BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy13810
         (386 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|350413829|ref|XP_003490125.1| PREDICTED: general transcription factor IIH subunit 4-like [Bombus
           impatiens]
          Length = 467

 Score =  540 bits (1392), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 268/418 (64%), Positives = 317/418 (75%), Gaps = 35/418 (8%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
           EL  IAKHYV+RLLFVEQPVPQAVIASW SK H +EH++    L EL             
Sbjct: 49  ELPVIAKHYVMRLLFVEQPVPQAVIASWCSKLHFEEHQKVVLILNELNVWKEASIPGGLP 108

Query: 48  --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
                   ++N K+ LLGGGKPW+MS +LE DSK RD+ FL+ YALERWEC+L +MVGSQ
Sbjct: 109 GWILNTTFKKNLKIVLLGGGKPWTMSNQLETDSKPRDVAFLDSYALERWECVLHYMVGSQ 168

Query: 100 QTEGISADAVRTLF-------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
           Q EGISADAVR L        +E   +PVIT+ GFQFLLLET +QVWYFIL+YL+T+E++
Sbjct: 169 QQEGISADAVRILLHAGLMKRDEADGSPVITQAGFQFLLLETASQVWYFILQYLDTIEAR 228

Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
           GLDLVECLTFLFQL FST G DYSTEGMS+ L  FLQHLREFGLVYQRKRKAGRFYPT+L
Sbjct: 229 GLDLVECLTFLFQLNFSTLGKDYSTEGMSEGLSTFLQHLREFGLVYQRKRKAGRFYPTRL 288

Query: 213 ALNMATRGTLKQIREP---GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
           ALN+AT       R+P   G+++VETN+RVYAYT+SNL+VAL+GLFCE++YRFPNL V +
Sbjct: 289 ALNIATGQNKPLSRDPEKEGYIVVETNYRVYAYTNSNLQVALLGLFCEMLYRFPNLVVSI 348

Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHA---LPSVSCP-LPPVVSDQIRLWEGERERLTTS 325
           LTRDSVR AL+SGITAAQI+G+L+ HA   +     P LPP + DQI+LWE ER R   S
Sbjct: 349 LTRDSVRQALKSGITAAQIVGYLQQHAHGKMIEAGPPVLPPTIVDQIKLWENERNRFIFS 408

Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           EGVLYSQF+SQ DFE LRD+A   GVL WQNE+KRT+VVTK GH D+KKFW  + KGS
Sbjct: 409 EGVLYSQFLSQTDFEVLRDHALSTGVLIWQNERKRTIVVTKAGHDDVKKFWKRYSKGS 466


>gi|242021693|ref|XP_002431278.1| TFIIH basal transcription factor complex p52 subunit, putative
           [Pediculus humanus corporis]
 gi|212516535|gb|EEB18540.1| TFIIH basal transcription factor complex p52 subunit, putative
           [Pediculus humanus corporis]
          Length = 467

 Score =  540 bits (1390), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 262/417 (62%), Positives = 326/417 (78%), Gaps = 34/417 (8%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL +IA+HYV RLLFVEQPVPQAVIASW+S+TH+KEH  AT  LT+LR            
Sbjct: 50  ELPQIARHYVSRLLFVEQPVPQAVIASWISQTHSKEHLTATKVLTDLRVWQEAAIPGGLP 109

Query: 49  ---------RNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
                    +N K++LLGGG+PW+MSA+LE D+K RD+ +L+ YA+ERWEC+L +MVGSQ
Sbjct: 110 GWILNSVFKKNMKISLLGGGRPWTMSAQLEPDTKPRDIPYLDNYAMERWECVLHYMVGSQ 169

Query: 100 QTEGISADAVRTLF-------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
             EGISADAVR L        +E   + VIT++GFQFLLL+T +QVWYF+L+YL+TV ++
Sbjct: 170 AQEGISADAVRILLHANLMKRDEEDGSCVITREGFQFLLLDTASQVWYFMLQYLDTVSAR 229

Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
            LDLVECLTFLFQL FST G DYST GMS+ L VFLQHLREFGL+YQRKR+ GRFYPT+L
Sbjct: 230 NLDLVECLTFLFQLSFSTLGKDYSTIGMSEGLLVFLQHLREFGLIYQRKRRGGRFYPTRL 289

Query: 213 ALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVL 270
           ALN+A     +L+Q+ + G++I+ETN+RVYAYTDSNL+VAL+GLFCE++YRFPNL+VG++
Sbjct: 290 ALNIACGENKSLQQMNKEGYIIIETNYRVYAYTDSNLQVALLGLFCEMLYRFPNLSVGLI 349

Query: 271 TRDSVRAALRSGITAAQIIGFLRLHALP---SVSCP-LPPVVSDQIRLWEGERERLTTSE 326
           TRDSVR A +SGITA QI+GFLRLHA P   +V  P LPP V DQI+LWE E  RL  S+
Sbjct: 350 TRDSVRQAFKSGITAEQIVGFLRLHAHPRMIAVGPPTLPPTVVDQIKLWENELNRLVYSD 409

Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           GVLYSQF+SQADFE LRD A +LGVL W+N+KKRT+VVTK GH D+KKFW ++ K +
Sbjct: 410 GVLYSQFLSQADFEALRDRANELGVLVWENDKKRTMVVTKSGHDDVKKFWKSYSKAN 466


>gi|340710070|ref|XP_003393621.1| PREDICTED: general transcription factor IIH subunit 4-like [Bombus
           terrestris]
          Length = 467

 Score =  539 bits (1389), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 267/418 (63%), Positives = 317/418 (75%), Gaps = 35/418 (8%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
           EL  IAKHYV+RLLFVEQPVPQAVIASW SK H +EH++    L EL             
Sbjct: 49  ELPVIAKHYVMRLLFVEQPVPQAVIASWCSKLHFEEHQKVVLILNELNVWKEASIPGGLP 108

Query: 48  --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
                   ++N K+ LLGGGKPW+MS +LE DSK RD+ FL+ YALERWEC+L +MVGSQ
Sbjct: 109 GWILNTTFKKNLKIVLLGGGKPWTMSNQLETDSKPRDVAFLDSYALERWECVLHYMVGSQ 168

Query: 100 QTEGISADAVRTLF-------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
           Q EGISADAVR L        +E   +PVIT+ GFQFLLLET +QVWYFIL+YL+T+E++
Sbjct: 169 QQEGISADAVRILLHAGLMKRDEADGSPVITQAGFQFLLLETASQVWYFILQYLDTIEAR 228

Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
           GLDLVECLTFLFQL FST G DYSTEGMS+ L  FLQHLREFGLVYQRKRKAGRFYPT+L
Sbjct: 229 GLDLVECLTFLFQLNFSTLGKDYSTEGMSEGLSTFLQHLREFGLVYQRKRKAGRFYPTRL 288

Query: 213 ALNMATRGTLKQIREP---GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
           ALN+AT       R+P   G+++VETN+RVYAYT+SNL+VAL+GLFCE++YRFPNL V +
Sbjct: 289 ALNIATGQNKPLSRDPEKEGYIVVETNYRVYAYTNSNLQVALLGLFCEMLYRFPNLVVSI 348

Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHA---LPSVSCP-LPPVVSDQIRLWEGERERLTTS 325
           LTRDSVR AL+SGITAAQI+G+L+ HA   +     P LPP + DQI+LWE ER R   +
Sbjct: 349 LTRDSVRQALKSGITAAQIVGYLQQHAHGKMIEAGPPVLPPTIVDQIKLWENERNRFIFN 408

Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           EGVLYSQF+SQ DFE LRD+A   GVL WQNE+KRT+VVTK GH D+KKFW  + KGS
Sbjct: 409 EGVLYSQFLSQTDFEVLRDHALSTGVLIWQNERKRTIVVTKAGHDDVKKFWKRYSKGS 466


>gi|345492819|ref|XP_001602702.2| PREDICTED: general transcription factor IIH subunit 4-like [Nasonia
           vitripennis]
          Length = 428

 Score =  538 bits (1386), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 265/419 (63%), Positives = 315/419 (75%), Gaps = 35/419 (8%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
           EL EIA+HYV+RLLFVEQPVPQAVIASW SK H + H      + EL             
Sbjct: 10  ELPEIARHYVMRLLFVEQPVPQAVIASWCSKLHVENHLNVVQVMNELNIWKEAAIPGGLP 69

Query: 48  --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
                   R+N K+ LLGGG PW+MS +LE+DSK RD+ FL+ YALERWEC+L +MVGSQ
Sbjct: 70  GWILNATFRKNLKIVLLGGGAPWTMSKQLEIDSKPRDIAFLDSYALERWECVLHYMVGSQ 129

Query: 100 QTEGISADAVRTLFNEGAE-------NPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
           Q EGISADAVR L + G         +PVIT+ GFQFLLL+T AQVWYFIL+YL+TVE++
Sbjct: 130 QQEGISADAVRILLHAGLMKRDEEDGSPVITQAGFQFLLLDTSAQVWYFILQYLDTVEAR 189

Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
           GLDLVECLTFLFQL FST G DYST+GMSD L +FLQHLREFGLVYQRKRKAGRFYPT+L
Sbjct: 190 GLDLVECLTFLFQLNFSTLGKDYSTQGMSDGLLMFLQHLREFGLVYQRKRKAGRFYPTRL 249

Query: 213 ALNMAT---RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
           ALN+AT   +   + + +  F+IVETN+RVYAYT+SNL+VALIGLFCEL+YRFPNL V +
Sbjct: 250 ALNIATGQDKPISRDLEKERFVIVETNYRVYAYTNSNLQVALIGLFCELLYRFPNLVVAI 309

Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSVSCP----LPPVVSDQIRLWEGERERLTTS 325
           LTRDSVRAAL+SGITA QI+G+L  HA   +  P    LPP + DQI+LWE ER R   S
Sbjct: 310 LTRDSVRAALKSGITAVQIVGYLNQHAHNKMIDPGPPTLPPTIVDQIKLWENERNRFIFS 369

Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGSQ 384
           EGVLYSQF SQ DFE LRD+A  LGV+ WQ+++KRT+VVTK GH D+KKFW  + KG+ 
Sbjct: 370 EGVLYSQFHSQIDFEVLRDHAVSLGVMIWQSDRKRTMVVTKTGHDDVKKFWKRYSKGAN 428


>gi|383852611|ref|XP_003701820.1| PREDICTED: general transcription factor IIH subunit 4-like
           [Megachile rotundata]
          Length = 467

 Score =  537 bits (1384), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 266/418 (63%), Positives = 317/418 (75%), Gaps = 35/418 (8%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
           EL  IAKHYV+RLLFVEQPVPQAVIASW SK H +EH++    L EL             
Sbjct: 49  ELPVIAKHYVMRLLFVEQPVPQAVIASWCSKLHFEEHQKVVSILNELNVWKEASIPGGLP 108

Query: 48  --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
                   ++N K+ LLGGGKPW+MS +LE DSK RD+ FL+ YALERWEC+L +MVGSQ
Sbjct: 109 GWILNTTFKKNLKIVLLGGGKPWTMSNQLETDSKPRDVAFLDSYALERWECVLHYMVGSQ 168

Query: 100 QTEGISADAVRTLF-------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
           Q EGISADAVR L        +E   +PVIT+ GFQFLLLET +QVWYFIL+YL+T+E++
Sbjct: 169 QQEGISADAVRILLHAGLMKRDEADGSPVITQAGFQFLLLETASQVWYFILQYLDTIEAR 228

Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
           GLDLVECLTFLFQL FST G DYSTEGMS+ L  FLQHLREFGLVYQRKRKAGRFYPT+L
Sbjct: 229 GLDLVECLTFLFQLNFSTLGKDYSTEGMSEGLLTFLQHLREFGLVYQRKRKAGRFYPTRL 288

Query: 213 ALNMATRGTLKQIREP---GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
           ALN+AT       R+P   G+++VETN+RVYAYT+SNL+VAL+GLFCE++YRFPNL V +
Sbjct: 289 ALNIATGQNKPLSRDPDKEGYIVVETNYRVYAYTNSNLQVALLGLFCEMLYRFPNLVVSI 348

Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHA---LPSVSCP-LPPVVSDQIRLWEGERERLTTS 325
           LTRDSVR AL+SGITA+QI+G+L+ HA   +     P LPP + DQI+LWE ER R   S
Sbjct: 349 LTRDSVRQALKSGITASQIVGYLQQHAHSKMIEAGPPVLPPTIVDQIKLWENERNRFIFS 408

Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           EGVLYSQF+SQ DFE LRD+A   GVL WQ+E+KRT+VVTK GH D+KKFW  + KGS
Sbjct: 409 EGVLYSQFLSQTDFEVLRDHALSTGVLIWQSERKRTMVVTKAGHDDVKKFWKRYSKGS 466


>gi|48102983|ref|XP_395476.1| PREDICTED: general transcription factor IIH subunit 4 [Apis
           mellifera]
          Length = 467

 Score =  536 bits (1380), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 266/418 (63%), Positives = 316/418 (75%), Gaps = 35/418 (8%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
           EL  IAKHYV+RLLFVEQPVPQAVIASW SK H +EH++    L EL             
Sbjct: 49  ELPVIAKHYVMRLLFVEQPVPQAVIASWCSKLHFEEHQKVVLVLNELNVWNEASIPGGLP 108

Query: 48  --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
                   ++N K+ LLGGGKPW+MS +LE DSK RD+ FL+ YALERWEC+L +MVGSQ
Sbjct: 109 GWILNTTFKKNLKIVLLGGGKPWTMSNQLETDSKPRDVAFLDSYALERWECVLHYMVGSQ 168

Query: 100 QTEGISADAVRTLF-------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
           Q EGISADAVR L        +E   +PVIT+ GFQFLLLET +QVWYFIL+YL+T+E++
Sbjct: 169 QQEGISADAVRILLHAGLMKRDEADGSPVITQAGFQFLLLETASQVWYFILQYLDTIEAR 228

Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
           GLDLVECLTFLFQL FST G DYSTEGMS+ L  FLQHLREFGLVYQRKRKAGRFYPT+L
Sbjct: 229 GLDLVECLTFLFQLNFSTLGKDYSTEGMSEGLLTFLQHLREFGLVYQRKRKAGRFYPTRL 288

Query: 213 ALNMATRGTLKQIREP---GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
           ALN+AT       R+P   G+++VETN+RVYAYT+SNL+VAL+GLFCE++YRFPNL V +
Sbjct: 289 ALNIATGQNKPLSRDPEKEGYIVVETNYRVYAYTNSNLQVALLGLFCEMLYRFPNLVVSI 348

Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHA---LPSVSCP-LPPVVSDQIRLWEGERERLTTS 325
           LTRDSVR AL+SGITA QI+G+L+ HA   +     P LPP + DQI+LWE ER R   S
Sbjct: 349 LTRDSVRQALKSGITATQIVGYLQQHAHSKMIEAGPPILPPTIVDQIKLWENERNRFIFS 408

Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           EGVLYSQF+SQ DFE LRD+A   GVL WQ+E+KRT+VVTK GH D+KKFW  + KGS
Sbjct: 409 EGVLYSQFLSQTDFEVLRDHALSTGVLIWQSERKRTMVVTKAGHDDVKKFWKRYSKGS 466


>gi|380014187|ref|XP_003691121.1| PREDICTED: general transcription factor IIH subunit 4-like [Apis
           florea]
          Length = 467

 Score =  536 bits (1380), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 266/418 (63%), Positives = 316/418 (75%), Gaps = 35/418 (8%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
           EL  IAKHYV+RLLFVEQPVPQAVIASW SK H +EH++    L EL             
Sbjct: 49  ELPVIAKHYVMRLLFVEQPVPQAVIASWCSKLHFEEHQKVVLILNELNVWNEASIPGGLP 108

Query: 48  --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
                   ++N K+ LLGGGKPW+MS +LE DSK RD+ FL+ YALERWEC+L +MVGSQ
Sbjct: 109 GWILNTTFKKNLKIVLLGGGKPWTMSNQLETDSKPRDVAFLDSYALERWECVLHYMVGSQ 168

Query: 100 QTEGISADAVRTLF-------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
           Q EGISADAVR L        +E   +PVIT+ GFQFLLLET +QVWYFIL+YL+T+E++
Sbjct: 169 QQEGISADAVRILLHAGLMKRDEADGSPVITQAGFQFLLLETASQVWYFILQYLDTIEAR 228

Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
           GLDLVECLTFLFQL FST G DYSTEGMS+ L  FLQHLREFGLVYQRKRKAGRFYPT+L
Sbjct: 229 GLDLVECLTFLFQLNFSTLGKDYSTEGMSEGLLTFLQHLREFGLVYQRKRKAGRFYPTRL 288

Query: 213 ALNMATRGTLKQIREP---GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
           ALN+AT       R+P   G+++VETN+RVYAYT+SNL+VAL+GLFCE++YRFPNL V +
Sbjct: 289 ALNIATGQNKPLSRDPEKEGYIVVETNYRVYAYTNSNLQVALLGLFCEMLYRFPNLVVSI 348

Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHA---LPSVSCP-LPPVVSDQIRLWEGERERLTTS 325
           LTRDSVR AL+SGITA QI+G+L+ HA   +     P LPP + DQI+LWE ER R   S
Sbjct: 349 LTRDSVRQALKSGITATQIVGYLQQHAHSKMIEAGPPILPPTIVDQIKLWENERNRFIFS 408

Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           EGVLYSQF+SQ DFE LRD+A   GVL WQ+E+KRT+VVTK GH D+KKFW  + KGS
Sbjct: 409 EGVLYSQFLSQTDFEVLRDHALSTGVLIWQSERKRTMVVTKAGHDDVKKFWKRYSKGS 466


>gi|307212060|gb|EFN87943.1| General transcription factor IIH subunit 4 [Harpegnathos saltator]
          Length = 467

 Score =  530 bits (1364), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 262/418 (62%), Positives = 315/418 (75%), Gaps = 35/418 (8%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
           EL  IAKHYV+RLLFVEQPVPQAVIASW SK + +EH++    L EL             
Sbjct: 49  ELPIIAKHYVMRLLFVEQPVPQAVIASWCSKLYFEEHQKVVQILNELYVWKEASIPGGLP 108

Query: 48  --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
                   ++N K+ LLGGG+PW+MS +LE DSK RD+ FL+ YALERWEC+L +MVGSQ
Sbjct: 109 GWILNNTFKKNLKIVLLGGGRPWTMSNQLETDSKPRDVAFLDSYALERWECVLHYMVGSQ 168

Query: 100 QTEGISADAVRTLFNEGAE-------NPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
           Q EGISADAVR L + G         +P+IT+ GFQFLLL+T +QVWYFIL+YL+T+E++
Sbjct: 169 QQEGISADAVRILLHAGLMKRDETDGSPIITQAGFQFLLLDTASQVWYFILQYLDTIEAR 228

Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
           GLDL+ECLTFLFQL FST G DYSTEGMS+ L  FLQHLREFGLVYQRKRKAGRFYPT+L
Sbjct: 229 GLDLIECLTFLFQLNFSTLGKDYSTEGMSEGLLTFLQHLREFGLVYQRKRKAGRFYPTRL 288

Query: 213 ALNMATRGTLKQIREP---GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
           ALN+AT  T    R+    G+++VETN+RVYAYT+SNL+VAL+GLFCE++YRFPNL V +
Sbjct: 289 ALNIATGETKPLTRDTDKEGYIVVETNYRVYAYTNSNLQVALLGLFCEMIYRFPNLVVSI 348

Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHA----LPSVSCPLPPVVSDQIRLWEGERERLTTS 325
           LTRDSVR AL+SGITA QI+G+L+ HA    + +    LPP + DQI LWE ER R   S
Sbjct: 349 LTRDSVRQALKSGITATQIVGYLQQHAHSKTIEAGPPILPPTIVDQIMLWENERNRFLFS 408

Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           EGVLYSQF+SQ DFE LRD+A   GVL WQNE+KRT+VVTK GH DIKKFW  + KGS
Sbjct: 409 EGVLYSQFLSQTDFEVLRDHALTTGVLIWQNERKRTIVVTKAGHDDIKKFWKRYSKGS 466


>gi|307177222|gb|EFN66420.1| General transcription factor IIH subunit 4 [Camponotus floridanus]
          Length = 467

 Score =  526 bits (1354), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 258/418 (61%), Positives = 316/418 (75%), Gaps = 35/418 (8%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
           EL  IAKHYV+RLLFVEQPVPQAVIASW SK + +EH++    L +L             
Sbjct: 49  ELPIIAKHYVMRLLFVEQPVPQAVIASWCSKLYFEEHQKVVQVLNDLYVWKEASIPGGLP 108

Query: 48  --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
                   ++N K+ LLGGGKPW+MS +LE DSK RD+ FL+ YALERWEC+L +MVGSQ
Sbjct: 109 GWILNNIFKKNLKIVLLGGGKPWTMSNQLETDSKPRDVAFLDSYALERWECVLHYMVGSQ 168

Query: 100 QTEGISADAVRTLF-------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
           Q EGISADAVR L        +E   +P+IT+ GFQFLLL+T +QVWYFIL+YL+T+E++
Sbjct: 169 QQEGISADAVRILLHAGLMKRDEADGSPIITQAGFQFLLLDTASQVWYFILQYLDTIEAR 228

Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
           GLDLVECLTFLFQL FST G DYSTEGMS+ L  FLQHLREFGLVYQRKRKAGRFYPT+L
Sbjct: 229 GLDLVECLTFLFQLNFSTLGKDYSTEGMSEGLLTFLQHLREFGLVYQRKRKAGRFYPTRL 288

Query: 213 ALNMAT---RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
           ALN+AT   +   K   + G+++VETN+RVYAYT+SNL+VAL+GLFCE++YRFPN+ V +
Sbjct: 289 ALNIATGENKPLAKDTDKEGYIVVETNYRVYAYTNSNLQVALLGLFCEMLYRFPNVVVSI 348

Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCP-LPPVVSDQIRLWEGERERLTTS 325
           LTRDS+R AL+SGITA+QI+G+L+ H    +     P LPP + DQI+LWE ER R   S
Sbjct: 349 LTRDSIRQALKSGITASQIVGYLQQHVHSKMIERGPPILPPTIVDQIKLWENERNRFLFS 408

Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           EGVLYSQF+SQ DFE LRD+A   GVL WQ+E+KRT+VVTK GH D+KKFW  + KGS
Sbjct: 409 EGVLYSQFLSQTDFEVLRDHALSTGVLIWQSERKRTMVVTKTGHDDVKKFWKRYSKGS 466


>gi|91094191|ref|XP_971121.1| PREDICTED: similar to TFIIH basal transcription factor complex p52
           subunit [Tribolium castaneum]
 gi|270010854|gb|EFA07302.1| hypothetical protein TcasGA2_TC015892 [Tribolium castaneum]
          Length = 472

 Score =  522 bits (1344), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 261/421 (61%), Positives = 316/421 (75%), Gaps = 38/421 (9%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
           EL E+A+ YVIR+LFVEQPVPQAV+ASW ++  +KEH   +  L+EL             
Sbjct: 52  ELPELARQYVIRILFVEQPVPQAVVASWGAQAFSKEHVYVSKVLSELSVWQEAAIPGGLL 111

Query: 48  --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
                   ++N K+ALLGGGKPWSMS+ LEVDSKARD+ FL+ Y+LERWEC+L +MVGSQ
Sbjct: 112 GWILSVTFKKNLKIALLGGGKPWSMSSALEVDSKARDVSFLDAYSLERWECVLHYMVGSQ 171

Query: 100 QTEGISADAVRTLFNEGA-------ENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
           Q EGISADAVR L + G         +PVIT+ GFQFLLL+  AQVW+F+L+YL+TVE +
Sbjct: 172 QQEGISADAVRILLHAGLMKRDEEDGSPVITRQGFQFLLLDRQAQVWHFLLQYLDTVEQR 231

Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
           GL LVECLTFLFQL FST G DYSTEGMS  L +FLQHLREFGLVYQRKRKAGRFYPT+L
Sbjct: 232 GLSLVECLTFLFQLSFSTLGKDYSTEGMSPGLLIFLQHLREFGLVYQRKRKAGRFYPTRL 291

Query: 213 ALNM-ATRGTLKQIRE----PGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTV 267
           ALN+  ++G   +I E     G++IVETN+RVYAYTDSNL+VALIGLF ELMYRFPNL V
Sbjct: 292 ALNITCSQGPETRILEEDTPKGYIIVETNYRVYAYTDSNLQVALIGLFTELMYRFPNLVV 351

Query: 268 GVLTRDSVRAALRSGITAAQIIGFLRLHALPSV-----SCPLPPVVSDQIRLWEGERERL 322
           GV+TRDS+R AL+ GITA QIIG+L+ HA P +       PLPP V DQI+LWE ER RL
Sbjct: 352 GVITRDSIRQALKGGITADQIIGYLKQHAHPQMLEGEAKHPLPPTVVDQIKLWEIERNRL 411

Query: 323 TTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKG 382
           T SEGVLYSQF+SQADF  L++YAQ  G L W N++KRT+++ K  H D+KKFW  + KG
Sbjct: 412 TYSEGVLYSQFLSQADFNILKEYAQSNGHLIWCNKEKRTLIINKSAHDDVKKFWKRYSKG 471

Query: 383 S 383
           +
Sbjct: 472 N 472


>gi|332030486|gb|EGI70174.1| General transcription factor IIH subunit 4 [Acromyrmex echinatior]
          Length = 550

 Score =  501 bits (1291), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 251/416 (60%), Positives = 310/416 (74%), Gaps = 36/416 (8%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
           EL  IAKHYV+RLLFVEQPVPQAVIASW SK + +EH++    L EL             
Sbjct: 49  ELPVIAKHYVMRLLFVEQPVPQAVIASWCSKLYFEEHQKVVQVLNELYVWKEASIPGGLP 108

Query: 48  --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
                   ++N K+ LLGGGKPW+MS +LE DSK RD+ FL+ YALERWEC+L +MVGSQ
Sbjct: 109 GWTLNSTFKKNLKIVLLGGGKPWTMSNQLETDSKPRDVAFLDSYALERWECVLHYMVGSQ 168

Query: 100 QTEGISADAVRTLF-------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
           Q EGISADAVR L        +E   +P+IT+ GFQFLLL+T +QVWYFIL+YL+T+E++
Sbjct: 169 QQEGISADAVRILLHAGLMKRDEADGSPIITQAGFQFLLLDTASQVWYFILQYLDTIEAR 228

Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
           GLDLVECLTFLFQL FST G DYSTEGMS+ L  FLQHLREFGLVYQRKRKAGRFYPT+L
Sbjct: 229 GLDLVECLTFLFQLNFSTLGKDYSTEGMSEGLLTFLQHLREFGLVYQRKRKAGRFYPTRL 288

Query: 213 ALNMATRGTLKQIREP---GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
           ALN+AT  T    R+    G++IVETN+RVYAYT+SNL+VAL+GLFCE++YRFPN+ V +
Sbjct: 289 ALNIATGETKPLTRDTDKEGYIIVETNYRVYAYTNSNLQVALLGLFCEMLYRFPNVVVSI 348

Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHA---LPSVSCP-LPPVVSDQIRLWEGERERLTTS 325
           LTRDSVR AL+SGITA+QI+G+LR HA   +     P LPP + DQI+LWE ER R   S
Sbjct: 349 LTRDSVRQALKSGITASQIVGYLRQHAHSKMIEAGPPILPPTIVDQIKLWENERNRFLFS 408

Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVV-TKQGHSDIKKFWVNHQ 380
           EGVLYSQF+SQ DFE LRD+A   GVL WQ+E++ T++  T+   S+ ++ +  +Q
Sbjct: 409 EGVLYSQFLSQTDFEVLRDHAVSTGVLIWQSERRSTLLADTRNIVSEYRRLFGKYQ 464


>gi|442757633|gb|JAA70975.1| Putative rna polymer [Ixodes ricinus]
          Length = 459

 Score =  495 bits (1275), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 236/415 (56%), Positives = 305/415 (73%), Gaps = 34/415 (8%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
           EL  I++HY++RL+FV+QPVPQAV++SW  + + KEH E+ + LT L             
Sbjct: 43  ELPIISRHYIMRLMFVDQPVPQAVVSSWNEQKYVKEHLESLEALTALHIWADSSLPGGLP 102

Query: 48  --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
                   R+N ++ALLGGG+PW++ + LE D   RD  FL++YA+ERWEC+L FMVG  
Sbjct: 103 GWSLSVVFRKNIQIALLGGGQPWAVYSTLEKDKHGRDAQFLDRYAMERWECVLHFMVGCH 162

Query: 100 QTEGISADAVRTLFNEG--------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
             EGISADAVR L + G           P+IT +GFQFLL++T +QVW+F+L+YL+T+ES
Sbjct: 163 TKEGISADAVRILLHAGLMKSEEEEGSAPLITMEGFQFLLMDTASQVWHFVLQYLDTLES 222

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
           +GL+LVECLTFLFQL F T G DYSTEGMS+ L VFLQHLREFGLVYQRKR++GRFYPT+
Sbjct: 223 RGLNLVECLTFLFQLSFLTLGKDYSTEGMSESLLVFLQHLREFGLVYQRKRRSGRFYPTR 282

Query: 212 LALNMATRGTLKQIR--EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
           LA+N+A+      +R  E G+++VETN+RVYAYTDS L+VAL+ LFCELMYR PNL VGV
Sbjct: 283 LAINLASGLKETNLRSYESGYIVVETNYRVYAYTDSQLQVALLALFCELMYRLPNLVVGV 342

Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
           LTR+SVR ALRSGIT+ QII FL++HA P     S  +PP + DQ+RLWE ER+R    E
Sbjct: 343 LTRESVRQALRSGITSNQIIKFLQMHAHPEAQKQSPVIPPTIMDQLRLWELERDRFNFRE 402

Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQK 381
           GVLYSQF+SQ+DF+ LR+YA DLGVL W N  KR +VV + GH ++K+FW  H++
Sbjct: 403 GVLYSQFISQSDFQLLRNYASDLGVLIWDNPSKRVMVVNRNGHDEVKRFWKRHRQ 457


>gi|157113167|ref|XP_001651923.1| TFIIH basal transcription factor complex p52 subunit [Aedes
           aegypti]
 gi|108877858|gb|EAT42083.1| AAEL006356-PA [Aedes aegypti]
          Length = 487

 Score =  495 bits (1274), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 243/431 (56%), Positives = 308/431 (71%), Gaps = 49/431 (11%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
           EL EIA+ +VIR+LFVEQ +PQAV++SW S+ +AKE+   +  LTEL             
Sbjct: 55  ELPEIARQFVIRILFVEQAIPQAVVSSWGSQVYAKENVLVSRVLTELGVWRTAGLTGGLE 114

Query: 48  --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
                   ++N K+ALLGGG+PWSMS  L+ D K+RD++FL+ YA+ RW C+L +MVG+ 
Sbjct: 115 AWELCPTFKKNLKIALLGGGRPWSMSNALDPDQKSRDIEFLDAYAMSRWRCVLHYMVGAG 174

Query: 99  ----QQTEGISADAVRTLF-------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLE 147
                + EGIS DAVR L        +E   +PVIT+ GFQFLLL+T AQVW+F+L+YL+
Sbjct: 175 SSKGMEGEGISPDAVRILLHANLMKRDETDGSPVITRQGFQFLLLDTQAQVWHFMLQYLD 234

Query: 148 TVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRF 207
           T E++GLDL ECL+ LFQL FST G DYS+EG+S  L  FLQHLREFGLVYQRKRK GRF
Sbjct: 235 TCEARGLDLAECLSMLFQLSFSTLGRDYSSEGLSQGLLTFLQHLREFGLVYQRKRKEGRF 294

Query: 208 YPTKLALNMATRGTLKQIRE-------PGFLIVETNFRVYAYTDSNLKVALIGLFCELMY 260
           YPT+LA N+ ++  ++ I+E        G++IVETN+RVYAYTDSNL+VAL+GLF EL+Y
Sbjct: 295 YPTRLAHNITSKNAVQTIQEDGSSVQDKGYIIVETNYRVYAYTDSNLQVALLGLFTELLY 354

Query: 261 RFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---------SCPLPPVVSDQ 311
           RFPNL VGVLTRDSVR A R GITA QII +L  HA P++           PLPP V DQ
Sbjct: 355 RFPNLVVGVLTRDSVRQAFRGGITAEQIISYLEQHAHPTMLNMEQAINSKSPLPPTVVDQ 414

Query: 312 IRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSD 371
           I+LWE ER R T +EGV+Y+QF+SQ DF  LRDYAQ +GV+TWQNE+ RT+VVTK GH D
Sbjct: 415 IKLWENERNRFTYTEGVVYNQFLSQGDFNTLRDYAQSIGVMTWQNERTRTMVVTKNGHDD 474

Query: 372 IKKFWVNHQKG 382
           +K+FW  + KG
Sbjct: 475 VKRFWKRYSKG 485


>gi|170039509|ref|XP_001847575.1| TFIIH basal transcription factor complex p52 subunit [Culex
           quinquefasciatus]
 gi|167863052|gb|EDS26435.1| TFIIH basal transcription factor complex p52 subunit [Culex
           quinquefasciatus]
          Length = 492

 Score =  493 bits (1268), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 242/432 (56%), Positives = 305/432 (70%), Gaps = 49/432 (11%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
           EL EIA+ +VIR+LFVEQP+P AV++SW S+ +AKE+   +  LTEL             
Sbjct: 60  ELPEIARQFVIRILFVEQPIPHAVVSSWGSQIYAKENTAVSKVLTELGVWRSTGLTGGLE 119

Query: 48  --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
                   ++N K+ALLGGG+PWSMS  L+ D K+RD++FL+ YA+ RW C+L +MVG+ 
Sbjct: 120 GWELCSTFKKNLKIALLGGGRPWSMSNALDPDQKSRDIEFLDGYAMSRWRCVLHYMVGAG 179

Query: 99  ----QQTEGISADAVRTLF-------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLE 147
                + EGIS DAVR L        +E   +PVIT+ GFQFLLL+T AQVW+F+L+YL+
Sbjct: 180 SSKGMEGEGISPDAVRILLHANLMKRDETDGSPVITRQGFQFLLLDTQAQVWHFMLQYLD 239

Query: 148 TVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRF 207
           T E++GLDL ECL+ LFQL FST G DYS+EG+S  L  FLQHLREFGLVYQRKRK GRF
Sbjct: 240 TCEARGLDLAECLSMLFQLSFSTLGRDYSSEGLSQGLLTFLQHLREFGLVYQRKRKEGRF 299

Query: 208 YPTKLALNMATRGTLKQIRE-------PGFLIVETNFRVYAYTDSNLKVALIGLFCELMY 260
           YPT+LA N+ +R  +  I+E        G+++VETN+RVYAYTDSNL+VAL+GLF EL+Y
Sbjct: 300 YPTRLAHNITSRNAVPTIQEDGSAAQDKGYIVVETNYRVYAYTDSNLQVALLGLFTELLY 359

Query: 261 RFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---------SCPLPPVVSDQ 311
           RFPNL VGVLTRDSVR A R GITA QII +L  HA P++            LPP V DQ
Sbjct: 360 RFPNLVVGVLTRDSVRQAFRGGITADQIISYLEQHAHPTMHNVEQTINTKSALPPTVVDQ 419

Query: 312 IRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSD 371
           I+LWE ER R T +EGV+Y+QF+SQ DF  LRDYAQ +GV+ WQNE+ RT+VVTK GH D
Sbjct: 420 IKLWENERNRFTYTEGVVYNQFLSQGDFNTLRDYAQSIGVMIWQNERTRTMVVTKNGHDD 479

Query: 372 IKKFWVNHQKGS 383
           +KKFW  + KG 
Sbjct: 480 VKKFWKRYSKGG 491


>gi|31201525|ref|XP_309710.1| AGAP010984-PA [Anopheles gambiae str. PEST]
 gi|21293295|gb|EAA05440.1| AGAP010984-PA [Anopheles gambiae str. PEST]
          Length = 501

 Score =  491 bits (1264), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 244/435 (56%), Positives = 308/435 (70%), Gaps = 53/435 (12%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
           EL EIA+ +VIR+LFVEQP+PQAV++SW ++ +AKE+   +  LTEL             
Sbjct: 65  ELPEIARQFVIRILFVEQPIPQAVVSSWATQVYAKENTSVSQVLTELGVWRSAAYPGGLA 124

Query: 48  --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
                   ++N K+ALLGGG+PWSMS  L+ D K+RD+DFL+ YA+ RW C+L +MVG+ 
Sbjct: 125 AWELCPTFKKNLKIALLGGGRPWSMSNALDPDQKSRDIDFLDTYAMSRWRCVLHYMVGAG 184

Query: 99  ----QQTEGISADAVRTLF-------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLE 147
                + EGIS DAVR L        +E   +PVIT+ GFQFLLL+T AQVW+F+L+YL+
Sbjct: 185 SSKGMEGEGISPDAVRILLHANLMKRDESDGSPVITRQGFQFLLLDTQAQVWHFMLQYLD 244

Query: 148 TVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRF 207
           T E++GL+L ECL+ LFQL FST G DYS+EG+S  L  FLQHLREFGLVYQRKRK GRF
Sbjct: 245 TCEARGLNLPECLSMLFQLSFSTLGRDYSSEGLSPGLLTFLQHLREFGLVYQRKRKEGRF 304

Query: 208 YPTKLALNMATRG-----TLKQIREP------GFLIVETNFRVYAYTDSNLKVALIGLFC 256
           YPT+LA N+ ++      TL Q +E       G++IVETN+RVYAYTDSNL+VAL+GLF 
Sbjct: 305 YPTRLAHNITSKNATHATTLAQDQESNATKDKGYIIVETNYRVYAYTDSNLQVALLGLFT 364

Query: 257 ELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---------SCPLPPV 307
           EL+YRFPNL VGVL+RDSVR A R GITA QII +L  HA P++            LPP 
Sbjct: 365 ELLYRFPNLVVGVLSRDSVRQAFRGGITAEQIISYLEQHAHPTMLTVEQAINSKSSLPPT 424

Query: 308 VSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQ 367
           V DQI+LWE ER R T +EGV+Y+QF+SQADF  LRDYAQ +GV+ WQNE+ RT+VVTK 
Sbjct: 425 VVDQIKLWENERNRFTYTEGVVYNQFLSQADFITLRDYAQSIGVMIWQNERIRTMVVTKN 484

Query: 368 GHSDIKKFWVNHQKG 382
           GH D+KKFW  + KG
Sbjct: 485 GHDDVKKFWKRYSKG 499


>gi|427785675|gb|JAA58289.1| Putative rna polymer [Rhipicephalus pulchellus]
          Length = 457

 Score =  490 bits (1262), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 237/416 (56%), Positives = 306/416 (73%), Gaps = 36/416 (8%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
           EL +I++HY++RLLFVEQPVPQAV++SW  + + K+H EA + LT L             
Sbjct: 41  ELPKISRHYIMRLLFVEQPVPQAVVSSWNEQKYVKDHLEALEALTALHIWTDASLPGGLP 100

Query: 48  --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
                   R+N ++ALLGGGKPW++ + LE D   RD  FL+QYA ERWEC+L FMVG  
Sbjct: 101 GWSLSAVFRKNIQIALLGGGKPWAVYSALEKDKHGRDAAFLDQYAAERWECVLHFMVGCH 160

Query: 100 QTEGISADAVRTLFNEG--------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
            TEGISADAVR L + G          +P+IT +GFQFLL++T +QVW+F+L+YL+T+ES
Sbjct: 161 TTEGISADAVRILLHAGLMKSDEGEGSSPLITMEGFQFLLMDTASQVWHFVLQYLDTLES 220

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
           +GL+LVECLTFLFQL F T G DYSTEGMS+ L VFLQHLREFGLVYQRKR++GRFYPT+
Sbjct: 221 RGLNLVECLTFLFQLSFLTLGKDYSTEGMSESLLVFLQHLREFGLVYQRKRRSGRFYPTR 280

Query: 212 LALNMAT---RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVG 268
           LA+N+A+     TL+   E G+++VETN+RVYAYT+S L+VAL+ LFCEL+YRFPNL V 
Sbjct: 281 LAINLASGLKETTLRSF-EAGYIMVETNYRVYAYTNSQLQVALLALFCELLYRFPNLVVA 339

Query: 269 VLTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTS 325
            LTR+SVR ALRSGIT+ QII FLR++A P     +  +P  + DQ+RLWE ER+R    
Sbjct: 340 RLTRESVRQALRSGITSNQIIKFLRMYAHPEALKQTPVIPATIMDQLRLWEMERDRFVFR 399

Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQK 381
           EGVLYSQF+SQ+DF+ LR+YA +LGVL W N  KR +VV + GH ++KKFW  H++
Sbjct: 400 EGVLYSQFISQSDFQLLRNYASELGVLIWDNPSKRVMVVNRNGHDEVKKFWKRHRQ 455


>gi|357618296|gb|EHJ71332.1| putative TFIIH basal transcription factor complex p52 subunit
           [Danaus plexippus]
          Length = 603

 Score =  486 bits (1251), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 241/425 (56%), Positives = 295/425 (69%), Gaps = 42/425 (9%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
           EL E+A+H+VIRLLFVEQPVPQAV+ASW+++THAKE  +A + L+EL             
Sbjct: 179 ELPELARHFVIRLLFVEQPVPQAVVASWVTQTHAKEQHKACEALSELSVWQEAPIPGGLP 238

Query: 48  --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
                   ++N KVALLGGG+PWSMS+ LE D KARD+ FL+ YALERWEC+L +MVGS 
Sbjct: 239 GWMLSQSFKKNLKVALLGGGRPWSMSSSLEPDGKARDVSFLDAYALERWECVLHYMVGST 298

Query: 100 QTEGISADAVRTLFNEGAEN-------PVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
           QTEGISADAVR L   G  N        VIT+ GFQFLLL T  QVW F+  YL T E +
Sbjct: 299 QTEGISADAVRILLQAGLMNRDAEDGTAVITRAGFQFLLLSTAKQVWLFLQHYLHTAEKR 358

Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
            L   ECL FL+QL FST G DYSTEGMS+ + VFLQHLREFGLVYQRKRKAGRFYPT+L
Sbjct: 359 SLSAAECLAFLYQLSFSTLGKDYSTEGMSNNMLVFLQHLREFGLVYQRKRKAGRFYPTRL 418

Query: 213 ALNMAT----RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVG 268
           ALN+         L+     G++I ETN+RVYAYT S L+VAL+GLF EL+YRFPN+ VG
Sbjct: 419 ALNITCVKDGVAPLQTAASSGYIIAETNYRVYAYTTSALQVALLGLFTELVYRFPNVVVG 478

Query: 269 VLTRDSVRAALRSGITAAQIIGFLRLHALPSV----------SCPLPPVVSDQIRLWEGE 318
           VLTR+SVRAALR GI+A QII +L  H+ P +          S  LPP V DQIRLWE E
Sbjct: 479 VLTRESVRAALRGGISAQQIITYLEQHSHPQMLKSDQGGIRSSSSLPPTVLDQIRLWESE 538

Query: 319 RERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVN 378
           R R T +EGV+Y+QF+SQA+F  LRDY +  G L W  ++ RT+VV +  H D+K++W  
Sbjct: 539 RNRFTYTEGVVYNQFLSQAEFNVLRDYGRSSGALVWAADRTRTMVVARAAHDDVKRYWKR 598

Query: 379 HQKGS 383
           + K +
Sbjct: 599 YSKAT 603


>gi|346464903|gb|AEO32296.1| hypothetical protein [Amblyomma maculatum]
          Length = 436

 Score =  469 bits (1206), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 230/395 (58%), Positives = 290/395 (73%), Gaps = 34/395 (8%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
           EL  I++ Y++RLLFVEQPVPQAVI+SW  + + K+H EA + LT L             
Sbjct: 42  ELPIISRQYIMRLLFVEQPVPQAVISSWNEQKYVKDHLEALEALTALHIWTDASLPGGLP 101

Query: 48  --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
                   R+N ++ALLGGGKPW++   LE D   RD  FL+QYA+ERWEC+L FMVG  
Sbjct: 102 GWSLSAVFRKNIQIALLGGGKPWAVYNPLEKDKHGRDATFLDQYAVERWECVLHFMVGCH 161

Query: 100 QTEGISADAVRTLFNEG--------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
            TEGISADAVR L + G          +P+IT +GFQFLL++TP+QVW+F+L+YL+T+ES
Sbjct: 162 TTEGISADAVRILLHAGLMKSEEGEGSSPLITMEGFQFLLMDTPSQVWHFVLQYLDTIES 221

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
           +GL+LVECLTFLFQL F T G DYSTEGMS+ L VFLQHLREFGLVYQRKR++GRFYPT+
Sbjct: 222 RGLNLVECLTFLFQLSFLTLGKDYSTEGMSESLLVFLQHLREFGLVYQRKRRSGRFYPTR 281

Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
           LA+N+A+  R T  +  E G+++VETN+RVYAYT+S L+VAL+ LFCEL+YRFPNL V  
Sbjct: 282 LAINLASGLRETNLRSYEAGYIMVETNYRVYAYTNSQLQVALLALFCELLYRFPNLVVAR 341

Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
           LTR+SVR ALRSGIT+ QII FLR++A P     +  +P  + DQ+RLWE ER+R    E
Sbjct: 342 LTRESVRQALRSGITSNQIIKFLRMYAHPEALKQTPVIPATIMDQLRLWELERDRFVYRE 401

Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRT 361
           GVLYSQF+SQ+DF+ LR+YA DLGVL W N  KR 
Sbjct: 402 GVLYSQFISQSDFQLLRNYASDLGVLIWDNPSKRV 436


>gi|195019521|ref|XP_001984999.1| GH16811 [Drosophila grimshawi]
 gi|193898481|gb|EDV97347.1| GH16811 [Drosophila grimshawi]
          Length = 493

 Score =  456 bits (1174), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 229/432 (53%), Positives = 298/432 (68%), Gaps = 52/432 (12%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
           EL EIA+ +VIR+LFV+QPVPQAV++SW ++  AKE  EAT  LT L             
Sbjct: 60  ELPEIARQFVIRILFVDQPVPQAVVSSWGAQRIAKEQMEATSCLTALNVWRVTAIPGGLA 119

Query: 48  --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
                   +++ +  LLGGGKPW+M+  L+ DSK RD+ FL+ YA+ RW C+L +MVG+ 
Sbjct: 120 AWELSPTFKKSVRQVLLGGGKPWAMTNTLDKDSKPRDIAFLDTYAMARWRCVLHYMVGTG 179

Query: 99  -----QQTEGISADAVRTLFN------EGAENPVITKDGFQFLLLETPAQVWYFILKYLE 147
                   E IS DAVR L +      +  +   IT+ GFQFLLL+T AQVW+F+L+YLE
Sbjct: 180 SRNGGTDAEAISPDAVRILLHANLMKRDERDGITITRQGFQFLLLDTRAQVWHFMLQYLE 239

Query: 148 TVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRF 207
           T E +G  L ECL+ LFQL FST G DYS+EGMS ++ +FLQHLREFGLV+QRKRK GRF
Sbjct: 240 TCEERGFSLPECLSMLFQLSFSTLGRDYSSEGMSHQMLMFLQHLREFGLVFQRKRKEGRF 299

Query: 208 YPTKLALNM--------ATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELM 259
           YPT+LALN+        +T    ++++E G+++VETN+RVYAYTDS L+VA++GLF EL+
Sbjct: 300 YPTRLALNVTNKDAAQASTSADDERMQERGYIVVETNYRVYAYTDSQLQVAVLGLFTELL 359

Query: 260 YRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV----------SCPLPPVVS 309
           YRFPNL VGVLTRDSVR ALR GITA QI+ +L  +A P++          SC LPP V 
Sbjct: 360 YRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYAHPNMKLVESAIQSKSC-LPPTVV 418

Query: 310 DQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGH 369
           DQI+LWE ER R T +EGV+Y+QF+SQ DF  LRDYAQ + +L WQNE+ RT+VV K GH
Sbjct: 419 DQIKLWELERNRFTYTEGVVYNQFLSQTDFVTLRDYAQSINMLVWQNERTRTMVVQKNGH 478

Query: 370 SDIKKFWVNHQK 381
            D+K++W  + K
Sbjct: 479 DDVKRYWKKYSK 490


>gi|194749905|ref|XP_001957376.1| GF10392 [Drosophila ananassae]
 gi|190624658|gb|EDV40182.1| GF10392 [Drosophila ananassae]
          Length = 497

 Score =  456 bits (1174), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 230/432 (53%), Positives = 300/432 (69%), Gaps = 52/432 (12%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
           EL EIA+ ++IR+LFV+QPVPQAV++SW ++  AKE  +AT  LT L             
Sbjct: 64  ELPEIARQFIIRILFVDQPVPQAVVSSWGAQRCAKEQTDATSCLTALNVWRVTAIPGGLS 123

Query: 48  --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
                   +++ +  LLGGGKPW M+  LE DSK RD++FL+ YA+ RW C+L +MVG+ 
Sbjct: 124 AWELSPTFKKSVRQVLLGGGKPWPMTNTLEKDSKPRDINFLDTYAMSRWRCVLHYMVGTG 183

Query: 99  ----QQTEGISADAVRTLFN------EGAENPVITKDGFQFLLLETPAQVWYFILKYLET 148
                +TE IS DAVR L +      +  +   IT+ GFQFLLL+T AQVW+F+L YL+T
Sbjct: 184 SRNGAETEAISPDAVRILLHANLMKRDERDGITITRQGFQFLLLDTRAQVWHFMLHYLDT 243

Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY 208
            E +G+ L ECL+ LFQL FST G DYS+EGM+ ++  FLQHLREFGLVYQRKRK GRFY
Sbjct: 244 CEERGMPLPECLSMLFQLSFSTLGRDYSSEGMNKQMLDFLQHLREFGLVYQRKRKEGRFY 303

Query: 209 PTKLALNMATR-----GTL----KQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELM 259
           PT+LALN+ ++      TL    + ++E G+++VETN+RVYAYTDS L+VA++GLF EL+
Sbjct: 304 PTRLALNVTSKEAAVAATLTTDEEGVQESGYIVVETNYRVYAYTDSPLQVAVLGLFTELL 363

Query: 260 YRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV----------SCPLPPVVS 309
           YRFPNL VGVLTRDSVR ALR GITA QI+ +L  +A P++          SC LPP V 
Sbjct: 364 YRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYAHPNMRLVESAIQSKSC-LPPTVV 422

Query: 310 DQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGH 369
           DQI+LWE ER R T +EGV+Y+QF+SQ DF  LRDYAQ + VL WQNE+ RT+VV K GH
Sbjct: 423 DQIKLWEMERNRFTYTEGVVYNQFLSQNDFVTLRDYAQSIHVLVWQNERTRTMVVQKNGH 482

Query: 370 SDIKKFWVNHQK 381
            D+K++W  + K
Sbjct: 483 DDVKRYWKKYSK 494


>gi|195378984|ref|XP_002048261.1| GJ11446 [Drosophila virilis]
 gi|194155419|gb|EDW70603.1| GJ11446 [Drosophila virilis]
          Length = 492

 Score =  455 bits (1170), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 226/431 (52%), Positives = 298/431 (69%), Gaps = 51/431 (11%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
           EL EIA+ +VIR+LF++QPVPQAV+ SW ++  AKE  EAT  LT L             
Sbjct: 60  ELPEIARQFVIRILFIDQPVPQAVVTSWGAQRVAKEQMEATSCLTALCVWRVTAIPGGLA 119

Query: 48  --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
                   +++ +  LLGGGKPW+M+  L+ DSK RD+ FL+ YA+ RW C+L +MVG+ 
Sbjct: 120 AWELSPTFKKSVRQVLLGGGKPWAMTNTLDRDSKPRDIAFLDSYAMSRWRCVLHYMVGTG 179

Query: 99  ----QQTEGISADAVRTLFN------EGAENPVITKDGFQFLLLETPAQVWYFILKYLET 148
                  E IS DAVR L +      +  +   IT+ GFQFLLL+T AQVW+F+L+YLET
Sbjct: 180 SRGGTDAEAISPDAVRILLHANLMKRDERDGITITRQGFQFLLLDTRAQVWHFMLQYLET 239

Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY 208
            E +G+ L ECL+ LFQL FST G DYS+EGM++++  FLQHLREFGLV+QRKRK GRFY
Sbjct: 240 CEERGICLPECLSMLFQLSFSTLGRDYSSEGMNNQMLTFLQHLREFGLVFQRKRKEGRFY 299

Query: 209 PTKLALNM--------ATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMY 260
           PT+LALN+        +T    ++++E G+++VETN+RVYAYTDS L+VA++GLF EL+Y
Sbjct: 300 PTRLALNVTNKDAAQASTSADEERMQERGYIVVETNYRVYAYTDSQLQVAVLGLFTELLY 359

Query: 261 RFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV----------SCPLPPVVSD 310
           RFPNL VGVLTRDSVR ALR GITA QI+ +L  +A P++          SC LPP + D
Sbjct: 360 RFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYAHPNMKLVESAIQSKSC-LPPTIVD 418

Query: 311 QIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHS 370
           QI+LWE ER R T +EGV+Y+QF+SQ DF  LRDYAQ + +L WQNE+ RT+VV K GH 
Sbjct: 419 QIKLWEMERNRFTYTEGVVYNQFLSQNDFVTLRDYAQSINMLVWQNERTRTMVVQKNGHD 478

Query: 371 DIKKFWVNHQK 381
           D+K++W  + K
Sbjct: 479 DVKRYWKKYSK 489


>gi|195162678|ref|XP_002022181.1| GL25529 [Drosophila persimilis]
 gi|198464237|ref|XP_001353141.2| GA20571 [Drosophila pseudoobscura pseudoobscura]
 gi|194104142|gb|EDW26185.1| GL25529 [Drosophila persimilis]
 gi|198149631|gb|EAL30642.2| GA20571 [Drosophila pseudoobscura pseudoobscura]
          Length = 494

 Score =  454 bits (1167), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 228/432 (52%), Positives = 300/432 (69%), Gaps = 51/432 (11%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
           EL E+A+ ++IR+LFV+QPVPQAV+ SW ++  AKE  EAT  L++L             
Sbjct: 62  ELPELARQFIIRILFVDQPVPQAVVTSWGAQRFAKEQTEATSCLSDLNVWRVTAIPGGLS 121

Query: 48  --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
                   +++ +  LLGGGKPW M+  LE DSK RD+ FL+ YA+ RW C+L +MVG+ 
Sbjct: 122 AWELSPTFKKSVRQVLLGGGKPWPMTNTLEKDSKPRDIAFLDSYAMSRWRCVLHYMVGTG 181

Query: 99  ----QQTEGISADAVRTLFN------EGAENPVITKDGFQFLLLETPAQVWYFILKYLET 148
                  E IS DAVR L +      +  +   IT+ GFQFLLL+T AQVW+F+L+YL+T
Sbjct: 182 NRNGTDAEAISPDAVRILLHANLMKRDERDGITITRQGFQFLLLDTRAQVWHFMLQYLDT 241

Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY 208
            E +G+ L ECL+ LFQL FST G DYS+EGM+ ++  FLQHLREFGLV+QRKRK GRFY
Sbjct: 242 CEERGVSLPECLSMLFQLSFSTLGRDYSSEGMNSQMLSFLQHLREFGLVFQRKRKEGRFY 301

Query: 209 PTKLALNM-----ATRGTL---KQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMY 260
           PT+LALN+     AT  T+   + +++ G+++VETN+RVYAYTDS L+VA++GLF EL+Y
Sbjct: 302 PTRLALNVTSKEAATTMTVSDEEAMQDSGYIVVETNYRVYAYTDSPLQVAVLGLFTELLY 361

Query: 261 RFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV----------SCPLPPVVSD 310
           RFPNL VGVLTRDSVR ALR GITA QI+ +L+ +A P++          SC LPP V D
Sbjct: 362 RFPNLVVGVLTRDSVRQALRGGITAEQIVSYLQQYAHPNMKLVESAIQSKSC-LPPTVVD 420

Query: 311 QIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHS 370
           QI+LWE ER R T +EGV+Y+QF+SQ DF  LRDYAQ + VL WQNE+ RT+VV K GH 
Sbjct: 421 QIKLWEMERNRFTYTEGVVYNQFLSQNDFVTLRDYAQSIHVLVWQNERTRTMVVQKHGHD 480

Query: 371 DIKKFWVNHQKG 382
           D+K++W  + KG
Sbjct: 481 DVKRYWKKYSKG 492


>gi|195126587|ref|XP_002007752.1| GI12214 [Drosophila mojavensis]
 gi|193919361|gb|EDW18228.1| GI12214 [Drosophila mojavensis]
          Length = 492

 Score =  454 bits (1167), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 227/433 (52%), Positives = 300/433 (69%), Gaps = 51/433 (11%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
           EL EIA+ +VIR+LFV+QPVPQAV++SW ++  AKE  EAT+ L+ L             
Sbjct: 60  ELPEIARQFVIRILFVDQPVPQAVVSSWGAQRVAKEQLEATNCLSALSVWRVTAIPGGLA 119

Query: 48  --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
                   +++ +  LLGGGKPW+++  LE DSK RD+ FL+ YA+ RW C+L +MVG+ 
Sbjct: 120 AWELSPTFKKSVRQVLLGGGKPWAITNSLERDSKPRDVAFLDAYAMSRWRCVLHYMVGTG 179

Query: 99  ----QQTEGISADAVRTLFN------EGAENPVITKDGFQFLLLETPAQVWYFILKYLET 148
                +TE IS DAVR L +      +  E   IT+ GFQFLLL+T AQVW+F+L+YL+T
Sbjct: 180 NRSNTETEAISPDAVRILLHANLMKRDEREGITITRQGFQFLLLDTRAQVWHFMLQYLDT 239

Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY 208
            + +GL L ECL+ LFQL FST G DYS+EGM+ ++ +FLQHLREFGLV+QRKRK  RFY
Sbjct: 240 CQERGLVLAECLSMLFQLSFSTLGRDYSSEGMNKQMLMFLQHLREFGLVFQRKRKEHRFY 299

Query: 209 PTKLALNMATRGTL--------KQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMY 260
           PT+LALN+  +           ++++E G+++VETN+RVYAYTDS L+VA++GLF EL+Y
Sbjct: 300 PTRLALNVTNKEAAEASISVDEERMQERGYIVVETNYRVYAYTDSQLQVAVLGLFTELLY 359

Query: 261 RFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV----------SCPLPPVVSD 310
           RFPNL VGVLTRDSVR ALR GITA QI+ +L  +A P++          SC LPP V D
Sbjct: 360 RFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYAHPNMKLVESAIQSKSC-LPPTVVD 418

Query: 311 QIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHS 370
           QI+LWE ER R T +EGV+Y+QF+SQ DF  LRDYAQ   VL WQNE+ RT+VV K GH 
Sbjct: 419 QIKLWEMERNRFTYTEGVVYNQFLSQNDFVTLRDYAQSQNVLVWQNERTRTMVVQKNGHD 478

Query: 371 DIKKFWVNHQKGS 383
           D+K++W  + K +
Sbjct: 479 DVKRYWKKYSKSA 491


>gi|195441209|ref|XP_002068409.1| GK20438 [Drosophila willistoni]
 gi|194164494|gb|EDW79395.1| GK20438 [Drosophila willistoni]
          Length = 512

 Score =  452 bits (1163), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 226/440 (51%), Positives = 299/440 (67%), Gaps = 58/440 (13%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
           +L EIA+ ++IR+LFV+QPVPQAV++SW ++  AKE +EAT  LT L             
Sbjct: 73  DLPEIARQFIIRILFVDQPVPQAVVSSWGAQRCAKEQQEATSCLTSLNVWRVTAIPGGLA 132

Query: 48  --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
                   +++ +  LLGGGKPW M+  LE DSK RD+ FL+ YA+ RW C+L +MVG+ 
Sbjct: 133 AWELSPTFKKSVRQILLGGGKPWPMTNTLEKDSKPRDIAFLDSYAMSRWRCVLHYMVGTG 192

Query: 99  ----QQTEGISADAVRTLFN------EGAENPVITKDGFQFLLLETPAQVWYFILKYLET 148
                  E IS DAVR L +      +  +   IT+ GFQFLLL+T AQVW+F+++YL+T
Sbjct: 193 NRNGTDAEAISPDAVRILLHANLMKRDERDGITITRQGFQFLLLDTRAQVWHFMMQYLDT 252

Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY 208
            E +G+ L ECL+ LFQL FST G DYS+EGMS ++  FLQHLREFGLV+QRKRK GRFY
Sbjct: 253 CEERGISLPECLSMLFQLSFSTLGRDYSSEGMSHQMLAFLQHLREFGLVFQRKRKEGRFY 312

Query: 209 PTKLALNMATRGTL---------------KQIREPGFLIVETNFRVYAYTDSNLKVALIG 253
           PT+LALN+ ++ +                +++++ G+++VETN+RVYAYTDS L+VA++G
Sbjct: 313 PTRLALNVTSKESADATTAIVTMSASEEEERMQDRGYIVVETNYRVYAYTDSPLQVAVLG 372

Query: 254 LFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV----------SCP 303
           LF EL+YRFPNL VGVLTRDSVR ALR GITA QII +L  +A P++          SC 
Sbjct: 373 LFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQIISYLEQYAHPNMKLVESAINSKSC- 431

Query: 304 LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVV 363
           LPP V DQI+LWE ER R T +EGV+Y+QF+S  DF  LRDYAQ + VL WQNEK RT+V
Sbjct: 432 LPPTVVDQIKLWEMERNRFTYTEGVVYNQFLSTTDFVTLRDYAQSIQVLVWQNEKTRTMV 491

Query: 364 VTKQGHSDIKKFWVNHQKGS 383
           V K GH D+K++W  + K +
Sbjct: 492 VQKNGHDDVKRYWKKYSKSA 511


>gi|194872905|ref|XP_001973103.1| GG13537 [Drosophila erecta]
 gi|190654886|gb|EDV52129.1| GG13537 [Drosophila erecta]
          Length = 499

 Score =  448 bits (1152), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 224/432 (51%), Positives = 293/432 (67%), Gaps = 52/432 (12%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
           EL EIA+ ++IR+LFV+QPVPQAV+ SW ++  AKE  EAT  LT L             
Sbjct: 66  ELPEIARQFIIRILFVDQPVPQAVVTSWGAQRCAKEQAEATSCLTALNVWRVTAIPGGLS 125

Query: 48  --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
                   +++ +  LLGGGKPW M+  LE DSK RD+ FL+ YA+ RW C+L +MVG+ 
Sbjct: 126 AWELSPTFKKSVRQVLLGGGKPWPMTNTLEKDSKPRDIAFLDTYAMSRWRCVLHYMVGTG 185

Query: 99  ----QQTEGISADAVRTLFN------EGAENPVITKDGFQFLLLETPAQVWYFILKYLET 148
                  E IS DAVR L +      +  +   IT+ GFQFLLL+T AQVW+F+L+YL+T
Sbjct: 186 NRNGTDAEAISPDAVRILLHANLMKRDERDGITITRQGFQFLLLDTRAQVWHFMLQYLDT 245

Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY 208
            E +G+ L ECL+ LFQL FST G DYS+EGM+ ++  FLQHLREFGLV+QRKRK GRFY
Sbjct: 246 CEERGISLPECLSMLFQLSFSTLGRDYSSEGMNSQMLTFLQHLREFGLVFQRKRKEGRFY 305

Query: 209 PTKLALNMATRGTL---------KQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELM 259
           PT+LALN+ ++            +  ++ G+++VETN+RVYAYTDS L+VA++GLF EL+
Sbjct: 306 PTRLALNVTSKEAAATASVAMDEEATQDCGYIVVETNYRVYAYTDSPLQVAVLGLFTELL 365

Query: 260 YRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV----------SCPLPPVVS 309
           YRFPNL VGVLTRDSVR ALR GITA QI+ +L  +A P++          SC LPP V 
Sbjct: 366 YRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYAHPNMRMVESAIQSKSC-LPPTVV 424

Query: 310 DQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGH 369
           DQI+LWE ER R T +EGV+Y+QF+S  DF  LRDYAQ + +L WQNE+ RT+VV K GH
Sbjct: 425 DQIKLWELERNRFTYTEGVVYNQFLSHTDFVTLRDYAQSIHMLVWQNERTRTMVVQKNGH 484

Query: 370 SDIKKFWVNHQK 381
            D+K++W  + K
Sbjct: 485 DDVKRYWKKYSK 496


>gi|260810774|ref|XP_002600123.1| hypothetical protein BRAFLDRAFT_57175 [Branchiostoma floridae]
 gi|229285409|gb|EEN56135.1| hypothetical protein BRAFLDRAFT_57175 [Branchiostoma floridae]
          Length = 457

 Score =  448 bits (1152), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 223/416 (53%), Positives = 295/416 (70%), Gaps = 34/416 (8%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL+++AKHYV+RLLF+EQ VPQAV+ SW++  +  EH  A   L+ELR            
Sbjct: 42  ELTDLAKHYVMRLLFLEQAVPQAVVGSWVTSGNHSEHLRAVSQLSELRVWHEHQISGGLA 101

Query: 49  ---------RNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
                     N + ALLGGGKPW    +L  D  A+D+  L+ Y+LERWE +L F+VGS 
Sbjct: 102 GWILSPTFRANLRTALLGGGKPWFSVVQLPPDKHAKDVAALDTYSLERWEVLLNFIVGSG 161

Query: 100 QTEGISADAVRTLFN-------EGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
           + + +S D +  L         EG+ +P IT  GFQFLL++TP+QVWY IL+YL+T++S+
Sbjct: 162 EAQ-VSKDIMEILIKSGLMKSEEGSLHPTITPAGFQFLLMDTPSQVWYIILQYLDTMQSR 220

Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
           GL+LVE L FLFQ+ FST G DY TEGMSD +Q FLQHLRE GLV QRKRK+GRFYPT+L
Sbjct: 221 GLNLVEALQFLFQISFSTLGKDYPTEGMSDSMQQFLQHLRELGLVNQRKRKSGRFYPTRL 280

Query: 213 ALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVL 270
           A+++A+      K   + G+L+VE+N+R+YAYTDS L+VALIGLF E++YRFPN+ V  L
Sbjct: 281 AIHLASGISDVEKDFHKEGYLVVESNYRIYAYTDSELQVALIGLFSEILYRFPNMVVANL 340

Query: 271 TRDSVRAALRSGITAAQIIGFLRLHALPSV--SCPL-PPVVSDQIRLWEGERERLTTSEG 327
           TRDSV+ A+  GITA QI+ FLR++A P      P+ PP +SDQIRLWE ER+RLT +EG
Sbjct: 341 TRDSVQEAVVRGITADQILHFLRVNAHPKALHRVPIVPPTISDQIRLWEMERDRLTFTEG 400

Query: 328 VLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           VLY+QF+SQ DFE LR+YA+DLGVL W+N  KR +VV+K GH D+K++W   +KG+
Sbjct: 401 VLYNQFLSQPDFEMLRNYAKDLGVLLWENNPKRLMVVSKAGHDDVKRYWKTIRKGN 456


>gi|443688956|gb|ELT91478.1| hypothetical protein CAPTEDRAFT_18660 [Capitella teleta]
          Length = 463

 Score =  448 bits (1152), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 228/415 (54%), Positives = 293/415 (70%), Gaps = 35/415 (8%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
           EL  ++KHYV+R+LFVEQPV  +V++SW++ ++  EH  A  +L++L             
Sbjct: 47  ELPSLSKHYVLRILFVEQPVSHSVVSSWVNSSNQTEHTAAVKSLSDLCVWQDHCLPGGLP 106

Query: 48  --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
                   R N KVALLGGG PW+ S  L  D  ++D+ FL QY++ERWEC+L +M GS+
Sbjct: 107 GYLLSDVFRTNLKVALLGGGSPWAGSGALGDDKHSKDVAFLEQYSMERWECVLHYMAGSK 166

Query: 100 Q-TEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
           +   G+S D V  L + G       + +P IT  GFQFLLL+T  QVW+F+++YL TVES
Sbjct: 167 EGVSGVSRDVVHILLHSGLMKTEQSSPDPCITPAGFQFLLLDTSTQVWFFMIEYLNTVES 226

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
           + +DLVECL+FLFQL FST G DYSTE M+   Q FLQHLREFGLVYQRKR + RFYPT+
Sbjct: 227 RKMDLVECLSFLFQLSFSTLGKDYSTESMTSNQQRFLQHLREFGLVYQRKRSSQRFYPTR 286

Query: 212 LALNMAT---RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVG 268
           LA+N+AT    G  +   E GF++VETN+RVYAYT+S+LKVALI LFC +MYRFPN+ VG
Sbjct: 287 LAVNLATGSKGGDSETASEDGFIVVETNYRVYAYTNSSLKVALISLFCSMMYRFPNMAVG 346

Query: 269 VLTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTS 325
           V++RDSVR AL  GITA QII FLR HA P        LPP VSDQIRLWE ER+R   +
Sbjct: 347 VVSRDSVREALSRGITAEQIINFLRNHAHPHTQKRKPILPPTVSDQIRLWELERDRFHFN 406

Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQ 380
           EGVLY+QF+SQ DFE LRDYA+DLGVL+++N  +R +VVT+ GH D+K+FW  H+
Sbjct: 407 EGVLYNQFLSQTDFEVLRDYAKDLGVLSYENIPRRLMVVTRAGHDDVKRFWKRHK 461


>gi|195477975|ref|XP_002086439.1| GE22863 [Drosophila yakuba]
 gi|194186229|gb|EDW99840.1| GE22863 [Drosophila yakuba]
          Length = 499

 Score =  447 bits (1151), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 224/432 (51%), Positives = 293/432 (67%), Gaps = 52/432 (12%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
           EL EIA+ ++IR+LFV+QPVPQAV+ SW ++  AKE  EAT  LT L             
Sbjct: 66  ELPEIARQFIIRILFVDQPVPQAVVTSWGAQRCAKEQAEATSCLTALNVWRVTAIPGGLS 125

Query: 48  --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
                   +++ +  LLGGGKPW M+  LE DSK RD+ FL+ YA+ RW C+L +MVG+ 
Sbjct: 126 AWELSPTFKKSVRQVLLGGGKPWPMTNTLEKDSKPRDIAFLDTYAMSRWRCVLHYMVGTG 185

Query: 99  ----QQTEGISADAVRTLFN------EGAENPVITKDGFQFLLLETPAQVWYFILKYLET 148
                  E IS DAVR L +      +  +   IT+ GFQFLLL+T AQVW+F+L+YL+T
Sbjct: 186 NRNGTDAEAISPDAVRILLHANLMKRDERDGITITRQGFQFLLLDTRAQVWHFMLQYLDT 245

Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY 208
            E +G+ L ECL+ LFQL FST G DYS+EGM+ ++  FLQHLREFGLV+QRKRK GRFY
Sbjct: 246 CEERGISLPECLSMLFQLSFSTLGRDYSSEGMNSQMLTFLQHLREFGLVFQRKRKEGRFY 305

Query: 209 PTKLALNMATRGTL---------KQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELM 259
           PT+LALN+ ++            +  ++ G+++VETN+RVYAYTDS L+VA++GLF EL+
Sbjct: 306 PTRLALNVTSKEAAATASVAMDEEATQDCGYIVVETNYRVYAYTDSPLQVAVLGLFTELL 365

Query: 260 YRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV----------SCPLPPVVS 309
           YRFPNL VGVLTRDSVR ALR GITA QI+ +L  +A P++          SC LPP V 
Sbjct: 366 YRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYAHPNMRLVESAIQSKSC-LPPTVV 424

Query: 310 DQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGH 369
           DQI+LWE ER R T +EGV+Y+QF+S  DF  LRDYAQ + +L WQNE+ RT+VV K GH
Sbjct: 425 DQIKLWELERNRFTYTEGVVYNQFLSHTDFVTLRDYAQSIHMLVWQNERTRTMVVQKNGH 484

Query: 370 SDIKKFWVNHQK 381
            D+K++W  + K
Sbjct: 485 DDVKRYWKKYSK 496


>gi|195327811|ref|XP_002030611.1| GM24480 [Drosophila sechellia]
 gi|194119554|gb|EDW41597.1| GM24480 [Drosophila sechellia]
          Length = 499

 Score =  447 bits (1150), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 224/432 (51%), Positives = 293/432 (67%), Gaps = 52/432 (12%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
           EL EIA+ ++IR+LFV+QPVPQAV+ SW ++  AKE  EAT  LT L             
Sbjct: 66  ELPEIARQFIIRILFVDQPVPQAVVTSWGAQRCAKEQAEATSCLTALNVWRVTAIPGGLS 125

Query: 48  --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
                   +++ +  LLGGGKPW M+  L+ DSK RD+ FL+ YA+ RW C+L +MVG+ 
Sbjct: 126 AWELSPTFKKSVRQVLLGGGKPWPMTNTLDKDSKPRDIAFLDTYAMSRWRCVLHYMVGTG 185

Query: 99  ----QQTEGISADAVRTLFN------EGAENPVITKDGFQFLLLETPAQVWYFILKYLET 148
                  E IS DAVR L +      +  +   IT+ GFQFLLL+T AQVW+F+L+YL+T
Sbjct: 186 NRNGTDAEAISPDAVRILLHANLMKRDERDGITITRQGFQFLLLDTRAQVWHFMLQYLDT 245

Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY 208
            E +G+ L ECL+ LFQL FST G DYS+EGM+ ++  FLQHLREFGLV+QRKRK GRFY
Sbjct: 246 CEERGISLPECLSMLFQLSFSTLGRDYSSEGMNSQMLTFLQHLREFGLVFQRKRKEGRFY 305

Query: 209 PTKLALNMATRGTL---------KQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELM 259
           PT+LALN+ ++            +  ++ G+++VETN+RVYAYTDS L+VA++GLF EL+
Sbjct: 306 PTRLALNVTSKEAAATASVAMDEEATQDCGYIVVETNYRVYAYTDSPLQVAVLGLFTELL 365

Query: 260 YRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV----------SCPLPPVVS 309
           YRFPNL VGVLTRDSVR ALR GITA QI+ +L  +A P++          SC LPP V 
Sbjct: 366 YRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYAHPNMRMVESAIHSKSC-LPPTVV 424

Query: 310 DQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGH 369
           DQI+LWE ER R T +EGVLY+QF+S  DF  LRDYAQ + +L WQNE+ RT+VV K GH
Sbjct: 425 DQIKLWELERNRFTYTEGVLYNQFLSHTDFVTLRDYAQSIHMLVWQNERTRTMVVQKNGH 484

Query: 370 SDIKKFWVNHQK 381
            D+K++W  + K
Sbjct: 485 DDVKRYWKKYSK 496


>gi|21357821|ref|NP_648780.1| marionette [Drosophila melanogaster]
 gi|7294263|gb|AAF49614.1| marionette [Drosophila melanogaster]
 gi|220942562|gb|ACL83824.1| Tfb2-PA [synthetic construct]
          Length = 499

 Score =  447 bits (1150), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 224/432 (51%), Positives = 293/432 (67%), Gaps = 52/432 (12%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
           EL EIA+ ++IR+LFV+QPVPQAV+ SW ++  AKE  EAT  LT L             
Sbjct: 66  ELPEIARQFIIRILFVDQPVPQAVVTSWGAQRCAKEQAEATSCLTALNVWRVTAIPGGLS 125

Query: 48  --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
                   +++ +  LLGGGKPW M+  L+ DSK RD+ FL+ YA+ RW C+L +MVG+ 
Sbjct: 126 AWELSPTFKKSVRQVLLGGGKPWPMTNTLDKDSKPRDIAFLDTYAMSRWRCVLHYMVGTG 185

Query: 99  ----QQTEGISADAVRTLFN------EGAENPVITKDGFQFLLLETPAQVWYFILKYLET 148
                  E IS DAVR L +      +  +   IT+ GFQFLLL+T AQVW+F+L+YL+T
Sbjct: 186 NRNGTDAEAISPDAVRILLHANLMKRDERDGITITRQGFQFLLLDTRAQVWHFMLQYLDT 245

Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY 208
            E +G+ L ECL+ LFQL FST G DYS+EGM+ ++  FLQHLREFGLV+QRKRK GRFY
Sbjct: 246 CEERGISLPECLSMLFQLSFSTLGRDYSSEGMNSQMLTFLQHLREFGLVFQRKRKEGRFY 305

Query: 209 PTKLALNMATRGTL---------KQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELM 259
           PT+LALN+ ++            +  ++ G+++VETN+RVYAYTDS L+VA++GLF EL+
Sbjct: 306 PTRLALNVTSKEAAATASVAMDEEATQDCGYIVVETNYRVYAYTDSPLQVAVLGLFTELL 365

Query: 260 YRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV----------SCPLPPVVS 309
           YRFPNL VGVLTRDSVR ALR GITA QI+ +L  +A P++          SC LPP V 
Sbjct: 366 YRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYAHPNMRMVESAIHSKSC-LPPTVV 424

Query: 310 DQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGH 369
           DQI+LWE ER R T +EGVLY+QF+S  DF  LRDYAQ + +L WQNE+ RT+VV K GH
Sbjct: 425 DQIKLWELERNRFTYTEGVLYNQFLSHTDFVTLRDYAQSIHMLVWQNERTRTMVVQKNGH 484

Query: 370 SDIKKFWVNHQK 381
            D+K++W  + K
Sbjct: 485 DDVKRYWKKYSK 496


>gi|321470926|gb|EFX81900.1| hypothetical protein DAPPUDRAFT_210868 [Daphnia pulex]
          Length = 463

 Score =  444 bits (1143), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 224/423 (52%), Positives = 293/423 (69%), Gaps = 44/423 (10%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL E+++ Y++R+LFV+Q VP+A++ SW+S   A+E ++    LT+LR            
Sbjct: 41  ELPELSRIYIMRILFVDQAVPKAIMGSWVSPNSARELEDIVKLLTDLRVWQEVEMQGGLK 100

Query: 49  ---------RNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
                    RN K ALLGGG  WSM    + D KAR +  L++YA+ RWEC+L FMVGS 
Sbjct: 101 GWLLNPTFRRNLKGALLGGGNEWSMKPNTDADPKARGIAVLDEYAMGRWECVLHFMVGSH 160

Query: 100 QTEGISADAVRTLFNEG------AENPV-ITKDGFQFLLLETPAQVWYFILKYLETVESK 152
           Q E IS+DA++ L + G       EN + ITKDGFQFLL++T AQVWYF+L+YL+T  S+
Sbjct: 161 QHEAISSDALQILQHAGLMKKEPGENQLSITKDGFQFLLMDTSAQVWYFLLQYLDTANSR 220

Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
            LDL++CL FLFQL FST G DYST+ M+D LQ FLQHLREFGLVYQRKRK GRFYPT+L
Sbjct: 221 NLDLIDCLGFLFQLSFSTLGQDYSTDSMNDGLQKFLQHLREFGLVYQRKRKDGRFYPTRL 280

Query: 213 ALNMAT---RGTLKQI----REPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNL 265
           AL++A    +  L  +    +  G+++VETN+RVYAYTDS+L+VALI LFCEL+YRFPNL
Sbjct: 281 ALDIAAGPKKSMLNSLNASSQTTGYIVVETNYRVYAYTDSSLQVALIALFCELIYRFPNL 340

Query: 266 TVGVLTRDSVRAALRSGITAAQIIGFLRLHAL------PSVSCPLPPVVSDQIRLWEGER 319
            VG++TR+SVR AL+ GITA QI+ FLR HA       P V   LP  ++DQI+LW  ER
Sbjct: 341 VVGIITRESVREALKRGITADQIVSFLRQHAHSECYKEPPV---LPSTIADQIKLWAIER 397

Query: 320 ERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNH 379
           +R    +GVLY+QF SQADFE LR+YAQ+ G+L W     R +VVTK+GH +++KFW  H
Sbjct: 398 DRFLYKDGVLYNQFDSQADFEILRNYAQERGLLVWHTLNGRKMVVTKEGHDEVRKFWKRH 457

Query: 380 QKG 382
            + 
Sbjct: 458 SRN 460


>gi|195590491|ref|XP_002084979.1| GD12553 [Drosophila simulans]
 gi|194196988|gb|EDX10564.1| GD12553 [Drosophila simulans]
          Length = 500

 Score =  444 bits (1141), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 224/433 (51%), Positives = 295/433 (68%), Gaps = 53/433 (12%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
           EL EIA+ ++IR+LFV+QPVPQAV+ SW ++  AKE  EAT  LT L             
Sbjct: 66  ELPEIARQFIIRILFVDQPVPQAVVTSWGAQRCAKEQAEATSCLTALNVWRVTAIPGGLS 125

Query: 48  --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
                   +++ +  LLGGGKPW M+  L+ DSK RD+ FL+ YA+ RW C+L +MVG+ 
Sbjct: 126 AWELSPTFKKSVRQVLLGGGKPWPMTNTLDKDSKPRDIAFLDTYAMSRWRCVLHYMVGTG 185

Query: 99  ----QQTEGISADAVRTLFN------EGAENPVITKDGFQFLLLETPAQVWYFILKYLET 148
                  E IS DAVR L +      +  +   IT+ GFQFLLL+T AQVW+F+L+YL+T
Sbjct: 186 NRSGTDAEAISPDAVRILLHANLMKRDERDGITITRQGFQFLLLDTRAQVWHFMLQYLDT 245

Query: 149 VESKGLDLVECLTFLFQL-KFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRF 207
            E +G+ L ECL+ LFQL +FST G DYS+EGM++++  FLQHLREFGLV+QRKRK GRF
Sbjct: 246 CEERGISLPECLSMLFQLTQFSTLGRDYSSEGMNNQMLTFLQHLREFGLVFQRKRKEGRF 305

Query: 208 YPTKLALNMATRGTL---------KQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCEL 258
           YPT+LALN+ ++            +  ++ G+++VETN+RVYAYTDS L+VA++GLF EL
Sbjct: 306 YPTRLALNVTSKEAAATASVAMDEEATQDCGYIVVETNYRVYAYTDSPLQVAVLGLFTEL 365

Query: 259 MYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV----------SCPLPPVV 308
           +YRFPNL VGVLTRDSVR ALR GITA QI+ +L  +A P++          SC LPP V
Sbjct: 366 LYRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYAHPNMRMVESAIHSKSC-LPPTV 424

Query: 309 SDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQG 368
            DQI+LWE ER R T +EGVLY+QF+S  DF  LRDYAQ + +L WQNE+ RT+VV K G
Sbjct: 425 VDQIKLWELERNRFTYTEGVLYNQFLSHTDFVTLRDYAQSIHMLVWQNERTRTMVVQKNG 484

Query: 369 HSDIKKFWVNHQK 381
           H D+K++W  + K
Sbjct: 485 HDDVKRYWKKYSK 497


>gi|193633122|ref|XP_001945987.1| PREDICTED: general transcription factor IIH subunit 4-like
           [Acyrthosiphon pisum]
          Length = 471

 Score =  443 bits (1139), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 211/412 (51%), Positives = 285/412 (69%), Gaps = 32/412 (7%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           L +I + + +R+LF+EQPVPQ+V++SW+   +++E  E+ +  T L              
Sbjct: 58  LPDITQQFTLRILFIEQPVPQSVLSSWVPANYSRELDESIEVATNLHIWKLTSVSGGLKG 117

Query: 48  -------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQ 100
                  ++  KVAL+GGG+    ++ +  D KARD+DFL+ YA ERWECIL +MVGS+ 
Sbjct: 118 WILNSTFKKKLKVALMGGGRSTVPNSDMTADPKARDIDFLDSYAYERWECILHYMVGSKH 177

Query: 101 TEGISADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGL 154
            EGIS+DAVR L N G       ++PVIT  GFQFLLL+   QVWYF+L+Y+ETVES+GL
Sbjct: 178 -EGISSDAVRVLLNAGLMVRDTDDSPVITSTGFQFLLLDMATQVWYFMLRYMETVESRGL 236

Query: 155 DLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLAL 214
           DL +CLTFLFQ+   T G DY T+ MS+ LQ FLQHLREFGLVYQRKRKAGRFYPT+L +
Sbjct: 237 DLAQCLTFLFQIHLGTLGWDYITDEMSENLQAFLQHLREFGLVYQRKRKAGRFYPTRLVI 296

Query: 215 NMA---TRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLT 271
            M    +R + +   +  +++VETNFR+YA TDS+LKVAL+ LF  ++YRFPN++ G+LT
Sbjct: 297 EMGQGNSRTSERMKNKERYIVVETNFRIYAMTDSDLKVALVALFTHMLYRFPNMSAGILT 356

Query: 272 RDSVRAALRSGITAAQIIGFLRLHALPSV-SCPLPPVVSDQIRLWEGERERLTTSEGVLY 330
           RDSVR ALRSGITAAQI+ FL +H  P +  C +P  V DQI LWE ER RLT ++GVLY
Sbjct: 357 RDSVRTALRSGITAAQIVRFLTVHTHPQMQECGMPQTVIDQIYLWENERNRLTYTDGVLY 416

Query: 331 SQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKG 382
           S   +  D+E +++YA D+G L W +E++R +VV+  GH D++KFW    K 
Sbjct: 417 SNINTPNDYETIKNYAADIGALVWCDERRRNIVVSTDGHDDVRKFWKKQPKS 468


>gi|147899535|ref|NP_001085110.1| general transcription factor IIH, polypeptide 4, 52kDa [Xenopus
           laevis]
 gi|47939830|gb|AAH72322.1| MGC83106 protein [Xenopus laevis]
          Length = 455

 Score =  432 bits (1112), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 219/416 (52%), Positives = 291/416 (69%), Gaps = 34/416 (8%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL  +AK+YV+R+LF+EQP+PQA +A W+ K +  EH+E T  LT LR            
Sbjct: 41  ELPGLAKNYVMRMLFLEQPLPQAAVALWVKKENFNEHEENTQVLTGLRLWHTQQLPGGLQ 100

Query: 49  ---------RNAKVALLGGGKPWSMSAKL-EVDSKARDLDFLNQYALERWECILRFMVGS 98
                     N K+ALLGGGK W+    L   D   +D+  L++YA ERWE IL FMVGS
Sbjct: 101 GLILNPIFRENLKIALLGGGKAWADDGCLLGPDKHGKDVVSLDKYAEERWEVILHFMVGS 160

Query: 99  QQTEGISADAVRTLF------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
             +  +S D  + L       NE  E P I+  GFQFLLL+TP+Q+WYF+L+YL++ ES+
Sbjct: 161 P-SAAVSQDLAQLLIQAGLMKNESGEAPCISSAGFQFLLLDTPSQLWYFMLQYLKSAESR 219

Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
           G++LVE L+F+FQL FST G DYS EGMSD L  FLQHLREFGLV+QRKRK+ R+YPT+L
Sbjct: 220 GMNLVEILSFMFQLSFSTLGKDYSVEGMSDSLLTFLQHLREFGLVFQRKRKSRRYYPTRL 279

Query: 213 ALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVL 270
           A+N+A+   G++    + GF++VETN+R+YAYTDS L++ALI LF E++YRFPNL V  +
Sbjct: 280 AINLASGISGSVVDSHKQGFIVVETNYRIYAYTDSELQIALIALFSEMLYRFPNLVVAQV 339

Query: 271 TRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSEG 327
           TR++V+ A+ +GITA QII FLR  A P +   +  LPP ++DQIRLWE ER+RL  SEG
Sbjct: 340 TRENVQQAIGNGITAEQIIHFLRTRAHPVMLQQNPALPPTITDQIRLWELERDRLRFSEG 399

Query: 328 VLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           VLY+QF+SQ DFE LR+YA+DLGVL ++N  KR +VVT  GHSD+K+FW   ++ S
Sbjct: 400 VLYNQFLSQVDFELLRNYARDLGVLVFENPAKRVMVVTPGGHSDVKRFWKRQKQSS 455


>gi|62751795|ref|NP_001015845.1| general transcription factor IIH, polypeptide 4, 52kDa [Xenopus
           (Silurana) tropicalis]
 gi|58475911|gb|AAH90134.1| general transcription factor II H, polypeptide 4 [Xenopus
           (Silurana) tropicalis]
 gi|89272862|emb|CAJ82116.1| transcription factor tfb2 [Xenopus (Silurana) tropicalis]
 gi|114107989|gb|AAI22900.1| gtf2h4 protein [Xenopus (Silurana) tropicalis]
          Length = 455

 Score =  432 bits (1111), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 220/416 (52%), Positives = 291/416 (69%), Gaps = 34/416 (8%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL  +AK+YV+R+LF+EQP+PQA +A W+ K + KEH+E T  LT LR            
Sbjct: 41  ELPGLAKNYVMRMLFLEQPLPQAAVALWVKKENFKEHEENTQVLTGLRLWHTQQLPGGLQ 100

Query: 49  ---------RNAKVALLGGGKPWSMSAKL-EVDSKARDLDFLNQYALERWECILRFMVGS 98
                     N K+ALLGGGKPW+    L   D   +D+  L++YA ERWE IL FMVGS
Sbjct: 101 GLILNPIFRENLKIALLGGGKPWADDGCLLGPDKHGKDVVSLDKYAEERWEVILHFMVGS 160

Query: 99  QQTEGISADAVRTLF------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
             +  +S D  + L       +E  E P I+  GFQFLLL+TP+Q+WYF+L+YL++ ES+
Sbjct: 161 P-SAAVSQDLAQLLIQAGLMKSESGEAPCISSAGFQFLLLDTPSQLWYFMLQYLKSAESR 219

Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
           G+ LVE L+F+FQL FST G DYS EGMSD L  FLQHLREFGLV+QRKRK+ R+YPT+L
Sbjct: 220 GMILVEILSFMFQLSFSTLGKDYSVEGMSDSLLTFLQHLREFGLVFQRKRKSRRYYPTRL 279

Query: 213 ALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVL 270
           A+N+A+   G++    + GF++VETN+R+YAYTDS L++ALI LF E++YRFPNL V  +
Sbjct: 280 AINLASGISGSVVDSHKQGFIVVETNYRIYAYTDSELQIALIALFSEMLYRFPNLVVAQV 339

Query: 271 TRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSEG 327
           TR++V+ A+ +GITA QII FLR  A P +   +  LPP ++DQIRLWE ER+RL  SEG
Sbjct: 340 TRENVQQAIGNGITAEQIIHFLRTRAHPVMLQQNPVLPPTITDQIRLWELERDRLRFSEG 399

Query: 328 VLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           VLY+QF+SQ DFE LR+YA+DLGVL ++N  KR +VVT  GHSD+K+FW   +  S
Sbjct: 400 VLYNQFLSQVDFELLRNYARDLGVLVFENPAKRVMVVTPGGHSDVKRFWKRQKHSS 455


>gi|327266348|ref|XP_003217968.1| PREDICTED: general transcription factor IIH subunit 4-like [Anolis
           carolinensis]
          Length = 460

 Score =  431 bits (1107), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 221/417 (52%), Positives = 291/417 (69%), Gaps = 35/417 (8%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL  +AK+YV+R+L +EQP+PQA +ASW+ K + KE +E++D L  LR            
Sbjct: 45  ELPGLAKNYVMRMLLLEQPLPQAAVASWVKKEYTKEQEESSDILLGLRLWHKQLLPGGLQ 104

Query: 49  ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                     N +VALLGGGK WS  +++L  D  ARD+  L++YA ERWE IL FMVGS
Sbjct: 105 GIVLNPIFKENLRVALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVILDFMVGS 164

Query: 99  QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
             +  +S D  + L   G        E P IT  GFQFLLL+T +Q+WYF+L+YL++ E+
Sbjct: 165 P-SAAVSQDLAQLLTEAGLMKSSEPGEPPCITSSGFQFLLLDTSSQLWYFMLQYLQSAET 223

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
           +G+DLVE L+FLFQL FST G DYS EGMS+ L  FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 224 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSESLLNFLQHLREFGLVFQRKRKSRRYYPTR 283

Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
           LA+N+++   G     R  GF+IVETN+R+YAYTDS L++ALI LF E++YRFPNL V  
Sbjct: 284 LAINLSSGISGITIDTRNQGFIIVETNYRIYAYTDSELQIALIALFSEMLYRFPNLVVAQ 343

Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
           +TR+SV+ A+ +GITA QII FLR  A P +   +  LPP ++DQIRLWE ER+RL  SE
Sbjct: 344 VTRESVQQAIANGITADQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFSE 403

Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           GVLY+QF+SQ DFE LRD+A++LGVL ++N  KR +VVT  GHSD+K+FW   +  S
Sbjct: 404 GVLYNQFLSQVDFELLRDHARELGVLIFENPSKRLMVVTPAGHSDVKRFWKRQKHSS 460


>gi|47059175|ref|NP_997666.1| general transcription factor II H, polypeptide 4 [Rattus
           norvegicus]
 gi|46237651|emb|CAE84027.1| general transcription factor II H, polypeptide 4 [Rattus
           norvegicus]
 gi|117558335|gb|AAI27470.1| General transcription factor II H, polypeptide 4 [Rattus
           norvegicus]
          Length = 463

 Score =  426 bits (1095), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 216/417 (51%), Positives = 292/417 (70%), Gaps = 35/417 (8%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL  +AK++V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR            
Sbjct: 48  ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 107

Query: 49  ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                    +N ++ALLGGGK WS  +++L  D  ARD+  L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPVFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 167

Query: 99  QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
             +  +S D  + L   G        E P IT  GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 168 P-SAAVSQDLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 226

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
           +G+DLVE L+FLFQL FST G DYS EGMSD L  FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 286

Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
           LA+N+++   G    + +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V  
Sbjct: 287 LAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQ 346

Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
           +TR+SV+ A+ SGITA QII FLR  A P +   +  LPP ++DQIRLWE ER+RL  +E
Sbjct: 347 VTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTE 406

Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           GVLY+QF+SQ DFE L  +A++LGVL ++N  KR +VVT  GHSD+K+FW   +  S
Sbjct: 407 GVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKHSS 463


>gi|395831929|ref|XP_003789035.1| PREDICTED: general transcription factor IIH subunit 4 [Otolemur
           garnettii]
          Length = 463

 Score =  426 bits (1095), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 216/417 (51%), Positives = 292/417 (70%), Gaps = 35/417 (8%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL  +AK++V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR            
Sbjct: 48  ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 107

Query: 49  ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                    +N ++ALLGGGK WS  +++L  D  ARD+  L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 167

Query: 99  QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
             +  +S D  + L   G        E P IT  GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 168 P-SAAVSQDLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 226

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
           +G+DLVE L+FLFQL FST G DYS EGMSD L  FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 286

Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
           LA+N+++   G    + +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V  
Sbjct: 287 LAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQ 346

Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
           +TR+SV+ A+ SGITA QII FLR  A P +   +  LPP ++DQIRLWE ER+RL  +E
Sbjct: 347 VTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTE 406

Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           GVLY+QF+SQ DFE L  +A++LGVL ++N  KR +VVT  GHSD+K+FW   +  S
Sbjct: 407 GVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKHSS 463


>gi|4504201|ref|NP_001508.1| general transcription factor IIH subunit 4 [Homo sapiens]
 gi|113865855|ref|NP_001038968.1| general transcription factor IIH subunit 4 [Pan troglodytes]
 gi|114050799|ref|NP_001040607.1| general transcription factor IIH subunit 4 [Macaca mulatta]
 gi|297677636|ref|XP_002816695.1| PREDICTED: general transcription factor IIH subunit 4 isoform 2
           [Pongo abelii]
 gi|332245940|ref|XP_003272109.1| PREDICTED: general transcription factor IIH subunit 4 [Nomascus
           leucogenys]
 gi|426352309|ref|XP_004043656.1| PREDICTED: general transcription factor IIH subunit 4 [Gorilla
           gorilla gorilla]
 gi|17380328|sp|Q92759.1|TF2H4_HUMAN RecName: Full=General transcription factor IIH subunit 4; AltName:
           Full=Basic transcription factor 2 52 kDa subunit;
           Short=BTF2 p52; AltName: Full=General transcription
           factor IIH polypeptide 4; AltName: Full=TFIIH basal
           transcription factor complex p52 subunit
 gi|38503276|sp|P60027.1|TF2H4_PANTR RecName: Full=General transcription factor IIH subunit 4; AltName:
           Full=Basic transcription factor 2 52 kDa subunit;
           Short=BTF2 p52; AltName: Full=General transcription
           factor IIH polypeptide 4; AltName: Full=TFIIH basal
           transcription factor complex p52 subunit
 gi|1514597|emb|CAA68870.1| transcription factor TFIIH [Homo sapiens]
 gi|13436278|gb|AAH04935.1| General transcription factor IIH, polypeptide 4, 52kDa [Homo
           sapiens]
 gi|15277224|dbj|BAB63317.1| Transcription factor II H [Homo sapiens]
 gi|16740884|gb|AAH16302.1| General transcription factor IIH, polypeptide 4, 52kDa [Homo
           sapiens]
 gi|21655317|gb|AAM64222.1| general transcription factor IIH, polypeptide 4 (52kD subunit)
           [Homo sapiens]
 gi|27544403|dbj|BAC54936.1| transcription factor II H [Homo sapiens]
 gi|30583481|gb|AAP35985.1| general transcription factor IIH, polypeptide 4, 52kDa [Homo
           sapiens]
 gi|32127781|dbj|BAC78171.1| transcription factor II H [Pan troglodytes]
 gi|55700798|dbj|BAD69753.1| general transcription factor IIH, polypeptide 4, 52kDa [Macaca
           mulatta]
 gi|60655839|gb|AAX32483.1| general transcription factor IIH polypeptide 4 [synthetic
           construct]
 gi|60655841|gb|AAX32484.1| general transcription factor IIH polypeptide 4 [synthetic
           construct]
 gi|86197966|dbj|BAE78622.1| general transcription factor IIH, polypeptide 4, 52kDa [Homo
           sapiens]
 gi|90960934|dbj|BAE92819.1| general transcription factor IIH, polypeptide 4 [Pan troglodytes]
 gi|90960936|dbj|BAE92820.1| general transcription factor IIH, polypeptide 4 [Pan troglodytes]
 gi|114306784|dbj|BAF31271.1| TFIIH protein [Homo sapiens]
 gi|119623752|gb|EAX03347.1| general transcription factor IIH, polypeptide 4, 52kDa [Homo
           sapiens]
 gi|123992979|gb|ABM84091.1| general transcription factor IIH, polypeptide 4, 52kDa [synthetic
           construct]
 gi|123999907|gb|ABM87462.1| general transcription factor IIH, polypeptide 4, 52kDa [synthetic
           construct]
 gi|355561507|gb|EHH18139.1| General transcription factor IIH polypeptide 4, partial [Macaca
           mulatta]
 gi|355748409|gb|EHH52892.1| General transcription factor IIH polypeptide 4 [Macaca
           fascicularis]
 gi|380812138|gb|AFE77944.1| general transcription factor IIH subunit 4 [Macaca mulatta]
 gi|380812140|gb|AFE77945.1| general transcription factor IIH subunit 4 [Macaca mulatta]
 gi|383408563|gb|AFH27495.1| general transcription factor IIH subunit 4 [Macaca mulatta]
 gi|410214852|gb|JAA04645.1| general transcription factor IIH, polypeptide 4, 52kDa [Pan
           troglodytes]
 gi|410250640|gb|JAA13287.1| general transcription factor IIH, polypeptide 4, 52kDa [Pan
           troglodytes]
 gi|410290944|gb|JAA24072.1| general transcription factor IIH, polypeptide 4, 52kDa [Pan
           troglodytes]
 gi|410352807|gb|JAA43007.1| general transcription factor IIH, polypeptide 4, 52kDa [Pan
           troglodytes]
          Length = 462

 Score =  426 bits (1095), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 216/417 (51%), Positives = 292/417 (70%), Gaps = 35/417 (8%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL  +AK++V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR            
Sbjct: 47  ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 106

Query: 49  ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                    +N ++ALLGGGK WS  +++L  D  ARD+  L++YA ERWE +L FMVGS
Sbjct: 107 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 166

Query: 99  QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
             +  +S D  + L   G        E P IT  GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 167 P-SAAVSQDLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 225

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
           +G+DLVE L+FLFQL FST G DYS EGMSD L  FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 226 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 285

Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
           LA+N+++   G    + +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V  
Sbjct: 286 LAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQ 345

Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
           +TR+SV+ A+ SGITA QII FLR  A P +   +  LPP ++DQIRLWE ER+RL  +E
Sbjct: 346 VTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTE 405

Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           GVLY+QF+SQ DFE L  +A++LGVL ++N  KR +VVT  GHSD+K+FW   +  S
Sbjct: 406 GVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKHSS 462


>gi|410958682|ref|XP_003985944.1| PREDICTED: general transcription factor IIH subunit 4 [Felis catus]
          Length = 463

 Score =  426 bits (1095), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 217/417 (52%), Positives = 291/417 (69%), Gaps = 35/417 (8%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL  +AK++V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR            
Sbjct: 48  ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 107

Query: 49  ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                    +N ++ALLGGGK WS  +++L  D  ARD+  L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 167

Query: 99  QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
             +  +S D  + L   G        E P IT  GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 168 P-SAAVSQDLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 226

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
           +G+DLVE L+FLFQL FST G DYS EGMSD L  FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 286

Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
           LA+N+++   G      +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V  
Sbjct: 287 LAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQ 346

Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
           +TR+SV+ A+ SGITA QII FLR  A P +   S  LPP ++DQIRLWE ER+RL  +E
Sbjct: 347 VTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQSPVLPPTITDQIRLWELERDRLRFTE 406

Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           GVLY+QF+SQ DFE L  +A++LGVL ++N  KR +VVT  GHSD+K+FW   +  S
Sbjct: 407 GVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKHSS 463


>gi|6754094|ref|NP_034494.1| general transcription factor IIH subunit 4 [Mus musculus]
 gi|18202124|sp|O70422.1|TF2H4_MOUSE RecName: Full=General transcription factor IIH subunit 4; AltName:
           Full=Basic transcription factor 2 52 kDa subunit;
           Short=BTF2 p52; AltName: Full=General transcription
           factor IIH polypeptide 4; AltName: Full=TFIIH basal
           transcription factor complex p52 subunit
 gi|2997755|gb|AAC08594.1| TFIIH transcription/DNA repair factor p52 subunit [Mus musculus]
 gi|15215027|gb|AAH12638.1| Gtf2h4 protein [Mus musculus]
 gi|26348003|dbj|BAC37650.1| unnamed protein product [Mus musculus]
 gi|74194835|dbj|BAE26009.1| unnamed protein product [Mus musculus]
 gi|74222199|dbj|BAE26910.1| unnamed protein product [Mus musculus]
          Length = 463

 Score =  426 bits (1094), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 216/417 (51%), Positives = 292/417 (70%), Gaps = 35/417 (8%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL  +AK++V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR            
Sbjct: 48  ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 107

Query: 49  ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                    +N ++ALLGGGK WS  +++L  D  ARD+  L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPVFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 167

Query: 99  QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
             +  +S D  + L   G        E P IT  GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 168 P-SAAVSQDLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 226

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
           +G+DLVE L+FLFQL FST G DYS EGMSD L  FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 286

Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
           LA+N+++   G    + +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V  
Sbjct: 287 LAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQ 346

Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
           +TR+SV+ A+ SGITA QII FLR  A P +   +  LPP ++DQIRLWE ER+RL  +E
Sbjct: 347 VTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQNPVLPPTITDQIRLWELERDRLRFTE 406

Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           GVLY+QF+SQ DFE L  +A++LGVL ++N  KR +VVT  GHSD+K+FW   +  S
Sbjct: 407 GVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKHSS 463


>gi|344307658|ref|XP_003422497.1| PREDICTED: general transcription factor IIH subunit 4 [Loxodonta
           africana]
          Length = 463

 Score =  425 bits (1093), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 217/417 (52%), Positives = 291/417 (69%), Gaps = 35/417 (8%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL  +AK++V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR            
Sbjct: 48  ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 107

Query: 49  ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                    +N ++ALLGGGK WS  +++L  D  ARD+  L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 167

Query: 99  QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
             +  +S D  + L   G        E P IT  GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 168 P-SAAVSQDLAQLLSQAGLMKSAEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 226

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
           +G+DLVE L+FLFQL FST G DYS EGMSD L  FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 286

Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
           LA+N+++   G      +PGF+IVETN+R+YAYT+S L++ALI LF E++YRFPN+ V  
Sbjct: 287 LAINLSSGVSGAGGTAHQPGFIIVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQ 346

Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
           +TR+SV+ A+ SGITA QII FLR  A P +   +  LPP ++DQIRLWE ER+RL  +E
Sbjct: 347 VTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTE 406

Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           GVLY+QF+SQ DFE L  +A++LGVL ++N  KR +VVT  GHSD+K+FW   +  S
Sbjct: 407 GVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKHSS 463


>gi|281337500|gb|EFB13084.1| hypothetical protein PANDA_018954 [Ailuropoda melanoleuca]
          Length = 463

 Score =  425 bits (1092), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 216/417 (51%), Positives = 291/417 (69%), Gaps = 35/417 (8%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL  +AK++V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR            
Sbjct: 48  ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 107

Query: 49  ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                    +N ++ALLGGGK WS  +++L  D  ARD+  L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 167

Query: 99  QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
             +  +S D  + L   G        E P IT  GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 168 P-SAAVSQDLAQLLSQAGLMKSAEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 226

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
           +G+DLVE L+FLFQL FST G DYS EGMSD L  FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 286

Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
           LA+N+++   G      +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V  
Sbjct: 287 LAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQ 346

Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
           +TR+SV+ A+ SGITA QII FLR  A P +   +  LPP ++DQIRLWE ER+RL  +E
Sbjct: 347 VTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTE 406

Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           GVLY+QF+SQ DFE L  +A++LGVL ++N  KR +VVT  GHSD+K+FW   +  S
Sbjct: 407 GVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKHSS 463


>gi|338718559|ref|XP_003363845.1| PREDICTED: general transcription factor IIH subunit 4 [Equus
           caballus]
          Length = 463

 Score =  424 bits (1091), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 216/417 (51%), Positives = 291/417 (69%), Gaps = 35/417 (8%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL  +AK++V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR            
Sbjct: 48  ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 107

Query: 49  ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                    +N ++ALLGGGK WS  +++L  D  ARD+  L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPVFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 167

Query: 99  QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
             +  +S D  + L   G        E P IT  GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 168 P-SAAVSQDLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 226

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
           +G+DLVE L+FLFQL FST G DYS EGMSD L  FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 286

Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
           LA+N+++   G      +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V  
Sbjct: 287 LAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQ 346

Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
           +TR+SV+ A+ SGITA QII FLR  A P +   +  LPP ++DQIRLWE ER+RL  +E
Sbjct: 347 VTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTE 406

Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           GVLY+QF+SQ DFE L  +A++LGVL ++N  KR +VVT  GHSD+K+FW   +  S
Sbjct: 407 GVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKHSS 463


>gi|426250586|ref|XP_004019016.1| PREDICTED: general transcription factor IIH subunit 4 [Ovis aries]
          Length = 463

 Score =  424 bits (1091), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 216/417 (51%), Positives = 291/417 (69%), Gaps = 35/417 (8%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL  +AK++V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR            
Sbjct: 48  ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 107

Query: 49  ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                    +N ++ALLGGGK WS  +++L  D  ARD+  L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 167

Query: 99  QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
             +  +S D  + L   G        E P IT  GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 168 P-SAAVSQDLAQLLSQAGLMKSAEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 226

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
           +G+DLVE L+FLFQL FST G DYS EGMSD L  FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 286

Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
           LA+N+++   G      +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V  
Sbjct: 287 LAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQ 346

Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
           +TR+SV+ A+ SGITA QII FLR  A P +   +  LPP ++DQIRLWE ER+RL  +E
Sbjct: 347 VTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTE 406

Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           GVLY+QF+SQ DFE L  +A++LGVL ++N  KR +VVT  GHSD+K+FW   +  S
Sbjct: 407 GVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKHNS 463


>gi|178056954|ref|NP_001116592.1| general transcription factor IIH subunit 4 [Sus scrofa]
 gi|41529162|dbj|BAD08424.1| general transcription factor IIH, polypeptide 4 [Sus scrofa]
 gi|47496805|dbj|BAD08426.2| general transcription factor IIH, polypeptide 4 [Sus scrofa]
          Length = 463

 Score =  424 bits (1089), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 215/417 (51%), Positives = 291/417 (69%), Gaps = 35/417 (8%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL  +AK++V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR            
Sbjct: 48  ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 107

Query: 49  ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                    +N ++ALLGGGK WS  +++L  D  ARD+  L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 167

Query: 99  QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
             +  +S D  + L   G        E P IT  GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 168 P-SAAVSQDLAQLLSQAGLMKSAEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 226

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
           +G+DLVE L+FLFQL FST G DYS EGMSD L  FLQHLRE GLV+QRKRK+ R+YPT+
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLRELGLVFQRKRKSRRYYPTR 286

Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
           LA+N+++   G    + +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V  
Sbjct: 287 LAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQ 346

Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
           +TR+SV+ A+ SGITA QII FLR  A P +   +  LPP ++DQIRLWE ER+RL  +E
Sbjct: 347 VTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTE 406

Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           GVLY+QF+SQ DFE L  +A++LGVL ++N  KR +VVT  GHSD+K+FW   +  S
Sbjct: 407 GVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKHSS 463


>gi|359320969|ref|XP_003639474.1| PREDICTED: general transcription factor IIH subunit 4-like [Canis
           lupus familiaris]
          Length = 463

 Score =  423 bits (1088), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 217/417 (52%), Positives = 291/417 (69%), Gaps = 35/417 (8%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL  +AK++V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR            
Sbjct: 48  ELPFLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 107

Query: 49  ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                    +N ++ALLGGGK WS  +++L  D  ARD+  L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 167

Query: 99  QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
             +  +S D  + L   G        E P IT  GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 168 P-SAAVSQDLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 226

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
           +G+DLVE L+FLFQL FST G DYS EGMSD L  FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 286

Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
           LA+N+++   G      +PGF+IVETN+R+YAYT+S L++ALI LF E++YRFPN+ V  
Sbjct: 287 LAINLSSGVSGAGGTAHQPGFIIVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQ 346

Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
           +TR+SV+ A+ SGITA QII FLR  A P +   +  LPP ++DQIRLWE ER+RL  +E
Sbjct: 347 VTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTE 406

Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           GVLY+QF+SQ DFE L  +A++LGVL ++N  KR +VVT  GHSD+K+FW   +  S
Sbjct: 407 GVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKHSS 463


>gi|440896165|gb|ELR48176.1| General transcription factor IIH subunit 4 [Bos grunniens mutus]
          Length = 463

 Score =  423 bits (1087), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 216/417 (51%), Positives = 291/417 (69%), Gaps = 35/417 (8%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL  +AK++V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR            
Sbjct: 48  ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 107

Query: 49  ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                    +N ++ALLGGGK WS  +++L  D  ARD+  L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVLSLDKYAEERWEVVLHFMVGS 167

Query: 99  QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
             +  +S D  + L   G        E P IT  GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 168 P-SAAVSQDLAQLLSQAGLMKSAEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 226

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
           +G+DLVE L+FLFQL FST G DYS EGMSD L  FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 286

Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
           LA+N+++   G      +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V  
Sbjct: 287 LAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQ 346

Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
           +TR+SV+ A+ SGITA QII FLR  A P +   +  LPP ++DQIRLWE ER+RL  +E
Sbjct: 347 VTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTE 406

Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           GVLY+QF+SQ DFE L  +A++LGVL ++N  KR +VVT  GHSD+K+FW   +  S
Sbjct: 407 GVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKHNS 463


>gi|155371845|ref|NP_001094527.1| general transcription factor IIH subunit 4 [Bos taurus]
 gi|148878127|gb|AAI46241.1| GTF2H4 protein [Bos taurus]
 gi|296474219|tpg|DAA16334.1| TPA: general transcription factor IIH, polypeptide 4, 52kDa [Bos
           taurus]
          Length = 463

 Score =  423 bits (1087), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 216/417 (51%), Positives = 291/417 (69%), Gaps = 35/417 (8%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL  +AK++V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR            
Sbjct: 48  ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 107

Query: 49  ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                    +N ++ALLGGGK WS  +++L  D  ARD+  L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVLSLDKYAEERWEVVLHFMVGS 167

Query: 99  QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
             +  +S D  + L   G        E P IT  GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 168 P-SAAVSQDLAQLLSQAGLMKSAEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 226

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
           +G+DLVE L+FLFQL FST G DYS EGMSD L  FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 286

Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
           LA+N+++   G      +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V  
Sbjct: 287 LAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQ 346

Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
           +TR+SV+ A+ SGITA QII FLR  A P +   +  LPP ++DQIRLWE ER+RL  +E
Sbjct: 347 VTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTE 406

Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           GVLY+QF+SQ DFE L  +A++LGVL ++N  KR +VVT  GHSD+K+FW   +  S
Sbjct: 407 GVLYNQFLSQVDFELLLAHARELGVLMFENSAKRLMVVTPAGHSDVKRFWKRQKHNS 463


>gi|62896773|dbj|BAD96327.1| general transcription factor IIH, polypeptide 4, 52kDa variant
           [Homo sapiens]
          Length = 462

 Score =  423 bits (1087), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 215/417 (51%), Positives = 291/417 (69%), Gaps = 35/417 (8%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL  +AK++V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR            
Sbjct: 47  ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 106

Query: 49  ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                    +N ++ALLGGGK WS  +++L  D  ARD+  L++YA ERWE +L FMVGS
Sbjct: 107 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 166

Query: 99  QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
             +  +S D  + L   G        E P IT  GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 167 P-SAAVSQDLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 225

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
           +G+DLVE L+FLFQL FST G DYS EGMSD L  FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 226 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 285

Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
           LA+N+++   G    + +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V  
Sbjct: 286 LAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQ 345

Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
           +TR+SV+ A+ SGITA QII FLR  A P +   +  LPP ++DQIRLWE  R+RL  +E
Sbjct: 346 VTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELGRDRLRFTE 405

Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           GVLY+QF+SQ DFE L  +A++LGVL ++N  KR +VVT  GHSD+K+FW   +  S
Sbjct: 406 GVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKHSS 462


>gi|41053973|ref|NP_956221.1| general transcription factor IIH subunit 4 [Danio rerio]
 gi|39794736|gb|AAH64301.1| General transcription factor IIH, polypeptide 4 [Danio rerio]
          Length = 466

 Score =  422 bits (1086), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 215/428 (50%), Positives = 284/428 (66%), Gaps = 46/428 (10%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL  +AK+YV+R+LF++ P+PQA +A W+ K   K+H +    LT LR            
Sbjct: 40  ELPPLAKNYVMRMLFLDHPLPQAAVALWVKKGSQKDHDQCVSVLTGLRLWHSQHLQGGLQ 99

Query: 49  ---------RNAKVALLGGGKPWS-MSAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                     N ++ALLGGGKPW+   A L  D  ARD++ L++YA+ERWE IL FMVGS
Sbjct: 100 GFVLNPVFKDNLRIALLGGGKPWADEGANLGPDRHARDVESLDRYAMERWEVILHFMVGS 159

Query: 99  QQTEGISADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
             +  +S D  + L   G       E P IT  GFQFLLL+T +Q+WYF L+YL+T +S+
Sbjct: 160 P-SAAVSQDLAQLLIQAGLMKSETGEAPCITSAGFQFLLLDTASQLWYFTLQYLKTAQSR 218

Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
           G+DLVE L+FLFQL FST G DYS EGMS+ L  FLQHLREFGLV+QRKRK+ R+YPT+L
Sbjct: 219 GMDLVEILSFLFQLSFSTLGRDYSVEGMSESLLTFLQHLREFGLVFQRKRKSRRYYPTRL 278

Query: 213 ALNMATR--------------GTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCEL 258
           A+ +A                G +    + GF++VETN+R+YAYT+S L++AL+ LF E+
Sbjct: 279 AITLAAGVTANPASGSASSALGAIPGTGDTGFIVVETNYRIYAYTNSELQIALVALFSEM 338

Query: 259 MYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLW 315
           +YRFPNL V  +TR+SV+ A+ +GITA QII FLR  A P +   +  LPP ++DQIRLW
Sbjct: 339 LYRFPNLVVAQVTRESVQQAISNGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLW 398

Query: 316 EGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           E E++RL  +EGVLY+QF+SQADFE LRD AQ LGVL WQN   R +VVT  GHS++K+F
Sbjct: 399 ELEKDRLQFTEGVLYNQFLSQADFEVLRDRAQGLGVLVWQNPAHRVMVVTPHGHSEVKRF 458

Query: 376 WVNHQKGS 383
           W   +  S
Sbjct: 459 WKRQKSHS 466


>gi|417401351|gb|JAA47565.1| Putative rna polymer [Desmodus rotundus]
          Length = 463

 Score =  421 bits (1082), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 214/417 (51%), Positives = 287/417 (68%), Gaps = 35/417 (8%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL  +AK +V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR            
Sbjct: 48  ELPSLAKSWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSSLRIWHTQLLPGGLQ 107

Query: 49  ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                    +N ++ALLGGGK WS  +++L  D   RD+  L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHTRDVPSLDKYAEERWEVVLHFMVGS 167

Query: 99  QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
                +S D  + L   G        E P IT  GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 168 PDA-AVSQDLAQLLIQAGLMKSAEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 226

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
           +G+DLVE L+FLFQL FST G DYS EGMSD L  FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 286

Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
           LA+N+++   G      +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V  
Sbjct: 287 LAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQ 346

Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
           +TR+SV+ A+ SGITA QII FLR  A P +   +  LPP ++DQIRLWE ER+RL  +E
Sbjct: 347 VTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTE 406

Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           GVLY+QF+SQ DFE L  +A++LGVL ++N  KR +VVT  GH D+K+FW   +  S
Sbjct: 407 GVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHGDVKRFWKRQKHSS 463


>gi|351713119|gb|EHB16038.1| General transcription factor IIH subunit 4 [Heterocephalus glaber]
          Length = 463

 Score =  420 bits (1080), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 217/417 (52%), Positives = 290/417 (69%), Gaps = 35/417 (8%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL  +AK++V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR            
Sbjct: 48  ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 107

Query: 49  ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                    +N ++ALLGGGK WS  +++L  D  ARD+  L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 167

Query: 99  QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
             +  +S D  + L   G        E P IT  GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 168 P-SAAVSQDLAQLLSQAGLMKSAEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 226

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
           +G+DLVE L+FLFQL FST G DYS EGMSD L  FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGRDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 286

Query: 212 LALNM--ATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
           LA+N+     G    + +PGF++VETN+R+YAYT+S L+VALI LF E++YRFPN+ V  
Sbjct: 287 LAINLSSGVSGAGGSVHQPGFIVVETNYRLYAYTESELQVALIALFSEMLYRFPNVVVAQ 346

Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
           +TR+SV+ A+ SGITA QII FLR  A P +   +  LPP ++DQIRLWE ER+RL  +E
Sbjct: 347 VTRESVQQAIGSGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTE 406

Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           GVLY+QF+SQ DFE L  +A++LGVL ++N  KR +VVT  GHSD+K+FW   +  S
Sbjct: 407 GVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKHSS 463


>gi|74143475|dbj|BAE28812.1| unnamed protein product [Mus musculus]
          Length = 463

 Score =  419 bits (1076), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 213/417 (51%), Positives = 289/417 (69%), Gaps = 35/417 (8%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL  +AK++V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR            
Sbjct: 48  ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 107

Query: 49  ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                    +N ++ALLGGGK WS  +++L  D  ARD+  L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPVFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 167

Query: 99  QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
             +  +S D  + L   G        E P IT  GFQFLLL+ PAQ+WYF+L+YL+T +S
Sbjct: 168 P-SAAVSQDLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDMPAQLWYFMLQYLQTAQS 226

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
           +G+DLVE L+FLFQL FST G DYS EGMSD L  FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 286

Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
           LA+N+++   G    + +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V  
Sbjct: 287 LAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQ 346

Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
           +TR+SV+  + SGITA QII FLR  A P +   +  LPP ++DQIRLWE ER+RL  +E
Sbjct: 347 VTRESVQQVIASGITAQQIIHFLRTRAHPVMLKQNPVLPPTITDQIRLWELERDRLRFTE 406

Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           GVLY+QF+SQ  FE L  +A++LGVL ++N  KR +VVT  GHSD+K+FW   +  S
Sbjct: 407 GVLYNQFLSQVVFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKHSS 463


>gi|156378083|ref|XP_001630974.1| predicted protein [Nematostella vectensis]
 gi|156218005|gb|EDO38911.1| predicted protein [Nematostella vectensis]
          Length = 471

 Score =  417 bits (1073), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 211/422 (50%), Positives = 285/422 (67%), Gaps = 41/422 (9%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL E+AKH+V+R LF +QPVP++++++W+   + K +  + +TL +LR            
Sbjct: 50  ELPELAKHFVMRTLFADQPVPESIVSTWVKSAYYKYNIGSLNTLKQLRIWREVPSGVHKR 109

Query: 49  --------RNAKVALLGGGKPWSMSAK-LEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
                    N K AL GGGK W  S + L  D   R+ DFL++YA+ERWE +L FM GS 
Sbjct: 110 YEMNATFRTNMKAALCGGGKSWMGSTQHLGPDKHTREPDFLDKYAIERWESVLHFMTGST 169

Query: 100 QTE----GISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLET 148
           +      G+S D V+ L   G         NP+I+  GFQFLLL+ P+QVWYF+L+ LET
Sbjct: 170 EMADNAGGVSQDVVKVLVLSGLMKCESPGSNPIISPAGFQFLLLDRPSQVWYFMLQCLET 229

Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY 208
           VE++G+DLVECL+ LFQL FS+ G DY T+G++D    FLQ LRE GLV+QRKRK+ R+Y
Sbjct: 230 VEARGMDLVECLSLLFQLSFSSPGKDYPTDGLTDSQMKFLQQLREIGLVFQRKRKSRRYY 289

Query: 209 PTKLALNMATRG-----TLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFP 263
           PTKL++N+   G     T  QI E GF+I+ETN+RVYAYT+S L+V+LIGLFCE++ RFP
Sbjct: 290 PTKLSVNLTAAGKGINTTDSQI-EAGFIIIETNYRVYAYTESCLQVSLIGLFCEILCRFP 348

Query: 264 NLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERE 320
           NL V  LTR+S + AL SGI+A QI+ FL+  A P +   +  +P  +SDQ+RLWE ER 
Sbjct: 349 NLCVASLTRESCQQALASGISAEQILNFLQTRAHPEMLKRTPIIPSTISDQVRLWEMERS 408

Query: 321 RLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQ 380
           R+  +EGVLY+QF+SQADFE LR YA+DLGVL W N  KR VVV++ GH D+K+FW   +
Sbjct: 409 RMKFTEGVLYNQFLSQADFEMLRKYAEDLGVLIWANSTKRVVVVSRSGHDDVKRFWKRQR 468

Query: 381 KG 382
           +G
Sbjct: 469 QG 470


>gi|344252968|gb|EGW09072.1| General transcription factor IIH subunit 4 [Cricetulus griseus]
          Length = 463

 Score =  417 bits (1072), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 215/417 (51%), Positives = 289/417 (69%), Gaps = 35/417 (8%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL  +AK++V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR            
Sbjct: 48  ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 107

Query: 49  ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                    +N ++ALLGGGK WS  +++L  D  ARD+  L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 167

Query: 99  QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
             +  +S D  + L   G        E P IT  GFQFLLL+T AQ+WYF+L+YL+T +S
Sbjct: 168 P-SAAVSQDLAQLLSQAGLMKSAEPGEPPCITSAGFQFLLLDTSAQLWYFMLQYLQTAQS 226

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
           +G+DLVE L+FLFQL FST G DYS EGMSD L  FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 286

Query: 212 LALNM--ATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
           LA+N+     G    + +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V  
Sbjct: 287 LAINLSSGVSGAGGSVHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQ 346

Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
           +TR+SV+ A+ SGITA QII FLR  A P +   +  LPP ++DQIRLWE ER+RL  +E
Sbjct: 347 VTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTE 406

Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           GVLY+QF+SQ DFE L  +A++LGVL ++N  KR +VVT  GHSD+K+FW   +  S
Sbjct: 407 GVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKHSS 463


>gi|391335225|ref|XP_003741996.1| PREDICTED: general transcription factor IIH subunit 4-like
           [Metaseiulus occidentalis]
          Length = 457

 Score =  416 bits (1069), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 214/418 (51%), Positives = 283/418 (67%), Gaps = 40/418 (9%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
           L  +A+ YVIRLL + Q VPQAVI SW     A++H  + D L  LR             
Sbjct: 39  LPPLARQYVIRLLHIRQAVPQAVINSWYDPEAARDHTSSEDALLSLRLWHETTLPGGLPG 98

Query: 49  --------RNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQ 100
                      + AL GGG PW +  +L+ D   RD  FL+QYA ERW+C+L +MVGS+ 
Sbjct: 99  WQLNPTFQEMLRQALAGGGDPWIVYGELDKDKHGRDTKFLDQYAQERWDCVLHYMVGSEV 158

Query: 101 TEGISADAVRTLFNEG-----AENP----VITKDGFQFLLLETPAQVWYFILKYLETVES 151
             GIS DAVR L + G      ++P    +IT +GFQFLL++T  QVWYFIL+YL TVE+
Sbjct: 159 ESGISQDAVRILLHSGLMKKEDDSPNSKNLITMEGFQFLLMDTEDQVWYFILQYLSTVET 218

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
           +G+ LV+CL F+FQL F T G DYS +G+S+ L VFLQHLREFGLVYQRKRK+GRFYPT+
Sbjct: 219 RGVSLVDCLQFIFQLSFLTLGKDYSIKGVSENLLVFLQHLREFGLVYQRKRKSGRFYPTR 278

Query: 212 LALNMATRGTLKQIR----EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTV 267
           LA+ +A+   LK+++    E G++IVETN+RVYAYTDS L+VAL+ LFCEL+YRFPNL V
Sbjct: 279 LAIGLAS--GLKELQATKDEQGYIIVETNYRVYAYTDSPLQVALLSLFCELLYRFPNLLV 336

Query: 268 GVLTRDSVRAALRSGITAAQIIGFLRLHA----LPSVSCPLPPVVSDQIRLWEGERERLT 323
            VLTR+SVR AL+ GIT+ QI  FL+  +    L      +P  V+DQ+RLWE ER+R  
Sbjct: 337 AVLTRESVRQALKGGITSNQITHFLKSRSHRVVLEREEGIIPMTVTDQLRLWELERDRFK 396

Query: 324 TSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQK 381
             E VLYSQF +  DFE LR+YA+D+GVL ++N +KR +VV+K G +++++FW  H+K
Sbjct: 397 FGESVLYSQFQTTGDFEMLRNYARDIGVLKYENPQKRFLVVSKSGDAEVRQFWKRHKK 454


>gi|348542188|ref|XP_003458568.1| PREDICTED: general transcription factor IIH subunit 4 [Oreochromis
           niloticus]
          Length = 461

 Score =  415 bits (1066), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 212/417 (50%), Positives = 280/417 (67%), Gaps = 43/417 (10%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL  +AK+YV+R+LF++QP+PQA +A W+ K   K+H E    L  LR            
Sbjct: 40  ELPSLAKNYVMRMLFLDQPLPQAAVALWVKKDSQKDHDECISVLAGLRLWHSQQLQGGLQ 99

Query: 49  ---------RNAKVALLGGGKPWS-MSAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                     N K ALLGGG+ W+   + L  D  ARD++ L++YA+ERWE IL+FMVGS
Sbjct: 100 GYILNPVFKDNLKTALLGGGRAWADEGSTLGPDRHARDIESLDRYAMERWEIILQFMVGS 159

Query: 99  QQTEGISADAVRTLF------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
                +S D  + L       +E  E P IT  GFQFLLL+T +Q+WYF L+YL+T +S+
Sbjct: 160 --PSAVSQDLAQLLVQAGLMKSEAGEAPYITSAGFQFLLLDTASQLWYFTLQYLKTAQSR 217

Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
           G+DLVE L+FLFQL FS+ G DYS EGMS+ L  FLQHLREFGLV+QRKRK+ R+YPT+L
Sbjct: 218 GMDLVEILSFLFQLSFSSLGRDYSVEGMSESLLTFLQHLREFGLVFQRKRKSRRYYPTRL 277

Query: 213 ALNMATRGTLKQIR----------EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRF 262
           A+ +A   T               + GF++VETN+R+YAYT+S L++AL+ LF E++YRF
Sbjct: 278 AITLAAGVTSSSSVSNLSSSPGTGDAGFIVVETNYRIYAYTNSELQIALVALFSEMLYRF 337

Query: 263 PNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGER 319
           PN+ V  LTR+SV+ A+ +GITA QII FLR  A P +   +  LPP ++DQIRLWE ER
Sbjct: 338 PNVVVAQLTRESVQQAIANGITAQQIIHFLRTRAHPVMLTQTPVLPPTITDQIRLWELER 397

Query: 320 ERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFW 376
           +RL  +EGVLY+QF+SQADFE LRD AQ LG L WQ+   R +VVT QGHS++K+FW
Sbjct: 398 DRLQFTEGVLYNQFLSQADFEVLRDRAQGLGCLVWQDVAHRVMVVTPQGHSEVKRFW 454


>gi|339521903|gb|AEJ84116.1| general transcription factor IIH subunit 4 [Capra hircus]
          Length = 463

 Score =  414 bits (1065), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 212/417 (50%), Positives = 286/417 (68%), Gaps = 35/417 (8%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL  +AK++V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR            
Sbjct: 48  ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHAQLLPGGLQ 107

Query: 49  ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                    +N ++ALLGGGK WS  +++L  D  ARD+  L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 167

Query: 99  QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
                +S D  + L   G        E P IT  GFQFLLL+TPA +WYF+L+YL+T +S
Sbjct: 168 PSV-AVSQDLAQLLSQAGLMKSAEPGEPPCITSAGFQFLLLDTPAHLWYFMLQYLQTAQS 226

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
           +G+DLVE L+FLFQL FST G DYS EGMSD L  FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 286

Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
           LA+N+++   G      +PGF++VETN+R+YAYT+  L++A I LF E++YRFPN+ V  
Sbjct: 287 LAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYTEPELQIARIALFSEMLYRFPNMVVPQ 346

Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSE 326
           +TR SV+ A+ SGITA QI+ FLR  A P +   +  LPP ++DQIRLWE ER+RL  +E
Sbjct: 347 VTRGSVQQAIASGITAQQIVHFLRTRADPVMLKQTPVLPPTITDQIRLWELERDRLRFTE 406

Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           GVLY+QF+SQ DFE L  +A++LGVL ++N  KR +VVT  GHSD+K+FW   +  S
Sbjct: 407 GVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKDNS 463


>gi|432883135|ref|XP_004074222.1| PREDICTED: general transcription factor IIH subunit 4-like [Oryzias
           latipes]
          Length = 460

 Score =  414 bits (1064), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 216/420 (51%), Positives = 281/420 (66%), Gaps = 50/420 (11%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL  +AK+YV+R+LF+EQP+PQA +A W+ K   ++H E    LT LR            
Sbjct: 40  ELPLLAKNYVMRMLFLEQPLPQAAVALWVRKDGQRDHDECVSVLTGLRLWHSQQLQGGLQ 99

Query: 49  ---------RNAKVALLGGGKPWSMSAK-LEVDSKARDLDFLNQYALERWECILRFMVGS 98
                     N ++ALLGGGK W+     L  D  ARD++ L++YA+ERWE IL+FMVGS
Sbjct: 100 GYILNPVFKDNLRIALLGGGKVWADEGIILGPDRHARDIESLDRYAMERWEVILQFMVGS 159

Query: 99  QQTEGISADAVRTLF------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
                +S D  + L       +E  E P IT  GFQFLLL+T +Q+WY  L+YL T +S+
Sbjct: 160 --PSAVSQDLAQLLVQAGLMKSEAGEAPYITSAGFQFLLLDTASQLWYLTLQYLNTAQSR 217

Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
           G++LVE L+FLFQL FST G DYS EGMS+ L  FLQHLREFGLV+QRKRK+ R+YPT+L
Sbjct: 218 GMELVEILSFLFQLSFSTLGRDYSVEGMSESLLTFLQHLREFGLVFQRKRKSRRYYPTRL 277

Query: 213 ALNMA-------------TRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELM 259
           A+ +A             T GT     + GF++VETN+R+YAYT+S L++AL+ LF E++
Sbjct: 278 AITLAAGVSSNSPSNMTNTPGT----GDAGFIVVETNYRIYAYTNSELQIALVALFSEML 333

Query: 260 YRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWE 316
           YRFPN+ V  LTR+SV+ A+ +GITA QII FLR  A P +   S  LPP ++DQIRLWE
Sbjct: 334 YRFPNVVVAHLTRESVQQAIANGITAQQIIHFLRTRAHPVMLKQSPALPPTITDQIRLWE 393

Query: 317 GERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFW 376
            ER+RL  +EGVLY+QF+SQADFE LRD AQ LG L WQ+   R +VVT QGHS++KKFW
Sbjct: 394 LERDRLQFTEGVLYNQFLSQADFEVLRDRAQGLGCLVWQDASHRAMVVTPQGHSEVKKFW 453


>gi|410911638|ref|XP_003969297.1| PREDICTED: general transcription factor IIH subunit 4-like
           [Takifugu rubripes]
          Length = 465

 Score =  411 bits (1057), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 211/420 (50%), Positives = 279/420 (66%), Gaps = 45/420 (10%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL  +AK++V+R+LF++QP+PQA ++ W++K + K+H E    L  LR            
Sbjct: 40  ELPSLAKNFVMRMLFLDQPLPQAAVSLWVNKDNQKDHDECVSVLAGLRLWHSQHLQGGLQ 99

Query: 49  ---------RNAKVALLGGGKPWS-MSAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                     N K+ALLGGG+ W+     L  D  ARD+  L++YA ERWE IL FMVGS
Sbjct: 100 GYILNPVFKDNLKIALLGGGRAWADEGGTLGPDRHARDIGSLDRYATERWEVILHFMVGS 159

Query: 99  QQTEGISADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
                +S D  + L + G       E P IT  GFQFLLL+T +Q+WYF L+YL+T +S+
Sbjct: 160 P-CAAVSQDLAQLLVHAGLMKSEAGEPPYITSAGFQFLLLDTASQLWYFTLQYLKTAQSR 218

Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
           G+DLVE L+FLFQL FST G DYS EGMSD L  FLQHLREFGLV+QRKRK+ R+YPT+L
Sbjct: 219 GMDLVEILSFLFQLSFSTLGRDYSVEGMSDSLLTFLQHLREFGLVFQRKRKSRRYYPTRL 278

Query: 213 ALNMATRGTLKQIREP-------------GFLIVETNFRVYAYTDSNLKVALIGLFCELM 259
           A+ +AT  +   +  P             GF++VETN+R+YAYT+S L++AL+ LF E++
Sbjct: 279 AITLATGDSSSSLHTPTASLASTPGSGDSGFIVVETNYRIYAYTNSELQIALVALFSEML 338

Query: 260 YRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SCP-LPPVVSDQIRLWE 316
           YRFPN+ V  +TR+SV+ A+ +GITA QII FLR  A P +    P LPP ++DQIRLWE
Sbjct: 339 YRFPNVVVAQVTRESVQQAIANGITAQQIIHFLRTRAHPVMLRQTPFLPPTITDQIRLWE 398

Query: 317 GERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFW 376
            ER+RL  +EGVLY+QF+SQ DFE LRD A+ LG L WQ+   R +VVT  GHS++KKFW
Sbjct: 399 LERDRLQFTEGVLYNQFLSQTDFEVLRDRAKSLGCLVWQDAAHRVMVVTLWGHSEVKKFW 458


>gi|72074363|ref|XP_796111.1| PREDICTED: general transcription factor IIH subunit 4
           [Strongylocentrotus purpuratus]
          Length = 469

 Score =  409 bits (1052), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 201/419 (47%), Positives = 286/419 (68%), Gaps = 37/419 (8%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL  + KH ++R+LF++ P+PQA + SW+     ++   A  TLT+LR            
Sbjct: 48  ELPVLGKHCIMRVLFIDTPIPQAAVTSWIQSNQQEQLHAALKTLTDLRLFRDQSLPGGLP 107

Query: 49  ---------RNAKVALLGGGKPWSMSAKLEVDSK---ARDLDFLNQYALERWECILRFMV 96
                     N K AL+GGGKPW++S K     K    ++   L++Y+ ERWEC+L F+V
Sbjct: 108 GWLLNLTFRTNLKTALIGGGKPWAISGKGGKGGKDKKVKETATLDKYSSERWECVLHFLV 167

Query: 97  GSQQT-EGISADAVRTL-------FNEGAENPVITKDGFQFLLLETPAQVWYFILKYLET 148
           GS +  + +S D    L         E    PVIT  GFQFLLL+TP+QVW+F+L+YLET
Sbjct: 168 GSSKAVDTLSRDIATVLTHSGLMRLGESGGTPVITPSGFQFLLLDTPSQVWFFMLQYLET 227

Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY 208
            +++GLD+V+ L+FLFQL FST G DYS+EGM+++   FLQHLRE GLV+QRKRK+ R+Y
Sbjct: 228 SQARGLDIVDALSFLFQLSFSTLGKDYSSEGMTEQQLHFLQHLRELGLVFQRKRKSMRYY 287

Query: 209 PTKLALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLT 266
           PT+LA+N+A+      K   + GF++VETNFRVYAYT+S+L+V ++GLFC +MYRFPNL+
Sbjct: 288 PTRLAINLASGVSSMAKDDHKDGFIVVETNFRVYAYTESDLQVEILGLFCSMMYRFPNLS 347

Query: 267 VGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SCPL-PPVVSDQIRLWEGERERLT 323
           V  LTR+SV+ A+ +GITA QI+ FLR HA P++    P+ PP +SDQ+RLWE ER+RL+
Sbjct: 348 VAALTRESVQLAISNGITAEQILSFLRTHAHPNMRLKTPIVPPTISDQVRLWELERDRLS 407

Query: 324 TSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKG 382
            ++G++Y++F+S  DFE LRDYA+DLGVL W +  +R ++V+  GH  +KK+W   +KG
Sbjct: 408 FTQGIIYNEFLSLHDFEVLRDYAKDLGVLIWDSTARRIMIVSPAGHDSVKKYWKRLKKG 466


>gi|431907067|gb|ELK11185.1| General transcription factor IIH subunit 4 [Pteropus alecto]
          Length = 447

 Score =  409 bits (1050), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 208/414 (50%), Positives = 283/414 (68%), Gaps = 45/414 (10%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL  +AK++V+R+LF++QP+PQA +A W+ K  +K  +E+T  L+ LR            
Sbjct: 48  ELPSLAKNWVMRMLFLDQPLPQAAVALWVKKEFSKAQEESTGLLSSLRIWHTQLLPGGLQ 107

Query: 49  ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                    +N ++ALLGGGK WS  +++L  D  ARD+  L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 167

Query: 99  QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
             +  +S D  + L   G        E P IT  GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 168 P-SAAVSQDLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 226

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
           +G+DLVE L+FLFQL FST G DYS EGMSD L  FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 286

Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
           LA+N+++   G      +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V  
Sbjct: 287 LAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQ 346

Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPVVSDQIRLWEGERERLTTSEGVL 329
           +TR+SV+ A+ SGITA Q             +  LPP ++DQIRLWE ER+RL  +EGVL
Sbjct: 347 VTRESVQQAIASGITAQQ-------------TPVLPPTITDQIRLWELERDRLRFTEGVL 393

Query: 330 YSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           Y+QF+SQ DFE L  +A++LGVL ++N  KR +VVT  GHSD+K+FW   +  S
Sbjct: 394 YNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKHSS 447


>gi|334323685|ref|XP_003340422.1| PREDICTED: valyl-tRNA synthetase, mitochondrial-like [Monodelphis
           domestica]
          Length = 1641

 Score =  403 bits (1035), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 217/441 (49%), Positives = 289/441 (65%), Gaps = 66/441 (14%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL  +AK++V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR            
Sbjct: 48  ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 107

Query: 49  ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                    +N +VALLGGGK WS  +++L  D  ARD+  L++YA ERWE IL FMVGS
Sbjct: 108 GLILNPVFRQNLRVALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVILHFMVGS 167

Query: 99  QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
             +  +S D  + L   G        E P IT  GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 168 P-SAAVSQDLAQLLSQAGLMKSSEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 226

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
           +G+DLVE L+FLFQL FST G DYS EGMSD L  FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 286

Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAY--------------------------- 242
           LA+N+++   G      +PGF++VETN+R+YAY                           
Sbjct: 287 LAINLSSGVTGAGGTAHQPGFIVVETNYRLYAYTGEVGGMRMGIGMLEGRNCGFFGIILF 346

Query: 243 ----TDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALP 298
               T+S L++ALI LF E++YRFPN+ V  +TR+SV+ A+ +GITA QII FL+  A P
Sbjct: 347 FSLATESELQIALIALFSEMLYRFPNMVVAQITRESVQQAIANGITAQQIIHFLKTRAHP 406

Query: 299 SV--SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQ 355
            +    P LPP ++DQIRLWE ER+RL  SEGVLY+QF+SQ DFE L  +A++LGVL ++
Sbjct: 407 VMLKQIPVLPPTITDQIRLWELERDRLRFSEGVLYNQFLSQVDFELLLSHARELGVLVFE 466

Query: 356 NEKKRTVVVTKQGHSDIKKFW 376
           N  KR +VVT  GHSD+K+FW
Sbjct: 467 NTAKRLMVVTPAGHSDVKRFW 487


>gi|395533817|ref|XP_003768949.1| PREDICTED: LOW QUALITY PROTEIN: valine--tRNA ligase, mitochondrial
           [Sarcophilus harrisii]
          Length = 1638

 Score =  402 bits (1033), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 216/451 (47%), Positives = 291/451 (64%), Gaps = 69/451 (15%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL  +AK++V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR            
Sbjct: 81  ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 140

Query: 49  ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                    +N +VALLGGGK WS  +++L  D  ARD+  L++YA ERWE IL FMVGS
Sbjct: 141 GFILNPVFRQNLRVALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVILHFMVGS 200

Query: 99  QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
             +  +S D  + L   G        E P IT  GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 201 P-SAAVSQDLAQLLSQAGLMKSSEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 259

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
           +G+DLVE L+FLFQL FST G DYS EGMSD L  FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 260 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 319

Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAY--------------------------- 242
           LA+N+++   G      +PGF++VETN+R+YAY                           
Sbjct: 320 LAINLSSGVTGAGGTAHQPGFIVVETNYRLYAYTGEVGGIKMRIEIRGGGIGGELNLLWS 379

Query: 243 -------TDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLH 295
                  T+S L++ALI LF E++YRFPN+ V  +TR+SV+ A+ +GITA QII F++  
Sbjct: 380 SIVFSLETESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIANGITAQQIIHFMKTR 439

Query: 296 ALPSV---SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVL 352
           A P +   +  LPP ++DQIRLWE ER+RL  SEGVLY+QF+SQ DFE L  +A++LGVL
Sbjct: 440 AHPVMLKQTPVLPPTITDQIRLWELERDRLRFSEGVLYNQFLSQVDFELLLSHARELGVL 499

Query: 353 TWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
            ++N  KR +VVT  GHSD+K+FW   +  S
Sbjct: 500 VFENTAKRLMVVTPAGHSDVKRFWKRQKHSS 530


>gi|402866404|ref|XP_003897374.1| PREDICTED: LOW QUALITY PROTEIN: valine--tRNA ligase, mitochondrial,
           partial [Papio anubis]
          Length = 1647

 Score =  401 bits (1031), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 213/448 (47%), Positives = 290/448 (64%), Gaps = 68/448 (15%)

Query: 5   IAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR---------------- 48
           + K++V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR                
Sbjct: 97  LGKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLIL 156

Query: 49  -----RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
                +N ++ALLGGGK WS  +++L  D  ARD+  L++YA ERWE +L FMVGS  + 
Sbjct: 157 NPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSP-SA 215

Query: 103 GISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
            +S D  + L   G        E P IT  GFQFLLL+TPAQ+WYF+L+YL+T +S+G+D
Sbjct: 216 AVSQDLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMD 275

Query: 156 LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALN 215
           LVE L+FLFQL FST G DYS EGMSD L  FLQHLREFGLV+QRKRK+ R+YPT+LA+N
Sbjct: 276 LVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTRLAIN 335

Query: 216 MAT--RGTLKQIREPGFLIVETNFRVYAYT------------------------------ 243
           +++   G    + +PGF++VETN+R+YAYT                              
Sbjct: 336 LSSGVSGAGGTVHQPGFIVVETNYRLYAYTGEVGQSAPEEKKEGLEGVWVWGWPPHPLSI 395

Query: 244 ---DSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV 300
              +S L++ALI LF E++YRFPN+ V  +TR+SV+ A+ SGITA QII FLR  A P +
Sbjct: 396 PGSESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVM 455

Query: 301 ---SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNE 357
              +  LPP ++DQIRLWE ER+RL  +EGVLY+QF+SQ DFE L  +A++LGVL ++N 
Sbjct: 456 LKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENS 515

Query: 358 KKRTVVVTKQGHSDIKKFWVNHQKGSQH 385
            KR +VVT  GHSD+K+FW   +  S+ 
Sbjct: 516 AKRLMVVTPAGHSDVKRFWKRQKHSSRQ 543


>gi|62088538|dbj|BAD92716.1| VARS2L protein variant [Homo sapiens]
          Length = 1653

 Score =  400 bits (1027), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 213/452 (47%), Positives = 290/452 (64%), Gaps = 73/452 (16%)

Query: 5   IAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR---------------- 48
           + K++V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR                
Sbjct: 98  LGKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLIL 157

Query: 49  -----RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
                +N ++ALLGGGK WS  +++L  D  ARD+  L++YA ERWE +L FMVGS  + 
Sbjct: 158 NPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSP-SA 216

Query: 103 GISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
            +S D  + L   G        E P IT  GFQFLLL+TPAQ+WYF+L+YL+T +S+G+D
Sbjct: 217 AVSQDLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMD 276

Query: 156 LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALN 215
           LVE L+FLFQL FST G DYS EGMSD L  FLQHLREFGLV+QRKRK+ R+YPT+LA+N
Sbjct: 277 LVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTRLAIN 336

Query: 216 MAT--RGTLKQIREPGFLIVETNFRVYAYT------------------------------ 243
           +++   G    + +PGF++VETN+R+YAYT                              
Sbjct: 337 LSSGVSGAGGTVHQPGFIVVETNYRLYAYTGEAGQRAPGRGGWGHLKEGLEGVWVWGWPP 396

Query: 244 --------DSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLH 295
                   +S L++ALI LF E++YRFPN+ V  +TR+SV+ A+ SGITA QII FLR  
Sbjct: 397 LPLSIPGSESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTR 456

Query: 296 ALPSV---SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVL 352
           A P +   +  LPP ++DQIRLWE ER+RL  +EGVLY+QF+SQ DFE L  +A++LGVL
Sbjct: 457 AHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVL 516

Query: 353 TWQNEKKRTVVVTKQGHSDIKKFWVNHQKGSQ 384
            ++N  KR +VVT  GHSD+K+FW   +  S+
Sbjct: 517 VFENSAKRLMVVTPAGHSDVKRFWKRQKHSSR 548


>gi|403308524|ref|XP_003944708.1| PREDICTED: LOW QUALITY PROTEIN: valine--tRNA ligase, mitochondrial
           [Saimiri boliviensis boliviensis]
          Length = 1752

 Score =  400 bits (1027), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 213/451 (47%), Positives = 289/451 (64%), Gaps = 73/451 (16%)

Query: 5   IAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR---------------- 48
           + K++V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR                
Sbjct: 196 LGKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLIL 255

Query: 49  -----RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
                +N ++ALLGGGK WS  +++L  D  ARD+  L++YA ERWE +L FMVGS  + 
Sbjct: 256 NPVFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSP-SA 314

Query: 103 GISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
            +S D  + L   G        E P IT  GFQFLLL+TPAQ+WYF+L+YL+T +S+G+D
Sbjct: 315 AVSQDLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMD 374

Query: 156 LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALN 215
           LVE L+FLFQL FST G DYS EGMSD L  FLQHLREFGLV+QRKRK+ R+YPT+LA+N
Sbjct: 375 LVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTRLAIN 434

Query: 216 MAT--RGTLKQIREPGFLIVETNFRVYAYT------------------------------ 243
           +++   G    + +PGF++VETN+R+YAYT                              
Sbjct: 435 LSSGVSGAGGTVHQPGFIVVETNYRLYAYTGEVGQRAPGRAGWGRGNARLEGVWVWGWPP 494

Query: 244 --------DSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLH 295
                   +S L++ALI LF E++YRFPN+ V  +TR+SV+ A+ SGITA QII FLR  
Sbjct: 495 HPLSIPGSESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTR 554

Query: 296 ALPSV---SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVL 352
           A P +   +  LPP ++DQIRLWE ER+RL  +EGVLY+QF+SQ DFE L  +A++LGVL
Sbjct: 555 AHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVL 614

Query: 353 TWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
            ++N  KR +VVT  GHSD+K+FW   +  S
Sbjct: 615 VFENSAKRLMVVTPAGHSDVKRFWKRQKHSS 645


>gi|301786953|ref|XP_002928911.1| PREDICTED: LOW QUALITY PROTEIN: valyl-tRNA synthetase,
           mitochondrial-like [Ailuropoda melanoleuca]
          Length = 1637

 Score =  399 bits (1026), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 215/459 (46%), Positives = 291/459 (63%), Gaps = 77/459 (16%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           +L  +AK++V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR            
Sbjct: 71  QLPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 130

Query: 49  ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                    +N ++ALLGGGK WS  +++L  D  ARD+  L++YA ERWE +L FMVGS
Sbjct: 131 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 190

Query: 99  QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
             +  +S D  + L   G        E P IT  GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 191 P-SAAVSQDLAQLLSQAGLMKSAEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 249

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
           +G+DLVE L+FLFQL FST G DYS EGMSD L  FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 250 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 309

Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYT-------------------------- 243
           LA+N+++   G      +PGF++VETN+R+YAYT                          
Sbjct: 310 LAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYTGEARQRAPNSRXGVGTFLVTLEGLEG 369

Query: 244 ----------------DSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQ 287
                           +S L++ALI LF E++YRFPN+ V  +TR+SV+ A+ SGITA Q
Sbjct: 370 VWVWGXPSSSLPIPGPESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQ 429

Query: 288 IIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRD 344
           II FLR  A P +   +  LPP ++DQIRLWE ER+RL  +EGVLY+QF+SQ DFE L  
Sbjct: 430 IIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLA 489

Query: 345 YAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           +A++LGVL ++N  KR +VVT  GHSD+K+FW   +  S
Sbjct: 490 HARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKHSS 528


>gi|390461382|ref|XP_002746360.2| PREDICTED: LOW QUALITY PROTEIN: valine--tRNA ligase, mitochondrial
           [Callithrix jacchus]
          Length = 1675

 Score =  399 bits (1025), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 213/452 (47%), Positives = 289/452 (63%), Gaps = 74/452 (16%)

Query: 5   IAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR---------------- 48
           + K++V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR                
Sbjct: 118 LGKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLIL 177

Query: 49  -----RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
                +N ++ALLGGGK WS  +++L  D  ARD+  L++YA ERWE +L FMVGS  + 
Sbjct: 178 NPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSP-SA 236

Query: 103 GISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
            +S D  + L   G        E P IT  GFQFLLL+TPAQ+WYF+L+YL+T +S+G+D
Sbjct: 237 AVSQDLAQLLSQAGLMKSAEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMD 296

Query: 156 LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALN 215
           LVE L+FLFQL FST G DYS EGMSD L  FLQHLREFGLV+QRKRK+ R+YPT+LA+N
Sbjct: 297 LVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTRLAIN 356

Query: 216 MAT--RGTLKQIREPGFLIVETNFRVYAYT------------------------------ 243
           +++   G    + +PGF++VETN+R+YAYT                              
Sbjct: 357 LSSGVSGAGGTVHQPGFIVVETNYRLYAYTGEAGQRAPGRAGWGXHLKERFEGVWVWGWP 416

Query: 244 ---------DSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRL 294
                    +S L++ALI LF E++YRFPN+ V  +TR+SV+ A+ SGITA QII FLR 
Sbjct: 417 PHPLSIPGSESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRT 476

Query: 295 HALPSV---SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGV 351
            A P +   +  LPP ++DQIRLWE ER+RL  +EGVLY+QF+SQ DFE L  +A++LGV
Sbjct: 477 RAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGV 536

Query: 352 LTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           L ++N  KR +VVT  GHSD+K+FW   +  S
Sbjct: 537 LVFENSAKRLMVVTPAGHSDVKRFWKRQKHSS 568


>gi|193654859|ref|XP_001944632.1| PREDICTED: general transcription factor IIH subunit 4-like
           [Acyrthosiphon pisum]
          Length = 470

 Score =  397 bits (1021), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 195/413 (47%), Positives = 275/413 (66%), Gaps = 33/413 (7%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           L +IA+ + +R+LF+EQPVP + ++SW+   +++E +E+ +    L              
Sbjct: 58  LPDIAQVFTLRILFIEQPVPLSALSSWVPAKYSRELEESIEVAINLHIWKLTSVSGGLKG 117

Query: 48  -------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQ 100
                  ++  KVA++ GG+  ++    E   KARD+D L+ YA ERWECIL +MVGS+ 
Sbjct: 118 WILNSTFKKKLKVAIMSGGRT-TVPNSDETALKARDIDILDSYAYERWECILYYMVGSKH 176

Query: 101 TEGISADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGL 154
            +GIS+DAVR L N G        +PVIT  G+QFLLL+   QVWYF+L Y+ETVES+GL
Sbjct: 177 -KGISSDAVRVLLNAGLMVRDTDNSPVITSTGYQFLLLDMATQVWYFMLHYMETVESRGL 235

Query: 155 DLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLAL 214
           DL + L FLFQ+  ST G DY T+GM + LQ FLQHL+EFGLVYQRK KAGRF PT+L +
Sbjct: 236 DLAQYLIFLFQIHLSTLGWDYITDGMPENLQTFLQHLQEFGLVYQRKLKAGRFCPTRLVI 295

Query: 215 NMA---TRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLT 271
            M    +  + +  ++  +++VETNFR+YA TDS+LKVAL+ LF  ++YRFPN++ G+LT
Sbjct: 296 EMGRENSHTSKRMKKKERYIVVETNFRIYAMTDSDLKVALVALFTHMLYRFPNMSAGILT 355

Query: 272 RDSVRAALRSGITAAQIIGFLRLHALPSV-SCPLPPVVSDQIRLWEGERERLTTSEGVLY 330
           RDSV+ ALR+GITAAQI+ FL +H  P +  C +P  V DQI LWE ER RLT  +GVLY
Sbjct: 356 RDSVQTALRNGITAAQIVRFLTVHTHPQMQECGMPQTVIDQIYLWEYERNRLTYRDGVLY 415

Query: 331 SQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           S   +  D+E +++YA D+G L W +E++R ++V+  GH+D+  FW    K +
Sbjct: 416 SDINTPNDYEAIKNYAADIGALLWCDERQRNIIVSTDGHNDVTTFWKKQPKSN 468


>gi|397471696|ref|XP_003807420.1| PREDICTED: LOW QUALITY PROTEIN: valine--tRNA ligase, mitochondrial
           [Pan paniscus]
          Length = 1673

 Score =  397 bits (1019), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 213/459 (46%), Positives = 290/459 (63%), Gaps = 80/459 (17%)

Query: 5   IAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR---------------- 48
           + K++V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR                
Sbjct: 111 LGKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLIL 170

Query: 49  -----RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
                +N ++ALLGGGK WS  +++L  D  ARD+  L++YA ERWE +L FMVGS  + 
Sbjct: 171 NPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSP-SA 229

Query: 103 GISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
            +S D  + L   G        E P IT  GFQFLLL+TPAQ+WYF+L+YL+T +S+G+D
Sbjct: 230 AVSQDLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMD 289

Query: 156 LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALN 215
           LVE L+FLFQL FST G DYS EGMSD L  FLQHLREFGLV+QRKRK+ R+YPT+LA+N
Sbjct: 290 LVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTRLAIN 349

Query: 216 MAT--RGTLKQIREPGFLIVETNFRVYAYT------------------------------ 243
           +++   G    + +PGF++VETN+R+YAYT                              
Sbjct: 350 LSSGVSGAGGTVHQPGFIVVETNYRLYAYTGEAGQRAPGRGGWGALXHFSXHLKEGLEGV 409

Query: 244 ---------------DSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQI 288
                          +S L++ALI LF E++YRFPN+ V  +TR+SV+ A+ SGITA QI
Sbjct: 410 WVWGCPPHPLSIPGSESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQI 469

Query: 289 IGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDY 345
           I FLR  A P +   +  LPP ++DQIRLWE ER+RL  +EGVLY+QF+SQ DFE L  +
Sbjct: 470 IHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAH 529

Query: 346 AQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGSQ 384
           A++LGVL ++N  KR +VVT  GHSD+K+FW   +  S+
Sbjct: 530 ARELGVLVFENSAKRLMVVTPAGHSDVKRFWKRQKHSSR 568


>gi|16198089|gb|AAL13841.1| LD30622p [Drosophila melanogaster]
          Length = 350

 Score =  383 bits (984), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 188/349 (53%), Positives = 245/349 (70%), Gaps = 31/349 (8%)

Query: 64  MSAKLEVDSKARDLDFLNQYALERWECILRFMVGS-----QQTEGISADAVRTLFN---- 114
           M+  L+ DSK RD+ FL+ YA+ RW C+L +MVG+        E IS DAVR L +    
Sbjct: 1   MTNTLDKDSKPRDIAFLDTYAMSRWRCVLHYMVGTGNRNGTDAEAISPDAVRILLHANLM 60

Query: 115 --EGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFG 172
             +  +   IT+ GFQFLLL+T AQVW+F+L+YL+T E +G+ L ECL+ LFQL FST G
Sbjct: 61  KRDERDGITITRQGFQFLLLDTRAQVWHFMLQYLDTCEERGISLPECLSMLFQLSFSTLG 120

Query: 173 TDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTL---------K 223
            DYS+EGM+ ++  FLQHLREFGLV+QRKRK GRFYPT+LALN+ ++            +
Sbjct: 121 RDYSSEGMNSQMLTFLQHLREFGLVFQRKRKEGRFYPTRLALNVTSKEAAATASVAMDEE 180

Query: 224 QIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGI 283
             ++ G+++VETN+RVYAYTDS L+VA++GLF EL+YRFPNL VGVLTRDSVR ALR GI
Sbjct: 181 ATQDCGYIVVETNYRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGI 240

Query: 284 TAAQIIGFLRLHALPSV----------SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQF 333
           TA QI+ +L  +A P++          SC LPP V DQI+LWE ER R T +EGVLY+QF
Sbjct: 241 TAEQIVSYLEQYAHPNMRMVESAIHSKSC-LPPTVVDQIKLWELERNRFTYTEGVLYNQF 299

Query: 334 VSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKG 382
           +S  DF  LRDYAQ + +L WQNE+ RT+VV K GH D+K++W  + K 
Sbjct: 300 LSHTDFVTLRDYAQSIHMLVWQNERTRTMVVQKNGHDDVKRYWKKYSKS 348


>gi|198428509|ref|XP_002131018.1| PREDICTED: similar to General transcription factor II H,
           polypeptide 4 [Ciona intestinalis]
          Length = 463

 Score =  380 bits (977), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 186/421 (44%), Positives = 274/421 (65%), Gaps = 39/421 (9%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL E+AK+Y++R+LFVEQP+ +A +++W+     ++H EA  ++  LR            
Sbjct: 42  ELPELAKYYIMRILFVEQPISKAAVSAWVKVNAKQDHNEAVKSMCSLRVWMESNLQGSAS 101

Query: 49  ----------RNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                     RN + AL+GGG+PWS +A L  D   +D++ L++YA ERWE +L ++VGS
Sbjct: 102 TAFIMSSIFRRNLQKALVGGGEPWSSTAHLGPDKHGKDIESLDKYASERWEMLLHYLVGS 161

Query: 99  QQTEGISADAVRTLFNE----------GAENPVITKDGFQFLLLETPAQVWYFILKYLET 148
           +    IS D ++ L N+          G    +IT  GFQFLLL+  +QVWYF+L+YL+ 
Sbjct: 162 ETNSTISQD-IKDLINQAGLMKQGSEGGRSYSIITAKGFQFLLLDNASQVWYFVLEYLDW 220

Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY 208
           V  +G++LV  L F+F+L FS+ G D  T+G  D + + LQH RE GLV QRKRK+ RFY
Sbjct: 221 VRERGMNLVPILRFIFELSFSSPGKDLPTDGRDDHVLICLQHFREMGLVMQRKRKSRRFY 280

Query: 209 PTKLALNMATRGT---LKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNL 265
           PT LA+N+A R            GF++ ETNFRVYAYTDS L+ +++ LFC+++YRFPN+
Sbjct: 281 PTYLAINLANRANASVTNVTNSRGFILAETNFRVYAYTDSELQYSILSLFCKMLYRFPNV 340

Query: 266 TVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLP---PVVSDQIRLWEGERERL 322
            V  LTR S+++A+ +GITA QI+ +++ +A P +    P   P ++DQ+RLW  ER+RL
Sbjct: 341 CVLQLTRKSIQSAVVNGITADQILHYIKANAHPDMLKNDPIIAPTLADQVRLWAMERDRL 400

Query: 323 TTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKG 382
           T  +GVLY+QF++Q DFE LR+YA++LG L W+N  KR +VVTK GH  +K++W  ++K 
Sbjct: 401 TYRDGVLYNQFLAQKDFEVLRNYAKELGALIWENSSKRYMVVTKDGHDQVKRYWKKYKKE 460

Query: 383 S 383
           S
Sbjct: 461 S 461


>gi|195999048|ref|XP_002109392.1| hypothetical protein TRIADDRAFT_20428 [Trichoplax adhaerens]
 gi|190587516|gb|EDV27558.1| hypothetical protein TRIADDRAFT_20428 [Trichoplax adhaerens]
          Length = 452

 Score =  377 bits (969), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 196/417 (47%), Positives = 270/417 (64%), Gaps = 41/417 (9%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           ELS++AKHYV+R LF  Q +P + I++W++  H KEH+ A + LTE+R            
Sbjct: 40  ELSQLAKHYVMRTLFTIQAIPTSTISAWVNSQHQKEHQAALNQLTEIRIWQSHTLPGGMQ 99

Query: 49  ---------RNAKVALLGGGKPW-----SMSAKLEVDSKARDLDFLNQYALERWECILRF 94
                     N K AL GGG+ W     S + K +  S+A DL      + ERWEC+L +
Sbjct: 100 GIKLNESFRANLKTALCGGGESWYGPQLSKADKYQFTSEALDLK-----SKERWECVLHY 154

Query: 95  MVGSQQTEGISADAVRTLFNEG-AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKG 153
           +VG+ +   +S +    L N G      IT  GFQFLLL+ P+QVWY +L+YL++ ES G
Sbjct: 155 LVGAGEDLALSGEIQSLLNNCGLVSKNGITAAGFQFLLLDRPSQVWYILLQYLDSAESLG 214

Query: 154 LDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLA 213
           +DLVE L+FLFQL +STFG +YSTEG++      LQH+RE GLV QRKRK  +FYPT+LA
Sbjct: 215 MDLVEILSFLFQLSYSTFGQNYSTEGLTQTQLTCLQHMREIGLVVQRKRKDMKFYPTQLA 274

Query: 214 LNMAT---RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVL 270
           +N+A+   +  L   +  G++IVETN+R+YAYT+S L +AL+ LFCE+MYR P L VG++
Sbjct: 275 INLASGAKQEELDHSKSSGYIIVETNYRLYAYTESPLDIALVALFCEMMYRLPGLCVGLI 334

Query: 271 TRDSVRAALRSGITAAQIIGFLRLHALP----SVSCPL-PPVVSDQIRLWEGERERLTTS 325
           TR+SV+ A  +GITA +II F+R HA P     V+ P+ P  + DQ+ LWE ER RL+ S
Sbjct: 335 TRESVQQAFTNGITANKIINFIRTHAHPEAKKKVTTPIVPSTIIDQLYLWELERGRLSCS 394

Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWV-NHQK 381
           +GVLY+Q +S +DFE LR YA D+GVL W +  KR +VV + GH+ +K FW  N QK
Sbjct: 395 DGVLYNQILSSSDFEALRKYADDMGVLLWASPAKRLLVVNRDGHNHVKHFWKRNKQK 451


>gi|221120480|ref|XP_002159945.1| PREDICTED: general transcription factor IIH subunit 4-like [Hydra
           magnipapillata]
          Length = 458

 Score =  350 bits (899), Expect = 5e-94,   Method: Compositional matrix adjust.
 Identities = 186/411 (45%), Positives = 256/411 (62%), Gaps = 38/411 (9%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           L E+AKHY++R LF  Q +  A + SW  K  AK+H EA + L  L              
Sbjct: 44  LPELAKHYIMRTLFTNQSLSDAFVLSWCKKEWAKDHIEAVNKLKGLHIWVSFEIGTPMLR 103

Query: 48  -------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQ 100
                  + N ++ L GGG   +   K   +  ARD+ FL+ Y+ ERWECIL +M GSQ 
Sbjct: 104 YEFNITFQTNLRIGLCGGGPVQATPLKAVEEKHARDVAFLDNYSKERWECILYYMTGSQV 163

Query: 101 T--EGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
           T   G+S D  R L N G        +   IT  GFQFLLL+T +QVWYF++++L ++E 
Sbjct: 164 TGNAGVSQDVARVLINAGLLIFDHQEQATCITSSGFQFLLLDTSSQVWYFMVQHLNSMEP 223

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
           +   LV+CL+FLFQ  FS  G DY    +++    FLQ LRE GL +QRKRK+ RFYPT+
Sbjct: 224 E--ILVQCLSFLFQTSFSVLGKDYPVNDLTEAQFSFLQLLREIGLAFQRKRKSKRFYPTR 281

Query: 212 LALNMA---TRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVG 268
           LA+N+    T  +    ++ GFL+VETN+RVYAYTDS L +AL+ LF ++  RFP+ TV 
Sbjct: 282 LAINLGSAVTGNSDSSSQQQGFLVVETNYRVYAYTDSVLHIALLSLFTDIKARFPSFTVA 341

Query: 269 VLTRDSVRAALRSGITAAQIIGFL--RLHALPSVSCPL-PPVVSDQIRLWEGERERLTTS 325
           +L+R+SV+ AL  GI+A QII FL  R H+  + S P+    ++DQI+LWE ER+RL  S
Sbjct: 342 LLSRESVQQALACGISAEQIIDFLKTRAHSQMTTSSPIIASTITDQIKLWEMERDRLRYS 401

Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFW 376
           +GVLY+QF+SQ+DFE LR +A +   L W N +KR +VV+K GH D+KK+W
Sbjct: 402 QGVLYNQFLSQSDFEMLRKFADEKNHLLWANNQKRLMVVSKSGHEDVKKYW 452


>gi|340371925|ref|XP_003384495.1| PREDICTED: general transcription factor IIH subunit 4-like
           [Amphimedon queenslandica]
          Length = 446

 Score =  350 bits (898), Expect = 6e-94,   Method: Compositional matrix adjust.
 Identities = 180/414 (43%), Positives = 262/414 (63%), Gaps = 35/414 (8%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASW--------------LSKTHAKE-------HKEA 40
           LSE+ + Y++R+LF+E+P+P A I SW              L+K H  E        ++ 
Sbjct: 36  LSELGQLYIMRVLFIERPLPLADINSWALPGNNLHAETLSLLNKLHIFEPVLIRTGQQQG 95

Query: 41  TDTLTELRRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQ 100
                  + N ++ + GGG+ W++      D  +RD+ +LN YA ++WE IL FMVG   
Sbjct: 96  YQLNQSYQTNLRILICGGGQLWAVPGHRGDDKHSRDIPYLNDYADKKWENILNFMVGG-- 153

Query: 101 TEGISADAVRTLFNEG------AENP-VITKDGFQFLLLETPAQVWYFILKYLETVESKG 153
           +E + ++ V+ L   G      + +P  IT  GFQFLL++T +Q+W FIL+YL T + +G
Sbjct: 154 SEKLDSEIVQVLQYAGLMSQASSRHPSFITTLGFQFLLMDTQSQIWQFILQYLNTAQDRG 213

Query: 154 LDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLA 213
           +DL++CL F+FQL FST G DYST+G+++    F+ HL +FGLVYQRK  + R+YPTKL 
Sbjct: 214 MDLIDCLKFIFQLSFSTLGKDYSTKGLTESQLTFMYHLCQFGLVYQRKSSSKRYYPTKLV 273

Query: 214 LNMATRGTL-KQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTR 272
           + +    T+     + GF+IVETN+RV AYTDS L +A + LFC++ YRFPN+ VG +TR
Sbjct: 274 IQLTAGETIGMNTSQAGFIIVETNYRVIAYTDSKLHIATLALFCQMQYRFPNVAVGTITR 333

Query: 273 DSVRAALRSGITAAQIIGFLRLHA---LPSVSCPLPPVVSDQIRLWEGERERLTTSEGVL 329
           +S++ AL  GI A QII FL  HA   + S +  LPP V+DQI+LWE ER RL+  EG+L
Sbjct: 334 ESIQQALVHGIKADQIISFLTQHAHHNMLSKAHILPPTVTDQIKLWEIERNRLSFQEGIL 393

Query: 330 YSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           YS+F+S  D+EK++ YA+DLGVL W N ++R +V+    H +I+ FW   QK S
Sbjct: 394 YSEFLSVTDYEKVKKYAEDLGVLLWSNRQRRLMVIHPDRHDEIRHFW-KRQKSS 446


>gi|194388768|dbj|BAG60352.1| unnamed protein product [Homo sapiens]
          Length = 386

 Score =  335 bits (859), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 170/329 (51%), Positives = 229/329 (69%), Gaps = 32/329 (9%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL  +AK++V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR            
Sbjct: 53  ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 112

Query: 49  ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                    +N ++ALLGGGK WS  +++L  D  ARD+  L++YA ERWE +L FMVGS
Sbjct: 113 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 172

Query: 99  QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
             +  +S D  + L   G        E P IT  GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 173 P-SAAVSQDLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 231

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
           +G+DLVE L+FLFQL FST G DYS EGMSD L  FLQHLREFGLV+QRKRK+ R+YPT+
Sbjct: 232 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYPTR 291

Query: 212 LALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
           LA+N+++   G    + +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V  
Sbjct: 292 LAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQ 351

Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALP 298
           +TR+SV+ A+ SGITA QII FLR  A P
Sbjct: 352 VTRESVQQAIASGITAQQIIHFLRTRAHP 380


>gi|405952750|gb|EKC20525.1| General transcription factor IIH subunit 4 [Crassostrea gigas]
          Length = 380

 Score =  321 bits (822), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 177/374 (47%), Positives = 226/374 (60%), Gaps = 73/374 (19%)

Query: 35  KEHKEATDTLTELR---------------------RNAKVALLGGGKPWSMSA-KLEVDS 72
           +EHK A   L+ELR                      N KVALLGGG     +   L  D 
Sbjct: 40  REHKFAAKVLSELRVWHEQQMQGGLLGWVLNGTFRSNMKVALLGGGMDSQFTGGPLPPDK 99

Query: 73  KARDLDFLNQYALERWECILRFMVGSQQ-TEGISADAVRTLFNEGAEN-------PVITK 124
            A+D+ FL++YALERWEC+L FMVGS + TEG+S D +  L N G          P IT 
Sbjct: 100 HAKDVPFLDKYALERWECVLHFMVGSTEGTEGVSKDIIDVLLNAGLMTMDGVDPMPSITP 159

Query: 125 DGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKL 184
            GFQFLL++  +QVWYF+L+YL+TVE++G+DL++CL+FLFQL FST G            
Sbjct: 160 AGFQFLLMDIGSQVWYFMLQYLDTVEARGMDLIDCLSFLFQLSFSTLG------------ 207

Query: 185 QVFLQHLREFGLVYQRKRKAGRFYPTKLALNMAT-RGTLKQI-REPGFLIVETNFRVYAY 242
                           KRK  R+YPT+LA+N+A  +     + +  G+L+VETN+RVYAY
Sbjct: 208 ----------------KRKIQRYYPTRLAINLAAGQSDFTSVGKNTGYLMVETNYRVYAY 251

Query: 243 TDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSC 302
           TDS L+VAL+ LFCE++YRFP  +VG LTR SVR AL  GITA Q             + 
Sbjct: 252 TDSPLQVALVALFCEMLYRFPTFSVGNLTRVSVRDALIRGITADQ-------------TP 298

Query: 303 PLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTV 362
            +P  V+DQ+RLWE ER+R    EGVLY QF+SQ DFE LRDYA+DLGVL W N  KR +
Sbjct: 299 VIPSTVTDQVRLWELERDRFKFMEGVLYDQFLSQNDFELLRDYAKDLGVLLWDNAIKRVM 358

Query: 363 VVTKQGHSDIKKFW 376
           VVTK GH D+K++W
Sbjct: 359 VVTKGGHDDVKRYW 372


>gi|353231086|emb|CCD77504.1| putative tfiih, polypeptide [Schistosoma mansoni]
          Length = 486

 Score =  319 bits (817), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 184/453 (40%), Positives = 254/453 (56%), Gaps = 73/453 (16%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL E+AKH+V+RLLF+EQP+P+++++ W+ K  +    ++   LT+LR            
Sbjct: 37  ELPELAKHFVMRLLFIEQPIPKSIVSGWVEKGSSALLNDSCKALTDLRIWHSTDSNVSRG 96

Query: 49  ---------RNAKVALLGGGKPWSMSAKLEV---DSKARDLDFLNQYALERWECILRFMV 96
                     + +++L GGGKP  +   L V   D  ++ +DFL  YA ERW+ IL FMV
Sbjct: 97  SWSLNKRYQESIRISLFGGGKP--LLGDLGVVTNDKYSKSVDFLKSYAAERWDAILHFMV 154

Query: 97  GSQQTE-GISADAVRTLFN----EGAENPV-ITKDGFQFLLLETPAQVWYFILKYLETVE 150
           GS+  E G     V  L N    EG + P+ ITK GF FLL+    QV  FIL Y + ++
Sbjct: 155 GSESAEVGSVVKDVLLLSNLMKCEGNDCPIGITKHGFHFLLMSRQFQVLVFILHYFDYLK 214

Query: 151 SKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPT 210
               +LV  L F+FQL F      Y  E +S   Q  LQH+RE GL YQRKR A RFY T
Sbjct: 215 ENSKNLVGALQFVFQLSFLCPTKSYPVEALSTAQQEVLQHMRELGLAYQRKRTAPRFYVT 274

Query: 211 KLALNMA-------------------------------------TRGTLKQIREPGFLIV 233
            LAL+ A                                     ++ ++    + G++++
Sbjct: 275 PLALDFAGGHTTFLESKSGGWGPQSGVIPTGVSKSDPTDSNKMFSQISVSNSSDVGYILL 334

Query: 234 ETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLR 293
           ETNFR+YAYTDS L+ AL+ LF ++  RFPNL V  +TRDSVR AL  GITA QI+ FL 
Sbjct: 335 ETNFRLYAYTDSPLRTALLSLFSKIRARFPNLVVADITRDSVREALIRGITANQILSFLT 394

Query: 294 LHALPSV---SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLG 350
            +A P +   +  LPP ++DQIRLWE ER+R    EG LY QF    DFE +RDYA+ +G
Sbjct: 395 ANAHPDMLLQNPILPPTLTDQIRLWELERDRFLFQEGCLYEQFSRNTDFEMVRDYAKSIG 454

Query: 351 VLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           VL W+N ++R +VV+K GH D++KFW  H++ S
Sbjct: 455 VLLWENPERRLMVVSKAGHEDVRKFW-KHKRPS 486


>gi|256086358|ref|XP_002579367.1| tfiih polypeptide [Schistosoma mansoni]
          Length = 486

 Score =  318 bits (815), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 184/453 (40%), Positives = 253/453 (55%), Gaps = 73/453 (16%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL E+AKH+V+RLLF+EQP+P+++++ W+ K       ++   LT+LR            
Sbjct: 37  ELPELAKHFVMRLLFIEQPIPKSIVSGWVEKGSIALLNDSCKALTDLRIWHSTDSNVSRG 96

Query: 49  ---------RNAKVALLGGGKPWSMSAKLEV---DSKARDLDFLNQYALERWECILRFMV 96
                     + +++L GGGKP  +   L V   D  ++ +DFL  YA ERW+ IL FMV
Sbjct: 97  SWSLNKRYQESIRISLFGGGKP--LLGDLGVVTNDKYSKSVDFLKSYAAERWDAILHFMV 154

Query: 97  GSQQTE-GISADAVRTLFN----EGAENPV-ITKDGFQFLLLETPAQVWYFILKYLETVE 150
           GS+  E G     V  L N    EG + P+ ITK GF FLL+    QV  FIL Y + ++
Sbjct: 155 GSESAEVGSVVKDVLLLSNLMKCEGNDCPIGITKHGFHFLLMSRQFQVLVFILHYFDYLK 214

Query: 151 SKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPT 210
               +LV  L F+FQL F      Y  E +S   Q  LQH+RE GL YQRKR A RFY T
Sbjct: 215 ENSKNLVGALQFVFQLSFLCPTKSYPVEALSTAQQEVLQHMRELGLAYQRKRTAPRFYVT 274

Query: 211 KLALNMA-------------------------------------TRGTLKQIREPGFLIV 233
            LAL+ A                                     ++ ++    + G++++
Sbjct: 275 PLALDFAGGHTTFLESKSGGWGPQSGVIPTGVSKSDPTDSNKMFSQISVSNSSDVGYILL 334

Query: 234 ETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLR 293
           ETNFR+YAYTDS L+ AL+ LF ++  RFPNL V  +TRDSVR AL  GITA QI+ FL 
Sbjct: 335 ETNFRLYAYTDSPLRTALLSLFSKIRARFPNLVVADITRDSVREALIRGITANQILSFLT 394

Query: 294 LHALPSV---SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLG 350
            +A P +   +  LPP ++DQIRLWE ER+R    EG LY QF    DFE +RDYA+ +G
Sbjct: 395 ANAHPDMLLQNPILPPTLTDQIRLWELERDRFLFQEGCLYEQFSRNTDFEMVRDYAKSIG 454

Query: 351 VLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           VL W+N ++R +VV+K GH D++KFW  H++ S
Sbjct: 455 VLLWENPERRLMVVSKAGHEDVRKFW-KHKRPS 486


>gi|320167307|gb|EFW44206.1| transcription factor tfb2 [Capsaspora owczarzaki ATCC 30864]
          Length = 406

 Score =  317 bits (812), Expect = 6e-84,   Method: Compositional matrix adjust.
 Identities = 165/354 (46%), Positives = 227/354 (64%), Gaps = 20/354 (5%)

Query: 47  LRRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISA 106
            R   + AL GGGK W+ +  LE D+ A+D+  L QY+  RWE +L+FM  +  TEG++ 
Sbjct: 33  FRVGLRTALSGGGKSWAGADPLEPDAHAKDISELEQYSRARWEAVLQFMAAAN-TEGVTQ 91

Query: 107 DAVRTLFN----------EGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDL 156
           + V  L +          EGA  PVIT  GFQFLL +   QVWYF+ +YL +++ +G D 
Sbjct: 92  EVVNVLIDAELISLSTTGEGAGRPVITNKGFQFLLQDVATQVWYFLTQYLNSLQKRGADP 151

Query: 157 VECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNM 216
           VE L  LF+L FST G DY  EG+++     LQHLRE GLV++RKR + RFYPT LA+N+
Sbjct: 152 VEALALLFRLSFSTVGMDYPVEGLTEGQLDLLQHLREIGLVFRRKRTSRRFYPTPLAINL 211

Query: 217 ATRGTLKQIREP---GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRD 273
           A+ G+ K +      G+++VETNFR+YAYTDS L++AL+ LF +L YRFPN+  G+L+RD
Sbjct: 212 AS-GSAKNLDAADVKGYIVVETNFRIYAYTDSPLQLALLSLFVDLRYRFPNMVCGLLSRD 270

Query: 274 SVRAALRSGITAAQIIGFLRLHALP---SVSCPLPPVVSDQIRLWEGERERLTTSEGVLY 330
           SVR AL  G+TA Q+I FLR HA P   S +  LP  +SDQ+RLWE ER RL     VLY
Sbjct: 271 SVRKALVKGLTAEQMIRFLRTHAHPQMRSRTPVLPETISDQLRLWELERNRLRVLPAVLY 330

Query: 331 SQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGSQ 384
            +F +Q + + L  Y +DLGV    + +   +VVT +GH  IK FW  H++ ++
Sbjct: 331 ERFSNQREHDLLHHYGRDLGVELAHSSQH--IVVTFEGHEQIKTFWQAHRQQAE 382


>gi|444727354|gb|ELW67853.1| Valyl-tRNA synthetase, mitochondrial [Tupaia chinensis]
          Length = 1382

 Score =  315 bits (808), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 178/407 (43%), Positives = 245/407 (60%), Gaps = 82/407 (20%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGGK 60
           EL  +AK++V+R+LF+EQP+PQA +A W+ K  +K  +E+T             LL G +
Sbjct: 48  ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEEST------------GLLSGLR 95

Query: 61  PWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISADAVRTLFNEGAENP 120
            W                              + + G     G+    +  +F +     
Sbjct: 96  IWHT----------------------------QLLPG-----GLQGLILNPIFRQNLRIA 122

Query: 121 VITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGT------- 173
           ++     QFLLL+TPAQ+WYF+L+YL+T +S+G+DLVE L+FLFQL FST G        
Sbjct: 123 LLEIVA-QFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGKRRGERGR 181

Query: 174 ------------------------DYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYP 209
                                   DYS EGMSD L  FLQHLREFGLV+QRKRK+ R+YP
Sbjct: 182 VHTVMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRKRKSRRYYP 241

Query: 210 TKLALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTV 267
           T+LA+N+++   G      +PGF++VETN+R+YAYT+S L++ALI LF E++YRFPN+ V
Sbjct: 242 TRLAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVV 301

Query: 268 GVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTT 324
             +TR+SV+ A+ SGITA QII FLR  A P +   +  LPP ++DQIRLWE ER+RL  
Sbjct: 302 AQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRF 361

Query: 325 SEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSD 371
           +EGVLY+QF+SQ DFE L  +A++LGVL ++N  KR +VVT  GHSD
Sbjct: 362 TEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKRLMVVTPAGHSD 408


>gi|313224499|emb|CBY20289.1| unnamed protein product [Oikopleura dioica]
          Length = 411

 Score =  287 bits (735), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 157/407 (38%), Positives = 237/407 (58%), Gaps = 28/407 (6%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL +IAKH +IR+LF+ Q + + +I SW++  H ++ + A + +T LR            
Sbjct: 6   ELPQIAKHIIIRILFINQQIAKPLIESWVADEHREKFEVAMEIITGLRIWENTNDGIAFN 65

Query: 49  -RNA-----KVALLGGGKPWSMSAK-LEVDSKARDLDFLNQYALERWECILRFMVGSQQT 101
             N+     + AL GGG+ W  + + L  D  A+ ++ L+ Y  ERW+ IL F+    Q 
Sbjct: 66  LNNSFRTYLQEALFGGGETWRPAVETLGADKNAKTVEQLDTYTKERWDQILSFLT---QE 122

Query: 102 EGISADAVRTLFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLT 161
           +G  ++ V +L           +  FQFLLL+  +QVWY +++YL  V+  GL LV  L 
Sbjct: 123 QGKLSEEVISLLKYAGLCDANGEKRFQFLLLDRSSQVWYLLVQYLGYVQKLGLSLVNVLA 182

Query: 162 FLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGT 221
           F+ QL + +FGTDY  +  ++++   +QH RE GL+++RK K  RFYPT+LA +++  G 
Sbjct: 183 FVLQLGYCSFGTDYPCDNSNNEISRVIQHFREMGLIFKRKSKEQRFYPTRLAQSISIAGG 242

Query: 222 LKQIREP---GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAA 278
            K   E     F++VETN+R+YAYTDS L  ALI LF E+ YRFP + V  ++RDS++ +
Sbjct: 243 KKASSEDVQEQFILVETNYRIYAYTDSELHYALISLFAEVQYRFPYMIVAQMSRDSIQQS 302

Query: 279 LRSGITAAQIIGFLRLHALPSVSCP---LPPVVSDQIRLWEGERERLTTSEGVLYSQFVS 335
              GI+A QI+ +LR  A P        +P VV D I LW  ERERL  ++G+LY QF+ 
Sbjct: 303 ADYGISAEQILNYLRSSAHPIARKNKHWVPQVVEDNIHLWCKERERLKFNDGLLYHQFLD 362

Query: 336 QADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKG 382
           Q  FE L+ YAQD+  L W N+++R +VV    H  IK ++   + G
Sbjct: 363 QEAFEMLKSYAQDIRALVWANDERRFMVVAPWSHDQIKSYYKQIKDG 409


>gi|313245784|emb|CBY34779.1| unnamed protein product [Oikopleura dioica]
          Length = 456

 Score =  287 bits (734), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 157/407 (38%), Positives = 237/407 (58%), Gaps = 28/407 (6%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL +IAKH +IR+LF+ Q + + +I SW++  H ++ + A + +T LR            
Sbjct: 51  ELPQIAKHIIIRILFINQQIAKPLIESWVADEHREKFEVAMEIITGLRIWENTNDGIAFN 110

Query: 49  -RNA-----KVALLGGGKPWSMSAK-LEVDSKARDLDFLNQYALERWECILRFMVGSQQT 101
             N+     + AL GGG+ W  + + L  D  A+ ++ L+ Y  ERW+ IL F+    Q 
Sbjct: 111 LNNSFRTYLQEALFGGGETWRPAVETLGADKNAKTVEQLDTYTKERWDQILSFLT---QE 167

Query: 102 EGISADAVRTLFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLT 161
           +G  ++ V +L           +  FQFLLL+  +QVWY +++YL  V+  GL LV  L 
Sbjct: 168 QGKLSEEVISLLKYAGLCDANGEKRFQFLLLDRSSQVWYLLVQYLGYVQKLGLSLVNVLA 227

Query: 162 FLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGT 221
           F+ QL + +FGTDY  +  ++++   +QH RE GL+++RK K  RFYPT+LA +++  G 
Sbjct: 228 FVLQLGYCSFGTDYPCDNSNNEISRVIQHFREMGLIFKRKSKEQRFYPTRLAQSISIAGG 287

Query: 222 LKQIREP---GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAA 278
            K   E     F++VETN+R+YAYTDS L  ALI LF E+ YRFP + V  ++RDS++ +
Sbjct: 288 KKASSEDVQEQFILVETNYRIYAYTDSELHYALISLFAEVQYRFPYMIVAQMSRDSIQQS 347

Query: 279 LRSGITAAQIIGFLRLHALPSVSCP---LPPVVSDQIRLWEGERERLTTSEGVLYSQFVS 335
              GI+A QI+ +LR  A P        +P VV D I LW  ERERL  ++G+LY QF+ 
Sbjct: 348 ADYGISAEQILNYLRSSAHPIARKNKHWVPQVVEDNIHLWCKERERLKFNDGLLYHQFLD 407

Query: 336 QADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKG 382
           Q  FE L+ YAQD+  L W N+++R +VV    H  IK ++   + G
Sbjct: 408 QEAFEMLKSYAQDIRALVWANDERRFMVVAPWSHDQIKSYYKQIKDG 454


>gi|393221241|gb|EJD06726.1| transcription factor Tfb2 [Fomitiporia mediterranea MF3/22]
          Length = 464

 Score =  284 bits (726), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 166/413 (40%), Positives = 237/413 (57%), Gaps = 40/413 (9%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           L+ + +  V+ LL++E  +P A ++SW+++   K +  A  TL+ L              
Sbjct: 43  LAPMERQLVMNLLWLESSIPTATMSSWINRKGIKLYDSALSTLSRLHIIGNPSQKLNFNP 102

Query: 48  --RRNAKVALLGGGKPWSMSAKLEVDSKAR----DLDFLNQYALERWECILRFMV--GSQ 99
             + + + A+ GGG   S      VD + +     L+ L+ YA+ERWE IL +MV  G+ 
Sbjct: 103 TFKASLRQAITGGGTHRSFGVP-AVDGEEKRSSVTLEMLDAYAVERWETILHYMVSSGTG 161

Query: 100 QTEGISADAV------RTLFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKG 153
           Q     +  V        L +    N  IT  GFQFLL    AQ+W  +L+YL   E + 
Sbjct: 162 QMPARPSQGVLFLLERSGLMSGNGGNMKITSFGFQFLLHSPHAQLWELLLQYLHMAEERQ 221

Query: 154 LDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLA 213
           +DL+E L FLF L     G  YSTE + +  +  L+ LR++GL++Q+K  + RFYPT+LA
Sbjct: 222 MDLIEVLGFLFMLSTMELGRGYSTENLGETQKAMLEDLRDYGLIWQKKASSRRFYPTRLA 281

Query: 214 LNMA-------TRGTL-KQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNL 265
             +        T G+     +  GF+I+ETN+R+YAYTD+ L+ A++ LF  L YRFPNL
Sbjct: 282 TTLTSSLPPLPTSGSGGANAQAQGFIILETNYRIYAYTDNPLQTAVLNLFVTLKYRFPNL 341

Query: 266 TVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SCPLPPV-VSDQIRLWEGERERL 322
            VG LTRDSVR AL +GITA QII +L  HA P +  + PL PV V DQIRLWE ER RL
Sbjct: 342 VVGSLTRDSVRKALANGITADQIIKYLTTHAHPQMRKNDPLIPVTVQDQIRLWELERNRL 401

Query: 323 TTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
            + EG LY+ F SQAD+E +  YA+ LGV+ W+N  KR    + +GH++I+ F
Sbjct: 402 KSQEGYLYTAFASQADYEFVLKYAKQLGVVLWENAAKRCFFGSLEGHANIRGF 454


>gi|302684027|ref|XP_003031694.1| hypothetical protein SCHCODRAFT_15746 [Schizophyllum commune H4-8]
 gi|300105387|gb|EFI96791.1| hypothetical protein SCHCODRAFT_15746 [Schizophyllum commune H4-8]
          Length = 476

 Score =  283 bits (723), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 170/428 (39%), Positives = 237/428 (55%), Gaps = 55/428 (12%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           L  + +  V+ LL++E  VP + + +W+ + H K ++ A +TL +L              
Sbjct: 42  LGPLERQIVMNLLWLESAVPVSTMGAWVVREHQKIYEAALETLGKLHIVPTSPVKLAVHP 101

Query: 48  --RRNAKVALLGGGKPWSMSAKLEVDSK--ARDLDFLNQYALERWECILRFMV--GSQQT 101
             + + + AL GGG   S     E D K  A D+  L+ YALERWE IL FMV  G+ Q 
Sbjct: 102 TFKSSLRQALTGGGTQGSFGIPAEPDGKQQAPDVGKLDGYALERWETILHFMVSSGTGQE 161

Query: 102 EGISADAVRTLFNEGA------ENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
               A  V  L  E            IT  GFQFLLL   AQ+W  ++ YL   E+  +D
Sbjct: 162 SAQPAAGVTYLLKESGLMESRGGTLRITSAGFQFLLLTPHAQLWELLIHYLRLQEANNID 221

Query: 156 LVECLTFLFQLKFSTFGT------------DYSTEGMSDKLQVFLQHLREFGLVYQRKRK 203
           LVE L+FLF +  + FG             +YSTE +S    V +  LR +GL++QRK+ 
Sbjct: 222 LVEVLSFLFMVSTTEFGQGNLEMLTGDLHQEYSTENLSTSQLVVMDDLRHYGLLWQRKKN 281

Query: 204 AGRFYPTKLALNMATR--------GTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLF 255
             RF PT+LA  + +         G     +E GF+++ETN+RVYAYTD+ L+ A++ LF
Sbjct: 282 YPRFSPTRLATTLTSSAPPLPTSSGPTSGPKE-GFIVLETNYRVYAYTDNALQTAVLNLF 340

Query: 256 CELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPS-------VSCPLPPV- 307
               YRFPNL VG +TR+SV+ A+ +GITA QII +L  HA P        V+ PL PV 
Sbjct: 341 ITPKYRFPNLVVGSITRESVKRAMSNGITADQIISYLITHAHPQMHKNVCLVANPLLPVT 400

Query: 308 VSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQ 367
           V DQIRLWE ER R  + EG LY++F SQAD+E + +YA+ LGV+ W+N  KR     ++
Sbjct: 401 VQDQIRLWELERNRTKSQEGFLYTEFASQADYEYVLNYAKQLGVVLWENHAKRCFFAAQE 460

Query: 368 GHSDIKKF 375
           GH++IK F
Sbjct: 461 GHANIKSF 468


>gi|195477962|ref|XP_002086438.1| GE22864 [Drosophila yakuba]
 gi|194186228|gb|EDW99839.1| GE22864 [Drosophila yakuba]
          Length = 241

 Score =  282 bits (722), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 136/240 (56%), Positives = 174/240 (72%), Gaps = 20/240 (8%)

Query: 162 FLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGT 221
            LFQL FST G DYS+EGM+ ++  FLQHLREFGLV+QRKRK GRFYPT+LALN+ ++  
Sbjct: 1   MLFQLSFSTLGRDYSSEGMNSQMLTFLQHLREFGLVFQRKRKEGRFYPTRLALNVTSKEA 60

Query: 222 L---------KQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTR 272
                     +  ++ G+++VETN+RVYAYTDS L+VA++GLF EL+YRFPNL VGVLTR
Sbjct: 61  AATASVAMDEEATQDCGYIVVETNYRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTR 120

Query: 273 DSVRAALRSGITAAQIIGFLRLHALPSV----------SCPLPPVVSDQIRLWEGERERL 322
           DSVR ALR GITA QI+ +L  +A P++          SC LPP V DQI+LWE ER R 
Sbjct: 121 DSVRQALRGGITAEQIVSYLEQYAHPNMRLVESAIQSKSC-LPPTVVDQIKLWELERNRF 179

Query: 323 TTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKG 382
           T +EGV+Y+QF+S  DF  LRDYAQ + +L WQNE+ RT+VV K GH D+K++W  + K 
Sbjct: 180 TYTEGVVYNQFLSHTDFVTLRDYAQSIHMLVWQNERTRTMVVQKNGHDDVKRYWKKYSKS 239


>gi|432098035|gb|ELK27922.1| General transcription factor IIH subunit 4 [Myotis davidii]
          Length = 377

 Score =  279 bits (713), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 147/331 (44%), Positives = 205/331 (61%), Gaps = 67/331 (20%)

Query: 49  RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISAD 107
           +N + ALLGGGK WS  +++L  D  ARD+  L++YA ERWE +L FM GS         
Sbjct: 104 QNLRTALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMAGSPSAA----- 158

Query: 108 AVRTLFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLK 167
                                FLLL+TPAQ+WYF+L++L+T +S+G+DLVE L+FLFQL 
Sbjct: 159 ---------------------FLLLDTPAQLWYFMLQHLQTAQSRGMDLVEILSFLFQLS 197

Query: 168 FSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMAT--RGTLKQI 225
           FS+ G DYS EGMSD         REFGLV+QRKRK+ R+YPT+LA+++++   G     
Sbjct: 198 FSSLGKDYSVEGMSDS-------RREFGLVFQRKRKSRRYYPTRLAISLSSGVSGAGGTA 250

Query: 226 REPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITA 285
            +PGF++VETN+++YAYT+S L++ALI LF E++YRFPN+ V                  
Sbjct: 251 HQPGFIVVETNYQLYAYTESELQIALIALFSEMLYRFPNMVV------------------ 292

Query: 286 AQIIGFLRLHALPSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDY 345
                        + +  LPP ++DQIRLWE ER+RL  +EGVLY+QF+SQ DF+ L   
Sbjct: 293 -------------AQTPVLPPTITDQIRLWELERDRLLFTEGVLYNQFLSQVDFDLLLAR 339

Query: 346 AQDLGVLTWQNEKKRTVVVTKQGHSDIKKFW 376
           A +LG L ++N  K+ +VVT  GH ++K+FW
Sbjct: 340 AWELGALVFENMAKQLMVVTPAGHREVKRFW 370


>gi|336370852|gb|EGN99192.1| hypothetical protein SERLA73DRAFT_90464 [Serpula lacrymans var.
           lacrymans S7.3]
 gi|336383609|gb|EGO24758.1| hypothetical protein SERLADRAFT_361686 [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 469

 Score =  278 bits (711), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 163/418 (38%), Positives = 235/418 (56%), Gaps = 40/418 (9%)

Query: 5   IAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL----RRNAKVAL----- 55
           + +  ++ LL++E  +P A +++W+ +   K +  A  TL  L    + N K+AL     
Sbjct: 52  LERQIIMNLLWLESAIPIATMSAWVIREGKKLYDAALTTLANLHILPQSNVKLALNPTFK 111

Query: 56  -------LGGGKPWSMSAKLEVDSK--ARDLDFLNQYALERWECILRFMVGSQQTEGISA 106
                   GGG   S     E D K     ++ L+ YALERWE IL +MV S Q    + 
Sbjct: 112 ASFRQAITGGGNSSSFGVPAEKDDKRSTNTVETLDAYALERWETILHYMVSSGQGSLPTK 171

Query: 107 DAVRTLF----------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDL 156
            +   L+          N G+    IT  GFQFLL     Q+W  +L+YL   E + +DL
Sbjct: 172 PSQGVLYLLQRSGLMTQNHGSAL-QITSAGFQFLLHTPHDQLWDLLLQYLHMAEERQMDL 230

Query: 157 VECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNM 216
           VE L+FLF L     G +YSTEG+S+  +  L+ LR++GLV+QRK  + RF PT+LA  +
Sbjct: 231 VEVLSFLFMLSTMDLGREYSTEGLSETQKAMLEDLRDYGLVWQRKATSKRFSPTRLATTL 290

Query: 217 ATR--------GTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVG 268
            +         GT    +  GF+++ETN+R+YAYTD  L+ A++ LF  + YRFPNL VG
Sbjct: 291 TSSCPPLPTSTGTSGGPQSQGFIVLETNYRIYAYTDKPLQTAVLNLFITMKYRFPNLVVG 350

Query: 269 VLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SCPLPPV-VSDQIRLWEGERERLTTS 325
           +LTR+SV+ AL +GI+A QII +L  HA P +  + PL PV V DQIRLWE ER RL + 
Sbjct: 351 MLTRESVKKALSNGISAEQIISYLTTHAHPQMRKNNPLLPVTVQDQIRLWELERNRLKSE 410

Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           EG LY  F S AD+E + +YA+ L V+ W+N  +R    +  GH++I+ F      G+
Sbjct: 411 EGYLYKDFGSHADYEYVLNYAKQLDVVLWENTSRRCFFGSLDGHTNIRGFIERRTNGA 468


>gi|328766914|gb|EGF76966.1| hypothetical protein BATDEDRAFT_92242 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 471

 Score =  277 bits (708), Expect = 6e-72,   Method: Compositional matrix adjust.
 Identities = 157/419 (37%), Positives = 226/419 (53%), Gaps = 45/419 (10%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           L ++AKH ++RLL+    V  + I +W +  ++ +   A   L +L              
Sbjct: 42  LPDLAKHTILRLLYTSMTVKISDIETWETHEYSGQFSHAVTGLAKLHILSIKNEGSHKSV 101

Query: 48  ------RRNAKVALLGGGKPWSMSAKLEVDSKAR-DLDFLNQYALERWECILRFMVGSQQ 100
                 +++   AL+G G   S     E   K R  + FL  YA E WE +L ++VG+  
Sbjct: 102 HINSVFQKSIHNALVGSGSHTSFGMPCETIDKHRPSIAFLETYAKECWESVLHYLVGTPS 161

Query: 101 TEGISADAVRTLFNEGAENPV------------ITKDGFQFLLLETPAQVWYFILKYLET 148
            +   A  V+ L   G   P+            IT  GFQFLL +   Q+W F+L+YLE 
Sbjct: 162 DKRPKA-IVKLLEKSGLMAPISSTNAAHNGDLRITSKGFQFLLQDVNVQIWAFLLQYLEM 220

Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY 208
            E   ++LVE L F FQL     G DYS E ++   +  L  L+  GL+YQRK+K+ RFY
Sbjct: 221 AEQLNMELVEVLNFFFQLGSLELGQDYSVEVLTPTQKHMLDDLKHLGLIYQRKKKSTRFY 280

Query: 209 PTKLALNMATRGTLKQIREP--------GFLIVETNFRVYAYTDSNLKVALIGLFCELMY 260
           PT LA ++ +  +   +  P        GF+I+ETN+RVYAYTDS L++A++ LF E+  
Sbjct: 281 PTHLATSLTSGASAGAVLSPRSLEADSDGFIIIETNYRVYAYTDSPLQIAVLSLFIEMRV 340

Query: 261 RFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEG 317
           RF N+ +G++TRDSVR AL  GI+A QII +L  HA P +   S  LP  + DQ+RLWE 
Sbjct: 341 RFANMVIGIITRDSVREALAKGISAEQIIAYLTTHAHPEMKKGSPVLPTTIVDQVRLWEM 400

Query: 318 ERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFW 376
           ER RL  S G LY  F  + ++ ++  YA DLG   W ++ KR VVV+ +GH  IK F+
Sbjct: 401 ERNRLRISRGHLYQMFSGEQEYREILKYATDLGYELWHSDSKRLVVVSAEGHEHIKVFF 459


>gi|213402795|ref|XP_002172170.1| transcription factor TFIIH complex subunit Tfb2
           [Schizosaccharomyces japonicus yFS275]
 gi|212000217|gb|EEB05877.1| transcription factor TFIIH complex subunit Tfb2
           [Schizosaccharomyces japonicus yFS275]
          Length = 444

 Score =  276 bits (707), Expect = 8e-72,   Method: Compositional matrix adjust.
 Identities = 165/406 (40%), Positives = 228/406 (56%), Gaps = 34/406 (8%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTL----------------T 45
           L  IA+ YV+ +LF   PV       W   +      E+ + L                +
Sbjct: 36  LPIIARQYVMAMLFNPSPVALNDFDLWTKLSSKVYQTESFNKLVRMHIFQVDGQNVILNS 95

Query: 46  ELRRNAKVALLGGGKPWSMSAK-LEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGI 104
           E R+    AL GGG   S     ++ D +  D++FL++YA + WE IL FMVG+   + I
Sbjct: 96  EFRQQFITALTGGGTHNSFGVPCIDEDRQRVDIEFLDKYATDTWETILHFMVGTS-VKKI 154

Query: 105 SADAVRTLFNEGA-----ENPV-ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVE 158
             + V +L   G      +N V IT  GFQFLL +   Q+W  +L+YL+  E   +D V+
Sbjct: 155 PGEGVLSLLRRGGLMTGTKNQVRITHSGFQFLLQDINTQIWTLLLEYLKLSEDTHMDPVQ 214

Query: 159 CLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMAT 218
            L FLF L     G DYS   ++D  Q+ L+ LRE+GLVYQ+K  + RFYPT+LA ++ T
Sbjct: 215 VLHFLFMLGSLELGRDYSVHFLTDTQQIMLEDLREYGLVYQKKSSSRRFYPTRLATSLTT 274

Query: 219 ------RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTR 272
                 +G      + GF+IVETN+R+YAYTDS L++A++GLF  L  RF NL VGV+TR
Sbjct: 275 EYHSPVKGAGSDAGK-GFIIVETNYRLYAYTDSPLQIAILGLFTNLRARFSNLIVGVITR 333

Query: 273 DSVRAALRSGITAAQIIGFLRLHALPSV--SCP-LPPVVSDQIRLWEGERERLTTSEGVL 329
           DS+R AL+SGITA QII +L  HA P +    P LPP + DQI LWE ER RL  + G+L
Sbjct: 334 DSIRRALQSGITAEQIITYLTTHAHPQMRKEIPLLPPTLVDQIYLWELERNRLRATPGIL 393

Query: 330 YSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           +  F++  DFEK   YA++LGVL W +  KR   +T  G   +  F
Sbjct: 394 FRDFLTDKDFEKAVQYAKELGVLVWDSSLKRMFFITNAGAQPMINF 439


>gi|388857860|emb|CCF48525.1| probable TFB2-TFIIH subunit (transcription/repair factor) [Ustilago
           hordei]
          Length = 492

 Score =  275 bits (704), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 158/416 (37%), Positives = 230/416 (55%), Gaps = 42/416 (10%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
           L  +A+  ++ +LF++ P+      +W+ K   K+   A D L+ L              
Sbjct: 71  LPMMARQLIMHMLFLDVPLAADDFIAWIKKEARKDFDAAVDKLSRLSIVQLKSTGGKQIL 130

Query: 49  -------RNAKVALLGGGKPWSMSAKLEV-DSKARDLDFLNQYALERWECILRFMVGSQQ 100
                     + AL GGGK  S     +  D  A D+ FL+QYA  +WE IL +MVGS+ 
Sbjct: 131 LLNAVFTDGMRRALTGGGKHRSFGVPCDTEDKNAVDVAFLDQYARNKWETILHYMVGSEN 190

Query: 101 TEGISADAVRTLFNEGAENP-----------VITKDGFQFLLLETPAQVWYFILKYLETV 149
           +       +  L       P            IT  GFQFLL +   Q+W  +L+YL+  
Sbjct: 191 SSTPREPVLYLLRRSNLMQPRPSSGSGSGGLNITSRGFQFLLEDVNTQLWDLLLQYLDMA 250

Query: 150 ESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYP 209
           E + +DLVE L FLF L     G DYSTE + +     L+  R++GLVYQRK  + RFYP
Sbjct: 251 EERNMDLVEVLAFLFMLGSLELGRDYSTEELPETQLHMLEDFRDYGLVYQRKASSRRFYP 310

Query: 210 TKLALNMATRGTLKQI-------REPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRF 262
           T+LA  + +   +  +        E G++I+ETN+R+YAYT + L+VA++ LF  +  RF
Sbjct: 311 TRLATTLTSSAAIPLLSSNGAEQEERGYIILETNYRLYAYTSNPLRVAVLSLFVTIKARF 370

Query: 263 PNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SCPLPPV-VSDQIRLWEGER 319
           PNL VG +TRDSV++AL +GITA QII +L  HA   +  + PL PV VSDQIRLWE E+
Sbjct: 371 PNLVVGSITRDSVKSALANGITAEQIITYLTHHAHVQMHRNDPLLPVTVSDQIRLWEREK 430

Query: 320 ERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
            R+  + G L++ F SQ DFE++R+YA+ LGVL W++E KR   V + G+  ++ +
Sbjct: 431 NRVVQNLGSLFTDFTSQFDFEEVRNYAKQLGVLVWEDESKRRFFVDEAGNEPVRDY 486


>gi|71024427|ref|XP_762443.1| hypothetical protein UM06296.1 [Ustilago maydis 521]
 gi|46097692|gb|EAK82925.1| hypothetical protein UM06296.1 [Ustilago maydis 521]
          Length = 491

 Score =  275 bits (702), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 159/415 (38%), Positives = 229/415 (55%), Gaps = 41/415 (9%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
           L  +A+  ++ +LF++ P+      +W+ K    E   A D L+ L              
Sbjct: 71  LPMMARQLIMHMLFLDVPLAADDFLAWIKKEVKNEFDAAVDKLSRLSIIQLKASAGKQML 130

Query: 49  -------RNAKVALLGGGKPWSMSAKLEV-DSKARDLDFLNQYALERWECILRFMVGSQQ 100
                     + AL GGGK  S     +  D  A D+ FL+QYA  +WE IL +MVGS  
Sbjct: 131 LLNAGFTEGMRRALTGGGKHRSFGVPCDTEDKNAVDVAFLDQYARTKWETILHYMVGSDN 190

Query: 101 TEGISADAVRTLFNEGAENP----------VITKDGFQFLLLETPAQVWYFILKYLETVE 150
           +       +  L       P           IT  GFQFLL +   Q+W  +L+YL+  E
Sbjct: 191 SSTPREPVLYLLRRSNLMQPRTASSSNGGLNITSRGFQFLLEDVNTQLWDLLLQYLDMAE 250

Query: 151 SKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPT 210
            + +DLVE L FLF L     G DYSTE + +     L+  R++GLVYQRK  + RFYPT
Sbjct: 251 ERNMDLVEVLAFLFMLGSLELGRDYSTEELPETQLHMLEDFRDYGLVYQRKASSRRFYPT 310

Query: 211 KLALNMATRGTLKQI-------REPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFP 263
           +LA  + +   +  +        E G++I+ETN+R+YAYT + L+VA++ LF  +  RFP
Sbjct: 311 RLATTLTSSAAVPLLSSNGTEQEERGYIILETNYRLYAYTSNPLRVAVLSLFVTIKARFP 370

Query: 264 NLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SCPLPPV-VSDQIRLWEGERE 320
           NL VG +TRDSV++AL +GITA QII +L  HA   +  + PL PV VSDQIRLWE E+ 
Sbjct: 371 NLVVGSITRDSVKSALANGITAEQIITYLTHHAHLQMHRNDPLLPVTVSDQIRLWEREKN 430

Query: 321 RLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           R+  + G L++ F SQ DFE++R+YA+ LGVL WQ+E+KR   V + G+  ++ +
Sbjct: 431 RVQQNLGSLFTDFTSQFDFEEVRNYAKQLGVLVWQDEEKRRFFVDEAGNEPVRDY 485


>gi|390594544|gb|EIN03954.1| transcription factor Tfb2 [Punctularia strigosozonata HHB-11173
           SS5]
          Length = 467

 Score =  274 bits (700), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 162/416 (38%), Positives = 234/416 (56%), Gaps = 47/416 (11%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKT-------HA------KEHKEATDTLTELR 48
           L  + +  V+ LL++E  VP   +  W+          HA      + H  + +T   L+
Sbjct: 48  LRPLDQQLVMNLLWLESAVPAQTMQQWIVPGGNNRKLYHASLEILSRLHILSGNTQVRLQ 107

Query: 49  RNA------KVALLGG---------GKPWSMSAKLEVDSKARDLDFLNQYALERWECILR 93
            N+      + A+ GG         G P    AK  +D      D L+ +ALERWE IL 
Sbjct: 108 LNSTWKASLRAAITGGTVATSSGSFGIPAERGAKDSMDK-----DGLDAFALERWETILH 162

Query: 94  FMVGSQQTEGISADAVRTLF-------NEGAENPVITKDGFQFLLLETPAQVWYFILKYL 146
           FMV S         +   LF         G  NP IT  GFQFLL E  AQ+W  +L+YL
Sbjct: 163 FMVSSGTAHNPPRPSAGVLFLLQRSGLMGGGGNPQITSAGFQFLLHEPHAQLWELLLQYL 222

Query: 147 ETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGR 206
              E + +DLVE L+F+F L  +  G +YSTE +SD  +  L+ LR++GL++Q+K  + R
Sbjct: 223 RMAEERQMDLVEVLSFIFMLSTTELGREYSTENLSDTQKAMLEDLRDYGLIWQQKPTSKR 282

Query: 207 FYPTKLALNMATRG----TLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRF 262
           F PT+LA  + +      T       GF+++ETN+R+YAYTD+ L+ A++ LF  L  RF
Sbjct: 283 FSPTRLATTLTSSSPPLPTSGSGVSEGFIVLETNYRLYAYTDNPLQTAVLALFTTLRSRF 342

Query: 263 PNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SCPLPPV-VSDQIRLWEGER 319
           PNL VG +TR+SV+ AL+SGI+A QII +L  HA P +  + PL PV V DQIRLWE E+
Sbjct: 343 PNLVVGQITRESVKRALQSGISAEQIISYLSTHAHPQMRKNNPLIPVTVQDQIRLWELEK 402

Query: 320 ERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
            RL + EG LY++F SQ D+E + +YA++L V+ W+N  KR    + +GH++I+ F
Sbjct: 403 NRLKSQEGFLYTEFASQGDYEYVLNYARELNVVLWENPVKRCFFGSMEGHANIRGF 458


>gi|343428936|emb|CBQ72481.1| probable TFB2-TFIIH subunit (transcription/repair factor)
           [Sporisorium reilianum SRZ2]
          Length = 493

 Score =  271 bits (694), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 157/417 (37%), Positives = 229/417 (54%), Gaps = 43/417 (10%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
           L  +A+  ++ +LF++ P+      +W+ K   K+   A D L+ L              
Sbjct: 71  LPMMARQLIMHMLFLDVPLAADDFMAWIKKEARKDFDAAVDKLSRLSIVQLKTTAAKQIL 130

Query: 49  -------RNAKVALLGGGKPWSMSAKLEV-DSKARDLDFLNQYALERWECILRFMVGSQQ 100
                     + AL GGGK  S     +  D  A D+ FL++YA  +WE IL +MVGS  
Sbjct: 131 LLNAVFTDGMRRALTGGGKHRSFGVPCDTEDKNAVDVAFLDEYARTKWETILHYMVGSDN 190

Query: 101 TEGISADAVRTLFNEGAENP------------VITKDGFQFLLLETPAQVWYFILKYLET 148
           +       +  L       P             IT  GFQFLL +   Q+W  +L+YL+ 
Sbjct: 191 SSTPREPVLYLLRRSNLMQPRSTSTSSSSGGLNITSRGFQFLLEDVNTQLWDLLLQYLDM 250

Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY 208
            E + +DLVE L FLF L     G DYSTE + +     L+  R++GLVYQRK  + RFY
Sbjct: 251 AEERNMDLVEVLAFLFMLGSLELGRDYSTEELPETQLHMLEDFRDYGLVYQRKASSRRFY 310

Query: 209 PTKLALNMATRGTLKQI-------REPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYR 261
           PT+LA  + +   +  +        E G++I+ETN+R+YAYT + L+VA++ LF  +  R
Sbjct: 311 PTRLATTLTSSAAVPLLSSNGTEQEERGYIILETNYRLYAYTSNRLRVAVLSLFVTIKAR 370

Query: 262 FPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SCPLPPV-VSDQIRLWEGE 318
           FPNL VG +TRDSV++AL +GITA QII +L  HA   +  + PL PV VSDQIRLWE E
Sbjct: 371 FPNLVVGSITRDSVKSALANGITAEQIITYLTHHAHVQMHRNDPLLPVTVSDQIRLWERE 430

Query: 319 RERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           + R+  + G L++ F SQ DFE++R+YA+ LGVL W++E KR   V + G+  ++ +
Sbjct: 431 KNRVQQNLGSLFTDFTSQFDFEEVRNYAKQLGVLVWEDEGKRRFFVDEAGNVPVRDY 487


>gi|392594388|gb|EIW83712.1| transcription factor Tfb2 [Coniophora puteana RWD-64-598 SS2]
          Length = 486

 Score =  268 bits (686), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 161/428 (37%), Positives = 239/428 (55%), Gaps = 54/428 (12%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR----RNAKVAL-- 55
           L+ +++  V+ LL+++ PV    +A+W+     K + EA +TLT L       AK+AL  
Sbjct: 50  LAPLSRQLVMNLLWLDSPVLAGTMAAWVVPDGKKLYDEALNTLTRLHILQSSGAKLALNP 109

Query: 56  ------------LGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQ--- 100
                        G    + + A+ +    A  ++ L+ YA++RWE IL +MV S Q   
Sbjct: 110 TFKASMRQAITCSGTTGSFGVPAQRDDRRDAPSVETLDGYAVQRWETILHYMVSSGQGQY 169

Query: 101 ----TEGISADAVRT--LFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGL 154
               T+G+     R+  + +       IT  GFQFLL     Q+W  +L+YL  VE + +
Sbjct: 170 PTKPTQGVLYLLQRSGLMASYHGSTLQITSSGFQFLLYSPHDQLWDLLLQYLHMVEERQM 229

Query: 155 DLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLAL 214
           DLVE L+F F L     G +YSTE +S+  +  L+ LR++GLV+QRK  + RF PT+LA 
Sbjct: 230 DLVEVLSFFFMLSTMELGREYSTEPLSETQKAMLEDLRDYGLVWQRKASSKRFSPTRLAT 289

Query: 215 NMATRG----TLKQIREPG--------------------FLIVETNFRVYAYTDSNLKVA 250
            + +      T    R PG                    F+++ETN+RVYAYTD+ L++A
Sbjct: 290 TLTSVSPSLPTAGGSRNPGSAPGASSVANSTGINTNDHGFIVLETNYRVYAYTDNPLQIA 349

Query: 251 LIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SCPLPPV- 307
           ++ LF  L YRFPNL VG+LTR+SVR AL +GI+A Q+I +L  HA P +  + PL PV 
Sbjct: 350 VLNLFITLKYRFPNLVVGMLTRESVRRALGNGISAEQVISYLTAHAHPQMRKNNPLLPVT 409

Query: 308 VSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQ 367
           V DQ+RLWE ER RL + +G LY  F +QAD+E +  YA++L V+ W+N  KR    T  
Sbjct: 410 VQDQVRLWELERNRLKSDDGFLYKDFATQADYEYVLAYAKELDVVLWENTSKRCFFGTLD 469

Query: 368 GHSDIKKF 375
           GH++++ F
Sbjct: 470 GHTNVRGF 477


>gi|170120252|ref|XP_001891159.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164633568|gb|EDQ98188.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 475

 Score =  267 bits (683), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 160/417 (38%), Positives = 239/417 (57%), Gaps = 43/417 (10%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           L  I++  ++ LL++E  +P + + +W+++     + EA  TLT+L              
Sbjct: 51  LDPISRQLIMNLLWLESAIPASTMQAWVTREGKPVYTEALATLTKLHIFPNSAIKLAVVP 110

Query: 48  --RRNAKVALLGGGKPWSMSAKLEVDSKARDLD--FLNQYALERWECILRFMVGSQQTEG 103
             + + + A+ GGG   S     E D K  D+D  FL+ +ALERWE IL +MV S   + 
Sbjct: 111 TFKTSFRQAITGGGTSGSFGVPSEKDDKRSDVDIEFLDVFALERWETILHYMVSSGSGQN 170

Query: 104 ISADAVRTLF--------NEGAENPV-ITKDGFQFLLLETPAQVWYFILKYLETVESKGL 154
            +  +VR LF            + P+ IT  GFQFLL     Q+W  +L+YL   E + +
Sbjct: 171 PTKPSVRVLFLLQRSGLMTTVGQGPLQITSAGFQFLLHSPHEQLWELLLQYLHLTEERQM 230

Query: 155 DLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLAL 214
           DLV+ L FLF L     G +YSTE +S      L+ L ++GL+++RK  +  F+PT+LA 
Sbjct: 231 DLVDVLGFLFMLSTMELGREYSTERLSKTQTAMLEDLVDYGLIWRRKPSSKGFHPTRLAT 290

Query: 215 NMATRGTL-------KQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTV 267
            + +           +   + GF+++ETN+R+YAYTD+ L+ A++ LF  L YRFPNL V
Sbjct: 291 TLTSSSPPLPSSIGNRSGPQEGFIVLETNYRIYAYTDNPLQTAILNLFVSLKYRFPNLVV 350

Query: 268 GVLTRDSVRAALRSGITAAQIIGFLRLHALPSV-------SC-PLPPV-VSDQIRLWEGE 318
           G +TRDSV+ AL +GI+A QII +L  HA P +       SC PL PV V DQIRLWE E
Sbjct: 351 GAITRDSVKKALLNGISADQIISYLMSHAHPQMRKNVRLRSCNPLLPVTVQDQIRLWELE 410

Query: 319 RERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           + RL + EG LY+ F SQAD++ + +YA++LGV+ W+N  KR    +  GH++I+ F
Sbjct: 411 KNRLKSQEGYLYTSFASQADYDFVLNYAKELGVVLWENPVKRCFFGSLDGHANIRGF 467


>gi|392564310|gb|EIW57488.1| transcription factor Tfb2 [Trametes versicolor FP-101664 SS1]
          Length = 477

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 157/414 (37%), Positives = 236/414 (57%), Gaps = 42/414 (10%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           L  + +  V+ LL+++ P+P + +A+W+++   K +++A  TL  L              
Sbjct: 56  LRPLERQLVMNLLWLDLPIPPSTMAAWVTREGKKPYQDALSTLARLHILPPSTSKHSLNP 115

Query: 48  --RRNAKVALLGGGKPWSMSAKLEVDSK--ARDLDFLNQYALERWECILRFMVGSQQTEG 103
             +   + A+ GGG   S    +E D K    ++D L+ YALERWE IL +MV S   E 
Sbjct: 116 TFKSGLRQAITGGGTSGSFGVLIENDDKNPPPEIDALDGYALERWETILYYMVSSGTGEF 175

Query: 104 ISADAVRTLF-----------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
            S  +   L+           + GA    IT  GFQFLL     Q+W  +L+YL+  E +
Sbjct: 176 PSQPSKGVLYLLERSGLMARVHSGALQ--ITSGGFQFLLHPPHVQLWELLLQYLQMAEER 233

Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
            +DLVE ++FL  L  +  G +YSTE +S   +  L  LR++GL+ QR   + RF PT+L
Sbjct: 234 QMDLVEVISFLLMLSTTELGKNYSTENLSPTQKTMLDDLRDYGLIKQRTPTSRRFSPTRL 293

Query: 213 ALNMATR--------GTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPN 264
           A  + +         G+       GF+++ETN+R+YAYTD+ L++A++ LF  L  RFPN
Sbjct: 294 ATTLTSSSPPLPTSAGSGDGSHAQGFIVLETNYRLYAYTDNPLQIAVLNLFVTLKSRFPN 353

Query: 265 LTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SCPLPPV-VSDQIRLWEGERER 321
           L +G +TRDSV+ AL SGITA QII +L  HA P +  + PL PV V DQIRLWE E+ R
Sbjct: 354 LVIGAITRDSVKKALASGITADQIISYLVTHAHPQMRKNQPLLPVTVQDQIRLWELEKNR 413

Query: 322 LTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           + + EG LY+ F SQAD+E + +YA+ L V+ W+N  +R    + +GH++I+ F
Sbjct: 414 MKSQEGYLYTAFASQADYEYVLNYAKQLDVVLWENASRRCFFGSVEGHANIRGF 467


>gi|19113355|ref|NP_596563.1| transcription factor TFIIH complex subunit Tfb2
           [Schizosaccharomyces pombe 972h-]
 gi|62901133|sp|P87303.2|TFB2_SCHPO RecName: Full=RNA polymerase II transcription factor B subunit 2;
           AltName: Full=RNA polymerase II transcription factor B
           52 kDa subunit; AltName: Full=RNA polymerase II
           transcription factor B p52 subunit
 gi|3080538|emb|CAA18666.1| transcription factor TFIIH complex subunit Tfb2
           [Schizosaccharomyces pombe]
          Length = 447

 Score =  265 bits (678), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 157/398 (39%), Positives = 222/398 (55%), Gaps = 31/398 (7%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTL----------------T 45
           L  +A+ YV+ +LF   PV  +    W+  +      E+ + L                +
Sbjct: 36  LPILARQYVMSMLFNPMPVALSDFDLWIKLSSKVYQSESFNKLVRMHIFQFDGQYITLNS 95

Query: 46  ELRRNAKVALLGGGKPWSMSAKL-EVDSKARDLDFLNQYALERWECILRFMVGSQQT--- 101
           E R+    AL GGG   S      + D     +DFL+ YA E WE IL FMVG+ +    
Sbjct: 96  EFRKQFITALTGGGNHNSFGVPCTDEDKHLVTVDFLDAYAKETWETILHFMVGTPEAKFP 155

Query: 102 -EGISADAVRTLFNEGAENPV-ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVEC 159
            EG+ +   R     G +N + IT+ GFQFLL +  AQ+W  +L YL+  E   +D V+ 
Sbjct: 156 GEGVLSLLKRGGLMSGPKNQLRITRAGFQFLLQDINAQIWTLLLDYLKLSEDTHMDPVQV 215

Query: 160 LTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATR 219
           L FLF L     G  YS + ++D  Q+ L+ LRE+GLVYQRK  + RFYPT+LA  + T 
Sbjct: 216 LHFLFMLGSLDLGRAYSVDFLTDTQQIMLEDLREYGLVYQRKITSKRFYPTRLATGLTTD 275

Query: 220 ------GTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRD 273
                    +   + GF+IVETN+R+YAYT S L++A+IGLF  L  RF NL VGV+TRD
Sbjct: 276 YRSLHGKQSENDDDKGFIIVETNYRLYAYTSSPLQIAIIGLFANLRARFSNLVVGVITRD 335

Query: 274 SVRAALRSGITAAQIIGFLRLHALPSV--SCP-LPPVVSDQIRLWEGERERLTTSEGVLY 330
           S+R AL +GI A QII +L  HA P +  + P LPP + DQI LWE E+ RL  + G+L+
Sbjct: 336 SIRRALMNGIAAEQIITYLTTHAHPQMRSNVPLLPPTLVDQIYLWELEKNRLRATPGILF 395

Query: 331 SQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQG 368
             F++ + F++  +YA++LGVL W +  KR   +T  G
Sbjct: 396 RDFLTDSGFDQAVEYAKELGVLVWDSSLKRMFFITTTG 433


>gi|443894209|dbj|GAC71558.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH, subunit TFB2 [Pseudozyma antarctica
           T-34]
          Length = 491

 Score =  263 bits (672), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 153/414 (36%), Positives = 225/414 (54%), Gaps = 40/414 (9%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
           L  +A+  ++ +LF+  P+      +WL K    E   A   L  L              
Sbjct: 72  LPMLARQLIMHMLFLHAPLAADDFLAWLRKDVRNEFDVAVAKLARLSIVQLKPASAKQIL 131

Query: 49  -------RNAKVALLGGGKPWSMSAKLEV-DSKARDLDFLNQYALERWECILRFMVGSQQ 100
                     + AL GGG   S     +  D  A D+ FL+ YA  +WE IL +MVGS +
Sbjct: 132 LLNAVFTEGMRRALTGGGSHRSFGVPCDTEDKNAVDIAFLDDYARTKWETILHYMVGSDK 191

Query: 101 TEGIS---------ADAVRTLFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
           +             ++ +++       +  IT  GFQFLL +   Q+W+ +L+YL+  E 
Sbjct: 192 SSTPREPVLYLLRRSNLMQSRSAAATASLNITSRGFQFLLEDVNTQLWHLLLQYLDMAEE 251

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
           + +DLVE L FLF L     G DYSTE + +     L+  R++GLVYQRK  + RFYPT+
Sbjct: 252 RNMDLVEVLAFLFMLGSLELGRDYSTEELPETQLHMLEDFRDYGLVYQRKASSRRFYPTR 311

Query: 212 LALNMATRGTL-------KQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPN 264
           LA  + +            +  E G++I+ETN+R+YAYT + L+VA++ LF  +  RFPN
Sbjct: 312 LATTLTSSAAAPLLSTNGSEPEERGYIILETNYRLYAYTSNPLRVAVLSLFVTIKARFPN 371

Query: 265 LTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SCPLPPV-VSDQIRLWEGERER 321
           L VG +TRDSV++AL +GITA QII +L  HA   +  + PL PV VSDQIRLWE E+ R
Sbjct: 372 LLVGSITRDSVKSALANGITAEQIITYLTHHAHLQMHRNDPLLPVTVSDQIRLWEREKNR 431

Query: 322 LTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           +  + G L++ F SQ DF ++R+YA  LGVL W +E KR + V + G+  ++ +
Sbjct: 432 VQQNLGSLFTDFTSQPDFAEVRNYASQLGVLVWHDEPKRRLFVDEAGNEPVRDY 485


>gi|393246879|gb|EJD54387.1| transcription factor Tfb2 [Auricularia delicata TFB-10046 SS5]
          Length = 480

 Score =  260 bits (665), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 157/427 (36%), Positives = 233/427 (54%), Gaps = 57/427 (13%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           LS + +  V+ LL+++  +  A +  W+++   + +K +   L +L              
Sbjct: 45  LSPVERQIVMTLLWLDNNIENATLTPWVTRDGLEIYKASVMELCKLHIVPATSSTPNLLP 104

Query: 48  ---------RRNAKVALLGGG------KPWSMSAKLEVDSKARDLDFLNQYALERWECIL 92
                    + N ++AL+GGG      KP   SA+L   +    +  L+ YA+ERWE IL
Sbjct: 105 TTLDLNGVFKANLRLALVGGGDHNSFGKPVKRSAELPPVT----VSALDAYAVERWETIL 160

Query: 93  RFMV--GSQQTEGISADAVRTLFNEGA----------ENPVITKDGFQFLLLETPAQVWY 140
            FMV  G+ Q+    + AV  L  +             +  IT  GFQFLL    AQ+W 
Sbjct: 161 HFMVSSGTDQSPATPSGAVCNLLRKSGLMIRLDPRSDSSMKITSRGFQFLLSSPHAQLWE 220

Query: 141 FILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQR 200
            +L YLE  E +GL L+E ++FLF L     G +YST+ ++      L  L ++GL+YQR
Sbjct: 221 LLLHYLELAEERGLGLMEVVSFLFMLSTMELGQEYSTDNLTKDQATVLGELLDYGLIYQR 280

Query: 201 KRK--AGRFYPTKLALNMATR-------GTLKQIREPGFLIVETNFRVYAYTDSNLKVAL 251
                + RF+PT+LA  + +          +      GF+I+ETN+R+YAYTD+ L++A+
Sbjct: 281 ALPGMSKRFFPTRLATTLMSSLPELPRTAGVASATSGGFIILETNYRLYAYTDNPLQIAV 340

Query: 252 IGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SCPLPPV-V 308
           + LF     RFPNL VG++TRDSV+ AL +GITA QII +L  HA P +  + PL PV V
Sbjct: 341 LNLFVSFKSRFPNLVVGMVTRDSVKKALANGITAEQIITYLSAHAHPQMRKNNPLLPVTV 400

Query: 309 SDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQG 368
            DQ+RLWE E+ R+   EG LY  F SQADFE + DYA+ LGV+ W+N ++R       G
Sbjct: 401 QDQVRLWELEKNRVKAEEGFLYMDFTSQADFELVLDYARKLGVVIWENGRQRMFFGKADG 460

Query: 369 HSDIKKF 375
           H++I+ F
Sbjct: 461 HNNIRTF 467


>gi|389749775|gb|EIM90946.1| transcription factor Tfb2 [Stereum hirsutum FP-91666 SS1]
          Length = 494

 Score =  260 bits (664), Expect = 9e-67,   Method: Compositional matrix adjust.
 Identities = 166/425 (39%), Positives = 234/425 (55%), Gaps = 51/425 (12%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR-RNAKVALLGGGK 60
           L+ + +  V+ LL++E  +P   ++SW+ K + K   EA   LT L       A LG   
Sbjct: 49  LAPVERQLVMNLLWLESAIPTNTMSSWVVKENKKFFDEALAQLTRLHILPNPSAKLGLNP 108

Query: 61  PWSMSAK-----------LEVDSKARD------LDFLNQYALERWECILRFMVGS----- 98
            +  S +             V ++A D      ++ L+ YALERWE IL FMV S     
Sbjct: 109 TFKTSLRHAITGGGTSGSFGVPAEADDKRGPPSVEILDTYALERWETILHFMVSSGTGQM 168

Query: 99  --QQTEGISADAVRTLFNEGAENPV-ITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
             + ++G+     R+      +  + IT  GFQFLL    AQ+W  +L+YL  VE + +D
Sbjct: 169 PTKPSQGVLYLLQRSKLMAPTQGTLQITSSGFQFLLHTPHAQLWELLLQYLAMVEERQMD 228

Query: 156 LVECLTFLFQLKFSTFGT-----DYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPT 210
           LVE L+FLF L     G      DYS E +SD  +  L+ LR++GL++QRK  + RF PT
Sbjct: 229 LVEVLSFLFMLSTMELGRVRQFHDYSAESLSDTQKAMLEDLRDYGLIWQRKATSRRFSPT 288

Query: 211 KLALNMATR----------GTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMY 260
           +LA  + +           GT    +  GF+I+ETN+R+YAYTD+ L+ A++ LF  +  
Sbjct: 289 RLATTLTSSLPPLPTASGTGTSGTGQTQGFIILETNYRLYAYTDNPLQTAVLNLFVTMKS 348

Query: 261 RFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVS---------CPLPPV-VSD 310
           RFPNL VG +TRDSVR AL +GITA QII +L  +A P +           PL PV V D
Sbjct: 349 RFPNLVVGSITRDSVRKALSNGITADQIISYLTTYAHPQMRKNVSRCRSFTPLLPVTVQD 408

Query: 311 QIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHS 370
           QIRLWE ER RL + EG LY+ F SQAD+E +  YA+ L V+ W+N  KR    T +GH+
Sbjct: 409 QIRLWELERNRLKSQEGFLYTAFASQADYEYVLTYAKQLDVVLWENPTKRCFFGTMEGHA 468

Query: 371 DIKKF 375
           +I+ F
Sbjct: 469 NIRGF 473


>gi|409077631|gb|EKM77996.1| hypothetical protein AGABI1DRAFT_114851 [Agaricus bisporus var.
           burnettii JB137-S8]
 gi|426198975|gb|EKV48900.1| hypothetical protein AGABI2DRAFT_191077 [Agaricus bisporus var.
           bisporus H97]
          Length = 467

 Score =  259 bits (662), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 161/410 (39%), Positives = 234/410 (57%), Gaps = 36/410 (8%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           L+ + +  V+ LL++E  +  + + +W+ +   K ++EA   L+ L              
Sbjct: 48  LAPLERQIVMNLLWLESAIATSTMVAWVQRDGRKLYEEALGNLSRLHILPNSAAKLALNF 107

Query: 48  --RRNAKVALLGGGKPWSMSAKLEVDSKA-RDLDFLNQYALERWECILRFMVGS---QQT 101
             + + + AL GGG   S     E D  +   +  L+ +ALERWE IL +MV S   QQ 
Sbjct: 108 TFKTSLRQALTGGGTSGSFGVPAEKDEHSGPSIKTLDSFALERWETILHYMVSSGTGQQP 167

Query: 102 EGISADAV-----RTLFNEGAENPV-ITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
              S   +      +L +     P+ IT  GFQFLL    AQ+W  +L+YL   + + +D
Sbjct: 168 AKPSQGVLFLLQRSSLMSSFHGGPLQITSSGFQFLLHSPHAQLWDLLLQYLHLAQERQMD 227

Query: 156 LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALN 215
           LVE L FLF L     G +Y TE +S    V L+ LR++GL++Q K K+ RF PT+L   
Sbjct: 228 LVEVLGFLFMLSTMELGREYLTENLSTTQGVLLEDLRDYGLIWQSKPKSRRFSPTRLGTT 287

Query: 216 MATRG-----TLKQIREP--GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVG 268
           + +       T+     P  GF+++ETN+R+YAYTD+ L+ A++ LF  L YRFPNL VG
Sbjct: 288 LTSSSPPLPTTISASSGPLDGFIVLETNYRIYAYTDNPLQTAVLNLFVSLKYRFPNLVVG 347

Query: 269 VLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SCPLPPV-VSDQIRLWEGERERLTTS 325
           ++TRDSVR AL +GITA QII +L  HA P +  + PL PV V DQIRLWE E+ RL +S
Sbjct: 348 MITRDSVRRALMNGITADQIISYLITHAHPQMRKNNPLLPVTVQDQIRLWELEKNRLKSS 407

Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           EG LY+ F SQAD++ + +YA+ L V+ W+N  KR    + +GH  IK+F
Sbjct: 408 EGYLYTAFTSQADYDLVLNYAKQLDVVLWENAAKRCFFGSLEGHLRIKEF 457


>gi|32487401|emb|CAE05735.1| OSJNBb0017I01.15 [Oryza sativa Japonica Group]
          Length = 451

 Score =  257 bits (656), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 151/413 (36%), Positives = 232/413 (56%), Gaps = 45/413 (10%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
           L  +AK Y +++L+V  PV  A +  W+   +A +H+ A D L +LR             
Sbjct: 40  LPPLAKKYALQMLYVSAPVAAAAMEEWVLDEYAAKHRVAIDRLLQLRVFVEVRDRRKEVS 99

Query: 49  --------RNAKVALLGGGK------PWSMSAKLEVDSKARDLDFLNQYALERWECILRF 94
                    N +  L+ GG       P S++A+L        L  L  +ALE+WEC L  
Sbjct: 100 YKMNQKFQGNMQKYLVDGGSLPREPIPSSVTARLPT------LAELESFALEQWECFLLQ 153

Query: 95  MVGSQQTE---GISADAVRT-----LFNEGAENPVITKDGFQFLLLETPAQVWYFILKYL 146
           ++ S Q E     S+  +RT     L +   E P +T++GFQFLL+ET AQ+WY + +Y+
Sbjct: 154 LINSSQVERGTSFSSSMMRTFQRGLLSSRDGEAPRLTENGFQFLLMETNAQLWYIMREYI 213

Query: 147 ETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAG 205
            + E +G+D  E ++FL +L F T G  YS   ++D  +  ++ L E GLV  Q+ RK  
Sbjct: 214 SSAEERGVDPTELISFLLELSFHTLGEAYSLNTLTDVQRNAIRDLAELGLVKLQQGRKDS 273

Query: 206 RFYPTKLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNL 265
            F PTKLA N++   +     + GF++VETNFR+YAY+ S L   ++ LF  + Y+ PNL
Sbjct: 274 WFIPTKLATNLSASLSDSSSNKEGFVVVETNFRMYAYSTSRLHCEILRLFSRVEYQLPNL 333

Query: 266 TVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERL 322
            VG +T++S+  A  +GITA QII FL+ +A P V+  +P V   V+DQIRLWE +R R+
Sbjct: 334 IVGSITKESLYGAFENGITAEQIISFLQQNAHPRVADKIPAVPENVTDQIRLWETDRNRV 393

Query: 323 TTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
             +   LY  F S+  F++  DYA+D G L W++ KK  ++V  + HS++++F
Sbjct: 394 DMTLSHLYEDFPSKDMFDQCCDYARDHGCLLWEDAKKMRLIVRVEFHSEMREF 446


>gi|409048320|gb|EKM57798.1| hypothetical protein PHACADRAFT_171015 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 472

 Score =  255 bits (651), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 150/413 (36%), Positives = 226/413 (54%), Gaps = 39/413 (9%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGGKP 61
           L  + +  V+ LL++E  +P + +A+W+     K ++ +   L+ L    + A+     P
Sbjct: 51  LGPLERQIVMNLLWLESSIPASTMATWVPHEVRKTYENSLSILSRLHILPQSAVKLALNP 110

Query: 62  ------------------WSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEG 103
                             + + A  +    +  ++ L+ YALERWE IL +MV S     
Sbjct: 111 TFKTSLRHAIIGGGSSGSFGVPATRDEKHSSPSIEVLDAYALERWETILHYMVSSGIATR 170

Query: 104 ISADAVRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLV 157
            S   +  L   G    +      IT  GFQFLL    AQ+W  +L+YL  VE + +DLV
Sbjct: 171 PSQGVLFLLQRSGLMASIHGGSLQITSLGFQFLLHSPHAQLWELLLQYLHMVEERQMDLV 230

Query: 158 ECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNM- 216
           E L+FLF L     G +YS E +S      L+ LR++G+++QR+  + RF PT+LA  + 
Sbjct: 231 EVLSFLFMLSTMELGREYSVEHLSPTQTAMLEDLRDYGIIWQRRATSKRFCPTRLATTLT 290

Query: 217 -------ATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
                  A  G        GF+I+ETN+R+YAYTD+ L++A++ LF  L  R+PNL VG 
Sbjct: 291 SSSPPLPAAGGVSASAHGQGFIILETNYRLYAYTDNPLQIAVLNLFVTLKSRYPNLVVGA 350

Query: 270 LTRDSVRAALRSGITA----AQIIGFLRLHALPSV--SCPLPPV-VSDQIRLWEGERERL 322
           +TR+SV+ AL +GITA     QII +L  HA P +  + PL PV V DQIRLWE E+ R+
Sbjct: 351 ITRESVKKALTNGITADQASRQIISYLTTHAHPQMRKNKPLLPVTVQDQIRLWELEKNRM 410

Query: 323 TTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
            + EG LY+ F SQAD+E + +YA+ LGV+ W++  +R    T +GH +I+ F
Sbjct: 411 KSQEGYLYTAFASQADYEYVLNYAKQLGVVLWESSGRRCFFGTLEGHPNIRGF 463


>gi|395332344|gb|EJF64723.1| transcription factor Tfb2 [Dichomitus squalens LYAD-421 SS1]
          Length = 506

 Score =  253 bits (647), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 157/441 (35%), Positives = 229/441 (51%), Gaps = 68/441 (15%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGGKP 61
           L  + +  V+ LL++E P+P A +A+W+++   + ++EA  TL  L    + +    G P
Sbjct: 58  LQPLERQIVMNLLWLETPIPGATMAAWVTREGKRPYQEALSTLQSLHILPQQST-RPGPP 116

Query: 62  WSMSAKLEVDSKAR---------------------------DLDFLNQYALERWECILRF 94
             +   L    KA                            D++ L+ +ALERWE IL +
Sbjct: 117 QPVKHNLNPTFKASLRQAITGGGTSGSFGILAENDDKHPPPDIEALDAFALERWETILYY 176

Query: 95  MV--GSQQTEGISADAVRTLFNEGAENPV-------ITKDGFQFLLLETPAQVWYFILKY 145
           MV  G+ Q     +  V  L                IT  GFQFLL     Q+W  +L+Y
Sbjct: 177 MVSSGTGQFPAKPSQGVLYLLQRSGLMATVHGATLQITSAGFQFLLHPPHEQLWELLLQY 236

Query: 146 LETVESKGLDLVECLTFLFQLKFSTFGT--------------------DYSTEGMSDKLQ 185
           L+  E + +DLVE L+FL  L  +  G                      YSTE +S   +
Sbjct: 237 LQMAEERQMDLVEVLSFLLMLSTTELGKVGVHTIQFIAQVTQAIPHAQSYSTENLSATQK 296

Query: 186 VFLQHLREFGLVYQRKRKAGRFYPTKLALNMATR--------GTLKQIREPGFLIVETNF 237
             L+ LR++GL+ QR   + +F PT+LA  + +         GT       GF+++ETN+
Sbjct: 297 AMLEDLRDYGLIRQRTATSRKFSPTRLATTLTSSSPPLPTSAGTGDGSHVQGFIVLETNY 356

Query: 238 RVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHAL 297
           R+YAYTD+ L++A++ LF  L YRFPNL VG +TRDSV+ AL +GITA QII +L  HA 
Sbjct: 357 RLYAYTDNPLQIAVLNLFVTLKYRFPNLVVGQITRDSVKKALANGITADQIISYLITHAH 416

Query: 298 PSV--SCPLPPV-VSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTW 354
           P +  + PL PV V DQIRLWE ER R+ + EG LY+ F S AD+E + DYA+ LGV+ W
Sbjct: 417 PQMRKNKPLLPVTVQDQIRLWELERNRVKSQEGYLYTAFASHADYEWVLDYAKKLGVVLW 476

Query: 355 QNEKKRTVVVTKQGHSDIKKF 375
           +N  KR    + +GH +I+ +
Sbjct: 477 ENPSKRCFFGSVEGHMNIRGY 497


>gi|358333989|dbj|GAA34589.2| transcription initiation factor TFIIH subunit 4 [Clonorchis
           sinensis]
          Length = 456

 Score =  252 bits (644), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 160/408 (39%), Positives = 215/408 (52%), Gaps = 84/408 (20%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR----------RN 50
           EL E+AKH+V RLLFVEQP+P+++++ W+ K      ++A   L++LR          R 
Sbjct: 37  ELPELAKHFVARLLFVEQPIPKSIVSGWVEKGSQALLQDACRALSDLRIWHSVDGNVARG 96

Query: 51  A-----------KVALLGGGKPWSMS-AKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
           A           +++L GGGKP         VD  ++D+DFL  YA ERW+ +L FMVGS
Sbjct: 97  AWCLSKKYQESLRISLFGGGKPLLGDLGSTAVDKYSKDVDFLETYAAERWDALLHFMVGS 156

Query: 99  QQTE-GISADAVRTLFN--EGA--ENPV-ITKDGFQFLLLETPAQVWYFILKYLETVESK 152
           + +E G     V  L N  +G   + PV ITK GF FLL+  P QV  F+L Y + ++S 
Sbjct: 157 ESSEVGSVVQDVLLLSNLMKGGTPDTPVGITKHGFHFLLMSRPDQVRLFLLHYFDYLKS- 215

Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
                                 Y  + ++   Q  LQ +RE GL YQRKR A RFY T L
Sbjct: 216 ----------------------YPVDALTAPQQEVLQQMRELGLAYQRKRTAPRFYVTPL 253

Query: 213 ALNMA------------------------------TRGTLKQIREPGFLIVETNFRVYAY 242
           A  ++                               + T     + G++++ETNFR+YAY
Sbjct: 254 ATVLSGSRCHQPAMSSGSVLSAIPTGVSHLEGTDSVQPTASSTSDVGYILLETNFRLYAY 313

Query: 243 TDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV-- 300
           TDS L+ AL+ LF ++  RFPNL V  +TRDSVR AL  GITA QII FL  +A P +  
Sbjct: 314 TDSPLRTALLSLFSKIRARFPNLVVADITRDSVREALIRGITADQIISFLTTNAHPDMLR 373

Query: 301 SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQ 347
             P LPP + DQIRLWE ER+R    EG LY QF   ADFE +RD+A+
Sbjct: 374 EPPILPPTLVDQIRLWELERDRFVFQEGCLYEQFSKSADFEMVRDFAK 421


>gi|218195833|gb|EEC78260.1| hypothetical protein OsI_17944 [Oryza sativa Indica Group]
          Length = 459

 Score =  252 bits (643), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 150/421 (35%), Positives = 232/421 (55%), Gaps = 53/421 (12%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
           L  +AK Y +++L+V  PV  A +  W+   +A +H+ A D L +LR             
Sbjct: 40  LPPLAKKYALQMLYVSAPVAAAAMEEWVLDEYAAKHRVAIDRLLQLRVFVEVRDRRKEVS 99

Query: 49  --------RNAKVALLGGGK------PWSMSAKLEVDSKARDLDFLNQYALERWECILRF 94
                    N +  L+ GG       P S++A+L        L  L  +ALE+WEC L  
Sbjct: 100 YKMNQKFQGNMQKYLVDGGSLPREPIPSSVTARLPT------LAELESFALEQWECFLLQ 153

Query: 95  MVGSQQTE---GISADAVRT-----LFNEGAENPVITKDGFQFLLLETPAQVWYFILKYL 146
           ++ S Q E     S+  +RT     L +   E P +T++GFQFLL+ET AQ+WY + +Y+
Sbjct: 154 LINSSQVERGTSFSSSMMRTFQRGLLSSRDGEAPRLTENGFQFLLMETNAQLWYIMREYI 213

Query: 147 ETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRK----- 201
            + E +G+D  E ++FL +L F T G  YS   ++D  +  ++ L E GLV  ++     
Sbjct: 214 SSAEERGVDPTELISFLLELSFHTLGEAYSLNTLTDVQRNAIRDLAELGLVKLQQMSRVK 273

Query: 202 ----RKAGRFYPTKLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCE 257
               RK   F PTKLA N++   +     + GF++VETNFR+YAY+ S L   ++ LF  
Sbjct: 274 PWQGRKDSWFIPTKLATNLSASLSDSSSNKEGFVVVETNFRMYAYSTSRLHCEILRLFSR 333

Query: 258 LMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRL 314
           + Y+ PNL VG +T++S+  A  +GITA QII FL+ +A P V+  +P V   V+DQIRL
Sbjct: 334 VEYQLPNLIVGSITKESLYGAFENGITAEQIISFLQQNAHPRVADKIPAVPENVTDQIRL 393

Query: 315 WEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKK 374
           WE +R R+  +   LY  F S+  F++  DYA+D G L W++ KK  ++V  + HS++++
Sbjct: 394 WETDRNRVDMTLSHLYEDFPSKDMFDQCCDYARDHGCLLWEDAKKMRLIVRVEFHSEMRE 453

Query: 375 F 375
           F
Sbjct: 454 F 454


>gi|242074798|ref|XP_002447335.1| hypothetical protein SORBIDRAFT_06g033130 [Sorghum bicolor]
 gi|241938518|gb|EES11663.1| hypothetical protein SORBIDRAFT_06g033130 [Sorghum bicolor]
          Length = 451

 Score =  250 bits (639), Expect = 7e-64,   Method: Compositional matrix adjust.
 Identities = 145/413 (35%), Positives = 229/413 (55%), Gaps = 45/413 (10%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
           L  +AK Y +++L+V  PV  A +  W+   +A +HK A D L +LR             
Sbjct: 40  LPPLAKKYALQMLYVLAPVTAAAMEEWVLNEYAAKHKVAIDKLLQLRVFVEVRDRRRDVS 99

Query: 49  --------RNAKVALLGGGK------PWSMSAKLEVDSKARDLDFLNQYALERWECILRF 94
                    N +  L+ GG       P S++ +L   +       L  YAL++WEC L  
Sbjct: 100 YKMNQKFQGNMQKYLVDGGSLPREPLPLSVTGRLPTPAD------LEAYALDQWECFLLQ 153

Query: 95  MVGSQQTE---GISADAVRT-----LFNEGAENPVITKDGFQFLLLETPAQVWYFILKYL 146
           ++ S Q E     S+  ++T     L +   E   +T++GFQFLL+ET AQ+WY + +Y+
Sbjct: 154 LINSSQVEKGTSFSSSMMKTFQRGLLSSRDGEASKLTENGFQFLLMETNAQLWYIMREYI 213

Query: 147 ETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGR 206
            + E +G+D  E ++FL +L F   G  YS   ++D  ++ ++ L E GLV Q++ +  R
Sbjct: 214 SSAEERGVDPTELISFLLELSFHKLGAAYSLNTLTDVQRIAIRDLAELGLVKQQQGRTDR 273

Query: 207 -FYPTKLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNL 265
            F PT+LA N++   +     + GF++VETNFR+YAY+ S L   ++ LF  + Y+ PNL
Sbjct: 274 WFIPTQLATNLSASLSDSSSNKEGFVVVETNFRMYAYSTSKLHCEILRLFSRVEYQLPNL 333

Query: 266 TVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERL 322
            VG +T++S+  A  +GITA QII FLR +A P V+  +P V   V+DQIRLWE +R R+
Sbjct: 334 IVGAVTKESIYGAFENGITAEQIISFLRQNAHPRVADKIPTVPENVTDQIRLWETDRNRV 393

Query: 323 TTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
                 LY  F S+  FE+  DYA+D G L W++ K+  ++V  + H ++++F
Sbjct: 394 EMIPSHLYEDFPSKEWFEQCCDYARDNGYLLWEDSKRMRLIVRGEFHPEMREF 446


>gi|357166758|ref|XP_003580836.1| PREDICTED: general transcription factor IIH subunit 4-like
           [Brachypodium distachyon]
          Length = 451

 Score =  249 bits (637), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 143/404 (35%), Positives = 225/404 (55%), Gaps = 33/404 (8%)

Query: 5   IAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR---------------- 48
           +AK Y +++L+V  P+P A +  W+   +A +HK A D L +LR                
Sbjct: 43  LAKKYALQMLYVSAPMPAAAMEEWVLDEYASKHKVAIDRLLQLRVFVEVRDRRKEVSYKM 102

Query: 49  -----RNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE- 102
                 N +  L+ GG          V  +   L  L  YALE+WEC L  ++ S Q E 
Sbjct: 103 NDKFQGNMQKYLVDGGSLPREPISSSVTGRLPTLTELENYALEQWECFLLQLINSSQVER 162

Query: 103 --GISADAVRT-----LFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
               S+  ++T     L +   + P ++++GFQFLL+ET AQ+WY + +Y+ + E +G+D
Sbjct: 163 GTSFSSSMMKTFQRGLLSSRDGDAPKLSENGFQFLLMETNAQLWYIMREYISSAEERGVD 222

Query: 156 LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYPTKLAL 214
             E ++FL +L F T G  YS   ++D  ++ ++ L E GLV  Q+ RK   F PTKLA 
Sbjct: 223 PTELISFLLELSFHTLGAAYSFNTLTDVQRIAIRDLAELGLVKVQQGRKDSWFIPTKLAT 282

Query: 215 NMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDS 274
           N+++  +     + G ++VETNFR+YAY+ S L   ++ LF  + Y+ PNL VG +T++S
Sbjct: 283 NLSSSLSDSSASKEGIVVVETNFRLYAYSASRLHCEILRLFSRVEYQLPNLIVGAITKES 342

Query: 275 VRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERLTTSEGVLYS 331
           +  A  +GITA QII FL+ +A P V+  +P V   V+DQIRLWE +R R+      +Y 
Sbjct: 343 LYGAFDNGITAEQIISFLKQNAHPRVADKIPVVPENVTDQIRLWETDRNRVDMVLSHVYE 402

Query: 332 QFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
            F S+  FE+  D A+D G L W++ KK  ++V  + H ++++F
Sbjct: 403 DFPSKDMFEQCCDLARDNGFLLWEDSKKMRLIVRVEFHQEMREF 446


>gi|326435722|gb|EGD81292.1| hypothetical protein PTSG_11329 [Salpingoeca sp. ATCC 50818]
          Length = 545

 Score =  248 bits (634), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 139/396 (35%), Positives = 229/396 (57%), Gaps = 40/396 (10%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
           L  ++K  V+RLL++   + +A++++W+    A+   E+ + L +L+             
Sbjct: 42  LDGLSKQMVLRLLYLPSALSKAIVSTWIVPEQARLWTESLEQLQKLQILAVSGDGAVAVA 101

Query: 49  ---RNAKVALL-GGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGI 104
              +N  +A L G GK    +   +    A   DFL QYA E+WE +L +MVGS   + +
Sbjct: 102 QPFKNGLIAALQGTGKERCQAP--DPGKHAASKDFLQQYADEKWEDVLDYMVGS---DSL 156

Query: 105 SADAVR--------TLFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDL 156
             D V+          + E  ++  I+  GFQFLL +   Q+W++++K+L+T +++G+D+
Sbjct: 157 EEDTVKQALYQSHLMTWREDVQDFDISSLGFQFLLQDRRTQLWFYVIKFLQTCQNEGIDI 216

Query: 157 VECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNM 216
            + L F+F++ ++TFG  Y TE ++    + L+H  + GL Y RK+K+ ++YPT+LAL +
Sbjct: 217 EDALGFVFRVAYTTFGNYYPTEVLNKTEHLVLRHFSKIGLCYHRKKKSKQYYPTRLALAL 276

Query: 217 ATRGTL--KQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDS 274
            T   L  K     GFL++ETNFR+YAYT S+LK++L+ LF + +Y+ P + V  +TR S
Sbjct: 277 RTSSALVAKDAMRCGFLVIETNFRIYAYTKSDLKISLLKLFAKPLYKLPTMLVASITRSS 336

Query: 275 VRAALRSGITAAQIIGFLRLHALP-------SVSCPLPPVVSDQIRLWEGERERLTTSEG 327
           VR A+  GITA QI+ FL++H          SV  P+PP VSDQI LWE E  RL+ +EG
Sbjct: 337 VRQAMAQGITARQILHFLKVHCSSRMRHQPHSVFSPVPPTVSDQILLWEQETSRLSKTEG 396

Query: 328 VLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVV 363
           VL+ + + +  F KL  + ++     W +E KR ++
Sbjct: 397 VLF-RHLPKPIFPKLLAFVKERQACIWYDETKRAII 431


>gi|402224480|gb|EJU04542.1| transcription factor Tfb2 [Dacryopinax sp. DJM-731 SS1]
          Length = 466

 Score =  248 bits (632), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 145/370 (39%), Positives = 214/370 (57%), Gaps = 29/370 (7%)

Query: 34  AKEHKEATDTLTELRRNAKVALLGGGKPWSMSAKLEVDSKARDL---DFLNQYALERWEC 90
           AKE       L   +   + AL GGG+  S        ++ +++   + L+ YA ERWE 
Sbjct: 91  AKEANGRILLLPAFQSGLRRALTGGGEHRSFGVPCP-GARGKEVMATEQLDDYASERWES 149

Query: 91  ILRFMVGSQQTEG-ISADAVRTLF---NEGAENPV---------ITKDGFQFLLLETPAQ 137
           IL FMV S  + G +   ++  +F     G   P+         IT  GFQFLL ++  Q
Sbjct: 150 ILHFMVSSGSSAGRVPPPSIAVIFLLRRSGLMVPIGAERHPESRITSKGFQFLLEDSHTQ 209

Query: 138 VWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV 197
           +W  +L+YL   E +G DLVE + FLF L     G +YSTE +S+   V LQ   ++GL+
Sbjct: 210 LWELLLQYLAMSEDQGRDLVEVIGFLFMLGSMQLGQEYSTENLSETQDVMLQDFLDYGLI 269

Query: 198 YQRK-RKAGRFYPTKLALNMATRGTL--------KQIREPGFLIVETNFRVYAYTDSNLK 248
           Y+R      RFYPT+LA  + +  +L        K     GF+I+ETN+RVYAYT++ L+
Sbjct: 270 YRRNPDDHTRFYPTRLATTLTSTSSLAFTSSKHEKAASSEGFIILETNYRVYAYTENPLQ 329

Query: 249 VALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SCPLPP 306
           +A++ LF  L  RF NL +G LTR+S++AAL +GITA QII +L +HA P +  + P+ P
Sbjct: 330 IAVLNLFVALHSRFENLVIGRLTRESIKAALANGITADQIISYLTVHAHPMMHKNNPVLP 389

Query: 307 V-VSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVT 365
           V V DQIRLW+ E+ RL +  G LY  F SQ DF+ + +YA+ L V+ W+N +KR + V 
Sbjct: 390 VTVQDQIRLWQLEKNRLKSENGYLYEDFNSQGDFDLVLNYAKQLDVVLWENREKRKMFVR 449

Query: 366 KQGHSDIKKF 375
           + GH ++++F
Sbjct: 450 EDGHENVREF 459


>gi|299750117|ref|XP_001836551.2| transcription factor TFIIH complex subunit Tfb2 [Coprinopsis
           cinerea okayama7#130]
 gi|298408751|gb|EAU85259.2| transcription factor TFIIH complex subunit Tfb2 [Coprinopsis
           cinerea okayama7#130]
          Length = 403

 Score =  248 bits (632), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 135/314 (42%), Positives = 195/314 (62%), Gaps = 18/314 (5%)

Query: 80  LNQYALERWECILRFMV--GSQQTEGISADAVR------TLFNEGAENPVITKDGFQFLL 131
           L+ YALERWE IL +MV  GS Q     +  VR       L NE  +   I+  GFQFLL
Sbjct: 81  LDAYALERWETILHYMVSSGSGQRPTEPSPGVRFLLKTSGLMNEIHDALHISSAGFQFLL 140

Query: 132 LETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHL 191
                Q+W+ +L+YL+  E + +DLV+ L+F F L     G +YS + ++      L+ L
Sbjct: 141 HSPHEQLWHLLLQYLQLAEQRRMDLVDVLSFFFMLSTMELGREYSVQNLTKTQSAMLEDL 200

Query: 192 REFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIR-------EPGFLIVETNFRVYAYTD 244
           R++GL++QRK+ + RF PT+L+  + +               + GF+I+ETN+R+YAYTD
Sbjct: 201 RDYGLIWQRKQTSKRFSPTRLSTTLTSSSPPLPSTSGASSGPQEGFIILETNYRIYAYTD 260

Query: 245 SNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SC 302
           + L+ A++ LF  L YRFPNL VG +TR+SV+ AL +GI+A QII +L  HA P++  + 
Sbjct: 261 NPLQTAVLSLFASLKYRFPNLVVGSITRESVKKALLNGISADQIISYLITHAHPNMRKNN 320

Query: 303 PLPPV-VSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRT 361
           PL PV V DQIRLWE E+ RL + EG LY+ F SQAD+E +  YA++L V+ W+N  KR 
Sbjct: 321 PLLPVTVQDQIRLWELEKNRLKSREGFLYTAFASQADYELVLQYAKELDVVLWENASKRC 380

Query: 362 VVVTKQGHSDIKKF 375
              + +GH++IK F
Sbjct: 381 FFGSLEGHANIKGF 394


>gi|414584751|tpg|DAA35322.1| TPA: hypothetical protein ZEAMMB73_386928 [Zea mays]
          Length = 451

 Score =  248 bits (632), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 145/413 (35%), Positives = 228/413 (55%), Gaps = 45/413 (10%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
           L  +AK Y +++L+V  PV  A +  W+   +A +HK A D L +LR             
Sbjct: 40  LPPLAKKYALQMLYVLAPVTAAAMEEWVLDEYAAKHKVAIDKLLQLRVFVEVRDRRRDVS 99

Query: 49  --------RNAKVALLGGGK------PWSMSAKLEVDSKARDLDFLNQYALERWECILRF 94
                    N +  L+ GG       P S++ +L   +       L  YAL++WEC L  
Sbjct: 100 YKMNQKFQGNMQKYLVDGGSLPREPLPLSVTGRLPTPAD------LEAYALDQWECFLLQ 153

Query: 95  MVGSQQTE---GISADAVRT-----LFNEGAENPVITKDGFQFLLLETPAQVWYFILKYL 146
           ++ S Q E     S+  ++T     L +   E   +T++GFQFLL+ET AQ+WY + +Y+
Sbjct: 154 LINSSQVEKGSSFSSSMMKTFQRGLLSSRDGEASKLTENGFQFLLMETNAQLWYIMREYI 213

Query: 147 ETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRK-RKAG 205
            + E +G+D  E ++FL +L F   G  YS   ++D  ++ ++ L E GLV Q++ RK  
Sbjct: 214 SSAEERGVDPTELISFLLELSFHKLGAAYSLNTLTDVQRIAIRDLAELGLVKQQQGRKDS 273

Query: 206 RFYPTKLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNL 265
            F PT+LA N++   +     + GF++VETNFR+YAY+ S L   ++ LF  + Y+ PNL
Sbjct: 274 WFIPTQLATNLSASLSDSSSNKEGFVVVETNFRMYAYSTSKLHCEILRLFARVEYQLPNL 333

Query: 266 TVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERL 322
            VG +T++S+  A  +GITA QII FLR +A P V+  +P V   V+DQIRLWE +  R+
Sbjct: 334 IVGAVTKESIYGAFENGITAEQIISFLRQNAHPRVADKIPAVPENVTDQIRLWETDLNRV 393

Query: 323 TTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
                 LY  F S+  FE+  DYA+D G L W++ K+  ++V  + H ++++F
Sbjct: 394 EMIPSHLYEDFPSKEWFEQCCDYARDNGYLLWEDPKRMRLIVRGEFHPEMREF 446


>gi|449540295|gb|EMD31288.1| hypothetical protein CERSUDRAFT_119839 [Ceriporiopsis subvermispora
           B]
          Length = 412

 Score =  247 bits (630), Expect = 8e-63,   Method: Compositional matrix adjust.
 Identities = 148/403 (36%), Positives = 223/403 (55%), Gaps = 38/403 (9%)

Query: 11  IRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR----RNAKVALLGGGKP----- 61
           + LL++E  +P + +A+W+++     +  +  TL  L        K+AL    K      
Sbjct: 1   MNLLWLESAIPVSTMAAWVTREGRTTYHASLSTLARLHILPTSATKLALNPTFKSSLRQA 60

Query: 62  ---------WSMSAKLEVDSKARDLDFLNQYALERWECILRFMV--GSQQTEGISADAVR 110
                    + + A  +  +    ++ L+ YALERWE IL +MV  G+ Q     +DAV 
Sbjct: 61  ITGGGSSGSFGVPAPYDEINPGPSIETLDSYALERWETILYYMVSSGNGQYPTKPSDAVL 120

Query: 111 TLFN-EGAENPV------ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFL 163
            L    G    V      IT  GFQFLL     Q+W  +L+YL   E + +DLV+ L FL
Sbjct: 121 YLLKRSGLMTSVRGAALQITSSGFQFLLHPPHEQLWELLLQYLHLTEERQMDLVDVLGFL 180

Query: 164 FQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGT-- 221
             L     G +YST+G+S   +  L+ LR++GL++QR   + RF PT+LA  +       
Sbjct: 181 LMLSTMELGREYSTDGLSPTQKAMLEDLRDYGLLWQRSATSQRFSPTRLATTLTASSNPL 240

Query: 222 ------LKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSV 275
                     +  GF+++ETN+RVYAYTD+ L++A++ LF  +  RFPNL +G +TR+SV
Sbjct: 241 PTSSSASADSQSQGFIVLETNYRVYAYTDNPLQIAVLNLFVTMKSRFPNLVIGAVTRESV 300

Query: 276 RAALRSGITAAQIIGFLRLHALPSVS--CPLPPV-VSDQIRLWEGERERLTTSEGVLYSQ 332
           + AL +GITA QII +L  HA P +    PL PV V DQIRLWE E+ R+ + EG LY+ 
Sbjct: 301 KKALANGITADQIISYLTAHAHPQMRRYKPLLPVTVQDQIRLWELEKNRVKSQEGYLYTA 360

Query: 333 FVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           F SQAD+E + +YA+ L V+ W++  +R    + +GHS+I+ F
Sbjct: 361 FASQADYEYVLNYAKQLDVVIWESPTRRCFFGSLEGHSNIRGF 403


>gi|149031822|gb|EDL86757.1| rCG41794 [Rattus norvegicus]
          Length = 209

 Score =  247 bits (630), Expect = 8e-63,   Method: Compositional matrix adjust.
 Identities = 118/209 (56%), Positives = 159/209 (76%), Gaps = 5/209 (2%)

Query: 180 MSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMAT--RGTLKQIREPGFLIVETNF 237
           MSD L  FLQHLREFGLV+QRKRK+ R+YPT+LA+N+++   G    + +PGF++VETN+
Sbjct: 1   MSDSLLNFLQHLREFGLVFQRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETNY 60

Query: 238 RVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHAL 297
           R+YAYT+S L++ALI LF E++YRFPN+ V  +TR+SV+ A+ SGITA QII FLR  A 
Sbjct: 61  RLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAH 120

Query: 298 PSV---SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTW 354
           P +   +  LPP ++DQIRLWE ER+RL  +EGVLY+QF+SQ DFE L  +A++LGVL +
Sbjct: 121 PVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVF 180

Query: 355 QNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           +N  KR +VVT  GHSD+K+FW   +  S
Sbjct: 181 ENSAKRLMVVTPAGHSDVKRFWKRQKHSS 209


>gi|451847848|gb|EMD61155.1| hypothetical protein COCSADRAFT_97372 [Cochliobolus sativus ND90Pr]
          Length = 482

 Score =  246 bits (629), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 152/424 (35%), Positives = 227/424 (53%), Gaps = 62/424 (14%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           L  +AK  V+ +L++  P P A + +W   T  KE + AT TL  L              
Sbjct: 39  LPHLAKSIVMAMLYMPTPFPAADLDTWFKPTARKEKERATFTLDRLHIITSARQDNGTLS 98

Query: 48  -------RRNAKVALLGGG--KPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                  +R+ + A+ G G  + + + A  E   K   ++FL++Y+  +WE IL ++V  
Sbjct: 99  WTLNPGFQRSLRNAIEGSGTHRSFGVPATKEESGKRVSIEFLDEYSRSQWEGILYYLVSG 158

Query: 99  QQTEGISADAV---------RTLFNEG------AENPVITKDGFQFLLLETPAQVWYFIL 143
               G+S D++         + L + G        +P ITKDGF F+L ET AQVW  ++
Sbjct: 159 --AAGLSKDSISRAEVGPGTKKLLHTGDLVRTIHGSPRITKDGFSFVLQETNAQVWSLLI 216

Query: 144 KYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRK 203
            YL+     G+   E L+FLF L     G DYST  +S      L+ L   GLVY+ +R 
Sbjct: 217 VYLKMTNELGMSETEVLSFLFMLGSLELGQDYSTSTLSATQLQMLEDLSAMGLVYRSERN 276

Query: 204 AGRFYPTKLALNM------ATRGTLKQIREP-------------GFLIVETNFRVYAYTD 244
           A  FYPT+LA  +      A   + K I +              GF+I+ETN+R+YAYT+
Sbjct: 277 ARTFYPTRLATTLTSDSGSAMSASSKDIAQASTSTTGPPTAANKGFIIIETNYRLYAYTN 336

Query: 245 SNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SC 302
           S +++A++ LF +L +RFPNL  G LT++SV  A+++GIT+AQII +L  +A P +  + 
Sbjct: 337 SLIQIAILSLFTKLQHRFPNLVSGKLTKESVHKAVQAGITSAQIISYLTTYAHPQMQKTV 396

Query: 303 P-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRT 361
           P +PP V DQIRLWE E ER+ T+ G L  +F S A++  + +YA  LGVL WQN+  R 
Sbjct: 397 PYIPPTVMDQIRLWEYEGERVETTTGYLMREFGSDAEYRDVLNYASALGVLVWQNDANRC 456

Query: 362 VVVT 365
             V+
Sbjct: 457 FFVS 460


>gi|326510735|dbj|BAJ91715.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 451

 Score =  246 bits (628), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 143/404 (35%), Positives = 226/404 (55%), Gaps = 33/404 (8%)

Query: 5   IAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR---------------- 48
           +AK YVI++L+V  P+P A +  W+   +A +HK A D L +LR                
Sbjct: 43  LAKKYVIQMLYVSAPMPAAAMQEWVLDEYASKHKVAIDRLLQLRVFVEVRDRRKEVSYKM 102

Query: 49  -----RNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEG 103
                 N +  L+ GG          V ++   L  L  YALE+WEC L  ++ S Q E 
Sbjct: 103 NNKFQANMQKYLVSGGCLPREPLPFNVTARLPTLVELENYALEQWECFLLQLINSSQVEK 162

Query: 104 ---ISADAVRT-----LFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
               S+  ++T     L +   E   ++++GFQFLL+ET AQ+WY + +Y+ + E +G+D
Sbjct: 163 GTTFSSSMMKTFQRGLLSSRDGEAAKLSENGFQFLLMETNAQLWYIMREYISSAEERGVD 222

Query: 156 LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYPTKLAL 214
             + ++FL +L F T G  YS   +++  ++ +  L E GLV  Q+ RK   F PTKLA 
Sbjct: 223 PTDLISFLLELSFHTQGAAYSLSTLTEVQRIAVMDLMELGLVKLQQGRKDSWFIPTKLAT 282

Query: 215 NMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDS 274
           N+++  +     + G ++VETNFR+YAY+ S L   ++ LF  + Y+ PNL VG +T++S
Sbjct: 283 NLSSSLSDSAASKEGIVVVETNFRLYAYSASKLHCEILRLFSRVEYQLPNLIVGAITKES 342

Query: 275 VRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERLTTSEGVLYS 331
           +  A  +GITA QII FL+ +A P V   +P V   V+DQIRLWE +R R+      +Y 
Sbjct: 343 LYGAFDNGITAEQIISFLQQNAHPRVIDKIPIVPENVTDQIRLWENDRNRVEMILSHVYE 402

Query: 332 QFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
            F S+  FE+  D+A+D G L W++ KK  ++V+ + H ++++F
Sbjct: 403 DFPSKDMFEQCCDHARDNGYLLWEDAKKMRLIVSGEFHQEMREF 446


>gi|403417620|emb|CCM04320.1| predicted protein [Fibroporia radiculosa]
          Length = 472

 Score =  246 bits (627), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 148/412 (35%), Positives = 230/412 (55%), Gaps = 38/412 (9%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           L  + +  V+ LL++E  VP + + +W+++     + ++  TL  L              
Sbjct: 52  LQPLQRQLVMNLLWLETAVPVSTMFAWVTREGRSVYSDSLSTLARLHILPSSPVKLALNP 111

Query: 48  --RRNAKVALLGGGKPWSMSAKLEVDSK--ARDLDFLNQYALERWECILRFMVGSQQTEG 103
             + + + A+ GGG   S    ++ D +  A +++ L+ YALERWE IL +MV S     
Sbjct: 112 TFKASLRQAVTGGGSSRSFGVPVDPDRQYPAPNINTLDAYALERWETILHYMVSSGTGHY 171

Query: 104 ISADAVRTLF---------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGL 154
            +  +   LF         N       IT  GFQFLL    AQ+W  +++YL  VE + +
Sbjct: 172 PTRPSQEVLFLLQRSGLMTNAHGGALQITSSGFQFLLHSPHAQLWELLVQYLYMVEERQM 231

Query: 155 DLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLAL 214
           DLV+ L+FL  L     G +YSTE ++      L  LR +GLV+Q+   + RF P++LA 
Sbjct: 232 DLVDVLSFLLMLSTMELGREYSTENLTATQTAMLTDLRNYGLVWQQSPSSRRFSPSRLAT 291

Query: 215 NMATRGTLKQIRE--------PGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLT 266
            + +  +     +         GF+++ETN+R+YAYTD+ L++A++ LF  L  RFPNL 
Sbjct: 292 TLTSSSSPLPTSQNSGTGPHGEGFIVLETNYRIYAYTDNPLQIAVLNLFVTLKSRFPNLV 351

Query: 267 VGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SCPLPPV-VSDQIRLWEGERERLT 323
           +G +TR+SV+ AL +GITA QII +L  HA P +  + PL PV V DQIRLWE ER R+ 
Sbjct: 352 IGAITRESVKKALANGITADQIISYLTAHAHPQMHKNKPLLPVTVQDQIRLWELERNRVK 411

Query: 324 TSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           + EG LY+ F SQAD+E + +YA+ L V+ W+N  +R+   +  GH++I+ F
Sbjct: 412 SEEGYLYTSFASQADYEYVLNYAKQLDVVLWENPSRRSFFGSLDGHANIRGF 463


>gi|50557260|ref|XP_506038.1| YALI0F30173p [Yarrowia lipolytica]
 gi|62901293|sp|Q6BZX4.1|TFB2_YARLI RecName: Full=RNA polymerase II transcription factor B subunit 2;
           AltName: Full=RNA polymerase II transcription factor B
           52 kDa subunit; AltName: Full=RNA polymerase II
           transcription factor B p52 subunit
 gi|49651908|emb|CAG78851.1| YALI0F30173p [Yarrowia lipolytica CLIB122]
          Length = 467

 Score =  245 bits (625), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 137/355 (38%), Positives = 205/355 (57%), Gaps = 27/355 (7%)

Query: 48  RRNAKVALLGGGKPWSMSAKLEVDSKAR-DLDFLNQYALERWECILRFMVGSQQTEGISA 106
           ++N + AL+GG +  S     + + K + D+ FL+ +A+ +WE IL FMVG+      S 
Sbjct: 100 KKNLRAALVGGDQNISFGVPCDTEDKHKVDVAFLDAHAVSQWEMILHFMVGTSIGRTPSD 159

Query: 107 DAVRTLFNEGAENPV-----ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLT 161
             +  L + G   P      IT  GFQFLL +  AQ+W  +L+YL   E   +D V+ L 
Sbjct: 160 GVLNLLKHSGLMEPERGGLRITNAGFQFLLQDVNAQIWTLLLQYLNMSEYLQMDPVDVLN 219

Query: 162 FLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLA-------- 213
           F+F L     G DYS   +S+  +  L+ LR++G+VYQRK  + RFYPT+LA        
Sbjct: 220 FIFMLGSLELGQDYSLSALSETQKHMLEDLRDYGIVYQRKASSRRFYPTRLATTLTSETA 279

Query: 214 --------LNMATRGTLKQ--IREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFP 263
                   +  AT+ T+      + GF+I+ETNFR+YAYT+S L++A++ LF  L  RF 
Sbjct: 280 ALRTASQSMEAATQDTISSSVAADSGFIILETNFRLYAYTESPLQIAVLNLFVNLKTRFA 339

Query: 264 NLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALP---SVSCPLPPVVSDQIRLWEGERE 320
           N+  G + RDSVR AL +GITA QII +L +HA P    +   LPP V DQI+LW+ E +
Sbjct: 340 NMVTGQINRDSVRFALSNGITAEQIITYLSVHAHPRMKGMEHVLPPTVVDQIKLWQLEMD 399

Query: 321 RLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           R+  ++G L+S+F +  +++ +  YA++LGVL ++N  KR  V T  G   I +F
Sbjct: 400 RIRATDGYLFSEFKNFDEYKDVSTYAKELGVLLYENPGKRKFVSTLAGSQQIVEF 454


>gi|339237219|ref|XP_003380164.1| general transcription factor IIH subunit 4 [Trichinella spiralis]
 gi|316977049|gb|EFV60220.1| general transcription factor IIH subunit 4 [Trichinella spiralis]
          Length = 674

 Score =  245 bits (625), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 147/405 (36%), Positives = 214/405 (52%), Gaps = 33/405 (8%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
           L  +A+  V+RL  V      A I  W+         E+   L +L              
Sbjct: 40  LPSVAQQIVMRLSLVSSGTTIADIEGWMVDEKKDILHESLKYLRQLHILQECNLSSIESV 99

Query: 49  -------RNAKVALLGGGKPWSMSAKLEVDSK-ARDLDFLNQYALERWECILRFMV--GS 98
                  +N ++ALL   K       + VD K  +    L+ YA ERWE +L+++    +
Sbjct: 100 VLNRVFAKNLRLALLC--KDAICFKTVTVDPKHQKSFADLDSYASERWESVLKYLALPSA 157

Query: 99  QQTEGISADAVRTLFNEGA-----ENPVITKDGFQFLLLETPAQVWYFILKYLETVESKG 153
           Q  + +S +  R L + G          +T DGFQF+L +   Q+W ++L YL  +E KG
Sbjct: 158 QSEKSVSVETKRVLQDSGLIQLCDSKMQLTSDGFQFILYDRRQQLWTYLLHYLAQLEKKG 217

Query: 154 LDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLA 213
             + +C+  + Q    +    YSTE +++    F+QHLRE GLV+QRKR AG FY T L 
Sbjct: 218 SPVHDCIMLILQACLGSHRAAYSTENLTEAALNFIQHLREIGLVHQRKRSAGWFYYTPLI 277

Query: 214 LNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRD 273
             +    +     + GFLIVETNFRVY YTDS L +A++  FCE +YRFPNL   +L R+
Sbjct: 278 SVLTGLKSSSSSSKEGFLIVETNFRVYCYTDSVLDLAIVSTFCEPLYRFPNLVACILNRE 337

Query: 274 SVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSEGVLY 330
           SVR A +  I+A QII +L  +A  ++   +  +P  V+DQI+LWE ER+R     GV+Y
Sbjct: 338 SVRRAFQVNISAEQIIQYLFSNAHKNMQKQTPTIPSTVTDQIKLWEMERDRFKFDPGVMY 397

Query: 331 SQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           S F S  D+  +RDYA+DLGVL  ++E  R +VV+  GH    +F
Sbjct: 398 SNFFSDTDYITIRDYAKDLGVLLCEHEANRALVVSADGHEQSNQF 442


>gi|345569654|gb|EGX52519.1| hypothetical protein AOL_s00043g13 [Arthrobotrys oligospora ATCC
           24927]
          Length = 452

 Score =  244 bits (622), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 149/416 (35%), Positives = 223/416 (53%), Gaps = 52/416 (12%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           L  +AK +V+ +LF   PVP + + +++     +  +EA   L +L              
Sbjct: 38  LPNLAKTFVMAMLFNPDPVPVSSLEAFVRPDQNRLKEEAFGKLRQLYIVTERQGTFALDA 97

Query: 48  --RRNAKVALLGGGKPWSMSAKLEVDSKAR-DLDFLNQYALERWECILRFMVG--SQQTE 102
             +   + AL GGG P S     + + K R D+ FL++YA E+WE IL FMVG  +Q ++
Sbjct: 98  TFKVELRKALTGGGSPGSFGLPCDTEDKNRVDISFLDRYAAEQWESILHFMVGNETQSSQ 157

Query: 103 GISADAVRTLFN------EGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDL 156
             S   VR L +        A   VIT+ GF FLL E  +QVW  +L YL+  E   +D 
Sbjct: 158 RPSEGVVRLLLHGKLMEQNRASGTVITQLGFSFLLQEVNSQVWTLLLLYLDMAEQLRMDP 217

Query: 157 VECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNM 216
           V+ L FLF L     G DY  + ++   +  L+ LR++G+VYQRK  + RFYPT+LA  +
Sbjct: 218 VDILHFLFLLGSLELGQDYDMKSLTPTQKGMLEDLRDYGIVYQRKSSSRRFYPTRLATGL 277

Query: 217 ATRGTLKQIREP--------------GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRF 262
            +   ++ +R P              GFLI+ETN+       S L++A++ LF  L+ RF
Sbjct: 278 TSE--IRSLRSPAATLANATPGNDENGFLIIETNYH------SPLQIAVLNLFARLISRF 329

Query: 263 PNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGER 319
           PNL    +TR+S+R A+  GITA QII ++  +A P +   S  LPP V DQIRLW+ E 
Sbjct: 330 PNLVTARITRESIRQAIGKGITAEQIIDYMTANAHPQLRKNSPVLPPTVVDQIRLWQIEG 389

Query: 320 ERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           ER+  + G L+ +F + A+F  +  YA D+GVL W ++ K  + V+K  H  I  +
Sbjct: 390 ERMKVTSGYLFKEFATNAEFNAVCKYADDIGVLVWIDKNKGMLFVSK--HEQIADY 443


>gi|189205068|ref|XP_001938869.1| RNA polymerase II transcription factor B subunit 2 [Pyrenophora
           tritici-repentis Pt-1C-BFP]
 gi|187985968|gb|EDU51456.1| RNA polymerase II transcription factor B subunit 2 [Pyrenophora
           tritici-repentis Pt-1C-BFP]
          Length = 482

 Score =  243 bits (620), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 150/425 (35%), Positives = 223/425 (52%), Gaps = 62/425 (14%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           L  +AK  V+ +L++  P P A + +W   +  KE + AT TL  L              
Sbjct: 39  LPHLAKSIVMAMLYMPSPFPAADLDAWFKPSARKEKERATFTLDRLHIITSSRQDNGTLS 98

Query: 48  -------RRNAKVALLGGG--KPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                  +R+ + A+ G G  + + + A  E   K   ++FL++Y+  +WE IL ++V  
Sbjct: 99  WTLSPGFQRSLRNAIEGSGTHRSFGVPATKEESGKRVSVEFLDEYSRAQWEGILYYLVSG 158

Query: 99  QQTEGISADAV---------RTLFNEG------AENPVITKDGFQFLLLETPAQVWYFIL 143
               G+  D +         +TL N G        +P ITKDGF F+L ET AQVW  ++
Sbjct: 159 --AAGLGKDNISRAEVSPSTKTLLNTGDLVRTIHGSPRITKDGFSFVLQETNAQVWSLLI 216

Query: 144 KYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRK 203
            YL+     G+   E L FLF L     G DYST  +S      L  L   GL+Y+  + 
Sbjct: 217 IYLKVTNELGMSETEVLAFLFMLGSLELGQDYSTSTLSPTQLRMLDDLSSMGLIYRSDKN 276

Query: 204 AGRFYPTKLALNM---------ATRGTLKQIRE----------PGFLIVETNFRVYAYTD 244
           A  FYPT+LA  +         A+   + Q  +           GF+I+ETN+R+YAYT+
Sbjct: 277 ARTFYPTRLATTLTSDSGSAMSASSNDIAQAGQGNAGPSATANKGFIIIETNYRLYAYTN 336

Query: 245 SNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SC 302
           S +++A++ LF +L +RFPNL  G LT++SV  A++SGIT+AQII +L  +A P +  + 
Sbjct: 337 SLIQIAILSLFTKLQHRFPNLVSGKLTKESVHKAVQSGITSAQIISYLTTYAHPQMQKTV 396

Query: 303 P-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRT 361
           P +PP V DQIRLWE E ER+ T+ G L  +F S A++  +  YA  LGVL WQN+  R 
Sbjct: 397 PYIPPTVMDQIRLWEYEGERVETTPGYLMREFSSDAEYRDVMGYASALGVLVWQNDGARC 456

Query: 362 VVVTK 366
             V++
Sbjct: 457 FFVSQ 461


>gi|451996931|gb|EMD89397.1| hypothetical protein COCHEDRAFT_1141584 [Cochliobolus
           heterostrophus C5]
          Length = 482

 Score =  243 bits (619), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 149/424 (35%), Positives = 224/424 (52%), Gaps = 62/424 (14%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           L  +AK  V+ +L++  P P A + +W   T  KE + AT TL  L              
Sbjct: 39  LPHLAKSIVMAMLYMPTPFPAADLDTWFKPTARKEKERATFTLDRLHIITSARQDNGTLS 98

Query: 48  -------RRNAKVALLGGG--KPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                  +R+ + A+ G G  + + + A  E       ++FL++Y+  +WE IL ++V  
Sbjct: 99  WTLNPGFQRSLRNAIEGSGTHRSFGVPATKEESGNRVSIEFLDEYSRSQWEGILYYLVSG 158

Query: 99  QQTEGISADAV---------RTLFNEG------AENPVITKDGFQFLLLETPAQVWYFIL 143
               G+S D++         + L + G        +P ITKDGF F+L ET AQVW  ++
Sbjct: 159 --AAGLSKDSISRAEVGPGTKKLLHTGDLVRTIHGSPRITKDGFSFVLQETNAQVWSLLI 216

Query: 144 KYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRK 203
            YL+     G+   E L+FLF L     G DYST  +S      L+ L   GLVY+  R 
Sbjct: 217 VYLKMTNELGMSETEVLSFLFMLGSLELGQDYSTSTLSATQLQMLEDLSAMGLVYRSDRN 276

Query: 204 AGRFYPTKLALNMAT--------------RGTLKQIREP-----GFLIVETNFRVYAYTD 244
           A  FYPT+LA  + +              + +      P     GF+I+ETN+R+YAYT+
Sbjct: 277 ARTFYPTRLATTLTSDSGSAMSTSSNDIAQASTSTTGPPTAANKGFIIIETNYRLYAYTN 336

Query: 245 SNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SC 302
           S +++A++ LF +L +RFPNL  G LT++SV  A+++GIT+AQII +L  +A P +  + 
Sbjct: 337 SLIQIAILSLFTKLQHRFPNLVSGKLTKESVHKAVQAGITSAQIISYLTTYAHPQMQKTV 396

Query: 303 P-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRT 361
           P +PP V DQIRLWE E ER+ T+ G L  +F S A++  + +YA  LGVL WQN+  R 
Sbjct: 397 PYIPPTVMDQIRLWEYEGERVETTTGYLMREFGSDAEYRDVLNYASALGVLVWQNDANRC 456

Query: 362 VVVT 365
             V+
Sbjct: 457 FFVS 460


>gi|357123176|ref|XP_003563288.1| PREDICTED: LOW QUALITY PROTEIN: general transcription factor IIH
           subunit 4-like [Brachypodium distachyon]
          Length = 452

 Score =  242 bits (618), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 142/405 (35%), Positives = 223/405 (55%), Gaps = 34/405 (8%)

Query: 5   IAKHYVIRLLFVEQPVPQAVIASW-LSKTHAKEHKEATDTLTELR--------------- 48
           + K Y +++L+V  P+P A +  W L   +A +HK A D L +LR               
Sbjct: 43  LPKKYTLQMLYVSAPMPAAAMEEWVLLDEYASKHKVAIDRLLQLRVFVEVRDXRKEVSYK 102

Query: 49  ------RNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
                  N +  L+ GG          V  +   L  L  YALE+WEC L  ++ S Q E
Sbjct: 103 MNDKFQGNMQKYLVDGGSLPREPIPSSVTGRLPTLTELENYALEQWECFLLQLINSSQVE 162

Query: 103 ---GISADAVRT-----LFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGL 154
                S+  ++T     L +   + P ++++GFQFLL+ET  Q+WY + +Y+ + E +G+
Sbjct: 163 RGTSFSSSMMKTFQRGLLSSRDGDAPKLSENGFQFLLMETNVQLWYIMREYISSAEERGV 222

Query: 155 DLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYPTKLA 213
           D  E ++FL +L F T G  YS   ++D  ++ ++ L E GLV  Q+ RK   F PTKLA
Sbjct: 223 DPTELISFLLELSFHTLGAAYSFNTLTDVQRIAIRDLAELGLVKVQQGRKDSWFIPTKLA 282

Query: 214 LNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRD 273
            N+++  +     + G ++VETNFR+YAY+ S L   ++ LF  + Y+ PNL VG +T++
Sbjct: 283 TNLSSSLSDSSASKEGIVVVETNFRLYAYSASRLHCEILRLFSRVEYQLPNLIVGAITKE 342

Query: 274 SVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERLTTSEGVLY 330
           S+  A  +GITA QII FL+ +A P V+  +P V   V+DQIRLWE +R R+      +Y
Sbjct: 343 SLYGAFDNGITAEQIISFLKQNAHPXVADKIPVVPENVTDQIRLWETDRNRVDMVLSHVY 402

Query: 331 SQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
             F S+  FE+  D A+D G L W++ KK  ++V  + H ++++F
Sbjct: 403 EDFPSKDLFEQCCDLARDNGFLLWEDSKKMRLIVRVEFHQEMREF 447


>gi|330918142|ref|XP_003298106.1| hypothetical protein PTT_08708 [Pyrenophora teres f. teres 0-1]
 gi|311328891|gb|EFQ93800.1| hypothetical protein PTT_08708 [Pyrenophora teres f. teres 0-1]
          Length = 482

 Score =  242 bits (618), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 149/425 (35%), Positives = 222/425 (52%), Gaps = 62/425 (14%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           L  +AK  V+ +L++  P P A + +W   +  KE + AT TL  L              
Sbjct: 39  LPHLAKSIVMAMLYMPSPFPAADLDAWFKPSARKEKERATFTLDRLHIITSVRQDNGTLS 98

Query: 48  -------RRNAKVALLGGG--KPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                  +R+ + A+ G G  + + + A  E   K   ++FL++Y+  +WE IL ++V  
Sbjct: 99  WTLSPGFQRSLRNAIEGSGTHRSFGVPATKEESGKRVSVEFLDEYSRSQWEGILYYLVSG 158

Query: 99  QQTEGISADAV---------RTLFNEG------AENPVITKDGFQFLLLETPAQVWYFIL 143
               G+  D +         +TL N G        +P ITKDGF F+L ET AQVW  ++
Sbjct: 159 --AAGLGKDNISRAEVSPSTKTLLNTGDLVRTIHGSPRITKDGFSFVLQETNAQVWSLLI 216

Query: 144 KYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRK 203
            YL+     G+   E L FLF L     G DYST  +S      L  L   GL+Y+  + 
Sbjct: 217 IYLKVTNELGMSETEVLAFLFMLGSLELGQDYSTSTLSPTQLRMLDDLSSMGLIYRSDKN 276

Query: 204 AGRFYPTKLALNM---------ATRGTLKQIRE----------PGFLIVETNFRVYAYTD 244
           A  FYPT+LA  +         A+   + Q  +           GF+I+ETN+R+YAYT+
Sbjct: 277 ARTFYPTRLATTLTSDSGSAMSASSNDIAQAGQGNAGPSAAANKGFIIIETNYRLYAYTN 336

Query: 245 SNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SC 302
           S +++A++ LF +L +RFPNL  G LT++SV  A++SGIT+AQII +L  +A P +  + 
Sbjct: 337 SLIQIAILSLFTKLQHRFPNLVSGKLTKESVHKAVQSGITSAQIISYLTTYAHPQMQKTV 396

Query: 303 P-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRT 361
           P +PP V DQIRLWE E ER+ T+ G L  +F S  ++  +  YA  LGVL WQN+  R 
Sbjct: 397 PYIPPTVMDQIRLWEYEGERVETTPGYLMREFSSDTEYRDVMGYASALGVLVWQNDAARC 456

Query: 362 VVVTK 366
             V++
Sbjct: 457 FFVSQ 461


>gi|308452336|ref|XP_003089005.1| hypothetical protein CRE_21072 [Caenorhabditis remanei]
 gi|308243592|gb|EFO87544.1| hypothetical protein CRE_21072 [Caenorhabditis remanei]
          Length = 468

 Score =  241 bits (616), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 142/358 (39%), Positives = 211/358 (58%), Gaps = 30/358 (8%)

Query: 54  ALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMV--GSQQTEGISADAVRT 111
           A+LG     S+   LE + + R    + + A+ERW+CILR++     + T+ +S +  R 
Sbjct: 114 AMLGAASISSLV--LETNDEKRRGKEVEKKAVERWDCILRYLALPSEENTQAVS-ETTRN 170

Query: 112 LFNE-----GAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQL 166
           LF +     G     IT  GFQFLLL    Q+W ++++YL+   S+G D+VE +  L Q+
Sbjct: 171 LFKKANFTSGDSRIEITTSGFQFLLLSPVKQMWTYVIEYLKLEMSQGQDIVEVIEPLIQI 230

Query: 167 ------KFSTFGTDYSTE-GMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATR 219
                  F      Y  +   S   Q  L HLRE G+++ RKRK G F+ T L  ++AT 
Sbjct: 231 VLLANRGFKAEKECYQIDPSWSIPQQELLNHLRELGVIFIRKRKDGVFFLTHLLTHLATN 290

Query: 220 GTLKQIR----EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSV 275
            T+          G +IVETNFRVYAYT S L++A+I LF E+ YRF +++VG++TR+SV
Sbjct: 291 ETIDDTSAERVSNGKVIVETNFRVYAYTSSLLQLAIIALFTEMTYRFQDMSVGMITRESV 350

Query: 276 RAALRSGITAAQIIGFLRLHALP-------SVSCPLPPVVSDQIRLWEGERERLTTSEGV 328
           R AL+ GITAAQII FLR +A P       +++C LP  V+DQIRLWE ER R+   +  
Sbjct: 351 RGALQHGITAAQIISFLRANAHPQCIATSGAINC-LPITVADQIRLWEDERRRMDLKDAY 409

Query: 329 LYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGSQHS 386
           +YS F S+ +F+ + DYAQ+ G+L W N +++ V+V ++GH  +++ W    KG + S
Sbjct: 410 IYSHFESEDEFQGVCDYAQERGILLWANAQQKLVIVNEEGHEYVRQ-WYKRSKGGETS 466


>gi|308460924|ref|XP_003092760.1| hypothetical protein CRE_20979 [Caenorhabditis remanei]
 gi|308252519|gb|EFO96471.1| hypothetical protein CRE_20979 [Caenorhabditis remanei]
          Length = 470

 Score =  241 bits (616), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 142/358 (39%), Positives = 211/358 (58%), Gaps = 30/358 (8%)

Query: 54  ALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMV--GSQQTEGISADAVRT 111
           A+LG     S+   LE + + R    + + A+ERW+CILR++     + T+ +S +  R 
Sbjct: 117 AMLGAASISSLV--LETNDEKRRGKEVEKKAVERWDCILRYLALPSEENTQAVS-ETTRN 173

Query: 112 LFNE-----GAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQL 166
           LF +     G     IT  GFQFLLL    Q+W ++++YL+   S+G D+VE +  L Q+
Sbjct: 174 LFKKANFTSGDSRIEITTSGFQFLLLSPVKQMWTYVIEYLKLEMSQGQDIVEVIEPLIQI 233

Query: 167 ------KFSTFGTDYSTE-GMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATR 219
                  F      Y  +   S   Q  L HLRE G+++ RKRK G F+ T L  ++AT 
Sbjct: 234 VLLANRGFKAEKECYQIDPSWSIPQQELLNHLRELGVIFIRKRKDGVFFLTHLLTHLATN 293

Query: 220 GTLKQIR----EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSV 275
            T+          G +IVETNFRVYAYT S L++A+I LF E+ YRF +++VG++TR+SV
Sbjct: 294 ETIDDTSAERVSNGKVIVETNFRVYAYTSSLLQLAIIALFTEMTYRFQDMSVGMITRESV 353

Query: 276 RAALRSGITAAQIIGFLRLHALP-------SVSCPLPPVVSDQIRLWEGERERLTTSEGV 328
           R AL+ GITAAQII FLR +A P       +++C LP  V+DQIRLWE ER R+   +  
Sbjct: 354 RGALQHGITAAQIISFLRANAHPQCIATSGAINC-LPITVADQIRLWEDERRRMDLKDAY 412

Query: 329 LYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGSQHS 386
           +YS F S+ +F+ + DYAQ+ G+L W N +++ V+V ++GH  +++ W    KG + S
Sbjct: 413 IYSHFESEDEFQGVCDYAQERGILLWANAQQKLVIVNEEGHEYVRQ-WYKRSKGGETS 469


>gi|357461579|ref|XP_003601071.1| General transcription factor IIH subunit [Medicago truncatula]
 gi|355490119|gb|AES71322.1| General transcription factor IIH subunit [Medicago truncatula]
          Length = 452

 Score =  241 bits (614), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 144/408 (35%), Positives = 222/408 (54%), Gaps = 34/408 (8%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR---------RNAK 52
           L  +AK YV++LL ++ PVP  ++A W+      +HK A D L +LR         +N K
Sbjct: 40  LPPLAKKYVLQLLHIDGPVPAKLLAEWVLPDGLSKHKVAIDRLVQLRVFVEAFDNRKNEK 99

Query: 53  V-------------ALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
                          L+ GG     S    +  +   L+ L  YALE+WEC L  ++   
Sbjct: 100 TYQVNSTYQKSLQNLLVHGGTLPRESMPSNITVRLPTLEDLETYALEQWECFLLQLISPS 159

Query: 99  --QQTEGISADAV-----RTLFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
              +T  IS+  +     R L     E P +T+ GFQFLL++T AQ+WY I +Y+   E 
Sbjct: 160 HVDKTLNISSSLMKVFQRRLLSQRDREAPKLTESGFQFLLMDTNAQLWYIIREYITNSEE 219

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYPT 210
           +G+D  + ++F+ +L F   G  Y+   +++  +  ++ L + GLV  Q+ RK   F PT
Sbjct: 220 RGVDAGDLISFMLELSFHVIGEAYNINTLTEFQRNIIKDLADLGLVKLQQGRKESWFIPT 279

Query: 211 KLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVL 270
           KLA N++   T    R+ GF++VETNFRVYAY+ S L   ++ LF  + Y+ PNL VG +
Sbjct: 280 KLATNLSVSMTESSSRKEGFVVVETNFRVYAYSTSKLHCEILRLFSRVEYQLPNLIVGAI 339

Query: 271 TRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERLTTSEG 327
           T++S+  A  +GITA QI+ FLR +A P V+  +P V   V+DQIRLWE +  R+  +E 
Sbjct: 340 TKESLYNAFDNGITADQIVSFLRQNAHPRVAQRVPAVPENVTDQIRLWESDLNRVEMTEA 399

Query: 328 VLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
             Y +F S+  FE   D A++   L W++ KK  +VV  + H+ ++ F
Sbjct: 400 YYYDEFPSRDVFEGACDCAREWSGLLWEDSKKMHLVVKSEVHTYVRDF 447


>gi|145248676|ref|XP_001400677.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           [Aspergillus niger CBS 513.88]
 gi|134081344|emb|CAK41847.1| unnamed protein product [Aspergillus niger]
          Length = 478

 Score =  239 bits (610), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 152/437 (34%), Positives = 223/437 (51%), Gaps = 65/437 (14%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR-----------RN 50
           L ++AK +V+ LL+++ P+P A + +W+     KE   A   L  L            R 
Sbjct: 39  LPDLAKCFVMALLYLKDPLPAADLEAWVKSESLKERDNALSILGRLHILSNTVTADNVRA 98

Query: 51  AKV----------ALLGGGKPWSMSAKLEVDSKAR-DLDFLNQYALERWECILRFMVGSQ 99
            KV          AL G  +  S     +V + +   +  L++YA  +WE +L +MVG+ 
Sbjct: 99  YKVTDPFATSLRHALTGADQTQSFGVLSQVPAGSEVSIPELDEYARRQWEGVLGYMVGTG 158

Query: 100 QTEGIS---ADAVRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLETVE 150
              G     +  V+ L   G    +      IT+DGF F+L +   QVW+ ++ Y+E+  
Sbjct: 159 GMGGPQVTLSKGVKELLQAGHLVEIRDRRVEITQDGFAFVLQDVSTQVWHILILYVESAS 218

Query: 151 SKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPT 210
           +  +D VE L+F+F L     G  Y    +S K    L  L +FG+VYQ    A RFYPT
Sbjct: 219 AIKMDSVEVLSFVFLLSSLELGKSYEKNHLSSKQLRMLTDLADFGIVYQESPDAPRFYPT 278

Query: 211 KLALNMATRGT------LKQIREP----------GFLIVETNFRVYAYTDSNLKVALIGL 254
           +LA  + +  +      L  +  P          GF+I+ETN+RVYAYT S L+++LI L
Sbjct: 279 RLATTLTSDSSALSNPLLGSLSGPAGEASSKAGTGFIIIETNYRVYAYTSSPLQISLIAL 338

Query: 255 FCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV-------------S 301
           F  L YRFPNL  G +TR SVR A+  GITA QII +L  HA P +              
Sbjct: 339 FTTLKYRFPNLITGKITRQSVRRAVEMGITADQIISYLSTHAHPQMRKHNVSRSTSNQAG 398

Query: 302 CP---LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEK 358
            P   LPP V DQIRLW+ ER+R+  + G L+  FVS A++E    YA+++GVL W++++
Sbjct: 399 IPPSVLPPTVVDQIRLWQLERDRVKATSGFLFKDFVSLAEYEAPCRYAEEIGVLAWKSDR 458

Query: 359 KRTVVVTKQGHSDIKKF 375
           KR   VT+  H  +  F
Sbjct: 459 KRMFFVTR--HEQVAAF 473


>gi|121714341|ref|XP_001274781.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Aspergillus clavatus NRRL 1]
 gi|119402935|gb|EAW13355.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Aspergillus clavatus NRRL 1]
          Length = 478

 Score =  239 bits (610), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 151/438 (34%), Positives = 224/438 (51%), Gaps = 66/438 (15%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEA----------TDTLT------ 45
           L ++AK +V+ LL+++ P+P A + +W+     KE   A          T+T+T      
Sbjct: 38  LPDLAKCFVMALLYLKDPLPAADLETWVKPDSLKERDNALSTLGRLHILTNTMTADNVRA 97

Query: 46  -----ELRRNAKVALLGGGKPWSMSAKLEV-DSKARDLDFLNQYALERWECILRFMVGS- 98
                E   + + AL G     S      V D  A  +  L++Y+  +WE +L +MVG+ 
Sbjct: 98  YMVTKEFASSLRQALTGAEHTQSFGVVSNVSDEAAVSIADLDEYSRRQWEGVLGYMVGTG 157

Query: 99  ---QQTEGISADAVRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLETV 149
               Q +   +  V+ L   G    +      ITKDGF F+L +   QVW+ ++ Y+E+ 
Sbjct: 158 GLGMQRDVNLSKGVKQLLQAGHLVEIRDRRVEITKDGFAFVLQDVGTQVWHILILYVESA 217

Query: 150 ESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYP 209
           E+ G+D VE L F+F L     G  Y  + M+      L  L +FG+VYQ   +A  FYP
Sbjct: 218 EAIGMDSVEVLKFVFFLSSLELGRSYEKKQMTSNQLRTLTDLADFGIVYQESPEATHFYP 277

Query: 210 TKLALNMAT----------------RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIG 253
           T+LA  + +                 GT       GF+I+ETN+R+YAYT S L+++LI 
Sbjct: 278 TRLATTLTSDSSALSNPIAGSLSGPTGTASNKAGSGFIIIETNYRLYAYTSSPLQISLIA 337

Query: 254 LFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV------------- 300
           LF  L YRFPNL  G +TR S+R A+  GITA QII +L  HA P +             
Sbjct: 338 LFTTLKYRFPNLITGKITRQSIRRAVEMGITADQIISYLSTHAHPQMRKHNASRSTSNQA 397

Query: 301 SCP---LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNE 357
             P   LPP V DQIRLW+ ER+R+  + G L+  FVS A++E    YA+++GVL W+++
Sbjct: 398 GMPVSVLPPTVVDQIRLWQLERDRVKATPGFLFKDFVSLAEYEAPCRYAEEIGVLVWKSD 457

Query: 358 KKRTVVVTKQGHSDIKKF 375
           +KR   VT+  H  +  F
Sbjct: 458 RKRMFFVTR--HEQVAAF 473


>gi|222629781|gb|EEE61913.1| hypothetical protein OsJ_16644 [Oryza sativa Japonica Group]
          Length = 427

 Score =  239 bits (609), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 144/405 (35%), Positives = 222/405 (54%), Gaps = 53/405 (13%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
           L  +AK Y +++L+V  PV  A +  W+   +A +H+ A D L +LR             
Sbjct: 40  LPPLAKKYALQMLYVSAPVAAAAMEEWVLDEYAAKHRVAIDRLLQLRVFVEVRDRRKEVS 99

Query: 49  --------RNAKVALLGGGK------PWSMSAKLEVDSKARDLDFLNQYALERWECILRF 94
                    N +  L+ GG       P S++A+L        L  L  +ALE+WE  +R 
Sbjct: 100 YKMNQKFQGNMQKYLVDGGSLPREPIPSSVTARLPT------LAELESFALEQWE--VRM 151

Query: 95  MVGSQQTEGISADAVRTLFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGL 154
                  +G              E P +T++GFQFLL+ET AQ+WY + +Y+ + E +G+
Sbjct: 152 TKAYVCIDG--------------EAPRLTENGFQFLLMETNAQLWYIMREYISSAEERGV 197

Query: 155 DLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYPTKLA 213
           D  E ++FL +L F T G  YS   ++D  +  ++ L E GLV  Q+ RK   F PTKLA
Sbjct: 198 DPTELISFLLELSFHTLGEAYSLNTLTDVQRNAIRDLAELGLVKLQQGRKDSWFIPTKLA 257

Query: 214 LNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRD 273
            N++   +     + GF++VETNFR+YAY+ S L   ++ LF  + Y+ PNL VG +T++
Sbjct: 258 TNLSASLSDSSSNKEGFVVVETNFRMYAYSTSRLHCEILRLFSRVEYQLPNLIVGSITKE 317

Query: 274 SVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERLTTSEGVLY 330
           S+  A  +GITA QII FL+ +A P V+  +P V   V+DQIRLWE +R R+  +   LY
Sbjct: 318 SLYGAFENGITAEQIISFLQQNAHPRVADKIPAVPENVTDQIRLWETDRNRVDMTLSHLY 377

Query: 331 SQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
             F S+  F++  DYA+D G L W++ KK  ++V  + HS++++F
Sbjct: 378 EDFPSKDMFDQCCDYARDHGCLLWEDAKKMRLIVRVEFHSEMREF 422


>gi|444320663|ref|XP_004180988.1| hypothetical protein TBLA_0E04140 [Tetrapisispora blattae CBS 6284]
 gi|387514031|emb|CCH61469.1| hypothetical protein TBLA_0E04140 [Tetrapisispora blattae CBS 6284]
          Length = 518

 Score =  239 bits (609), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 143/455 (31%), Positives = 225/455 (49%), Gaps = 84/455 (18%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           L ++AK +++ ++F E  V    +  W+      + +EA  ++  L              
Sbjct: 39  LPQLAKFFIMSMVFNENEVMLRDLDRWVKSNGKLQFQEAIKSMKSLHLLVPGKNSGAVMI 98

Query: 48  ------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQT 101
                 R + K AL GG    S    ++ D  A  +  L +Y+ ++WE IL FMVG+   
Sbjct: 99  NLNSTFRSSLKNALTGGEINNSFGVIVDNDRDAVKVSMLEEYSTKKWETILHFMVGTPLA 158

Query: 102 EGISADAVRTLF-------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGL 154
              S + +  L        NE      IT +GFQFLL E+ +Q+W  +L+YL+  E+  +
Sbjct: 159 TTPSENVLNLLKHSKLMEENEETNEFKITNEGFQFLLQESNSQIWTLLLQYLKLTETLHM 218

Query: 155 DLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLAL 214
           D V+ L F+F L    FG  YS   +S+  ++ L+ +R++GL++Q+   A  FYPT +A 
Sbjct: 219 DPVDVLNFIFMLGALEFGKAYSISSLSETQKIMLKDMRDYGLIFQKNSNANVFYPTSMAT 278

Query: 215 NMATR------------------------------------GTLKQIREPGFLIVETNFR 238
            + +                                     G+  Q    G LI+ETNF+
Sbjct: 279 MLTSDARNVRTASGAIDHILQRSQETNKNSNNDDDEDVDQVGSNTQATADGALIIETNFK 338

Query: 239 VYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALP 298
           +Y+Y++S L++A++ LF  L  RF N+  G +TRDS+R ALR+GITA QII +L  H+ P
Sbjct: 339 LYSYSNSPLQIAVLSLFVHLKTRFSNMVTGQITRDSIRRALRNGITAEQIIAYLLTHSHP 398

Query: 299 SVS-------------CP--------LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQA 337
            +               P        LPP V DQI+LW+ E +R+ + EG LYS F +  
Sbjct: 399 QMRRLAEGNLEKKLELDPNAKDSLQILPPTVVDQIKLWQLELDRIISHEGSLYSDFENNQ 458

Query: 338 DFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDI 372
           ++  L  YA+D+GVL W+N+KKR   V K+G+S +
Sbjct: 459 EYNLLSTYAEDIGVLLWKNDKKRKFFVLKEGNSQV 493


>gi|358370616|dbj|GAA87227.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           [Aspergillus kawachii IFO 4308]
          Length = 478

 Score =  238 bits (607), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 154/440 (35%), Positives = 223/440 (50%), Gaps = 71/440 (16%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR-----------RN 50
           L ++AK +V+ LL+++ P+P A + +W+     KE   A   L  L            R 
Sbjct: 39  LPDLAKCFVMALLYLKDPLPAADLEAWVKSESLKERDNALSILGRLHILSNTVTADNVRA 98

Query: 51  AKV----------ALLGGGKPWSMSAKLEVDSKAR-DLDFLNQYALERWECILRFMVGSQ 99
            KV          AL G  +  S      V + +   +  L++YA  +WE +L +MVG  
Sbjct: 99  YKVTDPFATSLRHALTGADQTQSFGVLSHVPAGSEVSISDLDEYARRQWEGVLGYMVG-- 156

Query: 100 QTEGISA------DAVRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLE 147
            T G+ A        V+ L   G    +      IT+DGF F+L +   QVW+ ++ Y+E
Sbjct: 157 -TGGMGAPQVTLSKGVKELLQAGHLVEIRDRRVEITQDGFAFVLQDVSTQVWHILILYVE 215

Query: 148 TVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRF 207
           +  +  +D VE L+F+F L     G  Y    +S K    L  L +FG+VYQ    A RF
Sbjct: 216 SASAIKMDSVEVLSFVFLLSSLELGKSYEKNHLSSKQLRMLTDLADFGIVYQESPDAPRF 275

Query: 208 YPTKLALNMATRGT------LKQIREP----------GFLIVETNFRVYAYTDSNLKVAL 251
           YPT+LA  + +  +      L  +  P          GF+I+ETN+RVYAYT S L+++L
Sbjct: 276 YPTRLATTLTSDSSALSNPLLGSLSGPAGEASSKAGTGFIIIETNYRVYAYTSSPLQISL 335

Query: 252 IGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV----------- 300
           I LF  L YRFPNL  G +TR SVR A+  GITA QII +L  HA P +           
Sbjct: 336 IALFTTLKYRFPNLITGKITRQSVRRAVEMGITADQIISYLSTHAHPQMRKHNVSRSTSN 395

Query: 301 --SCP---LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQ 355
               P   LPP V DQIRLW+ ER+R+  + G L+  FVS A++E    YA+++GVL W+
Sbjct: 396 QAGIPPSVLPPTVVDQIRLWQLERDRVKATSGFLFKDFVSLAEYEAPCRYAEEIGVLAWK 455

Query: 356 NEKKRTVVVTKQGHSDIKKF 375
           +++KR   VT+  H  +  F
Sbjct: 456 SDRKRMFFVTR--HEQVAAF 473


>gi|350639203|gb|EHA27557.1| transcription factor Tfb2 [Aspergillus niger ATCC 1015]
          Length = 478

 Score =  238 bits (607), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 153/438 (34%), Positives = 225/438 (51%), Gaps = 67/438 (15%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR-----------RN 50
           L ++AK +V+ LL+++ P+P A + +W+     KE   A   L  L            R 
Sbjct: 39  LPDLAKCFVMALLYLKDPLPAADLEAWVKSESLKERDNALSILGRLHILSNTVTADNVRA 98

Query: 51  AKV----------ALLGGGKPWSMS--AKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
            KV          AL G  +  S    +++ V S+   +  L++YA  +WE +L +MVG+
Sbjct: 99  YKVTDPFATSLRHALTGADQTQSFGVLSQVPVGSEV-SIPELDEYARRQWEGVLGYMVGT 157

Query: 99  QQTEGIS---ADAVRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLETV 149
               G     +  V+ L   G    +      IT+DGF F+L +   QVW+ ++ Y+E+ 
Sbjct: 158 GGMGGPQVTLSKGVKELLQAGHLVEIRDRRVEITQDGFAFVLQDVSTQVWHILILYVESA 217

Query: 150 ESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYP 209
            +  +D VE L+F+F L     G  Y    +S K    L  L +FG+VYQ    A RFYP
Sbjct: 218 SAIKMDSVEVLSFVFLLSSLELGKSYEKNHLSSKQLRMLTDLADFGIVYQESPDAPRFYP 277

Query: 210 TKLALNMATRGT------LKQIREP----------GFLIVETNFRVYAYTDSNLKVALIG 253
           T+LA  + +  +      L  +  P          GF+I+ETN+RVYAYT S L+++LI 
Sbjct: 278 TRLATTLTSDSSALSNPLLGSLSGPAGEASSKAGTGFIIIETNYRVYAYTSSPLQISLIA 337

Query: 254 LFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV------------- 300
           LF  L YRFPNL  G +TR SVR A+  GITA QII +L  HA P +             
Sbjct: 338 LFTTLKYRFPNLITGKITRQSVRRAVEMGITADQIISYLSTHAHPQMRKHNVSRSTSNQA 397

Query: 301 SCP---LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNE 357
             P   LPP V DQIRLW+ ER+R+  + G L+  FVS A++E    YA+++GVL W+++
Sbjct: 398 GIPPSVLPPTVVDQIRLWQLERDRVKATSGFLFKDFVSLAEYEAPCRYAEEIGVLAWKSD 457

Query: 358 KKRTVVVTKQGHSDIKKF 375
           +KR   VT+  H  +  F
Sbjct: 458 RKRMFFVTR--HEQVAAF 473


>gi|255721741|ref|XP_002545805.1| TFIIH basal transcription factor complex subunit 2 [Candida
           tropicalis MYA-3404]
 gi|240136294|gb|EER35847.1| TFIIH basal transcription factor complex subunit 2 [Candida
           tropicalis MYA-3404]
          Length = 494

 Score =  238 bits (606), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 150/450 (33%), Positives = 221/450 (49%), Gaps = 77/450 (17%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           LS +AK Y++ ++F E+P+    +  W   +  K   EA   L  L              
Sbjct: 39  LSPMAKFYIMSMIFNEKPIALRDLDKWCKPSARKLEFEALKRLESLHLIEYDSKGSHIRL 98

Query: 48  ----RRNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
               R+N +  L G   P +  S     D    D+ FL+ +A  +WE IL FMVG++ T 
Sbjct: 99  HSTFRKNFRDCLTGSQNPNAFGSISTTADKYKVDIAFLDNFASSKWETILHFMVGTEAT- 157

Query: 103 GISADAVRTLFN-----EGAENPV-ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDL 156
              +D+V +L       EG  N + IT  GFQFLL +  AQ+W  +L+YL   +   +D 
Sbjct: 158 ATPSDSVLSLLKLGGLMEGPNNKLRITNSGFQFLLQDVNAQIWTLLLQYLNLTQELNMDP 217

Query: 157 VECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALN- 215
           V+ L F+F L     G  Y+   +S+     L  L+++GLVYQR   +GRFYPT+LA   
Sbjct: 218 VDVLNFIFVLGSLELGKSYAVSTLSETQVSMLADLKDYGLVYQRSDTSGRFYPTRLATTL 277

Query: 216 -----------MATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPN 264
                      MA     +Q+     +I+ETNF++YAYT S L++A++ LF +L  RF N
Sbjct: 278 TSDSAALKTPAMAMDEEEQQVVTKESIIIETNFKLYAYTKSPLEIAILNLFVQLKTRFAN 337

Query: 265 LTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV------------------------ 300
           +  G +TR+S+R AL +GITA QII FL  HA   +                        
Sbjct: 338 MVCGQITRESIRNALYNGITADQIIKFLETHAHSQMRILAKEKLDKKIEFDTSHNINTAG 397

Query: 301 SCP---------------LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDY 345
             P               LPP V DQI+LW+ E +R+ T +G L+  F +Q +++ L +Y
Sbjct: 398 GAPQSKTDGSISQHKLEILPPNVVDQIKLWQLELDRIQTFDGYLFKDFANQQEYDILSNY 457

Query: 346 AQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           A +LGVL W ++ KR   VTK G + +  F
Sbjct: 458 ASELGVLIWADKVKRKFFVTKDGMTQVADF 487


>gi|261187837|ref|XP_002620336.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           [Ajellomyces dermatitidis SLH14081]
 gi|239593453|gb|EEQ76034.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           [Ajellomyces dermatitidis SLH14081]
 gi|239608433|gb|EEQ85420.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           [Ajellomyces dermatitidis ER-3]
 gi|327356043|gb|EGE84900.1| RNA pol II transcription initiation subunit [Ajellomyces
           dermatitidis ATCC 18188]
          Length = 485

 Score =  238 bits (606), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 157/435 (36%), Positives = 218/435 (50%), Gaps = 70/435 (16%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
           L  +AK +V+ LL+++ P+P A +  W+     +E   A   L  L              
Sbjct: 38  LPHLAKCFVMALLYLKDPLPAADLELWVKPESKRERDNALSILGRLHILSNTTTSDNVRA 97

Query: 49  --------RNAKVALLGGGKPWSMSA-KLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
                    + + AL GG K  S     L  D     +  L+ YA  +WE +L +MVG+ 
Sbjct: 98  YTVTDPFASSLRQALTGGNKQQSFGVPSLTADENPMTVALLDDYARSQWEGVLGYMVGTS 157

Query: 99  ----QQTEGISADAVRTLFN-----EGAENPV-ITKDGFQFLLLETPAQVWYFILKYLET 148
               QQ   +S   V+ L       E  +  V ITKDGF F+L +   QVW+ ++ Y+E 
Sbjct: 158 ALGVQQAVTLS-KGVKQLLQACHLVEIRDRRVEITKDGFAFVLQDLNTQVWHILILYVEN 216

Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRK--RKAGR 206
            E  G+D VE L+FLF L     G  Y  + ++      L  L +FG+VYQ    ++A R
Sbjct: 217 AEQIGMDSVEVLSFLFVLSSLELGQSYEKKHLTSTQLKTLADLTDFGIVYQHPPHQEATR 276

Query: 207 FYPTKLA-----------------LNMATRGTLKQIREPG--FLIVETNFRVYAYTDSNL 247
           FYPT+LA                 L   + G      EPG  F+I+ETN+R+YAYT S L
Sbjct: 277 FYPTRLATTLTSDSITLGSSISSSLTAPSGGPAAVSNEPGTGFIIIETNYRLYAYTSSPL 336

Query: 248 KVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALP--------- 298
           +++LI LF  L YRFPNL  G LTR S+R A+  GITA QII +L  HA P         
Sbjct: 337 QISLISLFTTLKYRFPNLITGKLTRQSIRRAVEMGITADQIISYLTTHAHPQMRKYHASK 396

Query: 299 ------SVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVL 352
                  V   LPP V DQIRLW+ ER+R+  + G L+  FVS A+FE    YA+++GVL
Sbjct: 397 SGANPAGVPAVLPPTVVDQIRLWQLERDRIKATPGFLFKDFVSLAEFEGPCRYAEEIGVL 456

Query: 353 TWQNEKKRTVVVTKQ 367
            W++EK+R   VT+ 
Sbjct: 457 LWKSEKRRMFFVTRH 471


>gi|407927340|gb|EKG20235.1| Transcription factor Tfb2 [Macrophomina phaseolina MS6]
          Length = 466

 Score =  238 bits (606), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 141/414 (34%), Positives = 217/414 (52%), Gaps = 49/414 (11%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           L  +AK  V+ +L++  P P   + +W+    + E  +A   L  L              
Sbjct: 38  LPHLAKMIVMAMLYMPGPFPATELDTWIRDGSSGEKDKAISVLESLHIITQAQETVGARA 97

Query: 48  -------RRNAKVALLGGGKPWSMSAK-LEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
                  + + + AL GGG   S     +   SK   ++ L++ A  +WE IL++MVGS 
Sbjct: 98  YRISSGFQTSLRQALTGGGNHRSFGVPCMNTGSKRVSIEALDEIARAKWETILQYMVGST 157

Query: 100 QTE-GISAD---AVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKYLETV 149
            T  G+ ++     +TL   G         P IT+ GF FLL E  AQVW  ++ YLE  
Sbjct: 158 GTALGVGSEITTGTKTLLEVGQFIEMRGGRPSITQGGFSFLLQEVNAQVWSLLIVYLENS 217

Query: 150 ESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYP 209
               +D V+ L+FLF L     G DYST+ ++   Q  L  L +FG+VY+   K+ RFYP
Sbjct: 218 HRLNMDNVDVLSFLFMLGSLELGQDYSTDSLTATQQHMLDDLNDFGIVYRSPPKSNRFYP 277

Query: 210 TKLALNMATRGTLKQIREP--------------GFLIVETNFRVYAYTDSNLKVALIGLF 255
           T+LA  + +        +               G++I+ETN+R+YAYT S L++A++ LF
Sbjct: 278 TRLATTLTSDAAALNNSDSFSSALNSTSASGSKGYIIIETNYRLYAYTSSPLQIAVLALF 337

Query: 256 CELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQI 312
            +L  RFPNL  G LT++SV+ A+  GITA QII +L  +A P +   +  LPP V DQI
Sbjct: 338 TDLETRFPNLVSGKLTKESVQRAIGLGITADQIISYLDTYAHPQMQRKTPVLPPTVVDQI 397

Query: 313 RLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTK 366
           RLW+ E +R+  +EG L+  F    +++ L+ YA  +GVL W N+++R + +T+
Sbjct: 398 RLWQIEGDRMRATEGFLFKDFRDWDEYQDLKKYADSIGVLVWSNDQRRLMFLTR 451


>gi|255556021|ref|XP_002519045.1| tfiih, polypeptide, putative [Ricinus communis]
 gi|223541708|gb|EEF43256.1| tfiih, polypeptide, putative [Ricinus communis]
          Length = 451

 Score =  237 bits (605), Expect = 7e-60,   Method: Compositional matrix adjust.
 Identities = 137/407 (33%), Positives = 224/407 (55%), Gaps = 33/407 (8%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
           L  +AK YV+++L+++ PV    +  WL    + +H+ A D LT+LR             
Sbjct: 40  LPPLAKKYVVQILYIDGPVADKTLEEWLLPDGSSKHRVAIDRLTQLRIFTEIVDRKKEIS 99

Query: 49  --------RNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQ 100
                    N +  L+ GG          +  +   L+ L+ YAL +WEC L  ++ S  
Sbjct: 100 YKLNPTFQTNLQKHLIDGGVLPGEPLASNIAVRLPTLEELDTYALGQWECFLLHLINSGH 159

Query: 101 TEGIS--ADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
            E  +  + ++  +F  G       E P +T+ GFQFLL++T AQ+WY I +Y+   E +
Sbjct: 160 AERSTNFSSSMMKIFQRGLLTQRDKEAPRLTESGFQFLLMDTNAQLWYIIREYISNSEER 219

Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYPTK 211
           GLD  + ++FL +L F   G  Y+   +++  +  ++ L + GLV  Q+ RK   F PTK
Sbjct: 220 GLDSADLISFLLELSFHITGEAYNMIMLTEFQRNMIKDLADLGLVKLQQGRKESWFIPTK 279

Query: 212 LALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLT 271
           LA N++   T    R+ GF++VETNFR+YAY+ S L   ++ LF  + Y+ PNL VG +T
Sbjct: 280 LATNLSMSLTDSSSRKQGFVVVETNFRMYAYSTSKLHCEIMRLFSRVEYQLPNLVVGAMT 339

Query: 272 RDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERLTTSEGV 328
           ++S+ +A  +GITA QII FL+ +A P V+  +P V   V+DQIRLWE +  R+  +   
Sbjct: 340 KESLYSAFENGITAEQIISFLQQNAHPRVAERIPSVPENVTDQIRLWESDMNRVEMTPAH 399

Query: 329 LYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           LY +F S+  FE   ++A+D   L W++ K+  +VV  + H +++++
Sbjct: 400 LYDEFPSRDVFEAACNFARDWNGLLWEDSKRMRMVVKAEIHLNMREY 446


>gi|312069992|ref|XP_003137940.1| transcription factor Tfb2 family protein [Loa loa]
          Length = 470

 Score =  237 bits (605), Expect = 7e-60,   Method: Compositional matrix adjust.
 Identities = 143/366 (39%), Positives = 211/366 (57%), Gaps = 41/366 (11%)

Query: 48  RRNAKVALLGGGKPWSMSAKLEVDSKAR---DLDFLNQYALERWECILRFM-VGSQQTE- 102
           R   K   +G  K   +SA  + D K R   + D L + A+ERWECIL ++ + SQ++E 
Sbjct: 100 RSYLKATRIGLCKASELSAVTDCDEKTRKSANKD-LGKKAVERWECILHYLALPSQKSEQ 158

Query: 103 GISADAVRTLFNEG---AENP----VITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
           G+S    R   + G   +EN      IT  GFQFLLL    Q+W ++L Y    E  G+D
Sbjct: 159 GVSGTTKRLFRSAGLTSSENDDGDIEITSAGFQFLLLNRSEQIWTYLLYYFHMQEVAGID 218

Query: 156 LVECLTFLFQLKFSTFGTDYSTEGMSDKLQV----------FLQHLREFGLVYQRKRKAG 205
           +++ L FLF++   T G+D      S    +          FL HLRE GLV+ RKRK G
Sbjct: 219 IIKELDFLFRV---TNGSDKERPAGSRAFLIDECWPEPTKDFLMHLRELGLVFIRKRKDG 275

Query: 206 RFYPTKL-----ALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMY 260
            F+ T L     +++ A+   ++   + GF+IVETN+RVYAYTDSNL++A++  F E++Y
Sbjct: 276 FFFLTPLLNHLTSISYASETGIENRNQNGFVIVETNYRVYAYTDSNLQLAILSTFTEMLY 335

Query: 261 RFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSC----------PLPPVVSD 310
           RF +++VGVL+R++VR A + GITAAQII FLR +A P+              +P  V+D
Sbjct: 336 RFNDMSVGVLSREAVRRAFQVGITAAQIIAFLRANAHPTTYAVTTEYGGMIQSVPITVAD 395

Query: 311 QIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHS 370
           QIRLWE ER RL      +YS F S+ ++  L+ Y    G+L W ++ +R +V+T++GH 
Sbjct: 396 QIRLWEDERRRLMFCNAAVYSSFESEKEYFGLKGYVSSQGILLWCDDVQRLMVITEEGHE 455

Query: 371 DIKKFW 376
            +K +W
Sbjct: 456 SVKSWW 461


>gi|254584224|ref|XP_002497680.1| ZYRO0F11088p [Zygosaccharomyces rouxii]
 gi|238940573|emb|CAR28747.1| ZYRO0F11088p [Zygosaccharomyces rouxii]
          Length = 509

 Score =  237 bits (605), Expect = 7e-60,   Method: Compositional matrix adjust.
 Identities = 149/460 (32%), Positives = 231/460 (50%), Gaps = 90/460 (19%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           L ++AK +++ ++F E  +    +  W+  +   + ++A  ++  L              
Sbjct: 40  LPQLAKFFIMSMVFNESAISLRDLDRWVKSSGKLQFQDAIKSMKSLHLLIPVRSNGPMLI 99

Query: 48  ------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQT 101
                 R + K AL GG    S    +E+D+     D L++Y+ ++WE IL FMVG+   
Sbjct: 100 DLNHTFRESFKNALTGGEVSNSFGIVMELDNDTVTTDMLDKYSADKWETILHFMVGTSLA 159

Query: 102 EGISADAVRTLFNEGA----ENPV----ITKDGFQFLLLETPAQVWYFILKYLETVESKG 153
             I ++ V  L          N      IT +GFQFLL E  +Q+W  +L+YL+  E   
Sbjct: 160 H-IPSENVLNLLKHSKLMEESNTTGEFKITNEGFQFLLQEVNSQLWALLLQYLKMTEMLQ 218

Query: 154 LDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLA 213
           +D V+ L F+F L    F   YST G+S+  +V L+ +R++GLV+Q    A  FYPT+LA
Sbjct: 219 MDPVDILNFIFMLGALEFSKPYSTNGLSETQKVMLKDMRDYGLVFQMNSNANVFYPTRLA 278

Query: 214 LNMAT-----RGT-------LKQIREP----------------------------GFLIV 233
             + +     RG        LKQ +E                             G LIV
Sbjct: 279 SMLTSDPKSIRGASGAMDSVLKQNKEDVLNKSGGAGANADDDADEIGSEGQNMQDGALIV 338

Query: 234 ETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLR 293
           ETNF++Y+Y++S L++A++ LF  +  RF N+  G LTR+S+R AL +GITA QII +L 
Sbjct: 339 ETNFKLYSYSNSPLQIAILSLFVHMKSRFSNMVTGQLTRESIRQALLNGITADQIIAYLE 398

Query: 294 LHALPSV------------------SCP---LPPVVSDQIRLWEGERERLTTSEGVLYSQ 332
            HA P +                    P   LPP V DQI+LW+ E +R+ + EG LYS 
Sbjct: 399 THAHPQMRRMAEEQLEKRLELDPNSKDPLQILPPTVVDQIKLWQLELDRIISFEGSLYSD 458

Query: 333 FVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDI 372
           F +  +F  L  YAQD+GVL W++++K+   V+K+G+S +
Sbjct: 459 FENHQEFTLLSSYAQDIGVLIWKDDRKKKFFVSKEGNSQV 498


>gi|358346685|ref|XP_003637396.1| General transcription factor IIH subunit [Medicago truncatula]
 gi|355503331|gb|AES84534.1| General transcription factor IIH subunit [Medicago truncatula]
          Length = 451

 Score =  237 bits (604), Expect = 8e-60,   Method: Compositional matrix adjust.
 Identities = 140/407 (34%), Positives = 220/407 (54%), Gaps = 33/407 (8%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           L  +AK YV++L  ++ PVP  ++A W+      +HK A D L +L              
Sbjct: 40  LPPLAKKYVLQLFHIDGPVPAKLLAEWVLPDGLSKHKVAIDRLVQLGVFVEALDSKNGKT 99

Query: 48  -------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS-- 98
                  +R+ +  L+ GG     S    +  +   L+ L  YALE+WEC L  ++    
Sbjct: 100 YKVNPTYQRSLQKLLVHGGTLPRESMPSNITVRLPTLEDLETYALEQWECFLLQLISPSH 159

Query: 99  -QQTEGISADAV-----RTLFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
             +T  IS+  +     R L     E P +T+ GFQFLL++T AQ+WY I +Y+   E +
Sbjct: 160 VDKTLNISSSLMKVFQRRLLSQRDREAPKLTESGFQFLLMDTNAQLWYIIREYITNSEER 219

Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYPTK 211
           G+D  + ++F+ +L F   G  Y+   +++  +  ++ L + GLV  Q+ RK   F PTK
Sbjct: 220 GVDAGDLISFMLELSFHVIGEAYNINTLTEFQRNIIKDLADLGLVKLQQGRKESWFIPTK 279

Query: 212 LALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLT 271
           LA N++   T    R+ GF++VETNFRVYAY+ S L   ++ LF  + Y+ PNL VG +T
Sbjct: 280 LATNLSVSMTESSSRKEGFVVVETNFRVYAYSTSKLHCEILRLFSRVEYQLPNLIVGAIT 339

Query: 272 RDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERLTTSEGV 328
           ++S+  A  +GITA QI+ F R +A P V+  +P V   V+DQIRLWE +  R+  +E  
Sbjct: 340 KESLYNAFDNGITADQIVSFFRQNAHPRVAQRIPAVPENVTDQIRLWESDLNRVEMTEAY 399

Query: 329 LYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
            Y +F S+  FE   D A++   L W++ KK  +VV  + H+ ++ F
Sbjct: 400 YYDEFPSRDVFEGACDCAREWSGLLWEDSKKMHLVVKSEVHTYVRDF 446


>gi|169784197|ref|XP_001826560.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           [Aspergillus oryzae RIB40]
 gi|238508739|ref|XP_002385554.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           [Aspergillus flavus NRRL3357]
 gi|83775305|dbj|BAE65427.1| unnamed protein product [Aspergillus oryzae RIB40]
 gi|220688446|gb|EED44799.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           [Aspergillus flavus NRRL3357]
 gi|391868531|gb|EIT77745.1| RNA polymerase II transcription initiation protein [Aspergillus
           oryzae 3.042]
          Length = 480

 Score =  237 bits (604), Expect = 9e-60,   Method: Compositional matrix adjust.
 Identities = 153/439 (34%), Positives = 222/439 (50%), Gaps = 68/439 (15%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR-----------RN 50
           L ++AK +V+ LL+++ P+P   + +W+     KE   A   L  L            R 
Sbjct: 40  LPDLAKCFVMALLYLKDPLPATDLEAWVKAESLKERDNALSILGRLHILSNTVITDNVRA 99

Query: 51  AKV----------ALLGGGKPWSMSAKLEV-DSKARDLDFLNQYALERWECILRFMVGSQ 99
            KV          AL G  K  S      + D  A  +  L++YA  +WE +L +MVG+ 
Sbjct: 100 YKVTDPFASSLRQALTGAEKTQSFGVLWHISDGAAVSIAELDEYARRQWEGVLGYMVGTS 159

Query: 100 QTEGISADA-----VRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLET 148
              GI  DA     V+ L   G    +      IT+DGF F+L +   QVW+ ++ Y+E+
Sbjct: 160 GL-GIQRDANLSKGVKQLLQAGHLVEIRDRRVEITQDGFAFVLQDVSTQVWHILMLYVES 218

Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY 208
            ++ G+  VE L+F+F L     G  Y  + ++      L  L +FG+VYQ    A  FY
Sbjct: 219 ADAIGMSSVEVLSFVFLLSSLELGKAYDKKHLTSNQLRTLTDLADFGIVYQDSPDASHFY 278

Query: 209 PTKLALNMAT----------------RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALI 252
           PT+LA  + +                 GT       GF+IVETN+R+YAYT S L+++LI
Sbjct: 279 PTRLATTLTSDSSALSNPISGSLSGPSGTGSNKAGSGFIIVETNYRLYAYTSSPLQISLI 338

Query: 253 GLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV------------ 300
            LF  L YRFPNL  G +TR SVR A+  GITA QII +L  HA P +            
Sbjct: 339 ALFTTLKYRFPNLITGKITRQSVRRAIEMGITADQIISYLATHAHPQMRKHNVSRSTSNQ 398

Query: 301 -SCP---LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQN 356
              P   LPP V+DQIRLW+ ER+R+  + G L+  FVS A++E    YA+++GVL W++
Sbjct: 399 AGIPPSVLPPTVTDQIRLWQLERDRVKATPGFLFKDFVSLAEYEAPCRYAEEIGVLVWKS 458

Query: 357 EKKRTVVVTKQGHSDIKKF 375
           ++KR   VT+  H  +  F
Sbjct: 459 DRKRMFFVTR--HEQVAAF 475


>gi|406604717|emb|CCH43852.1| RNA polymerase II transcription factor B subunit 2 [Wickerhamomyces
           ciferrii]
          Length = 488

 Score =  236 bits (602), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 143/441 (32%), Positives = 223/441 (50%), Gaps = 67/441 (15%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           L  +AK +++ ++F E+P       SW+      +  EA  ++  L              
Sbjct: 39  LPGLAKFFIMSMVFNEKPQAVRDFDSWVKPNGKMDLVEALRSIKSLHLLKEANGIAIMNP 98

Query: 48  --RRNAKVALLGGGKPWSMSAKLEVDSKAR-DLDFLNQYALERWECILRFMVGSQQTEGI 104
             R++ +  L G     S     + D   +  ++FL+QYA  +WE IL FMVG++  +  
Sbjct: 99  VFRKSFRNVLTGNEIGNSFGVLCDTDDTHKVSVEFLDQYAANKWETILHFMVGTELEQSP 158

Query: 105 SADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVE 158
           S   +  L + G       ++ +IT +GFQFLL +  AQ+W  +L+YL   ES  +D V+
Sbjct: 159 SPGVLSLLTHSGLMEGKSVKDMLITNEGFQFLLQDVNAQLWTLLLQYLRMAESLQMDPVD 218

Query: 159 CLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL------ 212
            L F+F L     G DYS   +S+     L  LR++GL+YQRK  + RFYPT+L      
Sbjct: 219 VLNFIFMLGSLELGKDYSLSALSETQVNMLGDLRDYGLIYQRKSTSRRFYPTRLATTLTS 278

Query: 213 ----------ALNMATRG------TLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFC 256
                     A+N           T   I   G +I+ETNF+VYAYT+S L++A++ LF 
Sbjct: 279 DTTSLRSASSAMNKVIENAKDSAYTPTNIENSGTIIIETNFKVYAYTNSPLQIAILNLFV 338

Query: 257 ELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSCP------------- 303
            L  RF NL  G +TR+S+R AL SGIT+ QII +L  HA P +                
Sbjct: 339 HLKARFSNLVTGQITRESIRKALVSGITSEQIISYLESHAHPQLRRAAEEELNKKNGFES 398

Query: 304 ---------LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTW 354
                    L P ++DQI+LW+ E +R+ + +G L++ F S  +++ L  Y++++GVL W
Sbjct: 399 NNHGEKIQILQPTIADQIKLWQLELDRIMSFDGYLFTDFSSDQEYQVLSKYSEEIGVLLW 458

Query: 355 QNEKKRTVVVTKQGHSDIKKF 375
            +  K+   VTK+G+S +  F
Sbjct: 459 NDSSKKKFFVTKEGNSQVIAF 479


>gi|324512881|gb|ADY45318.1| General transcription factor IIH subunit 4 [Ascaris suum]
          Length = 481

 Score =  236 bits (601), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 152/440 (34%), Positives = 241/440 (54%), Gaps = 67/440 (15%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAK---------------EHKEATDTLTE 46
           L ++++  ++++L++++ V  ++  SW+   H+                E KE   TL  
Sbjct: 37  LPDVSQQCILKVLWLQEGVQPSLWQSWVKHEHSSLVENHFDLLRRLGIIESKEQI-TLNP 95

Query: 47  LRRNA--KVALLGGGKPWSMSAKLEVDSKARDL--DFLNQYALERWECILRFM-VGSQQT 101
           + R +  +   +G  +   + A  ++D K+R      L + A ERWECIL ++ + SQ++
Sbjct: 96  IFRKSYIRAVQMGLYRASQLKAMTDLDEKSRKSASKDLGKKATERWECILHYLALPSQKS 155

Query: 102 EGISADAVRTLF--------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKG 153
           E   + A + LF         E   +  IT  GFQFLLL    Q+W +IL YL   ES G
Sbjct: 156 EQGVSGATKQLFRAAGLTSGGESEGDMEITSAGFQFLLLNQTEQIWMYILHYLRLEESMG 215

Query: 154 LDLVECLTFLFQL---------------------KFSTFGTDYSTEGMSDKLQVFLQHLR 192
            +++  L FL +L                     K +T+   +  E  S+ +  FL HLR
Sbjct: 216 KNVMAELDFLLKLTLCVDHTHTNGRLRVVDGDREKKATWRAFFIDESWSETITNFLMHLR 275

Query: 193 EFGLVYQRKRKAGRFYPT----KLALNMATRGTLKQIR-EPGFLIVETNFRVYAYTDSNL 247
           E GLV+ RKRK G F+ T     L ++ ++   + + R   G++IVETN+RVYAYTDS+L
Sbjct: 276 ELGLVFIRKRKDGYFFITPLFAHLTVSSSSETAVAEKRTHSGYIIVETNYRVYAYTDSSL 335

Query: 248 KVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPS-------- 299
           ++A++  F E++YRF +++VG+LTRDSVR AL+ GITA+QII FLR +A P         
Sbjct: 336 QLAILSTFTEMLYRFNDMSVGMLTRDSVRRALQVGITASQIISFLRANAHPETVAAANAS 395

Query: 300 ---VSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQN 356
              + C +P  V+DQIRLWE ER RL   +  LYS F S+ ++  +++Y +   +L W +
Sbjct: 396 PGVIHC-VPVTVADQIRLWEDERHRLVFFDSALYSTFESEREYVGVKEYTRSQDILLWFD 454

Query: 357 EKKRTVVVTKQGHSDIKKFW 376
           + +R VVVT++GH  +K +W
Sbjct: 455 DVQRLVVVTEEGHDMVKAWW 474


>gi|393910938|gb|EFO26137.2| transcription factor Tfb2 family protein [Loa loa]
          Length = 486

 Score =  236 bits (601), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 147/377 (38%), Positives = 213/377 (56%), Gaps = 50/377 (13%)

Query: 48  RRNAKVALLGGGKPWSMSAKLEVDSKAR---DLDFLNQYALERWECILRFM-VGSQQTE- 102
           R   K   +G  K   +SA  + D K R   + D L + A+ERWECIL ++ + SQ++E 
Sbjct: 103 RSYLKATRIGLCKASELSAVTDCDEKTRKSANKD-LGKKAVERWECILHYLALPSQKSEQ 161

Query: 103 GISADAVRTLFNEG---AENP----VITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
           G+S    R   + G   +EN      IT  GFQFLLL    Q+W ++L Y    E  G+D
Sbjct: 162 GVSGTTKRLFRSAGLTSSENDDGDIEITSAGFQFLLLNRSEQIWTYLLYYFHMQEVAGID 221

Query: 156 LVECLTFLFQLKF---STFGTDYSTEGMSDK------------------LQVFLQHLREF 194
           +++ L FLF+L      T G    T G SDK                   + FL HLRE 
Sbjct: 222 IIKELDFLFRLTLYSGDTSGGTRVTNG-SDKERPAGSRAFLIDECWPEPTKDFLMHLREL 280

Query: 195 GLVYQRKRKAGRFYPTKL-----ALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKV 249
           GLV+ RKRK G F+ T L     +++ A+   ++   + GF+IVETN+RVYAYTDSNL++
Sbjct: 281 GLVFIRKRKDGFFFLTPLLNHLTSISYASETGIENRNQNGFVIVETNYRVYAYTDSNLQL 340

Query: 250 ALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSC------- 302
           A++  F E++YRF +++VGVL+R++VR A + GITAAQII FLR +A P+          
Sbjct: 341 AILSTFTEMLYRFNDMSVGVLSREAVRRAFQVGITAAQIIAFLRANAHPTTYAVTTEYGG 400

Query: 303 ---PLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKK 359
               +P  V+DQIRLWE ER RL      +YS F S+ ++  L+ Y    G+L W ++ +
Sbjct: 401 MIQSVPITVADQIRLWEDERRRLMFCNAAVYSSFESEKEYFGLKGYVSSQGILLWCDDVQ 460

Query: 360 RTVVVTKQGHSDIKKFW 376
           R +V+T++GH  +K +W
Sbjct: 461 RLMVITEEGHESVKSWW 477


>gi|393910939|gb|EJD76097.1| transcription factor Tfb2 family protein, variant [Loa loa]
          Length = 389

 Score =  236 bits (601), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 147/377 (38%), Positives = 213/377 (56%), Gaps = 50/377 (13%)

Query: 48  RRNAKVALLGGGKPWSMSAKLEVDSKAR---DLDFLNQYALERWECILRFM-VGSQQTE- 102
           R   K   +G  K   +SA  + D K R   + D L + A+ERWECIL ++ + SQ++E 
Sbjct: 6   RSYLKATRIGLCKASELSAVTDCDEKTRKSANKD-LGKKAVERWECILHYLALPSQKSEQ 64

Query: 103 GISADAVRTLFNEG---AENP----VITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
           G+S    R   + G   +EN      IT  GFQFLLL    Q+W ++L Y    E  G+D
Sbjct: 65  GVSGTTKRLFRSAGLTSSENDDGDIEITSAGFQFLLLNRSEQIWTYLLYYFHMQEVAGID 124

Query: 156 LVECLTFLFQLKF---STFGTDYSTEGMSDK------------------LQVFLQHLREF 194
           +++ L FLF+L      T G    T G SDK                   + FL HLRE 
Sbjct: 125 IIKELDFLFRLTLYSGDTSGGTRVTNG-SDKERPAGSRAFLIDECWPEPTKDFLMHLREL 183

Query: 195 GLVYQRKRKAGRFYPTKL-----ALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKV 249
           GLV+ RKRK G F+ T L     +++ A+   ++   + GF+IVETN+RVYAYTDSNL++
Sbjct: 184 GLVFIRKRKDGFFFLTPLLNHLTSISYASETGIENRNQNGFVIVETNYRVYAYTDSNLQL 243

Query: 250 ALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSC------- 302
           A++  F E++YRF +++VGVL+R++VR A + GITAAQII FLR +A P+          
Sbjct: 244 AILSTFTEMLYRFNDMSVGVLSREAVRRAFQVGITAAQIIAFLRANAHPTTYAVTTEYGG 303

Query: 303 ---PLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKK 359
               +P  V+DQIRLWE ER RL      +YS F S+ ++  L+ Y    G+L W ++ +
Sbjct: 304 MIQSVPITVADQIRLWEDERRRLMFCNAAVYSSFESEKEYFGLKGYVSSQGILLWCDDVQ 363

Query: 360 RTVVVTKQGHSDIKKFW 376
           R +V+T++GH  +K +W
Sbjct: 364 RLMVITEEGHESVKSWW 380


>gi|344303892|gb|EGW34141.1| RNA polymerase II transcription factor B subunit 2 [Spathaspora
           passalidarum NRRL Y-27907]
          Length = 519

 Score =  236 bits (601), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 153/471 (32%), Positives = 222/471 (47%), Gaps = 98/471 (20%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           LS +AK +++ ++F E+PV    +  W   +  K   EA   L  L              
Sbjct: 40  LSPMAKFFIMSMIFNEKPVALRDLDKWCMPSARKLEYEALKQLESLHLIEYDSKASHIRL 99

Query: 48  ----RRNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
               R+N +  L G   P +  S    VD    D+ FL+ +A  +WE IL FMVG++ T 
Sbjct: 100 HSTFRQNFRDCLTGSQDPTAFGSISTTVDKHQVDVAFLDSFASSKWETILHFMVGTEST- 158

Query: 103 GISADAVRTLF---------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKG 153
              +++V +L            GA N  IT  GFQFLL +  AQ+W  +L+YL   +   
Sbjct: 159 ATPSNSVLSLLRLGGLMEGPGRGASNLKITNTGFQFLLQDVNAQIWTLLLQYLNLTQELN 218

Query: 154 LDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLA 213
           +D V+ L F+F L     G  Y+   +S+     L  L+++GLVY+R   + RFYPT+LA
Sbjct: 219 MDPVDVLNFIFVLGSLELGKSYAVSSLSETQVSMLADLKDYGLVYRRTDTSSRFYPTRLA 278

Query: 214 LNM-----------------------ATRGTLKQIREPGFL----IVETNFRVYAYTDSN 246
             +                        T   + Q   PG +    I+ETNF++YAYT+S 
Sbjct: 279 TTLTSDSSALKTPAMAVEQALDSVEGGTDNEVSQTAPPGNIQGTVIIETNFKLYAYTNSP 338

Query: 247 LKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV------ 300
           L++A++ LF  L  RF N+  G +TR+S+R AL +GITA QII FL  HA P +      
Sbjct: 339 LEIAILNLFVHLRTRFANMVCGQITRESIRNALYNGITADQIIRFLETHAHPQMKVLAKE 398

Query: 301 ------------------SCP------------------LPPVVSDQIRLWEGERERLTT 324
                               P                  LPP V DQI+LW+ E +R+ T
Sbjct: 399 KLDKKIEFDTSHNINTAGGAPQSQALRADGSVAQHKLEILPPNVVDQIKLWQLELDRIQT 458

Query: 325 SEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
            +G L+  F +Q +F+KL  YA +LGVL W ++ KR   VTK G S +  F
Sbjct: 459 FDGYLFKDFPNQQEFDKLSSYASELGVLIWSDKIKRKFFVTKDGMSQVADF 509


>gi|224077128|ref|XP_002305144.1| predicted protein [Populus trichocarpa]
 gi|222848108|gb|EEE85655.1| predicted protein [Populus trichocarpa]
          Length = 449

 Score =  235 bits (600), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 140/409 (34%), Positives = 227/409 (55%), Gaps = 39/409 (9%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
           L  +AK YV+++LF + PV   ++  W+      +H+ A D L +LR             
Sbjct: 40  LPPLAKKYVVQILFFDGPVSGKLLEQWVLADGVTKHRVAIDRLIQLRIFIEVSDKKRESS 99

Query: 49  --------RNAKVALLGGG--KPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                    N +  L  GG     +M+A +++ S    L+ L+ YALE+WEC L  ++ S
Sbjct: 100 YKLNQTFQANLRKHLTNGGVLPRETMAAVVKLPS----LEELDTYALEQWECFLLLLISS 155

Query: 99  QQTEGISA--DAVRTLFNEGA------ENPVITKDGFQFLLLETPAQVWYFILKYLETVE 150
            Q E  ++   ++  +F  G       + P +T+ GFQFLL++T AQ+WY I +Y+   E
Sbjct: 156 GQAEKPTSLSSSMMKIFQRGLLSQRDRDAPRLTEGGFQFLLMDTNAQLWYIIREYITNSE 215

Query: 151 SKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYP 209
            +G +  + ++FL +L F   G  Y+   +++  +  ++ L E GLV  Q+ RK   F P
Sbjct: 216 ERGTEPADLISFLLELSFHVTGEAYNMNTLTEIQRNTIKDLAELGLVKLQQGRKESWFIP 275

Query: 210 TKLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
           TKLA N++   T    R+ G+++VETNFR+YAY+ S L   ++ LF ++ Y+ PNL VG 
Sbjct: 276 TKLATNLSVSLTDSSSRKQGYVVVETNFRLYAYSSSKLHCEILRLFSKIEYQLPNLIVGA 335

Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERLTTSE 326
           +T++S+  A  +GIT+ QII FL+ +A P V+  LP V   V+DQIRLWE +  R+  + 
Sbjct: 336 ITKESLYTAFENGITSDQIISFLQQNAHPRVAERLPSVPENVTDQIRLWEADLNRVEITP 395

Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
              Y +F S+  FE   D+A++   L W++ KK  VVV  + H ++++F
Sbjct: 396 SHFYDEFPSRDTFEAACDFAREWNGLLWEDSKKMRVVVKAEIHMNMREF 444


>gi|388581217|gb|EIM21527.1| transcription factor Tfb2 [Wallemia sebi CBS 633.66]
          Length = 449

 Score =  234 bits (598), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 143/373 (38%), Positives = 214/373 (57%), Gaps = 31/373 (8%)

Query: 36  EHKEATDTLTELRRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFM 95
           ++   + TL E  +++   +L G +  S+S     +S    ++ L+ +A ++WE IL +M
Sbjct: 76  DNSNGSITLNEQFKHSLKQILIGREIESLS-----NSITTTIEQLDTFATQQWELILHYM 130

Query: 96  VGSQQTEGISADAVRTLFNEGAENPV--------ITKDGFQFLLLETPAQVWYFILKYLE 147
           VGS      S  +   LF     N +        I+  GFQFLL +   Q+W  +L+YL 
Sbjct: 131 VGSYN----STPSKGVLFLLEKSNLIYTANKQHRISSKGFQFLLEDVQVQLWQLLLEYLS 186

Query: 148 TVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRF 207
             E + +DLVE L+FLF L     G DYS E +++  Q  L  LR++GLV+QRK  + RF
Sbjct: 187 LSEIRQMDLVEVLSFLFVLGTLELGKDYSIESLTNTQQAMLSDLRDYGLVWQRKTSSKRF 246

Query: 208 YPTKLALNMATRG-----------TLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFC 256
           YPT+LA  + +                      F+I+ETN+R+YAYT + L+++++ LF 
Sbjct: 247 YPTRLATTLTSSAPPLLPTTESSSFTTSSDNKRFIILETNYRLYAYTSNPLQISILNLFV 306

Query: 257 ELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SCPLPPV-VSDQIR 313
            L  R+PNL +GV+TRDS+R+AL +GITA QIIG+L  HA   +  + PL PV VSDQIR
Sbjct: 307 TLKARYPNLVIGVITRDSIRSALSNGITAEQIIGYLTSHAHTQMHRNNPLLPVTVSDQIR 366

Query: 314 LWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIK 373
           LWE E+ RL   +GVLY++F SQ D+E L +YA+    + W N+ KR   VT +GH  ++
Sbjct: 367 LWELEKNRLKADDGVLYAEFRSQPDYEILLNYAKSYDCVLWSNDIKRMFFVTLEGHQIVR 426

Query: 374 KFWVNHQKGSQHS 386
           +F      G+ +S
Sbjct: 427 EFVRRRIPGNNNS 439


>gi|268534706|ref|XP_002632484.1| Hypothetical protein CBG13719 [Caenorhabditis briggsae]
          Length = 482

 Score =  234 bits (598), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 141/355 (39%), Positives = 205/355 (57%), Gaps = 30/355 (8%)

Query: 54  ALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMV--GSQQTEGISADAVRT 111
           A+LG     S+   LE + + R    + + A+ERW+CILR++     + TE +S D  + 
Sbjct: 121 AMLGAASISSLV--LETNDEKRRGKDVEKKAVERWDCILRYLALPSEENTEAVS-DTTKD 177

Query: 112 LFNE-----GAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQL 166
           LF +     G     IT  GFQFLLL    Q+W ++++YL+   SKG D+VE +  L Q+
Sbjct: 178 LFRKANFTSGESRIEITTFGFQFLLLSPVKQMWTYVIEYLKLEISKGNDIVEVIEPLIQI 237

Query: 167 ------KFSTFGTDYST-EGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATR 219
                  F      Y   E  +      L HLRE G+++ RKRK G F+ T L  ++AT 
Sbjct: 238 VLLANRGFKAEKECYQIDEQWTQPQHDLLNHLRELGVIFIRKRKDGVFFLTHLLTHLATN 297

Query: 220 GTLKQIREP----GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSV 275
            T+          G +IVETNFRVYAYT S L++A+I LF E+ YRF +++VG++TR+SV
Sbjct: 298 ETIDDTSSEKASNGKVIVETNFRVYAYTSSLLQLAIIALFTEMTYRFTDMSVGMITRESV 357

Query: 276 RAALRSGITAAQIIGFLRLHALPS-------VSCPLPPVVSDQIRLWEGERERLTTSEGV 328
           R AL+ GITAAQII FLR +A P        V+C LP  V+DQIRLWE ER R+   +  
Sbjct: 358 RGALQHGITAAQIISFLRANAHPQCVATSGPVNC-LPITVADQIRLWEDERRRMDLKDAY 416

Query: 329 LYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           +YS F S  ++  +  YAQ+ G+L W N +++ V+V ++GH  +++ W    KG+
Sbjct: 417 IYSHFESDDEYHGVVRYAQERGILLWANPQQKLVIVNEEGHEAVRQ-WYKRSKGA 470


>gi|403217068|emb|CCK71563.1| hypothetical protein KNAG_0H01490 [Kazachstania naganishii CBS
           8797]
          Length = 519

 Score =  234 bits (597), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 143/467 (30%), Positives = 231/467 (49%), Gaps = 96/467 (20%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           L ++AK +++ ++F +  V    +  W+      + +EA  ++  L              
Sbjct: 41  LPQLAKFFIMSMVFNDNDVSLRDLDRWVKANGKIQFQEAIKSMKSLHLIIPTKGAGQLMI 100

Query: 48  ------RRNAKVALLGGGKPWSMSAKLEVDSKAR----DLDFLNQYALERWECILRFMVG 97
                 R + + AL GG    S    +  D   R    ++  L++Y+  +WE IL FMVG
Sbjct: 101 GLNPTFRDSFRNALTGGEVSNSFGTVVNDDEDLRTGVLNMRLLDEYSANKWETILHFMVG 160

Query: 98  SQQTEGISADAVRTLFN-------EGAENPVITKDGFQFLLLETPAQVWYFILKYLETVE 150
           +  +E  S++ +  L +       E      IT +GFQFLL E  +Q+W  +L+YL+  E
Sbjct: 161 TPMSEVPSSNVLNLLKHSKLMEESEATGAFKITNEGFQFLLQEINSQIWTLLLQYLKMSE 220

Query: 151 SKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPT 210
           S  +D V+ L F+F L     G  YSTEG+S+  ++ ++ +R++G+V+Q+   +  FYPT
Sbjct: 221 SLQMDAVDVLNFIFMLGALEVGKAYSTEGLSETQRIMIKDMRDYGIVFQKNSTSNVFYPT 280

Query: 211 KLALNMATR--------------------------------------------GTLKQIR 226
            LA+ + +                                             GT  Q  
Sbjct: 281 SLAIMLTSDSKTVLRTASGAMNTVLSQNKEDTSSTNKSATDKANNDTAELDQVGTQSQEV 340

Query: 227 EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAA 286
           + G LIVETNF+VY+Y++S L++A++ LF  L  RF N+  G +T+ S+  AL +GITA 
Sbjct: 341 QDGVLIVETNFKVYSYSNSPLQIAILSLFVHLKSRFINMVTGQITKRSIINALDNGITAE 400

Query: 287 QIIGFLRLHALPSV----------------SCP-----LPPVVSDQIRLWEGERERLTTS 325
           QII +L  HA P +                +C      LPP V DQI+LW+ E++R+   
Sbjct: 401 QIIAYLETHAHPQMRRLAEAQLEKKRELDQNCKDPLQILPPTVVDQIKLWQLEKDRVMAF 460

Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDI 372
           EG LYS F S A++  L+ YAQD+GV+ W  +KKR + V+++G+S +
Sbjct: 461 EGSLYSDFESNAEYNILKKYAQDIGVMLWSEDKKRKLFVSQEGNSQV 507


>gi|356521347|ref|XP_003529318.1| PREDICTED: general transcription factor IIH subunit 4-like [Glycine
           max]
          Length = 451

 Score =  234 bits (596), Expect = 6e-59,   Method: Compositional matrix adjust.
 Identities = 136/404 (33%), Positives = 218/404 (53%), Gaps = 33/404 (8%)

Query: 5   IAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR---------------- 48
           +AK YVI++L ++ PV   ++  W+      +H+ A D L +LR                
Sbjct: 43  LAKKYVIQMLHIDVPVAAKLLEEWVLPAGVSKHRVAIDRLVQLRVFLEAVDRKNEKTYKV 102

Query: 49  -----RNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE- 102
                R+ +  L+ GG     S    +  +   L+ L  YALE+WEC L  ++   Q E 
Sbjct: 103 NPTFQRSLQKLLVQGGTLPRESMPSNITVRLPTLENLEAYALEQWECFLLQLISPSQVEK 162

Query: 103 --GISADAV-----RTLFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
              IS+  +     R L +   E P +T+ GFQFLL++T AQ+WY I +Y+   E +G+D
Sbjct: 163 PLNISSSLMKVFQRRILSHRDKEAPKLTESGFQFLLMDTNAQLWYIIREYISNSEDRGVD 222

Query: 156 LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYPTKLAL 214
             + ++F+ +L F   G  YS + ++   +  +  L + GLV  Q+ RK   F PTKLA 
Sbjct: 223 AADLISFMLELSFHVIGEAYSVDTLTSFQRTIINDLADLGLVKIQQGRKGSWFIPTKLAT 282

Query: 215 NMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDS 274
           N++        R+ GF++VETNFRVYAY+ S L   ++ LF  + Y+ PNL VG +T++S
Sbjct: 283 NLSMSLADSSSRKQGFVVVETNFRVYAYSTSKLHCEILRLFSRVEYQLPNLIVGAITKES 342

Query: 275 VRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERLTTSEGVLYS 331
           + +A  +GITA QI+ FL+ +A P V+  +P V   V++QIRLWE +  R+  ++   Y 
Sbjct: 343 LYSAYENGITAEQIVTFLQQNAHPRVAERVPSVPENVTEQIRLWEADLNRVEMTDTYYYD 402

Query: 332 QFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           +F S+  FE   D A++   L W++ KK  +VV  + H  ++ +
Sbjct: 403 EFPSRDVFEGACDCAREWNGLLWEDSKKMHMVVKTEVHPYVRDY 446


>gi|296422658|ref|XP_002840876.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295637102|emb|CAZ85067.1| unnamed protein product [Tuber melanosporum]
          Length = 460

 Score =  234 bits (596), Expect = 7e-59,   Method: Compositional matrix adjust.
 Identities = 143/408 (35%), Positives = 217/408 (53%), Gaps = 47/408 (11%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRR-------NAKVA 54
           L  +AK  V  LL+ E PV    +    +    + ++E  D  ++LRR       N  V+
Sbjct: 37  LPNLAKTIVTTLLYNETPV---TLGDLEALVRPESYREREDAFSKLRRLHIITENNLDVS 93

Query: 55  L----------LGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGI 104
           L           G    + +    E D K   + FL+ YA  ++E IL +MVG+   E  
Sbjct: 94  LDPVFKKNFRLAGDHHSFGVPCNAE-DKKKITVQFLDDYATRQFEAILHYMVGTHN-EVK 151

Query: 105 SADAVRTLFNEGA--------ENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDL 156
            +  V TL  +G          +  ITK+GF FLL +   Q+W  +++YLE  E  G++ 
Sbjct: 152 PSKGVITLLTKGGLMERYGAGSSATITKNGFSFLLQDGNPQIWALLIQYLEMSEELGMEQ 211

Query: 157 VECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNM 216
            + L FL  L     G  YS   +++  ++ L  LR++G+VYQRK  + RFYPT+LA  +
Sbjct: 212 TDVLHFLLMLGSLELGQAYSVNTLTETQKLMLADLRDYGIVYQRKSSSDRFYPTRLATTL 271

Query: 217 ATR-GTLKQIRE-------------PGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRF 262
            +  G L+                  GF+I+ETN+RVYAYTDS L++A++ LF +L  R+
Sbjct: 272 TSESGGLRSASASMSSAMAKDAEEGKGFIILETNYRVYAYTDSPLQIAVLNLFVKLSTRY 331

Query: 263 PNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVS---CPLPPVVSDQIRLWEGER 319
           PNL  G ++R S++ A++ GITA Q+I +L  HA P +      LPP V DQIRLW+ E 
Sbjct: 332 PNLVSGRISRRSIQEAIKMGITADQVIDYLSAHAHPQMRKSLVTLPPTVVDQIRLWQIEG 391

Query: 320 ERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQ 367
           ER+ T+ G L+  F S  +FE++  YA++LGVL W+   +R + VT+ 
Sbjct: 392 ERMRTTTGFLFKDFHSTQEFEEVAKYAEELGVLKWKKPARRCMFVTRH 439


>gi|241958748|ref|XP_002422093.1| TFIIH subunit, putative; tanscription initiation factor IIB,
           putative [Candida dubliniensis CD36]
 gi|223645438|emb|CAX40094.1| TFIIH subunit, putative [Candida dubliniensis CD36]
          Length = 494

 Score =  233 bits (595), Expect = 8e-59,   Method: Compositional matrix adjust.
 Identities = 151/450 (33%), Positives = 224/450 (49%), Gaps = 77/450 (17%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           LS +AK Y++ ++F E+P+    +  W   +  K   EA   L  L              
Sbjct: 39  LSPMAKFYIMSMIFNEKPIALRDLDKWCKPSARKLEFEALKRLESLHLIEYDSKGSHIRL 98

Query: 48  ----RRNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
               R+N +  L G   P +  S    VD    D+ FL+ +A ++WE IL FMVG++ T 
Sbjct: 99  HSIFRKNFRDCLTGSQNPNAFGSISTTVDKHKVDIPFLDLFASQKWETILHFMVGTEST- 157

Query: 103 GISADAVRTLFN-----EGAENPV-ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDL 156
              +D+V +L       EG  N + IT  GFQFLL +  AQ+W  +L+YL   +   +D 
Sbjct: 158 ATPSDSVLSLLKLGGLMEGPNNDLRITNSGFQFLLQDVNAQIWTLLLQYLNLTQELNMDP 217

Query: 157 VECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLAL-- 214
           V+ L F+F L     G  YS   +S+     L  L+++GLVYQR   +GRFYPT+LA   
Sbjct: 218 VDVLNFIFILGSLELGKSYSVSSLSETQVSMLADLKDYGLVYQRSDTSGRFYPTRLATTL 277

Query: 215 ----------NMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPN 264
                     +MA     +Q+     +I+ETNF++YAYT S L++A++ LF     RF N
Sbjct: 278 TSDSAALKTPSMAMDEEEQQVVSKESIIIETNFKIYAYTKSPLEIAILNLFVHFKTRFAN 337

Query: 265 LTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV------------------------ 300
           +  G +TR+S+R AL +GITA QII FL  HA P +                        
Sbjct: 338 MVCGQITRESIRNALYNGITADQIIKFLETHAHPQMRILAKEKLDKKIEFDTSHNINTAG 397

Query: 301 SCP---------------LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDY 345
             P               LPP V DQI+LW+ E +R+ T +G L+  F +Q +++ L +Y
Sbjct: 398 GAPQSKTDGMISQHKLEILPPNVVDQIKLWQLELDRIQTFDGYLFKDFSNQQEYDILSNY 457

Query: 346 AQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           A +LGVL W ++ K+   VTK+G + +  F
Sbjct: 458 ASELGVLIWADKVKKKFFVTKEGMTQVADF 487


>gi|297800414|ref|XP_002868091.1| hypothetical protein ARALYDRAFT_493174 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297313927|gb|EFH44350.1| hypothetical protein ARALYDRAFT_493174 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 452

 Score =  233 bits (594), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 136/418 (32%), Positives = 230/418 (55%), Gaps = 43/418 (10%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
           L  +AK YV+++L++  PVP A++  W+    A +H+ A D L +LR             
Sbjct: 40  LPPLAKKYVLQMLYINVPVPAAMMEEWVLADGASKHRVAIDRLIQLRIFSETSDRKRGIS 99

Query: 49  --------RNAKVALLGGG----KPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMV 96
                    N +  ++ GG    +P      +++ S    L  L  YAL++WEC L  ++
Sbjct: 100 YSLNPTFQNNLQKHIISGGVLPREPMHSDNAIKLPS----LQELETYALKQWECFLLQLI 155

Query: 97  GSQQTE---GISADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKYLE 147
            S Q E   GIS+  +R +F  G       + P +T+ GFQFLL++T AQ+WY I +Y+ 
Sbjct: 156 NSGQGEKLTGISSSMMR-IFQRGLLSQRDKDGPRLTESGFQFLLMDTNAQLWYIIREYIS 214

Query: 148 TVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGR 206
             E + ++  + ++FL +L F   G  Y++  +++     L+ L + GLV  Q+ RK   
Sbjct: 215 NAEERDVEPADLISFLLELSFHVTGEAYNSNTLTEVQNNTLKDLADLGLVKLQQGRKDSW 274

Query: 207 FYPTKLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLT 266
           F PTKLA N++        R+ GF+++ETNFR+YAY+ S L+  ++ LF  + Y+ PNL 
Sbjct: 275 FIPTKLATNLSVSLADSSARKEGFVVMETNFRMYAYSTSKLQCEILRLFARIEYQLPNLI 334

Query: 267 VGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERLT 323
              +T++S+  A  +GIT+ QII FL+ ++ P  +  +P +   V+DQIRLWE + +R+ 
Sbjct: 335 ACAITKESLYNAFDNGITSDQIITFLQQNSHPRCADRVPSIPENVTDQIRLWETDLKRIE 394

Query: 324 TSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQK 381
            ++   Y +F S+  FE   D+A++ G L W++ K+  +VV  + H+ +++F  N  K
Sbjct: 395 MTQAHFYDEFPSKDVFEAACDFAREWGGLLWEDSKRMRLVVKSEVHNQMREFLHNQTK 452


>gi|147854406|emb|CAN81292.1| hypothetical protein VITISV_005315 [Vitis vinifera]
          Length = 451

 Score =  233 bits (594), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 137/407 (33%), Positives = 222/407 (54%), Gaps = 33/407 (8%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
           L  +AK Y++++L+++ PV    +  W+      +H+ A D L +LR             
Sbjct: 40  LPPLAKKYILQMLYIDVPVTAKAMEEWVLADGFSKHRVAIDRLIQLRVFTETSDRKKETS 99

Query: 49  --------RNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQ 100
                    N +  L+ GG          +  +   LD L  YAL +WEC L  ++ S Q
Sbjct: 100 YRLNPTFQTNLQKHLIYGGVLPREPMPSNITVRLPSLDDLEAYALGQWECFLLQLISSTQ 159

Query: 101 TEGIS--ADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
           TE ++  + ++  +F  G       E P +T+ GFQFLL++T AQ+WY + +Y+   E +
Sbjct: 160 TEKLTNFSSSLMKVFQRGLLTQREKEAPRLTESGFQFLLMDTNAQLWYIMREYISNSEER 219

Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYPTK 211
           G+D  + ++FL +L F   G  Y+   +++  +  ++ L + GLV  Q+ RK   F PTK
Sbjct: 220 GVDPADLISFLLELSFHVTGEAYNINTLTEFQRNTIKDLVDLGLVKLQQGRKESWFIPTK 279

Query: 212 LALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLT 271
           LA N++   +    R+ GF++VETNFR+YAY+ S L   ++ LF  + Y+ PNL VG +T
Sbjct: 280 LATNLSMSLSDTSSRKQGFVVVETNFRLYAYSSSKLHCEILRLFSRVEYQLPNLIVGAIT 339

Query: 272 RDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERLTTSEGV 328
           ++S+  A  +GITA QII FL+ +A P V+   P V   V+DQIRLWE +  R+ T    
Sbjct: 340 KESLYNAFENGITAEQIISFLQQNAHPRVAERTPAVPENVTDQIRLWETDLNRVETMPSH 399

Query: 329 LYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           LY +F S+  FE   D+A++ G L W++ KK  +VV  + H  ++++
Sbjct: 400 LYDEFPSRDVFEAACDFAREYGGLLWEDSKKMRLVVKAEIHLHMREY 446


>gi|347830304|emb|CCD46001.1| similar to RNA polymerase II transcription factor B subunit 2
           [Botryotinia fuckeliana]
          Length = 471

 Score =  233 bits (594), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 143/421 (33%), Positives = 222/421 (52%), Gaps = 57/421 (13%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
           L  +AK +V+ LL++ +P+P   +  W+     K+  +A   L+ L              
Sbjct: 38  LPHLAKTFVMALLYMPKPLPLIALDQWVQSKSKKQKDQALSLLSRLHIVDITALSREDPQ 97

Query: 49  ---------RNAKVALLGGGKPWSMSAKLEVDSKAR--DLDFLNQYALERWECILRFMV- 96
                     + ++AL GGG   S       D  A   D+DFL+ +A  +WE IL +MV 
Sbjct: 98  TVALTKNFGASLRLALTGGGNHQSFGVP-SSDHIAPHVDIDFLDTHARMQWEGILHYMVN 156

Query: 97  ----GSQQTEGISADAVRTLFNEG-----AENPVITKDGFQFLLLETPAQVWYFILKYLE 147
               GS +     A++V+ L + G          IT+ GF FLL E  AQVW  +L ++E
Sbjct: 157 TVTSGSGKDGNGPANSVKALLDAGKLVTRGRGIGITQAGFSFLLQEANAQVWTLLLLWIE 216

Query: 148 TVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRF 207
             ES G+D V+ L+FLF L     G  YST+ +++  +  L +L + GL+Y       +F
Sbjct: 217 NAESMGMDSVDVLSFLFMLGSLELGRAYSTKTLTEAQKGMLANLIDLGLIYLPPSAPTQF 276

Query: 208 YPTKLALNMATRGT-LKQI------------REPGFLIVETNFRVYAYTDSNLKVALIGL 254
           +PT+LA  + +  + L+ +             + GF+I+ETN+R+YAYT+S L++A++ L
Sbjct: 277 FPTRLATTLTSDASALRTVAAGFDAASKSAASQKGFIIIETNYRLYAYTNSPLQIAVLSL 336

Query: 255 FCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALP--------SVSCP-LP 305
           F +L  R+PN+  G ++RDS+R A+  GIT+ QII +L  HA P        S   P LP
Sbjct: 337 FTKLNTRYPNMVSGRVSRDSIRTAIAHGITSDQIITYLSTHAHPQLVKASSASHGGPVLP 396

Query: 306 PVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVT 365
           P V DQIRLW+ E ER+    G L   F +Q ++E    YA+++GVL W+++ KR   VT
Sbjct: 397 PTVVDQIRLWQLENERMKAVPGFLMKDFETQKEYEGCAKYAEEVGVLVWKSDAKRMFFVT 456

Query: 366 K 366
           +
Sbjct: 457 R 457


>gi|328873270|gb|EGG21637.1| general transcription factor IIH [Dictyostelium fasciculatum]
          Length = 453

 Score =  233 bits (593), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 145/413 (35%), Positives = 224/413 (54%), Gaps = 44/413 (10%)

Query: 6   AKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR-----RNAKVALLGGGK 60
           AK Y++RLL V+    Q+V   W +++  + HK+A   L +L+      N K  +     
Sbjct: 36  AKQYLMRLLVVDSISMQSVY-QWTTQSSIQAHKDAIRKLLDLKILTRVPNPKGVVSQDQL 94

Query: 61  PWSMSAKLEVDSKARDLDF-----------------------LNQYALERWECILRFMV- 96
             +   K  +     DLD                        L++Y+  +WE +L ++  
Sbjct: 95  FLNQRFKDNIKDSLTDLDSVVLKSVKDNQSAQQHIKPMTVEELDKYSKGQWEKVLYYLSD 154

Query: 97  -GSQQTEGISADAVRT-LFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGL 154
            G    E +S   + + L  +   +  IT +GF+F+L +   Q+W  I+ YL T+ES+G 
Sbjct: 155 EGKAPPELVSDLLLSSNLTKKNGTSIAITSEGFKFMLKDIYTQIWTLIIVYLNTLESRGR 214

Query: 155 DLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLAL 214
              E L FLF+L F T G  Y    ++   +  L  L+EFGLVY R  K+  FYPT+LA+
Sbjct: 215 PRKEALIFLFKLSFLTLGKSYYLADLNPNQRDMLFDLKEFGLVYVRSEKSDVFYPTRLAI 274

Query: 215 NMATRGTLKQI-----------REPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFP 263
           ++AT  T+  +           ++ G+LI+ETNFR+YAYT S+L+++LI LF +++YR P
Sbjct: 275 SLATGRTVSLMNDLAQEISTSQKDQGYLILETNFRIYAYTQSSLQISLISLFVKMLYRLP 334

Query: 264 NLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPS-VSCPLPPVVSDQIRLWEGERERL 322
           NL VG+LTR+SVR+AL  GITA QI+ F++ +A P+ V    P VV +QIRLWE ER R+
Sbjct: 335 NLAVGILTRESVRSALLHGITADQIVDFVKQNAHPNMVISGYPEVVFEQIRLWESERNRI 394

Query: 323 TTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           T  + VL+  F +   F K   +A+D   L W +E K+ +VV  +G   I+ +
Sbjct: 395 TYKKAVLFDSFPNAESFSKTVTFAKDQYFLVWFDETKKMLVVNDEGSEPIRNY 447


>gi|225555033|gb|EEH03326.1| RNA pol II transcription initiation subunit [Ajellomyces capsulatus
           G186AR]
          Length = 485

 Score =  233 bits (593), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 151/434 (34%), Positives = 215/434 (49%), Gaps = 68/434 (15%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
           L  +AK +V+ LL+++ P+P A +  W+     +E   A   L+ L              
Sbjct: 38  LPYLAKCFVMALLYLKDPLPAADLELWVKTESKRERDNALSILSRLHILSNTTTSDNVRA 97

Query: 49  --------RNAKVALLGGGKPWSMSA-KLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
                    + + AL GG K  S     +  D     +  L+ YA  +WE +L +MVG+ 
Sbjct: 98  YTVTDPFSSSLRQALTGGDKQQSFGVPSVTDDENPMTVAQLDHYARSQWEGVLGYMVGTS 157

Query: 100 ----QTEGISADAVRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLETV 149
               Q     +  V+ L        V      ITKDGF F+L +   QVW+ ++ Y+E  
Sbjct: 158 ALGVQQAATLSKGVKQLLQACHLVEVRDRRVEITKDGFAFVLQDLNTQVWHILILYVENA 217

Query: 150 ESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRK--RKAGRF 207
           E  G+D +E L+FLF L     G  Y  + ++      L  L +FG+VYQ    ++A RF
Sbjct: 218 EQIGMDSIEVLSFLFVLSSLELGQSYEKKHLTSTQLKTLADLTDFGIVYQHSPHQEATRF 277

Query: 208 YPTKLALNMATRGTL-----------------KQIREP--GFLIVETNFRVYAYTDSNLK 248
           YPT+LA  + +                         EP  GF+I+ETN+R+YAYT S L+
Sbjct: 278 YPTRLATTLTSDSITLGSSISSSLTAPNGVPSASSNEPSTGFIIIETNYRLYAYTSSPLQ 337

Query: 249 VALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALP---------- 298
           ++LI LF  L YRFPNL  G LTR SVR A+  GITA QII +L  HA P          
Sbjct: 338 ISLIALFTSLKYRFPNLITGKLTRQSVRRAVEMGITADQIISYLTTHAHPQMRKYHATKA 397

Query: 299 ---SVSCP--LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLT 353
               V  P  LPP V DQIRLW+ ER+R+  + G L+  FV+ A+FE    YA+++GVL 
Sbjct: 398 GANPVGVPTVLPPTVVDQIRLWQLERDRIKATPGFLFKDFVNLAEFEGPCRYAEEIGVLV 457

Query: 354 WQNEKKRTVVVTKQ 367
           W++E+KR   VT+ 
Sbjct: 458 WKSERKRMFFVTRH 471


>gi|240279262|gb|EER42767.1| TFIIH basal transcription factor complex p52 subunit [Ajellomyces
           capsulatus H143]
          Length = 485

 Score =  232 bits (592), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 155/435 (35%), Positives = 216/435 (49%), Gaps = 70/435 (16%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
           L  +AK +V+ LL+++ P+P A +  W+     +E   A   L  L              
Sbjct: 38  LPYLAKCFVMALLYLKDPLPAADLELWVKAESKRERDNALSILGRLHILSNTTTSDNVRA 97

Query: 49  --------RNAKVALLGGGKPWSMSA-KLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
                    + + AL GG K  S     L  D     +  L+ YA  +WE +L +MVG+ 
Sbjct: 98  YTVTDPFSSSLRQALTGGDKQQSFGVPSLTDDENPMTVAQLDHYARSQWEGVLGYMVGTS 157

Query: 99  ----QQTEGISADAVRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLET 148
               QQ   +S   V+ L        V      ITKDGF F+L +   QVW+ ++ Y+E 
Sbjct: 158 ALGVQQAVTLS-KGVKQLLQACHLVEVRDRRVEITKDGFAFVLQDLNTQVWHILILYVEN 216

Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRK--RKAGR 206
            E  G+D +E L+FLF L     G  Y  + ++      L  L +FG+VYQ    ++A R
Sbjct: 217 AEQIGMDSIEVLSFLFVLSSLELGQSYEKKHLTSTQLKTLADLTDFGIVYQHSPHQEATR 276

Query: 207 FYPTKLALNMATRGTLKQI-----------------REP--GFLIVETNFRVYAYTDSNL 247
           FYPT+LA  + +                         EP  GF+I+ETN+R+YAYT S L
Sbjct: 277 FYPTRLATTLTSDSITLSSSISSSLTAPNLVPSASSNEPSTGFIIIETNYRLYAYTSSPL 336

Query: 248 KVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALP--------- 298
           +++LI LF  L YRFPNL  G LTR SVR A+  GITA QII +L  HA P         
Sbjct: 337 QISLIALFTSLKYRFPNLITGKLTRQSVRRAVEMGITADQIISYLTTHAHPQMRKYHATK 396

Query: 299 ----SVSCP--LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVL 352
                V  P  LPP V DQIRLW+ ER+R+  + G L+  FVS A+FE    YA+++GVL
Sbjct: 397 SGANPVGVPTVLPPTVVDQIRLWQLERDRIKATPGFLFKDFVSLAEFEGPCRYAEEIGVL 456

Query: 353 TWQNEKKRTVVVTKQ 367
            W++E+KR   VT+ 
Sbjct: 457 VWKSERKRMFFVTRH 471


>gi|325089533|gb|EGC42843.1| TFIIH basal transcription factor complex p52 subunit [Ajellomyces
           capsulatus H88]
          Length = 485

 Score =  232 bits (592), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 155/435 (35%), Positives = 216/435 (49%), Gaps = 70/435 (16%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
           L  +AK +V+ LL+++ P+P A +  W+     +E   A   L  L              
Sbjct: 38  LPYLAKCFVMALLYLKDPLPAADLELWVKAESKRERDNALSILGRLHILSNTTTSDNVRA 97

Query: 49  --------RNAKVALLGGGKPWSMSA-KLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
                    + + AL GG K  S     L  D     +  L+ YA  +WE +L +MVG+ 
Sbjct: 98  YTVTDPFSSSLRQALTGGDKQQSFGVPSLTDDENPMTVAQLDHYARSQWEGVLGYMVGTS 157

Query: 99  ----QQTEGISADAVRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLET 148
               QQ   +S   V+ L        V      ITKDGF F+L +   QVW+ ++ Y+E 
Sbjct: 158 ALGVQQAVTLS-KGVKQLLQACHLVEVRDRRVEITKDGFAFVLQDLNTQVWHILILYVEN 216

Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRK--RKAGR 206
            E  G+D +E L+FLF L     G  Y  + ++      L  L +FG+VYQ    ++A R
Sbjct: 217 AEQIGMDSIEVLSFLFVLSSLELGQSYEKKHLTSTQLKTLADLTDFGIVYQHSPHQEATR 276

Query: 207 FYPTKLALNMATRGTLKQI-----------------REP--GFLIVETNFRVYAYTDSNL 247
           FYPT+LA  + +                         EP  GF+I+ETN+R+YAYT S L
Sbjct: 277 FYPTRLATTLTSDSITLSSSISSSLTAPNLVPSASSNEPSTGFIIIETNYRLYAYTSSPL 336

Query: 248 KVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALP--------- 298
           +++LI LF  L YRFPNL  G LTR SVR A+  GITA QII +L  HA P         
Sbjct: 337 QISLIALFTSLKYRFPNLITGKLTRQSVRRAVEMGITADQIISYLTTHAHPQMRKYHATK 396

Query: 299 ----SVSCP--LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVL 352
                V  P  LPP V DQIRLW+ ER+R+  + G L+  FVS A+FE    YA+++GVL
Sbjct: 397 SGANPVGVPTVLPPTVVDQIRLWQLERDRIKATPGFLFKDFVSLAEFEGPCRYAEEIGVL 456

Query: 353 TWQNEKKRTVVVTKQ 367
            W++E+KR   VT+ 
Sbjct: 457 VWKSERKRMFFVTRH 471


>gi|119486893|ref|XP_001262366.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Neosartorya fischeri NRRL 181]
 gi|119410523|gb|EAW20469.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Neosartorya fischeri NRRL 181]
          Length = 468

 Score =  232 bits (591), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 144/433 (33%), Positives = 225/433 (51%), Gaps = 55/433 (12%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTE----------LRRNA 51
           L ++AK +V+ LL+++ P+P A + +W+     K H   + T ++             + 
Sbjct: 38  LPDLAKCFVMALLYLKDPLPAADLEAWVRPDSLKLHIMTSTTTSDNVKAYTVTDPFASSL 97

Query: 52  KVALLGGGKPWSMSAKLEV-DSKARDLDFLNQYALERWECILRFMVGSQ----QTEGISA 106
           + AL G  +  S     ++ D +A  +  L++YA  +WE +L +MVG+     Q +   +
Sbjct: 98  RQALTGAEQTQSFGVLSQISDEEAVSIADLDEYARRQWEGVLGYMVGTSGLGMQRDVSLS 157

Query: 107 DAVRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECL 160
             V+ L   G    +      IT+DGF F+L +   QVW+ ++ Y+E+  + G+D VE L
Sbjct: 158 KGVKELLQAGHLVEIRDRRVEITQDGFAFVLQDVGTQVWHILVLYVESAAAIGMDSVEVL 217

Query: 161 TFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMAT-- 218
           +F+F L     G  Y  + ++      L  L +FG+VYQ    A  FYPT+LA  + +  
Sbjct: 218 SFVFFLSSLELGKSYEKKQLTSNQLRTLTDLADFGIVYQETPDATHFYPTRLATTLTSDS 277

Query: 219 --------------RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPN 264
                          GT       GF+I+ETN+R+YAYT S L+++LI LF  L YRFPN
Sbjct: 278 STLSNPLAGSLSGPTGTSSSKAGSGFIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPN 337

Query: 265 LTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV-------------SCP---LPPVV 308
           L  G +TR SVR A+  GITA QII +L  HA P +               P   LPP V
Sbjct: 338 LITGKITRQSVRRAVEMGITADQIISYLSTHAHPQMRKHNVSRSTSNQAGMPVSVLPPTV 397

Query: 309 SDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQG 368
            DQIRLW+ ER+R+  + G L+  F + A++E    YA+++GVL W++++KR   VT+  
Sbjct: 398 VDQIRLWQLERDRVKATHGFLFRDFNTLAEYEAPCRYAEEIGVLVWKSDRKRMFFVTR-- 455

Query: 369 HSDIKKFWVNHQK 381
           H  +  F  + +K
Sbjct: 456 HHQVAAFLKSRRK 468


>gi|50310215|ref|XP_455127.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|62901296|sp|Q6CLR2.1|TFB2_KLULA RecName: Full=RNA polymerase II transcription factor B subunit 2;
           AltName: Full=RNA polymerase II transcription factor B
           52 kDa subunit; AltName: Full=RNA polymerase II
           transcription factor B p52 subunit
 gi|49644263|emb|CAG97834.1| KLLA0F01056p [Kluyveromyces lactis]
          Length = 496

 Score =  232 bits (591), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 143/451 (31%), Positives = 221/451 (49%), Gaps = 77/451 (17%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           LS +AK +++ +LF +  V    +  W+      + + +  ++  L              
Sbjct: 39  LSPMAKFFIMSMLFQDHDVSLRDLDKWVKPDAKYQLQYSIKSMKSLNLIIEGESKQPLLI 98

Query: 48  ------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQT 101
                 +++ K  L GG    S     + D+       L+QY+ E+WE IL +MVG+  T
Sbjct: 99  RLNPIFKKSFKNVLTGGEINNSFGDVADDDTNPVSTATLDQYSAEKWETILHYMVGTPNT 158

Query: 102 EGISADAVRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
                  +  L + G           IT  GFQFLL +  AQ+W  +L+YL+  ES  +D
Sbjct: 159 NTPGGKVLDLLQHSGLMEEAEYGELKITNQGFQFLLQDVNAQMWTLLLQYLKMAESLQMD 218

Query: 156 LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLA-- 213
            V+ L F+F L     G  Y  + +S+  +  LQ +R++GL+YQ +    +FYPT+LA  
Sbjct: 219 PVDVLNFIFMLGALQLGKAYKCDQLSNTQRTMLQDMRDYGLIYQNQSDYAKFYPTRLATL 278

Query: 214 ------------------LNMATRGT----------LKQIREPGFLIVETNFRVYAYTDS 245
                             LN A   T            +  + G LI+ETNF++Y+Y++S
Sbjct: 279 LTSDTKAFRSASVALDSVLNKANETTAVEGDSGQDETTERTQDGALIIETNFKLYSYSNS 338

Query: 246 NLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVS---- 301
            L++A++ LF  L  RF N+  G LTR+SVR AL +GITA QII +L  HA P +     
Sbjct: 339 PLQIAILSLFVHLKSRFANMVTGQLTRESVRNALLNGITAEQIIAYLETHAHPRMRRLAE 398

Query: 302 -----------------CPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRD 344
                              LPP V DQIRLW+ E +R+ + +G LY+ F S  +++ + D
Sbjct: 399 ENLSKKLELDPTVKETLQVLPPTVVDQIRLWQLELDRIISYDGYLYTDFESYQEYQTVAD 458

Query: 345 YAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           YA+D+GVL WQNEKK+   V+ +G+S +  F
Sbjct: 459 YAKDIGVLLWQNEKKKMFFVSTEGNSQVLDF 489


>gi|68490740|ref|XP_710812.1| hypothetical protein CaO19.13268 [Candida albicans SC5314]
 gi|68490763|ref|XP_710801.1| hypothetical protein CaO19.5846 [Candida albicans SC5314]
 gi|46432046|gb|EAK91553.1| hypothetical protein CaO19.5846 [Candida albicans SC5314]
 gi|46432058|gb|EAK91564.1| hypothetical protein CaO19.13268 [Candida albicans SC5314]
 gi|238879997|gb|EEQ43635.1| TFIIH basal transcription factor complex subunit 2 [Candida
           albicans WO-1]
          Length = 494

 Score =  232 bits (591), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 151/450 (33%), Positives = 223/450 (49%), Gaps = 77/450 (17%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           LS +AK Y++ ++F E+P+    +  W   +  K   EA   L  L              
Sbjct: 39  LSPMAKFYIMSMIFNEKPIALRDLDKWCKPSARKLEFEALKRLESLHLIEYDSKGSHVRL 98

Query: 48  ----RRNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
               R+N +  L G   P +  S    VD    D+ FL+ +A ++WE IL FMVG++ T 
Sbjct: 99  HSIFRKNFRDCLTGSQNPNAFGSISTTVDKHKVDIPFLDLFASQKWETILHFMVGTEST- 157

Query: 103 GISADAVRTLFN-----EGAENPV-ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDL 156
              +D+V +L       EG  N + IT  GFQFLL +  AQ+W  +L+YL   +   +D 
Sbjct: 158 ATPSDSVLSLLKLGGLMEGPNNDLRITNSGFQFLLQDVNAQIWTLLLQYLNLTQELNMDP 217

Query: 157 VECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLAL-- 214
           V+ L F+F L     G  YS   +S+     L  L+++GLVYQR   +GRFYPT+LA   
Sbjct: 218 VDVLNFIFVLGSLELGKSYSVLSLSETQVSMLADLKDYGLVYQRSDTSGRFYPTRLATTL 277

Query: 215 ----------NMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPN 264
                     +MA     +Q+     +I+ETNF++YAYT S L++A++ LF     RF N
Sbjct: 278 TSDSAALKTPSMAMDEEEQQVATKESIIIETNFKIYAYTKSPLEIAILNLFVHFKTRFAN 337

Query: 265 LTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV------------------------ 300
           +  G +TR+S+R AL +GITA QII FL  HA P +                        
Sbjct: 338 MVCGQITRESIRNALYNGITADQIIKFLETHAHPQMRILAKEKLDKKIEFDTSHNINTAG 397

Query: 301 SCP---------------LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDY 345
             P               LPP V DQI+LW+ E +R+ T +G L+  F +Q +++ L +Y
Sbjct: 398 GAPQSKTDGMVSQHKLEILPPNVVDQIKLWQLELDRIQTFDGYLFKDFSNQQEYDILSNY 457

Query: 346 AQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           A +LGVL W ++ K+   VTK G + +  F
Sbjct: 458 ASELGVLIWADKVKKKFFVTKDGMTQVADF 487


>gi|353240543|emb|CCA72408.1| probable TFB2-TFIIH subunit (transcription/repair factor)
           [Piriformospora indica DSM 11827]
          Length = 462

 Score =  232 bits (591), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 149/419 (35%), Positives = 220/419 (52%), Gaps = 50/419 (11%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR--RNAKVALL--- 56
           LS + +  V+ LL++E PV    I +W++      ++ A  +L +L+   N+K  +L   
Sbjct: 42  LSPLERQLVMSLLWLEGPVETQSITNWITLEGRSAYEHALKSLAKLQILPNSKEQILLQA 101

Query: 57  -----------GGGKPWSMSAKLEVDSKARDL--DFLNQYALERWECILRFMV--GSQQT 101
                      G G+  S  A +E D+    L  + L+ YA+ERW+ IL FMV  G++Q 
Sbjct: 102 SFKSGLRNGLTGSGQVASFGALVEPDNDMGTLPTEMLDNYAVERWDTILHFMVTSGTEQA 161

Query: 102 EGISADAVRTLF-------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETV-ESKG 153
               ++ V  L        NE     +IT  GFQFLL    AQ+W FIL YL  + E + 
Sbjct: 162 SARPSEGVLYLLEHSGLMSNEHGRR-IITSAGFQFLLQSPHAQLWEFILSYLRMMAERED 220

Query: 154 LDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLA 213
           +D+V+ L F F L  +  G  YST  +S    + +  LR+FGLVY        F PT+ A
Sbjct: 221 MDMVDILGFFFMLSMTQPGQHYSTHTLSPTQLIMVSDLRDFGLVYFPSDTTTSFQPTRFA 280

Query: 214 --------------LNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELM 259
                         L +     L Q     F+++ETN+ VYAYT++ L++A++ LF    
Sbjct: 281 TTLTSYTSSFSDHDLTIENGADLSQ----EFVVLETNYHVYAYTNNPLQIAVLNLFVSFK 336

Query: 260 YRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SCPLPPV-VSDQIRLWE 316
            RFPN+ +G LTRDSV+ AL +GITA QI+ +L  HA P +  + P+ PV V DQIRLWE
Sbjct: 337 ARFPNMIMGSLTRDSVKKALVNGITADQILSYLVTHAHPQMRKNNPIIPVTVQDQIRLWE 396

Query: 317 GERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
            ER R+   +G LY +F S  D+E +  YA++LGV+ W+N  +R       G   I+ +
Sbjct: 397 LERHRVKGQDGYLYKEFASMNDYEVVVQYARELGVVLWENASRRMFFADAAGRVHIRSY 455


>gi|295662078|ref|XP_002791593.1| RNA polymerase II transcription factor B subunit 2
           [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226279719|gb|EEH35285.1| RNA polymerase II transcription factor B subunit 2
           [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 484

 Score =  232 bits (591), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 156/434 (35%), Positives = 223/434 (51%), Gaps = 69/434 (15%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR--RNA-------- 51
           L  +AK++V+ LL+++ P+P A +  W+     +E   A   L  L    NA        
Sbjct: 38  LPYLAKYFVMALLYLKDPLPAADLELWVKPESKRERDNALSILGRLHILSNATTTDHVRA 97

Query: 52  -----------KVALLGGGKPWSMSA-KLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
                      + AL GG K  S     +  D+    +  L++YA  +WE +L +MVG+ 
Sbjct: 98  YVVTEPFASSLRQALTGGDKQQSFGVPSVTPDANLMTVAQLDEYASRQWEGVLGYMVGTS 157

Query: 99  ----QQTEGISADAVRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLET 148
               QQT  +S   V+ L        V      ITK+GF F+L +   QVW  ++ Y+E 
Sbjct: 158 ALGAQQTVTLS-KGVKQLLQACHLVEVHDRRVDITKEGFAFVLQDVNTQVWNILILYVEN 216

Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQR--KRKAGR 206
            E  GL+ VE L+FLF L     G  Y  + ++      L  L +FG+VYQ    ++A R
Sbjct: 217 AEQIGLESVEVLSFLFVLSSLELGQSYEKKHLTSTQLKTLTDLTDFGIVYQHFPPQEATR 276

Query: 207 FYPTKLALNM----ATRG-------------TLKQIREP--GFLIVETNFRVYAYTDSNL 247
           FYPT+L   +    AT G             T     EP  GF+++ETN+R+YAYT S L
Sbjct: 277 FYPTRLVTTLTSDSATLGSSISTSLTGPSGPTAANSSEPSTGFIVIETNYRLYAYTSSPL 336

Query: 248 KVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALP--------- 298
           +++LI LF  L YRFPNL  G +TR S+R A+  GITA QII +L  HA P         
Sbjct: 337 QISLIALFTTLKYRFPNLITGKITRQSIRRAVEMGITADQIISYLTTHAHPQMRKYHASK 396

Query: 299 ---SVSCP--LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLT 353
              +V  P  LPP V DQIRLW+ ER+R+  + G L+  FVS A+F+    YA+++GVL 
Sbjct: 397 SSHTVGAPAVLPPTVVDQIRLWQLERDRIKATPGFLFKDFVSLAEFDGPCRYAEEIGVLV 456

Query: 354 WQNEKKRTVVVTKQ 367
           W++EK+R   VT+ 
Sbjct: 457 WKSEKRRMFFVTRH 470


>gi|170589001|ref|XP_001899262.1| Transcription factor Tfb2 family protein [Brugia malayi]
 gi|158593475|gb|EDP32070.1| Transcription factor Tfb2 family protein [Brugia malayi]
          Length = 465

 Score =  232 bits (591), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 146/384 (38%), Positives = 215/384 (55%), Gaps = 54/384 (14%)

Query: 44  LTELRRNA--KVALLGGGKPWSMSAKLEVDSKAR---DLDFLNQYALERWECILRFM-VG 97
           L ++ R A  K   +G  K   +S   + D K R   + D L + A+ERWECIL ++ + 
Sbjct: 76  LNQVYRRAYLKATRIGLCKASELSVVTDCDEKTRKSANKD-LGKKAVERWECILHYLALP 134

Query: 98  SQQTE-GISADAVRTLFNEGAENP-------VITKDGFQFLLLETPAQVWYFILKYLETV 149
           SQ++E G+S    R   + G  +         IT  GFQFLLL  P Q+W ++L Y    
Sbjct: 135 SQKSEQGVSGTTKRLFRSAGLTSSGNDDGDIEITSAGFQFLLLNRPEQIWTYLLHYFHMQ 194

Query: 150 ESKGLDLVECLTFLFQLKF---STFGTDYSTEGMSDK------------------LQVFL 188
           E  G+D+V+ L FLF+L     ST G      G SDK                  ++ FL
Sbjct: 195 EVAGIDIVKELDFLFKLTLYSGSTCGGTRVVNG-SDKERPTGSRAFIIDENWPETIKDFL 253

Query: 189 QHLREFGLVYQRKRKAGRFYPTKL-----ALNMATRGTLKQIREPGFLIVETNFRVYAYT 243
            HLRE GLV+ RKRK G F+ T L      ++  +   ++   + GF+IVETN+RVYAYT
Sbjct: 254 MHLRELGLVFIRKRKDGFFFLTPLLNHLTGISNTSEAGMENRNQNGFVIVETNYRVYAYT 313

Query: 244 DSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSCP 303
           DSNL++A++  F E++YRF +++VGVL+R++VR A + GITAAQII FL  +A P ++C 
Sbjct: 314 DSNLQLAILSTFTEMLYRFSDMSVGVLSREAVRRAFQVGITAAQIIAFLSRNAHP-ITCA 372

Query: 304 -----------LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVL 352
                      +P  V DQI+LWE ER RLT     +YS F S+ ++  L+ Y     +L
Sbjct: 373 VTTEYGRMIQSVPITVVDQIQLWEDERRRLTFCSAAVYSSFESEKEYFGLKGYVISQNIL 432

Query: 353 TWQNEKKRTVVVTKQGHSDIKKFW 376
            W ++ +R +V+T++GH ++K +W
Sbjct: 433 LWCHDVQRLMVITEEGHENVKVWW 456


>gi|449433857|ref|XP_004134713.1| PREDICTED: general transcription factor IIH subunit 4-like [Cucumis
           sativus]
          Length = 451

 Score =  231 bits (590), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 136/406 (33%), Positives = 219/406 (53%), Gaps = 33/406 (8%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
           L  +AK +V+++L+++ PV    +  W+      ++K A D L +LR             
Sbjct: 40  LPPLAKKFVLQMLYIDGPVSAKSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRETT 99

Query: 49  --------RNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQ 100
                    N +  L+ G           +  +   L+ L  YAL++WEC L  ++ S Q
Sbjct: 100 YRLNPTFQANLQKLLIHGEVLAREPMPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQ 159

Query: 101 TEGIS--ADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
            E  S  + +V  +F +G       E P +T+ GFQFLL+ET AQ+WY I +Y+   E +
Sbjct: 160 AEKPSNISSSVMKVFQKGLLSQRDKEAPRLTESGFQFLLMETNAQLWYIIREYISNAEER 219

Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYPTK 211
           G+D  + ++FL +L F   G  Y  + +SD+ +  ++ L + GLV  Q+ RK   F PTK
Sbjct: 220 GVDPADLISFLLELSFHVTGEAYDIDTLSDEQRYAIKDLADLGLVKLQQGRKESWFIPTK 279

Query: 212 LALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLT 271
           LA N++        R+ GF++VETNFR+YAY+ S L   ++ LF  + Y+ PNL VG +T
Sbjct: 280 LATNLSMSLADSSSRKLGFVVVETNFRMYAYSTSKLHCEILRLFSRIEYQLPNLIVGAIT 339

Query: 272 RDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERLTTSEGV 328
           ++S+  A ++GITA QI+ FL+ +A P V+  +P V   V+DQIRLWE +  R+  +   
Sbjct: 340 KESLYNAFKNGITAEQIVTFLQQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAH 399

Query: 329 LYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKK 374
            Y +F S+  FE   DYA++   L W++ K   +VV    H+ +++
Sbjct: 400 FYDEFPSREVFEAACDYAREWNGLLWEDSKNLRLVVKADIHTHMRE 445


>gi|341886092|gb|EGT42027.1| hypothetical protein CAEBREN_00374 [Caenorhabditis brenneri]
          Length = 460

 Score =  231 bits (590), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 139/357 (38%), Positives = 206/357 (57%), Gaps = 36/357 (10%)

Query: 54  ALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMV--GSQQTEGISADAVRT 111
           A+LG     S+   LE + + R    + + A+ERW+CILR++     + T+ +S +  R 
Sbjct: 105 AMLGAASISSLV--LETNEEKRRGKEVEKKAVERWDCILRYLALPSEENTQAVS-ETTRN 161

Query: 112 LF-----NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQL 166
           LF       G     IT  GFQFLLL    Q+W ++++YL+   S+G D+VE +  L Q+
Sbjct: 162 LFKMANFTSGDSRIEITTSGFQFLLLSPVKQMWTYVIEYLKLEMSQGHDIVEVIEPLIQI 221

Query: 167 KF----------STFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNM 216
                         +  D S  G   +L   L HLRE G+++ RKRK G F+ T+L  ++
Sbjct: 222 VLLANRGYKAEKECYQIDPSWTGPQQEL---LNHLRELGVIFIRKRKDGVFFLTQLLTHL 278

Query: 217 ATRGTLKQIR----EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTR 272
           AT  T+          G +IVETNFRVYAYT S L++A+I LF E+ YRF +++VG++TR
Sbjct: 279 ATNETIDDTSAEKVSNGKVIVETNFRVYAYTSSLLQLAIIALFTEMTYRFQDMSVGMITR 338

Query: 273 DSVRAALRSGITAAQIIGFLRLHALPS-------VSCPLPPVVSDQIRLWEGERERLTTS 325
           +SVR AL+ GITAAQII FLR +A P        V+C LP  V+DQIRLWE ER R+   
Sbjct: 339 ESVRGALQHGITAAQIISFLRANAHPQCIATSGPVNC-LPITVADQIRLWEDERRRMDLK 397

Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKG 382
           +  +YS F S  +F  + DYA++  +L W + +++ V+V ++GH  +++ W   QK 
Sbjct: 398 DSYIYSNFESDEEFNGVCDYAKERKILLWADYQRKLVIVNEEGHEYVRQ-WYKQQKS 453


>gi|402592023|gb|EJW85952.1| transcription factor Tfb2 family protein [Wuchereria bancrofti]
          Length = 470

 Score =  231 bits (590), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 136/358 (37%), Positives = 207/358 (57%), Gaps = 41/358 (11%)

Query: 56  LGGGKPWSMSAKLEVDSKAR---DLDFLNQYALERWECILRFM-VGSQQTE-GISADAVR 110
           +G GK   +S   + D K R   + D L + A+ERWECIL ++ + SQ++E G+S    R
Sbjct: 108 IGLGKASELSVVKDCDEKTRKSANKD-LGKKAVERWECILHYLALPSQKSEQGVSGTTKR 166

Query: 111 TLFNEGAENP-------VITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFL 163
              + G  +         IT  GFQFLLL  P Q+W ++L Y    E  G+D+++ L FL
Sbjct: 167 LFRSAGLTSSGNDDGDIEITSAGFQFLLLNRPEQIWTYLLHYFHMQEVAGIDIIKELDFL 226

Query: 164 FQLKFST----------FGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL- 212
           F++   +          F  D   E   + ++ FL HLRE GLV+ RKRK G F+ T L 
Sbjct: 227 FKVVNDSDKERPAGSRAFVID---ENWPETIKDFLIHLRELGLVFIRKRKDGVFFLTPLL 283

Query: 213 ----ALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVG 268
                ++  +  +++   + GF+IVETN+RVYAYTDSNL++A++  F E++YRF +++VG
Sbjct: 284 NHLTGISNTSEASMENRNQHGFVIVETNYRVYAYTDSNLQLAILSTFTEMLYRFNDMSVG 343

Query: 269 VLTRDSVRAALRSGITAAQIIGFLRLHALPSVSC----------PLPPVVSDQIRLWEGE 318
           VL+R++VR A + GITAAQII FL  +A P               +P  V+DQI+LWE E
Sbjct: 344 VLSREAVRRAFQVGITAAQIIAFLSTNAHPITYAVTTEYGRMIQSVPITVADQIQLWEDE 403

Query: 319 RERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFW 376
           R RLT     +YS F S+ ++  L+ Y     +L W ++ +R +V+T++GH ++K +W
Sbjct: 404 RRRLTFCNATVYSSFESEKEYFGLKGYVISQNILLWCHDVQRLMVITEEGHENVKAWW 461


>gi|403163300|ref|XP_003323393.2| hypothetical protein PGTG_04930 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375163998|gb|EFP78974.2| hypothetical protein PGTG_04930 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 887

 Score =  231 bits (589), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 145/429 (33%), Positives = 219/429 (51%), Gaps = 56/429 (13%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHK-EATDTLTELRR----------- 49
           L  IA+H V+ LL+ +  + ++ +  W    H++  +    D+LT+L R           
Sbjct: 76  LPLIARHLVLNLLWSDHSIDKSQLELWADINHSESRRIHVEDSLTKLFRLNIVEQSSLDS 135

Query: 50  ------------------------NAKVALLGGGKPWSMSAKLEVDSKAR-DLDFLNQYA 84
                                   N + AL GGG   S        S A   +D L++Y 
Sbjct: 136 SSSSFDDPRNHHHLTLTLNRSFQQNFRRALTGGGNHCSFGVPCNTSSNAVVSIDELDRYG 195

Query: 85  LERWECILRFMVGSQQTEGISADAVRTLFNEG---AENP------VITKDGFQFLLLETP 135
            E+WE IL +MVGS+     S + +  L   G   + +P       IT  GF FLL +  
Sbjct: 196 TEKWETILHYMVGSRLPTKPSHNILSLLGQSGLMTSSDPRSLQSLKITSKGFGFLLEDVN 255

Query: 136 AQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFG 195
            Q+W  +L+YL+  E  GLD+V+ L  LF L     G +YS    +      LQ L ++G
Sbjct: 256 TQLWDILLQYLKMTEVNGLDVVDVLACLFMLGSLELGQEYSFSNWTPTQTQVLQDLVDYG 315

Query: 196 LVYQRKRKAGRFYPTKLALNMATRGTL-----KQIREPGFLIVETNFRVYAYTDSNLKVA 250
           LV        RFYPT+LA  + +         +   E GFL++ETN+R+YAYT + L++A
Sbjct: 316 LVL--VSAPDRFYPTRLATTLTSTAPPLVSAERAQEEHGFLVLETNYRIYAYTSNPLQIA 373

Query: 251 LIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPV 307
           ++ LF  L YRFPNL VG +TR+S+++AL +GITA Q+I +L  HA P +      LPP 
Sbjct: 374 VLNLFLSLRYRFPNLVVGAVTRESIKSALSNGITADQVIMYLHTHAHPQMRKLEPLLPPT 433

Query: 308 VSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQ 367
           V DQIRLWE E+ R+   EG LY  F S A+++ +  Y++ LG++ W++   R + V   
Sbjct: 434 VVDQIRLWELEKNRIRAQEGYLYEDFKSAAEYDSVIQYSRKLGIVLWEHAGLRKLFVGYD 493

Query: 368 GHSDIKKFW 376
           GH  +++F+
Sbjct: 494 GHLTLREFF 502


>gi|150863860|ref|XP_001382481.2| RNA polymerase II transcription factor B subunit 2 (RNA polymerase
           II transcription factor B p52 subunit) (RNA polymerase
           II transcription factor B 52 kDa subunit)
           [Scheffersomyces stipitis CBS 6054]
 gi|149385116|gb|ABN64452.2| RNA polymerase II transcription factor B subunit 2 (RNA polymerase
           II transcription factor B p52 subunit) (RNA polymerase
           II transcription factor B 52 kDa subunit)
           [Scheffersomyces stipitis CBS 6054]
          Length = 515

 Score =  231 bits (588), Expect = 6e-58,   Method: Compositional matrix adjust.
 Identities = 152/468 (32%), Positives = 227/468 (48%), Gaps = 96/468 (20%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHK-EATDTLTEL------------- 47
           L+ +AK Y++ +LF ++PV    +  W  K HAK+ + ++   LT L             
Sbjct: 40  LTPMAKFYIMTMLFNDRPVALRDLDRW-CKPHAKKLQFDSLKRLTSLHLIEEDRGGNHIR 98

Query: 48  -----RRNAKVALLGGGKPWSMSAKLEVDSKAR-DLDFLNQYALERWECILRFMVGSQQT 101
                R+N +  L G  +  +         K R D+ FL+ +A  +WE IL FMVG++ T
Sbjct: 99  LNPTFRQNFRNCLTGSQQSNAFGNLCTTTDKHRVDISFLDNFASSKWETILHFMVGTEAT 158

Query: 102 EGISADAVRTLFNEGA---------ENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
                 +V +L   G          +N  IT  GFQFLL +  AQ+W  +L+YL   +  
Sbjct: 159 -ATPTTSVLSLLKSGGLMEGPGNSPQNLKITNTGFQFLLQDVNAQIWTLLLQYLNLTQEL 217

Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
            +D V+ L F+F L     G  Y    +S      L  LR++GLVYQR   +GRFYPT+L
Sbjct: 218 HMDPVDVLNFIFVLGSLELGKSYLVSSLSATQVSMLADLRDYGLVYQRTDSSGRFYPTRL 277

Query: 213 --------------------ALNMATRGTLKQIREP----GFLIVETNFRVYAYTDSNLK 248
                               AL  +  G  +QI  P    G +I+ETNF++YAYT+S L+
Sbjct: 278 ATTLTSDSAALKTPSMAIEQALGASDGGEEQQIDTPTGNQGTVIIETNFKLYAYTNSPLE 337

Query: 249 VALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV-------- 300
           +A++ LF  L  RF N+  G +TR+S+R+AL +GITA QII FL  HA P +        
Sbjct: 338 IAILNLFVHLKSRFTNMVCGQITRESIRSALYNGITADQIIKFLETHAHPQMKLLAKEKL 397

Query: 301 ----------------SCP-----------------LPPVVSDQIRLWEGERERLTTSEG 327
                             P                 LPP V DQI+LW+ E +R+ T +G
Sbjct: 398 DKKIEFDTSHNINTAGGAPQSKVSGEGNVAQHKLEILPPTVVDQIKLWQLELDRIQTFDG 457

Query: 328 VLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
            L+  F +Q +F+ L +YA ++GVL W ++ K+   VT +G + + +F
Sbjct: 458 YLFKDFSNQQEFDTLSNYASEVGVLVWSDKIKKKFFVTAEGMTQVAEF 505


>gi|169610043|ref|XP_001798440.1| hypothetical protein SNOG_08115 [Phaeosphaeria nodorum SN15]
 gi|160701967|gb|EAT84391.2| hypothetical protein SNOG_08115 [Phaeosphaeria nodorum SN15]
          Length = 426

 Score =  230 bits (587), Expect = 7e-58,   Method: Compositional matrix adjust.
 Identities = 141/409 (34%), Positives = 214/409 (52%), Gaps = 56/409 (13%)

Query: 11  IRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL---------------------RR 49
           + +L++    P A + +W   +   E  +A D L  L                     +R
Sbjct: 1   MAMLYMPSSFPAADLEAWFKPSARSERVQALDILERLHILGSKREDDKSLSYSLIPGFQR 60

Query: 50  NAKVALLGGGKPWSM---SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISA 106
           + + A+ G G   +    ++K E   K   ++FL+ +A E+WE IL FMV      G   
Sbjct: 61  SLRHAIEGSGTHRTFGVPASKAERGEKRLSIEFLDAHAREQWEGILFFMVSG--AAGFQP 118

Query: 107 DAVRTLFNEGAE--------------NPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
            +V+     G +               P ITK+GF F+L ET AQVW  ++ YL+ V+  
Sbjct: 119 GSVKMDVGPGTKKLLHAGDLVRTVHGTPRITKEGFSFVLQETNAQVWNLLIVYLKMVDDL 178

Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
           G+   E L+FLF L     G DYST  +S      L  L   G+VY+  + +  FYPT+L
Sbjct: 179 GMSETEVLSFLFMLGSLELGQDYSTSTLSPSQLHMLDDLSAMGIVYRSSKDSPTFYPTRL 238

Query: 213 ALNMAT-RGTL------------KQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELM 259
           A  + +  G L             + +  GF+IVETN+R+YAYT+S +++A++ LF +L 
Sbjct: 239 ATTLTSDSGALPGSDVGTTEKPDSKAQNKGFIIVETNYRLYAYTNSLIQIAILSLFTKLQ 298

Query: 260 YRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWE 316
           +RFPNL  G LT++SV  A+++GIT+AQII +L  +A P +   + PLPP V DQIRLWE
Sbjct: 299 HRFPNLVSGKLTKESVHRAVQAGITSAQIISYLTTYAHPQMQKSNPPLPPTVMDQIRLWE 358

Query: 317 GERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVT 365
            E ER+  + G L  +F S++++  +  YAQ LGVL W+N+  R   V+
Sbjct: 359 YEGERVEVTTGYLMREFGSESEYRDVMGYAQALGVLVWKNDANRCFFVS 407


>gi|356548733|ref|XP_003542754.1| PREDICTED: general transcription factor IIH subunit 4-like [Glycine
           max]
          Length = 451

 Score =  230 bits (587), Expect = 8e-58,   Method: Compositional matrix adjust.
 Identities = 136/404 (33%), Positives = 216/404 (53%), Gaps = 33/404 (8%)

Query: 5   IAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR---------------- 48
           +AK YVI++L ++  V   ++  W+      +H+ A D L +LR                
Sbjct: 43  LAKKYVIQMLHIDVSVAAKLLEEWVLPGGVSKHRVAIDRLVQLRVFLEAVDRKNEKTYKV 102

Query: 49  -----RNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE- 102
                R+ +  L+ GG     S    +  +   L+ L  YALE+WEC L  ++   Q E 
Sbjct: 103 NPTFQRSLQKLLVQGGTLPRESMPSNITVRLPTLENLEAYALEQWECFLLQLISPSQVEK 162

Query: 103 --GISADAV-----RTLFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
              IS+  +     R L     E P +T+ GFQFLL++T AQ+WY I +Y+   E +G+D
Sbjct: 163 PLNISSSLMKVFQRRILSYRDKEAPKLTESGFQFLLMDTNAQLWYIIREYISNSEDRGVD 222

Query: 156 LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYPTKLAL 214
             + ++F+ +L F   G  YS + ++   +  +  L + GLV  Q+ RK   F PTKLA 
Sbjct: 223 AADLISFMLELSFHDIGEAYSVDTLTSFQRTIINDLADLGLVKIQQGRKGSWFIPTKLAT 282

Query: 215 NMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDS 274
           N++        R+ GF++VETNFRVYAY+ S L   ++ LF  + Y+ PNL VG +T++S
Sbjct: 283 NLSMSLADSSSRKQGFVVVETNFRVYAYSTSKLHCEILRLFSRVEYQLPNLIVGAITKES 342

Query: 275 VRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERLTTSEGVLYS 331
           + +A  +GITA QII FL+ +A P V+  +P V   V++QIRLWE +  R+  ++   Y 
Sbjct: 343 LYSAFENGITAEQIITFLQQNAHPRVAERIPSVPENVTEQIRLWEADLNRVEMTDAYYYD 402

Query: 332 QFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           +F S+  FE   D A++   L W++ KK  +VV  + H  ++ +
Sbjct: 403 EFPSRDVFEGACDCAREWNGLLWEDSKKMHMVVKTEVHQYVRDY 446


>gi|367009216|ref|XP_003679109.1| hypothetical protein TDEL_0A05660 [Torulaspora delbrueckii]
 gi|359746766|emb|CCE89898.1| hypothetical protein TDEL_0A05660 [Torulaspora delbrueckii]
          Length = 513

 Score =  230 bits (586), Expect = 9e-58,   Method: Compositional matrix adjust.
 Identities = 141/459 (30%), Positives = 232/459 (50%), Gaps = 88/459 (19%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           L ++AK +++ ++F E  +    +  W+  +   + ++A  ++  L              
Sbjct: 40  LPQMAKFFIMSMVFNENEISLRDLDRWVKASGKNQFQDAIKSMKSLHLLIPVRTNGPMMI 99

Query: 48  ------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQT 101
                 R + + AL GG    S    ++  +   + + L+ Y+ ++WE IL FMVG+   
Sbjct: 100 NLNQTFRESFRNALTGGEVNNSFGIVVDEPNDVVNTEMLDAYSADKWETILHFMVGTSLA 159

Query: 102 EGISADAVRTLFNEGAENPV-------ITKDGFQFLLLETPAQVWYFILKYLETVESKGL 154
              S + +  L +      V       IT +GFQFLL E  +Q+W  +L+YL+  E+  +
Sbjct: 160 STPSGNVLNLLKHSKLMEEVSTTGEFKITNEGFQFLLQEVNSQIWALLLQYLKMTETLQM 219

Query: 155 DLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLAL 214
           D VE L F+F L    FG  YS +G+SD  ++  + +R++GLV+Q+   +  FYPT+LA 
Sbjct: 220 DPVEVLNFIFMLGALEFGKPYSMDGLSDTQKLMSKDMRDYGLVFQKNSNSKVFYPTRLAT 279

Query: 215 NMAT-----RGT-------LKQIRE----------------------------PGFLIVE 234
            + +     RG        LKQ ++                             G LIVE
Sbjct: 280 MLTSDAKSIRGASGAMDSVLKQNKDEASNKAASSNADADSDDDEVGINGQPIQDGALIVE 339

Query: 235 TNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRL 294
           TNF++Y+Y++S L++A++ LF  L  RF N+  G +TR+S+R AL +GITA QII +L+ 
Sbjct: 340 TNFKLYSYSNSPLQIAILSLFVHLKSRFTNMVTGQITRESIRRALINGITADQIIAYLQT 399

Query: 295 HALPSV----------------SCP-----LPPVVSDQIRLWEGERERLTTSEGVLYSQF 333
           HA P +                +C      LPP V DQI+LW+ E +R+ + +G LYS  
Sbjct: 400 HAHPQMRRLAEEKLEKKLELDANCKDTLQILPPTVVDQIKLWQLELDRIISYDGSLYSDI 459

Query: 334 VSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDI 372
            S  ++  L  YAQD+GVL W++++KR   V+K+G+S +
Sbjct: 460 DSHQEYILLSTYAQDIGVLLWKDDRKRIFFVSKEGNSQV 498


>gi|294660113|ref|XP_002777731.1| DEHA2G23254p [Debaryomyces hansenii CBS767]
 gi|218511847|sp|Q6BGW8.2|TFB2_DEBHA RecName: Full=RNA polymerase II transcription factor B subunit 2;
           AltName: Full=RNA polymerase II transcription factor B
           52 kDa subunit; AltName: Full=RNA polymerase II
           transcription factor B p52 subunit
 gi|199434471|emb|CAR66042.1| DEHA2G23254p [Debaryomyces hansenii CBS767]
          Length = 515

 Score =  230 bits (586), Expect = 9e-58,   Method: Compositional matrix adjust.
 Identities = 152/475 (32%), Positives = 220/475 (46%), Gaps = 95/475 (20%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           L  IAK Y++ +LF E PV    +  W   T  K   ++   L  L              
Sbjct: 40  LPPIAKFYIMTMLFYETPVALKDLDKWCKPTSRKFQFDSLKRLRALHLIEEDSSGSHIRL 99

Query: 48  ----RRNAKVALLGG-GKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
               R+N +  L G   +    +     D ++  + FL+ +AL++WE IL FMVG++ + 
Sbjct: 100 HPTFRQNFRDCLTGSQARNAFGNLSTTPDEQSVSIKFLDSFALQKWESILHFMVGTELSV 159

Query: 103 GISADAVRTL--------FNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGL 154
             S   +  L         N+G     IT  GFQFLL +  AQ+W  +L+YL   +   +
Sbjct: 160 TPSNSVLSLLKSGGLMEGTNKGGNRLNITNSGFQFLLQDINAQIWTLLLQYLNLTQDLNM 219

Query: 155 DLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLAL 214
           D V+ L F+F L     G  Y+   +S+     L  LR++GL+YQR   + RFYPT+LA 
Sbjct: 220 DPVDVLNFIFILGSLELGKGYTVSSLSETQISMLADLRDYGLIYQRSENSERFYPTRLAT 279

Query: 215 ------------NMATRGTLKQIREP------------GFLIVETNFRVYAYTDSNLKVA 250
                       +MA    L+   E             G +I+ETNF++YAYT+S L++A
Sbjct: 280 TLTSDSAALKSPSMAMEQALESTTETEEQQNLASNSNQGTIILETNFKIYAYTNSPLEIA 339

Query: 251 LIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---------- 300
           ++ LF  L  RF N+  G +TR+S+R AL +GITA QII FL  HA   +          
Sbjct: 340 ILNLFVHLKTRFSNMVCGQITRESIRNALYNGITADQIIKFLETHAHSQMKILAKERLDK 399

Query: 301 --------------SCP--------------------LPPVVSDQIRLWEGERERLTTSE 326
                           P                    LPP V DQI+LW+ E +R+ T E
Sbjct: 400 KIEFDTSNNINTAGGAPQSQMMTNENGTTVAQHKLEVLPPTVVDQIKLWQLELDRIQTFE 459

Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQK 381
           G L+  F SQ +FE L +YA ++GVL W N+ K    VTK G S +  F   +Q+
Sbjct: 460 GYLFKDFASQQEFEALSNYASEIGVLLWSNKSKMKFFVTKDGISQVADFASRNQR 514


>gi|225682227|gb|EEH20511.1| RNA polymerase II transcription factor B subunit 2
           [Paracoccidioides brasiliensis Pb03]
 gi|226289599|gb|EEH45083.1| TFIIH basal transcription factor complex p52 subunit
           [Paracoccidioides brasiliensis Pb18]
          Length = 484

 Score =  229 bits (584), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 155/434 (35%), Positives = 222/434 (51%), Gaps = 69/434 (15%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR--RNA-------- 51
           L  +AK +V+ LL+++ P+P A +  W+     +E   A   L  L    NA        
Sbjct: 38  LPYLAKCFVMALLYLKDPLPAADLELWVKSESKRERDNALSILGRLHILSNATTTDHVRA 97

Query: 52  -----------KVALLGGGKPWSMSA-KLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
                      + AL GG K  S     +  D+    +  L++YA  +WE +L +MVG+ 
Sbjct: 98  YVVTEPFASSLRQALTGGDKQQSFGVPSVTPDANLMTVAQLDEYASRQWEGVLGYMVGTS 157

Query: 99  ----QQTEGISADAVRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLET 148
               QQT  +S   V+ L        V      ITK+GF F+L +   QVW  ++ Y+E 
Sbjct: 158 VLGAQQTVTLS-KGVKQLLQACHLVEVHDRRVDITKEGFAFVLQDVNTQVWNILILYVEN 216

Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQR--KRKAGR 206
            E  GL+ VE L+FLF L     G  Y  + ++      L  L +FG+VYQ    ++A R
Sbjct: 217 AEQIGLESVEVLSFLFVLSSLELGQSYEKKHLTSTQLKTLTDLTDFGIVYQHFPPQEATR 276

Query: 207 FYPTKLALNM----ATRGT-------------LKQIREP--GFLIVETNFRVYAYTDSNL 247
           FYPT+L   +    AT G+                  EP  GF+++ETN+R+YAYT S L
Sbjct: 277 FYPTRLVTTLTSDSATLGSSISTSLTGPSGPAAANSSEPSTGFIVIETNYRLYAYTSSPL 336

Query: 248 KVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALP--------- 298
           +++LI LF  L YRFPNL  G +TR S+R A+  GITA QII +L  HA P         
Sbjct: 337 QISLIALFTTLKYRFPNLITGKITRQSIRRAVEMGITADQIISYLTTHAHPQMRKYHASK 396

Query: 299 ---SVSCP--LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLT 353
              +V  P  LPP V DQIRLW+ ER+R+  + G L+  FVS A+F+    YA+++GVL 
Sbjct: 397 SSHTVGAPAVLPPTVVDQIRLWQLERDRIKATPGFLFKDFVSLAEFDGPCRYAEEIGVLV 456

Query: 354 WQNEKKRTVVVTKQ 367
           W++EK+R   VT+ 
Sbjct: 457 WKSEKRRMFFVTRH 470


>gi|366997767|ref|XP_003683620.1| hypothetical protein TPHA_0A01010 [Tetrapisispora phaffii CBS 4417]
 gi|357521915|emb|CCE61186.1| hypothetical protein TPHA_0A01010 [Tetrapisispora phaffii CBS 4417]
          Length = 504

 Score =  229 bits (583), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 140/458 (30%), Positives = 230/458 (50%), Gaps = 85/458 (18%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           L  +AK +++ ++F +  +    +  W++     + ++A  ++  L              
Sbjct: 39  LPHMAKFFIMSMVFNDNEISIRDLDRWVNSNGKLQFQDAIKSMKSLHILIPGKEQGSMFV 98

Query: 48  ------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQT 101
                 R + + AL GG    S    ++      D   L +YA ++WE IL FMVG+   
Sbjct: 99  NLNSTFRTSFRNALTGGEVNNSFGNIIDKIDDLVDSSVLLEYAKDKWETILHFMVGTPLI 158

Query: 102 EGISADAVRTLFN--------EGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKG 153
           + + +D V  L          E +    IT DGFQFLL +  +Q+W  +L+YL+  E+  
Sbjct: 159 K-MPSDNVLNLLKHSKLMEELESSGEFKITNDGFQFLLQDVNSQIWTLLLQYLKIAEALH 217

Query: 154 LDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLA 213
           ++ V+ L F+F L    FG  YS EG+SD  +  L+ +R++GL++Q+   A  FYPT LA
Sbjct: 218 MNPVDILNFIFMLGSLEFGKAYSVEGLSDTQKKMLKDMRDYGLIFQKTSNATVFYPTTLA 277

Query: 214 LNMATRGT------------LKQIREP-----------------------GFLIVETNFR 238
             + +  +            L Q +E                        G L+VETNF+
Sbjct: 278 TQLTSDSSTVRSASGAIDSLLSQTKEANNNDSQNLGTDTDDLVGNQSVEVGALVVETNFK 337

Query: 239 VYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALP 298
           +Y+Y++S L++A++ LF  L  RF N+  G +TR+S+R AL +G+TA QII +L+ HA P
Sbjct: 338 LYSYSNSPLQIAILSLFVHLRTRFSNMVTGQITRESIRNALANGVTADQIIAYLQSHAHP 397

Query: 299 SVS-------------CP--------LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQA 337
            +               P        LPP V DQI+LW+ E +R+ + +G LYS F +  
Sbjct: 398 QMRRLAETTLKKKLELDPNTKDQLQILPPTVVDQIKLWQLELDRILSYDGSLYSDFDNNQ 457

Query: 338 DFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           ++  L +YA+D+GVL W+N++KR   V+K+G+S +  F
Sbjct: 458 EYNMLYNYAKDIGVLLWKNDRKRKFFVSKEGNSQVLDF 495


>gi|448100135|ref|XP_004199281.1| Piso0_002710 [Millerozyma farinosa CBS 7064]
 gi|359380703|emb|CCE82944.1| Piso0_002710 [Millerozyma farinosa CBS 7064]
          Length = 509

 Score =  229 bits (583), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 147/466 (31%), Positives = 218/466 (46%), Gaps = 92/466 (19%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           L  +AK Y++ +LF E P+    ++ W + T  K   EA   L  L              
Sbjct: 40  LPPLAKFYIMSMLFYEHPIALKDMSKWCNPTAKKIQFEALKRLRSLHLIEEHNKGSHLKL 99

Query: 48  ----RRNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
               R+N +  L G     +  +     D    D+ FL+ +A ++WE IL FMVG++ + 
Sbjct: 100 HQTFRKNFRDCLTGSQTSNAFGNLCTSPDKNHIDVKFLDMFASQKWESILHFMVGTELSV 159

Query: 103 GISADAVRTLFNEG--------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGL 154
             S   +  L + G        A N  IT  GFQFLL +   Q+W  +L+YL   +   +
Sbjct: 160 APSKSVLSLLKSGGLMEGPGTSASNLKITNKGFQFLLQDVNVQIWTLLLQYLSLTQDLNM 219

Query: 155 DLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLAL 214
           D V+ L F+F L     G  Y+   +S+     L  L+++GL+YQ      RFYPT+LA 
Sbjct: 220 DPVDVLNFIFVLGCLELGKSYTVSSLSETQISMLADLKDYGLIYQNHNDKERFYPTRLAT 279

Query: 215 NMATRGT--------LKQIREP---------------GFLIVETNFRVYAYTDSNLKVAL 251
            + +           L Q+ +                G +I+ETNF++YAYT+S L++A+
Sbjct: 280 TLTSEAAALRTPSMALNQVVDTANSEATEEQMFSNTGGSIILETNFKIYAYTNSPLEIAI 339

Query: 252 IGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHA--------------- 296
           + LF  L  RFPN+  G +TR+S+R AL +GITA QII FL  HA               
Sbjct: 340 LNLFVHLKTRFPNMVTGQITRESIRKALFNGITADQIIKFLESHAHSQMKQMAKENLDKK 399

Query: 297 ----------------------LPSVSC-----PLPPVVSDQIRLWEGERERLTTSEGVL 329
                                   S S       LPP V DQI+LW+ E +R+ T EG L
Sbjct: 400 IEFDANNNINTATDSQQLSVGEFGSASSYHKLEILPPTVVDQIKLWQLELDRIQTFEGYL 459

Query: 330 YSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           +  F SQ +++ LR+YA +LGVL W ++ ++   VTK G S +  F
Sbjct: 460 FKDFSSQQEYDALRNYASELGVLLWSDKNRKKFFVTKDGMSQVADF 505


>gi|115388809|ref|XP_001211910.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114195994|gb|EAU37694.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 460

 Score =  229 bits (583), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 143/420 (34%), Positives = 216/420 (51%), Gaps = 48/420 (11%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAK---EHKEATDTLTELRRNAKVALLGG 58
           L ++AK +V+ LL+++ P+P   +  W+     K   ++  A         + + AL G 
Sbjct: 38  LPDLAKCFVMALLYLKDPLPAKDLELWVKSDSLKTLADNVRAYIVTNPFATSLRQALTGT 97

Query: 59  GKPWSMSAKLEV-DSKARDLDFLNQYALERWECILRFMVGSQ----QTEGISADAVRTLF 113
            +  S     ++ D     +  L++YA  +WE +L +MVG+     Q +   +  V+ L 
Sbjct: 98  EETQSFGVLSQLPDHPPVTIADLDEYARRQWEGVLGYMVGTSGLGIQRDVTLSKGVKQLL 157

Query: 114 NEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLK 167
             G    +      IT+DGF F+L +   QVW+ ++ Y+E+ E+ G+D VE L+F+F L 
Sbjct: 158 QAGHLVEIRDRRVEITQDGFAFVLQDVSTQVWHILILYVESAEAIGMDSVEVLSFVFLLS 217

Query: 168 FSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATR-------- 219
               G  Y  + ++      L  L +FG+VYQ    A  FYPT+LA  + +         
Sbjct: 218 SLELGKSYEKQHLTSNQLRTLTDLADFGIVYQHSPDATHFYPTRLATTLTSDASSLSNPI 277

Query: 220 --------GTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLT 271
                   GT       GF+IVETN+R+YAYT S L+++LI LF  L YRFPNL  G +T
Sbjct: 278 SGSLSGPDGTSSAKAGSGFIIVETNYRLYAYTSSPLQISLIALFTTLKYRFPNLITGKIT 337

Query: 272 RDSVRAALRSGITAAQIIGFLRLHALPSV-------------SCP---LPPVVSDQIRLW 315
           R SVR A+  GITA QII +L  HA P +               P   LPP V DQIRLW
Sbjct: 338 RQSVRRAIEMGITADQIISYLATHAHPQMRKHNVSRSTSNQAGIPPSVLPPTVVDQIRLW 397

Query: 316 EGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           + ER+R+  + G L+  F S A++E    YA+++GVL W++ +KR   VT+  H  +  F
Sbjct: 398 QLERDRVKATAGFLFKDFASLAEYEDPCRYAEEIGVLVWKSNRKRMFFVTR--HEQVAAF 455


>gi|392901992|ref|NP_001255864.1| Protein Y73F8A.24, isoform b [Caenorhabditis elegans]
 gi|290457493|emb|CBK19514.1| Protein Y73F8A.24, isoform b [Caenorhabditis elegans]
          Length = 431

 Score =  228 bits (582), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 137/371 (36%), Positives = 215/371 (57%), Gaps = 35/371 (9%)

Query: 37  HKEATDTLTELRRNAK-----VALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECI 91
           H++ TD    +  + K      A+LG  +  S+   LE +   R    + + A+ERW+CI
Sbjct: 45  HRQTTDGKLNIDTDYKRSYMYAAMLGAAQISSLV--LEPNEGNRRGKDVEKKAVERWDCI 102

Query: 92  LRFMV--GSQQTEGISADAVRTLFNE-----GAENPV-ITKDGFQFLLLETPAQVWYFIL 143
           LR++     + T+ +S +  R LF +     G +  + IT  GFQFLLL    Q+W +++
Sbjct: 103 LRYLALPSEENTQAVS-ETTRNLFKKANFTSGGDTQIEITTFGFQFLLLSPVKQMWTYVI 161

Query: 144 KYLETVESKGLDLVECLTFLFQL------KFSTFGTDYSTEGM-SDKLQVFLQHLREFGL 196
           +YL+   S+G D+VE +  L Q+       F      Y  +   ++     L HLRE G+
Sbjct: 162 EYLKLEMSQGQDIVEVIEPLIQIVLLANRGFKAERECYQIDANWTEPQNELLNHLRELGV 221

Query: 197 VYQRKRKAGRFYPTKLALNMATRGTLKQIR----EPGFLIVETNFRVYAYTDSNLKVALI 252
           ++ RKRK G F+ T+L  ++AT  T+  +       G +IVETNFRVYAYT S L++A+I
Sbjct: 222 IFIRKRKDGVFFLTQLLTHLATNETIDDVSAEKVSNGKVIVETNFRVYAYTSSLLQLAII 281

Query: 253 GLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPS-------VSCPLP 305
            LF E+ YRF +++VG++TR+SVR AL+ GITAAQII FLR +A P        V+C LP
Sbjct: 282 ALFTEMTYRFQDMSVGMITRESVRGALQHGITAAQIISFLRANAHPQCIATSGPVNC-LP 340

Query: 306 PVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVT 365
             V+DQIRLWE ER R+   +  +YS F S+ +F  + +YA+   +L W + +++ V+V 
Sbjct: 341 ITVADQIRLWEDERRRMNLKDAYIYSHFESEDEFHGVCEYARQQNILLWSDNQQKLVIVN 400

Query: 366 KQGHSDIKKFW 376
           + GH  +++++
Sbjct: 401 EDGHELVRQWY 411


>gi|323302721|gb|EGA56527.1| Tfb2p [Saccharomyces cerevisiae FostersB]
          Length = 505

 Score =  228 bits (582), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 149/467 (31%), Positives = 233/467 (49%), Gaps = 94/467 (20%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTL----------------T 45
           L  +AK +++ ++F E  VP   +  W++     + + A  ++                 
Sbjct: 39  LPPLAKFFIMAMVFNENEVPLLDLDKWVNSNGKLQFQNAIKSMKSLHLLIPNKSSGTLMI 98

Query: 46  ELRRNAKVAL---LGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
            L    K++L   L GG+  + S  + V+     LD L++Y+  +WE IL FMVG+   +
Sbjct: 99  NLNPTFKISLRNALTGGEVQN-SFGVVVEENVVSLDLLDEYSANKWETILHFMVGTPLAK 157

Query: 103 GISADAVRTLFNEGAENPV-------ITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
             S   +  L +      V       IT +GFQFLL E  +Q+W  +L+YL+ +E+  +D
Sbjct: 158 IPSEKVLNLLKHSKLMEEVNSTGEFKITNEGFQFLLQEINSQLWTLLLQYLKMIETSKMD 217

Query: 156 LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALN 215
           LV+ L F+F L     G  Y  + +S+  ++ LQ +R++GLV+Q+      FYPTKLAL 
Sbjct: 218 LVDVLHFIFMLGALEVGKAYKIDALSETQRIMLQDMRDYGLVFQKHSNDSIFYPTKLALM 277

Query: 216 M------------ATRGTLKQIRE----------------------------------PG 229
           +            A    L+Q RE                                   G
Sbjct: 278 LTSDTKTIRSASNAMDSVLRQNREEPSVNEDGANGKSTTDITTSDDLNKAGLKNQDIPDG 337

Query: 230 FLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQII 289
            LIVETNF++Y+Y++S L++A++ LF  L  RF N+ +G +TR+S+R AL +GITA QII
Sbjct: 338 SLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRRALTNGITADQII 397

Query: 290 GFLRLHALPSV----------------SCP-----LPPVVSDQIRLWEGERERLTTSEGV 328
            +L  HA P +                +C      LPP V DQIRLW+ E +R+ T EG 
Sbjct: 398 AYLETHAHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTVVDQIRLWQLELDRVITYEGS 457

Query: 329 LYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           LYS F +  ++  L  YAQD+GVL W+++KK+   ++K+G+S +  F
Sbjct: 458 LYSDFETSQEYNLLSKYAQDIGVLLWKDDKKKKFFISKEGNSQVLDF 504


>gi|396459783|ref|XP_003834504.1| similar to RNA polymerase II transcription factor B subunit 2
           [Leptosphaeria maculans JN3]
 gi|312211053|emb|CBX91139.1| similar to RNA polymerase II transcription factor B subunit 2
           [Leptosphaeria maculans JN3]
          Length = 483

 Score =  228 bits (581), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 147/428 (34%), Positives = 217/428 (50%), Gaps = 66/428 (15%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           L  +AK  V+ LL++   +P   + +W   T  KE ++A   L  L              
Sbjct: 39  LPHLAKSIVMALLYMPTSLPTTDLDAWFKSTARKEKEQALFILDCLHIITQQRNDDGKTS 98

Query: 48  --------RRNAKVALLGGGKPWSM---SAKLEVDSKARDLDFLNQYALERWECILRFMV 96
                   +R+ + AL G G   S    + + E   K   ++FL++Y+  +WE IL ++V
Sbjct: 99  SWALTPGFQRSLRNALEGAGSHCSFGVPATREESGGKKVSVEFLDEYSRGQWEGILYYLV 158

Query: 97  GSQQTEGISADAV---------RTLFNEG------AENPVITKDGFQFLLLETPAQVWYF 141
                 G+ +D++         + L  EG         P ITKDGF F+L ET AQVW  
Sbjct: 159 SG--AAGLRSDSISRAEVGPGTKKLLMEGDLVRVIHGTPRITKDGFSFVLQETNAQVWSL 216

Query: 142 ILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRK 201
           ++ YL+ V   G+   E L+FLF L     G DYS   +SD     L  L   GLVY+  
Sbjct: 217 LIVYLKMVHELGMSETEVLSFLFMLGSLDLGRDYSISTLSDTQLQMLDDLSAMGLVYRTS 276

Query: 202 RKAGRFYPTKLAL--------------------NMATRGTLKQIREPGFLIVETNFRVYA 241
           + A  FYPT+LA                     N    GTL    + GF+I+ETN+R+YA
Sbjct: 277 KDARTFYPTRLATTLTSDSGSAMSASSNDIAQANQGNAGTLATANK-GFIIIETNYRLYA 335

Query: 242 YTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVS 301
           YT+S +++A++ LF +L +RFPNL  G LT++SV  A+ +GIT+AQII +L  +A P + 
Sbjct: 336 YTNSLIQIAILSLFTKLQHRFPNLVSGKLTKESVHKAVSAGITSAQIISYLSTYAHPQMQ 395

Query: 302 ---CPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEK 358
                +PP V DQIRLWE E ER+  + G L   F S A++  +  YA  LGVL W+ ++
Sbjct: 396 KNVSFIPPTVMDQIRLWEYEGERVEVTPGFLMKDFGSDAEYRDVLGYADALGVLVWRRDE 455

Query: 359 KRTVVVTK 366
           +R   V++
Sbjct: 456 QRCFFVSQ 463


>gi|6325135|ref|NP_015203.1| Tfb2p [Saccharomyces cerevisiae S288c]
 gi|62901134|sp|Q02939.1|TFB2_YEAST RecName: Full=RNA polymerase II transcription factor B subunit 2;
           AltName: Full=General transcription and DNA repair
           factor IIH subunit TFB2; Short=TFIIH subunit TFB2;
           AltName: Full=RNA polymerase II transcription factor B
           52 kDa subunit; AltName: Full=RNA polymerase II
           transcription factor B p52 subunit
 gi|1163092|gb|AAB68240.1| Tfb2p: TFIIH subunit [Saccharomyces cerevisiae]
 gi|1778059|gb|AAB40628.1| transcription/repair factor TFIIH subunit Tfb2, partial
           [Saccharomyces cerevisiae]
 gi|151942676|gb|EDN61022.1| transcription initiation factor TFIIH subunit [Saccharomyces
           cerevisiae YJM789]
 gi|190407835|gb|EDV11100.1| RNA polymerase II transcription factor B subunit 2 [Saccharomyces
           cerevisiae RM11-1a]
 gi|256274256|gb|EEU09164.1| Tfb2p [Saccharomyces cerevisiae JAY291]
 gi|259150035|emb|CAY86838.1| Tfb2p [Saccharomyces cerevisiae EC1118]
 gi|285815419|tpg|DAA11311.1| TPA: Tfb2p [Saccharomyces cerevisiae S288c]
 gi|323331200|gb|EGA72618.1| Tfb2p [Saccharomyces cerevisiae AWRI796]
 gi|323335024|gb|EGA76314.1| Tfb2p [Saccharomyces cerevisiae Vin13]
 gi|323346175|gb|EGA80465.1| Tfb2p [Saccharomyces cerevisiae Lalvin QA23]
 gi|323352000|gb|EGA84539.1| Tfb2p [Saccharomyces cerevisiae VL3]
 gi|365762780|gb|EHN04313.1| Tfb2p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
 gi|392295887|gb|EIW06990.1| Tfb2p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 513

 Score =  228 bits (581), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 149/467 (31%), Positives = 233/467 (49%), Gaps = 94/467 (20%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTL----------------T 45
           L  +AK +++ ++F E  VP   +  W++     + + A  ++                 
Sbjct: 39  LPPLAKFFIMAMVFNENEVPLLDLDKWVNSNGKLQFQNAIKSMKSLHLLIPNKSSGTLMI 98

Query: 46  ELRRNAKVAL---LGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
            L    K++L   L GG+  + S  + V+     LD L++Y+  +WE IL FMVG+   +
Sbjct: 99  NLNPTFKISLRNALTGGEVQN-SFGVVVEENVVSLDLLDEYSANKWETILHFMVGTPLAK 157

Query: 103 GISADAVRTLFNEGAENPV-------ITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
             S   +  L +      V       IT +GFQFLL E  +Q+W  +L+YL+ +E+  +D
Sbjct: 158 IPSEKVLNLLKHSKLMEEVNSTGEFKITNEGFQFLLQEINSQLWTLLLQYLKMIETSKMD 217

Query: 156 LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALN 215
           LV+ L F+F L     G  Y  + +S+  ++ LQ +R++GLV+Q+      FYPTKLAL 
Sbjct: 218 LVDVLHFIFMLGALEVGKAYKIDALSETQRIMLQDMRDYGLVFQKHSNDSIFYPTKLALM 277

Query: 216 M------------ATRGTLKQIRE----------------------------------PG 229
           +            A    L+Q RE                                   G
Sbjct: 278 LTSDTKTIRSASNAMDSVLRQNREEPSVNEDGANGKSTTDITTSDDLNKAGLKNQDIPDG 337

Query: 230 FLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQII 289
            LIVETNF++Y+Y++S L++A++ LF  L  RF N+ +G +TR+S+R AL +GITA QII
Sbjct: 338 SLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRRALTNGITADQII 397

Query: 290 GFLRLHALPSV----------------SCP-----LPPVVSDQIRLWEGERERLTTSEGV 328
            +L  HA P +                +C      LPP V DQIRLW+ E +R+ T EG 
Sbjct: 398 AYLETHAHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTVVDQIRLWQLELDRVITYEGS 457

Query: 329 LYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           LYS F +  ++  L  YAQD+GVL W+++KK+   ++K+G+S +  F
Sbjct: 458 LYSDFETSQEYNLLSKYAQDIGVLLWKDDKKKKFFISKEGNSQVLDF 504


>gi|70982185|ref|XP_746621.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2) [Aspergillus fumigatus Af293]
 gi|66844244|gb|EAL84583.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Aspergillus fumigatus Af293]
 gi|159122143|gb|EDP47265.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Aspergillus fumigatus A1163]
          Length = 479

 Score =  228 bits (580), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 145/444 (32%), Positives = 223/444 (50%), Gaps = 66/444 (14%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
           L ++AK +V+ LL+++ P+P A + +W+     KE   A   L  L              
Sbjct: 38  LPDLAKCFVMALLYLKDPLPAADLEAWVRPGSLKERDNALSILGRLHIMTSTTTSDNIKA 97

Query: 49  --------RNAKVALLGGGKPWSMSAKLEV-DSKARDLDFLNQYALERWECILRFMVGSQ 99
                    + + AL G  +  S     ++ D +A  +  L++Y+  +WE +L +MVG+ 
Sbjct: 98  YTVTDPFASSLRQALTGAEQTQSFGVLSQISDKEAVSIADLDEYSRRQWEGVLGYMVGTS 157

Query: 100 ----QTEGISADAVRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLETV 149
               Q +   +  V+ L   G    +      IT+DGF F L +   QVW+ ++ Y+E+ 
Sbjct: 158 GLGMQRDVSLSKGVKELLQAGHLVEIRDRRVEITQDGFAFALQDVGTQVWHILVLYVESA 217

Query: 150 ESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYP 209
            + G+D VE L+F+F L     G  Y  + ++      L  L +FG+VYQ    A  FYP
Sbjct: 218 AAIGMDSVEVLSFVFFLSSLELGKSYEKKQLTSNQLRTLTDLADFGIVYQETPDATHFYP 277

Query: 210 TKLALNMAT----------------RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIG 253
           T+LA  + +                 GT       GF+I+ETN+R+YAYT S L+++LI 
Sbjct: 278 TRLATTLTSDSSTLSNPLAGSLSGPTGTSSSKAGSGFIIIETNYRLYAYTSSPLQISLIA 337

Query: 254 LFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV------------- 300
           LF  L YRFPNL  G +TR SVR A+  GITA QII +L  HA P +             
Sbjct: 338 LFTTLKYRFPNLITGKITRQSVRRAVEMGITADQIISYLSTHAHPQMRKHNVSRSTSNQA 397

Query: 301 SCP---LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNE 357
             P   LPP V DQIRLW+ ER+R+  + G L+  F + A++E    YA+++GVL W+++
Sbjct: 398 GMPVSVLPPTVVDQIRLWQLERDRVKATHGFLFRDFNTLAEYEAPCRYAEEIGVLVWKSD 457

Query: 358 KKRTVVVTKQGHSDIKKFWVNHQK 381
           +KR   VT+  H  +  F  + +K
Sbjct: 458 RKRMFFVTR--HHQVAAFLKSRRK 479


>gi|401623322|gb|EJS41426.1| tfb2p [Saccharomyces arboricola H-6]
          Length = 513

 Score =  228 bits (580), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 148/467 (31%), Positives = 233/467 (49%), Gaps = 94/467 (20%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTL----------------T 45
           L  +AK +++ ++F E  VP   +  W++     + ++A  ++                 
Sbjct: 39  LPPLAKFFIMAMVFNENEVPLLDLDKWVNSNGKLQFQDAIKSMKSLHLLIPNKSSGTLMI 98

Query: 46  ELRRNAKVAL---LGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
            L    K++L   L GG+  + S  + V+     L  L++Y+  +WE IL FMVG+   +
Sbjct: 99  NLNPTFKISLRNALTGGEVRN-SFGIVVEDNVVSLKMLDEYSANKWETILHFMVGTPLAK 157

Query: 103 GISADAVRTLFNEGAENPV-------ITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
             S   +  L +      +       IT +GFQFLL E  +Q+W  +L+YL+ +E+  +D
Sbjct: 158 IPSEKVLNLLKHSKLMEEINSTGEFKITNEGFQFLLQEINSQLWTLLLQYLKVIETSKMD 217

Query: 156 LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALN 215
           LV+ L F+F L     G  Y  + +SD  ++ LQ +R++GLV+Q+      FYPTKLAL 
Sbjct: 218 LVDVLHFIFMLGALEVGKAYKIDALSDTQRIMLQDMRDYGLVFQKHSNDTIFYPTKLALM 277

Query: 216 M------------ATRGTLKQIRE----------------------------------PG 229
           +            A    L+Q RE                                   G
Sbjct: 278 LTSDTKTIRSASNAMDSVLRQNREEPSANEEGANGKATTDTTTSDDLNKADLKNQDIPDG 337

Query: 230 FLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQII 289
            LIVETNF++Y+Y++S L++A++ LF  L  RF N+ +G +TR+S+R AL +GITA QII
Sbjct: 338 SLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRRALTNGITAEQII 397

Query: 290 GFLRLHALPSV----------------SCP-----LPPVVSDQIRLWEGERERLTTSEGV 328
            +L  HA P +                +C      LPP V DQIRLW+ E +R+ T EG 
Sbjct: 398 AYLETHAHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTVVDQIRLWQLELDRVMTYEGS 457

Query: 329 LYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           LYS F +  ++  L  YAQD+GVL W+++KK+   ++K+G+S +  F
Sbjct: 458 LYSDFETSQEYNVLNKYAQDIGVLLWKDDKKKKFFISKEGNSQVLDF 504


>gi|156057729|ref|XP_001594788.1| hypothetical protein SS1G_04596 [Sclerotinia sclerotiorum 1980]
 gi|154702381|gb|EDO02120.1| hypothetical protein SS1G_04596 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 420

 Score =  228 bits (580), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 133/349 (38%), Positives = 196/349 (56%), Gaps = 35/349 (10%)

Query: 52  KVALLGGGKPWSMSAKLEVDSKAR--DLDFLNQYALERWECILRFMV-----GSQQTEGI 104
           ++AL GGG   S       D  A   D+DFL+ +A  +WE IL +MV     GS Q    
Sbjct: 59  RLALTGGGNHQSFGVP-SSDHIAPHVDIDFLDVHARMQWEGILHYMVNNVAAGSGQEGSG 117

Query: 105 SADAVRTLFNEG-----AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVEC 159
            A +V+ L + G       +  IT+ GF FLL E  AQVW  +L ++E  ES G+D V+ 
Sbjct: 118 PAGSVKKLLDAGKLVTRGRSVGITQAGFSFLLQEANAQVWTLLLLWIENAESMGMDSVDV 177

Query: 160 LTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATR 219
           L+FLF L     G  YST  ++D     L +L +FGL+Y       +F+PT+LA  + + 
Sbjct: 178 LSFLFMLGSLELGRAYSTTTLTDAQHNMLGNLIDFGLIYLPPSAPTQFFPTRLATTLTSD 237

Query: 220 GT-LKQI------------REPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLT 266
            + L+ +             + GF+I+ETN+R+YAYT+S L++A++ LF +L  R+PN+ 
Sbjct: 238 ASALRTVSAGFDAASKSAANQKGFIIIETNYRLYAYTNSPLQIAVLSLFTKLNTRYPNMV 297

Query: 267 VGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--------SCP-LPPVVSDQIRLWEG 317
            G ++RDS+R A+  GIT+ QII +L  HA P +          P LPP V DQIRLW+ 
Sbjct: 298 SGRVSRDSIRTAISHGITSDQIITYLSTHAHPQLVKASTAAHGGPVLPPTVVDQIRLWQL 357

Query: 318 ERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTK 366
           E ER+    G L+ +F  Q ++E    YA+++GVL W+N+ KR   VT+
Sbjct: 358 ENERMKAVPGFLFKEFDGQKEYEGCAKYAEEVGVLVWKNDAKRMFFVTR 406


>gi|448103836|ref|XP_004200137.1| Piso0_002710 [Millerozyma farinosa CBS 7064]
 gi|359381559|emb|CCE82018.1| Piso0_002710 [Millerozyma farinosa CBS 7064]
          Length = 512

 Score =  227 bits (579), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 147/473 (31%), Positives = 218/473 (46%), Gaps = 93/473 (19%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           L  +AK Y++ +LF E P+    ++ W   T  K   EA   L  L              
Sbjct: 40  LPPLAKFYIMSMLFYEHPIALKDMSKWCKPTAKKIQFEALKRLRSLHLIEEHNKGSHLKL 99

Query: 48  ----RRNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
               R+N +  L G     +  +     D    D+ FL+ +A ++WE IL FMVG++ + 
Sbjct: 100 HQTFRKNFRDCLTGSQTSNAFGNLCTSPDKNHIDVKFLDMFASQKWESILHFMVGTELSV 159

Query: 103 GISADAVRTLFNEG--------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGL 154
             S   +  L + G        A N  IT  GFQFLL +   Q+W  +L+YL   +   +
Sbjct: 160 APSKSVLSLLKSGGLMEGPGTSASNLKITNKGFQFLLQDVNVQIWTLLLQYLSLTQDLNM 219

Query: 155 DLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLAL 214
           D V+ L F+F L     G  Y+   +S+     L  L+++GL+YQ      RFYPT+LA 
Sbjct: 220 DPVDVLNFIFVLGCLELGKSYTVSSLSETQISMLADLKDYGLIYQNHNDKERFYPTRLAT 279

Query: 215 NMATRGT--------LKQIREP---------------GFLIVETNFRVYAYTDSNLKVAL 251
            + +           L Q+ +                G +I+ETNF++YAYT+S L++A+
Sbjct: 280 TLTSEAAALRTPSMALNQVVDTANSEATEEQMFSNTGGSIILETNFKIYAYTNSPLEIAI 339

Query: 252 IGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSC--------- 302
           + LF  L  RFPN+  G +TR+S+R AL +GITA QII FL  HA   +           
Sbjct: 340 LNLFVHLKTRFPNMVTGQITRESIRKALFNGITADQIIKFLESHAHSQMKQMAKENLDKK 399

Query: 303 ---------------------------------PLPPVVSDQIRLWEGERERLTTSEGVL 329
                                             LPP V DQI+LW+ E +R+ T EG L
Sbjct: 400 IEFDANNNINTATDNQQLSVGEFGSANSYHKLEVLPPTVVDQIKLWQLELDRIQTFEGYL 459

Query: 330 YSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKG 382
           +  F SQ +++ LR YA +LGVL W ++ ++   VTK G S +  F  N  +G
Sbjct: 460 FKDFSSQQEYDALRSYASELGVLLWSDKNRKKFFVTKDGMSQVADF-ANKNRG 511


>gi|67541703|ref|XP_664619.1| hypothetical protein AN7015.2 [Aspergillus nidulans FGSC A4]
 gi|40742471|gb|EAA61661.1| hypothetical protein AN7015.2 [Aspergillus nidulans FGSC A4]
 gi|259483674|tpe|CBF79257.1| TPA: TFIIH and nucleotide excision repair factor 3 complexes
           subunit (Tfb2), putative (AFU_orthologue; AFUA_4G04360)
           [Aspergillus nidulans FGSC A4]
          Length = 482

 Score =  227 bits (579), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 146/444 (32%), Positives = 224/444 (50%), Gaps = 73/444 (16%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASW-----LSKTHAKEHKEATDTLTELR-------- 48
           L ++AK +V+ LL+++ P+P A + +W     L  +  +E   A   L  L         
Sbjct: 37  LPDLAKCFVMALLYLKDPLPAADLETWVRSESLRLSFLRERDSALSILGRLHILTNTTTS 96

Query: 49  -------------RNAKVALLGGGKPWSMSAKLEV-DSKARDLDFLNQYALERWECILRF 94
                         + + AL G  +  S      + D +   +  L++YA  +WE +L +
Sbjct: 97  GNVRAYMVTDPFAASLRQALTGAKETQSFGVLHHIPDDQTVPIHDLDEYARRQWEGVLGY 156

Query: 95  MVGSQQTEGISAD-----AVRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFIL 143
           MVG+    GI  D      V+ L   G    +      IT+DGF F+L +   QVW+ ++
Sbjct: 157 MVGTSGL-GIQRDVNLSKGVKQLLQAGHLVEIRDRRVEITQDGFAFVLQDVGTQVWHILI 215

Query: 144 KYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRK 203
            Y+E+ E+ G+D VE L+F+F L     G  Y  + ++      L  L +FG+VYQ   +
Sbjct: 216 LYVESAEAIGMDSVEVLSFIFLLSSLELGKSYEKKHLTSNQLRTLTDLADFGIVYQDSPE 275

Query: 204 AGRFYPTKLALNMATRGTL------KQIREP----------GFLIVETNFRVYAYTDSNL 247
           A  FYPT+LA  + +  +         + +P          GF+I+ETN+R+YAYT S L
Sbjct: 276 ASHFYPTRLATTLTSDSSALSNPISGALSDPDGGDSNQPGSGFIIIETNYRLYAYTSSPL 335

Query: 248 KVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV------- 300
           +++LI LF  L YRFPNL  G +TR S+R A+  GITA QII +L  HA P +       
Sbjct: 336 QISLIALFTTLKYRFPNLVTGKVTRQSIRRAIEMGITADQIISYLATHAHPQMRKHNVAR 395

Query: 301 ---------SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGV 351
                       LPP V DQIRLW+ ER+R+  + G L+  FVS A++E    YA+++GV
Sbjct: 396 STSNQAGMPPSVLPPTVVDQIRLWQLERDRVKATAGFLFKDFVSLAEYEAPCRYAEEIGV 455

Query: 352 LTWQNEKKRTVVVTKQGHSDIKKF 375
           L W++++KR   VT+  H  +  F
Sbjct: 456 LIWKSDRKRMFFVTR--HEQVAAF 477


>gi|328863850|gb|EGG12949.1| hypothetical protein MELLADRAFT_46475 [Melampsora larici-populina
           98AG31]
          Length = 454

 Score =  227 bits (579), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 127/342 (37%), Positives = 197/342 (57%), Gaps = 17/342 (4%)

Query: 48  RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISAD 107
           ++N ++AL GGG   S     +  SK  DL  L+QYA + WE IL FMVGS  +   S+ 
Sbjct: 106 KQNFQLALTGGGDHKSFGLPCKTSSKV-DLLTLDQYAKQNWETILHFMVGSNWSNRPSSK 164

Query: 108 AVRTLFNEG---AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLF 164
            +  L   G   ++N  IT  GFQFLL +   Q+W  +L+YL+  E+  LD+V+ L+ LF
Sbjct: 165 IITLLTFSGLISSQNSKITSVGFQFLLDDINTQLWELLLQYLKMAEANDLDIVDVLSCLF 224

Query: 165 QLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRG---- 220
            L     G +YS +  SD     L  L ++GLVY  + K   FYPT+L   + +      
Sbjct: 225 MLGSLELGKEYSMKNFSDTQVQVLNDLVDYGLVYLTQSKT--FYPTRLVTTLTSTAPPIV 282

Query: 221 ---TLKQIREPG-FLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVR 276
              + +   +P  FLI+ETN+R+YAYT + L++A++ LF     RFPNL +G +TR+S++
Sbjct: 283 SNPSDQSSSDPNEFLILETNYRIYAYTSNPLQIAILNLFISFKSRFPNLVIGSITRESIK 342

Query: 277 AALRSGITAAQIIGFLRLHA---LPSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQF 333
            A ++GI A QII +L  H+   +  +   LP  V DQIRLWE E+ R+   EG LY  F
Sbjct: 343 MAFKNGIRADQIISYLEFHSHSQMKLLKPILPNTVVDQIRLWELEKNRVRDQEGYLYEDF 402

Query: 334 VSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
            S +D+E + +Y++ + ++ W+N + +   V+  GH+ +++F
Sbjct: 403 KSVSDYEIVINYSKQIDIILWENPELKKFFVSLDGHTALREF 444


>gi|392901990|ref|NP_001255863.1| Protein Y73F8A.24, isoform a [Caenorhabditis elegans]
 gi|26985915|emb|CAB70230.2| Protein Y73F8A.24, isoform a [Caenorhabditis elegans]
          Length = 481

 Score =  227 bits (579), Expect = 7e-57,   Method: Compositional matrix adjust.
 Identities = 139/377 (36%), Positives = 216/377 (57%), Gaps = 36/377 (9%)

Query: 37  HKEATDTLTELRRNAK-----VALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECI 91
           H++ TD    +  + K      A+LG  +  S+   LE +   R    + + A+ERW+CI
Sbjct: 95  HRQTTDGKLNIDTDYKRSYMYAAMLGAAQISSLV--LEPNEGNRRGKDVEKKAVERWDCI 152

Query: 92  LRFMV--GSQQTEGISADAVRTLFNE-----GAENPV-ITKDGFQFLLLETPAQVWYFIL 143
           LR++     + T+ +S +  R LF +     G +  + IT  GFQFLLL    Q+W +++
Sbjct: 153 LRYLALPSEENTQAVS-ETTRNLFKKANFTSGGDTQIEITTFGFQFLLLSPVKQMWTYVI 211

Query: 144 KYLETVESKGLDLVECLTFLFQL------KFSTFGTDYSTEGM-SDKLQVFLQHLREFGL 196
           +YL+   S+G D+VE +  L Q+       F      Y  +   ++     L HLRE G+
Sbjct: 212 EYLKLEMSQGQDIVEVIEPLIQIVLLANRGFKAERECYQIDANWTEPQNELLNHLRELGV 271

Query: 197 VYQRKRKAGRFYPTKLALNMATRGTLKQIR----EPGFLIVETNFRVYAYTDSNLKVALI 252
           ++ RKRK G F+ T+L  ++AT  T+  +       G +IVETNFRVYAYT S L++A+I
Sbjct: 272 IFIRKRKDGVFFLTQLLTHLATNETIDDVSAEKVSNGKVIVETNFRVYAYTSSLLQLAII 331

Query: 253 GLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPS-------VSCPLP 305
            LF E+ YRF +++VG++TR+SVR AL+ GITAAQII FLR +A P        V+C LP
Sbjct: 332 ALFTEMTYRFQDMSVGMITRESVRGALQHGITAAQIISFLRANAHPQCIATSGPVNC-LP 390

Query: 306 PVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVT 365
             V+DQIRLWE ER R+   +  +YS F S+ +F  + +YA+   +L W + +++ V+V 
Sbjct: 391 ITVADQIRLWEDERRRMNLKDAYIYSHFESEDEFHGVCEYARQQNILLWSDNQQKLVIVN 450

Query: 366 KQGHSDIKKFWVNHQKG 382
           + GH  +++ W   +K 
Sbjct: 451 EDGHELVRQ-WYKREKS 466


>gi|349581695|dbj|GAA26852.1| K7_Tfb2p [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 513

 Score =  227 bits (579), Expect = 7e-57,   Method: Compositional matrix adjust.
 Identities = 149/467 (31%), Positives = 232/467 (49%), Gaps = 94/467 (20%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTL----------------T 45
           L  +AK +++ ++F E  VP   +  W++     + + A  ++                 
Sbjct: 39  LPPLAKFFIMAMVFNENEVPLLDLDKWVNSNGKLQFQNAIKSMKSLHLLIPNKSSGTLMI 98

Query: 46  ELRRNAKVAL---LGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
            L    K++L   L GG+  + S  + V+     LD L++Y+  +WE IL FMVG+   +
Sbjct: 99  NLNPTFKISLRNALTGGEVQN-SFGVVVEENVVSLDLLDEYSANKWETILHFMVGTPLAK 157

Query: 103 GISADAVRTLFNEGAENPV-------ITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
             S   +  L +      V       IT +GFQFLL E  +Q+W  +L+YL+ +E+  +D
Sbjct: 158 IPSEKVLNLLKHSKLMEEVNSTGEFKITNEGFQFLLQEINSQLWTLLLQYLKMIETSKMD 217

Query: 156 LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALN 215
           LV+ L F+F L     G  Y  + +S   ++ LQ +R++GLV+Q+      FYPTKLAL 
Sbjct: 218 LVDVLHFIFMLGALEVGKAYKIDALSQTQRIMLQDMRDYGLVFQKHSNDSIFYPTKLALM 277

Query: 216 M------------ATRGTLKQIRE----------------------------------PG 229
           +            A    L+Q RE                                   G
Sbjct: 278 LTSDTKTIRSASNAMDSVLRQNREEPSVNEDGANGKSTTDITTSDDLNKAGLKNQDIPDG 337

Query: 230 FLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQII 289
            LIVETNF++Y+Y++S L++A++ LF  L  RF N+ +G +TR+S+R AL +GITA QII
Sbjct: 338 SLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRRALTNGITADQII 397

Query: 290 GFLRLHALPSV----------------SCP-----LPPVVSDQIRLWEGERERLTTSEGV 328
            +L  HA P +                +C      LPP V DQIRLW+ E +R+ T EG 
Sbjct: 398 AYLETHAHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTVVDQIRLWQLELDRVITYEGS 457

Query: 329 LYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           LYS F +  ++  L  YAQD+GVL W+++KK+   ++K+G+S +  F
Sbjct: 458 LYSDFETSQEYNLLSKYAQDIGVLLWKDDKKKKFFISKEGNSQVLDF 504


>gi|453232318|ref|NP_001263825.1| Protein Y73F8A.24, isoform d [Caenorhabditis elegans]
 gi|412976516|emb|CCO25635.1| Protein Y73F8A.24, isoform d [Caenorhabditis elegans]
          Length = 435

 Score =  227 bits (578), Expect = 9e-57,   Method: Compositional matrix adjust.
 Identities = 137/375 (36%), Positives = 218/375 (58%), Gaps = 39/375 (10%)

Query: 37  HKEATDTLTELRRNAK-----VALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECI 91
           H++ TD    +  + K      A+LG  +  S+   LE +   R    + + A+ERW+CI
Sbjct: 45  HRQTTDGKLNIDTDYKRSYMYAAMLGAAQISSLV--LEPNEGNRRGKDVEKKAVERWDCI 102

Query: 92  LRFMV--GSQQTEGISADAVRTLFNE-----GAENPV-ITKDGFQFLLLETPAQVWYFIL 143
           LR++     + T+ +S +  R LF +     G +  + IT  GFQFLLL    Q+W +++
Sbjct: 103 LRYLALPSEENTQAVS-ETTRNLFKKANFTSGGDTQIEITTFGFQFLLLSPVKQMWTYVI 161

Query: 144 KYLETVESKGLDLVECLTFLFQL-------KFSTFGTD---YSTEGM-SDKLQVFLQHLR 192
           +YL+   S+G D+VE +  L Q+       + + F  +   Y  +   ++     L HLR
Sbjct: 162 EYLKLEMSQGQDIVEVIEPLIQIVLLANRVQVAGFKAERECYQIDANWTEPQNELLNHLR 221

Query: 193 EFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIR----EPGFLIVETNFRVYAYTDSNLK 248
           E G+++ RKRK G F+ T+L  ++AT  T+  +       G +IVETNFRVYAYT S L+
Sbjct: 222 ELGVIFIRKRKDGVFFLTQLLTHLATNETIDDVSAEKVSNGKVIVETNFRVYAYTSSLLQ 281

Query: 249 VALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPS-------VS 301
           +A+I LF E+ YRF +++VG++TR+SVR AL+ GITAAQII FLR +A P        V+
Sbjct: 282 LAIIALFTEMTYRFQDMSVGMITRESVRGALQHGITAAQIISFLRANAHPQCIATSGPVN 341

Query: 302 CPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRT 361
           C LP  V+DQIRLWE ER R+   +  +YS F S+ +F  + +YA+   +L W + +++ 
Sbjct: 342 C-LPITVADQIRLWEDERRRMNLKDAYIYSHFESEDEFHGVCEYARQQNILLWSDNQQKL 400

Query: 362 VVVTKQGHSDIKKFW 376
           V+V + GH  +++++
Sbjct: 401 VIVNEDGHELVRQWY 415


>gi|453232316|ref|NP_001263824.1| Protein Y73F8A.24, isoform c [Caenorhabditis elegans]
 gi|412976517|emb|CCO25636.1| Protein Y73F8A.24, isoform c [Caenorhabditis elegans]
          Length = 485

 Score =  226 bits (577), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 139/381 (36%), Positives = 219/381 (57%), Gaps = 40/381 (10%)

Query: 37  HKEATDTLTELRRNAK-----VALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECI 91
           H++ TD    +  + K      A+LG  +  S+   LE +   R    + + A+ERW+CI
Sbjct: 95  HRQTTDGKLNIDTDYKRSYMYAAMLGAAQISSLV--LEPNEGNRRGKDVEKKAVERWDCI 152

Query: 92  LRFMV--GSQQTEGISADAVRTLFNE-----GAENPV-ITKDGFQFLLLETPAQVWYFIL 143
           LR++     + T+ +S +  R LF +     G +  + IT  GFQFLLL    Q+W +++
Sbjct: 153 LRYLALPSEENTQAVS-ETTRNLFKKANFTSGGDTQIEITTFGFQFLLLSPVKQMWTYVI 211

Query: 144 KYLETVESKGLDLVECLTFLFQL-------KFSTFGTD---YSTEGM-SDKLQVFLQHLR 192
           +YL+   S+G D+VE +  L Q+       + + F  +   Y  +   ++     L HLR
Sbjct: 212 EYLKLEMSQGQDIVEVIEPLIQIVLLANRVQVAGFKAERECYQIDANWTEPQNELLNHLR 271

Query: 193 EFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIR----EPGFLIVETNFRVYAYTDSNLK 248
           E G+++ RKRK G F+ T+L  ++AT  T+  +       G +IVETNFRVYAYT S L+
Sbjct: 272 ELGVIFIRKRKDGVFFLTQLLTHLATNETIDDVSAEKVSNGKVIVETNFRVYAYTSSLLQ 331

Query: 249 VALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPS-------VS 301
           +A+I LF E+ YRF +++VG++TR+SVR AL+ GITAAQII FLR +A P        V+
Sbjct: 332 LAIIALFTEMTYRFQDMSVGMITRESVRGALQHGITAAQIISFLRANAHPQCIATSGPVN 391

Query: 302 CPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRT 361
           C LP  V+DQIRLWE ER R+   +  +YS F S+ +F  + +YA+   +L W + +++ 
Sbjct: 392 C-LPITVADQIRLWEDERRRMNLKDAYIYSHFESEDEFHGVCEYARQQNILLWSDNQQKL 450

Query: 362 VVVTKQGHSDIKKFWVNHQKG 382
           V+V + GH  +++ W   +K 
Sbjct: 451 VIVNEDGHELVRQ-WYKREKS 470


>gi|154273196|ref|XP_001537450.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
 gi|150415962|gb|EDN11306.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
          Length = 455

 Score =  226 bits (576), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 152/403 (37%), Positives = 209/403 (51%), Gaps = 53/403 (13%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGGKP 61
           L  +AK +V+ LL+++ P+P A +  W+ K  +K     TD  +   R A   L GG K 
Sbjct: 38  LPYLAKCFVMALLYLKDPLPAADLELWV-KAESKRAYTVTDPFSSSLRQA---LTGGDKQ 93

Query: 62  WSMSA-KLEVDSKARDLDFLNQYALERWECILRFMVGS-----QQTEGISADAVRTLFNE 115
            S     L  D  +  +  L+ YA  +WE +L +MVG+     QQ   +S   V+ L   
Sbjct: 94  QSFGVPSLTDDENSMTVAQLDYYARSQWEGVLGYMVGTSALGVQQAVTLS-KGVKQLLQA 152

Query: 116 GAENPV------ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFS 169
                V      ITKDGF F+L +   QVW+ ++ Y+E  E  G+D +E L+FLF L   
Sbjct: 153 CHLVEVRDRRVEITKDGFAFVLQDLNTQVWHILILYVENAEQIGMDSIEVLSFLFVLSSL 212

Query: 170 TFGTDYSTEGMSDKLQVFLQHLREFGLVYQRK--RKAGRFYPTKLALNM----------- 216
             G  Y  + ++      L  L +FG+VYQ    ++A RFYPT+LA  +           
Sbjct: 213 ELGQSYEKKHLTSTQLKTLADLTDFGIVYQHSPHQEATRFYPTRLATTLTSDSITLGSSI 272

Query: 217 -----ATRGTLKQI-REP--GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVG 268
                A  G       EP  GF+I+ETN+R+YAYT S L+++LI LF  L YRFPNL  G
Sbjct: 273 SSSLTAPNGVPSATSNEPSTGFIIIETNYRLYAYTSSPLQISLIALFTSLKYRFPNLITG 332

Query: 269 VLTRDSVRAALRSGITAAQIIGFLRLHALP---------------SVSCPLPPVVSDQIR 313
            LTR SVR A+  GITA QII +L  HA P                V   LPP V DQIR
Sbjct: 333 KLTRQSVRRAVEMGITADQIISYLTTHAHPQMRKYHATKPGANPVGVHTVLPPTVVDQIR 392

Query: 314 LWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQN 356
           LW+ ER+R+  + G L+  FVS A+FE    YA+++GVL  +N
Sbjct: 393 LWQLERDRIKATPGFLFKDFVSLAEFEGPCRYAEEIGVLIQRN 435


>gi|334186626|ref|NP_001190745.1| transcription initiation factor TFIIH subunit H4 [Arabidopsis
           thaliana]
 gi|332658440|gb|AEE83840.1| transcription initiation factor TFIIH subunit H4 [Arabidopsis
           thaliana]
          Length = 482

 Score =  225 bits (574), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 129/408 (31%), Positives = 225/408 (55%), Gaps = 35/408 (8%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGGKP 61
           L  +AK YV+++L+++ PVP  ++  W+      +H+ A D L +LR  ++++    G  
Sbjct: 40  LPPLAKKYVLQMLYIDVPVPATMMEEWVLADGTSKHRVAIDRLIQLRIFSEISDRKRGTS 99

Query: 62  WSMSAKLEVD---------------------SKARDLDFLNQYALERWECILRFMVGSQQ 100
           +S++   + +                      K   L  L  YAL++WEC L  ++ S Q
Sbjct: 100 YSLNPTFQNNLQKHIISGGVLPREPMNSDNAIKLPSLQELETYALKQWECFLLQLINSGQ 159

Query: 101 TE---GISADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
            E   GIS+  ++ +F  G       + P +T+ GFQFLL++T AQ+WY I +Y+   E 
Sbjct: 160 GEKLTGISSSMMK-IFQRGLLSQRDKDGPRLTESGFQFLLMDTNAQLWYIIREYILNAEE 218

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYPT 210
           + +D  + ++FL +L F   G  Y+   +++     L+ L + GLV  Q+ RK   F PT
Sbjct: 219 RDVDPADLISFLLELSFHVTGQAYNLNTLTEVQNNTLKDLADLGLVKLQQGRKDSWFIPT 278

Query: 211 KLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVL 270
           KLA N++        R+ GF+++ETNFR+YAY+ S L+  ++ LF  + Y+ PNL    +
Sbjct: 279 KLATNLSVSLADSSARKEGFVVMETNFRMYAYSTSKLQCEILRLFARIEYQLPNLIACAI 338

Query: 271 TRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERLTTSEG 327
           T++S+  A  +GIT+ QII FL+ ++ P  +  +P +   V+DQIRLWE + +R+  ++ 
Sbjct: 339 TKESLYNAFDNGITSDQIITFLQQNSHPRCADRVPSIPENVTDQIRLWETDLQRIEMTQA 398

Query: 328 VLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
             Y +F S+  FE   D+A++   L W++ K+  +VV  + H+ +++F
Sbjct: 399 HFYDEFPSKDVFEAACDFAREWRGLLWEDSKRMRLVVKSEVHNQMREF 446


>gi|42572935|ref|NP_974564.1| transcription initiation factor TFIIH subunit H4 [Arabidopsis
           thaliana]
 gi|332658438|gb|AEE83838.1| transcription initiation factor TFIIH subunit H4 [Arabidopsis
           thaliana]
          Length = 462

 Score =  225 bits (574), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 129/408 (31%), Positives = 225/408 (55%), Gaps = 35/408 (8%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGGKP 61
           L  +AK YV+++L+++ PVP  ++  W+      +H+ A D L +LR  ++++    G  
Sbjct: 40  LPPLAKKYVLQMLYIDVPVPATMMEEWVLADGTSKHRVAIDRLIQLRIFSEISDRKRGTS 99

Query: 62  WSMSAKLEVD---------------------SKARDLDFLNQYALERWECILRFMVGSQQ 100
           +S++   + +                      K   L  L  YAL++WEC L  ++ S Q
Sbjct: 100 YSLNPTFQNNLQKHIISGGVLPREPMNSDNAIKLPSLQELETYALKQWECFLLQLINSGQ 159

Query: 101 TE---GISADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
            E   GIS+  ++ +F  G       + P +T+ GFQFLL++T AQ+WY I +Y+   E 
Sbjct: 160 GEKLTGISSSMMK-IFQRGLLSQRDKDGPRLTESGFQFLLMDTNAQLWYIIREYILNAEE 218

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYPT 210
           + +D  + ++FL +L F   G  Y+   +++     L+ L + GLV  Q+ RK   F PT
Sbjct: 219 RDVDPADLISFLLELSFHVTGQAYNLNTLTEVQNNTLKDLADLGLVKLQQGRKDSWFIPT 278

Query: 211 KLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVL 270
           KLA N++        R+ GF+++ETNFR+YAY+ S L+  ++ LF  + Y+ PNL    +
Sbjct: 279 KLATNLSVSLADSSARKEGFVVMETNFRMYAYSTSKLQCEILRLFARIEYQLPNLIACAI 338

Query: 271 TRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERLTTSEG 327
           T++S+  A  +GIT+ QII FL+ ++ P  +  +P +   V+DQIRLWE + +R+  ++ 
Sbjct: 339 TKESLYNAFDNGITSDQIITFLQQNSHPRCADRVPSIPENVTDQIRLWETDLQRIEMTQA 398

Query: 328 VLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
             Y +F S+  FE   D+A++   L W++ K+  +VV  + H+ +++F
Sbjct: 399 HFYDEFPSKDVFEAACDFAREWRGLLWEDSKRMRLVVKSEVHNQMREF 446


>gi|42566894|ref|NP_193435.2| transcription initiation factor TFIIH subunit H4 [Arabidopsis
           thaliana]
 gi|51969678|dbj|BAD43531.1| unnamed protein product [Arabidopsis thaliana]
 gi|115646777|gb|ABJ17114.1| At4g17020 [Arabidopsis thaliana]
 gi|332658439|gb|AEE83839.1| transcription initiation factor TFIIH subunit H4 [Arabidopsis
           thaliana]
          Length = 452

 Score =  225 bits (574), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 129/408 (31%), Positives = 225/408 (55%), Gaps = 35/408 (8%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGGKP 61
           L  +AK YV+++L+++ PVP  ++  W+      +H+ A D L +LR  ++++    G  
Sbjct: 40  LPPLAKKYVLQMLYIDVPVPATMMEEWVLADGTSKHRVAIDRLIQLRIFSEISDRKRGTS 99

Query: 62  WSMSAKLEVD---------------------SKARDLDFLNQYALERWECILRFMVGSQQ 100
           +S++   + +                      K   L  L  YAL++WEC L  ++ S Q
Sbjct: 100 YSLNPTFQNNLQKHIISGGVLPREPMNSDNAIKLPSLQELETYALKQWECFLLQLINSGQ 159

Query: 101 TE---GISADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
            E   GIS+  ++ +F  G       + P +T+ GFQFLL++T AQ+WY I +Y+   E 
Sbjct: 160 GEKLTGISSSMMK-IFQRGLLSQRDKDGPRLTESGFQFLLMDTNAQLWYIIREYILNAEE 218

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYPT 210
           + +D  + ++FL +L F   G  Y+   +++     L+ L + GLV  Q+ RK   F PT
Sbjct: 219 RDVDPADLISFLLELSFHVTGQAYNLNTLTEVQNNTLKDLADLGLVKLQQGRKDSWFIPT 278

Query: 211 KLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVL 270
           KLA N++        R+ GF+++ETNFR+YAY+ S L+  ++ LF  + Y+ PNL    +
Sbjct: 279 KLATNLSVSLADSSARKEGFVVMETNFRMYAYSTSKLQCEILRLFARIEYQLPNLIACAI 338

Query: 271 TRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERLTTSEG 327
           T++S+  A  +GIT+ QII FL+ ++ P  +  +P +   V+DQIRLWE + +R+  ++ 
Sbjct: 339 TKESLYNAFDNGITSDQIITFLQQNSHPRCADRVPSIPENVTDQIRLWETDLQRIEMTQA 398

Query: 328 VLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
             Y +F S+  FE   D+A++   L W++ K+  +VV  + H+ +++F
Sbjct: 399 HFYDEFPSKDVFEAACDFAREWRGLLWEDSKRMRLVVKSEVHNQMREF 446


>gi|410082501|ref|XP_003958829.1| hypothetical protein KAFR_0H02850 [Kazachstania africana CBS 2517]
 gi|372465418|emb|CCF59694.1| hypothetical protein KAFR_0H02850 [Kazachstania africana CBS 2517]
          Length = 514

 Score =  225 bits (573), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 142/464 (30%), Positives = 229/464 (49%), Gaps = 93/464 (20%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           L +++K +V+ ++F E  V    +  W+      + +EA  ++  L              
Sbjct: 39  LPQLSKFFVMAMVFNENDVSLRDLDRWVKTNGKLQFQEAIKSMKSLHLIIPSKGNGTLMI 98

Query: 48  ------RRNAKVALLGG--GKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
                 + + + AL GG     + +    E +     L+FL++Y+  +WE IL FMVG+ 
Sbjct: 99  SLNPTFKESFRNALTGGEINNSFGIVVDNEAEQSRITLNFLDEYSANKWETILHFMVGTP 158

Query: 100 QTEGISADAVRTLFN----EGAENP---VITKDGFQFLLLETPAQVWYFILKYLETVESK 152
                S + +  L N    E A+      IT +GFQFLL E  +Q+W  +L+YL+  ES 
Sbjct: 159 LANIPSHNVLNLLKNSRLMEEADKTGTLKITNEGFQFLLQEINSQIWTLLLQYLKMSESL 218

Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
            +D VE L F+F L     G  YS +G+++  +  L+  R++G+++Q+   +  FYPT L
Sbjct: 219 QMDPVEVLNFIFMLGALEIGKAYSIDGLTETQKTMLRDFRDYGIIFQKNLTSKSFYPTNL 278

Query: 213 ALNM-------------ATRGTLKQIREP------------------------------G 229
           A+ +             A    L+Q +E                               G
Sbjct: 279 AIMLTSDTRSILRTASGAMDSVLRQNKEDSKKQTSGDNVTEDTANDIEQIGLENQEGRDG 338

Query: 230 FLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQII 289
            LI+ETNF++Y+Y++S L++A++ LF  L  RF N+  G +TR+S+R AL +GITA QII
Sbjct: 339 SLIIETNFKLYSYSNSPLQIAILSLFVHLKSRFVNMVTGQITRESIRRALVNGITADQII 398

Query: 290 GFLRLHALPSV-------------SCP--------LPPVVSDQIRLWEGERERLTTSEGV 328
            +L  HA P +               P        LPP V DQI+LW+ E +R+ T +G 
Sbjct: 399 AYLETHAHPRMRRLAEEKLEKKLELDPNSKDTLQILPPTVVDQIKLWQLELDRVITYDGS 458

Query: 329 LYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDI 372
           LYS F +  ++  L  YAQD+GVL W+++KKR   V+K+G++ +
Sbjct: 459 LYSDFENSTEYNMLSKYAQDIGVLIWKDDKKRKFFVSKEGNTQV 502


>gi|51969958|dbj|BAD43671.1| unnamed protein product [Arabidopsis thaliana]
          Length = 452

 Score =  225 bits (573), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 129/408 (31%), Positives = 225/408 (55%), Gaps = 35/408 (8%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGGKP 61
           L  +AK YV+++L+++ PVP  ++  W+      +H+ A D L +LR  ++++    G  
Sbjct: 40  LPPLAKKYVLQMLYIDVPVPATMMEEWVLADGTSKHRVAIDRLIQLRIFSEISDRKRGTS 99

Query: 62  WSMSAKLEVD---------------------SKARDLDFLNQYALERWECILRFMVGSQQ 100
           +S++   + +                      K   L  L  YAL++WEC L  ++ S Q
Sbjct: 100 YSLNPTFQNNLQKHIISGGVLPREPMNSDNAIKLPSLQELETYALKQWECFLLQLINSGQ 159

Query: 101 TE---GISADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
            E   GIS+  ++ +F  G       + P +T+ GFQFLL++T AQ+WY I +Y+   E 
Sbjct: 160 GEKLTGISSSMMK-IFQRGLLSQRDKDGPRLTESGFQFLLMDTNAQLWYIIREYILNAEE 218

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYPT 210
           + +D  + ++FL +L F   G  Y+   +++     L+ L + GLV  Q+ RK   F PT
Sbjct: 219 RDVDPADLISFLLELSFHVTGQAYNLNTLTEVQNNTLKDLADLGLVKLQQGRKDSWFIPT 278

Query: 211 KLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVL 270
           KLA N++        R+ GF+++ETNFR+YAY+ S L+  ++ LF  + Y+ PNL    +
Sbjct: 279 KLATNLSVSLADSSARKEGFVVMETNFRMYAYSTSKLQCEILRLFARIEYQLPNLIACAI 338

Query: 271 TRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERLTTSEG 327
           T++S+  A  +GIT+ QII FL+ ++ P  +  +P +   V+DQIRLWE + +R+  ++ 
Sbjct: 339 TKESLYNAFGNGITSDQIITFLQQNSHPRCADRVPSIPENVTDQIRLWETDLQRIEMTQA 398

Query: 328 VLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
             Y +F S+  FE   D+A++   L W++ K+  +VV  + H+ +++F
Sbjct: 399 HFYDEFPSKDVFEAACDFAREWRGLLWEDSKRMRLVVKSEVHNQMREF 446


>gi|344229873|gb|EGV61758.1| transcription factor Tfb2 [Candida tenuis ATCC 10573]
          Length = 498

 Score =  225 bits (573), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 147/456 (32%), Positives = 219/456 (48%), Gaps = 84/456 (18%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           L  +AK +++ +LF E+ V    +  W          E+ + +  L              
Sbjct: 40  LPPMAKFHIMSMLFYEKNVAIKDLEKWSKPNTRHLQIESLERMRSLHLVQLSESGKYIRL 99

Query: 48  ----RRNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
               ++N    L G   P +  +    V+    DL FL+ +A  +WE IL +MVG++ + 
Sbjct: 100 HPIFKKNFADCLTGNQAPDAFGNLDTTVEKHTVDLKFLDTFASTKWETILHYMVGTELSS 159

Query: 103 GISADAVRTLFNEG--------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGL 154
             S   +R L + G        A+   IT  GFQFLL +  AQ+W  +L+YL+      +
Sbjct: 160 TPSRSVLRLLKSGGLMEGNSDYAQELKITNKGFQFLLQDINAQIWTLLLQYLDLTLELQM 219

Query: 155 DLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLAL 214
           D V+ L F+F L     G  YS   +SD     L  LR+FGLVYQR   + +FYPT+LA 
Sbjct: 220 DAVDVLNFIFLLGSLELGKSYSVSALSDTQIEMLSDLRDFGLVYQRSATSNKFYPTRLAT 279

Query: 215 NMATRGTLKQIREPGF--------------------LIVETNFRVYAYTDSNLKVALIGL 254
            + +      ++ P                      +I+ETNF+VY+YT S L++A++ L
Sbjct: 280 TLTSDS--NSLQTPSMAIDKANSGIDSTDSKQRQESIIIETNFKVYSYTQSPLEIAILNL 337

Query: 255 FCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALP---------------- 298
           F  L  RF N+  G +TR+S+R AL +GITA Q+I FL  HA P                
Sbjct: 338 FVHLRTRFANMVTGQITRESIRNALYNGITAGQVIKFLETHAHPQMKMLAQEKLDKKIEF 397

Query: 299 ------------SVSCP-------LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADF 339
                       S   P       LPP V DQI+LW+ E +R+ T +G L+ +F +Q D+
Sbjct: 398 DASNNINTATGKSTDAPMQHKLEILPPNVVDQIKLWQLELDRIQTFDGYLFREFNNQIDY 457

Query: 340 EKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           E LR+YA ++GVL W ++KKRT  VTK+G   +  F
Sbjct: 458 ESLRNYASEIGVLVWSDDKKRTCFVTKEGLKQVSNF 493


>gi|281208616|gb|EFA82792.1| general transcription factor IIH [Polysphondylium pallidum PN500]
          Length = 458

 Score =  224 bits (572), Expect = 5e-56,   Method: Compositional matrix adjust.
 Identities = 135/396 (34%), Positives = 228/396 (57%), Gaps = 30/396 (7%)

Query: 6   AKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR--------RNAKVALLG 57
            K Y++++L +E  +    ++SW ++++ + H E+ + L +L+         N K ++  
Sbjct: 61  CKQYILKMLLIES-IQVNNVSSWTAQSNKQTHLESLERLEDLKILIKQEFQSNIKESI-T 118

Query: 58  GGKPWSMSAKLEVDSKAR-DLDFLNQYALERWECILRFMVGSQQTEG-ISADAVR----T 111
             K   +SA   V+   +  +D L+ Y+  +WE +L F+    +T   + AD +     T
Sbjct: 119 NMKNVVVSAVERVEPNLQMSVDQLDSYSKSQWEKVLYFLSDDSETPPELIADLLALSNLT 178

Query: 112 LFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTF 171
             N+G+   VIT +GF+FLL +   Q+W  I+ YL ++E++G    + L FLF+L F + 
Sbjct: 179 KLNDGS--MVITSEGFKFLLKDIYTQIWTLIIVYLNSLETRGKSRRDALAFLFKLSFLSL 236

Query: 172 GTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIR----- 226
           G+ Y    +++  +  L  LR+FGLVY R  K+  FYPT+L +++ T  T+  I+     
Sbjct: 237 GSAYYVNDLTEDEKSLLFDLRQFGLVYIRSEKSEIFYPTRLIISLTTGKTVTVIKDLAKE 296

Query: 227 ------EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALR 280
                 E G++I+ETNFR+YAYT S+L+++L+ LF +++YR PNL+VG+LTR+S+R A  
Sbjct: 297 MSNTQKEQGYIILETNFRIYAYTASSLQISLLSLFVKMLYRLPNLSVGILTRESIRTAFL 356

Query: 281 SGITAAQIIGFLRLHALPSV-SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADF 339
            GITA QI+ F++ +  P++     P +V +QIR+WE ER R+   + VL+  F +Q  F
Sbjct: 357 HGITADQIVDFIKQNGHPNMLKVGAPEIVFEQIRIWENERNRILYKKAVLFDSFPTQESF 416

Query: 340 EKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
                YA+DL    W +E K+ +VV+  G   IK +
Sbjct: 417 NMTLQYAKDLSFYMWASEAKKVLVVSDNGFDAIKNY 452


>gi|448527133|ref|XP_003869438.1| TFIIH and nucleotide excision repair factor 3 complex subunit
           [Candida orthopsilosis Co 90-125]
 gi|380353791|emb|CCG23303.1| TFIIH and nucleotide excision repair factor 3 complex subunit
           [Candida orthopsilosis]
          Length = 500

 Score =  224 bits (571), Expect = 5e-56,   Method: Compositional matrix adjust.
 Identities = 146/457 (31%), Positives = 221/457 (48%), Gaps = 85/457 (18%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           LS +AK Y++ ++F E+P+    +  W   +  K   EA   L  L              
Sbjct: 39  LSPMAKFYIMSMIFNEKPIAMRDLTKWCKPSAKKMEFEALKRLESLHLIDYDGKGTHVRL 98

Query: 48  ----RRNAKVALLGGGKP---WSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQ 100
               R N +  L G   P    S+S   +VD    ++ FL+++A   WE IL FMVG++ 
Sbjct: 99  NSIFRSNFRDCLTGSQDPNAFGSISTGDDVDK--VEISFLDKFASHEWETILHFMVGTEG 156

Query: 101 TEGISADAVRTL----FNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDL 156
           T   S+  +  L      EG     IT  GFQFLL +  AQ+W  +L+YL       ++ 
Sbjct: 157 TPTPSSSVLSLLKLGGLMEGESTLNITNTGFQFLLQDANAQIWTLLLQYLNLTSELNMNP 216

Query: 157 VECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLAL-- 214
           V+ L F+F L     G  YS   +S+     L  L++ GLVYQ+   + +FYPT+LA   
Sbjct: 217 VDVLNFIFILGCLELGKGYSVSNLSETQVSMLADLKDLGLVYQKSDTSNKFYPTRLATTL 276

Query: 215 ----------NMATRGTLKQIREPGF-------LIVETNFRVYAYTDSNLKVALIGLFCE 257
                     +MA +  L++  E          +I+ETNF++YAYT+S L++A++ LF +
Sbjct: 277 TSDSSALKTPSMAVQQALEENEEQMMASNSRESIIIETNFKIYAYTNSPLEIAILNLFVQ 336

Query: 258 LMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV----------------- 300
           +  RF N+  G +TR+S+R AL +GIT+ QII FL  HA P +                 
Sbjct: 337 MKTRFSNMVCGQITRESIRNALYNGITSDQIIKFLETHAHPQMRALAKEKLDKKVEFDAS 396

Query: 301 -------SCP---------------LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQAD 338
                    P               +PP V DQI+LW+ E +R+ T EG L+  F +Q +
Sbjct: 397 HNINTAGGAPQSKTDGAISQHKLEVIPPNVVDQIKLWQLELDRIQTVEGYLFKDFANQQE 456

Query: 339 FEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           ++ L +YA +LGVL W ++ KR   VTK G + +  F
Sbjct: 457 YDTLSNYATELGVLVWGDKVKRKFFVTKDGMAQVADF 493


>gi|119179887|ref|XP_001241468.1| hypothetical protein CIMG_08631 [Coccidioides immitis RS]
 gi|392866653|gb|EAS30172.2| transcription factor Tfb2 [Coccidioides immitis RS]
          Length = 487

 Score =  224 bits (571), Expect = 5e-56,   Method: Compositional matrix adjust.
 Identities = 153/444 (34%), Positives = 222/444 (50%), Gaps = 72/444 (16%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASW---------------------LSKTHAKEHKEA 40
           L  +AK +V+ LL+++ P+P A +  W                     LS T   +H  A
Sbjct: 38  LPHLAKCFVMALLYLKDPLPAAELELWVKAGSKRERDNALSILSRLHILSSTTTSDHVRA 97

Query: 41  TDTLTELRRNAKVALLGGGKPWSMSAKLEV-DSKARDLDFLNQYALERWECILRFMVGSQ 99
                    + + AL GG K  S      + D     +  L+ YA  +WE +L +MVG+ 
Sbjct: 98  YMVTNPFSSSLRQALTGGDKQHSFGVMSSMPDPHPMTVSDLDDYARRQWEGVLGYMVGTN 157

Query: 100 ----QTEGIS-ADAVRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLET 148
               Q E ++ +  V++L        V      ITK+GF F+L +   QVW+ ++ Y+E 
Sbjct: 158 SLGIQRENVTLSKGVKSLLQACHLVEVRDRRVEITKEGFAFVLQDVNTQVWHILILYVEN 217

Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRK--RKAGR 206
            E+ G+D VE L+FLF L     G  Y  + ++      L  L +FG+VYQ     ++ R
Sbjct: 218 AEAIGMDSVEVLSFLFLLSSLELGQSYDKKHLTSTQLRTLADLTDFGIVYQHSPASESTR 277

Query: 207 FYPTKLAL-----NMATRGTLKQIREP--------------GFLIVETNFRVYAYTDSNL 247
           FYPT+LA      +MA   ++     P              GF+I+ETN+R+YAYT S L
Sbjct: 278 FYPTRLATTLTSDSMALSSSISGNLAPAGPNINAATGAPGTGFIIIETNYRLYAYTSSPL 337

Query: 248 KVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV------- 300
           +++LI LF  L YRFPNL  G LTR S+R A+  GITA QII +L  HA P +       
Sbjct: 338 QISLIALFTTLKYRFPNLVTGKLTRQSIRRAVEMGITADQIISYLTTHAHPQMRKVNASK 397

Query: 301 ---------SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGV 351
                    +  LPP V DQIRLW+ ER+RL  + G L+  F S A+FE    YA+++GV
Sbjct: 398 STSTTAGLLASVLPPTVVDQIRLWQLERDRLKATPGFLFKDFASSAEFEAPCRYAEEIGV 457

Query: 352 LTWQNEKKRTVVVTKQGHSDIKKF 375
           L W++EK+R   VT+  H  +  F
Sbjct: 458 LVWKSEKRRMFFVTR--HEQVAAF 479


>gi|365986883|ref|XP_003670273.1| hypothetical protein NDAI_0E02130 [Naumovozyma dairenensis CBS 421]
 gi|343769043|emb|CCD25030.1| hypothetical protein NDAI_0E02130 [Naumovozyma dairenensis CBS 421]
          Length = 522

 Score =  224 bits (570), Expect = 7e-56,   Method: Compositional matrix adjust.
 Identities = 149/471 (31%), Positives = 226/471 (47%), Gaps = 101/471 (21%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           L +++K +V+ ++F    V    +  W+      + ++A  ++  L              
Sbjct: 39  LPQLSKFFVMSMVFNNNEVSLRDLDRWVKPDAKLQFQDAIKSMKSLHLIIPSKGNGGPLM 98

Query: 48  -------RRNAKVALLGGGKPWSMSAKLEVDSKARDLD-FLNQYALERWECILRFMVGSQ 99
                  R + K AL GG    S     + D         LN+Y+  +WE IL FMVG+ 
Sbjct: 99  IILNPTFRESFKNALTGGQINNSFGIISDEDENGVVTSTLLNEYSANKWETILHFMVGTP 158

Query: 100 QTEGISADAVRTL----FNEGAENP---VITKDGFQFLLLETPAQVWYFILKYLETVESK 152
            +   S   +  L      E  EN     IT +GFQFLL E  +Q+W  +L+YL+  E+ 
Sbjct: 159 MSSIPSGSVLNLLKHTKLMEEVENTGEFKITNEGFQFLLQELNSQLWTLLLQYLKLSETL 218

Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
            +D V+ L F+F L     G  YS +G+S+  ++ L+ +R++GLV+Q+   +  FYPT L
Sbjct: 219 NMDPVDVLNFIFMLGALEVGKGYSIDGLSETQKIMLKDMRDYGLVFQKVSNSKTFYPTNL 278

Query: 213 ALNMATRGTLKQIR------------------------EPGF------------------ 230
           AL M T  T   +R                        EPG                   
Sbjct: 279 AL-MLTSDTKSIVRTASGAIESVLNENRSGSNANENGYEPGTKKKNENTIMGTIDQVGMK 337

Query: 231 --------LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSG 282
                   LIVETNF++Y+Y++S L++A++ LF  L +RF N+  G +TR+S+R AL +G
Sbjct: 338 NQDVPDGSLIVETNFKLYSYSNSPLQIAILSLFVHLKFRFVNMVTGQITRESIRRALING 397

Query: 283 ITAAQIIGFLRLHALPSV----------------SCP-----LPPVVSDQIRLWEGERER 321
           ITA QII +L  HA P +                +C      LPP V DQIRLW+ E +R
Sbjct: 398 ITAEQIIAYLETHAHPQMRRLAEEKLEKKLELDANCKDSLQILPPTVVDQIRLWQLELDR 457

Query: 322 LTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDI 372
           +   EG LYS F +  ++  L  YAQD+GVL W+++KKR   V+K+G+S +
Sbjct: 458 VIAYEGSLYSDFENNTEYTTLYKYAQDIGVLLWKDDKKRKFFVSKEGNSQV 508


>gi|406862938|gb|EKD15987.1| tfiih and nucleotide excision repair factor 3 complexes subunit
           [Marssonina brunnea f. sp. 'multigermtubi' MB_m1]
          Length = 421

 Score =  223 bits (569), Expect = 9e-56,   Method: Compositional matrix adjust.
 Identities = 139/391 (35%), Positives = 207/391 (52%), Gaps = 45/391 (11%)

Query: 27  SWLSKTH-------AKEHKEATDTLTELRRNAKVALLGGGKPWSMSAKLEVDSKAR--DL 77
           S LS+ H       +++  +        R + ++AL GGG+  S       DS A   D 
Sbjct: 23  SLLSRLHIVTITAPSRDDPQKVSLTKNFRASLRLALTGGGEQQSFGIPSH-DSTASGVDA 81

Query: 78  DFLNQYALERWECILRFMVGS-----QQTEGISADAVRTLFNEGA-------ENPVITKD 125
           +FL+ YA ERWE IL F+V S     +Q       +VR L   G            IT+ 
Sbjct: 82  EFLDNYARERWEDILHFVVNSVGNSMRQDGTGPPTSVRLLLEAGKLVTTGRRSGGGITQA 141

Query: 126 GFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQ 185
           GF FLL E  AQVW  +L ++E  E   L+ VE L+FLF L     G  YST  +S   +
Sbjct: 142 GFSFLLQEVNAQVWALLLLWIENAEKMELESVEVLSFLFMLGSLELGKSYSTATLSASQR 201

Query: 186 VFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGT-LKQIRE------------PGFLI 232
             L++L +FGLVY       +F+PT+L+  + +  + L+ +               GF+I
Sbjct: 202 AMLKYLVDFGLVYSPSSTPQQFFPTRLSTTLTSDASGLRSVSAGFDDALKSESGTKGFII 261

Query: 233 VETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFL 292
           +ETN+R+YAYT+S L++A++ LF ++  R+PN+  G +TR SV  A+  GIT+ QII +L
Sbjct: 262 IETNYRLYAYTNSPLQIAVLALFTKMGVRYPNMVTGRVTRKSVANAISHGITSDQIISYL 321

Query: 293 RLHALPSV--------SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRD 344
           R HA P +        +  LPP V DQIRLW+ E ER+  + G L+ +F +Q ++E    
Sbjct: 322 RAHAHPQLVKAAAVNGNPVLPPTVVDQIRLWQIENERMKATAGFLFKEFETQKEYEGCAR 381

Query: 345 YAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           YA + GVL W+N+ KR   VT+  H  ++ +
Sbjct: 382 YADENGVLVWKNDAKRMFFVTR--HEQLRDY 410


>gi|146414792|ref|XP_001483366.1| hypothetical protein PGUG_04095 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 498

 Score =  223 bits (569), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 147/455 (32%), Positives = 215/455 (47%), Gaps = 83/455 (18%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASW---------------LSKTHAKEHKEATDTL-- 44
           L+ IAK +++ ++F E+P     +  W               L   H  E     + L  
Sbjct: 39  LTPIAKFFIMLMIFNERPQSVRDLDKWCRPQSKSLQYDALKRLKSLHLVEEDAGGNFLRL 98

Query: 45  -TELRRNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
               R+N +  L G  +  +  +   E D     + FL+ +AL++WE IL +MVG++   
Sbjct: 99  HMTFRKNFRDCLTGNQEATAFGNLCTEDDGHKVTVAFLDSFALQKWETILHYMVGTELPS 158

Query: 103 GISADAVRTL----FNEG----AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGL 154
             S   +  L      EG    A N  IT  GFQFLL +  AQ+W  +L+YL   +   +
Sbjct: 159 KPSPSVLSLLKLGGLMEGPGTDARNLKITNSGFQFLLQDMNAQIWTLLLQYLNLTQDLNM 218

Query: 155 DLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLAL 214
           D V+ L FLF L     G  Y   G+SD     L  LR++GLVYQ   ++ RFYPT+LA 
Sbjct: 219 DPVDVLNFLFVLGSLELGKSYYLSGLSDTQVSMLADLRDYGLVYQYSDQSDRFYPTRLAT 278

Query: 215 NMATRGTLKQIREP--------------------GFLIVETNFRVYAYTDSNLKVALIGL 254
            + +      ++ P                    G +I+ETNF++YAYT+S L++A++ L
Sbjct: 279 TLTSESA--SLKTPSMALDQAVDTSNGLDSGPGQGSIILETNFKLYAYTNSPLEIAILNL 336

Query: 255 FCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSC------------ 302
           F  L  RF N+  G +TR+S+R AL +GIT+ QII FL  HA P +              
Sbjct: 337 FVNLRTRFSNMVSGQITRESIRNALYNGITSDQIINFLETHAHPQMRALAKERLDKKVEF 396

Query: 303 ----------------------PLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFE 340
                                  LPP V DQI+LW+ E +R+   EG L+ +F SQ +F+
Sbjct: 397 DASNNINTAGGGQNSSAQHKLEILPPTVVDQIKLWQLELDRIQAFEGYLFKEFASQQEFD 456

Query: 341 KLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
            L +YA D+GVL W +  K    VTK G S +  F
Sbjct: 457 VLCNYASDVGVLIWSDRTKMRFFVTKDGISQVADF 491


>gi|303321033|ref|XP_003070511.1| Transcription factor tfb2 family protein [Coccidioides posadasii
           C735 delta SOWgp]
 gi|240110207|gb|EER28366.1| Transcription factor tfb2 family protein [Coccidioides posadasii
           C735 delta SOWgp]
 gi|320036051|gb|EFW17991.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           [Coccidioides posadasii str. Silveira]
          Length = 487

 Score =  223 bits (568), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 153/444 (34%), Positives = 221/444 (49%), Gaps = 72/444 (16%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASW---------------------LSKTHAKEHKEA 40
           L  +AK +V+ LL+++ P+P A +  W                     LS T   +H  A
Sbjct: 38  LPHLAKCFVMALLYLKDPLPAAELELWVKAGSKRERDNALSILSRLHILSSTTTSDHVRA 97

Query: 41  TDTLTELRRNAKVALLGGGKPWSMSAKLEV-DSKARDLDFLNQYALERWECILRFMVGSQ 99
                    + + AL GG K  S      + D     +  L+ YA  +WE +L +MVG+ 
Sbjct: 98  YMVTNPFSSSLRQALTGGDKQHSFGVMSSMPDPHPMTVSDLDDYARRQWEGVLGYMVGTN 157

Query: 100 ----QTEGIS-ADAVRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLET 148
               Q E ++ +  V++L        V      ITK+GF F+L +   QVW+ ++ Y+E 
Sbjct: 158 SLGIQRENVTLSKGVKSLLQACHLVEVRDRRVEITKEGFAFVLQDVNTQVWHILILYVEN 217

Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRK--RKAGR 206
            E+ G+D VE L+FLF L     G  Y  + ++      L  L +FG+VYQ     ++ R
Sbjct: 218 AEAIGMDSVEVLSFLFLLSSLELGQSYDKKHLTSTQLRTLADLTDFGIVYQHSPASESTR 277

Query: 207 FYPTKLAL-----NMATRGTLKQIREP--------------GFLIVETNFRVYAYTDSNL 247
           FYPT+LA      +MA    +     P              GF+I+ETN+R+YAYT S L
Sbjct: 278 FYPTRLATTLTSDSMALSSPISGNLAPAGPNINAATGAPGTGFIIIETNYRLYAYTSSPL 337

Query: 248 KVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV------- 300
           +++LI LF  L YRFPNL  G LTR S+R A+  GITA QII +L  HA P +       
Sbjct: 338 QISLIALFTTLKYRFPNLVTGKLTRQSIRRAVEMGITADQIISYLTTHAHPQMRKVNASK 397

Query: 301 ---------SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGV 351
                    +  LPP V DQIRLW+ ER+RL  + G L+  F S A+FE    YA+++GV
Sbjct: 398 STSTTAGLPASVLPPTVVDQIRLWQLERDRLKATPGFLFKDFASSAEFEAPCRYAEEIGV 457

Query: 352 LTWQNEKKRTVVVTKQGHSDIKKF 375
           L W++EK+R   VT+  H  +  F
Sbjct: 458 LVWKSEKRRMFFVTR--HEQVAAF 479


>gi|156839389|ref|XP_001643386.1| hypothetical protein Kpol_479p16 [Vanderwaltozyma polyspora DSM
           70294]
 gi|156113994|gb|EDO15528.1| hypothetical protein Kpol_479p16 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 504

 Score =  223 bits (568), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 136/456 (29%), Positives = 224/456 (49%), Gaps = 86/456 (18%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           L ++AK +++ ++F +  +    +  W+  +   + +E+   +  L              
Sbjct: 39  LPQLAKFFIMSMVFNDNEISLRDLDRWVKASAKVQFQESIKCMKSLNLLIPGKEPGSIMV 98

Query: 48  ------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQT 101
                 R++ + AL GG    S    ++ D+       L+ Y+ ++WE IL FMVG+   
Sbjct: 99  NLNPTFRKSLRNALSGGEINNSFGIVVDSDNDLVTRAMLDSYSADKWETILHFMVGTPLV 158

Query: 102 EGISADAVRTLFN--------EGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKG 153
             I +D V  L          E      IT +GFQFLL E   Q+W  +L+YL+  E+  
Sbjct: 159 -SIPSDNVLNLLKHSKLMEEVESTGEFKITNEGFQFLLQEVNLQIWTLLLQYLKMAETFR 217

Query: 154 LDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLA 213
           ++ V+ L F+F L    FG  YSTE +S   +  ++ +R++GL++Q+      FYPT+LA
Sbjct: 218 MNPVDVLHFIFMLGALEFGKAYSTEELSATQKTMMKDMRDYGLIFQKNSNTNVFYPTRLA 277

Query: 214 LNMATR------------------------------------GTLKQIREPGFLIVETNF 237
             + +                                     GT  Q  + G L+VETNF
Sbjct: 278 TMLTSDSRTVRNASSAMDSVLTQSKDEPSGATSGSADVDEQVGTQGQNIQDGALVVETNF 337

Query: 238 RVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHAL 297
           ++Y+Y++S L++A++ LF  L  RF N+  G +TR+S+R AL +GITA QII +L  HA 
Sbjct: 338 KLYSYSNSPLQIAILSLFVHLKTRFSNMVTGQITRESIRNALSNGITAEQIIAYLETHAH 397

Query: 298 PSVS---------------------CPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQ 336
           P +                        LPP V DQI+LW+ E +R+ + EG LYS F ++
Sbjct: 398 PQMRRLAEERLKKKLELDQNSKETLQILPPTVVDQIKLWQLELDRIISYEGSLYSDFDNK 457

Query: 337 ADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDI 372
            +++ L  YAQD+GVL W++++KR   V+K+ +S +
Sbjct: 458 QEYDMLYSYAQDIGVLIWKDDRKRKFFVSKESNSQV 493


>gi|190347675|gb|EDK39997.2| hypothetical protein PGUG_04095 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 498

 Score =  223 bits (567), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 145/455 (31%), Positives = 214/455 (47%), Gaps = 83/455 (18%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASW---------------LSKTHAKEHKEATDTL-- 44
           L+ IAK +++ ++F E+P     +  W               L   H  E     + L  
Sbjct: 39  LTPIAKFFIMSMIFNERPQSVRDLDKWCRPQSKSLQYDALKRLKSLHLVEEDAGGNFLRL 98

Query: 45  -TELRRNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
               R+N +  L G  +  +  +   E D     + FL+ +A ++WE IL +MVG++   
Sbjct: 99  HMTFRKNFRDCLTGNQEATAFGNLCTEDDGHKVTVAFLDSFASQKWETILHYMVGTELPS 158

Query: 103 GISADAVRTLFNEG--------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGL 154
             S   +  L + G        A N  IT  GFQFLL +  AQ+W  +L+YL   +   +
Sbjct: 159 KPSPSVLSLLKSGGLMEGPGTDARNLKITNSGFQFLLQDMNAQIWTLLLQYLNLTQDLNM 218

Query: 155 DLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLAL 214
           D V+ L FLF L     G  Y   G+SD     L  LR++GLVYQ   ++ RFYPT+LA 
Sbjct: 219 DPVDVLNFLFVLGSLELGKSYYLSGLSDTQVSMLADLRDYGLVYQYSDQSDRFYPTRLAT 278

Query: 215 NMATRGTLKQIREP--------------------GFLIVETNFRVYAYTDSNLKVALIGL 254
            + +      ++ P                    G +I+ETNF++YAYT+S L++A++ L
Sbjct: 279 TLTSESA--SLKTPSMALDQAVDTSNGLDSGPGQGSIILETNFKLYAYTNSPLEIAILNL 336

Query: 255 FCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSC------------ 302
           F  L  RF N+  G +TR+S+R AL +GIT+ QII FL  HA P +              
Sbjct: 337 FVNLRTRFSNMVSGQITRESIRNALYNGITSDQIINFLETHAHPQMRALAKERLDKKVEF 396

Query: 303 ----------------------PLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFE 340
                                  LPP V DQI+LW+ E +R+   EG L+ +F SQ +F+
Sbjct: 397 DASNNINTAGGGQNSSAQHKLEILPPTVVDQIKLWQLELDRIQAFEGYLFKEFASQQEFD 456

Query: 341 KLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
            L +YA D+GVL W +  K    VTK G S +  F
Sbjct: 457 VLCNYASDVGVLIWSDRTKMRFFVTKDGISQVADF 491


>gi|45187599|ref|NP_983822.1| ADL274Wp [Ashbya gossypii ATCC 10895]
 gi|62901343|sp|Q75B51.1|TFB2_ASHGO RecName: Full=RNA polymerase II transcription factor B subunit 2;
           AltName: Full=RNA polymerase II transcription factor B
           52 kDa subunit; AltName: Full=RNA polymerase II
           transcription factor B p52 subunit
 gi|44982337|gb|AAS51646.1| ADL274Wp [Ashbya gossypii ATCC 10895]
 gi|374107034|gb|AEY95942.1| FADL274Wp [Ashbya gossypii FDAG1]
          Length = 514

 Score =  221 bits (563), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 137/459 (29%), Positives = 228/459 (49%), Gaps = 85/459 (18%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           LS +AK +++ ++F ++ V    +  W+      +  +A  ++  L              
Sbjct: 48  LSPMAKFFIMSMIFQDEEVSLRDLDRWVKPDAKFQLHDAIKSMKLLHLITEGRSGQPLMV 107

Query: 48  ------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQT 101
                 + + K AL GG    S    +E ++    +  L+ YA ++WE IL FMVG+  T
Sbjct: 108 QLNSIFKESFKNALTGGEVKNSFGNVVEEENDPVTMAMLDSYAADKWETILHFMVGTPLT 167

Query: 102 EGISADAVRTL-------FNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGL 154
           +    + +  L        +E ++   IT +GFQFLL +  AQ+W  +L+YL   E+  +
Sbjct: 168 KSPGKNVLSLLRHSKLMEVDESSKELKITNEGFQFLLQDANAQIWTLLLQYLTMAETFQM 227

Query: 155 DLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLAL 214
           D V+ L  +F +     G  YS  G+S+  +  LQ LR++GLV+Q++    +FYPT+LA 
Sbjct: 228 DPVDVLNLIFMIGALELGKAYSVVGLSETQKTMLQDLRDYGLVFQKQSNLSKFYPTRLAT 287

Query: 215 NM------------ATRGTLKQIRE-------------------------PGFLIVETNF 237
            +            A    L+Q  E                          G LIVETNF
Sbjct: 288 MLTSDVVSIRSASGAVNSVLRQRAEGVDGKVLNGTALGDDDLQAGGEGALDGALIVETNF 347

Query: 238 RVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHAL 297
           ++Y+Y++S L++A++ LF  L  RF N+  G +TR+S+R AL +GITA QII ++  HA 
Sbjct: 348 KLYSYSNSPLQIAILSLFIHLKTRFQNMVTGQITRESIRRALHNGITADQIIAYMETHAH 407

Query: 298 PSV----------------SC-----PLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQ 336
           P +                +C      LPP V DQI+LW+ E +R+ + +G L+  F + 
Sbjct: 408 PQMRRLAGDNLEKKLELDPNCRDTLQVLPPTVVDQIKLWQLELDRIISYDGYLFRDFDNL 467

Query: 337 ADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
            +++ L  YA+D+GVL W ++KK+   V+K+G++ +  F
Sbjct: 468 QEYQVLAQYARDIGVLLWSDDKKKMFFVSKEGNAQVIDF 506


>gi|255713706|ref|XP_002553135.1| KLTH0D09790p [Lachancea thermotolerans]
 gi|238934515|emb|CAR22697.1| KLTH0D09790p [Lachancea thermotolerans CBS 6340]
          Length = 506

 Score =  221 bits (563), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 138/457 (30%), Positives = 227/457 (49%), Gaps = 86/457 (18%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           L ++AK +++ L+F E       +  W+      +  EA  ++  L              
Sbjct: 39  LPKLAKFFIMSLVFNETDASLRDLDRWVKSGGKYQFNEAIKSMKSLHLLIEGNSGQPFMI 98

Query: 48  ------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQT 101
                 R + + AL GG    S    +E +++      L+QYA  +WE IL FMVG+  T
Sbjct: 99  NLNPIFRSSFRNALTGGEVNNSFGNVVEDENETVHTSILDQYAANKWETILHFMVGTPLT 158

Query: 102 EGISADAVR-----TLFNEGAENPV-ITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
           +  S + +       L  E +   + IT +GFQFLL +  AQ+W  +L+YL   E+  +D
Sbjct: 159 QTPSRNVLSLLQHSKLMEESSSGELQITNEGFQFLLQDANAQIWALLLQYLRLAETLQMD 218

Query: 156 LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL--- 212
            V+ L F+F L     G  YS   +SD  ++ LQ +R++GLV+Q+     +FYPT+L   
Sbjct: 219 PVDVLNFIFMLGALELGKAYSDTNLSDTQKIMLQDMRDYGLVFQKASNTHKFYPTRLTAM 278

Query: 213 -------------ALN-MATRGTLKQIREP----------------------GFLIVETN 236
                        A+N + ++GT     E                       G +I+ETN
Sbjct: 279 LTSDTSSIRSASGAMNSVLSQGTSSSKEEAAAVAETDEETAQVGGTTQNIPDGAVILETN 338

Query: 237 FRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHA 296
           F++Y+Y++S L++A++ LF  L  RF N+  G +TR+S+R AL +GITA QII ++  HA
Sbjct: 339 FKLYSYSNSPLQIAVLSLFVHLKSRFSNMVTGQITRESIRRALHNGITADQIIAYMETHA 398

Query: 297 LPSV----------------SCP-----LPPVVSDQIRLWEGERERLTTSEGVLYSQFVS 335
            P +                +C      LPP V DQI+LW+ E +R+ + +G L++ F +
Sbjct: 399 HPQMRRLAEQTLDKKMELDPNCNEGLQILPPTVVDQIKLWQLELDRIISYDGYLFTDFEN 458

Query: 336 QADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDI 372
             ++  L  YA+D+GVL W ++KK+   V+++G+S +
Sbjct: 459 FQEYNMLSSYAKDIGVLLWSDDKKKRFFVSQEGNSQV 495


>gi|366991461|ref|XP_003675496.1| hypothetical protein NCAS_0C01390 [Naumovozyma castellii CBS 4309]
 gi|342301361|emb|CCC69129.1| hypothetical protein NCAS_0C01390 [Naumovozyma castellii CBS 4309]
          Length = 516

 Score =  221 bits (563), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 146/467 (31%), Positives = 230/467 (49%), Gaps = 96/467 (20%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           L ++AK +V+ ++F ++ V    +  W+      + ++A  ++  L              
Sbjct: 39  LPQLAKFFVMSMVFNDKEVSLRDLDRWVKSNGKMQFQDAIKSMKSLHLIIPSKSNGGPLM 98

Query: 48  -------RRNAKVALLGG--GKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                  R + K AL GG     + + A  + D+    L  L++Y+  +WE IL FMVG+
Sbjct: 99  INLNPTFRESFKNALTGGQVDNSFGIVADDDDDNGVVSLQLLDEYSANKWETILHFMVGT 158

Query: 99  QQTEGISADAVRTLFNEGAENPV-------ITKDGFQFLLLETPAQVWYFILKYLETVES 151
             +   SA  +  L +      V       IT +GFQFLL E  +Q+W  +L+YL+  E+
Sbjct: 159 PMSSIPSASVLNLLKHTRLMEEVDHTSEFKITNEGFQFLLQELNSQIWTLLLQYLKMSET 218

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
             +D V+ L F+F L     G  YS +G+S+  ++ L+ +R++GLV+Q+   +  FYPT 
Sbjct: 219 LKMDSVDVLNFIFMLGALEVGKGYSIDGLSETQKIMLKDMRDYGLVFQKYTNSNLFYPTN 278

Query: 212 LALNM-------------ATRGTLKQIREP------------------------------ 228
           LAL +             A    L+  RE                               
Sbjct: 279 LALMLTSDTKSIVRTASGALESVLQNKREDSSKQANGEINKDLITNGDELDQVGYNPQDI 338

Query: 229 --GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAA 286
             G LI+ETNF++Y+Y++S L++A + LF  L  RF N+  G +TR+S+R AL +GITA 
Sbjct: 339 PDGSLIIETNFKLYSYSNSPLQIATLSLFVHLKSRFANMVTGQITRESIRRALINGITAD 398

Query: 287 QIIGFLRLHALPSV------------------SCP---LPPVVSDQIRLWEGERERLTTS 325
           QII +L  HA P +                    P   LPP V DQI+LW+ E +R+ T 
Sbjct: 399 QIIAYLETHAHPQMRRLAEERLEKKLELDPNSKDPLQVLPPTVVDQIKLWQLELDRVITY 458

Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDI 372
           EG LYS F + A++  L  YAQD+GVL W+++KK+   V+K+G+S +
Sbjct: 459 EGSLYSDFETIAEYTTLSKYAQDIGVLLWKDDKKKKFFVSKEGNSQV 505


>gi|342319113|gb|EGU11064.1| RNA polymerase II transcription factor B subunit 2 [Rhodotorula
           glutinis ATCC 204091]
          Length = 496

 Score =  221 bits (562), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 131/350 (37%), Positives = 187/350 (53%), Gaps = 25/350 (7%)

Query: 47  LRRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISA 106
            R + ++AL GGGK  S  A      +   + FL+ +A  +WE I  FMVGS        
Sbjct: 130 FRDSFRMALTGGGKQGSFGAPAAEQDEEVTVQFLDDHAEVQWETIQHFMVGSDAGAKKPG 189

Query: 107 DAVRTLFNEGA---------ENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLV 157
           + V +L               N  IT  GFQFLL +   Q+W  +L YLE  +    DLV
Sbjct: 190 EKVLSLLERSGLMYSPTRSLRNMRITSKGFQFLLEDVNTQLWDLLLVYLEGSQ----DLV 245

Query: 158 ECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMA 217
           E + FLF L     G  Y T+ +S      L+ L ++GLVY  +R A  FYPT+LA  + 
Sbjct: 246 ETIGFLFMLGSLELGRAYMTDNLSQIQHGVLRDLADYGLVYLPERNAPIFYPTRLATTLT 305

Query: 218 TRG-----TLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTR 272
           +       +     E GF+++ETN+++YAYT + L++A++GLF  L  RF N   G +TR
Sbjct: 306 SSAPPLVSSRHSNEEKGFIVLETNYKLYAYTSNPLQIAVLGLFAHLKTRFANFVTGHITR 365

Query: 273 DSVRAALRSGITAAQIIGFLRLHALPSVSCP-------LPPVVSDQIRLWEGERERLTTS 325
           +S+R  L +GITA QII +L   A P +          LP  V DQIRLWE ER R+ T+
Sbjct: 366 ESIRRGLANGITANQIISYLASRAHPQMRAQAGSDDKLLPITVVDQIRLWEHERRRIQTT 425

Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           EG LY +F S  D+E + +YA+++G +  +  K R V VT  GH  +++F
Sbjct: 426 EGYLYDEFSSTHDYELVVNYAREIGSVLLELPKARKVFVTADGHQQVREF 475


>gi|154308354|ref|XP_001553513.1| hypothetical protein BC1G_08237 [Botryotinia fuckeliana B05.10]
          Length = 410

 Score =  221 bits (562), Expect = 7e-55,   Method: Compositional matrix adjust.
 Identities = 130/349 (37%), Positives = 196/349 (56%), Gaps = 35/349 (10%)

Query: 52  KVALLGGGKPWSMSAKLEVDSKAR--DLDFLNQYALERWECILRFMV-----GSQQTEGI 104
           ++AL GGG   S       D  A   D+DFL+ +A  +WE IL +MV     GS +    
Sbjct: 49  RLALTGGGNHQSFGVP-SSDHIAPHVDIDFLDTHARMQWEGILHYMVNTVTSGSGKDGNG 107

Query: 105 SADAVRTLFNEG-----AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVEC 159
            A++V+ L + G          IT+ GF FLL E  AQVW  +L ++E  ES G+D V+ 
Sbjct: 108 PANSVKALLDAGKLVTRGRGIGITQAGFSFLLQEANAQVWTLLLLWIENAESMGMDSVDV 167

Query: 160 LTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATR 219
           L+FLF L     G  YST+ +++  +  L +L + GL+Y       +F+PT+LA  + + 
Sbjct: 168 LSFLFMLGSLELGRAYSTKTLTEAQKGMLANLIDLGLIYLPPSAPTQFFPTRLATTLTSD 227

Query: 220 GT-LKQI------------REPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLT 266
            + L+ +             + GF+I+ETN+R+YAYT+S L++A++ LF +L  R+PN+ 
Sbjct: 228 ASALRTVAAGFDAASKSAASQKGFIIIETNYRLYAYTNSPLQIAVLSLFTKLNTRYPNMV 287

Query: 267 VGVLTRDSVRAALRSGITAAQIIGFLRLHALP--------SVSCP-LPPVVSDQIRLWEG 317
            G ++RDS+R A+  GIT+ QII +L  HA P        S   P LPP V DQIRLW+ 
Sbjct: 288 SGRVSRDSIRTAIAHGITSDQIITYLSTHAHPQLVKASSASHGGPVLPPTVVDQIRLWQL 347

Query: 318 ERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTK 366
           E ER+    G L   F +Q ++E    YA+++GVL W+++ KR   VT+
Sbjct: 348 ENERMKAVPGFLMKDFETQKEYEGCAKYAEEVGVLVWKSDAKRMFFVTR 396


>gi|255936123|ref|XP_002559088.1| Pc13g06540 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211583708|emb|CAP91723.1| Pc13g06540 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 483

 Score =  220 bits (561), Expect = 8e-55,   Method: Compositional matrix adjust.
 Identities = 148/448 (33%), Positives = 223/448 (49%), Gaps = 68/448 (15%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR--------RNAK 52
           +L  +AK +V+ LL+++ P+P A +  W++    KE   A   L  L          N +
Sbjct: 40  DLVNVAKCFVMALLYLKDPLPTADLEVWVNSESKKERDNALSILGRLHILSTTLTGDNVR 99

Query: 53  VALLGGGKPWSMSAKLEVDSKARDLDF----------------LNQYALERWECILRFMV 96
             ++    P++ S +  +        F                L++YA  +WE ++ +MV
Sbjct: 100 AYMIT--NPFAASLRQALTGAEDSHSFGVFSQAPEHTLTSITDLDEYARRQWEGVMGYMV 157

Query: 97  GSQQTEGIS-----ADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKY 145
           G+    G       +  V+ L   G           ITKDGF F+L +   QVW+ ++ Y
Sbjct: 158 GTSALSGQRDMVNLSKGVKQLLQAGHLVEIRGNRVDITKDGFGFVLQDVNTQVWHILILY 217

Query: 146 LETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAG 205
           +E+ E+ G+D VE L+FLF L     G+ Y    M+      L  L +FG+VYQ    A 
Sbjct: 218 VESAEAIGMDSVEVLSFLFLLSSLELGSSYDKSHMTPNQLRTLMDLADFGIVYQEHADAT 277

Query: 206 RFYPTKLALNMATRGTLKQIREPG---------------FLIVETNFRVYAYTDSNLKVA 250
           RFYPT+LA  + +  +       G               F+I+ETN+R+YAYT S L+++
Sbjct: 278 RFYPTRLATTLTSDSSALSNPVTGSLTGPAGPSGGSGSGFIIIETNYRLYAYTSSPLQIS 337

Query: 251 LIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALP-----SVSCP-- 303
           LI LF  L YRFPNL  G +TR SVR A+  GITA QII +L  HA P     S + P  
Sbjct: 338 LISLFTNLKYRFPNLVTGKITRQSVRRAIEMGITADQIISYLLSHAHPQLRKHSAAQPNG 397

Query: 304 -------LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQN 356
                  LPP V+DQIRLW+ ER+RL  + G L+  F S A+++    YA ++GVL W++
Sbjct: 398 KGVPASVLPPTVTDQIRLWQLERDRLRATAGFLFKDFTSLAEYQAPCQYAAEIGVLVWKS 457

Query: 357 EKKRTVVVTKQGHSDIKKFWVNHQKGSQ 384
           ++KR   VT+  H  +  F  + + G +
Sbjct: 458 DRKRMFFVTR--HEQVAAFLRSQKVGGK 483


>gi|167997877|ref|XP_001751645.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162697626|gb|EDQ83962.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 462

 Score =  220 bits (561), Expect = 9e-55,   Method: Compositional matrix adjust.
 Identities = 144/419 (34%), Positives = 211/419 (50%), Gaps = 38/419 (9%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
           L  +AK YV+RLL+VE  VP      W       +H+ A D L +LR             
Sbjct: 40  LPPLAKQYVLRLLWVENAVPLKSWQEWARPEALSKHQIAIDRLEQLRVILPERSDLSNST 99

Query: 49  ---------RNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMV--- 96
                    +  + AL  GG P        V ++  +   L  YA+++WE +L  +V   
Sbjct: 100 HYLMNPKLQKQLRQALSTGGGPPRDVVSESVAARVPNSADLENYAMKQWESVLLQLVDCA 159

Query: 97  -----GSQQTEGISADAVRTLFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
                G +    I       L     E+P +T  GFQFLL++T +Q+W  + +Y+ + E 
Sbjct: 160 ADGPAGPKNPFIIKVFQRSGLLTPENESPSLTDLGFQFLLMDTNSQLWQLVREYVTSSED 219

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYPT 210
           +G D  E + FL +L F   G  YS   +S  LQ  L  L   GLV  Q+  K   + PT
Sbjct: 220 RGTDSGELVGFLLELGFHLVGEAYSVNSLSPALQKVLDELAALGLVKLQQGMKESWYIPT 279

Query: 211 KLALNMA---TRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTV 267
           KLA N++   +  T  Q  E GF++VETNF+VYAYT S L+  ++  F  L Y+ PNL V
Sbjct: 280 KLASNLSASLSESTDWQSSE-GFVVVETNFKVYAYTSSKLQTEILRCFTRLEYQLPNLVV 338

Query: 268 GVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERERLTT 324
             LT++SV  AL SGI+A QII FLR HA P V+  +P V   VSDQ+RLWE +R R+  
Sbjct: 339 ATLTKESVNKALGSGISAEQIISFLRKHAHPHVAQKIPVVPETVSDQLRLWETDRNRVQF 398

Query: 325 SEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
                Y  F + A +E +  +A+DLG L +++   + ++V    H D++++      G+
Sbjct: 399 EPAYFYDDFPTMAIYEAVVAHARDLGGLLFEDASAKRLIVRSDLHEDMRQYIRKQSSGA 457


>gi|149238950|ref|XP_001525351.1| TFIIH basal transcription factor complex subunit 2 [Lodderomyces
           elongisporus NRRL YB-4239]
 gi|146450844|gb|EDK45100.1| TFIIH basal transcription factor complex subunit 2 [Lodderomyces
           elongisporus NRRL YB-4239]
          Length = 503

 Score =  220 bits (560), Expect = 9e-55,   Method: Compositional matrix adjust.
 Identities = 144/463 (31%), Positives = 218/463 (47%), Gaps = 93/463 (20%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           LS +AK Y++ ++F E+PV    +  W      K   +A   L  L              
Sbjct: 39  LSSMAKFYIMSMIFTEKPVSMKDLTKWCKPLAKKLEFDALKRLESLHLIEFDSKGTHLRL 98

Query: 48  ----RRNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
               R N +  L G   P +  S     D    DL FL+ +A ++WE IL FMVG++ T+
Sbjct: 99  NTIFRTNFRDCLTGSQDPNAFGSVSTTADPNKVDLAFLDSFASQKWESILHFMVGTEGTK 158

Query: 103 GISADAVRTLF---------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKG 153
             S   +  L           +G  N  IT  GFQFLL +  AQ+W  +L+YL   +   
Sbjct: 159 PPSKSVLSLLKLGGLMEQHEEDGTLN--ITNTGFQFLLQDVNAQIWTLLLQYLNLTQELN 216

Query: 154 LDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLA 213
           +D V+ L F+F L     G  YS   +S+     L  L+++GLVYQ+   + RFYPT+LA
Sbjct: 217 MDAVDVLNFIFILGSLELGNSYSVSNLSETQVSMLADLKDYGLVYQKSDTSSRFYPTRLA 276

Query: 214 LNMATRGTLKQIREPGF----------------------LIVETNFRVYAYTDSNLKVAL 251
             + +      ++ P                        +I+ETNF++YAYT+S L++A+
Sbjct: 277 TTLTSDSA--ALKTPSMAVEQALDETETSLASSTQTKESIIIETNFKMYAYTNSPLEIAI 334

Query: 252 IGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV----------- 300
           + LF  +  RF N+  G +TR+S+R AL +GITA QII FL  HA P +           
Sbjct: 335 LNLFVHMKTRFSNMICGQITRESIRNALYNGITADQIIKFLETHAHPQMRALAKERLDKK 394

Query: 301 -------------SCP---------------LPPVVSDQIRLWEGERERLTTSEGVLYSQ 332
                          P               LPP V DQI+LW+ E +R+ T +G L+  
Sbjct: 395 VEFDASHNINTAGGAPQSKTDGAISQHRLEVLPPNVVDQIKLWQLELDRIQTFDGYLFKD 454

Query: 333 FVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           F +Q +++ L +YA ++GVL W ++ K+   VTK G + + +F
Sbjct: 455 FANQQEYDMLSNYAAEIGVLIWADKVKKKFFVTKDGMTQVAEF 497


>gi|212535854|ref|XP_002148083.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Talaromyces marneffei ATCC 18224]
 gi|210070482|gb|EEA24572.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Talaromyces marneffei ATCC 18224]
          Length = 481

 Score =  220 bits (560), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 147/432 (34%), Positives = 222/432 (51%), Gaps = 68/432 (15%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEA----------TDTLTELRRNA 51
           L ++AK  V+ +L++  P+P A +  W+     KE   A          T+T+T  +  A
Sbjct: 38  LPDLAKCLVMAVLYMRDPLPAADLELWVKAESKKERDHAISILGRLHIMTNTMTSNQVRA 97

Query: 52  -----------KVALLGGGKPWSMSAKLEV-DSKARDLDFLNQYALERWECILRFMVGSQ 99
                      + AL G     S        D  A  +  L++YA  +WE +L +MVG+ 
Sbjct: 98  YMVTNPFASSLRQALTGAENSQSFGVPCSTPDPNAVSIADLDEYARRQWEGVLGYMVGTS 157

Query: 100 QT---EGIS-ADAVRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLETV 149
                E +S +  V+ L   G    +      IT+DGF F+L +   QVW  I+ Y+E+ 
Sbjct: 158 GPGVRETVSLSRGVKQLLQAGHLVEIHHGRVDITQDGFAFVLQDVNTQVWQIIILYVESA 217

Query: 150 ESKGLDLVECLTFLFQLKFSTFGTDYSTEGMS-DKLQVFLQHLREFGLVYQRK--RKAGR 206
           ++  ++ VE L+F+F L     G  Y    ++ D+L+  L  L +FG++YQ     +A R
Sbjct: 218 KALQVNDVEVLSFIFLLSSLELGQSYEKRHLTPDQLRA-LADLTDFGIIYQHAPASEADR 276

Query: 207 FYPTKLALNMATRG--------------TLKQIREPG--FLIVETNFRVYAYTDSNLKVA 250
           FYPT+LA  + +                T     EPG  F+I+ETN+R+YAYT S L+++
Sbjct: 277 FYPTRLATTLTSDSSALSNTMSSSLSAQTNAATGEPGSGFIIIETNYRLYAYTSSPLQIS 336

Query: 251 LIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALP------------ 298
           LI LF  L YRFPNL  G +TR S+R A+  GITA QII +L  HA P            
Sbjct: 337 LIALFTTLKYRFPNLVTGKITRQSIRRAVEMGITADQIISYLSTHAHPQMRKEGAVKSTS 396

Query: 299 -SVSCP---LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTW 354
            +   P   LPP V DQIRLW+ ER+R+  + G L+ +F S A++E    YA+++GVL W
Sbjct: 397 NTAGIPRSVLPPTVVDQIRLWQLERDRIKATPGFLFKEFASLAEYEGPCRYAEEIGVLVW 456

Query: 355 QNEKKRTVVVTK 366
           ++++KR   VT+
Sbjct: 457 KSDRKRMFFVTR 468


>gi|354546081|emb|CCE42810.1| hypothetical protein CPAR2_204530 [Candida parapsilosis]
          Length = 451

 Score =  220 bits (560), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 135/446 (30%), Positives = 210/446 (47%), Gaps = 85/446 (19%)

Query: 13  LLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------------RRNAKVA 54
           ++F E+P+P   +  W   +  +   +A   L  L                  R N +  
Sbjct: 1   MIFNEKPMPMRDLTKWCKPSAKRMEFDALKRLESLHLIEYDSKGTNVRLNTIFRSNFRDC 60

Query: 55  LLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISADAVRTL- 112
           L G   P +  S   + D+   +  FL+++A   WE IL FMVG++ T   S   +  L 
Sbjct: 61  LTGSQDPNAFGSISTDDDADKIETSFLDKFASHEWENILHFMVGTEGTPTPSRSVLSLLK 120

Query: 113 ---FNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFS 169
                EG     IT  GFQFLL +  AQ+W  +L+YL       ++ V+ L F+F L   
Sbjct: 121 LGGLMEGESELTITNTGFQFLLQDVNAQIWTLLLQYLNLTSESNMNPVDVLNFIFILGCL 180

Query: 170 TFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIREPG 229
             G  YS   +S+     L  L++ GL+YQ+ + + RFYPT+LA  + +  +   ++ P 
Sbjct: 181 ELGKSYSVLSLSETQVSMLADLKDLGLIYQKSKTSSRFYPTRLATTLTSDSS--ALKTPS 238

Query: 230 F---------------------LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVG 268
                                 +I+ETNF++YAYT+S L++A++ LF ++  RF N+  G
Sbjct: 239 MAVEQALEESEGQMMTSTSRESIIIETNFKIYAYTNSPLEIAILNLFVQMKTRFSNMVCG 298

Query: 269 VLTRDSVRAALRSGITAAQIIGFLRLHALPSV------------------------SCP- 303
            +TR+S+R AL +GIT+ QII FL  HA P +                          P 
Sbjct: 299 QITRESIRNALYNGITSDQIIKFLETHAHPQMKALAKEKLDKKVEFDASHNINTAGGAPQ 358

Query: 304 --------------LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDL 349
                         +PP V DQI+LW+ E +R+ T EG L+  F +Q +++ L +YA +L
Sbjct: 359 SKTDGTISQHKLEVIPPNVVDQIKLWQLELDRIQTVEGYLFKDFANQQEYDMLSNYATEL 418

Query: 350 GVLTWQNEKKRTVVVTKQGHSDIKKF 375
           GVL W ++ KR   VTK G + +  F
Sbjct: 419 GVLVWGDKAKRKFFVTKDGMAQVADF 444


>gi|327298217|ref|XP_003233802.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           Tfb2 [Trichophyton rubrum CBS 118892]
 gi|326463980|gb|EGD89433.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           Tfb2 [Trichophyton rubrum CBS 118892]
          Length = 484

 Score =  219 bits (559), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 147/442 (33%), Positives = 220/442 (49%), Gaps = 72/442 (16%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASW---------------------LSKTHAKEHKEA 40
           L  +AK +V+ LL+++ P+P   +  W                     LS T    H  A
Sbjct: 38  LPHLAKCFVMALLYLKDPLPATELELWVRSDSRRERDNSLSILGRLHILSSTTTSNHVRA 97

Query: 41  TDTLTELRRNAKVALLGGGKPWSMSA-KLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
                    + + AL+G  +  S     +  D     +  L++YA ++WE +L +MVG  
Sbjct: 98  YMVTEPFSTSLRQALMGEDQQGSFGVISVTPDKYPVSVADLDEYARQQWEAVLGYMVGMS 157

Query: 100 QTEG-----ISADAVRTLFN-----EGAENPV-ITKDGFQFLLLETPAQVWYFILKYLET 148
              G       +  V+ L       E  +  V ITK+GF F+L +   QVW+ ++ Y+E 
Sbjct: 158 SLSGGRETITLSKGVKQLLQACHLVEIRDRRVEITKEGFAFVLQDVNTQVWHILILYVEN 217

Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAG--R 206
            E+ G++ VE L+F+F L     G  Y  + ++      L  L +FG+VYQ    +G  R
Sbjct: 218 AEAIGMESVEVLSFIFLLSSLELGQSYEKKDLTPTQLRTLADLTDFGIVYQHTPASGSTR 277

Query: 207 FYPTKLAL-----NMATRGTLKQIREP-----------------GFLIVETNFRVYAYTD 244
           FYPT+LA      ++A  G +    EP                 GF+IVETN+R+YAYT 
Sbjct: 278 FYPTRLATTLTSDSLAMSGPISG--EPTVPTTSAGTTTSGDAGTGFIIVETNYRLYAYTS 335

Query: 245 SNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---- 300
           S L+++LI LF  L +RFPNL  G +TR SVR A+  GITA QII +L  HA P +    
Sbjct: 336 SPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGITADQIISYLTTHAHPQMRKTK 395

Query: 301 -------SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLT 353
                  S  LPP V DQIRLW+ ER+R+  + G L+ +F + A+FE    YA+++GVL 
Sbjct: 396 KSTSNVSSSVLPPTVVDQIRLWQLERDRIKATSGFLFKEFDTFAEFEAPCKYAEEIGVLV 455

Query: 354 WQNEKKRTVVVTKQGHSDIKKF 375
           W+++ +R   VT+  H  +  F
Sbjct: 456 WKSDSRRMFFVTR--HEQVAAF 475


>gi|242794214|ref|XP_002482325.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Talaromyces stipitatus ATCC 10500]
 gi|218718913|gb|EED18333.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Talaromyces stipitatus ATCC 10500]
          Length = 481

 Score =  219 bits (559), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 148/432 (34%), Positives = 222/432 (51%), Gaps = 68/432 (15%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEA----------TDTLTELRRNA 51
           L ++AK  V+ LL++  P+P A +  W+     KE   A          T+T+T  +  A
Sbjct: 38  LPDLAKCLVMALLYMRDPLPAADLELWIKGESKKERDHAISILGRLHIMTNTMTSNQVRA 97

Query: 52  -----------KVALLGGGKPWSMSAKLEV-DSKARDLDFLNQYALERWECILRFMVGSQ 99
                      + AL G     S        D     +  L++YA  +WE +L +MVG+ 
Sbjct: 98  YMVTNPFASSLRQALTGAENSQSFGVPCSTPDPNPVSIADLDEYARRQWEGVLGYMVGTS 157

Query: 100 QT---EGIS-ADAVRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLETV 149
                E +S +  V+ L   G    +      IT+DGF F+L +   QVW  I+ Y+E+ 
Sbjct: 158 GPGVRETVSLSRGVKQLLQAGHLVEIHHGRVDITQDGFAFVLQDVNTQVWQIIILYVESA 217

Query: 150 ESKGLDLVECLTFLFQLKFSTFGTDYSTEGMS-DKLQVFLQHLREFGLVYQ--RKRKAGR 206
           ++  ++ VE L+F+F L     G  Y  + ++ D+L+  L  L +FG+VYQ     +A R
Sbjct: 218 KALQVNDVEVLSFIFLLSSLELGQSYEKKHLTPDQLRA-LTDLTDFGIVYQYPSAAEADR 276

Query: 207 FYPTKLALNMATRG--------------TLKQIREPG--FLIVETNFRVYAYTDSNLKVA 250
           FYPT+LA  + +                T     EPG  F+I+ETN+R+YAYT S L+++
Sbjct: 277 FYPTRLATTLTSDSSALSNTMSSALSAQTNAATGEPGSGFIIIETNYRLYAYTSSPLQIS 336

Query: 251 LIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALP------------ 298
           LI LF  L YRFPNL  G +TR S+R A+  GITA QII +L  HA P            
Sbjct: 337 LIALFTTLKYRFPNLVTGKITRQSIRRAVEMGITADQIISYLSTHAHPQMRKEDAVKSTS 396

Query: 299 -SVSCP---LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTW 354
            +   P   LPP V DQIRLW+ ER+R+  + G L+ +F S A++E    YA+++GVL W
Sbjct: 397 NTAGLPRSVLPPTVVDQIRLWQLERDRIKATPGFLFKEFASLAEYEGPCRYAEEIGVLVW 456

Query: 355 QNEKKRTVVVTK 366
           ++++KR   VT+
Sbjct: 457 KSDRKRMFFVTR 468


>gi|323306981|gb|EGA60265.1| Tfb2p [Saccharomyces cerevisiae FostersO]
          Length = 448

 Score =  218 bits (556), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 135/388 (34%), Positives = 199/388 (51%), Gaps = 79/388 (20%)

Query: 49  RNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISADA 108
           RNA    L GG+  + S  + V+     LD L++Y+  +WE IL FMVG+   +  S   
Sbjct: 60  RNA----LTGGEVQN-SFGVVVEENVVSLDLLDEYSANKWETILHFMVGTPLAKIPSEKV 114

Query: 109 VRTLFNEGAENPV-------ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLT 161
           +  L +      V       IT +GFQFLL E  +Q+W  +L+YL+ +E+  +DLV+ L 
Sbjct: 115 LNLLKHSKLMEEVNSTGEFKITNEGFQFLLQEINSQLWTLLLQYLKMIETSKMDLVDVLH 174

Query: 162 FLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNM----- 216
           F+F L     G  Y  + +S+  ++ LQ +R++GLV+Q+      FYPTKLAL +     
Sbjct: 175 FIFMLGALEVGKAYKIDALSETQRIMLQDMRDYGLVFQKHSNDSIFYPTKLALMLTSDTK 234

Query: 217 -------ATRGTLKQIRE----------------------------------PGFLIVET 235
                  A    L+Q RE                                   G LIVET
Sbjct: 235 TIRSASNAMDSVLRQNREEPSVNEDGANGKSTTDITTSDDLNKAGLKNQDIPDGSLIVET 294

Query: 236 NFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLH 295
           NF++Y+Y++S L++A++ LF  L  RF N+ +G +TR+S+R AL +GITA QII +L  H
Sbjct: 295 NFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRRALTNGITADQIIAYLETH 354

Query: 296 ALPSV----------------SCP-----LPPVVSDQIRLWEGERERLTTSEGVLYSQFV 334
           A P +                +C      LPP V DQIRLW+ E +R+ T EG LYS F 
Sbjct: 355 AHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTVVDQIRLWQLELDRVITYEGSLYSDFE 414

Query: 335 SQADFEKLRDYAQDLGVLTWQNEKKRTV 362
           +  ++  L  YAQD+GVL W+++ K  +
Sbjct: 415 TSQEYNLLSKYAQDIGVLLWKDDXKEEI 442


>gi|50291141|ref|XP_448003.1| hypothetical protein [Candida glabrata CBS 138]
 gi|62901305|sp|Q6FP41.1|TFB2_CANGA RecName: Full=RNA polymerase II transcription factor B subunit 2;
           AltName: Full=RNA polymerase II transcription factor B
           52 kDa subunit; AltName: Full=RNA polymerase II
           transcription factor B p52 subunit
 gi|49527314|emb|CAG60954.1| unnamed protein product [Candida glabrata]
          Length = 504

 Score =  218 bits (556), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 143/458 (31%), Positives = 227/458 (49%), Gaps = 86/458 (18%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           L  +AK +++ ++F ++ +    +  W+      + +EA  ++  L              
Sbjct: 39  LPTLAKFFIMTVIFNDKDISLRDLDRWVKSNGKLQFQEAIKSMKSLHLLIPTRVNGQLLI 98

Query: 48  ------RRNAKVALLGGGKPWSMSAKLEVD--SKARDLDFLNQYALERWECILRFMVGSQ 99
                 R + + AL GG    S    ++ D      +L  L++YA  +WE IL FMVG+ 
Sbjct: 99  NLNPTFRESFRNALTGGEVNNSFGIVVDEDRLDTVVNLAVLDEYAATKWETILHFMVGTP 158

Query: 100 QTEGISADAVRTLFNEG--AENP-----VITKDGFQFLLLETPAQVWYFILKYLETVESK 152
             +  S + +  L +     E P     +IT +GFQFLL E  +Q+W  +L+YL+  ES 
Sbjct: 159 MAKMPSENVLNLLKHSKLMEEVPDSSEFMITNEGFQFLLQEVNSQIWSLLLQYLKLAESL 218

Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
            +D V  L F+F L     G  YSTE +S+     L  +R++GLV+Q+      FYPT+L
Sbjct: 219 HMDPVHVLNFIFMLGALETGKAYSTENLSETQLKMLLDMRDYGLVFQKTSNPNIFYPTRL 278

Query: 213 ALNMATRGTLKQIR----------------------------------EPGFLIVETNFR 238
           A  M T  T K +R                                  + G LI+ETNF+
Sbjct: 279 A-QMLTSDT-KSMRTASGAMESVLNKPDDAAKSTDDKYDSLEGKAEDIQDGALIIETNFK 336

Query: 239 VYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALP 298
           +Y+Y +S L++A++ LF  L  RF N+  G +TR+S+R AL +GITA Q+I +L  HA P
Sbjct: 337 LYSYCNSPLQIAILSLFVHLKSRFANMVAGQITRESIRRALINGITADQVIAYLESHAHP 396

Query: 299 SV----------------SCP-----LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQA 337
            +                +C      LPP V DQI+LW+ E +R+ T EG LY  F +  
Sbjct: 397 QMRRLAEEKLQKKLELDPNCKDPLQVLPPTVVDQIKLWQLELDRVLTYEGSLYIDFDTAQ 456

Query: 338 DFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           DF  L  YAQD+G L W++++KR + V+++G++ + ++
Sbjct: 457 DFNMLCKYAQDIGALLWKDDRKRKLFVSREGNAQVLEY 494


>gi|326483225|gb|EGE07235.1| transcription factor Tfb2 [Trichophyton equinum CBS 127.97]
          Length = 484

 Score =  218 bits (555), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 147/442 (33%), Positives = 219/442 (49%), Gaps = 72/442 (16%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASW---------------------LSKTHAKEHKEA 40
           L  +AK +V+ LL+++ P+P   +  W                     LS T    H  A
Sbjct: 38  LPHLAKCFVMALLYLKDPLPATELELWVRSDSRRERDNSLSILGRLHILSSTTTSNHVRA 97

Query: 41  TDTLTELRRNAKVALLGGGKPWSMSA-KLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
                    + + AL+G  +  S     +  D     +  L++YA ++WE +L +MVG  
Sbjct: 98  YMVTEPFSTSLRQALMGEDQQGSFGVISVTPDKYPVSVADLDEYARQQWEAVLGYMVGMS 157

Query: 100 QTEG-----ISADAVRTLFN-----EGAENPV-ITKDGFQFLLLETPAQVWYFILKYLET 148
              G       +  V+ L       E  +  V ITK+GF F+L +   QVW+ ++ Y+E 
Sbjct: 158 SLSGGRETITLSKGVKQLLQACHLVEIRDRRVEITKEGFAFVLQDVNTQVWHILILYVEN 217

Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAG--R 206
            E+ G++ VE L+F+F L     G  Y  + ++      L  L +FG+VYQ    +G  R
Sbjct: 218 AEAIGMESVEVLSFIFLLSSLELGQSYEKKDLTPTQLRTLADLTDFGIVYQHTPASGSAR 277

Query: 207 FYPTKLAL-----NMATRGTLKQIREP-----------------GFLIVETNFRVYAYTD 244
           FYPT+LA      ++A  G +    EP                 GF+IVETN R+YAYT 
Sbjct: 278 FYPTRLATTLTSDSLAMSGPISG--EPAVPTTTSGTTTSGDAGTGFIIVETNHRLYAYTS 335

Query: 245 SNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---- 300
           S L+++LI LF  L +RFPNL  G +TR SVR A+  GITA QII +L  HA P +    
Sbjct: 336 SPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGITADQIISYLTTHAHPQMRKTK 395

Query: 301 -------SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLT 353
                  S  LPP V DQIRLW+ ER+R+  + G L+ +F + A+FE    YA+++GVL 
Sbjct: 396 KSTSNVSSSVLPPTVVDQIRLWQLERDRIKATSGFLFKEFDTFAEFEAPCKYAEEIGVLV 455

Query: 354 WQNEKKRTVVVTKQGHSDIKKF 375
           W+++ +R   VT+  H  +  F
Sbjct: 456 WKSDSRRMFFVTR--HEQVAAF 475


>gi|452977455|gb|EME77221.1| hypothetical protein MYCFIDRAFT_146475 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 476

 Score =  218 bits (555), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 138/428 (32%), Positives = 219/428 (51%), Gaps = 51/428 (11%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTH----AKEHKEATDTLTELRRNAKVALLG 57
           L  +AK  V+ +L++  P   A + +W+   H    AKE + A        ++ + AL G
Sbjct: 38  LPHMAKTLVMAMLYMPTPFSVADLDTWVKPEHDSAQAKERRPALRLNPAFSKSLRQALTG 97

Query: 58  GGKPWSMSAKLEV-DSKARDLDFLNQYALERWECILRFMVGS------------QQTEGI 104
           GG   S        D  +  ++FL+ +A ++WE IL ++VGS              T+ +
Sbjct: 98  GGNHRSFGVPCTTPDKSSVTIEFLDTFARKQWETILYYVVGSANSALSGGMDVHNSTKKL 157

Query: 105 SADAVRTLFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKG-LDLVECLTFL 163
             +        G     IT DGF FLL +  AQ+W  ++ YL+T E++  +D V+ L+FL
Sbjct: 158 LENGEFVQMQNGGRQRFITTDGFTFLLQDVNAQIWSLLIVYLQTSETECFMDPVDVLSFL 217

Query: 164 FQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTL- 222
           F L     GT YST  ++      L+ L + GL++    ++ R+YPT+LA  + +     
Sbjct: 218 FTLGSLELGTSYSTSNLTKTQLQMLEDLSDLGLIFH-PEQSDRYYPTRLATTLTSDAPAL 276

Query: 223 ---------------------KQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYR 261
                                    E GF+I+ETN+R+YAYT+S L ++++ LF  L  R
Sbjct: 277 LNSSHTSTTTTVSASSNDDLAASANEKGFIILETNYRLYAYTNSPLLISILSLFASLHTR 336

Query: 262 FPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV-------SCP-LPPVVSDQIR 313
           +PNL    +T+ SV+AA+ SGIT+ QII +L+ HA P +       + P LPP V DQIR
Sbjct: 337 YPNLVTAKITKTSVQAAISSGITSNQIISYLQTHAHPILRRTASMHNAPILPPTVVDQIR 396

Query: 314 LWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIK 373
           LW+ E ER+T+++G L  +  S+ D+EK   YA+ LG+L      K++  VT+     +K
Sbjct: 397 LWQIEGERMTSTKGYLIREVGSKEDYEKAVQYAEALGILVKDFAHKQSFFVTRMDQ--MK 454

Query: 374 KFWVNHQK 381
            F+ N  +
Sbjct: 455 IFFANQAQ 462


>gi|315041383|ref|XP_003170068.1| TFIIH basal transcription factor complex p52 subunit [Arthroderma
           gypseum CBS 118893]
 gi|311345102|gb|EFR04305.1| TFIIH basal transcription factor complex p52 subunit [Arthroderma
           gypseum CBS 118893]
          Length = 484

 Score =  217 bits (552), Expect = 8e-54,   Method: Compositional matrix adjust.
 Identities = 145/442 (32%), Positives = 220/442 (49%), Gaps = 72/442 (16%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASW---------------------LSKTHAKEHKEA 40
           L  +AK +V+ LL+++ P+P   +  W                     LS T +  H  A
Sbjct: 38  LPHLAKCFVMALLYLKDPLPATELELWVRSDSRRERDNSLSILGRLHILSSTTSSSHVRA 97

Query: 41  TDTLTELRRNAKVALLGGGKPWSMSA-KLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
                    + + AL+G  +  S     +  D     +  L++YA ++WE +L +MVG  
Sbjct: 98  YMVTEPFSTSLRQALMGEDQKGSFGVISVTPDKYPVSVADLDEYARQQWEAVLGYMVGMS 157

Query: 100 QTEG-----ISADAVRTLFN-----EGAENPV-ITKDGFQFLLLETPAQVWYFILKYLET 148
              G       +  V+ L       E  +  V ITK+GF F+L +   QVW+ ++ Y+E 
Sbjct: 158 SLSGGRETITLSKGVKQLLQACHLVEIRDRRVEITKEGFAFVLQDVNTQVWHILILYVEN 217

Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAG--R 206
            E+ G++ VE L+F+F L     G  Y  + ++      L  L +FG+VYQ    +G  R
Sbjct: 218 AEAIGMESVEVLSFIFLLSSLELGQSYEKKNLTPTQLRTLADLTDFGIVYQHTPASGSTR 277

Query: 207 FYPTKLAL-----NMATRGTLKQIREPG-----------------FLIVETNFRVYAYTD 244
           FYPT+LA      ++A  G +    EP                  F+IVETN+R+YAYT 
Sbjct: 278 FYPTRLATTLTSDSLAMSGPISG--EPTVPAAAAGATTAGDAGTGFIIVETNYRLYAYTS 335

Query: 245 SNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---- 300
           S L+++LI LF  L +RFPNL  G +TR SVR A+  GITA QI+ +L  HA P +    
Sbjct: 336 SPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGITADQIVSYLTTHAHPQMRKTK 395

Query: 301 -------SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLT 353
                  S  LPP V DQIRLW+ ER+R+  + G L+ +F + A+FE    YA+++GVL 
Sbjct: 396 KSTTNVSSTVLPPTVVDQIRLWQLERDRIKATSGFLFKEFDTFAEFEAPCKYAEEIGVLV 455

Query: 354 WQNEKKRTVVVTKQGHSDIKKF 375
           W+++ +R   VT+  H  +  F
Sbjct: 456 WRSDSRRMFFVTR--HEQVAAF 475


>gi|358057347|dbj|GAA96696.1| hypothetical protein E5Q_03367 [Mixia osmundae IAM 14324]
          Length = 467

 Score =  216 bits (551), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 139/416 (33%), Positives = 215/416 (51%), Gaps = 46/416 (11%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKE-ATDTLTEL------------- 47
           L   A+H ++  L+ E  +PQ  + ++  +   +   + A   L  L             
Sbjct: 47  LPSTAQHLIVNALWQEHDIPQQDLHTYTKRGEGRRQLDTAIAALQRLHIVQLIAGEPTGN 106

Query: 48  ---------RRNAKVALLGGGKPWSMS--AKLEVDSKARDLDFLNQYALERWECILRFMV 96
                    R++ + AL G GK  S    A+   D  A  +  L++YA ++W+ +L  +V
Sbjct: 107 TVFALSHIFRKSLRKALTGAGKDSSFGWFARSSRDENALSIPQLDEYATDQWDSLLHCLV 166

Query: 97  GSQQTEGISADAVRTLFNEGA------ENPVITKDGFQFLLLETPAQVWYFILKYLETVE 150
           GS++++  S   +  L   G       +   IT  GFQFLL +  +Q+W  +L YL+  E
Sbjct: 167 GSERSQQPSKAVIDLLVAAGLLSSGDRDTRRITSLGFQFLLEDVNSQLWSLLLHYLKLSE 226

Query: 151 SKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPT 210
             G DL E +  +FQ+     G  YS+E ++      L+     GLVY    K+G + PT
Sbjct: 227 DAGTDLKEVIALVFQIGNQELGRVYSSETLNPLQLHILKTFGGLGLVY--VYKSGDYSPT 284

Query: 211 KLALNMAT-------RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFP 263
           +LA+ + +        GT ++  E GFLI+ETN+RVYAYT + L++A++ LF  L  RFP
Sbjct: 285 RLAVTLTSGAPPLLKAGTAEE--EQGFLILETNYRVYAYTQNPLQIAVLNLFVALKSRFP 342

Query: 264 NLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERE 320
            L VG++TR+S++A L +GI + QII FL  HA P +      LPP V DQI+LWE E+ 
Sbjct: 343 GLVVGMITRESIKAGLANGIKSDQIIAFLTAHAHPQMRKQEPLLPPTVVDQIKLWEREKN 402

Query: 321 RLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEK-KRTVVVTKQGHSDIKKF 375
           R+ T    LY  F SQAD++ + DYA+ +G + W  E   R    T+ GH  ++ F
Sbjct: 403 RVKTEPCFLYDDFRSQADYDLVCDYAKQIGAVLWLGEPGSRRFATTEDGHVQVRGF 458


>gi|384484453|gb|EIE76633.1| hypothetical protein RO3G_01337 [Rhizopus delemar RA 99-880]
          Length = 284

 Score =  216 bits (551), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 115/279 (41%), Positives = 164/279 (58%), Gaps = 40/279 (14%)

Query: 122 ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMS 181
           IT  GFQFLL +   QVW F+L+YL+  E   L+L               G +YS + ++
Sbjct: 13  ITNKGFQFLLQDVNTQVWAFLLQYLDMAELGSLEL---------------GENYSVDTLT 57

Query: 182 DKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMAT-RGTLKQIR-------------- 226
                 L+ LR++G+VYQRK+ + R+YPT+LA  + + +  L  +               
Sbjct: 58  QTQLQMLEDLRDYGIVYQRKKHSKRYYPTRLATTLTSGKSALATVAGKYNHMMQETNIDD 117

Query: 227 -------EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAAL 279
                  + GF+I+ETN+++YAYTDS L++A++ LF +L  RF N+  GV+TRDS+R AL
Sbjct: 118 TTDTESVDQGFIILETNYKLYAYTDSPLQIAVLNLFVQLQSRFRNMVTGVITRDSIRNAL 177

Query: 280 RSGITAAQIIGFLRLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQ 336
             GITA QII +L+ HA P +   +  LP  V DQIRLWE ER RL  +   LY +F  Q
Sbjct: 178 MKGITAEQIIYYLQSHAHPQMRKETPVLPLTVVDQIRLWEMERNRLKPTPSYLYHEFNVQ 237

Query: 337 ADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           ADF+    YA+DLGVL W N +KRT+ +T+ GH ++K F
Sbjct: 238 ADFDAAEKYARDLGVLLWSNNQKRTMAITEAGHENVKGF 276


>gi|260945437|ref|XP_002617016.1| hypothetical protein CLUG_02460 [Clavispora lusitaniae ATCC 42720]
 gi|238848870|gb|EEQ38334.1| hypothetical protein CLUG_02460 [Clavispora lusitaniae ATCC 42720]
          Length = 511

 Score =  216 bits (550), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 143/468 (30%), Positives = 218/468 (46%), Gaps = 95/468 (20%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           L  +AK Y++ ++F ++PV    +  W      K   E+   L  L              
Sbjct: 39  LPPLAKFYIVSMVFTDKPVAVKDLEKWCQPQARKLQYESFKRLRALHLIEEDQSGTHVRL 98

Query: 48  ----RRNAKVALLGGGKPWSMSAKLE-VDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
               R N +  L G   P +    +   D    D+ FL+Q+A ++WE IL FMVG++   
Sbjct: 99  HQTFRNNFRDCLTGSQAPNAFGIIVPGSDKHPIDVKFLDQFASQKWETILHFMVGTE-LP 157

Query: 103 GISADAVRTLFNEG---------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKG 153
            + + +V +L   G         A +  IT  GFQFLL +   Q+W  +L+YL   +   
Sbjct: 158 TVPSKSVLSLLKSGGLMEGHGTSAYSLKITNSGFQFLLQDVNVQIWTLLLEYLNLTQELN 217

Query: 154 LDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLA 213
           +D V+ L F+F L     G  Y+   +SD     L  LR++GLVYQR   + RF+PT+LA
Sbjct: 218 MDPVDVLNFIFILGSLELGKAYAVSSLSDTQLSMLPDLRDYGLVYQRSENSTRFFPTRLA 277

Query: 214 LNMATRGT--------LKQ-IREPGF-----------------LIVETNFRVYAYTDSNL 247
             + +  T        L Q + +PG                  +I+ETNF++YAYT+S L
Sbjct: 278 TTLTSESTGLKTPSMALNQTLEDPGAKDQEHVTSEAGESQSGQIILETNFKLYAYTNSPL 337

Query: 248 KVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSCP---- 303
           ++A++ LF  L  RF N+ +G +TR+S+R AL +GITA QII FL  HA   +       
Sbjct: 338 EIAILNLFVNLKVRFANMVIGQITRESIRKALYNGITADQIIKFLETHAHSQMRTSAKEK 397

Query: 304 ------------------------------------LPPVVSDQIRLWEGERERLTTSEG 327
                                               LPP V DQI+LW+ E +R+ T EG
Sbjct: 398 LDKKIEFDASHNINTAGGAPQSKSSDSGVAQHRLEILPPNVVDQIKLWQLELDRIQTFEG 457

Query: 328 VLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
            L+  F +Q +++ L +YA ++GVL W ++ K    VT +G   +  F
Sbjct: 458 YLFKDFKNQNEYDVLCNYATEVGVLLWSDKTKLRFFVTAEGMHQVADF 505


>gi|296818081|ref|XP_002849377.1| RNA polymerase II transcription factor B subunit 2 [Arthroderma
           otae CBS 113480]
 gi|238839830|gb|EEQ29492.1| RNA polymerase II transcription factor B subunit 2 [Arthroderma
           otae CBS 113480]
          Length = 484

 Score =  214 bits (545), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 144/442 (32%), Positives = 218/442 (49%), Gaps = 72/442 (16%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASW---------------------LSKTHAKEHKEA 40
           L  +AK +V+ LL+++ P+P   +  W                     LS T    H  A
Sbjct: 38  LPHLAKCFVMALLYLKDPLPATELELWVRSDSRRERDNSLSILGRLHILSSTTTSNHVRA 97

Query: 41  TDTLTELRRNAKVALLGGGKPWSMSA-KLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
                    + + AL+G  +  S     +  D     +  L++YA ++WE +L +MVG  
Sbjct: 98  YMVTEPFSTSLRQALMGEDQRHSFGVISMTPDKHHVSVADLDEYARQQWEAVLGYMVGMS 157

Query: 100 QTEG-----ISADAVRTLFN-----EGAENPV-ITKDGFQFLLLETPAQVWYFILKYLET 148
              G       +  V+ L       E  +  V ITK+GF F+L +   QVW+ ++ Y+E 
Sbjct: 158 SLSGGRETITLSKGVKQLLQACHLVEIRDRRVEITKEGFAFVLQDVNTQVWHILILYVEN 217

Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAG--R 206
            E+  ++ VE L+F+F L     G  Y    ++      L  L +FG+VYQ    +G  R
Sbjct: 218 AEAIHMESVEVLSFIFLLSSLELGQSYEKNDLTPTQLRTLADLTDFGIVYQHTPASGSTR 277

Query: 207 FYPTKLAL-----NMATRGTLKQIREP-----------------GFLIVETNFRVYAYTD 244
           FYPT+LA      ++A  G +    EP                 GF+IVETN+R+YAYT 
Sbjct: 278 FYPTRLATTLTSDSLAMPGPVSG--EPVVPAPSTGPTSATDGGTGFIIVETNYRLYAYTS 335

Query: 245 SNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---- 300
           S L+++LI LF  L +RFPNL  G +TR SVR A+  GITA QI+ +L  HA P +    
Sbjct: 336 SPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGITADQIVSYLTTHAHPQMRKTK 395

Query: 301 -------SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLT 353
                  +  LPP V DQIRLW+ ER+R+  + G L+ +F + A+FE    YA+++GVL 
Sbjct: 396 KSTSNVAASILPPTVVDQIRLWQLERDRIKATSGFLFKEFDTFAEFEAPCKYAEEIGVLV 455

Query: 354 WQNEKKRTVVVTKQGHSDIKKF 375
           W+++ +R   VT+  H  +  F
Sbjct: 456 WRSDSRRMFFVTR--HEQVAAF 475


>gi|390370355|ref|XP_783140.3| PREDICTED: general transcription factor IIH subunit 4-like, partial
           [Strongylocentrotus purpuratus]
          Length = 180

 Score =  214 bits (544), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 95/180 (52%), Positives = 139/180 (77%), Gaps = 5/180 (2%)

Query: 202 RKAGRFYPTKLALNMAT--RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELM 259
           RK+ R+YPT+LA+N+A+      K   + GF++VETNFR+YAYT+S+L+V ++GLFC +M
Sbjct: 1   RKSMRYYPTRLAINLASGVSSMAKDDHKDGFIVVETNFRIYAYTESDLQVEILGLFCSMM 60

Query: 260 YRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SCPL-PPVVSDQIRLWE 316
           YRFPNL+V  LTR+SV+ A+ +GITA QI+ FLR HA P++    P+ PP +SDQ+RLWE
Sbjct: 61  YRFPNLSVAALTRESVQLAISNGITAEQILSFLRTHAHPNMRLKTPIVPPTISDQVRLWE 120

Query: 317 GERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFW 376
            ER+RL+ ++G++Y++F+S  DFE LRDYA+DLGVL W +  +R ++V+  GH  +KK+W
Sbjct: 121 LERDRLSFTQGIIYNEFLSLHDFEVLRDYAKDLGVLIWDSTARRIMIVSPAGHDSVKKYW 180


>gi|326474968|gb|EGD98977.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           Tfb2 [Trichophyton tonsurans CBS 112818]
          Length = 489

 Score =  213 bits (542), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 143/424 (33%), Positives = 211/424 (49%), Gaps = 70/424 (16%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASW---------------------LSKTHAKEHKEA 40
           L  +AK +V+ LL+++ P+P   +  W                     LS T    H  A
Sbjct: 38  LPHLAKCFVMALLYLKDPLPATELELWVRSDSRRERDNSLSILGRLHILSSTTTSNHVRA 97

Query: 41  TDTLTELRRNAKVALLGGGKPWSMSA-KLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
                    + + AL+G  +  S     +  D     +  LN+YA ++WE +L +MVG  
Sbjct: 98  YMVTEPFSTSLRQALMGEDQQGSFGVISVTPDKYPVSVADLNEYARQQWEAVLGYMVGMS 157

Query: 100 QTEG-----ISADAVRTLFN-----EGAENPV-ITKDGFQFLLLETPAQVWYFILKYLET 148
              G       +  V+ L       E  +  V ITK+GF F+L +   QVW+ ++ Y+E 
Sbjct: 158 SLSGGRETITLSKGVKQLLQACHLVEIRDRRVEITKEGFAFVLQDVNTQVWHILILYVEN 217

Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAG--R 206
            E+ G++ VE L+F+F L     G  Y  + ++      L  L +FG+VYQ    +G  R
Sbjct: 218 AEAIGMESVEVLSFIFLLSSLELGQSYEKKDLTPTQLRTLADLTDFGIVYQHTPASGSAR 277

Query: 207 FYPTKLAL-----NMATRGTLKQIREP-----------------GFLIVETNFRVYAYTD 244
           FYPT+LA      ++A  G +    EP                 GF+IVETN R+YAYT 
Sbjct: 278 FYPTRLATTLTSDSLAMSGPISG--EPAVPTTTSGTTTSGDAGTGFIIVETNHRLYAYTS 335

Query: 245 SNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---- 300
           S L+++LI LF  L +RFPNL  G +TR SVR A+  GITA QII +L  HA P +    
Sbjct: 336 SPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGITADQIISYLTTHAHPQMRKTK 395

Query: 301 -------SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLT 353
                  S  LPP V DQIRLW+ ER+R+  + G L+ +F + A+FE    YA+++GVL 
Sbjct: 396 KSTSNVSSSVLPPTVVDQIRLWQLERDRIKATSGFLFKEFDTFAEFEAPCKYAEEIGVLV 455

Query: 354 WQNE 357
           W+++
Sbjct: 456 WKSD 459


>gi|389642093|ref|XP_003718679.1| RNA polymerase II transcription factor B subunit 2 [Magnaporthe
           oryzae 70-15]
 gi|351641232|gb|EHA49095.1| RNA polymerase II transcription factor B subunit 2 [Magnaporthe
           oryzae 70-15]
 gi|440469172|gb|ELQ38293.1| RNA polymerase II transcription factor B subunit 2 [Magnaporthe
           oryzae Y34]
 gi|440481925|gb|ELQ62459.1| RNA polymerase II transcription factor B subunit 2 [Magnaporthe
           oryzae P131]
          Length = 490

 Score =  213 bits (542), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 139/434 (32%), Positives = 220/434 (50%), Gaps = 70/434 (16%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           L  +AK +V+ LL++ QP+P   + SW+     K   +A   L  +              
Sbjct: 39  LPPLAKTFVMSLLYMPQPLPLTALDSWVKPEAKKNKDQALSILRSMHITTITPVTKEKPV 98

Query: 48  ---------RRNAKVALLGGG--KPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMV 96
                    +++ ++AL GGG    + + + L +  +  D+ FL+++A  RW+ IL ++V
Sbjct: 99  QEMSLTPNFKKSLRLALEGGGSHNSFGVPSSLPIPPQV-DVAFLDKWARSRWDAILHYVV 157

Query: 97  GS-QQTEGISA------------DAVRTLFNEG-------AENPVITKDGFQFLLLETPA 136
            S ++T+ +              D V+TL  +G       +E   ITK GF FLL E  A
Sbjct: 158 NSVEETDSMEPSKKYNFGGSKLQDTVKTLLVQGGLVQRRSSERISITKTGFTFLLQEANA 217

Query: 137 QVWYFILKYLETVESKGLD-LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFG 195
           QVW  +L++L +V     +  V+ L+FLF L     G  Y T  +S++ +  L  L +FG
Sbjct: 218 QVWTLLLQWLHSVNEDNTNRAVDMLSFLFMLGTLELGQAYDTGALSEERRNMLPDLNDFG 277

Query: 196 LVYQRKRKAGRFYPTKLA---------LNMATRGTLKQIREPG------FLIVETNFRVY 240
           LVY       +++PT+LA         L   + G      E G       +I+ETNFR+Y
Sbjct: 278 LVYIPPSNPDQYFPTRLATTLTSGSSALRSVSSGVAAATAEAGENNTKGAIILETNFRIY 337

Query: 241 AYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV 300
           AYT + L++A++ LF  L  RF  +  G L+R S++ A+  GITA QII +L  HA   +
Sbjct: 338 AYTSTPLQIAILALFANLKMRFAGMVTGQLSRHSIKRAISHGITADQIIEYLASHAHEQM 397

Query: 301 -------SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVL 352
                  + P LPP V DQIRLW+ E ER+    G L+  F SQA+F+ + DYA ++GVL
Sbjct: 398 HRIAAIRNKPVLPPTVVDQIRLWQLETERMQVQRGYLFKDFESQAEFKAIADYADEVGVL 457

Query: 353 TWQNEKKRTVVVTK 366
            W+++ ++    +K
Sbjct: 458 IWRSDARQLFFASK 471


>gi|302495702|ref|XP_003009866.1| hypothetical protein ARB_03925 [Arthroderma benhamiae CBS 112371]
 gi|291173385|gb|EFE29221.1| hypothetical protein ARB_03925 [Arthroderma benhamiae CBS 112371]
          Length = 386

 Score =  211 bits (537), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 127/342 (37%), Positives = 185/342 (54%), Gaps = 50/342 (14%)

Query: 80  LNQYALERWECILRFMVGSQQTEG-----ISADAVRTLFNEGAENPV------ITKDGFQ 128
           L++YA ++WE +L +MVG     G       +  V+ L        V      ITK+GF 
Sbjct: 40  LDEYARQQWEAVLGYMVGMSSLSGGRETITLSKGVKQLLQACHLVEVRDRRVEITKEGFA 99

Query: 129 FLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFL 188
           F+L +   QVW+ ++ Y+E  E+ G++ VE L+F+F L     G  Y  + ++      L
Sbjct: 100 FVLQDVNTQVWHILILYVENAEAIGMESVEVLSFIFLLSSLELGQSYEKKDLTPTQLRTL 159

Query: 189 QHLREFGLVYQRKRKAG--RFYPTKLAL-----NMATRGTLKQIREP------------- 228
             L +FG+VYQ    +G  RFYPT+LA      ++A  G +    EP             
Sbjct: 160 ADLTDFGIVYQHTPASGSTRFYPTRLATTLTSDSLAMSGPISG--EPAVPTTTAGTTTSG 217

Query: 229 ----GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGIT 284
               GF+IVETN+R+YAYT S L+++LI LF  L +RFPNL  G +TR SVR A+  GIT
Sbjct: 218 DAGTGFIIVETNYRLYAYTSSPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGIT 277

Query: 285 AAQIIGFLRLHALPSV-----------SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQF 333
           A QII +L  HA P +           S  LPP V DQIRLW+ ER+R+  + G L+ +F
Sbjct: 278 ADQIISYLTTHAHPQMRKTKKSTSNVSSSVLPPTVVDQIRLWQLERDRIKATSGFLFKEF 337

Query: 334 VSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
            + A+FE    YA+++GVL W+++ +R   VT+  H  +  F
Sbjct: 338 DTFAEFEAPCKYAEEIGVLVWKSDSRRMFFVTR--HEQVAAF 377


>gi|336466164|gb|EGO54329.1| hypothetical protein NEUTE1DRAFT_87562 [Neurospora tetrasperma FGSC
           2508]
 gi|350286986|gb|EGZ68233.1| transcription factor Tfb2 [Neurospora tetrasperma FGSC 2509]
          Length = 475

 Score =  211 bits (536), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 137/411 (33%), Positives = 212/411 (51%), Gaps = 66/411 (16%)

Query: 34  AKEHKEATDTLTELRRNAKVALLGGGKPWS--MSAKLEVDSKARDLDFLNQYALERWECI 91
           +K+  +        ++  ++A  GGG+  S  + + L VD    D+ +L++YA ++W+ I
Sbjct: 68  SKDRTQEIQITASFKKGLQLAFEGGGEDGSFGVPSSLPVDPSI-DIAYLDKYARKKWDDI 126

Query: 92  LRFMVGS----QQTEGISAD--AVRTLFNEG---AENP------VITKDGFQFLLLETPA 136
           L ++V S      +EG+S    +V+ L   G      P       IT+ GF FLL E+ A
Sbjct: 127 LHYVVNSLGVHGSSEGVSGPKASVKELLLAGRLVERRPDTRTGIGITQAGFTFLLQESNA 186

Query: 137 QVWYFILKYLETVE---SKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLRE 193
           QVW  +L +LE  +     G+D ++ L+FLF L     GT Y T  +++  +  L  L +
Sbjct: 187 QVWTLLLLWLEAADHTADSGMDSIDMLSFLFMLSSLELGTAYDTSALTETRRNMLPSLVD 246

Query: 194 FGLVYQRKRKAGRFYPTKLALNMA-------------TRGTLKQIREPGF---------- 230
           FGL+Y   R   +F+PT+LA  +              T  T     EPG           
Sbjct: 247 FGLIY-FPRDTRQFFPTRLATTLTSTASALRTVSSAFTAATANPSGEPGTTTSTSGGPAA 305

Query: 231 -----------LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAAL 279
                      +I+ETN+R+YAYT S L++A++ LFC+L  RFPN+  G LTRDS+R A+
Sbjct: 306 TPDKTQTAAKGIIIETNYRIYAYTTSPLQIAVLALFCQLNMRFPNMVSGRLTRDSIRQAI 365

Query: 280 RSGITAAQIIGFLRLHALPSV--------SCPLPPVVSDQIRLWEGERERLTTSEGVLYS 331
             GITA QII +L  HA   +           LPP V DQIRLW+ E ER+ T+ G L+ 
Sbjct: 366 GFGITADQIISYLHAHAHQQMVREATVTGKPVLPPTVVDQIRLWQLENERMQTTAGFLFK 425

Query: 332 QFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKG 382
            F S  ++  L  YA+++GVL W+N+K++    +K  H  ++ +  + +KG
Sbjct: 426 DFESTEEYAALSRYAEEIGVLVWKNDKRQLFFASK--HEQLRDYLKSRKKG 474


>gi|164425579|ref|XP_960485.2| hypothetical protein NCU05523 [Neurospora crassa OR74A]
 gi|157070982|gb|EAA31249.2| conserved hypothetical protein [Neurospora crassa OR74A]
          Length = 475

 Score =  211 bits (536), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 137/411 (33%), Positives = 212/411 (51%), Gaps = 66/411 (16%)

Query: 34  AKEHKEATDTLTELRRNAKVALLGGGKPWS--MSAKLEVDSKARDLDFLNQYALERWECI 91
           +K+  +        ++  ++A  GGG+  S  + + L VD    D+ +L++YA ++W+ I
Sbjct: 68  SKDRTQEIQITASFKKGLQLAFEGGGEDGSFGVPSSLPVDPSI-DIAYLDRYARKKWDDI 126

Query: 92  LRFMVGS----QQTEGISAD--AVRTLFNEG---AENP------VITKDGFQFLLLETPA 136
           L ++V S      +EG+S    +V+ L   G      P       IT+ GF FLL E+ A
Sbjct: 127 LHYVVNSLGVHGSSEGVSGPKASVKELLLAGRLVERRPDTRTGIGITQAGFTFLLQESNA 186

Query: 137 QVWYFILKYLETVE---SKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLRE 193
           QVW  +L +LE  +     G+D ++ L+FLF L     GT Y T  +++  +  L  L +
Sbjct: 187 QVWTLLLLWLEAADHTADSGMDSIDMLSFLFMLSSLELGTAYDTSALTETRRNMLPSLVD 246

Query: 194 FGLVYQRKRKAGRFYPTKLALNMA-------------TRGTLKQIREPGF---------- 230
           FGL+Y   R   +F+PT+LA  +              T  T     EPG           
Sbjct: 247 FGLIY-FPRDTRQFFPTRLATTLTSTASALRTVSSAFTAATANPSGEPGTTTSTSGGPAA 305

Query: 231 -----------LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAAL 279
                      +I+ETN+R+YAYT S L++A++ LFC+L  RFPN+  G LTRDS+R A+
Sbjct: 306 TPDKTQTAAKGIIIETNYRIYAYTTSPLQIAVLALFCQLNMRFPNMVSGRLTRDSIRQAI 365

Query: 280 RSGITAAQIIGFLRLHALPSV--------SCPLPPVVSDQIRLWEGERERLTTSEGVLYS 331
             GITA QII +L  HA   +           LPP V DQIRLW+ E ER+ T+ G L+ 
Sbjct: 366 GFGITADQIISYLHAHAHQQMVREATVTGKPVLPPTVVDQIRLWQLENERMQTTAGFLFK 425

Query: 332 QFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKG 382
            F S  ++  L  YA+++GVL W+N+K++    +K  H  ++ +  + +KG
Sbjct: 426 DFESTEEYAALSRYAEEIGVLVWKNDKRQLFFASK--HEQLRDYLKSRKKG 474


>gi|302667863|ref|XP_003025510.1| hypothetical protein TRV_00272 [Trichophyton verrucosum HKI 0517]
 gi|291189624|gb|EFE44899.1| hypothetical protein TRV_00272 [Trichophyton verrucosum HKI 0517]
          Length = 386

 Score =  211 bits (536), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 127/342 (37%), Positives = 185/342 (54%), Gaps = 50/342 (14%)

Query: 80  LNQYALERWECILRFMVGSQQTEG-----ISADAVRTLFNEGAENPV------ITKDGFQ 128
           L++YA ++WE +L +MVG     G       +  V+ L        V      ITK+GF 
Sbjct: 40  LDEYARQQWEAVLGYMVGMSSLSGGRETITLSKGVKQLLQACHLVEVRDRRVEITKEGFA 99

Query: 129 FLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFL 188
           F+L +   QVW+ ++ Y+E  E+ G++ VE L+F+F L     G  Y  + ++      L
Sbjct: 100 FVLQDVNTQVWHILILYVENAEAIGMESVEVLSFVFLLSSLELGQSYEKKDLTPTQLRTL 159

Query: 189 QHLREFGLVYQRKRKAG--RFYPTKLAL-----NMATRGTLKQIREP------------- 228
             L +FG+VYQ    +G  RFYPT+LA      ++A  G +    EP             
Sbjct: 160 ADLTDFGIVYQHTPASGSTRFYPTRLATTLTSDSLAMSGPISG--EPAVPTTTAGTTTSG 217

Query: 229 ----GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGIT 284
               GF+IVETN+R+YAYT S L+++LI LF  L +RFPNL  G +TR SVR A+  GIT
Sbjct: 218 DAGTGFIIVETNYRLYAYTSSPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGIT 277

Query: 285 AAQIIGFLRLHALPSV-----------SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQF 333
           A QII +L  HA P +           S  LPP V DQIRLW+ ER+R+  + G L+ +F
Sbjct: 278 ADQIISYLTTHAHPQMRKTKKSTSNVSSSVLPPTVVDQIRLWQLERDRIKATSGFLFKEF 337

Query: 334 VSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
            + A+FE    YA+++GVL W+++ +R   VT+  H  +  F
Sbjct: 338 DTFAEFEAPCKYAEEIGVLVWKSDSRRMFFVTR--HEQVAAF 377


>gi|194390490|dbj|BAG62007.1| unnamed protein product [Homo sapiens]
          Length = 220

 Score =  209 bits (533), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 111/220 (50%), Positives = 144/220 (65%), Gaps = 30/220 (13%)

Query: 11  IRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR---------------------R 49
           +R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR                     +
Sbjct: 1   MRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLILNPIFRQ 60

Query: 50  NAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISADA 108
           N ++ALLGGGK WS  +++L  D  ARD+  L++YA ERWE +L FMVGS  +  +S D 
Sbjct: 61  NLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSP-SAAVSQDL 119

Query: 109 VRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLT 161
            + L   G        E P IT  GFQFLLL+TPAQ+WYF+L+YL+T +S+G+DLVE L+
Sbjct: 120 AQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEILS 179

Query: 162 FLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRK 201
           FLFQL FST G DYS EGMSD L  FLQHLREFGLV+QRK
Sbjct: 180 FLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLVFQRK 219


>gi|342884444|gb|EGU84659.1| hypothetical protein FOXB_04847 [Fusarium oxysporum Fo5176]
          Length = 496

 Score =  208 bits (530), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 137/451 (30%), Positives = 229/451 (50%), Gaps = 80/451 (17%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTL----------------- 44
           L  +AK +V+R+L+  +P+  + +  W+  +H ++  +A   L                 
Sbjct: 41  LPHLAKTFVMRILYSPKPILLSDLDDWVKPSHKRQKDQALSILRVLHIVSITAPSKERPQ 100

Query: 45  -----TELRRNAKVALLGGG--KPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVG 97
                T  + + ++AL GGG    + + + L++  +  D+ FL++YA ++WE IL F+V 
Sbjct: 101 EMQLTTNFKLSLRLALSGGGTHNSFGVPSTLQIPPEI-DIAFLDRYARKKWEDILHFVVS 159

Query: 98  S--QQTEGISADAVRT---------LFNEGAENPV-ITKDGFQFLLLETPAQVWYFILKY 145
           S   ++ G S+   ++         L +  A   V IT+ GF FLL E  AQVW  +L +
Sbjct: 160 SVGYKSAGESSGPNKSVKELLVAGRLVDRKASGSVGITQAGFTFLLQEANAQVWTLLLLW 219

Query: 146 LETVESK---GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKR 202
           LE +E     GL+  + L+FLF L     G  Y T  ++++ +  L  L +FGL+Y    
Sbjct: 220 LEAMEVNKMAGLEATDMLSFLFVLASMELGRAYDTNALTEQRKNMLPSLVDFGLIYIPNH 279

Query: 203 KAGRFYPTKLALNM----------------------------ATRG--TLKQIREPGFLI 232
           K   F+PT+LA  +                            A  G  T     + G +I
Sbjct: 280 KRSMFFPTRLATTLTSSSNSLRSISDGVAAATAAALQPGQSGAPGGSVTAANTEQRGSVI 339

Query: 233 VETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFL 292
           +ETN+R+YAYT S L++A++ LF +L  RFP++  G ++R S+R A++ GITA QII +L
Sbjct: 340 IETNYRIYAYTQSTLQIAVLALFTKLAMRFPDMVAGRISRQSIRQAIQFGITAEQIISYL 399

Query: 293 RLHALPSV-------SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRD 344
             HA   +       + P LPP V DQIRLW+ E ER+ T+ G L+  F    +++ +  
Sbjct: 400 SAHAHDQMHRTAALNNKPVLPPTVVDQIRLWQLENERMKTTSGFLFKSFEDDREYKDIAR 459

Query: 345 YAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           +A+++GVL W+N+ ++    +K  H  I+ +
Sbjct: 460 FAEEVGVLVWKNDARQMFFASK--HEQIRDY 488


>gi|212535856|ref|XP_002148084.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Talaromyces marneffei ATCC 18224]
 gi|210070483|gb|EEA24573.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Talaromyces marneffei ATCC 18224]
          Length = 422

 Score =  208 bits (530), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 125/340 (36%), Positives = 185/340 (54%), Gaps = 44/340 (12%)

Query: 71  DSKARDLDFLNQYALERWECILRFMVGSQQT---EGIS-ADAVRTLFNEGAENPV----- 121
           D  A  +  L++YA  +WE +L +MVG+      E +S +  V+ L   G    +     
Sbjct: 70  DPNAVSIADLDEYARRQWEGVLGYMVGTSGPGVRETVSLSRGVKQLLQAGHLVEIHHGRV 129

Query: 122 -ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGM 180
            IT+DGF F+L +   QVW  I+ Y+E+ ++  ++ VE L+F+F L     G  Y    +
Sbjct: 130 DITQDGFAFVLQDVNTQVWQIIILYVESAKALQVNDVEVLSFIFLLSSLELGQSYEKRHL 189

Query: 181 SDKLQVFLQHLREFGLVYQRK--RKAGRFYPTKLALNMATRG--------------TLKQ 224
           +      L  L +FG++YQ     +A RFYPT+LA  + +                T   
Sbjct: 190 TPDQLRALADLTDFGIIYQHAPASEADRFYPTRLATTLTSDSSALSNTMSSSLSAQTNAA 249

Query: 225 IREPG--FLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSG 282
             EPG  F+I+ETN+R+YAYT S L+++LI LF  L YRFPNL  G +TR S+R A+  G
Sbjct: 250 TGEPGSGFIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLVTGKITRQSIRRAVEMG 309

Query: 283 ITAAQIIGFLRLHALP-------------SVSCP---LPPVVSDQIRLWEGERERLTTSE 326
           ITA QII +L  HA P             +   P   LPP V DQIRLW+ ER+R+  + 
Sbjct: 310 ITADQIISYLSTHAHPQMRKEGAVKSTSNTAGIPRSVLPPTVVDQIRLWQLERDRIKATP 369

Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTK 366
           G L+ +F S A++E    YA+++GVL W++++KR   VT+
Sbjct: 370 GFLFKEFASLAEYEGPCRYAEEIGVLVWKSDRKRMFFVTR 409


>gi|340518979|gb|EGR49219.1| trancription factor tfb2-like protein [Trichoderma reesei QM6a]
          Length = 480

 Score =  208 bits (529), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 147/448 (32%), Positives = 228/448 (50%), Gaps = 74/448 (16%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASW---------------LSKTH-------AKEHKE 39
           L  +AK +V+R+LF+  P+  A + +W               L   H       +KE  +
Sbjct: 39  LPHLAKTFVMRMLFLPHPMTLADLDAWVKPEAKRKKDQSLSILRSLHIVQISAPSKEKPQ 98

Query: 40  ATDTLTELRRNAKVALLGGGK------PWSMSAKLEVDSKARDLDFLNQYALERWECILR 93
               ++  +R+ ++AL GGG       P S+    E+D     L FL++YA ++WE +L 
Sbjct: 99  EIQLMSNFKRSLRLALTGGGNHNSFGVPSSLMVPPEID-----LPFLDRYARKKWEDVLH 153

Query: 94  FMV-----------GSQQTEGISADAVRTLFNEGAENPV--ITKDGFQFLLLETPAQVWY 140
           F+V           GS   +G+    +     +   N    IT+ GF FLL E  AQVW 
Sbjct: 154 FVVSSVGYKSSLGEGSGPNKGVKELLIAGQLVDRRPNGSLGITQAGFTFLLQEANAQVWT 213

Query: 141 FILKYLETV---ESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV 197
            +L +LE +   +S GLD V+ L+FLF L     G  Y T  ++++ +  L  L +FGL+
Sbjct: 214 LLLLWLEVLGNNKSSGLDPVDMLSFLFMLASLELGRAYDTNALTEERRNMLPSLVDFGLI 273

Query: 198 YQRKRKAGRFYPTKLALNMATRG-TLKQIRE--------------PGFLIVETNFRVYAY 242
           Y  + K   F+PT+LA  + + G +L+ I E               G +++ETN+R+YAY
Sbjct: 274 YIPQHKRSMFFPTRLATTLTSGGNSLRTISEGSLGLLGGGGTGEQAGSVVIETNYRLYAY 333

Query: 243 TDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV-- 300
           T S L++A++ LF +L  RFP++  G L+R S+R A+  GITA QII +L  HA   +  
Sbjct: 334 TQSALQIAVLALFAKLNMRFPDMVAGRLSRASIRQAINFGITADQIISYLSAHAHEQMHR 393

Query: 301 -----SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTW 354
                + P LPP V DQIRLW+ E ER+ T+ G L+  F    D+     +++++GVL W
Sbjct: 394 TAALTNKPVLPPTVVDQIRLWQLENERMKTTSGFLFRDFTDDKDYLDTARFSEEIGVLVW 453

Query: 355 QNEKKRTVVVTKQGHSDIKKFWVNHQKG 382
           +N+  R     K  H  IK F  + +K 
Sbjct: 454 RNDNTRMFFANK--HEQIKDFLKSRKKA 479


>gi|46127843|ref|XP_388475.1| hypothetical protein FG08299.1 [Gibberella zeae PH-1]
 gi|408390907|gb|EKJ70292.1| hypothetical protein FPSE_09509 [Fusarium pseudograminearum CS3096]
          Length = 494

 Score =  207 bits (527), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 139/451 (30%), Positives = 228/451 (50%), Gaps = 80/451 (17%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTL----------------- 44
           L ++AK +V+R+LF  +P+  + +  W+  +H ++  +A   +                 
Sbjct: 39  LPQLAKVFVMRILFNPKPILLSDLDDWVKPSHKRQKDQALSVMRVLHIVQISTPSKERPQ 98

Query: 45  -----TELRRNAKVALLGGG--KPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVG 97
                T  + + ++AL GGG    + + + L++  +  D+DFL++YA ++WE IL F+V 
Sbjct: 99  EMQLTTNFKVSLRLALSGGGSHNSFGVPSTLQIPPEI-DIDFLDRYARKKWEDILHFVVS 157

Query: 98  S--QQTEGISA---DAVRTLFNEG---AENPV----ITKDGFQFLLLETPAQVWYFILKY 145
           S   ++ G S+    +V+ L   G      P     IT+ GF FLL E  AQVW  +L +
Sbjct: 158 SVGYKSAGESSGPNKSVKELLVAGRLVDRRPSGAIGITQAGFTFLLQEANAQVWTLLLLW 217

Query: 146 LETVESK---GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKR 202
           LE ++     GL+  + L+FLF L     G  Y T  ++++ +  L  L +FGL+Y    
Sbjct: 218 LEAMDVNKMAGLEATDMLSFLFVLASMELGRAYDTNALTEQRKNMLPSLVDFGLIYIPNH 277

Query: 203 KAGRFYPTKLALNMATRGTLK------------------QIREPGF------------LI 232
           K   F+PT+LA  + +                       Q R PG             +I
Sbjct: 278 KRSMFFPTRLATTLTSSSNSLRSISDGVAAATAAALQPGQSRAPGSSATGSSNEQRGSVI 337

Query: 233 VETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFL 292
           +ETN+R+YAYT S L++A++ LF +L  RFP++  G ++R S+R A+  GITA QII +L
Sbjct: 338 IETNYRIYAYTQSTLQIAVLALFTKLQMRFPDMVAGRISRQSIRQAINFGITAEQIISYL 397

Query: 293 RLHALPSV-------SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRD 344
             HA   +       + P LPP V DQIRLW+ E ER+ T+ G L+  F    ++  +  
Sbjct: 398 SAHAHDQMRRTASLNNKPVLPPTVVDQIRLWQLENERMKTTGGFLFKDFEDHKEYMAVAG 457

Query: 345 YAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           +A+++GVL W+N+ K     +K  H  I+ +
Sbjct: 458 FAEEVGVLVWRNDVKGMFFASK--HEQIRDY 486


>gi|254573074|ref|XP_002493646.1| Subunit of TFIIH and nucleotide excision repair factor 3 complexes
           [Komagataella pastoris GS115]
 gi|238033445|emb|CAY71467.1| Subunit of TFIIH and nucleotide excision repair factor 3 complexes
           [Komagataella pastoris GS115]
 gi|328354526|emb|CCA40923.1| RNA polymerase II transcription factor B subunit 2 [Komagataella
           pastoris CBS 7435]
          Length = 547

 Score =  207 bits (527), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 130/393 (33%), Positives = 198/393 (50%), Gaps = 77/393 (19%)

Query: 48  RRNAKVALLGGGKPWSMSAKLEVDSKARDLD-----FLNQYALERWECILRFMVGSQQTE 102
           R+N + AL G      MS  L  D +  D++     FL++Y+L +WE IL FMVGS+  E
Sbjct: 144 RKNFRNALTGS----QMSNDLNNDIEMTDVEEVSEEFLDRYSLMKWENILHFMVGSEVPE 199

Query: 103 GISADAVRTL-----FNEGAENPVI------------------TKDGFQFLLLETPAQVW 139
             S   +  L      + G EN  I                  T+DGFQFLL +  AQ+W
Sbjct: 200 LPSIGVLSLLRHSGLMDIGEENEFIATPKRFKGDNMDLKSMKITQDGFQFLLQDINAQIW 259

Query: 140 YFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQ 199
             +L+YL   E   ++ V+ L F+F L     G  Y    +SD   + L+ L ++GL+Y 
Sbjct: 260 QLLLQYLRMSEKLMMNPVDVLNFIFMLGSLELGKQYPVTLLSDTQVIMLEDLVDYGLIY- 318

Query: 200 RKRKAGRFYPTKLALNMATRGTL---------------KQIREPGFLIVETNFRVYAYTD 244
              K   FYPT++A  + +  T                 ++++ G +++ETNF++Y YT 
Sbjct: 319 YPEKNRYFYPTRMATTLTSEKTTFKTASMSMNQVLEGGPEVKDHGSIVLETNFKLYCYTT 378

Query: 245 SNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSC-- 302
           S L++A++ LF  L  RF N+  G++TR+SVR ALR+GITA QII +L  HA   +    
Sbjct: 379 SPLQIAILNLFVHLRTRFANMVTGMITRESVREALRNGITADQIIKYLETHAHSQMKKLA 438

Query: 303 ---------------------------PLPPVVSDQIRLWEGERERLTTSEGVLYSQFVS 335
                                       +PP V DQI+LW+ E +RL T  G L+  F +
Sbjct: 439 EEKLLKKLEFDSNATETQQIGSQVKLEVIPPTVVDQIKLWQLEMDRLQTFAGFLFKDFAN 498

Query: 336 QADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQG 368
             +FE+L +YA ++GV+ W+++ KR   VT++G
Sbjct: 499 AQEFEQLANYADEVGVMLWRDDDKRKFFVTEEG 531


>gi|440636738|gb|ELR06657.1| hypothetical protein GMDG_00274 [Geomyces destructans 20631-21]
          Length = 470

 Score =  207 bits (526), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 139/429 (32%), Positives = 210/429 (48%), Gaps = 58/429 (13%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
           L  +AK +V+ +L++ +P+  + + +W+      E  +A   L+ L              
Sbjct: 38  LPHLAKCFVMGMLYMPKPLLLSELDNWVRPGGKSEKGKALAILSRLHIVTSTAASRDGPP 97

Query: 49  ---------RNAKVALLGGGKPWSMSAKLEVDSKAR-DLDFLNQYALERWECILRFMVGS 98
                     + ++AL GGG   S     +    +   + FL++YA  +WE IL ++V S
Sbjct: 98  ALNLTHNFATSLRLALTGGGDHGSFGIPSDSTQDSNITISFLDEYATAQWEGILHYVVNS 157

Query: 99  QQTEGI-----SADAVRTLFNEGA--------ENPVITKDGFQFLLLETPAQVWYFILKY 145
                +      +  VR L   G             IT+ GF FLL E  AQVW  +L +
Sbjct: 158 TGDPSLLGAQYPSANVRQLLQMGTLVDSKGKYSGVGITQAGFSFLLQEANAQVWTVLLLW 217

Query: 146 LETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAG 205
           LE  E   +D V+ L+F F L     G  YST  +S   +  L  L +FGLVY     + 
Sbjct: 218 LEHAEELAMDHVDLLSFFFLLGSLELGLAYSTTTLSTTQKSMLPILADFGLVYSPPSSSS 277

Query: 206 RFYPTKLALNMATRGT----------------LKQIREPGFLIVETNFRVYAYTDSNLKV 249
            F  T+LA  + +  +                       GF+++ETN+R+YAYT S L++
Sbjct: 278 FFP-TRLATTLTSSASALRSIASGFSAAAASPASSGGTTGFIVIETNYRLYAYTSSPLQI 336

Query: 250 ALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPP 306
           A++ LF +L  R+PN+  G +TRDSVR A+  GITA QII +L  HA P +   S  LPP
Sbjct: 337 AVLALFAKLSTRYPNMVSGRVTRDSVRGAIEHGITADQIISYLTTHAHPQLRRKSPVLPP 396

Query: 307 VVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTK 366
            V DQIRLW+ E ER+  + G L+  F S A++E    YA ++GVL W+++ KR   VTK
Sbjct: 397 TVVDQIRLWQIENERMKATAGFLFKDFASLAEYEGCVRYADEIGVLVWRSDAKRNFFVTK 456

Query: 367 QGHSDIKKF 375
             H  ++ +
Sbjct: 457 --HEQLRDY 463


>gi|425777809|gb|EKV15965.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Penicillium digitatum PHI26]
 gi|425782577|gb|EKV20476.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Penicillium digitatum Pd1]
          Length = 382

 Score =  206 bits (525), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 127/345 (36%), Positives = 187/345 (54%), Gaps = 42/345 (12%)

Query: 80  LNQYALERWECILRFMVG----SQQTEGIS-ADAVRTLFNEG------AENPVITKDGFQ 128
           L++YA  +WE ++ +MVG    S Q + ++ +  V+ L   G           ITKDGF 
Sbjct: 40  LDEYARRQWEGVMGYMVGTSALSGQRDAVNLSKGVKQLLQAGHLVEIRGNRVDITKDGFG 99

Query: 129 FLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFL 188
           F+L +   QVW+ ++ Y+E+ E+ G+D VE L+FLF L     G+ Y    M+      L
Sbjct: 100 FVLQDVNTQVWHILILYVESAEAIGMDSVEVLSFLFLLSSLELGSSYDKSHMTSNQLRTL 159

Query: 189 QHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIREPG---------------FLIV 233
             L +FG+VYQ    A  FYPT+LA  + +  +       G               F+I+
Sbjct: 160 ADLADFGIVYQEDADATHFYPTRLATTLTSDSSALSNPVTGSLTGPVGPSGGSGSGFIII 219

Query: 234 ETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLR 293
           ETN+R+YAYT S L+++LI LF  L YRFPNL  G +TR SVR A+  GITA QII +L 
Sbjct: 220 ETNYRLYAYTSSPLQISLISLFTNLKYRFPNLVTGKITRQSVRRAIEMGITADQIISYLL 279

Query: 294 LHALPSV--------------SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADF 339
            HA P +              +  LPP V+DQIRLW+ ER+RL  + G L+  F S A++
Sbjct: 280 SHAHPQLRKHSAAQSNGKGIPASVLPPTVTDQIRLWQLERDRLRATAGFLFKDFTSLAEY 339

Query: 340 EKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGSQ 384
           +    YA ++GVL W++++KR   VT+  H  +  F  + + G +
Sbjct: 340 QAPCQYAAEIGVLVWKSDRKRMFFVTR--HEQVAAFLRSQKVGGR 382


>gi|430812024|emb|CCJ30551.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 319

 Score =  205 bits (522), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 126/309 (40%), Positives = 172/309 (55%), Gaps = 32/309 (10%)

Query: 71  DSKARDLDFLNQYALERWECILRFMVGSQQTEGISADAVRTL-------FNEGAENPVIT 123
           D K  D+ FL+ YA  +WE IL FMVG+  TE +  D V  L       F    +   IT
Sbjct: 17  DKKHVDIPFLDNYAKNKWEMILHFMVGTG-TEKLPGDHVLNLLRYSGLMFGSRCDEMKIT 75

Query: 124 KDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSD- 182
             GFQFLLL   +Q+W  +L YL  +E   +D V+ L FLF L     G  Y    ++  
Sbjct: 76  NSGFQFLLLNINSQIWTLLLHYLNMIEDLEVDPVDILQFLFMLGNLELGQSYLISSLTPI 135

Query: 183 KLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIREPGFLIVETNFRVYAY 242
           ++QV L++L+++G+VY+RK  + RFYPT+LA  + T  +   IR                
Sbjct: 136 QIQV-LENLKDYGIVYRRK-SSRRFYPTRLA-TILTSNSEILIRH--------------- 177

Query: 243 TDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV-- 300
             S L++A++ LF  L  RF NL VGV+TR+SVR A  +GITA QII +L  +A P +  
Sbjct: 178 --SPLQIAVLNLFVHLYLRFSNLVVGVITRNSVRQAFMNGITAEQIISYLTSYAHPQMKK 235

Query: 301 SCPL-PPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKK 359
           + P+ PP V+DQIRLWE ER RL  +EG L+  F S +DFE    YA +L V+ W+   K
Sbjct: 236 NTPIIPPTVNDQIRLWEMERNRLKATEGYLFRDFNSNSDFELALKYANELNVVVWEAPSK 295

Query: 360 RTVVVTKQG 368
           R   V  QG
Sbjct: 296 RVFFVNIQG 304


>gi|348683175|gb|EGZ22990.1| hypothetical protein PHYSODRAFT_479459 [Phytophthora sojae]
          Length = 490

 Score =  205 bits (522), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 123/367 (33%), Positives = 197/367 (53%), Gaps = 52/367 (14%)

Query: 58  GGKPWS-----MSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEGISADAV--- 109
           GG PW      +    E    A DL+   +YA  RW+ +L +MVGS   +      V   
Sbjct: 118 GGSPWERGRLQLPKDPENTFAAADLE---RYARARWDSVLHYMVGSTAVQEPPQSVVDIL 174

Query: 110 ---RTLFNEGAENPV--ITKDGFQFLLLETPAQVWYFILKYLETVESKG-LDLVECLTFL 163
              R L   GA++    IT  G++F+L +   Q+W F+L+Y+ T+++ G L   + L FL
Sbjct: 175 LRTRLLQASGADSRALHITDTGYEFMLKDIHVQMWIFMLEYIRTLDNTGTLKQEDILQFL 234

Query: 164 FQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNM------- 216
           FQ+ +   G  Y+   +++  ++ L    +FGL+++++  + RFY T LA+N+       
Sbjct: 235 FQISYCQTGEYYAVADLTETQRLLLGDFIDFGLLFRKRPNSDRFYTTSLAVNLIFGGSTG 294

Query: 217 -------------ATRGTLK-QIREPG-----------FLIVETNFRVYAYTDSNLKVAL 251
                          R  LK Q+ +P             ++VETNF++YAYT S L VA+
Sbjct: 295 QKRSHVSLTSSFAGVRAGLKSQVADPRQAPTADHSAQLLVVVETNFKIYAYTTSTLHVAM 354

Query: 252 IGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPLPPVV 308
           + +F +++ R PNL +G +TR+S+R+AL  GI+A QI  FL  HA P +   S  +P  +
Sbjct: 355 LSVFVDIVARLPNLAIGFITRESLRSALIHGISAQQIYDFLMKHAHPKMRRNSPVIPENI 414

Query: 309 SDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQG 368
           +DQI LWE ER R+   EG+L+  F ++ D+E +RDYA+DL VLTW +     + +T  G
Sbjct: 415 ADQIYLWERERNRVQFMEGILFDGFNTKEDYESVRDYAKDLKVLTWSDPIHFRLSITTAG 474

Query: 369 HSDIKKF 375
             D++ +
Sbjct: 475 IDDVRHY 481


>gi|47212202|emb|CAF90416.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 471

 Score =  205 bits (521), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 135/358 (37%), Positives = 176/358 (49%), Gaps = 105/358 (29%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL  ++K++V+R+LF+EQP+PQA +A W+ K   ++H E    L  LR            
Sbjct: 70  ELPSLSKNFVMRMLFLEQPLPQAAVALWVGKDSQRDHDECVCVLAGLRLWHSQHLQGGLQ 129

Query: 49  ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                     N K ALLGGG+ W+     L  D  ARD+  L++YA ERWE IL FMVGS
Sbjct: 130 GYVLNPVFKENLKTALLGGGRAWAEEGGALGPDRHARDIGSLDRYATERWEVILHFMVGS 189

Query: 99  QQTEGISADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
                +S D  + L + G       E P IT  GFQFLLL+T +Q+WYF L+YL+T +  
Sbjct: 190 P-CAAVSQDLAQLLVHAGLMKSEAGEPPYITSAGFQFLLLDTASQLWYFTLQYLKTAQ-- 246

Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
                                DYS EGMSD L  FLQHLREFGLV+QRK  +GR      
Sbjct: 247 ---------------------DYSVEGMSDSLLTFLQHLREFGLVFQRK-ASGR------ 278

Query: 213 ALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTR 272
                                                       E++YRFPN+ V  +TR
Sbjct: 279 --------------------------------------------EMLYRFPNVVVAQVTR 294

Query: 273 DSVRAALRSGITAAQIIGF-LRLHALPSVSCP-LPPVVSDQIRLWEGERERLTTSEGV 328
           +SV+ A+ +GITA Q + F L     P    P LPP ++DQIRLWE ER+RL  +E V
Sbjct: 295 ESVQQAIANGITAQQRLTFDLWSSPTPLSQTPVLPPTITDQIRLWELERDRLQFTEAV 352


>gi|414584750|tpg|DAA35321.1| TPA: hypothetical protein ZEAMMB73_386928 [Zea mays]
          Length = 386

 Score =  204 bits (519), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 122/350 (34%), Positives = 191/350 (54%), Gaps = 45/350 (12%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
           L  +AK Y +++L+V  PV  A +  W+   +A +HK A D L +LR             
Sbjct: 40  LPPLAKKYALQMLYVLAPVTAAAMEEWVLDEYAAKHKVAIDKLLQLRVFVEVRDRRRDVS 99

Query: 49  --------RNAKVALLGGGK------PWSMSAKLEVDSKARDLDFLNQYALERWECILRF 94
                    N +  L+ GG       P S++ +L   +       L  YAL++WEC L  
Sbjct: 100 YKMNQKFQGNMQKYLVDGGSLPREPLPLSVTGRLPTPAD------LEAYALDQWECFLLQ 153

Query: 95  MVGSQQTE---GISADAVRT-----LFNEGAENPVITKDGFQFLLLETPAQVWYFILKYL 146
           ++ S Q E     S+  ++T     L +   E   +T++GFQFLL+ET AQ+WY + +Y+
Sbjct: 154 LINSSQVEKGSSFSSSMMKTFQRGLLSSRDGEASKLTENGFQFLLMETNAQLWYIMREYI 213

Query: 147 ETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRK-RKAG 205
            + E +G+D  E ++FL +L F   G  YS   ++D  ++ ++ L E GLV Q++ RK  
Sbjct: 214 SSAEERGVDPTELISFLLELSFHKLGAAYSLNTLTDVQRIAIRDLAELGLVKQQQGRKDS 273

Query: 206 RFYPTKLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNL 265
            F PT+LA N++   +     + GF++VETNFR+YAY+ S L   ++ LF  + Y+ PNL
Sbjct: 274 WFIPTQLATNLSASLSDSSSNKEGFVVVETNFRMYAYSTSKLHCEILRLFARVEYQLPNL 333

Query: 266 TVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQI 312
            VG +T++S+  A  +GITA QII FLR +A P V+  +P V   V+DQ+
Sbjct: 334 IVGAVTKESIYGAFENGITAEQIISFLRQNAHPRVADKIPAVPENVTDQV 383


>gi|325181847|emb|CCA16302.1| general transcription factor IIH subunit putative [Albugo laibachii
           Nc14]
          Length = 473

 Score =  204 bits (518), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 136/437 (31%), Positives = 215/437 (49%), Gaps = 66/437 (15%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWL------------SKTHAKE----HKEATDTLT 45
           L  +A+ +V+RLL     V + V+  WL             K  AK     ++ +   L 
Sbjct: 32  LPSLAQQFVMRLLSCNTVVQEQVLREWLVTQVQSDPGTSVGKEKAKSIDEIYQHSIQKLR 91

Query: 46  ELR-----------------RNAKVALLG-GGKPWSMSAK-----LEVDSKARDLDFLNQ 82
           +LR                 +  K AL   GG PW +  +     +EV   + +L+   +
Sbjct: 92  KLRVFTQSSTDSYQLHAGFQKQLKYALCHLGGCPWEVGRRKLGKDIEVSFASTELE---R 148

Query: 83  YALERWECILRFMVGSQQTEGISADAVRTLFNEGAENPV-------ITKDGFQFLLLETP 135
           YA   W+ +L +MVGS          V  L                IT  G++F+L +  
Sbjct: 149 YARSHWDAVLHYMVGSTSVPDPPQAVVDILLRTNLMQASVDQRALHITDKGYEFMLKDIH 208

Query: 136 AQVWYFILKYLETVESKG-LDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREF 194
            Q+W FIL+Y++T++  G L   + L FLFQ+ +      Y  + ++   Q+ L     F
Sbjct: 209 IQMWIFILEYIKTLDRTGALKQEDILRFLFQISYCQVNAYYPVQDLTKTQQLLLTDFNNF 268

Query: 195 GLVYQRKRKAGRFYPTKLALNMATRGTLK------QIREPG-------FLIVETNFRVYA 241
           GL+Y+++  + RFY T LA+N+   GT        Q+  P          IVETNF+VYA
Sbjct: 269 GLLYRKRSNSDRFYTTSLAINLIFGGTTTSNYSETQVARPRMSSMSDLLTIVETNFKVYA 328

Query: 242 YTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV- 300
           YT S L +A++ +F +++ R PNL +G +TR+S+R+AL  GI+A QI  FL  HA P + 
Sbjct: 329 YTTSTLHIAMLSVFVDIVARLPNLAIGFITRESLRSALVHGISAQQIYDFLLKHAHPRMK 388

Query: 301 -SCPL-PPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEK 358
            + P+ P  ++DQI LWE ER R+   EG+L+  F S+ +FE++R+YAQ    LTW +  
Sbjct: 389 KNNPIVPENIADQIYLWEKERNRIKFLEGILFDGFNSKEEFEQVREYAQKQRFLTWTDPI 448

Query: 359 KRTVVVTKQGHSDIKKF 375
              + ++  G   ++ F
Sbjct: 449 HLKLAISSNGIESVRNF 465


>gi|90076516|dbj|BAE87938.1| unnamed protein product [Macaca fascicularis]
          Length = 358

 Score =  203 bits (517), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 108/223 (48%), Positives = 144/223 (64%), Gaps = 30/223 (13%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL  +AK++V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR            
Sbjct: 83  ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 142

Query: 49  ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                    +N ++ALLGGGK WS  +++L  D  ARD+  L++YA ERWE +L FMVGS
Sbjct: 143 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 202

Query: 99  QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
             +  +S D  + L   G        E P IT  GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 203 P-SAAVSQDLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 261

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREF 194
           +G+DLVE L+FLFQL FST G DYS EGMSD L  FLQHL+E 
Sbjct: 262 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLQEI 304


>gi|325181222|emb|CCA15636.1| general transcription factor IIH subunit putative [Albugo laibachii
           Nc14]
          Length = 1033

 Score =  203 bits (516), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 136/437 (31%), Positives = 215/437 (49%), Gaps = 66/437 (15%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWL------------SKTHAKE----HKEATDTLT 45
           L  +A+ +V+RLL     V + V+  WL             K  AK     ++ +   L 
Sbjct: 79  LPSLAQQFVMRLLSCNTVVQEQVLREWLVTQVQSDPGTSVGKEKAKSIDEIYQHSIQKLR 138

Query: 46  ELR-----------------RNAKVALLG-GGKPWSMSAK-----LEVDSKARDLDFLNQ 82
           +LR                 +  K AL   GG PW +  +     +EV   + +L+   +
Sbjct: 139 KLRVFTQSSTDSYQLHAGFQKQLKYALCHLGGCPWEVGRRKLGKDIEVSFASTELE---R 195

Query: 83  YALERWECILRFMVGSQQTEGISADAVRTLFNEGAENPV-------ITKDGFQFLLLETP 135
           YA   W+ +L +MVGS          V  L                IT  G++F+L +  
Sbjct: 196 YARSHWDAVLHYMVGSTSVPDPPQAVVDILLRTNLMQASVDQRALHITDKGYEFMLKDIH 255

Query: 136 AQVWYFILKYLETVESKG-LDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREF 194
            Q+W FIL+Y++T++  G L   + L FLFQ+ +      Y  + ++   Q+ L     F
Sbjct: 256 IQMWIFILEYIKTLDRTGALKQEDILRFLFQISYCQVNAYYPVQDLTKTQQLLLTDFNNF 315

Query: 195 GLVYQRKRKAGRFYPTKLALNMATRGTLK------QIREPG-------FLIVETNFRVYA 241
           GL+Y+++  + RFY T LA+N+   GT        Q+  P          IVETNF+VYA
Sbjct: 316 GLLYRKRSNSDRFYTTSLAINLIFGGTTTSNYSETQVARPRMSSMSDLLTIVETNFKVYA 375

Query: 242 YTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV- 300
           YT S L +A++ +F +++ R PNL +G +TR+S+R+AL  GI+A QI  FL  HA P + 
Sbjct: 376 YTTSTLHIAMLSVFVDIVARLPNLAIGFITRESLRSALVHGISAQQIYDFLLKHAHPRMK 435

Query: 301 -SCPL-PPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEK 358
            + P+ P  ++DQI LWE ER R+   EG+L+  F S+ +FE++R+YAQ    LTW +  
Sbjct: 436 KNNPIVPENIADQIYLWEKERNRIKFLEGILFDGFNSKEEFEQVREYAQKQRFLTWTDPI 495

Query: 359 KRTVVVTKQGHSDIKKF 375
              + ++  G   ++ F
Sbjct: 496 HLKLAISSNGIESVRNF 512


>gi|302806060|ref|XP_002984780.1| hypothetical protein SELMODRAFT_121159 [Selaginella moellendorffii]
 gi|302808297|ref|XP_002985843.1| hypothetical protein SELMODRAFT_123271 [Selaginella moellendorffii]
 gi|300146350|gb|EFJ13020.1| hypothetical protein SELMODRAFT_123271 [Selaginella moellendorffii]
 gi|300147366|gb|EFJ14030.1| hypothetical protein SELMODRAFT_121159 [Selaginella moellendorffii]
          Length = 459

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 136/420 (32%), Positives = 212/420 (50%), Gaps = 43/420 (10%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
           L  +AK YV+RLL+++  VP  ++  W+      +HK A D L +LR             
Sbjct: 41  LPPLAKLYVLRLLYLDAAVPDKMLLEWIKGDAVAKHKNAVDRLVQLRVFLPEEKILSRKK 100

Query: 49  -----------RNAKVALLGG-GKPWS-MSAKLEVD-SKARDLDFLNQYALERWECILRF 94
                         ++AL  G G P   +   + V    A++LD    YA E+WE     
Sbjct: 101 ETHYRVNPKFQEQLRLALSTGLGTPRDPLPPDIAVRMPSAKELD---DYATEKWEAWFSS 157

Query: 95  MVGSQQTEGISADA---VRTLFNEG-----AENPVITKDGFQFLLLETPAQVWYFILKYL 146
           +V     +G SA     +  LF +       ++P IT  GFQFLLL+  +Q+W  I +Y+
Sbjct: 158 LVNFASPDGPSASTNGFIVQLFQKADLLTSGQDPKITPAGFQFLLLDRNSQLWRVIREYV 217

Query: 147 ETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGR 206
           +  E++ +D  E + FL ++ F + G  YS + + +  + F + L   G++  +K    R
Sbjct: 218 QYAEARQIDTGELIRFLLEIGFYSVGEPYSMDSLPNSQRNFAEELAMLGVLQLQKGMKDR 277

Query: 207 -FYPTKLALNMATRGTLKQI-REPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPN 264
            F PT+LA  ++   +     +  GF++VETNFRVYAYT S L +  + +F    Y  PN
Sbjct: 278 WFIPTRLATGLSASLSESSAWQTEGFIMVETNFRVYAYTSSKLHIETLHVFVRTEYVLPN 337

Query: 265 LTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPV---VSDQIRLWEGERER 321
           + VG +T++SV  A  SGI+A QII FL+ HA P V+  +P V   V DQIRLWE +R R
Sbjct: 338 ILVGSITKESVNGAFASGISADQIIKFLQQHAHPFVAQKVPSVPETVCDQIRLWESDRVR 397

Query: 322 LTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQK 381
           +       Y  F S + +E +  +A+D   L W++  ++ +VV  + H  I+ F  N  K
Sbjct: 398 VQYLPAYCYEGFPSTSVYESVVAHARDRNGLLWEDANRKMIVVGGEHHEAIRAFLQNINK 457


>gi|301098671|ref|XP_002898428.1| general transcription factor IIH subunit, putative [Phytophthora
           infestans T30-4]
 gi|262105199|gb|EEY63251.1| general transcription factor IIH subunit, putative [Phytophthora
           infestans T30-4]
          Length = 483

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 136/450 (30%), Positives = 223/450 (49%), Gaps = 75/450 (16%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWL------SKTHAKEHKEATDTLTELR------- 48
           L  +A+ +V+RLL     VP+ ++  W+      +K    +   A   L  LR       
Sbjct: 32  LPSLAQQFVMRLLPTNSAVPRDLLEQWVVPIPGETKRMPPQFNAALQKLEGLRVFVQQDG 91

Query: 49  ---------RNAKVALLG-GGKPWS-----MSAKLEVDSKARDLDFLNQYALERWECILR 93
                    +    AL   GG PW      +    E    A DL+   +YA  RW+ +L 
Sbjct: 92  GYKPHPTFQKQLMYALSNLGGSPWERGRLHLPRDPENTFAAVDLE---RYARARWDLVLH 148

Query: 94  FMVGSQQTEGISADAVRTLFN------EGAENPV--ITKDGFQFLLLETPAQVWYFILKY 145
           +MVGS   +      V  L         GA++    IT  G++F+L +   Q+W F+L+Y
Sbjct: 149 YMVGSTAVQEPPQSVVDILLRTKLLQASGADSRALHITDTGYEFMLKDIHVQMWIFMLEY 208

Query: 146 LETVESKG-LDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKA 204
           + T+++ G L   + L FLFQ+ +   G  Y+   +++  ++ L    +FGL++++++ +
Sbjct: 209 IRTLDNTGTLKQEDILQFLFQISYCQTGGYYAVADLTETQRLLLGDFIDFGLLFRKRQNS 268

Query: 205 GRFYPTKLALNM--------------------ATRGTLK-QIREPG-----------FLI 232
            RFY T LA+N+                      R  +K Q+ +P             ++
Sbjct: 269 DRFYTTSLAVNLIFGGSTGQKRSHVSLTSSFAGVRAGMKSQVADPRQAPTVDHGARLLVV 328

Query: 233 VETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFL 292
           VETNF++YAYT S L VA++ +F +++ R PNL +G +TR+S+R+AL  GI+A QI  FL
Sbjct: 329 VETNFKIYAYTSSTLHVAMLSVFVDIVARLPNLAIGFITRESLRSALIHGISAQQIYDFL 388

Query: 293 RLHALPSV---SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDL 349
             HA P +   S  +P  ++DQI LWE ER R+   EG+L+  F ++ D+E +RDYA+ L
Sbjct: 389 MKHAHPKMRRNSPVIPENIADQIYLWERERNRVQFLEGILFDGFNTKEDYESVRDYAKGL 448

Query: 350 GVLTWQNEKKRTVVVTKQGHSDIKKFWVNH 379
            VLTW +     + +   G  +++ F  N 
Sbjct: 449 MVLTWSDPIHFRLSIATAGIDEVRHFIQNQ 478


>gi|402083674|gb|EJT78692.1| RNA polymerase II transcription factor B subunit 2 [Gaeumannomyces
           graminis var. tritici R3-111a-1]
          Length = 496

 Score =  202 bits (514), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 134/444 (30%), Positives = 218/444 (49%), Gaps = 80/444 (18%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLS------KTHA----------------KEHKE 39
           L  +AK +V+ LL++ +P+  A + +W        K HA                KE  +
Sbjct: 38  LPSLAKTFVMALLYMPKPLSLADLDTWAKPDARKQKDHALAILRVLHIVLITAPSKEKSQ 97

Query: 40  ATDTLTELRRNAKVALLGGG---KPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMV 96
                   +++ ++AL GGG     + + + L V     D+ FL+++A  RW+ IL ++V
Sbjct: 98  EVSLTPNFKKSLRLALEGGGTSHNTFGVPSSLPVPPNV-DIPFLDKFARTRWDAILHYVV 156

Query: 97  GSQQTEGISADAVRTLFNEGAE--------------------NPVITKDGFQFLLLETPA 136
            S + +  +    + L + G++                    +  I+K GF FLL E+ A
Sbjct: 157 NSVEGDEFAQRPSKRLHSGGSKLQDTVKELLVAGGLVERRGGSISISKTGFTFLLQESNA 216

Query: 137 QVWYFILKYLETVESKGLD----LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLR 192
           QVW  +L++LE V + G D     V+ L+FLF L     G  Y TE +S++ +  L  L 
Sbjct: 217 QVWTLLLQWLEAVNAAGGDHSAMAVDMLSFLFMLGTLELGQAYDTEELSEQRRNMLPSLV 276

Query: 193 EFGLVYQRKRKAGRFYPTKLALNMA----------------------TRGTLKQIREPGF 230
           +FGLVY       +++PT+LA  +                         G  +     G 
Sbjct: 277 DFGLVYIPPGNTSQYFPTRLATTLTSGSSALRSASSALAAATAEGPNASGGGQSEAAKGS 336

Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIG 290
           +I+ETN+R+YAYT + L++A++GLF EL +RF  +  G L R+S++ A+  GITA Q+I 
Sbjct: 337 IIIETNYRLYAYTSTPLQIAILGLFAELRFRFAGMVTGRLDRESIKRAISYGITADQVIE 396

Query: 291 FLRLHALPSV--------SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKL 342
           +L  HA   +           LPP V DQIRLW+ E ER+    G L+  F SQA+++ L
Sbjct: 397 YLAAHAHEQMHRTATLRKKPVLPPTVVDQIRLWQLETERMKIMRGFLFRDFDSQAEYDDL 456

Query: 343 RDYAQDLGVLTWQNEKKRTVVVTK 366
             YA ++GVL W+++ ++    +K
Sbjct: 457 AKYADEIGVLLWRSDARQLFFASK 480


>gi|290987279|ref|XP_002676350.1| predicted protein [Naegleria gruberi]
 gi|284089952|gb|EFC43606.1| predicted protein [Naegleria gruberi]
          Length = 532

 Score =  202 bits (513), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 117/323 (36%), Positives = 186/323 (57%), Gaps = 26/323 (8%)

Query: 78  DFLNQYALERWECILRFMVGSQ-QTEGISADAVRTLFN-----EGAENPVITKDGFQFLL 131
           +FL +Y+  +WE +L ++VG+  ++E   +  V  L       + ++   IT +GFQFLL
Sbjct: 204 EFLQEYSKRKWENVLLYLVGTAAESEQTFSGNVNELLKYSQLIKKSDTVRITNEGFQFLL 263

Query: 132 LETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHL 191
            ET  QVW  +  YLET   +     E L F+F+L F   G +YS++ ++   +  L   
Sbjct: 264 QETKVQVWKLLKHYLETSGQRNQVKNEILNFIFELGFLDVGKEYSSKDLTSTQKSLLVDF 323

Query: 192 REFGLVYQRKRKAGR-----FYPTKLA----LNMATR-------GTLKQIREPGFLIVET 235
            + G++Y  + K  +     F+PT LA    ++M+T        G+   I   G++IVET
Sbjct: 324 NDLGIIYLHRDKKKKIKDKYFFPTPLAKSLTVSMSTSYDLISSFGSHNSINN-GYIIVET 382

Query: 236 NFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLH 295
           N+RVYAYT+S L++AL+ LF    YR PN+ VG++TR ++R AL++GI+A QI+ FLRL+
Sbjct: 383 NYRVYAYTNSPLQIALLSLFIFPEYRLPNMVVGLITRSTIREALKNGISAHQILQFLRLN 442

Query: 296 ALPSVSCP---LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVL 352
           A P +      +P  VSDQI LWE ER R+  +  V+Y +F + ++     DYA+  G L
Sbjct: 443 AHPQMRLKKPVIPDTVSDQILLWEKERNRVIKTTSVVYDKFTNVSELNATVDYARRQGAL 502

Query: 353 TWQNEKKRTVVVTKQGHSDIKKF 375
            W +E+K  +V  ++ H  +K+F
Sbjct: 503 LWHSEEKMMMVCKREFHGLMKEF 525


>gi|302917488|ref|XP_003052448.1| predicted protein [Nectria haematococca mpVI 77-13-4]
 gi|256733388|gb|EEU46735.1| predicted protein [Nectria haematococca mpVI 77-13-4]
          Length = 496

 Score =  201 bits (511), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 138/451 (30%), Positives = 226/451 (50%), Gaps = 80/451 (17%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTL----------------- 44
           L  +AK +V+R+L+  +P+    +  W+  +  ++  +A   L                 
Sbjct: 41  LPHLAKTFVMRILYSPKPILLTDLDDWVKPSAKRQKDQALSILRVLYIVQITAPSKERPQ 100

Query: 45  -----TELRRNAKVALLGGG--KPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVG 97
                   + + ++AL GGG    + + + L+V  +  D+ FL++YA  +WE IL F+V 
Sbjct: 101 EMQLTANFKTSLRLALSGGGTHNSFGVPSTLQVPPEI-DIIFLDRYARRKWEDILHFVVS 159

Query: 98  S--QQTEGISA---DAVRTLFNEG-----AENPV--ITKDGFQFLLLETPAQVWYFILKY 145
           S   ++ G S+    +V+ L   G       N +  IT+ GF FLL E  AQVW  +L +
Sbjct: 160 SVGYKSTGDSSGPNKSVKELLIAGRLVDRKPNGMVGITQAGFTFLLQEANAQVWTLLLLW 219

Query: 146 LETVE---SKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKR 202
           LE ++     GL+  + L+FLF L     G  Y T  ++++ +  L  L +FGL+Y    
Sbjct: 220 LEAMDVNKGAGLEATDMLSFLFVLASMELGRAYDTNALTEQRKNMLPSLVDFGLIYIPNH 279

Query: 203 KAGRFYPTKLALNMATRG-TLKQIREP-----------------------------GFLI 232
           K   F+PT+LA  + + G +L+ I +                              G +I
Sbjct: 280 KRSMFFPTRLATTLTSSGNSLRSISDGVAAATAAALQPGQSGGSGGSTTGSGQEQRGSVI 339

Query: 233 VETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFL 292
           +ETN+R+YAYT S L++A++ LF +L  RFP++  G L+R S+R A+  GITA QII +L
Sbjct: 340 IETNYRIYAYTQSTLQIAVLALFSKLSMRFPDMVAGRLSRQSIRQAINFGITADQIISYL 399

Query: 293 RLHALPSV-------SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRD 344
             HA   +       + P LPP V DQIRLW+ E ER+ T+ G L+  F    ++  +  
Sbjct: 400 AAHAHDQMHRTAALNNKPVLPPTVVDQIRLWQLENERMKTTSGFLFKDFEDHKEYMAVAG 459

Query: 345 YAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           +A+++GVL W+N+ K     +K  H  I+ +
Sbjct: 460 FAEEVGVLVWRNDVKGMFFASK--HEQIRDY 488


>gi|452839401|gb|EME41340.1| hypothetical protein DOTSEDRAFT_156149 [Dothistroma septosporum
           NZE10]
          Length = 497

 Score =  200 bits (509), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 134/431 (31%), Positives = 211/431 (48%), Gaps = 67/431 (15%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEH--KEATDTLTELR----------- 48
           L  +AK  V+ +L++  P   A + +W+      +    +A  TL  LR           
Sbjct: 38  LPHLAKTLVMAMLYMPTPFAAADLDTWIRPDMPSQQARDKALSTLQRLRILFDDQHNGKP 97

Query: 49  ---------RNAKVALLGGGKPWSMSAKLEV-DSKARDLDFLNQYALERWECILRFMVGS 98
                    ++ ++AL GGG   S        D  +   ++L+ +A  +WE IL ++VGS
Sbjct: 98  AYKLSPAYGKSLRLALTGGGDHRSFGVPCSTPDKTSISTEYLDTFARRQWEAILYYVVGS 157

Query: 99  QQTEGISAD-----AVRTLFNEG--------AENPVITKDGFQFLLLETPAQVWYFILKY 145
                +S +       ++L  +G         +  +I+  GF FLL +  AQVW  ++ Y
Sbjct: 158 AN-RALSGEVDISPGTKSLLQKGEFVNVMSGGKQRMISTTGFTFLLQDVNAQVWSLLIVY 216

Query: 146 LETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAG 205
           LE   S  +D VE L+FLF L     G  YST  ++   Q  L  L +FGL+Y+R   + 
Sbjct: 217 LEVSSSLRMDPVEVLSFLFTLGSLELGISYSTANLTPTQQQMLDDLADFGLIYRRTPDSH 276

Query: 206 RFYPTKLALNMA------TRGTLKQIR----------------EPGFLIVETNFRVYAYT 243
           R+YPT+LA  +       T  +L +                  E GF+I+ETN+R+YAYT
Sbjct: 277 RYYPTRLATTLTSDAPALTNNSLMKTTVMTGDPSTDLAASASDEKGFIILETNYRLYAYT 336

Query: 244 DSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--- 300
            S L ++++ LF  L  R+PNL    +T+ S + A+ +GIT+ QII +L  HA P +   
Sbjct: 337 SSPLLISILSLFASLNTRYPNLITAKITKASTQNAIAAGITSDQIIDYLTTHAHPVLRRQ 396

Query: 301 ----SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQ 355
                 P LPP V DQI+LW+ E ER+  ++G L     SQ +++K  +YA  +GVL+ +
Sbjct: 397 AAANDAPILPPTVVDQIKLWQMEGERMEATKGYLIRDIGSQEEYDKAVNYADAIGVLSKE 456

Query: 356 NEKKRTVVVTK 366
              K +  VT+
Sbjct: 457 FRHKGSFFVTR 467


>gi|320581720|gb|EFW95939.1| TFIIH basal transcription factor complex subunit 2 [Ogataea
           parapolymorpha DL-1]
          Length = 544

 Score =  200 bits (509), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 140/486 (28%), Positives = 222/486 (45%), Gaps = 115/486 (23%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASW-----------LSKTHAKE---HKEATDTLTEL 47
           L  +AK Y++ +LF +  +P   +  W           LSK +  +     +A + L E 
Sbjct: 47  LPSLAKFYIMTMLFQDAAIPSTDLNRWIKSSSFNGSRNLSKIYQNDSLKRLKALNLLKET 106

Query: 48  RR------------------------NAKVALLGGGKPWSMSAKLEVDS-KARDLDFLNQ 82
           RR                        + + AL G      +     ++S +   ++FL+ 
Sbjct: 107 RRIFTHPQTGQTTQLPFVSLNPTFRQSFRNALTGSRDANEVVEDNNLESIETISVEFLDT 166

Query: 83  YALERWECILRFMVGSQQTEGISADAVRTLFNEGA------------------------- 117
           Y L RWE IL FMVGS+  E  S   +  L   G                          
Sbjct: 167 YCLHRWESILHFMVGSETKELPSVGVLTLLRYSGLMELPSDREQREHLGDDYEFQPKLQQ 226

Query: 118 -------ENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFST 170
                  +N +IT++GFQFLL +  +Q+W  +L+YL+  E   ++ VE L F+F L    
Sbjct: 227 SSSIQTLKNLLITQNGFQFLLQDINSQIWTLLLQYLKMSEKLMMNPVEVLNFIFMLGSLE 286

Query: 171 FGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGR-FYPTKLALNMATRGT-------- 221
            G  Y  + +SD  ++ L  L ++GL+Y   +   + FYPT+LA  + +  T        
Sbjct: 287 LGQGYPVDPLSDTQKIMLDDLIDYGLIYTPSKTNKKIFYPTRLATTLTSEHTNFKTSAAV 346

Query: 222 -------LKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDS 274
                   K     G ++VETNF++Y YT S L++A++ LF  L  RF N+  GV+TR+S
Sbjct: 347 IDQEITNSKNSNNQGTIVVETNFKIYCYTSSPLQIAILNLFVHLKARFANMVTGVITRES 406

Query: 275 VRAALRSGITAAQIIGFLRLHALP------------------SVSCP----------LPP 306
           VR AL +GITA Q+I +L  HA P                  S+             LPP
Sbjct: 407 VRRALINGITADQMINYLGSHAHPWMVKQAEDKLARKLELESSIGNAAGQRQIDLEILPP 466

Query: 307 VVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTK 366
            V DQI+LW+ E +R+ + +G LY  F +  +FEKL  Y +++GV+ W++  ++   VT+
Sbjct: 467 TVVDQIKLWQLELDRIQSFKGYLYKDFSTDLEFEKLLTYGEEIGVIVWKDRARKRFFVTQ 526

Query: 367 QGHSDI 372
           +G+  +
Sbjct: 527 EGNGQL 532


>gi|403180193|ref|XP_003338499.2| hypothetical protein PGTG_19983 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375166015|gb|EFP94080.2| hypothetical protein PGTG_19983 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 679

 Score =  200 bits (509), Expect = 9e-49,   Method: Compositional matrix adjust.
 Identities = 106/263 (40%), Positives = 157/263 (59%), Gaps = 10/263 (3%)

Query: 122 ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMS 181
           IT  GF FLL +   Q+W  +L+YL+  E+ GLD+V+ L  LF L     G +YS    +
Sbjct: 37  ITSKGFGFLLEDVNTQLWDILLQYLKMTEANGLDVVDVLACLFMLGSLELGQEYSFSNWT 96

Query: 182 DKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTL-----KQIREPGFLIVETN 236
                 LQ L ++GLV        RFYPT+LA  + +         +   E GFL++ETN
Sbjct: 97  PTQTQVLQDLVDYGLVL--VSAPDRFYPTRLATTLTSTAPPLVSAERAQEEHGFLVLETN 154

Query: 237 FRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHA 296
           +R+YAYT + L++A++ LF  L YRFPNL VG +TR+S+++AL +GITA Q+I +L  HA
Sbjct: 155 YRIYAYTSNPLQIAVLNLFLSLRYRFPNLVVGAVTRESIKSALSNGITADQVIMYLHTHA 214

Query: 297 LPSV---SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLT 353
            P +      LPP V DQIRLWE E+ R+   EG LY  F S A+++ +  Y++ LG++ 
Sbjct: 215 HPQMRKLEPLLPPTVVDQIRLWELEKNRIRAQEGYLYEDFKSAAEYDSVIQYSRKLGIVL 274

Query: 354 WQNEKKRTVVVTKQGHSDIKKFW 376
           W++   R + V   GH  +++F+
Sbjct: 275 WEHAGLRKLFVGYDGHLTLREFF 297


>gi|346323394|gb|EGX92992.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Cordyceps militaris CM01]
          Length = 511

 Score =  200 bits (508), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 139/455 (30%), Positives = 213/455 (46%), Gaps = 103/455 (22%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTL----------------- 44
           L  +AK  V+R+L++  P+  A + +W+     ++  +   TL                 
Sbjct: 41  LPHLAKTIVMRMLYMSGPMSLADLDAWVKPEAKRQKDQMLSTLRSLHIVQISAPAKDKTQ 100

Query: 45  -----TELRRNAKVALLGGGK------PWSMSAKLEVDSKARDLDFLNQYALERWECILR 93
                T  R + ++AL GGG       P S++   E+D+      FL++YA  RW+ IL 
Sbjct: 101 EVQLTTNFRTSMRLALTGGGNHNSFGVPSSLTVPPEIDAP-----FLDRYARRRWDEILH 155

Query: 94  FMVGS--------------QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLL 132
           F+V S                  G    +VR +   G       + +  IT+ GF FLL 
Sbjct: 156 FVVSSVGFKSTATTAAGGEGGGGGGPNKSVRDILIAGRLVDRRPSGSVGITQAGFTFLLQ 215

Query: 133 ETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLR 192
           E  AQVW  +L +L+  ++     V+ L+FLF L     G  Y T  +S++ +  L  L 
Sbjct: 216 EANAQVWTLLLLWLDAQDADKNAAVDTLSFLFMLASLEPGRAYDTHALSEQRRDMLPALA 275

Query: 193 EFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIR-------------------------- 226
           +FGLVY   +K   F+PT+LA  + T G    +R                          
Sbjct: 276 DFGLVYIPPQKRSMFFPTRLATTL-TSGASSGLRSISDGVAAATSAARSSPSSSAHQPAS 334

Query: 227 --------------EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTR 272
                         +PG +++ETN+R+YAYT S L++A++ LF  L  RFP++  G LTR
Sbjct: 335 SSAGSGPLGGGSGDQPGAVVIETNYRLYAYTQSTLQIAVLALFARLTMRFPDMVAGRLTR 394

Query: 273 DSVRAALRSGITAAQIIGFLRLHALPSV-------SCP-LPPVVSDQIRLWEGERERLTT 324
            S+R A++ GITA QII +L  HA   +       + P LPP V DQIRLW+ E ER+ T
Sbjct: 395 TSIRQAIQFGITADQIIAYLAAHAHEQMHRTAALTNKPVLPPTVVDQIRLWQLENERMKT 454

Query: 325 SEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKK 359
           + G L+  F    ++   R +A ++GVL W ++KK
Sbjct: 455 TPGFLFRDFDDPKEYHDTRRFADEIGVLVWSDDKK 489


>gi|355693901|gb|AER99490.1| proteinral transcription factor IIH, polypeptide 4, 52kDa [Mustela
           putorius furo]
          Length = 265

 Score =  199 bits (507), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 106/219 (48%), Positives = 141/219 (64%), Gaps = 30/219 (13%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL  +AK++V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR            
Sbjct: 48  ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 107

Query: 49  ---------RNAKVALLGGGKPWSM-SAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                    +N ++ALLGGGK WS  +++L  D  ARD+  L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 167

Query: 99  QQTEGISADAVRTLFNEG-------AENPVITKDGFQFLLLETPAQVWYFILKYLETVES 151
             +  +S D  + L   G        E P IT  GFQFLLL+TPAQ+WYF+L+YL+T +S
Sbjct: 168 P-SAAVSQDLAQLLSQAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQS 226

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQH 190
           +G+DLVE L+FLFQL FST G DYS EGMSD L  FLQH
Sbjct: 227 RGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQH 265


>gi|171681598|ref|XP_001905742.1| hypothetical protein [Podospora anserina S mat+]
 gi|170940758|emb|CAP66407.1| unnamed protein product [Podospora anserina S mat+]
          Length = 540

 Score =  199 bits (506), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 135/430 (31%), Positives = 214/430 (49%), Gaps = 75/430 (17%)

Query: 25  IASWLSKTHAKEHKEATDTLTELRRNAKVALLGGG--KPWSMSAKLEVDSKARDLDFLNQ 82
           IA     T AK+ ++   T    +++ ++AL GGG    + + + L  D K   + +L+Q
Sbjct: 115 IAQLSQSTRAKKAQDIQLT-ANFKKSLRLALEGGGSHNSFGVPSTLPTDPKIH-IQYLDQ 172

Query: 83  YALERWECILRFMVGSQQTEG------------ISADAVRTLFN------EGAENPV-IT 123
           +A   W+ IL ++V S   +                 AVR L        EGA   V I+
Sbjct: 173 WAGRIWQDILYYVVNSVPMKANESGGRHGSGGGGPKRAVRELLKMGGLVREGAGGLVQIS 232

Query: 124 KDGFQFLLLETPAQVWYFILKYLET--------------VESKGLDLVECLTFLFQLKFS 169
           + GF FLL E  AQVW  +L +LE               +    +D VE L+FLF L   
Sbjct: 233 EHGFNFLLQEANAQVWTLLLLWLEAADRNKALAKEQGTDITGTAIDNVEMLSFLFMLASL 292

Query: 170 TFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIR--- 226
             G  Y T  +++  +  L  L +FGL+Y  + +  ++YPT+LA  + +  T++ +    
Sbjct: 293 ELGRAYDTSALTETRKNMLPALADFGLIYIDRDRPQQYYPTRLATTLTSLSTMRSVSASI 352

Query: 227 -------------------------EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYR 261
                                    E G ++VETN+R+YAYT S L++A++ LFC L  R
Sbjct: 353 DAATKKTPGDAGSLGADSTPTAPADENGGIVVETNYRIYAYTSSPLQIAILKLFCRLHMR 412

Query: 262 FPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--------SCPLPPVVSDQIR 313
           FPN+    LTR+SV+ A++ GITA QII +L  HA P +        +  +PP V DQIR
Sbjct: 413 FPNMVTARLTRESVQEAIKEGITANQIIDYLVAHAHPQMRRAAAARGTTVIPPTVMDQIR 472

Query: 314 LWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIK 373
           LW+ E +R+  + G  +  F S  ++ +L +YA ++GVL W++++K T  V+K     I+
Sbjct: 473 LWQLESQRMQKTPGFQFKDFESVEEYRQLAEYATEIGVLVWKDDRKGTFFVSKV--EQIR 530

Query: 374 KFWVNHQKGS 383
           +F    +KG+
Sbjct: 531 EFLKARKKGN 540


>gi|453082965|gb|EMF11011.1| RNA polymerase II transcription factor B subunit 2 [Mycosphaerella
           populorum SO2202]
          Length = 507

 Score =  199 bits (505), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 140/431 (32%), Positives = 210/431 (48%), Gaps = 66/431 (15%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTH-AKEHKE-ATDTLTELR----------- 48
           L  +AK  V+ LL++ QP   A + +W+   H + + KE A   L  L+           
Sbjct: 38  LPHLAKTMVMALLYLPQPFAAADLDAWVKPDHDSTQAKERALSILQRLKIVYDEPYQGRP 97

Query: 49  ---------RNAKVALLGGGKPWSMSAKL-EVDSKARDLDFLNQYALERWECILRFMVGS 98
                    ++ + AL G G   S      E D     +D+L+ +A  +WE IL ++VGS
Sbjct: 98  AYRLSPAFGKSLRSALTGSGTHNSFGVPCTEPDKHPVTVDYLDTFARTQWEAILYYVVGS 157

Query: 99  QQTEGIS----ADAVRTLFNEG--------AENPVITKDGFQFLLLETPAQVWYFILKYL 146
                      +   RTL   G          +  IT DGF FLL +  AQVW  ++ YL
Sbjct: 158 ASAGLGGGVDISSGTRTLLETGKFVVVKGGGRHRAITTDGFTFLLQDVNAQVWSLLIVYL 217

Query: 147 E-TVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAG 205
           E + +   +D V+ L+FLF L     G  YST  ++      L  L  FGLVY+R   + 
Sbjct: 218 EVSGDLLQMDSVDVLSFLFTLGSLELGVSYSTSNLTPTQLQMLDDLTNFGLVYRRSPSSD 277

Query: 206 RFYPTKLALNMATRG------------TLKQIREP----------GFLIVETNFRVYAYT 243
           R+YPT+LA  + +              T  Q  +P          G++I+ETN+R+YAYT
Sbjct: 278 RYYPTRLATTLTSDAPALPNNSFTTTTTTNQNADPNDPAATASEKGYIILETNYRLYAYT 337

Query: 244 DSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--- 300
            S L ++++ LF  L  R+PNL    +T+ S+  A+ SGIT+ QII +L+ HA P +   
Sbjct: 338 SSPLPISILSLFASLNTRYPNLITAKITKTSIHTAIASGITSDQIISYLQTHAHPILRRQ 397

Query: 301 ----SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQ 355
               S P LPP V DQIRLW+ E ER+ +++G L     ++ D+ K   YA+ LG+L  +
Sbjct: 398 AALNSAPILPPTVVDQIRLWQIEGERMKSTKGYLIRDVGTEDDYTKAVQYAEALGILVKE 457

Query: 356 NEKKRTVVVTK 366
              +R   VT+
Sbjct: 458 FHHRRAFFVTR 468


>gi|330797174|ref|XP_003286637.1| hypothetical protein DICPUDRAFT_31198 [Dictyostelium purpureum]
 gi|325083385|gb|EGC36839.1| hypothetical protein DICPUDRAFT_31198 [Dictyostelium purpureum]
          Length = 471

 Score =  199 bits (505), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 134/431 (31%), Positives = 228/431 (52%), Gaps = 62/431 (14%)

Query: 6   AKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR----------------- 48
           AK Y+ ++L V+   P  +   W   +   +HKEA   L +L+                 
Sbjct: 36  AKQYIFKMLMVD-TYPLTIAKDWSQHSSNHQHKEALKKLFDLKIILLNRGKQQSEQSIQL 94

Query: 49  -----RNAKVALLGGGKP-WSMSAKLEVDSKARDLDFLNQYALERWECILRFMV--GSQQ 100
                 N K AL+   K  +S S+ ++  +K  +++ L+ Y+ ++WE +L F+    +Q 
Sbjct: 95  NPLFQENIKSALVEFDKVIFSNSSNIKDTNKIPNINDLDNYSKKQWEQVLYFLSDDAAQP 154

Query: 101 TEGISADAVRTLFNEGAENPV--ITKDGFQFLLLETPAQVWYFILKYLETVESK---GL- 154
           ++ IS   + +      ++    IT DGF+FLL +   Q+W  ++ YL  +E K   GL 
Sbjct: 155 SKFISDLLLSSNLTRRDQDGSLSITSDGFKFLLKDVYTQIWTLLIVYLNDLEKKRREGLT 214

Query: 155 --DLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL 212
                + L FLF+L F   G  Y    +++  + +L  L++FGL+Y +   +  FYPT+L
Sbjct: 215 QGSRNDLLGFLFRLSFLQLGKGYLVGELTELQKEYLICLKQFGLIYMKSDASLFFYPTRL 274

Query: 213 ALNMATRGTLKQI-------------------------REPGFLIVETNFRVYAYTDSNL 247
            +++ T  TL  I                         +E G++++ETN+R+YAYT S+L
Sbjct: 275 IISLTTGKTLSLIQTLAVDKINSSSNSANTVASSTIEKKESGYIVLETNYRLYAYTASSL 334

Query: 248 KVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---SCPL 304
           +++L+ LF +++YR PNL VG++TR+S+R AL  GITA QII F+R +A P+      P+
Sbjct: 335 QISLLSLFVKMLYRLPNLAVGIITRESIRTALIHGITADQIIDFIRHNAHPNAVNNGQPI 394

Query: 305 PPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVV 364
           P VV++QI LWE ER R+T ++ VLY+ F +   +     +A++     W N++ +T+VV
Sbjct: 395 PDVVAEQILLWEQERNRITYTKSVLYNSFPTTDCYHATLKFAKEQDYYIWSNDQLKTLVV 454

Query: 365 TKQGHSDIKKF 375
            + G+  I+ F
Sbjct: 455 NENGNDPIRNF 465


>gi|346972761|gb|EGY16213.1| RNA polymerase II transcription factor B subunit 2 [Verticillium
           dahliae VdLs.17]
          Length = 476

 Score =  198 bits (503), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 140/435 (32%), Positives = 213/435 (48%), Gaps = 67/435 (15%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATD-TLTELR------------ 48
           LS +AK +V  LL++  P+P A + + +      E+K A D  L +LR            
Sbjct: 41  LSPLAKTFVTMLLYLPGPIPIADLEARVKP----EYKRAKDHALAQLRSLHMLQMSVPTQ 96

Query: 49  -------------RNAKVALLGGGKPWSMSA-KLEVDSKARDLDFLNQYALERWECILRF 94
                        ++ + AL G G P S         S    ++ + ++A ++W+ IL +
Sbjct: 97  GAPQMIQLTANFSKSYREALEGNGAPGSFGVISPHPRSPEITIESIEKHARQKWDAILHY 156

Query: 95  MVGSQQT----EGISADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILK 144
           +V S        G   ++V+ L   G        +  IT+ GF FLL E  AQVW  +L 
Sbjct: 157 IVNSVTPGFVDSGGPKNSVKELLLAGQLVKRQGGSTTITQAGFTFLLQEENAQVWTLLLL 216

Query: 145 YLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKA 204
           +L+  + +     + L+FLFQL        Y TE +S + Q  L  L +FGLVY    K 
Sbjct: 217 WLDATDQQTAKHTDMLSFLFQLSILELDEVYDTEALSKERQDMLPSLMDFGLVYIPSHKP 276

Query: 205 GRFYPTKLA-----LNMATR---GTLKQIREP--------GFLIVETNFRVYAYTDSNLK 248
            +++ T  A     ++ ATR     +  + +P          +IVETN+RVYAYT S L+
Sbjct: 277 FQYFVTPQARALTNVSTATRTLTDGMNALSQPQGGPGDQTNGIIVETNYRVYAYTTSQLQ 336

Query: 249 VALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV-------- 300
           +A++ LFC L  +FP +  G LTR SVR A+  GITA QII +L  HA   +        
Sbjct: 337 IAVLSLFCHLSVKFPGMVSGRLTRQSVRQAIDFGITADQIISYLAAHAHEQMHRQAAADG 396

Query: 301 SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKR 360
              LPP V DQIRLW+ E ER+ T+ G L+  F   A++  +  +A ++GVL W+N++KR
Sbjct: 397 KPVLPPTVVDQIRLWQLENERMKTTSGFLFRDFDDDAEYRDIARFADEIGVLAWRNDRKR 456

Query: 361 TVVVTKQGHSDIKKF 375
               +K  H  I+ +
Sbjct: 457 MFFASK--HEQIRDY 469


>gi|302405781|ref|XP_003000727.1| RNA polymerase II transcription factor B subunit 2 [Verticillium
           albo-atrum VaMs.102]
 gi|261360684|gb|EEY23112.1| RNA polymerase II transcription factor B subunit 2 [Verticillium
           albo-atrum VaMs.102]
          Length = 476

 Score =  198 bits (503), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 143/435 (32%), Positives = 214/435 (49%), Gaps = 67/435 (15%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATD-TLTELR------------ 48
           LS +AK +V  LL++  P+P A + + +      E+K A D  L +LR            
Sbjct: 41  LSPLAKTFVTMLLYLPGPIPIADLEARVKP----EYKRAKDHALAQLRSLHMLQMSVPTQ 96

Query: 49  -------------RNAKVALLGGGKPWSMSA-KLEVDSKARDLDFLNQYALERWECILRF 94
                        ++ + AL G G P S         S    ++ + ++A ++W+ IL +
Sbjct: 97  GAPQMIQLTANFSKSYREALEGNGAPGSFGVISPHPRSPEITIESIEKHARQKWDAILHY 156

Query: 95  MVGSQQT----EGISADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILK 144
           +V S        G   ++V+ L   G        +  IT+ GF FLL E  AQVW  +L 
Sbjct: 157 IVNSVTPGFVDSGGPKNSVKELLLAGQLVKRQGGSTTITQAGFTFLLQEENAQVWTLLLL 216

Query: 145 YLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKA 204
           +L+  + +     + L+FLFQL        Y TE +S + Q  L  L +FGLVY    K 
Sbjct: 217 WLDATDQQTAKHTDMLSFLFQLSILELDEVYDTEALSKERQDMLPSLMDFGLVYIPSHKP 276

Query: 205 GRFYPTKLA-----LNMATR------GTLKQIRE-PG----FLIVETNFRVYAYTDSNLK 248
            +++ T  A     ++ ATR        L Q +  PG     +IVETN+RVYAYT S L+
Sbjct: 277 FQYFVTPQARALTNVSTATRTLTDGMNALSQSQGGPGDQTNGIIVETNYRVYAYTTSQLQ 336

Query: 249 VALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV-------- 300
           +A++ LFC L  +FP +  G LTR SVR A+  GITA QII +L  HA   +        
Sbjct: 337 IAVLSLFCHLSVKFPGMVSGRLTRQSVRQAIDFGITADQIISYLAAHAHEQMHRQAAADG 396

Query: 301 SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKR 360
              LPP V DQIRLW+ E ER+ T+ G L+  F   A++  +  +A ++GVL W+N++KR
Sbjct: 397 KPVLPPTVVDQIRLWQLENERMKTTSGFLFRDFDDDAEYRDIARFADEIGVLAWRNDRKR 456

Query: 361 TVVVTKQGHSDIKKF 375
               +K  H  I+ +
Sbjct: 457 MFFASK--HEQIRDY 469


>gi|384246869|gb|EIE20357.1| Tfb2-domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 451

 Score =  197 bits (501), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 136/415 (32%), Positives = 214/415 (51%), Gaps = 41/415 (9%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           L+ +AK YVIR+LF++ PV ++V+ SW++++   +H  A + L  L              
Sbjct: 32  LTSLAKQYVIRMLFLDVPVTKSVMDSWIAQSSKSKHATAINRLKGLQLLLPSNVASNGQQ 91

Query: 48  ---------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                    R   + A+  GGK         V  +A   + L+ YA ++WE I   ++  
Sbjct: 92  AAYQLHPVFREQLRWAVSTGGKVGIDEVPAGVLQQAPSREMLSSYAQQQWEVITVPLLNQ 151

Query: 99  QQTE---GISADAVRTLF-----NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVE 150
            +     G S  +++TL      +  AE+  I + GFQFLL +T +Q+W  +  Y+ + E
Sbjct: 152 LKCTCGLGSSFASLQTLMLYLVGSICAESRSIEEQGFQFLLSDTYSQLWRLLRAYIASGE 211

Query: 151 SK-GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY- 208
            + G  L   L FL QL F   G+ ++  G+ D  +     + + GL+     K G  + 
Sbjct: 212 ERSGAPLGTILNFLLQLGFREVGSPFALSGLDDSQRHIAADMAQLGLLMPFTAKDGSVWL 271

Query: 209 -PTKLALNMA--TRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNL 265
            PT+LAL +A  + G  +     GF++VETN+RVYAYT S L+ AL+ LF       PNL
Sbjct: 272 APTRLALALAGGSSGQAQHDVTDGFVVVETNYRVYAYTSSLLQTALLRLFTRCECILPNL 331

Query: 266 TVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSC---PLPPVVSDQIRLWEGERERL 322
            VGVLTR+SV  AL  G++A QI+ +LR HA P V+     +P VV+DQ+RLW+ +  R+
Sbjct: 332 FVGVLTRESVTGALACGLSADQIVLYLRQHAHPHVASRTPVVPEVVADQVRLWQADTMRV 391

Query: 323 TTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRT--VVVTKQGHSDIKKF 375
             +  VLY  F S   F+     A+ LGV  W++ K     + V + GH  ++++
Sbjct: 392 RHNRAVLYDDFPSAQVFQLSAQKARTLGVWLWEDPKAGMGRLAVQEAGHDAMREY 446


>gi|449296697|gb|EMC92716.1| hypothetical protein BAUCODRAFT_259506 [Baudoinia compniacensis
           UAMH 10762]
          Length = 489

 Score =  197 bits (501), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 133/413 (32%), Positives = 197/413 (47%), Gaps = 62/413 (15%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHA--KEHKEATDTLTELR----------- 48
           L  +AK  V+ +L++ +P   A +  W+       +    A   L  LR           
Sbjct: 39  LPHLAKTLVMAMLYMPKPFAVADVDVWIKPDAEGLQARDRALSILQRLRILFDTRDEKDG 98

Query: 49  -----------RNAKVALLGGGKPWSMSAKLEV-DSKARDLDFLNQYALERWECILRFMV 96
                      R+ ++AL GGG   S        D +     +L+ +A ++WE IL ++V
Sbjct: 99  RPAYKLSDGFARSLRMALTGGGNHRSFGVPSNAPDKQPVSGSYLDAFARQQWEAILYYVV 158

Query: 97  GSQQTE-GISADAV----RTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKY 145
           GS     G S DA+    + L  +G           IT+ GF FLL E  AQ+W  ++ Y
Sbjct: 159 GSAHAGLGGSTDAISAGTKQLLQKGEFVSVRGREAHITQRGFTFLLQEINAQIWTLLIVY 218

Query: 146 LETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAG 205
           LE   +  +D V+ L+FLF L     G  YST  ++   Q  L  L +FGLVY+R     
Sbjct: 219 LEVSGALHMDPVDVLSFLFTLGSLELGISYSTTNLTPTQQQMLDDLSDFGLVYRRSNDEA 278

Query: 206 RFYPTKLALNMATRGTL-----------------------KQIREPGFLIVETNFRVYAY 242
           R+YPT+LA  + +                               E G++I+ETN+R+YAY
Sbjct: 279 RYYPTRLATTLTSDAPALPNTSLTSTTSTTTLASSSTDPSASANEKGYIILETNYRLYAY 338

Query: 243 TDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV-- 300
           T S L + ++ LF  L  R+PNL    LT+ S+++A+ SGIT+ QII +L  HA P +  
Sbjct: 339 TSSPLPITILSLFATLKTRYPNLITAKLTKSSIQSAIASGITSDQIITYLTTHAHPILRR 398

Query: 301 -SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVL 352
            +  LPP V DQIRLW+ E ER+T  +G L     +  +++K   YA  LGVL
Sbjct: 399 QNPVLPPTVVDQIRLWQIEGERMTAWKGFLIRDVGTAEEYDKAVQYADALGVL 451


>gi|336270172|ref|XP_003349845.1| hypothetical protein SMAC_00733 [Sordaria macrospora k-hell]
 gi|380095234|emb|CCC06707.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 496

 Score =  197 bits (500), Expect = 9e-48,   Method: Compositional matrix adjust.
 Identities = 147/471 (31%), Positives = 223/471 (47%), Gaps = 103/471 (21%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWL-----SKTH-------------------AKEH 37
           LS  AK  V  LL++  P P   +  W       K H                    K+ 
Sbjct: 38  LSHHAKVIVRGLLYMPDPFPLTSLDIWFPPESGGKIHRDRTIAALRSLHVLQVSPPGKDR 97

Query: 38  KEATDTLTELRRNAKVALLGGGKPWS--MSAKLEVDSKARDLDFLNQYALERWECILRFM 95
            +     T  ++  ++A  GGG+  S  + + L VD    D+ +L++YA ++W+ IL ++
Sbjct: 98  TQEIQITTSFKKGLQLAFEGGGEDGSFGVPSSLPVDPTI-DIGYLDKYARKKWDDILHYV 156

Query: 96  VGS----QQTEGISAD--AVRTLFNEG---AENP------VITKDGFQFLLLETPAQVWY 140
           V S      +EG+S    +V+ L   G      P       IT+ GF FLL E+ AQVW 
Sbjct: 157 VNSLGVQSSSEGVSGPKASVKELLLAGRLVERRPDTRTGIGITQAGFTFLLQESNAQVWT 216

Query: 141 FILKYLETVE---SKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV 197
            +L +LE  +     G+D ++ L+FLF L     GT Y T  +++  +  L  L +FGL+
Sbjct: 217 LLLLWLEAADHAADSGMDSIDMLSFLFMLSSLELGTAYDTSALTETRRNMLPSLVDFGLI 276

Query: 198 YQRKRKAGRFYPTKLALNMA-------------TRGTLKQIREPGF-------------- 230
           Y   R   +F+PT+LA  +              T  T     EPG               
Sbjct: 277 Y-FPRDTRQFFPTRLATTLTSTASALRTVSSAFTAATANPSGEPGTTTSTSGGPAGATPA 335

Query: 231 --------LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRS- 281
                   +I+ETN+R+YAYT S L++A++ LFC+L  RFPN+         V AALR+ 
Sbjct: 336 DKTGTSKGIIIETNYRIYAYTTSPLQIAVLALFCQLNMRFPNM---------VSAALRAI 386

Query: 282 --GITAAQIIGFLRLHALPSV--------SCPLPPVVSDQIRLWEGERERLTTSEGVLYS 331
             GITA QII +L  HA   +           LPP V DQIRLW+ E ER+ T+ G L+ 
Sbjct: 387 GFGITADQIISYLHAHAHQQMVREATVTGKTVLPPTVVDQIRLWQLENERMQTTAGFLFK 446

Query: 332 QFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKG 382
            F S  ++  L  YA+++GVL W+N+K++    +K  H  ++ +  + +KG
Sbjct: 447 DFESTEEYAALSRYAEEIGVLVWKNDKRQLFFASK--HEQLRDYLKSRKKG 495


>gi|358385986|gb|EHK23582.1| hypothetical protein TRIVIDRAFT_36912 [Trichoderma virens Gv29-8]
          Length = 492

 Score =  197 bits (500), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 145/453 (32%), Positives = 223/453 (49%), Gaps = 86/453 (18%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASW---------------LSKTH-------AKEHKE 39
           L  +AK +V+R+LF+  P+    + +W               L   H       +KE  +
Sbjct: 39  LPHLAKTFVMRMLFLPHPMTLTDLDAWVKPEAKRKKDQSLSILRSLHIVQISAPSKEKPQ 98

Query: 40  ATDTLTELRRNAKVALLGGGK------PWSMSAKLEVDSKARDLDFLNQYALERWECILR 93
               ++  +++ ++AL GGG       P S+    E+D     L FL++YA ++WE +L 
Sbjct: 99  EIQLMSNFKKSLRLALTGGGNHNSFGVPSSLLVPPEID-----LPFLDRYARKKWEDVLH 153

Query: 94  FMVGSQQTEGISADA------VRTLFNEG---AENPV----ITKDGFQFLLLETPAQVWY 140
           F+V S   +    D       V+ L   G      P     IT+ GF FLL E  AQVW 
Sbjct: 154 FVVSSVGYKSSVGDGSGPNKGVKELLIAGRLVDRRPNGSLGITQAGFTFLLQEANAQVWT 213

Query: 141 FILKYLETV---ESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV 197
            +L +LE +   +  GLD V+ L+FLF L     G  Y T  ++++ +  L  L +FGL+
Sbjct: 214 LLLLWLEVLGNNKGSGLDPVDMLSFLFMLASLELGRAYDTNALTEERRNMLPSLVDFGLI 273

Query: 198 YQRKRKAGRFYPTKLALNMATRG-TLKQIRE--------------------------PGF 230
           Y  + K   F+PT+LA  + + G +L+ I E                           G 
Sbjct: 274 YIPQHKRSMFFPTRLATTLTSGGNSLRTISEGVTAATQSAQTSQQALGPLGGSGGEQAGS 333

Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIG 290
           +++ETN+R+YAYT S L++A++ LF +L  RFP++  G L+R S+R A+  GITA QII 
Sbjct: 334 VVIETNYRLYAYTQSALQIAVLALFAKLNMRFPDMVAGRLSRASIRQAINFGITADQIIS 393

Query: 291 FLRLHALPSV-------SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKL 342
           +L  HA   +       + P LPP V DQIRLW+ E ER+ T+ G L+  F    D+   
Sbjct: 394 YLAAHAHEQMHRTAALTNKPVLPPTVVDQIRLWQLENERMKTTSGFLFRDFTDDKDYLDT 453

Query: 343 RDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
             +++++GVL W+N+  R     K  H  IK F
Sbjct: 454 ARFSEEIGVLVWKNDHARMFFANK--HEQIKDF 484


>gi|115461368|ref|NP_001054284.1| Os04g0680000 [Oryza sativa Japonica Group]
 gi|113565855|dbj|BAF16198.1| Os04g0680000 [Oryza sativa Japonica Group]
          Length = 444

 Score =  195 bits (496), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 136/437 (31%), Positives = 214/437 (48%), Gaps = 92/437 (21%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRR------------ 49
           L  +AK Y +++L+V  PV  A +  W+   +A +H+ A D L +LR             
Sbjct: 40  LPPLAKKYALQMLYVSAPVAAAAMEEWVLDEYAAKHRVAIDRLLQLRVFVEVRDRRKEVS 99

Query: 50  ---------NAKVALLGGGK------PWSMSAKLEVDSKARDLDFLNQYALERWE----- 89
                    N +  L+ GG       P S++A+L        L  L  +ALE+WE     
Sbjct: 100 YKMNQKFQGNMQKYLVDGGSLPREPIPSSVTARLPT------LAELESFALEQWEVRMTK 153

Query: 90  ---CILRFMVGSQQTEGISADAVRTLFNE----GAENPVIT---KDGFQ----------- 128
              CI  + + +      S      L+ +     A + +I+   +D FQ           
Sbjct: 154 AYVCI-SYCLNATNVHSPSKGTHSFLYGKIVLLAAIDQLISSRERDKFQLIHDENFSARS 212

Query: 129 --FLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQV 186
             F L+ET AQ+WY + +Y+ + E +G+D  E ++FL +L F T G  YS   ++D  + 
Sbjct: 213 SEFKLMETNAQLWYIMREYISSAEERGVDPTELISFLLELSFHTLGEAYSLNTLTDVQRN 272

Query: 187 FLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSN 246
            ++ L E GLV                        L+Q    GF++VETNFR+YAY+ S 
Sbjct: 273 AIRDLAELGLV-----------------------KLQQ----GFVVVETNFRMYAYSTSR 305

Query: 247 LKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPP 306
           L   ++ LF  + Y+ PNL VG +T++S+  A  +GITA QII FL+ +A P V+  +P 
Sbjct: 306 LHCEILRLFSRVEYQLPNLIVGSITKESLYGAFENGITAEQIISFLQQNAHPRVADKIPA 365

Query: 307 V---VSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVV 363
           V   V+DQIRLWE +R R+  +   LY  F S+  F++  DYA+D G L W++ KK  ++
Sbjct: 366 VPENVTDQIRLWETDRNRVDMTLSHLYEDFPSKDMFDQCCDYARDHGCLLWEDAKKMRLI 425

Query: 364 VTKQGHSDIKKFWVNHQ 380
           V  + HS++  + V  +
Sbjct: 426 VRVEFHSEMHDYSVQER 442


>gi|66800623|ref|XP_629237.1| general transcription factor IIH, polypeptide 4 [Dictyostelium
           discoideum AX4]
 gi|74850716|sp|Q54C29.1|TF2H4_DICDI RecName: Full=General transcription factor IIH subunit 4; AltName:
           Full=TFIIH basal transcription factor complex subunit 4
 gi|60462610|gb|EAL60813.1| general transcription factor IIH, polypeptide 4 [Dictyostelium
           discoideum AX4]
          Length = 483

 Score =  195 bits (495), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 114/335 (34%), Positives = 200/335 (59%), Gaps = 21/335 (6%)

Query: 62  WSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS--QQTEGISADAVRT-LFNEGAE 118
           +S ++ ++ + K   +D L+ Y+  +WE +L F+     Q ++ IS   + + L  +  +
Sbjct: 143 FSNNSSIKDNHKPPSIDDLDSYSKSQWEKVLYFLSDDTVQPSKLISELLLSSNLTKQEGD 202

Query: 119 NPVITKDGFQFLLLETPAQVWYFILKYLETVESK----GLDLVECLTFLFQLKFSTFGTD 174
              IT +GF+FLL +   Q+W  ++ YL+ +E K         + L+FLF+L F   G  
Sbjct: 203 GLSITSEGFKFLLKDVYTQIWTLLIVYLDDLEKKKGKGSGSRNDLLSFLFRLSFLNLGRG 262

Query: 175 YSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQI--------- 225
           Y    +S++ + +L  L++FGL+Y R   +  FYPT+L +++ T  TL  I         
Sbjct: 263 YLVSELSEQQKEYLFALKQFGLIYMRTDSSILFYPTRLIISLTTGKTLSLIQSISSERTQ 322

Query: 226 --REPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGI 283
             +E G++++ETN+R+YAYT S+L+++L+ LF +++YR PNL VG++TR+S+R AL  GI
Sbjct: 323 TQKEQGYIVLETNYRLYAYTSSSLQISLLSLFVKMLYRLPNLAVGIITRESIRTALIHGI 382

Query: 284 TAAQIIGFLRLHALPSVSC---PLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFE 340
           TA QII F+R ++ P+ +    P+P VV++QI LWE ER R+T ++ VLY+ F +   + 
Sbjct: 383 TADQIIDFVRHNSHPNAANSGQPIPDVVAEQILLWEAERNRITYTKSVLYNSFPTNDCYI 442

Query: 341 KLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
               +A++     W ++  +T+VV ++G+  I+ F
Sbjct: 443 ATLKFAKEQDYYIWSHDPLKTLVVKEEGNDPIRNF 477


>gi|358394641|gb|EHK44034.1| hypothetical protein TRIATDRAFT_222970 [Trichoderma atroviride IMI
           206040]
          Length = 491

 Score =  192 bits (487), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 143/455 (31%), Positives = 222/455 (48%), Gaps = 91/455 (20%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLS----------------------KTHAKEHKE 39
           L  +AK +V+R+LF+  P+    +  W+                          +KE  +
Sbjct: 39  LPHLAKTFVMRMLFMPHPMTLNDLDVWVKPEAKRKKDQSLSILRGLHIVQISAPSKEKPQ 98

Query: 40  ATDTLTELRRNAKVALLGGGK------PWSMSAKLEVDSKARDLDFLNQYALERWECILR 93
               ++  +R+ ++AL GGG       P S+    E+D     L FL++YA ++W+ +L 
Sbjct: 99  EIQLMSNFKRSLRLALTGGGNHNSFGVPSSLLIPPEID-----LPFLDRYARKKWDDVLH 153

Query: 94  FMV----------GSQQTEG-----ISADAVRTLFNEGAENPVITKDGFQFLLLETPAQV 138
           F+V          GS   +G     I+   V    N G     IT+ GF FLL E  AQV
Sbjct: 154 FVVSSVGYKSVGDGSGPNKGVKELLIAGRLVDRRPNGGLG---ITQAGFTFLLQEANAQV 210

Query: 139 WYFILKYLETV---ESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFG 195
           W  +L +L+ +   +  GLD V+ L+FLF L     G  Y T  ++++ +  L  L +FG
Sbjct: 211 WTLLLLWLDVLGNNKGSGLDPVDMLSFLFMLASLELGRAYDTNALTEERRNMLPSLVDFG 270

Query: 196 LVYQRKRKAGRFYPTKLALNMATRG-TLKQIRE--------------------------P 228
           L+Y  + K   F+PT+LA  + + G +L+ I E                           
Sbjct: 271 LIYIPQHKRSMFFPTRLATTLTSGGNSLRTISEGVTAATQSAQTSQQSLGPLGGGGEQQS 330

Query: 229 GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQI 288
           G +++ETN+R+YAYT S L++A++ LF +L  RFP++  G L+R S+R A+  GITA QI
Sbjct: 331 GSVVIETNYRLYAYTQSALQIAVLALFAKLNMRFPDMVAGRLSRASIRQAINFGITADQI 390

Query: 289 IGFLRLHALPSV-------SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFE 340
           I +L  HA   +       + P LPP V DQIRLW+ E ER+ T+ G L+  F    D+ 
Sbjct: 391 ISYLAAHAHEQMHRTAALTNKPVLPPTVVDQIRLWQLENERMKTTSGFLFRDFTDDKDYL 450

Query: 341 KLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
               +++++GVL W+N+  R     K  H  IK F
Sbjct: 451 DTARFSEEIGVLVWRNDHARMFFANK--HEQIKDF 483


>gi|322697444|gb|EFY89224.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Metarhizium acridum CQMa 102]
          Length = 541

 Score =  192 bits (487), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 133/402 (33%), Positives = 201/402 (50%), Gaps = 82/402 (20%)

Query: 34  AKEHKEATDTLTELRRNAKVALLGGGK------PWSMSAKLEVDSKARDLDFLNQYALER 87
           +KE  +     T  + + ++AL GGG+      P ++    E+D+      FL++YA +R
Sbjct: 122 SKEKPQEMQLTTNFKNSLRLALTGGGEHNSFGVPSTLLVPPEIDTA-----FLDRYARKR 176

Query: 88  WECILRFMV---GSQQTEGISA--DAVRTLFNEG---AENPV----ITKDGFQFLLLETP 135
           WE IL F+V   G + T  +S    +V+ L   G      P     IT+ GF FLL E  
Sbjct: 177 WEDILHFVVSSVGYKSTGDVSGPNKSVKELLIAGRLVDRRPNGSIGITQAGFTFLLQEPN 236

Query: 136 AQVWYFILKYLETVESK---GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLR 192
           AQVW  +L +LE  E+    GL+ V+ L+FLF L     G  Y T  ++++ +  L  L 
Sbjct: 237 AQVWTLLLLWLEASETNKAAGLETVDMLSFLFVLASLELGRAYDTNALTEQRRNMLPSLL 296

Query: 193 EFGLVYQRKRKAGRFYPTKLALNMATRG-TLKQIRE------------------------ 227
           +FGL+Y  + K   F+PT+LA  + + G +L+ I E                        
Sbjct: 297 DFGLIYIPQHKRSMFFPTRLATALTSGGGSLRTISEGVAAATAAAGPSSHEGVAAATAAA 356

Query: 228 -----------------------PGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPN 264
                                   G +IVETN+R+YAYT S L++A++ LFC+L  RFP+
Sbjct: 357 GPSSQSGSGSGGAGGPLGPSADQKGSVIVETNYRIYAYTQSTLQIAVLALFCKLNMRFPD 416

Query: 265 LTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV-------SCP-LPPVVSDQIRLWE 316
           +  G +TR S+  A+  GITA QII +L  HA   +       + P LPP V DQIRLW+
Sbjct: 417 MVAGRVTRTSIHQAINFGITADQIISYLAAHAHEQMHRTAALANKPILPPTVVDQIRLWQ 476

Query: 317 GERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEK 358
            E ER+ T+ G L+  F    +F     +A+++GVL W+++K
Sbjct: 477 LENERMKTTGGFLFRDFEDHKEFLDTARFAEEIGVLVWRSDK 518


>gi|392594381|gb|EIW83705.1| transcription factor Tfb2 [Coniophora puteana RWD-64-598 SS2]
          Length = 393

 Score =  192 bits (487), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 124/338 (36%), Positives = 180/338 (53%), Gaps = 51/338 (15%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR----RNAKVAL-- 55
           L+ ++K  V+ LL+++ PV    +A+W+     K + EA +TL  L       AK+AL  
Sbjct: 50  LAPLSKQLVMNLLWLDSPVLAGTMAAWVVLDGKKLYDEALNTLMRLHILQSSGAKLALNP 109

Query: 56  ----------LGGGKPWSMSAKLEVDSK--ARDLDFLNQYALERWECILRFMVGSQQ--- 100
                        G   S     + D K  A  ++ L+ YA++RWE IL +MV S Q   
Sbjct: 110 TFKASMRQAITCSGTTGSFGVPAQRDDKRDAPSVETLDGYAVQRWETILHYMVSSGQGQY 169

Query: 101 ----TEGISADAVRT--LFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGL 154
               T+G+     R+  + +       IT  GFQFLL     Q+W  +L+YL  VE + +
Sbjct: 170 PTKPTQGVLYLLQRSGLMASYHGSTLQITSSGFQFLLYSPRDQLWDLLLQYLHMVEERQM 229

Query: 155 DLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLAL 214
           DLVE L+F F L     G +YSTE +S+  +  L+ LR++GLV+QRK  + RF PT+LA 
Sbjct: 230 DLVEVLSFFFMLSTMELGREYSTEPLSETQKAMLEDLRDYGLVWQRKASSKRFSPTRLAT 289

Query: 215 NMATRG----TLKQIREP--------------------GFLIVETNFRVYAYTDSNLKVA 250
            + +      T    R P                    GF+++ETN+RVYAYTD+ L++A
Sbjct: 290 TLTSVSPSLPTAGGSRNPGSAPGASSVANSTGINTNDHGFIVLETNYRVYAYTDNPLQIA 349

Query: 251 LIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQI 288
           ++ LF  L YRFPNL VG+LTR+SVR AL +GI+A QI
Sbjct: 350 VLNLFITLKYRFPNLVVGMLTRESVRRALGNGISAEQI 387


>gi|90399251|emb|CAJ86205.1| B0213E10.4 [Oryza sativa Indica Group]
          Length = 470

 Score =  191 bits (486), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 102/261 (39%), Positives = 156/261 (59%), Gaps = 30/261 (11%)

Query: 118 ENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYST 177
           E P +T++GFQFLL+ET AQ+WY + +Y+ + E +G+D  E ++FL +L F T G  YS 
Sbjct: 232 EAPRLTENGFQFLLMETNAQLWYIMREYISSAEERGVDPTELISFLLELSFHTLGEAYSL 291

Query: 178 EGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIREPGFLIVETNF 237
             ++D  +  ++ L E GLV                        L+Q    GF++VETNF
Sbjct: 292 NTLTDVQRNAIRDLAELGLV-----------------------KLQQ----GFVVVETNF 324

Query: 238 RVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHAL 297
           R+YAY+ S L   ++ LF  + Y+ PNL VG +T++S+  A  +GITA QII FL+ +A 
Sbjct: 325 RMYAYSTSRLHCEILRLFSRVEYQLPNLIVGSITKESLYGAFENGITAEQIISFLQQNAH 384

Query: 298 PSVSCPLPPV---VSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTW 354
           P V+  +P V   V+DQIRLWE +R R+  +   LY  F S+  F++  DYA+D G L W
Sbjct: 385 PRVADKIPAVPENVTDQIRLWETDRNRVDMTLSHLYEDFPSKDMFDQCCDYARDHGCLLW 444

Query: 355 QNEKKRTVVVTKQGHSDIKKF 375
           ++ KK  ++V  + HS++++F
Sbjct: 445 EDAKKMRLIVRVEFHSEMREF 465


>gi|378729002|gb|EHY55461.1| transcription initiation factor TFIIH subunit H4 [Exophiala
           dermatitidis NIH/UT8656]
          Length = 508

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 141/455 (30%), Positives = 209/455 (45%), Gaps = 91/455 (20%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTL----------------- 44
           L+ +AK +V+ L+++  PVP   +  ++  T   E + A D L                 
Sbjct: 38  LTSLAKSFVMMLVYMPAPVPVKQLELFVKDTSRGEREYAIDLLHRYHIFKDVTYNGAKAY 97

Query: 45  ---TELRRNAKVALLGGGKPWS---MSAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
               +  ++ + AL G G   S   ++  +  D K   +  L++YA +RWE IL +MVGS
Sbjct: 98  ALTPDFAKSLRRALTGAGDSRSFGQVATNVPEDQKV-TIAQLDEYARQRWEGILGYMVGS 156

Query: 99  -----------QQTEGISADAVRTLFNEG-----------AENPVITKDGFQFLLLETPA 136
                       Q     +  V  L   G            +   ITK+GF F+L +   
Sbjct: 157 SSILLESGTTENQASIQPSPGVIELLKAGHLIELTGTYSRGQAAKITKEGFAFVLQDINT 216

Query: 137 QVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGL 196
           Q+W  +  Y++  E   +D V+ L+FLF +     G  YST  + +     L  L  FG+
Sbjct: 217 QIWALLFLYVDNAEVFEMDKVDVLSFLFFVSSLELGLAYSTAPLDETQSRCLSDLVSFGI 276

Query: 197 VYQRKRKAGR----FYPTKLAL-------------NMATRGTLKQIREPG---------- 229
           VYQ   + G     FYPT+LA              N     +L                 
Sbjct: 277 VYQPLLEDGTPVDYFYPTRLATTLTSDSSTTLSATNTTLASSLSTKSAHSSSSSSAGAGK 336

Query: 230 -FLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQI 288
            F+IVETN+R+YAYT S L++AL+ LF  L  R PNL  G +++ SV+ A+++GITA QI
Sbjct: 337 GFIIVETNYRLYAYTSSPLQIALLSLFVNLRSRHPNLVTGKMSKSSVQRAIQAGITADQI 396

Query: 289 IGFLRLHALPS-----------------VSCPLPPVVSDQIRLWEGERERLTTSEGVLYS 331
           I +L  HA P                  V   LP  + DQI LW+ ER+R+TT+ G L  
Sbjct: 397 ISYLTSHAHPQMRRHAQAEQARNADPNRVVPILPATILDQIHLWQLERDRMTTTPGFLLK 456

Query: 332 QFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTK 366
            F +QAD+E    YA ++GVL W+N+KKR   V +
Sbjct: 457 DFPNQADYEAPCRYADEIGVLVWKNDKKRMFFVNR 491


>gi|116201905|ref|XP_001226764.1| hypothetical protein CHGG_08837 [Chaetomium globosum CBS 148.51]
 gi|88177355|gb|EAQ84823.1| hypothetical protein CHGG_08837 [Chaetomium globosum CBS 148.51]
          Length = 503

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 145/449 (32%), Positives = 215/449 (47%), Gaps = 84/449 (18%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASW---------------LSKTH-------AKEHKE 39
           L  +AK +V  LL++  P+  A +  W               L   H        K+  +
Sbjct: 40  LPPLAKVFVQALLYMPSPLLLAELDVWVRAEGKMQRDRALSILRSLHIIQITPPGKDKPQ 99

Query: 40  ATDTLTELRRNAKVALLGGGKPWSMSAKLEVDSKAR-DLDFLNQYALERWECILRFMVGS 98
                T  + + ++AL GGG   S      +    R D+  L+ YA ++WE IL ++V S
Sbjct: 100 EMQLTTNFKNSLRLALEGGGSHNSFGVPSTLPVAPRIDIAHLDGYARKKWEDILHYVVNS 159

Query: 99  --QQTEGISA-------DAVRTLF---------NEGAENPVITKDGFQFLLLETPAQVWY 140
              Q+EG           +V+ L          ++      IT+ GF FLL E  AQVW 
Sbjct: 160 VPAQSEGGGGVSGGGPKTSVKELLLIGRLVERRHDTQAGVGITQAGFTFLLQEANAQVWT 219

Query: 141 FILKYLETV-----ESKG---LDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLR 192
            +L +LE       E+KG    D ++ L+FLF L     G  Y+T  +++  +  L HL 
Sbjct: 220 LLLLWLEAADRAKGEAKGAANTDSIDMLSFLFMLASLELGRAYNTGALTETRRNMLPHLV 279

Query: 193 EFGLVYQRKRKAGRFYPTKLA----------------LNMATR---------GTLKQIR- 226
           +FGL++  +    +++PT+LA                 N AT          GT  +   
Sbjct: 280 DFGLIFIPREDTRQYFPTRLATTLTSSASSLRSVSSGFNAATANNPGDTSSLGTTPETSA 339

Query: 227 -EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITA 285
              G +I+ETN+R+YAYT S L++A++GLF +L  RF  +  G LTRDS+R A+  GITA
Sbjct: 340 ASKGSVIIETNYRLYAYTSSPLQIAVLGLFTQLNMRFAGMATGRLTRDSIRRAIGFGITA 399

Query: 286 AQIIGFLRLH-------ALPSVSCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQA 337
            QII +L  H       A  +   P LPP V DQIRLW+ E ER+ TS G L+  F S  
Sbjct: 400 DQIISYLASHAHDQMVRAAAATGRPVLPPTVVDQIRLWQLENERMRTSAGFLFKDFDSLE 459

Query: 338 DFEKLRDYAQDLGVLTWQNEKKRTVVVTK 366
           ++  L  YA ++GVL W+++KKR    +K
Sbjct: 460 EYVSLSSYADEVGVLVWRSDKKRMFYASK 488


>gi|322711210|gb|EFZ02784.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Metarhizium anisopliae ARSEF 23]
          Length = 540

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 132/398 (33%), Positives = 201/398 (50%), Gaps = 78/398 (19%)

Query: 34  AKEHKEATDTLTELRRNAKVALLGGGK------PWSMSAKLEVDSKARDLDFLNQYALER 87
           +KE  +     T  + + ++AL GGG+      P ++    E+D+      FL++YA +R
Sbjct: 125 SKEKPQEMQLTTNFKNSLRLALTGGGEHNSFGVPSTLLVPPEIDTP-----FLDRYARKR 179

Query: 88  WECILRFMV---GSQQTEGISA--DAVRTLFNEG---AENPV----ITKDGFQFLLLETP 135
           WE IL F+V   G + T  +S    +V+ L   G      P     IT+ GF FLL E  
Sbjct: 180 WEDILHFVVSSVGYKSTGDVSGPNKSVKELLIAGRLVDRRPNGTIGITQAGFTFLLQEPN 239

Query: 136 AQVWYFILKYLETVESK---GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLR 192
           AQVW  +L +LE  E+    GL+ V+ L+FLF L     G  Y T  ++++ +  L  L 
Sbjct: 240 AQVWTLLLLWLEASETNKAAGLETVDMLSFLFVLASLELGRAYDTNALTEQRKNMLPSLL 299

Query: 193 EFGLVYQRKRKAGRFYPTKLALNMATRGT-LKQIRE------------------------ 227
           +FGL+Y  + K   F+PT+LA  + + G+ L+ I E                        
Sbjct: 300 DFGLIYIPQHKRSMFFPTRLATTLTSGGSSLRTISEGVAAATAAATTTAPAGGPSSQSGS 359

Query: 228 ---------------PGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTR 272
                           G +IVETN+R+YAYT S L++A++ LFC+L  RFP++  G +TR
Sbjct: 360 GSGGTGGPLGPSADQKGSVIVETNYRIYAYTQSTLQIAVLALFCKLNMRFPDMVAGRVTR 419

Query: 273 DSVRAALRSGITAAQIIGFLRLHALPSV-------SCP-LPPVVSDQIRL----WEGERE 320
            S+  A+  GI+A QII +L  HA   +       + P LPP V DQIRL    W+ E E
Sbjct: 420 TSIHQAINFGISADQIISYLAAHAHEQMHRSAALANKPILPPTVVDQIRLWQLEWQLENE 479

Query: 321 RLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEK 358
           R+ T+ G L+  F    +F     +A+++GVL W+++K
Sbjct: 480 RMKTTGGFLFRDFEDHKEFLDTARFAEEIGVLVWRSDK 517


>gi|400597688|gb|EJP65418.1| RNA polymerase II transcription factor B subunit 2 [Beauveria
           bassiana ARSEF 2860]
          Length = 507

 Score =  188 bits (478), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 136/452 (30%), Positives = 210/452 (46%), Gaps = 103/452 (22%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           L ++AK +V+R+L++  P+    + SW+     ++  +   TL  L              
Sbjct: 41  LPQLAKTFVMRMLYMPNPMSLNDLDSWVKPEAKRQKDQMLSTLRSLHIVQISAPSKDKTQ 100

Query: 48  --------RRNAKVALLGGGK------PWSMSAKLEVDSKARDLDFLNQYALERWECILR 93
                   R + ++AL GGG       P +++   E+D     L FL++YA  RW+ IL 
Sbjct: 101 EVQLTANFRTSMRLALTGGGNHGSFGVPSTLAVPPEID-----LPFLDRYARRRWDEILH 155

Query: 94  FMVGSQQTEGISADAVRTLFNEGAENPV-------------------ITKDGFQFLLLET 134
           F+V S    G           +G    V                   IT+ GF FLL E 
Sbjct: 156 FVVSS---VGFKNAGGGGAAADGPNKSVRDILVAGRLVDRRPNGSIGITQAGFTFLLQEA 212

Query: 135 PAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREF 194
            AQVW  +L +L+ +++     V+ L+FLF L     G  Y T  +++  +  L  L +F
Sbjct: 213 NAQVWTLLLLWLDALDADKSAAVDMLSFLFVLASLELGRAYDTNALTEHRRDMLPALADF 272

Query: 195 GLVYQRKRKAGRFYPTKLA--LNMATRGTLKQIREP------------------------ 228
           GLVY   +K   F+PT+LA  L      +L+ I E                         
Sbjct: 273 GLVYIPPQKRSMFFPTRLATTLTSGASSSLRSISEGVVAATSAALSASSSSSTHQDQNHQ 332

Query: 229 --------------GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDS 274
                         G +++ETN+R+YAYT S L++A++ LF  L  RFP++  G +TR S
Sbjct: 333 SRAGPLGAGAGDQIGSIVIETNYRLYAYTQSTLQIAVLALFSRLTMRFPDMVAGRITRTS 392

Query: 275 VRAALRSGITAAQIIGFLRLHALPSV-------SCP-LPPVVSDQIRLWEGERERLTTSE 326
           +R A++ GITA QII +L  HA   +       + P LPP V DQIRLW+ E ER+ T+ 
Sbjct: 393 IRQAIQFGITADQIIAYLAAHAHEQMHRSAALAARPVLPPTVVDQIRLWQLENERMKTTG 452

Query: 327 GVLYSQFVSQADFEKLRDYAQDLGVLTWQNEK 358
           G L+  F  Q +++  R +A ++GVL W ++K
Sbjct: 453 GFLFRDFDDQKEYQDTRRFADEIGVLVWSSDK 484


>gi|367040709|ref|XP_003650735.1| hypothetical protein THITE_2110518 [Thielavia terrestris NRRL 8126]
 gi|346997996|gb|AEO64399.1| hypothetical protein THITE_2110518 [Thielavia terrestris NRRL 8126]
          Length = 513

 Score =  187 bits (476), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 138/461 (29%), Positives = 218/461 (47%), Gaps = 98/461 (21%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTL----------------- 44
           L  +AK +V  LL++  P+  + +  W+ +  AK H++   ++                 
Sbjct: 40  LPPLAKIFVQALLYMPSPLLLSELDVWV-RPEAKMHRDRALSILRSLHIVQISPPGKDKP 98

Query: 45  ------TELRRNAKVALLGGG--KPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMV 96
                 T  + + ++AL GGG    + + + L VD +  D+  L+QYA ++WE IL ++V
Sbjct: 99  QEIQLTTNFKNSLRLALEGGGTHNSFGVPSTLPVDPRI-DIAHLDQYARKKWEDILHYVV 157

Query: 97  GSQQTEG----------ISADAVRTLFNEG---AENP------VITKDGFQFLLLETPAQ 137
            S                   +V+ L   G      P       IT+ GF FLL E  AQ
Sbjct: 158 NSVPVHAEGGGGGGSGGGPKASVKDLLLAGRLVERRPDTRGGVAITQAGFTFLLQEANAQ 217

Query: 138 VWYFILKYLETVE-----------------SKG-LDLVECLTFLFQLKFSTFGTDYSTEG 179
           VW  +L +LE  +                 S+G  D ++ L+FLF L     G  Y T+ 
Sbjct: 218 VWTLLLLWLEAADHSKAAAAAAGDHAKGAASQGKPDSIDMLSFLFMLASLELGRAYDTDS 277

Query: 180 MSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIREPGF--------- 230
           +++  +  L  L +FGL+Y  +    +++PT+LA  + +  +  +    GF         
Sbjct: 278 LTETRRNMLPALVDFGLIYIPREDTRQYFPTRLATTLTSSASALRSVSSGFSAAAANNPG 337

Query: 231 -----------------LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRD 273
                            LI+ETN+R+YAYT S L++A++ LF +L  RF  +  G LTRD
Sbjct: 338 DAASLGMTPDTTPTKGSLIIETNYRLYAYTSSPLQIAVLALFTQLNMRFAGMVTGRLTRD 397

Query: 274 SVRAALRSGITAAQIIGFLRLH-------ALPSVSCP-LPPVVSDQIRLWEGERERLTTS 325
           S+R A+  GITA QII +L  H       A  +   P LPP V DQIRLW+ E ER+ T+
Sbjct: 398 SIRRAISFGITADQIISYLASHAHEQMTRAAAAAGRPVLPPTVVDQIRLWQLENERMRTT 457

Query: 326 EGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTK 366
            G L+  F S  ++  L  YA+++GVL W++++KR    +K
Sbjct: 458 AGFLFKDFDSPEEYVSLSGYAEEIGVLVWKSDRKRVFFASK 498


>gi|340939354|gb|EGS19976.1| RNA polymerase II transcription factor B subunit 2-like protein
           [Chaetomium thermophilum var. thermophilum DSM 1495]
          Length = 514

 Score =  187 bits (474), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 140/459 (30%), Positives = 216/459 (47%), Gaps = 95/459 (20%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWL---SKTH-------------------AKEHKE 39
           L  +AK +V  LL++ QP+  + +  W+   +K H                    K+  +
Sbjct: 42  LPPLAKVFVQALLYMPQPMLLSDLDVWVRPEAKMHRDRALSILRSLHIVQITPPGKDRPQ 101

Query: 40  ATDTLTELRRNAKVALLGGG--KPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVG 97
                T  R + ++AL GG     + + + L VD +  D+ FL+ YA ++WE IL ++V 
Sbjct: 102 EVQLTTNFRNSLRLALEGGAAHNSFGVPSSLPVDPRI-DIAFLDNYARKKWEDILHYVVS 160

Query: 98  SQQTEGISAD----------AVRTLFNEG---AENPV------ITKDGFQFLLLETPAQV 138
           S    G +            +V+ L   G      P       IT+ GF FLL E  AQV
Sbjct: 161 SVPVHGDAGPGGMGGGGPKASVKDLLLAGRLVERRPDTKTGIGITQAGFTFLLQEANAQV 220

Query: 139 WYFILKYLET-------------VESKGL-----DLVECLTFLFQLKFSTFGTDYSTEGM 180
           W  +L +LE              V+ K       D +E L+FLF L     G  Y T+ +
Sbjct: 221 WTLLLLWLEAADQAKAAAAAASGVDPKNAPPTKPDSIEMLSFLFMLASLELGRAYDTDAL 280

Query: 181 SDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIREPGF---------- 230
           S+  +  L  L +FGL+Y  +    +++PT+LA  + +  +  +    GF          
Sbjct: 281 SETRRNMLPALVDFGLIYIPREDTRQYFPTRLATTLTSSASALRSVSSGFTAATNNTAND 340

Query: 231 ---------------LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSV 275
                          +I+ETN+R+YAYT S L++A++ LF  L  RF  +  G LTR+S+
Sbjct: 341 ASSLGGADPSAHKGSIIIETNYRLYAYTSSPLQIAVLALFTHLNMRFAGMVTGRLTRESI 400

Query: 276 RAALRSGITAAQIIGFLRLH-------ALPSVSCP-LPPVVSDQIRLWEGERERLTTSEG 327
           R A+  GITA QII +L  H       A  +   P LPP V DQIRLW+ E ER+ TS G
Sbjct: 401 RRAISFGITADQIISYLASHAHEQMVRAAAAAGRPVLPPTVVDQIRLWQLENERMRTSPG 460

Query: 328 VLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTK 366
            L+  F +  ++  L  YA+++GVL W++++KR    +K
Sbjct: 461 FLFKDFENVEEYMALAGYAEEIGVLVWRSDRKRMFFASK 499


>gi|367030175|ref|XP_003664371.1| hypothetical protein MYCTH_2307126 [Myceliophthora thermophila ATCC
           42464]
 gi|347011641|gb|AEO59126.1| hypothetical protein MYCTH_2307126 [Myceliophthora thermophila ATCC
           42464]
          Length = 512

 Score =  186 bits (473), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 128/405 (31%), Positives = 196/405 (48%), Gaps = 73/405 (18%)

Query: 34  AKEHKEATDTLTELRRNAKVALLGGG--KPWSMSAKLEVDSKARDLDFLNQYALERWECI 91
            K+  +     T  R + ++AL GGG    + + + L VD +  D+ +L+ YA  +WE I
Sbjct: 94  GKDKPQEIQLTTNFRNSLRLALEGGGTHNSFGVPSTLPVDPRI-DIAYLDSYARRKWEDI 152

Query: 92  LRFMVGSQQTEGISAD---------AVRTLF---------NEGAENPVITKDGFQFLLLE 133
           L ++V S   +G             +V+ L          N+      IT+ GF FLL E
Sbjct: 153 LHYVVNSVPAQGEGGGGGSVGGPKASVKDLLLIGRLVERRNDTKAGVGITQAGFTFLLQE 212

Query: 134 TPAQVWYFILKYLETVE-----------------SKGLDLVECLTFLFQLKFSTFGTDYS 176
             AQVW  +L +LE  +                 +   D ++ L+FLF L     G  Y 
Sbjct: 213 ANAQVWTLLLLWLEAADRAKGAAEQGKGGGAAAAAAKADSIDMLSFLFMLASLELGRAYD 272

Query: 177 TEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIREPGF------ 230
           T+ +S+  +  L  L +FGL+Y  +    +++PT+LA  + +  +  +    GF      
Sbjct: 273 TDALSETRRNMLPALVDFGLIYIPREDTRQYFPTRLATTLTSSASSLRSVSSGFSAAAAN 332

Query: 231 ---------------------LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
                                +I+ETN+R+YAYT S L++A++ LF +L  RF  +  G 
Sbjct: 333 NPGDASSLGTTPETSAASKGSVIIETNYRLYAYTSSPLQIAVLALFTQLNMRFAGMATGR 392

Query: 270 LTRDSVRAALRSGITAAQIIGFLRLH-------ALPSVSCP-LPPVVSDQIRLWEGERER 321
           LTRDS+R A+  GITA QII +L  H       A  +   P LPP V DQIRLW+ E ER
Sbjct: 393 LTRDSIRRAIGFGITADQIISYLASHAHEQMVRAAAATGRPVLPPTVVDQIRLWQLENER 452

Query: 322 LTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTK 366
           + TS G L+  F S  ++  L  YA+++GVL W+++KKR    +K
Sbjct: 453 MRTSAGFLFKDFDSLDEYLSLSAYAEEIGVLVWKSDKKRMFYASK 497


>gi|440800489|gb|ELR21525.1| transcription factor tfb2 subfamily protein [Acanthamoeba
           castellanii str. Neff]
          Length = 465

 Score =  186 bits (471), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 104/303 (34%), Positives = 181/303 (59%), Gaps = 13/303 (4%)

Query: 88  WECILRFMVGSQQTEGISADAVRTLFNEGA--------ENPVITKDGFQFLLLETPAQVW 139
           W+ I  F++GS   +  S D V +L   G         ++  I   GF FLL +   QVW
Sbjct: 152 WDTIFLFIMGSTVIQPPS-DRVVSLLTRGEFMVVHEEDQSIRIADKGFPFLLKDLRTQVW 210

Query: 140 YFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQ 199
             +L YL +++ +  ++ + L+FLF+L F T G  Y  + ++      LQ L++ G++Y+
Sbjct: 211 TLLLLYLRSLQEEKANVHDVLSFLFRLSFLTVGEGYQMDDLAFSESGLLQDLQDLGIIYR 270

Query: 200 RKRKAGRFYPTKLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELM 259
           + + +   YPT+LA+ +++    K+ +E G++IV T++R+YAYT S +K+ L+ LF ++ 
Sbjct: 271 KHKDSKWLYPTQLAIGLSSTEAAKRDQE-GWIIVGTDYRIYAYTSSPVKLLLLSLFTQIE 329

Query: 260 YRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--SCP-LPPVVSDQIRLWE 316
           Y+ PN+ +G+L R+++R A++ GI+A QI+ FL  +A P +  + P +P  ++DQ+RLWE
Sbjct: 330 YQLPNMVMGILLRENIRQAVQVGISANQILQFLETNAHPQMKQNTPIIPESIADQLRLWE 389

Query: 317 GERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFW 376
            E  RL+ S G  Y  F S A F+K   YA+D+G L + +  KR + V++ GH  ++++ 
Sbjct: 390 AEDRRLSLSSGYFYDDFASLAAFKKAEKYARDVGALIYSDATKRFLFVSEPGHQLLRRYV 449

Query: 377 VNH 379
             H
Sbjct: 450 KQH 452


>gi|440791255|gb|ELR12502.1| TFIIH basal transcription factor complex subunit 2, putative
           [Acanthamoeba castellanii str. Neff]
          Length = 304

 Score =  186 bits (471), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 97/261 (37%), Positives = 165/261 (63%), Gaps = 4/261 (1%)

Query: 122 ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMS 181
           I   GF FLL +   QVW  +L YL +++ +  ++ + L+FLF+L F T G  Y  + ++
Sbjct: 34  IADKGFPFLLKDLRTQVWILLLLYLRSLQEEKANVHDVLSFLFRLSFLTVGEGYQMDDLA 93

Query: 182 DKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIREPGFLIVETNFRVYA 241
                 LQ L++ G++Y++ R +   YPT+LA+ +++    K+ +E G++IV T++R+YA
Sbjct: 94  FSESGLLQDLQDLGIIYRKHRDSKWLYPTQLAIGLSSTEAAKRDQE-GWIIVGTDYRIYA 152

Query: 242 YTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV- 300
           YT S +K+ L+ LF ++ Y+ PN+ +G+L R+++R A++ GI+A QI+ FL  +A P + 
Sbjct: 153 YTSSPVKLLLLSLFTQIEYQLPNMVMGILLRENIRQAVQVGISANQILQFLETNAHPQMK 212

Query: 301 -SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEK 358
            + P +P  ++DQ+RLWE E  RL+ S G  Y  F S A F+K   YA+D+G L + +  
Sbjct: 213 QNTPIIPESIADQLRLWEAEDRRLSLSPGYFYDDFASLAAFKKAEKYARDVGALLYSDAA 272

Query: 359 KRTVVVTKQGHSDIKKFWVNH 379
           KR + VT+QGH  ++++   H
Sbjct: 273 KRFLFVTEQGHQLLRRYVKQH 293


>gi|320587960|gb|EFX00435.1| tfiih and nucleotide excision repair factor 3 complexes subunit
           [Grosmannia clavigera kw1407]
          Length = 518

 Score =  185 bits (470), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 143/480 (29%), Positives = 223/480 (46%), Gaps = 100/480 (20%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL-------------- 47
           L  +AK +V  LL+++ P+    + + +     +E  +A   L  L              
Sbjct: 41  LPSLAKSFVRMLLYMDGPLLLDTLDTMVVPEAKRERDQALSILRALHIVQITTAMRDRPQ 100

Query: 48  --------RRNAKVALLGGGKPWSMSAKLEVDSKAR-DLDFLNQYALERWECILRFMVGS 98
                   + + ++AL GGG   S     E    A  D  FL+++A  +WE IL ++V S
Sbjct: 101 EIMLTANFKNSFRLALEGGGSQNSFGVPSEKPVSAETDTAFLDRFARRKWEDILHYVVNS 160

Query: 99  --------QQTEGISADAVRTLFNEG---AENPV------ITKDGFQFLLLETPAQVWYF 141
                     ++G   + V+ L   G      P       IT+ GF FLL    AQVW  
Sbjct: 161 VGLPSGPGHDSQG-PKNTVKELLLAGHLVESRPGRPGGVGITQTGFTFLLQAANAQVWTL 219

Query: 142 ILKYLETVESK-----GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGL 196
           +L++LE V+ +      +D V+ L+FLF L     G  Y T  +++  +  L  L +FGL
Sbjct: 220 LLQWLEAVDQQRQAGGAVDSVDMLSFLFMLSTLELGRAYDTNALTEARRNMLPSLMDFGL 279

Query: 197 VY-QRKRKAGRFYPTKLALNMATRGT--LKQIREP------------------------- 228
           +Y     +A +++PT+LA  + + G   L+ +                            
Sbjct: 280 IYIPPHDRASQYFPTRLATTLTSSGAAALRSLSTGFAAASANSTAAAGLPGTGTGSNNNH 339

Query: 229 ----------------GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTR 272
                           G +I+ETNFR+YAYT S L++A++ LF +L  RFP +  G LTR
Sbjct: 340 VGAASTANSAATTEGRGSVILETNFRLYAYTASPLQIAVLALFSKLSQRFPGMVAGKLTR 399

Query: 273 DSVRAALRSGITAAQIIGFL------RLHALPSVS--CPLPPVVSDQIRLWEGERERLTT 324
           DSVR A+  GIT+ QII +L      ++H   + +    LPP V DQIRLWE E ER+ +
Sbjct: 400 DSVRRAISYGITSDQIISYLAANAHEQMHKYAAANHRPVLPPTVVDQIRLWELENERMKS 459

Query: 325 SEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGSQ 384
            +G L+  F S  +++ L  YA ++GVL ++N+KKR   VTK  H  +  +    +K  Q
Sbjct: 460 HKGFLFKDFDSAKEYDMLAQYADEIGVLVYRNDKKRHFFVTK--HEQLVTYLKARKKADQ 517


>gi|85857462|gb|ABC86267.1| RE43632p [Drosophila melanogaster]
          Length = 280

 Score =  184 bits (466), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 94/204 (46%), Positives = 127/204 (62%), Gaps = 32/204 (15%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
           EL EIA+ ++IR+LFV+QPVPQAV+ SW ++  AKE  EAT  LT L             
Sbjct: 70  ELPEIARQFIIRILFVDQPVPQAVVTSWGAQRCAKEQAEATSCLTALNVWRVTAIPGGLS 129

Query: 48  --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
                   +++ +  LLGGGKPW M+  L+ DSK RD+ FL+ YA+ RW C+L +MVG+ 
Sbjct: 130 AWELSPTFKKSVRQVLLGGGKPWPMTNTLDKDSKPRDIAFLDTYAMSRWRCVLHYMVGTG 189

Query: 99  ----QQTEGISADAVRTLFN------EGAENPVITKDGFQFLLLETPAQVWYFILKYLET 148
                  E IS DAVR L +      +  +   IT+ GFQFLLL+T AQVW+F+L+YL+T
Sbjct: 190 NRNGTDAEAISPDAVRILLHANLMKRDERDGITITRQGFQFLLLDTRAQVWHFMLQYLDT 249

Query: 149 VESKGLDLVECLTFLFQLKFSTFG 172
            E +G+ L ECL+ LFQL FST+ 
Sbjct: 250 CEERGISLPECLSMLFQLSFSTWA 273


>gi|310799661|gb|EFQ34554.1| transcription factor tfb2 [Glomerella graminicola M1.001]
          Length = 485

 Score =  180 bits (456), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 135/443 (30%), Positives = 215/443 (48%), Gaps = 76/443 (17%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTL----------------- 44
           +  +AK  V+ +L++ +P+    +  W+     ++  +A  TL                 
Sbjct: 41  IPPLAKTIVMAILYMPKPMLLEDLDVWVKPESRRQKDQAISTLRSLHILQITVPSKERPQ 100

Query: 45  -----TELRRNAKVALLGGGK------PWSMSAKLEVDSKARDLDFLNQYALERWECILR 93
                T  + + ++AL GGG       P S+ A  E+      + FL++YA  +WE IL 
Sbjct: 101 EMQLTTNFKNSLRLALEGGGTHNSFGVPSSLPAPPEIT-----VPFLDRYARRKWEDILH 155

Query: 94  FMVGSQQTEGIS----ADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFIL 143
           ++V +    G        +V+ L   G           IT+ GF FLL E  AQVW  +L
Sbjct: 156 YIVNTVNPGGADLGGPKSSVKNLLVAGQLVRRQGSAVGITQAGFTFLLQEANAQVWTLLL 215

Query: 144 KYLETVE----SKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQ 199
            +LE  +    + G++  + L+FLF L     G  Y T  +++  +  L  L +FGL+Y 
Sbjct: 216 LWLEATDHAEDAAGMESTDMLSFLFLLASLELGRPYDTNALTEARRNMLPSLLDFGLIYI 275

Query: 200 RKRKAGRFYPTKLALNMA-------------------TRGTLKQIREPGFLIVETNFRVY 240
              K  +++PT+LA  +                    +  +   IR   F + ETN+RVY
Sbjct: 276 PSHKPQQYFPTRLATTLTSSSSALSYAHCRDSETFFQSYASHLMIRVRPFRLSETNYRVY 335

Query: 241 AYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV 300
           AY  + L++A++ LFC+L  RF ++  G LTR+S+R A+  GITA QII +L  HA   +
Sbjct: 336 AYGQTPLQIAVLSLFCKLKLRFADMVSGRLTRNSIRNAVERGITADQIISYLAAHAHEQM 395

Query: 301 -------SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVL 352
                  S P LPP V DQIRLW+ E ER+TT+ G L+  F S  +++ +  YA ++GVL
Sbjct: 396 HRMAAARSRPILPPTVIDQIRLWQLETERMTTTSGFLFRDFDSPREYDDIAGYAAEIGVL 455

Query: 353 TWQNEKKRTVVVTKQGHSDIKKF 375
            W+N+K      +K  H  I+ +
Sbjct: 456 VWRNDKLGMFFASK--HEQIRDY 476


>gi|398406290|ref|XP_003854611.1| hypothetical protein MYCGRDRAFT_69317 [Zymoseptoria tritici IPO323]
 gi|339474494|gb|EGP89587.1| hypothetical protein MYCGRDRAFT_69317 [Zymoseptoria tritici IPO323]
          Length = 493

 Score =  180 bits (456), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 131/430 (30%), Positives = 208/430 (48%), Gaps = 67/430 (15%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKT--HAKEHKEATDTLTELR----------- 48
           L  +AK  V+ +L++  P+  A + +W       ++E + +   L  L+           
Sbjct: 39  LPHLAKTVVMSMLYLPGPMRAADLEAWFKSDVQSSQERERSVSVLVRLKVLVEEQAGGGA 98

Query: 49  -----------RNAKVALLG-------GGKPWSMSAKLEVDSKARDLDFLNQYALERWEC 90
                      +  + AL G       G   + ++   E D    ++ FL+ +A  +WE 
Sbjct: 99  GLQYRLSEAFAKGLRCALTGAVGKGDAGKGSFGVACSAEKDEDV-NVKFLDDFARRQWEA 157

Query: 91  ILRFMVGSQQTEGISAD-----AVRTLFNEG--------AENPVITKDGFQFLLLETPAQ 137
           IL ++VGS  + G+S D       + L  +G         +  +IT+ GF FLL E  AQ
Sbjct: 158 ILYYVVGSANS-GLSGDLEISAGTKRLLQQGNFVQIKSGGKQRLITQAGFTFLLEEVNAQ 216

Query: 138 VWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV 197
           VW   + YL   E   +D V+ L+FLF L     G  YST  ++   Q  L  L++FGL+
Sbjct: 217 VWSLAVVYLRVCEDLQMDPVDVLSFLFTLGSLELGIPYSTSNLTLTQQHMLDDLQDFGLI 276

Query: 198 YQRKRKAGRFYPTKLALNMATRG-------------TLKQIREPGFLIVETNFRVYAYTD 244
           Y+    A  +YPT+LA  + +                  +  E G++I+ETN+R+YAYT 
Sbjct: 277 YRSSPTATAYYPTRLATTLTSDAPALPNNTSSTTTTAATESNEKGYIILETNYRLYAYTS 336

Query: 245 SNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPS----- 299
           S L+++++ LF  L  R+PNL    +T+ S+ +A+   IT+ QII +L  HA P      
Sbjct: 337 SPLQISILSLFSTLHTRYPNLITAKITKSSIASAISQSITSDQIITYLSTHAHPVLRRQA 396

Query: 300 --VSCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQN 356
             +S P LPP V DQIRLW+ E ER+TT +G          ++ K  +YA+ LGVL  + 
Sbjct: 397 AIMSAPILPPTVVDQIRLWQIEGERMTTWKGFYIRDVGGWEEYVKAVEYAEALGVLRQRF 456

Query: 357 EKKRTVVVTK 366
           + KR+  V +
Sbjct: 457 DAKRSFFVGR 466


>gi|298705308|emb|CBJ48998.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 534

 Score =  176 bits (447), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 129/415 (31%), Positives = 196/415 (47%), Gaps = 61/415 (14%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
           L ++AK +V+RLLFVE PV + ++ SW+ K + + H  +   L  LR             
Sbjct: 66  LPQLAKQHVMRLLFVEGPVGKGMLKSWVKKEYQRVHAASVRKLVSLRVLLPSAGGQEYQL 125

Query: 49  -----RNAKVALLGGGK-PWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
                 N + AL    K PW     L  D     +  + +    +W+ +L F+VGS + E
Sbjct: 126 NPPFRENLQRALCASDKTPWQGDVPLSADRSPPTVAKIERQMHSQWQDVLYFLVGSDE-E 184

Query: 103 GIS---ADAVRTLFNEGAENPV-------ITKDGFQFLLLETPAQVWYFILKYLETV-ES 151
           G S   A  V+ +   G   PV       IT  G++F+L E   Q W  +   +     +
Sbjct: 185 GRSTPPARVVKFMEESGLMRPVRGRGALRITDKGYEFMLKEAHVQAWMVVHALINGYGRT 244

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTK 211
           +     E L FLFQL +   G  Y    ++   +   Q   E GL+++RK K+ RFYPT 
Sbjct: 245 QPGCRDELLAFLFQLSYCKVGDAYPLGALTQTQRDLAQDFVELGLLFKRKAKSTRFYPTS 304

Query: 212 LALNM-------------ATRGTLKQIREPGFL-------------IVETNFRVYAYTDS 245
           +A+N+                GT ++ +  G L             IVETNF+V AYT S
Sbjct: 305 IAVNLIFGSSPSGDAGGAGGGGTQRKPQPAGGLGGGKREDDTSIHIIVETNFQVIAYTRS 364

Query: 246 NLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---SC 302
            L  A++ LF EL    PN  VG +TR+S+R AL +GI   Q++ FL+ HA P V   + 
Sbjct: 365 KLHFAMLSLFLELRALLPNAIVGAITRESMRKALSTGIKGRQVLDFLKWHAHPVVRRRTP 424

Query: 303 PLPPVVSDQIRLWEGERERLTTSEGVLYS-QFVSQADFEKLRDYAQDLGVLTWQN 356
            +P  ++DQ+ LWE ER+R+   +GVL    + S+  F  + ++A     L W +
Sbjct: 425 VVPENIADQVLLWERERDRMEHRDGVLVDVSYASRDAFRGMTEFANAKQGLLWSS 479


>gi|76154655|gb|AAX26095.2| SJCHGC05044 protein [Schistosoma japonicum]
          Length = 214

 Score =  176 bits (445), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 85/158 (53%), Positives = 114/158 (72%), Gaps = 4/158 (2%)

Query: 229 GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQI 288
           G++++ETNFR+YAYTDS L+ AL+ LF ++  RFPNL V  +TRDSVR AL  GITA QI
Sbjct: 58  GYILLETNFRLYAYTDSPLRTALLSLFSKIRARFPNLVVADITRDSVREALIRGITANQI 117

Query: 289 IGFLRLHALPSV--SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDY 345
           + FL  +A P +  S P LPP ++DQIRLWE ER+R    EG LY  F    DFE +RDY
Sbjct: 118 LSFLTSNAHPDMLLSNPILPPTLTDQIRLWELERDRFVFQEGCLYEHFSRNTDFELVRDY 177

Query: 346 AQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKGS 383
           A+++GVL W+N ++R +VV+K GH D++KFW  H++ S
Sbjct: 178 AKNIGVLLWENPERRLMVVSKSGHEDVRKFW-KHKRPS 214


>gi|365758016|gb|EHM99881.1| Tfb2p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
          Length = 419

 Score =  175 bits (444), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 114/371 (30%), Positives = 182/371 (49%), Gaps = 73/371 (19%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTL----------------T 45
           L  +AK +++ ++F E  VP   +  W++     + ++A  ++                 
Sbjct: 39  LPPLAKFFIMAMVFNENEVPLLDLDKWVNSNGKLQFQDAIKSMKSLHLLIPDKSSGTLMI 98

Query: 46  ELRRNAKVAL---LGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
            L    K++L   L GG+  + S  + V+     LD L++Y+  +WE IL FMVG+   +
Sbjct: 99  NLNPTFKISLRNALTGGEVQN-SFGVVVEDNVISLDLLDEYSANKWETILHFMVGTPLAK 157

Query: 103 GISADAVRTLFNEGAENPV-------ITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
             S   +  L +      V       IT +GFQFLL E  +Q+W  +L+YL+ +E+  +D
Sbjct: 158 IPSEKVLNLLKHSKLMEEVNSTGEFKITNEGFQFLLQEINSQLWTLLLQYLKMIETSKMD 217

Query: 156 LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALN 215
           LV+ L F+F L     G  Y  + +S+  ++ LQ +R++GLV+Q+      FYPTKLAL 
Sbjct: 218 LVDVLHFIFMLGALEVGKAYKIDALSETQRIMLQDMRDYGLVFQKHLNDSIFYPTKLALM 277

Query: 216 M------------ATRGTLKQIRE----------------------------------PG 229
           +            A    L+Q RE                                   G
Sbjct: 278 LTSDTKTIISASNAMDSVLRQNREEPSVNEDGANGKSTTDTTASDDLNKAGSKNQDIPDG 337

Query: 230 FLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQII 289
            LIVETNF++Y+Y++S L++A++ LF  L  RF N+ +G +TR+S+R AL +GITA QII
Sbjct: 338 SLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRNALTNGITADQII 397

Query: 290 GFLRLHALPSV 300
            +L  HA P +
Sbjct: 398 AYLETHAHPQM 408


>gi|158302621|ref|XP_561433.2| Anopheles gambiae str. PEST AGAP012880-PA [Anopheles gambiae str.
           PEST]
 gi|157021114|gb|EAL42401.2| AGAP012880-PA [Anopheles gambiae str. PEST]
          Length = 244

 Score =  175 bits (443), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 87/180 (48%), Positives = 119/180 (66%), Gaps = 33/180 (18%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
           EL EIA+ +VIR+LFVEQP+PQAV++SW ++ +AKE+   +  LTEL             
Sbjct: 65  ELPEIARQFVIRILFVEQPIPQAVVSSWATQVYAKENTSVSQVLTELGVWRSAAYPGGLA 124

Query: 48  --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGS- 98
                   ++N K+ALLGGG+PWSMS  L+ D K+RD+DFL+ YA+ RW C+L +MVG+ 
Sbjct: 125 AWELCPTFKKNLKIALLGGGRPWSMSNALDPDQKSRDIDFLDTYAMSRWRCVLHYMVGAG 184

Query: 99  ----QQTEGISADAVRTLF-------NEGAENPVITKDGFQFLLLETPAQVWYFILKYLE 147
                + EGIS DAVR L        +E   +PVIT+ GFQFLLL+T AQVW+F+L+YL+
Sbjct: 185 SSKGMEGEGISPDAVRILLHANLMKRDESDGSPVITRQGFQFLLLDTQAQVWHFMLQYLD 244


>gi|401887601|gb|EJT51582.1| hypothetical protein A1Q1_07170 [Trichosporon asahii var. asahii
           CBS 2479]
 gi|406699646|gb|EKD02845.1| hypothetical protein A1Q2_02789 [Trichosporon asahii var. asahii
           CBS 8904]
          Length = 451

 Score =  174 bits (440), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 119/356 (33%), Positives = 176/356 (49%), Gaps = 41/356 (11%)

Query: 33  HAKEHKEATDTLTELRRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECIL 92
           H K      DT     RNA   L G    + +  K +    A   + L  Y    WE IL
Sbjct: 62  HQKFQWSLNDTFKRGMRNALTGL-GTSNSFGVPYKRQARDPAPTAEELIDYGENTWESIL 120

Query: 93  RFMVGS---QQTEGISADAVRTL----------FNEGAENP-----VITKDGFQFLLLET 134
           ++MV S    +  G   ++V +L          F++   NP      IT  GFQFLL + 
Sbjct: 121 KYMVSSGLGARVTGRPRNSVLSLLHASGLMVDQFDDTGRNPNYDRLTITSQGFQFLLEDR 180

Query: 135 PAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVF------L 188
             Q+W  ++ YL   + +G    + L+  F L     G DYS    S   Q F      L
Sbjct: 181 QTQLWQILMFYLARKQRRGEAAADILSLFFSLGCMQLGQDYSA---SKSFQNFPQGQQCL 237

Query: 189 QHLREFGLVYQRK----RKAGRFYPTKLALNMATRGTL----KQIREPGFLIVETNFRVY 240
             L ++G +Y+R+    RK  +F+PT LA ++ +  T         +  FLI+ETN+RVY
Sbjct: 238 DDLEQYGFIYRRQEVDGRKTDQFFPTHLATSLCSGDTSVNRSATADDKRFLILETNYRVY 297

Query: 241 AYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV 300
           AYT + L++A++ LF  +  R+PNL VG L R  V+AA+  GI+A QII +L  HA P +
Sbjct: 298 AYTSNELEIAILNLFVNITIRYPNLIVGHLDRRHVKAAMDKGISAYQIISYLTTHAHPQM 357

Query: 301 SCPLPPV----VSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVL 352
               PP+    V+DQ+ LW+ ER R+   + V++ +F S+  FE  RD A+ +G L
Sbjct: 358 YNHPPPLLHPSVTDQLHLWDRERNRVQAQDTVMF-EFYSKELFETCRDEAERMGAL 412


>gi|241654873|ref|XP_002411342.1| transcription factor IIH (TFIIH), putative [Ixodes scapularis]
 gi|215503972|gb|EEC13466.1| transcription factor IIH (TFIIH), putative [Ixodes scapularis]
          Length = 218

 Score =  172 bits (437), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 82/176 (46%), Positives = 115/176 (65%), Gaps = 29/176 (16%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
           EL  I++HY++RL+FV+QPVPQAV++SW  + + KEH E+ + LT L             
Sbjct: 43  ELPIISRHYIMRLMFVDQPVPQAVVSSWNEQKYVKEHLESLEALTALHIWADSSLPGGLP 102

Query: 48  --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
                   R+N ++ALLGGG+PW++ + LE D   RD  FL++YA+ERWEC+L FMVG  
Sbjct: 103 GWSLSGVFRKNIQIALLGGGQPWAVYSTLEKDKHGRDAQFLDRYAMERWECVLHFMVGCH 162

Query: 100 QTEGISADAVRTLFNEG--------AENPVITKDGFQFLLLETPAQVWYFILKYLE 147
             EGISADAVR L + G           P+IT +GFQFLL++T +QVW+F+L+YL+
Sbjct: 163 TKEGISADAVRILLHAGLMKSEEEEGSAPLITMEGFQFLLMDTASQVWHFVLQYLD 218


>gi|452824378|gb|EME31381.1| transcription initiation factor TFIIH subunit H4 [Galdieria
           sulphuraria]
          Length = 740

 Score =  171 bits (433), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 107/299 (35%), Positives = 155/299 (51%), Gaps = 34/299 (11%)

Query: 76  DLDFLNQYALERWECILRFMVGSQQTEG-ISADAVRTLFNEGAENPV-----ITKDGFQF 129
           D+ FL+QY+   W  IL+F+VG +  E   S   +  L   G    V     IT  GFQF
Sbjct: 141 DVTFLSQYSRHCWNKILQFLVGLEPLESEPSVRIIEALVESGLIEQVEDGFRITTVGFQF 200

Query: 130 LLLETPAQVWYFILKYLETVESK---GLDLV------ECLTFLFQLKFSTFGTDYSTEGM 180
           LL +T  QVW+ +   L + E+    G  L       E + FLF+L F   G  Y  + +
Sbjct: 201 LLKDTREQVWFILENILFSNETNLKIGTKLQKRVMNEEMVEFLFELSFCASGVAYHWKTL 260

Query: 181 SDKLQVFLQHLREFGLVY----QRKRKAGRFYPTKLALNMATRGTL-------------- 222
           S   Q+++ +L + GL+Y        K   FY T L + + TR                 
Sbjct: 261 SKTQQIYIPYLADLGLIYLHISSSLEKDSYFYVTPLGVTL-TRSICWLGDKNNYLLLEDW 319

Query: 223 KQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSG 282
             +     +IV+TNFRVY YT+   +++L+ LF + +YR PN+ VGV+TRDS+R AL +G
Sbjct: 320 SNVDNDCRMIVQTNFRVYVYTNCTFQISLLSLFIQFLYRLPNMAVGVITRDSIRTALNNG 379

Query: 283 ITAAQIIGFLRLHALPSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEK 341
           ITA Q+I +L+ H  P++   LP  + DQIRLWE +R R+TT   +L   F +   FEK
Sbjct: 380 ITAQQMISYLQNHMHPNMKGKLPITIIDQIRLWEAQRFRVTTKHALLLDHFDNMTCFEK 438


>gi|255079788|ref|XP_002503474.1| predicted protein [Micromonas sp. RCC299]
 gi|226518741|gb|ACO64732.1| predicted protein [Micromonas sp. RCC299]
          Length = 458

 Score =  167 bits (422), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 127/426 (29%), Positives = 194/426 (45%), Gaps = 62/426 (14%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWL--SKTHAKEHKEATDTLTELRRNAKVALLGGG 59
           L  +AKHYV+RLL+V++ + +  + +W+   K H   H  +   L  LR    V +  G 
Sbjct: 34  LPSLAKHYVMRLLYVDEGIAREEMDAWVRPGKEHRDRHARSMLALKRLR----VLVPAGD 89

Query: 60  KPWSMSAKL-------------------------EVDSKARDLDFLN----------QYA 84
           + +    K                          EV++     D +           ++A
Sbjct: 90  QEYVADGKELVRLNRRFVKGVRAQIETCFAPEGDEVEATVAPDDAMGGKRPSPEKIEEFA 149

Query: 85  LERWECILRFMVGSQQTEGISADAVRTLFNEG--AENPV------ITKDGFQFLLLETPA 136
             RWE +L  + G  + +G   D        G  AE         IT+ GF+FLL     
Sbjct: 150 KGRWEALLMTLTGRDRRKGPGLDVAALFRGAGLVAEKSNKSNGWGITEKGFRFLLSTARE 209

Query: 137 QVWYFILKYLETVESKG---LDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLRE 193
           Q+W  + +Y+    + G   L     + F+ +L F   G  Y  + +    +   + L  
Sbjct: 210 QIWTLLTEYVRQYSAPGDRTLVAPAVIGFMLRLTFQAVGQPYRVDDLPSAQRAIAEDLAH 269

Query: 194 FGLVY--QRKRKAGRFYPTKL-ALNMATRGTLKQIREPG-FLIVETNFRVYAYTDSNLKV 249
            GL+Y      K G + PT+L A      G      +PG  +I ETNFRVYAYT S+++ 
Sbjct: 270 LGLLYLFAGPGKEGYYVPTQLTAGKDVADGDASLGGDPGGHIIAETNFRVYAYTFSDVEC 329

Query: 250 ALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV-----SCPL 304
            ++ LF    YR PNL VG+LTR++V  AL +G+ A QII +++ HA P+         +
Sbjct: 330 EILRLFTRPDYRLPNLYVGMLTREAVHEALDTGVAAEQIIKYIKSHAHPNARKTTNGSGI 389

Query: 305 PPVVSDQIRLWEGERERLTTSEGVLYSQFVSQAD-FEKLRDYAQDLGVLTWQNEKKRTVV 363
           PP V+DQI LW  ER R+ ++E VLY  F +  D +      A D GVL W+N ++  + 
Sbjct: 390 PPNVADQIMLWAMERRRVRSAECVLYCDFPTGTDEYAAAVKAASDAGVLLWENREQMKLA 449

Query: 364 VTKQGH 369
           V K GH
Sbjct: 450 VAKSGH 455


>gi|321265552|ref|XP_003197492.1| subunit of TFIIH and nucleotide excision repair factor 3 complexes
           [Cryptococcus gattii WM276]
 gi|317463972|gb|ADV25705.1| Subunit of TFIIH and nucleotide excision repair factor 3 complexes,
           putative [Cryptococcus gattii WM276]
          Length = 481

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 119/380 (31%), Positives = 179/380 (47%), Gaps = 36/380 (9%)

Query: 29  LSKTHAKEHKEATDTLTELRRNAKVALLGGGKPWSMSAKLEVDSKARDLDF-----LNQY 83
           L   H K+ K         +R  + AL G G   S     E   +A DLD      L  Y
Sbjct: 93  LHPMHYKKQKMQWSLNDSFKRGLRNALTGLGASNSFGVPFE-RHQATDLDLPSKDELVAY 151

Query: 84  ALERWECILRFMVGS----------QQTEGISADAVRTLFNEGAE----NP-----VITK 124
             E +E IL++MV S           Q E +       L  +  +    NP      IT 
Sbjct: 152 GEETFESILKYMVSSGLGTEFSGSRPQPEVLQLLHASGLMTDPGDMSRRNPNIYRLTITS 211

Query: 125 DGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKL 184
            GFQFLL +   Q+W  ++ YL   E+      E L+  F L     G DYS        
Sbjct: 212 KGFQFLLEKRQTQLWEILMYYLSAKEANSERSSEVLSMFFSLGCMQLGQDYSASNSFPHA 271

Query: 185 QVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIR-----EPGFLIVETNFRV 239
           Q  L  L ++G +Y+    + +F+PT LA ++ + G    I+     +  FLI+ETN+++
Sbjct: 272 QEALNDLAQYGFIYKPSPDSDQFWPTHLATSLCS-GDASAIQSQSADDKRFLILETNYKI 330

Query: 240 YAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPS 299
           YAYT + L++A++ LF ++  R+PNL VG L R  V+AA+  GI+A QII +L  HA P 
Sbjct: 331 YAYTSNELEIAILNLFVDIRIRYPNLVVGKLDRQHVKAAMEKGISARQIIAYLSSHAHPQ 390

Query: 300 V-SCPLP---PVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQ 355
           + + P P   P + DQ+ LW+ ER RL T E V+Y +F S+  F+   + A+    L   
Sbjct: 391 MYNSPPPLLHPTIVDQLHLWDRERNRLQTEETVMY-EFFSKELFDDTVNEAKANAALQHA 449

Query: 356 NEKKRTVVVTKQGHSDIKKF 375
              ++ + +       I +F
Sbjct: 450 ATSQKLLFIEPHTKPAITEF 469


>gi|134118870|ref|XP_771938.1| hypothetical protein CNBN1180 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|50254542|gb|EAL17291.1| hypothetical protein CNBN1180 [Cryptococcus neoformans var.
           neoformans B-3501A]
          Length = 481

 Score =  164 bits (416), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 116/384 (30%), Positives = 181/384 (47%), Gaps = 44/384 (11%)

Query: 29  LSKTHAKEHKEATDTLTELRRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQ-----Y 83
           L   H K+ +         ++  + AL G G   S     E   +A DLD  ++     Y
Sbjct: 93  LHPMHYKKQRMQWSLNDSFKKGLRNALTGLGTSNSFGVPFE-RHQATDLDLPSEDELVAY 151

Query: 84  ALERWECILRFMVGSQ------------------QTEGISADAVRTLFNEGAENP----- 120
             + +E IL++MV S                    T G+  D+V    +    NP     
Sbjct: 152 GEDTFESILKYMVSSGLGTEFSGSRPQPEVLQLLHTSGLMTDSV----DMNGRNPNINRL 207

Query: 121 VITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGM 180
            IT  GFQFLL +   Q+W  ++ YL   E+      E L+  F L     G DYS    
Sbjct: 208 TITSKGFQFLLEKRQTQLWEILMYYLSAKEANSERSSEVLSMFFSLGCMQLGQDYSASNS 267

Query: 181 SDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIR-----EPGFLIVET 235
               Q  L  L ++G +Y+    + +F+PT LA ++ + G    I+     +  FLI+ET
Sbjct: 268 FPGAQEALDDLAQYGFIYKSSPDSDQFWPTHLATSLCS-GDASAIQSQSADDKRFLILET 326

Query: 236 NFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLH 295
           N+++YAYT + L++A++ LF ++  R+PNL VG L R  V+AA+  GI+A QII +L  H
Sbjct: 327 NYKIYAYTSNELEIAILNLFVDIRIRYPNLVVGKLDRQHVKAAMEKGISARQIIAYLSSH 386

Query: 296 ALPSV-SCPLP---PVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGV 351
           A P + + P P   P + DQ+ LW+ ER RL T E V+Y +F S+  F+   + A+    
Sbjct: 387 AHPQMYNSPPPLLHPTIVDQLYLWDRERNRLQTEETVMY-EFFSKELFDDTVNEAKANAA 445

Query: 352 LTWQNEKKRTVVVTKQGHSDIKKF 375
           L      ++ + +       I +F
Sbjct: 446 LQHAATSQKLLFIEPHTKPAITEF 469


>gi|58262306|ref|XP_568563.1| hypothetical protein CNN01190 [Cryptococcus neoformans var.
           neoformans JEC21]
 gi|57230737|gb|AAW47046.1| hypothetical protein CNN01190 [Cryptococcus neoformans var.
           neoformans JEC21]
          Length = 481

 Score =  164 bits (415), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 112/349 (32%), Positives = 170/349 (48%), Gaps = 44/349 (12%)

Query: 29  LSKTHAKEHKEATDTLTELRRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQ-----Y 83
           L   H K+ +         ++  + AL G G   S     E   +A DLD  ++     Y
Sbjct: 93  LHPMHYKKQRMQWSLNDSFKKGLRNALTGLGTSNSFGVPFE-RHQATDLDLPSEDELVAY 151

Query: 84  ALERWECILRFMVGSQ------------------QTEGISADAVRTLFNEGAENP----- 120
             + +E IL++MV S                    T G+  D+V    +    NP     
Sbjct: 152 GEDTFESILKYMVSSGLGTEFSGSRPQPEVLQLLHTSGLMTDSV----DMNGRNPNINRL 207

Query: 121 VITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGM 180
            IT  GFQFLL +   Q+W  ++ YL   E+      E L+  F L     G DYS    
Sbjct: 208 TITSKGFQFLLEKRQTQLWEILMYYLSAKEANSERSSEVLSMFFSLGCMQLGQDYSASNS 267

Query: 181 SDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIR-----EPGFLIVET 235
               Q  L  L ++G +Y+    + +F+PT LA ++ + G    I+     +  FLI+ET
Sbjct: 268 FPGAQEALDDLAQYGFIYKSSPDSDQFWPTHLATSLCS-GDASAIQSQSADDKRFLILET 326

Query: 236 NFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLH 295
           N+++YAYT + L++A++ LF ++  R+PNL VG L R  V+AA+  GI+A QII +L  H
Sbjct: 327 NYKIYAYTSNELEIAILNLFVDIRIRYPNLVVGKLDRQHVKAAMEKGISARQIIAYLSSH 386

Query: 296 ALPSV-SCPLP---PVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFE 340
           A P + + P P   P + DQ+ LW+ ER RL T E V+Y +F S+  F+
Sbjct: 387 AHPQMYNSPPPLLHPTIVDQLYLWDRERNRLQTEETVMY-EFFSKELFD 434


>gi|405123868|gb|AFR98631.1| transcription factor TFIIH complex subunit Tfb2 [Cryptococcus
           neoformans var. grubii H99]
          Length = 481

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 116/380 (30%), Positives = 180/380 (47%), Gaps = 36/380 (9%)

Query: 29  LSKTHAKEHKEATDTLTELRRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQ-----Y 83
           L   H K+ K         ++  + AL G G   S     E   +A DLD  ++     Y
Sbjct: 93  LHPMHYKKQKMQWSMNDSFKKGLRNALTGLGTSNSFGVPFE-RHQATDLDLPSEDELVAY 151

Query: 84  ALERWECILRFMVGS----------QQTEGISADAVRTLFNEGAE----NP-----VITK 124
             + +E IL++MV S           Q E +       L  +  +    NP      IT 
Sbjct: 152 GEDTFESILKYMVSSGLGTEFSGSRPQPEVLQLLHASGLMTDPGDMTGRNPNINRLTITS 211

Query: 125 DGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKL 184
            GFQFLL +   Q+W  ++ YL   E+      E L+  F L     G DYS        
Sbjct: 212 KGFQFLLEKRQTQLWEILMYYLSAKEANSERSSEVLSMFFSLGCMQLGQDYSASKSFPHA 271

Query: 185 QVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIR-----EPGFLIVETNFRV 239
           Q  L  L ++G +Y+    + +F+PT LA ++ + G    I+     +  FLI+ETN+++
Sbjct: 272 QEALDDLAQYGFIYKSSPGSDQFWPTHLATSLCS-GDASAIQSQSADDKRFLILETNYKI 330

Query: 240 YAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPS 299
           YAYT + L++A++ LF ++  R+PNL VG L R  V+AA+  GI+A QII +L  HA P 
Sbjct: 331 YAYTSNELEIAILNLFVDIRIRYPNLVVGKLDRQHVKAAMEKGISAGQIIAYLSSHAHPQ 390

Query: 300 V-SCPLP---PVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQ 355
           + + P P   P + DQ+ LW+ ER RL T E V+Y +F S+  F+   + A+    L   
Sbjct: 391 MYNSPPPLLHPTIVDQLHLWDRERNRLQTEETVMY-EFFSKELFDDTVNEAKANAALQHA 449

Query: 356 NEKKRTVVVTKQGHSDIKKF 375
              ++ + +       I +F
Sbjct: 450 ATSQKLLFIEPHTKPAITEF 469


>gi|355693906|gb|AER99491.1| proteinral transcription factor IIH, polypeptide 4, 52kDa [Mustela
           putorius furo]
          Length = 143

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 78/143 (54%), Positives = 105/143 (73%), Gaps = 3/143 (2%)

Query: 244 DSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV--- 300
           +S L++ALI LF E++YRFPN+ V  +TR+SV+ A+ SGITA QII FLR  A P +   
Sbjct: 1   ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 60

Query: 301 SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKR 360
           +  LPP ++DQIRLWE ER+RL  +EGVLY+QF+SQ DFE L  +A++LGVL ++N  KR
Sbjct: 61  TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLVFENSAKR 120

Query: 361 TVVVTKQGHSDIKKFWVNHQKGS 383
            +VVT  GHSD+K+FW   +  S
Sbjct: 121 LMVVTPAGHSDVKRFWKRQKHSS 143


>gi|303272589|ref|XP_003055656.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226463630|gb|EEH60908.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 476

 Score =  157 bits (398), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 127/454 (27%), Positives = 199/454 (43%), Gaps = 91/454 (20%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKV----ALLG 57
           L E+AKHY +RL++VE  VP   +  W++      H E+   + ELR    +       G
Sbjct: 39  LPELAKHYCMRLMYVEGGVPSIDMDEWVTDLGRDAHAESVRRMRELRVMLPLEDVTGAAG 98

Query: 58  GGKPWSMSAKLE-------------------VDSKARDL-------DFLNQYALERWECI 91
            G+   ++ K +                     + A DL       + L  YA  RWE +
Sbjct: 99  DGESLGLNPKFQRGMRSIMEGGGFGDGFDDDDAATASDLGSALPSPEDLESYAKGRWEAL 158

Query: 92  LRFMVGSQ-------------------QTEGISADAVRTLFNEGAENPVITKDGFQFLLL 132
           L  + G+                    +  G+  DA +       E   +T+ GF+FLL 
Sbjct: 159 LLTLTGASDAFAAAGANAADLDVGALFRAAGLIGDASK------GEKEGVTEAGFKFLLS 212

Query: 133 ETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLR 192
               Q+W              LD    L+FL +L F   G  YST+G+    +  ++ + 
Sbjct: 213 TAREQIWAL------------LDAPAVLSFLLKLTFQAPGVAYSTDGLPASQKGVVRDVA 260

Query: 193 EFGLVYQRKRKA-GRFYPTKLALNMATRGTLKQIREPG------------------FLIV 233
           + GL+Y       G + PT L+  ++  G      + G                   +IV
Sbjct: 261 KLGLLYPLAAAGKGYYVPTSLSSGLSGGGGGDDDGDGGVGGGGKKSGDGGGVGARGHIIV 320

Query: 234 ETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLR 293
           ETNFRVYAYT S ++V ++ LF    Y+ PNL VG++TR++V  ALR GI+A QI+ +LR
Sbjct: 321 ETNFRVYAYTSSAVEVEILRLFTRPDYKLPNLYVGMMTREAVVTALRGGISAEQIVSYLR 380

Query: 294 LHALP----SVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQAD-FEKLRDYAQD 348
            HA P    +    +P  V DQIRLW  +  R+  +  VLY  F +    FEK+ + A++
Sbjct: 381 KHAHPQARKTPGPAIPATVCDQIRLWSKDENRVKYTPCVLYCDFPTGTGMFEKVAEIAKE 440

Query: 349 LGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQKG 382
            G+  W +     + V ++GH  +K  +   + G
Sbjct: 441 RGLYLWGDPVGLKLAVREEGHESMKDVFKKIRAG 474


>gi|145351329|ref|XP_001420034.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144580267|gb|ABO98327.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 495

 Score =  157 bits (398), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 127/433 (29%), Positives = 188/433 (43%), Gaps = 66/433 (15%)

Query: 5   IAKHYVIRLLFVEQPVPQAVIASWLSK--THAKEHKEATDTLTELR-------------- 48
           IAK  ++RL +V+        A W     + A+  + A   L  LR              
Sbjct: 52  IAKQIILRLAYVDDGADARECAKWCGDGASDARALERAMARLARLRVTTALGEGEDARVV 111

Query: 49  ------RNAKVAL------LGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMV 96
                  N + AL      L  G          V +K  D   LN YA  +WE +L  + 
Sbjct: 112 ANEAFANNLRRALERGFVNLDDGDGADAREDAGVAAKVPDRATLNAYAKAKWEDLLLTLT 171

Query: 97  GSQQTEGISADAVRTLFNEGA-----------------------ENPVITKDGFQFLLLE 133
           G+          V+   +  A                       +N  +T +GF FLL  
Sbjct: 172 GASNAFSRPGAKVKGRLDAQALFRAAGLTTATTAALKKGGQKRLKNAGVTAEGFAFLLQT 231

Query: 134 TPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEG-MSDKLQVFLQHLR 192
              Q+W  + +Y+   +S+       ++FL +L F   G  Y+  G +SD  Q  +  L 
Sbjct: 232 AQEQIWVLLTQYIR--DSQKTSATSAISFLLRLTFQEPGRAYAMTGVLSDTEQDVVLDLT 289

Query: 193 EFGLVYQRKRKAGRFY-PTKLALNMA-----TRGTLKQIRE--PGFLIVETNFRVYAYTD 244
             GL+Y  + K   +Y PT LA  ++       G  K       G +IVETN+RVYAYT 
Sbjct: 290 HLGLLYTFEVKKKFYYVPTLLASGLSGGFTGDDGDTKAAAAAAEGHIIVETNYRVYAYTS 349

Query: 245 SNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV---S 301
           S +++ ++ LF    YR PNL VG+LTR+SV+ ALR+G+ A QI+G++R HA   V    
Sbjct: 350 SAVEMEILRLFTRADYRLPNLYVGMLTRESVQNALRAGVDAEQIVGYIRAHAHKQVRRKK 409

Query: 302 CPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADF-EKLRDYAQDLGVLTWQNEKKR 360
             +P  V DQIRLW  + ER+   E VLY  F     F   +   A+  G L W+++  R
Sbjct: 410 PSVPSTVCDQIRLWARDMERMEAEECVLYCDFPQAGTFYGAVVSEAEKRGALLWRDDASR 469

Query: 361 TVVVTKQGHSDIK 373
            + V    H ++K
Sbjct: 470 RLTVRASAHDEMK 482


>gi|223993063|ref|XP_002286215.1| transcription factor of TFIIH family [Thalassiosira pseudonana
           CCMP1335]
 gi|220977530|gb|EED95856.1| transcription factor of TFIIH family [Thalassiosira pseudonana
           CCMP1335]
          Length = 461

 Score =  155 bits (392), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 114/345 (33%), Positives = 169/345 (48%), Gaps = 52/345 (15%)

Query: 60  KPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE-----GISADA---VRT 111
            PW +  +  +D  + D   L  Y    W+ +L F+VGS +T+     GI       V+ 
Sbjct: 119 SPWPIITQDHIDEFSMDGGDLETYTQSSWDSVLHFLVGSDETDRGSGYGIEEPPEAMVQF 178

Query: 112 LF-------------NEGAENP-VITKDGFQFLLLETPAQVWYFILKYLETV---ESKGL 154
           L              N+ +  P VIT  G++F+L +T AQVW F+L+YL ++   E K  
Sbjct: 179 LMRIGLMQVDPDFKGNDKSRAPLVITSKGYEFMLRDTNAQVWQFVLQYLNSMAHHEQKDF 238

Query: 155 DLVECLTFLFQLKFSTFGTDYSTEGMSDK-LQVFLQHLREFGLVYQRKRKAGR--FYPTK 211
             +E L+FL  L     G  Y +  +  K  +  ++    FGL++   R AG+  FYPT+
Sbjct: 239 IRMEALSFLICLGSCRVGEGYQSSVLGSKSARALMRDFSRFGLLFV-CRVAGKNSFYPTR 297

Query: 212 LALNMA----------------TRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLF 255
            A+N+                 T       R    ++V+TNF+V AYT S L ++ +GLF
Sbjct: 298 AAVNLVASSEKAGRQASDLMGQTLAAPVPSRSHLAVVVQTNFQVVAYTKSKLHISTLGLF 357

Query: 256 CEL--MYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALP---SVSCPL-PPVVS 309
           C++    R PN+    +TRDS+R+A R G+TA QI+ FL +HA P   S   P+ P  V 
Sbjct: 358 CDVSSFRRLPNVIFFHITRDSIRSAFRLGVTADQILRFLHVHAHPMLRSGDRPMVPSNVV 417

Query: 310 DQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTW 354
           DQI LW+ ER R+   E V   Q    A+F  +  YA D+  L W
Sbjct: 418 DQILLWDRERHRVVMDE-VCVHQCRDAAEFTAVSQYASDVDALAW 461


>gi|159474228|ref|XP_001695231.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158276165|gb|EDP01939.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 435

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 120/400 (30%), Positives = 180/400 (45%), Gaps = 42/400 (10%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEAT------DTLTELR------- 48
           L  +A+ YV+RLLFV  P P   + SW +K  A  HK A       D LTE R       
Sbjct: 32  LPPLAQVYVMRLLFVPSPFPADFVDSWAAKGAASGHKAALSALRGLDVLTEERGRSRARA 91

Query: 49  ---RNAKVALLGGGKPWSMSAKLEVDSKAR------DLDFLNQYALERWECILRFMVGSQ 99
              R  +    G   P  +S  +  D  A        L+ L ++A+ +WE +  +++GS 
Sbjct: 92  GQGRGRRATGQGPKGPVELSQLMRGDVPAAAASCCPSLEELGEWAVGQWEALQLYLLGSS 151

Query: 100 QTEG-------ISADAVRTLFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVES- 151
           ++         + A         G  +  +T+  F +LL     Q+W  + +Y+   E+ 
Sbjct: 152 RSPPQLPQLLRVRARTCEDGAAAGRRSLAVTQRSFHWLLQPGDRQLWAVLREYISGAEAA 211

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGL--VYQRKRKAGRFYP 209
            G +L   L+FL QL F   G   +   +    Q    HL + G+  V+Q    A     
Sbjct: 212 SGDELAHTLSFLLQLGFRRVGQPCAWAELRPPEQRMAAHLAQLGVLAVFQGSAGAAGSGG 271

Query: 210 TKLALNMATRGTLKQIREP-------GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRF 262
              A      G                F+IVE+N+RVY YT S + +A++ LF +     
Sbjct: 272 ALAARGGGAGGAAAAAAMDGGGAGSDAFVIVESNYRVYVYTRSPVTIAVLELFVKREALL 331

Query: 263 PNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVS--CP-LPPVVSDQIRLWEGER 319
           PNL VG + RDS+ AAL  GITA +++ +L     PS++  CP +P VVSDQIRLWE   
Sbjct: 332 PNLFVGAIRRDSILAALARGITADELVAYLSARPHPSIAARCPVVPEVVSDQIRLWEASM 391

Query: 320 ERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKK 359
            RL     VLY    S+  +E+    A+  G L W++  +
Sbjct: 392 NRLQAHPAVLYENMESRQLYERAVAAARAAGTLQWEDGAR 431


>gi|392574410|gb|EIW67546.1| hypothetical protein TREMEDRAFT_33390 [Tremella mesenterica DSM
           1558]
          Length = 504

 Score =  150 bits (378), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 112/381 (29%), Positives = 175/381 (45%), Gaps = 51/381 (13%)

Query: 47  LRRNAKVALLGGGK------PWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQ 100
            +R  + AL G G       P+ +S   E  S+++    L Q+    WE IL++MV S  
Sbjct: 124 FKRGLRNALTGSGTSNSFGLPYPLSPTDEYPSESQ----LIQFGDRTWESILKYMVSSGL 179

Query: 101 TEGIS-----ADAVRTLFNEGA--------------ENPVITKDGFQFLLLETPAQVWYF 141
             G          +R L + G               +   IT  GFQFLL +   Q+W  
Sbjct: 180 DRGFPVARPETSVLRLLHSSGLMADLSESNGRQQSLDRMTITSQGFQFLLEDRQTQLWQI 239

Query: 142 ILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQV-FLQHLREFGLVYQR 200
           ++ YL ++E   +     L+  F L        YS    S+  Q+  L  L+ +GL+Y+ 
Sbjct: 240 LVFYLTSIEVSCISSAPVLSLFFSLGCMQITQSYSLSTFSNPTQLQALDDLQSYGLIYRP 299

Query: 201 KR----KAGRFYPTKLAL-----NMATRGTLKQIR-------EPGFLIVETNFRVYAYTD 244
                 KA  F+PT L+      N A    L           +  FLI+ETN+++YAYT+
Sbjct: 300 ATPSGAKADYFFPTHLSTSLCSGNSALSAALNTTEIDTSHEDDKKFLILETNYKIYAYTN 359

Query: 245 SNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSCPL 304
           + L++A++ LF ++  ++ NL VG L R  V+AA+  GI+A QII +L+ HA P +    
Sbjct: 360 NELEIAILNLFMDIKVQYRNLVVGKLDRAHVKAAMEKGISAYQIIAYLQSHAHPQMYSSP 419

Query: 305 PPV----VSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKR 360
           PP+    + DQ+ LW+ ER RL T E  ++ +F S+  +E     A+    L     K +
Sbjct: 420 PPILHTTIVDQLHLWDKERNRLRTEESEMF-EFFSKDLYEDTEAEAKRYDGLLLAVPKDK 478

Query: 361 TVVVTKQGHSDIKKFWVNHQK 381
            + V       IK F    Q+
Sbjct: 479 LLFVNPSVKDAIKDFVKGQQR 499


>gi|164660182|ref|XP_001731214.1| hypothetical protein MGL_1397 [Malassezia globosa CBS 7966]
 gi|159105114|gb|EDP44000.1| hypothetical protein MGL_1397 [Malassezia globosa CBS 7966]
          Length = 231

 Score =  149 bits (375), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 78/184 (42%), Positives = 110/184 (59%), Gaps = 5/184 (2%)

Query: 122 ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMS 181
           IT  GFQFLL +  +Q+W  +L YL   + + +DLVE L F F +     G  Y + G S
Sbjct: 36  ITSLGFQFLLQDVNSQLWALLLHYLSMADERNMDLVEVLAFFFTVGGLELGRAYESRGFS 95

Query: 182 DKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGT-----LKQIREPGFLIVETN 236
                 L+ L ++GLVY+  + A  F+PT+LA  + +  +     L    E G+LI+ETN
Sbjct: 96  QTQLQTLEELSDYGLVYRPSKSAKYFFPTRLASTLTSTASPLLSRLNDQEEQGYLILETN 155

Query: 237 FRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHA 296
           +RVYAYT + L++A++ LF  L  R PNL +G LTR SV++AL  GITA QII +L  HA
Sbjct: 156 YRVYAYTANLLRIAILNLFVTLKSRLPNLVIGQLTRHSVKSALNKGITADQIITYLTHHA 215

Query: 297 LPSV 300
            P +
Sbjct: 216 HPQM 219


>gi|307106040|gb|EFN54287.1| hypothetical protein CHLNCDRAFT_25123 [Chlorella variabilis]
          Length = 458

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 102/322 (31%), Positives = 154/322 (47%), Gaps = 24/322 (7%)

Query: 80  LNQYALERWECILRFMVGSQQTEGIS---------------ADAVRTLFNEGAENPVITK 124
           L+ YA  +WE +L ++V    T  ++               A A   + +E      IT+
Sbjct: 133 LDAYARRQWEALLLYLVNGNGTPPMAPPVLHATPIDIPSLLAAAGLMVKDECTLEQKITE 192

Query: 125 DGFQFLLLETPAQVWYFILKYLETVESKG-LDLVECLTFLFQLKFSTFGTDYSTEGMSDK 183
            GFQFLL    +Q+W  + +YL  + + G  DL   + FL +L         +   +   
Sbjct: 193 HGFQFLLANLYSQLWSAVRQYLTLLNTAGGADLAVAINFLLRLGLQGAAAAMAHSQLDSA 252

Query: 184 LQVFLQHLREFGLVYQRKRKAGR-FYPTKLALNMATRGTLKQIR---EPGFLIVETNFRV 239
            +    H+ + GL+           +PT+LA  +A  G   +     E G++IVE+NFRV
Sbjct: 253 ERTIAAHMCQLGLLMPVPGANELWLHPTRLAAVLAGGGRAGEAAVAPEEGYVIVESNFRV 312

Query: 240 YAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPS 299
           YAYT S ++VA++ +F       PNL VG +TR+S   AL +GI A Q++ FLR HA P 
Sbjct: 313 YAYTTSAVQVAVLRVFVRCDALLPNLFVGTITRESATNALDTGIAADQVVAFLRQHAHPR 372

Query: 300 VSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKK 359
            +   P VV+DQIRLW  E +RL      LY +F S+  +     +A+ L  L +  E +
Sbjct: 373 AAAKTPTVVTDQIRLWAQELKRLQEKNATLYDKFESKELYVGAVAHARQLNALLYSCEDR 432

Query: 360 RTVVVTKQGHSDIKKFWVNHQK 381
           R +VV    H       V H K
Sbjct: 433 RQLVVESAFHG----LMVGHLK 450


>gi|303388087|ref|XP_003072278.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH subunit TFB2 [Encephalitozoon
           intestinalis ATCC 50506]
 gi|303301417|gb|ADM10918.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH subunit TFB2 [Encephalitozoon
           intestinalis ATCC 50506]
          Length = 414

 Score =  144 bits (364), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 99/302 (32%), Positives = 154/302 (50%), Gaps = 36/302 (11%)

Query: 87  RWECILRFMVGSQQTEGISADAVRTLFNEGA---ENPVITKDGFQFLLLETPAQVWYFIL 143
           R+E ILR +V  +    I    VR +   G     +  IT  GF+FLL     Q+W  +L
Sbjct: 131 RFETILRSIVNKESK--IREFGVREVLMFGGLLGRDENITNRGFEFLLKTKKEQLWCLVL 188

Query: 144 ----KYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQ 199
                +L++VE +    V  L  +F+L     GT Y      DK    L++L   G++  
Sbjct: 189 LSLKYFLKSVEEE----VSTLEAIFELSTKNVGTVYRHSSSMDKR--LLKYLEVLGIL-- 240

Query: 200 RKRKAGRFYPTKLALNMATRGTLKQIREPG------FLIVETNFRVYAYTDSNLKVALIG 253
                 + Y      N+A R +  Q+ E        F+IVETN ++YAYT+S  + ++I 
Sbjct: 241 ------KLYGE----NLAIRRSFVQLFEASERNRREFIIVETNNKIYAYTNSEYEKSVIH 290

Query: 254 LFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPVVSDQIR 313
           LFC + +  PNLT G +T +SV AA   GIT  QII FL   + P     LPP + +QI 
Sbjct: 291 LFCNVSFNLPNLTKGSITEESVNAAFDKGITGRQIIHFLEASSKPG---SLPPAIINQII 347

Query: 314 LWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIK 373
           +WE +R R+  + G LYS F++ +D++K+ ++  +   L   +  +R +VV  +GH  +K
Sbjct: 348 IWESKRNRIFMAPGYLYSNFLNLSDYQKVLEFCSERNYLIESDIDRRMIVVNPKGHVFVK 407

Query: 374 KF 375
           +F
Sbjct: 408 EF 409


>gi|412992641|emb|CCO18621.1| predicted protein [Bathycoccus prasinos]
          Length = 542

 Score =  144 bits (363), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 105/371 (28%), Positives = 163/371 (43%), Gaps = 71/371 (19%)

Query: 76  DLDFLNQYALERWECILRFMVGSQQTE---------------GISADA---VRTLFNEG- 116
           D  FL+ YA  RWE +L  +      E               G  A     +R LF    
Sbjct: 171 DAGFLDHYAKSRWETVLLELTSEASVEKKRKKKKMMKKKGDGGAGAQHALHLRKLFYAAR 230

Query: 117 --AENPVITKDGFQFLLLETPAQVWYFILKYL---------------------ETVESKG 153
              ++  IT +GF FLL  T  Q+W  + +Y                      + +E+ G
Sbjct: 231 LITKDGRITSEGFSFLLGATSEQIWILLARYARGGDFEKKMKNKTTNEENGEGDKMETNG 290

Query: 154 LD------------LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRK 201
            D                + FL +L F   G  YS   +S+  +    HL   G++Y+ +
Sbjct: 291 EDEKQQHKLKSDESSAAAMAFLVRLSFQHPGRKYSKANLSEAERRVASHLSALGVLYENE 350

Query: 202 RKAGR---FYPTKLALNMATRGTLKQIREP-----GFLIVETNFRVYAYTDSNLKVALIG 253
                   + PT L+  +++  T    +       G +IVETNFRVYAYT S L+  ++ 
Sbjct: 351 DDENDNNWYVPTVLSAGLSSVSTTSSAKSALARIDGHIIVETNFRVYAYTHSELETEVLR 410

Query: 254 LFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSC-------PLPP 306
           LF    Y+ PN  VG++TRDS+  A+R+GI+  QI+ +L+  A P  +         +PP
Sbjct: 411 LFTRPDYKLPNAYVGMITRDSILDAMRAGISPDQIVNYLQTRAHPRCTIGKRPNHPAVPP 470

Query: 307 VVSDQIRLWEGERERLTTSEGVLYSQFVSQAD-FEKLRDYAQDLGV-LTWQNEKKRTVVV 364
            V DQIRLW  +  R+   + V+Y+ F  Q + F+   + A+++G  + W +E KR   V
Sbjct: 471 TVCDQIRLWARDLYRVKADDCVMYTDFPMQGNQFQDAVNNARNVGAQILWMDENKRRFAV 530

Query: 365 TKQGHSDIKKF 375
               H  +K F
Sbjct: 531 DADSHERLKVF 541


>gi|323455511|gb|EGB11379.1| hypothetical protein AURANDRAFT_52511 [Aureococcus anophagefferens]
          Length = 291

 Score =  141 bits (355), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 93/266 (34%), Positives = 144/266 (54%), Gaps = 21/266 (7%)

Query: 122 ITKDGFQFLLLETPAQVWYFILKYLETV-ESKGLDLVECLTFLFQLKFSTFGTDYSTEGM 180
           +T  G  FLL E   QVW  + +Y+    ++KG    E +  +  L ++T G  Y+   +
Sbjct: 12  VTGRGVDFLLKERHEQVWALVDEYIRAAGDAKG----EVVALVLTLAYATPGEGYAIHEL 67

Query: 181 SDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMA-----TRGTLK---QIREPGFLI 232
           S+  +  L  L   GLVY+R   + RFYPT L +++A     + G  +     R P  +I
Sbjct: 68  SEAQKAALDVLFALGLVYRRNASSSRFYPTTLGVDVAFGARRSAGGARAGGDFRRPVDVI 127

Query: 233 VETNFRVYAYTD-----SNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQ 287
           V+TNF+V AYTD     S L +A + LF EL  R PNL VG ++RD+++  +  GI   Q
Sbjct: 128 VQTNFQVLAYTDAGVNTSTLVLATLNLFAELTTRLPNLVVGTISRDAIKRCVDRGIRVPQ 187

Query: 288 IIGFLRLHALPSV-SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYA 346
           I+ FLR HA P++ +  +P  V+DQ+ LW GE  R+  ++GVL +   S A ++   D A
Sbjct: 188 IVKFLRAHAHPAMKASGVPQNVTDQMALWAGEGNRVAFTDGVLVA-LDSDALYDAALDAA 246

Query: 347 QDLGV-LTWQNEKKRTVVVTKQGHSD 371
           +DL   + W++ +KR + V      D
Sbjct: 247 RDLDARVHWKSARKRCLFVDADAEPD 272


>gi|322785901|gb|EFZ12520.1| hypothetical protein SINV_14207 [Solenopsis invicta]
          Length = 170

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 69/124 (55%), Positives = 84/124 (67%), Gaps = 21/124 (16%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
           EL  IAK+YV+RLLFVEQPVPQAVIASW SK + +EH++    L EL             
Sbjct: 47  ELPVIAKNYVMRLLFVEQPVPQAVIASWCSKLYFEEHQKVVQVLNELYVWKEASIPGGLP 106

Query: 48  --------RRNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
                   ++N K+ LLGGGKPW+MS +LE DSK RD+ FL+ YALERWEC+L +MVGSQ
Sbjct: 107 GWILNNTFKKNLKIVLLGGGKPWTMSNQLETDSKPRDVAFLDSYALERWECVLHYMVGSQ 166

Query: 100 QTEG 103
           Q EG
Sbjct: 167 QQEG 170


>gi|167535774|ref|XP_001749560.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163771952|gb|EDQ85611.1| predicted protein [Monosiga brevicollis MX1]
          Length = 417

 Score =  137 bits (345), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 84/235 (35%), Positives = 129/235 (54%), Gaps = 18/235 (7%)

Query: 79  FLNQYALERWECILRFMVGSQQTEGISADAV----RTLFNEGAENPV-ITKDGFQFLLLE 133
           FL +++   WE IL +M  + + +  +   +    + + N+  + P+ +T +GFQFLL +
Sbjct: 128 FLEEHSRVSWEAILSYMTTNAELKNSTMILILQEAQLIVND--QPPISVTPEGFQFLLQD 185

Query: 134 TPAQVWYFILKYL------ETVESKGLDLVECLTFLFQLKFSTFGT----DYSTEGMSDK 183
              QVW  I+ +L      E +  +GL + E L  LF++           ++S E ++  
Sbjct: 186 RANQVWLLIVTFLKLLSREEPIPGEGLPMHEALVLLFRIGLGRLNEVSVLEFSAENLTTM 245

Query: 184 LQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIR-EPGFLIVETNFRVYAY 242
              FL+ L   G+VY R RK+ RF+ T  A+ +     L   + E GFL+ ETNF VYAY
Sbjct: 246 QITFLRQLHYMGVVYHRNRKSKRFFLTPYAVMLYHNAELALSQTETGFLLAETNFHVYAY 305

Query: 243 TDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHAL 297
           TDS +KVAL+  F  L YR P +T  ++TR SVR AL  GITA QI+ F++  +L
Sbjct: 306 TDSAVKVALLSKFATLTYRLPIMTTAIITRTSVRRALSQGITADQILRFMQRSSL 360


>gi|207340606|gb|EDZ68904.1| YPL122Cp-like protein [Saccharomyces cerevisiae AWRI1631]
          Length = 226

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 72/168 (42%), Positives = 106/168 (63%), Gaps = 21/168 (12%)

Query: 229 GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQI 288
           G LIVETNF++Y+Y++S L++A++ LF  L  RF N+ +G +TR+S+R AL +GITA QI
Sbjct: 50  GSLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRRALTNGITADQI 109

Query: 289 IGFLRLHALPSV----------------SCP-----LPPVVSDQIRLWEGERERLTTSEG 327
           I +L  HA P +                +C      LPP V DQIRLW+ E +R+ T EG
Sbjct: 110 IAYLETHAHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTVVDQIRLWQLELDRVITYEG 169

Query: 328 VLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
            LYS F +  ++  L  YAQD+GVL W+++KK+   ++K+G+S +  F
Sbjct: 170 SLYSDFETSQEYNLLSKYAQDIGVLLWKDDKKKKFFISKEGNSQVLDF 217


>gi|401825125|ref|XP_003886658.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH subunit TFB2 [Encephalitozoon hellem
           ATCC 50504]
 gi|395459803|gb|AFM97677.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH subunit TFB2 [Encephalitozoon hellem
           ATCC 50504]
          Length = 414

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 84/259 (32%), Positives = 133/259 (51%), Gaps = 21/259 (8%)

Query: 122 ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYS-TEGM 180
           IT  GF+FLL     Q+W  +L  L+       + +  L  LF+L     G  Y  T+ M
Sbjct: 167 ITNRGFEFLLKTKKEQLWCLVLLSLKYFLGSVDEEIHALEALFELSTKAVGIAYHRTDRM 226

Query: 181 SDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMAT----RGTLKQIREPGFLIVETN 236
             +L   L++L   G++        + Y   LA+  +       + +  RE  F+IVETN
Sbjct: 227 DARL---LKYLEALGIL--------KLYKESLAIGKSFVQLFEASERNRRE--FIIVETN 273

Query: 237 FRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHA 296
            ++YAYT+S  + ++I LFC +    PNL  G +T +SV  A   GIT  QII FL    
Sbjct: 274 NKIYAYTNSEYEKSVIHLFCSVSVNLPNLIKGSITEESVNVAFDKGITGKQIIHFLEASV 333

Query: 297 LPSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQN 356
            P     LPP +S QI +WE +R R+  + G +YS F++ +D++++ ++  +   L   +
Sbjct: 334 KPG---SLPPAISSQIMIWESKRNRIFMAPGYIYSNFLNLSDYQRVLEFCTERSYLIESD 390

Query: 357 EKKRTVVVTKQGHSDIKKF 375
             KR +VV  +GH  +K+F
Sbjct: 391 VDKRMIVVKVEGHELVKEF 409


>gi|361130709|gb|EHL02459.1| putative RNA polymerase II transcription factor B subunit 2 [Glarea
           lozoyensis 74030]
          Length = 210

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 75/202 (37%), Positives = 115/202 (56%), Gaps = 28/202 (13%)

Query: 187 FLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGT-LKQIRE------------PGFLIV 233
            L++L +FGLVY       +F+PT+LA  + +  + L+ I               GF+I+
Sbjct: 1   MLENLVDFGLVYIPPSVKDQFFPTRLATTLTSDASALRSITAGFESALSTGSGTAGFIII 60

Query: 234 ETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLR 293
           ETN+       S L++A++ LF +L  R+PN+  G +TR+SV  A+ SGI++ QII +L 
Sbjct: 61  ETNYH------SPLQIAVLALFTKLTTRYPNMVTGRVTRESVYRAVSSGISSDQIITYLS 114

Query: 294 LHALPSV--------SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRD 344
            HA P +          P LPP V DQIRLW+ E ER+ T+ G L+  F S  +++KL  
Sbjct: 115 THAHPELLKASAAKGGGPVLPPTVVDQIRLWQIENERMKTTYGFLFKDFESTDEYKKLLK 174

Query: 345 YAQDLGVLTWQNEKKRTVVVTK 366
           YA ++GVLTW ++ +    V+K
Sbjct: 175 YADEIGVLTWASKTEEKFFVSK 196


>gi|85691005|ref|XP_965902.1| hypothetical protein ECU01_0670 [Encephalitozoon cuniculi GB-M1]
 gi|19068469|emb|CAD24937.1| hypothetical protein [Encephalitozoon cuniculi GB-M1]
 gi|449329766|gb|AGE96035.1| hypothetical protein ECU01_0670 [Encephalitozoon cuniculi]
          Length = 414

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 84/256 (32%), Positives = 129/256 (50%), Gaps = 15/256 (5%)

Query: 122 ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMS 181
           IT  GF+FLL     Q+W  +L  L+       + +  L  LF+L     GT Y      
Sbjct: 167 ITNRGFEFLLKTKKEQLWCLVLLSLKYFLGSVEEEIAVLEALFELSARAVGTVYRQVDQM 226

Query: 182 DKLQVFLQHLREFGLV--YQRKRKAGRFYPTKLALNMATRGTLKQIREPGFLIVETNFRV 239
           D+     ++L   G++  Y++    GR +      +   R      RE  F+IVETN ++
Sbjct: 227 DRR--LFKYLEALGILRLYEKGLAIGRSFVQLFEASERNR------RE--FIIVETNNKI 276

Query: 240 YAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPS 299
           YAYT+S  + ++I LFC +    PNL  G++T +SV  A   GIT  QII FL       
Sbjct: 277 YAYTNSEYEKSVIHLFCNVTVNLPNLIKGIITEESVNVAFDKGITGKQIIHFLEASVRQG 336

Query: 300 VSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKK 359
               LPP + +QI +WE +R R+    G LYS F++ +D++K+ ++      L   +  K
Sbjct: 337 ---SLPPAIRNQIVIWESKRNRIFMVPGYLYSNFLNLSDYQKVLEFCIRGNHLVESDVDK 393

Query: 360 RTVVVTKQGHSDIKKF 375
           R +VV  +GHS +K+F
Sbjct: 394 RMIVVKLEGHSLVKEF 409


>gi|396080769|gb|AFN82390.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH subunit TFB2 [Encephalitozoon
           romaleae SJ-2008]
          Length = 414

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 90/296 (30%), Positives = 147/296 (49%), Gaps = 24/296 (8%)

Query: 87  RWECILRFMVGSQQTEGISADAVRTLFNEGA---ENPVITKDGFQFLLLETPAQVWYFIL 143
           ++E ILR +V  +    I A  +  +   G    ++  IT  GF+FLL     Q+W  +L
Sbjct: 131 KFEAILRSIVNREYR--IQAFGIGEVLMFGGLLRKDRNITNRGFEFLLKTKKEQLWCLVL 188

Query: 144 ----KYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQ 199
                +L +VE +    +  L  LF+L   T  T +  +G  D     L++L   G++  
Sbjct: 189 LSLKYFLRSVEEE----IYALEALFELSTKTIETAHHQDGRMDGR--LLKYLEALGIL-- 240

Query: 200 RKRKAGRFYPTKLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELM 259
             R  G       +       + +  RE  F+IVETN ++YAYT+S  + ++I LFC + 
Sbjct: 241 --RVCGERLAIGRSFVQLFEASERNRRE--FIIVETNNKIYAYTNSEYEKSVIHLFCNVS 296

Query: 260 YRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPVVSDQIRLWEGER 319
              PNL  G +T +SV  A   GIT  QII FL     P     LPP ++ QI +WE +R
Sbjct: 297 VSLPNLIKGSITEESVNVAFDKGITGKQIIHFLEASVKPG---SLPPAITSQIMIWESKR 353

Query: 320 ERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
            R+    G +YS F++ +D++++ ++  +   L   +  KR +VV  +GH  +++F
Sbjct: 354 NRIFMVPGYIYSNFLNLSDYQRVLEFCTERSYLIESDVGKRMIVVKLEGHELVREF 409


>gi|397564488|gb|EJK44232.1| hypothetical protein THAOC_37249 [Thalassiosira oceanica]
          Length = 971

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 103/318 (32%), Positives = 152/318 (47%), Gaps = 51/318 (16%)

Query: 80  LNQYALERWECILRFMVGSQQTE-----GISADAVR-----TLFNEGAENP--------- 120
           L  +   +W+ +L F+VGS  ++     GI    V      T      E+P         
Sbjct: 225 LEAHTQSQWDSVLHFLVGSDDSDRKGVNGIEEPNVAMVNFLTRIGLMQEDPDFTGKDRSR 284

Query: 121 ---VITKDGFQFLLLETPAQVWYFILKYLETVESKGL-DLV--ECLTFLFQLKFSTFGTD 174
              VIT  G++F+L +T AQVW F+L+YL ++    L D +  E L+FL  L     G  
Sbjct: 285 APLVITSKGYEFMLRDTNAQVWQFVLQYLNSMAHHDLKDTIRKEALSFLICLGSCRIGEG 344

Query: 175 YSTEGMSDK-LQVFLQHLREFGLVYQRKRKAGR--FYPTKLALNMATRGTLKQIREPGFL 231
           Y +  +  K  +V ++    FGL++   R AG+  FYPT++A+N+         R+   L
Sbjct: 345 YFSSVLGSKSARVLMKDFARFGLLFV-CRVAGKTAFYPTRVAVNLVASNEKGGSRQSDAL 403

Query: 232 I--------VETNFRVYAYTDSNLKVALIGLFCELMY-RFPNLTVGVLTRDSVRAALRSG 282
           +        +E        T S+L V        + Y R PN+    LTRDS+++A R G
Sbjct: 404 LPSVAATRSLEEAMNAPDPTRSHLAV--------ISYKRLPNVVFFHLTRDSIKSAFRLG 455

Query: 283 ITAAQIIGFLRLHALP---SVSCPLPPV-VSDQIRLWEGERERLTTSEGVLYSQFVSQAD 338
           +TA QI+ FL++HA P   S + P+PP  V DQI LW+ ER R+   E V   Q    A+
Sbjct: 456 VTADQILRFLQVHAHPMMRSGNQPMPPANVRDQILLWDRERRRVVMDE-VWVHQCRDDAE 514

Query: 339 FEKLRDYAQDLGVLTWQN 356
           F  +  YA D   L W N
Sbjct: 515 FSAVGMYASDSEALAWGN 532


>gi|324515605|gb|ADY46257.1| General transcription factor IIH subunit 4 [Ascaris suum]
          Length = 245

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 83/237 (35%), Positives = 126/237 (53%), Gaps = 37/237 (15%)

Query: 64  MSAKLEVDSKARDLDF--LNQYALERWECILRFM-VGSQQTEGISADAVRTLF------- 113
           + A  ++D K+R      L + A ERWECIL ++ + SQ++E   + A + LF       
Sbjct: 9   LKAMTDLDEKSRKSASKDLGKKATERWECILHYLALPSQKSEQGVSGATKQLFRAAGLTS 68

Query: 114 -NEGAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQL------ 166
             E   +  IT  GFQFLLL    Q+W +IL YL   ES G +++  L FL +L      
Sbjct: 69  GGESEGDMEITSAGFQFLLLNQTEQIWMYILHYLRLEESMGKNVMAELDFLLKLTLCVDH 128

Query: 167 ---------------KFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPT- 210
                          K +T+   +  E  S+ +  FL  LRE GLV+ RKRK G F+ T 
Sbjct: 129 THTNGRLRVVDGDREKKATWRAFFIDESWSETITNFLMQLRELGLVFIRKRKDGYFFITP 188

Query: 211 ---KLALNMATRGTLKQIR-EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFP 263
               L ++ ++   + + R   G++IVETN+RVYAYTDS+L++A++  F E++YR+ 
Sbjct: 189 LFAHLTVSSSSETAVAEKRTHSGYIIVETNYRVYAYTDSSLQLAILSTFTEMLYRWA 245


>gi|440493551|gb|ELQ76006.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH, subunit TFB2 [Trachipleistophora
           hominis]
          Length = 426

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 84/281 (29%), Positives = 145/281 (51%), Gaps = 32/281 (11%)

Query: 122 ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTF---LFQ----LKFS---TF 171
           IT  GF+FLL     Q+W+ IL  +  +    +D +  + F   +++    LK +    F
Sbjct: 157 ITHSGFEFLLKSRKEQMWFLILNGILMLGKNYVDWLTMIVFEMGMYERHCWLKINESKIF 216

Query: 172 GT------DYSTEGMSDK-----LQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRG 220
           GT      + + + +SDK     L   L++L   G++   +       PT     + +  
Sbjct: 217 GTARKHNKNTNIDPVSDKEAHSVLVRLLKYLSYLGII---QYSNDMVKPTAEYFLLFSSA 273

Query: 221 TLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALR 280
            +    +  FLIVETN+++YAYT S+ ++++I LF +++   PNL    +T +SV AA  
Sbjct: 274 AI----QDSFLIVETNYKLYAYTTSSHELSIIKLFSQIVRELPNLVTAHITEESVNAAFL 329

Query: 281 SGITAAQIIGFLRLHALPSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFE 340
            GIT  QI+ +L           LPPVV +QI +WE +R+R+   + V+YS F++  ++E
Sbjct: 330 KGITGQQIVDYLT----EKSKSELPPVVLEQILIWERQRDRMKCIDAVIYSHFMTYNEYE 385

Query: 341 KLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQK 381
               Y ++ G L   +E +R +VV  + H+++K F  N+ K
Sbjct: 386 ITYRYCREKGALIDHDEFRRLLVVRLECHNEVKNFIKNNIK 426


>gi|422294120|gb|EKU21420.1| transcription initiation factor TFIIH subunit 4, partial
           [Nannochloropsis gaditana CCMP526]
          Length = 213

 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 74/212 (34%), Positives = 111/212 (52%), Gaps = 29/212 (13%)

Query: 172 GTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIR----- 226
           G  Y    +S   Q  L+    FGL+YQ  R +  FYPT +A+N+   GT+++ R     
Sbjct: 1   GRPYPVRALSPTQQALLEKFISFGLIYQDDRHSRHFYPTAVAVNLIFGGTVQEERLRRGH 60

Query: 227 -----------------EPGFL--IVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTV 267
                            +P  L  IVETN+++ AYT S+L V ++ +F ++  R PN+ +
Sbjct: 61  GHVGPEGRENTKELRLVDPSQLAVIVETNYQLVAYTSSSLHVEMLRIFTDVRCRLPNVVI 120

Query: 268 GVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSCP----LPPVVSDQIRLWEGERERLT 323
           G +TR SVR A+ SGITAA I+ FL+ H   +V       LP  V  QI LW  ER R+ 
Sbjct: 121 GFITRASVRRAMASGITAATILSFLKTHTHVAVRAGKGRLLPENVEAQIELWHQERSRVK 180

Query: 324 TSEGVLYS-QFVSQADFEKLRDYAQDLGVLTW 354
             E ++     +   +FE++R YA++L V+ W
Sbjct: 181 FEEVMMIDLSSLMVEEFEEVRTYAENLAVVCW 212


>gi|429965574|gb|ELA47571.1| hypothetical protein VCUG_00894 [Vavraia culicis 'floridensis']
          Length = 424

 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 80/273 (29%), Positives = 137/273 (50%), Gaps = 30/273 (10%)

Query: 122 ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTF------------LFQLKFS 169
           IT  GF+FLL     Q+W+ IL  +  +    +D +  + F            + + K  
Sbjct: 157 ITHSGFEFLLKNRKEQMWFLILNGILMLGKSYVDWLIMIVFEMGMYERYRWLKVNEAKIF 216

Query: 170 TFGTDYSTEGMS-------DKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTL 222
               ++S    S       D L  FL++L   G++   +       PT+    + +  + 
Sbjct: 217 CDVCEHSNNTGSVGGGEEGDILFRFLKYLSYLGII---QHSNDMVKPTEEYFLLFSSAS- 272

Query: 223 KQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSG 282
               +  FL+VETN+++YAYT S+ ++++I LFC+++   PNL V  +T +SV AA   G
Sbjct: 273 ---TQDSFLVVETNYKLYAYTTSSHELSIIKLFCQIIRELPNLVVAHITEESVNAAFVKG 329

Query: 283 ITAAQIIGFLRLHALPSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKL 342
           IT  QI+ +L           LP VV +QI +WE +R+R+   + V+YS F++  ++E  
Sbjct: 330 ITGQQIVDYLN----EKSRSELPVVVLEQILIWERKRDRMKCMDAVIYSHFMTYGEYEIT 385

Query: 343 RDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
             Y ++ G L   +E +R +VV  + H+D+K F
Sbjct: 386 YKYCKEKGALVDYDEFRRLLVVKLEYHNDVKNF 418


>gi|449017614|dbj|BAM81016.1| similar to TFIIH subunit TFB2 [Cyanidioschyzon merolae strain 10D]
          Length = 693

 Score =  121 bits (303), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 64/157 (40%), Positives = 96/157 (61%), Gaps = 7/157 (4%)

Query: 228 PGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQ 287
           P  +IVETNFR+YAY  S+ +VAL+ LF  ++YR P + +GV+TRDSVR AL+ GITA Q
Sbjct: 535 PFQIIVETNFRLYAYAASSFQVALLSLFTRILYRMPGVAIGVITRDSVRRALKCGITAKQ 594

Query: 288 IIGFLRLHALPSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQAD----FEKLR 343
           ++ FL +H++   S P    V DQI LWE ER+R+    GVL   F   +D    F++L+
Sbjct: 595 LLHFLGIHSMEGKSVPF--NVHDQILLWELERKRIQAYPGVLLEGFEPTSDGRAFFDQLQ 652

Query: 344 DYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQ 380
           +Y  +LG   W +  ++ +V        +++ W+  Q
Sbjct: 653 EYTVELGAQQWCDRVRQLLVADASSFERLRE-WIRRQ 688



 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 43/145 (29%), Positives = 60/145 (41%), Gaps = 26/145 (17%)

Query: 80  LNQYALERWECILRFMVG--------SQQT----------EGISADAVRTLFNEGAENPV 121
           L +Y    WE +L F++G         Q T          E  S + +  L + G    V
Sbjct: 231 LREYTRHAWEHVLHFVIGIPVAEPEIRQPTGPEHTPEHVMEPPSDEVIEALLSLGVSEQV 290

Query: 122 ------ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDY 175
                 IT+ GF +LLL+  AQ+W  +  Y+E  +    D +E    LF+L FS  G  Y
Sbjct: 291 GSDGMRITERGFSYLLLDIHAQLWALLEAYMENWDEARGDRLELFDLLFRLGFSVPGHIY 350

Query: 176 S--TEGMSDKLQVFLQHLREFGLVY 198
                 +S      L  L E GLVY
Sbjct: 351 DGGDPSLSAAQCRMLGFLAEIGLVY 375


>gi|428181353|gb|EKX50217.1| TFB2 transcription factor B2 nucleotide excision repair [Guillardia
           theta CCMP2712]
          Length = 463

 Score =  119 bits (297), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 92/297 (30%), Positives = 134/297 (45%), Gaps = 38/297 (12%)

Query: 61  PWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVG---------------SQQTEGI- 104
           PWS   ++  DS     + L Q+A  RWE IL +MV                ++   GI 
Sbjct: 123 PWS-KGRIMGDSPDTTAEELQQWAQNRWESILHYMVSPGLDPMKRPEPGWKPTKWVRGIL 181

Query: 105 -SADAVRTLFNEGAEN--PVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLT 161
              D     F    +N  P IT  GFQFLL     Q+   ++  L    S+  DL + L+
Sbjct: 182 VGEDKNGNSFMMSLQNLAPKITAFGFQFLLRPLRNQILQMLMHVLTKKGSENKDLQDSLS 241

Query: 162 FLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGT 221
            +FQL F T G  Y  + +S   Q  +  L + G VY +      F+PT L +N+ T   
Sbjct: 242 CIFQLSFCTLGAGYRLDLLSRSNQELITSLHDLGAVYFKPNDRQYFFPTPLIVNLCTESE 301

Query: 222 LKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRS 281
           +     P      T+  V     +     ++ LF   +Y+FP++ + V+TR+S+R AL  
Sbjct: 302 VPS--GPS-----TDAGVADEFSAEPAAGILRLFTRPIYKFPHMIIAVITRESIRNAL-- 352

Query: 282 GITAAQIIGFLRLHALPSV--SCPL-PPVVSDQIRLWEGERERLTTSEGVLYSQFVS 335
                 II +LR+HA P    + P+ P VV+DQI  WE ER R+     V Y  F S
Sbjct: 353 ------IIEYLRMHAHPQCLENWPIVPEVVTDQICFWEQERCRIRAEPAVAYHNFFS 403


>gi|449521313|ref|XP_004167674.1| PREDICTED: RNA polymerase II transcription factor B subunit 2-like,
           partial [Cucumis sativus]
          Length = 296

 Score =  118 bits (296), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 75/246 (30%), Positives = 123/246 (50%), Gaps = 30/246 (12%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
           L  +AK +V+++L+++ PV    +  W+      ++K A D L +LR             
Sbjct: 40  LPPLAKKFVLQMLYIDGPVSAKSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRETT 99

Query: 49  --------RNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQ 100
                    N +  L+ G           +  +   L+ L  YAL++WEC L  ++ S Q
Sbjct: 100 YRLNPTFQANLQKLLIHGEVLAREPMPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQ 159

Query: 101 TEGIS--ADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
            E  S  + +V  +F +G       E P +T+ GFQFLL+ET AQ+WY I +Y+   E +
Sbjct: 160 AEKPSNISSSVMKVFQKGLLSQRDKEAPRLTESGFQFLLMETNAQLWYIIREYISNAEER 219

Query: 153 GLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYPTK 211
           G+D  + ++FL +L F   G  Y  + +SD+ +  ++ L + GLV  Q+ RK   F PTK
Sbjct: 220 GVDPADLISFLLELSFHVTGEAYDIDTLSDEQRYAIKDLADLGLVKLQQGRKESWFIPTK 279

Query: 212 LALNMA 217
           LA N++
Sbjct: 280 LATNLS 285


>gi|308460920|ref|XP_003092758.1| hypothetical protein CRE_20976 [Caenorhabditis remanei]
 gi|308252517|gb|EFO96469.1| hypothetical protein CRE_20976 [Caenorhabditis remanei]
          Length = 128

 Score =  115 bits (289), Expect = 3e-23,   Method: Composition-based stats.
 Identities = 56/127 (44%), Positives = 86/127 (67%), Gaps = 9/127 (7%)

Query: 265 LTVGVLTRDSVRAALRSGITAAQIIGFLRLHALP-------SVSCPLPPVVSDQIRLWEG 317
           ++VG++TR+SVR AL+ GITAAQII FLR +A P       +++C LP  V+DQIRLWE 
Sbjct: 1   MSVGMITRESVRGALQHGITAAQIISFLRANAHPQCIATSGAINC-LPITVADQIRLWED 59

Query: 318 ERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWV 377
           ER R+   +  +YS F S+ +F+ + DYAQ+ G+L W N +++ V+V ++GH  +++ W 
Sbjct: 60  ERRRMDLKDAYIYSHFESEDEFQGVCDYAQERGILLWANAQQKLVIVNEEGHEYVRQ-WY 118

Query: 378 NHQKGSQ 384
              KG +
Sbjct: 119 KRSKGGE 125


>gi|300707677|ref|XP_002996037.1| hypothetical protein NCER_100934 [Nosema ceranae BRL01]
 gi|239605297|gb|EEQ82366.1| hypothetical protein NCER_100934 [Nosema ceranae BRL01]
          Length = 403

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 81/298 (27%), Positives = 151/298 (50%), Gaps = 22/298 (7%)

Query: 87  RWECILRFMVGSQQTEGISADAVRTL-FNEGAENPV-ITKDGFQFLLLETPAQVWYFILK 144
           ++E IL+F+V        +   V  L F+   +N   IT  GF+FLL     Q W+ I+ 
Sbjct: 121 KFEEILKFLVTKNNVNH-APGVVNVLKFSNLIDNNYDITNKGFEFLLKPRYDQYWFLIIA 179

Query: 145 YLE--TVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKR 202
            L+   V+    ++++   F+  ++ S     Y  +   D  + F   L   GL+     
Sbjct: 180 ALKFYCVD----EILQISNFMSIMELSNMLPIYKYKLKKDVNKKFYDFLSYLGLIKLEND 235

Query: 203 KAGRFYPTKLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRF 262
               ++      N+  +   K++R   F+++ETNF++YAYT S  ++++I LF  +  + 
Sbjct: 236 ILTIYH------NLFVKSDTKKLR---FILLETNFKLYAYTSSVYEMSIIQLFSNIYLKM 286

Query: 263 PNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPVVSDQIRLWEGERERL 322
           PNL    +T +S+  A   G+T+ QII FL+ ++L      +P  +  QI +WE +R+R+
Sbjct: 287 PNLIKASITEESLSNAFSKGVTSQQIINFLKSYSLFE---DIPVAIISQIIIWETKRKRI 343

Query: 323 TTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQ 380
               G LYS F++  D++K+  +      +  ++++KR +V+ K  H+++ K +V  Q
Sbjct: 344 KIFPGYLYSNFLNLIDYQKVVKFCLSNDCIIEKDDEKRMIVI-KPDHNEVVKKFVKQQ 400


>gi|403348380|gb|EJY73625.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           [Oxytricha trifallax]
          Length = 532

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 80/254 (31%), Positives = 131/254 (51%), Gaps = 32/254 (12%)

Query: 67  KLEVDSKARDLDFLNQYALERWECILRFMVG---SQQTEGIS-----ADAVRTLFNEGAE 118
           +L+   K ++   L+  +  +WE I +FM+     QQ    S       A    F+  A+
Sbjct: 142 RLKEKKKIKNQAQLDTASQTKWENISKFMLNIIPPQQNTPKSLINLLQQARYIEFDVNAQ 201

Query: 119 NPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTE 178
              +T +GFQ++L++  +QV   +L Y++T E+ G ++VE L F+F L      T+ + +
Sbjct: 202 TYQLTVNGFQYILMDIASQVQNILLNYIQTAETSGRNVVEVLGFIFNLTL----TEPNYD 257

Query: 179 GMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKL--ALNMATRGTLKQI----------- 225
           G ++ +   L+   + GLV   +R+  +F  T L  +   +  G L++            
Sbjct: 258 G-NENVNSILRDFDDMGLVGMERRQ--KFVITSLLQSFLQSQTGQLQKSSDSSSSNVIDA 314

Query: 226 ----REPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRS 281
                +  FLIVETNF+VYAYT S+L  AL+  F  +   FPNL VG LTR S++ A + 
Sbjct: 315 QRSQSQDKFLIVETNFKVYAYTSSDLYRALLRQFIRVECIFPNLVVGTLTRKSLQKAFQR 374

Query: 282 GITAAQIIGFLRLH 295
           GI++ QI+ FL  H
Sbjct: 375 GISSGQILSFLESH 388


>gi|123488965|ref|XP_001325283.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121908180|gb|EAY13060.1| hypothetical protein TVAG_212420 [Trichomonas vaginalis G3]
          Length = 460

 Score =  108 bits (270), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 105/417 (25%), Positives = 177/417 (42%), Gaps = 67/417 (16%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEA-------------TDTLTELR 48
           L  +++  + RLL++E+  P++ +  W  K+ + +   A             TDT++E  
Sbjct: 33  LPPLSQQILFRLLYIERGFPRSEVEKWSPKSKSNDLINAIKKLIKAHLVYIVTDTISENS 92

Query: 49  RNAKV--------ALLGG-----------GKPWSMSAKL--------EVD-SKARDLD-- 78
            N+ +         LLG              P++   +         E+D SK   +D  
Sbjct: 93  YNSSIIINKRVREVLLGTYVEDKKAFVRQKNPFAPDDQPDENFVPFDELDQSKFPTIDEK 152

Query: 79  FLNQYALERWECILRFMVG-----SQQTEGISADAVRTLFNEGAENPVITKDGFQFLLLE 133
            L+ ++  + E IL FM+        + + I +DA    F        +   G +FLLL 
Sbjct: 153 ILDDFSNYQLESILSFMLKLRDNIDNEAKKILSDAKLMEFGGN-----LCPKGHRFLLLS 207

Query: 134 TPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLRE 193
              Q+W  +  YL+  +    DL   L FL ++        Y    ++   +  L   + 
Sbjct: 208 PKEQIWRIVKCYLKFTK----DLHSSLRFLLKIGSMELSKGYPITSLTPTQKELLSPFKT 263

Query: 194 FGLVYQRKRKAGRFYPTKLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIG 253
            GLVY        FYPTK  LN   +  + Q    G+++++TNF++ A+  S L  AL+ 
Sbjct: 264 IGLVY---IDGDYFYPTKSILNFFGKSNIFQTE--GWMLIDTNFKITAFPKSPLHTALLK 318

Query: 254 LFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV-SCPLPPVVSDQI 312
            F  + Y FP      ++ +S R AL  G T   IIGFL+ +    + S  +P  V  Q 
Sbjct: 319 KFANVTYEFPGFASAFISPNSFREALNQGTTLDDIIGFLKSNLSHKIGSGQIPSAVMKQF 378

Query: 313 RLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDL-GVLTWQNEKKRT---VVVT 365
            +W  +RERLT +   +  Q+ +  D       A+ L G +    EKK +   +V+T
Sbjct: 379 YVWRDQRERLTVTHECIMRQYTNPNDANLAAQCAKQLAGYVYGPAEKKESHYWIVIT 435


>gi|429962184|gb|ELA41728.1| hypothetical protein VICG_01232 [Vittaforma corneae ATCC 50505]
          Length = 449

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 77/283 (27%), Positives = 127/283 (44%), Gaps = 29/283 (10%)

Query: 122 ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMS 181
           IT  GF+FLLL    Q+W+ I+  ++      L+  E    L ++        Y     S
Sbjct: 173 ITNKGFEFLLLARKDQLWFLIVNAIKYHSRDSLEESEMFISLAEILMKRRCGPYICSSFS 232

Query: 182 ------DKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIRE-------- 227
                 D + V     ++  LV+        +    LA +  T      I +        
Sbjct: 233 SWHSFLDSIGVLFVISQDNNLVFYVNNSV-LYDKVPLADSAMTHHGTAGIDDIYDIDYTK 291

Query: 228 PG---------FLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAA 278
           P          F+++ETNF++YAYT +    +++ LF + +Y FPNL       +S+ +A
Sbjct: 292 PNIPINKASNKFIVLETNFKIYAYTSTAYDKSVLSLFSKTVYVFPNLIKACFDEESLLSA 351

Query: 279 LRSGITAAQIIGFLRLHALPSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQAD 338
              GITA QII +L+ H     S  +P  + +QI +WE  + R+    G LY  F+  +D
Sbjct: 352 FNKGITAKQIIKYLQEH-----SEEVPKNIVNQISIWEHRQHRIRARNGYLYHDFIHLSD 406

Query: 339 FEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVNHQK 381
           F ++  Y +  G L +++E KR +V  ++ H  +K F    QK
Sbjct: 407 FHRVLRYVESKGGLIYRDEVKRMIVGEERIHESVKNFIKEMQK 449


>gi|403221352|dbj|BAM39485.1| transcription factor IIH subunit [Theileria orientalis strain
           Shintoku]
          Length = 634

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 68/207 (32%), Positives = 107/207 (51%), Gaps = 27/207 (13%)

Query: 156 LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALN 215
           + E +  L  L  S  G  +STE ++      ++ L E G+VY +   A RFY   L+  
Sbjct: 338 VTESVELLLALSQSKCGDCFSTENLTKTQMRLVRLLNELGIVYYKN--ANRFYIYDLSYI 395

Query: 216 MATRG----------TLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNL 265
           +                K       +IV++NF+VY YT S L+++++   CEL  R PNL
Sbjct: 396 IGKSNINVDLFKEFDVNKTAGNESRIIVQSNFKVYVYTASPLQISVLSHLCELQARTPNL 455

Query: 266 TVGVLTRDSVRAALRSGITAAQIIGFL---RLHALPSVSC----------PLPPVVSDQI 312
            VGVLTR+SV++A +SGIT+ +II FL   +L    + +C           +P  V  Q+
Sbjct: 456 VVGVLTRESVQSAFKSGITSNEIIRFLSPMKLSLSYAGTCNSFMDSFANYKIPENVCRQL 515

Query: 313 RLWEGERERLTTSEGVLYSQFVSQADF 339
           ++WE ER+R+  S  +L+ ++   ADF
Sbjct: 516 KMWESERDRIELSPAILFKRW--DADF 540


>gi|402465832|gb|EJW01474.1| hypothetical protein EDEG_03937 [Edhazardia aedis USNM 41457]
          Length = 465

 Score =  105 bits (261), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 86/340 (25%), Positives = 153/340 (45%), Gaps = 35/340 (10%)

Query: 58  GGKPWSMSAKLEVDSKARDLDFLNQY--------ALERWECILRFMVGSQQT----EGIS 105
           G + ++MS    +++  + +   N Y        + +++E IL  +V  Q+T    E + 
Sbjct: 138 GTENYNMSPVKSIENIKKTVTIFNDYESKNTSNKSFQKYEEILHLIVDQQETPKNEEILK 197

Query: 106 ADAVRTLFNEGAENPVITKDGFQFLLLETPAQVWYFIL---KYLET---VESKGLDLVEC 159
                 + N+  E   IT  GF FLL     Q+W  ++   KYL++   +E K   L+  
Sbjct: 198 LLTFSRILNKNLE---ITNTGFDFLLKTRKEQLWILLITTIKYLKSTMKIEEKDFILL-- 252

Query: 160 LTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQR-KRKAGRFYPTKLALNMAT 218
              +F+L        Y        +Q  + ++ + GL+    K        + L +N+  
Sbjct: 253 ---IFELCEKRPYKSYKIINCPSSIQTCINYMSKLGLLENSIKNNTVILSISPLFINLFE 309

Query: 219 RGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAA 278
                 +    F+ +ETNF++YAYT S    +++ LF ++  + PNL   ++  DSV  A
Sbjct: 310 ENL---VLSESFMYIETNFKLYAYTTSKYDFSILSLFSKISCKLPNLISAIINEDSVNTA 366

Query: 279 LRSGITAAQIIGFLRLHALPSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQAD 338
               I+A QI  +L+     S    +P  V +Q+ +WE +R R+ TSE  L+  F++  D
Sbjct: 367 FDKKISAKQISYYLK-----SKGKNVPKNVVEQVYIWESKRNRIKTSECTLFKGFLNLMD 421

Query: 339 FEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKFWVN 378
           F+K  D  ++   L    E KR + V      + K++  N
Sbjct: 422 FKKAVDVCKEKHWLVDVYEDKRFIFVNNLYAEEFKEYIKN 461


>gi|84998132|ref|XP_953787.1| TFIIH subunit, Tf2b  [Theileria annulata]
 gi|65304784|emb|CAI73109.1| TFIIH subunit, Tf2b homologue, putative [Theileria annulata]
          Length = 645

 Score =  105 bits (261), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 66/213 (30%), Positives = 110/213 (51%), Gaps = 27/213 (12%)

Query: 146 LETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAG 205
           LE  E+    + E +  L  L  ++ G  +STEG++      L+ L E G+VY +  K  
Sbjct: 356 LELKEASMKTVTESVELLLSLSQASCGDCFSTEGLTKTQIRLLRLLNELGIVYYKNPK-- 413

Query: 206 RFYPTKLAL---NMATRGTLKQIREPGF---------LIVETNFRVYAYTDSNLKVALIG 253
           +FY   L+       T   L   ++            +IV++NF+VY YT S L+++++ 
Sbjct: 414 KFYLYDLSYIVGKTNTNSVLPNSKDLDISIKAGNDSRIIVQSNFKVYVYTASPLQISVLS 473

Query: 254 LFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHAL-------------PSV 300
             CEL  R PNL VGVLTR+SV++A +SGIT+ +II FL  + +              S 
Sbjct: 474 HLCELQARTPNLVVGVLTRESVQSAFKSGITSKEIIRFLSPNGMNSSIGSQENTLLNSSF 533

Query: 301 SCPLPPVVSDQIRLWEGERERLTTSEGVLYSQF 333
           +  +P  V  Q+++WE ER+R+     +++ ++
Sbjct: 534 TYSIPENVCRQLKMWESERDRIELCPSIVFKRW 566


>gi|359497087|ref|XP_002267711.2| PREDICTED: general transcription factor IIH subunit 4-like [Vitis
           vinifera]
          Length = 238

 Score =  104 bits (260), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 73/238 (30%), Positives = 116/238 (48%), Gaps = 44/238 (18%)

Query: 142 ILKYLET---VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV- 197
           IL+YL+    V+ +G+D  + ++FL +L F   G  Y+   +++  +  ++ L + GLV 
Sbjct: 36  ILRYLDLLNLVQERGVDPADLISFLLELSFHVTGEAYNINTLTEFQRNTIKDLVDLGLVK 95

Query: 198 YQRKRKAGRFYPTKLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCE 257
            Q+ RK   F PTKLA N++   +    R+ GF++VETNFR+YAY+ S L        CE
Sbjct: 96  LQQGRKESWFIPTKLATNLSMSLSDTSSRKQGFVVVETNFRLYAYSSSKLH-------CE 148

Query: 258 LMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPVVSDQIRLWEG 317
           +                                 LRL +  SVS    P     IRLWE 
Sbjct: 149 I---------------------------------LRLFSRSSVSTVATPCHGYLIRLWET 175

Query: 318 ERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
           +  R+ T    LY +F S+  FE   D+A++ G L W++ KK  +VV  + H  ++++
Sbjct: 176 DLNRVETMPSHLYDEFPSRDVFEAACDFAREYGGLLWEDSKKMRLVVKAEIHLHMREY 233


>gi|296084734|emb|CBI25875.3| unnamed protein product [Vitis vinifera]
          Length = 258

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 72/252 (28%), Positives = 125/252 (49%), Gaps = 28/252 (11%)

Query: 152 KGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYPT 210
           +G+D  + ++FL +L F   G  Y+   +++  +  ++ L + GLV  Q+ RK   F PT
Sbjct: 2   RGVDPADLISFLLELSFHVTGEAYNINTLTEFQRNTIKDLVDLGLVKLQQGRKESWFIPT 61

Query: 211 KLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLF---------CELMYR 261
           KLA N++   +    R+ GF++VETNFR+YAY+ S L   ++ LF         C L   
Sbjct: 62  KLATNLSMSLSDTSSRKQGFVVVETNFRLYAYSSSKLHCEILRLFSRYASMLFDCLLCIC 121

Query: 262 FPN-----LTVGVLTRDSVRAALRSGI----------TAAQIIGFLRLHALPSVSCPLPP 306
           + +     L V ++  + +R      I           A    G+L +  +  ++  +P 
Sbjct: 122 YSSQKGTLLDVSLIMYECLRTFSLGKIIFWFFSSVSTVATPCHGYLVIFTISCLAEKIPC 181

Query: 307 VV---SDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVV 363
           +      QIRLWE +  R+ T    LY +F S+  FE   D+A++ G L W++ KK  +V
Sbjct: 182 LQFTNMSQIRLWETDLNRVETMPSHLYDEFPSRDVFEAACDFAREYGGLLWEDSKKMRLV 241

Query: 364 VTKQGHSDIKKF 375
           V  + H  ++++
Sbjct: 242 VKAEIHLHMREY 253


>gi|71033737|ref|XP_766510.1| hypothetical protein [Theileria parva strain Muguga]
 gi|68353467|gb|EAN34227.1| hypothetical protein, conserved [Theileria parva]
          Length = 359

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 65/216 (30%), Positives = 109/216 (50%), Gaps = 33/216 (15%)

Query: 146 LETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAG 205
           LE  E+    + E +  L  L  ++ G  +STEG++      L+ L E G+VY +  K  
Sbjct: 69  LELKEASFKTVTESVELLLSLSQASCGDCFSTEGLTKTQLRLLRLLNELGIVYYKNPKKF 128

Query: 206 RFY---------------PTKLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVA 250
             Y               PT   L+++ +           +IV++NF+VY YT S L+++
Sbjct: 129 YLYDLSYIVGKTNTNTGLPTSKDLDLSIKAG-----NDSRIIVQSNFKVYVYTASPLQIS 183

Query: 251 LIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPS----------- 299
           ++   CEL  R PNL VGVLTR+SV++A +SGIT+ +II FL  + + S           
Sbjct: 184 VLSHLCELQARTPNLVVGVLTRESVQSAFKSGITSKEIIRFLSPNGMNSSIGNQENTLLN 243

Query: 300 --VSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQF 333
              +  +P  V  Q+++WE ER+R+     +++ ++
Sbjct: 244 STFTYSIPENVCRQLKMWESERDRIELCPSIVFKRW 279


>gi|387592902|gb|EIJ87926.1| hypothetical protein NEQG_01998 [Nematocida parisii ERTm3]
          Length = 398

 Score =  101 bits (252), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 79/258 (30%), Positives = 126/258 (48%), Gaps = 24/258 (9%)

Query: 122 ITKDGFQFLLLETPAQVWYFILKYL----ETVESKGLDLVECLTFLFQLKFSTFGTDYST 177
           +T  GF FLL     Q+W  +L ++    ET E++ L L E    L +    T+  D S 
Sbjct: 155 LTHKGFNFLLTGKKRQLWTLLLAHIQEDPETAENEILVLCE---LLVKDPKRTYAVDRSQ 211

Query: 178 EGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIREPGFLIVETNF 237
           +  S  L +F       GL++  K    +F PT   L     G  K      FL++E+NF
Sbjct: 212 K--SKLLGLF----ESLGLIFFEK-GLVKFSPTFSLLFDDEEGAEK------FLVLESNF 258

Query: 238 RVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHAL 297
           R+Y Y++  L   +I LF      FPN+ V ++  DS+R AL  GITA QI  +L  +++
Sbjct: 259 RLYIYSNRPLDTFIISLFSIKSREFPNMMVAMINEDSIRQALMHGITAGQIRVYLNQNSM 318

Query: 298 PSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNE 357
             ++      V +QIRLWE    R+ + E  ++S F++  DF  +  Y ++  +      
Sbjct: 319 YEIN----ENVIEQIRLWEKRMNRIHSWESYIFSNFLNYKDFLLVESYCENNNIDHRSYR 374

Query: 358 KKRTVVVTKQGHSDIKKF 375
           +KR +VV  + +  +K F
Sbjct: 375 EKRMLVVGIENYESVKSF 392


>gi|387595518|gb|EIJ93142.1| hypothetical protein NEPG_02098 [Nematocida parisii ERTm1]
          Length = 398

 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 79/258 (30%), Positives = 126/258 (48%), Gaps = 24/258 (9%)

Query: 122 ITKDGFQFLLLETPAQVWYFILKYL----ETVESKGLDLVECLTFLFQLKFSTFGTDYST 177
           +T  GF FLL     Q+W  +L ++    ET E++ L L E    L +    T+  D S 
Sbjct: 155 LTHKGFNFLLTGKKRQLWTLLLAHIQEDPETAENEILVLCE---LLVKDPKRTYAVDRSQ 211

Query: 178 EGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIREPGFLIVETNF 237
           +  S  L +F       GL++  K    +F PT   L     G  K      FL++E+NF
Sbjct: 212 K--SKLLGLF----ESLGLIFFEK-GLVKFSPTFSLLFDDEEGAEK------FLVLESNF 258

Query: 238 RVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHAL 297
           R+Y Y++  L   +I LF      FPN+ V ++  DS+R AL  GITA QI  +L  +++
Sbjct: 259 RLYIYSNRPLDTFIISLFSIKSREFPNMIVAMINEDSIRQALMHGITAGQIRVYLNQNSM 318

Query: 298 PSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNE 357
             ++      V +QIRLWE    R+ + E  ++S F++  DF  +  Y ++  +      
Sbjct: 319 YEIN----ENVIEQIRLWEKRMNRIHSWESYIFSNFLNYKDFLLVESYCENNNIDHRSYR 374

Query: 358 KKRTVVVTKQGHSDIKKF 375
           +KR +VV  + +  +K F
Sbjct: 375 EKRMLVVGIENYESVKSF 392


>gi|124806778|ref|XP_001350829.1| transcription factor Tfb2, putative [Plasmodium falciparum 3D7]
 gi|23496958|gb|AAN36509.1| transcription factor Tfb2, putative [Plasmodium falciparum 3D7]
          Length = 945

 Score =  101 bits (251), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 60/156 (38%), Positives = 87/156 (55%), Gaps = 11/156 (7%)

Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIG 290
           LIV++NF+VY YT S LK+ ++   CEL  R PN+ VG+LTR SV  A  S ITA QII 
Sbjct: 766 LIVQSNFKVYLYTSSLLKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITANQIIK 825

Query: 291 FLRLHALP---SVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQ 347
           FL  ++ P   +    +P  V  Q++LWE ER RLT  + +++  F  + DF     Y Q
Sbjct: 826 FLESYSHPGKNNFKSSIPMNVITQLKLWESERHRLTLEDAIVFKSF--EKDFMP-HLYQQ 882

Query: 348 DLGVLTWQNEKKRTVVVTKQGHSDIKKF--WVNHQK 381
              ++ W N K   +  T    ++ K+F  W+  +K
Sbjct: 883 ---IVIWANSKNYLLYYTPWPKNNTKEFDLWIKAEK 915


>gi|389586531|dbj|GAB69260.1| hypothetical protein PCYB_146880 [Plasmodium cynomolgi strain B]
          Length = 913

 Score =  101 bits (251), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 50/106 (47%), Positives = 67/106 (63%), Gaps = 3/106 (2%)

Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIG 290
           LIV++NF+VY YT+S LK+ ++   CEL  R PN+ VG+LTR SV  A  S ITA QII 
Sbjct: 733 LIVQSNFKVYLYTNSTLKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITANQIIK 792

Query: 291 FLRLHALPSVS---CPLPPVVSDQIRLWEGERERLTTSEGVLYSQF 333
           FL  +A P  S     +P  V  Q++LWE ER RLT  + +++  F
Sbjct: 793 FLESYAHPGRSNFKSSIPVNVITQLKLWESERHRLTLDDAIVFKSF 838


>gi|221061889|ref|XP_002262514.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium
           knowlesi strain H]
 gi|193811664|emb|CAQ42392.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium
           knowlesi strain H]
          Length = 943

 Score =  101 bits (251), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 50/106 (47%), Positives = 67/106 (63%), Gaps = 3/106 (2%)

Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIG 290
           LIV++NF+VY YT+S LK+ ++   CEL  R PN+ VG+LTR SV  A  S ITA QII 
Sbjct: 763 LIVQSNFKVYLYTNSTLKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITANQIIK 822

Query: 291 FLRLHALPSVS---CPLPPVVSDQIRLWEGERERLTTSEGVLYSQF 333
           FL  +A P  S     +P  V  Q++LWE ER RLT  + +++  F
Sbjct: 823 FLESYAHPGRSNFKSSIPVNVITQLKLWESERHRLTLDDAIVFKSF 868


>gi|412992644|emb|CCO18624.1| predicted protein [Bathycoccus prasinos]
          Length = 464

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 77/276 (27%), Positives = 117/276 (42%), Gaps = 62/276 (22%)

Query: 76  DLDFLNQYALERWECILRFMVGSQQTE---------------GISADA---VRTLFNEG- 116
           D  FL+ YA  RWE +L  +      E               G  A     +R LF    
Sbjct: 186 DAGFLDHYAKSRWETVLLELTSEASVEKKRKKKKMMKKKGDGGAGAQHALHLRKLFYAAR 245

Query: 117 --AENPVITKDGFQFLLLETPAQVWYFILKYL---------------------ETVESKG 153
              ++  IT +GF FLL  T  Q+W  + +Y                      + +E+ G
Sbjct: 246 LITKDGRITSEGFSFLLGATSEQIWILLARYARGGDFEKKMKNKTTNEENGEGDKMETNG 305

Query: 154 LD------------LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRK 201
            D                + FL +L F   G  YS   +S+  +    HL   G++Y+ +
Sbjct: 306 EDEKQQHKLKSDESSAAAMAFLVRLSFQHPGRKYSKANLSEAERRVASHLSALGVLYENE 365

Query: 202 RKAGR---FYPTKLALNMATRGTLKQIREP-----GFLIVETNFRVYAYTDSNLKVALIG 253
                   + PT L+  +++  T    +       G +IVETNFRVYAYT S L+  ++ 
Sbjct: 366 DDENDNNWYVPTVLSAGLSSVSTTSSAKSALARIDGHIIVETNFRVYAYTHSELETEVLR 425

Query: 254 LFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQII 289
           LF    Y+ PN  VG++TRDS+  A+R+GI+  QI+
Sbjct: 426 LFTRPDYKLPNAYVGMITRDSILDAMRAGISPDQIV 461


>gi|156095867|ref|XP_001613968.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148802842|gb|EDL44241.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 960

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 50/106 (47%), Positives = 67/106 (63%), Gaps = 3/106 (2%)

Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIG 290
           LIV++NF+VY YT+S LK+ ++   CEL  R PN+ VG+LTR SV  A  S ITA QII 
Sbjct: 780 LIVQSNFKVYLYTNSTLKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITANQIIK 839

Query: 291 FLRLHALPSVS---CPLPPVVSDQIRLWEGERERLTTSEGVLYSQF 333
           FL  +A P  S     +P  V  Q++LWE ER RLT  + +++  F
Sbjct: 840 FLESYAHPGRSTFKSSIPVNVITQLKLWESERHRLTLDDAIVFKSF 885


>gi|241725008|ref|XP_002412209.1| transcription factor IIH (TFIIH), putative [Ixodes scapularis]
 gi|215505416|gb|EEC14910.1| transcription factor IIH (TFIIH), putative [Ixodes scapularis]
          Length = 75

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Composition-based stats.
 Identities = 42/72 (58%), Positives = 55/72 (76%)

Query: 310 DQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGH 369
           DQ+RLWE ER+R    EGVLYSQF+SQ+DF+ LR+YA DLGVL W N  KR +VV + GH
Sbjct: 2   DQLRLWELERDRFNFREGVLYSQFISQSDFQLLRNYASDLGVLIWDNPSKRVMVVNRNGH 61

Query: 370 SDIKKFWVNHQK 381
            ++K+FW  H++
Sbjct: 62  DEVKRFWKRHRQ 73


>gi|378754852|gb|EHY64880.1| hypothetical protein NERG_01936 [Nematocida sp. 1 ERTm2]
          Length = 396

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 80/272 (29%), Positives = 130/272 (47%), Gaps = 27/272 (9%)

Query: 110 RTLFNEGAENPVITKDG-----FQFLLLETPAQVWYFILKYL-ETVESKGLDLVECLTFL 163
           R + N G    +++KDG     F FLL     Q+W  +L ++ E + ++  +++    FL
Sbjct: 140 RLMLNTG----IVSKDGLTHKGFNFLLTGRKNQMWTLVLAHIQEDLATRQEEVLVMCEFL 195

Query: 164 FQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLK 223
            +     +  D S    S  L +F       GL+   +R   RF  T   L     G  K
Sbjct: 196 VKDPKRMYAIDVSYR--SKMLDLF----ESLGLI-TFERGLVRFSSTFSLLFDDEEGGEK 248

Query: 224 QIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGI 283
                 FL +E+NFR+Y Y++  L V +I LF   +  FPN+ V ++  DS+R AL  GI
Sbjct: 249 ------FLTLESNFRLYIYSNRPLDVFIISLFSIKIREFPNMIVAMINEDSIRQALTYGI 302

Query: 284 TAAQIIGFLRLHALPSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLR 343
           TA QI  +L  +++  ++      V +QIRLWE    R+   E  ++S F++  DF  + 
Sbjct: 303 TAGQIRVYLNQNSMHKIN----ENVLEQIRLWEKRMNRIHAWESYIFSNFLNYKDFLLVE 358

Query: 344 DYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
            Y ++  V       KR +VV  + + ++K F
Sbjct: 359 SYCENNNVEHRSYRDKRVLVVGVENYDNVKSF 390


>gi|428673252|gb|EKX74165.1| conserved hypothetical protein [Babesia equi]
          Length = 668

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 70/223 (31%), Positives = 111/223 (49%), Gaps = 28/223 (12%)

Query: 172 GTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY--------------PTKLALNMA 217
           G  +ST+ ++      L+ L E G+VY +  K G FY              P+   L++ 
Sbjct: 422 GDSFSTKSLTKSQFRILRLLYELGIVYYKSIK-GPFYVLDLSFIVGPKNLVPSNSPLSVH 480

Query: 218 TRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRA 277
           T  +      P  +IV++NF+VY YT +NL+  ++ + CE+  R PN+ VGVLTR+S + 
Sbjct: 481 TSISFTSEYLPSKIIVQSNFKVYVYTVNNLQFDILNILCEVQARTPNMVVGVLTRESAQR 540

Query: 278 ALRSGITAAQIIGFLRLHALPSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQA 337
           A +SGIT+ +II F       + +   P  V  Q+R+WE ER R+  S  +L    + + 
Sbjct: 541 AFKSGITSHEIIRFFS----STNTSTFPENVIRQLRMWEAERNRVELSPAIL----IKRW 592

Query: 338 DFEKLRDYAQDLGVLTWQNEKKRTVVVTK---QGHSDIKKFWV 377
           D E L D  Q    + W   K+  +  TK     HS+  + W+
Sbjct: 593 DREFLPDLFQ--RTVRWAQSKRYELFHTKWPQDPHSEEYQEWL 633


>gi|221482097|gb|EEE20458.1| tfiih, polypeptide, putative [Toxoplasma gondii GT1]
          Length = 861

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 54/114 (47%), Positives = 74/114 (64%), Gaps = 8/114 (7%)

Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIG 290
           ++V++NF+VY YT S L+++++   CEL  R PNL VG+LTR SV AA +SGITA QII 
Sbjct: 675 MLVQSNFKVYVYTASALQLSVLSHLCELQARMPNLIVGILTRASVLAAYKSGITADQIIR 734

Query: 291 FLRLHALPSV-------SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQ 336
           FL  HA P V       + P LP  V+ Q+R+WE ER RL+    VL  ++ +Q
Sbjct: 735 FLEAHAHPVVLERKLRTNAPLLPENVTIQLRMWEAERMRLSLYPAVLLKKWDAQ 788


>gi|401410644|ref|XP_003884770.1| hypothetical protein NCLIV_051680 [Neospora caninum Liverpool]
 gi|325119188|emb|CBZ54742.1| hypothetical protein NCLIV_051680 [Neospora caninum Liverpool]
          Length = 868

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 54/114 (47%), Positives = 74/114 (64%), Gaps = 8/114 (7%)

Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIG 290
           ++V++NF+VY YT S L+++++   CEL  R PNL VG+LTR SV AA +SGITA QII 
Sbjct: 682 MLVQSNFKVYVYTASALQLSVLSHLCELQARMPNLIVGILTRASVLAAYKSGITADQIIR 741

Query: 291 FLRLHALPSV-------SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQ 336
           FL  HA P V       + P LP  V+ Q+R+WE ER RL+    VL  ++ +Q
Sbjct: 742 FLEAHAHPVVLERKLRTNAPLLPENVTIQLRMWEAERMRLSLYPAVLLKKWDAQ 795


>gi|429854513|gb|ELA29524.1| tfiih and nucleotide excision repair factor 3 complexes subunit
           [Colletotrichum gloeosporioides Nara gc5]
          Length = 509

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 59/155 (38%), Positives = 85/155 (54%), Gaps = 33/155 (21%)

Query: 229 GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQI 288
           G +IVETN+RVYAY  + L++A++ LFC+L  RF ++  G LTR+S+R A+  GITA QI
Sbjct: 371 GSIIVETNYRVYAYGQTPLQIAVLSLFCKLKLRFADMVSGRLTRNSIRNAVERGITADQI 430

Query: 289 IGFLRLHALPSV-------SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFE 340
           I +L  HA   +       + P LPP V DQIRL                       ++E
Sbjct: 431 ISYLASHAHEQMHRFAAVRNKPVLPPTVIDQIRL-----------------------EYE 467

Query: 341 KLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
            +  YA ++GVLTW+N+K      +K  H  I+ +
Sbjct: 468 DIAGYAAEIGVLTWRNDKLGMFFASK--HEQIRDY 500


>gi|221502519|gb|EEE28246.1| tfiih, polypeptide, putative [Toxoplasma gondii VEG]
          Length = 836

 Score = 98.2 bits (243), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 54/114 (47%), Positives = 74/114 (64%), Gaps = 8/114 (7%)

Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIG 290
           ++V++NF+VY YT S L+++++   CEL  R PNL VG+LTR SV AA +SGITA QII 
Sbjct: 650 MLVQSNFKVYVYTASALQLSVLSHLCELQARMPNLIVGILTRASVLAAYKSGITADQIIR 709

Query: 291 FLRLHALPSV-------SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQ 336
           FL  HA P V       + P LP  V+ Q+R+WE ER RL+    VL  ++ +Q
Sbjct: 710 FLEAHAHPVVLERKLRTNAPLLPENVTIQLRMWEAERMRLSLYPAVLLKKWDAQ 763


>gi|237842971|ref|XP_002370783.1| hypothetical protein TGME49_014520 [Toxoplasma gondii ME49]
 gi|211968447|gb|EEB03643.1| hypothetical protein TGME49_014520 [Toxoplasma gondii ME49]
          Length = 836

 Score = 98.2 bits (243), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 54/114 (47%), Positives = 74/114 (64%), Gaps = 8/114 (7%)

Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIG 290
           ++V++NF+VY YT S L+++++   CEL  R PNL VG+LTR SV AA +SGITA QII 
Sbjct: 650 MLVQSNFKVYVYTASALQLSVLSHLCELQARMPNLIVGILTRASVLAAYKSGITADQIIR 709

Query: 291 FLRLHALPSV-------SCP-LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQ 336
           FL  HA P V       + P LP  V+ Q+R+WE ER RL+    VL  ++ +Q
Sbjct: 710 FLEAHAHPVVLERKLRTNAPLLPENVTIQLRMWEAERMRLSLYPAVLLKKWDAQ 763


>gi|149031821|gb|EDL86756.1| rCG41767 [Rattus norvegicus]
          Length = 187

 Score = 97.4 bits (241), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 55/138 (39%), Positives = 81/138 (58%), Gaps = 23/138 (16%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTEL------------- 47
           EL  +AK++V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ L             
Sbjct: 48  ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQ 107

Query: 48  --------RRNAKVALLGGGKPWS-MSAKLEVDSKARDLDFLNQYALERWECILRFMVGS 98
                   R+N ++ALLGGGK WS  +++L  D  ARD+  L++YA ERWE +L FMVGS
Sbjct: 108 GLILNPVFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGS 167

Query: 99  QQTEGISADAVRTLFNEG 116
             +  +S D  + L   G
Sbjct: 168 -PSAAVSQDLAQLLSQAG 184


>gi|82704497|ref|XP_726580.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
 gi|23482045|gb|EAA18145.1| Drosophila melanogaster LD30622p-related [Plasmodium yoelii yoelii]
          Length = 926

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 47/106 (44%), Positives = 66/106 (62%), Gaps = 3/106 (2%)

Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIG 290
           LI+++NF+VY YT S LK+ ++   CEL  R PN+ VG+LTR SV  A  S ITA QII 
Sbjct: 746 LIIQSNFKVYLYTSSILKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITADQIIK 805

Query: 291 FLRLHALP---SVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQF 333
           FL  ++ P    +   +P  V  Q++LWE ER RLT  + +++  F
Sbjct: 806 FLESYSHPGKTKLKSIIPINVITQLKLWEAERHRLTLEDSIVFKNF 851


>gi|70950974|ref|XP_744765.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56524852|emb|CAH80923.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
          Length = 922

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 47/106 (44%), Positives = 66/106 (62%), Gaps = 3/106 (2%)

Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIG 290
           LI+++NF+VY YT S LK+ ++   CEL  R PN+ VG+LTR SV  A  S ITA QII 
Sbjct: 742 LIIQSNFKVYLYTSSILKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITADQIIK 801

Query: 291 FLRLHALP---SVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQF 333
           FL  ++ P    +   +P  V  Q++LWE ER RLT  + +++  F
Sbjct: 802 FLESYSHPGKTKLKSIIPINVITQLKLWEAERHRLTLEDSIVFKNF 847


>gi|68070707|ref|XP_677265.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56497310|emb|CAH94094.1| conserved hypothetical protein [Plasmodium berghei]
          Length = 929

 Score = 94.4 bits (233), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 46/106 (43%), Positives = 65/106 (61%), Gaps = 3/106 (2%)

Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIG 290
           LI+++NF+VY YT S LK+ ++   CEL  R PN+ VG+LTR SV  A  S ITA QII 
Sbjct: 749 LIIQSNFKVYLYTSSILKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITADQIIK 808

Query: 291 FLRLHALP---SVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQF 333
           FL  ++ P    +   +P  V  Q++LWE ER RL   + +++  F
Sbjct: 809 FLESYSHPGKTKLKSIIPINVITQLKLWEAERHRLILEDSIVFKNF 854


>gi|296084771|emb|CBI25914.3| unnamed protein product [Vitis vinifera]
          Length = 225

 Score = 94.4 bits (233), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 57/200 (28%), Positives = 97/200 (48%), Gaps = 29/200 (14%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
           L  +AK Y++++L+++ PV    +  W+      +H+ A D L +LR             
Sbjct: 9   LPPLAKKYILQMLYIDVPVTAKAMEEWVLADGFSKHRVAIDRLIQLRVFTETSDRKKETS 68

Query: 49  --------RNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQ 100
                    N +  L+ GG          +  +   LD L  YAL +WEC L  ++ S Q
Sbjct: 69  YRLNPTFQTNLQKHLIYGGVLPREPMPSNITVRLPSLDDLEAYALGQWECFLLQLISSTQ 128

Query: 101 TEGIS--ADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
           TE ++  + ++  +F  G       E P +T+ GFQFLL++T AQ+WY + +Y+   E +
Sbjct: 129 TEKLTNFSSSMMKVFQRGLLTQREKEAPRLTESGFQFLLMDTNAQLWYIMREYISNSEER 188

Query: 153 GLDLVECLTFLFQLKFSTFG 172
           G+D  + ++FL +L F   G
Sbjct: 189 GVDPADLISFLLELSFHVTG 208


>gi|359497414|ref|XP_003635508.1| PREDICTED: general transcription factor IIH subunit 4-like, partial
           [Vitis vinifera]
          Length = 209

 Score = 94.4 bits (233), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 57/200 (28%), Positives = 97/200 (48%), Gaps = 29/200 (14%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
           L  +AK Y++++L+++ PV    +  W+      +H+ A D L +LR             
Sbjct: 9   LPPLAKKYILQMLYIDVPVTAKAMEEWVLADGFSKHRVAIDRLIQLRVFTETSDRKKETS 68

Query: 49  --------RNAKVALLGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQ 100
                    N +  L+ GG          +  +   LD L  YAL +WEC L  ++ S Q
Sbjct: 69  YRLNPTFQTNLQKHLIYGGVLPREPMPSNITVRLPSLDDLEAYALGQWECFLLQLISSTQ 128

Query: 101 TEGIS--ADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFILKYLETVESK 152
           TE ++  + ++  +F  G       E P +T+ GFQFLL++T AQ+WY + +Y+   E +
Sbjct: 129 TEKLTNFSSSMMKVFQRGLLTQREKEAPRLTESGFQFLLMDTNAQLWYIMREYISNSEER 188

Query: 153 GLDLVECLTFLFQLKFSTFG 172
           G+D  + ++FL +L F   G
Sbjct: 189 GVDPADLISFLLELSFHVTG 208


>gi|399219100|emb|CCF75987.1| unnamed protein product [Babesia microti strain RI]
          Length = 612

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 47/106 (44%), Positives = 64/106 (60%), Gaps = 7/106 (6%)

Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIG 290
           +IV++NF+VY YT S L++ ++   CEL  R PNL +GVLTR S  AA R+GITA QI  
Sbjct: 432 IIVQSNFKVYVYTASPLQINVLAHLCELQSRTPNLVIGVLTRASAHAAFRAGITAKQICQ 491

Query: 291 FLRLHALPSV-------SCPLPPVVSDQIRLWEGERERLTTSEGVL 329
           FL  H+ P +          LP  V  Q+ +WE ER R++  + VL
Sbjct: 492 FLETHSHPILLQNVREGGSYLPNNVVTQLNMWEAERNRISLEQCVL 537


>gi|258577869|ref|XP_002543116.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237903382|gb|EEP77783.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 311

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 75/252 (29%), Positives = 118/252 (46%), Gaps = 35/252 (13%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------- 48
           L  +AK +V+ LL+++ P+P A +  W+     +E   A   L+ L              
Sbjct: 38  LPHLAKCFVMALLYLKDPLPAAELELWVKTESKRERDNALSILSRLHILSSTTTANHVRA 97

Query: 49  --------RNAKVALLGGGKPWSMSA-KLEVDSKARDLDFLNQYALERWECILRFMVGSQ 99
                    + + AL GG K  S        D +   +  L+ YA  +WE +L +MVG+ 
Sbjct: 98  FMVTNPFSSSLRQALTGGDKQQSFGVISTTPDPQPMTIADLDDYARRQWEGVLGYMVGTN 157

Query: 100 ----QTEGIS-ADAVRTLFNEGAENPV------ITKDGFQFLLLETPAQVWYFILKYLET 148
               Q E ++ +  V++L        V      ITK+GF F+L +   QVW+ ++ Y+E 
Sbjct: 158 TLGIQRESVTLSKGVKSLLQACHLVEVRDRRVEITKEGFAFVLQDVNTQVWHILVLYVEN 217

Query: 149 VESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRK--RKAGR 206
            E+ G+D VE L+FLF L     G  Y  + +S      L  L +FG+VYQ     +A R
Sbjct: 218 AEAIGMDSVEVLSFLFLLSSLELGQSYEKQHLSSTQLRTLADLTDFGIVYQHSPASEATR 277

Query: 207 FYPTKLALNMAT 218
           FYPT+LA  + +
Sbjct: 278 FYPTRLATTLTS 289


>gi|47194594|emb|CAF93785.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 123

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 47/96 (48%), Positives = 57/96 (59%), Gaps = 23/96 (23%)

Query: 304 LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYA----------------- 346
           LPP ++DQIRLWE ER+RL  +EGVLY+QF+SQ DFE LRD A                 
Sbjct: 24  LPPTITDQIRLWELERDRLQFTEGVLYNQFLSQTDFEVLRDRAQVGLFHCCYTIRQKVHQ 83

Query: 347 ------QDLGVLTWQNEKKRTVVVTKQGHSDIKKFW 376
                 Q LG L WQ+   R +VVT  GHS++KKFW
Sbjct: 84  QLSPVLQSLGCLLWQDAAHRVMVVTLWGHSEVKKFW 119


>gi|380476902|emb|CCF44454.1| RNA polymerase II transcription factor B subunit 2 [Colletotrichum
           higginsianum]
          Length = 126

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 50/119 (42%), Positives = 70/119 (58%), Gaps = 10/119 (8%)

Query: 265 LTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSV-------SCP-LPPVVSDQIRLWE 316
           +  G LTR+S+R A+  GITA QII +L  HA   +       S P LPP V DQIRLW+
Sbjct: 1   MVSGRLTRNSIRNAVERGITADQIISYLAAHAHEQMHRMAAVRSRPVLPPTVVDQIRLWQ 60

Query: 317 GERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
            E ER+TT+ G L+  F S  ++E +  YA ++GVL W+N+K      +K  H  I+ +
Sbjct: 61  LETERMTTTSGFLFRDFDSPKEYEVIAGYASEIGVLVWRNDKLGMFFASK--HEQIRDY 117


>gi|337743331|gb|AEI73162.1| GTF2H4 [Kryptolebias marmoratus]
          Length = 82

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 44/81 (54%), Positives = 55/81 (67%), Gaps = 11/81 (13%)

Query: 174 DYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIR------- 226
           DYS EGMS+ L  FLQHLREFGLV+QRKRK+ R+YPT+LA+ +A   T            
Sbjct: 2   DYSVEGMSESLLTFLQHLREFGLVFQRKRKSRRYYPTRLAITLAAGVTSNSSSSSNLSST 61

Query: 227 ----EPGFLIVETNFRVYAYT 243
               + GF++VETN+RVYAYT
Sbjct: 62  QGAGDAGFIVVETNYRVYAYT 82


>gi|60603004|gb|AAX27959.1| unknown [Schistosoma japonicum]
          Length = 100

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 41/72 (56%), Positives = 54/72 (75%)

Query: 229 GFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQI 288
           G++++ETNFR+YAYTDS L+ AL+ LF ++  RFPNL V  +TRDSVR AL  GITA QI
Sbjct: 14  GYILLETNFRLYAYTDSPLRTALLSLFSKIRARFPNLVVADITRDSVREALIRGITANQI 73

Query: 289 IGFLRLHALPSV 300
           + FL  +A P +
Sbjct: 74  LSFLTSNAHPDM 85


>gi|399949686|gb|AFP65344.1| tfIIB-like protein [Chroomonas mesostigmatica CCMP1168]
          Length = 464

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 38/100 (38%), Positives = 65/100 (65%), Gaps = 3/100 (3%)

Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIG 290
           +I+E+NFR+Y Y +++L   L+ +F E++Y+ PNL VG +T  S+  A  SG T+  IIG
Sbjct: 329 IIIESNFRIYVYKNAHLGNDLLLVFSEILYQLPNLFVGEITEKSILKAFNSGSTSRNIIG 388

Query: 291 FLRLHALPSVSCP-LPPVVSDQIRLWEGERERLTTSEGVL 329
           FL+ +  P   CP +P  V++QI+ WE ++  +  +E ++
Sbjct: 389 FLKNNLHP--ICPWIPSAVTNQIKAWEFQKTEIKITESLM 426


>gi|156086630|ref|XP_001610724.1| hypothetical protein [Babesia bovis T2Bo]
 gi|154797977|gb|EDO07156.1| conserved hypothetical protein [Babesia bovis]
          Length = 586

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 53/184 (28%), Positives = 89/184 (48%), Gaps = 27/184 (14%)

Query: 171 FGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLA---------------LN 215
           +G  +  +  +   Q  L+ L E GLVY       + Y T L+               ++
Sbjct: 327 YGDTFRLDNPTLSQQRILRFLIELGLVYA-DSDGSKIYVTNLSFLVDVKHANLVEEHQIS 385

Query: 216 MATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSV 275
           M+  G      +   ++V++NF++YAY  S L++ ++   CEL  + PNL +GVLTR S+
Sbjct: 386 MSVCGN-----KGSKMVVQSNFKIYAYIPSALQMNVLNHICELQAKTPNLVIGVLTRSSL 440

Query: 276 RAALRSGITAAQIIGFLRLHA------LPSVSCPLPPVVSDQIRLWEGERERLTTSEGVL 329
           + A +SGITA Q+I F           +      +P  V  Q+++WE ER RL     V+
Sbjct: 441 QTAFKSGITADQLICFFESKGQYDDIQINRNVMNVPENVRRQLKMWEAERNRLELLNAVV 500

Query: 330 YSQF 333
           + ++
Sbjct: 501 FKRW 504


>gi|160331881|ref|XP_001712647.1| hypothetical protein HAN_3g523 [Hemiselmis andersenii]
 gi|159766096|gb|ABW98322.1| hypothetical protein HAN_3g523 [Hemiselmis andersenii]
          Length = 432

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 36/107 (33%), Positives = 62/107 (57%), Gaps = 1/107 (0%)

Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIG 290
           +I+E+N+R+Y Y  ++    L  +F E++Y  PN  VG +T  S+  AL+SGIT   I+G
Sbjct: 300 IIIESNYRIYVYQKNSNNNQLFLIFSEILYILPNFFVGEITETSISRALKSGITIQNILG 359

Query: 291 FLRLHALPSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQA 337
           F+R   L  V   +P  + +Q RLWE +++++      ++   V+Q 
Sbjct: 360 FIR-ENLHCVCRSIPSTILNQFRLWEFQKKKIKIENCFVFFDMVTQG 405


>gi|443919931|gb|ELU39967.1| Tfb2 domain-containing protein [Rhizoctonia solani AG-1 IA]
          Length = 302

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 46/155 (29%), Positives = 74/155 (47%), Gaps = 13/155 (8%)

Query: 72  SKARDLDFLNQYALERWECILRFMVGSQQTEGISADAVRTLF--------NEGAENPVIT 123
           S    ++ L++ ALE WE IL +MV +  +          L           G     IT
Sbjct: 133 SSQMSIEQLDRKALEHWETILHYMVDTNSSAQRPGGGALHLLVAGGWLEEGRGGGGHEIT 192

Query: 124 KDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDK 183
             GFQFLL    AQ+W  +L+YL   +++ +D+ E L+FLF L     G +YS + +S  
Sbjct: 193 STGFQFLLQSPRAQLWDILLQYLHMSDARRMDIAEVLSFLFMLSLMKLGQEYSCDNLSPT 252

Query: 184 LQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMAT 218
               +  L+++G+   +      FYPT+LA  + +
Sbjct: 253 QNAMMTDLKDYGINANKS-----FYPTRLATTLTS 282


>gi|167395228|ref|XP_001741280.1| TFIIH basal transcription factor complex p52 subunit [Entamoeba
           dispar SAW760]
 gi|165894177|gb|EDR22239.1| TFIIH basal transcription factor complex p52 subunit, putative
           [Entamoeba dispar SAW760]
          Length = 489

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 83/348 (23%), Positives = 148/348 (42%), Gaps = 37/348 (10%)

Query: 45  TELRRNAKVALLGGGKPWSMSAKL-EVDSKARDL---DFLNQYALERWECILRFMVGSQQ 100
            + + N K+ L+G  +P   + K+ EV+ K   +     L +  ++ +E  +  ++    
Sbjct: 101 NDFQNNFKMNLIGSMEP---AYKIKEVNEKVSQIIKEKCLKENCIKTFEYFMNKLLQFNS 157

Query: 101 TEGISADAVRTLFNE----GAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDL 156
              IS + +R +F +      E   IT+ G+QFL  ET  Q+W  +L  +  ++ +   +
Sbjct: 158 QPNISNNPLR-IFKDLELVKEETRQITRKGYQFLFQETKTQLWVIMLSIIGLIQRR---I 213

Query: 157 VECLTFLFQLKFSTFGTDYSTEGMS-----DKLQVFLQHLREFGL-VYQRKRKAGRFYPT 210
              +  +F++ +      Y+ +        D LQ+F     + G+ VY +++      P 
Sbjct: 214 SPFINDVFEMTYLKENIIYNCDRFKKVYGPDPLQLF----NDLGIIVYYKEQNVMAITPL 269

Query: 211 KLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVL 270
              L          +++P   I E N+ VY YT+S  +V L  LF    ++  NL VG L
Sbjct: 270 MSLLRSNANIPSDLVKKPK-TITEINYSVYIYTESQFQVDLYRLFIRKNFQLANLWVGKL 328

Query: 271 TRDSVRAALRSGITAAQIIGFLRLHALPSVSCPLPPVVSDQIRLWEGERERLTTSEGV-- 328
              SV  A   GIT+  +I FL+ +        LP  +  QI LW+ E  R      V  
Sbjct: 329 NHQSVTEAFAKGITSEMLINFLQPN--------LPRTIQKQIDLWKKEINRFKEEHVVRY 380

Query: 329 -LYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
             Y   + +  F  +++ +  L     + E  R + V  Q    I+ F
Sbjct: 381 KFYDDAIGRQLFIIVKNESDRLKATILEKEDIRLIFVKYQYGETIRSF 428


>gi|67483069|ref|XP_656826.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56474050|gb|EAL51440.1| hypothetical protein EHI_192400 [Entamoeba histolytica HM-1:IMSS]
 gi|449706224|gb|EMD46115.1| tfiih basal transcription factor complex p52 subunit, putative
           [Entamoeba histolytica KU27]
          Length = 484

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 83/345 (24%), Positives = 145/345 (42%), Gaps = 31/345 (8%)

Query: 45  TELRRNAKVALLGGGKP-WSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTEG 103
            + + N K+ L G  +P + +    E  SK      L +  ++ +E  +  ++       
Sbjct: 101 NDFQSNFKMNLTGSMEPAYKIKEVNEKVSKIIQEKCLKENCIKTFEYFMNKLLQFNSQPN 160

Query: 104 ISADAVRTLFNE----GAENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVEC 159
           IS + +R +F +      E   IT+ G+QFL  ET  Q+W  +L  +  ++ +   +   
Sbjct: 161 ISNNPLR-IFKDLELVKEETRQITRKGYQFLFQETKTQLWVIMLSIIGLIQRR---ISPF 216

Query: 160 LTFLFQLKFSTFGTDYSTEGMS-----DKLQVFLQHLREFGL-VYQRKRKAGRFYPTKLA 213
           +  +F++ +      Y+ +        D LQ+F     + G+ VY +++      P    
Sbjct: 217 INDVFEMTYLKEHIIYNCDHFKKVYGPDPLQLF----NDLGIIVYYKEQNVMAITPLMSL 272

Query: 214 LNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRD 273
           L          +++P   I E N+ VY YT+S  +V L  LF    ++  NL VG L   
Sbjct: 273 LRSNANIPSDLVKKPK-TITEINYSVYIYTESQFQVDLYRLFIRKNFQLANLWVGKLNHQ 331

Query: 274 SVRAALRSGITAAQIIGFLRLHALPSVSCPLPPVVSDQIRLWEGERERLTTSEGV---LY 330
           SV  A   GIT+  +I FL+    P+    LP  +  QI LW+ E  R      V    Y
Sbjct: 332 SVTEAFAKGITSEMLINFLQ----PN----LPRTIQKQIDLWKKEINRFKEEHVVRYRFY 383

Query: 331 SQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQGHSDIKKF 375
              + +  F  +++ +  L     + E  R + V  Q    I+ F
Sbjct: 384 DDAIGRQLFVIVKNESDRLKATILEKEDIRLIFVKYQYGETIRSF 428


>gi|66358470|ref|XP_626413.1| possible transcription factor TFIIH [Cryptosporidium parvum Iowa
           II]
 gi|46227864|gb|EAK88784.1| possible transcription factor TFIIH [Cryptosporidium parvum Iowa
           II]
          Length = 709

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 51/193 (26%), Positives = 92/193 (47%), Gaps = 27/193 (13%)

Query: 210 TKLALNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV 269
           + + +N     ++   R    +IV++NFR+Y YT S L+  ++   C++  R PN+  G+
Sbjct: 507 SNVKINEPNCLSMNSTRLEAGIIVQSNFRIYCYTASPLQAKILRHLCQVKVRGPNIICGI 566

Query: 270 LTRDSVRAALRSGITAAQIIGFLRLHALPSV--------SCPLPPVVSDQIRLWEGERER 321
           LTR  + +A   G++A QI+ F   +A P +        +  +P  V  Q++LWE ++ R
Sbjct: 567 LTRKGLLSAYSMGVSAEQILRFFSSNAHPIILRRFMLEGTSIIPVNVETQLKLWEKDKNR 626

Query: 322 LTTSEGVLYSQF-VSQADFEKLRD---YAQDLGVLTWQN---------------EKKRTV 362
           L  S    +S +  S  D +  R    YA+   +L + +               +KK  +
Sbjct: 627 LKISHASTFSDWGASPNDIQLFRQTILYARSKDILLYNSPIELTEKELNLNVELQKKIIL 686

Query: 363 VVTKQGHSDIKKF 375
           V+ ++   DIK F
Sbjct: 687 VIKQEYEDDIKTF 699


>gi|207340600|gb|EDZ68899.1| YPL122Cp-like protein [Saccharomyces cerevisiae AWRI1631]
          Length = 226

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 53/189 (28%), Positives = 91/189 (48%), Gaps = 27/189 (14%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTL----------------T 45
           L  +AK +++ ++F E  VP   +  W++     + + A  ++                 
Sbjct: 39  LPPLAKFFIMAMVFNENEVPLLDLDKWVNSNGKLQFQNAIKSMKSLHLLIPNKSSGTLMI 98

Query: 46  ELRRNAKVAL---LGGGKPWSMSAKLEVDSKARDLDFLNQYALERWECILRFMVGSQQTE 102
            L    K++L   L GG+  + S  + V+     LD L++Y+  +WE IL FMVG+   +
Sbjct: 99  NLNPTFKISLRNALTGGEVQN-SFGVVVEENVVSLDLLDEYSANKWETILHFMVGTPLAK 157

Query: 103 GISADAVRTLFNEGAENPV-------ITKDGFQFLLLETPAQVWYFILKYLETVESKGLD 155
             S   +  L +      V       IT +GFQFLL E  +Q+W  +L+YL+ +E+  +D
Sbjct: 158 IPSEKVLNLLKHSKLMEEVNSTGEFKITNEGFQFLLQEINSQLWTLLLQYLKMIETSKMD 217

Query: 156 LVECLTFLF 164
           LV+ L F+F
Sbjct: 218 LVDVLHFIF 226


>gi|330038479|ref|XP_003239608.1| TATA binding protein of transcription factor IIB-like protein
           [Cryptomonas paramecium]
 gi|327206532|gb|AEA38710.1| TATA binding protein of transcription factor IIB-like protein
           [Cryptomonas paramecium]
          Length = 467

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 35/101 (34%), Positives = 61/101 (60%), Gaps = 5/101 (4%)

Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIG 290
           +I+E+N+RVYAY ++     ++  FC+L+Y  P L VG + + S+  A+  GI A  II 
Sbjct: 319 MIIESNYRVYAYNNNTFLNKILIQFCDLIYNLPGLFVGEINKTSIHKAIHKGINAKNIIS 378

Query: 291 FL--RLHALPSVSCPLPPVVSDQIRLWEGERERLTTSEGVL 329
           F+    H +   SC   P++ +QIR+WE ++ + +T + V+
Sbjct: 379 FILKNSHYIHQNSC--NPII-NQIRIWEFQQRKTSTHDCVM 416


>gi|269861689|ref|XP_002650538.1| transcription/repair factor TFIIH subunit [Enterocytozoon bieneusi
           H348]
 gi|220065990|gb|EED43520.1| transcription/repair factor TFIIH subunit [Enterocytozoon bieneusi
           H348]
          Length = 291

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 56/257 (21%), Positives = 108/257 (42%), Gaps = 41/257 (15%)

Query: 119 NPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTE 178
           N  IT  G +FLL+    Q+WY I++                     ++ S+        
Sbjct: 69  NHEITTKGLEFLLMNKNEQIWYLIVE--------------------DIRLSS-------- 100

Query: 179 GMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIREPG------FLI 232
            +  KL  + Q L    ++   K+         +  N+  +   K I          F+ 
Sbjct: 101 HLHKKLYSYAQVLNGHKII--DKKILNFLVEINVIDNIVDQNLYKYILYKNNTEIQKFIY 158

Query: 233 VETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFL 292
           +ETNF++Y+YT++    A++ LF  +++  P +   +L    +   L  GI   QI+ ++
Sbjct: 159 LETNFKLYSYTNNLCDRAILDLFSNIVFEVPGMIKSILDEKKIMNTLDRGIKIQQIVEYI 218

Query: 293 RLHALPSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVL 352
           R H + +    L       I +W+ +R R+    G LYS F +  ++  + +  +    L
Sbjct: 219 RRHTINNCDQILH-----MIEIWDKQRNRINKQIGYLYSDFTNYNEYRSVLEQIKTDTDL 273

Query: 353 TWQNEKKRTVVVTKQGH 369
            ++NE++R + V  + H
Sbjct: 274 LYKNEEERILFVKNKLH 290


>gi|380492949|emb|CCF34234.1| RNA polymerase II transcription factor B subunit 2 [Colletotrichum
           higginsianum]
          Length = 343

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 66/262 (25%), Positives = 116/262 (44%), Gaps = 47/262 (17%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTL----------------- 44
           +  +AK  V+ +L++ +P+    +  W+     ++  +A  TL                 
Sbjct: 41  IPPLAKTIVMAILYMPKPMLLEDLDVWVKPESRRQKDQAISTLRSLHILQITVPSKERPQ 100

Query: 45  -----TELRRNAKVALLGGGK------PWSMSAKLEVDSKARDLDFLNQYALERWECILR 93
                T  + + ++AL GGG       P S+    E+      + FL++YA  +WE IL 
Sbjct: 101 EMQLTTNFKSSLRLALEGGGTHNSFGVPSSLPVPPEIT-----VPFLDRYARRKWEDILH 155

Query: 94  FMVGSQQTEGI----SADAVRTLFNEG------AENPVITKDGFQFLLLETPAQVWYFIL 143
           ++V +    G        +V+ L   G           IT+ GF FLL E  AQVW  +L
Sbjct: 156 YIVNTVNPGGADLGGPKSSVKNLLVAGQLVRRQGSAVGITQAGFTFLLQEANAQVWTLLL 215

Query: 144 KYLETVE----SKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQ 199
            +LE  +    + G++  + L+FLF L     G  Y T  +++  +  L  L +FGL+Y 
Sbjct: 216 LWLEATDHAEDAAGMESTDMLSFLFLLASLELGRAYDTNALTEARRNMLPSLLDFGLIYI 275

Query: 200 RKRKAGRFYPTKLALNMATRGT 221
              K  +++PT+LA  + +  +
Sbjct: 276 PSHKPQQYFPTRLATTLTSSSS 297


>gi|269860316|ref|XP_002649880.1| transcription/repair factor TFIIH subunit [Enterocytozoon bieneusi
           H348]
 gi|220066720|gb|EED44193.1| transcription/repair factor TFIIH subunit [Enterocytozoon bieneusi
           H348]
          Length = 369

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 56/258 (21%), Positives = 108/258 (41%), Gaps = 41/258 (15%)

Query: 118 ENPVITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYST 177
            N  IT  G +FLL+    Q+WY I++                     ++ S+       
Sbjct: 146 NNHEITTKGLEFLLMNKNEQIWYLIVE--------------------DIRLSS------- 178

Query: 178 EGMSDKLQVFLQHLREFGLVYQRKRKAGRFYPTKLALNMATRGTLKQIREPG------FL 231
             +  KL  + Q L    ++   K+         +  N+  +   K I          F+
Sbjct: 179 -HLHKKLYSYAQVLNGHKII--DKKILNFLVEINVIDNIVDQNLYKYILYKNNTEIQKFI 235

Query: 232 IVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGF 291
            +ETNF++Y+YT++    A++ LF  +++  P +   +L    +   L  GI   QI+ +
Sbjct: 236 YLETNFKLYSYTNNLCDRAILDLFSNIVFEVPGMIKSILDEKKIMNTLDRGIKIQQIVEY 295

Query: 292 LRLHALPSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGV 351
           +R H + +    L       I +W+ +R R+    G LYS F +  ++  + +  +    
Sbjct: 296 IRRHTINNCDQILH-----MIEIWDKQRNRINKQIGYLYSDFTNYNEYRSVLEQIKTDTD 350

Query: 352 LTWQNEKKRTVVVTKQGH 369
           L ++NE++R + V  + H
Sbjct: 351 LLYKNEEERILFVKNKLH 368


>gi|238570605|ref|XP_002386883.1| hypothetical protein MPER_14683 [Moniliophthora perniciosa FA553]
 gi|215440101|gb|EEB87813.1| hypothetical protein MPER_14683 [Moniliophthora perniciosa FA553]
          Length = 69

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 35/66 (53%), Positives = 48/66 (72%), Gaps = 5/66 (7%)

Query: 227 EPGFLIVETNFRVYAYTDS-----NLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRS 281
           + GF+++ETN+RVYAYT S      L+ A++ LF  L YRFPNL VG +TR+SV+ AL +
Sbjct: 4   QQGFIVLETNYRVYAYTGSFSLHNPLQTAILHLFVSLKYRFPNLVVGSITRESVKKALSN 63

Query: 282 GITAAQ 287
           GI+A Q
Sbjct: 64  GISADQ 69


>gi|209880858|ref|XP_002141868.1| hypothetical protein [Cryptosporidium muris RN66]
 gi|209557474|gb|EEA07519.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
          Length = 794

 Score = 72.0 bits (175), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 38/109 (34%), Positives = 61/109 (55%), Gaps = 8/109 (7%)

Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIG 290
           +IV++NFRVY YT S L+  ++   C++  R PN+  GVLTR  + +A   G+ A QI+ 
Sbjct: 606 IIVQSNFRVYCYTASPLQAKILRHLCQVKVRGPNIICGVLTRRGLLSAYSMGVKAYQILR 665

Query: 291 FLRLHALPSV--------SCPLPPVVSDQIRLWEGERERLTTSEGVLYS 331
           F   +A P +        +  +P  V  Q++LWE +  RL  ++  L+S
Sbjct: 666 FFISNAHPIILKKHITDGTSIIPISVETQLKLWENDHNRLEINKVSLFS 714


>gi|440295536|gb|ELP88449.1| TFIIH basal transcription factor complex p52 subunit, putative
           [Entamoeba invadens IP1]
          Length = 531

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 67/264 (25%), Positives = 115/264 (43%), Gaps = 23/264 (8%)

Query: 122 ITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKF----STFGTDYST 177
           IT+ G+QFL  ET  Q+W  +L      + K   + + +   F+L +    + +  D  T
Sbjct: 181 ITRKGYQFLFQETKTQIWILMLSIFGPAQRKR-RMGKLINEFFELTYLAPETCYNADPFT 239

Query: 178 EGMSDKLQVFLQHLREFGLV-YQRKRKAGRFYPTKLALNMATRGTLKQIREPGFLIVETN 236
           +  S    +    +   G++ Y +++ A    P    L          + +P   I ETN
Sbjct: 240 KIYSAAPLILFDMM---GIIAYSKEKNALVVTPLMSLLRSNALVPSDLVTKPR-TITETN 295

Query: 237 FRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGFLRLHA 296
           + VY YT+S  +V L  LF     +  NL VG +T D+V  A   GIT   ++ FL+ + 
Sbjct: 296 YTVYIYTESFFQVKLYSLFIRQNLQLTNLCVGRITYDTVTEAFLKGITNEMLVNFLQPN- 354

Query: 297 LPSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEK-----LRDYAQDLGV 351
                  LP  +  QI LW+ E  RL     V +  +  + + +K      +  A+ +  
Sbjct: 355 -------LPKNIQAQIDLWKRELNRLKEVRAVKFRFYEPELEVQKELYHLTKSEAEKMKG 407

Query: 352 LTWQNEKKRTVVVTKQGHSDIKKF 375
           + +  E++ T+ V       IK+F
Sbjct: 408 VVFYKEEELTLFVRYDVAEKIKEF 431


>gi|294950275|ref|XP_002786548.1| tfiih, polypeptide, putative [Perkinsus marinus ATCC 50983]
 gi|239900840|gb|EER18344.1| tfiih, polypeptide, putative [Perkinsus marinus ATCC 50983]
          Length = 580

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 105/470 (22%), Positives = 170/470 (36%), Gaps = 150/470 (31%)

Query: 2   LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGGK- 60
           L+ I K ++ RL+ V +P+ Q VI  W+S  H  +H +    L++L +   + +   G+ 
Sbjct: 67  LNGIGKVFINRLILVHRPIQQGVIRHWISSLHDSDHSKYRAALSQLTKLHILLMDPKGQH 126

Query: 61  --------------------------PWSMSAKLEVDSKARD-------------LDFLN 81
                                     PW +  + +VD    D             ++ L+
Sbjct: 127 LRMSSNFRQRLHEVFLKNTSDDHQSPPWEILEEAKVDVSKVDGVLTNGGAPRDAGMNRLD 186

Query: 82  QYALERWECILRFMVGSQQTEGISA---DAVRTLFNEGAE---------NPVITKDG--F 127
           +YA  RWE +L  MV     +G++A   DA  +L  EGA          +   T  G  F
Sbjct: 187 EYAALRWEYVLEAMV-----DGMTAAVGDA--SLRPEGARQGLSGIVGMSSTSTSTGQLF 239

Query: 128 QFLLLETPAQVWYFILKYLETVESK--------GLDLVECLT---------FLFQLKFST 170
           QF+L    +Q W  + +  + +E +         + +V  LT          L   +   
Sbjct: 240 QFVLASRTSQYWELVRRAFDMIERQHGIQSLLAPIRMVVALTRSIKHEPERLLPGTRIRV 299

Query: 171 FGTDYSTEGMSDKLQVFLQHLREFGLVY--------------QRKRKAGRFYPTKLA--- 213
                +T G    +   L+ L + G+V               QR+      Y   L    
Sbjct: 300 NPQCMATPGAKGAVTTLLETLADLGIVVPATYDDELRKVAALQRQYGESVKYELPLVSSD 359

Query: 214 ------------------LNMATRGTLKQIREPGFLIVETNFRVYAYTDSNLKVALIGLF 255
                             + M+   T     +   L V++NF V AYT S+L + L+G F
Sbjct: 360 SRMVVSSLVYVMQASRSKMMMSVDDTTAITNKSCRLFVDSNFAVTAYTTSSLDLRLLGTF 419

Query: 256 CELMYRF-------PN---LTVGVLTRDSVRAALRSGITAAQIIGFLRLHALPSVSC--- 302
            +L  +        PN     +G LT+ SV++A + G+T+  II +L+ H  P  +    
Sbjct: 420 VQLQRQLGDGREYDPNDFGCVLGTLTQSSVQSAAQRGVTSEYIISYLKSHVDPRAAHMGS 479

Query: 303 ----------------------PLPPVVSDQIRLWEGE--RERLTTSEGV 328
                                  +P  V  QI LWE E    RL    GV
Sbjct: 480 QGGRSSAATANTGAARGEKFIDGIPANVVTQITLWEREAIHNRLRIDPGV 529


>gi|197107449|pdb|3DOM|A Chain A, Crystal Structure Of The Complex Between Tfb5 And The
           C-Terminal Domain Of Tfb2
 gi|197107451|pdb|3DOM|C Chain C, Crystal Structure Of The Complex Between Tfb5 And The
           C-Terminal Domain Of Tfb2
          Length = 108

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 34/72 (47%), Positives = 50/72 (69%)

Query: 304 LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVV 363
           LPP V DQIRLW+ E +R+ T EG LYS F +  ++  L  YAQD+GVL W+++KK+   
Sbjct: 28  LPPTVVDQIRLWQLELDRVITYEGSLYSDFETSQEYNLLSKYAQDIGVLLWKDDKKKKFF 87

Query: 364 VTKQGHSDIKKF 375
           ++K+G+S +  F
Sbjct: 88  ISKEGNSQVLDF 99


>gi|197107383|pdb|3DGP|A Chain A, Crystal Structure Of The Complex Between Tfb5 And The
           C-Terminal Domain Of Tfb2
          Length = 80

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 32/70 (45%), Positives = 48/70 (68%)

Query: 306 PVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVT 365
           P V DQIRLW+ E +R+ T EG LYS F +  ++  L  YAQD+GVL W+++KK+   ++
Sbjct: 2   PTVVDQIRLWQLELDRVITYEGSLYSDFETSQEYNLLSKYAQDIGVLLWKDDKKKKFFIS 61

Query: 366 KQGHSDIKKF 375
           K+G+S +  F
Sbjct: 62  KEGNSQVLDF 71


>gi|145509120|ref|XP_001440504.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124407721|emb|CAK73107.1| unnamed protein product [Paramecium tetraurelia]
          Length = 400

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 49/185 (26%), Positives = 84/185 (45%), Gaps = 32/185 (17%)

Query: 126 GFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGMSDKLQ 185
           GF F+L  T  Q+   +  Y+    S  +  + CL+ L  +K      +   + +     
Sbjct: 177 GFSFILSSTHKQINQILKYYVTKQSSNVVRFILCLSVLDPMKIYQMPNEEWQKNV----- 231

Query: 186 VFLQHLREFGLVY---QRKRKAGRFY-----PTKLALNMATRGTLKQIREPGFLIVETNF 237
             ++ L+EFGL +   Q+ R    F+     P  +++ +              +IVE NF
Sbjct: 232 --IKDLQEFGLTHYQDQQMRITFLFWNFLYEPPSVSIGIQCN-----------IIVEANF 278

Query: 238 RVYAYTDS------NLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIGF 291
           R+YAY +S       +   L+ LF E+  RF  L +  L+  S+R A+R  + A QII F
Sbjct: 279 RIYAYLNSGDQQEEEILCNLLNLFSEIKKRFKILIIADLSESSIRKAVRENLQAKQIIQF 338

Query: 292 LRLHA 296
           L +++
Sbjct: 339 LEMNS 343


>gi|401419872|ref|XP_003874425.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322490661|emb|CBZ25923.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 559

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 35/71 (49%), Positives = 45/71 (63%), Gaps = 1/71 (1%)

Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFP-NLTVGVLTRDSVRAALRSGITAAQII 289
           +I ETNFR+YAYTD    + ++  F EL      NL    +TRDS  AA+R GITAAQ++
Sbjct: 359 IITETNFRLYAYTDDADLLNILNQFAELEEIVNGNLHCYRVTRDSFAAAMRKGITAAQVL 418

Query: 290 GFLRLHALPSV 300
            FL L A PS+
Sbjct: 419 RFLSLRAHPSM 429


>gi|76156573|gb|AAX27758.2| SJCHGC02916 protein [Schistosoma japonicum]
          Length = 141

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 32/105 (30%), Positives = 54/105 (51%), Gaps = 22/105 (20%)

Query: 1   ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR------------ 48
           EL E+AKH V+RLLF+EQP+P+++++ W+ K  +    ++   LT LR            
Sbjct: 37  ELPELAKHIVMRLLFIEQPIPKSIVSGWVEKGSSALLNDSCSALTVLRIWHSTDTNVSGG 96

Query: 49  ---------RNAKVALLGGGKPWSMSAKLEVDSK-ARDLDFLNQY 83
                     + +++L GGGKP      +  + K ++ +DFL  Y
Sbjct: 97  SWSLNKKYQESIRISLFGGGKPLLGDLGIVTNDKYSKSVDFLKSY 141


>gi|146103275|ref|XP_001469523.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|134073893|emb|CAM72632.1| conserved hypothetical protein [Leishmania infantum JPCM5]
          Length = 557

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 33/71 (46%), Positives = 45/71 (63%), Gaps = 1/71 (1%)

Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFP-NLTVGVLTRDSVRAALRSGITAAQII 289
           +I ETNFR+YAYTD    + ++  F +L      NL    +TRDS  +A+R GITAAQ++
Sbjct: 357 IITETNFRLYAYTDDADLLNILNQFAKLEEIVNGNLHCYRVTRDSFASAMRKGITAAQVL 416

Query: 290 GFLRLHALPSV 300
            FL L A PS+
Sbjct: 417 RFLSLRAHPSM 427


>gi|398024082|ref|XP_003865202.1| hypothetical protein, conserved [Leishmania donovani]
 gi|322503439|emb|CBZ38524.1| hypothetical protein, conserved [Leishmania donovani]
          Length = 557

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 33/71 (46%), Positives = 45/71 (63%), Gaps = 1/71 (1%)

Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFP-NLTVGVLTRDSVRAALRSGITAAQII 289
           +I ETNFR+YAYTD    + ++  F +L      NL    +TRDS  +A+R GITAAQ++
Sbjct: 357 IITETNFRLYAYTDDADLLNILNQFAKLEEIVNGNLHCYRVTRDSFASAMRKGITAAQVL 416

Query: 290 GFLRLHALPSV 300
            FL L A PS+
Sbjct: 417 RFLSLRAHPSM 427


>gi|154345556|ref|XP_001568715.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134066057|emb|CAM43844.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 561

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 33/71 (46%), Positives = 45/71 (63%), Gaps = 1/71 (1%)

Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFP-NLTVGVLTRDSVRAALRSGITAAQII 289
           +I ETNFR+YAYT+ +  + ++  F EL      NL    +TRDS  AA+R GITA Q++
Sbjct: 362 IITETNFRLYAYTEDSDLLNILHQFAELEEVVNGNLHCYRVTRDSFAAAMRKGITATQVL 421

Query: 290 GFLRLHALPSV 300
            FL L A PS+
Sbjct: 422 RFLSLRAHPSM 432


>gi|261332554|emb|CBH15549.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
           DAL972]
          Length = 500

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 81/316 (25%), Positives = 120/316 (37%), Gaps = 63/316 (19%)

Query: 96  VGSQQTEGISAD-AVRTLFNEGAENPVITKDGFQFLLLETPAQVWYFIL----KYLETVE 150
           +G ++ E I+A      L    +E  V+T +G  F +     Q W  +     + L  + 
Sbjct: 156 LGGRRNEPIAALLQFSRLIPPPSEERVVTSEGLAFCMQPLQQQWWTLVSVVLDRALAIMS 215

Query: 151 SKGLD---LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVF--LQHLREFGLVYQ---RKR 202
            KG+    L + L  LF L  S +   Y      D L+ F  L  L E GLVY       
Sbjct: 216 GKGVSRATLWQLLAVLFALNTSDYV--YLFPSKDDDLEAFQLLARLSEVGLVYPLICNGE 273

Query: 203 KAGRFYP--------------TKLALNMATRGTLKQIR--EPGFLIVETNFRVYAYTDSN 246
           K     P                 AL   TRG   ++R  +   +I ETNFR+YAYT + 
Sbjct: 274 KCFVLSPHFHHAICWSSTPSLCTAALLDDTRGPSSRLRREDEDTIITETNFRLYAYTKNP 333

Query: 247 LKVALIGLFCELMYRFPNLTVGV-LTRDSVRAALRSGITAAQIIGFLRLHALPSV----- 300
             + ++  F         + V   +TR S  +AL  GI A  I+ FL + A PS+     
Sbjct: 334 DMLRILDQFAVKEVDVVGMVVCYRVTRASFASALAKGIGANHILQFLTVKAHPSMIKQSN 393

Query: 301 ------SCPLPPVVS--------------------DQIRLWEGERERLTTSEGVLYSQFV 334
                 SCP+ P  S                    DQ+  WE E  RL     V+  + +
Sbjct: 394 SEAGDPSCPVLPAASAGFGNTSEYRQGNIIPQSFCDQLFTWERECRRLIFRHDVVLLKNI 453

Query: 335 SQADFEKLRDYAQDLG 350
            +   E + +   + G
Sbjct: 454 GKEQMEIILNCLSNSG 469


>gi|302831319|ref|XP_002947225.1| hypothetical protein VOLCADRAFT_35893 [Volvox carteri f.
           nagariensis]
 gi|300267632|gb|EFJ51815.1| hypothetical protein VOLCADRAFT_35893 [Volvox carteri f.
           nagariensis]
          Length = 90

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 30/90 (33%), Positives = 45/90 (50%), Gaps = 3/90 (3%)

Query: 282 GITAAQIIGFLRLHALPSVSCP---LPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQAD 338
           GI A +++ +L     PS++     +P VVSDQIRLWE    RL     VLY    S+  
Sbjct: 1   GIAADELVSYLAARPHPSIASRTPVVPEVVSDQIRLWEASMNRLRADSVVLYENLASREL 60

Query: 339 FEKLRDYAQDLGVLTWQNEKKRTVVVTKQG 368
           FE+   +++  G L W++  +   V    G
Sbjct: 61  FERALAFSRSSGTLLWEDSGQMRFVALDAG 90


>gi|71747422|ref|XP_822766.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|70832434|gb|EAN77938.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 500

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 81/316 (25%), Positives = 120/316 (37%), Gaps = 63/316 (19%)

Query: 96  VGSQQTEGISAD-AVRTLFNEGAENPVITKDGFQFLLLETPAQVWYFIL----KYLETVE 150
           +G ++ E I+A      L    +E  V+T +G  F +     Q W  +     + L  + 
Sbjct: 156 LGGRRNEPIAALLQFSRLIPPPSEERVVTSEGLAFCMQPLQQQWWTLVSVVLDRALAIMS 215

Query: 151 SKGLD---LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVF--LQHLREFGLVYQ---RKR 202
            KG+    L + L  LF L  S +   Y      D L+ F  L  L E GLVY       
Sbjct: 216 GKGVSRATLWQLLAVLFALNTSDYV--YLFPSKDDDLEAFQLLARLSEVGLVYPLICNGE 273

Query: 203 KAGRFYP--------------TKLALNMATRGTLKQIR--EPGFLIVETNFRVYAYTDSN 246
           K     P                 AL   TRG   ++R  +   +I ETNFR+YAYT + 
Sbjct: 274 KCFVLSPHFHHAICWSSTPSLCTAALLDDTRGPSSRLRREDEDTIITETNFRLYAYTKNP 333

Query: 247 LKVALIGLFCELMYRFPNLTVGV-LTRDSVRAALRSGITAAQIIGFLRLHALPSV----- 300
             + ++  F         + V   +TR S  +AL  GI A  I+ FL + A PS+     
Sbjct: 334 DMLRILDQFAVKEVDVVGMVVCYRVTRASFASALAKGIGANHILQFLTVKAHPSMIKQSN 393

Query: 301 ------SCPLPPVVS--------------------DQIRLWEGERERLTTSEGVLYSQFV 334
                 SCP+ P  S                    DQ+  WE E  RL     V+  + +
Sbjct: 394 SEAGDPSCPVLPAASVGFGNTSEYRQGNIIPQSFCDQLFTWERECRRLIFRHDVVLLKNI 453

Query: 335 SQADFEKLRDYAQDLG 350
            +   E + +   + G
Sbjct: 454 GKEQMEIILNCLSNSG 469


>gi|146161819|ref|XP_001033510.2| hypothetical protein TTHERM_00313290 [Tetrahymena thermophila]
 gi|146142827|gb|EAR85847.2| hypothetical protein TTHERM_00313290 [Tetrahymena thermophila
           SB210]
          Length = 484

 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 40/134 (29%), Positives = 61/134 (45%), Gaps = 22/134 (16%)

Query: 187 FLQHLREFGLVYQRKRKAGR--FYPTKLALNMATRG-TLKQIREPGFLIVETNFRVYAYT 243
           FL  L   G+V++++    R  FY T +      R   LK +     + +ETNF VYAY 
Sbjct: 277 FLNDLHSLGMVFKKEILKDRIAFYITPVLWQFCYRKIDLKTLNAK--ITIETNFNVYAYI 334

Query: 244 D-----------SNLK------VALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAA 286
           D             LK        L+  FC+L Y+FP+L VG L+    +   ++G+TA 
Sbjct: 335 DQEDPKEIKDSKDTLKDQTEHIEKLLASFCDLHYKFPHLIVGQLSEQKTKDQFKNGLTAR 394

Query: 287 QIIGFLRLHALPSV 300
            +I F    + P +
Sbjct: 395 LLIQFFNKTSDPQM 408


>gi|148691278|gb|EDL23225.1| general transcription factor II H, polypeptide 4 [Mus musculus]
          Length = 116

 Score = 54.3 bits (129), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 23/48 (47%), Positives = 36/48 (75%)

Query: 1  ELSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR 48
          EL  +AK++V+R+LF+EQP+PQA +A W+ K  +K  +E+T  L+ LR
Sbjct: 48 ELPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLR 95


>gi|157876614|ref|XP_001686652.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|68129727|emb|CAJ09033.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 626

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 32/71 (45%), Positives = 44/71 (61%), Gaps = 1/71 (1%)

Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFP-NLTVGVLTRDSVRAALRSGITAAQII 289
           +I ETNFR++AYT+    + ++  F EL      NL    +TR S  AA+R GITAAQ++
Sbjct: 426 IITETNFRLFAYTNDADLLNILNQFAELEEIVNGNLHCYRVTRASFAAAMRKGITAAQVL 485

Query: 290 GFLRLHALPSV 300
            FL L A PS+
Sbjct: 486 RFLSLRAHPSM 496


>gi|340057132|emb|CCC51474.1| conserved hypothetical protein [Trypanosoma vivax Y486]
          Length = 502

 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 84/341 (24%), Positives = 130/341 (38%), Gaps = 71/341 (20%)

Query: 82  QYALERWECILRF-----MVGSQQTEGISAD---AVRTLFNEGAENP----VITKDGFQF 129
           Q AL R   +LR      + GS +  G +++   +V   F+     P    V+T +G  F
Sbjct: 131 QNALSRSRNMLRMVTACALAGSLEPLGTNSEQPISVMLQFSNLIPPPPEVGVVTPEGLNF 190

Query: 130 LLLETPAQVWYFILKYLETV----ESKGL---DLVECLTFLFQLKFSTFGTDYSTEGMSD 182
            +     Q W  +   L+ V     ++G+   +L + L  LF L  S F   +       
Sbjct: 191 CMQPIQQQWWTLVSVALDRVLAVAATQGVLRAELWQLLAVLFALDTSEFVYPFPDREKDF 250

Query: 183 KLQVFLQHLREFGLVYQRKRKAGRFYP-----------------TKLALNMATRGTLKQI 225
                L  L E GLVY       R +                    +AL    +G   ++
Sbjct: 251 AAFQLLARLSEVGLVYPLLCDGQRCFVLSPQFHHAVCWNSTAPLCTVALLDDDKGGSYRV 310

Query: 226 R--EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV-LTRDSVRAALRSG 282
           R  +   +I ETNFR+YAYT +   +A++  F         + V   +TR S  AALR G
Sbjct: 311 RREDEDTIITETNFRLYAYTRNKHLLAILEQFAIKEAEVDGMIVCFRVTRASFAAALRKG 370

Query: 283 ITAAQIIGFLRLHALPSV-----SCP---------------------------LPPVVSD 310
           I A  I+ FLR+ A  S+     +C                            +P    D
Sbjct: 371 IGAQHIVQFLRVKAHSSMLKHQLTCDPRDSAGLAATSSRVTPTDTPWTHADKIIPRSFCD 430

Query: 311 QIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGV 351
           Q+  WE E  RLT +  ++  + VS +  E + D     G+
Sbjct: 431 QLFTWERECRRLTFNRALVLLRNVSSSQQELILDCLGHYGL 471


>gi|162605818|ref|XP_001713424.1| hypothetical protein GTHECHR3089 [Guillardia theta]
 gi|13794356|gb|AAK39733.1|AF083031_90 hypothetical protein [Guillardia theta]
          Length = 180

 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 32/149 (21%), Positives = 65/149 (43%), Gaps = 14/149 (9%)

Query: 231 LIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGVLTRDSVRAALRSGITAAQIIG 290
           +I+E NFR+Y +   +   +      + +Y  P   VG +  +  R   + G+T+  I+ 
Sbjct: 45  IIIEVNFRIYIFKYFDFSNSFFKFISKKIYSLPYFYVGEINENYSRRLFQLGVTSDNILK 104

Query: 291 FLRLHALPSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLG 350
           F++   L      +P    D++R+WE   ++     G L            +R+Y  ++ 
Sbjct: 105 FIK-KNLHYTCNKIPSTFEDRLRIWEISFKKKYFFGGFL------------MRNYNSNI- 150

Query: 351 VLTWQNEKKRTVVVTKQGHSDIKKFWVNH 379
             + +  K  + V TK+  + I K  +N+
Sbjct: 151 FFSLKKTKSISYVKTKKHLAIISKIPINY 179


>gi|340508252|gb|EGR33999.1| hypothetical protein IMG5_027880 [Ichthyophthirius multifiliis]
          Length = 454

 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 53/197 (26%), Positives = 85/197 (43%), Gaps = 40/197 (20%)

Query: 128 QFLLLETPAQVWYFILKY------LETVESKGLDLVECLTFLFQLK-----FSTFGTDYS 176
           +F+L +T AQ+   +  Y      +E +E +  D++  +  L  L      F  + ++Y+
Sbjct: 189 EFILQDTNAQIHQILFYYCQVFAQMENIEEE--DIINFIITLSNLDVNKTYFFNYRSEYA 246

Query: 177 TEGMSDKLQVF----------LQHLREFGLVYQRKRKAGRFYPTKLALNMATRGT-LKQI 225
            +   D+   F          L  ++EF L  Q+K     F  T L      R   +K I
Sbjct: 247 KKQNIDQKFTFELKILSDLAKLGMIKEFDLGQQKKV----FGITPLIWQFCYRSIDIKTI 302

Query: 226 REPGFLIVETNFRVYAYTDSNLKVA----------LIGLFCELMYRFPNLTVGVLTRDSV 275
                +IVETNF +YAY D N +            L+  F ++ Y FP+L V  LT   +
Sbjct: 303 NAK--IIVETNFNLYAYLDYNPQNKTFSESKYIRDLLKKFSKIHYTFPHLIVAQLTEAKM 360

Query: 276 RAALRSGITAAQIIGFL 292
           + A   GIT+  +I F 
Sbjct: 361 KQAFNQGITSKLLIEFF 377


>gi|342184190|emb|CCC93671.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
          Length = 499

 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 68/235 (28%), Positives = 93/235 (39%), Gaps = 32/235 (13%)

Query: 96  VGSQQTEGISAD-AVRTLFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETV----E 150
           +G ++ E ISA      L    +E   IT +G  F +     Q W  +   L+ V     
Sbjct: 155 LGERRGEPISAMLHFSRLIPPPSEAARITCEGLSFCMQSLQQQWWTLVSVALDRVLTLTS 214

Query: 151 SKGLD---LVECLTFLFQLKFSTFGTDYSTEGMSDKLQVF--LQHLREFGLVYQ---RKR 202
            KG+    L + L  LF L  S +   Y      + L+ F  L  L E GLVY      R
Sbjct: 215 GKGVTRATLWQLLAVLFALDASDYV--YPFPSKEEDLEAFHLLARLSEVGLVYPLLCNGR 272

Query: 203 KAGRFYP--------------TKLALNMATRGTLKQIR--EPGFLIVETNFRVYAYTDSN 246
           K     P                 AL      T   +R  +   +I ETNFR+YAYT + 
Sbjct: 273 KCFVLSPHFHHAICWSSVPSLCAAALLDNPNQTPCPLRREDEDTIITETNFRLYAYTRNP 332

Query: 247 LKVALIGLFCELMYRFPNLTVGV-LTRDSVRAALRSGITAAQIIGFLRLHALPSV 300
             + ++  F         + V   +TR S   ALR GI A  I+ FL L A PS+
Sbjct: 333 DLLGILDQFAVKEIDIDGIIVCYRVTRSSFALALRKGIDAKHILQFLTLKAHPSM 387


>gi|430746993|ref|YP_007206122.1| hypothetical protein Sinac_6345 [Singulisphaera acidiphila DSM
           18658]
 gi|430018713|gb|AGA30427.1| hypothetical protein Sinac_6345 [Singulisphaera acidiphila DSM
           18658]
          Length = 790

 Score = 47.4 bits (111), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 40/118 (33%), Positives = 55/118 (46%), Gaps = 15/118 (12%)

Query: 230 FLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLT-VGV-----LTRDSVRAALRSGI 283
           FL V+ NF + AY    L   LIG F     RF   + VG      LT +SV   L  G+
Sbjct: 450 FLYVQPNFEIIAYR-QGLTPPLIGQFS----RFALWSQVGAALELKLTPESVYRGLEGGL 504

Query: 284 TAAQIIGFLRLHALPSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEK 341
           T  +++  L  H+       LP  VS+ +R W G RER+T        +F ++ D EK
Sbjct: 505 TPPEMLDRLARHS----QRVLPAGVSEAVRTWAGRRERITYYASATLVEFATKEDLEK 558


>gi|407397449|gb|EKF27748.1| DNA repair and transcription factor protein [Trypanosoma cruzi
           marinkellei]
          Length = 501

 Score = 47.4 bits (111), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 69/298 (23%), Positives = 108/298 (36%), Gaps = 59/298 (19%)

Query: 112 LFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETV----ESKGL---DLVECLTFLF 164
           L    +E   IT +G  F +     Q W  +   L+ V       G+   +L + L  LF
Sbjct: 173 LVPASSETREITSEGLSFCMQPLQQQWWTLVSVALDRVLILTNKSGVARAELWQLLAVLF 232

Query: 165 QLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY---------------- 208
            L  S F   +  +         L  L E GLVY       R +                
Sbjct: 233 ALDTSQFVYQFPEKEKDLAAFQILARLSEVGLVYPLISGGHRCFVLSPHFHHAVCWSSTT 292

Query: 209 PTKLALNMATRG-TLKQIR--EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNL 265
           P   A  + + G +L ++R  +   +I E NFR+YAYT +   + ++  F E       +
Sbjct: 293 PLCTAALLDSEGNSLGRVRREDEDTIITEANFRLYAYTKNPDLLNILNQFAERDAEIDQM 352

Query: 266 TVGV-LTRDSVRAALRSGITAAQIIGFLRLHALPS------------------------- 299
                +TR +  AAL+ GI ++ I+ FL + A PS                         
Sbjct: 353 IACYRVTRKTFAAALKRGIGSSHILQFLAVKAHPSMLRNHRDGDSKKTSGLSVLGAGTRF 412

Query: 300 -------VSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEKLRDYAQDLG 350
                  +   +P    DQ+  WE E  RL  S  ++  + VS    E L ++    G
Sbjct: 413 KNTSDIRIDGIIPQSFCDQLMTWEKECRRLIFSRNLVLLRNVSAMQQELLLNFLSSGG 470


>gi|71650598|ref|XP_813994.1| DNA repair and transcription factor protein [Trypanosoma cruzi
           strain CL Brener]
 gi|70878928|gb|EAN92143.1| DNA repair and transcription factor protein, putative [Trypanosoma
           cruzi]
          Length = 501

 Score = 44.7 bits (104), Expect = 0.070,   Method: Compositional matrix adjust.
 Identities = 53/216 (24%), Positives = 89/216 (41%), Gaps = 27/216 (12%)

Query: 112 LFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETV----ESKGL---DLVECLTFLF 164
           L    +E   IT +G  F +     Q W  +   L+ +    ++ G+   +L + L  LF
Sbjct: 173 LVPTSSETREITSEGLSFCMQPLQQQWWTLVSVALDRILVLTKNSGVTRAELWQLLAVLF 232

Query: 165 QLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY---------------- 208
            L  S F   +  +         L  L E GLVY       R +                
Sbjct: 233 ALNTSQFVYPFPEKEKDLAAFQILARLSEVGLVYPLISGGRRCFALSPHFHHAVCWSSTA 292

Query: 209 PTKLALNMATRG-TLKQIR--EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNL 265
           P   A  + + G ++ ++R  +   +I E NFR+YAYT +   + ++  F E       +
Sbjct: 293 PLCTAALLDSGGNSMGRVRREDEDTIITEANFRLYAYTKNPDLLNILNQFAERDAEVDQM 352

Query: 266 TVGV-LTRDSVRAALRSGITAAQIIGFLRLHALPSV 300
                +TR +  AAL+ GI ++ I+ FL + A PS+
Sbjct: 353 IACYRVTRRTFAAALKRGIGSSHILQFLAVKAHPSM 388


>gi|71425201|ref|XP_813043.1| DNA repair and transcription factor protein [Trypanosoma cruzi
           strain CL Brener]
 gi|70877891|gb|EAN91192.1| DNA repair and transcription factor protein, putative [Trypanosoma
           cruzi]
          Length = 501

 Score = 44.3 bits (103), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 53/216 (24%), Positives = 89/216 (41%), Gaps = 27/216 (12%)

Query: 112 LFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETV----ESKGL---DLVECLTFLF 164
           L    +E   IT +G  F +     Q W  +   L+ +    ++ G+   +L + L  LF
Sbjct: 173 LVPTSSETREITSEGLSFCMQPLQQQWWTLVSVALDRILVLTKNSGVTRAELWQLLAVLF 232

Query: 165 QLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY---------------- 208
            L  S F   +  +         L  L E GLVY       R +                
Sbjct: 233 ALDTSQFVYPFPEKEKDLAAFQILARLSEVGLVYPLISGGRRCFALSPHFHHAVCWSSTA 292

Query: 209 PTKLALNMATRG-TLKQIR--EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNL 265
           P   A  + + G ++ ++R  +   +I E NFR+YAYT +   + ++  F E       +
Sbjct: 293 PLCTAALLDSGGNSMGRVRREDEDTIITEANFRLYAYTRNPDLLNILNQFAERDAEVDQM 352

Query: 266 TVGV-LTRDSVRAALRSGITAAQIIGFLRLHALPSV 300
                +TR +  AAL+ GI ++ I+ FL + A PS+
Sbjct: 353 IACYRVTRRTFAAALKRGIGSSHILQFLAVKAHPSM 388


>gi|407835017|gb|EKF99108.1| DNA repair and transcription factor protein, putative [Trypanosoma
           cruzi]
          Length = 501

 Score = 43.1 bits (100), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 53/216 (24%), Positives = 88/216 (40%), Gaps = 27/216 (12%)

Query: 112 LFNEGAENPVITKDGFQFLLLETPAQVWYFILKYLETV----ESKGL---DLVECLTFLF 164
           L    +E   IT +G  F +     Q W  +   L+ +    ++ G+   +L + L  LF
Sbjct: 173 LVPTSSETREITSEGLSFCMQPLQQQWWTLVSVALDRILVLTKNSGVTRAELWQLLAVLF 232

Query: 165 QLKFSTFGTDYSTEGMSDKLQVFLQHLREFGLVYQRKRKAGRFY---------------- 208
            L  S F   +  +         L  L E GLVY       R +                
Sbjct: 233 ALDTSQFVYPFPEKEKDLAAFQILARLSEVGLVYPLISGGRRCFALSPHFHHAVCWSSTA 292

Query: 209 PTKLALNMATRG-TLKQIR--EPGFLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNL 265
           P   A  + + G ++ ++R  +   +I E NFR+YAYT +   + ++  F E       +
Sbjct: 293 PLCTAALLDSGGDSMGRVRREDEDTIITEANFRLYAYTRNPDLLNILNQFAERDAEVDQM 352

Query: 266 TVGV-LTRDSVRAALRSGITAAQIIGFLRLHALPSV 300
                +TR +  AAL+ GI +  I+ FL + A PS+
Sbjct: 353 IACYRVTRRTFAAALKRGIGSLHILQFLAVKAHPSM 388


>gi|300122496|emb|CBK23066.2| unnamed protein product [Blastocystis hominis]
          Length = 339

 Score = 42.0 bits (97), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 40/155 (25%), Positives = 68/155 (43%), Gaps = 23/155 (14%)

Query: 121 VITKDGFQFLLLETPAQVWYFILKYLETVESKGLDLVECLTFLFQLKFSTFGTDYSTEGM 180
            IT  G+QFL+     Q W+ +  ++ +  +     +  L+FLF L+       Y    +
Sbjct: 182 AITATGYQFLMKSYRDQAWWLLHPHITSANA-----LPALSFLFSLRHMIPCRAYDVAAL 236

Query: 181 SDKLQVFLQHLREFGLVYQRKRKAGR-FYPTKLALNM-----ATRGT-LKQIREPGFL-- 231
           S+  +  L+  R  G+VY       R F+ T L   +     A+RG   ++    GF+  
Sbjct: 237 SEPQRQLLEIARIAGVVYTPTTAGKRVFFATPLGAALFTQEGASRGEGGRKSGGEGFMKA 296

Query: 232 ------IVETNFRVYAY---TDSNLKVALIGLFCE 257
                 ++E NF VYA+    ++ L V L+  F E
Sbjct: 297 EGSMRVVIEANFHVYAFGLLAENRLHVQLLRQFLE 331


>gi|2245059|emb|CAB10482.1| transcription factor like protein [Arabidopsis thaliana]
 gi|7268453|emb|CAB80973.1| transcription factor like protein [Arabidopsis thaliana]
          Length = 101

 Score = 41.2 bits (95), Expect = 0.86,   Method: Composition-based stats.
 Identities = 16/47 (34%), Positives = 29/47 (61%)

Query: 2  LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR 48
          L  +AK YV+++L+++ PVP  ++  W+      +H+ A D L +LR
Sbjct: 40 LPPLAKKYVLQMLYIDVPVPATMMEEWVLADGTSKHRVAIDRLIQLR 86


>gi|320103368|ref|YP_004178959.1| hypothetical protein Isop_1828 [Isosphaera pallida ATCC 43644]
 gi|319750650|gb|ADV62410.1| hypothetical protein Isop_1828 [Isosphaera pallida ATCC 43644]
          Length = 961

 Score = 41.2 bits (95), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 33/115 (28%), Positives = 49/115 (42%), Gaps = 9/115 (7%)

Query: 230 FLIVETNFRVYAYTDSNLKVALIGLFCELMYRFPNLTVGV---LTRDSVRAALRSGITAA 286
           FL ++ NF + AY    L    IG    L  R+  L       +T DS+   L  G++  
Sbjct: 496 FLNIQPNFEMIAYR-QGLNPGAIGKLT-LFGRWTKLGGAAELKVTEDSLTLGLHGGLSVD 553

Query: 287 QIIGFLRLHALPSVSCPLPPVVSDQIRLWEGERERLTTSEGVLYSQFVSQADFEK 341
            I+  LR H+      PL P +   +R W   RER+T        +F +  D E+
Sbjct: 554 AILERLRRHS----PRPLAPALERTLRGWSARRERITLHTAATLLEFATPEDLEE 604


>gi|90398991|emb|CAJ86263.1| H0801D08.21 [Oryza sativa Indica Group]
          Length = 119

 Score = 40.0 bits (92), Expect = 1.8,   Method: Composition-based stats.
 Identities = 17/47 (36%), Positives = 28/47 (59%)

Query: 2  LSEIAKHYVIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELR 48
          L  +AK Y +++L+V  PV  A +  W+   +A +H+ A D L +LR
Sbjct: 40 LPPLAKKYALQMLYVSAPVAAAAMEEWVLDEYAAKHRVAIDRLLQLR 86


>gi|393786717|ref|ZP_10374849.1| hypothetical protein HMPREF1068_01129 [Bacteroides nordii CL02T12C05]
 gi|392657952|gb|EIY51582.1| hypothetical protein HMPREF1068_01129 [Bacteroides nordii CL02T12C05]
          Length = 1885

 Score = 38.9 bits (89), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 41/148 (27%), Positives = 69/148 (46%), Gaps = 24/148 (16%)

Query: 10   VIRLLFVEQPVPQAVIASWLSKTHAKEHKEATDTLTELRRNAKVALLGGGKPWSMSAKLE 69
            + RL  ++QP       SW         +  T+ + EL  NA++A+L G KP      LE
Sbjct: 1433 LTRLKELQQP-----DGSWAWYKGMNGSRYITNYIVEL--NARLAMLTG-KP------LE 1478

Query: 70   VDS---KARDLDFLNQYALERWECILRFMVGSQQTEGISADAVRTLFNEGAENPVITKDG 126
             D+   +   L FLN+ ALE ++ I +      + +G+SA A++ ++       +I   G
Sbjct: 1479 GDALVMQRTALSFLNKEALEEYKNIRKAEKNGDKVKGLSASALKYIY-------LIAISG 1531

Query: 127  FQFLLLETPAQVWYFILKYLETVESKGL 154
             Q  L E      YF+ K  ET+ ++ +
Sbjct: 1532 EQIPLAENQTAYNYFLSKVTETLTTRSM 1559


>gi|359320939|ref|XP_003431754.2| PREDICTED: uncharacterized protein LOC612235 [Canis lupus
           familiaris]
          Length = 1766

 Score = 38.1 bits (87), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 20/50 (40%), Positives = 31/50 (62%)

Query: 318 ERERLTTSEGVLYSQFVSQADFEKLRDYAQDLGVLTWQNEKKRTVVVTKQ 367
           ERE LT   GVL S F    D E+LRD AQ++ +L   ++K +++ + +Q
Sbjct: 133 EREELTPQLGVLLSHFNIPYDLEELRDAAQEIELLNLVSQKFQSIFMEQQ 182


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.321    0.136    0.401 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,805,996,948
Number of Sequences: 23463169
Number of extensions: 231505562
Number of successful extensions: 513480
Number of sequences better than 100.0: 415
Number of HSP's better than 100.0 without gapping: 398
Number of HSP's successfully gapped in prelim test: 17
Number of HSP's that attempted gapping in prelim test: 511611
Number of HSP's gapped (non-prelim): 551
length of query: 386
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 242
effective length of database: 8,980,499,031
effective search space: 2173280765502
effective search space used: 2173280765502
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 78 (34.7 bits)