BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 036799
         (166 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255556021|ref|XP_002519045.1| tfiih, polypeptide, putative [Ricinus communis]
 gi|223541708|gb|EEF43256.1| tfiih, polypeptide, putative [Ricinus communis]
          Length = 451

 Score =  226 bits (575), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 107/132 (81%), Positives = 118/132 (89%), Gaps = 6/132 (4%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           S  QGFVVVETNFRMYAYSTSKLH EI+RLFS++EYQLPNL+VGA TKESL +AFENGIT
Sbjct: 293 SRKQGFVVVETNFRMYAYSTSKLHCEIMRLFSRVEYQLPNLVVGAMTKESLYSAFENGIT 352

Query: 101 AEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           AEQ      QNAHPRVA+RIPS+PENV DQIRLWESD+NRVEMTPAH YDEFPSRDVFEA
Sbjct: 353 AEQIISFLQQNAHPRVAERIPSVPENVTDQIRLWESDMNRVEMTPAHLYDEFPSRDVFEA 412

Query: 155 ACDYARDRSGLL 166
           AC++ARD +GLL
Sbjct: 413 ACNFARDWNGLL 424



 Score = 51.2 bits (121), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 32/93 (34%), Positives = 38/93 (40%), Gaps = 49/93 (52%)

Query: 1   MPSGIIARLPTSEDLEAYAIGQWE------------------------------------ 24
           + S I  RLPT E+L+ YA+GQWE                                    
Sbjct: 125 LASNIAVRLPTLEELDTYALGQWECFLLHLINSGHAERSTNFSSSMMKIFQRGLLTQRDK 184

Query: 25  -------------LMDTNAQLWYIVREYISNSQ 44
                        LMDTNAQLWYI+REYISNS+
Sbjct: 185 EAPRLTESGFQFLLMDTNAQLWYIIREYISNSE 217


>gi|449433857|ref|XP_004134713.1| PREDICTED: general transcription factor IIH subunit 4-like [Cucumis
           sativus]
          Length = 451

 Score =  224 bits (571), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 108/132 (81%), Positives = 118/132 (89%), Gaps = 6/132 (4%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           S   GFVVVETNFRMYAYSTSKLH EILRLFS+IEYQLPNLIVGA TKESL NAF+NGIT
Sbjct: 293 SRKLGFVVVETNFRMYAYSTSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGIT 352

Query: 101 AEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           AEQ      QNAHPRVA+RIPS+PENV DQIRLWESDLNRV++TPAH+YDEFPSR+VFEA
Sbjct: 353 AEQIVTFLQQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSREVFEA 412

Query: 155 ACDYARDRSGLL 166
           ACDYAR+ +GLL
Sbjct: 413 ACDYAREWNGLL 424



 Score = 53.9 bits (128), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 33/93 (35%), Positives = 39/93 (41%), Gaps = 49/93 (52%)

Query: 1   MPSGIIARLPTSEDLEAYAIGQWE------------------------------------ 24
           MPS I  RLP+ EDLEAYA+ QWE                                    
Sbjct: 125 MPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLLSQRDK 184

Query: 25  -------------LMDTNAQLWYIVREYISNSQ 44
                        LM+TNAQLWYI+REYISN++
Sbjct: 185 EAPRLTESGFQFLLMETNAQLWYIIREYISNAE 217


>gi|224077128|ref|XP_002305144.1| predicted protein [Populus trichocarpa]
 gi|222848108|gb|EEE85655.1| predicted protein [Populus trichocarpa]
          Length = 449

 Score =  218 bits (556), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 102/132 (77%), Positives = 117/132 (88%), Gaps = 6/132 (4%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           S  QG+VVVETNFR+YAYS+SKLH EILRLFSKIEYQLPNLIVGA TKESL  AFENGIT
Sbjct: 291 SRKQGYVVVETNFRLYAYSSSKLHCEILRLFSKIEYQLPNLIVGAITKESLYTAFENGIT 350

Query: 101 AEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           ++Q      QNAHPRVA+R+PS+PENV DQIRLWE+DLNRVE+TP+H+YDEFPSRD FEA
Sbjct: 351 SDQIISFLQQNAHPRVAERLPSVPENVTDQIRLWEADLNRVEITPSHFYDEFPSRDTFEA 410

Query: 155 ACDYARDRSGLL 166
           ACD+AR+ +GLL
Sbjct: 411 ACDFAREWNGLL 422



 Score = 44.3 bits (103), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 18/23 (78%), Positives = 22/23 (95%)

Query: 22  QWELMDTNAQLWYIVREYISNSQ 44
           Q+ LMDTNAQLWYI+REYI+NS+
Sbjct: 193 QFLLMDTNAQLWYIIREYITNSE 215


>gi|147854406|emb|CAN81292.1| hypothetical protein VITISV_005315 [Vitis vinifera]
          Length = 451

 Score =  218 bits (556), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 103/132 (78%), Positives = 114/132 (86%), Gaps = 6/132 (4%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           S  QGFVVVETNFR+YAYS+SKLH EILRLFS++EYQLPNLIVGA TKESL NAFENGIT
Sbjct: 293 SRKQGFVVVETNFRLYAYSSSKLHCEILRLFSRVEYQLPNLIVGAITKESLYNAFENGIT 352

Query: 101 AEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           AEQ      QNAHPRVA+R P++PENV DQIRLWE+DLNRVE  P+H YDEFPSRDVFEA
Sbjct: 353 AEQIISFLQQNAHPRVAERTPAVPENVTDQIRLWETDLNRVETMPSHLYDEFPSRDVFEA 412

Query: 155 ACDYARDRSGLL 166
           ACD+AR+  GLL
Sbjct: 413 ACDFAREYGGLL 424



 Score = 58.2 bits (139), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 35/93 (37%), Positives = 40/93 (43%), Gaps = 49/93 (52%)

Query: 1   MPSGIIARLPTSEDLEAYAIGQWE------------------------------------ 24
           MPS I  RLP+ +DLEAYA+GQWE                                    
Sbjct: 125 MPSNITVRLPSLDDLEAYALGQWECFLLQLISSTQTEKLTNFSSSLMKVFQRGLLTQREK 184

Query: 25  -------------LMDTNAQLWYIVREYISNSQ 44
                        LMDTNAQLWYI+REYISNS+
Sbjct: 185 EAPRLTESGFQFLLMDTNAQLWYIMREYISNSE 217


>gi|356548733|ref|XP_003542754.1| PREDICTED: general transcription factor IIH subunit 4-like [Glycine
           max]
          Length = 451

 Score =  217 bits (552), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 105/132 (79%), Positives = 116/132 (87%), Gaps = 6/132 (4%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           S  QGFVVVETNFR+YAYSTSKLH EILRLFS++EYQLPNLIVGA TKESL +AFENGIT
Sbjct: 293 SRKQGFVVVETNFRVYAYSTSKLHCEILRLFSRVEYQLPNLIVGAITKESLYSAFENGIT 352

Query: 101 AEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           AEQ      QNAHPRVA+RIPS+PENV +QIRLWE+DLNRVEMT A+YYDEFPSRDVFE 
Sbjct: 353 AEQIITFLQQNAHPRVAERIPSVPENVTEQIRLWEADLNRVEMTDAYYYDEFPSRDVFEG 412

Query: 155 ACDYARDRSGLL 166
           ACD AR+ +GLL
Sbjct: 413 ACDCAREWNGLL 424



 Score = 55.8 bits (133), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 35/93 (37%), Positives = 39/93 (41%), Gaps = 49/93 (52%)

Query: 1   MPSGIIARLPTSEDLEAYAIGQWE------------------------------------ 24
           MPS I  RLPT E+LEAYA+ QWE                                    
Sbjct: 125 MPSNITVRLPTLENLEAYALEQWECFLLQLISPSQVEKPLNISSSLMKVFQRRILSYRDK 184

Query: 25  -------------LMDTNAQLWYIVREYISNSQ 44
                        LMDTNAQLWYI+REYISNS+
Sbjct: 185 EAPKLTESGFQFLLMDTNAQLWYIIREYISNSE 217


>gi|358346685|ref|XP_003637396.1| General transcription factor IIH subunit [Medicago truncatula]
 gi|355503331|gb|AES84534.1| General transcription factor IIH subunit [Medicago truncatula]
          Length = 451

 Score =  217 bits (552), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 110/164 (67%), Positives = 125/164 (76%), Gaps = 17/164 (10%)

Query: 20  IGQWELMDTNAQLWYI-----------VREYISNSQGFVVVETNFRMYAYSTSKLHYEIL 68
           +G  +L     + W+I           + E  S  +GFVVVETNFR+YAYSTSKLH EIL
Sbjct: 261 LGLVKLQQGRKESWFIPTKLATNLSVSMTESSSRKEGFVVVETNFRVYAYSTSKLHCEIL 320

Query: 69  RLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ------QNAHPRVADRIPSIPENVC 122
           RLFS++EYQLPNLIVGA TKESL NAF+NGITA+Q      QNAHPRVA RIP++PENV 
Sbjct: 321 RLFSRVEYQLPNLIVGAITKESLYNAFDNGITADQIVSFFRQNAHPRVAQRIPAVPENVT 380

Query: 123 DQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARDRSGLL 166
           DQIRLWESDLNRVEMT A+YYDEFPSRDVFE ACD AR+ SGLL
Sbjct: 381 DQIRLWESDLNRVEMTEAYYYDEFPSRDVFEGACDCAREWSGLL 424



 Score = 55.1 bits (131), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 34/93 (36%), Positives = 38/93 (40%), Gaps = 49/93 (52%)

Query: 1   MPSGIIARLPTSEDLEAYAIGQWE------------------------------------ 24
           MPS I  RLPT EDLE YA+ QWE                                    
Sbjct: 125 MPSNITVRLPTLEDLETYALEQWECFLLQLISPSHVDKTLNISSSLMKVFQRRLLSQRDR 184

Query: 25  -------------LMDTNAQLWYIVREYISNSQ 44
                        LMDTNAQLWYI+REYI+NS+
Sbjct: 185 EAPKLTESGFQFLLMDTNAQLWYIIREYITNSE 217


>gi|357461579|ref|XP_003601071.1| General transcription factor IIH subunit [Medicago truncatula]
 gi|355490119|gb|AES71322.1| General transcription factor IIH subunit [Medicago truncatula]
          Length = 452

 Score =  216 bits (551), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 109/164 (66%), Positives = 125/164 (76%), Gaps = 17/164 (10%)

Query: 20  IGQWELMDTNAQLWYI-----------VREYISNSQGFVVVETNFRMYAYSTSKLHYEIL 68
           +G  +L     + W+I           + E  S  +GFVVVETNFR+YAYSTSKLH EIL
Sbjct: 262 LGLVKLQQGRKESWFIPTKLATNLSVSMTESSSRKEGFVVVETNFRVYAYSTSKLHCEIL 321

Query: 69  RLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ------QNAHPRVADRIPSIPENVC 122
           RLFS++EYQLPNLIVGA TKESL NAF+NGITA+Q      QNAHPRVA R+P++PENV 
Sbjct: 322 RLFSRVEYQLPNLIVGAITKESLYNAFDNGITADQIVSFLRQNAHPRVAQRVPAVPENVT 381

Query: 123 DQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARDRSGLL 166
           DQIRLWESDLNRVEMT A+YYDEFPSRDVFE ACD AR+ SGLL
Sbjct: 382 DQIRLWESDLNRVEMTEAYYYDEFPSRDVFEGACDCAREWSGLL 425



 Score = 55.5 bits (132), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 34/93 (36%), Positives = 38/93 (40%), Gaps = 49/93 (52%)

Query: 1   MPSGIIARLPTSEDLEAYAIGQWE------------------------------------ 24
           MPS I  RLPT EDLE YA+ QWE                                    
Sbjct: 126 MPSNITVRLPTLEDLETYALEQWECFLLQLISPSHVDKTLNISSSLMKVFQRRLLSQRDR 185

Query: 25  -------------LMDTNAQLWYIVREYISNSQ 44
                        LMDTNAQLWYI+REYI+NS+
Sbjct: 186 EAPKLTESGFQFLLMDTNAQLWYIIREYITNSE 218


>gi|115461368|ref|NP_001054284.1| Os04g0680000 [Oryza sativa Japonica Group]
 gi|113565855|dbj|BAF16198.1| Os04g0680000 [Oryza sativa Japonica Group]
          Length = 444

 Score =  215 bits (548), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 110/203 (54%), Positives = 132/203 (65%), Gaps = 57/203 (28%)

Query: 21  GQWELMDTNAQLWYIVREYISNS------------------------------------- 43
            +++LM+TNAQLWYI+REYIS++                                     
Sbjct: 213 SEFKLMETNAQLWYIMREYISSAEERGVDPTELISFLLELSFHTLGEAYSLNTLTDVQRN 272

Query: 44  --------------QGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKE 89
                         QGFVVVETNFRMYAYSTS+LH EILRLFS++EYQLPNLIVG+ TKE
Sbjct: 273 AIRDLAELGLVKLQQGFVVVETNFRMYAYSTSRLHCEILRLFSRVEYQLPNLIVGSITKE 332

Query: 90  SLNNAFENGITAE------QQNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYY 143
           SL  AFENGITAE      QQNAHPRVAD+IP++PENV DQIRLWE+D NRV+MT +H Y
Sbjct: 333 SLYGAFENGITAEQIISFLQQNAHPRVADKIPAVPENVTDQIRLWETDRNRVDMTLSHLY 392

Query: 144 DEFPSRDVFEAACDYARDRSGLL 166
           ++FPS+D+F+  CDYARD   LL
Sbjct: 393 EDFPSKDMFDQCCDYARDHGCLL 415



 Score = 37.7 bits (86), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 18/43 (41%), Positives = 27/43 (62%), Gaps = 3/43 (6%)

Query: 1   MPSGIIARLPTSEDLEAYAIGQWELMDTNAQLWYIVREYISNS 43
           +PS + ARLPT  +LE++A+ QWE+  T A   Y+   Y  N+
Sbjct: 125 IPSSVTARLPTLAELESFALEQWEVRMTKA---YVCISYCLNA 164


>gi|90399251|emb|CAJ86205.1| B0213E10.4 [Oryza sativa Indica Group]
          Length = 470

 Score =  215 bits (547), Expect = 6e-54,   Method: Compositional matrix adjust.
 Identities = 111/202 (54%), Positives = 131/202 (64%), Gaps = 57/202 (28%)

Query: 22  QWELMDTNAQLWYIVREYISNS-------------------------------------- 43
           Q+ LM+TNAQLWYI+REYIS++                                      
Sbjct: 242 QFLLMETNAQLWYIMREYISSAEERGVDPTELISFLLELSFHTLGEAYSLNTLTDVQRNA 301

Query: 44  -------------QGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKES 90
                        QGFVVVETNFRMYAYSTS+LH EILRLFS++EYQLPNLIVG+ TKES
Sbjct: 302 IRDLAELGLVKLQQGFVVVETNFRMYAYSTSRLHCEILRLFSRVEYQLPNLIVGSITKES 361

Query: 91  LNNAFENGITAE------QQNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYD 144
           L  AFENGITAE      QQNAHPRVAD+IP++PENV DQIRLWE+D NRV+MT +H Y+
Sbjct: 362 LYGAFENGITAEQIISFLQQNAHPRVADKIPAVPENVTDQIRLWETDRNRVDMTLSHLYE 421

Query: 145 EFPSRDVFEAACDYARDRSGLL 166
           +FPS+D+F+  CDYARD   LL
Sbjct: 422 DFPSKDMFDQCCDYARDHGCLL 443



 Score = 37.7 bits (86), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 18/43 (41%), Positives = 27/43 (62%), Gaps = 3/43 (6%)

Query: 1   MPSGIIARLPTSEDLEAYAIGQWELMDTNAQLWYIVREYISNS 43
           +PS + ARLPT  +LE++A+ QWE+  T A   Y+   Y  N+
Sbjct: 125 IPSSVTARLPTLAELESFALEQWEVRMTKA---YVCISYCLNA 164


>gi|356521347|ref|XP_003529318.1| PREDICTED: general transcription factor IIH subunit 4-like [Glycine
           max]
          Length = 451

 Score =  213 bits (541), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 102/132 (77%), Positives = 115/132 (87%), Gaps = 6/132 (4%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           S  QGFVVVETNFR+YAYSTSKLH EILRLFS++EYQLPNLIVGA TKESL +A+ENGIT
Sbjct: 293 SRKQGFVVVETNFRVYAYSTSKLHCEILRLFSRVEYQLPNLIVGAITKESLYSAYENGIT 352

Query: 101 AEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           AEQ      QNAHPRVA+R+PS+PENV +QIRLWE+DLNRVEMT  +YYDEFPSRDVFE 
Sbjct: 353 AEQIVTFLQQNAHPRVAERVPSVPENVTEQIRLWEADLNRVEMTDTYYYDEFPSRDVFEG 412

Query: 155 ACDYARDRSGLL 166
           ACD AR+ +GLL
Sbjct: 413 ACDCAREWNGLL 424



 Score = 55.8 bits (133), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 35/93 (37%), Positives = 39/93 (41%), Gaps = 49/93 (52%)

Query: 1   MPSGIIARLPTSEDLEAYAIGQWE------------------------------------ 24
           MPS I  RLPT E+LEAYA+ QWE                                    
Sbjct: 125 MPSNITVRLPTLENLEAYALEQWECFLLQLISPSQVEKPLNISSSLMKVFQRRILSHRDK 184

Query: 25  -------------LMDTNAQLWYIVREYISNSQ 44
                        LMDTNAQLWYI+REYISNS+
Sbjct: 185 EAPKLTESGFQFLLMDTNAQLWYIIREYISNSE 217


>gi|414584751|tpg|DAA35322.1| TPA: hypothetical protein ZEAMMB73_386928 [Zea mays]
          Length = 451

 Score =  207 bits (528), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 97/131 (74%), Positives = 111/131 (84%), Gaps = 6/131 (4%)

Query: 42  NSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITA 101
           N +GFVVVETNFRMYAYSTSKLH EILRLF+++EYQLPNLIVGA TKES+  AFENGITA
Sbjct: 294 NKEGFVVVETNFRMYAYSTSKLHCEILRLFARVEYQLPNLIVGAVTKESIYGAFENGITA 353

Query: 102 EQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAA 155
           EQ      QNAHPRVAD+IP++PENV DQIRLWE+DLNRVEM P+H Y++FPS++ FE  
Sbjct: 354 EQIISFLRQNAHPRVADKIPAVPENVTDQIRLWETDLNRVEMIPSHLYEDFPSKEWFEQC 413

Query: 156 CDYARDRSGLL 166
           CDYARD   LL
Sbjct: 414 CDYARDNGYLL 424



 Score = 46.2 bits (108), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 29/93 (31%), Positives = 37/93 (39%), Gaps = 49/93 (52%)

Query: 1   MPSGIIARLPTSEDLEAYAIGQWE------------------------------------ 24
           +P  +  RLPT  DLEAYA+ QWE                                    
Sbjct: 125 LPLSVTGRLPTPADLEAYALDQWECFLLQLINSSQVEKGSSFSSSMMKTFQRGLLSSRDG 184

Query: 25  -------------LMDTNAQLWYIVREYISNSQ 44
                        LM+TNAQLWYI+REYIS+++
Sbjct: 185 EASKLTENGFQFLLMETNAQLWYIMREYISSAE 217


>gi|242074798|ref|XP_002447335.1| hypothetical protein SORBIDRAFT_06g033130 [Sorghum bicolor]
 gi|241938518|gb|EES11663.1| hypothetical protein SORBIDRAFT_06g033130 [Sorghum bicolor]
          Length = 451

 Score =  206 bits (524), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 97/131 (74%), Positives = 110/131 (83%), Gaps = 6/131 (4%)

Query: 42  NSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITA 101
           N +GFVVVETNFRMYAYSTSKLH EILRLFS++EYQLPNLIVGA TKES+  AFENGITA
Sbjct: 294 NKEGFVVVETNFRMYAYSTSKLHCEILRLFSRVEYQLPNLIVGAVTKESIYGAFENGITA 353

Query: 102 EQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAA 155
           EQ      QNAHPRVAD+IP++PENV DQIRLWE+D NRVEM P+H Y++FPS++ FE  
Sbjct: 354 EQIISFLRQNAHPRVADKIPTVPENVTDQIRLWETDRNRVEMIPSHLYEDFPSKEWFEQC 413

Query: 156 CDYARDRSGLL 166
           CDYARD   LL
Sbjct: 414 CDYARDNGYLL 424



 Score = 46.6 bits (109), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 29/93 (31%), Positives = 37/93 (39%), Gaps = 49/93 (52%)

Query: 1   MPSGIIARLPTSEDLEAYAIGQWE------------------------------------ 24
           +P  +  RLPT  DLEAYA+ QWE                                    
Sbjct: 125 LPLSVTGRLPTPADLEAYALDQWECFLLQLINSSQVEKGTSFSSSMMKTFQRGLLSSRDG 184

Query: 25  -------------LMDTNAQLWYIVREYISNSQ 44
                        LM+TNAQLWYI+REYIS+++
Sbjct: 185 EASKLTENGFQFLLMETNAQLWYIMREYISSAE 217


>gi|297800414|ref|XP_002868091.1| hypothetical protein ARALYDRAFT_493174 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297313927|gb|EFH44350.1| hypothetical protein ARALYDRAFT_493174 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 452

 Score =  206 bits (523), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 101/164 (61%), Positives = 121/164 (73%), Gaps = 17/164 (10%)

Query: 20  IGQWELMDTNAQLWYIVREYISN-----------SQGFVVVETNFRMYAYSTSKLHYEIL 68
           +G  +L       W+I  +  +N            +GFVV+ETNFRMYAYSTSKL  EIL
Sbjct: 261 LGLVKLQQGRKDSWFIPTKLATNLSVSLADSSARKEGFVVMETNFRMYAYSTSKLQCEIL 320

Query: 69  RLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ------QNAHPRVADRIPSIPENVC 122
           RLF++IEYQLPNLI  A TKESL NAF+NGIT++Q      QN+HPR ADR+PSIPENV 
Sbjct: 321 RLFARIEYQLPNLIACAITKESLYNAFDNGITSDQIITFLQQNSHPRCADRVPSIPENVT 380

Query: 123 DQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARDRSGLL 166
           DQIRLWE+DL R+EMT AH+YDEFPS+DVFEAACD+AR+  GLL
Sbjct: 381 DQIRLWETDLKRIEMTQAHFYDEFPSKDVFEAACDFAREWGGLL 424



 Score = 43.9 bits (102), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 18/23 (78%), Positives = 22/23 (95%)

Query: 22  QWELMDTNAQLWYIVREYISNSQ 44
           Q+ LMDTNAQLWYI+REYISN++
Sbjct: 195 QFLLMDTNAQLWYIIREYISNAE 217


>gi|334186626|ref|NP_001190745.1| transcription initiation factor TFIIH subunit H4 [Arabidopsis
           thaliana]
 gi|332658440|gb|AEE83840.1| transcription initiation factor TFIIH subunit H4 [Arabidopsis
           thaliana]
          Length = 482

 Score =  205 bits (522), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 101/164 (61%), Positives = 121/164 (73%), Gaps = 17/164 (10%)

Query: 20  IGQWELMDTNAQLWYIVREYISN-----------SQGFVVVETNFRMYAYSTSKLHYEIL 68
           +G  +L       W+I  +  +N            +GFVV+ETNFRMYAYSTSKL  EIL
Sbjct: 261 LGLVKLQQGRKDSWFIPTKLATNLSVSLADSSARKEGFVVMETNFRMYAYSTSKLQCEIL 320

Query: 69  RLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ------QNAHPRVADRIPSIPENVC 122
           RLF++IEYQLPNLI  A TKESL NAF+NGIT++Q      QN+HPR ADR+PSIPENV 
Sbjct: 321 RLFARIEYQLPNLIACAITKESLYNAFDNGITSDQIITFLQQNSHPRCADRVPSIPENVT 380

Query: 123 DQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARDRSGLL 166
           DQIRLWE+DL R+EMT AH+YDEFPS+DVFEAACD+AR+  GLL
Sbjct: 381 DQIRLWETDLQRIEMTQAHFYDEFPSKDVFEAACDFAREWRGLL 424



 Score = 41.6 bits (96), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 17/23 (73%), Positives = 21/23 (91%)

Query: 22  QWELMDTNAQLWYIVREYISNSQ 44
           Q+ LMDTNAQLWYI+REYI N++
Sbjct: 195 QFLLMDTNAQLWYIIREYILNAE 217


>gi|42566894|ref|NP_193435.2| transcription initiation factor TFIIH subunit H4 [Arabidopsis
           thaliana]
 gi|51969678|dbj|BAD43531.1| unnamed protein product [Arabidopsis thaliana]
 gi|115646777|gb|ABJ17114.1| At4g17020 [Arabidopsis thaliana]
 gi|332658439|gb|AEE83839.1| transcription initiation factor TFIIH subunit H4 [Arabidopsis
           thaliana]
          Length = 452

 Score =  205 bits (522), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 96/132 (72%), Positives = 112/132 (84%), Gaps = 6/132 (4%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           +  +GFVV+ETNFRMYAYSTSKL  EILRLF++IEYQLPNLI  A TKESL NAF+NGIT
Sbjct: 293 ARKEGFVVMETNFRMYAYSTSKLQCEILRLFARIEYQLPNLIACAITKESLYNAFDNGIT 352

Query: 101 AEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           ++Q      QN+HPR ADR+PSIPENV DQIRLWE+DL R+EMT AH+YDEFPS+DVFEA
Sbjct: 353 SDQIITFLQQNSHPRCADRVPSIPENVTDQIRLWETDLQRIEMTQAHFYDEFPSKDVFEA 412

Query: 155 ACDYARDRSGLL 166
           ACD+AR+  GLL
Sbjct: 413 ACDFAREWRGLL 424



 Score = 41.6 bits (96), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 17/23 (73%), Positives = 21/23 (91%)

Query: 22  QWELMDTNAQLWYIVREYISNSQ 44
           Q+ LMDTNAQLWYI+REYI N++
Sbjct: 195 QFLLMDTNAQLWYIIREYILNAE 217


>gi|42572935|ref|NP_974564.1| transcription initiation factor TFIIH subunit H4 [Arabidopsis
           thaliana]
 gi|332658438|gb|AEE83838.1| transcription initiation factor TFIIH subunit H4 [Arabidopsis
           thaliana]
          Length = 462

 Score =  205 bits (521), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 96/132 (72%), Positives = 112/132 (84%), Gaps = 6/132 (4%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           +  +GFVV+ETNFRMYAYSTSKL  EILRLF++IEYQLPNLI  A TKESL NAF+NGIT
Sbjct: 293 ARKEGFVVMETNFRMYAYSTSKLQCEILRLFARIEYQLPNLIACAITKESLYNAFDNGIT 352

Query: 101 AEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           ++Q      QN+HPR ADR+PSIPENV DQIRLWE+DL R+EMT AH+YDEFPS+DVFEA
Sbjct: 353 SDQIITFLQQNSHPRCADRVPSIPENVTDQIRLWETDLQRIEMTQAHFYDEFPSKDVFEA 412

Query: 155 ACDYARDRSGLL 166
           ACD+AR+  GLL
Sbjct: 413 ACDFAREWRGLL 424



 Score = 41.2 bits (95), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 17/23 (73%), Positives = 21/23 (91%)

Query: 22  QWELMDTNAQLWYIVREYISNSQ 44
           Q+ LMDTNAQLWYI+REYI N++
Sbjct: 195 QFLLMDTNAQLWYIIREYILNAE 217


>gi|32487401|emb|CAE05735.1| OSJNBb0017I01.15 [Oryza sativa Japonica Group]
          Length = 451

 Score =  204 bits (520), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 95/131 (72%), Positives = 111/131 (84%), Gaps = 6/131 (4%)

Query: 42  NSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITA 101
           N +GFVVVETNFRMYAYSTS+LH EILRLFS++EYQLPNLIVG+ TKESL  AFENGITA
Sbjct: 294 NKEGFVVVETNFRMYAYSTSRLHCEILRLFSRVEYQLPNLIVGSITKESLYGAFENGITA 353

Query: 102 E------QQNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAA 155
           E      QQNAHPRVAD+IP++PENV DQIRLWE+D NRV+MT +H Y++FPS+D+F+  
Sbjct: 354 EQIISFLQQNAHPRVADKIPAVPENVTDQIRLWETDRNRVDMTLSHLYEDFPSKDMFDQC 413

Query: 156 CDYARDRSGLL 166
           CDYARD   LL
Sbjct: 414 CDYARDHGCLL 424



 Score = 45.1 bits (105), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 28/93 (30%), Positives = 39/93 (41%), Gaps = 49/93 (52%)

Query: 1   MPSGIIARLPTSEDLEAYAIGQWE------------------------------------ 24
           +PS + ARLPT  +LE++A+ QWE                                    
Sbjct: 125 IPSSVTARLPTLAELESFALEQWECFLLQLINSSQVERGTSFSSSMMRTFQRGLLSSRDG 184

Query: 25  -------------LMDTNAQLWYIVREYISNSQ 44
                        LM+TNAQLWYI+REYIS+++
Sbjct: 185 EAPRLTENGFQFLLMETNAQLWYIMREYISSAE 217


>gi|218195833|gb|EEC78260.1| hypothetical protein OsI_17944 [Oryza sativa Indica Group]
          Length = 459

 Score =  204 bits (520), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 95/131 (72%), Positives = 111/131 (84%), Gaps = 6/131 (4%)

Query: 42  NSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITA 101
           N +GFVVVETNFRMYAYSTS+LH EILRLFS++EYQLPNLIVG+ TKESL  AFENGITA
Sbjct: 302 NKEGFVVVETNFRMYAYSTSRLHCEILRLFSRVEYQLPNLIVGSITKESLYGAFENGITA 361

Query: 102 E------QQNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAA 155
           E      QQNAHPRVAD+IP++PENV DQIRLWE+D NRV+MT +H Y++FPS+D+F+  
Sbjct: 362 EQIISFLQQNAHPRVADKIPAVPENVTDQIRLWETDRNRVDMTLSHLYEDFPSKDMFDQC 421

Query: 156 CDYARDRSGLL 166
           CDYARD   LL
Sbjct: 422 CDYARDHGCLL 432



 Score = 45.1 bits (105), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 28/93 (30%), Positives = 39/93 (41%), Gaps = 49/93 (52%)

Query: 1   MPSGIIARLPTSEDLEAYAIGQWE------------------------------------ 24
           +PS + ARLPT  +LE++A+ QWE                                    
Sbjct: 125 IPSSVTARLPTLAELESFALEQWECFLLQLINSSQVERGTSFSSSMMRTFQRGLLSSRDG 184

Query: 25  -------------LMDTNAQLWYIVREYISNSQ 44
                        LM+TNAQLWYI+REYIS+++
Sbjct: 185 EAPRLTENGFQFLLMETNAQLWYIMREYISSAE 217


>gi|222629781|gb|EEE61913.1| hypothetical protein OsJ_16644 [Oryza sativa Japonica Group]
          Length = 427

 Score =  204 bits (520), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 95/131 (72%), Positives = 111/131 (84%), Gaps = 6/131 (4%)

Query: 42  NSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITA 101
           N +GFVVVETNFRMYAYSTS+LH EILRLFS++EYQLPNLIVG+ TKESL  AFENGITA
Sbjct: 270 NKEGFVVVETNFRMYAYSTSRLHCEILRLFSRVEYQLPNLIVGSITKESLYGAFENGITA 329

Query: 102 E------QQNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAA 155
           E      QQNAHPRVAD+IP++PENV DQIRLWE+D NRV+MT +H Y++FPS+D+F+  
Sbjct: 330 EQIISFLQQNAHPRVADKIPAVPENVTDQIRLWETDRNRVDMTLSHLYEDFPSKDMFDQC 389

Query: 156 CDYARDRSGLL 166
           CDYARD   LL
Sbjct: 390 CDYARDHGCLL 400



 Score = 54.3 bits (129), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/69 (40%), Positives = 39/69 (56%), Gaps = 25/69 (36%)

Query: 1   MPSGIIARLPTSEDLEAYAIGQWE-------------------------LMDTNAQLWYI 35
           +PS + ARLPT  +LE++A+ QWE                         LM+TNAQLWYI
Sbjct: 125 IPSSVTARLPTLAELESFALEQWEVRMTKAYVCIDGEAPRLTENGFQFLLMETNAQLWYI 184

Query: 36  VREYISNSQ 44
           +REYIS+++
Sbjct: 185 MREYISSAE 193


>gi|51969958|dbj|BAD43671.1| unnamed protein product [Arabidopsis thaliana]
          Length = 452

 Score =  204 bits (519), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 96/132 (72%), Positives = 111/132 (84%), Gaps = 6/132 (4%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           +  +GFVV+ETNFRMYAYSTSKL  EILRLF++IEYQLPNLI  A TKESL NAF NGIT
Sbjct: 293 ARKEGFVVMETNFRMYAYSTSKLQCEILRLFARIEYQLPNLIACAITKESLYNAFGNGIT 352

Query: 101 AEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           ++Q      QN+HPR ADR+PSIPENV DQIRLWE+DL R+EMT AH+YDEFPS+DVFEA
Sbjct: 353 SDQIITFLQQNSHPRCADRVPSIPENVTDQIRLWETDLQRIEMTQAHFYDEFPSKDVFEA 412

Query: 155 ACDYARDRSGLL 166
           ACD+AR+  GLL
Sbjct: 413 ACDFAREWRGLL 424



 Score = 41.6 bits (96), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 17/23 (73%), Positives = 21/23 (91%)

Query: 22  QWELMDTNAQLWYIVREYISNSQ 44
           Q+ LMDTNAQLWYI+REYI N++
Sbjct: 195 QFLLMDTNAQLWYIIREYILNAE 217


>gi|326510735|dbj|BAJ91715.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 451

 Score =  192 bits (487), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 96/164 (58%), Positives = 115/164 (70%), Gaps = 17/164 (10%)

Query: 20  IGQWELMDTNAQLWYIVREYISN-----------SQGFVVVETNFRMYAYSTSKLHYEIL 68
           +G  +L       W+I  +  +N            +G VVVETNFR+YAYS SKLH EIL
Sbjct: 261 LGLVKLQQGRKDSWFIPTKLATNLSSSLSDSAASKEGIVVVETNFRLYAYSASKLHCEIL 320

Query: 69  RLFSKIEYQLPNLIVGATTKESLNNAFENGITAE------QQNAHPRVADRIPSIPENVC 122
           RLFS++EYQLPNLIVGA TKESL  AF+NGITAE      QQNAHPRV D+IP +PENV 
Sbjct: 321 RLFSRVEYQLPNLIVGAITKESLYGAFDNGITAEQIISFLQQNAHPRVIDKIPIVPENVT 380

Query: 123 DQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARDRSGLL 166
           DQIRLWE+D NRVEM  +H Y++FPS+D+FE  CD+ARD   LL
Sbjct: 381 DQIRLWENDRNRVEMILSHVYEDFPSKDMFEQCCDHARDNGYLL 424



 Score = 42.4 bits (98), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 28/93 (30%), Positives = 37/93 (39%), Gaps = 49/93 (52%)

Query: 1   MPSGIIARLPTSEDLEAYAIGQWE------------------------------------ 24
           +P  + ARLPT  +LE YA+ QWE                                    
Sbjct: 125 LPFNVTARLPTLVELENYALEQWECFLLQLINSSQVEKGTTFSSSMMKTFQRGLLSSRDG 184

Query: 25  -------------LMDTNAQLWYIVREYISNSQ 44
                        LM+TNAQLWYI+REYIS+++
Sbjct: 185 EAAKLSENGFQFLLMETNAQLWYIMREYISSAE 217


>gi|357166758|ref|XP_003580836.1| PREDICTED: general transcription factor IIH subunit 4-like
           [Brachypodium distachyon]
          Length = 451

 Score =  191 bits (486), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 90/130 (69%), Positives = 105/130 (80%), Gaps = 6/130 (4%)

Query: 43  SQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE 102
            +G VVVETNFR+YAYS S+LH EILRLFS++EYQLPNLIVGA TKESL  AF+NGITAE
Sbjct: 295 KEGIVVVETNFRLYAYSASRLHCEILRLFSRVEYQLPNLIVGAITKESLYGAFDNGITAE 354

Query: 103 Q------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAAC 156
           Q      QNAHPRVAD+IP +PENV DQIRLWE+D NRV+M  +H Y++FPS+D+FE  C
Sbjct: 355 QIISFLKQNAHPRVADKIPVVPENVTDQIRLWETDRNRVDMVLSHVYEDFPSKDMFEQCC 414

Query: 157 DYARDRSGLL 166
           D ARD   LL
Sbjct: 415 DLARDNGFLL 424



 Score = 40.4 bits (93), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 27/93 (29%), Positives = 36/93 (38%), Gaps = 49/93 (52%)

Query: 1   MPSGIIARLPTSEDLEAYAIGQWE------------------------------------ 24
           + S +  RLPT  +LE YA+ QWE                                    
Sbjct: 125 ISSSVTGRLPTLTELENYALEQWECFLLQLINSSQVERGTSFSSSMMKTFQRGLLSSRDG 184

Query: 25  -------------LMDTNAQLWYIVREYISNSQ 44
                        LM+TNAQLWYI+REYIS+++
Sbjct: 185 DAPKLSENGFQFLLMETNAQLWYIMREYISSAE 217


>gi|357123176|ref|XP_003563288.1| PREDICTED: LOW QUALITY PROTEIN: general transcription factor IIH
           subunit 4-like [Brachypodium distachyon]
          Length = 452

 Score =  189 bits (480), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 89/130 (68%), Positives = 104/130 (80%), Gaps = 6/130 (4%)

Query: 43  SQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE 102
            +G VVVETNFR+YAYS S+LH EILRLFS++EYQLPNLIVGA TKESL  AF+NGITAE
Sbjct: 296 KEGIVVVETNFRLYAYSASRLHCEILRLFSRVEYQLPNLIVGAITKESLYGAFDNGITAE 355

Query: 103 Q------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAAC 156
           Q      QNAHP VAD+IP +PENV DQIRLWE+D NRV+M  +H Y++FPS+D+FE  C
Sbjct: 356 QIISFLKQNAHPXVADKIPVVPENVTDQIRLWETDRNRVDMVLSHVYEDFPSKDLFEQCC 415

Query: 157 DYARDRSGLL 166
           D ARD   LL
Sbjct: 416 DLARDNGFLL 425



 Score = 42.0 bits (97), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 27/93 (29%), Positives = 36/93 (38%), Gaps = 49/93 (52%)

Query: 1   MPSGIIARLPTSEDLEAYAIGQWE------------------------------------ 24
           +PS +  RLPT  +LE YA+ QWE                                    
Sbjct: 126 IPSSVTGRLPTLTELENYALEQWECFLLQLINSSQVERGTSFSSSMMKTFQRGLLSSRDG 185

Query: 25  -------------LMDTNAQLWYIVREYISNSQ 44
                        LM+TN QLWYI+REYIS+++
Sbjct: 186 DAPKLSENGFQFLLMETNVQLWYIMREYISSAE 218


>gi|167997877|ref|XP_001751645.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162697626|gb|EDQ83962.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 462

 Score =  168 bits (425), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 83/170 (48%), Positives = 113/170 (66%), Gaps = 19/170 (11%)

Query: 16  EAYAIGQWELMDTNAQLWYIVREYISN-------------SQGFVVVETNFRMYAYSTSK 62
           E  A+G  +L     + WYI  +  SN             S+GFVVVETNF++YAY++SK
Sbjct: 258 ELAALGLVKLQQGMKESWYIPTKLASNLSASLSESTDWQSSEGFVVVETNFKVYAYTSSK 317

Query: 63  LHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ------QNAHPRVADRIPS 116
           L  EILR F+++EYQLPNL+V   TKES+N A  +GI+AEQ      ++AHP VA +IP 
Sbjct: 318 LQTEILRCFTRLEYQLPNLVVATLTKESVNKALGSGISAEQIISFLRKHAHPHVAQKIPV 377

Query: 117 IPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARDRSGLL 166
           +PE V DQ+RLWE+D NRV+  PA++YD+FP+  ++EA   +ARD  GLL
Sbjct: 378 VPETVSDQLRLWETDRNRVQFEPAYFYDDFPTMAIYEAVVAHARDLGGLL 427



 Score = 38.9 bits (89), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 33/131 (25%), Positives = 51/131 (38%), Gaps = 62/131 (47%)

Query: 5   IIARLPTSEDLEAYAIGQWE---------------------------------------- 24
           + AR+P S DLE YA+ QWE                                        
Sbjct: 130 VAARVPNSADLENYAMKQWESVLLQLVDCAADGPAGPKNPFIIKVFQRSGLLTPENESPS 189

Query: 25  ---------LMDTNAQLWYIVREYISNSQ----------GFVVVETNFRMY--AYSTSKL 63
                    LMDTN+QLW +VREY+++S+          GF ++E  F +   AYS + L
Sbjct: 190 LTDLGFQFLLMDTNSQLWQLVREYVTSSEDRGTDSGELVGF-LLELGFHLVGEAYSVNSL 248

Query: 64  HYEILRLFSKI 74
              + ++  ++
Sbjct: 249 SPALQKVLDEL 259


>gi|302806060|ref|XP_002984780.1| hypothetical protein SELMODRAFT_121159 [Selaginella moellendorffii]
 gi|302808297|ref|XP_002985843.1| hypothetical protein SELMODRAFT_123271 [Selaginella moellendorffii]
 gi|300146350|gb|EFJ13020.1| hypothetical protein SELMODRAFT_123271 [Selaginella moellendorffii]
 gi|300147366|gb|EFJ14030.1| hypothetical protein SELMODRAFT_121159 [Selaginella moellendorffii]
          Length = 459

 Score =  162 bits (409), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 74/131 (56%), Positives = 101/131 (77%), Gaps = 6/131 (4%)

Query: 42  NSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITA 101
            ++GF++VETNFR+YAY++SKLH E L +F + EY LPN++VG+ TKES+N AF +GI+A
Sbjct: 299 QTEGFIMVETNFRVYAYTSSKLHIETLHVFVRTEYVLPNILVGSITKESVNGAFASGISA 358

Query: 102 EQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAA 155
           +Q      Q+AHP VA ++PS+PE VCDQIRLWESD  RV+  PA+ Y+ FPS  V+E+ 
Sbjct: 359 DQIIKFLQQHAHPFVAQKVPSVPETVCDQIRLWESDRVRVQYLPAYCYEGFPSTSVYESV 418

Query: 156 CDYARDRSGLL 166
             +ARDR+GLL
Sbjct: 419 VAHARDRNGLL 429


>gi|414584750|tpg|DAA35321.1| TPA: hypothetical protein ZEAMMB73_386928 [Zea mays]
          Length = 386

 Score =  146 bits (369), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 69/90 (76%), Positives = 78/90 (86%), Gaps = 6/90 (6%)

Query: 42  NSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITA 101
           N +GFVVVETNFRMYAYSTSKLH EILRLF+++EYQLPNLIVGA TKES+  AFENGITA
Sbjct: 294 NKEGFVVVETNFRMYAYSTSKLHCEILRLFARVEYQLPNLIVGAVTKESIYGAFENGITA 353

Query: 102 EQ------QNAHPRVADRIPSIPENVCDQI 125
           EQ      QNAHPRVAD+IP++PENV DQ+
Sbjct: 354 EQIISFLRQNAHPRVADKIPAVPENVTDQV 383



 Score = 47.0 bits (110), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 29/93 (31%), Positives = 37/93 (39%), Gaps = 49/93 (52%)

Query: 1   MPSGIIARLPTSEDLEAYAIGQWE------------------------------------ 24
           +P  +  RLPT  DLEAYA+ QWE                                    
Sbjct: 125 LPLSVTGRLPTPADLEAYALDQWECFLLQLINSSQVEKGSSFSSSMMKTFQRGLLSSRDG 184

Query: 25  -------------LMDTNAQLWYIVREYISNSQ 44
                        LM+TNAQLWYI+REYIS+++
Sbjct: 185 EASKLTENGFQFLLMETNAQLWYIMREYISSAE 217


>gi|384246869|gb|EIE20357.1| Tfb2-domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 451

 Score =  127 bits (319), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 61/124 (49%), Positives = 82/124 (66%), Gaps = 6/124 (4%)

Query: 43  SQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE 102
           + GFVVVETN+R+YAY++S L   +LRLF++ E  LPNL VG  T+ES+  A   G++A+
Sbjct: 293 TDGFVVVETNYRVYAYTSSLLQTALLRLFTRCECILPNLFVGVLTRESVTGALACGLSAD 352

Query: 103 Q------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAAC 156
           Q      Q+AHP VA R P +PE V DQ+RLW++D  RV    A  YD+FPS  VF+ + 
Sbjct: 353 QIVLYLRQHAHPHVASRTPVVPEVVADQVRLWQADTMRVRHNRAVLYDDFPSAQVFQLSA 412

Query: 157 DYAR 160
             AR
Sbjct: 413 QKAR 416


>gi|296084734|emb|CBI25875.3| unnamed protein product [Vitis vinifera]
          Length = 258

 Score =  122 bits (307), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 69/156 (44%), Positives = 90/156 (57%), Gaps = 30/156 (19%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVG---ATTKESLNNA--- 94
           S  QGFVVVETNFR+YAYS+SKLH EILRLFS+    L + ++    ++ K +L +    
Sbjct: 76  SRKQGFVVVETNFRLYAYSSSKLHCEILRLFSRYASMLFDCLLCICYSSQKGTLLDVSLI 135

Query: 95  ----------------FENGITAEQQNAH--------PRVADRIPSIPENVCDQIRLWES 130
                           F + ++      H          +A++IP +      QIRLWE+
Sbjct: 136 MYECLRTFSLGKIIFWFFSSVSTVATPCHGYLVIFTISCLAEKIPCLQFTNMSQIRLWET 195

Query: 131 DLNRVEMTPAHYYDEFPSRDVFEAACDYARDRSGLL 166
           DLNRVE  P+H YDEFPSRDVFEAACD+AR+  GLL
Sbjct: 196 DLNRVETMPSHLYDEFPSRDVFEAACDFAREYGGLL 231


>gi|440791255|gb|ELR12502.1| TFIIH basal transcription factor complex subunit 2, putative
           [Acanthamoeba castellanii str. Neff]
          Length = 304

 Score =  121 bits (303), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 53/131 (40%), Positives = 89/131 (67%), Gaps = 6/131 (4%)

Query: 42  NSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITA 101
           + +G+++V T++R+YAY++S +   +L LF++IEYQLPN+++G   +E++  A + GI+A
Sbjct: 137 DQEGWIIVGTDYRIYAYTSSPVKLLLLSLFTQIEYQLPNMVMGILLRENIRQAVQVGISA 196

Query: 102 EQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAA 155
            Q       NAHP++    P IPE++ DQ+RLWE++  R+ ++P ++YD+F S   F+ A
Sbjct: 197 NQILQFLETNAHPQMKQNTPIIPESIADQLRLWEAEDRRLSLSPGYFYDDFASLAAFKKA 256

Query: 156 CDYARDRSGLL 166
             YARD   LL
Sbjct: 257 EKYARDVGALL 267


>gi|303272589|ref|XP_003055656.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226463630|gb|EEH60908.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 476

 Score =  117 bits (294), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 58/125 (46%), Positives = 83/125 (66%), Gaps = 8/125 (6%)

Query: 47  VVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ--- 103
           ++VETNFR+YAY++S +  EILRLF++ +Y+LPNL VG  T+E++  A   GI+AEQ   
Sbjct: 318 IIVETNFRVYAYTSSAVEVEILRLFTRPDYKLPNLYVGMMTREAVVTALRGGISAEQIVS 377

Query: 104 ---QNAHPRVADRI-PSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPS-RDVFEAACDY 158
              ++AHP+      P+IP  VCDQIRLW  D NRV+ TP   Y +FP+   +FE   + 
Sbjct: 378 YLRKHAHPQARKTPGPAIPATVCDQIRLWSKDENRVKYTPCVLYCDFPTGTGMFEKVAEI 437

Query: 159 ARDRS 163
           A++R 
Sbjct: 438 AKERG 442


>gi|145351329|ref|XP_001420034.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144580267|gb|ABO98327.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 495

 Score =  117 bits (294), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 56/129 (43%), Positives = 80/129 (62%), Gaps = 6/129 (4%)

Query: 44  QGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ 103
           +G ++VETN+R+YAY++S +  EILRLF++ +Y+LPNL VG  T+ES+ NA   G+ AEQ
Sbjct: 333 EGHIIVETNYRVYAYTSSAVEMEILRLFTRADYRLPNLYVGMLTRESVQNALRAGVDAEQ 392

Query: 104 ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACD 157
                  +AH +V  + PS+P  VCDQIRLW  D+ R+E      Y +FP    F  A  
Sbjct: 393 IVGYIRAHAHKQVRRKKPSVPSTVCDQIRLWARDMERMEAEECVLYCDFPQAGTFYGAVV 452

Query: 158 YARDRSGLL 166
              ++ G L
Sbjct: 453 SEAEKRGAL 461


>gi|72074363|ref|XP_796111.1| PREDICTED: general transcription factor IIH subunit 4
           [Strongylocentrotus purpuratus]
          Length = 469

 Score =  116 bits (291), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 60/131 (45%), Positives = 80/131 (61%), Gaps = 6/131 (4%)

Query: 42  NSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITA 101
           +  GF+VVETNFR+YAY+ S L  EIL LF  + Y+ PNL V A T+ES+  A  NGITA
Sbjct: 307 HKDGFIVVETNFRVYAYTESDLQVEILGLFCSMMYRFPNLSVAALTRESVQLAISNGITA 366

Query: 102 EQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAA 155
           EQ       +AHP +  + P +P  + DQ+RLWE + +R+  T    Y+EF S   FE  
Sbjct: 367 EQILSFLRTHAHPNMRLKTPIVPPTISDQVRLWELERDRLSFTQGIIYNEFLSLHDFEVL 426

Query: 156 CDYARDRSGLL 166
            DYA+D   L+
Sbjct: 427 RDYAKDLGVLI 437


>gi|390370355|ref|XP_783140.3| PREDICTED: general transcription factor IIH subunit 4-like, partial
           [Strongylocentrotus purpuratus]
          Length = 180

 Score =  116 bits (291), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 60/131 (45%), Positives = 80/131 (61%), Gaps = 6/131 (4%)

Query: 42  NSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITA 101
           +  GF+VVETNFR+YAY+ S L  EIL LF  + Y+ PNL V A T+ES+  A  NGITA
Sbjct: 27  HKDGFIVVETNFRIYAYTESDLQVEILGLFCSMMYRFPNLSVAALTRESVQLAISNGITA 86

Query: 102 EQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAA 155
           EQ       +AHP +  + P +P  + DQ+RLWE + +R+  T    Y+EF S   FE  
Sbjct: 87  EQILSFLRTHAHPNMRLKTPIVPPTISDQVRLWELERDRLSFTQGIIYNEFLSLHDFEVL 146

Query: 156 CDYARDRSGLL 166
            DYA+D   L+
Sbjct: 147 RDYAKDLGVLI 157


>gi|440800489|gb|ELR21525.1| transcription factor tfb2 subfamily protein [Acanthamoeba
           castellanii str. Neff]
          Length = 465

 Score =  116 bits (291), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 51/131 (38%), Positives = 88/131 (67%), Gaps = 6/131 (4%)

Query: 42  NSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITA 101
           + +G+++V T++R+YAY++S +   +L LF++IEYQLPN+++G   +E++  A + GI+A
Sbjct: 296 DQEGWIIVGTDYRIYAYTSSPVKLLLLSLFTQIEYQLPNMVMGILLRENIRQAVQVGISA 355

Query: 102 EQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAA 155
            Q       NAHP++    P IPE++ DQ+RLWE++  R+ ++  ++YD+F S   F+ A
Sbjct: 356 NQILQFLETNAHPQMKQNTPIIPESIADQLRLWEAEDRRLSLSSGYFYDDFASLAAFKKA 415

Query: 156 CDYARDRSGLL 166
             YARD   L+
Sbjct: 416 EKYARDVGALI 426


>gi|307106040|gb|EFN54287.1| hypothetical protein CHLNCDRAFT_25123 [Chlorella variabilis]
          Length = 458

 Score =  116 bits (291), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 56/129 (43%), Positives = 84/129 (65%), Gaps = 9/129 (6%)

Query: 44  QGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ 103
           +G+V+VE+NFR+YAY+TS +   +LR+F + +  LPNL VG  T+ES  NA + GI A+Q
Sbjct: 301 EGYVIVESNFRVYAYTTSAVQVAVLRVFVRCDALLPNLFVGTITRESATNALDTGIAADQ 360

Query: 104 ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACD 157
                 Q+AHPR A + P++   V DQIRLW  +L R++   A  YD+F S++++  A  
Sbjct: 361 VVAFLRQHAHPRAAAKTPTV---VTDQIRLWAQELKRLQEKNATLYDKFESKELYVGAVA 417

Query: 158 YARDRSGLL 166
           +AR  + LL
Sbjct: 418 HARQLNALL 426


>gi|384484453|gb|EIE76633.1| hypothetical protein RO3G_01337 [Rhizopus delemar RA 99-880]
          Length = 284

 Score =  116 bits (290), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 56/132 (42%), Positives = 84/132 (63%), Gaps = 6/132 (4%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           S  QGF+++ETN+++YAY+ S L   +L LF +++ +  N++ G  T++S+ NA   GIT
Sbjct: 123 SVDQGFIILETNYKLYAYTDSPLQIAVLNLFVQLQSRFRNMVTGVITRDSIRNALMKGIT 182

Query: 101 AE------QQNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           AE      Q +AHP++    P +P  V DQIRLWE + NR++ TP++ Y EF  +  F+A
Sbjct: 183 AEQIIYYLQSHAHPQMRKETPVLPLTVVDQIRLWEMERNRLKPTPSYLYHEFNVQADFDA 242

Query: 155 ACDYARDRSGLL 166
           A  YARD   LL
Sbjct: 243 AEKYARDLGVLL 254


>gi|348683175|gb|EGZ22990.1| hypothetical protein PHYSODRAFT_479459 [Phytophthora sojae]
          Length = 490

 Score =  114 bits (285), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 56/126 (44%), Positives = 84/126 (66%), Gaps = 6/126 (4%)

Query: 42  NSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITA 101
           ++Q  VVVETNF++YAY+TS LH  +L +F  I  +LPNL +G  T+ESL +A  +GI+A
Sbjct: 329 SAQLLVVVETNFKIYAYTTSTLHVAMLSVFVDIVARLPNLAIGFITRESLRSALIHGISA 388

Query: 102 EQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAA 155
           +Q      ++AHP++    P IPEN+ DQI LWE + NRV+      +D F +++ +E+ 
Sbjct: 389 QQIYDFLMKHAHPKMRRNSPVIPENIADQIYLWERERNRVQFMEGILFDGFNTKEDYESV 448

Query: 156 CDYARD 161
            DYA+D
Sbjct: 449 RDYAKD 454


>gi|328873270|gb|EGG21637.1| general transcription factor IIH [Dictyostelium fasciculatum]
          Length = 453

 Score =  114 bits (284), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 56/125 (44%), Positives = 81/125 (64%), Gaps = 8/125 (6%)

Query: 44  QGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ 103
           QG++++ETNFR+YAY+ S L   ++ LF K+ Y+LPNL VG  T+ES+ +A  +GITA+Q
Sbjct: 299 QGYLILETNFRIYAYTQSSLQISLISLFVKMLYRLPNLAVGILTRESVRSALLHGITADQ 358

Query: 104 ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACD 157
                 QNAHP +   I   PE V +QIRLWES+ NR+    A  +D FP+ + F     
Sbjct: 359 IVDFVKQNAHPNMV--ISGYPEVVFEQIRLWESERNRITYKKAVLFDSFPNAESFSKTVT 416

Query: 158 YARDR 162
           +A+D+
Sbjct: 417 FAKDQ 421


>gi|147899535|ref|NP_001085110.1| general transcription factor IIH, polypeptide 4, 52kDa [Xenopus
           laevis]
 gi|47939830|gb|AAH72322.1| MGC83106 protein [Xenopus laevis]
          Length = 455

 Score =  113 bits (283), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 55/127 (43%), Positives = 83/127 (65%), Gaps = 6/127 (4%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           S+ QGF+VVETN+R+YAY+ S+L   ++ LFS++ Y+ PNL+V   T+E++  A  NGIT
Sbjct: 294 SHKQGFIVVETNYRIYAYTDSELQIALIALFSEMLYRFPNLVVAQVTRENVQQAIGNGIT 353

Query: 101 AEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           AEQ        AHP +  + P++P  + DQIRLWE + +R+  +    Y++F S+  FE 
Sbjct: 354 AEQIIHFLRTRAHPVMLQQNPALPPTITDQIRLWELERDRLRFSEGVLYNQFLSQVDFEL 413

Query: 155 ACDYARD 161
             +YARD
Sbjct: 414 LRNYARD 420


>gi|290987279|ref|XP_002676350.1| predicted protein [Naegleria gruberi]
 gi|284089952|gb|EFC43606.1| predicted protein [Naegleria gruberi]
          Length = 532

 Score =  112 bits (281), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 60/147 (40%), Positives = 87/147 (59%), Gaps = 6/147 (4%)

Query: 26  MDTNAQLWYIVREYISNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGA 85
           M T+  L      + S + G+++VETN+R+YAY+ S L   +L LF   EY+LPN++VG 
Sbjct: 357 MSTSYDLISSFGSHNSINNGYIIVETNYRVYAYTNSPLQIALLSLFIFPEYRLPNMVVGL 416

Query: 86  TTKESLNNAFENGITAEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTP 139
            T+ ++  A +NGI+A Q       NAHP++  + P IP+ V DQI LWE + NRV  T 
Sbjct: 417 ITRSTIREALKNGISAHQILQFLRLNAHPQMRLKKPVIPDTVSDQILLWEKERNRVIKTT 476

Query: 140 AHYYDEFPSRDVFEAACDYARDRSGLL 166
           +  YD+F +     A  DYAR +  LL
Sbjct: 477 SVVYDKFTNVSELNATVDYARRQGALL 503


>gi|62751795|ref|NP_001015845.1| general transcription factor IIH, polypeptide 4, 52kDa [Xenopus
           (Silurana) tropicalis]
 gi|58475911|gb|AAH90134.1| general transcription factor II H, polypeptide 4 [Xenopus
           (Silurana) tropicalis]
 gi|89272862|emb|CAJ82116.1| transcription factor tfb2 [Xenopus (Silurana) tropicalis]
 gi|114107989|gb|AAI22900.1| gtf2h4 protein [Xenopus (Silurana) tropicalis]
          Length = 455

 Score =  112 bits (281), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 55/127 (43%), Positives = 82/127 (64%), Gaps = 6/127 (4%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           S+ QGF+VVETN+R+YAY+ S+L   ++ LFS++ Y+ PNL+V   T+E++  A  NGIT
Sbjct: 294 SHKQGFIVVETNYRIYAYTDSELQIALIALFSEMLYRFPNLVVAQVTRENVQQAIGNGIT 353

Query: 101 AEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           AEQ        AHP +  + P +P  + DQIRLWE + +R+  +    Y++F S+  FE 
Sbjct: 354 AEQIIHFLRTRAHPVMLQQNPVLPPTITDQIRLWELERDRLRFSEGVLYNQFLSQVDFEL 413

Query: 155 ACDYARD 161
             +YARD
Sbjct: 414 LRNYARD 420


>gi|308452336|ref|XP_003089005.1| hypothetical protein CRE_21072 [Caenorhabditis remanei]
 gi|308243592|gb|EFO87544.1| hypothetical protein CRE_21072 [Caenorhabditis remanei]
          Length = 468

 Score =  112 bits (281), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 57/133 (42%), Positives = 83/133 (62%), Gaps = 9/133 (6%)

Query: 43  SQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE 102
           S G V+VETNFR+YAY++S L   I+ LF+++ Y+  ++ VG  T+ES+  A ++GITA 
Sbjct: 302 SNGKVIVETNFRVYAYTSSLLQLAIIALFTEMTYRFQDMSVGMITRESVRGALQHGITAA 361

Query: 103 Q------QNAHPR---VADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFE 153
           Q       NAHP+    +  I  +P  V DQIRLWE +  R+++  A+ Y  F S D F+
Sbjct: 362 QIISFLRANAHPQCIATSGAINCLPITVADQIRLWEDERRRMDLKDAYIYSHFESEDEFQ 421

Query: 154 AACDYARDRSGLL 166
             CDYA++R  LL
Sbjct: 422 GVCDYAQERGILL 434


>gi|308460924|ref|XP_003092760.1| hypothetical protein CRE_20979 [Caenorhabditis remanei]
 gi|308252519|gb|EFO96471.1| hypothetical protein CRE_20979 [Caenorhabditis remanei]
          Length = 470

 Score =  112 bits (280), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 57/133 (42%), Positives = 83/133 (62%), Gaps = 9/133 (6%)

Query: 43  SQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE 102
           S G V+VETNFR+YAY++S L   I+ LF+++ Y+  ++ VG  T+ES+  A ++GITA 
Sbjct: 305 SNGKVIVETNFRVYAYTSSLLQLAIIALFTEMTYRFQDMSVGMITRESVRGALQHGITAA 364

Query: 103 Q------QNAHPR---VADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFE 153
           Q       NAHP+    +  I  +P  V DQIRLWE +  R+++  A+ Y  F S D F+
Sbjct: 365 QIISFLRANAHPQCIATSGAINCLPITVADQIRLWEDERRRMDLKDAYIYSHFESEDEFQ 424

Query: 154 AACDYARDRSGLL 166
             CDYA++R  LL
Sbjct: 425 GVCDYAQERGILL 437


>gi|359497087|ref|XP_002267711.2| PREDICTED: general transcription factor IIH subunit 4-like [Vitis
           vinifera]
          Length = 238

 Score =  112 bits (280), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 61/126 (48%), Positives = 72/126 (57%), Gaps = 37/126 (29%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           S  QGFVVVETNFR+YAYS+SKLH EILRLFS+                       + ++
Sbjct: 123 SRKQGFVVVETNFRLYAYSSSKLHCEILRLFSR-----------------------SSVS 159

Query: 101 AEQQNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYAR 160
                 H  +              IRLWE+DLNRVE  P+H YDEFPSRDVFEAACD+AR
Sbjct: 160 TVATPCHGYL--------------IRLWETDLNRVETMPSHLYDEFPSRDVFEAACDFAR 205

Query: 161 DRSGLL 166
           +  GLL
Sbjct: 206 EYGGLL 211


>gi|395332344|gb|EJF64723.1| transcription factor Tfb2 [Dichomitus squalens LYAD-421 SS1]
          Length = 506

 Score =  111 bits (278), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 54/127 (42%), Positives = 79/127 (62%), Gaps = 6/127 (4%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           S+ QGF+V+ETN+R+YAY+ + L   +L LF  ++Y+ PNL+VG  T++S+  A  NGIT
Sbjct: 344 SHVQGFIVLETNYRLYAYTDNPLQIAVLNLFVTLKYRFPNLVVGQITRDSVKKALANGIT 403

Query: 101 AEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           A+Q       +AHP++    P +P  V DQIRLWE + NRV+    + Y  F S   +E 
Sbjct: 404 ADQIISYLITHAHPQMRKNKPLLPVTVQDQIRLWELERNRVKSQEGYLYTAFASHADYEW 463

Query: 155 ACDYARD 161
             DYA+ 
Sbjct: 464 VLDYAKK 470


>gi|330918142|ref|XP_003298106.1| hypothetical protein PTT_08708 [Pyrenophora teres f. teres 0-1]
 gi|311328891|gb|EFQ93800.1| hypothetical protein PTT_08708 [Pyrenophora teres f. teres 0-1]
          Length = 482

 Score =  111 bits (277), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 52/122 (42%), Positives = 77/122 (63%), Gaps = 6/122 (4%)

Query: 44  QGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ 103
           +GF+++ETN+R+YAY+ S +   IL LF+K++++ PNL+ G  TKES++ A ++GIT+ Q
Sbjct: 320 KGFIIIETNYRLYAYTNSLIQIAILSLFTKLQHRFPNLVSGKLTKESVHKAVQSGITSAQ 379

Query: 104 ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACD 157
                   AHP++   +P IP  V DQIRLWE +  RVE TP +   EF S   +     
Sbjct: 380 IISYLTTYAHPQMQKTVPYIPPTVMDQIRLWEYEGERVETTPGYLMREFSSDTEYRDVMG 439

Query: 158 YA 159
           YA
Sbjct: 440 YA 441


>gi|358057347|dbj|GAA96696.1| hypothetical protein E5Q_03367 [Mixia osmundae IAM 14324]
          Length = 467

 Score =  111 bits (277), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 50/129 (38%), Positives = 80/129 (62%), Gaps = 6/129 (4%)

Query: 44  QGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ 103
           QGF+++ETN+R+YAY+ + L   +L LF  ++ + P L+VG  T+ES+     NGI ++Q
Sbjct: 307 QGFLILETNYRVYAYTQNPLQIAVLNLFVALKSRFPGLVVGMITRESIKAGLANGIKSDQ 366

Query: 104 ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACD 157
                  +AHP++  + P +P  V DQI+LWE + NRV+  P   YD+F S+  ++  CD
Sbjct: 367 IIAFLTAHAHPQMRKQEPLLPPTVVDQIKLWEREKNRVKTEPCFLYDDFRSQADYDLVCD 426

Query: 158 YARDRSGLL 166
           YA+    +L
Sbjct: 427 YAKQIGAVL 435


>gi|189205068|ref|XP_001938869.1| RNA polymerase II transcription factor B subunit 2 [Pyrenophora
           tritici-repentis Pt-1C-BFP]
 gi|187985968|gb|EDU51456.1| RNA polymerase II transcription factor B subunit 2 [Pyrenophora
           tritici-repentis Pt-1C-BFP]
          Length = 482

 Score =  111 bits (277), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 52/122 (42%), Positives = 77/122 (63%), Gaps = 6/122 (4%)

Query: 44  QGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ 103
           +GF+++ETN+R+YAY+ S +   IL LF+K++++ PNL+ G  TKES++ A ++GIT+ Q
Sbjct: 320 KGFIIIETNYRLYAYTNSLIQIAILSLFTKLQHRFPNLVSGKLTKESVHKAVQSGITSAQ 379

Query: 104 ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACD 157
                   AHP++   +P IP  V DQIRLWE +  RVE TP +   EF S   +     
Sbjct: 380 IISYLTTYAHPQMQKTVPYIPPTVMDQIRLWEYEGERVETTPGYLMREFSSDAEYRDVMG 439

Query: 158 YA 159
           YA
Sbjct: 440 YA 441


>gi|260810774|ref|XP_002600123.1| hypothetical protein BRAFLDRAFT_57175 [Branchiostoma floridae]
 gi|229285409|gb|EEN56135.1| hypothetical protein BRAFLDRAFT_57175 [Branchiostoma floridae]
          Length = 457

 Score =  110 bits (276), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 53/131 (40%), Positives = 84/131 (64%), Gaps = 6/131 (4%)

Query: 42  NSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITA 101
           + +G++VVE+N+R+YAY+ S+L   ++ LFS+I Y+ PN++V   T++S+  A   GITA
Sbjct: 296 HKEGYLVVESNYRIYAYTDSELQVALIGLFSEILYRFPNMVVANLTRDSVQEAVVRGITA 355

Query: 102 EQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAA 155
           +Q       NAHP+   R+P +P  + DQIRLWE + +R+  T    Y++F S+  FE  
Sbjct: 356 DQILHFLRVNAHPKALHRVPIVPPTISDQIRLWEMERDRLTFTEGVLYNQFLSQPDFEML 415

Query: 156 CDYARDRSGLL 166
            +YA+D   LL
Sbjct: 416 RNYAKDLGVLL 426


>gi|327266348|ref|XP_003217968.1| PREDICTED: general transcription factor IIH subunit 4-like [Anolis
           carolinensis]
          Length = 460

 Score =  110 bits (276), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 53/132 (40%), Positives = 84/132 (63%), Gaps = 6/132 (4%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           + +QGF++VETN+R+YAY+ S+L   ++ LFS++ Y+ PNL+V   T+ES+  A  NGIT
Sbjct: 299 TRNQGFIIVETNYRIYAYTDSELQIALIALFSEMLYRFPNLVVAQVTRESVQQAIANGIT 358

Query: 101 AEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           A+Q        AHP +  + P +P  + DQIRLWE + +R+  +    Y++F S+  FE 
Sbjct: 359 ADQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFSEGVLYNQFLSQVDFEL 418

Query: 155 ACDYARDRSGLL 166
             D+AR+   L+
Sbjct: 419 LRDHARELGVLI 430


>gi|403180193|ref|XP_003338499.2| hypothetical protein PGTG_19983 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375166015|gb|EFP94080.2| hypothetical protein PGTG_19983 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 679

 Score =  110 bits (276), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 51/123 (41%), Positives = 79/123 (64%), Gaps = 6/123 (4%)

Query: 44  QGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ 103
            GF+V+ETN+R+YAY+++ L   +L LF  + Y+ PNL+VGA T+ES+ +A  NGITA+Q
Sbjct: 146 HGFLVLETNYRIYAYTSNPLQIAVLNLFLSLRYRFPNLVVGAVTRESIKSALSNGITADQ 205

Query: 104 ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACD 157
                  +AHP++    P +P  V DQIRLWE + NR+     + Y++F S   +++   
Sbjct: 206 VIMYLHTHAHPQMRKLEPLLPPTVVDQIRLWELEKNRIRAQEGYLYEDFKSAAEYDSVIQ 265

Query: 158 YAR 160
           Y+R
Sbjct: 266 YSR 268


>gi|403163300|ref|XP_003323393.2| hypothetical protein PGTG_04930 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375163998|gb|EFP78974.2| hypothetical protein PGTG_04930 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 887

 Score =  110 bits (275), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 51/123 (41%), Positives = 79/123 (64%), Gaps = 6/123 (4%)

Query: 44  QGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ 103
            GF+V+ETN+R+YAY+++ L   +L LF  + Y+ PNL+VGA T+ES+ +A  NGITA+Q
Sbjct: 351 HGFLVLETNYRIYAYTSNPLQIAVLNLFLSLRYRFPNLVVGAVTRESIKSALSNGITADQ 410

Query: 104 ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACD 157
                  +AHP++    P +P  V DQIRLWE + NR+     + Y++F S   +++   
Sbjct: 411 VIMYLHTHAHPQMRKLEPLLPPTVVDQIRLWELEKNRIRAQEGYLYEDFKSAAEYDSVIQ 470

Query: 158 YAR 160
           Y+R
Sbjct: 471 YSR 473


>gi|336370852|gb|EGN99192.1| hypothetical protein SERLA73DRAFT_90464 [Serpula lacrymans var.
           lacrymans S7.3]
 gi|336383609|gb|EGO24758.1| hypothetical protein SERLADRAFT_361686 [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 469

 Score =  110 bits (274), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 53/124 (42%), Positives = 78/124 (62%), Gaps = 6/124 (4%)

Query: 43  SQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE 102
           SQGF+V+ETN+R+YAY+   L   +L LF  ++Y+ PNL+VG  T+ES+  A  NGI+AE
Sbjct: 309 SQGFIVLETNYRIYAYTDKPLQTAVLNLFITMKYRFPNLVVGMLTRESVKKALSNGISAE 368

Query: 103 Q------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAAC 156
           Q       +AHP++    P +P  V DQIRLWE + NR++    + Y +F S   +E   
Sbjct: 369 QIISYLTTHAHPQMRKNNPLLPVTVQDQIRLWELERNRLKSEEGYLYKDFGSHADYEYVL 428

Query: 157 DYAR 160
           +YA+
Sbjct: 429 NYAK 432


>gi|325181847|emb|CCA16302.1| general transcription factor IIH subunit putative [Albugo laibachii
           Nc14]
          Length = 473

 Score =  110 bits (274), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 55/131 (41%), Positives = 82/131 (62%), Gaps = 6/131 (4%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           S S    +VETNF++YAY+TS LH  +L +F  I  +LPNL +G  T+ESL +A  +GI+
Sbjct: 312 SMSDLLTIVETNFKVYAYTTSTLHIAMLSVFVDIVARLPNLAIGFITRESLRSALVHGIS 371

Query: 101 AEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           A+Q      ++AHPR+    P +PEN+ DQI LWE + NR++      +D F S++ FE 
Sbjct: 372 AQQIYDFLLKHAHPRMKKNNPIVPENIADQIYLWEKERNRIKFLEGILFDGFNSKEEFEQ 431

Query: 155 ACDYARDRSGL 165
             +YA+ +  L
Sbjct: 432 VREYAQKQRFL 442


>gi|390594544|gb|EIN03954.1| transcription factor Tfb2 [Punctularia strigosozonata HHB-11173
           SS5]
          Length = 467

 Score =  110 bits (274), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 53/125 (42%), Positives = 80/125 (64%), Gaps = 6/125 (4%)

Query: 43  SQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE 102
           S+GF+V+ETN+R+YAY+ + L   +L LF+ +  + PNL+VG  T+ES+  A ++GI+AE
Sbjct: 307 SEGFIVLETNYRLYAYTDNPLQTAVLALFTTLRSRFPNLVVGQITRESVKRALQSGISAE 366

Query: 103 Q------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAAC 156
           Q       +AHP++    P IP  V DQIRLWE + NR++      Y EF S+  +E   
Sbjct: 367 QIISYLSTHAHPQMRKNNPLIPVTVQDQIRLWELEKNRLKSQEGFLYTEFASQGDYEYVL 426

Query: 157 DYARD 161
           +YAR+
Sbjct: 427 NYARE 431


>gi|325181222|emb|CCA15636.1| general transcription factor IIH subunit putative [Albugo laibachii
           Nc14]
          Length = 1033

 Score =  109 bits (273), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 55/131 (41%), Positives = 82/131 (62%), Gaps = 6/131 (4%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           S S    +VETNF++YAY+TS LH  +L +F  I  +LPNL +G  T+ESL +A  +GI+
Sbjct: 359 SMSDLLTIVETNFKVYAYTTSTLHIAMLSVFVDIVARLPNLAIGFITRESLRSALVHGIS 418

Query: 101 AEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           A+Q      ++AHPR+    P +PEN+ DQI LWE + NR++      +D F S++ FE 
Sbjct: 419 AQQIYDFLLKHAHPRMKKNNPIVPENIADQIYLWEKERNRIKFLEGILFDGFNSKEEFEQ 478

Query: 155 ACDYARDRSGL 165
             +YA+ +  L
Sbjct: 479 VREYAQKQRFL 489


>gi|393246879|gb|EJD54387.1| transcription factor Tfb2 [Auricularia delicata TFB-10046 SS5]
          Length = 480

 Score =  109 bits (273), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 53/126 (42%), Positives = 77/126 (61%), Gaps = 6/126 (4%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           + S GF+++ETN+R+YAY+ + L   +L LF   + + PNL+VG  T++S+  A  NGIT
Sbjct: 314 ATSGGFIILETNYRLYAYTDNPLQIAVLNLFVSFKSRFPNLVVGMVTRDSVKKALANGIT 373

Query: 101 AEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           AEQ       +AHP++    P +P  V DQ+RLWE + NRV+      Y +F S+  FE 
Sbjct: 374 AEQIITYLSAHAHPQMRKNNPLLPVTVQDQVRLWELEKNRVKAEEGFLYMDFTSQADFEL 433

Query: 155 ACDYAR 160
             DYAR
Sbjct: 434 VLDYAR 439


>gi|213402795|ref|XP_002172170.1| transcription factor TFIIH complex subunit Tfb2
           [Schizosaccharomyces japonicus yFS275]
 gi|212000217|gb|EEB05877.1| transcription factor TFIIH complex subunit Tfb2
           [Schizosaccharomyces japonicus yFS275]
          Length = 444

 Score =  109 bits (273), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 52/125 (41%), Positives = 78/125 (62%), Gaps = 6/125 (4%)

Query: 43  SQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE 102
            +GF++VETN+R+YAY+ S L   IL LF+ +  +  NLIVG  T++S+  A ++GITAE
Sbjct: 288 GKGFIIVETNYRLYAYTDSPLQIAILGLFTNLRARFSNLIVGVITRDSIRRALQSGITAE 347

Query: 103 Q------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAAC 156
           Q       +AHP++   IP +P  + DQI LWE + NR+  TP   + +F +   FE A 
Sbjct: 348 QIITYLTTHAHPQMRKEIPLLPPTLVDQIYLWELERNRLRATPGILFRDFLTDKDFEKAV 407

Query: 157 DYARD 161
            YA++
Sbjct: 408 QYAKE 412


>gi|301098671|ref|XP_002898428.1| general transcription factor IIH subunit, putative [Phytophthora
           infestans T30-4]
 gi|262105199|gb|EEY63251.1| general transcription factor IIH subunit, putative [Phytophthora
           infestans T30-4]
          Length = 483

 Score =  109 bits (272), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 53/121 (43%), Positives = 80/121 (66%), Gaps = 6/121 (4%)

Query: 46  FVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ-- 103
            VVVETNF++YAY++S LH  +L +F  I  +LPNL +G  T+ESL +A  +GI+A+Q  
Sbjct: 326 LVVVETNFKIYAYTSSTLHVAMLSVFVDIVARLPNLAIGFITRESLRSALIHGISAQQIY 385

Query: 104 ----QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYA 159
               ++AHP++    P IPEN+ DQI LWE + NRV+      +D F +++ +E+  DYA
Sbjct: 386 DFLMKHAHPKMRRNSPVIPENIADQIYLWERERNRVQFLEGILFDGFNTKEDYESVRDYA 445

Query: 160 R 160
           +
Sbjct: 446 K 446


>gi|320167307|gb|EFW44206.1| transcription factor tfb2 [Capsaspora owczarzaki ATCC 30864]
          Length = 406

 Score =  108 bits (271), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 49/127 (38%), Positives = 79/127 (62%), Gaps = 6/127 (4%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           ++ +G++VVETNFR+YAY+ S L   +L LF  + Y+ PN++ G  +++S+  A   G+T
Sbjct: 222 ADVKGYIVVETNFRIYAYTDSPLQLALLSLFVDLRYRFPNMVCGLLSRDSVRKALVKGLT 281

Query: 101 AEQQ------NAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           AEQ       +AHP++  R P +PE + DQ+RLWE + NR+ + PA  Y+ F ++   + 
Sbjct: 282 AEQMIRFLRTHAHPQMRSRTPVLPETISDQLRLWELERNRLRVLPAVLYERFSNQREHDL 341

Query: 155 ACDYARD 161
              Y RD
Sbjct: 342 LHHYGRD 348


>gi|242021693|ref|XP_002431278.1| TFIIH basal transcription factor complex p52 subunit, putative
           [Pediculus humanus corporis]
 gi|212516535|gb|EEB18540.1| TFIIH basal transcription factor complex p52 subunit, putative
           [Pediculus humanus corporis]
          Length = 467

 Score =  108 bits (271), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 55/127 (43%), Positives = 84/127 (66%), Gaps = 7/127 (5%)

Query: 42  NSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITA 101
           N +G++++ETN+R+YAY+ S L   +L LF ++ Y+ PNL VG  T++S+  AF++GITA
Sbjct: 305 NKEGYIIIETNYRVYAYTDSNLQVALLGLFCEMLYRFPNLSVGLITRDSVRQAFKSGITA 364

Query: 102 EQ------QNAHPR-VADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           EQ       +AHPR +A   P++P  V DQI+LWE++LNR+  +    Y +F S+  FEA
Sbjct: 365 EQIVGFLRLHAHPRMIAVGPPTLPPTVVDQIKLWENELNRLVYSDGVLYSQFLSQADFEA 424

Query: 155 ACDYARD 161
             D A +
Sbjct: 425 LRDRANE 431


>gi|393221241|gb|EJD06726.1| transcription factor Tfb2 [Fomitiporia mediterranea MF3/22]
          Length = 464

 Score =  108 bits (271), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 51/127 (40%), Positives = 80/127 (62%), Gaps = 6/127 (4%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           + +QGF+++ETN+R+YAY+ + L   +L LF  ++Y+ PNL+VG+ T++S+  A  NGIT
Sbjct: 301 AQAQGFIILETNYRIYAYTDNPLQTAVLNLFVTLKYRFPNLVVGSLTRDSVRKALANGIT 360

Query: 101 AEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           A+Q       +AHP++    P IP  V DQIRLWE + NR++    + Y  F S+  +E 
Sbjct: 361 ADQIIKYLTTHAHPQMRKNDPLIPVTVQDQIRLWELERNRLKSQEGYLYTAFASQADYEF 420

Query: 155 ACDYARD 161
              YA+ 
Sbjct: 421 VLKYAKQ 427


>gi|41053973|ref|NP_956221.1| general transcription factor IIH subunit 4 [Danio rerio]
 gi|39794736|gb|AAH64301.1| General transcription factor IIH, polypeptide 4 [Danio rerio]
          Length = 466

 Score =  108 bits (270), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 52/126 (41%), Positives = 79/126 (62%), Gaps = 6/126 (4%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           +   GF+VVETN+R+YAY+ S+L   ++ LFS++ Y+ PNL+V   T+ES+  A  NGIT
Sbjct: 305 TGDTGFIVVETNYRIYAYTNSELQIALVALFSEMLYRFPNLVVAQVTRESVQQAISNGIT 364

Query: 101 AEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           A+Q        AHP +  + P +P  + DQIRLWE + +R++ T    Y++F S+  FE 
Sbjct: 365 AQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELEKDRLQFTEGVLYNQFLSQADFEV 424

Query: 155 ACDYAR 160
             D A+
Sbjct: 425 LRDRAQ 430


>gi|453232318|ref|NP_001263825.1| Protein Y73F8A.24, isoform d [Caenorhabditis elegans]
 gi|412976516|emb|CCO25635.1| Protein Y73F8A.24, isoform d [Caenorhabditis elegans]
          Length = 435

 Score =  108 bits (270), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 61/150 (40%), Positives = 90/150 (60%), Gaps = 11/150 (7%)

Query: 26  MDTNAQLWYIVREYISNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGA 85
           + TN  +  +  E +SN  G V+VETNFR+YAY++S L   I+ LF+++ Y+  ++ VG 
Sbjct: 245 LATNETIDDVSAEKVSN--GKVIVETNFRVYAYTSSLLQLAIIALFTEMTYRFQDMSVGM 302

Query: 86  TTKESLNNAFENGITAEQ------QNAHPR-VADRIP--SIPENVCDQIRLWESDLNRVE 136
            T+ES+  A ++GITA Q       NAHP+ +A   P   +P  V DQIRLWE +  R+ 
Sbjct: 303 ITRESVRGALQHGITAAQIISFLRANAHPQCIATSGPVNCLPITVADQIRLWEDERRRMN 362

Query: 137 MTPAHYYDEFPSRDVFEAACDYARDRSGLL 166
           +  A+ Y  F S D F   C+YAR ++ LL
Sbjct: 363 LKDAYIYSHFESEDEFHGVCEYARQQNILL 392


>gi|410911638|ref|XP_003969297.1| PREDICTED: general transcription factor IIH subunit 4-like
           [Takifugu rubripes]
          Length = 465

 Score =  108 bits (269), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 53/132 (40%), Positives = 81/132 (61%), Gaps = 6/132 (4%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           S   GF+VVETN+R+YAY+ S+L   ++ LFS++ Y+ PN++V   T+ES+  A  NGIT
Sbjct: 304 SGDSGFIVVETNYRIYAYTNSELQIALVALFSEMLYRFPNVVVAQVTRESVQQAIANGIT 363

Query: 101 AEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           A+Q        AHP +  + P +P  + DQIRLWE + +R++ T    Y++F S+  FE 
Sbjct: 364 AQQIIHFLRTRAHPVMLRQTPFLPPTITDQIRLWELERDRLQFTEGVLYNQFLSQTDFEV 423

Query: 155 ACDYARDRSGLL 166
             D A+    L+
Sbjct: 424 LRDRAKSLGCLV 435


>gi|392901992|ref|NP_001255864.1| Protein Y73F8A.24, isoform b [Caenorhabditis elegans]
 gi|290457493|emb|CBK19514.1| Protein Y73F8A.24, isoform b [Caenorhabditis elegans]
          Length = 431

 Score =  108 bits (269), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 61/150 (40%), Positives = 90/150 (60%), Gaps = 11/150 (7%)

Query: 26  MDTNAQLWYIVREYISNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGA 85
           + TN  +  +  E +SN  G V+VETNFR+YAY++S L   I+ LF+++ Y+  ++ VG 
Sbjct: 241 LATNETIDDVSAEKVSN--GKVIVETNFRVYAYTSSLLQLAIIALFTEMTYRFQDMSVGM 298

Query: 86  TTKESLNNAFENGITAEQ------QNAHPR-VADRIP--SIPENVCDQIRLWESDLNRVE 136
            T+ES+  A ++GITA Q       NAHP+ +A   P   +P  V DQIRLWE +  R+ 
Sbjct: 299 ITRESVRGALQHGITAAQIISFLRANAHPQCIATSGPVNCLPITVADQIRLWEDERRRMN 358

Query: 137 MTPAHYYDEFPSRDVFEAACDYARDRSGLL 166
           +  A+ Y  F S D F   C+YAR ++ LL
Sbjct: 359 LKDAYIYSHFESEDEFHGVCEYARQQNILL 388


>gi|392901990|ref|NP_001255863.1| Protein Y73F8A.24, isoform a [Caenorhabditis elegans]
 gi|26985915|emb|CAB70230.2| Protein Y73F8A.24, isoform a [Caenorhabditis elegans]
          Length = 481

 Score =  108 bits (269), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 61/150 (40%), Positives = 90/150 (60%), Gaps = 11/150 (7%)

Query: 26  MDTNAQLWYIVREYISNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGA 85
           + TN  +  +  E +SN  G V+VETNFR+YAY++S L   I+ LF+++ Y+  ++ VG 
Sbjct: 291 LATNETIDDVSAEKVSN--GKVIVETNFRVYAYTSSLLQLAIIALFTEMTYRFQDMSVGM 348

Query: 86  TTKESLNNAFENGITAEQ------QNAHPR-VADRIP--SIPENVCDQIRLWESDLNRVE 136
            T+ES+  A ++GITA Q       NAHP+ +A   P   +P  V DQIRLWE +  R+ 
Sbjct: 349 ITRESVRGALQHGITAAQIISFLRANAHPQCIATSGPVNCLPITVADQIRLWEDERRRMN 408

Query: 137 MTPAHYYDEFPSRDVFEAACDYARDRSGLL 166
           +  A+ Y  F S D F   C+YAR ++ LL
Sbjct: 409 LKDAYIYSHFESEDEFHGVCEYARQQNILL 438


>gi|453232316|ref|NP_001263824.1| Protein Y73F8A.24, isoform c [Caenorhabditis elegans]
 gi|412976517|emb|CCO25636.1| Protein Y73F8A.24, isoform c [Caenorhabditis elegans]
          Length = 485

 Score =  108 bits (269), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 61/150 (40%), Positives = 90/150 (60%), Gaps = 11/150 (7%)

Query: 26  MDTNAQLWYIVREYISNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGA 85
           + TN  +  +  E +SN  G V+VETNFR+YAY++S L   I+ LF+++ Y+  ++ VG 
Sbjct: 295 LATNETIDDVSAEKVSN--GKVIVETNFRVYAYTSSLLQLAIIALFTEMTYRFQDMSVGM 352

Query: 86  TTKESLNNAFENGITAEQ------QNAHPR-VADRIP--SIPENVCDQIRLWESDLNRVE 136
            T+ES+  A ++GITA Q       NAHP+ +A   P   +P  V DQIRLWE +  R+ 
Sbjct: 353 ITRESVRGALQHGITAAQIISFLRANAHPQCIATSGPVNCLPITVADQIRLWEDERRRMN 412

Query: 137 MTPAHYYDEFPSRDVFEAACDYARDRSGLL 166
           +  A+ Y  F S D F   C+YAR ++ LL
Sbjct: 413 LKDAYIYSHFESEDEFHGVCEYARQQNILL 442


>gi|451847848|gb|EMD61155.1| hypothetical protein COCSADRAFT_97372 [Cochliobolus sativus ND90Pr]
          Length = 482

 Score =  107 bits (267), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 51/122 (41%), Positives = 76/122 (62%), Gaps = 6/122 (4%)

Query: 44  QGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ 103
           +GF+++ETN+R+YAY+ S +   IL LF+K++++ PNL+ G  TKES++ A + GIT+ Q
Sbjct: 320 KGFIIIETNYRLYAYTNSLIQIAILSLFTKLQHRFPNLVSGKLTKESVHKAVQAGITSAQ 379

Query: 104 ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACD 157
                   AHP++   +P IP  V DQIRLWE +  RVE T  +   EF S   +    +
Sbjct: 380 IISYLTTYAHPQMQKTVPYIPPTVMDQIRLWEYEGERVETTTGYLMREFGSDAEYRDVLN 439

Query: 158 YA 159
           YA
Sbjct: 440 YA 441


>gi|443688956|gb|ELT91478.1| hypothetical protein CAPTEDRAFT_18660 [Capitella teleta]
          Length = 463

 Score =  107 bits (267), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 52/127 (40%), Positives = 75/127 (59%), Gaps = 6/127 (4%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           ++  GF+VVETN+R+YAY+ S L   ++ LF  + Y+ PN+ VG  +++S+  A   GIT
Sbjct: 303 ASEDGFIVVETNYRVYAYTNSSLKVALISLFCSMMYRFPNMAVGVVSRDSVREALSRGIT 362

Query: 101 AEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           AEQ       +AHP    R P +P  V DQIRLWE + +R        Y++F S+  FE 
Sbjct: 363 AEQIINFLRNHAHPHTQKRKPILPPTVSDQIRLWELERDRFHFNEGVLYNQFLSQTDFEV 422

Query: 155 ACDYARD 161
             DYA+D
Sbjct: 423 LRDYAKD 429


>gi|403417620|emb|CCM04320.1| predicted protein [Fibroporia radiculosa]
          Length = 472

 Score =  107 bits (267), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 51/125 (40%), Positives = 79/125 (63%), Gaps = 6/125 (4%)

Query: 42  NSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITA 101
           + +GF+V+ETN+R+YAY+ + L   +L LF  ++ + PNL++GA T+ES+  A  NGITA
Sbjct: 311 HGEGFIVLETNYRIYAYTDNPLQIAVLNLFVTLKSRFPNLVIGAITRESVKKALANGITA 370

Query: 102 EQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAA 155
           +Q       +AHP++    P +P  V DQIRLWE + NRV+    + Y  F S+  +E  
Sbjct: 371 DQIISYLTAHAHPQMHKNKPLLPVTVQDQIRLWELERNRVKSEEGYLYTSFASQADYEYV 430

Query: 156 CDYAR 160
            +YA+
Sbjct: 431 LNYAK 435


>gi|451996931|gb|EMD89397.1| hypothetical protein COCHEDRAFT_1141584 [Cochliobolus
           heterostrophus C5]
          Length = 482

 Score =  107 bits (267), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 51/122 (41%), Positives = 76/122 (62%), Gaps = 6/122 (4%)

Query: 44  QGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ 103
           +GF+++ETN+R+YAY+ S +   IL LF+K++++ PNL+ G  TKES++ A + GIT+ Q
Sbjct: 320 KGFIIIETNYRLYAYTNSLIQIAILSLFTKLQHRFPNLVSGKLTKESVHKAVQAGITSAQ 379

Query: 104 ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACD 157
                   AHP++   +P IP  V DQIRLWE +  RVE T  +   EF S   +    +
Sbjct: 380 IISYLTTYAHPQMQKTVPYIPPTVMDQIRLWEYEGERVETTTGYLMREFGSDAEYRDVLN 439

Query: 158 YA 159
           YA
Sbjct: 440 YA 441


>gi|341886092|gb|EGT42027.1| hypothetical protein CAEBREN_00374 [Caenorhabditis brenneri]
          Length = 460

 Score =  107 bits (266), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 56/133 (42%), Positives = 83/133 (62%), Gaps = 9/133 (6%)

Query: 43  SQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE 102
           S G V+VETNFR+YAY++S L   I+ LF+++ Y+  ++ VG  T+ES+  A ++GITA 
Sbjct: 293 SNGKVIVETNFRVYAYTSSLLQLAIIALFTEMTYRFQDMSVGMITRESVRGALQHGITAA 352

Query: 103 Q------QNAHPR-VADRIP--SIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFE 153
           Q       NAHP+ +A   P   +P  V DQIRLWE +  R+++  ++ Y  F S + F 
Sbjct: 353 QIISFLRANAHPQCIATSGPVNCLPITVADQIRLWEDERRRMDLKDSYIYSNFESDEEFN 412

Query: 154 AACDYARDRSGLL 166
             CDYA++R  LL
Sbjct: 413 GVCDYAKERKILL 425


>gi|409048320|gb|EKM57798.1| hypothetical protein PHACADRAFT_171015 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 472

 Score =  106 bits (265), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 51/131 (38%), Positives = 80/131 (61%), Gaps = 10/131 (7%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           ++ QGF+++ETN+R+YAY+ + L   +L LF  ++ + PNL+VGA T+ES+  A  NGIT
Sbjct: 306 AHGQGFIILETNYRLYAYTDNPLQIAVLNLFVTLKSRYPNLVVGAITRESVKKALTNGIT 365

Query: 101 AEQ----------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRD 150
           A+Q           +AHP++    P +P  V DQIRLWE + NR++    + Y  F S+ 
Sbjct: 366 ADQASRQIISYLTTHAHPQMRKNKPLLPVTVQDQIRLWELEKNRMKSQEGYLYTAFASQA 425

Query: 151 VFEAACDYARD 161
            +E   +YA+ 
Sbjct: 426 DYEYVLNYAKQ 436


>gi|432883135|ref|XP_004074222.1| PREDICTED: general transcription factor IIH subunit 4-like [Oryzias
           latipes]
          Length = 460

 Score =  106 bits (265), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 52/132 (39%), Positives = 82/132 (62%), Gaps = 6/132 (4%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           +   GF+VVETN+R+YAY+ S+L   ++ LFS++ Y+ PN++V   T+ES+  A  NGIT
Sbjct: 299 TGDAGFIVVETNYRIYAYTNSELQIALVALFSEMLYRFPNVVVAHLTRESVQQAIANGIT 358

Query: 101 AEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           A+Q        AHP +  + P++P  + DQIRLWE + +R++ T    Y++F S+  FE 
Sbjct: 359 AQQIIHFLRTRAHPVMLKQSPALPPTITDQIRLWELERDRLQFTEGVLYNQFLSQADFEV 418

Query: 155 ACDYARDRSGLL 166
             D A+    L+
Sbjct: 419 LRDRAQGLGCLV 430


>gi|392564310|gb|EIW57488.1| transcription factor Tfb2 [Trametes versicolor FP-101664 SS1]
          Length = 477

 Score =  106 bits (265), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 50/126 (39%), Positives = 81/126 (64%), Gaps = 6/126 (4%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           S++QGF+V+ETN+R+YAY+ + L   +L LF  ++ + PNL++GA T++S+  A  +GIT
Sbjct: 314 SHAQGFIVLETNYRLYAYTDNPLQIAVLNLFVTLKSRFPNLVIGAITRDSVKKALASGIT 373

Query: 101 AEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           A+Q       +AHP++    P +P  V DQIRLWE + NR++    + Y  F S+  +E 
Sbjct: 374 ADQIISYLVTHAHPQMRKNQPLLPVTVQDQIRLWELEKNRMKSQEGYLYTAFASQADYEY 433

Query: 155 ACDYAR 160
             +YA+
Sbjct: 434 VLNYAK 439


>gi|74143475|dbj|BAE28812.1| unnamed protein product [Mus musculus]
          Length = 463

 Score =  106 bits (265), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 50/123 (40%), Positives = 78/123 (63%), Gaps = 6/123 (4%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           GF+VVETN+R+YAY+ S+L   ++ LFS++ Y+ PN++V   T+ES+     +GITA+Q 
Sbjct: 306 GFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQVIASGITAQQI 365

Query: 104 -----QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDY 158
                  AHP +  + P +P  + DQIRLWE + +R+  T    Y++F S+ VFE    +
Sbjct: 366 IHFLRTRAHPVMLKQNPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVVFELLLAH 425

Query: 159 ARD 161
           AR+
Sbjct: 426 ARE 428


>gi|348542188|ref|XP_003458568.1| PREDICTED: general transcription factor IIH subunit 4 [Oreochromis
           niloticus]
          Length = 461

 Score =  106 bits (265), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 52/132 (39%), Positives = 81/132 (61%), Gaps = 6/132 (4%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           +   GF+VVETN+R+YAY+ S+L   ++ LFS++ Y+ PN++V   T+ES+  A  NGIT
Sbjct: 300 TGDAGFIVVETNYRIYAYTNSELQIALVALFSEMLYRFPNVVVAQLTRESVQQAIANGIT 359

Query: 101 AEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           A+Q        AHP +  + P +P  + DQIRLWE + +R++ T    Y++F S+  FE 
Sbjct: 360 AQQIIHFLRTRAHPVMLTQTPVLPPTITDQIRLWELERDRLQFTEGVLYNQFLSQADFEV 419

Query: 155 ACDYARDRSGLL 166
             D A+    L+
Sbjct: 420 LRDRAQGLGCLV 431


>gi|19113355|ref|NP_596563.1| transcription factor TFIIH complex subunit Tfb2
           [Schizosaccharomyces pombe 972h-]
 gi|62901133|sp|P87303.2|TFB2_SCHPO RecName: Full=RNA polymerase II transcription factor B subunit 2;
           AltName: Full=RNA polymerase II transcription factor B
           52 kDa subunit; AltName: Full=RNA polymerase II
           transcription factor B p52 subunit
 gi|3080538|emb|CAA18666.1| transcription factor TFIIH complex subunit Tfb2
           [Schizosaccharomyces pombe]
          Length = 447

 Score =  106 bits (265), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 48/126 (38%), Positives = 79/126 (62%), Gaps = 6/126 (4%)

Query: 42  NSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITA 101
           + +GF++VETN+R+YAY++S L   I+ LF+ +  +  NL+VG  T++S+  A  NGI A
Sbjct: 288 DDKGFIIVETNYRLYAYTSSPLQIAIIGLFANLRARFSNLVVGVITRDSIRRALMNGIAA 347

Query: 102 EQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAA 155
           EQ       +AHP++   +P +P  + DQI LWE + NR+  TP   + +F +   F+ A
Sbjct: 348 EQIITYLTTHAHPQMRSNVPLLPPTLVDQIYLWELEKNRLRATPGILFRDFLTDSGFDQA 407

Query: 156 CDYARD 161
            +YA++
Sbjct: 408 VEYAKE 413


>gi|149031822|gb|EDL86757.1| rCG41794 [Rattus norvegicus]
          Length = 209

 Score =  106 bits (264), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 50/123 (40%), Positives = 78/123 (63%), Gaps = 6/123 (4%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           GF+VVETN+R+YAY+ S+L   ++ LFS++ Y+ PN++V   T+ES+  A  +GITA+Q 
Sbjct: 52  GFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQI 111

Query: 104 -----QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDY 158
                  AHP +  + P +P  + DQIRLWE + +R+  T    Y++F S+  FE    +
Sbjct: 112 IHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAH 171

Query: 159 ARD 161
           AR+
Sbjct: 172 ARE 174


>gi|155371845|ref|NP_001094527.1| general transcription factor IIH subunit 4 [Bos taurus]
 gi|148878127|gb|AAI46241.1| GTF2H4 protein [Bos taurus]
 gi|296474219|tpg|DAA16334.1| TPA: general transcription factor IIH, polypeptide 4, 52kDa [Bos
           taurus]
          Length = 463

 Score =  106 bits (264), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 51/128 (39%), Positives = 80/128 (62%), Gaps = 6/128 (4%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           GF+VVETN+R+YAY+ S+L   ++ LFS++ Y+ PN++V   T+ES+  A  +GITA+Q 
Sbjct: 306 GFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQI 365

Query: 104 -----QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDY 158
                  AHP +  + P +P  + DQIRLWE + +R+  T    Y++F S+  FE    +
Sbjct: 366 IHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAH 425

Query: 159 ARDRSGLL 166
           AR+   L+
Sbjct: 426 ARELGVLM 433


>gi|178056954|ref|NP_001116592.1| general transcription factor IIH subunit 4 [Sus scrofa]
 gi|41529162|dbj|BAD08424.1| general transcription factor IIH, polypeptide 4 [Sus scrofa]
 gi|47496805|dbj|BAD08426.2| general transcription factor IIH, polypeptide 4 [Sus scrofa]
          Length = 463

 Score =  106 bits (264), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 50/123 (40%), Positives = 78/123 (63%), Gaps = 6/123 (4%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           GF+VVETN+R+YAY+ S+L   ++ LFS++ Y+ PN++V   T+ES+  A  +GITA+Q 
Sbjct: 306 GFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQI 365

Query: 104 -----QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDY 158
                  AHP +  + P +P  + DQIRLWE + +R+  T    Y++F S+  FE    +
Sbjct: 366 IHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAH 425

Query: 159 ARD 161
           AR+
Sbjct: 426 ARE 428


>gi|417401351|gb|JAA47565.1| Putative rna polymer [Desmodus rotundus]
          Length = 463

 Score =  106 bits (264), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 50/127 (39%), Positives = 80/127 (62%), Gaps = 6/127 (4%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           ++  GF+VVETN+R+YAY+ S+L   ++ LFS++ Y+ PN++V   T+ES+  A  +GIT
Sbjct: 302 AHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGIT 361

Query: 101 AEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           A+Q        AHP +  + P +P  + DQIRLWE + +R+  T    Y++F S+  FE 
Sbjct: 362 AQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFEL 421

Query: 155 ACDYARD 161
              +AR+
Sbjct: 422 LLAHARE 428


>gi|338718559|ref|XP_003363845.1| PREDICTED: general transcription factor IIH subunit 4 [Equus
           caballus]
          Length = 463

 Score =  106 bits (264), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 50/123 (40%), Positives = 78/123 (63%), Gaps = 6/123 (4%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           GF+VVETN+R+YAY+ S+L   ++ LFS++ Y+ PN++V   T+ES+  A  +GITA+Q 
Sbjct: 306 GFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQI 365

Query: 104 -----QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDY 158
                  AHP +  + P +P  + DQIRLWE + +R+  T    Y++F S+  FE    +
Sbjct: 366 IHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAH 425

Query: 159 ARD 161
           AR+
Sbjct: 426 ARE 428


>gi|395831929|ref|XP_003789035.1| PREDICTED: general transcription factor IIH subunit 4 [Otolemur
           garnettii]
          Length = 463

 Score =  106 bits (264), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 50/123 (40%), Positives = 78/123 (63%), Gaps = 6/123 (4%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           GF+VVETN+R+YAY+ S+L   ++ LFS++ Y+ PN++V   T+ES+  A  +GITA+Q 
Sbjct: 306 GFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQI 365

Query: 104 -----QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDY 158
                  AHP +  + P +P  + DQIRLWE + +R+  T    Y++F S+  FE    +
Sbjct: 366 IHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAH 425

Query: 159 ARD 161
           AR+
Sbjct: 426 ARE 428


>gi|344252968|gb|EGW09072.1| General transcription factor IIH subunit 4 [Cricetulus griseus]
          Length = 463

 Score =  106 bits (264), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 50/123 (40%), Positives = 78/123 (63%), Gaps = 6/123 (4%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           GF+VVETN+R+YAY+ S+L   ++ LFS++ Y+ PN++V   T+ES+  A  +GITA+Q 
Sbjct: 306 GFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQI 365

Query: 104 -----QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDY 158
                  AHP +  + P +P  + DQIRLWE + +R+  T    Y++F S+  FE    +
Sbjct: 366 IHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAH 425

Query: 159 ARD 161
           AR+
Sbjct: 426 ARE 428


>gi|281337500|gb|EFB13084.1| hypothetical protein PANDA_018954 [Ailuropoda melanoleuca]
          Length = 463

 Score =  106 bits (264), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 50/127 (39%), Positives = 80/127 (62%), Gaps = 6/127 (4%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           ++  GF+VVETN+R+YAY+ S+L   ++ LFS++ Y+ PN++V   T+ES+  A  +GIT
Sbjct: 302 AHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGIT 361

Query: 101 AEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           A+Q        AHP +  + P +P  + DQIRLWE + +R+  T    Y++F S+  FE 
Sbjct: 362 AQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFEL 421

Query: 155 ACDYARD 161
              +AR+
Sbjct: 422 LLAHARE 428


>gi|169610043|ref|XP_001798440.1| hypothetical protein SNOG_08115 [Phaeosphaeria nodorum SN15]
 gi|160701967|gb|EAT84391.2| hypothetical protein SNOG_08115 [Phaeosphaeria nodorum SN15]
          Length = 426

 Score =  106 bits (264), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 58/155 (37%), Positives = 86/155 (55%), Gaps = 17/155 (10%)

Query: 12  SEDLEAYAIGQWELMDTNAQLWYIVREYISNSQGFVVVETNFRMYAYSTSKLHYEILRLF 71
           S  L    +G  E  D+ AQ           ++GF++VETN+R+YAY+ S +   IL LF
Sbjct: 246 SGALPGSDVGTTEKPDSKAQ-----------NKGFIIVETNYRLYAYTNSLIQIAILSLF 294

Query: 72  SKIEYQLPNLIVGATTKESLNNAFENGITAEQ------QNAHPRVADRIPSIPENVCDQI 125
           +K++++ PNL+ G  TKES++ A + GIT+ Q        AHP++    P +P  V DQI
Sbjct: 295 TKLQHRFPNLVSGKLTKESVHRAVQAGITSAQIISYLTTYAHPQMQKSNPPLPPTVMDQI 354

Query: 126 RLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYAR 160
           RLWE +  RVE+T  +   EF S   +     YA+
Sbjct: 355 RLWEYEGERVEVTTGYLMREFGSESEYRDVMGYAQ 389


>gi|4504201|ref|NP_001508.1| general transcription factor IIH subunit 4 [Homo sapiens]
 gi|113865855|ref|NP_001038968.1| general transcription factor IIH subunit 4 [Pan troglodytes]
 gi|114050799|ref|NP_001040607.1| general transcription factor IIH subunit 4 [Macaca mulatta]
 gi|297677636|ref|XP_002816695.1| PREDICTED: general transcription factor IIH subunit 4 isoform 2
           [Pongo abelii]
 gi|332245940|ref|XP_003272109.1| PREDICTED: general transcription factor IIH subunit 4 [Nomascus
           leucogenys]
 gi|426352309|ref|XP_004043656.1| PREDICTED: general transcription factor IIH subunit 4 [Gorilla
           gorilla gorilla]
 gi|17380328|sp|Q92759.1|TF2H4_HUMAN RecName: Full=General transcription factor IIH subunit 4; AltName:
           Full=Basic transcription factor 2 52 kDa subunit;
           Short=BTF2 p52; AltName: Full=General transcription
           factor IIH polypeptide 4; AltName: Full=TFIIH basal
           transcription factor complex p52 subunit
 gi|38503276|sp|P60027.1|TF2H4_PANTR RecName: Full=General transcription factor IIH subunit 4; AltName:
           Full=Basic transcription factor 2 52 kDa subunit;
           Short=BTF2 p52; AltName: Full=General transcription
           factor IIH polypeptide 4; AltName: Full=TFIIH basal
           transcription factor complex p52 subunit
 gi|1514597|emb|CAA68870.1| transcription factor TFIIH [Homo sapiens]
 gi|13436278|gb|AAH04935.1| General transcription factor IIH, polypeptide 4, 52kDa [Homo
           sapiens]
 gi|15277224|dbj|BAB63317.1| Transcription factor II H [Homo sapiens]
 gi|16740884|gb|AAH16302.1| General transcription factor IIH, polypeptide 4, 52kDa [Homo
           sapiens]
 gi|21655317|gb|AAM64222.1| general transcription factor IIH, polypeptide 4 (52kD subunit)
           [Homo sapiens]
 gi|27544403|dbj|BAC54936.1| transcription factor II H [Homo sapiens]
 gi|30583481|gb|AAP35985.1| general transcription factor IIH, polypeptide 4, 52kDa [Homo
           sapiens]
 gi|32127781|dbj|BAC78171.1| transcription factor II H [Pan troglodytes]
 gi|55700798|dbj|BAD69753.1| general transcription factor IIH, polypeptide 4, 52kDa [Macaca
           mulatta]
 gi|60655839|gb|AAX32483.1| general transcription factor IIH polypeptide 4 [synthetic
           construct]
 gi|60655841|gb|AAX32484.1| general transcription factor IIH polypeptide 4 [synthetic
           construct]
 gi|86197966|dbj|BAE78622.1| general transcription factor IIH, polypeptide 4, 52kDa [Homo
           sapiens]
 gi|90960934|dbj|BAE92819.1| general transcription factor IIH, polypeptide 4 [Pan troglodytes]
 gi|90960936|dbj|BAE92820.1| general transcription factor IIH, polypeptide 4 [Pan troglodytes]
 gi|114306784|dbj|BAF31271.1| TFIIH protein [Homo sapiens]
 gi|119623752|gb|EAX03347.1| general transcription factor IIH, polypeptide 4, 52kDa [Homo
           sapiens]
 gi|123992979|gb|ABM84091.1| general transcription factor IIH, polypeptide 4, 52kDa [synthetic
           construct]
 gi|123999907|gb|ABM87462.1| general transcription factor IIH, polypeptide 4, 52kDa [synthetic
           construct]
 gi|355561507|gb|EHH18139.1| General transcription factor IIH polypeptide 4, partial [Macaca
           mulatta]
 gi|355748409|gb|EHH52892.1| General transcription factor IIH polypeptide 4 [Macaca
           fascicularis]
 gi|380812138|gb|AFE77944.1| general transcription factor IIH subunit 4 [Macaca mulatta]
 gi|380812140|gb|AFE77945.1| general transcription factor IIH subunit 4 [Macaca mulatta]
 gi|383408563|gb|AFH27495.1| general transcription factor IIH subunit 4 [Macaca mulatta]
 gi|410214852|gb|JAA04645.1| general transcription factor IIH, polypeptide 4, 52kDa [Pan
           troglodytes]
 gi|410250640|gb|JAA13287.1| general transcription factor IIH, polypeptide 4, 52kDa [Pan
           troglodytes]
 gi|410290944|gb|JAA24072.1| general transcription factor IIH, polypeptide 4, 52kDa [Pan
           troglodytes]
 gi|410352807|gb|JAA43007.1| general transcription factor IIH, polypeptide 4, 52kDa [Pan
           troglodytes]
          Length = 462

 Score =  106 bits (264), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 50/123 (40%), Positives = 78/123 (63%), Gaps = 6/123 (4%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           GF+VVETN+R+YAY+ S+L   ++ LFS++ Y+ PN++V   T+ES+  A  +GITA+Q 
Sbjct: 305 GFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQI 364

Query: 104 -----QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDY 158
                  AHP +  + P +P  + DQIRLWE + +R+  T    Y++F S+  FE    +
Sbjct: 365 IHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAH 424

Query: 159 ARD 161
           AR+
Sbjct: 425 ARE 427


>gi|47059175|ref|NP_997666.1| general transcription factor II H, polypeptide 4 [Rattus
           norvegicus]
 gi|46237651|emb|CAE84027.1| general transcription factor II H, polypeptide 4 [Rattus
           norvegicus]
 gi|117558335|gb|AAI27470.1| General transcription factor II H, polypeptide 4 [Rattus
           norvegicus]
          Length = 463

 Score =  106 bits (264), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 50/123 (40%), Positives = 78/123 (63%), Gaps = 6/123 (4%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           GF+VVETN+R+YAY+ S+L   ++ LFS++ Y+ PN++V   T+ES+  A  +GITA+Q 
Sbjct: 306 GFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQI 365

Query: 104 -----QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDY 158
                  AHP +  + P +P  + DQIRLWE + +R+  T    Y++F S+  FE    +
Sbjct: 366 IHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAH 425

Query: 159 ARD 161
           AR+
Sbjct: 426 ARE 428


>gi|440896165|gb|ELR48176.1| General transcription factor IIH subunit 4 [Bos grunniens mutus]
          Length = 463

 Score =  106 bits (264), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 50/123 (40%), Positives = 78/123 (63%), Gaps = 6/123 (4%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           GF+VVETN+R+YAY+ S+L   ++ LFS++ Y+ PN++V   T+ES+  A  +GITA+Q 
Sbjct: 306 GFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQI 365

Query: 104 -----QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDY 158
                  AHP +  + P +P  + DQIRLWE + +R+  T    Y++F S+  FE    +
Sbjct: 366 IHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAH 425

Query: 159 ARD 161
           AR+
Sbjct: 426 ARE 428


>gi|426250586|ref|XP_004019016.1| PREDICTED: general transcription factor IIH subunit 4 [Ovis aries]
          Length = 463

 Score =  106 bits (264), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 50/123 (40%), Positives = 78/123 (63%), Gaps = 6/123 (4%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           GF+VVETN+R+YAY+ S+L   ++ LFS++ Y+ PN++V   T+ES+  A  +GITA+Q 
Sbjct: 306 GFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQI 365

Query: 104 -----QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDY 158
                  AHP +  + P +P  + DQIRLWE + +R+  T    Y++F S+  FE    +
Sbjct: 366 IHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAH 425

Query: 159 ARD 161
           AR+
Sbjct: 426 ARE 428


>gi|261187837|ref|XP_002620336.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           [Ajellomyces dermatitidis SLH14081]
 gi|239593453|gb|EEQ76034.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           [Ajellomyces dermatitidis SLH14081]
 gi|239608433|gb|EEQ85420.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           [Ajellomyces dermatitidis ER-3]
 gi|327356043|gb|EGE84900.1| RNA pol II transcription initiation subunit [Ajellomyces
           dermatitidis ATCC 18188]
          Length = 485

 Score =  105 bits (263), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 52/140 (37%), Positives = 83/140 (59%), Gaps = 18/140 (12%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           GF+++ETN+R+YAY++S L   ++ LF+ ++Y+ PNLI G  T++S+  A E GITA+Q 
Sbjct: 318 GFIIIETNYRLYAYTSSPLQISLISLFTTLKYRFPNLITGKLTRQSIRRAVEMGITADQI 377

Query: 104 -----QNAHPRVAD-----------RIPSI-PENVCDQIRLWESDLNRVEMTPAHYYDEF 146
                 +AHP++              +P++ P  V DQIRLW+ + +R++ TP   + +F
Sbjct: 378 ISYLTTHAHPQMRKYHASKSGANPAGVPAVLPPTVVDQIRLWQLERDRIKATPGFLFKDF 437

Query: 147 PSRDVFEAACDYARDRSGLL 166
            S   FE  C YA +   LL
Sbjct: 438 VSLAEFEGPCRYAEEIGVLL 457


>gi|444727354|gb|ELW67853.1| Valyl-tRNA synthetase, mitochondrial [Tupaia chinensis]
          Length = 1382

 Score =  105 bits (263), Expect = 5e-21,   Method: Composition-based stats.
 Identities = 50/127 (39%), Positives = 80/127 (62%), Gaps = 6/127 (4%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           ++  GF+VVETN+R+YAY+ S+L   ++ LFS++ Y+ PN++V   T+ES+  A  +GIT
Sbjct: 259 AHQPGFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGIT 318

Query: 101 AEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           A+Q        AHP +  + P +P  + DQIRLWE + +R+  T    Y++F S+  FE 
Sbjct: 319 AQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFEL 378

Query: 155 ACDYARD 161
              +AR+
Sbjct: 379 LLAHARE 385


>gi|325089533|gb|EGC42843.1| TFIIH basal transcription factor complex p52 subunit [Ajellomyces
           capsulatus H88]
          Length = 485

 Score =  105 bits (263), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 51/137 (37%), Positives = 82/137 (59%), Gaps = 18/137 (13%)

Query: 43  SQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE 102
           S GF+++ETN+R+YAY++S L   ++ LF+ ++Y+ PNLI G  T++S+  A E GITA+
Sbjct: 316 STGFIIIETNYRLYAYTSSPLQISLIALFTSLKYRFPNLITGKLTRQSVRRAVEMGITAD 375

Query: 103 Q------QNAHPRVAD-----------RIPSI-PENVCDQIRLWESDLNRVEMTPAHYYD 144
           Q       +AHP++              +P++ P  V DQIRLW+ + +R++ TP   + 
Sbjct: 376 QIISYLTTHAHPQMRKYHATKSGANPVGVPTVLPPTVVDQIRLWQLERDRIKATPGFLFK 435

Query: 145 EFPSRDVFEAACDYARD 161
           +F S   FE  C YA +
Sbjct: 436 DFVSLAEFEGPCRYAEE 452


>gi|240279262|gb|EER42767.1| TFIIH basal transcription factor complex p52 subunit [Ajellomyces
           capsulatus H143]
          Length = 485

 Score =  105 bits (263), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 51/137 (37%), Positives = 82/137 (59%), Gaps = 18/137 (13%)

Query: 43  SQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE 102
           S GF+++ETN+R+YAY++S L   ++ LF+ ++Y+ PNLI G  T++S+  A E GITA+
Sbjct: 316 STGFIIIETNYRLYAYTSSPLQISLIALFTSLKYRFPNLITGKLTRQSVRRAVEMGITAD 375

Query: 103 Q------QNAHPRVAD-----------RIPSI-PENVCDQIRLWESDLNRVEMTPAHYYD 144
           Q       +AHP++              +P++ P  V DQIRLW+ + +R++ TP   + 
Sbjct: 376 QIISYLTTHAHPQMRKYHATKSGANPVGVPTVLPPTVVDQIRLWQLERDRIKATPGFLFK 435

Query: 145 EFPSRDVFEAACDYARD 161
           +F S   FE  C YA +
Sbjct: 436 DFVSLAEFEGPCRYAEE 452


>gi|359320969|ref|XP_003639474.1| PREDICTED: general transcription factor IIH subunit 4-like [Canis
           lupus familiaris]
          Length = 463

 Score =  105 bits (263), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 49/127 (38%), Positives = 80/127 (62%), Gaps = 6/127 (4%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           ++  GF++VETN+R+YAY+ S+L   ++ LFS++ Y+ PN++V   T+ES+  A  +GIT
Sbjct: 302 AHQPGFIIVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGIT 361

Query: 101 AEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           A+Q        AHP +  + P +P  + DQIRLWE + +R+  T    Y++F S+  FE 
Sbjct: 362 AQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFEL 421

Query: 155 ACDYARD 161
              +AR+
Sbjct: 422 LLAHARE 428


>gi|344307658|ref|XP_003422497.1| PREDICTED: general transcription factor IIH subunit 4 [Loxodonta
           africana]
          Length = 463

 Score =  105 bits (263), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 49/127 (38%), Positives = 80/127 (62%), Gaps = 6/127 (4%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           ++  GF++VETN+R+YAY+ S+L   ++ LFS++ Y+ PN++V   T+ES+  A  +GIT
Sbjct: 302 AHQPGFIIVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGIT 361

Query: 101 AEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           A+Q        AHP +  + P +P  + DQIRLWE + +R+  T    Y++F S+  FE 
Sbjct: 362 AQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFEL 421

Query: 155 ACDYARD 161
              +AR+
Sbjct: 422 LLAHARE 428


>gi|449540295|gb|EMD31288.1| hypothetical protein CERSUDRAFT_119839 [Ceriporiopsis subvermispora
           B]
          Length = 412

 Score =  105 bits (262), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 51/123 (41%), Positives = 77/123 (62%), Gaps = 6/123 (4%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           GF+V+ETN+R+YAY+ + L   +L LF  ++ + PNL++GA T+ES+  A  NGITA+Q 
Sbjct: 254 GFIVLETNYRVYAYTDNPLQIAVLNLFVTMKSRFPNLVIGAVTRESVKKALANGITADQI 313

Query: 104 -----QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDY 158
                 +AHP++    P +P  V DQIRLWE + NRV+    + Y  F S+  +E   +Y
Sbjct: 314 ISYLTAHAHPQMRRYKPLLPVTVQDQIRLWELEKNRVKSQEGYLYTAFASQADYEYVLNY 373

Query: 159 ARD 161
           A+ 
Sbjct: 374 AKQ 376


>gi|169784197|ref|XP_001826560.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           [Aspergillus oryzae RIB40]
 gi|238508739|ref|XP_002385554.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           [Aspergillus flavus NRRL3357]
 gi|83775305|dbj|BAE65427.1| unnamed protein product [Aspergillus oryzae RIB40]
 gi|220688446|gb|EED44799.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           [Aspergillus flavus NRRL3357]
 gi|391868531|gb|EIT77745.1| RNA polymerase II transcription initiation protein [Aspergillus
           oryzae 3.042]
          Length = 480

 Score =  105 bits (262), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 54/136 (39%), Positives = 82/136 (60%), Gaps = 19/136 (13%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           GF++VETN+R+YAY++S L   ++ LF+ ++Y+ PNLI G  T++S+  A E GITA+Q 
Sbjct: 315 GFIIVETNYRLYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSVRRAIEMGITADQI 374

Query: 104 -----QNAHPRV------------ADRIPSI-PENVCDQIRLWESDLNRVEMTPAHYYDE 145
                 +AHP++            A   PS+ P  V DQIRLW+ + +RV+ TP   + +
Sbjct: 375 ISYLATHAHPQMRKHNVSRSTSNQAGIPPSVLPPTVTDQIRLWQLERDRVKATPGFLFKD 434

Query: 146 FPSRDVFEAACDYARD 161
           F S   +EA C YA +
Sbjct: 435 FVSLAEYEAPCRYAEE 450


>gi|396459783|ref|XP_003834504.1| similar to RNA polymerase II transcription factor B subunit 2
           [Leptosphaeria maculans JN3]
 gi|312211053|emb|CBX91139.1| similar to RNA polymerase II transcription factor B subunit 2
           [Leptosphaeria maculans JN3]
          Length = 483

 Score =  105 bits (262), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 53/129 (41%), Positives = 78/129 (60%), Gaps = 7/129 (5%)

Query: 44  QGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ 103
           +GF+++ETN+R+YAY+ S +   IL LF+K++++ PNL+ G  TKES++ A   GIT+ Q
Sbjct: 322 KGFIIIETNYRLYAYTNSLIQIAILSLFTKLQHRFPNLVSGKLTKESVHKAVSAGITSAQ 381

Query: 104 ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACD 157
                   AHP++   +  IP  V DQIRLWE +  RVE+TP     +F S   +     
Sbjct: 382 IISYLSTYAHPQMQKNVSFIPPTVMDQIRLWEYEGERVEVTPGFLMKDFGSDAEYRDVLG 441

Query: 158 YARDRSGLL 166
           YA D  G+L
Sbjct: 442 YA-DALGVL 449


>gi|62896773|dbj|BAD96327.1| general transcription factor IIH, polypeptide 4, 52kDa variant
           [Homo sapiens]
          Length = 462

 Score =  105 bits (262), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 50/123 (40%), Positives = 77/123 (62%), Gaps = 6/123 (4%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           GF+VVETN+R+YAY+ S+L   ++ LFS++ Y+ PN++V   T+ES+  A  +GITA+Q 
Sbjct: 305 GFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQI 364

Query: 104 -----QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDY 158
                  AHP +  + P +P  + DQIRLWE   +R+  T    Y++F S+  FE    +
Sbjct: 365 IHFLRTRAHPVMLKQTPVLPPTITDQIRLWELGRDRLRFTEGVLYNQFLSQVDFELLLAH 424

Query: 159 ARD 161
           AR+
Sbjct: 425 ARE 427


>gi|198428509|ref|XP_002131018.1| PREDICTED: similar to General transcription factor II H,
           polypeptide 4 [Ciona intestinalis]
          Length = 463

 Score =  105 bits (262), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 52/133 (39%), Positives = 82/133 (61%), Gaps = 6/133 (4%)

Query: 40  ISNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGI 99
           ++NS+GF++ ETNFR+YAY+ S+L Y IL LF K+ Y+ PN+ V   T++S+ +A  NGI
Sbjct: 299 VTNSRGFILAETNFRVYAYTDSELQYSILSLFCKMLYRFPNVCVLQLTRKSIQSAVVNGI 358

Query: 100 TAEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFE 153
           TA+Q       NAHP +    P I   + DQ+RLW  + +R+       Y++F ++  FE
Sbjct: 359 TADQILHYIKANAHPDMLKNDPIIAPTLADQVRLWAMERDRLTYRDGVLYNQFLAQKDFE 418

Query: 154 AACDYARDRSGLL 166
              +YA++   L+
Sbjct: 419 VLRNYAKELGALI 431


>gi|351713119|gb|EHB16038.1| General transcription factor IIH subunit 4 [Heterocephalus glaber]
          Length = 463

 Score =  105 bits (262), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 50/123 (40%), Positives = 78/123 (63%), Gaps = 6/123 (4%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           GF+VVETN+R+YAY+ S+L   ++ LFS++ Y+ PN++V   T+ES+  A  +GITA+Q 
Sbjct: 306 GFIVVETNYRLYAYTESELQVALIALFSEMLYRFPNVVVAQVTRESVQQAIGSGITAQQI 365

Query: 104 -----QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDY 158
                  AHP +  + P +P  + DQIRLWE + +R+  T    Y++F S+  FE    +
Sbjct: 366 IHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAH 425

Query: 159 ARD 161
           AR+
Sbjct: 426 ARE 428


>gi|410958682|ref|XP_003985944.1| PREDICTED: general transcription factor IIH subunit 4 [Felis catus]
          Length = 463

 Score =  105 bits (262), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 50/123 (40%), Positives = 78/123 (63%), Gaps = 6/123 (4%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           GF+VVETN+R+YAY+ S+L   ++ LFS++ Y+ PN++V   T+ES+  A  +GITA+Q 
Sbjct: 306 GFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQI 365

Query: 104 -----QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDY 158
                  AHP +  + P +P  + DQIRLWE + +R+  T    Y++F S+  FE    +
Sbjct: 366 IHFLRTRAHPVMLKQSPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAH 425

Query: 159 ARD 161
           AR+
Sbjct: 426 ARE 428


>gi|392594388|gb|EIW83712.1| transcription factor Tfb2 [Coniophora puteana RWD-64-598 SS2]
          Length = 486

 Score =  105 bits (262), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 50/127 (39%), Positives = 79/127 (62%), Gaps = 6/127 (4%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           +N  GF+V+ETN+R+YAY+ + L   +L LF  ++Y+ PNL+VG  T+ES+  A  NGI+
Sbjct: 324 TNDHGFIVLETNYRVYAYTDNPLQIAVLNLFITLKYRFPNLVVGMLTRESVRRALGNGIS 383

Query: 101 AEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           AEQ       +AHP++    P +P  V DQ+RLWE + NR++      Y +F ++  +E 
Sbjct: 384 AEQVISYLTAHAHPQMRKNNPLLPVTVQDQVRLWELERNRLKSDDGFLYKDFATQADYEY 443

Query: 155 ACDYARD 161
              YA++
Sbjct: 444 VLAYAKE 450


>gi|121714341|ref|XP_001274781.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Aspergillus clavatus NRRL 1]
 gi|119402935|gb|EAW13355.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Aspergillus clavatus NRRL 1]
          Length = 478

 Score =  105 bits (261), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 52/136 (38%), Positives = 82/136 (60%), Gaps = 19/136 (13%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           GF+++ETN+R+YAY++S L   ++ LF+ ++Y+ PNLI G  T++S+  A E GITA+Q 
Sbjct: 313 GFIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSIRRAVEMGITADQI 372

Query: 104 -----QNAHPRV----ADR---------IPSIPENVCDQIRLWESDLNRVEMTPAHYYDE 145
                 +AHP++    A R         +  +P  V DQIRLW+ + +RV+ TP   + +
Sbjct: 373 ISYLSTHAHPQMRKHNASRSTSNQAGMPVSVLPPTVVDQIRLWQLERDRVKATPGFLFKD 432

Query: 146 FPSRDVFEAACDYARD 161
           F S   +EA C YA +
Sbjct: 433 FVSLAEYEAPCRYAEE 448


>gi|119179887|ref|XP_001241468.1| hypothetical protein CIMG_08631 [Coccidioides immitis RS]
 gi|392866653|gb|EAS30172.2| transcription factor Tfb2 [Coccidioides immitis RS]
          Length = 487

 Score =  105 bits (261), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 51/136 (37%), Positives = 82/136 (60%), Gaps = 19/136 (13%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           GF+++ETN+R+YAY++S L   ++ LF+ ++Y+ PNL+ G  T++S+  A E GITA+Q 
Sbjct: 319 GFIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLVTGKLTRQSIRRAVEMGITADQI 378

Query: 104 -----QNAHPRV------------ADRIPSI-PENVCDQIRLWESDLNRVEMTPAHYYDE 145
                 +AHP++            A  + S+ P  V DQIRLW+ + +R++ TP   + +
Sbjct: 379 ISYLTTHAHPQMRKVNASKSTSTTAGLLASVLPPTVVDQIRLWQLERDRLKATPGFLFKD 438

Query: 146 FPSRDVFEAACDYARD 161
           F S   FEA C YA +
Sbjct: 439 FASSAEFEAPCRYAEE 454


>gi|242794214|ref|XP_002482325.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Talaromyces stipitatus ATCC 10500]
 gi|218718913|gb|EED18333.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Talaromyces stipitatus ATCC 10500]
          Length = 481

 Score =  105 bits (261), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 51/136 (37%), Positives = 80/136 (58%), Gaps = 19/136 (13%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           GF+++ETN+R+YAY++S L   ++ LF+ ++Y+ PNL+ G  T++S+  A E GITA+Q 
Sbjct: 315 GFIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLVTGKITRQSIRRAVEMGITADQI 374

Query: 104 -----QNAHP--RVADRIPS-----------IPENVCDQIRLWESDLNRVEMTPAHYYDE 145
                 +AHP  R  D + S           +P  V DQIRLW+ + +R++ TP   + E
Sbjct: 375 ISYLSTHAHPQMRKEDAVKSTSNTAGLPRSVLPPTVVDQIRLWQLERDRIKATPGFLFKE 434

Query: 146 FPSRDVFEAACDYARD 161
           F S   +E  C YA +
Sbjct: 435 FASLAEYEGPCRYAEE 450


>gi|298705308|emb|CBJ48998.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 534

 Score =  105 bits (261), Expect = 9e-21,   Method: Composition-based stats.
 Identities = 53/127 (41%), Positives = 75/127 (59%), Gaps = 7/127 (5%)

Query: 47  VVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ--- 103
           ++VETNF++ AY+ SKLH+ +L LF ++   LPN IVGA T+ES+  A   GI   Q   
Sbjct: 350 IIVETNFQVIAYTRSKLHFAMLSLFLELRALLPNAIVGAITRESMRKALSTGIKGRQVLD 409

Query: 104 ---QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYD-EFPSRDVFEAACDYA 159
               +AHP V  R P +PEN+ DQ+ LWE + +R+E       D  + SRD F    ++A
Sbjct: 410 FLKWHAHPVVRRRTPVVPENIADQVLLWERERDRMEHRDGVLVDVSYASRDAFRGMTEFA 469

Query: 160 RDRSGLL 166
             + GLL
Sbjct: 470 NAKQGLL 476


>gi|409077631|gb|EKM77996.1| hypothetical protein AGABI1DRAFT_114851 [Agaricus bisporus var.
           burnettii JB137-S8]
 gi|426198975|gb|EKV48900.1| hypothetical protein AGABI2DRAFT_191077 [Agaricus bisporus var.
           bisporus H97]
          Length = 467

 Score =  105 bits (261), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 49/122 (40%), Positives = 78/122 (63%), Gaps = 6/122 (4%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           GF+V+ETN+R+YAY+ + L   +L LF  ++Y+ PNL+VG  T++S+  A  NGITA+Q 
Sbjct: 308 GFIVLETNYRIYAYTDNPLQTAVLNLFVSLKYRFPNLVVGMITRDSVRRALMNGITADQI 367

Query: 104 -----QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDY 158
                 +AHP++    P +P  V DQIRLWE + NR++ +  + Y  F S+  ++   +Y
Sbjct: 368 ISYLITHAHPQMRKNNPLLPVTVQDQIRLWELEKNRLKSSEGYLYTAFTSQADYDLVLNY 427

Query: 159 AR 160
           A+
Sbjct: 428 AK 429


>gi|6754094|ref|NP_034494.1| general transcription factor IIH subunit 4 [Mus musculus]
 gi|18202124|sp|O70422.1|TF2H4_MOUSE RecName: Full=General transcription factor IIH subunit 4; AltName:
           Full=Basic transcription factor 2 52 kDa subunit;
           Short=BTF2 p52; AltName: Full=General transcription
           factor IIH polypeptide 4; AltName: Full=TFIIH basal
           transcription factor complex p52 subunit
 gi|2997755|gb|AAC08594.1| TFIIH transcription/DNA repair factor p52 subunit [Mus musculus]
 gi|15215027|gb|AAH12638.1| Gtf2h4 protein [Mus musculus]
 gi|26348003|dbj|BAC37650.1| unnamed protein product [Mus musculus]
 gi|74194835|dbj|BAE26009.1| unnamed protein product [Mus musculus]
 gi|74222199|dbj|BAE26910.1| unnamed protein product [Mus musculus]
          Length = 463

 Score =  105 bits (261), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 50/123 (40%), Positives = 78/123 (63%), Gaps = 6/123 (4%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           GF+VVETN+R+YAY+ S+L   ++ LFS++ Y+ PN++V   T+ES+  A  +GITA+Q 
Sbjct: 306 GFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQI 365

Query: 104 -----QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDY 158
                  AHP +  + P +P  + DQIRLWE + +R+  T    Y++F S+  FE    +
Sbjct: 366 IHFLRTRAHPVMLKQNPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAH 425

Query: 159 ARD 161
           AR+
Sbjct: 426 ARE 428


>gi|302667863|ref|XP_003025510.1| hypothetical protein TRV_00272 [Trichophyton verrucosum HKI 0517]
 gi|291189624|gb|EFE44899.1| hypothetical protein TRV_00272 [Trichophyton verrucosum HKI 0517]
          Length = 386

 Score =  104 bits (260), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 50/131 (38%), Positives = 79/131 (60%), Gaps = 14/131 (10%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           GF++VETN+R+YAY++S L   ++ LF+ ++++ PNLI G  T++S+  A E GITA+Q 
Sbjct: 222 GFIIVETNYRLYAYTSSPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGITADQI 281

Query: 104 -----QNAHPRVADRIPS--------IPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRD 150
                 +AHP++     S        +P  V DQIRLW+ + +R++ T    + EF +  
Sbjct: 282 ISYLTTHAHPQMRKTKKSTSNVSSSVLPPTVVDQIRLWQLERDRIKATSGFLFKEFDTFA 341

Query: 151 VFEAACDYARD 161
            FEA C YA +
Sbjct: 342 EFEAPCKYAEE 352


>gi|302495702|ref|XP_003009866.1| hypothetical protein ARB_03925 [Arthroderma benhamiae CBS 112371]
 gi|291173385|gb|EFE29221.1| hypothetical protein ARB_03925 [Arthroderma benhamiae CBS 112371]
          Length = 386

 Score =  104 bits (260), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 50/131 (38%), Positives = 79/131 (60%), Gaps = 14/131 (10%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           GF++VETN+R+YAY++S L   ++ LF+ ++++ PNLI G  T++S+  A E GITA+Q 
Sbjct: 222 GFIIVETNYRLYAYTSSPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGITADQI 281

Query: 104 -----QNAHPRVADRIPS--------IPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRD 150
                 +AHP++     S        +P  V DQIRLW+ + +R++ T    + EF +  
Sbjct: 282 ISYLTTHAHPQMRKTKKSTSNVSSSVLPPTVVDQIRLWQLERDRIKATSGFLFKEFDTFA 341

Query: 151 VFEAACDYARD 161
            FEA C YA +
Sbjct: 342 EFEAPCKYAEE 352


>gi|299750117|ref|XP_001836551.2| transcription factor TFIIH complex subunit Tfb2 [Coprinopsis
           cinerea okayama7#130]
 gi|298408751|gb|EAU85259.2| transcription factor TFIIH complex subunit Tfb2 [Coprinopsis
           cinerea okayama7#130]
          Length = 403

 Score =  104 bits (260), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 49/124 (39%), Positives = 78/124 (62%), Gaps = 6/124 (4%)

Query: 44  QGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ 103
           +GF+++ETN+R+YAY+ + L   +L LF+ ++Y+ PNL+VG+ T+ES+  A  NGI+A+Q
Sbjct: 244 EGFIILETNYRIYAYTDNPLQTAVLSLFASLKYRFPNLVVGSITRESVKKALLNGISADQ 303

Query: 104 ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACD 157
                  +AHP +    P +P  V DQIRLWE + NR++      Y  F S+  +E    
Sbjct: 304 IISYLITHAHPNMRKNNPLLPVTVQDQIRLWELEKNRLKSREGFLYTAFASQADYELVLQ 363

Query: 158 YARD 161
           YA++
Sbjct: 364 YAKE 367


>gi|295662078|ref|XP_002791593.1| RNA polymerase II transcription factor B subunit 2
           [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226279719|gb|EEH35285.1| RNA polymerase II transcription factor B subunit 2
           [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 484

 Score =  104 bits (260), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 51/136 (37%), Positives = 80/136 (58%), Gaps = 17/136 (12%)

Query: 43  SQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE 102
           S GF+V+ETN+R+YAY++S L   ++ LF+ ++Y+ PNLI G  T++S+  A E GITA+
Sbjct: 316 STGFIVIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSIRRAVEMGITAD 375

Query: 103 Q------QNAHPRVADRIPS-----------IPENVCDQIRLWESDLNRVEMTPAHYYDE 145
           Q       +AHP++     S           +P  V DQIRLW+ + +R++ TP   + +
Sbjct: 376 QIISYLTTHAHPQMRKYHASKSSHTVGAPAVLPPTVVDQIRLWQLERDRIKATPGFLFKD 435

Query: 146 FPSRDVFEAACDYARD 161
           F S   F+  C YA +
Sbjct: 436 FVSLAEFDGPCRYAEE 451


>gi|440636738|gb|ELR06657.1| hypothetical protein GMDG_00274 [Geomyces destructans 20631-21]
          Length = 470

 Score =  104 bits (260), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 50/130 (38%), Positives = 81/130 (62%), Gaps = 7/130 (5%)

Query: 43  SQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE 102
           + GF+V+ETN+R+YAY++S L   +L LF+K+  + PN++ G  T++S+  A E+GITA+
Sbjct: 314 TTGFIVIETNYRLYAYTSSPLQIAVLALFAKLSTRYPNMVSGRVTRDSVRGAIEHGITAD 373

Query: 103 Q------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAAC 156
           Q       +AHP++  + P +P  V DQIRLW+ +  R++ T    + +F S   +E   
Sbjct: 374 QIISYLTTHAHPQLRRKSPVLPPTVVDQIRLWQIENERMKATAGFLFKDFASLAEYEGCV 433

Query: 157 DYARDRSGLL 166
            YA D  G+L
Sbjct: 434 RYA-DEIGVL 442


>gi|225682227|gb|EEH20511.1| RNA polymerase II transcription factor B subunit 2
           [Paracoccidioides brasiliensis Pb03]
 gi|226289599|gb|EEH45083.1| TFIIH basal transcription factor complex p52 subunit
           [Paracoccidioides brasiliensis Pb18]
          Length = 484

 Score =  104 bits (260), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 51/136 (37%), Positives = 80/136 (58%), Gaps = 17/136 (12%)

Query: 43  SQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE 102
           S GF+V+ETN+R+YAY++S L   ++ LF+ ++Y+ PNLI G  T++S+  A E GITA+
Sbjct: 316 STGFIVIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSIRRAVEMGITAD 375

Query: 103 Q------QNAHPRVADRIPS-----------IPENVCDQIRLWESDLNRVEMTPAHYYDE 145
           Q       +AHP++     S           +P  V DQIRLW+ + +R++ TP   + +
Sbjct: 376 QIISYLTTHAHPQMRKYHASKSSHTVGAPAVLPPTVVDQIRLWQLERDRIKATPGFLFKD 435

Query: 146 FPSRDVFEAACDYARD 161
           F S   F+  C YA +
Sbjct: 436 FVSLAEFDGPCRYAEE 451


>gi|303321033|ref|XP_003070511.1| Transcription factor tfb2 family protein [Coccidioides posadasii
           C735 delta SOWgp]
 gi|240110207|gb|EER28366.1| Transcription factor tfb2 family protein [Coccidioides posadasii
           C735 delta SOWgp]
 gi|320036051|gb|EFW17991.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           [Coccidioides posadasii str. Silveira]
          Length = 487

 Score =  104 bits (260), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 50/136 (36%), Positives = 82/136 (60%), Gaps = 19/136 (13%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           GF+++ETN+R+YAY++S L   ++ LF+ ++Y+ PNL+ G  T++S+  A E GITA+Q 
Sbjct: 319 GFIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLVTGKLTRQSIRRAVEMGITADQI 378

Query: 104 -----QNAHPRV-----------ADRIPS--IPENVCDQIRLWESDLNRVEMTPAHYYDE 145
                 +AHP++              +P+  +P  V DQIRLW+ + +R++ TP   + +
Sbjct: 379 ISYLTTHAHPQMRKVNASKSTSTTAGLPASVLPPTVVDQIRLWQLERDRLKATPGFLFKD 438

Query: 146 FPSRDVFEAACDYARD 161
           F S   FEA C YA +
Sbjct: 439 FASSAEFEAPCRYAEE 454


>gi|327298217|ref|XP_003233802.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           Tfb2 [Trichophyton rubrum CBS 118892]
 gi|326463980|gb|EGD89433.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           Tfb2 [Trichophyton rubrum CBS 118892]
          Length = 484

 Score =  104 bits (260), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 50/131 (38%), Positives = 79/131 (60%), Gaps = 14/131 (10%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           GF++VETN+R+YAY++S L   ++ LF+ ++++ PNLI G  T++S+  A E GITA+Q 
Sbjct: 320 GFIIVETNYRLYAYTSSPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGITADQI 379

Query: 104 -----QNAHPRVADRIPS--------IPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRD 150
                 +AHP++     S        +P  V DQIRLW+ + +R++ T    + EF +  
Sbjct: 380 ISYLTTHAHPQMRKTKKSTSNVSSSVLPPTVVDQIRLWQLERDRIKATSGFLFKEFDTFA 439

Query: 151 VFEAACDYARD 161
            FEA C YA +
Sbjct: 440 EFEAPCKYAEE 450


>gi|343428936|emb|CBQ72481.1| probable TFB2-TFIIH subunit (transcription/repair factor)
           [Sporisorium reilianum SRZ2]
          Length = 493

 Score =  104 bits (260), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 49/123 (39%), Positives = 80/123 (65%), Gaps = 6/123 (4%)

Query: 44  QGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ 103
           +G++++ETN+R+YAY++++L   +L LF  I+ + PNL+VG+ T++S+ +A  NGITAEQ
Sbjct: 337 RGYIILETNYRLYAYTSNRLRVAVLSLFVTIKARFPNLVVGSITRDSVKSALANGITAEQ 396

Query: 104 ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACD 157
                  +AH ++    P +P  V DQIRLWE + NRV+      + +F S+  FE   +
Sbjct: 397 IITYLTHHAHVQMHRNDPLLPVTVSDQIRLWEREKNRVQQNLGSLFTDFTSQFDFEEVRN 456

Query: 158 YAR 160
           YA+
Sbjct: 457 YAK 459


>gi|296818081|ref|XP_002849377.1| RNA polymerase II transcription factor B subunit 2 [Arthroderma
           otae CBS 113480]
 gi|238839830|gb|EEQ29492.1| RNA polymerase II transcription factor B subunit 2 [Arthroderma
           otae CBS 113480]
          Length = 484

 Score =  104 bits (259), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 50/134 (37%), Positives = 79/134 (58%), Gaps = 14/134 (10%)

Query: 42  NSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITA 101
              GF++VETN+R+YAY++S L   ++ LF+ ++++ PNLI G  T++S+  A E GITA
Sbjct: 317 GGTGFIIVETNYRLYAYTSSPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGITA 376

Query: 102 EQ------QNAHPRVADRIPS--------IPENVCDQIRLWESDLNRVEMTPAHYYDEFP 147
           +Q       +AHP++     S        +P  V DQIRLW+ + +R++ T    + EF 
Sbjct: 377 DQIVSYLTTHAHPQMRKTKKSTSNVAASILPPTVVDQIRLWQLERDRIKATSGFLFKEFD 436

Query: 148 SRDVFEAACDYARD 161
           +   FEA C YA +
Sbjct: 437 TFAEFEAPCKYAEE 450


>gi|225555033|gb|EEH03326.1| RNA pol II transcription initiation subunit [Ajellomyces capsulatus
           G186AR]
          Length = 485

 Score =  104 bits (259), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 50/137 (36%), Positives = 82/137 (59%), Gaps = 18/137 (13%)

Query: 43  SQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE 102
           S GF+++ETN+R+YAY++S L   ++ LF+ ++Y+ PNLI G  T++S+  A E GITA+
Sbjct: 316 STGFIIIETNYRLYAYTSSPLQISLIALFTSLKYRFPNLITGKLTRQSVRRAVEMGITAD 375

Query: 103 Q------QNAHPRVAD-----------RIPSI-PENVCDQIRLWESDLNRVEMTPAHYYD 144
           Q       +AHP++              +P++ P  V DQIRLW+ + +R++ TP   + 
Sbjct: 376 QIISYLTTHAHPQMRKYHATKAGANPVGVPTVLPPTVVDQIRLWQLERDRIKATPGFLFK 435

Query: 145 EFPSRDVFEAACDYARD 161
           +F +   FE  C YA +
Sbjct: 436 DFVNLAEFEGPCRYAEE 452


>gi|442757633|gb|JAA70975.1| Putative rna polymer [Ixodes ricinus]
          Length = 459

 Score =  103 bits (258), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 52/132 (39%), Positives = 78/132 (59%), Gaps = 6/132 (4%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           S   G++VVETN+R+YAY+ S+L   +L LF ++ Y+LPNL+VG  T+ES+  A  +GIT
Sbjct: 298 SYESGYIVVETNYRVYAYTDSQLQVALLALFCELMYRLPNLVVGVLTRESVRQALRSGIT 357

Query: 101 AE------QQNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           +       Q +AHP    + P IP  + DQ+RLWE + +R        Y +F S+  F+ 
Sbjct: 358 SNQIIKFLQMHAHPEAQKQSPVIPPTIMDQLRLWELERDRFNFREGVLYSQFISQSDFQL 417

Query: 155 ACDYARDRSGLL 166
             +YA D   L+
Sbjct: 418 LRNYASDLGVLI 429


>gi|212535854|ref|XP_002148083.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Talaromyces marneffei ATCC 18224]
 gi|210070482|gb|EEA24572.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Talaromyces marneffei ATCC 18224]
          Length = 481

 Score =  103 bits (258), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 50/136 (36%), Positives = 80/136 (58%), Gaps = 19/136 (13%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           GF+++ETN+R+YAY++S L   ++ LF+ ++Y+ PNL+ G  T++S+  A E GITA+Q 
Sbjct: 315 GFIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLVTGKITRQSIRRAVEMGITADQI 374

Query: 104 -----QNAHPRVADR-----------IPS--IPENVCDQIRLWESDLNRVEMTPAHYYDE 145
                 +AHP++              IP   +P  V DQIRLW+ + +R++ TP   + E
Sbjct: 375 ISYLSTHAHPQMRKEGAVKSTSNTAGIPRSVLPPTVVDQIRLWQLERDRIKATPGFLFKE 434

Query: 146 FPSRDVFEAACDYARD 161
           F S   +E  C YA +
Sbjct: 435 FASLAEYEGPCRYAEE 450


>gi|357618296|gb|EHJ71332.1| putative TFIIH basal transcription factor complex p52 subunit
           [Danaus plexippus]
          Length = 603

 Score =  103 bits (258), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 53/137 (38%), Positives = 80/137 (58%), Gaps = 13/137 (9%)

Query: 43  SQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE 102
           S G+++ ETN+R+YAY+TS L   +L LF+++ Y+ PN++VG  T+ES+  A   GI+A+
Sbjct: 437 SSGYIIAETNYRVYAYTTSALQVALLGLFTELVYRFPNVVVGVLTRESVRAALRGGISAQ 496

Query: 103 Q------QNAHPRVAD-------RIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 149
           Q      Q++HP++            S+P  V DQIRLWES+ NR   T    Y++F S+
Sbjct: 497 QIITYLEQHSHPQMLKSDQGGIRSSSSLPPTVLDQIRLWESERNRFTYTEGVVYNQFLSQ 556

Query: 150 DVFEAACDYARDRSGLL 166
             F    DY R    L+
Sbjct: 557 AEFNVLRDYGRSSGALV 573


>gi|353240543|emb|CCA72408.1| probable TFB2-TFIIH subunit (transcription/repair factor)
           [Piriformospora indica DSM 11827]
          Length = 462

 Score =  103 bits (257), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 52/125 (41%), Positives = 77/125 (61%), Gaps = 6/125 (4%)

Query: 43  SQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE 102
           SQ FVV+ETN+ +YAY+ + L   +L LF   + + PN+I+G+ T++S+  A  NGITA+
Sbjct: 304 SQEFVVLETNYHVYAYTNNPLQIAVLNLFVSFKARFPNMIMGSLTRDSVKKALVNGITAD 363

Query: 103 Q------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAAC 156
           Q       +AHP++    P IP  V DQIRLWE + +RV+    + Y EF S + +E   
Sbjct: 364 QILSYLVTHAHPQMRKNNPIIPVTVQDQIRLWELERHRVKGQDGYLYKEFASMNDYEVVV 423

Query: 157 DYARD 161
            YAR+
Sbjct: 424 QYARE 428


>gi|159474228|ref|XP_001695231.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158276165|gb|EDP01939.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 435

 Score =  103 bits (257), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 51/114 (44%), Positives = 72/114 (63%), Gaps = 6/114 (5%)

Query: 46  FVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQQN 105
           FV+VE+N+R+Y Y+ S +   +L LF K E  LPNL VGA  ++S+  A   GITA++  
Sbjct: 299 FVIVESNYRVYVYTRSPVTIAVLELFVKREALLPNLFVGAIRRDSILAALARGITADELV 358

Query: 106 A------HPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFE 153
           A      HP +A R P +PE V DQIRLWE+ +NR++  PA  Y+   SR ++E
Sbjct: 359 AYLSARPHPSIAARCPVVPEVVSDQIRLWEASMNRLQAHPAVLYENMESRQLYE 412


>gi|157113167|ref|XP_001651923.1| TFIIH basal transcription factor complex p52 subunit [Aedes
           aegypti]
 gi|108877858|gb|EAT42083.1| AAEL006356-PA [Aedes aegypti]
          Length = 487

 Score =  103 bits (257), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 53/129 (41%), Positives = 80/129 (62%), Gaps = 12/129 (9%)

Query: 44  QGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ 103
           +G+++VETN+R+YAY+ S L   +L LF+++ Y+ PNL+VG  T++S+  AF  GITAEQ
Sbjct: 322 KGYIIVETNYRVYAYTDSNLQVALLGLFTELLYRFPNLVVGVLTRDSVRQAFRGGITAEQ 381

Query: 104 ------QNAHPRVADRIPSI------PENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDV 151
                 Q+AHP + +   +I      P  V DQI+LWE++ NR   T    Y++F S+  
Sbjct: 382 IISYLEQHAHPTMLNMEQAINSKSPLPPTVVDQIKLWENERNRFTYTEGVVYNQFLSQGD 441

Query: 152 FEAACDYAR 160
           F    DYA+
Sbjct: 442 FNTLRDYAQ 450


>gi|154273196|ref|XP_001537450.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
 gi|150415962|gb|EDN11306.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
          Length = 455

 Score =  103 bits (257), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 50/137 (36%), Positives = 80/137 (58%), Gaps = 18/137 (13%)

Query: 43  SQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE 102
           S GF+++ETN+R+YAY++S L   ++ LF+ ++Y+ PNLI G  T++S+  A E GITA+
Sbjct: 291 STGFIIIETNYRLYAYTSSPLQISLIALFTSLKYRFPNLITGKLTRQSVRRAVEMGITAD 350

Query: 103 Q------QNAHPRVADRIPS------------IPENVCDQIRLWESDLNRVEMTPAHYYD 144
           Q       +AHP++     +            +P  V DQIRLW+ + +R++ TP   + 
Sbjct: 351 QIISYLTTHAHPQMRKYHATKPGANPVGVHTVLPPTVVDQIRLWQLERDRIKATPGFLFK 410

Query: 145 EFPSRDVFEAACDYARD 161
           +F S   FE  C YA +
Sbjct: 411 DFVSLAEFEGPCRYAEE 427


>gi|156378083|ref|XP_001630974.1| predicted protein [Nematostella vectensis]
 gi|156218005|gb|EDO38911.1| predicted protein [Nematostella vectensis]
          Length = 471

 Score =  103 bits (257), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 51/128 (39%), Positives = 76/128 (59%), Gaps = 6/128 (4%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE-- 102
           GF+++ETN+R+YAY+ S L   ++ LF +I  + PNL V + T+ES   A  +GI+AE  
Sbjct: 314 GFIIIETNYRVYAYTESCLQVSLIGLFCEILCRFPNLCVASLTRESCQQALASGISAEQI 373

Query: 103 ----QQNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDY 158
               Q  AHP +  R P IP  + DQ+RLWE + +R++ T    Y++F S+  FE    Y
Sbjct: 374 LNFLQTRAHPEMLKRTPIIPSTISDQVRLWEMERSRMKFTEGVLYNQFLSQADFEMLRKY 433

Query: 159 ARDRSGLL 166
           A D   L+
Sbjct: 434 AEDLGVLI 441


>gi|170039509|ref|XP_001847575.1| TFIIH basal transcription factor complex p52 subunit [Culex
           quinquefasciatus]
 gi|167863052|gb|EDS26435.1| TFIIH basal transcription factor complex p52 subunit [Culex
           quinquefasciatus]
          Length = 492

 Score =  103 bits (256), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 52/132 (39%), Positives = 81/132 (61%), Gaps = 12/132 (9%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           +  +G++VVETN+R+YAY+ S L   +L LF+++ Y+ PNL+VG  T++S+  AF  GIT
Sbjct: 324 AQDKGYIVVETNYRVYAYTDSNLQVALLGLFTELLYRFPNLVVGVLTRDSVRQAFRGGIT 383

Query: 101 AEQ------QNAHP------RVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 148
           A+Q      Q+AHP      +  +   ++P  V DQI+LWE++ NR   T    Y++F S
Sbjct: 384 ADQIISYLEQHAHPTMHNVEQTINTKSALPPTVVDQIKLWENERNRFTYTEGVVYNQFLS 443

Query: 149 RDVFEAACDYAR 160
           +  F    DYA+
Sbjct: 444 QGDFNTLRDYAQ 455


>gi|212535856|ref|XP_002148084.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Talaromyces marneffei ATCC 18224]
 gi|210070483|gb|EEA24573.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Talaromyces marneffei ATCC 18224]
          Length = 422

 Score =  103 bits (256), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 50/136 (36%), Positives = 80/136 (58%), Gaps = 19/136 (13%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           GF+++ETN+R+YAY++S L   ++ LF+ ++Y+ PNL+ G  T++S+  A E GITA+Q 
Sbjct: 256 GFIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLVTGKITRQSIRRAVEMGITADQI 315

Query: 104 -----QNAHPRVADR-----------IPS--IPENVCDQIRLWESDLNRVEMTPAHYYDE 145
                 +AHP++              IP   +P  V DQIRLW+ + +R++ TP   + E
Sbjct: 316 ISYLSTHAHPQMRKEGAVKSTSNTAGIPRSVLPPTVVDQIRLWQLERDRIKATPGFLFKE 375

Query: 146 FPSRDVFEAACDYARD 161
           F S   +E  C YA +
Sbjct: 376 FASLAEYEGPCRYAEE 391


>gi|281208616|gb|EFA82792.1| general transcription factor IIH [Polysphondylium pallidum PN500]
          Length = 458

 Score =  103 bits (256), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 56/126 (44%), Positives = 80/126 (63%), Gaps = 8/126 (6%)

Query: 44  QGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ 103
           QG++++ETNFR+YAY+ S L   +L LF K+ Y+LPNL VG  T+ES+  AF +GITA+Q
Sbjct: 304 QGYIILETNFRIYAYTASSLQISLLSLFVKMLYRLPNLSVGILTRESIRTAFLHGITADQ 363

Query: 104 ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACD 157
                 QN HP +       PE V +QIR+WE++ NR+    A  +D FP+++ F     
Sbjct: 364 IVDFIKQNGHPNMLK--VGAPEIVFEQIRIWENERNRILYKKAVLFDSFPTQESFNMTLQ 421

Query: 158 YARDRS 163
           YA+D S
Sbjct: 422 YAKDLS 427


>gi|326483225|gb|EGE07235.1| transcription factor Tfb2 [Trichophyton equinum CBS 127.97]
          Length = 484

 Score =  103 bits (256), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 50/131 (38%), Positives = 78/131 (59%), Gaps = 14/131 (10%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           GF++VETN R+YAY++S L   ++ LF+ ++++ PNLI G  T++S+  A E GITA+Q 
Sbjct: 320 GFIIVETNHRLYAYTSSPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGITADQI 379

Query: 104 -----QNAHPRVADRIPS--------IPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRD 150
                 +AHP++     S        +P  V DQIRLW+ + +R++ T    + EF +  
Sbjct: 380 ISYLTTHAHPQMRKTKKSTSNVSSSVLPPTVVDQIRLWQLERDRIKATSGFLFKEFDTFA 439

Query: 151 VFEAACDYARD 161
            FEA C YA +
Sbjct: 440 EFEAPCKYAEE 450


>gi|71024427|ref|XP_762443.1| hypothetical protein UM06296.1 [Ustilago maydis 521]
 gi|46097692|gb|EAK82925.1| hypothetical protein UM06296.1 [Ustilago maydis 521]
          Length = 491

 Score =  103 bits (256), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 49/123 (39%), Positives = 79/123 (64%), Gaps = 6/123 (4%)

Query: 44  QGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ 103
           +G++++ETN+R+YAY+++ L   +L LF  I+ + PNL+VG+ T++S+ +A  NGITAEQ
Sbjct: 335 RGYIILETNYRLYAYTSNPLRVAVLSLFVTIKARFPNLVVGSITRDSVKSALANGITAEQ 394

Query: 104 ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACD 157
                  +AH ++    P +P  V DQIRLWE + NRV+      + +F S+  FE   +
Sbjct: 395 IITYLTHHAHLQMHRNDPLLPVTVSDQIRLWEREKNRVQQNLGSLFTDFTSQFDFEEVRN 454

Query: 158 YAR 160
           YA+
Sbjct: 455 YAK 457


>gi|326474968|gb|EGD98977.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           Tfb2 [Trichophyton tonsurans CBS 112818]
          Length = 489

 Score =  102 bits (255), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 50/131 (38%), Positives = 78/131 (59%), Gaps = 14/131 (10%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           GF++VETN R+YAY++S L   ++ LF+ ++++ PNLI G  T++S+  A E GITA+Q 
Sbjct: 320 GFIIVETNHRLYAYTSSPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGITADQI 379

Query: 104 -----QNAHPRVADRIPS--------IPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRD 150
                 +AHP++     S        +P  V DQIRLW+ + +R++ T    + EF +  
Sbjct: 380 ISYLTTHAHPQMRKTKKSTSNVSSSVLPPTVVDQIRLWQLERDRIKATSGFLFKEFDTFA 439

Query: 151 VFEAACDYARD 161
            FEA C YA +
Sbjct: 440 EFEAPCKYAEE 450


>gi|302684027|ref|XP_003031694.1| hypothetical protein SCHCODRAFT_15746 [Schizophyllum commune H4-8]
 gi|300105387|gb|EFI96791.1| hypothetical protein SCHCODRAFT_15746 [Schizophyllum commune H4-8]
          Length = 476

 Score =  102 bits (255), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 52/129 (40%), Positives = 78/129 (60%), Gaps = 11/129 (8%)

Query: 44  QGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ 103
           +GF+V+ETN+R+YAY+ + L   +L LF   +Y+ PNL+VG+ T+ES+  A  NGITA+Q
Sbjct: 313 EGFIVLETNYRVYAYTDNALQTAVLNLFITPKYRFPNLVVGSITRESVKRAMSNGITADQ 372

Query: 104 ------QNAHPRVADRI-----PSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVF 152
                  +AHP++   +     P +P  V DQIRLWE + NR +      Y EF S+  +
Sbjct: 373 IISYLITHAHPQMHKNVCLVANPLLPVTVQDQIRLWELERNRTKSQEGFLYTEFASQADY 432

Query: 153 EAACDYARD 161
           E   +YA+ 
Sbjct: 433 EYVLNYAKQ 441


>gi|407927340|gb|EKG20235.1| Transcription factor Tfb2 [Macrophomina phaseolina MS6]
          Length = 466

 Score =  102 bits (255), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 51/132 (38%), Positives = 80/132 (60%), Gaps = 7/132 (5%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           S S+G++++ETN+R+YAY++S L   +L LF+ +E + PNL+ G  TKES+  A   GIT
Sbjct: 307 SGSKGYIIIETNYRLYAYTSSPLQIAVLALFTDLETRFPNLVSGKLTKESVQRAIGLGIT 366

Query: 101 AEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           A+Q        AHP++  + P +P  V DQIRLW+ + +R+  T    + +F   D ++ 
Sbjct: 367 ADQIISYLDTYAHPQMQRKTPVLPPTVVDQIRLWQIEGDRMRATEGFLFKDFRDWDEYQD 426

Query: 155 ACDYARDRSGLL 166
              YA D  G+L
Sbjct: 427 LKKYA-DSIGVL 437


>gi|31201525|ref|XP_309710.1| AGAP010984-PA [Anopheles gambiae str. PEST]
 gi|21293295|gb|EAA05440.1| AGAP010984-PA [Anopheles gambiae str. PEST]
          Length = 501

 Score =  102 bits (254), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 52/132 (39%), Positives = 81/132 (61%), Gaps = 12/132 (9%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           +  +G+++VETN+R+YAY+ S L   +L LF+++ Y+ PNL+VG  +++S+  AF  GIT
Sbjct: 333 TKDKGYIIVETNYRVYAYTDSNLQVALLGLFTELLYRFPNLVVGVLSRDSVRQAFRGGIT 392

Query: 101 AEQ------QNAHP------RVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 148
           AEQ      Q+AHP      +  +   S+P  V DQI+LWE++ NR   T    Y++F S
Sbjct: 393 AEQIISYLEQHAHPTMLTVEQAINSKSSLPPTVVDQIKLWENERNRFTYTEGVVYNQFLS 452

Query: 149 RDVFEAACDYAR 160
           +  F    DYA+
Sbjct: 453 QADFITLRDYAQ 464


>gi|66800623|ref|XP_629237.1| general transcription factor IIH, polypeptide 4 [Dictyostelium
           discoideum AX4]
 gi|74850716|sp|Q54C29.1|TF2H4_DICDI RecName: Full=General transcription factor IIH subunit 4; AltName:
           Full=TFIIH basal transcription factor complex subunit 4
 gi|60462610|gb|EAL60813.1| general transcription factor IIH, polypeptide 4 [Dictyostelium
           discoideum AX4]
          Length = 483

 Score =  102 bits (254), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 54/132 (40%), Positives = 83/132 (62%), Gaps = 6/132 (4%)

Query: 37  REYISNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFE 96
           R      QG++V+ETN+R+YAY++S L   +L LF K+ Y+LPNL VG  T+ES+  A  
Sbjct: 320 RTQTQKEQGYIVLETNYRLYAYTSSSLQISLLSLFVKMLYRLPNLAVGIITRESIRTALI 379

Query: 97  NGITAEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRD 150
           +GITA+Q       N+HP  A+    IP+ V +QI LWE++ NR+  T +  Y+ FP+ D
Sbjct: 380 HGITADQIIDFVRHNSHPNAANSGQPIPDVVAEQILLWEAERNRITYTKSVLYNSFPTND 439

Query: 151 VFEAACDYARDR 162
            + A   +A+++
Sbjct: 440 CYIATLKFAKEQ 451


>gi|268534706|ref|XP_002632484.1| Hypothetical protein CBG13719 [Caenorhabditis briggsae]
          Length = 482

 Score =  102 bits (254), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 56/133 (42%), Positives = 81/133 (60%), Gaps = 9/133 (6%)

Query: 43  SQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE 102
           S G V+VETNFR+YAY++S L   I+ LF+++ Y+  ++ VG  T+ES+  A ++GITA 
Sbjct: 309 SNGKVIVETNFRVYAYTSSLLQLAIIALFTEMTYRFTDMSVGMITRESVRGALQHGITAA 368

Query: 103 Q------QNAHPR-VADRIP--SIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFE 153
           Q       NAHP+ VA   P   +P  V DQIRLWE +  R+++  A+ Y  F S D + 
Sbjct: 369 QIISFLRANAHPQCVATSGPVNCLPITVADQIRLWEDERRRMDLKDAYIYSHFESDDEYH 428

Query: 154 AACDYARDRSGLL 166
               YA++R  LL
Sbjct: 429 GVVRYAQERGILL 441


>gi|67541703|ref|XP_664619.1| hypothetical protein AN7015.2 [Aspergillus nidulans FGSC A4]
 gi|40742471|gb|EAA61661.1| hypothetical protein AN7015.2 [Aspergillus nidulans FGSC A4]
 gi|259483674|tpe|CBF79257.1| TPA: TFIIH and nucleotide excision repair factor 3 complexes
           subunit (Tfb2), putative (AFU_orthologue; AFUA_4G04360)
           [Aspergillus nidulans FGSC A4]
          Length = 482

 Score =  102 bits (254), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 51/136 (37%), Positives = 81/136 (59%), Gaps = 19/136 (13%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           GF+++ETN+R+YAY++S L   ++ LF+ ++Y+ PNL+ G  T++S+  A E GITA+Q 
Sbjct: 317 GFIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLVTGKVTRQSIRRAIEMGITADQI 376

Query: 104 -----QNAHPRV------------ADRIPSI-PENVCDQIRLWESDLNRVEMTPAHYYDE 145
                 +AHP++            A   PS+ P  V DQIRLW+ + +RV+ T    + +
Sbjct: 377 ISYLATHAHPQMRKHNVARSTSNQAGMPPSVLPPTVVDQIRLWQLERDRVKATAGFLFKD 436

Query: 146 FPSRDVFEAACDYARD 161
           F S   +EA C YA +
Sbjct: 437 FVSLAEYEAPCRYAEE 452


>gi|412992641|emb|CCO18621.1| predicted protein [Bathycoccus prasinos]
          Length = 542

 Score =  102 bits (253), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 55/128 (42%), Positives = 81/128 (63%), Gaps = 11/128 (8%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE-- 102
           G ++VETNFR+YAY+ S+L  E+LRLF++ +Y+LPN  VG  T++S+ +A   GI+ +  
Sbjct: 386 GHIIVETNFRVYAYTHSELETEVLRLFTRPDYKLPNAYVGMITRDSILDAMRAGISPDQI 445

Query: 103 ----QQNAHPR--VADR--IPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR-DVFE 153
               Q  AHPR  +  R   P++P  VCDQIRLW  DL RV+      Y +FP + + F+
Sbjct: 446 VNYLQTRAHPRCTIGKRPNHPAVPPTVCDQIRLWARDLYRVKADDCVMYTDFPMQGNQFQ 505

Query: 154 AACDYARD 161
            A + AR+
Sbjct: 506 DAVNNARN 513


>gi|170120252|ref|XP_001891159.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164633568|gb|EDQ98188.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 475

 Score =  102 bits (253), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 50/130 (38%), Positives = 81/130 (62%), Gaps = 12/130 (9%)

Query: 44  QGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ 103
           +GF+V+ETN+R+YAY+ + L   IL LF  ++Y+ PNL+VGA T++S+  A  NGI+A+Q
Sbjct: 311 EGFIVLETNYRIYAYTDNPLQTAILNLFVSLKYRFPNLVVGAITRDSVKKALLNGISADQ 370

Query: 104 ------QNAHPRVADRI------PSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDV 151
                  +AHP++   +      P +P  V DQIRLWE + NR++    + Y  F S+  
Sbjct: 371 IISYLMSHAHPQMRKNVRLRSCNPLLPVTVQDQIRLWELEKNRLKSQEGYLYTSFASQAD 430

Query: 152 FEAACDYARD 161
           ++   +YA++
Sbjct: 431 YDFVLNYAKE 440


>gi|315041383|ref|XP_003170068.1| TFIIH basal transcription factor complex p52 subunit [Arthroderma
           gypseum CBS 118893]
 gi|311345102|gb|EFR04305.1| TFIIH basal transcription factor complex p52 subunit [Arthroderma
           gypseum CBS 118893]
          Length = 484

 Score =  102 bits (253), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 49/130 (37%), Positives = 78/130 (60%), Gaps = 14/130 (10%)

Query: 46  FVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ-- 103
           F++VETN+R+YAY++S L   ++ LF+ ++++ PNLI G  T++S+  A E GITA+Q  
Sbjct: 321 FIIVETNYRLYAYTSSPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGITADQIV 380

Query: 104 ----QNAHPRVADRIPS--------IPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDV 151
                +AHP++     S        +P  V DQIRLW+ + +R++ T    + EF +   
Sbjct: 381 SYLTTHAHPQMRKTKKSTTNVSSTVLPPTVVDQIRLWQLERDRIKATSGFLFKEFDTFAE 440

Query: 152 FEAACDYARD 161
           FEA C YA +
Sbjct: 441 FEAPCKYAEE 450


>gi|340371925|ref|XP_003384495.1| PREDICTED: general transcription factor IIH subunit 4-like
           [Amphimedon queenslandica]
          Length = 446

 Score =  101 bits (252), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 52/132 (39%), Positives = 76/132 (57%), Gaps = 6/132 (4%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           ++  GF++VETN+R+ AY+ SKLH   L LF +++Y+ PN+ VG  T+ES+  A  +GI 
Sbjct: 286 TSQAGFIIVETNYRVIAYTDSKLHIATLALFCQMQYRFPNVAVGTITRESIQQALVHGIK 345

Query: 101 AEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           A+Q      Q+AH  +  +   +P  V DQI+LWE + NR+       Y EF S   +E 
Sbjct: 346 ADQIISFLTQHAHHNMLSKAHILPPTVTDQIKLWEIERNRLSFQEGILYSEFLSVTDYEK 405

Query: 155 ACDYARDRSGLL 166
              YA D   LL
Sbjct: 406 VKKYAEDLGVLL 417


>gi|406862938|gb|EKD15987.1| tfiih and nucleotide excision repair factor 3 complexes subunit
           [Marssonina brunnea f. sp. 'multigermtubi' MB_m1]
          Length = 421

 Score =  101 bits (251), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 49/137 (35%), Positives = 83/137 (60%), Gaps = 12/137 (8%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           S ++GF+++ETN+R+YAY+ S L   +L LF+K+  + PN++ G  T++S+ NA  +GIT
Sbjct: 254 SGTKGFIIIETNYRLYAYTNSPLQIAVLALFTKMGVRYPNMVTGRVTRKSVANAISHGIT 313

Query: 101 AEQ------QNAHPRVADRI-----PSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 149
           ++Q       +AHP++         P +P  V DQIRLW+ +  R++ T    + EF ++
Sbjct: 314 SDQIISYLRAHAHPQLVKAAAVNGNPVLPPTVVDQIRLWQIENERMKATAGFLFKEFETQ 373

Query: 150 DVFEAACDYARDRSGLL 166
             +E    YA D +G+L
Sbjct: 374 KEYEGCARYA-DENGVL 389


>gi|350639203|gb|EHA27557.1| transcription factor Tfb2 [Aspergillus niger ATCC 1015]
          Length = 478

 Score =  101 bits (251), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 52/136 (38%), Positives = 81/136 (59%), Gaps = 19/136 (13%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           GF+++ETN+R+YAY++S L   ++ LF+ ++Y+ PNLI G  T++S+  A E GITA+Q 
Sbjct: 313 GFIIIETNYRVYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSVRRAVEMGITADQI 372

Query: 104 -----QNAHPRV------------ADRIPSI-PENVCDQIRLWESDLNRVEMTPAHYYDE 145
                 +AHP++            A   PS+ P  V DQIRLW+ + +RV+ T    + +
Sbjct: 373 ISYLSTHAHPQMRKHNVSRSTSNQAGIPPSVLPPTVVDQIRLWQLERDRVKATSGFLFKD 432

Query: 146 FPSRDVFEAACDYARD 161
           F S   +EA C YA +
Sbjct: 433 FVSLAEYEAPCRYAEE 448


>gi|145248676|ref|XP_001400677.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           [Aspergillus niger CBS 513.88]
 gi|134081344|emb|CAK41847.1| unnamed protein product [Aspergillus niger]
          Length = 478

 Score =  101 bits (251), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 52/136 (38%), Positives = 81/136 (59%), Gaps = 19/136 (13%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           GF+++ETN+R+YAY++S L   ++ LF+ ++Y+ PNLI G  T++S+  A E GITA+Q 
Sbjct: 313 GFIIIETNYRVYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSVRRAVEMGITADQI 372

Query: 104 -----QNAHPRV------------ADRIPSI-PENVCDQIRLWESDLNRVEMTPAHYYDE 145
                 +AHP++            A   PS+ P  V DQIRLW+ + +RV+ T    + +
Sbjct: 373 ISYLSTHAHPQMRKHNVSRSTSNQAGIPPSVLPPTVVDQIRLWQLERDRVKATSGFLFKD 432

Query: 146 FPSRDVFEAACDYARD 161
           F S   +EA C YA +
Sbjct: 433 FVSLAEYEAPCRYAEE 448


>gi|388857860|emb|CCF48525.1| probable TFB2-TFIIH subunit (transcription/repair factor) [Ustilago
           hordei]
          Length = 492

 Score =  101 bits (251), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 49/123 (39%), Positives = 78/123 (63%), Gaps = 6/123 (4%)

Query: 44  QGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ 103
           +G++++ETN+R+YAY+++ L   +L LF  I+ + PNL+VG+ T++S+ +A  NGITAEQ
Sbjct: 336 RGYIILETNYRLYAYTSNPLRVAVLSLFVTIKARFPNLVVGSITRDSVKSALANGITAEQ 395

Query: 104 ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACD 157
                  +AH ++    P +P  V DQIRLWE + NRV       + +F S+  FE   +
Sbjct: 396 IITYLTHHAHVQMHRNDPLLPVTVSDQIRLWEREKNRVVQNLGSLFTDFTSQFDFEEVRN 455

Query: 158 YAR 160
           YA+
Sbjct: 456 YAK 458


>gi|358370616|dbj|GAA87227.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           [Aspergillus kawachii IFO 4308]
          Length = 478

 Score =  101 bits (251), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 52/136 (38%), Positives = 81/136 (59%), Gaps = 19/136 (13%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           GF+++ETN+R+YAY++S L   ++ LF+ ++Y+ PNLI G  T++S+  A E GITA+Q 
Sbjct: 313 GFIIIETNYRVYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSVRRAVEMGITADQI 372

Query: 104 -----QNAHPRV------------ADRIPSI-PENVCDQIRLWESDLNRVEMTPAHYYDE 145
                 +AHP++            A   PS+ P  V DQIRLW+ + +RV+ T    + +
Sbjct: 373 ISYLSTHAHPQMRKHNVSRSTSNQAGIPPSVLPPTVVDQIRLWQLERDRVKATSGFLFKD 432

Query: 146 FPSRDVFEAACDYARD 161
           F S   +EA C YA +
Sbjct: 433 FVSLAEYEAPCRYAEE 448


>gi|195126587|ref|XP_002007752.1| GI12214 [Drosophila mojavensis]
 gi|193919361|gb|EDW18228.1| GI12214 [Drosophila mojavensis]
          Length = 492

 Score =  101 bits (251), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 56/135 (41%), Positives = 83/135 (61%), Gaps = 12/135 (8%)

Query: 44  QGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ 103
           +G++VVETN+R+YAY+ S+L   +L LF+++ Y+ PNL+VG  T++S+  A   GITAEQ
Sbjct: 327 RGYIVVETNYRVYAYTDSQLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQ 386

Query: 104 ------QNAHPR---VADRIPS---IPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDV 151
                 Q AHP    V   I S   +P  V DQI+LWE + NR   T    Y++F S++ 
Sbjct: 387 IVSYLEQYAHPNMKLVESAIQSKSCLPPTVVDQIKLWEMERNRFTYTEGVVYNQFLSQND 446

Query: 152 FEAACDYARDRSGLL 166
           F    DYA+ ++ L+
Sbjct: 447 FVTLRDYAQSQNVLV 461


>gi|321470926|gb|EFX81900.1| hypothetical protein DAPPUDRAFT_210868 [Daphnia pulex]
          Length = 463

 Score =  100 bits (249), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 52/132 (39%), Positives = 80/132 (60%), Gaps = 7/132 (5%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           S + G++VVETN+R+YAY+ S L   ++ LF ++ Y+ PNL+VG  T+ES+  A + GIT
Sbjct: 300 SQTTGYIVVETNYRVYAYTDSSLQVALIALFCELIYRFPNLVVGIITRESVREALKRGIT 359

Query: 101 AEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           A+Q      Q+AH       P +P  + DQI+LW  + +R        Y++F S+  FE 
Sbjct: 360 ADQIVSFLRQHAHSECYKEPPVLPSTIADQIKLWAIERDRFLYKDGVLYNQFDSQADFEI 419

Query: 155 ACDYARDRSGLL 166
             +YA++R GLL
Sbjct: 420 LRNYAQER-GLL 430


>gi|195999048|ref|XP_002109392.1| hypothetical protein TRIADDRAFT_20428 [Trichoplax adhaerens]
 gi|190587516|gb|EDV27558.1| hypothetical protein TRIADDRAFT_20428 [Trichoplax adhaerens]
          Length = 452

 Score =  100 bits (249), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 52/134 (38%), Positives = 77/134 (57%), Gaps = 8/134 (5%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           S S G+++VETN+R+YAY+ S L   ++ LF ++ Y+LP L VG  T+ES+  AF NGIT
Sbjct: 289 SKSSGYIIVETNYRLYAYTESPLDIALVALFCEMMYRLPGLCVGLITRESVQQAFTNGIT 348

Query: 101 AEQ------QNAHPRVADRI--PSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVF 152
           A +       +AHP    ++  P +P  + DQ+ LWE +  R+  +    Y++  S   F
Sbjct: 349 ANKIINFIRTHAHPEAKKKVTTPIVPSTIIDQLYLWELERGRLSCSDGVLYNQILSSSDF 408

Query: 153 EAACDYARDRSGLL 166
           EA   YA D   LL
Sbjct: 409 EALRKYADDMGVLL 422


>gi|115388809|ref|XP_001211910.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114195994|gb|EAU37694.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 460

 Score =  100 bits (249), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 52/136 (38%), Positives = 80/136 (58%), Gaps = 19/136 (13%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           GF++VETN+R+YAY++S L   ++ LF+ ++Y+ PNLI G  T++S+  A E GITA+Q 
Sbjct: 295 GFIIVETNYRLYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSVRRAIEMGITADQI 354

Query: 104 -----QNAHPRV------------ADRIPSI-PENVCDQIRLWESDLNRVEMTPAHYYDE 145
                 +AHP++            A   PS+ P  V DQIRLW+ + +RV+ T    + +
Sbjct: 355 ISYLATHAHPQMRKHNVSRSTSNQAGIPPSVLPPTVVDQIRLWQLERDRVKATAGFLFKD 414

Query: 146 FPSRDVFEAACDYARD 161
           F S   +E  C YA +
Sbjct: 415 FASLAEYEDPCRYAEE 430


>gi|328766914|gb|EGF76966.1| hypothetical protein BATDEDRAFT_92242 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 471

 Score =  100 bits (249), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 45/131 (34%), Positives = 74/131 (56%), Gaps = 6/131 (4%)

Query: 37  REYISNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFE 96
           R   ++S GF+++ETN+R+YAY+ S L   +L LF ++  +  N+++G  T++S+  A  
Sbjct: 301 RSLEADSDGFIIIETNYRVYAYTDSPLQIAVLSLFIEMRVRFANMVIGIITRDSVREALA 360

Query: 97  NGITAEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRD 150
            GI+AEQ       +AHP +    P +P  + DQ+RLWE + NR+ ++  H Y  F    
Sbjct: 361 KGISAEQIIAYLTTHAHPEMKKGSPVLPTTIVDQVRLWEMERNRLRISRGHLYQMFSGEQ 420

Query: 151 VFEAACDYARD 161
            +     YA D
Sbjct: 421 EYREILKYATD 431


>gi|402224480|gb|EJU04542.1| transcription factor Tfb2 [Dacryopinax sp. DJM-731 SS1]
          Length = 466

 Score =  100 bits (248), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 48/129 (37%), Positives = 80/129 (62%), Gaps = 6/129 (4%)

Query: 38  EYISNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFEN 97
           E  ++S+GF+++ETN+R+YAY+ + L   +L LF  +  +  NL++G  T+ES+  A  N
Sbjct: 303 EKAASSEGFIILETNYRVYAYTENPLQIAVLNLFVALHSRFENLVIGRLTRESIKAALAN 362

Query: 98  GITAEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDV 151
           GITA+Q       +AHP +    P +P  V DQIRLW+ + NR++    + Y++F S+  
Sbjct: 363 GITADQIISYLTVHAHPMMHKNNPVLPVTVQDQIRLWQLEKNRLKSENGYLYEDFNSQGD 422

Query: 152 FEAACDYAR 160
           F+   +YA+
Sbjct: 423 FDLVLNYAK 431


>gi|443894209|dbj|GAC71558.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH, subunit TFB2 [Pseudozyma antarctica
           T-34]
          Length = 491

 Score =  100 bits (248), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 48/122 (39%), Positives = 77/122 (63%), Gaps = 6/122 (4%)

Query: 44  QGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ 103
           +G++++ETN+R+YAY+++ L   +L LF  I+ + PNL+VG+ T++S+ +A  NGITAEQ
Sbjct: 335 RGYIILETNYRLYAYTSNPLRVAVLSLFVTIKARFPNLLVGSITRDSVKSALANGITAEQ 394

Query: 104 ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACD 157
                  +AH ++    P +P  V DQIRLWE + NRV+      + +F S+  F    +
Sbjct: 395 IITYLTHHAHLQMHRNDPLLPVTVSDQIRLWEREKNRVQQNLGSLFTDFTSQPDFAEVRN 454

Query: 158 YA 159
           YA
Sbjct: 455 YA 456


>gi|330797174|ref|XP_003286637.1| hypothetical protein DICPUDRAFT_31198 [Dictyostelium purpureum]
 gi|325083385|gb|EGC36839.1| hypothetical protein DICPUDRAFT_31198 [Dictyostelium purpureum]
          Length = 471

 Score =  100 bits (248), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 52/124 (41%), Positives = 78/124 (62%), Gaps = 6/124 (4%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           G++V+ETN+R+YAY+ S L   +L LF K+ Y+LPNL VG  T+ES+  A  +GITA+Q 
Sbjct: 316 GYIVLETNYRLYAYTASSLQISLLSLFVKMLYRLPNLAVGIITRESIRTALIHGITADQI 375

Query: 104 -----QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDY 158
                 NAHP   +    IP+ V +QI LWE + NR+  T +  Y+ FP+ D + A   +
Sbjct: 376 IDFIRHNAHPNAVNNGQPIPDVVAEQILLWEQERNRITYTKSVLYNSFPTTDCYHATLKF 435

Query: 159 ARDR 162
           A+++
Sbjct: 436 AKEQ 439


>gi|342319113|gb|EGU11064.1| RNA polymerase II transcription factor B subunit 2 [Rhodotorula
           glutinis ATCC 204091]
          Length = 496

 Score =  100 bits (248), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 49/133 (36%), Positives = 78/133 (58%), Gaps = 10/133 (7%)

Query: 44  QGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ 103
           +GF+V+ETN+++YAY+++ L   +L LF+ ++ +  N + G  T+ES+     NGITA Q
Sbjct: 321 KGFIVLETNYKLYAYTSNPLQIAVLGLFAHLKTRFANFVTGHITRESIRRGLANGITANQ 380

Query: 104 ------QNAHPRVADRIPS----IPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFE 153
                   AHP++  +  S    +P  V DQIRLWE +  R++ T  + YDEF S   +E
Sbjct: 381 IISYLASRAHPQMRAQAGSDDKLLPITVVDQIRLWEHERRRIQTTEGYLYDEFSSTHDYE 440

Query: 154 AACDYARDRSGLL 166
              +YAR+   +L
Sbjct: 441 LVVNYAREIGSVL 453


>gi|388581217|gb|EIM21527.1| transcription factor Tfb2 [Wallemia sebi CBS 633.66]
          Length = 449

 Score = 99.8 bits (247), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 48/125 (38%), Positives = 79/125 (63%), Gaps = 6/125 (4%)

Query: 42  NSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITA 101
           +++ F+++ETN+R+YAY+++ L   IL LF  ++ + PNL++G  T++S+ +A  NGITA
Sbjct: 276 DNKRFIILETNYRLYAYTSNPLQISILNLFVTLKARYPNLVIGVITRDSIRSALSNGITA 335

Query: 102 EQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAA 155
           EQ       +AH ++    P +P  V DQIRLWE + NR++      Y EF S+  +E  
Sbjct: 336 EQIIGYLTSHAHTQMHRNNPLLPVTVSDQIRLWELEKNRLKADDGVLYAEFRSQPDYEIL 395

Query: 156 CDYAR 160
            +YA+
Sbjct: 396 LNYAK 400


>gi|195378984|ref|XP_002048261.1| GJ11446 [Drosophila virilis]
 gi|194155419|gb|EDW70603.1| GJ11446 [Drosophila virilis]
          Length = 492

 Score = 99.8 bits (247), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 54/129 (41%), Positives = 79/129 (61%), Gaps = 12/129 (9%)

Query: 44  QGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ 103
           +G++VVETN+R+YAY+ S+L   +L LF+++ Y+ PNL+VG  T++S+  A   GITAEQ
Sbjct: 327 RGYIVVETNYRVYAYTDSQLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQ 386

Query: 104 ------QNAHPR---VADRIPS---IPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDV 151
                 Q AHP    V   I S   +P  + DQI+LWE + NR   T    Y++F S++ 
Sbjct: 387 IVSYLEQYAHPNMKLVESAIQSKSCLPPTIVDQIKLWEMERNRFTYTEGVVYNQFLSQND 446

Query: 152 FEAACDYAR 160
           F    DYA+
Sbjct: 447 FVTLRDYAQ 455


>gi|76154655|gb|AAX26095.2| SJCHGC05044 protein [Schistosoma japonicum]
          Length = 214

 Score = 99.4 bits (246), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 59/154 (38%), Positives = 82/154 (53%), Gaps = 7/154 (4%)

Query: 19  AIGQWELMDTNAQLWYIVREYISNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQL 78
            + Q +  DTN  L  I     S+  G++++ETNFR+YAY+ S L   +L LFSKI  + 
Sbjct: 33  GVSQSDSSDTNKMLAQISVSS-SSDVGYILLETNFRLYAYTDSPLRTALLSLFSKIRARF 91

Query: 79  PNLIVGATTKESLNNAFENGITAEQ------QNAHPRVADRIPSIPENVCDQIRLWESDL 132
           PNL+V   T++S+  A   GITA Q       NAHP +    P +P  + DQIRLWE + 
Sbjct: 92  PNLVVADITRDSVREALIRGITANQILSFLTSNAHPDMLLSNPILPPTLTDQIRLWELER 151

Query: 133 NRVEMTPAHYYDEFPSRDVFEAACDYARDRSGLL 166
           +R        Y+ F     FE   DYA++   LL
Sbjct: 152 DRFVFQEGCLYEHFSRNTDFELVRDYAKNIGVLL 185


>gi|50557260|ref|XP_506038.1| YALI0F30173p [Yarrowia lipolytica]
 gi|62901293|sp|Q6BZX4.1|TFB2_YARLI RecName: Full=RNA polymerase II transcription factor B subunit 2;
           AltName: Full=RNA polymerase II transcription factor B
           52 kDa subunit; AltName: Full=RNA polymerase II
           transcription factor B p52 subunit
 gi|49651908|emb|CAG78851.1| YALI0F30173p [Yarrowia lipolytica CLIB122]
          Length = 467

 Score = 99.0 bits (245), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 46/133 (34%), Positives = 78/133 (58%), Gaps = 6/133 (4%)

Query: 40  ISNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGI 99
           ++   GF+++ETNFR+YAY+ S L   +L LF  ++ +  N++ G   ++S+  A  NGI
Sbjct: 300 VAADSGFIILETNFRLYAYTESPLQIAVLNLFVNLKTRFANMVTGQINRDSVRFALSNGI 359

Query: 100 TAEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFE 153
           TAEQ       +AHPR+      +P  V DQI+LW+ +++R+  T  + + EF + D ++
Sbjct: 360 TAEQIITYLSVHAHPRMKGMEHVLPPTVVDQIKLWQLEMDRIRATDGYLFSEFKNFDEYK 419

Query: 154 AACDYARDRSGLL 166
               YA++   LL
Sbjct: 420 DVSTYAKELGVLL 432


>gi|195019521|ref|XP_001984999.1| GH16811 [Drosophila grimshawi]
 gi|193898481|gb|EDV97347.1| GH16811 [Drosophila grimshawi]
          Length = 493

 Score = 99.0 bits (245), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 55/129 (42%), Positives = 78/129 (60%), Gaps = 12/129 (9%)

Query: 44  QGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ 103
           +G++VVETN+R+YAY+ S+L   +L LF+++ Y+ PNL+VG  T++S+  A   GITAEQ
Sbjct: 328 RGYIVVETNYRVYAYTDSQLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQ 387

Query: 104 ------QNAHPR---VADRIPS---IPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDV 151
                 Q AHP    V   I S   +P  V DQI+LWE + NR   T    Y++F S+  
Sbjct: 388 IVSYLEQYAHPNMKLVESAIQSKSCLPPTVVDQIKLWELERNRFTYTEGVVYNQFLSQTD 447

Query: 152 FEAACDYAR 160
           F    DYA+
Sbjct: 448 FVTLRDYAQ 456


>gi|156057729|ref|XP_001594788.1| hypothetical protein SS1G_04596 [Sclerotinia sclerotiorum 1980]
 gi|154702381|gb|EDO02120.1| hypothetical protein SS1G_04596 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 420

 Score = 99.0 bits (245), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 45/133 (33%), Positives = 78/133 (58%), Gaps = 12/133 (9%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           +N +GF+++ETN+R+YAY+ S L   +L LF+K+  + PN++ G  +++S+  A  +GIT
Sbjct: 256 ANQKGFIIIETNYRLYAYTNSPLQIAVLSLFTKLNTRYPNMVSGRVSRDSIRTAISHGIT 315

Query: 101 AEQ------QNAHPRV------ADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 148
           ++Q       +AHP++      A   P +P  V DQIRLW+ +  R++  P   + EF  
Sbjct: 316 SDQIITYLSTHAHPQLVKASTAAHGGPVLPPTVVDQIRLWQLENERMKAVPGFLFKEFDG 375

Query: 149 RDVFEAACDYARD 161
           +  +E    YA +
Sbjct: 376 QKEYEGCAKYAEE 388


>gi|119486893|ref|XP_001262366.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Neosartorya fischeri NRRL 181]
 gi|119410523|gb|EAW20469.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Neosartorya fischeri NRRL 181]
          Length = 468

 Score = 99.0 bits (245), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 48/136 (35%), Positives = 79/136 (58%), Gaps = 19/136 (13%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           GF+++ETN+R+YAY++S L   ++ LF+ ++Y+ PNLI G  T++S+  A E GITA+Q 
Sbjct: 302 GFIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSVRRAVEMGITADQI 361

Query: 104 -----QNAHPRVADR-------------IPSIPENVCDQIRLWESDLNRVEMTPAHYYDE 145
                 +AHP++                +  +P  V DQIRLW+ + +RV+ T    + +
Sbjct: 362 ISYLSTHAHPQMRKHNVSRSTSNQAGMPVSVLPPTVVDQIRLWQLERDRVKATHGFLFRD 421

Query: 146 FPSRDVFEAACDYARD 161
           F +   +EA C YA +
Sbjct: 422 FNTLAEYEAPCRYAEE 437


>gi|70982185|ref|XP_746621.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2) [Aspergillus fumigatus Af293]
 gi|66844244|gb|EAL84583.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Aspergillus fumigatus Af293]
 gi|159122143|gb|EDP47265.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Aspergillus fumigatus A1163]
          Length = 479

 Score = 99.0 bits (245), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 48/136 (35%), Positives = 79/136 (58%), Gaps = 19/136 (13%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           GF+++ETN+R+YAY++S L   ++ LF+ ++Y+ PNLI G  T++S+  A E GITA+Q 
Sbjct: 313 GFIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSVRRAVEMGITADQI 372

Query: 104 -----QNAHPRVADR-------------IPSIPENVCDQIRLWESDLNRVEMTPAHYYDE 145
                 +AHP++                +  +P  V DQIRLW+ + +RV+ T    + +
Sbjct: 373 ISYLSTHAHPQMRKHNVSRSTSNQAGMPVSVLPPTVVDQIRLWQLERDRVKATHGFLFRD 432

Query: 146 FPSRDVFEAACDYARD 161
           F +   +EA C YA +
Sbjct: 433 FNTLAEYEAPCRYAEE 448


>gi|256086358|ref|XP_002579367.1| tfiih polypeptide [Schistosoma mansoni]
          Length = 486

 Score = 99.0 bits (245), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 55/135 (40%), Positives = 77/135 (57%), Gaps = 8/135 (5%)

Query: 40  ISNSQ--GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFEN 97
           +SNS   G++++ETNFR+YAY+ S L   +L LFSKI  + PNL+V   T++S+  A   
Sbjct: 323 VSNSSDVGYILLETNFRLYAYTDSPLRTALLSLFSKIRARFPNLVVADITRDSVREALIR 382

Query: 98  GITAEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDV 151
           GITA Q       NAHP +  + P +P  + DQIRLWE + +R        Y++F     
Sbjct: 383 GITANQILSFLTANAHPDMLLQNPILPPTLTDQIRLWELERDRFLFQEGCLYEQFSRNTD 442

Query: 152 FEAACDYARDRSGLL 166
           FE   DYA+    LL
Sbjct: 443 FEMVRDYAKSIGVLL 457


>gi|195162678|ref|XP_002022181.1| GL25529 [Drosophila persimilis]
 gi|198464237|ref|XP_001353141.2| GA20571 [Drosophila pseudoobscura pseudoobscura]
 gi|194104142|gb|EDW26185.1| GL25529 [Drosophila persimilis]
 gi|198149631|gb|EAL30642.2| GA20571 [Drosophila pseudoobscura pseudoobscura]
          Length = 494

 Score = 99.0 bits (245), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 55/128 (42%), Positives = 77/128 (60%), Gaps = 12/128 (9%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE-- 102
           G++VVETN+R+YAY+ S L   +L LF+++ Y+ PNL+VG  T++S+  A   GITAE  
Sbjct: 330 GYIVVETNYRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQI 389

Query: 103 ----QQNAHPR---VADRIPS---IPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVF 152
               QQ AHP    V   I S   +P  V DQI+LWE + NR   T    Y++F S++ F
Sbjct: 390 VSYLQQYAHPNMKLVESAIQSKSCLPPTVVDQIKLWEMERNRFTYTEGVVYNQFLSQNDF 449

Query: 153 EAACDYAR 160
               DYA+
Sbjct: 450 VTLRDYAQ 457


>gi|353231086|emb|CCD77504.1| putative tfiih, polypeptide [Schistosoma mansoni]
          Length = 486

 Score = 99.0 bits (245), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 55/135 (40%), Positives = 77/135 (57%), Gaps = 8/135 (5%)

Query: 40  ISNSQ--GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFEN 97
           +SNS   G++++ETNFR+YAY+ S L   +L LFSKI  + PNL+V   T++S+  A   
Sbjct: 323 VSNSSDVGYILLETNFRLYAYTDSPLRTALLSLFSKIRARFPNLVVADITRDSVREALIR 382

Query: 98  GITAEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDV 151
           GITA Q       NAHP +  + P +P  + DQIRLWE + +R        Y++F     
Sbjct: 383 GITANQILSFLTANAHPDMLLQNPILPPTLTDQIRLWELERDRFLFQEGCLYEQFSRNTD 442

Query: 152 FEAACDYARDRSGLL 166
           FE   DYA+    LL
Sbjct: 443 FEMVRDYAKSIGVLL 457


>gi|194749905|ref|XP_001957376.1| GF10392 [Drosophila ananassae]
 gi|190624658|gb|EDV40182.1| GF10392 [Drosophila ananassae]
          Length = 497

 Score = 98.6 bits (244), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 55/128 (42%), Positives = 77/128 (60%), Gaps = 12/128 (9%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           G++VVETN+R+YAY+ S L   +L LF+++ Y+ PNL+VG  T++S+  A   GITAEQ 
Sbjct: 333 GYIVVETNYRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQI 392

Query: 104 -----QNAHPR---VADRIPS---IPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVF 152
                Q AHP    V   I S   +P  V DQI+LWE + NR   T    Y++F S++ F
Sbjct: 393 VSYLEQYAHPNMRLVESAIQSKSCLPPTVVDQIKLWEMERNRFTYTEGVVYNQFLSQNDF 452

Query: 153 EAACDYAR 160
               DYA+
Sbjct: 453 VTLRDYAQ 460


>gi|307177222|gb|EFN66420.1| General transcription factor IIH subunit 4 [Camponotus floridanus]
          Length = 467

 Score = 98.6 bits (244), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 48/126 (38%), Positives = 79/126 (62%), Gaps = 7/126 (5%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           ++ +G++VVETN+R+YAY+ S L   +L LF ++ Y+ PN++V   T++S+  A ++GIT
Sbjct: 304 TDKEGYIVVETNYRVYAYTNSNLQVALLGLFCEMLYRFPNVVVSILTRDSIRQALKSGIT 363

Query: 101 AE------QQNAHPRVADRIPSI-PENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFE 153
           A       QQ+ H ++ +R P I P  + DQI+LWE++ NR   +    Y +F S+  FE
Sbjct: 364 ASQIVGYLQQHVHSKMIERGPPILPPTIVDQIKLWENERNRFLFSEGVLYSQFLSQTDFE 423

Query: 154 AACDYA 159
              D+A
Sbjct: 424 VLRDHA 429


>gi|425777809|gb|EKV15965.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Penicillium digitatum PHI26]
 gi|425782577|gb|EKV20476.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Penicillium digitatum Pd1]
          Length = 382

 Score = 98.6 bits (244), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 47/131 (35%), Positives = 78/131 (59%), Gaps = 17/131 (12%)

Query: 46  FVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ-- 103
           F+++ETN+R+YAY++S L   ++ LF+ ++Y+ PNL+ G  T++S+  A E GITA+Q  
Sbjct: 216 FIIIETNYRLYAYTSSPLQISLISLFTNLKYRFPNLVTGKITRQSVRRAIEMGITADQII 275

Query: 104 ----QNAHPRV---------ADRIPS--IPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 148
                +AHP++            IP+  +P  V DQIRLW+ + +R+  T    + +F S
Sbjct: 276 SYLLSHAHPQLRKHSAAQSNGKGIPASVLPPTVTDQIRLWQLERDRLRATAGFLFKDFTS 335

Query: 149 RDVFEAACDYA 159
              ++A C YA
Sbjct: 336 LAEYQAPCQYA 346


>gi|91094191|ref|XP_971121.1| PREDICTED: similar to TFIIH basal transcription factor complex p52
           subunit [Tribolium castaneum]
 gi|270010854|gb|EFA07302.1| hypothetical protein TcasGA2_TC015892 [Tribolium castaneum]
          Length = 472

 Score = 98.6 bits (244), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 48/131 (36%), Positives = 81/131 (61%), Gaps = 8/131 (6%)

Query: 44  QGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ 103
           +G+++VETN+R+YAY+ S L   ++ LF+++ Y+ PNL+VG  T++S+  A + GITA+Q
Sbjct: 312 KGYIIVETNYRVYAYTDSNLQVALIGLFTELMYRFPNLVVGVITRDSIRQALKGGITADQ 371

Query: 104 ------QNAHPRV--ADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAA 155
                 Q+AHP++   +    +P  V DQI+LWE + NR+  +    Y +F S+  F   
Sbjct: 372 IIGYLKQHAHPQMLEGEAKHPLPPTVVDQIKLWEIERNRLTYSEGVLYSQFLSQADFNIL 431

Query: 156 CDYARDRSGLL 166
            +YA+    L+
Sbjct: 432 KEYAQSNGHLI 442


>gi|431907067|gb|ELK11185.1| General transcription factor IIH subunit 4 [Pteropus alecto]
          Length = 447

 Score = 98.2 bits (243), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 47/117 (40%), Positives = 73/117 (62%), Gaps = 10/117 (8%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQQ 104
           GF+VVETN+R+YAY+ S+L   ++ LFS++ Y+ PN++V   T+ES+  A  +GITA+Q 
Sbjct: 306 GFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQ- 364

Query: 105 NAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARD 161
                     P +P  + DQIRLWE + +R+  T    Y++F S+  FE    +AR+
Sbjct: 365 ---------TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARE 412


>gi|195477962|ref|XP_002086438.1| GE22864 [Drosophila yakuba]
 gi|194186228|gb|EDW99839.1| GE22864 [Drosophila yakuba]
          Length = 241

 Score = 97.8 bits (242), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 55/128 (42%), Positives = 75/128 (58%), Gaps = 12/128 (9%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           G++VVETN+R+YAY+ S L   +L LF+++ Y+ PNL+VG  T++S+  A   GITAEQ 
Sbjct: 77  GYIVVETNYRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQI 136

Query: 104 -----QNAHPR---VADRIPS---IPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVF 152
                Q AHP    V   I S   +P  V DQI+LWE + NR   T    Y++F S   F
Sbjct: 137 VSYLEQYAHPNMRLVESAIQSKSCLPPTVVDQIKLWELERNRFTYTEGVVYNQFLSHTDF 196

Query: 153 EAACDYAR 160
               DYA+
Sbjct: 197 VTLRDYAQ 204


>gi|255936123|ref|XP_002559088.1| Pc13g06540 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211583708|emb|CAP91723.1| Pc13g06540 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 483

 Score = 97.8 bits (242), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 46/131 (35%), Positives = 78/131 (59%), Gaps = 17/131 (12%)

Query: 46  FVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ-- 103
           F+++ETN+R+YAY++S L   ++ LF+ ++Y+ PNL+ G  T++S+  A E GITA+Q  
Sbjct: 317 FIIIETNYRLYAYTSSPLQISLISLFTNLKYRFPNLVTGKITRQSVRRAIEMGITADQII 376

Query: 104 ----QNAHPRV---------ADRIPS--IPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 148
                +AHP++            +P+  +P  V DQIRLW+ + +R+  T    + +F S
Sbjct: 377 SYLLSHAHPQLRKHSAAQPNGKGVPASVLPPTVTDQIRLWQLERDRLRATAGFLFKDFTS 436

Query: 149 RDVFEAACDYA 159
              ++A C YA
Sbjct: 437 LAEYQAPCQYA 447


>gi|358333989|dbj|GAA34589.2| transcription initiation factor TFIIH subunit 4 [Clonorchis
           sinensis]
          Length = 456

 Score = 97.4 bits (241), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 49/122 (40%), Positives = 71/122 (58%), Gaps = 6/122 (4%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           G++++ETNFR+YAY+ S L   +L LFSKI  + PNL+V   T++S+  A   GITA+Q 
Sbjct: 300 GYILLETNFRLYAYTDSPLRTALLSLFSKIRARFPNLVVADITRDSVREALIRGITADQI 359

Query: 104 -----QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDY 158
                 NAHP +    P +P  + DQIRLWE + +R        Y++F     FE   D+
Sbjct: 360 ISFLTTNAHPDMLREPPILPPTLVDQIRLWELERDRFVFQEGCLYEQFSKSADFEMVRDF 419

Query: 159 AR 160
           A+
Sbjct: 420 AK 421


>gi|195441209|ref|XP_002068409.1| GK20438 [Drosophila willistoni]
 gi|194164494|gb|EDW79395.1| GK20438 [Drosophila willistoni]
          Length = 512

 Score = 97.4 bits (241), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 52/129 (40%), Positives = 75/129 (58%), Gaps = 12/129 (9%)

Query: 44  QGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ 103
           +G++VVETN+R+YAY+ S L   +L LF+++ Y+ PNL+VG  T++S+  A   GITAEQ
Sbjct: 347 RGYIVVETNYRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQ 406

Query: 104 ------QNAHPRVA------DRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDV 151
                 Q AHP +       +    +P  V DQI+LWE + NR   T    Y++F S   
Sbjct: 407 IISYLEQYAHPNMKLVESAINSKSCLPPTVVDQIKLWEMERNRFTYTEGVVYNQFLSTTD 466

Query: 152 FEAACDYAR 160
           F    DYA+
Sbjct: 467 FVTLRDYAQ 475


>gi|328863850|gb|EGG12949.1| hypothetical protein MELLADRAFT_46475 [Melampsora larici-populina
           98AG31]
          Length = 454

 Score = 97.4 bits (241), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 47/126 (37%), Positives = 78/126 (61%), Gaps = 6/126 (4%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           S+   F+++ETN+R+YAY+++ L   IL LF   + + PNL++G+ T+ES+  AF+NGI 
Sbjct: 291 SDPNEFLILETNYRIYAYTSNPLQIAILNLFISFKSRFPNLVIGSITRESIKMAFKNGIR 350

Query: 101 AEQQ------NAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           A+Q       ++H ++    P +P  V DQIRLWE + NRV     + Y++F S   +E 
Sbjct: 351 ADQIISYLEFHSHSQMKLLKPILPNTVVDQIRLWELEKNRVRDQEGYLYEDFKSVSDYEI 410

Query: 155 ACDYAR 160
             +Y++
Sbjct: 411 VINYSK 416


>gi|195477975|ref|XP_002086439.1| GE22863 [Drosophila yakuba]
 gi|194186229|gb|EDW99840.1| GE22863 [Drosophila yakuba]
          Length = 499

 Score = 97.4 bits (241), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 55/128 (42%), Positives = 75/128 (58%), Gaps = 12/128 (9%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           G++VVETN+R+YAY+ S L   +L LF+++ Y+ PNL+VG  T++S+  A   GITAEQ 
Sbjct: 335 GYIVVETNYRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQI 394

Query: 104 -----QNAHPR---VADRIPS---IPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVF 152
                Q AHP    V   I S   +P  V DQI+LWE + NR   T    Y++F S   F
Sbjct: 395 VSYLEQYAHPNMRLVESAIQSKSCLPPTVVDQIKLWELERNRFTYTEGVVYNQFLSHTDF 454

Query: 153 EAACDYAR 160
               DYA+
Sbjct: 455 VTLRDYAQ 462


>gi|194872905|ref|XP_001973103.1| GG13537 [Drosophila erecta]
 gi|190654886|gb|EDV52129.1| GG13537 [Drosophila erecta]
          Length = 499

 Score = 97.1 bits (240), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 55/128 (42%), Positives = 75/128 (58%), Gaps = 12/128 (9%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           G++VVETN+R+YAY+ S L   +L LF+++ Y+ PNL+VG  T++S+  A   GITAEQ 
Sbjct: 335 GYIVVETNYRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQI 394

Query: 104 -----QNAHPR---VADRIPS---IPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVF 152
                Q AHP    V   I S   +P  V DQI+LWE + NR   T    Y++F S   F
Sbjct: 395 VSYLEQYAHPNMRMVESAIQSKSCLPPTVVDQIKLWELERNRFTYTEGVVYNQFLSHTDF 454

Query: 153 EAACDYAR 160
               DYA+
Sbjct: 455 VTLRDYAQ 462


>gi|16198089|gb|AAL13841.1| LD30622p [Drosophila melanogaster]
          Length = 350

 Score = 97.1 bits (240), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 55/128 (42%), Positives = 75/128 (58%), Gaps = 12/128 (9%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           G++VVETN+R+YAY+ S L   +L LF+++ Y+ PNL+VG  T++S+  A   GITAEQ 
Sbjct: 186 GYIVVETNYRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQI 245

Query: 104 -----QNAHPR---VADRIPS---IPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVF 152
                Q AHP    V   I S   +P  V DQI+LWE + NR   T    Y++F S   F
Sbjct: 246 VSYLEQYAHPNMRMVESAIHSKSCLPPTVVDQIKLWELERNRFTYTEGVLYNQFLSHTDF 305

Query: 153 EAACDYAR 160
               DYA+
Sbjct: 306 VTLRDYAQ 313


>gi|346464903|gb|AEO32296.1| hypothetical protein [Amblyomma maculatum]
          Length = 436

 Score = 97.1 bits (240), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 49/132 (37%), Positives = 75/132 (56%), Gaps = 6/132 (4%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           S   G+++VETN+R+YAY+ S+L   +L LF ++ Y+ PNL+V   T+ES+  A  +GIT
Sbjct: 297 SYEAGYIMVETNYRVYAYTNSQLQVALLALFCELLYRFPNLVVARLTRESVRQALRSGIT 356

Query: 101 AEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           + Q        AHP    + P IP  + DQ+RLWE + +R        Y +F S+  F+ 
Sbjct: 357 SNQIIKFLRMYAHPEALKQTPVIPATIMDQLRLWELERDRFVYREGVLYSQFISQSDFQL 416

Query: 155 ACDYARDRSGLL 166
             +YA D   L+
Sbjct: 417 LRNYASDLGVLI 428


>gi|195590491|ref|XP_002084979.1| GD12553 [Drosophila simulans]
 gi|194196988|gb|EDX10564.1| GD12553 [Drosophila simulans]
          Length = 500

 Score = 97.1 bits (240), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 55/128 (42%), Positives = 75/128 (58%), Gaps = 12/128 (9%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           G++VVETN+R+YAY+ S L   +L LF+++ Y+ PNL+VG  T++S+  A   GITAEQ 
Sbjct: 336 GYIVVETNYRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQI 395

Query: 104 -----QNAHPR---VADRIPS---IPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVF 152
                Q AHP    V   I S   +P  V DQI+LWE + NR   T    Y++F S   F
Sbjct: 396 VSYLEQYAHPNMRMVESAIHSKSCLPPTVVDQIKLWELERNRFTYTEGVLYNQFLSHTDF 455

Query: 153 EAACDYAR 160
               DYA+
Sbjct: 456 VTLRDYAQ 463


>gi|195327811|ref|XP_002030611.1| GM24480 [Drosophila sechellia]
 gi|194119554|gb|EDW41597.1| GM24480 [Drosophila sechellia]
          Length = 499

 Score = 96.7 bits (239), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 55/128 (42%), Positives = 75/128 (58%), Gaps = 12/128 (9%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           G++VVETN+R+YAY+ S L   +L LF+++ Y+ PNL+VG  T++S+  A   GITAEQ 
Sbjct: 335 GYIVVETNYRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQI 394

Query: 104 -----QNAHPR---VADRIPS---IPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVF 152
                Q AHP    V   I S   +P  V DQI+LWE + NR   T    Y++F S   F
Sbjct: 395 VSYLEQYAHPNMRMVESAIHSKSCLPPTVVDQIKLWELERNRFTYTEGVLYNQFLSHTDF 454

Query: 153 EAACDYAR 160
               DYA+
Sbjct: 455 VTLRDYAQ 462


>gi|21357821|ref|NP_648780.1| marionette [Drosophila melanogaster]
 gi|7294263|gb|AAF49614.1| marionette [Drosophila melanogaster]
 gi|220942562|gb|ACL83824.1| Tfb2-PA [synthetic construct]
          Length = 499

 Score = 96.7 bits (239), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 55/128 (42%), Positives = 75/128 (58%), Gaps = 12/128 (9%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           G++VVETN+R+YAY+ S L   +L LF+++ Y+ PNL+VG  T++S+  A   GITAEQ 
Sbjct: 335 GYIVVETNYRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQI 394

Query: 104 -----QNAHPR---VADRIPS---IPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVF 152
                Q AHP    V   I S   +P  V DQI+LWE + NR   T    Y++F S   F
Sbjct: 395 VSYLEQYAHPNMRMVESAIHSKSCLPPTVVDQIKLWELERNRFTYTEGVLYNQFLSHTDF 454

Query: 153 EAACDYAR 160
               DYA+
Sbjct: 455 VTLRDYAQ 462


>gi|383852611|ref|XP_003701820.1| PREDICTED: general transcription factor IIH subunit 4-like
           [Megachile rotundata]
          Length = 467

 Score = 96.7 bits (239), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 48/125 (38%), Positives = 78/125 (62%), Gaps = 7/125 (5%)

Query: 42  NSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITA 101
           + +G++VVETN+R+YAY+ S L   +L LF ++ Y+ PNL+V   T++S+  A ++GITA
Sbjct: 305 DKEGYIVVETNYRVYAYTNSNLQVALLGLFCEMLYRFPNLVVSILTRDSVRQALKSGITA 364

Query: 102 E------QQNAHPRVADRIPSI-PENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
                  QQ+AH ++ +  P + P  + DQI+LWE++ NR   +    Y +F S+  FE 
Sbjct: 365 SQIVGYLQQHAHSKMIEAGPPVLPPTIVDQIKLWENERNRFIFSEGVLYSQFLSQTDFEV 424

Query: 155 ACDYA 159
             D+A
Sbjct: 425 LRDHA 429


>gi|380014187|ref|XP_003691121.1| PREDICTED: general transcription factor IIH subunit 4-like [Apis
           florea]
          Length = 467

 Score = 96.7 bits (239), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 49/124 (39%), Positives = 77/124 (62%), Gaps = 7/124 (5%)

Query: 43  SQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE 102
            +G++VVETN+R+YAY+ S L   +L LF ++ Y+ PNL+V   T++S+  A ++GITA 
Sbjct: 306 KEGYIVVETNYRVYAYTNSNLQVALLGLFCEMLYRFPNLVVSILTRDSVRQALKSGITAT 365

Query: 103 ------QQNAHPRVADRIPSI-PENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAA 155
                 QQ+AH ++ +  P I P  + DQI+LWE++ NR   +    Y +F S+  FE  
Sbjct: 366 QIVGYLQQHAHSKMIEAGPPILPPTIVDQIKLWENERNRFIFSEGVLYSQFLSQTDFEVL 425

Query: 156 CDYA 159
            D+A
Sbjct: 426 RDHA 429


>gi|48102983|ref|XP_395476.1| PREDICTED: general transcription factor IIH subunit 4 [Apis
           mellifera]
          Length = 467

 Score = 96.7 bits (239), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 49/124 (39%), Positives = 77/124 (62%), Gaps = 7/124 (5%)

Query: 43  SQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE 102
            +G++VVETN+R+YAY+ S L   +L LF ++ Y+ PNL+V   T++S+  A ++GITA 
Sbjct: 306 KEGYIVVETNYRVYAYTNSNLQVALLGLFCEMLYRFPNLVVSILTRDSVRQALKSGITAT 365

Query: 103 ------QQNAHPRVADRIPSI-PENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAA 155
                 QQ+AH ++ +  P I P  + DQI+LWE++ NR   +    Y +F S+  FE  
Sbjct: 366 QIVGYLQQHAHSKMIEAGPPILPPTIVDQIKLWENERNRFIFSEGVLYSQFLSQTDFEVL 425

Query: 156 CDYA 159
            D+A
Sbjct: 426 RDHA 429


>gi|332030486|gb|EGI70174.1| General transcription factor IIH subunit 4 [Acromyrmex echinatior]
          Length = 550

 Score = 96.7 bits (239), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 47/126 (37%), Positives = 79/126 (62%), Gaps = 7/126 (5%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           ++ +G+++VETN+R+YAY+ S L   +L LF ++ Y+ PN++V   T++S+  A ++GIT
Sbjct: 304 TDKEGYIIVETNYRVYAYTNSNLQVALLGLFCEMLYRFPNVVVSILTRDSVRQALKSGIT 363

Query: 101 AEQ------QNAHPRVADRIPSI-PENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFE 153
           A Q      Q+AH ++ +  P I P  + DQI+LWE++ NR   +    Y +F S+  FE
Sbjct: 364 ASQIVGYLRQHAHSKMIEAGPPILPPTIVDQIKLWENERNRFLFSEGVLYSQFLSQTDFE 423

Query: 154 AACDYA 159
              D+A
Sbjct: 424 VLRDHA 429


>gi|296422658|ref|XP_002840876.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295637102|emb|CAZ85067.1| unnamed protein product [Tuber melanosporum]
          Length = 460

 Score = 96.3 bits (238), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 45/125 (36%), Positives = 73/125 (58%), Gaps = 6/125 (4%)

Query: 43  SQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE 102
            +GF+++ETN+R+YAY+ S L   +L LF K+  + PNL+ G  ++ S+  A + GITA+
Sbjct: 296 GKGFIILETNYRVYAYTDSPLQIAVLNLFVKLSTRYPNLVSGRISRRSIQEAIKMGITAD 355

Query: 103 Q------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAAC 156
           Q       +AHP++   + ++P  V DQIRLW+ +  R+  T    + +F S   FE   
Sbjct: 356 QVIDYLSAHAHPQMRKSLVTLPPTVVDQIRLWQIEGERMRTTTGFLFKDFHSTQEFEEVA 415

Query: 157 DYARD 161
            YA +
Sbjct: 416 KYAEE 420


>gi|50310215|ref|XP_455127.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|62901296|sp|Q6CLR2.1|TFB2_KLULA RecName: Full=RNA polymerase II transcription factor B subunit 2;
           AltName: Full=RNA polymerase II transcription factor B
           52 kDa subunit; AltName: Full=RNA polymerase II
           transcription factor B p52 subunit
 gi|49644263|emb|CAG97834.1| KLLA0F01056p [Kluyveromyces lactis]
          Length = 496

 Score = 95.9 bits (237), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 52/146 (35%), Positives = 78/146 (53%), Gaps = 24/146 (16%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           G +++ETNF++Y+YS S L   IL LF  ++ +  N++ G  T+ES+ NA  NGITAEQ 
Sbjct: 322 GALIIETNFKLYSYSNSPLQIAILSLFVHLKSRFANMVTGQLTRESVRNALLNGITAEQI 381

Query: 104 -----QNAHPR------------------VADRIPSIPENVCDQIRLWESDLNRVEMTPA 140
                 +AHPR                  V + +  +P  V DQIRLW+ +L+R+     
Sbjct: 382 IAYLETHAHPRMRRLAEENLSKKLELDPTVKETLQVLPPTVVDQIRLWQLELDRIISYDG 441

Query: 141 HYYDEFPSRDVFEAACDYARDRSGLL 166
           + Y +F S   ++   DYA+D   LL
Sbjct: 442 YLYTDFESYQEYQTVADYAKDIGVLL 467


>gi|427785675|gb|JAA58289.1| Putative rna polymer [Rhipicephalus pulchellus]
          Length = 457

 Score = 95.9 bits (237), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 47/128 (36%), Positives = 74/128 (57%), Gaps = 6/128 (4%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           G+++VETN+R+YAY+ S+L   +L LF ++ Y+ PNL+V   T+ES+  A  +GIT+ Q 
Sbjct: 300 GYIMVETNYRVYAYTNSQLQVALLALFCELLYRFPNLVVARLTRESVRQALRSGITSNQI 359

Query: 104 -----QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDY 158
                  AHP    + P IP  + DQ+RLWE + +R        Y +F S+  F+   +Y
Sbjct: 360 IKFLRMYAHPEALKQTPVIPATIMDQLRLWEMERDRFVFREGVLYSQFISQSDFQLLRNY 419

Query: 159 ARDRSGLL 166
           A +   L+
Sbjct: 420 ASELGVLI 427


>gi|350413829|ref|XP_003490125.1| PREDICTED: general transcription factor IIH subunit 4-like [Bombus
           impatiens]
          Length = 467

 Score = 95.9 bits (237), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 48/124 (38%), Positives = 77/124 (62%), Gaps = 7/124 (5%)

Query: 43  SQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE 102
            +G++VVETN+R+YAY+ S L   +L LF ++ Y+ PNL+V   T++S+  A ++GITA 
Sbjct: 306 KEGYIVVETNYRVYAYTNSNLQVALLGLFCEMLYRFPNLVVSILTRDSVRQALKSGITAA 365

Query: 103 ------QQNAHPRVADRIPSI-PENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAA 155
                 QQ+AH ++ +  P + P  + DQI+LWE++ NR   +    Y +F S+  FE  
Sbjct: 366 QIVGYLQQHAHGKMIEAGPPVLPPTIVDQIKLWENERNRFIFSEGVLYSQFLSQTDFEVL 425

Query: 156 CDYA 159
            D+A
Sbjct: 426 RDHA 429


>gi|307212060|gb|EFN87943.1| General transcription factor IIH subunit 4 [Harpegnathos saltator]
          Length = 467

 Score = 95.9 bits (237), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 49/126 (38%), Positives = 77/126 (61%), Gaps = 7/126 (5%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           ++ +G++VVETN+R+YAY+ S L   +L LF ++ Y+ PNL+V   T++S+  A ++GIT
Sbjct: 304 TDKEGYIVVETNYRVYAYTNSNLQVALLGLFCEMIYRFPNLVVSILTRDSVRQALKSGIT 363

Query: 101 AE------QQNAHPRVADRIPSI-PENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFE 153
           A       QQ+AH +  +  P I P  + DQI LWE++ NR   +    Y +F S+  FE
Sbjct: 364 ATQIVGYLQQHAHSKTIEAGPPILPPTIVDQIMLWENERNRFLFSEGVLYSQFLSQTDFE 423

Query: 154 AACDYA 159
              D+A
Sbjct: 424 VLRDHA 429


>gi|221120480|ref|XP_002159945.1| PREDICTED: general transcription factor IIH subunit 4-like [Hydra
           magnipapillata]
          Length = 458

 Score = 95.5 bits (236), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 49/132 (37%), Positives = 77/132 (58%), Gaps = 6/132 (4%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           S  QGF+VVETN+R+YAY+ S LH  +L LF+ I+ + P+  V   ++ES+  A   GI+
Sbjct: 298 SQQQGFLVVETNYRVYAYTDSVLHIALLSLFTDIKARFPSFTVALLSRESVQQALACGIS 357

Query: 101 AEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           AEQ        AH ++    P I   + DQI+LWE + +R+  +    Y++F S+  FE 
Sbjct: 358 AEQIIDFLKTRAHSQMTTSSPIIASTITDQIKLWEMERDRLRYSQGVLYNQFLSQSDFEM 417

Query: 155 ACDYARDRSGLL 166
              +A +++ LL
Sbjct: 418 LRKFADEKNHLL 429


>gi|339237219|ref|XP_003380164.1| general transcription factor IIH subunit 4 [Trichinella spiralis]
 gi|316977049|gb|EFV60220.1| general transcription factor IIH subunit 4 [Trichinella spiralis]
          Length = 674

 Score = 95.1 bits (235), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 50/129 (38%), Positives = 72/129 (55%), Gaps = 6/129 (4%)

Query: 44  QGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ 103
           +GF++VETNFR+Y Y+ S L   I+  F +  Y+ PNL+     +ES+  AF+  I+AEQ
Sbjct: 292 EGFLIVETNFRVYCYTDSVLDLAIVSTFCEPLYRFPNLVACILNRESVRRAFQVNISAEQ 351

Query: 104 ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACD 157
                  NAH  +  + P+IP  V DQI+LWE + +R +  P   Y  F S   +    D
Sbjct: 352 IIQYLFSNAHKNMQKQTPTIPSTVTDQIKLWEMERDRFKFDPGVMYSNFFSDTDYITIRD 411

Query: 158 YARDRSGLL 166
           YA+D   LL
Sbjct: 412 YAKDLGVLL 420


>gi|340710070|ref|XP_003393621.1| PREDICTED: general transcription factor IIH subunit 4-like [Bombus
           terrestris]
          Length = 467

 Score = 95.1 bits (235), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 48/124 (38%), Positives = 76/124 (61%), Gaps = 7/124 (5%)

Query: 43  SQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE 102
            +G++VVETN+R+YAY+ S L   +L LF ++ Y+ PNL+V   T++S+  A ++GITA 
Sbjct: 306 KEGYIVVETNYRVYAYTNSNLQVALLGLFCEMLYRFPNLVVSILTRDSVRQALKSGITAA 365

Query: 103 ------QQNAHPRVADRIPSI-PENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAA 155
                 QQ+AH ++ +  P + P  + DQI+LWE++ NR        Y +F S+  FE  
Sbjct: 366 QIVGYLQQHAHGKMIEAGPPVLPPTIVDQIKLWENERNRFIFNEGVLYSQFLSQTDFEVL 425

Query: 156 CDYA 159
            D+A
Sbjct: 426 RDHA 429


>gi|391335225|ref|XP_003741996.1| PREDICTED: general transcription factor IIH subunit 4-like
           [Metaseiulus occidentalis]
          Length = 457

 Score = 95.1 bits (235), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 49/128 (38%), Positives = 75/128 (58%), Gaps = 7/128 (5%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
            + QG+++VETN+R+YAY+ S L   +L LF ++ Y+ PNL+V   T+ES+  A + GIT
Sbjct: 294 KDEQGYIIVETNYRVYAYTDSPLQVALLSLFCELLYRFPNLLVAVLTRESVRQALKGGIT 353

Query: 101 AEQ------QNAHPRVADRIPS-IPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFE 153
           + Q        +H  V +R    IP  V DQ+RLWE + +R +   +  Y +F +   FE
Sbjct: 354 SNQITHFLKSRSHRVVLEREEGIIPMTVTDQLRLWELERDRFKFGESVLYSQFQTTGDFE 413

Query: 154 AACDYARD 161
              +YARD
Sbjct: 414 MLRNYARD 421


>gi|339521903|gb|AEJ84116.1| general transcription factor IIH subunit 4 [Capra hircus]
          Length = 463

 Score = 94.7 bits (234), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 47/127 (37%), Positives = 76/127 (59%), Gaps = 6/127 (4%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           ++  GF+VVETN+R+YAY+  +L    + LFS++ Y+ PN++V   T+ S+  A  +GIT
Sbjct: 302 AHQPGFIVVETNYRLYAYTEPELQIARIALFSEMLYRFPNMVVPQVTRGSVQQAIASGIT 361

Query: 101 AEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           A+Q        A P +  + P +P  + DQIRLWE + +R+  T    Y++F S+  FE 
Sbjct: 362 AQQIVHFLRTRADPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFEL 421

Query: 155 ACDYARD 161
              +AR+
Sbjct: 422 LLAHARE 428


>gi|378729002|gb|EHY55461.1| transcription initiation factor TFIIH subunit H4 [Exophiala
           dermatitidis NIH/UT8656]
          Length = 508

 Score = 94.7 bits (234), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 52/141 (36%), Positives = 79/141 (56%), Gaps = 21/141 (14%)

Query: 46  FVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ-- 103
           F++VETN+R+YAY++S L   +L LF  +  + PNL+ G  +K S+  A + GITA+Q  
Sbjct: 338 FIIVETNYRLYAYTSSPLQIALLSLFVNLRSRHPNLVTGKMSKSSVQRAIQAGITADQII 397

Query: 104 ----QNAHP-----------RVAD---RIPSIPENVCDQIRLWESDLNRVEMTPAHYYDE 145
                +AHP           R AD    +P +P  + DQI LW+ + +R+  TP     +
Sbjct: 398 SYLTSHAHPQMRRHAQAEQARNADPNRVVPILPATILDQIHLWQLERDRMTTTPGFLLKD 457

Query: 146 FPSRDVFEAACDYARDRSGLL 166
           FP++  +EA C YA D  G+L
Sbjct: 458 FPNQADYEAPCRYA-DEIGVL 477


>gi|452839401|gb|EME41340.1| hypothetical protein DOTSEDRAFT_156149 [Dothistroma septosporum
           NZE10]
          Length = 497

 Score = 94.4 bits (233), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 51/137 (37%), Positives = 81/137 (59%), Gaps = 12/137 (8%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           S+ +GF+++ETN+R+YAY++S L   IL LF+ +  + PNLI    TK S  NA   GIT
Sbjct: 318 SDEKGFIILETNYRLYAYTSSPLLISILSLFASLNTRYPNLITAKITKASTQNAIAAGIT 377

Query: 101 AEQ------QNAHPRV-----ADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 149
           ++Q       +AHP +     A+  P +P  V DQI+LW+ +  R+E T  +   +  S+
Sbjct: 378 SDQIIDYLTTHAHPVLRRQAAANDAPILPPTVVDQIKLWQMEGERMEATKGYLIRDIGSQ 437

Query: 150 DVFEAACDYARDRSGLL 166
           + ++ A +YA D  G+L
Sbjct: 438 EEYDKAVNYA-DAIGVL 453


>gi|345492819|ref|XP_001602702.2| PREDICTED: general transcription factor IIH subunit 4-like [Nasonia
           vitripennis]
          Length = 428

 Score = 94.4 bits (233), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 48/121 (39%), Positives = 76/121 (62%), Gaps = 7/121 (5%)

Query: 46  FVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ-- 103
           FV+VETN+R+YAY+ S L   ++ LF ++ Y+ PNL+V   T++S+  A ++GITA Q  
Sbjct: 270 FVIVETNYRVYAYTNSNLQVALIGLFCELLYRFPNLVVAILTRDSVRAALKSGITAVQIV 329

Query: 104 ----QNAHPRVAD-RIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDY 158
               Q+AH ++ D   P++P  + DQI+LWE++ NR   +    Y +F S+  FE   D+
Sbjct: 330 GYLNQHAHNKMIDPGPPTLPPTIVDQIKLWENERNRFIFSEGVLYSQFHSQIDFEVLRDH 389

Query: 159 A 159
           A
Sbjct: 390 A 390


>gi|402465832|gb|EJW01474.1| hypothetical protein EDEG_03937 [Edhazardia aedis USNM 41457]
          Length = 465

 Score = 94.0 bits (232), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 45/124 (36%), Positives = 79/124 (63%), Gaps = 2/124 (1%)

Query: 43  SQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE 102
           S+ F+ +ETNF++YAY+TSK  + IL LFSKI  +LPNLI     ++S+N AF+  I+A+
Sbjct: 315 SESFMYIETNFKLYAYTTSKYDFSILSLFSKISCKLPNLISAIINEDSVNTAFDKKISAK 374

Query: 103 QQNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARDR 162
           Q + + +   +  ++P+NV +Q+ +WES  NR++ +    +  F +   F+ A D  +++
Sbjct: 375 QISYYLKSKGK--NVPKNVVEQVYIWESKRNRIKTSECTLFKGFLNLMDFKKAVDVCKEK 432

Query: 163 SGLL 166
             L+
Sbjct: 433 HWLV 436


>gi|405952750|gb|EKC20525.1| General transcription factor IIH subunit 4 [Crassostrea gigas]
          Length = 380

 Score = 93.6 bits (231), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 48/127 (37%), Positives = 73/127 (57%), Gaps = 10/127 (7%)

Query: 40  ISNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGI 99
           +  + G+++VETN+R+YAY+ S L   ++ LF ++ Y+ P   VG  T+ S+ +A   GI
Sbjct: 233 VGKNTGYLMVETNYRVYAYTDSPLQVALVALFCEMLYRFPTFSVGNLTRVSVRDALIRGI 292

Query: 100 TAEQQNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYA 159
           TA+Q           P IP  V DQ+RLWE + +R +      YD+F S++ FE   DYA
Sbjct: 293 TADQ----------TPVIPSTVTDQVRLWELERDRFKFMEGVLYDQFLSQNDFELLRDYA 342

Query: 160 RDRSGLL 166
           +D   LL
Sbjct: 343 KDLGVLL 349


>gi|347830304|emb|CCD46001.1| similar to RNA polymerase II transcription factor B subunit 2
           [Botryotinia fuckeliana]
          Length = 471

 Score = 93.6 bits (231), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 42/133 (31%), Positives = 77/133 (57%), Gaps = 12/133 (9%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           ++ +GF+++ETN+R+YAY+ S L   +L LF+K+  + PN++ G  +++S+  A  +GIT
Sbjct: 307 ASQKGFIIIETNYRLYAYTNSPLQIAVLSLFTKLNTRYPNMVSGRVSRDSIRTAIAHGIT 366

Query: 101 AEQ------QNAHPRVADRI------PSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 148
           ++Q       +AHP++          P +P  V DQIRLW+ +  R++  P     +F +
Sbjct: 367 SDQIITYLSTHAHPQLVKASSASHGGPVLPPTVVDQIRLWQLENERMKAVPGFLMKDFET 426

Query: 149 RDVFEAACDYARD 161
           +  +E    YA +
Sbjct: 427 QKEYEGCAKYAEE 439


>gi|154308354|ref|XP_001553513.1| hypothetical protein BC1G_08237 [Botryotinia fuckeliana B05.10]
          Length = 410

 Score = 93.2 bits (230), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 42/133 (31%), Positives = 77/133 (57%), Gaps = 12/133 (9%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           ++ +GF+++ETN+R+YAY+ S L   +L LF+K+  + PN++ G  +++S+  A  +GIT
Sbjct: 246 ASQKGFIIIETNYRLYAYTNSPLQIAVLSLFTKLNTRYPNMVSGRVSRDSIRTAIAHGIT 305

Query: 101 AEQ------QNAHPRVADRI------PSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 148
           ++Q       +AHP++          P +P  V DQIRLW+ +  R++  P     +F +
Sbjct: 306 SDQIITYLSTHAHPQLVKASSASHGGPVLPPTVVDQIRLWQLENERMKAVPGFLMKDFET 365

Query: 149 RDVFEAACDYARD 161
           +  +E    YA +
Sbjct: 366 QKEYEGCAKYAEE 378


>gi|255079788|ref|XP_002503474.1| predicted protein [Micromonas sp. RCC299]
 gi|226518741|gb|ACO64732.1| predicted protein [Micromonas sp. RCC299]
          Length = 458

 Score = 93.2 bits (230), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 48/116 (41%), Positives = 68/116 (58%), Gaps = 8/116 (6%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
            +  G ++ ETNFR+YAY+ S +  EILRLF++ +Y+LPNL VG  T+E+++ A + G+ 
Sbjct: 305 GDPGGHIIAETNFRVYAYTFSDVECEILRLFTRPDYRLPNLYVGMLTREAVHEALDTGVA 364

Query: 101 AEQ------QNAHP--RVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 148
           AEQ       +AHP  R       IP NV DQI LW  +  RV       Y +FP+
Sbjct: 365 AEQIIKYIKSHAHPNARKTTNGSGIPPNVADQIMLWAMERRRVRSAECVLYCDFPT 420


>gi|171681598|ref|XP_001905742.1| hypothetical protein [Podospora anserina S mat+]
 gi|170940758|emb|CAP66407.1| unnamed protein product [Podospora anserina S mat+]
          Length = 540

 Score = 93.2 bits (230), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 47/130 (36%), Positives = 75/130 (57%), Gaps = 11/130 (8%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           ++  G +VVETN+R+YAY++S L   IL+LF ++  + PN++    T+ES+  A + GIT
Sbjct: 376 ADENGGIVVETNYRIYAYTSSPLQIAILKLFCRLHMRFPNMVTARLTRESVQEAIKEGIT 435

Query: 101 AEQ------QNAHPRV-----ADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 149
           A Q       +AHP++     A     IP  V DQIRLW+ +  R++ TP   + +F S 
Sbjct: 436 ANQIIDYLVAHAHPQMRRAAAARGTTVIPPTVMDQIRLWQLESQRMQKTPGFQFKDFESV 495

Query: 150 DVFEAACDYA 159
           + +    +YA
Sbjct: 496 EEYRQLAEYA 505


>gi|334323685|ref|XP_003340422.1| PREDICTED: valyl-tRNA synthetase, mitochondrial-like [Monodelphis
           domestica]
          Length = 1641

 Score = 92.8 bits (229), Expect = 4e-17,   Method: Composition-based stats.
 Identities = 51/158 (32%), Positives = 81/158 (51%), Gaps = 37/158 (23%)

Query: 41  SNSQGFVVVETNFRMYAYS-------------------------------TSKLHYEILR 69
           ++  GF+VVETN+R+YAY+                                S+L   ++ 
Sbjct: 302 AHQPGFIVVETNYRLYAYTGEVGGMRMGIGMLEGRNCGFFGIILFFSLATESELQIALIA 361

Query: 70  LFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ------QNAHPRVADRIPSIPENVCD 123
           LFS++ Y+ PN++V   T+ES+  A  NGITA+Q        AHP +  +IP +P  + D
Sbjct: 362 LFSEMLYRFPNMVVAQITRESVQQAIANGITAQQIIHFLKTRAHPVMLKQIPVLPPTITD 421

Query: 124 QIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARD 161
           QIRLWE + +R+  +    Y++F S+  FE    +AR+
Sbjct: 422 QIRLWELERDRLRFSEGVLYNQFLSQVDFELLLSHARE 459


>gi|345569654|gb|EGX52519.1| hypothetical protein AOL_s00043g13 [Arthrobotrys oligospora ATCC
           24927]
          Length = 452

 Score = 92.4 bits (228), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 46/127 (36%), Positives = 71/127 (55%), Gaps = 12/127 (9%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           ++  GF+++ETN+       S L   +L LF+++  + PNL+    T+ES+  A   GIT
Sbjct: 298 NDENGFLIIETNYH------SPLQIAVLNLFARLISRFPNLVTARITRESIRQAIGKGIT 351

Query: 101 AEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           AEQ       NAHP++    P +P  V DQIRLW+ +  R+++T  + + EF +   F A
Sbjct: 352 AEQIIDYMTANAHPQLRKNSPVLPPTVVDQIRLWQIEGERMKVTSGYLFKEFATNAEFNA 411

Query: 155 ACDYARD 161
            C YA D
Sbjct: 412 VCKYADD 418


>gi|389749775|gb|EIM90946.1| transcription factor Tfb2 [Stereum hirsutum FP-91666 SS1]
          Length = 494

 Score = 92.0 bits (227), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 47/129 (36%), Positives = 74/129 (57%), Gaps = 13/129 (10%)

Query: 46  FVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ-- 103
           F+++ETN+R+YAY+ + L   +L LF  ++ + PNL+VG+ T++S+  A  NGITA+Q  
Sbjct: 318 FIILETNYRLYAYTDNPLQTAVLNLFVTMKSRFPNLVVGSITRDSVRKALSNGITADQII 377

Query: 104 ----QNAHPRVADRI-------PSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVF 152
                 AHP++   +       P +P  V DQIRLWE + NR++      Y  F S+  +
Sbjct: 378 SYLTTYAHPQMRKNVSRCRSFTPLLPVTVQDQIRLWELERNRLKSQEGFLYTAFASQADY 437

Query: 153 EAACDYARD 161
           E    YA+ 
Sbjct: 438 EYVLTYAKQ 446


>gi|396080769|gb|AFN82390.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH subunit TFB2 [Encephalitozoon
           romaleae SJ-2008]
          Length = 414

 Score = 91.7 bits (226), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 44/125 (35%), Positives = 73/125 (58%)

Query: 42  NSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITA 101
           N + F++VETN ++YAY+ S+    ++ LF  +   LPNLI G+ T+ES+N AF+ GIT 
Sbjct: 263 NRREFIIVETNNKIYAYTNSEYEKSVIHLFCNVSVSLPNLIKGSITEESVNVAFDKGITG 322

Query: 102 EQQNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARD 161
           +Q       + +  S+P  +  QI +WES  NR+ M P + Y  F +   ++   ++  +
Sbjct: 323 KQIIHFLEASVKPGSLPPAITSQIMIWESKRNRIFMVPGYIYSNFLNLSDYQRVLEFCTE 382

Query: 162 RSGLL 166
           RS L+
Sbjct: 383 RSYLI 387


>gi|401825125|ref|XP_003886658.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH subunit TFB2 [Encephalitozoon hellem
           ATCC 50504]
 gi|395459803|gb|AFM97677.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH subunit TFB2 [Encephalitozoon hellem
           ATCC 50504]
          Length = 414

 Score = 91.7 bits (226), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 44/125 (35%), Positives = 73/125 (58%)

Query: 42  NSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITA 101
           N + F++VETN ++YAY+ S+    ++ LF  +   LPNLI G+ T+ES+N AF+ GIT 
Sbjct: 263 NRREFIIVETNNKIYAYTNSEYEKSVIHLFCSVSVNLPNLIKGSITEESVNVAFDKGITG 322

Query: 102 EQQNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARD 161
           +Q       + +  S+P  +  QI +WES  NR+ M P + Y  F +   ++   ++  +
Sbjct: 323 KQIIHFLEASVKPGSLPPAISSQIMIWESKRNRIFMAPGYIYSNFLNLSDYQRVLEFCTE 382

Query: 162 RSGLL 166
           RS L+
Sbjct: 383 RSYLI 387


>gi|303388087|ref|XP_003072278.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH subunit TFB2 [Encephalitozoon
           intestinalis ATCC 50506]
 gi|303301417|gb|ADM10918.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH subunit TFB2 [Encephalitozoon
           intestinalis ATCC 50506]
          Length = 414

 Score = 90.9 bits (224), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 42/125 (33%), Positives = 73/125 (58%)

Query: 42  NSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITA 101
           N + F++VETN ++YAY+ S+    ++ LF  + + LPNL  G+ T+ES+N AF+ GIT 
Sbjct: 263 NRREFIIVETNNKIYAYTNSEYEKSVIHLFCNVSFNLPNLTKGSITEESVNAAFDKGITG 322

Query: 102 EQQNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARD 161
            Q       + +  S+P  + +QI +WES  NR+ M P + Y  F +   ++   ++  +
Sbjct: 323 RQIIHFLEASSKPGSLPPAIINQIIIWESKRNRIFMAPGYLYSNFLNLSDYQKVLEFCSE 382

Query: 162 RSGLL 166
           R+ L+
Sbjct: 383 RNYLI 387


>gi|389642093|ref|XP_003718679.1| RNA polymerase II transcription factor B subunit 2 [Magnaporthe
           oryzae 70-15]
 gi|351641232|gb|EHA49095.1| RNA polymerase II transcription factor B subunit 2 [Magnaporthe
           oryzae 70-15]
 gi|440469172|gb|ELQ38293.1| RNA polymerase II transcription factor B subunit 2 [Magnaporthe
           oryzae Y34]
 gi|440481925|gb|ELQ62459.1| RNA polymerase II transcription factor B subunit 2 [Magnaporthe
           oryzae P131]
          Length = 490

 Score = 90.5 bits (223), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 47/138 (34%), Positives = 83/138 (60%), Gaps = 14/138 (10%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           +N++G +++ETNFR+YAY+++ L   IL LF+ ++ +   ++ G  ++ S+  A  +GIT
Sbjct: 322 NNTKGAIILETNFRIYAYTSTPLQIAILALFANLKMRFAGMVTGQLSRHSIKRAISHGIT 381

Query: 101 AEQ------QNAHPRVADRI------PSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 148
           A+Q       +AH ++  RI      P +P  V DQIRLW+ +  R+++   + + +F S
Sbjct: 382 ADQIIEYLASHAHEQM-HRIAAIRNKPVLPPTVVDQIRLWQLETERMQVQRGYLFKDFES 440

Query: 149 RDVFEAACDYARDRSGLL 166
           +  F+A  DYA D  G+L
Sbjct: 441 QAEFKAIADYA-DEVGVL 457


>gi|452977455|gb|EME77221.1| hypothetical protein MYCFIDRAFT_146475 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 476

 Score = 90.5 bits (223), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 49/130 (37%), Positives = 73/130 (56%), Gaps = 11/130 (8%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           +N +GF+++ETN+R+YAY+ S L   IL LF+ +  + PNL+    TK S+  A  +GIT
Sbjct: 300 ANEKGFIILETNYRLYAYTNSPLLISILSLFASLHTRYPNLVTAKITKTSVQAAISSGIT 359

Query: 101 AE------QQNAHP---RVAD--RIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 149
           +       Q +AHP   R A     P +P  V DQIRLW+ +  R+  T  +   E  S+
Sbjct: 360 SNQIISYLQTHAHPILRRTASMHNAPILPPTVVDQIRLWQIEGERMTSTKGYLIREVGSK 419

Query: 150 DVFEAACDYA 159
           + +E A  YA
Sbjct: 420 EDYEKAVQYA 429


>gi|146414792|ref|XP_001483366.1| hypothetical protein PGUG_04095 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 498

 Score = 89.7 bits (221), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 47/156 (30%), Positives = 78/156 (50%), Gaps = 37/156 (23%)

Query: 43  SQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE 102
            QG +++ETNF++YAY+ S L   IL LF  +  +  N++ G  T+ES+ NA  NGIT++
Sbjct: 309 GQGSIILETNFKLYAYTNSPLEIAILNLFVNLRTRFSNMVSGQITRESIRNALYNGITSD 368

Query: 103 Q------QNAHPRV-------------------------------ADRIPSIPENVCDQI 125
           Q       +AHP++                                 ++  +P  V DQI
Sbjct: 369 QIINFLETHAHPQMRALAKERLDKKVEFDASNNINTAGGGQNSSAQHKLEILPPTVVDQI 428

Query: 126 RLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARD 161
           +LW+ +L+R++    + + EF S+  F+  C+YA D
Sbjct: 429 KLWQLELDRIQAFEGYLFKEFASQQEFDVLCNYASD 464


>gi|190347675|gb|EDK39997.2| hypothetical protein PGUG_04095 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 498

 Score = 89.7 bits (221), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 47/156 (30%), Positives = 78/156 (50%), Gaps = 37/156 (23%)

Query: 43  SQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE 102
            QG +++ETNF++YAY+ S L   IL LF  +  +  N++ G  T+ES+ NA  NGIT++
Sbjct: 309 GQGSIILETNFKLYAYTNSPLEIAILNLFVNLRTRFSNMVSGQITRESIRNALYNGITSD 368

Query: 103 Q------QNAHPRV-------------------------------ADRIPSIPENVCDQI 125
           Q       +AHP++                                 ++  +P  V DQI
Sbjct: 369 QIINFLETHAHPQMRALAKERLDKKVEFDASNNINTAGGGQNSSAQHKLEILPPTVVDQI 428

Query: 126 RLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARD 161
           +LW+ +L+R++    + + EF S+  F+  C+YA D
Sbjct: 429 KLWQLELDRIQAFEGYLFKEFASQQEFDVLCNYASD 464


>gi|402866404|ref|XP_003897374.1| PREDICTED: LOW QUALITY PROTEIN: valine--tRNA ligase, mitochondrial,
           partial [Papio anubis]
          Length = 1647

 Score = 89.4 bits (220), Expect = 5e-16,   Method: Composition-based stats.
 Identities = 51/156 (32%), Positives = 78/156 (50%), Gaps = 39/156 (25%)

Query: 45  GFVVVETNFRMYAY---------------------------------STSKLHYEILRLF 71
           GF+VVETN+R+YAY                                 S S+L   ++ LF
Sbjct: 351 GFIVVETNYRLYAYTGEVGQSAPEEKKEGLEGVWVWGWPPHPLSIPGSESELQIALIALF 410

Query: 72  SKIEYQLPNLIVGATTKESLNNAFENGITAEQ------QNAHPRVADRIPSIPENVCDQI 125
           S++ Y+ PN++V   T+ES+  A  +GITA+Q        AHP +  + P +P  + DQI
Sbjct: 411 SEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQI 470

Query: 126 RLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARD 161
           RLWE + +R+  T    Y++F S+  FE    +AR+
Sbjct: 471 RLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARE 506


>gi|393910939|gb|EJD76097.1| transcription factor Tfb2 family protein, variant [Loa loa]
          Length = 389

 Score = 89.0 bits (219), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 51/138 (36%), Positives = 71/138 (51%), Gaps = 13/138 (9%)

Query: 42  NSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITA 101
           N  GFV+VETN+R+YAY+ S L   IL  F+++ Y+  ++ VG  ++E++  AF+ GITA
Sbjct: 220 NQNGFVIVETNYRVYAYTDSNLQLAILSTFTEMLYRFNDMSVGVLSREAVRRAFQVGITA 279

Query: 102 EQ------QNAHPRV-------ADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 148
            Q       NAHP            I S+P  V DQIRLWE +  R+    A  Y  F S
Sbjct: 280 AQIIAFLRANAHPTTYAVTTEYGGMIQSVPITVADQIRLWEDERRRLMFCNAAVYSSFES 339

Query: 149 RDVFEAACDYARDRSGLL 166
              +     Y   +  LL
Sbjct: 340 EKEYFGLKGYVSSQGILL 357


>gi|393910938|gb|EFO26137.2| transcription factor Tfb2 family protein [Loa loa]
          Length = 486

 Score = 89.0 bits (219), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 51/138 (36%), Positives = 71/138 (51%), Gaps = 13/138 (9%)

Query: 42  NSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITA 101
           N  GFV+VETN+R+YAY+ S L   IL  F+++ Y+  ++ VG  ++E++  AF+ GITA
Sbjct: 317 NQNGFVIVETNYRVYAYTDSNLQLAILSTFTEMLYRFNDMSVGVLSREAVRRAFQVGITA 376

Query: 102 EQ------QNAHPRV-------ADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 148
            Q       NAHP            I S+P  V DQIRLWE +  R+    A  Y  F S
Sbjct: 377 AQIIAFLRANAHPTTYAVTTEYGGMIQSVPITVADQIRLWEDERRRLMFCNAAVYSSFES 436

Query: 149 RDVFEAACDYARDRSGLL 166
              +     Y   +  LL
Sbjct: 437 EKEYFGLKGYVSSQGILL 454


>gi|312069992|ref|XP_003137940.1| transcription factor Tfb2 family protein [Loa loa]
          Length = 470

 Score = 89.0 bits (219), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 51/138 (36%), Positives = 71/138 (51%), Gaps = 13/138 (9%)

Query: 42  NSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITA 101
           N  GFV+VETN+R+YAY+ S L   IL  F+++ Y+  ++ VG  ++E++  AF+ GITA
Sbjct: 301 NQNGFVIVETNYRVYAYTDSNLQLAILSTFTEMLYRFNDMSVGVLSREAVRRAFQVGITA 360

Query: 102 EQ------QNAHPRV-------ADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 148
            Q       NAHP            I S+P  V DQIRLWE +  R+    A  Y  F S
Sbjct: 361 AQIIAFLRANAHPTTYAVTTEYGGMIQSVPITVADQIRLWEDERRRLMFCNAAVYSSFES 420

Query: 149 RDVFEAACDYARDRSGLL 166
              +     Y   +  LL
Sbjct: 421 EKEYFGLKGYVSSQGILL 438


>gi|449296697|gb|EMC92716.1| hypothetical protein BAUCODRAFT_259506 [Baudoinia compniacensis
           UAMH 10762]
          Length = 489

 Score = 89.0 bits (219), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 46/132 (34%), Positives = 78/132 (59%), Gaps = 7/132 (5%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           +N +G++++ETN+R+YAY++S L   IL LF+ ++ + PNLI    TK S+ +A  +GIT
Sbjct: 321 ANEKGYIILETNYRLYAYTSSPLPITILSLFATLKTRYPNLITAKLTKSSIQSAIASGIT 380

Query: 101 AEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
           ++Q       +AHP +  + P +P  V DQIRLW+ +  R+         +  + + ++ 
Sbjct: 381 SDQIITYLTTHAHPILRRQNPVLPPTVVDQIRLWQIEGERMTAWKGFLIRDVGTAEEYDK 440

Query: 155 ACDYARDRSGLL 166
           A  YA D  G+L
Sbjct: 441 AVQYA-DALGVL 451


>gi|313224499|emb|CBY20289.1| unnamed protein product [Oikopleura dioica]
          Length = 411

 Score = 88.6 bits (218), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 43/127 (33%), Positives = 75/127 (59%), Gaps = 6/127 (4%)

Query: 46  FVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ-- 103
           F++VETN+R+YAY+ S+LHY ++ LF++++Y+ P +IV   +++S+  + + GI+AEQ  
Sbjct: 254 FILVETNYRIYAYTDSELHYALISLFAEVQYRFPYMIVAQMSRDSIQQSADYGISAEQIL 313

Query: 104 ----QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYA 159
                +AHP        +P+ V D I LW  +  R++      Y +F  ++ FE    YA
Sbjct: 314 NYLRSSAHPIARKNKHWVPQVVEDNIHLWCKERERLKFNDGLLYHQFLDQEAFEMLKSYA 373

Query: 160 RDRSGLL 166
           +D   L+
Sbjct: 374 QDIRALV 380


>gi|395533817|ref|XP_003768949.1| PREDICTED: LOW QUALITY PROTEIN: valine--tRNA ligase, mitochondrial
           [Sarcophilus harrisii]
          Length = 1638

 Score = 88.6 bits (218), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 50/161 (31%), Positives = 80/161 (49%), Gaps = 40/161 (24%)

Query: 41  SNSQGFVVVETNFRMYAYST----------------------------------SKLHYE 66
           ++  GF+VVETN+R+YAY+                                   S+L   
Sbjct: 335 AHQPGFIVVETNYRLYAYTGEVGGIKMRIEIRGGGIGGELNLLWSSIVFSLETESELQIA 394

Query: 67  ILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ------QNAHPRVADRIPSIPEN 120
           ++ LFS++ Y+ PN++V   T+ES+  A  NGITA+Q        AHP +  + P +P  
Sbjct: 395 LIALFSEMLYRFPNMVVAQVTRESVQQAIANGITAQQIIHFMKTRAHPVMLKQTPVLPPT 454

Query: 121 VCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARD 161
           + DQIRLWE + +R+  +    Y++F S+  FE    +AR+
Sbjct: 455 ITDQIRLWELERDRLRFSEGVLYNQFLSQVDFELLLSHARE 495


>gi|313245784|emb|CBY34779.1| unnamed protein product [Oikopleura dioica]
          Length = 456

 Score = 88.6 bits (218), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 43/127 (33%), Positives = 75/127 (59%), Gaps = 6/127 (4%)

Query: 46  FVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ-- 103
           F++VETN+R+YAY+ S+LHY ++ LF++++Y+ P +IV   +++S+  + + GI+AEQ  
Sbjct: 299 FILVETNYRIYAYTDSELHYALISLFAEVQYRFPYMIVAQMSRDSIQQSADYGISAEQIL 358

Query: 104 ----QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYA 159
                +AHP        +P+ V D I LW  +  R++      Y +F  ++ FE    YA
Sbjct: 359 NYLRSSAHPIARKNKHWVPQVVEDNIHLWCKERERLKFNDGLLYHQFLDQEAFEMLKSYA 418

Query: 160 RDRSGLL 166
           +D   L+
Sbjct: 419 QDIRALV 425


>gi|344303892|gb|EGW34141.1| RNA polymerase II transcription factor B subunit 2 [Spathaspora
           passalidarum NRRL Y-27907]
          Length = 519

 Score = 88.2 bits (217), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 51/171 (29%), Positives = 83/171 (48%), Gaps = 45/171 (26%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
            N QG V++ETNF++YAY+ S L   IL LF  +  +  N++ G  T+ES+ NA  NGIT
Sbjct: 317 GNIQGTVIIETNFKLYAYTNSPLEIAILNLFVHLRTRFANMVCGQITRESIRNALYNGIT 376

Query: 101 AEQ------QNAHPRV---------------------------------AD------RIP 115
           A+Q       +AHP++                                 AD      ++ 
Sbjct: 377 ADQIIRFLETHAHPQMKVLAKEKLDKKIEFDTSHNINTAGGAPQSQALRADGSVAQHKLE 436

Query: 116 SIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARDRSGLL 166
            +P NV DQI+LW+ +L+R++    + + +FP++  F+    YA +   L+
Sbjct: 437 ILPPNVVDQIKLWQLELDRIQTFDGYLFKDFPNQQEFDKLSSYASELGVLI 487


>gi|50291141|ref|XP_448003.1| hypothetical protein [Candida glabrata CBS 138]
 gi|62901305|sp|Q6FP41.1|TFB2_CANGA RecName: Full=RNA polymerase II transcription factor B subunit 2;
           AltName: Full=RNA polymerase II transcription factor B
           52 kDa subunit; AltName: Full=RNA polymerase II
           transcription factor B p52 subunit
 gi|49527314|emb|CAG60954.1| unnamed protein product [Candida glabrata]
          Length = 504

 Score = 87.8 bits (216), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 48/146 (32%), Positives = 74/146 (50%), Gaps = 24/146 (16%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           G +++ETNF++Y+Y  S L   IL LF  ++ +  N++ G  T+ES+  A  NGITA+Q 
Sbjct: 327 GALIIETNFKLYSYCNSPLQIAILSLFVHLKSRFANMVAGQITRESIRRALINGITADQV 386

Query: 104 -----QNAHPRV------------------ADRIPSIPENVCDQIRLWESDLNRVEMTPA 140
                 +AHP++                   D +  +P  V DQI+LW+ +L+RV     
Sbjct: 387 IAYLESHAHPQMRRLAEEKLQKKLELDPNCKDPLQVLPPTVVDQIKLWQLELDRVLTYEG 446

Query: 141 HYYDEFPSRDVFEAACDYARDRSGLL 166
             Y +F +   F   C YA+D   LL
Sbjct: 447 SLYIDFDTAQDFNMLCKYAQDIGALL 472


>gi|365986883|ref|XP_003670273.1| hypothetical protein NDAI_0E02130 [Naumovozyma dairenensis CBS 421]
 gi|343769043|emb|CCD25030.1| hypothetical protein NDAI_0E02130 [Naumovozyma dairenensis CBS 421]
          Length = 522

 Score = 87.8 bits (216), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 50/146 (34%), Positives = 75/146 (51%), Gaps = 24/146 (16%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           G ++VETNF++Y+YS S L   IL LF  ++++  N++ G  T+ES+  A  NGITAEQ 
Sbjct: 344 GSLIVETNFKLYSYSNSPLQIAILSLFVHLKFRFVNMVTGQITRESIRRALINGITAEQI 403

Query: 104 -----QNAHPRV------------------ADRIPSIPENVCDQIRLWESDLNRVEMTPA 140
                 +AHP++                   D +  +P  V DQIRLW+ +L+RV     
Sbjct: 404 IAYLETHAHPQMRRLAEEKLEKKLELDANCKDSLQILPPTVVDQIRLWQLELDRVIAYEG 463

Query: 141 HYYDEFPSRDVFEAACDYARDRSGLL 166
             Y +F +   +     YA+D   LL
Sbjct: 464 SLYSDFENNTEYTTLYKYAQDIGVLL 489


>gi|156839389|ref|XP_001643386.1| hypothetical protein Kpol_479p16 [Vanderwaltozyma polyspora DSM
           70294]
 gi|156113994|gb|EDO15528.1| hypothetical protein Kpol_479p16 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 504

 Score = 87.8 bits (216), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 47/141 (33%), Positives = 75/141 (53%), Gaps = 24/141 (17%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           G +VVETNF++Y+YS S L   IL LF  ++ +  N++ G  T+ES+ NA  NGITAEQ 
Sbjct: 329 GALVVETNFKLYSYSNSPLQIAILSLFVHLKTRFSNMVTGQITRESIRNALSNGITAEQI 388

Query: 104 -----QNAHPRV------------------ADRIPSIPENVCDQIRLWESDLNRVEMTPA 140
                 +AHP++                   + +  +P  V DQI+LW+ +L+R+     
Sbjct: 389 IAYLETHAHPQMRRLAEERLKKKLELDQNSKETLQILPPTVVDQIKLWQLELDRIISYEG 448

Query: 141 HYYDEFPSRDVFEAACDYARD 161
             Y +F ++  ++    YA+D
Sbjct: 449 SLYSDFDNKQEYDMLYSYAQD 469


>gi|170589001|ref|XP_001899262.1| Transcription factor Tfb2 family protein [Brugia malayi]
 gi|158593475|gb|EDP32070.1| Transcription factor Tfb2 family protein [Brugia malayi]
          Length = 465

 Score = 87.8 bits (216), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 47/120 (39%), Positives = 67/120 (55%), Gaps = 13/120 (10%)

Query: 42  NSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITA 101
           N  GFV+VETN+R+YAY+ S L   IL  F+++ Y+  ++ VG  ++E++  AF+ GITA
Sbjct: 296 NQNGFVIVETNYRVYAYTDSNLQLAILSTFTEMLYRFSDMSVGVLSREAVRRAFQVGITA 355

Query: 102 EQ------QNAHP-------RVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 148
            Q      +NAHP            I S+P  V DQI+LWE +  R+    A  Y  F S
Sbjct: 356 AQIIAFLSRNAHPITCAVTTEYGRMIQSVPITVVDQIQLWEDERRRLTFCSAAVYSSFES 415


>gi|403308524|ref|XP_003944708.1| PREDICTED: LOW QUALITY PROTEIN: valine--tRNA ligase, mitochondrial
           [Saimiri boliviensis boliviensis]
          Length = 1752

 Score = 87.4 bits (215), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 51/161 (31%), Positives = 78/161 (48%), Gaps = 44/161 (27%)

Query: 45  GFVVVETNFRMYAY--------------------------------------STSKLHYE 66
           GF+VVETN+R+YAY                                      S S+L   
Sbjct: 450 GFIVVETNYRLYAYTGEVGQRAPGRAGWGRGNARLEGVWVWGWPPHPLSIPGSESELQIA 509

Query: 67  ILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ------QNAHPRVADRIPSIPEN 120
           ++ LFS++ Y+ PN++V   T+ES+  A  +GITA+Q        AHP +  + P +P  
Sbjct: 510 LIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPT 569

Query: 121 VCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARD 161
           + DQIRLWE + +R+  T    Y++F S+  FE    +AR+
Sbjct: 570 ITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARE 610


>gi|62088538|dbj|BAD92716.1| VARS2L protein variant [Homo sapiens]
          Length = 1653

 Score = 87.4 bits (215), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 51/161 (31%), Positives = 78/161 (48%), Gaps = 44/161 (27%)

Query: 45  GFVVVETNFRMYAY--------------------------------------STSKLHYE 66
           GF+VVETN+R+YAY                                      S S+L   
Sbjct: 352 GFIVVETNYRLYAYTGEAGQRAPGRGGWGHLKEGLEGVWVWGWPPLPLSIPGSESELQIA 411

Query: 67  ILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ------QNAHPRVADRIPSIPEN 120
           ++ LFS++ Y+ PN++V   T+ES+  A  +GITA+Q        AHP +  + P +P  
Sbjct: 412 LIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPT 471

Query: 121 VCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARD 161
           + DQIRLWE + +R+  T    Y++F S+  FE    +AR+
Sbjct: 472 ITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARE 512


>gi|85691005|ref|XP_965902.1| hypothetical protein ECU01_0670 [Encephalitozoon cuniculi GB-M1]
 gi|19068469|emb|CAD24937.1| hypothetical protein [Encephalitozoon cuniculi GB-M1]
 gi|449329766|gb|AGE96035.1| hypothetical protein ECU01_0670 [Encephalitozoon cuniculi]
          Length = 414

 Score = 87.4 bits (215), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 42/118 (35%), Positives = 68/118 (57%)

Query: 42  NSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITA 101
           N + F++VETN ++YAY+ S+    ++ LF  +   LPNLI G  T+ES+N AF+ GIT 
Sbjct: 263 NRREFIIVETNNKIYAYTNSEYEKSVIHLFCNVTVNLPNLIKGIITEESVNVAFDKGITG 322

Query: 102 EQQNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYA 159
           +Q       + R  S+P  + +QI +WES  NR+ M P + Y  F +   ++   ++ 
Sbjct: 323 KQIIHFLEASVRQGSLPPAIRNQIVIWESKRNRIFMVPGYLYSNFLNLSDYQKVLEFC 380


>gi|390461382|ref|XP_002746360.2| PREDICTED: LOW QUALITY PROTEIN: valine--tRNA ligase, mitochondrial
           [Callithrix jacchus]
          Length = 1675

 Score = 87.0 bits (214), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 51/162 (31%), Positives = 78/162 (48%), Gaps = 45/162 (27%)

Query: 45  GFVVVETNFRMYAY---------------------------------------STSKLHY 65
           GF+VVETN+R+YAY                                       S S+L  
Sbjct: 372 GFIVVETNYRLYAYTGEAGQRAPGRAGWGXHLKERFEGVWVWGWPPHPLSIPGSESELQI 431

Query: 66  EILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ------QNAHPRVADRIPSIPE 119
            ++ LFS++ Y+ PN++V   T+ES+  A  +GITA+Q        AHP +  + P +P 
Sbjct: 432 ALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPP 491

Query: 120 NVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARD 161
            + DQIRLWE + +R+  T    Y++F S+  FE    +AR+
Sbjct: 492 TITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARE 533


>gi|320587960|gb|EFX00435.1| tfiih and nucleotide excision repair factor 3 complexes subunit
           [Grosmannia clavigera kw1407]
          Length = 518

 Score = 87.0 bits (214), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 48/134 (35%), Positives = 75/134 (55%), Gaps = 12/134 (8%)

Query: 44  QGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ 103
           +G V++ETNFR+YAY+ S L   +L LFSK+  + P ++ G  T++S+  A   GIT++Q
Sbjct: 355 RGSVILETNFRLYAYTASPLQIAVLALFSKLSQRFPGMVAGKLTRDSVRRAISYGITSDQ 414

Query: 104 ------QNAHPRV-----ADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVF 152
                  NAH ++     A+  P +P  V DQIRLWE +  R++      + +F S   +
Sbjct: 415 IISYLAANAHEQMHKYAAANHRPVLPPTVVDQIRLWELENERMKSHKGFLFKDFDSAKEY 474

Query: 153 EAACDYARDRSGLL 166
           +    YA D  G+L
Sbjct: 475 DMLAQYA-DEIGVL 487


>gi|366997767|ref|XP_003683620.1| hypothetical protein TPHA_0A01010 [Tetrapisispora phaffii CBS 4417]
 gi|357521915|emb|CCE61186.1| hypothetical protein TPHA_0A01010 [Tetrapisispora phaffii CBS 4417]
          Length = 504

 Score = 86.7 bits (213), Expect = 3e-15,   Method: Composition-based stats.
 Identities = 48/146 (32%), Positives = 76/146 (52%), Gaps = 24/146 (16%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE-- 102
           G +VVETNF++Y+YS S L   IL LF  +  +  N++ G  T+ES+ NA  NG+TA+  
Sbjct: 328 GALVVETNFKLYSYSNSPLQIAILSLFVHLRTRFSNMVTGQITRESIRNALANGVTADQI 387

Query: 103 ----QQNAHPRV------------------ADRIPSIPENVCDQIRLWESDLNRVEMTPA 140
               Q +AHP++                   D++  +P  V DQI+LW+ +L+R+     
Sbjct: 388 IAYLQSHAHPQMRRLAETTLKKKLELDPNTKDQLQILPPTVVDQIKLWQLELDRILSYDG 447

Query: 141 HYYDEFPSRDVFEAACDYARDRSGLL 166
             Y +F +   +    +YA+D   LL
Sbjct: 448 SLYSDFDNNQEYNMLYNYAKDIGVLL 473


>gi|453082965|gb|EMF11011.1| RNA polymerase II transcription factor B subunit 2 [Mycosphaerella
           populorum SO2202]
          Length = 507

 Score = 86.7 bits (213), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 44/130 (33%), Positives = 74/130 (56%), Gaps = 11/130 (8%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           ++ +G++++ETN+R+YAY++S L   IL LF+ +  + PNLI    TK S++ A  +GIT
Sbjct: 319 ASEKGYIILETNYRLYAYTSSPLPISILSLFASLNTRYPNLITAKITKTSIHTAIASGIT 378

Query: 101 AE------QQNAHP-----RVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 149
           ++      Q +AHP        +  P +P  V DQIRLW+ +  R++ T  +   +  + 
Sbjct: 379 SDQIISYLQTHAHPILRRQAALNSAPILPPTVVDQIRLWQIEGERMKSTKGYLIRDVGTE 438

Query: 150 DVFEAACDYA 159
           D +  A  YA
Sbjct: 439 DDYTKAVQYA 448


>gi|402592023|gb|EJW85952.1| transcription factor Tfb2 family protein [Wuchereria bancrofti]
          Length = 470

 Score = 86.3 bits (212), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 50/138 (36%), Positives = 72/138 (52%), Gaps = 13/138 (9%)

Query: 42  NSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITA 101
           N  GFV+VETN+R+YAY+ S L   IL  F+++ Y+  ++ VG  ++E++  AF+ GITA
Sbjct: 301 NQHGFVIVETNYRVYAYTDSNLQLAILSTFTEMLYRFNDMSVGVLSREAVRRAFQVGITA 360

Query: 102 EQ------QNAHP-------RVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 148
            Q       NAHP            I S+P  V DQI+LWE +  R+    A  Y  F S
Sbjct: 361 AQIIAFLSTNAHPITYAVTTEYGRMIQSVPITVADQIQLWEDERRRLTFCNATVYSSFES 420

Query: 149 RDVFEAACDYARDRSGLL 166
              +     Y   ++ LL
Sbjct: 421 EKEYFGLKGYVISQNILL 438


>gi|45187599|ref|NP_983822.1| ADL274Wp [Ashbya gossypii ATCC 10895]
 gi|62901343|sp|Q75B51.1|TFB2_ASHGO RecName: Full=RNA polymerase II transcription factor B subunit 2;
           AltName: Full=RNA polymerase II transcription factor B
           52 kDa subunit; AltName: Full=RNA polymerase II
           transcription factor B p52 subunit
 gi|44982337|gb|AAS51646.1| ADL274Wp [Ashbya gossypii ATCC 10895]
 gi|374107034|gb|AEY95942.1| FADL274Wp [Ashbya gossypii FDAG1]
          Length = 514

 Score = 86.3 bits (212), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 47/146 (32%), Positives = 76/146 (52%), Gaps = 24/146 (16%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           G ++VETNF++Y+YS S L   IL LF  ++ +  N++ G  T+ES+  A  NGITA+Q 
Sbjct: 339 GALIVETNFKLYSYSNSPLQIAILSLFIHLKTRFQNMVTGQITRESIRRALHNGITADQI 398

Query: 104 -----QNAHPRV------------------ADRIPSIPENVCDQIRLWESDLNRVEMTPA 140
                 +AHP++                   D +  +P  V DQI+LW+ +L+R+     
Sbjct: 399 IAYMETHAHPQMRRLAGDNLEKKLELDPNCRDTLQVLPPTVVDQIKLWQLELDRIISYDG 458

Query: 141 HYYDEFPSRDVFEAACDYARDRSGLL 166
           + + +F +   ++    YARD   LL
Sbjct: 459 YLFRDFDNLQEYQVLAQYARDIGVLL 484


>gi|193633122|ref|XP_001945987.1| PREDICTED: general transcription factor IIH subunit 4-like
           [Acyrthosiphon pisum]
          Length = 471

 Score = 85.5 bits (210), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 46/135 (34%), Positives = 76/135 (56%), Gaps = 8/135 (5%)

Query: 38  EYISNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFEN 97
           E + N + ++VVETNFR+YA + S L   ++ LF+ + Y+ PN+  G  T++S+  A  +
Sbjct: 307 ERMKNKERYIVVETNFRIYAMTDSDLKVALVALFTHMLYRFPNMSAGILTRDSVRTALRS 366

Query: 98  GITAEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDV 151
           GITA Q       + HP++ +    +P+ V DQI LWE++ NR+  T    Y    + + 
Sbjct: 367 GITAAQIVRFLTVHTHPQMQE--CGMPQTVIDQIYLWENERNRLTYTDGVLYSNINTPND 424

Query: 152 FEAACDYARDRSGLL 166
           +E   +YA D   L+
Sbjct: 425 YETIKNYAADIGALV 439


>gi|301786953|ref|XP_002928911.1| PREDICTED: LOW QUALITY PROTEIN: valyl-tRNA synthetase,
           mitochondrial-like [Ailuropoda melanoleuca]
          Length = 1637

 Score = 85.5 bits (210), Expect = 7e-15,   Method: Composition-based stats.
 Identities = 50/169 (29%), Positives = 80/169 (47%), Gaps = 48/169 (28%)

Query: 41  SNSQGFVVVETNFRMYAYS----------------------------------------- 59
           ++  GF+VVETN+R+YAY+                                         
Sbjct: 325 AHQPGFIVVETNYRLYAYTGEARQRAPNSRXGVGTFLVTLEGLEGVWVWGXPSSSLPIPG 384

Query: 60  -TSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ------QNAHPRVAD 112
             S+L   ++ LFS++ Y+ PN++V   T+ES+  A  +GITA+Q        AHP +  
Sbjct: 385 PESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLK 444

Query: 113 RIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARD 161
           + P +P  + DQIRLWE + +R+  T    Y++F S+  FE    +AR+
Sbjct: 445 QTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARE 493


>gi|401623322|gb|EJS41426.1| tfb2p [Saccharomyces arboricola H-6]
          Length = 513

 Score = 85.5 bits (210), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 48/146 (32%), Positives = 75/146 (51%), Gaps = 24/146 (16%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           G ++VETNF++Y+YS S L   +L LF  ++ +  N+++G  T+ES+  A  NGITAEQ 
Sbjct: 337 GSLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRRALTNGITAEQI 396

Query: 104 -----QNAHPRV------------------ADRIPSIPENVCDQIRLWESDLNRVEMTPA 140
                 +AHP++                   + +  +P  V DQIRLW+ +L+RV     
Sbjct: 397 IAYLETHAHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTVVDQIRLWQLELDRVMTYEG 456

Query: 141 HYYDEFPSRDVFEAACDYARDRSGLL 166
             Y +F +   +     YA+D   LL
Sbjct: 457 SLYSDFETSQEYNVLNKYAQDIGVLL 482


>gi|320581720|gb|EFW95939.1| TFIIH basal transcription factor complex subunit 2 [Ogataea
           parapolymorpha DL-1]
          Length = 544

 Score = 85.5 bits (210), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 48/152 (31%), Positives = 77/152 (50%), Gaps = 31/152 (20%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           SN+QG +VVETNF++Y Y++S L   IL LF  ++ +  N++ G  T+ES+  A  NGIT
Sbjct: 357 SNNQGTIVVETNFKIYCYTSSPLQIAILNLFVHLKARFANMVTGVITRESVRRALINGIT 416

Query: 101 AEQ------QNAHP----RVADRIPS---------------------IPENVCDQIRLWE 129
           A+Q       +AHP    +  D++                       +P  V DQI+LW+
Sbjct: 417 ADQMINYLGSHAHPWMVKQAEDKLARKLELESSIGNAAGQRQIDLEILPPTVVDQIKLWQ 476

Query: 130 SDLNRVEMTPAHYYDEFPSRDVFEAACDYARD 161
            +L+R++    + Y +F +   FE    Y  +
Sbjct: 477 LELDRIQSFKGYLYKDFSTDLEFEKLLTYGEE 508


>gi|429965574|gb|ELA47571.1| hypothetical protein VCUG_00894 [Vavraia culicis 'floridensis']
          Length = 424

 Score = 85.1 bits (209), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 45/134 (33%), Positives = 75/134 (55%), Gaps = 1/134 (0%)

Query: 33  WYIVREYISNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLN 92
           ++++    S    F+VVETN+++YAY+TS     I++LF +I  +LPNL+V   T+ES+N
Sbjct: 264 YFLLFSSASTQDSFLVVETNYKLYAYTTSSHELSIIKLFCQIIRELPNLVVAHITEESVN 323

Query: 93  NAFENGITAEQQNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVF 152
            AF  GIT +Q   +     R   +P  V +QI +WE   +R++   A  Y  F +   +
Sbjct: 324 AAFVKGITGQQIVDYLNEKSR-SELPVVVLEQILIWERKRDRMKCMDAVIYSHFMTYGEY 382

Query: 153 EAACDYARDRSGLL 166
           E    Y +++  L+
Sbjct: 383 EITYKYCKEKGALV 396


>gi|193654859|ref|XP_001944632.1| PREDICTED: general transcription factor IIH subunit 4-like
           [Acyrthosiphon pisum]
          Length = 470

 Score = 85.1 bits (209), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 46/133 (34%), Positives = 74/133 (55%), Gaps = 8/133 (6%)

Query: 40  ISNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGI 99
           +   + ++VVETNFR+YA + S L   ++ LF+ + Y+ PN+  G  T++S+  A  NGI
Sbjct: 308 MKKKERYIVVETNFRIYAMTDSDLKVALVALFTHMLYRFPNMSAGILTRDSVQTALRNGI 367

Query: 100 TAEQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFE 153
           TA Q       + HP++ +    +P+ V DQI LWE + NR+       Y +  + + +E
Sbjct: 368 TAAQIVRFLTVHTHPQMQE--CGMPQTVIDQIYLWEYERNRLTYRDGVLYSDINTPNDYE 425

Query: 154 AACDYARDRSGLL 166
           A  +YA D   LL
Sbjct: 426 AIKNYAADIGALL 438


>gi|397471696|ref|XP_003807420.1| PREDICTED: LOW QUALITY PROTEIN: valine--tRNA ligase, mitochondrial
           [Pan paniscus]
          Length = 1673

 Score = 84.7 bits (208), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 51/168 (30%), Positives = 78/168 (46%), Gaps = 51/168 (30%)

Query: 45  GFVVVETNFRMYAY---------------------------------------------S 59
           GF+VVETN+R+YAY                                             S
Sbjct: 365 GFIVVETNYRLYAYTGEAGQRAPGRGGWGALXHFSXHLKEGLEGVWVWGCPPHPLSIPGS 424

Query: 60  TSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ------QNAHPRVADR 113
            S+L   ++ LFS++ Y+ PN++V   T+ES+  A  +GITA+Q        AHP +  +
Sbjct: 425 ESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQ 484

Query: 114 IPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARD 161
            P +P  + DQIRLWE + +R+  T    Y++F S+  FE    +AR+
Sbjct: 485 TPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARE 532


>gi|410082501|ref|XP_003958829.1| hypothetical protein KAFR_0H02850 [Kazachstania africana CBS 2517]
 gi|372465418|emb|CCF59694.1| hypothetical protein KAFR_0H02850 [Kazachstania africana CBS 2517]
          Length = 514

 Score = 84.7 bits (208), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 46/141 (32%), Positives = 72/141 (51%), Gaps = 24/141 (17%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           G +++ETNF++Y+YS S L   IL LF  ++ +  N++ G  T+ES+  A  NGITA+Q 
Sbjct: 338 GSLIIETNFKLYSYSNSPLQIAILSLFVHLKSRFVNMVTGQITRESIRRALVNGITADQI 397

Query: 104 -----QNAHPRV------------------ADRIPSIPENVCDQIRLWESDLNRVEMTPA 140
                 +AHPR+                   D +  +P  V DQI+LW+ +L+RV     
Sbjct: 398 IAYLETHAHPRMRRLAEEKLEKKLELDPNSKDTLQILPPTVVDQIKLWQLELDRVITYDG 457

Query: 141 HYYDEFPSRDVFEAACDYARD 161
             Y +F +   +     YA+D
Sbjct: 458 SLYSDFENSTEYNMLSKYAQD 478


>gi|403217068|emb|CCK71563.1| hypothetical protein KNAG_0H01490 [Kazachstania naganishii CBS
           8797]
          Length = 519

 Score = 84.7 bits (208), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 49/146 (33%), Positives = 74/146 (50%), Gaps = 24/146 (16%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           G ++VETNF++Y+YS S L   IL LF  ++ +  N++ G  TK S+ NA +NGITAEQ 
Sbjct: 343 GVLIVETNFKVYSYSNSPLQIAILSLFVHLKSRFINMVTGQITKRSIINALDNGITAEQI 402

Query: 104 -----QNAHPRV------------------ADRIPSIPENVCDQIRLWESDLNRVEMTPA 140
                 +AHP++                   D +  +P  V DQI+LW+ + +RV     
Sbjct: 403 IAYLETHAHPQMRRLAEAQLEKKRELDQNCKDPLQILPPTVVDQIKLWQLEKDRVMAFEG 462

Query: 141 HYYDEFPSRDVFEAACDYARDRSGLL 166
             Y +F S   +     YA+D   +L
Sbjct: 463 SLYSDFESNAEYNILKKYAQDIGVML 488


>gi|46127843|ref|XP_388475.1| hypothetical protein FG08299.1 [Gibberella zeae PH-1]
 gi|408390907|gb|EKJ70292.1| hypothetical protein FPSE_09509 [Fusarium pseudograminearum CS3096]
          Length = 494

 Score = 84.7 bits (208), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 44/133 (33%), Positives = 75/133 (56%), Gaps = 12/133 (9%)

Query: 41  SNSQ-GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGI 99
           SN Q G V++ETN+R+YAY+ S L   +L LF+K++ + P+++ G  +++S+  A   GI
Sbjct: 329 SNEQRGSVIIETNYRIYAYTQSTLQIAVLALFTKLQMRFPDMVAGRISRQSIRQAINFGI 388

Query: 100 TAEQ------QNAHPRV-----ADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 148
           TAEQ       +AH ++      +  P +P  V DQIRLW+ +  R++ T    + +F  
Sbjct: 389 TAEQIISYLSAHAHDQMRRTASLNNKPVLPPTVVDQIRLWQLENERMKTTGGFLFKDFED 448

Query: 149 RDVFEAACDYARD 161
              + A   +A +
Sbjct: 449 HKEYMAVAGFAEE 461


>gi|300707677|ref|XP_002996037.1| hypothetical protein NCER_100934 [Nosema ceranae BRL01]
 gi|239605297|gb|EEQ82366.1| hypothetical protein NCER_100934 [Nosema ceranae BRL01]
          Length = 403

 Score = 84.3 bits (207), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 38/101 (37%), Positives = 63/101 (62%)

Query: 46  FVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQQN 105
           F+++ETNF++YAY++S     I++LFS I  ++PNLI  + T+ESL+NAF  G+T++Q  
Sbjct: 254 FILLETNFKLYAYTSSVYEMSIIQLFSNIYLKMPNLIKASITEESLSNAFSKGVTSQQII 313

Query: 106 AHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEF 146
              +       IP  +  QI +WE+   R+++ P + Y  F
Sbjct: 314 NFLKSYSLFEDIPVAIISQIIIWETKRKRIKIFPGYLYSNF 354


>gi|349581695|dbj|GAA26852.1| K7_Tfb2p [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 513

 Score = 84.0 bits (206), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 47/146 (32%), Positives = 75/146 (51%), Gaps = 24/146 (16%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           G ++VETNF++Y+YS S L   +L LF  ++ +  N+++G  T+ES+  A  NGITA+Q 
Sbjct: 337 GSLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRRALTNGITADQI 396

Query: 104 -----QNAHPRV------------------ADRIPSIPENVCDQIRLWESDLNRVEMTPA 140
                 +AHP++                   + +  +P  V DQIRLW+ +L+RV     
Sbjct: 397 IAYLETHAHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTVVDQIRLWQLELDRVITYEG 456

Query: 141 HYYDEFPSRDVFEAACDYARDRSGLL 166
             Y +F +   +     YA+D   LL
Sbjct: 457 SLYSDFETSQEYNLLSKYAQDIGVLL 482


>gi|6325135|ref|NP_015203.1| Tfb2p [Saccharomyces cerevisiae S288c]
 gi|62901134|sp|Q02939.1|TFB2_YEAST RecName: Full=RNA polymerase II transcription factor B subunit 2;
           AltName: Full=General transcription and DNA repair
           factor IIH subunit TFB2; Short=TFIIH subunit TFB2;
           AltName: Full=RNA polymerase II transcription factor B
           52 kDa subunit; AltName: Full=RNA polymerase II
           transcription factor B p52 subunit
 gi|1163092|gb|AAB68240.1| Tfb2p: TFIIH subunit [Saccharomyces cerevisiae]
 gi|1778059|gb|AAB40628.1| transcription/repair factor TFIIH subunit Tfb2, partial
           [Saccharomyces cerevisiae]
 gi|151942676|gb|EDN61022.1| transcription initiation factor TFIIH subunit [Saccharomyces
           cerevisiae YJM789]
 gi|190407835|gb|EDV11100.1| RNA polymerase II transcription factor B subunit 2 [Saccharomyces
           cerevisiae RM11-1a]
 gi|256274256|gb|EEU09164.1| Tfb2p [Saccharomyces cerevisiae JAY291]
 gi|259150035|emb|CAY86838.1| Tfb2p [Saccharomyces cerevisiae EC1118]
 gi|285815419|tpg|DAA11311.1| TPA: Tfb2p [Saccharomyces cerevisiae S288c]
 gi|323331200|gb|EGA72618.1| Tfb2p [Saccharomyces cerevisiae AWRI796]
 gi|323335024|gb|EGA76314.1| Tfb2p [Saccharomyces cerevisiae Vin13]
 gi|323346175|gb|EGA80465.1| Tfb2p [Saccharomyces cerevisiae Lalvin QA23]
 gi|323352000|gb|EGA84539.1| Tfb2p [Saccharomyces cerevisiae VL3]
 gi|365762780|gb|EHN04313.1| Tfb2p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
 gi|392295887|gb|EIW06990.1| Tfb2p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 513

 Score = 84.0 bits (206), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 47/146 (32%), Positives = 75/146 (51%), Gaps = 24/146 (16%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           G ++VETNF++Y+YS S L   +L LF  ++ +  N+++G  T+ES+  A  NGITA+Q 
Sbjct: 337 GSLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRRALTNGITADQI 396

Query: 104 -----QNAHPRV------------------ADRIPSIPENVCDQIRLWESDLNRVEMTPA 140
                 +AHP++                   + +  +P  V DQIRLW+ +L+RV     
Sbjct: 397 IAYLETHAHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTVVDQIRLWQLELDRVITYEG 456

Query: 141 HYYDEFPSRDVFEAACDYARDRSGLL 166
             Y +F +   +     YA+D   LL
Sbjct: 457 SLYSDFETSQEYNLLSKYAQDIGVLL 482


>gi|164425579|ref|XP_960485.2| hypothetical protein NCU05523 [Neurospora crassa OR74A]
 gi|157070982|gb|EAA31249.2| conserved hypothetical protein [Neurospora crassa OR74A]
          Length = 475

 Score = 84.0 bits (206), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 42/126 (33%), Positives = 71/126 (56%), Gaps = 11/126 (8%)

Query: 47  VVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ--- 103
           +++ETN+R+YAY+TS L   +L LF ++  + PN++ G  T++S+  A   GITA+Q   
Sbjct: 317 IIIETNYRIYAYTTSPLQIAVLALFCQLNMRFPNMVSGRLTRDSIRQAIGFGITADQIIS 376

Query: 104 ---QNAHPRVADRI-----PSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAA 155
               +AH ++         P +P  V DQIRLW+ +  R++ T    + +F S + + A 
Sbjct: 377 YLHAHAHQQMVREATVTGKPVLPPTVVDQIRLWQLENERMQTTAGFLFKDFESTEEYAAL 436

Query: 156 CDYARD 161
             YA +
Sbjct: 437 SRYAEE 442


>gi|324512881|gb|ADY45318.1| General transcription factor IIH subunit 4 [Ascaris suum]
          Length = 481

 Score = 84.0 bits (206), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 47/135 (34%), Positives = 71/135 (52%), Gaps = 13/135 (9%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           G+++VETN+R+YAY+ S L   IL  F+++ Y+  ++ VG  T++S+  A + GITA Q 
Sbjct: 317 GYIIVETNYRVYAYTDSSLQLAILSTFTEMLYRFNDMSVGMLTRDSVRRALQVGITASQI 376

Query: 104 -----QNAHPRVA-------DRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDV 151
                 NAHP            I  +P  V DQIRLWE + +R+    +  Y  F S   
Sbjct: 377 ISFLRANAHPETVAAANASPGVIHCVPVTVADQIRLWEDERHRLVFFDSALYSTFESERE 436

Query: 152 FEAACDYARDRSGLL 166
           +    +Y R +  LL
Sbjct: 437 YVGVKEYTRSQDILL 451


>gi|448527133|ref|XP_003869438.1| TFIIH and nucleotide excision repair factor 3 complex subunit
           [Candida orthopsilosis Co 90-125]
 gi|380353791|emb|CCG23303.1| TFIIH and nucleotide excision repair factor 3 complex subunit
           [Candida orthopsilosis]
          Length = 500

 Score = 84.0 bits (206), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 46/168 (27%), Positives = 84/168 (50%), Gaps = 42/168 (25%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           SNS+  +++ETNF++YAY+ S L   IL LF +++ +  N++ G  T+ES+ NA  NGIT
Sbjct: 304 SNSRESIIIETNFKIYAYTNSPLEIAILNLFVQMKTRFSNMVCGQITRESIRNALYNGIT 363

Query: 101 AEQ------QNAHPRV------------------------------------ADRIPSIP 118
           ++Q       +AHP++                                      ++  IP
Sbjct: 364 SDQIIKFLETHAHPQMRALAKEKLDKKVEFDASHNINTAGGAPQSKTDGAISQHKLEVIP 423

Query: 119 ENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARDRSGLL 166
            NV DQI+LW+ +L+R++    + + +F ++  ++   +YA +   L+
Sbjct: 424 PNVVDQIKLWQLELDRIQTVEGYLFKDFANQQEYDTLSNYATELGVLV 471


>gi|323302721|gb|EGA56527.1| Tfb2p [Saccharomyces cerevisiae FostersB]
          Length = 505

 Score = 84.0 bits (206), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 47/146 (32%), Positives = 75/146 (51%), Gaps = 24/146 (16%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           G ++VETNF++Y+YS S L   +L LF  ++ +  N+++G  T+ES+  A  NGITA+Q 
Sbjct: 337 GSLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRRALTNGITADQI 396

Query: 104 -----QNAHPRV------------------ADRIPSIPENVCDQIRLWESDLNRVEMTPA 140
                 +AHP++                   + +  +P  V DQIRLW+ +L+RV     
Sbjct: 397 IAYLETHAHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTVVDQIRLWQLELDRVITYEG 456

Query: 141 HYYDEFPSRDVFEAACDYARDRSGLL 166
             Y +F +   +     YA+D   LL
Sbjct: 457 SLYSDFETSQEYNLLSKYAQDIGVLL 482


>gi|336466164|gb|EGO54329.1| hypothetical protein NEUTE1DRAFT_87562 [Neurospora tetrasperma FGSC
           2508]
 gi|350286986|gb|EGZ68233.1| transcription factor Tfb2 [Neurospora tetrasperma FGSC 2509]
          Length = 475

 Score = 83.6 bits (205), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 42/126 (33%), Positives = 71/126 (56%), Gaps = 11/126 (8%)

Query: 47  VVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ--- 103
           +++ETN+R+YAY+TS L   +L LF ++  + PN++ G  T++S+  A   GITA+Q   
Sbjct: 317 IIIETNYRIYAYTTSPLQIAVLALFCQLNMRFPNMVSGRLTRDSIRQAIGFGITADQIIS 376

Query: 104 ---QNAHPRVADRI-----PSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAA 155
               +AH ++         P +P  V DQIRLW+ +  R++ T    + +F S + + A 
Sbjct: 377 YLHAHAHQQMVREATVTGKPVLPPTVVDQIRLWQLENERMQTTAGFLFKDFESTEEYAAL 436

Query: 156 CDYARD 161
             YA +
Sbjct: 437 SRYAEE 442


>gi|452824378|gb|EME31381.1| transcription initiation factor TFIIH subunit H4 [Galdieria
           sulphuraria]
          Length = 740

 Score = 83.6 bits (205), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 45/124 (36%), Positives = 65/124 (52%), Gaps = 11/124 (8%)

Query: 47  VVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE---- 102
           ++V+TNFR+Y Y+       +L LF +  Y+LPN+ VG  T++S+  A  NGITA+    
Sbjct: 328 MIVQTNFRVYVYTNCTFQISLLSLFIQFLYRLPNMAVGVITRDSIRTALNNGITAQQMIS 387

Query: 103 --QQNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYAR 160
             Q + HP +  ++P     + DQIRLWE+   RV    A   D F +   FE    Y  
Sbjct: 388 YLQNHMHPNMKGKLPI---TIIDQIRLWEAQRFRVTTKHALLLDHFDNMTCFEKT--YGT 442

Query: 161 DRSG 164
            R G
Sbjct: 443 TRIG 446


>gi|323306981|gb|EGA60265.1| Tfb2p [Saccharomyces cerevisiae FostersO]
          Length = 448

 Score = 83.6 bits (205), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 47/146 (32%), Positives = 75/146 (51%), Gaps = 24/146 (16%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           G ++VETNF++Y+YS S L   +L LF  ++ +  N+++G  T+ES+  A  NGITA+Q 
Sbjct: 288 GSLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRRALTNGITADQI 347

Query: 104 -----QNAHPRV------------------ADRIPSIPENVCDQIRLWESDLNRVEMTPA 140
                 +AHP++                   + +  +P  V DQIRLW+ +L+RV     
Sbjct: 348 IAYLETHAHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTVVDQIRLWQLELDRVITYEG 407

Query: 141 HYYDEFPSRDVFEAACDYARDRSGLL 166
             Y +F +   +     YA+D   LL
Sbjct: 408 SLYSDFETSQEYNLLSKYAQDIGVLL 433


>gi|440493551|gb|ELQ76006.1| RNA polymerase II transcription initiation/nucleotide excision
           repair factor TFIIH, subunit TFB2 [Trachipleistophora
           hominis]
          Length = 426

 Score = 83.6 bits (205), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 43/122 (35%), Positives = 71/122 (58%), Gaps = 1/122 (0%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQQ 104
            F++VETN+++YAY+TS     I++LFS+I  +LPNL+    T+ES+N AF  GIT +Q 
Sbjct: 278 SFLIVETNYKLYAYTTSSHELSIIKLFSQIVRELPNLVTAHITEESVNAAFLKGITGQQI 337

Query: 105 NAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARDRSG 164
             +     +   +P  V +QI +WE   +R++   A  Y  F + + +E    Y R++  
Sbjct: 338 VDYLTEKSK-SELPPVVLEQILIWERQRDRMKCIDAVIYSHFMTYNEYEITYRYCREKGA 396

Query: 165 LL 166
           L+
Sbjct: 397 LI 398


>gi|260945437|ref|XP_002617016.1| hypothetical protein CLUG_02460 [Clavispora lusitaniae ATCC 42720]
 gi|238848870|gb|EEQ38334.1| hypothetical protein CLUG_02460 [Clavispora lusitaniae ATCC 42720]
          Length = 511

 Score = 83.6 bits (205), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 46/176 (26%), Positives = 85/176 (48%), Gaps = 43/176 (24%)

Query: 34  YIVREYISNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNN 93
           ++  E   +  G +++ETNF++YAY+ S L   IL LF  ++ +  N+++G  T+ES+  
Sbjct: 308 HVTSEAGESQSGQIILETNFKLYAYTNSPLEIAILNLFVNLKVRFANMVIGQITRESIRK 367

Query: 94  AFENGITAEQ------QNAHPRV------------------------------------- 110
           A  NGITA+Q       +AH ++                                     
Sbjct: 368 ALYNGITADQIIKFLETHAHSQMRTSAKEKLDKKIEFDASHNINTAGGAPQSKSSDSGVA 427

Query: 111 ADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARDRSGLL 166
             R+  +P NV DQI+LW+ +L+R++    + + +F +++ ++  C+YA +   LL
Sbjct: 428 QHRLEILPPNVVDQIKLWQLELDRIQTFEGYLFKDFKNQNEYDVLCNYATEVGVLL 483


>gi|429962184|gb|ELA41728.1| hypothetical protein VICG_01232 [Vittaforma corneae ATCC 50505]
          Length = 449

 Score = 83.6 bits (205), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 43/124 (34%), Positives = 68/124 (54%), Gaps = 2/124 (1%)

Query: 43  SQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE 102
           S  F+V+ETNF++YAY+++     +L LFSK  Y  PNLI     +ESL +AF  GITA+
Sbjct: 300 SNKFIVLETNFKIYAYTSTAYDKSVLSLFSKTVYVFPNLIKACFDEESLLSAFNKGITAK 359

Query: 103 QQNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARDR 162
           Q   +  + +    +P+N+ +QI +WE   +R+     + Y +F     F     Y   +
Sbjct: 360 QIIKY--LQEHSEEVPKNIVNQISIWEHRQHRIRARNGYLYHDFIHLSDFHRVLRYVESK 417

Query: 163 SGLL 166
            GL+
Sbjct: 418 GGLI 421


>gi|430812024|emb|CCJ30551.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 319

 Score = 82.8 bits (203), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 44/107 (41%), Positives = 59/107 (55%), Gaps = 6/107 (5%)

Query: 61  SKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ------QNAHPRVADRI 114
           S L   +L LF  +  +  NL+VG  T+ S+  AF NGITAEQ        AHP++    
Sbjct: 178 SPLQIAVLNLFVHLYLRFSNLVVGVITRNSVRQAFMNGITAEQIISYLTSYAHPQMKKNT 237

Query: 115 PSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARD 161
           P IP  V DQIRLWE + NR++ T  + + +F S   FE A  YA +
Sbjct: 238 PIIPPTVNDQIRLWEMERNRLKATEGYLFRDFNSNSDFELALKYANE 284


>gi|207340606|gb|EDZ68904.1| YPL122Cp-like protein [Saccharomyces cerevisiae AWRI1631]
          Length = 226

 Score = 82.8 bits (203), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 47/146 (32%), Positives = 75/146 (51%), Gaps = 24/146 (16%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           G ++VETNF++Y+YS S L   +L LF  ++ +  N+++G  T+ES+  A  NGITA+Q 
Sbjct: 50  GSLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRRALTNGITADQI 109

Query: 104 -----QNAHPRV------------------ADRIPSIPENVCDQIRLWESDLNRVEMTPA 140
                 +AHP++                   + +  +P  V DQIRLW+ +L+RV     
Sbjct: 110 IAYLETHAHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTVVDQIRLWQLELDRVITYEG 169

Query: 141 HYYDEFPSRDVFEAACDYARDRSGLL 166
             Y +F +   +     YA+D   LL
Sbjct: 170 SLYSDFETSQEYNLLSKYAQDIGVLL 195


>gi|294660113|ref|XP_002777731.1| DEHA2G23254p [Debaryomyces hansenii CBS767]
 gi|218511847|sp|Q6BGW8.2|TFB2_DEBHA RecName: Full=RNA polymerase II transcription factor B subunit 2;
           AltName: Full=RNA polymerase II transcription factor B
           52 kDa subunit; AltName: Full=RNA polymerase II
           transcription factor B p52 subunit
 gi|199434471|emb|CAR66042.1| DEHA2G23254p [Debaryomyces hansenii CBS767]
          Length = 515

 Score = 82.8 bits (203), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 51/174 (29%), Positives = 83/174 (47%), Gaps = 48/174 (27%)

Query: 41  SNS-QGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGI 99
           SNS QG +++ETNF++YAY+ S L   IL LF  ++ +  N++ G  T+ES+ NA  NGI
Sbjct: 313 SNSNQGTIILETNFKIYAYTNSPLEIAILNLFVHLKTRFSNMVCGQITRESIRNALYNGI 372

Query: 100 TAEQ------QNAHPRVA-----------------------------------------D 112
           TA+Q       +AH ++                                           
Sbjct: 373 TADQIIKFLETHAHSQMKILAKERLDKKIEFDTSNNINTAGGAPQSQMMTNENGTTVAQH 432

Query: 113 RIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARDRSGLL 166
           ++  +P  V DQI+LW+ +L+R++    + + +F S+  FEA  +YA +   LL
Sbjct: 433 KLEVLPPTVVDQIKLWQLELDRIQTFEGYLFKDFASQQEFEALSNYASEIGVLL 486


>gi|302917488|ref|XP_003052448.1| predicted protein [Nectria haematococca mpVI 77-13-4]
 gi|256733388|gb|EEU46735.1| predicted protein [Nectria haematococca mpVI 77-13-4]
          Length = 496

 Score = 82.4 bits (202), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 41/129 (31%), Positives = 72/129 (55%), Gaps = 11/129 (8%)

Query: 44  QGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ 103
           +G V++ETN+R+YAY+ S L   +L LFSK+  + P+++ G  +++S+  A   GITA+Q
Sbjct: 335 RGSVIIETNYRIYAYTQSTLQIAVLALFSKLSMRFPDMVAGRLSRQSIRQAINFGITADQ 394

Query: 104 ------QNAHPRV-----ADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVF 152
                  +AH ++      +  P +P  V DQIRLW+ +  R++ T    + +F     +
Sbjct: 395 IISYLAAHAHDQMHRTAALNNKPVLPPTVVDQIRLWQLENERMKTTSGFLFKDFEDHKEY 454

Query: 153 EAACDYARD 161
            A   +A +
Sbjct: 455 MAVAGFAEE 463


>gi|444320663|ref|XP_004180988.1| hypothetical protein TBLA_0E04140 [Tetrapisispora blattae CBS 6284]
 gi|387514031|emb|CCH61469.1| hypothetical protein TBLA_0E04140 [Tetrapisispora blattae CBS 6284]
          Length = 518

 Score = 82.4 bits (202), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 47/154 (30%), Positives = 77/154 (50%), Gaps = 28/154 (18%)

Query: 41  SNSQ----GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFE 96
           SN+Q    G +++ETNF++Y+YS S L   +L LF  ++ +  N++ G  T++S+  A  
Sbjct: 321 SNTQATADGALIIETNFKLYSYSNSPLQIAVLSLFVHLKTRFSNMVTGQITRDSIRRALR 380

Query: 97  NGITAEQ------QNAHPRV------------------ADRIPSIPENVCDQIRLWESDL 132
           NGITAEQ       ++HP++                   D +  +P  V DQI+LW+ +L
Sbjct: 381 NGITAEQIIAYLLTHSHPQMRRLAEGNLEKKLELDPNAKDSLQILPPTVVDQIKLWQLEL 440

Query: 133 NRVEMTPAHYYDEFPSRDVFEAACDYARDRSGLL 166
           +R+       Y +F +   +     YA D   LL
Sbjct: 441 DRIISHEGSLYSDFENNQEYNLLSTYAEDIGVLL 474


>gi|149238950|ref|XP_001525351.1| TFIIH basal transcription factor complex subunit 2 [Lodderomyces
           elongisporus NRRL YB-4239]
 gi|146450844|gb|EDK45100.1| TFIIH basal transcription factor complex subunit 2 [Lodderomyces
           elongisporus NRRL YB-4239]
          Length = 503

 Score = 82.0 bits (201), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 45/155 (29%), Positives = 76/155 (49%), Gaps = 42/155 (27%)

Query: 47  VVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ--- 103
           +++ETNF+MYAY+ S L   IL LF  ++ +  N+I G  T+ES+ NA  NGITA+Q   
Sbjct: 314 IIIETNFKMYAYTNSPLEIAILNLFVHMKTRFSNMICGQITRESIRNALYNGITADQIIK 373

Query: 104 ---QNAHPRV------------------------------------ADRIPSIPENVCDQ 124
               +AHP++                                      R+  +P NV DQ
Sbjct: 374 FLETHAHPQMRALAKERLDKKVEFDASHNINTAGGAPQSKTDGAISQHRLEVLPPNVVDQ 433

Query: 125 IRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYA 159
           I+LW+ +L+R++    + + +F ++  ++   +YA
Sbjct: 434 IKLWQLELDRIQTFDGYLFKDFANQQEYDMLSNYA 468


>gi|340939354|gb|EGS19976.1| RNA polymerase II transcription factor B subunit 2-like protein
           [Chaetomium thermophilum var. thermophilum DSM 1495]
          Length = 514

 Score = 82.0 bits (201), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 42/132 (31%), Positives = 73/132 (55%), Gaps = 11/132 (8%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           S  +G +++ETN+R+YAY++S L   +L LF+ +  +   ++ G  T+ES+  A   GIT
Sbjct: 350 SAHKGSIIIETNYRLYAYTSSPLQIAVLALFTHLNMRFAGMVTGRLTRESIRRAISFGIT 409

Query: 101 AEQ------QNAHPRV-----ADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 149
           A+Q       +AH ++     A   P +P  V DQIRLW+ +  R+  +P   + +F + 
Sbjct: 410 ADQIISYLASHAHEQMVRAAAAAGRPVLPPTVVDQIRLWQLENERMRTSPGFLFKDFENV 469

Query: 150 DVFEAACDYARD 161
           + + A   YA +
Sbjct: 470 EEYMALAGYAEE 481


>gi|255713706|ref|XP_002553135.1| KLTH0D09790p [Lachancea thermotolerans]
 gi|238934515|emb|CAR22697.1| KLTH0D09790p [Lachancea thermotolerans CBS 6340]
          Length = 506

 Score = 82.0 bits (201), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 46/146 (31%), Positives = 77/146 (52%), Gaps = 24/146 (16%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           G V++ETNF++Y+YS S L   +L LF  ++ +  N++ G  T+ES+  A  NGITA+Q 
Sbjct: 331 GAVILETNFKLYSYSNSPLQIAVLSLFVHLKSRFSNMVTGQITRESIRRALHNGITADQI 390

Query: 104 -----QNAHP---RVADR---------------IPSIPENVCDQIRLWESDLNRVEMTPA 140
                 +AHP   R+A++               +  +P  V DQI+LW+ +L+R+     
Sbjct: 391 IAYMETHAHPQMRRLAEQTLDKKMELDPNCNEGLQILPPTVVDQIKLWQLELDRIISYDG 450

Query: 141 HYYDEFPSRDVFEAACDYARDRSGLL 166
           + + +F +   +     YA+D   LL
Sbjct: 451 YLFTDFENFQEYNMLSSYAKDIGVLL 476


>gi|302405781|ref|XP_003000727.1| RNA polymerase II transcription factor B subunit 2 [Verticillium
           albo-atrum VaMs.102]
 gi|261360684|gb|EEY23112.1| RNA polymerase II transcription factor B subunit 2 [Verticillium
           albo-atrum VaMs.102]
          Length = 476

 Score = 82.0 bits (201), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 44/125 (35%), Positives = 71/125 (56%), Gaps = 18/125 (14%)

Query: 40  ISNSQGF-------VVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLN 92
           +S SQG        ++VETN+R+YAY+TS+L   +L LF  +  + P ++ G  T++S+ 
Sbjct: 305 LSQSQGGPGDQTNGIIVETNYRVYAYTTSQLQIAVLSLFCHLSVKFPGMVSGRLTRQSVR 364

Query: 93  NAFENGITAEQ------QNAHPRV-----ADRIPSIPENVCDQIRLWESDLNRVEMTPAH 141
            A + GITA+Q       +AH ++     AD  P +P  V DQIRLW+ +  R++ T   
Sbjct: 365 QAIDFGITADQIISYLAAHAHEQMHRQAAADGKPVLPPTVVDQIRLWQLENERMKTTSGF 424

Query: 142 YYDEF 146
            + +F
Sbjct: 425 LFRDF 429


>gi|254584224|ref|XP_002497680.1| ZYRO0F11088p [Zygosaccharomyces rouxii]
 gi|238940573|emb|CAR28747.1| ZYRO0F11088p [Zygosaccharomyces rouxii]
          Length = 509

 Score = 81.6 bits (200), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 46/141 (32%), Positives = 72/141 (51%), Gaps = 24/141 (17%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           G ++VETNF++Y+YS S L   IL LF  ++ +  N++ G  T+ES+  A  NGITA+Q 
Sbjct: 334 GALIVETNFKLYSYSNSPLQIAILSLFVHMKSRFSNMVTGQLTRESIRQALLNGITADQI 393

Query: 104 -----QNAHPRV------------------ADRIPSIPENVCDQIRLWESDLNRVEMTPA 140
                 +AHP++                   D +  +P  V DQI+LW+ +L+R+     
Sbjct: 394 IAYLETHAHPQMRRMAEEQLEKRLELDPNSKDPLQILPPTVVDQIKLWQLELDRIISFEG 453

Query: 141 HYYDEFPSRDVFEAACDYARD 161
             Y +F +   F     YA+D
Sbjct: 454 SLYSDFENHQEFTLLSSYAQD 474


>gi|448103836|ref|XP_004200137.1| Piso0_002710 [Millerozyma farinosa CBS 7064]
 gi|359381559|emb|CCE82018.1| Piso0_002710 [Millerozyma farinosa CBS 7064]
          Length = 512

 Score = 81.6 bits (200), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 47/174 (27%), Positives = 82/174 (47%), Gaps = 45/174 (25%)

Query: 38  EYISNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFEN 97
           +  SN+ G +++ETNF++YAY+ S L   IL LF  ++ + PN++ G  T+ES+  A  N
Sbjct: 310 QMFSNTGGSIILETNFKIYAYTNSPLEIAILNLFVHLKTRFPNMVTGQITRESIRKALFN 369

Query: 98  GITAEQ------QNAHPRVAD--------------------------------------- 112
           GITA+Q       +AH ++                                         
Sbjct: 370 GITADQIIKFLESHAHSQMKQMAKENLDKKIEFDANNNINTATDNQQLSVGEFGSANSYH 429

Query: 113 RIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARDRSGLL 166
           ++  +P  V DQI+LW+ +L+R++    + + +F S+  ++A   YA +   LL
Sbjct: 430 KLEVLPPTVVDQIKLWQLELDRIQTFEGYLFKDFSSQQEYDALRSYASELGVLL 483


>gi|354546081|emb|CCE42810.1| hypothetical protein CPAR2_204530 [Candida parapsilosis]
          Length = 451

 Score = 81.6 bits (200), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 44/163 (26%), Positives = 81/163 (49%), Gaps = 42/163 (25%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           S S+  +++ETNF++YAY+ S L   IL LF +++ +  N++ G  T+ES+ NA  NGIT
Sbjct: 255 STSRESIIIETNFKIYAYTNSPLEIAILNLFVQMKTRFSNMVCGQITRESIRNALYNGIT 314

Query: 101 AEQ------QNAHPRVA------------------------------------DRIPSIP 118
           ++Q       +AHP++                                      ++  IP
Sbjct: 315 SDQIIKFLETHAHPQMKALAKEKLDKKVEFDASHNINTAGGAPQSKTDGTISQHKLEVIP 374

Query: 119 ENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARD 161
            NV DQI+LW+ +L+R++    + + +F ++  ++   +YA +
Sbjct: 375 PNVVDQIKLWQLELDRIQTVEGYLFKDFANQQEYDMLSNYATE 417


>gi|367009216|ref|XP_003679109.1| hypothetical protein TDEL_0A05660 [Torulaspora delbrueckii]
 gi|359746766|emb|CCE89898.1| hypothetical protein TDEL_0A05660 [Torulaspora delbrueckii]
          Length = 513

 Score = 81.3 bits (199), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 47/146 (32%), Positives = 73/146 (50%), Gaps = 24/146 (16%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE-- 102
           G ++VETNF++Y+YS S L   IL LF  ++ +  N++ G  T+ES+  A  NGITA+  
Sbjct: 334 GALIVETNFKLYSYSNSPLQIAILSLFVHLKSRFTNMVTGQITRESIRRALINGITADQI 393

Query: 103 ----QQNAHPRV------------------ADRIPSIPENVCDQIRLWESDLNRVEMTPA 140
               Q +AHP++                   D +  +P  V DQI+LW+ +L+R+     
Sbjct: 394 IAYLQTHAHPQMRRLAEEKLEKKLELDANCKDTLQILPPTVVDQIKLWQLELDRIISYDG 453

Query: 141 HYYDEFPSRDVFEAACDYARDRSGLL 166
             Y +  S   +     YA+D   LL
Sbjct: 454 SLYSDIDSHQEYILLSTYAQDIGVLL 479


>gi|346972761|gb|EGY16213.1| RNA polymerase II transcription factor B subunit 2 [Verticillium
           dahliae VdLs.17]
          Length = 476

 Score = 81.3 bits (199), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 40/111 (36%), Positives = 66/111 (59%), Gaps = 11/111 (9%)

Query: 47  VVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ--- 103
           ++VETN+R+YAY+TS+L   +L LF  +  + P ++ G  T++S+  A + GITA+Q   
Sbjct: 319 IIVETNYRVYAYTTSQLQIAVLSLFCHLSVKFPGMVSGRLTRQSVRQAIDFGITADQIIS 378

Query: 104 ---QNAHPRV-----ADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEF 146
               +AH ++     AD  P +P  V DQIRLW+ +  R++ T    + +F
Sbjct: 379 YLAAHAHEQMHRQAAADGKPVLPPTVVDQIRLWQLENERMKTTSGFLFRDF 429


>gi|428673252|gb|EKX74165.1| conserved hypothetical protein [Babesia equi]
          Length = 668

 Score = 81.3 bits (199), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 38/119 (31%), Positives = 74/119 (62%), Gaps = 4/119 (3%)

Query: 47  VVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQQNA 106
           ++V++NF++Y Y+ + L ++IL +  +++ + PN++VG  T+ES   AF++GIT+  +  
Sbjct: 494 IIVQSNFKVYVYTVNNLQFDILNILCEVQARTPNMVVGVLTRESAQRAFKSGITS-HEII 552

Query: 107 HPRVADRIPSIPENVCDQIRLWESDLNRVEMTPA---HYYDEFPSRDVFEAACDYARDR 162
               +    + PENV  Q+R+WE++ NRVE++PA     +D     D+F+    +A+ +
Sbjct: 553 RFFSSTNTSTFPENVIRQLRMWEAERNRVELSPAILIKRWDREFLPDLFQRTVRWAQSK 611


>gi|366991461|ref|XP_003675496.1| hypothetical protein NCAS_0C01390 [Naumovozyma castellii CBS 4309]
 gi|342301361|emb|CCC69129.1| hypothetical protein NCAS_0C01390 [Naumovozyma castellii CBS 4309]
          Length = 516

 Score = 81.3 bits (199), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 46/146 (31%), Positives = 73/146 (50%), Gaps = 24/146 (16%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           G +++ETNF++Y+YS S L    L LF  ++ +  N++ G  T+ES+  A  NGITA+Q 
Sbjct: 341 GSLIIETNFKLYSYSNSPLQIATLSLFVHLKSRFANMVTGQITRESIRRALINGITADQI 400

Query: 104 -----QNAHPRV------------------ADRIPSIPENVCDQIRLWESDLNRVEMTPA 140
                 +AHP++                   D +  +P  V DQI+LW+ +L+RV     
Sbjct: 401 IAYLETHAHPQMRRLAEERLEKKLELDPNSKDPLQVLPPTVVDQIKLWQLELDRVITYEG 460

Query: 141 HYYDEFPSRDVFEAACDYARDRSGLL 166
             Y +F +   +     YA+D   LL
Sbjct: 461 SLYSDFETIAEYTTLSKYAQDIGVLL 486


>gi|448100135|ref|XP_004199281.1| Piso0_002710 [Millerozyma farinosa CBS 7064]
 gi|359380703|emb|CCE82944.1| Piso0_002710 [Millerozyma farinosa CBS 7064]
          Length = 509

 Score = 80.9 bits (198), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 47/174 (27%), Positives = 83/174 (47%), Gaps = 45/174 (25%)

Query: 38  EYISNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFEN 97
           +  SN+ G +++ETNF++YAY+ S L   IL LF  ++ + PN++ G  T+ES+  A  N
Sbjct: 310 QMFSNTGGSIILETNFKIYAYTNSPLEIAILNLFVHLKTRFPNMVTGQITRESIRKALFN 369

Query: 98  GITAEQ------QNAHPRVAD--------------------------------------- 112
           GITA+Q       +AH ++                                         
Sbjct: 370 GITADQIIKFLESHAHSQMKQMAKENLDKKIEFDANNNINTATDSQQLSVGEFGSASSYH 429

Query: 113 RIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARDRSGLL 166
           ++  +P  V DQI+LW+ +L+R++    + + +F S+  ++A  +YA +   LL
Sbjct: 430 KLEILPPTVVDQIKLWQLELDRIQTFEGYLFKDFSSQQEYDALRNYASELGVLL 483


>gi|342884444|gb|EGU84659.1| hypothetical protein FOXB_04847 [Fusarium oxysporum Fo5176]
          Length = 496

 Score = 80.9 bits (198), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 39/117 (33%), Positives = 68/117 (58%), Gaps = 11/117 (9%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           +  +G V++ETN+R+YAY+ S L   +L LF+K+  + P+++ G  +++S+  A + GIT
Sbjct: 332 TEQRGSVIIETNYRIYAYTQSTLQIAVLALFTKLAMRFPDMVAGRISRQSIRQAIQFGIT 391

Query: 101 AEQ------QNAHPRV-----ADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEF 146
           AEQ       +AH ++      +  P +P  V DQIRLW+ +  R++ T    +  F
Sbjct: 392 AEQIISYLSAHAHDQMHRTAALNNKPVLPPTVVDQIRLWQLENERMKTTSGFLFKSF 448


>gi|150863860|ref|XP_001382481.2| RNA polymerase II transcription factor B subunit 2 (RNA polymerase
           II transcription factor B p52 subunit) (RNA polymerase
           II transcription factor B 52 kDa subunit)
           [Scheffersomyces stipitis CBS 6054]
 gi|149385116|gb|ABN64452.2| RNA polymerase II transcription factor B subunit 2 (RNA polymerase
           II transcription factor B p52 subunit) (RNA polymerase
           II transcription factor B 52 kDa subunit)
           [Scheffersomyces stipitis CBS 6054]
          Length = 515

 Score = 80.5 bits (197), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 44/165 (26%), Positives = 80/165 (48%), Gaps = 44/165 (26%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           + +QG V++ETNF++YAY+ S L   IL LF  ++ +  N++ G  T+ES+ +A  NGIT
Sbjct: 314 TGNQGTVIIETNFKLYAYTNSPLEIAILNLFVHLKSRFTNMVCGQITRESIRSALYNGIT 373

Query: 101 AEQ------QNAHPRVA--------------------------------------DRIPS 116
           A+Q       +AHP++                                        ++  
Sbjct: 374 ADQIIKFLETHAHPQMKLLAKEKLDKKIEFDTSHNINTAGGAPQSKVSGEGNVAQHKLEI 433

Query: 117 IPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARD 161
           +P  V DQI+LW+ +L+R++    + + +F ++  F+   +YA +
Sbjct: 434 LPPTVVDQIKLWQLELDRIQTFDGYLFKDFSNQQEFDTLSNYASE 478


>gi|223993063|ref|XP_002286215.1| transcription factor of TFIIH family [Thalassiosira pseudonana
           CCMP1335]
 gi|220977530|gb|EED95856.1| transcription factor of TFIIH family [Thalassiosira pseudonana
           CCMP1335]
          Length = 461

 Score = 80.5 bits (197), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 55/137 (40%), Positives = 74/137 (54%), Gaps = 16/137 (11%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIE--YQLPNLIVGATTKESLNNAFENG 98
           S S   VVV+TNF++ AY+ SKLH   L LF  +    +LPN+I    T++S+ +AF  G
Sbjct: 327 SRSHLAVVVQTNFQVVAYTKSKLHISTLGLFCDVSSFRRLPNVIFFHITRDSIRSAFRLG 386

Query: 99  ITAEQ------QNAHP--RVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRD 150
           +TA+Q       +AHP  R  DR P +P NV DQI LW+ + +RV M     +     RD
Sbjct: 387 VTADQILRFLHVHAHPMLRSGDR-PMVPSNVVDQILLWDRERHRVVMDEVCVHQ---CRD 442

Query: 151 V--FEAACDYARDRSGL 165
              F A   YA D   L
Sbjct: 443 AAEFTAVSQYASDVDAL 459


>gi|355693906|gb|AER99491.1| proteinral transcription factor IIH, polypeptide 4, 52kDa [Mustela
           putorius furo]
          Length = 143

 Score = 80.5 bits (197), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 40/112 (35%), Positives = 65/112 (58%), Gaps = 6/112 (5%)

Query: 61  SKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ------QNAHPRVADRI 114
           S+L   ++ LFS++ Y+ PN++V   T+ES+  A  +GITA+Q        AHP +  + 
Sbjct: 2   SELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQT 61

Query: 115 PSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARDRSGLL 166
           P +P  + DQIRLWE + +R+  T    Y++F S+  FE    +AR+   L+
Sbjct: 62  PVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVDFELLLAHARELGVLV 113


>gi|449017614|dbj|BAM81016.1| similar to TFIIH subunit TFB2 [Cyanidioschyzon merolae strain 10D]
          Length = 693

 Score = 80.1 bits (196), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 44/119 (36%), Positives = 67/119 (56%), Gaps = 8/119 (6%)

Query: 37  REYISNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFE 96
           R  +S S   ++VETNFR+YAY+ S     +L LF++I Y++P + +G  T++S+  A +
Sbjct: 528 RSILSGSPFQIIVETNFRLYAYAASSFQVALLSLFTRILYRMPGVAIGVITRDSVRRALK 587

Query: 97  NGITAEQ----QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEF-PSRD 150
            GITA+Q       H        S+P NV DQI LWE +  R++  P    + F P+ D
Sbjct: 588 CGITAKQLLHFLGIHSMEGK---SVPFNVHDQILLWELERKRIQAYPGVLLEGFEPTSD 643


>gi|221502519|gb|EEE28246.1| tfiih, polypeptide, putative [Toxoplasma gondii VEG]
          Length = 836

 Score = 80.1 bits (196), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 39/105 (37%), Positives = 67/105 (63%), Gaps = 11/105 (10%)

Query: 47  VVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ--- 103
           ++V++NF++Y Y+ S L   +L    +++ ++PNLIVG  T+ S+  A+++GITA+Q   
Sbjct: 650 MLVQSNFKVYVYTASALQLSVLSHLCELQARMPNLIVGILTRASVLAAYKSGITADQIIR 709

Query: 104 ---QNAHPRVADR-----IPSIPENVCDQIRLWESDLNRVEMTPA 140
               +AHP V +R      P +PENV  Q+R+WE++  R+ + PA
Sbjct: 710 FLEAHAHPVVLERKLRTNAPLLPENVTIQLRMWEAERMRLSLYPA 754


>gi|402083674|gb|EJT78692.1| RNA polymerase II transcription factor B subunit 2 [Gaeumannomyces
           graminis var. tritici R3-111a-1]
          Length = 496

 Score = 80.1 bits (196), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 41/135 (30%), Positives = 77/135 (57%), Gaps = 12/135 (8%)

Query: 43  SQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE 102
           ++G +++ETN+R+YAY+++ L   IL LF+++ ++   ++ G   +ES+  A   GITA+
Sbjct: 333 AKGSIIIETNYRLYAYTSTPLQIAILGLFAELRFRFAGMVTGRLDRESIKRAISYGITAD 392

Query: 103 Q------QNAHPRV-----ADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDV 151
           Q       +AH ++       + P +P  V DQIRLW+ +  R+++     + +F S+  
Sbjct: 393 QVIEYLAAHAHEQMHRTATLRKKPVLPPTVVDQIRLWQLETERMKIMRGFLFRDFDSQAE 452

Query: 152 FEAACDYARDRSGLL 166
           ++    YA D  G+L
Sbjct: 453 YDDLAKYA-DEIGVL 466


>gi|237842971|ref|XP_002370783.1| hypothetical protein TGME49_014520 [Toxoplasma gondii ME49]
 gi|211968447|gb|EEB03643.1| hypothetical protein TGME49_014520 [Toxoplasma gondii ME49]
          Length = 836

 Score = 80.1 bits (196), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 39/105 (37%), Positives = 67/105 (63%), Gaps = 11/105 (10%)

Query: 47  VVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ--- 103
           ++V++NF++Y Y+ S L   +L    +++ ++PNLIVG  T+ S+  A+++GITA+Q   
Sbjct: 650 MLVQSNFKVYVYTASALQLSVLSHLCELQARMPNLIVGILTRASVLAAYKSGITADQIIR 709

Query: 104 ---QNAHPRVADR-----IPSIPENVCDQIRLWESDLNRVEMTPA 140
               +AHP V +R      P +PENV  Q+R+WE++  R+ + PA
Sbjct: 710 FLEAHAHPVVLERKLRTNAPLLPENVTIQLRMWEAERMRLSLYPA 754


>gi|322697444|gb|EFY89224.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Metarhizium acridum CQMa 102]
          Length = 541

 Score = 79.7 bits (195), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 46/147 (31%), Positives = 78/147 (53%), Gaps = 22/147 (14%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           ++ +G V+VETN+R+YAY+ S L   +L LF K+  + P+++ G  T+ S++ A   GIT
Sbjct: 377 ADQKGSVIVETNYRIYAYTQSTLQIAVLALFCKLNMRFPDMVAGRVTRTSIHQAINFGIT 436

Query: 101 AEQ------QNAHPR------VADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 148
           A+Q       +AH +      +A++ P +P  V DQIRLW+ +  R++ T    + +F  
Sbjct: 437 ADQIISYLAAHAHEQMHRTAALANK-PILPPTVVDQIRLWQLENERMKTTGGFLFRDFED 495

Query: 149 RDVFEAACDYAR---------DRSGLL 166
              F     +A          D+SG+ 
Sbjct: 496 HKEFLDTARFAEEIGVLVWRSDKSGMF 522


>gi|401887601|gb|EJT51582.1| hypothetical protein A1Q1_07170 [Trichosporon asahii var. asahii
           CBS 2479]
 gi|406699646|gb|EKD02845.1| hypothetical protein A1Q2_02789 [Trichosporon asahii var. asahii
           CBS 8904]
          Length = 451

 Score = 79.7 bits (195), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 48/139 (34%), Positives = 81/139 (58%), Gaps = 9/139 (6%)

Query: 35  IVREYISNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNA 94
           + R   ++ + F+++ETN+R+YAY++++L   IL LF  I  + PNLIVG   +  +  A
Sbjct: 276 VNRSATADDKRFLILETNYRVYAYTSNELEIAILNLFVNITIRYPNLIVGHLDRRHVKAA 335

Query: 95  FENGITAEQ------QNAHPRVADRIPSIPE-NVCDQIRLWESDLNRVEMTPAHYYDEFP 147
            + GI+A Q       +AHP++ +  P +   +V DQ+ LW+ + NRV+      + EF 
Sbjct: 336 MDKGISAYQIISYLTTHAHPQMYNHPPPLLHPSVTDQLHLWDRERNRVQAQDTVMF-EFY 394

Query: 148 SRDVFEAACDYARDRSGLL 166
           S+++FE   D A +R G L
Sbjct: 395 SKELFETCRDEA-ERMGAL 412


>gi|326435722|gb|EGD81292.1| hypothetical protein PTSG_11329 [Salpingoeca sp. ATCC 50818]
          Length = 545

 Score = 79.7 bits (195), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 44/132 (33%), Positives = 71/132 (53%), Gaps = 11/132 (8%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQQ 104
           GF+V+ETNFR+YAY+ S L   +L+LF+K  Y+LP ++V + T+ S+  A   GITA Q 
Sbjct: 291 GFLVIETNFRIYAYTKSDLKISLLKLFAKPLYKLPTMLVASITRSSVRQAMAQGITARQI 350

Query: 105 NAHPRV--ADRI--------PSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
               +V  + R+          +P  V DQI LWE + +R+  T    +   P + +F  
Sbjct: 351 LHFLKVHCSSRMRHQPHSVFSPVPPTVSDQILLWEQETSRLSKTEGVLFRHLP-KPIFPK 409

Query: 155 ACDYARDRSGLL 166
              + ++R   +
Sbjct: 410 LLAFVKERQACI 421


>gi|367030175|ref|XP_003664371.1| hypothetical protein MYCTH_2307126 [Myceliophthora thermophila ATCC
           42464]
 gi|347011641|gb|AEO59126.1| hypothetical protein MYCTH_2307126 [Myceliophthora thermophila ATCC
           42464]
          Length = 512

 Score = 79.7 bits (195), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 43/135 (31%), Positives = 74/135 (54%), Gaps = 11/135 (8%)

Query: 38  EYISNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFEN 97
           E  + S+G V++ETN+R+YAY++S L   +L LF+++  +   +  G  T++S+  A   
Sbjct: 345 ETSAASKGSVIIETNYRLYAYTSSPLQIAVLALFTQLNMRFAGMATGRLTRDSIRRAIGF 404

Query: 98  GITAEQ------QNAHPRV-----ADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEF 146
           GITA+Q       +AH ++     A   P +P  V DQIRLW+ +  R+  +    + +F
Sbjct: 405 GITADQIISYLASHAHEQMVRAAAATGRPVLPPTVVDQIRLWQLENERMRTSAGFLFKDF 464

Query: 147 PSRDVFEAACDYARD 161
            S D + +   YA +
Sbjct: 465 DSLDEYLSLSAYAEE 479


>gi|254573074|ref|XP_002493646.1| Subunit of TFIIH and nucleotide excision repair factor 3 complexes
           [Komagataella pastoris GS115]
 gi|238033445|emb|CAY71467.1| Subunit of TFIIH and nucleotide excision repair factor 3 complexes
           [Komagataella pastoris GS115]
 gi|328354526|emb|CCA40923.1| RNA polymerase II transcription factor B subunit 2 [Komagataella
           pastoris CBS 7435]
          Length = 547

 Score = 79.7 bits (195), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 45/155 (29%), Positives = 75/155 (48%), Gaps = 33/155 (21%)

Query: 44  QGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ 103
            G +V+ETNF++Y Y+TS L   IL LF  +  +  N++ G  T+ES+  A  NGITA+Q
Sbjct: 362 HGSIVLETNFKLYCYTTSPLQIAILNLFVHLRTRFANMVTGMITRESVREALRNGITADQ 421

Query: 104 ------QNAHPRVAD--------------------------RIPSIPENVCDQIRLWESD 131
                  +AH ++                            ++  IP  V DQI+LW+ +
Sbjct: 422 IIKYLETHAHSQMKKLAEEKLLKKLEFDSNATETQQIGSQVKLEVIPPTVVDQIKLWQLE 481

Query: 132 LNRVEMTPAHYYDEFPSRDVFEAACDYARDRSGLL 166
           ++R++      + +F +   FE   +YA D  G++
Sbjct: 482 MDRLQTFAGFLFKDFANAQEFEQLANYA-DEVGVM 515


>gi|346323394|gb|EGX92992.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Cordyceps militaris CM01]
          Length = 511

 Score = 79.7 bits (195), Expect = 4e-13,   Method: Composition-based stats.
 Identities = 40/113 (35%), Positives = 66/113 (58%), Gaps = 11/113 (9%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           G VV+ETN+R+YAY+ S L   +L LF+++  + P+++ G  T+ S+  A + GITA+Q 
Sbjct: 351 GAVVIETNYRLYAYTQSTLQIAVLALFARLTMRFPDMVAGRLTRTSIRQAIQFGITADQI 410

Query: 104 -----QNAHPRV-----ADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEF 146
                 +AH ++         P +P  V DQIRLW+ +  R++ TP   + +F
Sbjct: 411 IAYLAAHAHEQMHRTAALTNKPVLPPTVVDQIRLWQLENERMKTTPGFLFRDF 463


>gi|116201905|ref|XP_001226764.1| hypothetical protein CHGG_08837 [Chaetomium globosum CBS 148.51]
 gi|88177355|gb|EAQ84823.1| hypothetical protein CHGG_08837 [Chaetomium globosum CBS 148.51]
          Length = 503

 Score = 79.3 bits (194), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 45/140 (32%), Positives = 77/140 (55%), Gaps = 12/140 (8%)

Query: 38  EYISNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFEN 97
           E  + S+G V++ETN+R+YAY++S L   +L LF+++  +   +  G  T++S+  A   
Sbjct: 336 ETSAASKGSVIIETNYRLYAYTSSPLQIAVLGLFTQLNMRFAGMATGRLTRDSIRRAIGF 395

Query: 98  GITAEQ------QNAHPRV-----ADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEF 146
           GITA+Q       +AH ++     A   P +P  V DQIRLW+ +  R+  +    + +F
Sbjct: 396 GITADQIISYLASHAHDQMVRAAAATGRPVLPPTVVDQIRLWQLENERMRTSAGFLFKDF 455

Query: 147 PSRDVFEAACDYARDRSGLL 166
            S + + +   YA D  G+L
Sbjct: 456 DSLEEYVSLSSYA-DEVGVL 474


>gi|71033737|ref|XP_766510.1| hypothetical protein [Theileria parva strain Muguga]
 gi|68353467|gb|EAN34227.1| hypothetical protein, conserved [Theileria parva]
          Length = 359

 Score = 79.3 bits (194), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 39/135 (28%), Positives = 79/135 (58%), Gaps = 19/135 (14%)

Query: 47  VVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ--- 103
           ++V++NF++Y Y+ S L   +L    +++ + PNL+VG  T+ES+ +AF++GIT+++   
Sbjct: 164 IIVQSNFKVYVYTASPLQISVLSHLCELQARTPNLVVGVLTRESVQSAFKSGITSKEIIR 223

Query: 104 ----QNAHPRVADR---------IPSIPENVCDQIRLWESDLNRVEMTPA---HYYDEFP 147
                  +  + ++           SIPENVC Q+++WES+ +R+E+ P+     +D+  
Sbjct: 224 FLSPNGMNSSIGNQENTLLNSTFTYSIPENVCRQLKMWESERDRIELCPSIVFKRWDQDF 283

Query: 148 SRDVFEAACDYARDR 162
             D+F+    +A+ +
Sbjct: 284 MPDLFQRTVRWAQSK 298


>gi|406604717|emb|CCH43852.1| RNA polymerase II transcription factor B subunit 2 [Wickerhamomyces
           ciferrii]
          Length = 488

 Score = 79.3 bits (194), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 45/152 (29%), Positives = 78/152 (51%), Gaps = 26/152 (17%)

Query: 40  ISNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGI 99
           I NS G +++ETNF++YAY+ S L   IL LF  ++ +  NL+ G  T+ES+  A  +GI
Sbjct: 307 IENS-GTIIIETNFKVYAYTNSPLQIAILNLFVHLKARFSNLVTGQITRESIRKALVSGI 365

Query: 100 TAEQ------QNAHPRV-------------------ADRIPSIPENVCDQIRLWESDLNR 134
           T+EQ       +AHP++                    ++I  +   + DQI+LW+ +L+R
Sbjct: 366 TSEQIISYLESHAHPQLRRAAEEELNKKNGFESNNHGEKIQILQPTIADQIKLWQLELDR 425

Query: 135 VEMTPAHYYDEFPSRDVFEAACDYARDRSGLL 166
           +     + + +F S   ++    Y+ +   LL
Sbjct: 426 IMSFDGYLFTDFSSDQEYQVLSKYSEEIGVLL 457


>gi|221482097|gb|EEE20458.1| tfiih, polypeptide, putative [Toxoplasma gondii GT1]
          Length = 861

 Score = 79.3 bits (194), Expect = 5e-13,   Method: Composition-based stats.
 Identities = 39/105 (37%), Positives = 67/105 (63%), Gaps = 11/105 (10%)

Query: 47  VVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ--- 103
           ++V++NF++Y Y+ S L   +L    +++ ++PNLIVG  T+ S+  A+++GITA+Q   
Sbjct: 675 MLVQSNFKVYVYTASALQLSVLSHLCELQARMPNLIVGILTRASVLAAYKSGITADQIIR 734

Query: 104 ---QNAHPRVADR-----IPSIPENVCDQIRLWESDLNRVEMTPA 140
               +AHP V +R      P +PENV  Q+R+WE++  R+ + PA
Sbjct: 735 FLEAHAHPVVLERKLRTNAPLLPENVTIQLRMWEAERMRLSLYPA 779


>gi|367040709|ref|XP_003650735.1| hypothetical protein THITE_2110518 [Thielavia terrestris NRRL 8126]
 gi|346997996|gb|AEO64399.1| hypothetical protein THITE_2110518 [Thielavia terrestris NRRL 8126]
          Length = 513

 Score = 79.0 bits (193), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 40/130 (30%), Positives = 73/130 (56%), Gaps = 11/130 (8%)

Query: 43  SQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE 102
           ++G +++ETN+R+YAY++S L   +L LF+++  +   ++ G  T++S+  A   GITA+
Sbjct: 351 TKGSLIIETNYRLYAYTSSPLQIAVLALFTQLNMRFAGMVTGRLTRDSIRRAISFGITAD 410

Query: 103 Q------QNAHPRV-----ADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDV 151
           Q       +AH ++     A   P +P  V DQIRLW+ +  R+  T    + +F S + 
Sbjct: 411 QIISYLASHAHEQMTRAAAAAGRPVLPPTVVDQIRLWQLENERMRTTAGFLFKDFDSPEE 470

Query: 152 FEAACDYARD 161
           + +   YA +
Sbjct: 471 YVSLSGYAEE 480


>gi|401410644|ref|XP_003884770.1| hypothetical protein NCLIV_051680 [Neospora caninum Liverpool]
 gi|325119188|emb|CBZ54742.1| hypothetical protein NCLIV_051680 [Neospora caninum Liverpool]
          Length = 868

 Score = 79.0 bits (193), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 39/105 (37%), Positives = 67/105 (63%), Gaps = 11/105 (10%)

Query: 47  VVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ--- 103
           ++V++NF++Y Y+ S L   +L    +++ ++PNLIVG  T+ S+  A+++GITA+Q   
Sbjct: 682 MLVQSNFKVYVYTASALQLSVLSHLCELQARMPNLIVGILTRASVLAAYKSGITADQIIR 741

Query: 104 ---QNAHPRVADR-----IPSIPENVCDQIRLWESDLNRVEMTPA 140
               +AHP V +R      P +PENV  Q+R+WE++  R+ + PA
Sbjct: 742 FLEAHAHPVVLERKLRTNAPLLPENVTIQLRMWEAERMRLSLYPA 786


>gi|321265552|ref|XP_003197492.1| subunit of TFIIH and nucleotide excision repair factor 3 complexes
           [Cryptococcus gattii WM276]
 gi|317463972|gb|ADV25705.1| Subunit of TFIIH and nucleotide excision repair factor 3 complexes,
           putative [Cryptococcus gattii WM276]
          Length = 481

 Score = 78.6 bits (192), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 41/132 (31%), Positives = 78/132 (59%), Gaps = 8/132 (6%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           ++ + F+++ETN+++YAY++++L   IL LF  I  + PNL+VG   ++ +  A E GI+
Sbjct: 316 ADDKRFLILETNYKIYAYTSNELEIAILNLFVDIRIRYPNLVVGKLDRQHVKAAMEKGIS 375

Query: 101 AEQ------QNAHPRVADRIPSIPE-NVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFE 153
           A Q       +AHP++ +  P +    + DQ+ LW+ + NR++      Y EF S+++F+
Sbjct: 376 ARQIIAYLSSHAHPQMYNSPPPLLHPTIVDQLHLWDRERNRLQTEETVMY-EFFSKELFD 434

Query: 154 AACDYARDRSGL 165
              + A+  + L
Sbjct: 435 DTVNEAKANAAL 446


>gi|400597688|gb|EJP65418.1| RNA polymerase II transcription factor B subunit 2 [Beauveria
           bassiana ARSEF 2860]
          Length = 507

 Score = 78.6 bits (192), Expect = 9e-13,   Method: Composition-based stats.
 Identities = 45/134 (33%), Positives = 76/134 (56%), Gaps = 14/134 (10%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           G +V+ETN+R+YAY+ S L   +L LFS++  + P+++ G  T+ S+  A + GITA+Q 
Sbjct: 347 GSIVIETNYRLYAYTQSTLQIAVLALFSRLTMRFPDMVAGRITRTSIRQAIQFGITADQI 406

Query: 104 -----QNAHPR------VADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVF 152
                 +AH +      +A R P +P  V DQIRLW+ +  R++ T    + +F  +  +
Sbjct: 407 IAYLAAHAHEQMHRSAALAAR-PVLPPTVVDQIRLWQLENERMKTTGGFLFRDFDDQKEY 465

Query: 153 EAACDYARDRSGLL 166
           +    +A D  G+L
Sbjct: 466 QDTRRFA-DEIGVL 478


>gi|241958748|ref|XP_002422093.1| TFIIH subunit, putative; tanscription initiation factor IIB,
           putative [Candida dubliniensis CD36]
 gi|223645438|emb|CAX40094.1| TFIIH subunit, putative [Candida dubliniensis CD36]
          Length = 494

 Score = 78.2 bits (191), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 44/166 (26%), Positives = 80/166 (48%), Gaps = 42/166 (25%)

Query: 43  SQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE 102
           S+  +++ETNF++YAY+ S L   IL LF   + +  N++ G  T+ES+ NA  NGITA+
Sbjct: 300 SKESIIIETNFKIYAYTKSPLEIAILNLFVHFKTRFANMVCGQITRESIRNALYNGITAD 359

Query: 103 Q------QNAHPRV------------------------------------ADRIPSIPEN 120
           Q       +AHP++                                      ++  +P N
Sbjct: 360 QIIKFLETHAHPQMRILAKEKLDKKIEFDTSHNINTAGGAPQSKTDGMISQHKLEILPPN 419

Query: 121 VCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARDRSGLL 166
           V DQI+LW+ +L+R++    + + +F ++  ++   +YA +   L+
Sbjct: 420 VVDQIKLWQLELDRIQTFDGYLFKDFSNQQEYDILSNYASELGVLI 465


>gi|405123868|gb|AFR98631.1| transcription factor TFIIH complex subunit Tfb2 [Cryptococcus
           neoformans var. grubii H99]
          Length = 481

 Score = 77.8 bits (190), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 41/132 (31%), Positives = 78/132 (59%), Gaps = 8/132 (6%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           ++ + F+++ETN+++YAY++++L   IL LF  I  + PNL+VG   ++ +  A E GI+
Sbjct: 316 ADDKRFLILETNYKIYAYTSNELEIAILNLFVDIRIRYPNLVVGKLDRQHVKAAMEKGIS 375

Query: 101 AEQ------QNAHPRVADRIPSIPE-NVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFE 153
           A Q       +AHP++ +  P +    + DQ+ LW+ + NR++      Y EF S+++F+
Sbjct: 376 AGQIIAYLSSHAHPQMYNSPPPLLHPTIVDQLHLWDRERNRLQTEETVMY-EFFSKELFD 434

Query: 154 AACDYARDRSGL 165
              + A+  + L
Sbjct: 435 DTVNEAKANAAL 446


>gi|68490740|ref|XP_710812.1| hypothetical protein CaO19.13268 [Candida albicans SC5314]
 gi|68490763|ref|XP_710801.1| hypothetical protein CaO19.5846 [Candida albicans SC5314]
 gi|46432046|gb|EAK91553.1| hypothetical protein CaO19.5846 [Candida albicans SC5314]
 gi|46432058|gb|EAK91564.1| hypothetical protein CaO19.13268 [Candida albicans SC5314]
 gi|238879997|gb|EEQ43635.1| TFIIH basal transcription factor complex subunit 2 [Candida
           albicans WO-1]
          Length = 494

 Score = 77.8 bits (190), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 43/162 (26%), Positives = 78/162 (48%), Gaps = 42/162 (25%)

Query: 47  VVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ--- 103
           +++ETNF++YAY+ S L   IL LF   + +  N++ G  T+ES+ NA  NGITA+Q   
Sbjct: 304 IIIETNFKIYAYTKSPLEIAILNLFVHFKTRFANMVCGQITRESIRNALYNGITADQIIK 363

Query: 104 ---QNAHPRV------------------------------------ADRIPSIPENVCDQ 124
               +AHP++                                      ++  +P NV DQ
Sbjct: 364 FLETHAHPQMRILAKEKLDKKIEFDTSHNINTAGGAPQSKTDGMVSQHKLEILPPNVVDQ 423

Query: 125 IRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARDRSGLL 166
           I+LW+ +L+R++    + + +F ++  ++   +YA +   L+
Sbjct: 424 IKLWQLELDRIQTFDGYLFKDFSNQQEYDILSNYASELGVLI 465


>gi|66358470|ref|XP_626413.1| possible transcription factor TFIIH [Cryptosporidium parvum Iowa
           II]
 gi|46227864|gb|EAK88784.1| possible transcription factor TFIIH [Cryptosporidium parvum Iowa
           II]
          Length = 709

 Score = 77.8 bits (190), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 45/135 (33%), Positives = 76/135 (56%), Gaps = 15/135 (11%)

Query: 47  VVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ--- 103
           ++V++NFR+Y Y+ S L  +ILR   +++ + PN+I G  T++ L +A+  G++AEQ   
Sbjct: 528 IIVQSNFRIYCYTASPLQAKILRHLCQVKVRGPNIICGILTRKGLLSAYSMGVSAEQILR 587

Query: 104 ---QNAHPRVADRI-----PSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR----DV 151
               NAHP +  R        IP NV  Q++LWE D NR++++ A  + ++ +      +
Sbjct: 588 FFSSNAHPIILRRFMLEGTSIIPVNVETQLKLWEKDKNRLKISHASTFSDWGASPNDIQL 647

Query: 152 FEAACDYARDRSGLL 166
           F     YAR +  LL
Sbjct: 648 FRQTILYARSKDILL 662


>gi|134118870|ref|XP_771938.1| hypothetical protein CNBN1180 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|50254542|gb|EAL17291.1| hypothetical protein CNBN1180 [Cryptococcus neoformans var.
           neoformans B-3501A]
          Length = 481

 Score = 77.8 bits (190), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 41/132 (31%), Positives = 78/132 (59%), Gaps = 8/132 (6%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           ++ + F+++ETN+++YAY++++L   IL LF  I  + PNL+VG   ++ +  A E GI+
Sbjct: 316 ADDKRFLILETNYKIYAYTSNELEIAILNLFVDIRIRYPNLVVGKLDRQHVKAAMEKGIS 375

Query: 101 AEQ------QNAHPRVADRIPSIPE-NVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFE 153
           A Q       +AHP++ +  P +    + DQ+ LW+ + NR++      Y EF S+++F+
Sbjct: 376 ARQIIAYLSSHAHPQMYNSPPPLLHPTIVDQLYLWDRERNRLQTEETVMY-EFFSKELFD 434

Query: 154 AACDYARDRSGL 165
              + A+  + L
Sbjct: 435 DTVNEAKANAAL 446


>gi|58262306|ref|XP_568563.1| hypothetical protein CNN01190 [Cryptococcus neoformans var.
           neoformans JEC21]
 gi|57230737|gb|AAW47046.1| hypothetical protein CNN01190 [Cryptococcus neoformans var.
           neoformans JEC21]
          Length = 481

 Score = 77.8 bits (190), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 41/132 (31%), Positives = 78/132 (59%), Gaps = 8/132 (6%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           ++ + F+++ETN+++YAY++++L   IL LF  I  + PNL+VG   ++ +  A E GI+
Sbjct: 316 ADDKRFLILETNYKIYAYTSNELEIAILNLFVDIRIRYPNLVVGKLDRQHVKAAMEKGIS 375

Query: 101 AEQ------QNAHPRVADRIPSIPE-NVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFE 153
           A Q       +AHP++ +  P +    + DQ+ LW+ + NR++      Y EF S+++F+
Sbjct: 376 ARQIIAYLSSHAHPQMYNSPPPLLHPTIVDQLYLWDRERNRLQTEETVMY-EFFSKELFD 434

Query: 154 AACDYARDRSGL 165
              + A+  + L
Sbjct: 435 DTVNEAKANAAL 446


>gi|422294120|gb|EKU21420.1| transcription initiation factor TFIIH subunit 4, partial
           [Nannochloropsis gaditana CCMP526]
          Length = 213

 Score = 77.4 bits (189), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 41/101 (40%), Positives = 62/101 (61%), Gaps = 7/101 (6%)

Query: 43  SQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE 102
           SQ  V+VETN+++ AY++S LH E+LR+F+ +  +LPN+++G  T+ S+  A  +GITA 
Sbjct: 80  SQLAVIVETNYQLVAYTSSSLHVEMLRIFTDVRCRLPNVVIGFITRASVRRAMASGITAA 139

Query: 103 Q----QNAHPRVADRIPS---IPENVCDQIRLWESDLNRVE 136
                   H  VA R      +PENV  QI LW  + +RV+
Sbjct: 140 TILSFLKTHTHVAVRAGKGRLLPENVEAQIELWHQERSRVK 180


>gi|344229873|gb|EGV61758.1| transcription factor Tfb2 [Candida tenuis ATCC 10573]
          Length = 498

 Score = 77.0 bits (188), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 44/156 (28%), Positives = 77/156 (49%), Gaps = 38/156 (24%)

Query: 44  QGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ 103
           Q  +++ETNF++Y+Y+ S L   IL LF  +  +  N++ G  T+ES+ NA  NGITA Q
Sbjct: 311 QESIIIETNFKVYSYTQSPLEIAILNLFVHLRTRFANMVTGQITRESIRNALYNGITAGQ 370

Query: 104 ------QNAHPRVA--------------------------------DRIPSIPENVCDQI 125
                  +AHP++                                  ++  +P NV DQI
Sbjct: 371 VIKFLETHAHPQMKMLAQEKLDKKIEFDASNNINTATGKSTDAPMQHKLEILPPNVVDQI 430

Query: 126 RLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARD 161
           +LW+ +L+R++    + + EF ++  +E+  +YA +
Sbjct: 431 KLWQLELDRIQTFDGYLFREFNNQIDYESLRNYASE 466


>gi|84998132|ref|XP_953787.1| TFIIH subunit, Tf2b  [Theileria annulata]
 gi|65304784|emb|CAI73109.1| TFIIH subunit, Tf2b homologue, putative [Theileria annulata]
          Length = 645

 Score = 77.0 bits (188), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 40/135 (29%), Positives = 77/135 (57%), Gaps = 19/135 (14%)

Query: 47  VVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ--- 103
           ++V++NF++Y Y+ S L   +L    +++ + PNL+VG  T+ES+ +AF++GIT+++   
Sbjct: 451 IIVQSNFKVYVYTASPLQISVLSHLCELQARTPNLVVGVLTRESVQSAFKSGITSKEIIR 510

Query: 104 -------------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPA---HYYDEFP 147
                        Q      +    SIPENVC Q+++WES+ +R+E+ P+     +D+  
Sbjct: 511 FLSPNGMNSSIGSQENTLLNSSFTYSIPENVCRQLKMWESERDRIELCPSIVFKRWDQDF 570

Query: 148 SRDVFEAACDYARDR 162
             D+F+    +A+ +
Sbjct: 571 MPDLFQRTVRWAQSK 585


>gi|361130709|gb|EHL02459.1| putative RNA polymerase II transcription factor B subunit 2 [Glarea
           lozoyensis 74030]
          Length = 210

 Score = 77.0 bits (188), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 44/138 (31%), Positives = 73/138 (52%), Gaps = 19/138 (13%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           S + GF+++ETN+       S L   +L LF+K+  + PN++ G  T+ES+  A  +GI+
Sbjct: 52  SGTAGFIIIETNYH------SPLQIAVLALFTKLTTRYPNMVTGRVTRESVYRAVSSGIS 105

Query: 101 AEQ------QNAHPRVADRI------PSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 148
           ++Q       +AHP +          P +P  V DQIRLW+ +  R++ T    + +F S
Sbjct: 106 SDQIITYLSTHAHPELLKASAAKGGGPVLPPTVVDQIRLWQIENERMKTTYGFLFKDFES 165

Query: 149 RDVFEAACDYARDRSGLL 166
            D ++    YA D  G+L
Sbjct: 166 TDEYKKLLKYA-DEIGVL 182


>gi|255721741|ref|XP_002545805.1| TFIIH basal transcription factor complex subunit 2 [Candida
           tropicalis MYA-3404]
 gi|240136294|gb|EER35847.1| TFIIH basal transcription factor complex subunit 2 [Candida
           tropicalis MYA-3404]
          Length = 494

 Score = 77.0 bits (188), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 42/162 (25%), Positives = 79/162 (48%), Gaps = 42/162 (25%)

Query: 47  VVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ--- 103
           +++ETNF++YAY+ S L   IL LF +++ +  N++ G  T+ES+ NA  NGITA+Q   
Sbjct: 304 IIIETNFKLYAYTKSPLEIAILNLFVQLKTRFANMVCGQITRESIRNALYNGITADQIIK 363

Query: 104 ---QNAHPRV------------------------------------ADRIPSIPENVCDQ 124
               +AH ++                                      ++  +P NV DQ
Sbjct: 364 FLETHAHSQMRILAKEKLDKKIEFDTSHNINTAGGAPQSKTDGSISQHKLEILPPNVVDQ 423

Query: 125 IRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARDRSGLL 166
           I+LW+ +L+R++    + + +F ++  ++   +YA +   L+
Sbjct: 424 IKLWQLELDRIQTFDGYLFKDFANQQEYDILSNYASELGVLI 465


>gi|358385986|gb|EHK23582.1| hypothetical protein TRIVIDRAFT_36912 [Trichoderma virens Gv29-8]
          Length = 492

 Score = 76.6 bits (187), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 39/113 (34%), Positives = 64/113 (56%), Gaps = 11/113 (9%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           G VV+ETN+R+YAY+ S L   +L LF+K+  + P+++ G  ++ S+  A   GITA+Q 
Sbjct: 332 GSVVIETNYRLYAYTQSALQIAVLALFAKLNMRFPDMVAGRLSRASIRQAINFGITADQI 391

Query: 104 -----QNAHPRV-----ADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEF 146
                 +AH ++         P +P  V DQIRLW+ +  R++ T    + +F
Sbjct: 392 ISYLAAHAHEQMHRTAALTNKPVLPPTVVDQIRLWQLENERMKTTSGFLFRDF 444


>gi|392574410|gb|EIW67546.1| hypothetical protein TREMEDRAFT_33390 [Tremella mesenterica DSM
           1558]
          Length = 504

 Score = 76.6 bits (187), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 44/132 (33%), Positives = 74/132 (56%), Gaps = 8/132 (6%)

Query: 42  NSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITA 101
           + + F+++ETN+++YAY+ ++L   IL LF  I+ Q  NL+VG   +  +  A E GI+A
Sbjct: 341 DDKKFLILETNYKIYAYTNNELEIAILNLFMDIKVQYRNLVVGKLDRAHVKAAMEKGISA 400

Query: 102 E------QQNAHPRVADRIPSIPE-NVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
                  Q +AHP++    P I    + DQ+ LW+ + NR+    +  + EF S+D++E 
Sbjct: 401 YQIIAYLQSHAHPQMYSSPPPILHTTIVDQLHLWDKERNRLRTEESEMF-EFFSKDLYED 459

Query: 155 ACDYARDRSGLL 166
               A+   GLL
Sbjct: 460 TEAEAKRYDGLL 471


>gi|340518979|gb|EGR49219.1| trancription factor tfb2-like protein [Trichoderma reesei QM6a]
          Length = 480

 Score = 76.6 bits (187), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 39/113 (34%), Positives = 64/113 (56%), Gaps = 11/113 (9%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           G VV+ETN+R+YAY+ S L   +L LF+K+  + P+++ G  ++ S+  A   GITA+Q 
Sbjct: 320 GSVVIETNYRLYAYTQSALQIAVLALFAKLNMRFPDMVAGRLSRASIRQAINFGITADQI 379

Query: 104 -----QNAHPRV-----ADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEF 146
                 +AH ++         P +P  V DQIRLW+ +  R++ T    + +F
Sbjct: 380 ISYLSAHAHEQMHRTAALTNKPVLPPTVVDQIRLWQLENERMKTTSGFLFRDF 432


>gi|310799661|gb|EFQ34554.1| transcription factor tfb2 [Glomerella graminicola M1.001]
          Length = 485

 Score = 75.9 bits (185), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 42/125 (33%), Positives = 67/125 (53%), Gaps = 11/125 (8%)

Query: 46  FVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ-- 103
           F + ETN+R+YAY  + L   +L LF K++ +  +++ G  T+ S+ NA E GITA+Q  
Sbjct: 325 FRLSETNYRVYAYGQTPLQIAVLSLFCKLKLRFADMVSGRLTRNSIRNAVERGITADQII 384

Query: 104 ----QNAHPRV-----ADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEA 154
                +AH ++     A   P +P  V DQIRLW+ +  R+  T    + +F S   ++ 
Sbjct: 385 SYLAAHAHEQMHRMAAARSRPILPPTVIDQIRLWQLETERMTTTSGFLFRDFDSPREYDD 444

Query: 155 ACDYA 159
              YA
Sbjct: 445 IAGYA 449


>gi|403221352|dbj|BAM39485.1| transcription factor IIH subunit [Theileria orientalis strain
           Shintoku]
          Length = 634

 Score = 75.9 bits (185), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 36/110 (32%), Positives = 65/110 (59%), Gaps = 16/110 (14%)

Query: 47  VVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ--- 103
           ++V++NF++Y Y+ S L   +L    +++ + PNL+VG  T+ES+ +AF++GIT+ +   
Sbjct: 421 IIVQSNFKVYVYTASPLQISVLSHLCELQARTPNLVVGVLTRESVQSAFKSGITSNEIIR 480

Query: 104 -----------QNAHPRVADRIPS--IPENVCDQIRLWESDLNRVEMTPA 140
                              D   +  IPENVC Q+++WES+ +R+E++PA
Sbjct: 481 FLSPMKLSLSYAGTCNSFMDSFANYKIPENVCRQLKMWESERDRIELSPA 530


>gi|156086630|ref|XP_001610724.1| hypothetical protein [Babesia bovis T2Bo]
 gi|154797977|gb|EDO07156.1| conserved hypothetical protein [Babesia bovis]
          Length = 586

 Score = 75.1 bits (183), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 37/109 (33%), Positives = 65/109 (59%), Gaps = 9/109 (8%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
            N    +VV++NF++YAY  S L   +L    +++ + PNL++G  T+ SL  AF++GIT
Sbjct: 390 GNKGSKMVVQSNFKIYAYIPSALQMNVLNHICELQAKTPNLVIGVLTRSSLQTAFKSGIT 449

Query: 101 AEQ---------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPA 140
           A+Q         Q    ++   + ++PENV  Q+++WE++ NR+E+  A
Sbjct: 450 ADQLICFFESKGQYDDIQINRNVMNVPENVRRQLKMWEAERNRLELLNA 498


>gi|209880858|ref|XP_002141868.1| hypothetical protein [Cryptosporidium muris RN66]
 gi|209557474|gb|EEA07519.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
          Length = 794

 Score = 75.1 bits (183), Expect = 9e-12,   Method: Composition-based stats.
 Identities = 43/135 (31%), Positives = 74/135 (54%), Gaps = 15/135 (11%)

Query: 40  ISNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGI 99
           ISN +  ++V++NFR+Y Y+ S L  +ILR   +++ + PN+I G  T+  L +A+  G+
Sbjct: 599 ISNLEAGIIVQSNFRVYCYTASPLQAKILRHLCQVKVRGPNIICGVLTRRGLLSAYSMGV 658

Query: 100 TAEQ------QNAHP-----RVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 148
            A Q       NAHP      + D    IP +V  Q++LWE+D NR+E+     + ++ +
Sbjct: 659 KAYQILRFFISNAHPIILKKHITDGTSIIPISVETQLKLWENDHNRLEINKVSLFSDWGN 718

Query: 149 R----DVFEAACDYA 159
                ++F+    YA
Sbjct: 719 NKEDIELFKQTVTYA 733


>gi|358394641|gb|EHK44034.1| hypothetical protein TRIATDRAFT_222970 [Trichoderma atroviride IMI
           206040]
          Length = 491

 Score = 75.1 bits (183), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 38/111 (34%), Positives = 63/111 (56%), Gaps = 11/111 (9%)

Query: 47  VVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ--- 103
           VV+ETN+R+YAY+ S L   +L LF+K+  + P+++ G  ++ S+  A   GITA+Q   
Sbjct: 333 VVIETNYRLYAYTQSALQIAVLALFAKLNMRFPDMVAGRLSRASIRQAINFGITADQIIS 392

Query: 104 ---QNAHPRV-----ADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEF 146
               +AH ++         P +P  V DQIRLW+ +  R++ T    + +F
Sbjct: 393 YLAAHAHEQMHRTAALTNKPVLPPTVVDQIRLWQLENERMKTTSGFLFRDF 443


>gi|308460920|ref|XP_003092758.1| hypothetical protein CRE_20976 [Caenorhabditis remanei]
 gi|308252517|gb|EFO96469.1| hypothetical protein CRE_20976 [Caenorhabditis remanei]
          Length = 128

 Score = 74.7 bits (182), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 39/93 (41%), Positives = 54/93 (58%), Gaps = 9/93 (9%)

Query: 83  VGATTKESLNNAFENGITAEQ------QNAHPRV---ADRIPSIPENVCDQIRLWESDLN 133
           VG  T+ES+  A ++GITA Q       NAHP+    +  I  +P  V DQIRLWE +  
Sbjct: 3   VGMITRESVRGALQHGITAAQIISFLRANAHPQCIATSGAINCLPITVADQIRLWEDERR 62

Query: 134 RVEMTPAHYYDEFPSRDVFEAACDYARDRSGLL 166
           R+++  A+ Y  F S D F+  CDYA++R  LL
Sbjct: 63  RMDLKDAYIYSHFESEDEFQGVCDYAQERGILL 95


>gi|398406290|ref|XP_003854611.1| hypothetical protein MYCGRDRAFT_69317 [Zymoseptoria tritici IPO323]
 gi|339474494|gb|EGP89587.1| hypothetical protein MYCGRDRAFT_69317 [Zymoseptoria tritici IPO323]
          Length = 493

 Score = 74.3 bits (181), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 45/130 (34%), Positives = 71/130 (54%), Gaps = 11/130 (8%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           SN +G++++ETN+R+YAY++S L   IL LFS +  + PNLI    TK S+ +A    IT
Sbjct: 317 SNEKGYIILETNYRLYAYTSSPLQISILSLFSTLHTRYPNLITAKITKSSIASAISQSIT 376

Query: 101 AEQ------QNAHPRVADR-----IPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 149
           ++Q       +AHP +  +      P +P  V DQIRLW+ +  R+      Y  +    
Sbjct: 377 SDQIITYLSTHAHPVLRRQAAIMSAPILPPTVVDQIRLWQIEGERMTTWKGFYIRDVGGW 436

Query: 150 DVFEAACDYA 159
           + +  A +YA
Sbjct: 437 EEYVKAVEYA 446


>gi|322711210|gb|EFZ02784.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           (Tfb2), putative [Metarhizium anisopliae ARSEF 23]
          Length = 540

 Score = 73.6 bits (179), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 78/151 (51%), Gaps = 26/151 (17%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           ++ +G V+VETN+R+YAY+ S L   +L LF K+  + P+++ G  T+ S++ A   GI+
Sbjct: 372 ADQKGSVIVETNYRIYAYTQSTLQIAVLALFCKLNMRFPDMVAGRVTRTSIHQAINFGIS 431

Query: 101 AEQ------QNAHPR------VADRIPSIPENVCDQIRLWESDLN----RVEMTPAHYYD 144
           A+Q       +AH +      +A++ P +P  V DQIRLW+ +      R++ T    + 
Sbjct: 432 ADQIISYLAAHAHEQMHRSAALANK-PILPPTVVDQIRLWQLEWQLENERMKTTGGFLFR 490

Query: 145 EFPSRDVFEAACDYAR---------DRSGLL 166
           +F     F     +A          D+SG+ 
Sbjct: 491 DFEDHKEFLDTARFAEEIGVLVWRSDKSGMF 521


>gi|392594381|gb|EIW83705.1| transcription factor Tfb2 [Coniophora puteana RWD-64-598 SS2]
          Length = 393

 Score = 73.2 bits (178), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 31/63 (49%), Positives = 46/63 (73%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           +N  GF+V+ETN+R+YAY+ + L   +L LF  ++Y+ PNL+VG  T+ES+  A  NGI+
Sbjct: 324 TNDHGFIVLETNYRVYAYTDNPLQIAVLNLFITLKYRFPNLVVGMLTRESVRRALGNGIS 383

Query: 101 AEQ 103
           AEQ
Sbjct: 384 AEQ 386


>gi|387595518|gb|EIJ93142.1| hypothetical protein NEPG_02098 [Nematocida parisii ERTm1]
          Length = 398

 Score = 71.2 bits (173), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 41/118 (34%), Positives = 66/118 (55%), Gaps = 4/118 (3%)

Query: 43  SQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE 102
           ++ F+V+E+NFR+Y YS   L   I+ LFS    + PN+IV    ++S+  A  +GITA 
Sbjct: 248 AEKFLVLESNFRLYIYSNRPLDTFIISLFSIKSREFPNMIVAMINEDSIRQALMHGITAG 307

Query: 103 QQNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVF---EAACD 157
           Q   +    + +  I ENV +QIRLWE  +NR+    ++ +  F +   F   E+ C+
Sbjct: 308 QIRVYLN-QNSMYEINENVIEQIRLWEKRMNRIHSWESYIFSNFLNYKDFLLVESYCE 364


>gi|221061889|ref|XP_002262514.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium
           knowlesi strain H]
 gi|193811664|emb|CAQ42392.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium
           knowlesi strain H]
          Length = 943

 Score = 71.2 bits (173), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 43/147 (29%), Positives = 78/147 (53%), Gaps = 17/147 (11%)

Query: 6   IARLPTSEDLEAYAIGQWELMDTNAQLWYIVREYISNSQGFVVVETNFRMYAYSTSKLHY 65
           + +   + +L + A+ Q + M+ N  L           +  ++V++NF++Y Y+ S L  
Sbjct: 733 MDKTKLTHELASLALSQEKKMNGNNNL-----------EMGLIVQSNFKVYLYTNSTLKI 781

Query: 66  EILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ------QNAHPRVADRIPSIPE 119
            IL    +++ + PN++VG  T+ S+ NA+ + ITA Q        AHP  ++   SIP 
Sbjct: 782 NILSHLCELQARTPNMVVGILTRRSVLNAYNSDITANQIIKFLESYAHPGRSNFKSSIPV 841

Query: 120 NVCDQIRLWESDLNRVEMTPAHYYDEF 146
           NV  Q++LWES+ +R+ +  A  +  F
Sbjct: 842 NVITQLKLWESERHRLTLDDAIVFKSF 868


>gi|47212202|emb|CAF90416.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 471

 Score = 70.9 bits (172), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 41/128 (32%), Positives = 69/128 (53%), Gaps = 9/128 (7%)

Query: 22  QWELMDTNAQLWYIVREYISNSQGFVVVETNFRMYAYSTSKLHYEIL--RLFSKIE--YQ 77
           Q+ L+DT +QLWY   +Y+  +Q + V   +  +  +      + ++  R  S  E  Y+
Sbjct: 224 QFLLLDTASQLWYFTLQYLKTAQDYSVEGMSDSLLTFLQHLREFGLVFQRKASGREMLYR 283

Query: 78  LPNLIVGATTKESLNNAFENGITAEQQN-----AHPRVADRIPSIPENVCDQIRLWESDL 132
            PN++V   T+ES+  A  NGITA+Q+      + P    + P +P  + DQIRLWE + 
Sbjct: 284 FPNVVVAQVTRESVQQAIANGITAQQRLTFDLWSSPTPLSQTPVLPPTITDQIRLWELER 343

Query: 133 NRVEMTPA 140
           +R++ T A
Sbjct: 344 DRLQFTEA 351


>gi|399949686|gb|AFP65344.1| tfIIB-like protein [Chroomonas mesostigmatica CCMP1168]
          Length = 464

 Score = 70.9 bits (172), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 37/100 (37%), Positives = 59/100 (59%), Gaps = 10/100 (10%)

Query: 47  VVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE---- 102
           +++E+NFR+Y Y  + L  ++L +FS+I YQLPNL VG  T++S+  AF +G T+     
Sbjct: 329 IIIESNFRIYVYKNAHLGNDLLLVFSEILYQLPNLFVGEITEKSILKAFNSGSTSRNIIG 388

Query: 103 --QQNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPA 140
             + N HP      P IP  V +QI+ WE     +++T +
Sbjct: 389 FLKNNLHPIC----PWIPSAVTNQIKAWEFQKTEIKITES 424


>gi|194388768|dbj|BAG60352.1| unnamed protein product [Homo sapiens]
          Length = 386

 Score = 70.5 bits (171), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 32/76 (42%), Positives = 51/76 (67%), Gaps = 6/76 (7%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           GF+VVETN+R+YAY+ S+L   ++ LFS++ Y+ PN++V   T+ES+  A  +GITA+Q 
Sbjct: 311 GFIVVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQI 370

Query: 104 -----QNAHPRVADRI 114
                  AHP +  ++
Sbjct: 371 IHFLRTRAHPVMLKQV 386


>gi|389586531|dbj|GAB69260.1| hypothetical protein PCYB_146880 [Plasmodium cynomolgi strain B]
          Length = 913

 Score = 70.5 bits (171), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 37/106 (34%), Positives = 63/106 (59%), Gaps = 6/106 (5%)

Query: 47  VVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ--- 103
           ++V++NF++Y Y+ S L   IL    +++ + PN++VG  T+ S+ NA+ + ITA Q   
Sbjct: 733 LIVQSNFKVYLYTNSTLKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITANQIIK 792

Query: 104 ---QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEF 146
                AHP  ++   SIP NV  Q++LWES+ +R+ +  A  +  F
Sbjct: 793 FLESYAHPGRSNFKSSIPVNVITQLKLWESERHRLTLDDAIVFKSF 838


>gi|387592902|gb|EIJ87926.1| hypothetical protein NEQG_01998 [Nematocida parisii ERTm3]
          Length = 398

 Score = 70.1 bits (170), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 40/118 (33%), Positives = 66/118 (55%), Gaps = 4/118 (3%)

Query: 43  SQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE 102
           ++ F+V+E+NFR+Y YS   L   I+ LFS    + PN++V    ++S+  A  +GITA 
Sbjct: 248 AEKFLVLESNFRLYIYSNRPLDTFIISLFSIKSREFPNMMVAMINEDSIRQALMHGITAG 307

Query: 103 QQNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVF---EAACD 157
           Q   +    + +  I ENV +QIRLWE  +NR+    ++ +  F +   F   E+ C+
Sbjct: 308 QIRVYLN-QNSMYEINENVIEQIRLWEKRMNRIHSWESYIFSNFLNYKDFLLVESYCE 364


>gi|323455511|gb|EGB11379.1| hypothetical protein AURANDRAFT_52511 [Aureococcus anophagefferens]
          Length = 291

 Score = 69.7 bits (169), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 37/103 (35%), Positives = 59/103 (57%), Gaps = 13/103 (12%)

Query: 47  VVVETNFRMYAYS-----TSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITA 101
           V+V+TNF++ AY+     TS L    L LF+++  +LPNL+VG  +++++    + GI  
Sbjct: 126 VIVQTNFQVLAYTDAGVNTSTLVLATLNLFAELTTRLPNLVVGTISRDAIKRCVDRGIRV 185

Query: 102 EQ------QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMT 138
            Q       +AHP  A +   +P+NV DQ+ LW  + NRV  T
Sbjct: 186 PQIVKFLRAHAHP--AMKASGVPQNVTDQMALWAGEGNRVAFT 226


>gi|156095867|ref|XP_001613968.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148802842|gb|EDL44241.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 960

 Score = 69.7 bits (169), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 37/106 (34%), Positives = 62/106 (58%), Gaps = 6/106 (5%)

Query: 47  VVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ--- 103
           ++V++NF++Y Y+ S L   IL    +++ + PN++VG  T+ S+ NA+ + ITA Q   
Sbjct: 780 LIVQSNFKVYLYTNSTLKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITANQIIK 839

Query: 104 ---QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEF 146
                AHP  +    SIP NV  Q++LWES+ +R+ +  A  +  F
Sbjct: 840 FLESYAHPGRSTFKSSIPVNVITQLKLWESERHRLTLDDAIVFKSF 885


>gi|60603004|gb|AAX27959.1| unknown [Schistosoma japonicum]
          Length = 100

 Score = 69.3 bits (168), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 34/74 (45%), Positives = 47/74 (63%), Gaps = 6/74 (8%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           S+  G++++ETNFR+YAY+ S L   +L LFSKI  + PNL+V   T++S+  A   GIT
Sbjct: 10  SSDVGYILLETNFRLYAYTDSPLRTALLSLFSKIRARFPNLVVADITRDSVREALIRGIT 69

Query: 101 AEQ------QNAHP 108
           A Q       NAHP
Sbjct: 70  ANQILSFLTSNAHP 83


>gi|378754852|gb|EHY64880.1| hypothetical protein NERG_01936 [Nematocida sp. 1 ERTm2]
          Length = 396

 Score = 69.3 bits (168), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 40/117 (34%), Positives = 63/117 (53%), Gaps = 4/117 (3%)

Query: 44  QGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ 103
           + F+ +E+NFR+Y YS   L   I+ LFS    + PN+IV    ++S+  A   GITA Q
Sbjct: 247 EKFLTLESNFRLYIYSNRPLDVFIISLFSIKIREFPNMIVAMINEDSIRQALTYGITAGQ 306

Query: 104 QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVF---EAACD 157
              +    + +  I ENV +QIRLWE  +NR+    ++ +  F +   F   E+ C+
Sbjct: 307 IRVYLN-QNSMHKINENVLEQIRLWEKRMNRIHAWESYIFSNFLNYKDFLLVESYCE 362


>gi|124806778|ref|XP_001350829.1| transcription factor Tfb2, putative [Plasmodium falciparum 3D7]
 gi|23496958|gb|AAN36509.1| transcription factor Tfb2, putative [Plasmodium falciparum 3D7]
          Length = 945

 Score = 68.9 bits (167), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 36/106 (33%), Positives = 63/106 (59%), Gaps = 6/106 (5%)

Query: 47  VVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ--- 103
           ++V++NF++Y Y++S L   IL    +++ + PN++VG  T+ S+ NA+ + ITA Q   
Sbjct: 766 LIVQSNFKVYLYTSSLLKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITANQIIK 825

Query: 104 ---QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEF 146
                +HP   +   SIP NV  Q++LWES+ +R+ +  A  +  F
Sbjct: 826 FLESYSHPGKNNFKSSIPMNVITQLKLWESERHRLTLEDAIVFKSF 871


>gi|238570605|ref|XP_002386883.1| hypothetical protein MPER_14683 [Moniliophthora perniciosa FA553]
 gi|215440101|gb|EEB87813.1| hypothetical protein MPER_14683 [Moniliophthora perniciosa FA553]
          Length = 69

 Score = 68.9 bits (167), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 32/65 (49%), Positives = 46/65 (70%), Gaps = 5/65 (7%)

Query: 44  QGFVVVETNFRMYAYSTS-----KLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENG 98
           QGF+V+ETN+R+YAY+ S      L   IL LF  ++Y+ PNL+VG+ T+ES+  A  NG
Sbjct: 5   QGFIVLETNYRVYAYTGSFSLHNPLQTAILHLFVSLKYRFPNLVVGSITRESVKKALSNG 64

Query: 99  ITAEQ 103
           I+A+Q
Sbjct: 65  ISADQ 69


>gi|336270172|ref|XP_003349845.1| hypothetical protein SMAC_00733 [Sordaria macrospora k-hell]
 gi|380095234|emb|CCC06707.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 496

 Score = 68.2 bits (165), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 41/132 (31%), Positives = 68/132 (51%), Gaps = 18/132 (13%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
             S+G +++ETN+R+YAY+TS L   +L LF ++  + PN++  A        A   GIT
Sbjct: 339 GTSKG-IIIETNYRIYAYTTSPLQIAVLALFCQLNMRFPNMVSAAL------RAIGFGIT 391

Query: 101 AEQ------QNAHPRVADRI-----PSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 149
           A+Q       +AH ++           +P  V DQIRLW+ +  R++ T    + +F S 
Sbjct: 392 ADQIISYLHAHAHQQMVREATVTGKTVLPPTVVDQIRLWQLENERMQTTAGFLFKDFEST 451

Query: 150 DVFEAACDYARD 161
           + + A   YA +
Sbjct: 452 EEYAALSRYAEE 463


>gi|399219100|emb|CCF75987.1| unnamed protein product [Babesia microti strain RI]
          Length = 612

 Score = 68.2 bits (165), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 32/106 (30%), Positives = 61/106 (57%), Gaps = 10/106 (9%)

Query: 42  NSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITA 101
           N +  ++V++NF++Y Y+ S L   +L    +++ + PNL++G  T+ S + AF  GITA
Sbjct: 427 NLESGIIVQSNFKVYVYTASPLQINVLAHLCELQSRTPNLVIGVLTRASAHAAFRAGITA 486

Query: 102 EQ------QNAHPRVADRIPS----IPENVCDQIRLWESDLNRVEM 137
           +Q       ++HP +   +      +P NV  Q+ +WE++ NR+ +
Sbjct: 487 KQICQFLETHSHPILLQNVREGGSYLPNNVVTQLNMWEAERNRISL 532


>gi|164660182|ref|XP_001731214.1| hypothetical protein MGL_1397 [Malassezia globosa CBS 7966]
 gi|159105114|gb|EDP44000.1| hypothetical protein MGL_1397 [Malassezia globosa CBS 7966]
          Length = 231

 Score = 67.4 bits (163), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 30/73 (41%), Positives = 50/73 (68%), Gaps = 6/73 (8%)

Query: 44  QGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ 103
           QG++++ETN+R+YAY+ + L   IL LF  ++ +LPNL++G  T+ S+ +A   GITA+Q
Sbjct: 147 QGYLILETNYRVYAYTANLLRIAILNLFVTLKSRLPNLVIGQLTRHSVKSALNKGITADQ 206

Query: 104 ------QNAHPRV 110
                  +AHP++
Sbjct: 207 IITYLTHHAHPQM 219


>gi|429854513|gb|ELA29524.1| tfiih and nucleotide excision repair factor 3 complexes subunit
           [Colletotrichum gloeosporioides Nara gc5]
          Length = 509

 Score = 67.4 bits (163), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 36/99 (36%), Positives = 56/99 (56%), Gaps = 11/99 (11%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           G ++VETN+R+YAY  + L   +L LF K++ +  +++ G  T+ S+ NA E GITA+Q 
Sbjct: 371 GSIIVETNYRVYAYGQTPLQIAVLSLFCKLKLRFADMVSGRLTRNSIRNAVERGITADQI 430

Query: 104 -----QNAHPRV-----ADRIPSIPENVCDQIRLWESDL 132
                 +AH ++         P +P  V DQIRL   D+
Sbjct: 431 ISYLASHAHEQMHRFAAVRNKPVLPPTVIDQIRLEYEDI 469


>gi|432098035|gb|ELK27922.1| General transcription factor IIH subunit 4 [Myotis davidii]
          Length = 377

 Score = 66.6 bits (161), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 35/113 (30%), Positives = 59/113 (52%), Gaps = 28/113 (24%)

Query: 41  SNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGIT 100
           ++  GF+VVETN+++YAY+ S+L   ++ LFS++ Y+ PN++V  T              
Sbjct: 250 AHQPGFIVVETNYQLYAYTESELQIALIALFSEMLYRFPNMVVAQT-------------- 295

Query: 101 AEQQNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFE 153
                         P +P  + DQIRLWE + +R+  T    Y++F S+  F+
Sbjct: 296 --------------PVLPPTITDQIRLWELERDRLLFTEGVLYNQFLSQVDFD 334


>gi|412992644|emb|CCO18624.1| predicted protein [Bathycoccus prasinos]
          Length = 464

 Score = 66.6 bits (161), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 29/59 (49%), Positives = 45/59 (76%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ 103
           G ++VETNFR+YAY+ S+L  E+LRLF++ +Y+LPN  VG  T++S+ +A   GI+ +Q
Sbjct: 401 GHIIVETNFRVYAYTHSELETEVLRLFTRPDYKLPNAYVGMITRDSILDAMRAGISPDQ 459


>gi|403348380|gb|EJY73625.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
           [Oxytricha trifallax]
          Length = 532

 Score = 66.2 bits (160), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 31/64 (48%), Positives = 47/64 (73%), Gaps = 1/64 (1%)

Query: 41  SNSQG-FVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGI 99
           S SQ  F++VETNF++YAY++S L+  +LR F ++E   PNL+VG  T++SL  AF+ GI
Sbjct: 317 SQSQDKFLIVETNFKVYAYTSSDLYRALLRQFIRVECIFPNLVVGTLTRKSLQKAFQRGI 376

Query: 100 TAEQ 103
           ++ Q
Sbjct: 377 SSGQ 380


>gi|365758016|gb|EHM99881.1| Tfb2p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
          Length = 419

 Score = 65.1 bits (157), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 30/72 (41%), Positives = 48/72 (66%), Gaps = 6/72 (8%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ- 103
           G ++VETNF++Y+YS S L   +L LF  ++ +  N+++G  T+ES+ NA  NGITA+Q 
Sbjct: 337 GSLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRNALTNGITADQI 396

Query: 104 -----QNAHPRV 110
                 +AHP++
Sbjct: 397 IAYLETHAHPQM 408


>gi|70950974|ref|XP_744765.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56524852|emb|CAH80923.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
          Length = 922

 Score = 63.5 bits (153), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 32/106 (30%), Positives = 62/106 (58%), Gaps = 6/106 (5%)

Query: 47  VVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ--- 103
           +++++NF++Y Y++S L   IL    +++ + PN++VG  T+ S+ NA+ + ITA+Q   
Sbjct: 742 LIIQSNFKVYLYTSSILKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITADQIIK 801

Query: 104 ---QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEF 146
                +HP        IP NV  Q++LWE++ +R+ +  +  +  F
Sbjct: 802 FLESYSHPGKTKLKSIIPINVITQLKLWEAERHRLTLEDSIVFKNF 847


>gi|82704497|ref|XP_726580.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
 gi|23482045|gb|EAA18145.1| Drosophila melanogaster LD30622p-related [Plasmodium yoelii yoelii]
          Length = 926

 Score = 63.5 bits (153), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 32/106 (30%), Positives = 62/106 (58%), Gaps = 6/106 (5%)

Query: 47  VVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ--- 103
           +++++NF++Y Y++S L   IL    +++ + PN++VG  T+ S+ NA+ + ITA+Q   
Sbjct: 746 LIIQSNFKVYLYTSSILKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITADQIIK 805

Query: 104 ---QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEF 146
                +HP        IP NV  Q++LWE++ +R+ +  +  +  F
Sbjct: 806 FLESYSHPGKTKLKSIIPINVITQLKLWEAERHRLTLEDSIVFKNF 851


>gi|428181353|gb|EKX50217.1| TFB2 transcription factor B2 nucleotide excision repair [Guillardia
           theta CCMP2712]
          Length = 463

 Score = 62.4 bits (150), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 35/94 (37%), Positives = 51/94 (54%), Gaps = 2/94 (2%)

Query: 67  ILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQQNAHPRVADRIPSIPENVCDQIR 126
           ILRLF++  Y+ P++I+   T+ES+ NA    I   + +AHP+  +  P +PE V DQI 
Sbjct: 324 ILRLFTRPIYKFPHMIIAVITRESIRNAL--IIEYLRMHAHPQCLENWPIVPEVVTDQIC 381

Query: 127 LWESDLNRVEMTPAHYYDEFPSRDVFEAACDYAR 160
            WE +  R+   PA  Y  F S +   A    AR
Sbjct: 382 FWEQERCRIRAEPAVAYHNFFSGEAHRACEKEAR 415


>gi|68070707|ref|XP_677265.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56497310|emb|CAH94094.1| conserved hypothetical protein [Plasmodium berghei]
          Length = 929

 Score = 62.4 bits (150), Expect = 6e-08,   Method: Composition-based stats.
 Identities = 32/106 (30%), Positives = 62/106 (58%), Gaps = 6/106 (5%)

Query: 47  VVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ--- 103
           +++++NF++Y Y++S L   IL    +++ + PN++VG  T+ S+ NA+ + ITA+Q   
Sbjct: 749 LIIQSNFKVYLYTSSILKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITADQIIK 808

Query: 104 ---QNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEF 146
                +HP        IP NV  Q++LWE++ +R+ +  +  +  F
Sbjct: 809 FLESYSHPGKTKLKSIIPINVITQLKLWEAERHRLILEDSIVFKNF 854


>gi|67483069|ref|XP_656826.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56474050|gb|EAL51440.1| hypothetical protein EHI_192400 [Entamoeba histolytica HM-1:IMSS]
 gi|449706224|gb|EMD46115.1| tfiih basal transcription factor complex p52 subunit, putative
           [Entamoeba histolytica KU27]
          Length = 484

 Score = 60.8 bits (146), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 33/108 (30%), Positives = 57/108 (52%), Gaps = 8/108 (7%)

Query: 48  VVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQQNAH 107
           + E N+ +Y Y+ S+   ++ RLF +  +QL NL VG    +S+  AF  GIT+E     
Sbjct: 290 ITEINYSVYIYTESQFQVDLYRLFIRKNFQLANLWVGKLNHQSVTEAFAKGITSEM---- 345

Query: 108 PRVADRIPSIPENVCDQIRLWESDLNRVE---MTPAHYYDEFPSRDVF 152
             +    P++P  +  QI LW+ ++NR +   +    +YD+   R +F
Sbjct: 346 -LINFLQPNLPRTIQKQIDLWKKEINRFKEEHVVRYRFYDDAIGRQLF 392


>gi|269861689|ref|XP_002650538.1| transcription/repair factor TFIIH subunit [Enterocytozoon bieneusi
           H348]
 gi|220065990|gb|EED43520.1| transcription/repair factor TFIIH subunit [Enterocytozoon bieneusi
           H348]
          Length = 291

 Score = 60.5 bits (145), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 35/137 (25%), Positives = 68/137 (49%), Gaps = 10/137 (7%)

Query: 34  YIVREYISNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNN 93
           YI+ +  +  Q F+ +ETNF++Y+Y+ +     IL LFS I +++P +I     ++ + N
Sbjct: 144 YILYKNNTEIQKFIYLETNFKLYSYTNNLCDRAILDLFSNIVFEVPGMIKSILDEKKIMN 203

Query: 94  AFENGITAEQQNAHPRVADRIPSIPENVCDQ----IRLWESDLNRVEMTPAHYYDEFPSR 149
             + GI  +Q      + + I     N CDQ    I +W+   NR+     + Y +F + 
Sbjct: 204 TLDRGIKIQQ------IVEYIRRHTINNCDQILHMIEIWDKQRNRINKQIGYLYSDFTNY 257

Query: 150 DVFEAACDYARDRSGLL 166
           + + +  +  +  + LL
Sbjct: 258 NEYRSVLEQIKTDTDLL 274


>gi|167395228|ref|XP_001741280.1| TFIIH basal transcription factor complex p52 subunit [Entamoeba
           dispar SAW760]
 gi|165894177|gb|EDR22239.1| TFIIH basal transcription factor complex p52 subunit, putative
           [Entamoeba dispar SAW760]
          Length = 489

 Score = 60.5 bits (145), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 33/108 (30%), Positives = 57/108 (52%), Gaps = 8/108 (7%)

Query: 48  VVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQQNAH 107
           + E N+ +Y Y+ S+   ++ RLF +  +QL NL VG    +S+  AF  GIT+E     
Sbjct: 290 ITEINYSVYIYTESQFQVDLYRLFIRKNFQLANLWVGKLNHQSVTEAFAKGITSEM---- 345

Query: 108 PRVADRIPSIPENVCDQIRLWESDLNRVE---MTPAHYYDEFPSRDVF 152
             +    P++P  +  QI LW+ ++NR +   +    +YD+   R +F
Sbjct: 346 -LINFLQPNLPRTIQKQIDLWKKEINRFKEEHVVRYKFYDDAIGRQLF 392


>gi|160331881|ref|XP_001712647.1| hypothetical protein HAN_3g523 [Hemiselmis andersenii]
 gi|159766096|gb|ABW98322.1| hypothetical protein HAN_3g523 [Hemiselmis andersenii]
          Length = 432

 Score = 60.1 bits (144), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 30/97 (30%), Positives = 56/97 (57%), Gaps = 10/97 (10%)

Query: 47  VVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE---- 102
           +++E+N+R+Y Y  +  + ++  +FS+I Y LPN  VG  T+ S++ A ++GIT +    
Sbjct: 300 IIIESNYRIYVYQKNSNNNQLFLIFSEILYILPNFFVGEITETSISRALKSGITIQNILG 359

Query: 103 --QQNAHPRVADRIPSIPENVCDQIRLWESDLNRVEM 137
             ++N H        SIP  + +Q RLWE    ++++
Sbjct: 360 FIRENLHCVCR----SIPSTILNQFRLWEFQKKKIKI 392


>gi|302831319|ref|XP_002947225.1| hypothetical protein VOLCADRAFT_35893 [Volvox carteri f.
           nagariensis]
 gi|300267632|gb|EFJ51815.1| hypothetical protein VOLCADRAFT_35893 [Volvox carteri f.
           nagariensis]
          Length = 90

 Score = 60.1 bits (144), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 26/60 (43%), Positives = 36/60 (60%)

Query: 107 HPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARDRSGLL 166
           HP +A R P +PE V DQIRLWE+ +NR+       Y+   SR++FE A  ++R    LL
Sbjct: 16  HPSIASRTPVVPEVVSDQIRLWEASMNRLRADSVVLYENLASRELFERALAFSRSSGTLL 75


>gi|269860316|ref|XP_002649880.1| transcription/repair factor TFIIH subunit [Enterocytozoon bieneusi
           H348]
 gi|220066720|gb|EED44193.1| transcription/repair factor TFIIH subunit [Enterocytozoon bieneusi
           H348]
          Length = 369

 Score = 60.1 bits (144), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 35/137 (25%), Positives = 68/137 (49%), Gaps = 10/137 (7%)

Query: 34  YIVREYISNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNN 93
           YI+ +  +  Q F+ +ETNF++Y+Y+ +     IL LFS I +++P +I     ++ + N
Sbjct: 222 YILYKNNTEIQKFIYLETNFKLYSYTNNLCDRAILDLFSNIVFEVPGMIKSILDEKKIMN 281

Query: 94  AFENGITAEQQNAHPRVADRIPSIPENVCDQ----IRLWESDLNRVEMTPAHYYDEFPSR 149
             + GI  +Q      + + I     N CDQ    I +W+   NR+     + Y +F + 
Sbjct: 282 TLDRGIKIQQ------IVEYIRRHTINNCDQILHMIEIWDKQRNRINKQIGYLYSDFTNY 335

Query: 150 DVFEAACDYARDRSGLL 166
           + + +  +  +  + LL
Sbjct: 336 NEYRSVLEQIKTDTDLL 352


>gi|296084771|emb|CBI25914.3| unnamed protein product [Vitis vinifera]
          Length = 225

 Score = 58.2 bits (139), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 35/93 (37%), Positives = 40/93 (43%), Gaps = 49/93 (52%)

Query: 1   MPSGIIARLPTSEDLEAYAIGQWE------------------------------------ 24
           MPS I  RLP+ +DLEAYA+GQWE                                    
Sbjct: 94  MPSNITVRLPSLDDLEAYALGQWECFLLQLISSTQTEKLTNFSSSMMKVFQRGLLTQREK 153

Query: 25  -------------LMDTNAQLWYIVREYISNSQ 44
                        LMDTNAQLWYI+REYISNS+
Sbjct: 154 EAPRLTESGFQFLLMDTNAQLWYIMREYISNSE 186


>gi|359497414|ref|XP_003635508.1| PREDICTED: general transcription factor IIH subunit 4-like, partial
           [Vitis vinifera]
          Length = 209

 Score = 57.8 bits (138), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 35/93 (37%), Positives = 40/93 (43%), Gaps = 49/93 (52%)

Query: 1   MPSGIIARLPTSEDLEAYAIGQWE------------------------------------ 24
           MPS I  RLP+ +DLEAYA+GQWE                                    
Sbjct: 94  MPSNITVRLPSLDDLEAYALGQWECFLLQLISSTQTEKLTNFSSSMMKVFQRGLLTQREK 153

Query: 25  -------------LMDTNAQLWYIVREYISNSQ 44
                        LMDTNAQLWYI+REYISNS+
Sbjct: 154 EAPRLTESGFQFLLMDTNAQLWYIMREYISNSE 186


>gi|440295536|gb|ELP88449.1| TFIIH basal transcription factor complex p52 subunit, putative
           [Entamoeba invadens IP1]
          Length = 531

 Score = 57.4 bits (137), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 32/89 (35%), Positives = 50/89 (56%), Gaps = 5/89 (5%)

Query: 48  VVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQQNAH 107
           + ETN+ +Y Y+ S    ++  LF +   QL NL VG  T +++  AF  GIT E     
Sbjct: 291 ITETNYTVYIYTESFFQVKLYSLFIRQNLQLTNLCVGRITYDTVTEAFLKGITNEM---- 346

Query: 108 PRVADRIPSIPENVCDQIRLWESDLNRVE 136
             V    P++P+N+  QI LW+ +LNR++
Sbjct: 347 -LVNFLQPNLPKNIQAQIDLWKRELNRLK 374


>gi|449521313|ref|XP_004167674.1| PREDICTED: RNA polymerase II transcription factor B subunit 2-like,
           partial [Cucumis sativus]
          Length = 296

 Score = 53.9 bits (128), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 33/93 (35%), Positives = 39/93 (41%), Gaps = 49/93 (52%)

Query: 1   MPSGIIARLPTSEDLEAYAIGQWE------------------------------------ 24
           MPS I  RLP+ EDLEAYA+ QWE                                    
Sbjct: 125 MPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLLSQRDK 184

Query: 25  -------------LMDTNAQLWYIVREYISNSQ 44
                        LM+TNAQLWYI+REYISN++
Sbjct: 185 EAPRLTESGFQFLLMETNAQLWYIIREYISNAE 217


>gi|167535774|ref|XP_001749560.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163771952|gb|EDQ85611.1| predicted protein [Monosiga brevicollis MX1]
          Length = 417

 Score = 52.8 bits (125), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 23/59 (38%), Positives = 36/59 (61%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ 103
           GF++ ETNF +YAY+ S +   +L  F+ + Y+LP +     T+ S+  A   GITA+Q
Sbjct: 292 GFLLAETNFHVYAYTDSAVKVALLSKFATLTYRLPIMTTAIITRTSVRRALSQGITADQ 350


>gi|330038479|ref|XP_003239608.1| TATA binding protein of transcription factor IIB-like protein
           [Cryptomonas paramecium]
 gi|327206532|gb|AEA38710.1| TATA binding protein of transcription factor IIB-like protein
           [Cryptomonas paramecium]
          Length = 467

 Score = 52.8 bits (125), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 30/87 (34%), Positives = 47/87 (54%), Gaps = 6/87 (6%)

Query: 47  VVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQQNA 106
           +++E+N+R+YAY+ +    +IL  F  + Y LP L VG   K S++ A   GI A  +N 
Sbjct: 319 MIIESNYRVYAYNNNTFLNKILIQFCDLIYNLPGLFVGEINKTSIHKAIHKGINA--KNI 376

Query: 107 HPRVADRIPSIPENVC----DQIRLWE 129
              +      I +N C    +QIR+WE
Sbjct: 377 ISFILKNSHYIHQNSCNPIINQIRIWE 403


>gi|340508252|gb|EGR33999.1| hypothetical protein IMG5_027880 [Ichthyophthirius multifiliis]
          Length = 454

 Score = 51.6 bits (122), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 38/66 (57%), Gaps = 10/66 (15%)

Query: 47  VVVETNFRMYAY----------STSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFE 96
           ++VETNF +YAY          S SK   ++L+ FSKI Y  P+LIV   T+  +  AF 
Sbjct: 306 IIVETNFNLYAYLDYNPQNKTFSESKYIRDLLKKFSKIHYTFPHLIVAQLTEAKMKQAFN 365

Query: 97  NGITAE 102
            GIT++
Sbjct: 366 QGITSK 371


>gi|380476902|emb|CCF44454.1| RNA polymerase II transcription factor B subunit 2 [Colletotrichum
           higginsianum]
          Length = 126

 Score = 50.8 bits (120), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 30/90 (33%), Positives = 43/90 (47%), Gaps = 11/90 (12%)

Query: 81  LIVGATTKESLNNAFENGITAEQ----------QNAHPRVADRI-PSIPENVCDQIRLWE 129
           ++ G  T+ S+ NA E GITA+Q          +  H   A R  P +P  V DQIRLW+
Sbjct: 1   MVSGRLTRNSIRNAVERGITADQIISYLAAHAHEQMHRMAAVRSRPVLPPTVVDQIRLWQ 60

Query: 130 SDLNRVEMTPAHYYDEFPSRDVFEAACDYA 159
            +  R+  T    + +F S   +E    YA
Sbjct: 61  LETERMTTTSGFLFRDFDSPKEYEVIAGYA 90


>gi|294950275|ref|XP_002786548.1| tfiih, polypeptide, putative [Perkinsus marinus ATCC 50983]
 gi|239900840|gb|EER18344.1| tfiih, polypeptide, putative [Perkinsus marinus ATCC 50983]
          Length = 580

 Score = 49.7 bits (117), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 39/141 (27%), Positives = 62/141 (43%), Gaps = 40/141 (28%)

Query: 40  ISNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQL-------PN---LIVGATTKE 89
           I+N    + V++NF + AY+TS L   +L  F +++ QL       PN    ++G  T+ 
Sbjct: 388 ITNKSCRLFVDSNFAVTAYTTSSLDLRLLGTFVQLQRQLGDGREYDPNDFGCVLGTLTQS 447

Query: 90  SLNNAFENGITAE------QQNAHPRVADR----------------------IPSIPENV 121
           S+ +A + G+T+E      + +  PR A                        I  IP NV
Sbjct: 448 SVQSAAQRGVTSEYIISYLKSHVDPRAAHMGSQGGRSSAATANTGAARGEKFIDGIPANV 507

Query: 122 CDQIRLWESDL--NRVEMTPA 140
             QI LWE +   NR+ + P 
Sbjct: 508 VTQITLWEREAIHNRLRIDPG 528


>gi|123488965|ref|XP_001325283.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121908180|gb|EAY13060.1| hypothetical protein TVAG_212420 [Trichomonas vaginalis G3]
          Length = 460

 Score = 48.9 bits (115), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 28/115 (24%), Positives = 55/115 (47%), Gaps = 4/115 (3%)

Query: 28  TNAQLWYIVREYISNSQGFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATT 87
           T + L +  +  I  ++G+++++TNF++ A+  S LH  +L+ F+ + Y+ P       +
Sbjct: 277 TKSILNFFGKSNIFQTEGWMLIDTNFKITAFPKSPLHTALLKKFANVTYEFPGFASAFIS 336

Query: 88  KESLNNAFENGITAEQQNAHPR--VADRIPS--IPENVCDQIRLWESDLNRVEMT 138
             S   A   G T +      +  ++ +I S  IP  V  Q  +W     R+ +T
Sbjct: 337 PNSFREALNQGTTLDDIIGFLKSNLSHKIGSGQIPSAVMKQFYVWRDQRERLTVT 391


>gi|401419872|ref|XP_003874425.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322490661|emb|CBZ25923.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 559

 Score = 45.1 bits (105), Expect = 0.011,   Method: Composition-based stats.
 Identities = 38/144 (26%), Positives = 56/144 (38%), Gaps = 55/144 (38%)

Query: 47  VVVETNFRMYAYSTSKLHYEILRLFSKIEYQLP-NLIVGATTKESLNNAFENGITAEQ-- 103
           ++ ETNFR+YAY+       IL  F+++E  +  NL     T++S   A   GITA Q  
Sbjct: 359 IITETNFRLYAYTDDADLLNILNQFAELEEIVNGNLHCYRVTRDSFAAAMRKGITAAQVL 418

Query: 104 ----QNAHPRVADR-----------IPS-------------------------------- 116
                 AHP +  R            PS                                
Sbjct: 419 RFLSLRAHPSMLRRHGEREESGDTAAPSSAGGVSSSLTFSAPHRRGAVPGTCGGTTAIDA 478

Query: 117 -----IPENVCDQIRLWESDLNRV 135
                +P++ CDQ+R+WES+ +RV
Sbjct: 479 VSTIVVPQSFCDQMRMWESECHRV 502


>gi|397564488|gb|EJK44232.1| hypothetical protein THAOC_37249 [Thalassiosira oceanica]
          Length = 971

 Score = 44.7 bits (104), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 26/68 (38%), Positives = 40/68 (58%), Gaps = 7/68 (10%)

Query: 77  QLPNLIVGATTKESLNNAFENGITAE------QQNAHPRV-ADRIPSIPENVCDQIRLWE 129
           +LPN++    T++S+ +AF  G+TA+      Q +AHP + +   P  P NV DQI LW+
Sbjct: 434 RLPNVVFFHLTRDSIKSAFRLGVTADQILRFLQVHAHPMMRSGNQPMPPANVRDQILLWD 493

Query: 130 SDLNRVEM 137
            +  RV M
Sbjct: 494 RERRRVVM 501


>gi|162605818|ref|XP_001713424.1| hypothetical protein GTHECHR3089 [Guillardia theta]
 gi|13794356|gb|AAK39733.1|AF083031_90 hypothetical protein [Guillardia theta]
          Length = 180

 Score = 44.7 bits (104), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 27/110 (24%), Positives = 50/110 (45%), Gaps = 15/110 (13%)

Query: 47  VVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAE---- 102
           +++E NFR+Y +          +  SK  Y LP   VG   +      F+ G+T++    
Sbjct: 45  IIIEVNFRIYIFKYFDFSNSFFKFISKKIYSLPYFYVGEINENYSRRLFQLGVTSDNILK 104

Query: 103 --QQNAHPRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRD 150
             ++N H    ++IPS  E   D++R+WE     +     +++  F  R+
Sbjct: 105 FIKKNLH-YTCNKIPSTFE---DRLRIWE-----ISFKKKYFFGGFLMRN 145


>gi|154345556|ref|XP_001568715.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134066057|emb|CAM43844.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 561

 Score = 42.7 bits (99), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 37/143 (25%), Positives = 53/143 (37%), Gaps = 54/143 (37%)

Query: 47  VVVETNFRMYAYSTSKLHYEILRLFSKIEYQL-PNLIVGATTKESLNNAFENGITAEQ-- 103
           ++ ETNFR+YAY+       IL  F+++E  +  NL     T++S   A   GITA Q  
Sbjct: 362 IITETNFRLYAYTEDSDLLNILHQFAELEEVVNGNLHCYRVTRDSFAAAMRKGITATQVL 421

Query: 104 ----QNAHPRVADR------------------IPS------------------------- 116
                 AHP +  R                   PS                         
Sbjct: 422 RFLSLRAHPSMLRRYGEKEENGDTKSPSPAGGAPSSTFSAPPRRAAVAGTRGSTTFIDAV 481

Query: 117 ----IPENVCDQIRLWESDLNRV 135
               +P + CDQ+R+WE +  RV
Sbjct: 482 TTLVVPHSFCDQLRMWERECRRV 504


>gi|146103275|ref|XP_001469523.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|134073893|emb|CAM72632.1| conserved hypothetical protein [Leishmania infantum JPCM5]
          Length = 557

 Score = 42.4 bits (98), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 36/144 (25%), Positives = 53/144 (36%), Gaps = 55/144 (38%)

Query: 47  VVVETNFRMYAYSTSKLHYEILRLFSKIEYQL-PNLIVGATTKESLNNAFENGITAEQ-- 103
           ++ ETNFR+YAY+       IL  F+K+E  +  NL     T++S  +A   GITA Q  
Sbjct: 357 IITETNFRLYAYTDDADLLNILNQFAKLEEIVNGNLHCYRVTRDSFASAMRKGITAAQVL 416

Query: 104 ----QNAHPRVADR---------------------------------------------- 113
                 AHP +  R                                              
Sbjct: 417 RFLSLRAHPSMLRRHGEREENGDTTSPSSAGGASSSFTFSASPRRRAVAGTCGSTTSIDA 476

Query: 114 --IPSIPENVCDQIRLWESDLNRV 135
                +P++ CDQ+ +WES+  RV
Sbjct: 477 VSTLVVPQSFCDQMHMWESECRRV 500


>gi|398024082|ref|XP_003865202.1| hypothetical protein, conserved [Leishmania donovani]
 gi|322503439|emb|CBZ38524.1| hypothetical protein, conserved [Leishmania donovani]
          Length = 557

 Score = 42.4 bits (98), Expect = 0.061,   Method: Compositional matrix adjust.
 Identities = 36/144 (25%), Positives = 53/144 (36%), Gaps = 55/144 (38%)

Query: 47  VVVETNFRMYAYSTSKLHYEILRLFSKIEYQL-PNLIVGATTKESLNNAFENGITAEQ-- 103
           ++ ETNFR+YAY+       IL  F+K+E  +  NL     T++S  +A   GITA Q  
Sbjct: 357 IITETNFRLYAYTDDADLLNILNQFAKLEEIVNGNLHCYRVTRDSFASAMRKGITAAQVL 416

Query: 104 ----QNAHPRVADR---------------------------------------------- 113
                 AHP +  R                                              
Sbjct: 417 RFLSLRAHPSMLRRHGEREENGDTTSPSSAGGASSSFTFSASPRRRAVAGTCGSTTSIDA 476

Query: 114 --IPSIPENVCDQIRLWESDLNRV 135
                +P++ CDQ+ +WES+  RV
Sbjct: 477 VSTLVVPQSFCDQMHMWESECRRV 500


>gi|340057132|emb|CCC51474.1| conserved hypothetical protein [Trypanosoma vivax Y486]
          Length = 502

 Score = 42.4 bits (98), Expect = 0.072,   Method: Compositional matrix adjust.
 Identities = 35/127 (27%), Positives = 51/127 (40%), Gaps = 40/127 (31%)

Query: 47  VVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVG-ATTKESLNNAFENGITAE--- 102
           ++ ETNFR+YAY+ +K    IL  F+  E ++  +IV    T+ S   A   GI A+   
Sbjct: 318 IITETNFRLYAYTRNKHLLAILEQFAIKEAEVDGMIVCFRVTRASFAAALRKGIGAQHIV 377

Query: 103 --------------QQNAHPR--------------------VADRIPSIPENVCDQIRLW 128
                         Q    PR                     AD+I  IP + CDQ+  W
Sbjct: 378 QFLRVKAHSSMLKHQLTCDPRDSAGLAATSSRVTPTDTPWTHADKI--IPRSFCDQLFTW 435

Query: 129 ESDLNRV 135
           E +  R+
Sbjct: 436 ERECRRL 442


>gi|342184190|emb|CCC93671.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
          Length = 499

 Score = 41.2 bits (95), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 52/124 (41%), Gaps = 35/124 (28%)

Query: 47  VVVETNFRMYAYSTSKLHYEILRLFSKIEYQLPNLIVG-ATTKESLNNAFENGITAEQ-- 103
           ++ ETNFR+YAY+ +     IL  F+  E  +  +IV    T+ S   A   GI A+   
Sbjct: 317 IITETNFRLYAYTRNPDLLGILDQFAVKEIDIDGIIVCYRVTRSSFALALRKGIDAKHIL 376

Query: 104 ----QNAHPR-----------VADRI----------PS-------IPENVCDQIRLWESD 131
                 AHP            V+D +          PS       IP++ CDQ+  WE +
Sbjct: 377 QFLTLKAHPSMVRKDGGASRDVSDNLALNSVAGFGKPSEVHQSTVIPQSFCDQLMTWERE 436

Query: 132 LNRV 135
             R+
Sbjct: 437 CRRL 440


>gi|157876614|ref|XP_001686652.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|68129727|emb|CAJ09033.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 626

 Score = 40.8 bits (94), Expect = 0.20,   Method: Composition-based stats.
 Identities = 36/144 (25%), Positives = 53/144 (36%), Gaps = 55/144 (38%)

Query: 47  VVVETNFRMYAYSTSKLHYEILRLFSKIEYQLP-NLIVGATTKESLNNAFENGITAEQ-- 103
           ++ ETNFR++AY+       IL  F+++E  +  NL     T+ S   A   GITA Q  
Sbjct: 426 IITETNFRLFAYTNDADLLNILNQFAELEEIVNGNLHCYRVTRASFAAAMRKGITAAQVL 485

Query: 104 ----QNAHPRVADR---------------------------IPS---------------- 116
                 AHP +  R                            P                 
Sbjct: 486 RFLSLRAHPSMLRRHGEREENGDTASPSSAGGASSSFTFSASPRRGAVAGTCGSTTSIDA 545

Query: 117 -----IPENVCDQIRLWESDLNRV 135
                +P++ CDQ+R+WES+  RV
Sbjct: 546 VSTLVVPQSFCDQMRMWESECRRV 569


>gi|47194594|emb|CAF93785.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 123

 Score = 40.4 bits (93), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 19/53 (35%), Positives = 30/53 (56%)

Query: 108 PRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYAR 160
           P    + P +P  + DQIRLWE + +R++ T    Y++F S+  FE   D A+
Sbjct: 15  PTPLTQTPVLPPTITDQIRLWELERDRLQFTEGVLYNQFLSQTDFEVLRDRAQ 67


>gi|146161819|ref|XP_001033510.2| hypothetical protein TTHERM_00313290 [Tetrahymena thermophila]
 gi|146142827|gb|EAR85847.2| hypothetical protein TTHERM_00313290 [Tetrahymena thermophila
           SB210]
          Length = 484

 Score = 39.3 bits (90), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 24/72 (33%), Positives = 37/72 (51%), Gaps = 17/72 (23%)

Query: 47  VVVETNFRMYAY-----------STSKL-----HYE-ILRLFSKIEYQLPNLIVGATTKE 89
           + +ETNF +YAY           S   L     H E +L  F  + Y+ P+LIVG  +++
Sbjct: 322 ITIETNFNVYAYIDQEDPKEIKDSKDTLKDQTEHIEKLLASFCDLHYKFPHLIVGQLSEQ 381

Query: 90  SLNNAFENGITA 101
              + F+NG+TA
Sbjct: 382 KTKDQFKNGLTA 393


>gi|150863777|ref|XP_001382371.2| hypothetical protein PICST_55332 [Scheffersomyces stipitis CBS
           6054]
 gi|149385033|gb|ABN64342.2| predicted protein, partial [Scheffersomyces stipitis CBS 6054]
          Length = 430

 Score = 38.1 bits (87), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 19/65 (29%), Positives = 31/65 (47%)

Query: 4   GIIARLPTSEDLEAYAIGQWELMDTNAQLWYIVREYISNSQGFVVVETNFRMYAYSTSKL 63
           G +   P S+ L+  +   W    T+  L++  ++ I  + GFV V  N  +Y Y  SKL
Sbjct: 160 GSLTATPISDSLDVESANGWGFTTTSGSLFHNPKQPIVKNYGFVDVNENLDLYKYQVSKL 219

Query: 64  HYEIL 68
              +L
Sbjct: 220 GVTVL 224


>gi|238055347|sp|A3LNJ3.3|CLP1_PICST RecName: Full=Protein CLP1
          Length = 453

 Score = 38.1 bits (87), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 19/65 (29%), Positives = 31/65 (47%)

Query: 4   GIIARLPTSEDLEAYAIGQWELMDTNAQLWYIVREYISNSQGFVVVETNFRMYAYSTSKL 63
           G +   P S+ L+  +   W    T+  L++  ++ I  + GFV V  N  +Y Y  SKL
Sbjct: 183 GSLTATPISDSLDVESANGWGFTTTSGSLFHNPKQPIVKNYGFVDVNENLDLYKYQVSKL 242

Query: 64  HYEIL 68
              +L
Sbjct: 243 GVTVL 247


>gi|145509120|ref|XP_001440504.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124407721|emb|CAK73107.1| unnamed protein product [Paramecium tetraurelia]
          Length = 400

 Score = 38.1 bits (87), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 29/102 (28%), Positives = 49/102 (48%), Gaps = 11/102 (10%)

Query: 13  EDLEAYAIGQWE--LMDTNAQLWYIVREYISNSQGF---VVVETNFRMYAYSTSKLHYE- 66
           +DL+ + +  ++   M      W  + E  S S G    ++VE NFR+YAY  S    E 
Sbjct: 233 KDLQEFGLTHYQDQQMRITFLFWNFLYEPPSVSIGIQCNIIVEANFRIYAYLNSGDQQEE 292

Query: 67  -----ILRLFSKIEYQLPNLIVGATTKESLNNAFENGITAEQ 103
                +L LFS+I+ +   LI+   ++ S+  A    + A+Q
Sbjct: 293 EILCNLLNLFSEIKKRFKILIIADLSESSIRKAVRENLQAKQ 334


>gi|197107449|pdb|3DOM|A Chain A, Crystal Structure Of The Complex Between Tfb5 And The
           C-Terminal Domain Of Tfb2
 gi|197107451|pdb|3DOM|C Chain C, Crystal Structure Of The Complex Between Tfb5 And The
           C-Terminal Domain Of Tfb2
          Length = 108

 Score = 38.1 bits (87), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 19/59 (32%), Positives = 29/59 (49%)

Query: 108 PRVADRIPSIPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRDVFEAACDYARDRSGLL 166
           P   + +  +P  V DQIRLW+ +L+RV       Y +F +   +     YA+D   LL
Sbjct: 19  PNCKEPLQVLPPTVVDQIRLWQLELDRVITYEGSLYSDFETSQEYNLLSKYAQDIGVLL 77


>gi|324515605|gb|ADY46257.1| General transcription factor IIH subunit 4 [Ascaris suum]
          Length = 245

 Score = 38.1 bits (87), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 15/33 (45%), Positives = 25/33 (75%)

Query: 45  GFVVVETNFRMYAYSTSKLHYEILRLFSKIEYQ 77
           G+++VETN+R+YAY+ S L   IL  F+++ Y+
Sbjct: 211 GYIIVETNYRVYAYTDSSLQLAILSTFTEMLYR 243


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.134    0.400 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,662,976,144
Number of Sequences: 23463169
Number of extensions: 104098706
Number of successful extensions: 223180
Number of sequences better than 100.0: 387
Number of HSP's better than 100.0 without gapping: 372
Number of HSP's successfully gapped in prelim test: 15
Number of HSP's that attempted gapping in prelim test: 222161
Number of HSP's gapped (non-prelim): 695
length of query: 166
length of database: 8,064,228,071
effective HSP length: 127
effective length of query: 39
effective length of database: 9,379,372,904
effective search space: 365795543256
effective search space used: 365795543256
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 71 (32.0 bits)