BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 015409
(407 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|147854406|emb|CAN81292.1| hypothetical protein VITISV_005315 [Vitis vinifera]
Length = 451
Score = 712 bits (1839), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 334/407 (82%), Positives = 375/407 (92%), Gaps = 2/407 (0%)
Query: 1 MPLVKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPA 60
MP VKIIAKNFMDMVASL A LD LY N FICEAILRSLPPLAKKY++QM YID + A
Sbjct: 1 MPQVKIIAKNFMDMVASLPAMKLDALYHNTFICEAILRSLPPLAKKYILQMLYIDVPVTA 60
Query: 61 KMMEEWVLPDGFTKHRVAIDRLVQLRLFSE--EKKKETTYRLNSTFQSNLRKHLIYGGAL 118
K MEEWVL DGF+KHRVAIDRL+QLR+F+E ++KKET+YRLN TFQ+NL+KHLIYGG L
Sbjct: 61 KAMEEWVLADGFSKHRVAIDRLIQLRVFTETSDRKKETSYRLNPTFQTNLQKHLIYGGVL 120
Query: 119 PREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLS 178
PREPMPS IT RLP+L+DLEAYA+GQWECFLLQLISSTQ E+ TNFSSS+MKVFQRGLL+
Sbjct: 121 PREPMPSNITVRLPSLDDLEAYALGQWECFLLQLISSTQTEKLTNFSSSLMKVFQRGLLT 180
Query: 179 RRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGE 238
+R+KEAPRLTESGFQFLLMDTNAQLWYI+REYISNS+ERG++ ADLISFLLELSFHV GE
Sbjct: 181 QREKEAPRLTESGFQFLLMDTNAQLWYIMREYISNSEERGVDPADLISFLLELSFHVTGE 240
Query: 239 AYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARKEGFIV 298
AYN+NTL+E Q++ IKD DLGLVKLQQGRKE+WFIPTKLATNLSMSL+D+S+RK+GF+V
Sbjct: 241 AYNINTLTEFQRNTIKDLVDLGLVKLQQGRKESWFIPTKLATNLSMSLSDTSSRKQGFVV 300
Query: 299 VETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFL 358
VETNFR+YAYS+SKLHCEILRLFS+VEYQLPNLIVGAITKESLYNAFENGITAEQIISFL
Sbjct: 301 VETNFRLYAYSSSKLHCEILRLFSRVEYQLPNLIVGAITKESLYNAFENGITAEQIISFL 360
Query: 359 QQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
QQNAHPRVA+R P+VPENV DQIRLWE+DLNRVE P+H YDEFPSR
Sbjct: 361 QQNAHPRVAERTPAVPENVTDQIRLWETDLNRVETMPSHLYDEFPSR 407
>gi|255556021|ref|XP_002519045.1| tfiih, polypeptide, putative [Ricinus communis]
gi|223541708|gb|EEF43256.1| tfiih, polypeptide, putative [Ricinus communis]
Length = 451
Score = 696 bits (1797), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 325/407 (79%), Positives = 371/407 (91%), Gaps = 2/407 (0%)
Query: 1 MPLVKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPA 60
MP V+IIAKNFMDMVASL A LD+LY NPFIC+AILRSLPPLAKKYV+Q+ YIDG +
Sbjct: 1 MPQVRIIAKNFMDMVASLPAMKLDKLYTNPFICQAILRSLPPLAKKYVVQILYIDGPVAD 60
Query: 61 KMMEEWVLPDGFTKHRVAIDRLVQLRLFSE--EKKKETTYRLNSTFQSNLRKHLIYGGAL 118
K +EEW+LPDG +KHRVAIDRL QLR+F+E ++KKE +Y+LN TFQ+NL+KHLI GG L
Sbjct: 61 KTLEEWLLPDGSSKHRVAIDRLTQLRIFTEIVDRKKEISYKLNPTFQTNLQKHLIDGGVL 120
Query: 119 PREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLS 178
P EP+ S I RLPTLE+L+ YA+GQWECFLL LI+S AER TNFSSSMMK+FQRGLL+
Sbjct: 121 PGEPLASNIAVRLPTLEELDTYALGQWECFLLHLINSGHAERSTNFSSSMMKIFQRGLLT 180
Query: 179 RRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGE 238
+RDKEAPRLTESGFQFLLMDTNAQLWYI+REYISNS+ERG++ ADLISFLLELSFH+ GE
Sbjct: 181 QRDKEAPRLTESGFQFLLMDTNAQLWYIIREYISNSEERGLDSADLISFLLELSFHITGE 240
Query: 239 AYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARKEGFIV 298
AYN+ L+E Q++MIKD ADLGLVKLQQGRKE+WFIPTKLATNLSMSLTDSS+RK+GF+V
Sbjct: 241 AYNMIMLTEFQRNMIKDLADLGLVKLQQGRKESWFIPTKLATNLSMSLTDSSSRKQGFVV 300
Query: 299 VETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFL 358
VETNFRMYAYSTSKLHCEI+RLFS+VEYQLPNL+VGA+TKESLY+AFENGITAEQIISFL
Sbjct: 301 VETNFRMYAYSTSKLHCEIMRLFSRVEYQLPNLVVGAMTKESLYSAFENGITAEQIISFL 360
Query: 359 QQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
QQNAHPRVA+R+PSVPENV DQIRLWESD+NRVEMTPAH YDEFPSR
Sbjct: 361 QQNAHPRVAERIPSVPENVTDQIRLWESDMNRVEMTPAHLYDEFPSR 407
>gi|449433857|ref|XP_004134713.1| PREDICTED: general transcription factor IIH subunit 4-like [Cucumis
sativus]
Length = 451
Score = 695 bits (1794), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 327/409 (79%), Positives = 376/409 (91%), Gaps = 2/409 (0%)
Query: 1 MPLVKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPA 60
MP VKIIAKNFMDMVASL A LD+LYEN FICEAILRSLPPLAKK+V+QM YIDG + A
Sbjct: 1 MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDGPVSA 60
Query: 61 KMMEEWVLPDGFTKHRVAIDRLVQLRLFSE--EKKKETTYRLNSTFQSNLRKHLIYGGAL 118
K MEEWVLPDG +K++VA+DRL+QLR+F E ++K+ETTYRLN TFQ+NL+K LI+G L
Sbjct: 61 KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRETTYRLNPTFQANLQKLLIHGEVL 120
Query: 119 PREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLS 178
REPMPS IT RLP+LEDLEAYA+ QWECFLLQLI+S QAE+P+N SSS+MKVFQ+GLLS
Sbjct: 121 AREPMPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLLS 180
Query: 179 RRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGE 238
+RDKEAPRLTESGFQFLLM+TNAQLWYI+REYISN++ERG++ ADLISFLLELSFHV GE
Sbjct: 181 QRDKEAPRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGE 240
Query: 239 AYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARKEGFIV 298
AY+++TLS+ Q+ IKD ADLGLVKLQQGRKE+WFIPTKLATNLSMSL DSS+RK GF+V
Sbjct: 241 AYDIDTLSDEQRYAIKDLADLGLVKLQQGRKESWFIPTKLATNLSMSLADSSSRKLGFVV 300
Query: 299 VETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFL 358
VETNFRMYAYSTSKLHCEILRLFS++EYQLPNLIVGAITKESLYNAF+NGITAEQI++FL
Sbjct: 301 VETNFRMYAYSTSKLHCEILRLFSRIEYQLPNLIVGAITKESLYNAFKNGITAEQIVTFL 360
Query: 359 QQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRVI 407
QQNAHPRVA+R+PSVPENV DQIRLWESDLNRV++TPAH+YDEFPSR +
Sbjct: 361 QQNAHPRVAERIPSVPENVTDQIRLWESDLNRVDITPAHFYDEFPSREV 409
>gi|357461579|ref|XP_003601071.1| General transcription factor IIH subunit [Medicago truncatula]
gi|355490119|gb|AES71322.1| General transcription factor IIH subunit [Medicago truncatula]
Length = 452
Score = 673 bits (1736), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/408 (76%), Positives = 368/408 (90%), Gaps = 3/408 (0%)
Query: 1 MPLVKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPA 60
MP V+IIAKNFMDMVAS+ A LD+LY+N FICEAILRSLPPLAKKYV+Q+ +IDG +PA
Sbjct: 1 MPEVRIIAKNFMDMVASMPAIKLDKLYQNGFICEAILRSLPPLAKKYVLQLLHIDGPVPA 60
Query: 61 KMMEEWVLPDGFTKHRVAIDRLVQLRLFSE---EKKKETTYRLNSTFQSNLRKHLIYGGA 117
K++ EWVLPDG +KH+VAIDRLVQLR+F E +K E TY++NST+Q +L+ L++GG
Sbjct: 61 KLLAEWVLPDGLSKHKVAIDRLVQLRVFVEAFDNRKNEKTYQVNSTYQKSLQNLLVHGGT 120
Query: 118 LPREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLL 177
LPRE MPS IT RLPTLEDLE YA+ QWECFLLQLIS + ++ N SSS+MKVFQR LL
Sbjct: 121 LPRESMPSNITVRLPTLEDLETYALEQWECFLLQLISPSHVDKTLNISSSLMKVFQRRLL 180
Query: 178 SRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAG 237
S+RD+EAP+LTESGFQFLLMDTNAQLWYI+REYI+NS+ERG++ DLISF+LELSFHV G
Sbjct: 181 SQRDREAPKLTESGFQFLLMDTNAQLWYIIREYITNSEERGVDAGDLISFMLELSFHVIG 240
Query: 238 EAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARKEGFI 297
EAYN+NTL+E Q+++IKD ADLGLVKLQQGRKE+WFIPTKLATNLS+S+T+SS+RKEGF+
Sbjct: 241 EAYNINTLTEFQRNIIKDLADLGLVKLQQGRKESWFIPTKLATNLSVSMTESSSRKEGFV 300
Query: 298 VVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISF 357
VVETNFR+YAYSTSKLHCEILRLFS+VEYQLPNLIVGAITKESLYNAF+NGITA+QI+SF
Sbjct: 301 VVETNFRVYAYSTSKLHCEILRLFSRVEYQLPNLIVGAITKESLYNAFDNGITADQIVSF 360
Query: 358 LQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
L+QNAHPRVA R+P+VPENV DQIRLWESDLNRVEMT A+YYDEFPSR
Sbjct: 361 LRQNAHPRVAQRVPAVPENVTDQIRLWESDLNRVEMTEAYYYDEFPSR 408
>gi|224077128|ref|XP_002305144.1| predicted protein [Populus trichocarpa]
gi|222848108|gb|EEE85655.1| predicted protein [Populus trichocarpa]
Length = 449
Score = 673 bits (1736), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/407 (76%), Positives = 368/407 (90%), Gaps = 4/407 (0%)
Query: 1 MPLVKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPA 60
MP V+IIAKNFMDMVASL A LD LY+N FICEAILRSLPPLAKKYV+Q+ + DG +
Sbjct: 1 MPQVRIIAKNFMDMVASLPAMKLDTLYQNAFICEAILRSLPPLAKKYVVQILFFDGPVSG 60
Query: 61 KMMEEWVLPDGFTKHRVAIDRLVQLRLFSE--EKKKETTYRLNSTFQSNLRKHLIYGGAL 118
K++E+WVL DG TKHRVAIDRL+QLR+F E +KK+E++Y+LN TFQ+NLRKHL GG L
Sbjct: 61 KLLEQWVLADGVTKHRVAIDRLIQLRIFIEVSDKKRESSYKLNQTFQANLRKHLTNGGVL 120
Query: 119 PREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLS 178
PRE M + + +LP+LE+L+ YA+ QWECFLL LISS QAE+PT+ SSSMMK+FQRGLLS
Sbjct: 121 PRETMAAVV--KLPSLEELDTYALEQWECFLLLLISSGQAEKPTSLSSSMMKIFQRGLLS 178
Query: 179 RRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGE 238
+RD++APRLTE GFQFLLMDTNAQLWYI+REYI+NS+ERG ADLISFLLELSFHV GE
Sbjct: 179 QRDRDAPRLTEGGFQFLLMDTNAQLWYIIREYITNSEERGTEPADLISFLLELSFHVTGE 238
Query: 239 AYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARKEGFIV 298
AYN+NTL+EIQ++ IKD A+LGLVKLQQGRKE+WFIPTKLATNLS+SLTDSS+RK+G++V
Sbjct: 239 AYNMNTLTEIQRNTIKDLAELGLVKLQQGRKESWFIPTKLATNLSVSLTDSSSRKQGYVV 298
Query: 299 VETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFL 358
VETNFR+YAYS+SKLHCEILRLFSK+EYQLPNLIVGAITKESLY AFENGIT++QIISFL
Sbjct: 299 VETNFRLYAYSSSKLHCEILRLFSKIEYQLPNLIVGAITKESLYTAFENGITSDQIISFL 358
Query: 359 QQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
QQNAHPRVA+R+PSVPENV DQIRLWE+DLNRVE+TP+H+YDEFPSR
Sbjct: 359 QQNAHPRVAERLPSVPENVTDQIRLWEADLNRVEITPSHFYDEFPSR 405
>gi|358346685|ref|XP_003637396.1| General transcription factor IIH subunit [Medicago truncatula]
gi|355503331|gb|AES84534.1| General transcription factor IIH subunit [Medicago truncatula]
Length = 451
Score = 668 bits (1723), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 311/407 (76%), Positives = 365/407 (89%), Gaps = 2/407 (0%)
Query: 1 MPLVKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPA 60
MP V+IIAKNFMDMVAS+ LD+LYEN FICEAILRSLPPLAKKYV+Q+F+IDG +PA
Sbjct: 1 MPEVRIIAKNFMDMVASMPTIKLDKLYENGFICEAILRSLPPLAKKYVLQLFHIDGPVPA 60
Query: 61 KMMEEWVLPDGFTKHRVAIDRLVQLRLFSE--EKKKETTYRLNSTFQSNLRKHLIYGGAL 118
K++ EWVLPDG +KH+VAIDRLVQL +F E + K TY++N T+Q +L+K L++GG L
Sbjct: 61 KLLAEWVLPDGLSKHKVAIDRLVQLGVFVEALDSKNGKTYKVNPTYQRSLQKLLVHGGTL 120
Query: 119 PREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLS 178
PRE MPS IT RLPTLEDLE YA+ QWECFLLQLIS + ++ N SSS+MKVFQR LLS
Sbjct: 121 PRESMPSNITVRLPTLEDLETYALEQWECFLLQLISPSHVDKTLNISSSLMKVFQRRLLS 180
Query: 179 RRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGE 238
+RD+EAP+LTESGFQFLLMDTNAQLWYI+REYI+NS+ERG++ DLISF+LELSFHV GE
Sbjct: 181 QRDREAPKLTESGFQFLLMDTNAQLWYIIREYITNSEERGVDAGDLISFMLELSFHVIGE 240
Query: 239 AYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARKEGFIV 298
AYN+NTL+E Q+++IKD ADLGLVKLQQGRKE+WFIPTKLATNLS+S+T+SS+RKEGF+V
Sbjct: 241 AYNINTLTEFQRNIIKDLADLGLVKLQQGRKESWFIPTKLATNLSVSMTESSSRKEGFVV 300
Query: 299 VETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFL 358
VETNFR+YAYSTSKLHCEILRLFS+VEYQLPNLIVGAITKESLYNAF+NGITA+QI+SF
Sbjct: 301 VETNFRVYAYSTSKLHCEILRLFSRVEYQLPNLIVGAITKESLYNAFDNGITADQIVSFF 360
Query: 359 QQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
+QNAHPRVA R+P+VPENV DQIRLWESDLNRVEMT A+YYDEFPSR
Sbjct: 361 RQNAHPRVAQRIPAVPENVTDQIRLWESDLNRVEMTEAYYYDEFPSR 407
>gi|356521347|ref|XP_003529318.1| PREDICTED: general transcription factor IIH subunit 4-like [Glycine
max]
Length = 451
Score = 664 bits (1713), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 311/407 (76%), Positives = 365/407 (89%), Gaps = 2/407 (0%)
Query: 1 MPLVKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPA 60
MP V+IIAKNFMDMVAS+ A LD+LYEN FICEAILRS+PPLAKKYV+QM +ID + A
Sbjct: 1 MPEVRIIAKNFMDMVASMPAMKLDKLYENGFICEAILRSVPPLAKKYVIQMLHIDVPVAA 60
Query: 61 KMMEEWVLPDGFTKHRVAIDRLVQLRLFSE--EKKKETTYRLNSTFQSNLRKHLIYGGAL 118
K++EEWVLP G +KHRVAIDRLVQLR+F E ++K E TY++N TFQ +L+K L+ GG L
Sbjct: 61 KLLEEWVLPAGVSKHRVAIDRLVQLRVFLEAVDRKNEKTYKVNPTFQRSLQKLLVQGGTL 120
Query: 119 PREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLS 178
PRE MPS IT RLPTLE+LEAYA+ QWECFLLQLIS +Q E+P N SSS+MKVFQR +LS
Sbjct: 121 PRESMPSNITVRLPTLENLEAYALEQWECFLLQLISPSQVEKPLNISSSLMKVFQRRILS 180
Query: 179 RRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGE 238
RDKEAP+LTESGFQFLLMDTNAQLWYI+REYISNS++RG++ ADLISF+LELSFHV GE
Sbjct: 181 HRDKEAPKLTESGFQFLLMDTNAQLWYIIREYISNSEDRGVDAADLISFMLELSFHVIGE 240
Query: 239 AYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARKEGFIV 298
AY+++TL+ Q+++I D ADLGLVK+QQGRK +WFIPTKLATNLSMSL DSS+RK+GF+V
Sbjct: 241 AYSVDTLTSFQRTIINDLADLGLVKIQQGRKGSWFIPTKLATNLSMSLADSSSRKQGFVV 300
Query: 299 VETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFL 358
VETNFR+YAYSTSKLHCEILRLFS+VEYQLPNLIVGAITKESLY+A+ENGITAEQI++FL
Sbjct: 301 VETNFRVYAYSTSKLHCEILRLFSRVEYQLPNLIVGAITKESLYSAYENGITAEQIVTFL 360
Query: 359 QQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
QQNAHPRVA+R+PSVPENV +QIRLWE+DLNRVEMT +YYDEFPSR
Sbjct: 361 QQNAHPRVAERVPSVPENVTEQIRLWEADLNRVEMTDTYYYDEFPSR 407
>gi|356548733|ref|XP_003542754.1| PREDICTED: general transcription factor IIH subunit 4-like [Glycine
max]
Length = 451
Score = 662 bits (1708), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/407 (76%), Positives = 365/407 (89%), Gaps = 2/407 (0%)
Query: 1 MPLVKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPA 60
MP V+IIAKNFMDMVAS+ A LD+LYEN FICEAILRSL PLAKKYV+QM +ID ++ A
Sbjct: 1 MPEVRIIAKNFMDMVASMPAMKLDKLYENGFICEAILRSLLPLAKKYVIQMLHIDVSVAA 60
Query: 61 KMMEEWVLPDGFTKHRVAIDRLVQLRLFSE--EKKKETTYRLNSTFQSNLRKHLIYGGAL 118
K++EEWVLP G +KHRVAIDRLVQLR+F E ++K E TY++N TFQ +L+K L+ GG L
Sbjct: 61 KLLEEWVLPGGVSKHRVAIDRLVQLRVFLEAVDRKNEKTYKVNPTFQRSLQKLLVQGGTL 120
Query: 119 PREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLS 178
PRE MPS IT RLPTLE+LEAYA+ QWECFLLQLIS +Q E+P N SSS+MKVFQR +LS
Sbjct: 121 PRESMPSNITVRLPTLENLEAYALEQWECFLLQLISPSQVEKPLNISSSLMKVFQRRILS 180
Query: 179 RRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGE 238
RDKEAP+LTESGFQFLLMDTNAQLWYI+REYISNS++RG++ ADLISF+LELSFH GE
Sbjct: 181 YRDKEAPKLTESGFQFLLMDTNAQLWYIIREYISNSEDRGVDAADLISFMLELSFHDIGE 240
Query: 239 AYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARKEGFIV 298
AY+++TL+ Q+++I D ADLGLVK+QQGRK +WFIPTKLATNLSMSL DSS+RK+GF+V
Sbjct: 241 AYSVDTLTSFQRTIINDLADLGLVKIQQGRKGSWFIPTKLATNLSMSLADSSSRKQGFVV 300
Query: 299 VETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFL 358
VETNFR+YAYSTSKLHCEILRLFS+VEYQLPNLIVGAITKESLY+AFENGITAEQII+FL
Sbjct: 301 VETNFRVYAYSTSKLHCEILRLFSRVEYQLPNLIVGAITKESLYSAFENGITAEQIITFL 360
Query: 359 QQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
QQNAHPRVA+R+PSVPENV +QIRLWE+DLNRVEMT A+YYDEFPSR
Sbjct: 361 QQNAHPRVAERIPSVPENVTEQIRLWEADLNRVEMTDAYYYDEFPSR 407
>gi|42566894|ref|NP_193435.2| transcription initiation factor TFIIH subunit H4 [Arabidopsis
thaliana]
gi|51969678|dbj|BAD43531.1| unnamed protein product [Arabidopsis thaliana]
gi|115646777|gb|ABJ17114.1| At4g17020 [Arabidopsis thaliana]
gi|332658439|gb|AEE83839.1| transcription initiation factor TFIIH subunit H4 [Arabidopsis
thaliana]
Length = 452
Score = 660 bits (1702), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 308/407 (75%), Positives = 360/407 (88%), Gaps = 2/407 (0%)
Query: 1 MPLVKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPA 60
MP VKIIAKNFMDMVASL A LD+LY N FICEAILRSLPPLAKKYV+QM YID +PA
Sbjct: 1 MPQVKIIAKNFMDMVASLPAIKLDKLYNNVFICEAILRSLPPLAKKYVLQMLYIDVPVPA 60
Query: 61 KMMEEWVLPDGFTKHRVAIDRLVQLRLFSE--EKKKETTYRLNSTFQSNLRKHLIYGGAL 118
MMEEWVL DG +KHRVAIDRL+QLR+FSE ++K+ T+Y LN TFQ+NL+KH+I GG L
Sbjct: 61 TMMEEWVLADGTSKHRVAIDRLIQLRIFSEISDRKRGTSYSLNPTFQNNLQKHIISGGVL 120
Query: 119 PREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLS 178
PREPM S +LP+L++LE YA+ QWECFLLQLI+S Q E+ T SSSMMK+FQRGLLS
Sbjct: 121 PREPMNSDNAIKLPSLQELETYALKQWECFLLQLINSGQGEKLTGISSSMMKIFQRGLLS 180
Query: 179 RRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGE 238
+RDK+ PRLTESGFQFLLMDTNAQLWYI+REYI N++ER ++ ADLISFLLELSFHV G+
Sbjct: 181 QRDKDGPRLTESGFQFLLMDTNAQLWYIIREYILNAEERDVDPADLISFLLELSFHVTGQ 240
Query: 239 AYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARKEGFIV 298
AYNLNTL+E+Q + +KD ADLGLVKLQQGRK++WFIPTKLATNLS+SL DSSARKEGF+V
Sbjct: 241 AYNLNTLTEVQNNTLKDLADLGLVKLQQGRKDSWFIPTKLATNLSVSLADSSARKEGFVV 300
Query: 299 VETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFL 358
+ETNFRMYAYSTSKL CEILRLF+++EYQLPNLI AITKESLYNAF+NGIT++QII+FL
Sbjct: 301 METNFRMYAYSTSKLQCEILRLFARIEYQLPNLIACAITKESLYNAFDNGITSDQIITFL 360
Query: 359 QQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
QQN+HPR ADR+PS+PENV DQIRLWE+DL R+EMT AH+YDEFPS+
Sbjct: 361 QQNSHPRCADRVPSIPENVTDQIRLWETDLQRIEMTQAHFYDEFPSK 407
>gi|334186626|ref|NP_001190745.1| transcription initiation factor TFIIH subunit H4 [Arabidopsis
thaliana]
gi|332658440|gb|AEE83840.1| transcription initiation factor TFIIH subunit H4 [Arabidopsis
thaliana]
Length = 482
Score = 659 bits (1701), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 308/409 (75%), Positives = 361/409 (88%), Gaps = 2/409 (0%)
Query: 1 MPLVKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPA 60
MP VKIIAKNFMDMVASL A LD+LY N FICEAILRSLPPLAKKYV+QM YID +PA
Sbjct: 1 MPQVKIIAKNFMDMVASLPAIKLDKLYNNVFICEAILRSLPPLAKKYVLQMLYIDVPVPA 60
Query: 61 KMMEEWVLPDGFTKHRVAIDRLVQLRLFSE--EKKKETTYRLNSTFQSNLRKHLIYGGAL 118
MMEEWVL DG +KHRVAIDRL+QLR+FSE ++K+ T+Y LN TFQ+NL+KH+I GG L
Sbjct: 61 TMMEEWVLADGTSKHRVAIDRLIQLRIFSEISDRKRGTSYSLNPTFQNNLQKHIISGGVL 120
Query: 119 PREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLS 178
PREPM S +LP+L++LE YA+ QWECFLLQLI+S Q E+ T SSSMMK+FQRGLLS
Sbjct: 121 PREPMNSDNAIKLPSLQELETYALKQWECFLLQLINSGQGEKLTGISSSMMKIFQRGLLS 180
Query: 179 RRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGE 238
+RDK+ PRLTESGFQFLLMDTNAQLWYI+REYI N++ER ++ ADLISFLLELSFHV G+
Sbjct: 181 QRDKDGPRLTESGFQFLLMDTNAQLWYIIREYILNAEERDVDPADLISFLLELSFHVTGQ 240
Query: 239 AYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARKEGFIV 298
AYNLNTL+E+Q + +KD ADLGLVKLQQGRK++WFIPTKLATNLS+SL DSSARKEGF+V
Sbjct: 241 AYNLNTLTEVQNNTLKDLADLGLVKLQQGRKDSWFIPTKLATNLSVSLADSSARKEGFVV 300
Query: 299 VETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFL 358
+ETNFRMYAYSTSKL CEILRLF+++EYQLPNLI AITKESLYNAF+NGIT++QII+FL
Sbjct: 301 METNFRMYAYSTSKLQCEILRLFARIEYQLPNLIACAITKESLYNAFDNGITSDQIITFL 360
Query: 359 QQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRVI 407
QQN+HPR ADR+PS+PENV DQIRLWE+DL R+EMT AH+YDEFPS+ +
Sbjct: 361 QQNSHPRCADRVPSIPENVTDQIRLWETDLQRIEMTQAHFYDEFPSKDV 409
>gi|42572935|ref|NP_974564.1| transcription initiation factor TFIIH subunit H4 [Arabidopsis
thaliana]
gi|332658438|gb|AEE83838.1| transcription initiation factor TFIIH subunit H4 [Arabidopsis
thaliana]
Length = 462
Score = 659 bits (1699), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 308/409 (75%), Positives = 361/409 (88%), Gaps = 2/409 (0%)
Query: 1 MPLVKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPA 60
MP VKIIAKNFMDMVASL A LD+LY N FICEAILRSLPPLAKKYV+QM YID +PA
Sbjct: 1 MPQVKIIAKNFMDMVASLPAIKLDKLYNNVFICEAILRSLPPLAKKYVLQMLYIDVPVPA 60
Query: 61 KMMEEWVLPDGFTKHRVAIDRLVQLRLFSE--EKKKETTYRLNSTFQSNLRKHLIYGGAL 118
MMEEWVL DG +KHRVAIDRL+QLR+FSE ++K+ T+Y LN TFQ+NL+KH+I GG L
Sbjct: 61 TMMEEWVLADGTSKHRVAIDRLIQLRIFSEISDRKRGTSYSLNPTFQNNLQKHIISGGVL 120
Query: 119 PREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLS 178
PREPM S +LP+L++LE YA+ QWECFLLQLI+S Q E+ T SSSMMK+FQRGLLS
Sbjct: 121 PREPMNSDNAIKLPSLQELETYALKQWECFLLQLINSGQGEKLTGISSSMMKIFQRGLLS 180
Query: 179 RRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGE 238
+RDK+ PRLTESGFQFLLMDTNAQLWYI+REYI N++ER ++ ADLISFLLELSFHV G+
Sbjct: 181 QRDKDGPRLTESGFQFLLMDTNAQLWYIIREYILNAEERDVDPADLISFLLELSFHVTGQ 240
Query: 239 AYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARKEGFIV 298
AYNLNTL+E+Q + +KD ADLGLVKLQQGRK++WFIPTKLATNLS+SL DSSARKEGF+V
Sbjct: 241 AYNLNTLTEVQNNTLKDLADLGLVKLQQGRKDSWFIPTKLATNLSVSLADSSARKEGFVV 300
Query: 299 VETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFL 358
+ETNFRMYAYSTSKL CEILRLF+++EYQLPNLI AITKESLYNAF+NGIT++QII+FL
Sbjct: 301 METNFRMYAYSTSKLQCEILRLFARIEYQLPNLIACAITKESLYNAFDNGITSDQIITFL 360
Query: 359 QQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRVI 407
QQN+HPR ADR+PS+PENV DQIRLWE+DL R+EMT AH+YDEFPS+ +
Sbjct: 361 QQNSHPRCADRVPSIPENVTDQIRLWETDLQRIEMTQAHFYDEFPSKDV 409
>gi|51969958|dbj|BAD43671.1| unnamed protein product [Arabidopsis thaliana]
Length = 452
Score = 659 bits (1699), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 308/407 (75%), Positives = 359/407 (88%), Gaps = 2/407 (0%)
Query: 1 MPLVKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPA 60
MP VKIIAKNFMDMVASL A LD+LY N FICEAILRSLPPLAKKYV+QM YID +PA
Sbjct: 1 MPQVKIIAKNFMDMVASLPAIKLDKLYNNVFICEAILRSLPPLAKKYVLQMLYIDVPVPA 60
Query: 61 KMMEEWVLPDGFTKHRVAIDRLVQLRLFSE--EKKKETTYRLNSTFQSNLRKHLIYGGAL 118
MMEEWVL DG +KHRVAIDRL+QLR+FSE ++K+ T+Y LN TFQ+NL+KH+I GG L
Sbjct: 61 TMMEEWVLADGTSKHRVAIDRLIQLRIFSEISDRKRGTSYSLNPTFQNNLQKHIISGGVL 120
Query: 119 PREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLS 178
PREPM S +LP+L++LE YA+ QWECFLLQLI+S Q E+ T SSSMMK+FQRGLLS
Sbjct: 121 PREPMNSDNAIKLPSLQELETYALKQWECFLLQLINSGQGEKLTGISSSMMKIFQRGLLS 180
Query: 179 RRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGE 238
+RDK+ PRLTESGFQFLLMDTNAQLWYI+REYI N++ER ++ ADLISFLLELSFHV G+
Sbjct: 181 QRDKDGPRLTESGFQFLLMDTNAQLWYIIREYILNAEERDVDPADLISFLLELSFHVTGQ 240
Query: 239 AYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARKEGFIV 298
AYNLNTL+E+Q + +KD ADLGLVKLQQGRK++WFIPTKLATNLS+SL DSSARKEGF+V
Sbjct: 241 AYNLNTLTEVQNNTLKDLADLGLVKLQQGRKDSWFIPTKLATNLSVSLADSSARKEGFVV 300
Query: 299 VETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFL 358
+ETNFRMYAYSTSKL CEILRLF+++EYQLPNLI AITKESLYNAF NGIT++QII+FL
Sbjct: 301 METNFRMYAYSTSKLQCEILRLFARIEYQLPNLIACAITKESLYNAFGNGITSDQIITFL 360
Query: 359 QQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
QQN+HPR ADR+PS+PENV DQIRLWE+DL R+EMT AH+YDEFPS+
Sbjct: 361 QQNSHPRCADRVPSIPENVTDQIRLWETDLQRIEMTQAHFYDEFPSK 407
>gi|297800414|ref|XP_002868091.1| hypothetical protein ARALYDRAFT_493174 [Arabidopsis lyrata subsp.
lyrata]
gi|297313927|gb|EFH44350.1| hypothetical protein ARALYDRAFT_493174 [Arabidopsis lyrata subsp.
lyrata]
Length = 452
Score = 655 bits (1690), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 306/409 (74%), Positives = 359/409 (87%), Gaps = 2/409 (0%)
Query: 1 MPLVKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPA 60
MP VKIIAKNFMDMVASL A LD+LY N FICEAILRSLPPLAKKYV+QM YI+ +PA
Sbjct: 1 MPQVKIIAKNFMDMVASLPAIKLDKLYNNVFICEAILRSLPPLAKKYVLQMLYINVPVPA 60
Query: 61 KMMEEWVLPDGFTKHRVAIDRLVQLRLFSE--EKKKETTYRLNSTFQSNLRKHLIYGGAL 118
MMEEWVL DG +KHRVAIDRL+QLR+FSE ++K+ +Y LN TFQ+NL+KH+I GG L
Sbjct: 61 AMMEEWVLADGASKHRVAIDRLIQLRIFSETSDRKRGISYSLNPTFQNNLQKHIISGGVL 120
Query: 119 PREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLS 178
PREPM S +LP+L++LE YA+ QWECFLLQLI+S Q E+ T SSSMM++FQRGLLS
Sbjct: 121 PREPMHSDNAIKLPSLQELETYALKQWECFLLQLINSGQGEKLTGISSSMMRIFQRGLLS 180
Query: 179 RRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGE 238
+RDK+ PRLTESGFQFLLMDTNAQLWYI+REYISN++ER + ADLISFLLELSFHV GE
Sbjct: 181 QRDKDGPRLTESGFQFLLMDTNAQLWYIIREYISNAEERDVEPADLISFLLELSFHVTGE 240
Query: 239 AYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARKEGFIV 298
AYN NTL+E+Q + +KD ADLGLVKLQQGRK++WFIPTKLATNLS+SL DSSARKEGF+V
Sbjct: 241 AYNSNTLTEVQNNTLKDLADLGLVKLQQGRKDSWFIPTKLATNLSVSLADSSARKEGFVV 300
Query: 299 VETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFL 358
+ETNFRMYAYSTSKL CEILRLF+++EYQLPNLI AITKESLYNAF+NGIT++QII+FL
Sbjct: 301 METNFRMYAYSTSKLQCEILRLFARIEYQLPNLIACAITKESLYNAFDNGITSDQIITFL 360
Query: 359 QQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRVI 407
QQN+HPR ADR+PS+PENV DQIRLWE+DL R+EMT AH+YDEFPS+ +
Sbjct: 361 QQNSHPRCADRVPSIPENVTDQIRLWETDLKRIEMTQAHFYDEFPSKDV 409
>gi|32487401|emb|CAE05735.1| OSJNBb0017I01.15 [Oryza sativa Japonica Group]
Length = 451
Score = 617 bits (1591), Expect = e-174, Method: Compositional matrix adjust.
Identities = 291/407 (71%), Positives = 359/407 (88%), Gaps = 2/407 (0%)
Query: 1 MPLVKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPA 60
MP V ++A+NFMDMVA+L A LD LY++ FICEA+LRSLPPLAKKY +QM Y+ + A
Sbjct: 1 MPQVMVVARNFMDMVAALPAAKLDMLYDSAFICEAVLRSLPPLAKKYALQMLYVSAPVAA 60
Query: 61 KMMEEWVLPDGFTKHRVAIDRLVQLRLFSE--EKKKETTYRLNSTFQSNLRKHLIYGGAL 118
MEEWVL + KHRVAIDRL+QLR+F E +++KE +Y++N FQ N++K+L+ GG+L
Sbjct: 61 AAMEEWVLDEYAAKHRVAIDRLLQLRVFVEVRDRRKEVSYKMNQKFQGNMQKYLVDGGSL 120
Query: 119 PREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLS 178
PREP+PS +TARLPTL +LE++A+ QWECFLLQLI+S+Q ER T+FSSSMM+ FQRGLLS
Sbjct: 121 PREPIPSSVTARLPTLAELESFALEQWECFLLQLINSSQVERGTSFSSSMMRTFQRGLLS 180
Query: 179 RRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGE 238
RD EAPRLTE+GFQFLLM+TNAQLWYI+REYIS+++ERG++ +LISFLLELSFH GE
Sbjct: 181 SRDGEAPRLTENGFQFLLMETNAQLWYIMREYISSAEERGVDPTELISFLLELSFHTLGE 240
Query: 239 AYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARKEGFIV 298
AY+LNTL+++Q++ I+D A+LGLVKLQQGRK++WFIPTKLATNLS SL+DSS+ KEGF+V
Sbjct: 241 AYSLNTLTDVQRNAIRDLAELGLVKLQQGRKDSWFIPTKLATNLSASLSDSSSNKEGFVV 300
Query: 299 VETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFL 358
VETNFRMYAYSTS+LHCEILRLFS+VEYQLPNLIVG+ITKESLY AFENGITAEQIISFL
Sbjct: 301 VETNFRMYAYSTSRLHCEILRLFSRVEYQLPNLIVGSITKESLYGAFENGITAEQIISFL 360
Query: 359 QQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
QQNAHPRVAD++P+VPENV DQIRLWE+D NRV+MT +H Y++FPS+
Sbjct: 361 QQNAHPRVADKIPAVPENVTDQIRLWETDRNRVDMTLSHLYEDFPSK 407
>gi|218195833|gb|EEC78260.1| hypothetical protein OsI_17944 [Oryza sativa Indica Group]
Length = 459
Score = 610 bits (1572), Expect = e-172, Method: Compositional matrix adjust.
Identities = 291/415 (70%), Positives = 359/415 (86%), Gaps = 10/415 (2%)
Query: 1 MPLVKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPA 60
MP V ++A+NFMDMVA+L A LD LY++ FICEA+LRSLPPLAKKY +QM Y+ + A
Sbjct: 1 MPQVMVVARNFMDMVAALPAAKLDMLYDSAFICEAVLRSLPPLAKKYALQMLYVSAPVAA 60
Query: 61 KMMEEWVLPDGFTKHRVAIDRLVQLRLFSE--EKKKETTYRLNSTFQSNLRKHLIYGGAL 118
MEEWVL + KHRVAIDRL+QLR+F E +++KE +Y++N FQ N++K+L+ GG+L
Sbjct: 61 AAMEEWVLDEYAAKHRVAIDRLLQLRVFVEVRDRRKEVSYKMNQKFQGNMQKYLVDGGSL 120
Query: 119 PREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLS 178
PREP+PS +TARLPTL +LE++A+ QWECFLLQLI+S+Q ER T+FSSSMM+ FQRGLLS
Sbjct: 121 PREPIPSSVTARLPTLAELESFALEQWECFLLQLINSSQVERGTSFSSSMMRTFQRGLLS 180
Query: 179 RRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGE 238
RD EAPRLTE+GFQFLLM+TNAQLWYI+REYIS+++ERG++ +LISFLLELSFH GE
Sbjct: 181 SRDGEAPRLTENGFQFLLMETNAQLWYIMREYISSAEERGVDPTELISFLLELSFHTLGE 240
Query: 239 AYNLNTLSEIQKSMIKDFADLGLVKLQ--------QGRKENWFIPTKLATNLSMSLTDSS 290
AY+LNTL+++Q++ I+D A+LGLVKLQ QGRK++WFIPTKLATNLS SL+DSS
Sbjct: 241 AYSLNTLTDVQRNAIRDLAELGLVKLQQMSRVKPWQGRKDSWFIPTKLATNLSASLSDSS 300
Query: 291 ARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGIT 350
+ KEGF+VVETNFRMYAYSTS+LHCEILRLFS+VEYQLPNLIVG+ITKESLY AFENGIT
Sbjct: 301 SNKEGFVVVETNFRMYAYSTSRLHCEILRLFSRVEYQLPNLIVGSITKESLYGAFENGIT 360
Query: 351 AEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
AEQIISFLQQNAHPRVAD++P+VPENV DQIRLWE+D NRV+MT +H Y++FPS+
Sbjct: 361 AEQIISFLQQNAHPRVADKIPAVPENVTDQIRLWETDRNRVDMTLSHLYEDFPSK 415
>gi|414584751|tpg|DAA35322.1| TPA: hypothetical protein ZEAMMB73_386928 [Zea mays]
Length = 451
Score = 597 bits (1538), Expect = e-168, Method: Compositional matrix adjust.
Identities = 281/407 (69%), Positives = 353/407 (86%), Gaps = 2/407 (0%)
Query: 1 MPLVKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPA 60
MP V ++A+NFMDMVA+L A LD LY++ FICEA+LRSLPPLAKKY +QM Y+ + A
Sbjct: 1 MPQVMVVARNFMDMVAALPAGKLDMLYDSAFICEAVLRSLPPLAKKYALQMLYVLAPVTA 60
Query: 61 KMMEEWVLPDGFTKHRVAIDRLVQLRLFSE--EKKKETTYRLNSTFQSNLRKHLIYGGAL 118
MEEWVL + KH+VAID+L+QLR+F E +++++ +Y++N FQ N++K+L+ GG+L
Sbjct: 61 AAMEEWVLDEYAAKHKVAIDKLLQLRVFVEVRDRRRDVSYKMNQKFQGNMQKYLVDGGSL 120
Query: 119 PREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLS 178
PREP+P +T RLPT DLEAYA+ QWECFLLQLI+S+Q E+ ++FSSSMMK FQRGLLS
Sbjct: 121 PREPLPLSVTGRLPTPADLEAYALDQWECFLLQLINSSQVEKGSSFSSSMMKTFQRGLLS 180
Query: 179 RRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGE 238
RD EA +LTE+GFQFLLM+TNAQLWYI+REYIS+++ERG++ +LISFLLELSFH G
Sbjct: 181 SRDGEASKLTENGFQFLLMETNAQLWYIMREYISSAEERGVDPTELISFLLELSFHKLGA 240
Query: 239 AYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARKEGFIV 298
AY+LNTL+++Q+ I+D A+LGLVK QQGRK++WFIPT+LATNLS SL+DSS+ KEGF+V
Sbjct: 241 AYSLNTLTDVQRIAIRDLAELGLVKQQQGRKDSWFIPTQLATNLSASLSDSSSNKEGFVV 300
Query: 299 VETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFL 358
VETNFRMYAYSTSKLHCEILRLF++VEYQLPNLIVGA+TKES+Y AFENGITAEQIISFL
Sbjct: 301 VETNFRMYAYSTSKLHCEILRLFARVEYQLPNLIVGAVTKESIYGAFENGITAEQIISFL 360
Query: 359 QQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
+QNAHPRVAD++P+VPENV DQIRLWE+DLNRVEM P+H Y++FPS+
Sbjct: 361 RQNAHPRVADKIPAVPENVTDQIRLWETDLNRVEMIPSHLYEDFPSK 407
>gi|326510735|dbj|BAJ91715.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 451
Score = 594 bits (1531), Expect = e-167, Method: Compositional matrix adjust.
Identities = 282/407 (69%), Positives = 349/407 (85%), Gaps = 2/407 (0%)
Query: 1 MPLVKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPA 60
MP V ++A+NFMDMVA+L A LD LY++ FICEA+LRS PPLAKKYV+QM Y+ +PA
Sbjct: 1 MPQVMVVARNFMDMVAALPAAKLDMLYDSAFICEAVLRSFPPLAKKYVIQMLYVSAPMPA 60
Query: 61 KMMEEWVLPDGFTKHRVAIDRLVQLRLFSE--EKKKETTYRLNSTFQSNLRKHLIYGGAL 118
M+EWVL + +KH+VAIDRL+QLR+F E +++KE +Y++N+ FQ+N++K+L+ GG L
Sbjct: 61 AAMQEWVLDEYASKHKVAIDRLLQLRVFVEVRDRRKEVSYKMNNKFQANMQKYLVSGGCL 120
Query: 119 PREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLS 178
PREP+P +TARLPTL +LE YA+ QWECFLLQLI+S+Q E+ T FSSSMMK FQRGLLS
Sbjct: 121 PREPLPFNVTARLPTLVELENYALEQWECFLLQLINSSQVEKGTTFSSSMMKTFQRGLLS 180
Query: 179 RRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGE 238
RD EA +L+E+GFQFLLM+TNAQLWYI+REYIS+++ERG++ DLISFLLELSFH G
Sbjct: 181 SRDGEAAKLSENGFQFLLMETNAQLWYIMREYISSAEERGVDPTDLISFLLELSFHTQGA 240
Query: 239 AYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARKEGFIV 298
AY+L+TL+E+Q+ + D +LGLVKLQQGRK++WFIPTKLATNLS SL+DS+A KEG +V
Sbjct: 241 AYSLSTLTEVQRIAVMDLMELGLVKLQQGRKDSWFIPTKLATNLSSSLSDSAASKEGIVV 300
Query: 299 VETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFL 358
VETNFR+YAYS SKLHCEILRLFS+VEYQLPNLIVGAITKESLY AF+NGITAEQIISFL
Sbjct: 301 VETNFRLYAYSASKLHCEILRLFSRVEYQLPNLIVGAITKESLYGAFDNGITAEQIISFL 360
Query: 359 QQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
QQNAHPRV D++P VPENV DQIRLWE+D NRVEM +H Y++FPS+
Sbjct: 361 QQNAHPRVIDKIPIVPENVTDQIRLWENDRNRVEMILSHVYEDFPSK 407
>gi|357166758|ref|XP_003580836.1| PREDICTED: general transcription factor IIH subunit 4-like
[Brachypodium distachyon]
Length = 451
Score = 594 bits (1531), Expect = e-167, Method: Compositional matrix adjust.
Identities = 280/407 (68%), Positives = 350/407 (85%), Gaps = 2/407 (0%)
Query: 1 MPLVKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPA 60
MP V ++A+NFMDMVA+L A LD LY++ FICEA+LRS PPLAKKY +QM Y+ +PA
Sbjct: 1 MPQVMVVARNFMDMVAALPAAKLDMLYDSAFICEAVLRSFPPLAKKYALQMLYVSAPMPA 60
Query: 61 KMMEEWVLPDGFTKHRVAIDRLVQLRLFSE--EKKKETTYRLNSTFQSNLRKHLIYGGAL 118
MEEWVL + +KH+VAIDRL+QLR+F E +++KE +Y++N FQ N++K+L+ GG+L
Sbjct: 61 AAMEEWVLDEYASKHKVAIDRLLQLRVFVEVRDRRKEVSYKMNDKFQGNMQKYLVDGGSL 120
Query: 119 PREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLS 178
PREP+ S +T RLPTL +LE YA+ QWECFLLQLI+S+Q ER T+FSSSMMK FQRGLLS
Sbjct: 121 PREPISSSVTGRLPTLTELENYALEQWECFLLQLINSSQVERGTSFSSSMMKTFQRGLLS 180
Query: 179 RRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGE 238
RD +AP+L+E+GFQFLLM+TNAQLWYI+REYIS+++ERG++ +LISFLLELSFH G
Sbjct: 181 SRDGDAPKLSENGFQFLLMETNAQLWYIMREYISSAEERGVDPTELISFLLELSFHTLGA 240
Query: 239 AYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARKEGFIV 298
AY+ NTL+++Q+ I+D A+LGLVK+QQGRK++WFIPTKLATNLS SL+DSSA KEG +V
Sbjct: 241 AYSFNTLTDVQRIAIRDLAELGLVKVQQGRKDSWFIPTKLATNLSSSLSDSSASKEGIVV 300
Query: 299 VETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFL 358
VETNFR+YAYS S+LHCEILRLFS+VEYQLPNLIVGAITKESLY AF+NGITAEQIISFL
Sbjct: 301 VETNFRLYAYSASRLHCEILRLFSRVEYQLPNLIVGAITKESLYGAFDNGITAEQIISFL 360
Query: 359 QQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
+QNAHPRVAD++P VPENV DQIRLWE+D NRV+M +H Y++FPS+
Sbjct: 361 KQNAHPRVADKIPVVPENVTDQIRLWETDRNRVDMVLSHVYEDFPSK 407
>gi|242074798|ref|XP_002447335.1| hypothetical protein SORBIDRAFT_06g033130 [Sorghum bicolor]
gi|241938518|gb|EES11663.1| hypothetical protein SORBIDRAFT_06g033130 [Sorghum bicolor]
Length = 451
Score = 593 bits (1529), Expect = e-167, Method: Compositional matrix adjust.
Identities = 281/407 (69%), Positives = 350/407 (85%), Gaps = 2/407 (0%)
Query: 1 MPLVKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPA 60
MP V ++A+NFMDMVA+L A LD LY++ FICEA+LRSLPPLAKKY +QM Y+ + A
Sbjct: 1 MPQVMVVARNFMDMVAALPAGKLDMLYDSAFICEAVLRSLPPLAKKYALQMLYVLAPVTA 60
Query: 61 KMMEEWVLPDGFTKHRVAIDRLVQLRLFSE--EKKKETTYRLNSTFQSNLRKHLIYGGAL 118
MEEWVL + KH+VAID+L+QLR+F E +++++ +Y++N FQ N++K+L+ GG+L
Sbjct: 61 AAMEEWVLNEYAAKHKVAIDKLLQLRVFVEVRDRRRDVSYKMNQKFQGNMQKYLVDGGSL 120
Query: 119 PREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLS 178
PREP+P +T RLPT DLEAYA+ QWECFLLQLI+S+Q E+ T+FSSSMMK FQRGLLS
Sbjct: 121 PREPLPLSVTGRLPTPADLEAYALDQWECFLLQLINSSQVEKGTSFSSSMMKTFQRGLLS 180
Query: 179 RRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGE 238
RD EA +LTE+GFQFLLM+TNAQLWYI+REYIS+++ERG++ +LISFLLELSFH G
Sbjct: 181 SRDGEASKLTENGFQFLLMETNAQLWYIMREYISSAEERGVDPTELISFLLELSFHKLGA 240
Query: 239 AYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARKEGFIV 298
AY+LNTL+++Q+ I+D A+LGLVK QQGR + WFIPT+LATNLS SL+DSS+ KEGF+V
Sbjct: 241 AYSLNTLTDVQRIAIRDLAELGLVKQQQGRTDRWFIPTQLATNLSASLSDSSSNKEGFVV 300
Query: 299 VETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFL 358
VETNFRMYAYSTSKLHCEILRLFS+VEYQLPNLIVGA+TKES+Y AFENGITAEQIISFL
Sbjct: 301 VETNFRMYAYSTSKLHCEILRLFSRVEYQLPNLIVGAVTKESIYGAFENGITAEQIISFL 360
Query: 359 QQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
+QNAHPRVAD++P+VPENV DQIRLWE+D NRVEM P+H Y++FPS+
Sbjct: 361 RQNAHPRVADKIPTVPENVTDQIRLWETDRNRVEMIPSHLYEDFPSK 407
>gi|357123176|ref|XP_003563288.1| PREDICTED: LOW QUALITY PROTEIN: general transcription factor IIH
subunit 4-like [Brachypodium distachyon]
Length = 452
Score = 588 bits (1516), Expect = e-165, Method: Compositional matrix adjust.
Identities = 279/408 (68%), Positives = 348/408 (85%), Gaps = 3/408 (0%)
Query: 1 MPLVKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPA 60
MP V ++A+NFMDMVA+L A LD LY++ FICEA+LRS PPL KKY +QM Y+ +PA
Sbjct: 1 MPQVMVVARNFMDMVAALPATKLDMLYDSAFICEAVLRSFPPLPKKYTLQMLYVSAPMPA 60
Query: 61 KMMEEWVLPDGF-TKHRVAIDRLVQLRLFSE--EKKKETTYRLNSTFQSNLRKHLIYGGA 117
MEEWVL D + +KH+VAIDRL+QLR+F E + +KE +Y++N FQ N++K+L+ GG+
Sbjct: 61 AAMEEWVLLDEYASKHKVAIDRLLQLRVFVEVRDXRKEVSYKMNDKFQGNMQKYLVDGGS 120
Query: 118 LPREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLL 177
LPREP+PS +T RLPTL +LE YA+ QWECFLLQLI+S+Q ER T+FSSSMMK FQRGLL
Sbjct: 121 LPREPIPSSVTGRLPTLTELENYALEQWECFLLQLINSSQVERGTSFSSSMMKTFQRGLL 180
Query: 178 SRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAG 237
S RD +AP+L+E+GFQFLLM+TN QLWYI+REYIS+++ERG++ +LISFLLELSFH G
Sbjct: 181 SSRDGDAPKLSENGFQFLLMETNVQLWYIMREYISSAEERGVDPTELISFLLELSFHTLG 240
Query: 238 EAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARKEGFI 297
AY+ NTL+++Q+ I+D A+LGLVK+QQGRK++WFIPTKLATNLS SL+DSSA KEG +
Sbjct: 241 AAYSFNTLTDVQRIAIRDLAELGLVKVQQGRKDSWFIPTKLATNLSSSLSDSSASKEGIV 300
Query: 298 VVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISF 357
VVETNFR+YAYS S+LHCEILRLFS+VEYQLPNLIVGAITKESLY AF+NGITAEQIISF
Sbjct: 301 VVETNFRLYAYSASRLHCEILRLFSRVEYQLPNLIVGAITKESLYGAFDNGITAEQIISF 360
Query: 358 LQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
L+QNAHP VAD++P VPENV DQIRLWE+D NRV+M +H Y++FPS+
Sbjct: 361 LKQNAHPXVADKIPVVPENVTDQIRLWETDRNRVDMVLSHVYEDFPSK 408
>gi|414584750|tpg|DAA35321.1| TPA: hypothetical protein ZEAMMB73_386928 [Zea mays]
Length = 386
Score = 558 bits (1438), Expect = e-156, Method: Compositional matrix adjust.
Identities = 263/383 (68%), Positives = 331/383 (86%), Gaps = 2/383 (0%)
Query: 1 MPLVKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPA 60
MP V ++A+NFMDMVA+L A LD LY++ FICEA+LRSLPPLAKKY +QM Y+ + A
Sbjct: 1 MPQVMVVARNFMDMVAALPAGKLDMLYDSAFICEAVLRSLPPLAKKYALQMLYVLAPVTA 60
Query: 61 KMMEEWVLPDGFTKHRVAIDRLVQLRLFSE--EKKKETTYRLNSTFQSNLRKHLIYGGAL 118
MEEWVL + KH+VAID+L+QLR+F E +++++ +Y++N FQ N++K+L+ GG+L
Sbjct: 61 AAMEEWVLDEYAAKHKVAIDKLLQLRVFVEVRDRRRDVSYKMNQKFQGNMQKYLVDGGSL 120
Query: 119 PREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLS 178
PREP+P +T RLPT DLEAYA+ QWECFLLQLI+S+Q E+ ++FSSSMMK FQRGLLS
Sbjct: 121 PREPLPLSVTGRLPTPADLEAYALDQWECFLLQLINSSQVEKGSSFSSSMMKTFQRGLLS 180
Query: 179 RRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGE 238
RD EA +LTE+GFQFLLM+TNAQLWYI+REYIS+++ERG++ +LISFLLELSFH G
Sbjct: 181 SRDGEASKLTENGFQFLLMETNAQLWYIMREYISSAEERGVDPTELISFLLELSFHKLGA 240
Query: 239 AYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARKEGFIV 298
AY+LNTL+++Q+ I+D A+LGLVK QQGRK++WFIPT+LATNLS SL+DSS+ KEGF+V
Sbjct: 241 AYSLNTLTDVQRIAIRDLAELGLVKQQQGRKDSWFIPTQLATNLSASLSDSSSNKEGFVV 300
Query: 299 VETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFL 358
VETNFRMYAYSTSKLHCEILRLF++VEYQLPNLIVGA+TKES+Y AFENGITAEQIISFL
Sbjct: 301 VETNFRMYAYSTSKLHCEILRLFARVEYQLPNLIVGAVTKESIYGAFENGITAEQIISFL 360
Query: 359 QQNAHPRVADRMPSVPENVCDQI 381
+QNAHPRVAD++P+VPENV DQ+
Sbjct: 361 RQNAHPRVADKIPAVPENVTDQV 383
>gi|222629781|gb|EEE61913.1| hypothetical protein OsJ_16644 [Oryza sativa Japonica Group]
Length = 427
Score = 548 bits (1411), Expect = e-153, Method: Compositional matrix adjust.
Identities = 265/407 (65%), Positives = 331/407 (81%), Gaps = 26/407 (6%)
Query: 1 MPLVKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPA 60
MP V ++A+NFMDMVA+L A LD LY++ FICEA+LRSLPPLAKKY +QM Y+ + A
Sbjct: 1 MPQVMVVARNFMDMVAALPAAKLDMLYDSAFICEAVLRSLPPLAKKYALQMLYVSAPVAA 60
Query: 61 KMMEEWVLPDGFTKHRVAIDRLVQLRLFSE--EKKKETTYRLNSTFQSNLRKHLIYGGAL 118
MEEWVL + KHRVAIDRL+QLR+F E +++KE +Y++N FQ N++K+L+ GG+L
Sbjct: 61 AAMEEWVLDEYAAKHRVAIDRLLQLRVFVEVRDRRKEVSYKMNQKFQGNMQKYLVDGGSL 120
Query: 119 PREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLS 178
PREP+PS +TARLPTL +LE++A+ QWE + +
Sbjct: 121 PREPIPSSVTARLPTLAELESFALEQWEVRMTKAYVCI---------------------- 158
Query: 179 RRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGE 238
D EAPRLTE+GFQFLLM+TNAQLWYI+REYIS+++ERG++ +LISFLLELSFH GE
Sbjct: 159 --DGEAPRLTENGFQFLLMETNAQLWYIMREYISSAEERGVDPTELISFLLELSFHTLGE 216
Query: 239 AYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARKEGFIV 298
AY+LNTL+++Q++ I+D A+LGLVKLQQGRK++WFIPTKLATNLS SL+DSS+ KEGF+V
Sbjct: 217 AYSLNTLTDVQRNAIRDLAELGLVKLQQGRKDSWFIPTKLATNLSASLSDSSSNKEGFVV 276
Query: 299 VETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFL 358
VETNFRMYAYSTS+LHCEILRLFS+VEYQLPNLIVG+ITKESLY AFENGITAEQIISFL
Sbjct: 277 VETNFRMYAYSTSRLHCEILRLFSRVEYQLPNLIVGSITKESLYGAFENGITAEQIISFL 336
Query: 359 QQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
QQNAHPRVAD++P+VPENV DQIRLWE+D NRV+MT +H Y++FPS+
Sbjct: 337 QQNAHPRVADKIPAVPENVTDQIRLWETDRNRVDMTLSHLYEDFPSK 383
>gi|90399251|emb|CAJ86205.1| B0213E10.4 [Oryza sativa Indica Group]
Length = 470
Score = 500 bits (1288), Expect = e-139, Method: Compositional matrix adjust.
Identities = 258/454 (56%), Positives = 322/454 (70%), Gaps = 77/454 (16%)
Query: 1 MPLVKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPA 60
MP V ++A+NFMDMVA+L A LD LY++ FICEA+LRSLPPLAKKY +QM Y+ + A
Sbjct: 1 MPQVMVVARNFMDMVAALPAAKLDMLYDSAFICEAVLRSLPPLAKKYALQMLYVSAPVAA 60
Query: 61 KMMEEWVLPDGFTKHRVAIDRLVQLRLFSE--EKKKETTYRLNSTFQSNLRKHLIYGGAL 118
MEEWVL + KHRVAIDRL+QLR+F E +++KE +Y++N FQ N++K+L+ GG+L
Sbjct: 61 AAMEEWVLDEYAAKHRVAIDRLLQLRVFVEVRDRRKEVSYKMNQKFQGNMQKYLVDGGSL 120
Query: 119 PREPMPSGITARLPTLEDLEAYAIGQWEC---------------------------FLL- 150
PREP+PS +TARLPTL +LE++A+ QWE FL
Sbjct: 121 PREPIPSSVTARLPTLAELESFALEQWEVRMTKAYVCISYCLNATNVHSPSKGTHSFLYG 180
Query: 151 ---------QLISSTQAER-----PTNFS--SSMMKVFQRGLLS---RRDKEAPRLTESG 191
QLISS + ++ NFS SS KV L+ D EAPRLTE+G
Sbjct: 181 KIVLLAAIDQLISSRERDKFQLIHDENFSARSSEFKVHLISSLAFGIYLDGEAPRLTENG 240
Query: 192 FQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKS 251
FQFLLM+TNAQLWYI+REYIS+++ERG++ +LISFLLELSFH GEAY+LNTL+++Q++
Sbjct: 241 FQFLLMETNAQLWYIMREYISSAEERGVDPTELISFLLELSFHTLGEAYSLNTLTDVQRN 300
Query: 252 MIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARKEGFIVVETNFRMYAYSTS 311
I+D A+LGLVKLQQ GF+VVETNFRMYAYSTS
Sbjct: 301 AIRDLAELGLVKLQQ----------------------------GFVVVETNFRMYAYSTS 332
Query: 312 KLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRMP 371
+LHCEILRLFS+VEYQLPNLIVG+ITKESLY AFENGITAEQIISFLQQNAHPRVAD++P
Sbjct: 333 RLHCEILRLFSRVEYQLPNLIVGSITKESLYGAFENGITAEQIISFLQQNAHPRVADKIP 392
Query: 372 SVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
+VPENV DQIRLWE+D NRV+MT +H Y++FPS+
Sbjct: 393 AVPENVTDQIRLWETDRNRVDMTLSHLYEDFPSK 426
>gi|115461368|ref|NP_001054284.1| Os04g0680000 [Oryza sativa Japonica Group]
gi|113565855|dbj|BAF16198.1| Os04g0680000 [Oryza sativa Japonica Group]
Length = 444
Score = 484 bits (1246), Expect = e-134, Method: Compositional matrix adjust.
Identities = 239/426 (56%), Positives = 314/426 (73%), Gaps = 49/426 (11%)
Query: 1 MPLVKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPA 60
MP V ++A+NFMDMVA+L A LD LY++ FICEA+LRSLPPLAKKY +QM Y+ + A
Sbjct: 1 MPQVMVVARNFMDMVAALPAAKLDMLYDSAFICEAVLRSLPPLAKKYALQMLYVSAPVAA 60
Query: 61 KMMEEWVLPDGFTKHRVAIDRLVQLRLFSE--EKKKETTYRLNSTFQSNLRKHLIYGGAL 118
MEEWVL + KHRVAIDRL+QLR+F E +++KE +Y++N FQ N++K+L+ GG+L
Sbjct: 61 AAMEEWVLDEYAAKHRVAIDRLLQLRVFVEVRDRRKEVSYKMNQKFQGNMQKYLVDGGSL 120
Query: 119 PREPMPSGITARLPTLEDLEAYAIGQWECFLLQ-------LISSTQAERPTNFSSSMM-- 169
PREP+PS +TARLPTL +LE++A+ QWE + + +++T P+ + S +
Sbjct: 121 PREPIPSSVTARLPTLAELESFALEQWEVRMTKAYVCISYCLNATNVHSPSKGTHSFLYG 180
Query: 170 KVFQRG----LLSRRDKEAPRL------TESGFQFLLMDTNAQLWYIVREYISNSQERGI 219
K+ L+S R+++ +L + +F LM+TNAQLWYI+REYIS+++ERG+
Sbjct: 181 KIVLLAAIDQLISSRERDKFQLIHDENFSARSSEFKLMETNAQLWYIMREYISSAEERGV 240
Query: 220 NQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLA 279
+ +LISFLLELSFH GEAY+LNTL+++Q++ I+D A+LGLVKLQQ
Sbjct: 241 DPTELISFLLELSFHTLGEAYSLNTLTDVQRNAIRDLAELGLVKLQQ------------- 287
Query: 280 TNLSMSLTDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKE 339
GF+VVETNFRMYAYSTS+LHCEILRLFS+VEYQLPNLIVG+ITKE
Sbjct: 288 ---------------GFVVVETNFRMYAYSTSRLHCEILRLFSRVEYQLPNLIVGSITKE 332
Query: 340 SLYNAFENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYY 399
SLY AFENGITAEQIISFLQQNAHPRVAD++P+VPENV DQIRLWE+D NRV+MT +H Y
Sbjct: 333 SLYGAFENGITAEQIISFLQQNAHPRVADKIPAVPENVTDQIRLWETDRNRVDMTLSHLY 392
Query: 400 DEFPSR 405
++FPS+
Sbjct: 393 EDFPSK 398
>gi|449521313|ref|XP_004167674.1| PREDICTED: RNA polymerase II transcription factor B subunit 2-like,
partial [Cucumis sativus]
Length = 296
Score = 479 bits (1234), Expect = e-133, Method: Compositional matrix adjust.
Identities = 228/295 (77%), Positives = 265/295 (89%), Gaps = 2/295 (0%)
Query: 1 MPLVKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPA 60
MP VKIIAKNFMDMVASL A LD+LYEN FICEAILRSLPPLAKK+V+QM YIDG + A
Sbjct: 1 MPQVKIIAKNFMDMVASLPAMKLDQLYENAFICEAILRSLPPLAKKFVLQMLYIDGPVSA 60
Query: 61 KMMEEWVLPDGFTKHRVAIDRLVQLRLFSE--EKKKETTYRLNSTFQSNLRKHLIYGGAL 118
K MEEWVLPDG +K++VA+DRL+QLR+F E ++K+ETTYRLN TFQ+NL+K LI+G L
Sbjct: 61 KSMEEWVLPDGVSKYKVAVDRLIQLRVFIETADRKRETTYRLNPTFQANLQKLLIHGEVL 120
Query: 119 PREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLS 178
REPMPS IT RLP+LEDLEAYA+ QWECFLLQLI+S QAE+P+N SSS+MKVFQ+GLLS
Sbjct: 121 AREPMPSNITVRLPSLEDLEAYALDQWECFLLQLINSGQAEKPSNISSSVMKVFQKGLLS 180
Query: 179 RRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGE 238
+RDKEAPRLTESGFQFLLM+TNAQLWYI+REYISN++ERG++ ADLISFLLELSFHV GE
Sbjct: 181 QRDKEAPRLTESGFQFLLMETNAQLWYIIREYISNAEERGVDPADLISFLLELSFHVTGE 240
Query: 239 AYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARK 293
AY+++TLS+ Q+ IKD ADLGLVKLQQGRKE+WFIPTKLATNLSMSL DSS+RK
Sbjct: 241 AYDIDTLSDEQRYAIKDLADLGLVKLQQGRKESWFIPTKLATNLSMSLADSSSRK 295
>gi|167997877|ref|XP_001751645.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162697626|gb|EDQ83962.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 462
Score = 451 bits (1159), Expect = e-124, Method: Compositional matrix adjust.
Identities = 212/413 (51%), Positives = 302/413 (73%), Gaps = 7/413 (1%)
Query: 1 MPLVKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPA 60
MP V+++++NFM+MVA+L A LD LY+ + C+A+ RSLPPLAK+YV+++ +++ A+P
Sbjct: 1 MPRVRVVSRNFMEMVAALPAAKLDLLYDRHWTCQAVFRSLPPLAKQYVLRLLWVENAVPL 60
Query: 61 KMMEEWVLPDGFTKHRVAIDRLVQLRLFSEEKK---KETTYRLNSTFQSNLRKHLIYGGA 117
K +EW P+ +KH++AIDRL QLR+ E+ T Y +N Q LR+ L GG
Sbjct: 61 KSWQEWARPEALSKHQIAIDRLEQLRVILPERSDLSNSTHYLMNPKLQKQLRQALSTGGG 120
Query: 118 LPREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSS-MMKVFQRGL 176
PR+ + + AR+P DLE YA+ QWE LLQL+ A+ P + ++KVFQR
Sbjct: 121 PPRDVVSESVAARVPNSADLENYAMKQWESVLLQLVDCA-ADGPAGPKNPFIIKVFQRSG 179
Query: 177 LSRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVA 236
L + E+P LT+ GFQFLLMDTN+QLW +VREY+++S++RG + +L+ FLLEL FH+
Sbjct: 180 LLTPENESPSLTDLGFQFLLMDTNSQLWQLVREYVTSSEDRGTDSGELVGFLLELGFHLV 239
Query: 237 GEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSS--ARKE 294
GEAY++N+LS + ++ + A LGLVKLQQG KE+W+IPTKLA+NLS SL++S+ E
Sbjct: 240 GEAYSVNSLSPALQKVLDELAALGLVKLQQGMKESWYIPTKLASNLSASLSESTDWQSSE 299
Query: 295 GFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQI 354
GF+VVETNF++YAY++SKL EILR F+++EYQLPNL+V +TKES+ A +GI+AEQI
Sbjct: 300 GFVVVETNFKVYAYTSSKLQTEILRCFTRLEYQLPNLVVATLTKESVNKALGSGISAEQI 359
Query: 355 ISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRVI 407
ISFL+++AHP VA ++P VPE V DQ+RLWE+D NRV+ PA++YD+FP+ I
Sbjct: 360 ISFLRKHAHPHVAQKIPVVPETVSDQLRLWETDRNRVQFEPAYFYDDFPTMAI 412
>gi|302806060|ref|XP_002984780.1| hypothetical protein SELMODRAFT_121159 [Selaginella moellendorffii]
gi|302808297|ref|XP_002985843.1| hypothetical protein SELMODRAFT_123271 [Selaginella moellendorffii]
gi|300146350|gb|EFJ13020.1| hypothetical protein SELMODRAFT_123271 [Selaginella moellendorffii]
gi|300147366|gb|EFJ14030.1| hypothetical protein SELMODRAFT_121159 [Selaginella moellendorffii]
Length = 459
Score = 406 bits (1044), Expect = e-111, Method: Compositional matrix adjust.
Identities = 204/412 (49%), Positives = 297/412 (72%), Gaps = 9/412 (2%)
Query: 1 MPLVKIIAKNFMDMVAS-LTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIP 59
MP V+ ++FMD+VAS LTA +D LY + + C+A+LRSLPPLAK YV+++ Y+D A+P
Sbjct: 1 MPNVRRATRDFMDLVASCLTAARIDALYASKWTCQAVLRSLPPLAKLYVLRLLYLDAAVP 60
Query: 60 AKMMEEWVLPDGFTKHRVAIDRLVQLRLFSEE-----KKKETTYRLNSTFQSNLRKHLIY 114
KM+ EW+ D KH+ A+DRLVQLR+F E +KKET YR+N FQ LR L
Sbjct: 61 DKMLLEWIKGDAVAKHKNAVDRLVQLRVFLPEEKILSRKKETHYRVNPKFQEQLRLALST 120
Query: 115 GGALPREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSS-MMKVFQ 173
G PR+P+P I R+P+ ++L+ YA +WE + L++ + P+ ++ ++++FQ
Sbjct: 121 GLGTPRDPLPPDIAVRMPSAKELDDYATEKWEAWFSSLVNFASPDGPSASTNGFIVQLFQ 180
Query: 174 RGLLSRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSF 233
+ L ++ P++T +GFQFLL+D N+QLW ++REY+ ++ R I+ +LI FLLE+ F
Sbjct: 181 KADLLTSGQD-PKITPAGFQFLLLDRNSQLWRVIREYVQYAEARQIDTGELIRFLLEIGF 239
Query: 234 HVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSA-R 292
+ GE Y++++L Q++ ++ A LG+++LQ+G K+ WFIPT+LAT LS SL++SSA +
Sbjct: 240 YSVGEPYSMDSLPNSQRNFAEELAMLGVLQLQKGMKDRWFIPTRLATGLSASLSESSAWQ 299
Query: 293 KEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAE 352
EGFI+VETNFR+YAY++SKLH E L +F + EY LPN++VG+ITKES+ AF +GI+A+
Sbjct: 300 TEGFIMVETNFRVYAYTSSKLHIETLHVFVRTEYVLPNILVGSITKESVNGAFASGISAD 359
Query: 353 QIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
QII FLQQ+AHP VA ++PSVPE VCDQIRLWESD RV+ PA+ Y+ FPS
Sbjct: 360 QIIKFLQQHAHPFVAQKVPSVPETVCDQIRLWESDRVRVQYLPAYCYEGFPS 411
>gi|359497414|ref|XP_003635508.1| PREDICTED: general transcription factor IIH subunit 4-like, partial
[Vitis vinifera]
Length = 209
Score = 354 bits (909), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 165/205 (80%), Positives = 189/205 (92%), Gaps = 2/205 (0%)
Query: 36 ILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQLRLFSE--EKK 93
+LRSLPPLAKKY++QM YID + AK MEEWVL DGF+KHRVAIDRL+QLR+F+E ++K
Sbjct: 5 VLRSLPPLAKKYILQMLYIDVPVTAKAMEEWVLADGFSKHRVAIDRLIQLRVFTETSDRK 64
Query: 94 KETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLPTLEDLEAYAIGQWECFLLQLI 153
KET+YRLN TFQ+NL+KHLIYGG LPREPMPS IT RLP+L+DLEAYA+GQWECFLLQLI
Sbjct: 65 KETSYRLNPTFQTNLQKHLIYGGVLPREPMPSNITVRLPSLDDLEAYALGQWECFLLQLI 124
Query: 154 SSTQAERPTNFSSSMMKVFQRGLLSRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISN 213
SSTQ E+ TNFSSSMMKVFQRGLL++R+KEAPRLTESGFQFLLMDTNAQLWYI+REYISN
Sbjct: 125 SSTQTEKLTNFSSSMMKVFQRGLLTQREKEAPRLTESGFQFLLMDTNAQLWYIMREYISN 184
Query: 214 SQERGINQADLISFLLELSFHVAGE 238
S+ERG++ ADLISFLLELSFHV GE
Sbjct: 185 SEERGVDPADLISFLLELSFHVTGE 209
>gi|296084771|emb|CBI25914.3| unnamed protein product [Vitis vinifera]
Length = 225
Score = 353 bits (907), Expect = 6e-95, Method: Compositional matrix adjust.
Identities = 165/205 (80%), Positives = 189/205 (92%), Gaps = 2/205 (0%)
Query: 36 ILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQLRLFSE--EKK 93
+LRSLPPLAKKY++QM YID + AK MEEWVL DGF+KHRVAIDRL+QLR+F+E ++K
Sbjct: 5 VLRSLPPLAKKYILQMLYIDVPVTAKAMEEWVLADGFSKHRVAIDRLIQLRVFTETSDRK 64
Query: 94 KETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLPTLEDLEAYAIGQWECFLLQLI 153
KET+YRLN TFQ+NL+KHLIYGG LPREPMPS IT RLP+L+DLEAYA+GQWECFLLQLI
Sbjct: 65 KETSYRLNPTFQTNLQKHLIYGGVLPREPMPSNITVRLPSLDDLEAYALGQWECFLLQLI 124
Query: 154 SSTQAERPTNFSSSMMKVFQRGLLSRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISN 213
SSTQ E+ TNFSSSMMKVFQRGLL++R+KEAPRLTESGFQFLLMDTNAQLWYI+REYISN
Sbjct: 125 SSTQTEKLTNFSSSMMKVFQRGLLTQREKEAPRLTESGFQFLLMDTNAQLWYIMREYISN 184
Query: 214 SQERGINQADLISFLLELSFHVAGE 238
S+ERG++ ADLISFLLELSFHV GE
Sbjct: 185 SEERGVDPADLISFLLELSFHVTGE 209
>gi|384246869|gb|EIE20357.1| Tfb2-domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 451
Score = 276 bits (707), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 145/403 (35%), Positives = 236/403 (58%), Gaps = 8/403 (1%)
Query: 10 NFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLP 69
+F+ + +L A + +LY + + C+A+LR L LAK+YV++M ++D + +M+ W+
Sbjct: 2 DFISWLETLPADTVGKLYSSHWACQAVLRGLTSLAKQYVIRMLFLDVPVTKSVMDSWIAQ 61
Query: 70 DGFTKHRVAIDRLVQLRLF----SEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPS 125
+KH AI+RL L+L ++ Y+L+ F+ LR + GG + + +P+
Sbjct: 62 SSKSKHATAINRLKGLQLLLPSNVASNGQQAAYQLHPVFREQLRWAVSTGGKVGIDEVPA 121
Query: 126 GITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAP 185
G+ + P+ E L +YA QWE + L++ + S + ++ L+ E+
Sbjct: 122 GVLQQAPSREMLSSYAQQQWEVITVPLLNQLKCTCGLGSSFASLQTLMLYLVGSICAESR 181
Query: 186 RLTESGFQFLLMDTNAQLWYIVREYISNSQER-GINQADLISFLLELSFHVAGEAYNLNT 244
+ E GFQFLL DT +QLW ++R YI++ +ER G +++FLL+L F G + L+
Sbjct: 182 SIEEQGFQFLLSDTYSQLWRLLRAYIASGEERSGAPLGTILNFLLQLGFREVGSPFALSG 241
Query: 245 LSEIQKSMIKDFADLGLVKLQQGRKEN-WFIPTKLATNLSMSLTDSSAR--KEGFIVVET 301
L + Q+ + D A LGL+ + + W PT+LA L+ + + +GF+VVET
Sbjct: 242 LDDSQRHIAADMAQLGLLMPFTAKDGSVWLAPTRLALALAGGSSGQAQHDVTDGFVVVET 301
Query: 302 NFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQN 361
N+R+YAY++S L +LRLF++ E LPNL VG +T+ES+ A G++A+QI+ +L+Q+
Sbjct: 302 NYRVYAYTSSLLQTALLRLFTRCECILPNLFVGVLTRESVTGALACGLSADQIVLYLRQH 361
Query: 362 AHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
AHP VA R P VPE V DQ+RLW++D RV A YD+FPS
Sbjct: 362 AHPHVASRTPVVPEVVADQVRLWQADTMRVRHNRAVLYDDFPS 404
>gi|41053973|ref|NP_956221.1| general transcription factor IIH subunit 4 [Danio rerio]
gi|39794736|gb|AAH64301.1| General transcription factor IIH, polypeptide 4 [Danio rerio]
Length = 466
Score = 273 bits (699), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 152/419 (36%), Positives = 233/419 (55%), Gaps = 21/419 (5%)
Query: 4 VKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMM 63
V++ KN + + LT LDRLY +P C A+ R LPPLAK YVM+M ++D +P +
Sbjct: 5 VQLQCKNLHEYLKELTPEILDRLYNHPATCLAVYRELPPLAKNYVMRMLFLDHPLPQAAV 64
Query: 64 EEWVLPDGFTKHRVAIDRLVQLRLFSEEKKKET--TYRLNSTFQSNLRKHLIYGGA-LPR 120
WV H + L LRL+ + + + LN F+ NLR L+ GG
Sbjct: 65 ALWVKKGSQKDHDQCVSVLTGLRLWHSQHLQGGLQGFVLNPVFKDNLRIALLGGGKPWAD 124
Query: 121 EPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRR 180
E G +E L+ YA+ +WE L ++ S A + + ++ Q GL+
Sbjct: 125 EGANLGPDRHARDVESLDRYAMERWEVILHFMVGSPSAAVSQDLAQLLI---QAGLMKSE 181
Query: 181 DKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAY 240
EAP +T +GFQFLL+DT +QLWY +Y+ +Q RG++ +++SFL +LSF G Y
Sbjct: 182 TGEAPCITSAGFQFLLLDTASQLWYFTLQYLKTAQSRGMDLVEILSFLFQLSFSTLGRDY 241
Query: 241 NLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARKE------ 294
++ +SE + ++ + GLV Q+ RK + PT+LA L+ +T + A
Sbjct: 242 SVEGMSESLLTFLQHLREFGLV-FQRKRKSRRYYPTRLAITLAAGVTANPASGSASSALG 300
Query: 295 --------GFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFE 346
GFIVVETN+R+YAY+ S+L ++ LFS++ Y+ PNL+V +T+ES+ A
Sbjct: 301 AIPGTGDTGFIVVETNYRIYAYTNSELQIALVALFSEMLYRFPNLVVAQVTRESVQQAIS 360
Query: 347 NGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
NGITA+QII FL+ AHP + + P +P + DQIRLWE + +R++ T Y++F S+
Sbjct: 361 NGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELEKDRLQFTEGVLYNQFLSQ 419
>gi|432883135|ref|XP_004074222.1| PREDICTED: general transcription factor IIH subunit 4-like [Oryzias
latipes]
Length = 460
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 151/415 (36%), Positives = 239/415 (57%), Gaps = 19/415 (4%)
Query: 4 VKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMM 63
V++ KN + + L LDRLY +P C A+ R LP LAK YVM+M +++ +P +
Sbjct: 5 VQLQCKNLHEYLRELGPDVLDRLYNHPATCLAVCRELPLLAKNYVMRMLFLEQPLPQAAV 64
Query: 64 EEWVLPDGFTKHRVAIDRLVQLRLFSEEKKKET--TYRLNSTFQSNLRKHLIYGGAL-PR 120
WV DG H + L LRL+ ++ + Y LN F+ NLR L+ GG +
Sbjct: 65 ALWVRKDGQRDHDECVSVLTGLRLWHSQQLQGGLQGYILNPVFKDNLRIALLGGGKVWAD 124
Query: 121 EPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVF-QRGLLSR 179
E + G +E L+ YA+ +WE L ++ S P+ S + ++ Q GL+
Sbjct: 125 EGIILGPDRHARDIESLDRYAMERWEVILQFMVGS-----PSAVSQDLAQLLVQAGLMKS 179
Query: 180 RDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEA 239
EAP +T +GFQFLL+DT +QLWY+ +Y++ +Q RG+ +++SFL +LSF G
Sbjct: 180 EAGEAPYITSAGFQFLLLDTASQLWYLTLQYLNTAQSRGMELVEILSFLFQLSFSTLGRD 239
Query: 240 YNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSS--------- 290
Y++ +SE + ++ + GLV Q+ RK + PT+LA L+ ++ +S
Sbjct: 240 YSVEGMSESLLTFLQHLREFGLV-FQRKRKSRRYYPTRLAITLAAGVSSNSPSNMTNTPG 298
Query: 291 ARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGIT 350
GFIVVETN+R+YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A NGIT
Sbjct: 299 TGDAGFIVVETNYRIYAYTNSELQIALVALFSEMLYRFPNVVVAHLTRESVQQAIANGIT 358
Query: 351 AEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
A+QII FL+ AHP + + P++P + DQIRLWE + +R++ T Y++F S+
Sbjct: 359 AQQIIHFLRTRAHPVMLKQSPALPPTITDQIRLWELERDRLQFTEGVLYNQFLSQ 413
>gi|348542188|ref|XP_003458568.1| PREDICTED: general transcription factor IIH subunit 4 [Oreochromis
niloticus]
Length = 461
Score = 272 bits (696), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 153/416 (36%), Positives = 235/416 (56%), Gaps = 20/416 (4%)
Query: 4 VKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMM 63
V++ KN + + L+ LDRLY +P C A+ R LP LAK YVM+M ++D +P +
Sbjct: 5 VQLQCKNLHEYLKELSPEILDRLYNHPATCLAVYRELPSLAKNYVMRMLFLDQPLPQAAV 64
Query: 64 EEWVLPDGFTKHRVAIDRLVQLRLFSEEKKKET--TYRLNSTFQSNLRKHLIYGG-ALPR 120
WV D H I L LRL+ ++ + Y LN F+ NL+ L+ GG A
Sbjct: 65 ALWVKKDSQKDHDECISVLAGLRLWHSQQLQGGLQGYILNPVFKDNLKTALLGGGRAWAD 124
Query: 121 EPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVF-QRGLLSR 179
E G +E L+ YA+ +WE L ++ S P+ S + ++ Q GL+
Sbjct: 125 EGSTLGPDRHARDIESLDRYAMERWEIILQFMVGS-----PSAVSQDLAQLLVQAGLMKS 179
Query: 180 RDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEA 239
EAP +T +GFQFLL+DT +QLWY +Y+ +Q RG++ +++SFL +LSF G
Sbjct: 180 EAGEAPYITSAGFQFLLLDTASQLWYFTLQYLKTAQSRGMDLVEILSFLFQLSFSSLGRD 239
Query: 240 YNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDS---------- 289
Y++ +SE + ++ + GLV Q+ RK + PT+LA L+ +T S
Sbjct: 240 YSVEGMSESLLTFLQHLREFGLV-FQRKRKSRRYYPTRLAITLAAGVTSSSSVSNLSSSP 298
Query: 290 SARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGI 349
GFIVVETN+R+YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A NGI
Sbjct: 299 GTGDAGFIVVETNYRIYAYTNSELQIALVALFSEMLYRFPNVVVAQLTRESVQQAIANGI 358
Query: 350 TAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
TA+QII FL+ AHP + + P +P + DQIRLWE + +R++ T Y++F S+
Sbjct: 359 TAQQIIHFLRTRAHPVMLTQTPVLPPTITDQIRLWELERDRLQFTEGVLYNQFLSQ 414
>gi|147899535|ref|NP_001085110.1| general transcription factor IIH, polypeptide 4, 52kDa [Xenopus
laevis]
gi|47939830|gb|AAH72322.1| MGC83106 protein [Xenopus laevis]
Length = 455
Score = 269 bits (688), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 146/408 (35%), Positives = 238/408 (58%), Gaps = 9/408 (2%)
Query: 4 VKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMM 63
V + KN + + L+ LDRLY +P C A+ R LP LAK YVM+M +++ +P +
Sbjct: 6 VHLKCKNLHEFLQELSPGVLDRLYNHPATCLAVFRELPGLAKNYVMRMLFLEQPLPQAAV 65
Query: 64 EEWVLPDGFTKHRVAIDRLVQLRLFSEEK--KKETTYRLNSTFQSNLRKHLIYGG-ALPR 120
WV + F +H L LRL+ ++ LN F+ NL+ L+ GG A
Sbjct: 66 ALWVKKENFNEHEENTQVLTGLRLWHTQQLPGGLQGLILNPIFRENLKIALLGGGKAWAD 125
Query: 121 EPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRR 180
+ G + L+ YA +WE L ++ S A + + ++ Q GL+
Sbjct: 126 DGCLLGPDKHGKDVVSLDKYAEERWEVILHFMVGSPSAAVSQDLAQLLI---QAGLMKNE 182
Query: 181 DKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAY 240
EAP ++ +GFQFLL+DT +QLWY + +Y+ +++ RG+N +++SF+ +LSF G+ Y
Sbjct: 183 SGEAPCISSAGFQFLLLDTPSQLWYFMLQYLKSAESRGMNLVEILSFMFQLSFSTLGKDY 242
Query: 241 NLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDS--SARKEGFIV 298
++ +S+ + ++ + GLV Q+ RK + PT+LA NL+ ++ S + K+GFIV
Sbjct: 243 SVEGMSDSLLTFLQHLREFGLV-FQRKRKSRRYYPTRLAINLASGISGSVVDSHKQGFIV 301
Query: 299 VETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFL 358
VETN+R+YAY+ S+L ++ LFS++ Y+ PNL+V +T+E++ A NGITAEQII FL
Sbjct: 302 VETNYRIYAYTDSELQIALIALFSEMLYRFPNLVVAQVTRENVQQAIGNGITAEQIIHFL 361
Query: 359 QQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
+ AHP + + P++P + DQIRLWE + +R+ + Y++F S+V
Sbjct: 362 RTRAHPVMLQQNPALPPTITDQIRLWELERDRLRFSEGVLYNQFLSQV 409
>gi|72074363|ref|XP_796111.1| PREDICTED: general transcription factor IIH subunit 4
[Strongylocentrotus purpuratus]
Length = 469
Score = 268 bits (686), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 155/409 (37%), Positives = 233/409 (56%), Gaps = 12/409 (2%)
Query: 5 KIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMME 64
K+ K+ + +L LDRLY +P C A+ R LP L K +M++ +ID IP +
Sbjct: 14 KLDCKDLHGYLRTLQGSVLDRLYNHPATCLAVFRELPVLGKHCIMRVLFIDTPIPQAAVT 73
Query: 65 EWVLPDGFTKHRVAIDRLVQLRLFSEEK--KKETTYRLNSTFQSNLRKHLIYGG---ALP 119
W+ + + A+ L LRLF ++ + LN TF++NL+ LI GG A+
Sbjct: 74 SWIQSNQQEQLHAALKTLTDLRLFRDQSLPGGLPGWLLNLTFRTNLKTALIGGGKPWAIS 133
Query: 120 REPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSR 179
+ G ++ L+ Y+ +WEC L L+ S++A S + V L R
Sbjct: 134 GKGGKGGKDKKVKETATLDKYSSERWECVLHFLVGSSKA--VDTLSRDIATVLTHSGLMR 191
Query: 180 RDKEA--PRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAG 237
+ P +T SGFQFLL+DT +Q+W+ + +Y+ SQ RG++ D +SFL +LSF G
Sbjct: 192 LGESGGTPVITPSGFQFLLLDTPSQVWFFMLQYLETSQARGLDIVDALSFLFQLSFSTLG 251
Query: 238 EAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSA--RKEG 295
+ Y+ ++E Q ++ +LGLV Q+ RK + PT+LA NL+ ++ + K+G
Sbjct: 252 KDYSSEGMTEQQLHFLQHLRELGLV-FQRKRKSMRYYPTRLAINLASGVSSMAKDDHKDG 310
Query: 296 FIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQII 355
FIVVETNFR+YAY+ S L EIL LF + Y+ PNL V A+T+ES+ A NGITAEQI+
Sbjct: 311 FIVVETNFRVYAYTESDLQVEILGLFCSMMYRFPNLSVAALTRESVQLAISNGITAEQIL 370
Query: 356 SFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
SFL+ +AHP + + P VP + DQ+RLWE + +R+ T Y+EF S
Sbjct: 371 SFLRTHAHPNMRLKTPIVPPTISDQVRLWELERDRLSFTQGIIYNEFLS 419
>gi|260810774|ref|XP_002600123.1| hypothetical protein BRAFLDRAFT_57175 [Branchiostoma floridae]
gi|229285409|gb|EEN56135.1| hypothetical protein BRAFLDRAFT_57175 [Branchiostoma floridae]
Length = 457
Score = 265 bits (676), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 142/414 (34%), Positives = 242/414 (58%), Gaps = 25/414 (6%)
Query: 5 KIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMME 64
K+ K+ + +L + LD LY P C A+ R L LAK YVM++ +++ A+P ++
Sbjct: 8 KLDCKDLHGYLKTLGSSLLDELYGRPATCLAVFRELTDLAKHYVMRLLFLEQAVPQAVVG 67
Query: 65 EWVLPDGFTKHRVAIDRLVQLRLFSEEKKKE--TTYRLNSTFQSNLRKHLIYGGA----- 117
WV ++H A+ +L +LR++ E + + L+ TF++NLR L+ GG
Sbjct: 68 SWVTSGNHSEHLRAVSQLSELRVWHEHQISGGLAGWILSPTFRANLRTALLGGGKPWFSV 127
Query: 118 --LPREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRG 175
LP + + A L+ Y++ +WE L ++ S +A+ S +M++ +
Sbjct: 128 VQLPPDKHAKDVAA-------LDTYSLERWEVLLNFIVGSGEAQ----VSKDIMEILIKS 176
Query: 176 LLSRRDKEA--PRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSF 233
L + ++ + P +T +GFQFLLMDT +Q+WYI+ +Y+ Q RG+N + + FL ++SF
Sbjct: 177 GLMKSEEGSLHPTITPAGFQFLLMDTPSQVWYIILQYLDTMQSRGLNLVEALQFLFQISF 236
Query: 234 HVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSA-- 291
G+ Y +S+ + ++ +LGLV Q+ RK F PT+LA +L+ ++D
Sbjct: 237 STLGKDYPTEGMSDSMQQFLQHLRELGLVN-QRKRKSGRFYPTRLAIHLASGISDVEKDF 295
Query: 292 RKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITA 351
KEG++VVE+N+R+YAY+ S+L ++ LFS++ Y+ PN++V +T++S+ A GITA
Sbjct: 296 HKEGYLVVESNYRIYAYTDSELQVALIGLFSEILYRFPNMVVANLTRDSVQEAVVRGITA 355
Query: 352 EQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
+QI+ FL+ NAHP+ R+P VP + DQIRLWE + +R+ T Y++F S+
Sbjct: 356 DQILHFLRVNAHPKALHRVPIVPPTISDQIRLWEMERDRLTFTEGVLYNQFLSQ 409
>gi|62751795|ref|NP_001015845.1| general transcription factor IIH, polypeptide 4, 52kDa [Xenopus
(Silurana) tropicalis]
gi|58475911|gb|AAH90134.1| general transcription factor II H, polypeptide 4 [Xenopus
(Silurana) tropicalis]
gi|89272862|emb|CAJ82116.1| transcription factor tfb2 [Xenopus (Silurana) tropicalis]
gi|114107989|gb|AAI22900.1| gtf2h4 protein [Xenopus (Silurana) tropicalis]
Length = 455
Score = 260 bits (665), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 143/408 (35%), Positives = 236/408 (57%), Gaps = 9/408 (2%)
Query: 4 VKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMM 63
V++ KN + + L+ LD+LY +P C A+ R LP LAK YVM+M +++ +P +
Sbjct: 6 VQLKCKNLHEFLQELSPGVLDQLYNHPATCLAVFRELPGLAKNYVMRMLFLEQPLPQAAV 65
Query: 64 EEWVLPDGFTKHRVAIDRLVQLRLFSEEK--KKETTYRLNSTFQSNLRKHLIYGGA-LPR 120
WV + F +H L LRL+ ++ LN F+ NL+ L+ GG
Sbjct: 66 ALWVKKENFKEHEENTQVLTGLRLWHTQQLPGGLQGLILNPIFRENLKIALLGGGKPWAD 125
Query: 121 EPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRR 180
+ G + L+ YA +WE L ++ S A + + ++ Q GL+
Sbjct: 126 DGCLLGPDKHGKDVVSLDKYAEERWEVILHFMVGSPSAAVSQDLAQLLI---QAGLMKSE 182
Query: 181 DKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAY 240
EAP ++ +GFQFLL+DT +QLWY + +Y+ +++ RG+ +++SF+ +LSF G+ Y
Sbjct: 183 SGEAPCISSAGFQFLLLDTPSQLWYFMLQYLKSAESRGMILVEILSFMFQLSFSTLGKDY 242
Query: 241 NLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDS--SARKEGFIV 298
++ +S+ + ++ + GLV Q+ RK + PT+LA NL+ ++ S + K+GFIV
Sbjct: 243 SVEGMSDSLLTFLQHLREFGLV-FQRKRKSRRYYPTRLAINLASGISGSVVDSHKQGFIV 301
Query: 299 VETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFL 358
VETN+R+YAY+ S+L ++ LFS++ Y+ PNL+V +T+E++ A NGITAEQII FL
Sbjct: 302 VETNYRIYAYTDSELQIALIALFSEMLYRFPNLVVAQVTRENVQQAIGNGITAEQIIHFL 361
Query: 359 QQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
+ AHP + + P +P + DQIRLWE + +R+ + Y++F S+V
Sbjct: 362 RTRAHPVMLQQNPVLPPTITDQIRLWELERDRLRFSEGVLYNQFLSQV 409
>gi|327266348|ref|XP_003217968.1| PREDICTED: general transcription factor IIH subunit 4-like [Anolis
carolinensis]
Length = 460
Score = 259 bits (662), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 147/409 (35%), Positives = 236/409 (57%), Gaps = 10/409 (2%)
Query: 4 VKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMM 63
V + KN + + L+ LDRLY +P C A+ R LP LAK YVM+M ++ +P +
Sbjct: 10 VHLKCKNLHEFLRGLSPGVLDRLYNHPATCLAVFRELPGLAKNYVMRMLLLEQPLPQAAV 69
Query: 64 EEWVLPDGFTKHRVAIDRLVQLRLFSEE--KKKETTYRLNSTFQSNLRKHLIYGG-ALPR 120
WV + + + D L+ LRL+ ++ LN F+ NLR L+ GG A
Sbjct: 70 ASWVKKEYTKEQEESSDILLGLRLWHKQLLPGGLQGIVLNPIFKENLRVALLGGGKAWSD 129
Query: 121 EPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLL-SR 179
+ G + L+ YA +WE L ++ S A + + + + GL+ S
Sbjct: 130 DTSQLGPDKHARDVPSLDKYAEERWEVILDFMVGSPSAAVSQDLAQLLTEA---GLMKSS 186
Query: 180 RDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEA 239
E P +T SGFQFLL+DT++QLWY + +Y+ +++ RG++ +++SFL +LSF G+
Sbjct: 187 EPGEPPCITSSGFQFLLLDTSSQLWYFMLQYLQSAETRGMDLVEILSFLFQLSFSTLGKD 246
Query: 240 YNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSS--ARKEGFI 297
Y++ +SE + ++ + GLV Q+ RK + PT+LA NLS ++ + R +GFI
Sbjct: 247 YSVEGMSESLLNFLQHLREFGLV-FQRKRKSRRYYPTRLAINLSSGISGITIDTRNQGFI 305
Query: 298 VVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISF 357
+VETN+R+YAY+ S+L ++ LFS++ Y+ PNL+V +T+ES+ A NGITA+QII F
Sbjct: 306 IVETNYRIYAYTDSELQIALIALFSEMLYRFPNLVVAQVTRESVQQAIANGITADQIIHF 365
Query: 358 LQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
L+ AHP + + P +P + DQIRLWE + +R+ + Y++F S+V
Sbjct: 366 LRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFSEGVLYNQFLSQV 414
>gi|410911638|ref|XP_003969297.1| PREDICTED: general transcription factor IIH subunit 4-like
[Takifugu rubripes]
Length = 465
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 146/419 (34%), Positives = 229/419 (54%), Gaps = 20/419 (4%)
Query: 4 VKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMM 63
V++ KN + L+ LDRLY +P C A+ R LP LAK +VM+M ++D +P +
Sbjct: 5 VQLQCKNLHQYLKELSPDILDRLYNHPATCLAVYRELPSLAKNFVMRMLFLDQPLPQAAV 64
Query: 64 EEWVLPDGFTKHRVAIDRLVQLRLFSEEKKKET--TYRLNSTFQSNLRKHLIYGG-ALPR 120
WV D H + L LRL+ + + Y LN F+ NL+ L+ GG A
Sbjct: 65 SLWVNKDNQKDHDECVSVLAGLRLWHSQHLQGGLQGYILNPVFKDNLKIALLGGGRAWAD 124
Query: 121 EPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRR 180
E G + L+ YA +WE L ++ S A + + ++ GL+
Sbjct: 125 EGGTLGPDRHARDIGSLDRYATERWEVILHFMVGSPCAAVSQDLAQLLVHA---GLMKSE 181
Query: 181 DKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAY 240
E P +T +GFQFLL+DT +QLWY +Y+ +Q RG++ +++SFL +LSF G Y
Sbjct: 182 AGEPPYITSAGFQFLLLDTASQLWYFTLQYLKTAQSRGMDLVEILSFLFQLSFSTLGRDY 241
Query: 241 NLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSS---------- 290
++ +S+ + ++ + GLV Q+ RK + PT+LA L+ + SS
Sbjct: 242 SVEGMSDSLLTFLQHLREFGLV-FQRKRKSRRYYPTRLAITLATGDSSSSLHTPTASLAS 300
Query: 291 ---ARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFEN 347
+ GFIVVETN+R+YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A N
Sbjct: 301 TPGSGDSGFIVVETNYRIYAYTNSELQIALVALFSEMLYRFPNVVVAQVTRESVQQAIAN 360
Query: 348 GITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
GITA+QII FL+ AHP + + P +P + DQIRLWE + +R++ T Y++F S+
Sbjct: 361 GITAQQIIHFLRTRAHPVMLRQTPFLPPTITDQIRLWELERDRLQFTEGVLYNQFLSQT 419
>gi|321470926|gb|EFX81900.1| hypothetical protein DAPPUDRAFT_210868 [Daphnia pulex]
Length = 463
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 141/417 (33%), Positives = 238/417 (57%), Gaps = 29/417 (6%)
Query: 8 AKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWV 67
K+ + +L A LDRLY +P C A+ R LP L++ Y+M++ ++D A+P +M WV
Sbjct: 10 CKDLHSYLKTLPAAVLDRLYNHPATCLAVFRELPELSRIYIMRILFVDQAVPKAIMGSWV 69
Query: 68 LPDGFTKHRVAIDRLVQLRLFSEEKKKE--TTYRLNSTFQSNLRKHLIYGGA-------L 118
P+ + + L LR++ E + + + LN TF+ NL+ L+ GG
Sbjct: 70 SPNSARELEDIVKLLTDLRVWQEVEMQGGLKGWLLNPTFRRNLKGALLGGGNEWSMKPNT 129
Query: 119 PREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLS 178
+P GI L+ YA+G+WEC L ++ S Q E SS +++ Q L
Sbjct: 130 DADPKARGIAV-------LDEYAMGRWECVLHFMVGSHQHE---AISSDALQILQHAGLM 179
Query: 179 RRDKEAPRL--TESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVA 236
+++ +L T+ GFQFLLMDT+AQ+WY + +Y+ + R ++ D + FL +LSF
Sbjct: 180 KKEPGENQLSITKDGFQFLLMDTSAQVWYFLLQYLDTANSRNLDLIDCLGFLFQLSFSTL 239
Query: 237 GEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLS-------MSLTDS 289
G+ Y+ +++++ + ++ + GLV Q+ RK+ F PT+LA +++ ++ ++
Sbjct: 240 GQDYSTDSMNDGLQKFLQHLREFGLV-YQRKRKDGRFYPTRLALDIAAGPKKSMLNSLNA 298
Query: 290 SARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGI 349
S++ G+IVVETN+R+YAY+ S L ++ LF ++ Y+ PNL+VG IT+ES+ A + GI
Sbjct: 299 SSQTTGYIVVETNYRVYAYTDSSLQVALIALFCELIYRFPNLVVGIITRESVREALKRGI 358
Query: 350 TAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
TA+QI+SFL+Q+AH P +P + DQI+LW + +R Y++F S+
Sbjct: 359 TADQIVSFLRQHAHSECYKEPPVLPSTIADQIKLWAIERDRFLYKDGVLYNQFDSQA 415
>gi|336370852|gb|EGN99192.1| hypothetical protein SERLA73DRAFT_90464 [Serpula lacrymans var.
lacrymans S7.3]
gi|336383609|gb|EGO24758.1| hypothetical protein SERLADRAFT_361686 [Serpula lacrymans var.
lacrymans S7.9]
Length = 469
Score = 258 bits (660), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 146/411 (35%), Positives = 231/411 (56%), Gaps = 15/411 (3%)
Query: 7 IAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEW 66
I + + S + L RLY+ P C +I R PL ++ +M + +++ AIP M W
Sbjct: 16 IPHTLLPFLQSQSQNTLTRLYQRPSSCFSIFRLTGPLERQIIMNLLWLESAIPIATMSAW 75
Query: 67 VLPDGFTKHRVAIDRLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSG 126
V+ +G + A+ L L + + K LN TF+++ R+ + GG +P+
Sbjct: 76 VIREGKKLYDAALTTLANLHILPQSNVK---LALNPTFKASFRQAITGGGNSSSFGVPAE 132
Query: 127 ITAR--LPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQR-GLLSRRDKE 183
+ T+E L+AYA+ +WE L ++SS Q PT S ++ + QR GL+++
Sbjct: 133 KDDKRSTNTVETLDAYALERWETILHYMVSSGQGSLPTKPSQGVLYLLQRSGLMTQNHGS 192
Query: 184 APRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLN 243
A ++T +GFQFLL + QLW ++ +Y+ ++ER ++ +++SFL LS G Y+
Sbjct: 193 ALQITSAGFQFLLHTPHDQLWDLLLQYLHMAEERQMDLVEVLSFLFMLSTMDLGREYSTE 252
Query: 244 TLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSL--------TDSSARKEG 295
LSE QK+M++D D GLV Q+ F PT+LAT L+ S T + +G
Sbjct: 253 GLSETQKAMLEDLRDYGLV-WQRKATSKRFSPTRLATTLTSSCPPLPTSTGTSGGPQSQG 311
Query: 296 FIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQII 355
FIV+ETN+R+YAY+ L +L LF ++Y+ PNL+VG +T+ES+ A NGI+AEQII
Sbjct: 312 FIVLETNYRIYAYTDKPLQTAVLNLFITMKYRFPNLVVGMLTRESVKKALSNGISAEQII 371
Query: 356 SFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
S+L +AHP++ P +P V DQIRLWE + NR++ + Y +F S
Sbjct: 372 SYLTTHAHPQMRKNNPLLPVTVQDQIRLWELERNRLKSEEGYLYKDFGSHA 422
>gi|338718559|ref|XP_003363845.1| PREDICTED: general transcription factor IIH subunit 4 [Equus
caballus]
Length = 463
Score = 254 bits (650), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 146/409 (35%), Positives = 232/409 (56%), Gaps = 10/409 (2%)
Query: 4 VKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMM 63
V + +N + + SL+ LDRLY +P C A+ R LP LAK +VM+M +++ +P +
Sbjct: 13 VHLQCRNLQEFLGSLSPGVLDRLYGHPATCLAVFRELPSLAKNWVMRMLFLEQPLPQAAV 72
Query: 64 EEWVLPDGFTKHRVAIDRLVQLRLFSEE--KKKETTYRLNSTFQSNLRKHLIYGG-ALPR 120
WV + + L LR++ + LN F+ NLR L+ GG A
Sbjct: 73 ALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLILNPVFRQNLRIALLGGGKAWSD 132
Query: 121 EPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLL-SR 179
+ G + L+ YA +WE L ++ S A + + + Q GL+ S
Sbjct: 133 DTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSAAVSQDLAQLLS---QAGLMKST 189
Query: 180 RDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEA 239
E P +T +GFQFLL+DT AQLWY + +Y+ +Q RG++ +++SFL +LSF G+
Sbjct: 190 EPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGKD 249
Query: 240 YNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLS--MSLTDSSARKEGFI 297
Y++ +S+ + ++ + GLV Q+ RK + PT+LA NLS +S +A + GFI
Sbjct: 250 YSVEGMSDSLLNFLQHLREFGLV-FQRKRKSRRYYPTRLAINLSSGVSGAGGTAHQPGFI 308
Query: 298 VVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISF 357
VVETN+R+YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII F
Sbjct: 309 VVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHF 368
Query: 358 LQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
L+ AHP + + P +P + DQIRLWE + +R+ T Y++F S+V
Sbjct: 369 LRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQV 417
>gi|417401351|gb|JAA47565.1| Putative rna polymer [Desmodus rotundus]
Length = 463
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 144/405 (35%), Positives = 230/405 (56%), Gaps = 10/405 (2%)
Query: 8 AKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWV 67
+N + + L+ LDRLY +P C A+ R LP LAK +VM+M +++ +P + WV
Sbjct: 17 CRNLQEFLGGLSPGVLDRLYGHPATCLAVFRELPSLAKSWVMRMLFLEQPLPQAAVALWV 76
Query: 68 LPDGFTKHRVAIDRLVQLRLFSEE--KKKETTYRLNSTFQSNLRKHLIYGG-ALPREPMP 124
+ + L LR++ + LN F+ NLR L+ GG A +
Sbjct: 77 KKEFSKAQEESTGLLSSLRIWHTQLLPGGLQGLILNPIFRQNLRIALLGGGKAWSDDTSQ 136
Query: 125 SGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLL-SRRDKE 183
G + L+ YA +WE L ++ S A + + ++ Q GL+ S E
Sbjct: 137 LGPDKHTRDVPSLDKYAEERWEVVLHFMVGSPDAAVSQDLAQLLI---QAGLMKSAEPGE 193
Query: 184 APRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLN 243
P +T +GFQFLL+DT AQLWY + +Y+ +Q RG++ +++SFL +LSF G+ Y++
Sbjct: 194 PPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGKDYSVE 253
Query: 244 TLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLS--MSLTDSSARKEGFIVVET 301
+S+ + ++ + GLV Q+ RK + PT+LA NLS +S +A + GFIVVET
Sbjct: 254 GMSDSLLNFLQHLREFGLV-FQRKRKSRRYYPTRLAINLSSGVSGAGGTAHQPGFIVVET 312
Query: 302 NFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQN 361
N+R+YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII FL+
Sbjct: 313 NYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTR 372
Query: 362 AHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
AHP + + P +P + DQIRLWE + +R+ T Y++F S+V
Sbjct: 373 AHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQV 417
>gi|395831929|ref|XP_003789035.1| PREDICTED: general transcription factor IIH subunit 4 [Otolemur
garnettii]
Length = 463
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 144/409 (35%), Positives = 231/409 (56%), Gaps = 10/409 (2%)
Query: 4 VKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMM 63
V + +N + + SL+ LDRLY +P C A+ R LP LAK +VM+M +++ +P +
Sbjct: 13 VHLQCRNLQEFLGSLSPGVLDRLYGHPATCLAVFRELPSLAKNWVMRMLFLEQPLPQAAV 72
Query: 64 EEWVLPDGFTKHRVAIDRLVQLRLFSEE--KKKETTYRLNSTFQSNLRKHLIYGG-ALPR 120
WV + + L LR++ + LN F+ NLR L+ GG A
Sbjct: 73 ALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLILNPIFRQNLRIALLGGGKAWSD 132
Query: 121 EPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLL-SR 179
+ G + L+ YA +WE L ++ S A + + + Q GL+ S
Sbjct: 133 DTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSAAVSQDLAQLLS---QAGLMKST 189
Query: 180 RDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEA 239
E P +T +GFQFLL+DT AQLWY + +Y+ +Q RG++ +++SFL +LSF G+
Sbjct: 190 EPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGKD 249
Query: 240 YNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSA--RKEGFI 297
Y++ +S+ + ++ + GLV Q+ RK + PT+LA NLS ++ + + GFI
Sbjct: 250 YSVEGMSDSLLNFLQHLREFGLV-FQRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFI 308
Query: 298 VVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISF 357
VVETN+R+YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII F
Sbjct: 309 VVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHF 368
Query: 358 LQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
L+ AHP + + P +P + DQIRLWE + +R+ T Y++F S+V
Sbjct: 369 LRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQV 417
>gi|390594544|gb|EIN03954.1| transcription factor Tfb2 [Punctularia strigosozonata HHB-11173
SS5]
Length = 467
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 144/395 (36%), Positives = 234/395 (59%), Gaps = 18/395 (4%)
Query: 23 LDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTK--HRVAID 80
L RL+ P C ++ R L PL ++ VM + +++ A+PA+ M++W++P G + + +++
Sbjct: 31 LTRLWGRPSACLSVFRLLRPLDQQLVMNLLWLESAVPAQTMQQWIVPGGNNRKLYHASLE 90
Query: 81 RLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLPTLED---- 136
L +L + S + +LNST++++LR I GG + GI A +
Sbjct: 91 ILSRLHILS--GNTQVRLQLNSTWKASLRAA-ITGGTVATSSGSFGIPAERGAKDSMDKD 147
Query: 137 -LEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQR-GLLSRRDKEAPRLTESGFQF 194
L+A+A+ +WE L ++SS A P S+ ++ + QR GL+ P++T +GFQF
Sbjct: 148 GLDAFALERWETILHFMVSSGTAHNPPRPSAGVLFLLQRSGLMGGGGN--PQITSAGFQF 205
Query: 195 LLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIK 254
LL + +AQLW ++ +Y+ ++ER ++ +++SF+ LS G Y+ LS+ QK+M++
Sbjct: 206 LLHEPHAQLWELLLQYLRMAEERQMDLVEVLSFIFMLSTTELGREYSTENLSDTQKAMLE 265
Query: 255 DFADLGLVKLQQGRKENWFIPTKLATNLSMSL----TDSSARKEGFIVVETNFRMYAYST 310
D D GL+ QQ F PT+LAT L+ S T S EGFIV+ETN+R+YAY+
Sbjct: 266 DLRDYGLI-WQQKPTSKRFSPTRLATTLTSSSPPLPTSGSGVSEGFIVLETNYRLYAYTD 324
Query: 311 SKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRM 370
+ L +L LF+ + + PNL+VG IT+ES+ A ++GI+AEQIIS+L +AHP++
Sbjct: 325 NPLQTAVLALFTTLRSRFPNLVVGQITRESVKRALQSGISAEQIISYLSTHAHPQMRKNN 384
Query: 371 PSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
P +P V DQIRLWE + NR++ Y EF S+
Sbjct: 385 PLIPVTVQDQIRLWELEKNRLKSQEGFLYTEFASQ 419
>gi|426250586|ref|XP_004019016.1| PREDICTED: general transcription factor IIH subunit 4 [Ovis aries]
Length = 463
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 145/409 (35%), Positives = 231/409 (56%), Gaps = 10/409 (2%)
Query: 4 VKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMM 63
V + +N + + L+ LDRLY +P C A+ R LP LAK +VM+M +++ +P +
Sbjct: 13 VHLQCRNLQEFLGGLSPGILDRLYGHPATCLAVFRELPSLAKNWVMRMLFLEQPLPQAAV 72
Query: 64 EEWVLPDGFTKHRVAIDRLVQLRLFSEE--KKKETTYRLNSTFQSNLRKHLIYGG-ALPR 120
WV + + L LR++ + LN F+ NLR L+ GG A
Sbjct: 73 ALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLILNPIFRQNLRIALLGGGKAWSD 132
Query: 121 EPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLL-SR 179
+ G + L+ YA +WE L ++ S A + + + Q GL+ S
Sbjct: 133 DTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSAAVSQDLAQLLS---QAGLMKSA 189
Query: 180 RDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEA 239
E P +T +GFQFLL+DT AQLWY + +Y+ +Q RG++ +++SFL +LSF G+
Sbjct: 190 EPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGKD 249
Query: 240 YNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLS--MSLTDSSARKEGFI 297
Y++ +S+ + ++ + GLV Q+ RK + PT+LA NLS +S +A + GFI
Sbjct: 250 YSVEGMSDSLLNFLQHLREFGLV-FQRKRKSRRYYPTRLAINLSSGVSGAGGTAHQPGFI 308
Query: 298 VVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISF 357
VVETN+R+YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII F
Sbjct: 309 VVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHF 368
Query: 358 LQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
L+ AHP + + P +P + DQIRLWE + +R+ T Y++F S+V
Sbjct: 369 LRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQV 417
>gi|281337500|gb|EFB13084.1| hypothetical protein PANDA_018954 [Ailuropoda melanoleuca]
Length = 463
Score = 253 bits (645), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 145/409 (35%), Positives = 231/409 (56%), Gaps = 10/409 (2%)
Query: 4 VKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMM 63
V + +N + + L+ LDRLY +P C A+ R LP LAK +VM+M +++ +P +
Sbjct: 13 VHLQCRNLQEFLGGLSPGVLDRLYGHPATCLAVFRELPSLAKNWVMRMLFLEQPLPQAAV 72
Query: 64 EEWVLPDGFTKHRVAIDRLVQLRLFSEE--KKKETTYRLNSTFQSNLRKHLIYGG-ALPR 120
WV + + L LR++ + LN F+ NLR L+ GG A
Sbjct: 73 ALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLILNPIFRQNLRIALLGGGKAWSD 132
Query: 121 EPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLL-SR 179
+ G + L+ YA +WE L ++ S A + + + Q GL+ S
Sbjct: 133 DTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSAAVSQDLAQLLS---QAGLMKSA 189
Query: 180 RDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEA 239
E P +T +GFQFLL+DT AQLWY + +Y+ +Q RG++ +++SFL +LSF G+
Sbjct: 190 EPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGKD 249
Query: 240 YNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLS--MSLTDSSARKEGFI 297
Y++ +S+ + ++ + GLV Q+ RK + PT+LA NLS +S +A + GFI
Sbjct: 250 YSVEGMSDSLLNFLQHLREFGLV-FQRKRKSRRYYPTRLAINLSSGVSGAGGTAHQPGFI 308
Query: 298 VVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISF 357
VVETN+R+YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII F
Sbjct: 309 VVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHF 368
Query: 358 LQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
L+ AHP + + P +P + DQIRLWE + +R+ T Y++F S+V
Sbjct: 369 LRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQV 417
>gi|410958682|ref|XP_003985944.1| PREDICTED: general transcription factor IIH subunit 4 [Felis catus]
Length = 463
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 145/409 (35%), Positives = 231/409 (56%), Gaps = 10/409 (2%)
Query: 4 VKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMM 63
V + +N + + L+ LDRLY +P C A+ R LP LAK +VM+M +++ +P +
Sbjct: 13 VHLQCRNLQEFLGGLSPGILDRLYGHPATCLAVFRELPSLAKNWVMRMLFLEQPLPQAAV 72
Query: 64 EEWVLPDGFTKHRVAIDRLVQLRLFSEE--KKKETTYRLNSTFQSNLRKHLIYGG-ALPR 120
WV + + L LR++ + LN F+ NLR L+ GG A
Sbjct: 73 ALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLILNPIFRQNLRIALLGGGKAWSD 132
Query: 121 EPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLL-SR 179
+ G + L+ YA +WE L ++ S A + + + Q GL+ S
Sbjct: 133 DTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSAAVSQDLAQLLS---QAGLMKST 189
Query: 180 RDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEA 239
E P +T +GFQFLL+DT AQLWY + +Y+ +Q RG++ +++SFL +LSF G+
Sbjct: 190 EPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGKD 249
Query: 240 YNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLS--MSLTDSSARKEGFI 297
Y++ +S+ + ++ + GLV Q+ RK + PT+LA NLS +S +A + GFI
Sbjct: 250 YSVEGMSDSLLNFLQHLREFGLV-FQRKRKSRRYYPTRLAINLSSGVSGAGGTAHQPGFI 308
Query: 298 VVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISF 357
VVETN+R+YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII F
Sbjct: 309 VVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHF 368
Query: 358 LQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
L+ AHP + + P +P + DQIRLWE + +R+ T Y++F S+V
Sbjct: 369 LRTRAHPVMLKQSPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQV 417
>gi|344307658|ref|XP_003422497.1| PREDICTED: general transcription factor IIH subunit 4 [Loxodonta
africana]
Length = 463
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 144/409 (35%), Positives = 231/409 (56%), Gaps = 10/409 (2%)
Query: 4 VKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMM 63
V + +N + + L+ LDRLY +P C A+ R LP LAK +VM+M +++ +P +
Sbjct: 13 VHLQCRNLQEFLGGLSPGVLDRLYGHPATCLAVFRELPSLAKNWVMRMLFLEQPLPQAAV 72
Query: 64 EEWVLPDGFTKHRVAIDRLVQLRLFSEE--KKKETTYRLNSTFQSNLRKHLIYGG-ALPR 120
WV + + L LR++ + LN F+ NLR L+ GG A
Sbjct: 73 ALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLILNPIFRQNLRIALLGGGKAWSD 132
Query: 121 EPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLL-SR 179
+ G + L+ YA +WE L ++ S A + + + Q GL+ S
Sbjct: 133 DTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSAAVSQDLAQLLS---QAGLMKSA 189
Query: 180 RDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEA 239
E P +T +GFQFLL+DT AQLWY + +Y+ +Q RG++ +++SFL +LSF G+
Sbjct: 190 EPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGKD 249
Query: 240 YNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLS--MSLTDSSARKEGFI 297
Y++ +S+ + ++ + GLV Q+ RK + PT+LA NLS +S +A + GFI
Sbjct: 250 YSVEGMSDSLLNFLQHLREFGLV-FQRKRKSRRYYPTRLAINLSSGVSGAGGTAHQPGFI 308
Query: 298 VVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISF 357
+VETN+R+YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII F
Sbjct: 309 IVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHF 368
Query: 358 LQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
L+ AHP + + P +P + DQIRLWE + +R+ T Y++F S+V
Sbjct: 369 LRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQV 417
>gi|178056954|ref|NP_001116592.1| general transcription factor IIH subunit 4 [Sus scrofa]
gi|41529162|dbj|BAD08424.1| general transcription factor IIH, polypeptide 4 [Sus scrofa]
gi|47496805|dbj|BAD08426.2| general transcription factor IIH, polypeptide 4 [Sus scrofa]
Length = 463
Score = 252 bits (643), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 144/409 (35%), Positives = 231/409 (56%), Gaps = 10/409 (2%)
Query: 4 VKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMM 63
V + +N + + L+ LDRLY +P C A+ R LP LAK +VM+M +++ +P +
Sbjct: 13 VHLQCRNLQEFLGGLSPVVLDRLYGHPATCLAVFRELPSLAKNWVMRMLFLEQPLPQAAV 72
Query: 64 EEWVLPDGFTKHRVAIDRLVQLRLFSEE--KKKETTYRLNSTFQSNLRKHLIYGG-ALPR 120
WV + + L LR++ + LN F+ NLR L+ GG A
Sbjct: 73 ALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLILNPIFRQNLRIALLGGGKAWSD 132
Query: 121 EPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLL-SR 179
+ G + L+ YA +WE L ++ S A + + + Q GL+ S
Sbjct: 133 DTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSAAVSQDLAQLLS---QAGLMKSA 189
Query: 180 RDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEA 239
E P +T +GFQFLL+DT AQLWY + +Y+ +Q RG++ +++SFL +LSF G+
Sbjct: 190 EPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGKD 249
Query: 240 YNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSA--RKEGFI 297
Y++ +S+ + ++ +LGLV Q+ RK + PT+LA NLS ++ + + GFI
Sbjct: 250 YSVEGMSDSLLNFLQHLRELGLV-FQRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFI 308
Query: 298 VVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISF 357
VVETN+R+YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII F
Sbjct: 309 VVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHF 368
Query: 358 LQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
L+ AHP + + P +P + DQIRLWE + +R+ T Y++F S+V
Sbjct: 369 LRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQV 417
>gi|383852611|ref|XP_003701820.1| PREDICTED: general transcription factor IIH subunit 4-like
[Megachile rotundata]
Length = 467
Score = 252 bits (643), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 139/412 (33%), Positives = 239/412 (58%), Gaps = 24/412 (5%)
Query: 8 AKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWV 67
KN + + S L++LY NP IC A+ R LP +AK YVM++ +++ +P ++ W
Sbjct: 18 CKNLQEYLKSRPPEILNKLYHNPPICLAVFRELPVIAKHYVMRLLFVEQPVPQAVIASWC 77
Query: 68 LPDGFTKHRVAIDRLVQLRLFSEEK--KKETTYRLNSTFQSNLRKHLIYGGALPREPMPS 125
F +H+ + L +L ++ E + LN+TF+ NL+ L+ GG P
Sbjct: 78 SKLHFEEHQKVVSILNELNVWKEASIPGGLPGWILNTTFKKNLKIVLLGGGK------PW 131
Query: 126 GITARLPT------LEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSR 179
++ +L T + L++YA+ +WEC L ++ S Q E S+ +++ L +
Sbjct: 132 TMSNQLETDSKPRDVAFLDSYALERWECVLHYMVGSQQQE---GISADAVRILLHAGLMK 188
Query: 180 RDKE--APRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAG 237
RD+ +P +T++GFQFLL++T +Q+WY + +Y+ + RG++ + ++FL +L+F G
Sbjct: 189 RDEADGSPVITQAGFQFLLLETASQVWYFILQYLDTIEARGLDLVECLTFLFQLNFSTLG 248
Query: 238 EAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSAR---KE 294
+ Y+ +SE + ++ + GLV Q+ RK F PT+LA N++ +R KE
Sbjct: 249 KDYSTEGMSEGLLTFLQHLREFGLV-YQRKRKAGRFYPTRLALNIATGQNKPLSRDPDKE 307
Query: 295 GFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQI 354
G+IVVETN+R+YAY+ S L +L LF ++ Y+ PNL+V +T++S+ A ++GITA QI
Sbjct: 308 GYIVVETNYRVYAYTNSNLQVALLGLFCEMLYRFPNLVVSILTRDSVRQALKSGITASQI 367
Query: 355 ISFLQQNAHPRVADRMPSV-PENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
+ +LQQ+AH ++ + P V P + DQI+LWE++ NR + Y +F S+
Sbjct: 368 VGYLQQHAHSKMIEAGPPVLPPTIVDQIKLWENERNRFIFSEGVLYSQFLSQ 419
>gi|440896165|gb|ELR48176.1| General transcription factor IIH subunit 4 [Bos grunniens mutus]
Length = 463
Score = 252 bits (643), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 145/409 (35%), Positives = 231/409 (56%), Gaps = 10/409 (2%)
Query: 4 VKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMM 63
V + +N + + L+ LDRLY +P C A+ R LP LAK +VM+M +++ +P +
Sbjct: 13 VHLQCRNLQEFLGGLSPGILDRLYGHPATCLAVFRELPSLAKNWVMRMLFLEQPLPQAAV 72
Query: 64 EEWVLPDGFTKHRVAIDRLVQLRLFSEE--KKKETTYRLNSTFQSNLRKHLIYGG-ALPR 120
WV + + L LR++ + LN F+ NLR L+ GG A
Sbjct: 73 ALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLILNPIFRQNLRIALLGGGKAWSD 132
Query: 121 EPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLL-SR 179
+ G + L+ YA +WE L ++ S A + + + Q GL+ S
Sbjct: 133 DTSQLGPDKHARDVLSLDKYAEERWEVVLHFMVGSPSAAVSQDLAQLLS---QAGLMKSA 189
Query: 180 RDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEA 239
E P +T +GFQFLL+DT AQLWY + +Y+ +Q RG++ +++SFL +LSF G+
Sbjct: 190 EPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGKD 249
Query: 240 YNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLS--MSLTDSSARKEGFI 297
Y++ +S+ + ++ + GLV Q+ RK + PT+LA NLS +S +A + GFI
Sbjct: 250 YSVEGMSDSLLNFLQHLREFGLV-FQRKRKSRRYYPTRLAINLSSGVSGAGGTAHQPGFI 308
Query: 298 VVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISF 357
VVETN+R+YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII F
Sbjct: 309 VVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHF 368
Query: 358 LQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
L+ AHP + + P +P + DQIRLWE + +R+ T Y++F S+V
Sbjct: 369 LRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQV 417
>gi|155371845|ref|NP_001094527.1| general transcription factor IIH subunit 4 [Bos taurus]
gi|148878127|gb|AAI46241.1| GTF2H4 protein [Bos taurus]
gi|296474219|tpg|DAA16334.1| TPA: general transcription factor IIH, polypeptide 4, 52kDa [Bos
taurus]
Length = 463
Score = 252 bits (643), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 145/409 (35%), Positives = 231/409 (56%), Gaps = 10/409 (2%)
Query: 4 VKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMM 63
V + +N + + L+ LDRLY +P C A+ R LP LAK +VM+M +++ +P +
Sbjct: 13 VHLQCRNLQEFLGGLSPGILDRLYGHPATCLAVFRELPSLAKNWVMRMLFLEQPLPQAAV 72
Query: 64 EEWVLPDGFTKHRVAIDRLVQLRLFSEE--KKKETTYRLNSTFQSNLRKHLIYGG-ALPR 120
WV + + L LR++ + LN F+ NLR L+ GG A
Sbjct: 73 ALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLILNPIFRQNLRIALLGGGKAWSD 132
Query: 121 EPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLL-SR 179
+ G + L+ YA +WE L ++ S A + + + Q GL+ S
Sbjct: 133 DTSQLGPDKHARDVLSLDKYAEERWEVVLHFMVGSPSAAVSQDLAQLLS---QAGLMKSA 189
Query: 180 RDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEA 239
E P +T +GFQFLL+DT AQLWY + +Y+ +Q RG++ +++SFL +LSF G+
Sbjct: 190 EPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGKD 249
Query: 240 YNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLS--MSLTDSSARKEGFI 297
Y++ +S+ + ++ + GLV Q+ RK + PT+LA NLS +S +A + GFI
Sbjct: 250 YSVEGMSDSLLNFLQHLREFGLV-FQRKRKSRRYYPTRLAINLSSGVSGAGGTAHQPGFI 308
Query: 298 VVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISF 357
VVETN+R+YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII F
Sbjct: 309 VVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHF 368
Query: 358 LQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
L+ AHP + + P +P + DQIRLWE + +R+ T Y++F S+V
Sbjct: 369 LRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQV 417
>gi|48102983|ref|XP_395476.1| PREDICTED: general transcription factor IIH subunit 4 [Apis
mellifera]
Length = 467
Score = 251 bits (642), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 138/412 (33%), Positives = 241/412 (58%), Gaps = 24/412 (5%)
Query: 8 AKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWV 67
KN + + S + L++LY NP IC A+ R LP +AK YVM++ +++ +P ++ W
Sbjct: 18 CKNLQEYLKSRPSEVLNKLYHNPPICLAVFRELPVIAKHYVMRLLFVEQPVPQAVIASWC 77
Query: 68 LPDGFTKHRVAIDRLVQLRLFSEEK--KKETTYRLNSTFQSNLRKHLIYGGALPREPMPS 125
F +H+ + L +L +++E + LN+TF+ NL+ L+ GG P
Sbjct: 78 SKLHFEEHQKVVLVLNELNVWNEASIPGGLPGWILNTTFKKNLKIVLLGGGK------PW 131
Query: 126 GITARLPT------LEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSR 179
++ +L T + L++YA+ +WEC L ++ S Q E S+ +++ L +
Sbjct: 132 TMSNQLETDSKPRDVAFLDSYALERWECVLHYMVGSQQQE---GISADAVRILLHAGLMK 188
Query: 180 RDKE--APRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAG 237
RD+ +P +T++GFQFLL++T +Q+WY + +Y+ + RG++ + ++FL +L+F G
Sbjct: 189 RDEADGSPVITQAGFQFLLLETASQVWYFILQYLDTIEARGLDLVECLTFLFQLNFSTLG 248
Query: 238 EAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSAR---KE 294
+ Y+ +SE + ++ + GLV Q+ RK F PT+LA N++ +R KE
Sbjct: 249 KDYSTEGMSEGLLTFLQHLREFGLV-YQRKRKAGRFYPTRLALNIATGQNKPLSRDPEKE 307
Query: 295 GFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQI 354
G+IVVETN+R+YAY+ S L +L LF ++ Y+ PNL+V +T++S+ A ++GITA QI
Sbjct: 308 GYIVVETNYRVYAYTNSNLQVALLGLFCEMLYRFPNLVVSILTRDSVRQALKSGITATQI 367
Query: 355 ISFLQQNAHPRVADRMPSV-PENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
+ +LQQ+AH ++ + P + P + DQI+LWE++ NR + Y +F S+
Sbjct: 368 VGYLQQHAHSKMIEAGPPILPPTIVDQIKLWENERNRFIFSEGVLYSQFLSQ 419
>gi|350413829|ref|XP_003490125.1| PREDICTED: general transcription factor IIH subunit 4-like [Bombus
impatiens]
Length = 467
Score = 251 bits (641), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 139/412 (33%), Positives = 239/412 (58%), Gaps = 24/412 (5%)
Query: 8 AKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWV 67
KN + + S L++LY NP IC A+ R LP +AK YVM++ +++ +P ++ W
Sbjct: 18 CKNLQEYLKSRPPEVLNKLYHNPPICLAVFRELPVIAKHYVMRLLFVEQPVPQAVIASWC 77
Query: 68 LPDGFTKHRVAIDRLVQLRLFSEEK--KKETTYRLNSTFQSNLRKHLIYGGALPREPMPS 125
F +H+ + L +L ++ E + LN+TF+ NL+ L+ GG P
Sbjct: 78 SKLHFEEHQKVVLILNELNVWKEASIPGGLPGWILNTTFKKNLKIVLLGGGK------PW 131
Query: 126 GITARLPT------LEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSR 179
++ +L T + L++YA+ +WEC L ++ S Q E S+ +++ L +
Sbjct: 132 TMSNQLETDSKPRDVAFLDSYALERWECVLHYMVGSQQQE---GISADAVRILLHAGLMK 188
Query: 180 RDKE--APRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAG 237
RD+ +P +T++GFQFLL++T +Q+WY + +Y+ + RG++ + ++FL +L+F G
Sbjct: 189 RDEADGSPVITQAGFQFLLLETASQVWYFILQYLDTIEARGLDLVECLTFLFQLNFSTLG 248
Query: 238 EAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSAR---KE 294
+ Y+ +SE + ++ + GLV Q+ RK F PT+LA N++ +R KE
Sbjct: 249 KDYSTEGMSEGLSTFLQHLREFGLV-YQRKRKAGRFYPTRLALNIATGQNKPLSRDPEKE 307
Query: 295 GFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQI 354
G+IVVETN+R+YAY+ S L +L LF ++ Y+ PNL+V +T++S+ A ++GITA QI
Sbjct: 308 GYIVVETNYRVYAYTNSNLQVALLGLFCEMLYRFPNLVVSILTRDSVRQALKSGITAAQI 367
Query: 355 ISFLQQNAHPRVADRMPSV-PENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
+ +LQQ+AH ++ + P V P + DQI+LWE++ NR + Y +F S+
Sbjct: 368 VGYLQQHAHGKMIEAGPPVLPPTIVDQIKLWENERNRFIFSEGVLYSQFLSQ 419
>gi|4504201|ref|NP_001508.1| general transcription factor IIH subunit 4 [Homo sapiens]
gi|113865855|ref|NP_001038968.1| general transcription factor IIH subunit 4 [Pan troglodytes]
gi|114050799|ref|NP_001040607.1| general transcription factor IIH subunit 4 [Macaca mulatta]
gi|297677636|ref|XP_002816695.1| PREDICTED: general transcription factor IIH subunit 4 isoform 2
[Pongo abelii]
gi|332245940|ref|XP_003272109.1| PREDICTED: general transcription factor IIH subunit 4 [Nomascus
leucogenys]
gi|426352309|ref|XP_004043656.1| PREDICTED: general transcription factor IIH subunit 4 [Gorilla
gorilla gorilla]
gi|17380328|sp|Q92759.1|TF2H4_HUMAN RecName: Full=General transcription factor IIH subunit 4; AltName:
Full=Basic transcription factor 2 52 kDa subunit;
Short=BTF2 p52; AltName: Full=General transcription
factor IIH polypeptide 4; AltName: Full=TFIIH basal
transcription factor complex p52 subunit
gi|38503276|sp|P60027.1|TF2H4_PANTR RecName: Full=General transcription factor IIH subunit 4; AltName:
Full=Basic transcription factor 2 52 kDa subunit;
Short=BTF2 p52; AltName: Full=General transcription
factor IIH polypeptide 4; AltName: Full=TFIIH basal
transcription factor complex p52 subunit
gi|1514597|emb|CAA68870.1| transcription factor TFIIH [Homo sapiens]
gi|13436278|gb|AAH04935.1| General transcription factor IIH, polypeptide 4, 52kDa [Homo
sapiens]
gi|15277224|dbj|BAB63317.1| Transcription factor II H [Homo sapiens]
gi|16740884|gb|AAH16302.1| General transcription factor IIH, polypeptide 4, 52kDa [Homo
sapiens]
gi|21655317|gb|AAM64222.1| general transcription factor IIH, polypeptide 4 (52kD subunit)
[Homo sapiens]
gi|27544403|dbj|BAC54936.1| transcription factor II H [Homo sapiens]
gi|30583481|gb|AAP35985.1| general transcription factor IIH, polypeptide 4, 52kDa [Homo
sapiens]
gi|32127781|dbj|BAC78171.1| transcription factor II H [Pan troglodytes]
gi|55700798|dbj|BAD69753.1| general transcription factor IIH, polypeptide 4, 52kDa [Macaca
mulatta]
gi|60655839|gb|AAX32483.1| general transcription factor IIH polypeptide 4 [synthetic
construct]
gi|60655841|gb|AAX32484.1| general transcription factor IIH polypeptide 4 [synthetic
construct]
gi|86197966|dbj|BAE78622.1| general transcription factor IIH, polypeptide 4, 52kDa [Homo
sapiens]
gi|90960934|dbj|BAE92819.1| general transcription factor IIH, polypeptide 4 [Pan troglodytes]
gi|90960936|dbj|BAE92820.1| general transcription factor IIH, polypeptide 4 [Pan troglodytes]
gi|114306784|dbj|BAF31271.1| TFIIH protein [Homo sapiens]
gi|119623752|gb|EAX03347.1| general transcription factor IIH, polypeptide 4, 52kDa [Homo
sapiens]
gi|123992979|gb|ABM84091.1| general transcription factor IIH, polypeptide 4, 52kDa [synthetic
construct]
gi|123999907|gb|ABM87462.1| general transcription factor IIH, polypeptide 4, 52kDa [synthetic
construct]
gi|355561507|gb|EHH18139.1| General transcription factor IIH polypeptide 4, partial [Macaca
mulatta]
gi|355748409|gb|EHH52892.1| General transcription factor IIH polypeptide 4 [Macaca
fascicularis]
gi|380812138|gb|AFE77944.1| general transcription factor IIH subunit 4 [Macaca mulatta]
gi|380812140|gb|AFE77945.1| general transcription factor IIH subunit 4 [Macaca mulatta]
gi|383408563|gb|AFH27495.1| general transcription factor IIH subunit 4 [Macaca mulatta]
gi|410214852|gb|JAA04645.1| general transcription factor IIH, polypeptide 4, 52kDa [Pan
troglodytes]
gi|410250640|gb|JAA13287.1| general transcription factor IIH, polypeptide 4, 52kDa [Pan
troglodytes]
gi|410290944|gb|JAA24072.1| general transcription factor IIH, polypeptide 4, 52kDa [Pan
troglodytes]
gi|410352807|gb|JAA43007.1| general transcription factor IIH, polypeptide 4, 52kDa [Pan
troglodytes]
Length = 462
Score = 251 bits (641), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 143/409 (34%), Positives = 230/409 (56%), Gaps = 10/409 (2%)
Query: 4 VKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMM 63
V + +N + + L+ LDRLY +P C A+ R LP LAK +VM+M +++ +P +
Sbjct: 12 VHLQCRNLQEFLGGLSPGVLDRLYGHPATCLAVFRELPSLAKNWVMRMLFLEQPLPQAAV 71
Query: 64 EEWVLPDGFTKHRVAIDRLVQLRLFSEE--KKKETTYRLNSTFQSNLRKHLIYGG-ALPR 120
WV + + L LR++ + LN F+ NLR L+ GG A
Sbjct: 72 ALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLILNPIFRQNLRIALLGGGKAWSD 131
Query: 121 EPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLL-SR 179
+ G + L+ YA +WE L ++ S A + + + Q GL+ S
Sbjct: 132 DTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSAAVSQDLAQLLS---QAGLMKST 188
Query: 180 RDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEA 239
E P +T +GFQFLL+DT AQLWY + +Y+ +Q RG++ +++SFL +LSF G+
Sbjct: 189 EPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGKD 248
Query: 240 YNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSA--RKEGFI 297
Y++ +S+ + ++ + GLV Q+ RK + PT+LA NLS ++ + + GFI
Sbjct: 249 YSVEGMSDSLLNFLQHLREFGLV-FQRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFI 307
Query: 298 VVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISF 357
VVETN+R+YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII F
Sbjct: 308 VVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHF 367
Query: 358 LQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
L+ AHP + + P +P + DQIRLWE + +R+ T Y++F S+V
Sbjct: 368 LRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQV 416
>gi|380014187|ref|XP_003691121.1| PREDICTED: general transcription factor IIH subunit 4-like [Apis
florea]
Length = 467
Score = 251 bits (641), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 138/412 (33%), Positives = 241/412 (58%), Gaps = 24/412 (5%)
Query: 8 AKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWV 67
KN + + S + L++LY NP IC A+ R LP +AK YVM++ +++ +P ++ W
Sbjct: 18 CKNLQEYLKSRPSEVLNKLYHNPPICLAVFRELPVIAKHYVMRLLFVEQPVPQAVIASWC 77
Query: 68 LPDGFTKHRVAIDRLVQLRLFSEEK--KKETTYRLNSTFQSNLRKHLIYGGALPREPMPS 125
F +H+ + L +L +++E + LN+TF+ NL+ L+ GG P
Sbjct: 78 SKLHFEEHQKVVLILNELNVWNEASIPGGLPGWILNTTFKKNLKIVLLGGGK------PW 131
Query: 126 GITARLPT------LEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSR 179
++ +L T + L++YA+ +WEC L ++ S Q E S+ +++ L +
Sbjct: 132 TMSNQLETDSKPRDVAFLDSYALERWECVLHYMVGSQQQE---GISADAVRILLHAGLMK 188
Query: 180 RDKE--APRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAG 237
RD+ +P +T++GFQFLL++T +Q+WY + +Y+ + RG++ + ++FL +L+F G
Sbjct: 189 RDEADGSPVITQAGFQFLLLETASQVWYFILQYLDTIEARGLDLVECLTFLFQLNFSTLG 248
Query: 238 EAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSAR---KE 294
+ Y+ +SE + ++ + GLV Q+ RK F PT+LA N++ +R KE
Sbjct: 249 KDYSTEGMSEGLLTFLQHLREFGLV-YQRKRKAGRFYPTRLALNIATGQNKPLSRDPEKE 307
Query: 295 GFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQI 354
G+IVVETN+R+YAY+ S L +L LF ++ Y+ PNL+V +T++S+ A ++GITA QI
Sbjct: 308 GYIVVETNYRVYAYTNSNLQVALLGLFCEMLYRFPNLVVSILTRDSVRQALKSGITATQI 367
Query: 355 ISFLQQNAHPRVADRMPSV-PENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
+ +LQQ+AH ++ + P + P + DQI+LWE++ NR + Y +F S+
Sbjct: 368 VGYLQQHAHSKMIEAGPPILPPTIVDQIKLWENERNRFIFSEGVLYSQFLSQ 419
>gi|340710070|ref|XP_003393621.1| PREDICTED: general transcription factor IIH subunit 4-like [Bombus
terrestris]
Length = 467
Score = 251 bits (640), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 139/412 (33%), Positives = 238/412 (57%), Gaps = 24/412 (5%)
Query: 8 AKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWV 67
KN + + S L++LY NP IC A+ R LP +AK YVM++ +++ +P ++ W
Sbjct: 18 CKNLQEYLKSRPPEVLNKLYHNPPICLAVFRELPVIAKHYVMRLLFVEQPVPQAVIASWC 77
Query: 68 LPDGFTKHRVAIDRLVQLRLFSEEK--KKETTYRLNSTFQSNLRKHLIYGGALPREPMPS 125
F +H+ + L +L ++ E + LN+TF+ NL+ L+ GG P
Sbjct: 78 SKLHFEEHQKVVLILNELNVWKEASIPGGLPGWILNTTFKKNLKIVLLGGGK------PW 131
Query: 126 GITARLPT------LEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSR 179
++ +L T + L++YA+ +WEC L ++ S Q E S+ +++ L +
Sbjct: 132 TMSNQLETDSKPRDVAFLDSYALERWECVLHYMVGSQQQE---GISADAVRILLHAGLMK 188
Query: 180 RDKE--APRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAG 237
RD+ +P +T++GFQFLL++T +Q+WY + +Y+ + RG++ + ++FL +L+F G
Sbjct: 189 RDEADGSPVITQAGFQFLLLETASQVWYFILQYLDTIEARGLDLVECLTFLFQLNFSTLG 248
Query: 238 EAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSAR---KE 294
+ Y+ +SE + ++ + GLV Q+ RK F PT+LA N++ +R KE
Sbjct: 249 KDYSTEGMSEGLSTFLQHLREFGLV-YQRKRKAGRFYPTRLALNIATGQNKPLSRDPEKE 307
Query: 295 GFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQI 354
G+IVVETN+R+YAY+ S L +L LF ++ Y+ PNL+V +T++S+ A ++GITA QI
Sbjct: 308 GYIVVETNYRVYAYTNSNLQVALLGLFCEMLYRFPNLVVSILTRDSVRQALKSGITAAQI 367
Query: 355 ISFLQQNAHPRVADRMPSV-PENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
+ +LQQ+AH ++ + P V P + DQI+LWE++ NR Y +F S+
Sbjct: 368 VGYLQQHAHGKMIEAGPPVLPPTIVDQIKLWENERNRFIFNEGVLYSQFLSQ 419
>gi|359320969|ref|XP_003639474.1| PREDICTED: general transcription factor IIH subunit 4-like [Canis
lupus familiaris]
Length = 463
Score = 251 bits (640), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 144/409 (35%), Positives = 231/409 (56%), Gaps = 10/409 (2%)
Query: 4 VKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMM 63
V + +N + + L+ LDRLY +P C A+ R LP LAK +VM+M +++ +P +
Sbjct: 13 VHLQCRNLQEFLGGLSPGVLDRLYGHPATCLAVFRELPFLAKNWVMRMLFLEQPLPQAAV 72
Query: 64 EEWVLPDGFTKHRVAIDRLVQLRLFSEE--KKKETTYRLNSTFQSNLRKHLIYGG-ALPR 120
WV + + L LR++ + LN F+ NLR L+ GG A
Sbjct: 73 ALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLILNPIFRQNLRIALLGGGKAWSD 132
Query: 121 EPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLL-SR 179
+ G + L+ YA +WE L ++ S A + + + Q GL+ S
Sbjct: 133 DTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSAAVSQDLAQLLS---QAGLMKST 189
Query: 180 RDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEA 239
E P +T +GFQFLL+DT AQLWY + +Y+ +Q RG++ +++SFL +LSF G+
Sbjct: 190 EPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGKD 249
Query: 240 YNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLS--MSLTDSSARKEGFI 297
Y++ +S+ + ++ + GLV Q+ RK + PT+LA NLS +S +A + GFI
Sbjct: 250 YSVEGMSDSLLNFLQHLREFGLV-FQRKRKSRRYYPTRLAINLSSGVSGAGGTAHQPGFI 308
Query: 298 VVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISF 357
+VETN+R+YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII F
Sbjct: 309 IVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHF 368
Query: 358 LQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
L+ AHP + + P +P + DQIRLWE + +R+ T Y++F S+V
Sbjct: 369 LRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQV 417
>gi|62896773|dbj|BAD96327.1| general transcription factor IIH, polypeptide 4, 52kDa variant
[Homo sapiens]
Length = 462
Score = 250 bits (639), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 143/409 (34%), Positives = 229/409 (55%), Gaps = 10/409 (2%)
Query: 4 VKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMM 63
V + +N + + L+ LDRLY +P C A+ R LP LAK +VM+M +++ +P +
Sbjct: 12 VHLQCRNLQEFLGGLSPGVLDRLYGHPATCLAVFRELPSLAKNWVMRMLFLEQPLPQAAV 71
Query: 64 EEWVLPDGFTKHRVAIDRLVQLRLFSEE--KKKETTYRLNSTFQSNLRKHLIYGG-ALPR 120
WV + + L LR++ + LN F+ NLR L+ GG A
Sbjct: 72 ALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLILNPIFRQNLRIALLGGGKAWSD 131
Query: 121 EPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLL-SR 179
+ G + L+ YA +WE L ++ S A + + + Q GL+ S
Sbjct: 132 DTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSAAVSQDLAQLLS---QAGLMKST 188
Query: 180 RDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEA 239
E P +T +GFQFLL+DT AQLWY + +Y+ +Q RG++ +++SFL +LSF G+
Sbjct: 189 EPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGKD 248
Query: 240 YNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSA--RKEGFI 297
Y++ +S+ + ++ + GLV Q+ RK + PT+LA NLS ++ + + GFI
Sbjct: 249 YSVEGMSDSLLNFLQHLREFGLV-FQRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFI 307
Query: 298 VVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISF 357
VVETN+R+YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII F
Sbjct: 308 VVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHF 367
Query: 358 LQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
L+ AHP + + P +P + DQIRLWE +R+ T Y++F S+V
Sbjct: 368 LRTRAHPVMLKQTPVLPPTITDQIRLWELGRDRLRFTEGVLYNQFLSQV 416
>gi|47059175|ref|NP_997666.1| general transcription factor II H, polypeptide 4 [Rattus
norvegicus]
gi|46237651|emb|CAE84027.1| general transcription factor II H, polypeptide 4 [Rattus
norvegicus]
gi|117558335|gb|AAI27470.1| General transcription factor II H, polypeptide 4 [Rattus
norvegicus]
Length = 463
Score = 250 bits (639), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 142/405 (35%), Positives = 228/405 (56%), Gaps = 10/405 (2%)
Query: 8 AKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWV 67
+N + + L+ LDRLY +P C A+ R LP LAK +VM+M +++ +P + WV
Sbjct: 17 CRNLQEFLGGLSPGVLDRLYGHPATCLAVFRELPSLAKNWVMRMLFLEQPLPQAAVALWV 76
Query: 68 LPDGFTKHRVAIDRLVQLRLFSEE--KKKETTYRLNSTFQSNLRKHLIYGG-ALPREPMP 124
+ + L LR++ + LN F+ NLR L+ GG A +
Sbjct: 77 KKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLILNPVFRQNLRIALLGGGKAWSDDTSQ 136
Query: 125 SGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLL-SRRDKE 183
G + L+ YA +WE L ++ S A + + + Q GL+ S E
Sbjct: 137 LGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSAAVSQDLAQLLS---QAGLMKSTEPGE 193
Query: 184 APRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLN 243
P +T +GFQFLL+DT AQLWY + +Y+ +Q RG++ +++SFL +LSF G+ Y++
Sbjct: 194 PPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGKDYSVE 253
Query: 244 TLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSA--RKEGFIVVET 301
+S+ + ++ + GLV Q+ RK + PT+LA NLS ++ + + GFIVVET
Sbjct: 254 GMSDSLLNFLQHLREFGLV-FQRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVET 312
Query: 302 NFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQN 361
N+R+YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII FL+
Sbjct: 313 NYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTR 372
Query: 362 AHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
AHP + + P +P + DQIRLWE + +R+ T Y++F S+V
Sbjct: 373 AHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQV 417
>gi|307106040|gb|EFN54287.1| hypothetical protein CHLNCDRAFT_25123 [Chlorella variabilis]
Length = 458
Score = 250 bits (638), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 145/413 (35%), Positives = 229/413 (55%), Gaps = 33/413 (7%)
Query: 14 MVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPA---KMMEEWVLPD 70
+ S+ L+RL+ P+ AILRSLPPLA+ ++++ G +PA + + W D
Sbjct: 9 ITTSVRGEQLERLFVQPYAVVAILRSLPPLARHMLLRLASTAGTVPAGKRTLADSWATSD 68
Query: 71 GFTKHRVAIDRLVQLRLFSEE-KKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITA 129
G +K A+ L +L L ++E + TY +N+ FQ+ LR+ + G A P +P A
Sbjct: 69 GSSKLSAALCELAELGLLAKEPHDGQVTYTVNAGFQAQLRRTMCSGCACP---LPGHPAA 125
Query: 130 RLPTLEDLEAYAIGQWECFLLQLIS-------------STQAERPTNFSSSMMKVFQRGL 176
+P+ +L+AYA QWE LL L++ +T + P+ +++ + V
Sbjct: 126 PVPSAAELDAYARRQWEALLLYLVNGNGTPPMAPPVLHATPIDIPSLLAAAGLMVKDECT 185
Query: 177 LSRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISN-SQERGINQADLISFLLELSFHV 235
L ++ +TE GFQFLL + +QLW VR+Y++ + G + A I+FLL L
Sbjct: 186 LEQK------ITEHGFQFLLANLYSQLWSAVRQYLTLLNTAGGADLAVAINFLLRLGLQG 239
Query: 236 AGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSAR--- 292
A A + L ++++ LGL+ G E W PT+LA L+ A
Sbjct: 240 AAAAMAHSQLDSAERTIAAHMCQLGLLMPVPGANELWLHPTRLAAVLAGGGRAGEAAVAP 299
Query: 293 KEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAE 352
+EG+++VE+NFR+YAY+TS + +LR+F + + LPNL VG IT+ES NA + GI A+
Sbjct: 300 EEGYVIVESNFRVYAYTTSAVQVAVLRVFVRCDALLPNLFVGTITRESATNALDTGIAAD 359
Query: 353 QIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
Q+++FL+Q+AHPR A + P+V V DQIRLW +L R++ A YD+F S+
Sbjct: 360 QVVAFLRQHAHPRAAAKTPTV---VTDQIRLWAQELKRLQEKNATLYDKFESK 409
>gi|6754094|ref|NP_034494.1| general transcription factor IIH subunit 4 [Mus musculus]
gi|18202124|sp|O70422.1|TF2H4_MOUSE RecName: Full=General transcription factor IIH subunit 4; AltName:
Full=Basic transcription factor 2 52 kDa subunit;
Short=BTF2 p52; AltName: Full=General transcription
factor IIH polypeptide 4; AltName: Full=TFIIH basal
transcription factor complex p52 subunit
gi|2997755|gb|AAC08594.1| TFIIH transcription/DNA repair factor p52 subunit [Mus musculus]
gi|15215027|gb|AAH12638.1| Gtf2h4 protein [Mus musculus]
gi|26348003|dbj|BAC37650.1| unnamed protein product [Mus musculus]
gi|74194835|dbj|BAE26009.1| unnamed protein product [Mus musculus]
gi|74222199|dbj|BAE26910.1| unnamed protein product [Mus musculus]
Length = 463
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 142/405 (35%), Positives = 228/405 (56%), Gaps = 10/405 (2%)
Query: 8 AKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWV 67
+N + + L+ LDRLY +P C A+ R LP LAK +VM+M +++ +P + WV
Sbjct: 17 CRNLQEFLGGLSPGVLDRLYGHPATCLAVFRELPSLAKNWVMRMLFLEQPLPQAAVALWV 76
Query: 68 LPDGFTKHRVAIDRLVQLRLFSEE--KKKETTYRLNSTFQSNLRKHLIYGG-ALPREPMP 124
+ + L LR++ + LN F+ NLR L+ GG A +
Sbjct: 77 KKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLILNPVFRQNLRIALLGGGKAWSDDTSQ 136
Query: 125 SGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLL-SRRDKE 183
G + L+ YA +WE L ++ S A + + + Q GL+ S E
Sbjct: 137 LGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSAAVSQDLAQLLS---QAGLMKSTEPGE 193
Query: 184 APRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLN 243
P +T +GFQFLL+DT AQLWY + +Y+ +Q RG++ +++SFL +LSF G+ Y++
Sbjct: 194 PPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGKDYSVE 253
Query: 244 TLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSA--RKEGFIVVET 301
+S+ + ++ + GLV Q+ RK + PT+LA NLS ++ + + GFIVVET
Sbjct: 254 GMSDSLLNFLQHLREFGLV-FQRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVET 312
Query: 302 NFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQN 361
N+R+YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII FL+
Sbjct: 313 NYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTR 372
Query: 362 AHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
AHP + + P +P + DQIRLWE + +R+ T Y++F S+V
Sbjct: 373 AHPVMLKQNPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQV 417
>gi|392594388|gb|EIW83712.1| transcription factor Tfb2 [Coniophora puteana RWD-64-598 SS2]
Length = 486
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 145/431 (33%), Positives = 236/431 (54%), Gaps = 35/431 (8%)
Query: 7 IAKN----FMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKM 62
IA+N + + S + L RLY+ P C +I R L PL+++ VM + ++D + A
Sbjct: 13 IAQNSPHVLLPFLQSQSQNALTRLYQRPSSCLSIFRLLAPLSRQLVMNLLWLDSPVLAGT 72
Query: 63 MEEWVLPDGFTKHRVAIDRLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREP 122
M WV+PDG + A++ L +L + K LN TF++++R+ + G
Sbjct: 73 MAAWVVPDGKKLYDEALNTLTRLHILQSSGAK---LALNPTFKASMRQAITCSGTTGSFG 129
Query: 123 MPSGITAR--LPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQR-GLLSR 179
+P+ R P++E L+ YA+ +WE L ++SS Q + PT + ++ + QR GL++
Sbjct: 130 VPAQRDDRRDAPSVETLDGYAVQRWETILHYMVSSGQGQYPTKPTQGVLYLLQRSGLMAS 189
Query: 180 RDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEA 239
++T SGFQFLL + QLW ++ +Y+ +ER ++ +++SF LS G
Sbjct: 190 YHGSTLQITSSGFQFLLYSPHDQLWDLLLQYLHMVEERQMDLVEVLSFFFMLSTMELGRE 249
Query: 240 YNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSM--------------- 284
Y+ LSE QK+M++D D GLV Q+ F PT+LAT L+
Sbjct: 250 YSTEPLSETQKAMLEDLRDYGLV-WQRKASSKRFSPTRLATTLTSVSPSLPTAGGSRNPG 308
Query: 285 ------SLTDSSA---RKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGA 335
S+ +S+ GFIV+ETN+R+YAY+ + L +L LF ++Y+ PNL+VG
Sbjct: 309 SAPGASSVANSTGINTNDHGFIVLETNYRVYAYTDNPLQIAVLNLFITLKYRFPNLVVGM 368
Query: 336 ITKESLYNAFENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTP 395
+T+ES+ A NGI+AEQ+IS+L +AHP++ P +P V DQ+RLWE + NR++
Sbjct: 369 LTRESVRRALGNGISAEQVISYLTAHAHPQMRKNNPLLPVTVQDQVRLWELERNRLKSDD 428
Query: 396 AHYYDEFPSRV 406
Y +F ++
Sbjct: 429 GFLYKDFATQA 439
>gi|303272589|ref|XP_003055656.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463630|gb|EEH60908.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 476
Score = 248 bits (633), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 150/438 (34%), Positives = 234/438 (53%), Gaps = 47/438 (10%)
Query: 2 PLVKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAK 61
P + +A F+ ++ L R +DRLY +P C A+LRSLP LAK Y M++ Y++G +P+
Sbjct: 1 PHTRAMASAFLGFISQLPPRHVDRLYNSPHACLAVLRSLPELAKHYCMRLMYVEGGVPSI 60
Query: 62 MMEEWVLPDGFTKHRVAIDRLVQLRLF------SEEKKKETTYRLNSTFQSNLRKHL--- 112
M+EWV G H ++ R+ +LR+ + + LN FQ +R +
Sbjct: 61 DMDEWVTDLGRDAHAESVRRMRELRVMLPLEDVTGAAGDGESLGLNPKFQRGMRSIMEGG 120
Query: 113 --IYGGALPREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMK 170
G S + + LP+ EDLE+YA G+WE LL L ++ A +++ +
Sbjct: 121 GFGDGFDDDDAATASDLGSALPSPEDLESYAKGRWEALLLTLTGASDAFAAAGANAADLD 180
Query: 171 VFQ----RGLLSRRDK-EAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLI 225
V GL+ K E +TE+GF+FLL Q+W + ++ ++
Sbjct: 181 VGALFRAAGLIGDASKGEKEGVTEAGFKFLLSTAREQIWAL------------LDAPAVL 228
Query: 226 SFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMS 285
SFLL+L+F G AY+ + L QK +++D A LGL+ + +++PT L++ LS
Sbjct: 229 SFLLKLTFQAPGVAYSTDGLPASQKGVVRDVAKLGLLYPLAAAGKGYYVPTSLSSGLSGG 288
Query: 286 LTDSSARKEG------------------FIVVETNFRMYAYSTSKLHCEILRLFSKVEYQ 327
G I+VETNFR+YAY++S + EILRLF++ +Y+
Sbjct: 289 GGGDDDGDGGVGGGGKKSGDGGGVGARGHIIVETNFRVYAYTSSAVEVEILRLFTRPDYK 348
Query: 328 LPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR-MPSVPENVCDQIRLWES 386
LPNL VG +T+E++ A GI+AEQI+S+L+++AHP+ P++P VCDQIRLW
Sbjct: 349 LPNLYVGMMTREAVVTALRGGISAEQIVSYLRKHAHPQARKTPGPAIPATVCDQIRLWSK 408
Query: 387 DLNRVEMTPAHYYDEFPS 404
D NRV+ TP Y +FP+
Sbjct: 409 DENRVKYTPCVLYCDFPT 426
>gi|307177222|gb|EFN66420.1| General transcription factor IIH subunit 4 [Camponotus floridanus]
Length = 467
Score = 248 bits (633), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 136/412 (33%), Positives = 238/412 (57%), Gaps = 24/412 (5%)
Query: 8 AKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWV 67
KN + + S ++ L++LY P IC A+ R LP +AK YVM++ +++ +P ++ W
Sbjct: 18 CKNLHEYLKSRSSDILNKLYHKPPICLAVFRELPIIAKHYVMRLLFVEQPVPQAVIASWC 77
Query: 68 LPDGFTKHRVAIDRLVQLRLFSEEK--KKETTYRLNSTFQSNLRKHLIYGGALPREPMPS 125
F +H+ + L L ++ E + LN+ F+ NL+ L+ GG P
Sbjct: 78 SKLYFEEHQKVVQVLNDLYVWKEASIPGGLPGWILNNIFKKNLKIVLLGGGK------PW 131
Query: 126 GITARLPT------LEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSR 179
++ +L T + L++YA+ +WEC L ++ S Q E S+ +++ L +
Sbjct: 132 TMSNQLETDSKPRDVAFLDSYALERWECVLHYMVGSQQQE---GISADAVRILLHAGLMK 188
Query: 180 RDKE--APRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAG 237
RD+ +P +T++GFQFLL+DT +Q+WY + +Y+ + RG++ + ++FL +L+F G
Sbjct: 189 RDEADGSPIITQAGFQFLLLDTASQVWYFILQYLDTIEARGLDLVECLTFLFQLNFSTLG 248
Query: 238 EAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSAR---KE 294
+ Y+ +SE + ++ + GLV Q+ RK F PT+LA N++ A+ KE
Sbjct: 249 KDYSTEGMSEGLLTFLQHLREFGLV-YQRKRKAGRFYPTRLALNIATGENKPLAKDTDKE 307
Query: 295 GFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQI 354
G+IVVETN+R+YAY+ S L +L LF ++ Y+ PN++V +T++S+ A ++GITA QI
Sbjct: 308 GYIVVETNYRVYAYTNSNLQVALLGLFCEMLYRFPNVVVSILTRDSIRQALKSGITASQI 367
Query: 355 ISFLQQNAHPRVADRMPSV-PENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
+ +LQQ+ H ++ +R P + P + DQI+LWE++ NR + Y +F S+
Sbjct: 368 VGYLQQHVHSKMIERGPPILPPTIVDQIKLWENERNRFLFSEGVLYSQFLSQ 419
>gi|74143475|dbj|BAE28812.1| unnamed protein product [Mus musculus]
Length = 463
Score = 247 bits (630), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 140/406 (34%), Positives = 227/406 (55%), Gaps = 10/406 (2%)
Query: 8 AKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWV 67
+N + + L+ LDRLY +P C A+ R LP LAK +VM+M +++ +P + WV
Sbjct: 17 CRNLQEFLGGLSPGVLDRLYGHPATCLAVFRELPSLAKNWVMRMLFLEQPLPQAAVALWV 76
Query: 68 LPDGFTKHRVAIDRLVQLRLFSEE--KKKETTYRLNSTFQSNLRKHLIYGG-ALPREPMP 124
+ + L LR++ + LN F+ NLR L+ GG A +
Sbjct: 77 KKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLILNPVFRQNLRIALLGGGKAWSDDTSQ 136
Query: 125 SGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLL-SRRDKE 183
G + L+ YA +WE L ++ S A + + + Q GL+ S E
Sbjct: 137 LGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSAAVSQDLAQLLS---QAGLMKSTEPGE 193
Query: 184 APRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLN 243
P +T +GFQFLL+D AQLWY + +Y+ +Q RG++ +++SFL +LSF G+ Y++
Sbjct: 194 PPCITSAGFQFLLLDMPAQLWYFMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGKDYSVE 253
Query: 244 TLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSA--RKEGFIVVET 301
+S+ + ++ + GLV Q+ RK + PT+LA NLS ++ + + GFIVVET
Sbjct: 254 GMSDSLLNFLQHLREFGLV-FQRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVET 312
Query: 302 NFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQN 361
N+R+YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ +GITA+QII FL+
Sbjct: 313 NYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQVIASGITAQQIIHFLRTR 372
Query: 362 AHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRVI 407
AHP + + P +P + DQIRLWE + +R+ T Y++F S+V+
Sbjct: 373 AHPVMLKQNPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQVV 418
>gi|393221241|gb|EJD06726.1| transcription factor Tfb2 [Fomitiporia mediterranea MF3/22]
Length = 464
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 136/395 (34%), Positives = 223/395 (56%), Gaps = 15/395 (3%)
Query: 23 LDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRL 82
L RLY P C ++ R L P+ ++ VM + +++ +IP M W+ G + A+ L
Sbjct: 26 LTRLYGRPSACLSVFRLLAPMERQLVMNLLWLESSIPTATMSSWINRKGIKLYDSALSTL 85
Query: 83 VQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPS--GITARLP-TLEDLEA 139
+L + +K N TF+++LR+ + GG +P+ G R TLE L+A
Sbjct: 86 SRLHIIGNPSQK---LNFNPTFKASLRQAITGGGTHRSFGVPAVDGEEKRSSVTLEMLDA 142
Query: 140 YAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAPRLTESGFQFLLMDT 199
YA+ +WE L ++SS + P S ++ + +R L + ++T GFQFLL
Sbjct: 143 YAVERWETILHYMVSSGTGQMPARPSQGVLFLLERSGLMSGNGGNMKITSFGFQFLLHSP 202
Query: 200 NAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADL 259
+AQLW ++ +Y+ ++ER ++ +++ FL LS G Y+ L E QK+M++D D
Sbjct: 203 HAQLWELLLQYLHMAEERQMDLIEVLGFLFMLSTMELGRGYSTENLGETQKAMLEDLRDY 262
Query: 260 GLVKLQQGRKENWFIPTKLATNLSMSL--------TDSSARKEGFIVVETNFRMYAYSTS 311
GL+ Q+ F PT+LAT L+ SL ++A+ +GFI++ETN+R+YAY+ +
Sbjct: 263 GLI-WQKKASSRRFYPTRLATTLTSSLPPLPTSGSGGANAQAQGFIILETNYRIYAYTDN 321
Query: 312 KLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRMP 371
L +L LF ++Y+ PNL+VG++T++S+ A NGITA+QII +L +AHP++ P
Sbjct: 322 PLQTAVLNLFVTLKYRFPNLVVGSLTRDSVRKALANGITADQIIKYLTTHAHPQMRKNDP 381
Query: 372 SVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
+P V DQIRLWE + NR++ + Y F S+
Sbjct: 382 LIPVTVQDQIRLWELERNRLKSQEGYLYTAFASQA 416
>gi|307212060|gb|EFN87943.1| General transcription factor IIH subunit 4 [Harpegnathos saltator]
Length = 467
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 139/412 (33%), Positives = 236/412 (57%), Gaps = 24/412 (5%)
Query: 8 AKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWV 67
KN + + S L++LY P IC A+ R LP +AK YVM++ +++ +P ++ W
Sbjct: 18 CKNLHEYLKSRPPDILNKLYHKPPICLAVFRELPIIAKHYVMRLLFVEQPVPQAVIASWC 77
Query: 68 LPDGFTKHRVAIDRLVQLRLFSEEK--KKETTYRLNSTFQSNLRKHLIYGGALPREPMPS 125
F +H+ + L +L ++ E + LN+TF+ NL+ L+ GG P
Sbjct: 78 SKLYFEEHQKVVQILNELYVWKEASIPGGLPGWILNNTFKKNLKIVLLGGGR------PW 131
Query: 126 GITARLPT------LEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSR 179
++ +L T + L++YA+ +WEC L ++ S Q E S+ +++ L +
Sbjct: 132 TMSNQLETDSKPRDVAFLDSYALERWECVLHYMVGSQQQE---GISADAVRILLHAGLMK 188
Query: 180 RDKE--APRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAG 237
RD+ +P +T++GFQFLL+DT +Q+WY + +Y+ + RG++ + ++FL +L+F G
Sbjct: 189 RDETDGSPIITQAGFQFLLLDTASQVWYFILQYLDTIEARGLDLIECLTFLFQLNFSTLG 248
Query: 238 EAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSAR---KE 294
+ Y+ +SE + ++ + GLV Q+ RK F PT+LA N++ T R KE
Sbjct: 249 KDYSTEGMSEGLLTFLQHLREFGLV-YQRKRKAGRFYPTRLALNIATGETKPLTRDTDKE 307
Query: 295 GFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQI 354
G+IVVETN+R+YAY+ S L +L LF ++ Y+ PNL+V +T++S+ A ++GITA QI
Sbjct: 308 GYIVVETNYRVYAYTNSNLQVALLGLFCEMIYRFPNLVVSILTRDSVRQALKSGITATQI 367
Query: 355 ISFLQQNAHPRVADRMPSV-PENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
+ +LQQ+AH + + P + P + DQI LWE++ NR + Y +F S+
Sbjct: 368 VGYLQQHAHSKTIEAGPPILPPTIVDQIMLWENERNRFLFSEGVLYSQFLSQ 419
>gi|332030486|gb|EGI70174.1| General transcription factor IIH subunit 4 [Acromyrmex echinatior]
Length = 550
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 135/410 (32%), Positives = 237/410 (57%), Gaps = 24/410 (5%)
Query: 10 NFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLP 69
N + + S + L++LY P IC A+ LP +AK YVM++ +++ +P ++ W
Sbjct: 20 NLQEYLKSRSPDTLNKLYHKPPICLAVFCELPVIAKHYVMRLLFVEQPVPQAVIASWCSK 79
Query: 70 DGFTKHRVAIDRLVQLRLFSEEK--KKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGI 127
F +H+ + L +L ++ E + LNSTF+ NL+ L+ GG P +
Sbjct: 80 LYFEEHQKVVQVLNELYVWKEASIPGGLPGWTLNSTFKKNLKIVLLGGGK------PWTM 133
Query: 128 TARLPT------LEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRD 181
+ +L T + L++YA+ +WEC L ++ S Q E S+ +++ L +RD
Sbjct: 134 SNQLETDSKPRDVAFLDSYALERWECVLHYMVGSQQQE---GISADAVRILLHAGLMKRD 190
Query: 182 KE--APRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEA 239
+ +P +T++GFQFLL+DT +Q+WY + +Y+ + RG++ + ++FL +L+F G+
Sbjct: 191 EADGSPIITQAGFQFLLLDTASQVWYFILQYLDTIEARGLDLVECLTFLFQLNFSTLGKD 250
Query: 240 YNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSAR---KEGF 296
Y+ +SE + ++ + GLV Q+ RK F PT+LA N++ T R KEG+
Sbjct: 251 YSTEGMSEGLLTFLQHLREFGLV-YQRKRKAGRFYPTRLALNIATGETKPLTRDTDKEGY 309
Query: 297 IVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIIS 356
I+VETN+R+YAY+ S L +L LF ++ Y+ PN++V +T++S+ A ++GITA QI+
Sbjct: 310 IIVETNYRVYAYTNSNLQVALLGLFCEMLYRFPNVVVSILTRDSVRQALKSGITASQIVG 369
Query: 357 FLQQNAHPRVADRMPSV-PENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
+L+Q+AH ++ + P + P + DQI+LWE++ NR + Y +F S+
Sbjct: 370 YLRQHAHSKMIEAGPPILPPTIVDQIKLWENERNRFLFSEGVLYSQFLSQ 419
>gi|344252968|gb|EGW09072.1| General transcription factor IIH subunit 4 [Cricetulus griseus]
Length = 463
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 142/405 (35%), Positives = 229/405 (56%), Gaps = 10/405 (2%)
Query: 8 AKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWV 67
+N + + L+ LDRLY +P C A+ R LP LAK +VM+M +++ +P + WV
Sbjct: 17 CRNLQEFLGGLSPGVLDRLYGHPATCLAVFRELPSLAKNWVMRMLFLEQPLPQAAVALWV 76
Query: 68 LPDGFTKHRVAIDRLVQLRLFSEE--KKKETTYRLNSTFQSNLRKHLIYGG-ALPREPMP 124
+ + L LR++ + LN F+ NLR L+ GG A +
Sbjct: 77 KKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLILNPIFRQNLRIALLGGGKAWSDDTSQ 136
Query: 125 SGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLL-SRRDKE 183
G + L+ YA +WE L ++ S A + + + Q GL+ S E
Sbjct: 137 LGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSAAVSQDLAQLLS---QAGLMKSAEPGE 193
Query: 184 APRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLN 243
P +T +GFQFLL+DT+AQLWY + +Y+ +Q RG++ +++SFL +LSF G+ Y++
Sbjct: 194 PPCITSAGFQFLLLDTSAQLWYFMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGKDYSVE 253
Query: 244 TLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSA--RKEGFIVVET 301
+S+ + ++ + GLV Q+ RK + PT+LA NLS ++ + + GFIVVET
Sbjct: 254 GMSDSLLNFLQHLREFGLV-FQRKRKSRRYYPTRLAINLSSGVSGAGGSVHQPGFIVVET 312
Query: 302 NFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQN 361
N+R+YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII FL+
Sbjct: 313 NYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTR 372
Query: 362 AHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
AHP + + P +P + DQIRLWE + +R+ T Y++F S+V
Sbjct: 373 AHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQV 417
>gi|392564310|gb|EIW57488.1| transcription factor Tfb2 [Trametes versicolor FP-101664 SS1]
Length = 477
Score = 245 bits (626), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 142/395 (35%), Positives = 225/395 (56%), Gaps = 15/395 (3%)
Query: 23 LDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRL 82
L RLY+ P C +I R L PL ++ VM + ++D IP M WV +G ++ A+ L
Sbjct: 39 LTRLYQKPSACLSIFRLLRPLERQLVMNLLWLDLPIPPSTMAAWVTREGKKPYQDALSTL 98
Query: 83 VQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREP--MPSGITARLPTLEDLEAY 140
+L + K + LN TF+S LR+ + GG + + P ++ L+ Y
Sbjct: 99 ARLHILPPSTSKHS---LNPTFKSGLRQAITGGGTSGSFGVLIENDDKNPPPEIDALDGY 155
Query: 141 AIGQWECFLLQLISSTQAERPTNFSSSMMKVFQR-GLLSRRDKEAPRLTESGFQFLLMDT 199
A+ +WE L ++SS E P+ S ++ + +R GL++R A ++T GFQFLL
Sbjct: 156 ALERWETILYYMVSSGTGEFPSQPSKGVLYLLERSGLMARVHSGALQITSGGFQFLLHPP 215
Query: 200 NAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADL 259
+ QLW ++ +Y+ ++ER ++ ++ISFLL LS G+ Y+ LS QK+M+ D D
Sbjct: 216 HVQLWELLLQYLQMAEERQMDLVEVISFLLMLSTTELGKNYSTENLSPTQKTMLDDLRDY 275
Query: 260 GLVKLQQGRKENWFIPTKLATNLSMSL--------TDSSARKEGFIVVETNFRMYAYSTS 311
GL+K Q+ F PT+LAT L+ S + + +GFIV+ETN+R+YAY+ +
Sbjct: 276 GLIK-QRTPTSRRFSPTRLATTLTSSSPPLPTSAGSGDGSHAQGFIVLETNYRLYAYTDN 334
Query: 312 KLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRMP 371
L +L LF ++ + PNL++GAIT++S+ A +GITA+QIIS+L +AHP++ P
Sbjct: 335 PLQIAVLNLFVTLKSRFPNLVIGAITRDSVKKALASGITADQIISYLVTHAHPQMRKNQP 394
Query: 372 SVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
+P V DQIRLWE + NR++ + Y F S+
Sbjct: 395 LLPVTVQDQIRLWELEKNRMKSQEGYLYTAFASQA 429
>gi|351713119|gb|EHB16038.1| General transcription factor IIH subunit 4 [Heterocephalus glaber]
Length = 463
Score = 244 bits (624), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 143/409 (34%), Positives = 229/409 (55%), Gaps = 10/409 (2%)
Query: 4 VKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMM 63
V + +N + + L+ LDRLY +P C A+ R LP LAK +VM+M +++ +P +
Sbjct: 13 VHLQCRNLQEFLGGLSPGVLDRLYGHPATCLAVFRELPSLAKNWVMRMLFLEQPLPQAAV 72
Query: 64 EEWVLPDGFTKHRVAIDRLVQLRLFSEE--KKKETTYRLNSTFQSNLRKHLIYGG-ALPR 120
WV + + L LR++ + LN F+ NLR L+ GG A
Sbjct: 73 ALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLILNPIFRQNLRIALLGGGKAWSD 132
Query: 121 EPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLL-SR 179
+ G + L+ YA +WE L ++ S A + + + Q GL+ S
Sbjct: 133 DTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSAAVSQDLAQLLS---QAGLMKSA 189
Query: 180 RDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEA 239
E P +T +GFQFLL+DT AQLWY + +Y+ +Q RG++ +++SFL +LSF G
Sbjct: 190 EPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGRD 249
Query: 240 YNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSA--RKEGFI 297
Y++ +S+ + ++ + GLV Q+ RK + PT+LA NLS ++ + + GFI
Sbjct: 250 YSVEGMSDSLLNFLQHLREFGLV-FQRKRKSRRYYPTRLAINLSSGVSGAGGSVHQPGFI 308
Query: 298 VVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISF 357
VVETN+R+YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII F
Sbjct: 309 VVETNYRLYAYTESELQVALIALFSEMLYRFPNVVVAQVTRESVQQAIGSGITAQQIIHF 368
Query: 358 LQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
L+ AHP + + P +P + DQIRLWE + +R+ T Y++F S+V
Sbjct: 369 LRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQV 417
>gi|393246879|gb|EJD54387.1| transcription factor Tfb2 [Auricularia delicata TFB-10046 SS5]
Length = 480
Score = 244 bits (623), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 142/404 (35%), Positives = 227/404 (56%), Gaps = 20/404 (4%)
Query: 21 RDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAID 80
R L RLY P C A+ R L P+ ++ VM + ++D I + WV DG ++ ++
Sbjct: 26 RSLTRLYSKPSACLAVFRLLSPVERQIVMTLLWLDNNIENATLTPWVTRDGLEIYKASVM 85
Query: 81 RLVQLRLF----SEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLP--TL 134
L +L + S TT LN F++NLR L+ GG P +A LP T+
Sbjct: 86 ELCKLHIVPATSSTPNLLPTTLDLNGVFKANLRLALVGGGDHNSFGKPVKRSAELPPVTV 145
Query: 135 EDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQR-GLLSR---RDKEAPRLTES 190
L+AYA+ +WE L ++SS + P S ++ + ++ GL+ R R + ++T
Sbjct: 146 SALDAYAVERWETILHFMVSSGTDQSPATPSGAVCNLLRKSGLMIRLDPRSDSSMKITSR 205
Query: 191 GFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQK 250
GFQFLL +AQLW ++ Y+ ++ERG+ +++SFL LS G+ Y+ + L++ Q
Sbjct: 206 GFQFLLSSPHAQLWELLLHYLELAEERGLGLMEVVSFLFMLSTMELGQEYSTDNLTKDQA 265
Query: 251 SMIKDFADLGLV--KLQQGRKENWFIPTKLATNLSMSLTD-------SSARKEGFIVVET 301
+++ + D GL+ + G + +F PT+LAT L SL + +SA GFI++ET
Sbjct: 266 TVLGELLDYGLIYQRALPGMSKRFF-PTRLATTLMSSLPELPRTAGVASATSGGFIILET 324
Query: 302 NFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQN 361
N+R+YAY+ + L +L LF + + PNL+VG +T++S+ A NGITAEQII++L +
Sbjct: 325 NYRLYAYTDNPLQIAVLNLFVSFKSRFPNLVVGMVTRDSVKKALANGITAEQIITYLSAH 384
Query: 362 AHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
AHP++ P +P V DQ+RLWE + NRV+ Y +F S+
Sbjct: 385 AHPQMRKNNPLLPVTVQDQVRLWELEKNRVKAEEGFLYMDFTSQ 428
>gi|242021693|ref|XP_002431278.1| TFIIH basal transcription factor complex p52 subunit, putative
[Pediculus humanus corporis]
gi|212516535|gb|EEB18540.1| TFIIH basal transcription factor complex p52 subunit, putative
[Pediculus humanus corporis]
Length = 467
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 136/408 (33%), Positives = 231/408 (56%), Gaps = 17/408 (4%)
Query: 8 AKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWV 67
K + + +L L++LY +P C A+ R LP +A+ YV ++ +++ +P ++ W+
Sbjct: 19 CKYLHEYLQTLPLATLEQLYNHPTTCLAVFRELPQIARHYVSRLLFVEQPVPQAVIASWI 78
Query: 68 LPDGFTKHRVAIDRLVQLRLFSEEK--KKETTYRLNSTFQSNLRKHLIYGG---ALPREP 122
+H A L LR++ E + LNS F+ N++ L+ GG + +
Sbjct: 79 SQTHSKEHLTATKVLTDLRVWQEAAIPGGLPGWILNSVFKKNMKISLLGGGRPWTMSAQL 138
Query: 123 MPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDK 182
P +P L++ YA+ +WEC L ++ S E S+ +++ L +RD+
Sbjct: 139 EPDTKPRDIPYLDN---YAMERWECVLHYMVGSQAQE---GISADAVRILLHANLMKRDE 192
Query: 183 E--APRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAY 240
E + +T GFQFLL+DT +Q+WY + +Y+ R ++ + ++FL +LSF G+ Y
Sbjct: 193 EDGSCVITREGFQFLLLDTASQVWYFMLQYLDTVSARNLDLVECLTFLFQLSFSTLGKDY 252
Query: 241 NLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDS--SARKEGFIV 298
+ +SE ++ + GL+ Q+ R+ F PT+LA N++ S KEG+I+
Sbjct: 253 STIGMSEGLLVFLQHLREFGLI-YQRKRRGGRFYPTRLALNIACGENKSLQQMNKEGYII 311
Query: 299 VETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFL 358
+ETN+R+YAY+ S L +L LF ++ Y+ PNL VG IT++S+ AF++GITAEQI+ FL
Sbjct: 312 IETNYRVYAYTDSNLQVALLGLFCEMLYRFPNLSVGLITRDSVRQAFKSGITAEQIVGFL 371
Query: 359 QQNAHPR-VADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
+ +AHPR +A P++P V DQI+LWE++LNR+ + Y +F S+
Sbjct: 372 RLHAHPRMIAVGPPTLPPTVVDQIKLWENELNRLVYSDGVLYSQFLSQ 419
>gi|91094191|ref|XP_971121.1| PREDICTED: similar to TFIIH basal transcription factor complex p52
subunit [Tribolium castaneum]
gi|270010854|gb|EFA07302.1| hypothetical protein TcasGA2_TC015892 [Tribolium castaneum]
Length = 472
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 134/410 (32%), Positives = 237/410 (57%), Gaps = 17/410 (4%)
Query: 8 AKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWV 67
KN D + + ++ L++LY +P IC A+ R LP LA++YV+++ +++ +P ++ W
Sbjct: 21 CKNLYDYLKTRSSATLEKLYNHPTICLAVYRELPELARQYVIRILFVEQPVPQAVVASWG 80
Query: 68 LPDGFTKHRVAIDR-LVQLRLFSEEK--KKETTYRLNSTFQSNLRKHLIYGGALPREPMP 124
F+K V + + L +L ++ E + L+ TF+ NL+ L+ GG
Sbjct: 81 -AQAFSKEHVYVSKVLSELSVWQEAAIPGGLLGWILSVTFKKNLKIALLGGGKPWSMSSA 139
Query: 125 SGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKE- 183
+ ++ + L+AY++ +WEC L ++ S Q E S+ +++ L +RD+E
Sbjct: 140 LEVDSKARDVSFLDAYSLERWECVLHYMVGSQQQE---GISADAVRILLHAGLMKRDEED 196
Query: 184 -APRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNL 242
+P +T GFQFLL+D AQ+W+ + +Y+ ++RG++ + ++FL +LSF G+ Y+
Sbjct: 197 GSPVITRQGFQFLLLDRQAQVWHFLLQYLDTVEQRGLSLVECLTFLFQLSFSTLGKDYST 256
Query: 243 NTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARKE-----GFI 297
+S ++ + GLV Q+ RK F PT+LA N++ S + E G+I
Sbjct: 257 EGMSPGLLIFLQHLREFGLV-YQRKRKAGRFYPTRLALNITCSQGPETRILEEDTPKGYI 315
Query: 298 VVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISF 357
+VETN+R+YAY+ S L ++ LF+++ Y+ PNL+VG IT++S+ A + GITA+QII +
Sbjct: 316 IVETNYRVYAYTDSNLQVALIGLFTELMYRFPNLVVGVITRDSIRQALKGGITADQIIGY 375
Query: 358 LQQNAHPRV--ADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
L+Q+AHP++ + +P V DQI+LWE + NR+ + Y +F S+
Sbjct: 376 LKQHAHPQMLEGEAKHPLPPTVVDQIKLWEIERNRLTYSEGVLYSQFLSQ 425
>gi|145351329|ref|XP_001420034.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580267|gb|ABO98327.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 495
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 152/437 (34%), Positives = 233/437 (53%), Gaps = 57/437 (13%)
Query: 11 FMDMVASLTARDLDRLY-ENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLP 69
F+ + +L R+LD LY +P+ A+LR+ +AK+ ++++ Y+D A+ +W
Sbjct: 19 FVAFLCALAPRELDALYASSPWSALAVLRACDSIAKQIILRLAYVDDGADARECAKWC-G 77
Query: 70 DGFTKHRV---AIDRLVQLRLFSEEKKKETTYRL-NSTFQSNLRKHLIYG--------GA 117
DG + R A+ RL +LR+ + + E + N F +NLR+ L G GA
Sbjct: 78 DGASDARALERAMARLARLRVTTALGEGEDARVVANEAFANNLRRALERGFVNLDDGDGA 137
Query: 118 LPREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLL 177
RE +G+ A++P L AYA +WE LL L ++ A FS KV +G L
Sbjct: 138 DARED--AGVAAKVPDRATLNAYAKAKWEDLLLTLTGASNA-----FSRPGAKV--KGRL 188
Query: 178 S-----------------------RRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNS 214
+R K A +T GF FLL Q+W ++ +YI +S
Sbjct: 189 DAQALFRAAGLTTATTAALKKGGQKRLKNAG-VTAEGFAFLLQTAQEQIWVLLTQYIRDS 247
Query: 215 QERGINQADLISFLLELSFHVAGEAYNL-NTLSEIQKSMIKDFADLGLVKLQQGRKENWF 273
Q+ A ISFLL L+F G AY + LS+ ++ ++ D LGL+ + +K+ ++
Sbjct: 248 QKTSATSA--ISFLLRLTFQEPGRAYAMTGVLSDTEQDVVLDLTHLGLLYTFEVKKKFYY 305
Query: 274 IPTKLATNLSMSLTDSSA-------RKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEY 326
+PT LA+ LS T EG I+VETN+R+YAY++S + EILRLF++ +Y
Sbjct: 306 VPTLLASGLSGGFTGDDGDTKAAAAAAEGHIIVETNYRVYAYTSSAVEMEILRLFTRADY 365
Query: 327 QLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWES 386
+LPNL VG +T+ES+ NA G+ AEQI+ +++ +AH +V + PSVP VCDQIRLW
Sbjct: 366 RLPNLYVGMLTRESVQNALRAGVDAEQIVGYIRAHAHKQVRRKKPSVPSTVCDQIRLWAR 425
Query: 387 DLNRVEMTPAHYYDEFP 403
D+ R+E Y +FP
Sbjct: 426 DMERMEAEECVLYCDFP 442
>gi|442757633|gb|JAA70975.1| Putative rna polymer [Ixodes ricinus]
Length = 459
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 133/417 (31%), Positives = 231/417 (55%), Gaps = 23/417 (5%)
Query: 2 PLVKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAK 61
P + + + +L+ LD+LY +P C A+ R LP +++ Y+M++ ++D +P
Sbjct: 6 PSSSLKCNDLHAYLKTLSPATLDQLYTHPATCLAVFRELPIISRHYIMRLMFVDQPVPQA 65
Query: 62 MMEEWVLPDGFTKHRVAIDRLVQLRLFSEEK--KKETTYRLNSTFQSNLRKHLIYGGALP 119
++ W +H +++ L L ++++ + L+ F+ N++ L+ GG
Sbjct: 66 VVSSWNEQKYVKEHLESLEALTALHIWADSSLPGGLPGWSLSVVFRKNIQIALLGGG--- 122
Query: 120 REPMPSGITARLPT------LEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVF- 172
P + + L + L+ YA+ +WEC L ++ E S+ +++
Sbjct: 123 ---QPWAVYSTLEKDKHGRDAQFLDRYAMERWECVLHFMVGCHTKE---GISADAVRILL 176
Query: 173 QRGLLSRRDKE--APRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLE 230
GL+ ++E AP +T GFQFLLMDT +Q+W+ V +Y+ + RG+N + ++FL +
Sbjct: 177 HAGLMKSEEEEGSAPLITMEGFQFLLMDTASQVWHFVLQYLDTLESRGLNLVECLTFLFQ 236
Query: 231 LSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSS 290
LSF G+ Y+ +SE ++ + GLV Q+ R+ F PT+LA NL+ L +++
Sbjct: 237 LSFLTLGKDYSTEGMSESLLVFLQHLREFGLV-YQRKRRSGRFYPTRLAINLASGLKETN 295
Query: 291 AR--KEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENG 348
R + G+IVVETN+R+YAY+ S+L +L LF ++ Y+LPNL+VG +T+ES+ A +G
Sbjct: 296 LRSYESGYIVVETNYRVYAYTDSQLQVALLALFCELMYRLPNLVVGVLTRESVRQALRSG 355
Query: 349 ITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
IT+ QII FLQ +AHP + P +P + DQ+RLWE + +R Y +F S+
Sbjct: 356 ITSNQIIKFLQMHAHPEAQKQSPVIPPTIMDQLRLWELERDRFNFREGVLYSQFISQ 412
>gi|157113167|ref|XP_001651923.1| TFIIH basal transcription factor complex p52 subunit [Aedes
aegypti]
gi|108877858|gb|EAT42083.1| AAEL006356-PA [Aedes aegypti]
Length = 487
Score = 238 bits (608), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 139/418 (33%), Positives = 234/418 (55%), Gaps = 22/418 (5%)
Query: 8 AKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWV 67
KN + + S L++LY P IC A+ R LP +A+++V+++ +++ AIP ++ W
Sbjct: 24 CKNLEEYLKSRPPDALEKLYNYPAICLAVYRELPEIARQFVIRILFVEQAIPQAVVSSWG 83
Query: 68 LPDGFTKHRVAIDR-LVQLRLFSEEKKKET--TYRLNSTFQSNLRKHLIYGGALPREPMP 124
+ K V + R L +L ++ + L TF+ NL+ L+ GG
Sbjct: 84 -SQVYAKENVLVSRVLTELGVWRTAGLTGGLEAWELCPTFKKNLKIALLGGGRPWSMSNA 142
Query: 125 SGITARLPTLEDLEAYAIGQWECFLLQLIS--STQAERPTNFSSSMMKVFQRGLLSRRDK 182
+ +E L+AYA+ +W C L ++ S++ S +++ L +RD+
Sbjct: 143 LDPDQKSRDIEFLDAYAMSRWRCVLHYMVGAGSSKGMEGEGISPDAVRILLHANLMKRDE 202
Query: 183 E--APRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAY 240
+P +T GFQFLL+DT AQ+W+ + +Y+ + RG++ A+ +S L +LSF G Y
Sbjct: 203 TDGSPVITRQGFQFLLLDTQAQVWHFMLQYLDTCEARGLDLAECLSMLFQLSFSTLGRDY 262
Query: 241 NLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLS-------MSLTDSSARK 293
+ LS+ + ++ + GLV Q+ RKE F PT+LA N++ + SS +
Sbjct: 263 SSEGLSQGLLTFLQHLREFGLV-YQRKRKEGRFYPTRLAHNITSKNAVQTIQEDGSSVQD 321
Query: 294 EGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQ 353
+G+I+VETN+R+YAY+ S L +L LF+++ Y+ PNL+VG +T++S+ AF GITAEQ
Sbjct: 322 KGYIIVETNYRVYAYTDSNLQVALLGLFTELLYRFPNLVVGVLTRDSVRQAFRGGITAEQ 381
Query: 354 IISFLQQNAHPRVADRMPSV------PENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
IIS+L+Q+AHP + + ++ P V DQI+LWE++ NR T Y++F S+
Sbjct: 382 IISYLEQHAHPTMLNMEQAINSKSPLPPTVVDQIKLWENERNRFTYTEGVVYNQFLSQ 439
>gi|391335225|ref|XP_003741996.1| PREDICTED: general transcription factor IIH subunit 4-like
[Metaseiulus occidentalis]
Length = 457
Score = 238 bits (607), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 139/416 (33%), Positives = 230/416 (55%), Gaps = 25/416 (6%)
Query: 4 VKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMM 63
V I ++ + + +L + L++LY++P C AI R LPPLA++YV+++ +I A+P ++
Sbjct: 3 VGIQYRDLISYLRTLESAVLNQLYKHPATCLAIFRRLPPLARQYVIRLLHIRQAVPQAVI 62
Query: 64 EEWVLPDGFTKHRVAIDRLVQLRLFSEEK--KKETTYRLNSTFQSNLRKHLIYGGALPRE 121
W P+ H + D L+ LRL+ E ++LN TFQ LR+ L GG
Sbjct: 63 NSWYDPEAARDHTSSEDALLSLRLWHETTLPGGLPGWQLNPTFQEMLRQALAGGGD---- 118
Query: 122 PMPSGITARLPTLED------LEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRG 175
P + L + L+ YA +W+C L ++ S E + S +++
Sbjct: 119 --PWIVYGELDKDKHGRDTKFLDQYAQERWDCVLHYMVGS---EVESGISQDAVRILLHS 173
Query: 176 LLSRRDKEAPR----LTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLEL 231
L +++ ++P +T GFQFLLMDT Q+WY + +Y+S + RG++ D + F+ +L
Sbjct: 174 GLMKKEDDSPNSKNLITMEGFQFLLMDTEDQVWYFILQYLSTVETRGVSLVDCLQFIFQL 233
Query: 232 SFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSA 291
SF G+ Y++ +SE ++ + GLV Q+ RK F PT+LA L+ L + A
Sbjct: 234 SFLTLGKDYSIKGVSENLLVFLQHLREFGLV-YQRKRKSGRFYPTRLAIGLASGLKELQA 292
Query: 292 RKE--GFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGI 349
K+ G+I+VETN+R+YAY+ S L +L LF ++ Y+ PNL+V +T+ES+ A + GI
Sbjct: 293 TKDEQGYIIVETNYRVYAYTDSPLQVALLSLFCELLYRFPNLLVAVLTRESVRQALKGGI 352
Query: 350 TAEQIISFLQQNAHPRVADRMPS-VPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
T+ QI FL+ +H V +R +P V DQ+RLWE + +R + + Y +F +
Sbjct: 353 TSNQITHFLKSRSHRVVLEREEGIIPMTVTDQLRLWELERDRFKFGESVLYSQFQT 408
>gi|334323685|ref|XP_003340422.1| PREDICTED: valyl-tRNA synthetase, mitochondrial-like [Monodelphis
domestica]
Length = 1641
Score = 238 bits (606), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 145/439 (33%), Positives = 233/439 (53%), Gaps = 41/439 (9%)
Query: 5 KIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMME 64
++ +N + + L+ LDRLY +P C A+ R LP LAK +VM+M +++ +P +
Sbjct: 14 QLQCRNLQEFLGGLSPGVLDRLYGHPATCLAVFRELPSLAKNWVMRMLFLEQPLPQAAVA 73
Query: 65 EWVLPDGFTKHRVAIDRLVQLRLFSEE--KKKETTYRLNSTFQSNLRKHLIYGG-ALPRE 121
WV + + L LR++ + LN F+ NLR L+ GG A +
Sbjct: 74 LWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLILNPVFRQNLRVALLGGGKAWSDD 133
Query: 122 PMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLL-SRR 180
G + L+ YA +WE L ++ S A + + + Q GL+ S
Sbjct: 134 TSQLGPDKHARDVPSLDKYAEERWEVILHFMVGSPSAAVSQDLAQLLS---QAGLMKSSE 190
Query: 181 DKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAY 240
E P +T +GFQFLL+DT AQLWY + +Y+ +Q RG++ +++SFL +LSF G+ Y
Sbjct: 191 PGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGKDY 250
Query: 241 NLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDS--SARKEGFIV 298
++ +S+ + ++ + GLV Q+ RK + PT+LA NLS +T + +A + GFIV
Sbjct: 251 SVEGMSDSLLNFLQHLREFGLV-FQRKRKSRRYYPTRLAINLSSGVTGAGGTAHQPGFIV 309
Query: 299 VETNFRMYAYST-------------------------------SKLHCEILRLFSKVEYQ 327
VETN+R+YAY+ S+L ++ LFS++ Y+
Sbjct: 310 VETNYRLYAYTGEVGGMRMGIGMLEGRNCGFFGIILFFSLATESELQIALIALFSEMLYR 369
Query: 328 LPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESD 387
PN++V IT+ES+ A NGITA+QII FL+ AHP + ++P +P + DQIRLWE +
Sbjct: 370 FPNMVVAQITRESVQQAIANGITAQQIIHFLKTRAHPVMLKQIPVLPPTITDQIRLWELE 429
Query: 388 LNRVEMTPAHYYDEFPSRV 406
+R+ + Y++F S+V
Sbjct: 430 RDRLRFSEGVLYNQFLSQV 448
>gi|339521903|gb|AEJ84116.1| general transcription factor IIH subunit 4 [Capra hircus]
Length = 463
Score = 238 bits (606), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 139/409 (33%), Positives = 225/409 (55%), Gaps = 10/409 (2%)
Query: 4 VKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMM 63
V + +N + + L+ LDRLY +P C A+ R LP LAK +VM+M +++ +P +
Sbjct: 13 VHLQCRNLQEFLGGLSPGILDRLYGHPATCLAVFRELPSLAKNWVMRMLFLEQPLPQAAV 72
Query: 64 EEWVLPDGFTKHRVAIDRLVQLRLFSEE--KKKETTYRLNSTFQSNLRKHLIYGG-ALPR 120
WV + + L LR++ + LN F+ NLR L+ GG A
Sbjct: 73 ALWVKKEFSKAQEESTGLLSGLRIWHAQLLPGGLQGLILNPIFRQNLRIALLGGGKAWSD 132
Query: 121 EPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLL-SR 179
+ G + L+ YA +WE L ++ S + + + Q GL+ S
Sbjct: 133 DTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSVAVSQDLAQLLS---QAGLMKSA 189
Query: 180 RDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEA 239
E P +T +GFQFLL+DT A LWY + +Y+ +Q RG++ +++SFL +LSF G+
Sbjct: 190 EPGEPPCITSAGFQFLLLDTPAHLWYFMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGKD 249
Query: 240 YNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLS--MSLTDSSARKEGFI 297
Y++ +S+ + ++ + GLV Q+ RK + PT+LA NLS +S +A + GFI
Sbjct: 250 YSVEGMSDSLLNFLQHLREFGLV-FQRKRKSRRYYPTRLAINLSSGVSGAGGTAHQPGFI 308
Query: 298 VVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISF 357
VVETN+R+YAY+ +L + LFS++ Y+ PN++V +T+ S+ A +GITA+QI+ F
Sbjct: 309 VVETNYRLYAYTEPELQIARIALFSEMLYRFPNMVVPQVTRGSVQQAIASGITAQQIVHF 368
Query: 358 LQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
L+ A P + + P +P + DQIRLWE + +R+ T Y++F S+V
Sbjct: 369 LRTRADPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQV 417
>gi|255079788|ref|XP_002503474.1| predicted protein [Micromonas sp. RCC299]
gi|226518741|gb|ACO64732.1| predicted protein [Micromonas sp. RCC299]
Length = 458
Score = 237 bits (604), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 135/426 (31%), Positives = 232/426 (54%), Gaps = 34/426 (7%)
Query: 7 IAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEW 66
+ F+ ++ +++R ++ LY++P+ C ++LRSLP LAK YVM++ Y+D I + M+ W
Sbjct: 1 MGSAFVGFLSDMSSRQINHLYQSPWACLSVLRSLPSLAKHYVMRLLYVDEGIAREEMDAW 60
Query: 67 VLP--DGFTKHRVAIDRLVQLRLFSEEKKKETT------YRLNSTFQSNLRKHL------ 112
V P + +H ++ L +LR+ +E RLN F +R +
Sbjct: 61 VRPGKEHRDRHARSMLALKRLRVLVPAGDQEYVADGKELVRLNRRFVKGVRAQIETCFAP 120
Query: 113 ----IYGGALPREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSM 168
+ P + M + P+ E +E +A G+WE L+ L + + P +++
Sbjct: 121 EGDEVEATVAPDDAM----GGKRPSPEKIEEFAKGRWEALLMTLTGRDRRKGPGLDVAAL 176
Query: 169 MKVFQRGLLSRRDKEAPR--LTESGFQFLLMDTNAQLWYIVREYI---SNSQERGINQAD 223
+ GL++ + ++ +TE GF+FLL Q+W ++ EY+ S +R +
Sbjct: 177 FR--GAGLVAEKSNKSNGWGITEKGFRFLLSTAREQIWTLLTEYVRQYSAPGDRTLVAPA 234
Query: 224 LISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQG-RKENWFIPTKLATNL 282
+I F+L L+F G+ Y ++ L Q+++ +D A LGL+ L G KE +++PT+L
Sbjct: 235 VIGFMLRLTFQAVGQPYRVDDLPSAQRAIAEDLAHLGLLYLFAGPGKEGYYVPTQLTAGK 294
Query: 283 SMSLTDSSARKE--GFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKES 340
++ D+S + G I+ ETNFR+YAY+ S + CEILRLF++ +Y+LPNL VG +T+E+
Sbjct: 295 DVADGDASLGGDPGGHIIAETNFRVYAYTFSDVECEILRLFTRPDYRLPNLYVGMLTREA 354
Query: 341 LYNAFENGITAEQIISFLQQNAHP--RVADRMPSVPENVCDQIRLWESDLNRVEMTPAHY 398
++ A + G+ AEQII +++ +AHP R +P NV DQI LW + RV
Sbjct: 355 VHEALDTGVAAEQIIKYIKSHAHPNARKTTNGSGIPPNVADQIMLWAMERRRVRSAECVL 414
Query: 399 YDEFPS 404
Y +FP+
Sbjct: 415 YCDFPT 420
>gi|403417620|emb|CCM04320.1| predicted protein [Fibroporia radiculosa]
Length = 472
Score = 237 bits (604), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 146/413 (35%), Positives = 231/413 (55%), Gaps = 27/413 (6%)
Query: 11 FMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPD 70
+ + S + L RLY+ P C ++ R L PL ++ VM + +++ A+P M WV +
Sbjct: 23 LLTFLQSQSQNALTRLYQRPSSCLSVFRLLQPLQRQLVMNLLWLETAVPVSTMFAWVTRE 82
Query: 71 GFTKHRVAIDRLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGA-----LPREP--- 122
G + + ++ L +L + K LN TF+++LR+ + GG+ +P +P
Sbjct: 83 GRSVYSDSLSTLARLHILPSSPVK---LALNPTFKASLRQAVTGGGSSRSFGVPVDPDRQ 139
Query: 123 MPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQR-GLLSRRD 181
P+ P + L+AYA+ +WE L ++SS PT S ++ + QR GL++
Sbjct: 140 YPA------PNINTLDAYALERWETILHYMVSSGTGHYPTRPSQEVLFLLQRSGLMTNAH 193
Query: 182 KEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYN 241
A ++T SGFQFLL +AQLW ++ +Y+ +ER ++ D++SFLL LS G Y+
Sbjct: 194 GGALQITSSGFQFLLHSPHAQLWELLVQYLYMVEERQMDLVDVLSFLLMLSTMELGREYS 253
Query: 242 LNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSL--------TDSSARK 293
L+ Q +M+ D + GLV QQ F P++LAT L+ S + +
Sbjct: 254 TENLTATQTAMLTDLRNYGLV-WQQSPSSRRFSPSRLATTLTSSSSPLPTSQNSGTGPHG 312
Query: 294 EGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQ 353
EGFIV+ETN+R+YAY+ + L +L LF ++ + PNL++GAIT+ES+ A NGITA+Q
Sbjct: 313 EGFIVLETNYRIYAYTDNPLQIAVLNLFVTLKSRFPNLVIGAITRESVKKALANGITADQ 372
Query: 354 IISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
IIS+L +AHP++ P +P V DQIRLWE + NRV+ + Y F S+
Sbjct: 373 IISYLTAHAHPQMHKNKPLLPVTVQDQIRLWELERNRVKSEEGYLYTSFASQA 425
>gi|409077631|gb|EKM77996.1| hypothetical protein AGABI1DRAFT_114851 [Agaricus bisporus var.
burnettii JB137-S8]
gi|426198975|gb|EKV48900.1| hypothetical protein AGABI2DRAFT_191077 [Agaricus bisporus var.
bisporus H97]
Length = 467
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 140/393 (35%), Positives = 221/393 (56%), Gaps = 13/393 (3%)
Query: 23 LDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRL 82
L RLY+ P C ++ R L PL ++ VM + +++ AI M WV DG + A+ L
Sbjct: 31 LTRLYQRPSSCLSVFRLLAPLERQIVMNLLWLESAIATSTMVAWVQRDGRKLYEEALGNL 90
Query: 83 VQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARL-PTLEDLEAYA 141
+L + K LN TF+++LR+ L GG +P+ P+++ L+++A
Sbjct: 91 SRLHILPNSAAK---LALNFTFKTSLRQALTGGGTSGSFGVPAEKDEHSGPSIKTLDSFA 147
Query: 142 IGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAP-RLTESGFQFLLMDTN 200
+ +WE L ++SS ++P S ++ + QR L P ++T SGFQFLL +
Sbjct: 148 LERWETILHYMVSSGTGQQPAKPSQGVLFLLQRSSLMSSFHGGPLQITSSGFQFLLHSPH 207
Query: 201 AQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLG 260
AQLW ++ +Y+ +QER ++ +++ FL LS G Y LS Q +++D D G
Sbjct: 208 AQLWDLLLQYLHLAQERQMDLVEVLGFLFMLSTMELGREYLTENLSTTQGVLLEDLRDYG 267
Query: 261 LVKLQQGRKENWFIPTKLATNLSMS-------LTDSSARKEGFIVVETNFRMYAYSTSKL 313
L+ Q K F PT+L T L+ S ++ SS +GFIV+ETN+R+YAY+ + L
Sbjct: 268 LI-WQSKPKSRRFSPTRLGTTLTSSSPPLPTTISASSGPLDGFIVLETNYRIYAYTDNPL 326
Query: 314 HCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRMPSV 373
+L LF ++Y+ PNL+VG IT++S+ A NGITA+QIIS+L +AHP++ P +
Sbjct: 327 QTAVLNLFVSLKYRFPNLVVGMITRDSVRRALMNGITADQIISYLITHAHPQMRKNNPLL 386
Query: 374 PENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
P V DQIRLWE + NR++ + + Y F S+
Sbjct: 387 PVTVQDQIRLWELEKNRLKSSEGYLYTAFTSQA 419
>gi|170039509|ref|XP_001847575.1| TFIIH basal transcription factor complex p52 subunit [Culex
quinquefasciatus]
gi|167863052|gb|EDS26435.1| TFIIH basal transcription factor complex p52 subunit [Culex
quinquefasciatus]
Length = 492
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 137/418 (32%), Positives = 234/418 (55%), Gaps = 22/418 (5%)
Query: 8 AKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWV 67
KN + + S L++LY P IC A+ R LP +A+++V+++ +++ IP ++ W
Sbjct: 29 CKNLEEYLRSRPPDALEKLYNYPAICLAVYRELPEIARQFVIRILFVEQPIPHAVVSSWG 88
Query: 68 LPDGFTKHRVAIDR-LVQLRLFSEEKKKET--TYRLNSTFQSNLRKHLIYGGALPREPMP 124
+ K A+ + L +L ++ + L STF+ NL+ L+ GG
Sbjct: 89 -SQIYAKENTAVSKVLTELGVWRSTGLTGGLEGWELCSTFKKNLKIALLGGGRPWSMSNA 147
Query: 125 SGITARLPTLEDLEAYAIGQWECFLLQLIS--STQAERPTNFSSSMMKVFQRGLLSRRDK 182
+ +E L+ YA+ +W C L ++ S++ S +++ L +RD+
Sbjct: 148 LDPDQKSRDIEFLDGYAMSRWRCVLHYMVGAGSSKGMEGEGISPDAVRILLHANLMKRDE 207
Query: 183 E--APRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAY 240
+P +T GFQFLL+DT AQ+W+ + +Y+ + RG++ A+ +S L +LSF G Y
Sbjct: 208 TDGSPVITRQGFQFLLLDTQAQVWHFMLQYLDTCEARGLDLAECLSMLFQLSFSTLGRDY 267
Query: 241 NLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLS-------MSLTDSSARK 293
+ LS+ + ++ + GLV Q+ RKE F PT+LA N++ + S+A+
Sbjct: 268 SSEGLSQGLLTFLQHLREFGLV-YQRKRKEGRFYPTRLAHNITSRNAVPTIQEDGSAAQD 326
Query: 294 EGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQ 353
+G+IVVETN+R+YAY+ S L +L LF+++ Y+ PNL+VG +T++S+ AF GITA+Q
Sbjct: 327 KGYIVVETNYRVYAYTDSNLQVALLGLFTELLYRFPNLVVGVLTRDSVRQAFRGGITADQ 386
Query: 354 IISFLQQNAHP------RVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
IIS+L+Q+AHP + + ++P V DQI+LWE++ NR T Y++F S+
Sbjct: 387 IISYLEQHAHPTMHNVEQTINTKSALPPTVVDQIKLWENERNRFTYTEGVVYNQFLSQ 444
>gi|340371925|ref|XP_003384495.1| PREDICTED: general transcription factor IIH subunit 4-like
[Amphimedon queenslandica]
Length = 446
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 138/401 (34%), Positives = 221/401 (55%), Gaps = 13/401 (3%)
Query: 10 NFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLP 69
+ + + SL + L RLY++P C A+ RSL L + Y+M++ +I+ +P + W LP
Sbjct: 6 DLLTYLQSLPSAVLTRLYQHPPSCLAVYRSLSELGQLYIMRVLFIERPLPLADINSWALP 65
Query: 70 DGFTKHRVAIDRLVQLRLFSE---EKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSG 126
G H + L +L +F ++ Y+LN ++Q+NLR + GG L P G
Sbjct: 66 -GNNLHAETLSLLNKLHIFEPVLIRTGQQQGYQLNQSYQTNLRILICGGGQLWAVPGHRG 124
Query: 127 ITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQ-RGLLSRRDKEAP 185
+ L YA +WE L ++ ++ S +++V Q GL+S+ P
Sbjct: 125 DDKHSRDIPYLNDYADKKWENILNFMVGGSE-----KLDSEIVQVLQYAGLMSQASSRHP 179
Query: 186 R-LTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNT 244
+T GFQFLLMDT +Q+W + +Y++ +Q+RG++ D + F+ +LSF G+ Y+
Sbjct: 180 SFITTLGFQFLLMDTQSQIWQFILQYLNTAQDRGMDLIDCLKFIFQLSFSTLGKDYSTKG 239
Query: 245 LSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLT-DSSARKEGFIVVETNF 303
L+E Q + + GLV ++ + ++ PTKL L+ T + + GFI+VETN+
Sbjct: 240 LTESQLTFMYHLCQFGLVYQRKSSSKRYY-PTKLVIQLTAGETIGMNTSQAGFIIVETNY 298
Query: 304 RMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAH 363
R+ AY+ SKLH L LF +++Y+ PN+ VG IT+ES+ A +GI A+QIISFL Q+AH
Sbjct: 299 RVIAYTDSKLHIATLALFCQMQYRFPNVAVGTITRESIQQALVHGIKADQIISFLTQHAH 358
Query: 364 PRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
+ + +P V DQI+LWE + NR+ Y EF S
Sbjct: 359 HNMLSKAHILPPTVTDQIKLWEIERNRLSFQEGILYSEFLS 399
>gi|221120480|ref|XP_002159945.1| PREDICTED: general transcription factor IIH subunit 4-like [Hydra
magnipapillata]
Length = 458
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 135/396 (34%), Positives = 226/396 (57%), Gaps = 11/396 (2%)
Query: 17 SLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHR 76
+L+ L+ LY +PF C A+ R LP LAK Y+M+ + + ++ + W + H
Sbjct: 21 TLSPSSLNALYNDPFTCLAVFRGLPELAKHYIMRTLFTNQSLSDAFVLSWCKKEWAKDHI 80
Query: 77 VAIDRLVQLRLFS--EEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLPTL 134
A+++L L ++ E Y N TFQ+NLR L GG + P+ + +
Sbjct: 81 EAVNKLKGLHIWVSFEIGTPMLRYEFNITFQTNLRIGLCGGGPVQATPLKAVEEKHARDV 140
Query: 135 EDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVF-QRGLLSRRDKE-APRLTESGF 192
L+ Y+ +WEC +L ++ +Q S + +V GLL +E A +T SGF
Sbjct: 141 AFLDNYSKERWEC-ILYYMTGSQVTGNAGVSQDVARVLINAGLLIFDHQEQATCITSSGF 199
Query: 193 QFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSM 252
QFLL+DT++Q+WY + +++++ + + Q +SFL + SF V G+ Y +N L+E Q S
Sbjct: 200 QFLLLDTSSQVWYFMVQHLNSMEPEILVQC--LSFLFQTSFSVLGKDYPVNDLTEAQFSF 257
Query: 253 IKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLT---DSSARKEGFIVVETNFRMYAYS 309
++ ++GL Q+ RK F PT+LA NL ++T DSS++++GF+VVETN+R+YAY+
Sbjct: 258 LQLLREIGLA-FQRKRKSKRFYPTRLAINLGSAVTGNSDSSSQQQGFLVVETNYRVYAYT 316
Query: 310 TSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 369
S LH +L LF+ ++ + P+ V +++ES+ A GI+AEQII FL+ AH ++
Sbjct: 317 DSVLHIALLSLFTDIKARFPSFTVALLSRESVQQALACGISAEQIIDFLKTRAHSQMTTS 376
Query: 370 MPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
P + + DQI+LWE + +R+ + Y++F S+
Sbjct: 377 SPIIASTITDQIKLWEMERDRLRYSQGVLYNQFLSQ 412
>gi|395533817|ref|XP_003768949.1| PREDICTED: LOW QUALITY PROTEIN: valine--tRNA ligase, mitochondrial
[Sarcophilus harrisii]
Length = 1638
Score = 235 bits (599), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 143/442 (32%), Positives = 233/442 (52%), Gaps = 44/442 (9%)
Query: 5 KIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMME 64
++ +N + + L+ LDRLY +P C A+ R LP LAK +VM+M +++ +P +
Sbjct: 47 QLQCRNLQEFLRGLSPVVLDRLYGHPATCLAVFRELPSLAKNWVMRMLFLEQPLPQAAVA 106
Query: 65 EWVLPDGFTKHRVAIDRLVQLRLFSEE--KKKETTYRLNSTFQSNLRKHLIYGG-ALPRE 121
WV + + L LR++ + + LN F+ NLR L+ GG A +
Sbjct: 107 LWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGFILNPVFRQNLRVALLGGGKAWSDD 166
Query: 122 PMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLL-SRR 180
G + L+ YA +WE L ++ S A + + + Q GL+ S
Sbjct: 167 TSQLGPDKHARDVPSLDKYAEERWEVILHFMVGSPSAAVSQDLAQLLS---QAGLMKSSE 223
Query: 181 DKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAY 240
E P +T +GFQFLL+DT AQLWY + +Y+ +Q RG++ +++SFL +LSF G+ Y
Sbjct: 224 PGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGKDY 283
Query: 241 NLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDS--SARKEGFIV 298
++ +S+ + ++ + GLV Q+ RK + PT+LA NLS +T + +A + GFIV
Sbjct: 284 SVEGMSDSLLNFLQHLREFGLV-FQRKRKSRRYYPTRLAINLSSGVTGAGGTAHQPGFIV 342
Query: 299 VETNFRMYAYST----------------------------------SKLHCEILRLFSKV 324
VETN+R+YAY+ S+L ++ LFS++
Sbjct: 343 VETNYRLYAYTGEVGGIKMRIEIRGGGIGGELNLLWSSIVFSLETESELQIALIALFSEM 402
Query: 325 EYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLW 384
Y+ PN++V +T+ES+ A NGITA+QII F++ AHP + + P +P + DQIRLW
Sbjct: 403 LYRFPNMVVAQVTRESVQQAIANGITAQQIIHFMKTRAHPVMLKQTPVLPPTITDQIRLW 462
Query: 385 ESDLNRVEMTPAHYYDEFPSRV 406
E + +R+ + Y++F S+V
Sbjct: 463 ELERDRLRFSEGVLYNQFLSQV 484
>gi|409048320|gb|EKM57798.1| hypothetical protein PHACADRAFT_171015 [Phanerochaete carnosa
HHB-10118-sp]
Length = 472
Score = 235 bits (599), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 144/411 (35%), Positives = 233/411 (56%), Gaps = 22/411 (5%)
Query: 11 FMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPD 70
+ + S + L RLY+ P C +I R L PL ++ VM + +++ +IPA M WV +
Sbjct: 22 LLAFLQSQSQNTLTRLYQRPSSCLSIFRLLGPLERQIVMNLLWLESSIPASTMATWVPHE 81
Query: 71 GFTKHRVAIDRLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITAR 130
+ ++ L +L + + K LN TF+++LR +I GG+ +P+ +
Sbjct: 82 VRKTYENSLSILSRLHILPQSAVK---LALNPTFKTSLRHAIIGGGSSGSFGVPATRDEK 138
Query: 131 L--PTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQR-GLLSRRDKEAPRL 187
P++E L+AYA+ +WE L ++SS A RP S ++ + QR GL++ + ++
Sbjct: 139 HSSPSIEVLDAYALERWETILHYMVSSGIATRP---SQGVLFLLQRSGLMASIHGGSLQI 195
Query: 188 TESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSE 247
T GFQFLL +AQLW ++ +Y+ +ER ++ +++SFL LS G Y++ LS
Sbjct: 196 TSLGFQFLLHSPHAQLWELLLQYLHMVEERQMDLVEVLSFLFMLSTMELGREYSVEHLSP 255
Query: 248 IQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSL--------TDSSARKEGFIVV 299
Q +M++D D G++ Q+ F PT+LAT L+ S +SA +GFI++
Sbjct: 256 TQTAMLEDLRDYGII-WQRRATSKRFCPTRLATTLTSSSPPLPAAGGVSASAHGQGFIIL 314
Query: 300 ETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAE----QII 355
ETN+R+YAY+ + L +L LF ++ + PNL+VGAIT+ES+ A NGITA+ QII
Sbjct: 315 ETNYRLYAYTDNPLQIAVLNLFVTLKSRYPNLVVGAITRESVKKALTNGITADQASRQII 374
Query: 356 SFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
S+L +AHP++ P +P V DQIRLWE + NR++ + Y F S+
Sbjct: 375 SYLTTHAHPQMRKNKPLLPVTVQDQIRLWELEKNRMKSQEGYLYTAFASQA 425
>gi|346464903|gb|AEO32296.1| hypothetical protein [Amblyomma maculatum]
Length = 436
Score = 235 bits (599), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 130/408 (31%), Positives = 228/408 (55%), Gaps = 17/408 (4%)
Query: 8 AKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWV 67
++ + +L+A LD+LY +P C A+ R LP ++++Y+M++ +++ +P ++ W
Sbjct: 11 CRDLHAYLKTLSAATLDKLYTHPATCLAVFRELPIISRQYIMRLLFVEQPVPQAVISSWN 70
Query: 68 LPDGFTKHRVAIDRLVQLRLFSEEK--KKETTYRLNSTFQSNLRKHLIYGGALP---REP 122
H A++ L L ++++ + L++ F+ N++ L+ GG P P
Sbjct: 71 EQKYVKDHLEALEALTALHIWTDASLPGGLPGWSLSAVFRKNIQIALL-GGGKPWAVYNP 129
Query: 123 MPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVF-QRGLLSRRD 181
+ R T L+ YA+ +WEC L ++ E S+ +++ GL+ +
Sbjct: 130 LEKDKHGRDATF--LDQYAVERWECVLHFMVGCHTTE---GISADAVRILLHAGLMKSEE 184
Query: 182 KE--APRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEA 239
E +P +T GFQFLLMDT +Q+W+ V +Y+ + RG+N + ++FL +LSF G+
Sbjct: 185 GEGSSPLITMEGFQFLLMDTPSQVWHFVLQYLDTIESRGLNLVECLTFLFQLSFLTLGKD 244
Query: 240 YNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSAR--KEGFI 297
Y+ +SE ++ + GLV Q+ R+ F PT+LA NL+ L +++ R + G+I
Sbjct: 245 YSTEGMSESLLVFLQHLREFGLV-YQRKRRSGRFYPTRLAINLASGLRETNLRSYEAGYI 303
Query: 298 VVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISF 357
+VETN+R+YAY+ S+L +L LF ++ Y+ PNL+V +T+ES+ A +GIT+ QII F
Sbjct: 304 MVETNYRVYAYTNSQLQVALLALFCELLYRFPNLVVARLTRESVRQALRSGITSNQIIKF 363
Query: 358 LQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
L+ AHP + P +P + DQ+RLWE + +R Y +F S+
Sbjct: 364 LRMYAHPEALKQTPVIPATIMDQLRLWELERDRFVYREGVLYSQFISQ 411
>gi|195126587|ref|XP_002007752.1| GI12214 [Drosophila mojavensis]
gi|193919361|gb|EDW18228.1| GI12214 [Drosophila mojavensis]
Length = 492
Score = 234 bits (597), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 138/431 (32%), Positives = 233/431 (54%), Gaps = 47/431 (10%)
Query: 8 AKNFMDMVASL-TARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEW 66
KNF + + + T L++LY P IC A+ R LP +A+++V+++ ++D +P ++ W
Sbjct: 28 CKNFQEYLRTRQTPESLEKLYSYPPICLAVFRELPEIARQFVIRILFVDQPVPQAVVSSW 87
Query: 67 VLPDGFTKHRVAIDRLVQLRLFSEEKKKETT--------YRLNSTFQSNLRKHLIYGG-- 116
RVA ++L S T + L+ TF+ ++R+ L+ GG
Sbjct: 88 ------GAQRVAKEQLEATNCLSALSVWRVTAIPGGLAAWELSPTFKKSVRQVLLGGGKP 141
Query: 117 -----ALPREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISS---TQAERPTNFSSSM 168
+L R+ P + L+AYA+ +W C L ++ + + E ++
Sbjct: 142 WAITNSLERDSKPRDVAF-------LDAYAMSRWRCVLHYMVGTGNRSNTETEAISPDAV 194
Query: 169 MKVFQRGLLSRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFL 228
+ L+ R ++E +T GFQFLL+DT AQ+W+ + +Y+ QERG+ A+ +S L
Sbjct: 195 RILLHANLMKRDEREGITITRQGFQFLLLDTRAQVWHFMLQYLDTCQERGLVLAECLSML 254
Query: 229 LELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNL------ 282
+LSF G Y+ +++ ++ + GLV Q+ RKE+ F PT+LA N+
Sbjct: 255 FQLSFSTLGRDYSSEGMNKQMLMFLQHLREFGLV-FQRKRKEHRFYPTRLALNVTNKEAA 313
Query: 283 --SMSLTDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKES 340
S+S+ + ++ G+IVVETN+R+YAY+ S+L +L LF+++ Y+ PNL+VG +T++S
Sbjct: 314 EASISVDEERMQERGYIVVETNYRVYAYTDSQLQVAVLGLFTELLYRFPNLVVGVLTRDS 373
Query: 341 LYNAFENGITAEQIISFLQQNAHPRVA------DRMPSVPENVCDQIRLWESDLNRVEMT 394
+ A GITAEQI+S+L+Q AHP + +P V DQI+LWE + NR T
Sbjct: 374 VRQALRGGITAEQIVSYLEQYAHPNMKLVESAIQSKSCLPPTVVDQIKLWEMERNRFTYT 433
Query: 395 PAHYYDEFPSR 405
Y++F S+
Sbjct: 434 EGVVYNQFLSQ 444
>gi|357618296|gb|EHJ71332.1| putative TFIIH basal transcription factor complex p52 subunit
[Danaus plexippus]
Length = 603
Score = 234 bits (597), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 136/424 (32%), Positives = 239/424 (56%), Gaps = 27/424 (6%)
Query: 2 PLVKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAK 61
P + K+ + + S + + L+ LY P IC A+ R LP LA+ +V+++ +++ +P
Sbjct: 142 PSPTLQCKDLHEYLKSRSPQFLETLYNYPTICLAVYRELPELARHFVIRLLFVEQPVPQA 201
Query: 62 MMEEWVLPDGFTKHRVAIDRLVQLRLFSEEK--KKETTYRLNSTFQSNLRKHLIYGG--- 116
++ WV + A + L +L ++ E + L+ +F+ NL+ L+ GG
Sbjct: 202 VVASWVTQTHAKEQHKACEALSELSVWQEAPIPGGLPGWMLSQSFKKNLKVALLGGGRPW 261
Query: 117 ALPREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGL 176
++ P G + + L+AYA+ +WEC L ++ STQ E S+ +++ +
Sbjct: 262 SMSSSLEPDG---KARDVSFLDAYALERWECVLHYMVGSTQTE---GISADAVRILLQAG 315
Query: 177 LSRRDKE--APRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFH 234
L RD E +T +GFQFLL+ T Q+W ++ Y+ +++R ++ A+ ++FL +LSF
Sbjct: 316 LMNRDAEDGTAVITRAGFQFLLLSTAKQVWLFLQHYLHTAEKRSLSAAECLAFLYQLSFS 375
Query: 235 VAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDS----- 289
G+ Y+ +S ++ + GLV Q+ RK F PT+LA N++ + D
Sbjct: 376 TLGKDYSTEGMSNNMLVFLQHLREFGLV-YQRKRKAGRFYPTRLALNIT-CVKDGVAPLQ 433
Query: 290 SARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGI 349
+A G+I+ ETN+R+YAY+TS L +L LF+++ Y+ PN++VG +T+ES+ A GI
Sbjct: 434 TAASSGYIIAETNYRVYAYTTSALQVALLGLFTELVYRFPNVVVGVLTRESVRAALRGGI 493
Query: 350 TAEQIISFLQQNAHPRV--ADR-----MPSVPENVCDQIRLWESDLNRVEMTPAHYYDEF 402
+A+QII++L+Q++HP++ +D+ S+P V DQIRLWES+ NR T Y++F
Sbjct: 494 SAQQIITYLEQHSHPQMLKSDQGGIRSSSSLPPTVLDQIRLWESERNRFTYTEGVVYNQF 553
Query: 403 PSRV 406
S+
Sbjct: 554 LSQA 557
>gi|348683175|gb|EGZ22990.1| hypothetical protein PHYSODRAFT_479459 [Phytophthora sojae]
Length = 490
Score = 234 bits (596), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 147/451 (32%), Positives = 241/451 (53%), Gaps = 71/451 (15%)
Query: 13 DMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGF 72
D + +L L+RLY++P+ C+AI ++LPPLA+++VM++ + A+P ++E+WV+P+
Sbjct: 5 DFLETLPPATLERLYQDPWACQAIFQALPPLAQQFVMRLLPSNAAVPRSLLEQWVVPEPG 64
Query: 73 TKHRV------AIDRLVQLRLFSEEKKKETTYRLNSTFQSNL---RKHLIY-----GGA- 117
+R+ A+ +L L++F + + YR + TFQ L + L Y GG+
Sbjct: 65 DANRMPPQFHAALHKLEGLKVFVHQ---DGAYRPHPTFQKQLMVEKWLLTYALSNLGGSP 121
Query: 118 -------LPREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMK 170
LP++P + A DLE YA +W+ L ++ ST + P S++
Sbjct: 122 WERGRLQLPKDPENTFAAA------DLERYARARWDSVLHYMVGSTAVQEPPQ---SVVD 172
Query: 171 VFQRGLL---SRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERG-INQADLIS 226
+ R L S D A +T++G++F+L D + Q+W + EYI G + Q D++
Sbjct: 173 ILLRTRLLQASGADSRALHITDTGYEFMLKDIHVQMWIFMLEYIRTLDNTGTLKQEDILQ 232
Query: 227 FLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNL---- 282
FL ++S+ GE Y + L+E Q+ ++ DF D GL+ ++ + F T LA NL
Sbjct: 233 FLFQISYCQTGEYYAVADLTETQRLLLGDFIDFGLL-FRKRPNSDRFYTTSLAVNLIFGG 291
Query: 283 -------SMSLTDSSA---------------------RKEGFIVVETNFRMYAYSTSKLH 314
+SLT S A + +VVETNF++YAY+TS LH
Sbjct: 292 STGQKRSHVSLTSSFAGVRAGLKSQVADPRQAPTADHSAQLLVVVETNFKIYAYTTSTLH 351
Query: 315 CEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRMPSVP 374
+L +F + +LPNL +G IT+ESL +A +GI+A+QI FL ++AHP++ P +P
Sbjct: 352 VAMLSVFVDIVARLPNLAIGFITRESLRSALIHGISAQQIYDFLMKHAHPKMRRNSPVIP 411
Query: 375 ENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
EN+ DQI LWE + NRV+ +D F ++
Sbjct: 412 ENIADQIYLWERERNRVQFMEGILFDGFNTK 442
>gi|156378083|ref|XP_001630974.1| predicted protein [Nematostella vectensis]
gi|156218005|gb|EDO38911.1| predicted protein [Nematostella vectensis]
Length = 471
Score = 233 bits (595), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 134/410 (32%), Positives = 229/410 (55%), Gaps = 16/410 (3%)
Query: 9 KNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVL 68
K+ + ++ L LDRL+ +P C + R LP LAK +VM+ + D +P ++ WV
Sbjct: 20 KDLYEYLSGLPVGILDRLFCHPATCLTVFRELPELAKHFVMRTLFADQPVPESIVSTWVK 79
Query: 69 PDGFTKHRVAIDRLVQLRLFSE-EKKKETTYRLNSTFQSNLRKHLIYGGAL---PREPMP 124
+ + +++ L QLR++ E Y +N+TF++N++ L GG + +
Sbjct: 80 SAYYKYNIGSLNTLKQLRIWREVPSGVHKRYEMNATFRTNMKAALCGGGKSWMGSTQHLG 139
Query: 125 SGITARLPTLEDLEAYAIGQWECFLLQLISSTQ-AERPTNFSSSMMKVFQRGLLSRRDKE 183
R P L+ YAI +WE L + ST+ A+ S ++KV L + +
Sbjct: 140 PDKHTREPDF--LDKYAIERWESVLHFMTGSTEMADNAGGVSQDVVKVLVLSGLMKCESP 197
Query: 184 A--PRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYN 241
P ++ +GFQFLL+D +Q+WY + + + + RG++ + +S L +LSF G+ Y
Sbjct: 198 GSNPIISPAGFQFLLLDRPSQVWYFMLQCLETVEARGMDLVECLSLLFQLSFSSPGKDYP 257
Query: 242 LNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLS-----MSLTDSSARKEGF 296
+ L++ Q ++ ++GLV Q+ RK + PTKL+ NL+ ++ TDS + GF
Sbjct: 258 TDGLTDSQMKFLQQLREIGLV-FQRKRKSRRYYPTKLSVNLTAAGKGINTTDSQI-EAGF 315
Query: 297 IVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIIS 356
I++ETN+R+YAY+ S L ++ LF ++ + PNL V ++T+ES A +GI+AEQI++
Sbjct: 316 IIIETNYRVYAYTESCLQVSLIGLFCEILCRFPNLCVASLTRESCQQALASGISAEQILN 375
Query: 357 FLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
FLQ AHP + R P +P + DQ+RLWE + +R++ T Y++F S+
Sbjct: 376 FLQTRAHPEMLKRTPIIPSTISDQVRLWEMERSRMKFTEGVLYNQFLSQA 425
>gi|395332344|gb|EJF64723.1| transcription factor Tfb2 [Dichomitus squalens LYAD-421 SS1]
Length = 506
Score = 231 bits (590), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 139/420 (33%), Positives = 229/420 (54%), Gaps = 37/420 (8%)
Query: 23 LDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRL 82
++RLY+ P C +I R L PL ++ VM + +++ IP M WV +G ++ A+ L
Sbjct: 41 MNRLYQKPSSCLSIFRLLQPLERQIVMNLLWLETPIPGATMAAWVTREGKRPYQEALSTL 100
Query: 83 VQLRLFSEEKKKE-----TTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLPT--LE 135
L + ++ + + LN TF+++LR+ + GG + + + P +E
Sbjct: 101 QSLHILPQQSTRPGPPQPVKHNLNPTFKASLRQAITGGGTSGSFGILAENDDKHPPPDIE 160
Query: 136 DLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQR-GLLSRRDKEAPRLTESGFQF 194
L+A+A+ +WE L ++SS + P S ++ + QR GL++ ++T +GFQF
Sbjct: 161 ALDAFALERWETILYYMVSSGTGQFPAKPSQGVLYLLQRSGLMATVHGATLQITSAGFQF 220
Query: 195 LLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAG----------------- 237
LL + QLW ++ +Y+ ++ER ++ +++SFLL LS G
Sbjct: 221 LLHPPHEQLWELLLQYLQMAEERQMDLVEVLSFLLMLSTTELGKVGVHTIQFIAQVTQAI 280
Query: 238 ---EAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSL-------- 286
++Y+ LS QK+M++D D GL++ Q+ F PT+LAT L+ S
Sbjct: 281 PHAQSYSTENLSATQKAMLEDLRDYGLIR-QRTATSRKFSPTRLATTLTSSSPPLPTSAG 339
Query: 287 TDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFE 346
T + +GFIV+ETN+R+YAY+ + L +L LF ++Y+ PNL+VG IT++S+ A
Sbjct: 340 TGDGSHVQGFIVLETNYRLYAYTDNPLQIAVLNLFVTLKYRFPNLVVGQITRDSVKKALA 399
Query: 347 NGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
NGITA+QIIS+L +AHP++ P +P V DQIRLWE + NRV+ + Y F S
Sbjct: 400 NGITADQIISYLITHAHPQMRKNKPLLPVTVQDQIRLWELERNRVKSQEGYLYTAFASHA 459
>gi|195999048|ref|XP_002109392.1| hypothetical protein TRIADDRAFT_20428 [Trichoplax adhaerens]
gi|190587516|gb|EDV27558.1| hypothetical protein TRIADDRAFT_20428 [Trichoplax adhaerens]
Length = 452
Score = 231 bits (590), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 133/397 (33%), Positives = 224/397 (56%), Gaps = 15/397 (3%)
Query: 15 VASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTK 74
+ SL A LD+LY +P C A R L LAK YVM+ + AIP + WV +
Sbjct: 16 LKSLPAIILDKLYNHPAACLATFRELSQLAKHYVMRTLFTIQAIPTSTISAWVNSQHQKE 75
Query: 75 HRVAIDRLVQLRLFSEEK--KKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLP 132
H+ A+++L ++R++ +LN +F++NL+ L GG P S
Sbjct: 76 HQAALNQLTEIRIWQSHTLPGGMQGIKLNESFRANLKTALCGGGESWYGPQLSKADKYQF 135
Query: 133 TLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAPRLTESGF 192
T E L+ + +WEC L L+ + + + S++ GL+S+ +T +GF
Sbjct: 136 TSEALDLKSKERWECVLHYLVGAGEDLALSGEIQSLLN--NCGLVSKNG-----ITAAGF 188
Query: 193 QFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSM 252
QFLL+D +Q+WYI+ +Y+ +++ G++ +++SFL +LS+ G+ Y+ L++ Q +
Sbjct: 189 QFLLLDRPSQVWYILLQYLDSAESLGMDLVEILSFLFQLSYSTFGQNYSTEGLTQTQLTC 248
Query: 253 IKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTD---SSARKEGFIVVETNFRMYAYS 309
++ ++GLV +Q+ RK+ F PT+LA NL+ ++ G+I+VETN+R+YAY+
Sbjct: 249 LQHMREIGLV-VQRKRKDMKFYPTQLAINLASGAKQEELDHSKSSGYIIVETNYRLYAYT 307
Query: 310 TSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 369
S L ++ LF ++ Y+LP L VG IT+ES+ AF NGITA +II+F++ +AHP +
Sbjct: 308 ESPLDIALVALFCEMMYRLPGLCVGLITRESVQQAFTNGITANKIINFIRTHAHPEAKKK 367
Query: 370 M--PSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
+ P VP + DQ+ LWE + R+ + Y++ S
Sbjct: 368 VTTPIVPSTIIDQLYLWELERGRLSCSDGVLYNQILS 404
>gi|431907067|gb|ELK11185.1| General transcription factor IIH subunit 4 [Pteropus alecto]
Length = 447
Score = 231 bits (588), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 139/409 (33%), Positives = 221/409 (54%), Gaps = 26/409 (6%)
Query: 4 VKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMM 63
V + +N + + L+ LDRLY +P C A+ R LP LAK +VM+M ++D +P +
Sbjct: 13 VHLQCRNLQEFLGGLSPGVLDRLYGHPATCLAVFRELPSLAKNWVMRMLFLDQPLPQAAV 72
Query: 64 EEWVLPDGFTKHRVAIDRLVQLRLFSEE--KKKETTYRLNSTFQSNLRKHLIYGG-ALPR 120
WV + + L LR++ + LN F+ NLR L+ GG A
Sbjct: 73 ALWVKKEFSKAQEESTGLLSSLRIWHTQLLPGGLQGLILNPIFRQNLRIALLGGGKAWSD 132
Query: 121 EPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLL-SR 179
+ G + L+ YA +WE L ++ S A + + + Q GL+ S
Sbjct: 133 DTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSAAVSQDLAQLLS---QAGLMKST 189
Query: 180 RDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEA 239
E P +T +GFQFLL+DT AQLWY + +Y+ +Q RG++ +++SFL +LSF G+
Sbjct: 190 EPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGKD 249
Query: 240 YNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLS--MSLTDSSARKEGFI 297
Y++ +S+ + ++ + GLV Q+ RK + PT+LA NLS +S +A + GFI
Sbjct: 250 YSVEGMSDSLLNFLQHLREFGLV-FQRKRKSRRYYPTRLAINLSSGVSGAGGTAHQPGFI 308
Query: 298 VVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISF 357
VVETN+R+YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+Q
Sbjct: 309 VVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQT--- 365
Query: 358 LQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
P +P + DQIRLWE + +R+ T Y++F S+V
Sbjct: 366 -------------PVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQV 401
>gi|195162678|ref|XP_002022181.1| GL25529 [Drosophila persimilis]
gi|198464237|ref|XP_001353141.2| GA20571 [Drosophila pseudoobscura pseudoobscura]
gi|194104142|gb|EDW26185.1| GL25529 [Drosophila persimilis]
gi|198149631|gb|EAL30642.2| GA20571 [Drosophila pseudoobscura pseudoobscura]
Length = 494
Score = 231 bits (588), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 141/436 (32%), Positives = 243/436 (55%), Gaps = 39/436 (8%)
Query: 1 MPLVK----IIAKNFMDMVASL-TARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYID 55
+PLV+ + K+F + + + TA L++LY P IC A+ R LP LA+++++++ ++D
Sbjct: 19 VPLVQGPANLECKDFQEYLRTRQTAETLEKLYNYPPICLAVFRELPELARQFIIRILFVD 78
Query: 56 GAIPAKMMEEWVLPDGFTKHRVAIDRLVQLRLF--SEEKKKETTYRLNSTFQSNLRKHLI 113
+P ++ W + A L L ++ + + + L+ TF+ ++R+ L+
Sbjct: 79 QPVPQAVVTSWGAQRFAKEQTEATSCLSDLNVWRVTAIPGGLSAWELSPTFKKSVRQVLL 138
Query: 114 YGGALPREPMPSGIT----ARLPTLEDLEAYAIGQWECFLLQLISS-----TQAERPTNF 164
GG +P P T ++ + L++YA+ +W C L ++ + T AE
Sbjct: 139 GGG----KPWPMTNTLEKDSKPRDIAFLDSYAMSRWRCVLHYMVGTGNRNGTDAE---AI 191
Query: 165 SSSMMKVFQRGLLSRRD-KEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQAD 223
S +++ L +RD ++ +T GFQFLL+DT AQ+W+ + +Y+ +ERG++ +
Sbjct: 192 SPDAVRILLHANLMKRDERDGITITRQGFQFLLLDTRAQVWHFMLQYLDTCEERGVSLPE 251
Query: 224 LISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLS 283
+S L +LSF G Y+ ++ S ++ + GLV Q+ RKE F PT+LA N++
Sbjct: 252 CLSMLFQLSFSTLGRDYSSEGMNSQMLSFLQHLREFGLV-FQRKRKEGRFYPTRLALNVT 310
Query: 284 -------MSLTDSSARKE-GFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGA 335
M+++D A ++ G+IVVETN+R+YAY+ S L +L LF+++ Y+ PNL+VG
Sbjct: 311 SKEAATTMTVSDEEAMQDSGYIVVETNYRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGV 370
Query: 336 ITKESLYNAFENGITAEQIISFLQQNAHPRVA------DRMPSVPENVCDQIRLWESDLN 389
+T++S+ A GITAEQI+S+LQQ AHP + +P V DQI+LWE + N
Sbjct: 371 LTRDSVRQALRGGITAEQIVSYLQQYAHPNMKLVESAIQSKSCLPPTVVDQIKLWEMERN 430
Query: 390 RVEMTPAHYYDEFPSR 405
R T Y++F S+
Sbjct: 431 RFTYTEGVVYNQFLSQ 446
>gi|449540295|gb|EMD31288.1| hypothetical protein CERSUDRAFT_119839 [Ceriporiopsis subvermispora
B]
Length = 412
Score = 230 bits (586), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 133/369 (36%), Positives = 214/369 (57%), Gaps = 15/369 (4%)
Query: 49 MQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQLRLFSEEKKKETTYRLNSTFQSNL 108
M + +++ AIP M WV +G T + ++ L +L + K LN TF+S+L
Sbjct: 1 MNLLWLESAIPVSTMAAWVTREGRTTYHASLSTLARLHILPTSATK---LALNPTFKSSL 57
Query: 109 RKHLIYGGALPRE--PMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSS 166
R+ + GG+ P P P++E L++YA+ +WE L ++SS + PT S
Sbjct: 58 RQAITGGGSSGSFGVPAPYDEINPGPSIETLDSYALERWETILYYMVSSGNGQYPTKPSD 117
Query: 167 SMMKVFQR-GLLSRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLI 225
+++ + +R GL++ A ++T SGFQFLL + QLW ++ +Y+ ++ER ++ D++
Sbjct: 118 AVLYLLKRSGLMTSVRGAALQITSSGFQFLLHPPHEQLWELLLQYLHLTEERQMDLVDVL 177
Query: 226 SFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMS 285
FLL LS G Y+ + LS QK+M++D D GL+ Q+ F PT+LAT L+ S
Sbjct: 178 GFLLMLSTMELGREYSTDGLSPTQKAMLEDLRDYGLL-WQRSATSQRFSPTRLATTLTAS 236
Query: 286 --------LTDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAIT 337
+ ++ +GFIV+ETN+R+YAY+ + L +L LF ++ + PNL++GA+T
Sbjct: 237 SNPLPTSSSASADSQSQGFIVLETNYRVYAYTDNPLQIAVLNLFVTMKSRFPNLVIGAVT 296
Query: 338 KESLYNAFENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAH 397
+ES+ A NGITA+QIIS+L +AHP++ P +P V DQIRLWE + NRV+ +
Sbjct: 297 RESVKKALANGITADQIISYLTAHAHPQMRRYKPLLPVTVQDQIRLWELEKNRVKSQEGY 356
Query: 398 YYDEFPSRV 406
Y F S+
Sbjct: 357 LYTAFASQA 365
>gi|427785675|gb|JAA58289.1| Putative rna polymer [Rhipicephalus pulchellus]
Length = 457
Score = 229 bits (585), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 126/411 (30%), Positives = 224/411 (54%), Gaps = 23/411 (5%)
Query: 8 AKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWV 67
++ + +L+ LD+LY +P C A+ R LP +++ Y+M++ +++ +P ++ W
Sbjct: 10 CRDLHAYLKTLSPATLDKLYTHPATCLAVFRELPKISRHYIMRLLFVEQPVPQAVVSSWN 69
Query: 68 LPDGFTKHRVAIDRLVQLRLFSEEK--KKETTYRLNSTFQSNLRKHLIYGGALPREPMPS 125
H A++ L L ++++ + L++ F+ N++ L+ GG P
Sbjct: 70 EQKYVKDHLEALEALTALHIWTDASLPGGLPGWSLSAVFRKNIQIALLGGGK------PW 123
Query: 126 GITARLPTLED------LEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSR 179
+ + L + L+ YA +WEC L ++ E S+ +++ L +
Sbjct: 124 AVYSALEKDKHGRDAAFLDQYAAERWECVLHFMVGCHTTE---GISADAVRILLHAGLMK 180
Query: 180 RDK---EAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVA 236
D+ +P +T GFQFLLMDT +Q+W+ V +Y+ + RG+N + ++FL +LSF
Sbjct: 181 SDEGEGSSPLITMEGFQFLLMDTASQVWHFVLQYLDTLESRGLNLVECLTFLFQLSFLTL 240
Query: 237 GEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSAR--KE 294
G+ Y+ +SE ++ + GLV Q+ R+ F PT+LA NL+ L +++ R +
Sbjct: 241 GKDYSTEGMSESLLVFLQHLREFGLV-YQRKRRSGRFYPTRLAINLASGLKETTLRSFEA 299
Query: 295 GFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQI 354
G+I+VETN+R+YAY+ S+L +L LF ++ Y+ PNL+V +T+ES+ A +GIT+ QI
Sbjct: 300 GYIMVETNYRVYAYTNSQLQVALLALFCELLYRFPNLVVARLTRESVRQALRSGITSNQI 359
Query: 355 ISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
I FL+ AHP + P +P + DQ+RLWE + +R Y +F S+
Sbjct: 360 IKFLRMYAHPEALKQTPVIPATIMDQLRLWEMERDRFVFREGVLYSQFISQ 410
>gi|198428509|ref|XP_002131018.1| PREDICTED: similar to General transcription factor II H,
polypeptide 4 [Ciona intestinalis]
Length = 463
Score = 229 bits (585), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 140/414 (33%), Positives = 224/414 (54%), Gaps = 24/414 (5%)
Query: 4 VKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMM 63
VK+ + + SL LD LYE+P C A+ R LP LAK Y+M++ +++ I +
Sbjct: 7 VKLKRLGLHEYLCSLPPAILDSLYEHPATCMAVFRELPELAKYYIMRILFVEQPISKAAV 66
Query: 64 EEWVLPDGFTKHRVAIDRLVQLRLFSE---EKKKETTYRLNSTFQSNLRKHLIYGGALPR 120
WV + H A+ + LR++ E + T + ++S F+ NL+K L+ GG
Sbjct: 67 SAWVKVNAKQDHNEAVKSMCSLRVWMESNLQGSASTAFIMSSIFRRNLQKALVGGG---- 122
Query: 121 EPMPS----GITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGL 176
EP S G +E L+ YA +WE L L+ S E + S + + +
Sbjct: 123 EPWSSTAHLGPDKHGKDIESLDKYASERWEMLLHYLVGS---ETNSTISQDIKDLINQAG 179
Query: 177 LSRRDKEAPR----LTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELS 232
L ++ E R +T GFQFLL+D +Q+WY V EY+ +ERG+N ++ F+ ELS
Sbjct: 180 LMKQGSEGGRSYSIITAKGFQFLLLDNASQVWYFVLEYLDWVRERGMNLVPILRFIFELS 239
Query: 233 FHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLS----MSLTD 288
F G+ + + ++ F ++GLV +Q+ RK F PT LA NL+ S+T+
Sbjct: 240 FSSPGKDLPTDGRDDHVLICLQHFREMGLV-MQRKRKSRRFYPTYLAINLANRANASVTN 298
Query: 289 SSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENG 348
+ + GFI+ ETNFR+YAY+ S+L IL LF K+ Y+ PN+ V +T++S+ +A NG
Sbjct: 299 VTNSR-GFILAETNFRVYAYTDSELQYSILSLFCKMLYRFPNVCVLQLTRKSIQSAVVNG 357
Query: 349 ITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEF 402
ITA+QI+ +++ NAHP + P + + DQ+RLW + +R+ Y++F
Sbjct: 358 ITADQILHYIKANAHPDMLKNDPIIAPTLADQVRLWAMERDRLTYRDGVLYNQF 411
>gi|328873270|gb|EGG21637.1| general transcription factor IIH [Dictyostelium fasciculatum]
Length = 453
Score = 229 bits (585), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 133/410 (32%), Positives = 231/410 (56%), Gaps = 29/410 (7%)
Query: 15 VASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTK 74
++SL ++ LY + F C A+LRSLPP AK+Y+M++ +D +I + + +W
Sbjct: 7 LSSLPENEIQDLYSDAFTCRAVLRSLPPRAKQYLMRLLVVD-SISMQSVYQWTTQSSIQA 65
Query: 75 HRVAIDRLVQLRLFSEEKKKETTYR-----LNSTFQSNLRKHLIYGGALPREPMPSGITA 129
H+ AI +L+ L++ + + LN F+ N++ L ++ + + +A
Sbjct: 66 HKDAIRKLLDLKILTRVPNPKGVVSQDQLFLNQRFKDNIKDSLTDLDSVVLKSVKDNQSA 125
Query: 130 RLP----TLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAP 185
+ T+E+L+ Y+ GQWE L L S + + P S ++ L++++ +
Sbjct: 126 QQHIKPMTVEELDKYSKGQWEKVLYYL--SDEGKAPPELVSDLL---LSSNLTKKNGTSI 180
Query: 186 RLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTL 245
+T GF+F+L D Q+W ++ Y++ + RG + + + FL +LSF G++Y L L
Sbjct: 181 AITSEGFKFMLKDIYTQIWTLIIVYLNTLESRGRPRKEALIFLFKLSFLTLGKSYYLADL 240
Query: 246 SEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLT-----------DSSARKE 294
+ Q+ M+ D + GLV ++ K + F PT+LA +L+ T +S + +
Sbjct: 241 NPNQRDMLFDLKEFGLVYVR-SEKSDVFYPTRLAISLATGRTVSLMNDLAQEISTSQKDQ 299
Query: 295 GFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQI 354
G++++ETNFR+YAY+ S L ++ LF K+ Y+LPNL VG +T+ES+ +A +GITA+QI
Sbjct: 300 GYLILETNFRIYAYTQSSLQISLISLFVKMLYRLPNLAVGILTRESVRSALLHGITADQI 359
Query: 355 ISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
+ F++QNAHP + + PE V +QIRLWES+ NR+ A +D FP+
Sbjct: 360 VDFVKQNAHPNMV--ISGYPEVVFEQIRLWESERNRITYKKAVLFDSFPN 407
>gi|301098671|ref|XP_002898428.1| general transcription factor IIH subunit, putative [Phytophthora
infestans T30-4]
gi|262105199|gb|EEY63251.1| general transcription factor IIH subunit, putative [Phytophthora
infestans T30-4]
Length = 483
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 145/444 (32%), Positives = 236/444 (53%), Gaps = 64/444 (14%)
Query: 13 DMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLP-DG 71
D + +L L+RLY++P+ C+AI ++LP LA+++VM++ + A+P ++E+WV+P G
Sbjct: 5 DFLETLPPATLERLYQDPWACQAIFQALPSLAQQFVMRLLPTNSAVPRDLLEQWVVPIPG 64
Query: 72 FTKH-----RVAIDRLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLI-YGGA-------- 117
TK A+ +L LR+F ++ + Y+ + TFQ L L GG+
Sbjct: 65 ETKRMPPQFNAALQKLEGLRVFVQQ---DGGYKPHPTFQKQLMYALSNLGGSPWERGRLH 121
Query: 118 LPREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLL 177
LPR+P + DLE YA +W+ L ++ ST + P S++ + R L
Sbjct: 122 LPRDPENTFAAV------DLERYARARWDLVLHYMVGSTAVQEP---PQSVVDILLRTKL 172
Query: 178 ---SRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERG-INQADLISFLLELSF 233
S D A +T++G++F+L D + Q+W + EYI G + Q D++ FL ++S+
Sbjct: 173 LQASGADSRALHITDTGYEFMLKDIHVQMWIFMLEYIRTLDNTGTLKQEDILQFLFQISY 232
Query: 234 HVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNL----------- 282
G Y + L+E Q+ ++ DF D GL+ ++ + + F T LA NL
Sbjct: 233 CQTGGYYAVADLTETQRLLLGDFIDFGLL-FRKRQNSDRFYTTSLAVNLIFGGSTGQKRS 291
Query: 283 SMSLTDSSARKEG---------------------FIVVETNFRMYAYSTSKLHCEILRLF 321
+SLT S A +VVETNF++YAY++S LH +L +F
Sbjct: 292 HVSLTSSFAGVRAGMKSQVADPRQAPTVDHGARLLVVVETNFKIYAYTSSTLHVAMLSVF 351
Query: 322 SKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQI 381
+ +LPNL +G IT+ESL +A +GI+A+QI FL ++AHP++ P +PEN+ DQI
Sbjct: 352 VDIVARLPNLAIGFITRESLRSALIHGISAQQIYDFLMKHAHPKMRRNSPVIPENIADQI 411
Query: 382 RLWESDLNRVEMTPAHYYDEFPSR 405
LWE + NRV+ +D F ++
Sbjct: 412 YLWERERNRVQFLEGILFDGFNTK 435
>gi|330918142|ref|XP_003298106.1| hypothetical protein PTT_08708 [Pyrenophora teres f. teres 0-1]
gi|311328891|gb|EFQ93800.1| hypothetical protein PTT_08708 [Pyrenophora teres f. teres 0-1]
Length = 482
Score = 229 bits (583), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 141/410 (34%), Positives = 218/410 (53%), Gaps = 32/410 (7%)
Query: 25 RLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLV 83
+LYE P A+ R LP LAK VM M Y+ PA ++ W P + A L
Sbjct: 23 KLYEQPSTVLAVFRCMLPHLAKSIVMAMLYMPSPFPAADLDAWFKPSARKEKERATFTLD 82
Query: 84 QLRLFSEEKKKETT--YRLNSTFQSNLRKHLIYGGALPRE---PMPSGITARLPTLEDLE 138
+L + + ++ T + L+ FQ +LR + I G R P + + ++E L+
Sbjct: 83 RLHIITSVRQDNGTLSWTLSPGFQRSLR-NAIEGSGTHRSFGVPATKEESGKRVSVEFLD 141
Query: 139 AYAIGQWECFLLQLISSTQAERPTNFSS-----SMMKVFQRGLLSRRDKEAPRLTESGFQ 193
Y+ QWE L L+S N S S + G L R +PR+T+ GF
Sbjct: 142 EYSRSQWEGILYYLVSGAAGLGKDNISRAEVSPSTKTLLNTGDLVRTIHGSPRITKDGFS 201
Query: 194 FLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMI 253
F+L +TNAQ+W ++ Y+ + E G+++ ++++FL L G+ Y+ +TLS Q M+
Sbjct: 202 FVLQETNAQVWSLLIIYLKVTNELGMSETEVLAFLFMLGSLELGQDYSTSTLSPTQLRML 261
Query: 254 KDFADLGLVKLQQGRKENWFIPTKLATNL--------SMSLTD-----------SSARKE 294
D + +GL+ + + F PT+LAT L S S D S+A +
Sbjct: 262 DDLSSMGLI-YRSDKNARTFYPTRLATTLTSDSGSAMSASSNDIAQAGQGNAGPSAAANK 320
Query: 295 GFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQI 354
GFI++ETN+R+YAY+ S + IL LF+K++++ PNL+ G +TKES++ A ++GIT+ QI
Sbjct: 321 GFIIIETNYRLYAYTNSLIQIAILSLFTKLQHRFPNLVSGKLTKESVHKAVQSGITSAQI 380
Query: 355 ISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
IS+L AHP++ +P +P V DQIRLWE + RVE TP + EF S
Sbjct: 381 ISYLTTYAHPQMQKTVPYIPPTVMDQIRLWEYEGERVETTPGYLMREFSS 430
>gi|195019521|ref|XP_001984999.1| GH16811 [Drosophila grimshawi]
gi|193898481|gb|EDV97347.1| GH16811 [Drosophila grimshawi]
Length = 493
Score = 229 bits (583), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 139/437 (31%), Positives = 231/437 (52%), Gaps = 40/437 (9%)
Query: 2 PLVK----IIAKNFMDMVASL-TARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDG 56
PLV+ + KNF D + + T L++LY P IC A+ R LP +A+++V+++ ++D
Sbjct: 18 PLVQSPENLECKNFQDYLRTRQTPESLEKLYNYPPICLAVFRELPEIARQFVIRILFVDQ 77
Query: 57 AIPAKMMEEWVLPDGFTKHRVAIDRLVQLRLF--SEEKKKETTYRLNSTFQSNLRKHLIY 114
+P ++ W + A L L ++ + + L+ TF+ ++R+ L+
Sbjct: 78 PVPQAVVSSWGAQRIAKEQMEATSCLTALNVWRVTAIPGGLAAWELSPTFKKSVRQVLLG 137
Query: 115 GG-------ALPREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTN---F 164
GG L ++ P I L+ YA+ +W C L ++ + T+
Sbjct: 138 GGKPWAMTNTLDKDSKPRDIAF-------LDTYAMARWRCVLHYMVGTGSRNGGTDAEAI 190
Query: 165 SSSMMKVFQRGLLSRRD-KEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQAD 223
S +++ L +RD ++ +T GFQFLL+DT AQ+W+ + +Y+ +ERG + +
Sbjct: 191 SPDAVRILLHANLMKRDERDGITITRQGFQFLLLDTRAQVWHFMLQYLETCEERGFSLPE 250
Query: 224 LISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNL- 282
+S L +LSF G Y+ +S ++ + GLV Q+ RKE F PT+LA N+
Sbjct: 251 CLSMLFQLSFSTLGRDYSSEGMSHQMLMFLQHLREFGLV-FQRKRKEGRFYPTRLALNVT 309
Query: 283 -------SMSLTDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGA 335
S S D ++ G+IVVETN+R+YAY+ S+L +L LF+++ Y+ PNL+VG
Sbjct: 310 NKDAAQASTSADDERMQERGYIVVETNYRVYAYTDSQLQVAVLGLFTELLYRFPNLVVGV 369
Query: 336 ITKESLYNAFENGITAEQIISFLQQNAHPRVA------DRMPSVPENVCDQIRLWESDLN 389
+T++S+ A GITAEQI+S+L+Q AHP + +P V DQI+LWE + N
Sbjct: 370 LTRDSVRQALRGGITAEQIVSYLEQYAHPNMKLVESAIQSKSCLPPTVVDQIKLWELERN 429
Query: 390 RVEMTPAHYYDEFPSRV 406
R T Y++F S+
Sbjct: 430 RFTYTEGVVYNQFLSQT 446
>gi|451847848|gb|EMD61155.1| hypothetical protein COCSADRAFT_97372 [Cochliobolus sativus ND90Pr]
Length = 482
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 141/410 (34%), Positives = 219/410 (53%), Gaps = 32/410 (7%)
Query: 25 RLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLV 83
+LYE P A+ R LP LAK VM M Y+ PA ++ W P + A L
Sbjct: 23 KLYEQPSTVLAVFRCMLPHLAKSIVMAMLYMPTPFPAADLDTWFKPTARKEKERATFTLD 82
Query: 84 QLRLFSEEKKKETT--YRLNSTFQSNLRKHLIYGGALPRE---PMPSGITARLPTLEDLE 138
+L + + ++ T + LN FQ +LR + I G R P + + ++E L+
Sbjct: 83 RLHIITSARQDNGTLSWTLNPGFQRSLR-NAIEGSGTHRSFGVPATKEESGKRVSIEFLD 141
Query: 139 AYAIGQWECFLLQLISSTQAERPTNFSSSMM-----KVFQRGLLSRRDKEAPRLTESGFQ 193
Y+ QWE L L+S + S + + K+ G L R +PR+T+ GF
Sbjct: 142 EYSRSQWEGILYYLVSGAAGLSKDSISRAEVGPGTKKLLHTGDLVRTIHGSPRITKDGFS 201
Query: 194 FLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMI 253
F+L +TNAQ+W ++ Y+ + E G+++ +++SFL L G+ Y+ +TLS Q M+
Sbjct: 202 FVLQETNAQVWSLLIVYLKMTNELGMSETEVLSFLFMLGSLELGQDYSTSTLSATQLQML 261
Query: 254 KDFADLGLVKLQQGRKENWFIPTKLATNLS------MSLTDS-------------SARKE 294
+D + +GLV + R F PT+LAT L+ MS + +A +
Sbjct: 262 EDLSAMGLVYRSE-RNARTFYPTRLATTLTSDSGSAMSASSKDIAQASTSTTGPPTAANK 320
Query: 295 GFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQI 354
GFI++ETN+R+YAY+ S + IL LF+K++++ PNL+ G +TKES++ A + GIT+ QI
Sbjct: 321 GFIIIETNYRLYAYTNSLIQIAILSLFTKLQHRFPNLVSGKLTKESVHKAVQAGITSAQI 380
Query: 355 ISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
IS+L AHP++ +P +P V DQIRLWE + RVE T + EF S
Sbjct: 381 ISYLTTYAHPQMQKTVPYIPPTVMDQIRLWEYEGERVETTTGYLMREFGS 430
>gi|443688956|gb|ELT91478.1| hypothetical protein CAPTEDRAFT_18660 [Capitella teleta]
Length = 463
Score = 228 bits (581), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 131/406 (32%), Positives = 221/406 (54%), Gaps = 10/406 (2%)
Query: 8 AKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWV 67
K + + + L +LY +P C AI R LP L+K YV+++ +++ + ++ WV
Sbjct: 16 CKTLHEYLKGMPQNTLTKLYNHPATCLAIFRELPSLSKHYVLRILFVEQPVSHSVVSSWV 75
Query: 68 LPDGFTKHRVAIDRLVQLRLFSEE--KKKETTYRLNSTFQSNLRKHLIYGGALPREPMPS 125
T+H A+ L L ++ + Y L+ F++NL+ L+ GG+
Sbjct: 76 NSSNQTEHTAAVKSLSDLCVWQDHCLPGGLPGYLLSDVFRTNLKVALLGGGSPWAGSGAL 135
Query: 126 GITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAP 185
G + LE Y++ +WEC L + S E + S ++ + L + ++ +P
Sbjct: 136 GDDKHSKDVAFLEQYSMERWECVLHYMAGS--KEGVSGVSRDVVHILLHSGLMKTEQSSP 193
Query: 186 R--LTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLN 243
+T +GFQFLL+DT+ Q+W+ + EY++ + R ++ + +SFL +LSF G+ Y+
Sbjct: 194 DPCITPAGFQFLLLDTSTQVWFFMIEYLNTVESRKMDLVECLSFLFQLSFSTLGKDYSTE 253
Query: 244 TLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNL---SMSLTDSSARKEGFIVVE 300
+++ Q+ ++ + GLV Q+ R F PT+LA NL S +A ++GFIVVE
Sbjct: 254 SMTSNQQRFLQHLREFGLV-YQRKRSSQRFYPTRLAVNLATGSKGGDSETASEDGFIVVE 312
Query: 301 TNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQ 360
TN+R+YAY+ S L ++ LF + Y+ PN+ VG ++++S+ A GITAEQII+FL+
Sbjct: 313 TNYRVYAYTNSSLKVALISLFCSMMYRFPNMAVGVVSRDSVREALSRGITAEQIINFLRN 372
Query: 361 NAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
+AHP R P +P V DQIRLWE + +R Y++F S+
Sbjct: 373 HAHPHTQKRKPILPPTVSDQIRLWELERDRFHFNEGVLYNQFLSQT 418
>gi|189205068|ref|XP_001938869.1| RNA polymerase II transcription factor B subunit 2 [Pyrenophora
tritici-repentis Pt-1C-BFP]
gi|187985968|gb|EDU51456.1| RNA polymerase II transcription factor B subunit 2 [Pyrenophora
tritici-repentis Pt-1C-BFP]
Length = 482
Score = 228 bits (581), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 140/410 (34%), Positives = 217/410 (52%), Gaps = 32/410 (7%)
Query: 25 RLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLV 83
+LYE P A+ R LP LAK VM M Y+ PA ++ W P + A L
Sbjct: 23 KLYEQPSTVLAVFRCMLPHLAKSIVMAMLYMPSPFPAADLDAWFKPSARKEKERATFTLD 82
Query: 84 QLRLFSEEKKKETT--YRLNSTFQSNLRKHLIYGGALPRE---PMPSGITARLPTLEDLE 138
+L + + ++ T + L+ FQ +LR + I G R P + + ++E L+
Sbjct: 83 RLHIITSSRQDNGTLSWTLSPGFQRSLR-NAIEGSGTHRSFGVPATKEESGKRVSVEFLD 141
Query: 139 AYAIGQWECFLLQLISSTQAERPTNFSS-----SMMKVFQRGLLSRRDKEAPRLTESGFQ 193
Y+ QWE L L+S N S S + G L R +PR+T+ GF
Sbjct: 142 EYSRAQWEGILYYLVSGAAGLGKDNISRAEVSPSTKTLLNTGDLVRTIHGSPRITKDGFS 201
Query: 194 FLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMI 253
F+L +TNAQ+W ++ Y+ + E G+++ ++++FL L G+ Y+ +TLS Q M+
Sbjct: 202 FVLQETNAQVWSLLIIYLKVTNELGMSETEVLAFLFMLGSLELGQDYSTSTLSPTQLRML 261
Query: 254 KDFADLGLVKLQQGRKENWFIPTKLATNL--------SMSLTD-----------SSARKE 294
D + +GL+ + + F PT+LAT L S S D S+ +
Sbjct: 262 DDLSSMGLI-YRSDKNARTFYPTRLATTLTSDSGSAMSASSNDIAQAGQGNAGPSATANK 320
Query: 295 GFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQI 354
GFI++ETN+R+YAY+ S + IL LF+K++++ PNL+ G +TKES++ A ++GIT+ QI
Sbjct: 321 GFIIIETNYRLYAYTNSLIQIAILSLFTKLQHRFPNLVSGKLTKESVHKAVQSGITSAQI 380
Query: 355 ISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
IS+L AHP++ +P +P V DQIRLWE + RVE TP + EF S
Sbjct: 381 ISYLTTYAHPQMQKTVPYIPPTVMDQIRLWEYEGERVETTPGYLMREFSS 430
>gi|451996931|gb|EMD89397.1| hypothetical protein COCHEDRAFT_1141584 [Cochliobolus
heterostrophus C5]
Length = 482
Score = 228 bits (581), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 141/410 (34%), Positives = 219/410 (53%), Gaps = 32/410 (7%)
Query: 25 RLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLV 83
+LYE P A+ R LP LAK VM M Y+ PA ++ W P + A L
Sbjct: 23 KLYEQPSTVLAVFRCMLPHLAKSIVMAMLYMPTPFPAADLDTWFKPTARKEKERATFTLD 82
Query: 84 QLRLFSEEKKKETT--YRLNSTFQSNLRKHLIYGGALPRE---PMPSGITARLPTLEDLE 138
+L + + ++ T + LN FQ +LR + I G R P + ++E L+
Sbjct: 83 RLHIITSARQDNGTLSWTLNPGFQRSLR-NAIEGSGTHRSFGVPATKEESGNRVSIEFLD 141
Query: 139 AYAIGQWECFLLQLISSTQAERPTNFSSSMM-----KVFQRGLLSRRDKEAPRLTESGFQ 193
Y+ QWE L L+S + S + + K+ G L R +PR+T+ GF
Sbjct: 142 EYSRSQWEGILYYLVSGAAGLSKDSISRAEVGPGTKKLLHTGDLVRTIHGSPRITKDGFS 201
Query: 194 FLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMI 253
F+L +TNAQ+W ++ Y+ + E G+++ +++SFL L G+ Y+ +TLS Q M+
Sbjct: 202 FVLQETNAQVWSLLIVYLKMTNELGMSETEVLSFLFMLGSLELGQDYSTSTLSATQLQML 261
Query: 254 KDFADLGLVKLQQGRKENWFIPTKLATNLS------MSLTDS-------------SARKE 294
+D + +GLV + R F PT+LAT L+ MS + + +A +
Sbjct: 262 EDLSAMGLV-YRSDRNARTFYPTRLATTLTSDSGSAMSTSSNDIAQASTSTTGPPTAANK 320
Query: 295 GFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQI 354
GFI++ETN+R+YAY+ S + IL LF+K++++ PNL+ G +TKES++ A + GIT+ QI
Sbjct: 321 GFIIIETNYRLYAYTNSLIQIAILSLFTKLQHRFPNLVSGKLTKESVHKAVQAGITSAQI 380
Query: 355 ISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
IS+L AHP++ +P +P V DQIRLWE + RVE T + EF S
Sbjct: 381 ISYLTTYAHPQMQKTVPYIPPTVMDQIRLWEYEGERVETTTGYLMREFGS 430
>gi|345492819|ref|XP_001602702.2| PREDICTED: general transcription factor IIH subunit 4-like [Nasonia
vitripennis]
Length = 428
Score = 228 bits (580), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 122/380 (32%), Positives = 221/380 (58%), Gaps = 12/380 (3%)
Query: 35 AILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQLRLFSEEK-- 92
++ R LP +A+ YVM++ +++ +P ++ W H + + +L ++ E
Sbjct: 6 SVYRELPEIARHYVMRLLFVEQPVPQAVIASWCSKLHVENHLNVVQVMNELNIWKEAAIP 65
Query: 93 KKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLPTLEDLEAYAIGQWECFLLQL 152
+ LN+TF+ NL+ L+ GGA I ++ + L++YA+ +WEC L +
Sbjct: 66 GGLPGWILNATFRKNLKIVLLGGGAPWTMSKQLEIDSKPRDIAFLDSYALERWECVLHYM 125
Query: 153 ISSTQAERPTNFSSSMMKVFQRGLLSRRDKE--APRLTESGFQFLLMDTNAQLWYIVREY 210
+ S Q E S+ +++ L +RD+E +P +T++GFQFLL+DT+AQ+WY + +Y
Sbjct: 126 VGSQQQE---GISADAVRILLHAGLMKRDEEDGSPVITQAGFQFLLLDTSAQVWYFILQY 182
Query: 211 ISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKE 270
+ + RG++ + ++FL +L+F G+ Y+ +S+ ++ + GLV Q+ RK
Sbjct: 183 LDTVEARGLDLVECLTFLFQLNFSTLGKDYSTQGMSDGLLMFLQHLREFGLV-YQRKRKA 241
Query: 271 NWFIPTKLATNLSMSLTDSSAR---KEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQ 327
F PT+LA N++ +R KE F++VETN+R+YAY+ S L ++ LF ++ Y+
Sbjct: 242 GRFYPTRLALNIATGQDKPISRDLEKERFVIVETNYRVYAYTNSNLQVALIGLFCELLYR 301
Query: 328 LPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVAD-RMPSVPENVCDQIRLWES 386
PNL+V +T++S+ A ++GITA QI+ +L Q+AH ++ D P++P + DQI+LWE+
Sbjct: 302 FPNLVVAILTRDSVRAALKSGITAVQIVGYLNQHAHNKMIDPGPPTLPPTIVDQIKLWEN 361
Query: 387 DLNRVEMTPAHYYDEFPSRV 406
+ NR + Y +F S++
Sbjct: 362 ERNRFIFSEGVLYSQFHSQI 381
>gi|170120252|ref|XP_001891159.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164633568|gb|EDQ98188.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 475
Score = 227 bits (579), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 139/414 (33%), Positives = 233/414 (56%), Gaps = 20/414 (4%)
Query: 8 AKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWV 67
A + + S + L +LY P C +I R L P++++ +M + +++ AIPA M+ WV
Sbjct: 19 AHALLPFLQSQSQNTLTKLYSKPSSCLSIFRLLDPISRQLIMNLLWLESAIPASTMQAWV 78
Query: 68 LPDGFTKHRVAIDRLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGI 127
+G + A+ L +L +F K TF+++ R+ + GG +PS
Sbjct: 79 TREGKPVYTEALATLTKLHIFPNSAIKLAVV---PTFKTSFRQAITGGGTSGSFGVPSEK 135
Query: 128 TARLPT--LEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQR-GLLSRRDKEA 184
+ +E L+ +A+ +WE L ++SS + PT S ++ + QR GL++ +
Sbjct: 136 DDKRSDVDIEFLDVFALERWETILHYMVSSGSGQNPTKPSVRVLFLLQRSGLMTTVGQGP 195
Query: 185 PRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNT 244
++T +GFQFLL + QLW ++ +Y+ ++ER ++ D++ FL LS G Y+
Sbjct: 196 LQITSAGFQFLLHSPHEQLWELLLQYLHLTEERQMDLVDVLGFLFMLSTMELGREYSTER 255
Query: 245 LSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMS-------LTDSSARKEGFI 297
LS+ Q +M++D D GL+ ++ + F PT+LAT L+ S + + S +EGFI
Sbjct: 256 LSKTQTAMLEDLVDYGLIWRRKPSSKG-FHPTRLATTLTSSSPPLPSSIGNRSGPQEGFI 314
Query: 298 VVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISF 357
V+ETN+R+YAY+ + L IL LF ++Y+ PNL+VGAIT++S+ A NGI+A+QIIS+
Sbjct: 315 VLETNYRIYAYTDNPLQTAILNLFVSLKYRFPNLVVGAITRDSVKKALLNGISADQIISY 374
Query: 358 LQQNAHPRVADRM------PSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
L +AHP++ + P +P V DQIRLWE + NR++ + Y F S+
Sbjct: 375 LMSHAHPQMRKNVRLRSCNPLLPVTVQDQIRLWELEKNRLKSQEGYLYTSFASQ 428
>gi|296084734|emb|CBI25875.3| unnamed protein product [Vitis vinifera]
Length = 258
Score = 227 bits (578), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 120/215 (55%), Positives = 145/215 (67%), Gaps = 26/215 (12%)
Query: 216 ERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIP 275
ERG++ ADLISFLLELSFHV GEAYN+NTL+E Q++ IKD DLGLVKLQQGRKE+WFIP
Sbjct: 1 ERGVDPADLISFLLELSFHVTGEAYNINTLTEFQRNTIKDLVDLGLVKLQQGRKESWFIP 60
Query: 276 TKLATNLSMSLTDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLI--- 332
TKLATNLSMSL+D+S+RK+GF+VVETNFR+YAYS+SKLHCEILRLFS+ L + +
Sbjct: 61 TKLATNLSMSLSDTSSRKQGFVVVETNFRLYAYSSSKLHCEILRLFSRYASMLFDCLLCI 120
Query: 333 -----------VGAITKESLYNAFENGITAEQIISFLQQNAHP-----------RVADRM 370
V I E L F G S + A P +A+++
Sbjct: 121 CYSSQKGTLLDVSLIMYECL-RTFSLGKIIFWFFSSVSTVATPCHGYLVIFTISCLAEKI 179
Query: 371 PSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
P + QIRLWE+DLNRVE P+H YDEFPSR
Sbjct: 180 PCLQFTNMSQIRLWETDLNRVETMPSHLYDEFPSR 214
>gi|31201525|ref|XP_309710.1| AGAP010984-PA [Anopheles gambiae str. PEST]
gi|21293295|gb|EAA05440.1| AGAP010984-PA [Anopheles gambiae str. PEST]
Length = 501
Score = 227 bits (578), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 132/423 (31%), Positives = 229/423 (54%), Gaps = 26/423 (6%)
Query: 8 AKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWV 67
K+ + + S L++LY P IC A+ R LP +A+++V+++ +++ IP ++ W
Sbjct: 34 CKDLEEYLKSRPPEVLEKLYNYPAICLAVYRELPEIARQFVIRILFVEQPIPQAVVSSWA 93
Query: 68 LPDGFTKHRVAIDR-LVQLRLFSEEKKKE--TTYRLNSTFQSNLRKHLIYGGALPREPMP 124
+ K ++ + L +L ++ + L TF+ NL+ L+ GG
Sbjct: 94 -TQVYAKENTSVSQVLTELGVWRSAAYPGGLAAWELCPTFKKNLKIALLGGGRPWSMSNA 152
Query: 125 SGITARLPTLEDLEAYAIGQWECFLLQLIS--STQAERPTNFSSSMMKVFQRGLLSRRDK 182
+ ++ L+ YA+ +W C L ++ S++ S +++ L +RD+
Sbjct: 153 LDPDQKSRDIDFLDTYAMSRWRCVLHYMVGAGSSKGMEGEGISPDAVRILLHANLMKRDE 212
Query: 183 E--APRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAY 240
+P +T GFQFLL+DT AQ+W+ + +Y+ + RG+N + +S L +LSF G Y
Sbjct: 213 SDGSPVITRQGFQFLLLDTQAQVWHFMLQYLDTCEARGLNLPECLSMLFQLSFSTLGRDY 272
Query: 241 NLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLA-----------TNLSMSLTDS 289
+ LS + ++ + GLV Q+ RKE F PT+LA T L+ +
Sbjct: 273 SSEGLSPGLLTFLQHLREFGLV-YQRKRKEGRFYPTRLAHNITSKNATHATTLAQDQESN 331
Query: 290 SARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGI 349
+ + +G+I+VETN+R+YAY+ S L +L LF+++ Y+ PNL+VG ++++S+ AF GI
Sbjct: 332 ATKDKGYIIVETNYRVYAYTDSNLQVALLGLFTELLYRFPNLVVGVLSRDSVRQAFRGGI 391
Query: 350 TAEQIISFLQQNAHP------RVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFP 403
TAEQIIS+L+Q+AHP + + S+P V DQI+LWE++ NR T Y++F
Sbjct: 392 TAEQIISYLEQHAHPTMLTVEQAINSKSSLPPTVVDQIKLWENERNRFTYTEGVVYNQFL 451
Query: 404 SRV 406
S+
Sbjct: 452 SQA 454
>gi|195378984|ref|XP_002048261.1| GJ11446 [Drosophila virilis]
gi|194155419|gb|EDW70603.1| GJ11446 [Drosophila virilis]
Length = 492
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 140/438 (31%), Positives = 234/438 (53%), Gaps = 45/438 (10%)
Query: 2 PLVK----IIAKNFMDMVASL-TARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDG 56
PLV+ + KNF + + + T L++LY P IC A+ R LP +A+++V+++ +ID
Sbjct: 18 PLVQSPENLECKNFQEYLRTRQTPETLEKLYNYPPICLAVYRELPEIARQFVIRILFIDQ 77
Query: 57 AIPAKMMEEWVLPDGFTKHRVAIDRLVQLRLF--SEEKKKETTYRLNSTFQSNLRKHLIY 114
+P ++ W + A L L ++ + + L+ TF+ ++R+ L+
Sbjct: 78 PVPQAVVTSWGAQRVAKEQMEATSCLTALCVWRVTAIPGGLAAWELSPTFKKSVRQVLLG 137
Query: 115 GG-------ALPREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISS-----TQAERPT 162
GG L R+ P I L++YA+ +W C L ++ + T AE
Sbjct: 138 GGKPWAMTNTLDRDSKPRDIAF-------LDSYAMSRWRCVLHYMVGTGSRGGTDAE--- 187
Query: 163 NFSSSMMKVFQRGLLSRRD-KEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQ 221
S +++ L +RD ++ +T GFQFLL+DT AQ+W+ + +Y+ +ERGI
Sbjct: 188 AISPDAVRILLHANLMKRDERDGITITRQGFQFLLLDTRAQVWHFMLQYLETCEERGICL 247
Query: 222 ADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATN 281
+ +S L +LSF G Y+ ++ + ++ + GLV Q+ RKE F PT+LA N
Sbjct: 248 PECLSMLFQLSFSTLGRDYSSEGMNNQMLTFLQHLREFGLV-FQRKRKEGRFYPTRLALN 306
Query: 282 L--------SMSLTDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIV 333
+ S S + ++ G+IVVETN+R+YAY+ S+L +L LF+++ Y+ PNL+V
Sbjct: 307 VTNKDAAQASTSADEERMQERGYIVVETNYRVYAYTDSQLQVAVLGLFTELLYRFPNLVV 366
Query: 334 GAITKESLYNAFENGITAEQIISFLQQNAHPRVA------DRMPSVPENVCDQIRLWESD 387
G +T++S+ A GITAEQI+S+L+Q AHP + +P + DQI+LWE +
Sbjct: 367 GVLTRDSVRQALRGGITAEQIVSYLEQYAHPNMKLVESAIQSKSCLPPTIVDQIKLWEME 426
Query: 388 LNRVEMTPAHYYDEFPSR 405
NR T Y++F S+
Sbjct: 427 RNRFTYTEGVVYNQFLSQ 444
>gi|195441209|ref|XP_002068409.1| GK20438 [Drosophila willistoni]
gi|194164494|gb|EDW79395.1| GK20438 [Drosophila willistoni]
Length = 512
Score = 224 bits (572), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 135/419 (32%), Positives = 228/419 (54%), Gaps = 41/419 (9%)
Query: 19 TARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVA 78
T L++LY P IC A+ R LP +A+++++++ ++D +P ++ W + + A
Sbjct: 53 TPETLEKLYNYPPICLAVYRDLPEIARQFIIRILFVDQPVPQAVVSSWGAQRCAKEQQEA 112
Query: 79 IDRLVQLRLF--SEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGIT----ARLP 132
L L ++ + + L+ TF+ ++R+ L+ GG +P P T ++
Sbjct: 113 TSCLTSLNVWRVTAIPGGLAAWELSPTFKKSVRQILLGGG----KPWPMTNTLEKDSKPR 168
Query: 133 TLEDLEAYAIGQWECFLLQLISS-----TQAERPTNFSSSMMKVFQRGLLSRRD-KEAPR 186
+ L++YA+ +W C L ++ + T AE S +++ L +RD ++
Sbjct: 169 DIAFLDSYAMSRWRCVLHYMVGTGNRNGTDAE---AISPDAVRILLHANLMKRDERDGIT 225
Query: 187 LTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLS 246
+T GFQFLL+DT AQ+W+ + +Y+ +ERGI+ + +S L +LSF G Y+ +S
Sbjct: 226 ITRQGFQFLLLDTRAQVWHFMMQYLDTCEERGISLPECLSMLFQLSFSTLGRDYSSEGMS 285
Query: 247 EIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLS-------------MSLTDSSARK 293
+ ++ + GLV Q+ RKE F PT+LA N++ MS ++ R
Sbjct: 286 HQMLAFLQHLREFGLV-FQRKRKEGRFYPTRLALNVTSKESADATTAIVTMSASEEEERM 344
Query: 294 E--GFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITA 351
+ G+IVVETN+R+YAY+ S L +L LF+++ Y+ PNL+VG +T++S+ A GITA
Sbjct: 345 QDRGYIVVETNYRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITA 404
Query: 352 EQIISFLQQNAHPRVA------DRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
EQIIS+L+Q AHP + + +P V DQI+LWE + NR T Y++F S
Sbjct: 405 EQIISYLEQYAHPNMKLVESAINSKSCLPPTVVDQIKLWEMERNRFTYTEGVVYNQFLS 463
>gi|407927340|gb|EKG20235.1| Transcription factor Tfb2 [Macrophomina phaseolina MS6]
Length = 466
Score = 224 bits (570), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 138/414 (33%), Positives = 222/414 (53%), Gaps = 27/414 (6%)
Query: 12 MDMVASLTARDLDRLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPD 70
++ + L+ RLY+ P A+ R LP LAK VM M Y+ G PA ++ W+
Sbjct: 9 LEYLEGLSGATHKRLYQQPSAVLAVFRRMLPHLAKMIVMAMLYMPGPFPATELDTWIRDG 68
Query: 71 GFTKHRVAIDRLVQLRLFSE--EKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGIT 128
+ AI L L + ++ E YR++S FQ++LR+ L GG +P T
Sbjct: 69 SSGEKDKAISVLESLHIITQAQETVGARAYRISSGFQTSLRQALTGGGNHRSFGVPCMNT 128
Query: 129 -ARLPTLEDLEAYAIGQWECFLLQLISSTQ-----AERPTNFSSSMMKVFQRGLLSRRDK 182
++ ++E L+ A +WE L ++ ST T + ++++V Q + R
Sbjct: 129 GSKRVSIEALDEIARAKWETILQYMVGSTGTALGVGSEITTGTKTLLEVGQ--FIEMRGG 186
Query: 183 EAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNL 242
P +T+ GF FLL + NAQ+W ++ Y+ NS ++ D++SFL L G+ Y+
Sbjct: 187 R-PSITQGGFSFLLQEVNAQVWSLLIVYLENSHRLNMDNVDVLSFLFMLGSLELGQDYST 245
Query: 243 NTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNL-------------SMSLTDS 289
++L+ Q+ M+ D D G+V + K N F PT+LAT L S +L +
Sbjct: 246 DSLTATQQHMLDDLNDFGIV-YRSPPKSNRFYPTRLATTLTSDAAALNNSDSFSSALNST 304
Query: 290 SAR-KEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENG 348
SA +G+I++ETN+R+YAY++S L +L LF+ +E + PNL+ G +TKES+ A G
Sbjct: 305 SASGSKGYIIIETNYRLYAYTSSPLQIAVLALFTDLETRFPNLVSGKLTKESVQRAIGLG 364
Query: 349 ITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEF 402
ITA+QIIS+L AHP++ + P +P V DQIRLW+ + +R+ T + +F
Sbjct: 365 ITADQIISYLDTYAHPQMQRKTPVLPPTVVDQIRLWQIEGDRMRATEGFLFKDF 418
>gi|194872905|ref|XP_001973103.1| GG13537 [Drosophila erecta]
gi|190654886|gb|EDV52129.1| GG13537 [Drosophila erecta]
Length = 499
Score = 223 bits (569), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 138/437 (31%), Positives = 239/437 (54%), Gaps = 44/437 (10%)
Query: 2 PLVK----IIAKNFMDMVASL-TARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDG 56
PLV+ + K+F + + + T L++LY P IC A+ R LP +A+++++++ ++D
Sbjct: 24 PLVQGPANLECKDFQEYLRTRQTPETLEKLYNYPPICLAVFRELPEIARQFIIRILFVDQ 83
Query: 57 AIPAKMMEEWVLPDGFTKHRVAIDRLVQLRLF--SEEKKKETTYRLNSTFQSNLRKHLIY 114
+P ++ W + A L L ++ + + + L+ TF+ ++R+ L+
Sbjct: 84 PVPQAVVTSWGAQRCAKEQAEATSCLTALNVWRVTAIPGGLSAWELSPTFKKSVRQVLLG 143
Query: 115 GGALPREPMPSGIT----ARLPTLEDLEAYAIGQWECFLLQLISS-----TQAERPTNFS 165
GG +P P T ++ + L+ YA+ +W C L ++ + T AE S
Sbjct: 144 GG----KPWPMTNTLEKDSKPRDIAFLDTYAMSRWRCVLHYMVGTGNRNGTDAE---AIS 196
Query: 166 SSMMKVFQRGLLSRRD-KEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADL 224
+++ L +RD ++ +T GFQFLL+DT AQ+W+ + +Y+ +ERGI+ +
Sbjct: 197 PDAVRILLHANLMKRDERDGITITRQGFQFLLLDTRAQVWHFMLQYLDTCEERGISLPEC 256
Query: 225 ISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNL-- 282
+S L +LSF G Y+ ++ + ++ + GLV Q+ RKE F PT+LA N+
Sbjct: 257 LSMLFQLSFSTLGRDYSSEGMNSQMLTFLQHLREFGLV-FQRKRKEGRFYPTRLALNVTS 315
Query: 283 -------SMSLTDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGA 335
S+++ + + + G+IVVETN+R+YAY+ S L +L LF+++ Y+ PNL+VG
Sbjct: 316 KEAAATASVAMDEEATQDCGYIVVETNYRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGV 375
Query: 336 ITKESLYNAFENGITAEQIISFLQQNAHPRVADRM--------PSVPENVCDQIRLWESD 387
+T++S+ A GITAEQI+S+L+Q AHP + RM +P V DQI+LWE +
Sbjct: 376 LTRDSVRQALRGGITAEQIVSYLEQYAHPNM--RMVESAIQSKSCLPPTVVDQIKLWELE 433
Query: 388 LNRVEMTPAHYYDEFPS 404
NR T Y++F S
Sbjct: 434 RNRFTYTEGVVYNQFLS 450
>gi|195327811|ref|XP_002030611.1| GM24480 [Drosophila sechellia]
gi|194119554|gb|EDW41597.1| GM24480 [Drosophila sechellia]
Length = 499
Score = 223 bits (569), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 133/415 (32%), Positives = 228/415 (54%), Gaps = 39/415 (9%)
Query: 19 TARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVA 78
T L++LY P IC A+ R LP +A+++++++ ++D +P ++ W + A
Sbjct: 46 TPESLEKLYNYPPICLAVFRELPEIARQFIIRILFVDQPVPQAVVTSWGAQRCAKEQAEA 105
Query: 79 IDRLVQLRLF--SEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGIT----ARLP 132
L L ++ + + + L+ TF+ ++R+ L+ GG +P P T ++
Sbjct: 106 TSCLTALNVWRVTAIPGGLSAWELSPTFKKSVRQVLLGGG----KPWPMTNTLDKDSKPR 161
Query: 133 TLEDLEAYAIGQWECFLLQLISS-----TQAERPTNFSSSMMKVFQRGLLSRRD-KEAPR 186
+ L+ YA+ +W C L ++ + T AE S +++ L +RD ++
Sbjct: 162 DIAFLDTYAMSRWRCVLHYMVGTGNRNGTDAE---AISPDAVRILLHANLMKRDERDGIT 218
Query: 187 LTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLS 246
+T GFQFLL+DT AQ+W+ + +Y+ +ERGI+ + +S L +LSF G Y+ ++
Sbjct: 219 ITRQGFQFLLLDTRAQVWHFMLQYLDTCEERGISLPECLSMLFQLSFSTLGRDYSSEGMN 278
Query: 247 EIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNL---------SMSLTDSSARKEGFI 297
+ ++ + GLV Q+ RKE F PT+LA N+ S+++ + + + G+I
Sbjct: 279 SQMLTFLQHLREFGLV-FQRKRKEGRFYPTRLALNVTSKEAAATASVAMDEEATQDCGYI 337
Query: 298 VVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISF 357
VVETN+R+YAY+ S L +L LF+++ Y+ PNL+VG +T++S+ A GITAEQI+S+
Sbjct: 338 VVETNYRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQIVSY 397
Query: 358 LQQNAHPRVADRM--------PSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
L+Q AHP + RM +P V DQI+LWE + NR T Y++F S
Sbjct: 398 LEQYAHPNM--RMVESAIHSKSCLPPTVVDQIKLWELERNRFTYTEGVLYNQFLS 450
>gi|21357821|ref|NP_648780.1| marionette [Drosophila melanogaster]
gi|7294263|gb|AAF49614.1| marionette [Drosophila melanogaster]
gi|220942562|gb|ACL83824.1| Tfb2-PA [synthetic construct]
Length = 499
Score = 223 bits (569), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 133/415 (32%), Positives = 228/415 (54%), Gaps = 39/415 (9%)
Query: 19 TARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVA 78
T L++LY P IC A+ R LP +A+++++++ ++D +P ++ W + A
Sbjct: 46 TPESLEKLYNYPPICLAVFRELPEIARQFIIRILFVDQPVPQAVVTSWGAQRCAKEQAEA 105
Query: 79 IDRLVQLRLF--SEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGIT----ARLP 132
L L ++ + + + L+ TF+ ++R+ L+ GG +P P T ++
Sbjct: 106 TSCLTALNVWRVTAIPGGLSAWELSPTFKKSVRQVLLGGG----KPWPMTNTLDKDSKPR 161
Query: 133 TLEDLEAYAIGQWECFLLQLISS-----TQAERPTNFSSSMMKVFQRGLLSRRD-KEAPR 186
+ L+ YA+ +W C L ++ + T AE S +++ L +RD ++
Sbjct: 162 DIAFLDTYAMSRWRCVLHYMVGTGNRNGTDAE---AISPDAVRILLHANLMKRDERDGIT 218
Query: 187 LTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLS 246
+T GFQFLL+DT AQ+W+ + +Y+ +ERGI+ + +S L +LSF G Y+ ++
Sbjct: 219 ITRQGFQFLLLDTRAQVWHFMLQYLDTCEERGISLPECLSMLFQLSFSTLGRDYSSEGMN 278
Query: 247 EIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNL---------SMSLTDSSARKEGFI 297
+ ++ + GLV Q+ RKE F PT+LA N+ S+++ + + + G+I
Sbjct: 279 SQMLTFLQHLREFGLV-FQRKRKEGRFYPTRLALNVTSKEAAATASVAMDEEATQDCGYI 337
Query: 298 VVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISF 357
VVETN+R+YAY+ S L +L LF+++ Y+ PNL+VG +T++S+ A GITAEQI+S+
Sbjct: 338 VVETNYRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQIVSY 397
Query: 358 LQQNAHPRVADRM--------PSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
L+Q AHP + RM +P V DQI+LWE + NR T Y++F S
Sbjct: 398 LEQYAHPNM--RMVESAIHSKSCLPPTVVDQIKLWELERNRFTYTEGVLYNQFLS 450
>gi|195477975|ref|XP_002086439.1| GE22863 [Drosophila yakuba]
gi|194186229|gb|EDW99840.1| GE22863 [Drosophila yakuba]
Length = 499
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 136/435 (31%), Positives = 237/435 (54%), Gaps = 40/435 (9%)
Query: 2 PLVK----IIAKNFMDMVASL-TARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDG 56
PLV+ + K+F + + + T L++LY P IC A+ R LP +A+++++++ ++D
Sbjct: 24 PLVQGPANLECKDFQEYLRTRQTPETLEKLYNYPPICLAVFRELPEIARQFIIRILFVDQ 83
Query: 57 AIPAKMMEEWVLPDGFTKHRVAIDRLVQLRLF--SEEKKKETTYRLNSTFQSNLRKHLIY 114
+P ++ W + A L L ++ + + + L+ TF+ ++R+ L+
Sbjct: 84 PVPQAVVTSWGAQRCAKEQAEATSCLTALNVWRVTAIPGGLSAWELSPTFKKSVRQVLLG 143
Query: 115 GGALPREPMPSGIT----ARLPTLEDLEAYAIGQWECFLLQLISS-----TQAERPTNFS 165
GG +P P T ++ + L+ YA+ +W C L ++ + T AE S
Sbjct: 144 GG----KPWPMTNTLEKDSKPRDIAFLDTYAMSRWRCVLHYMVGTGNRNGTDAE---AIS 196
Query: 166 SSMMKVFQRGLLSRRD-KEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADL 224
+++ L +RD ++ +T GFQFLL+DT AQ+W+ + +Y+ +ERGI+ +
Sbjct: 197 PDAVRILLHANLMKRDERDGITITRQGFQFLLLDTRAQVWHFMLQYLDTCEERGISLPEC 256
Query: 225 ISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNL-- 282
+S L +LSF G Y+ ++ + ++ + GLV Q+ RKE F PT+LA N+
Sbjct: 257 LSMLFQLSFSTLGRDYSSEGMNSQMLTFLQHLREFGLV-FQRKRKEGRFYPTRLALNVTS 315
Query: 283 -------SMSLTDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGA 335
S+++ + + + G+IVVETN+R+YAY+ S L +L LF+++ Y+ PNL+VG
Sbjct: 316 KEAAATASVAMDEEATQDCGYIVVETNYRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGV 375
Query: 336 ITKESLYNAFENGITAEQIISFLQQNAHPRV------ADRMPSVPENVCDQIRLWESDLN 389
+T++S+ A GITAEQI+S+L+Q AHP + +P V DQI+LWE + N
Sbjct: 376 LTRDSVRQALRGGITAEQIVSYLEQYAHPNMRLVESAIQSKSCLPPTVVDQIKLWELERN 435
Query: 390 RVEMTPAHYYDEFPS 404
R T Y++F S
Sbjct: 436 RFTYTEGVVYNQFLS 450
>gi|328766914|gb|EGF76966.1| hypothetical protein BATDEDRAFT_92242 [Batrachochytrium
dendrobatidis JAM81]
Length = 471
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 129/413 (31%), Positives = 218/413 (52%), Gaps = 22/413 (5%)
Query: 7 IAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEW 66
I+ N + SL LDRL+ P C AILR LP LAK ++++ Y + +E W
Sbjct: 9 ISGNVGQYLESLPRPVLDRLFAQPATCLAILRLLPDLAKHTILRLLYTSMTVKISDIETW 68
Query: 67 VLPDGFTKHRVAIDRLVQLRLFS-EEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPS 125
+ + A+ L +L + S + + + +NS FQ ++ L+ G+ MP
Sbjct: 69 ETHEYSGQFSHAVTGLAKLHILSIKNEGSHKSVHINSVFQKSIHNALVGSGSHTSFGMPC 128
Query: 126 -GITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQR-GLLSRRDKE 183
I P++ LE YA WE L L+ + +RP +++K+ ++ GL++
Sbjct: 129 ETIDKHRPSIAFLETYAKECWESVLHYLVGTPSDKRP----KAIVKLLEKSGLMAPISST 184
Query: 184 AP------RLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAG 237
R+T GFQFLL D N Q+W + +Y+ +++ + ++++F +L G
Sbjct: 185 NAAHNGDLRITSKGFQFLLQDVNVQIWAFLLQYLEMAEQLNMELVEVLNFFFQLGSLELG 244
Query: 238 EAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSS------- 290
+ Y++ L+ QK M+ D LGL+ Q+ +K F PT LAT+L+ + +
Sbjct: 245 QDYSVEVLTPTQKHMLDDLKHLGLI-YQRKKKSTRFYPTHLATSLTSGASAGAVLSPRSL 303
Query: 291 -ARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGI 349
A +GFI++ETN+R+YAY+ S L +L LF ++ + N+++G IT++S+ A GI
Sbjct: 304 EADSDGFIIIETNYRVYAYTDSPLQIAVLSLFIEMRVRFANMVIGIITRDSVREALAKGI 363
Query: 350 TAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEF 402
+AEQII++L +AHP + P +P + DQ+RLWE + NR+ ++ H Y F
Sbjct: 364 SAEQIIAYLTTHAHPEMKKGSPVLPTTIVDQVRLWEMERNRLRISRGHLYQMF 416
>gi|389749775|gb|EIM90946.1| transcription factor Tfb2 [Stereum hirsutum FP-91666 SS1]
Length = 494
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 141/408 (34%), Positives = 223/408 (54%), Gaps = 28/408 (6%)
Query: 23 LDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRL 82
L RLY P C +I R L P+ ++ VM + +++ AIP M WV+ + A+ +L
Sbjct: 32 LTRLYNRPSSCLSIFRLLAPVERQLVMNLLWLESAIPTNTMSSWVVKENKKFFDEALAQL 91
Query: 83 VQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARL--PTLEDLEAY 140
+L + K LN TF+++LR + GG +P+ + P++E L+ Y
Sbjct: 92 TRLHILPNPSAK---LGLNPTFKTSLRHAITGGGTSGSFGVPAEADDKRGPPSVEILDTY 148
Query: 141 AIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAPRLTESGFQFLLMDTN 200
A+ +WE L ++SS + PT S ++ + QR L + ++T SGFQFLL +
Sbjct: 149 ALERWETILHFMVSSGTGQMPTKPSQGVLYLLQRSKLMAPTQGTLQITSSGFQFLLHTPH 208
Query: 201 AQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEA-----YNLNTLSEIQKSMIKD 255
AQLW ++ +Y++ +ER ++ +++SFL LS G Y+ +LS+ QK+M++D
Sbjct: 209 AQLWELLLQYLAMVEERQMDLVEVLSFLFMLSTMELGRVRQFHDYSAESLSDTQKAMLED 268
Query: 256 FADLGLVKLQQGRKENWFIPTKLATNLSMSL----------TDSSARKEGFIVVETNFRM 305
D GL+ Q+ F PT+LAT L+ SL T + + +GFI++ETN+R+
Sbjct: 269 LRDYGLI-WQRKATSRRFSPTRLATTLTSSLPPLPTASGTGTSGTGQTQGFIILETNYRL 327
Query: 306 YAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHP- 364
YAY+ + L +L LF ++ + PNL+VG+IT++S+ A NGITA+QIIS+L AHP
Sbjct: 328 YAYTDNPLQTAVLNLFVTMKSRFPNLVVGSITRDSVRKALSNGITADQIISYLTTYAHPQ 387
Query: 365 ------RVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
R P +P V DQIRLWE + NR++ Y F S+
Sbjct: 388 MRKNVSRCRSFTPLLPVTVQDQIRLWELERNRLKSQEGFLYTAFASQA 435
>gi|330797174|ref|XP_003286637.1| hypothetical protein DICPUDRAFT_31198 [Dictyostelium purpureum]
gi|325083385|gb|EGC36839.1| hypothetical protein DICPUDRAFT_31198 [Dictyostelium purpureum]
Length = 471
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 140/426 (32%), Positives = 235/426 (55%), Gaps = 43/426 (10%)
Query: 15 VASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTK 74
+ +L ARDL+ LY++ + C+AI+RSLPP AK+Y+ +M +D P + ++W +
Sbjct: 7 LTTLDARDLEDLYKDSWTCQAIVRSLPPRAKQYIFKMLMVD-TYPLTIAKDWSQHSSNHQ 65
Query: 75 HRVAIDRLVQLRLF---SEEKKKETTYRLNSTFQSNLRKHLI-YGGALPREPMPSGITAR 130
H+ A+ +L L++ +++ E + +LN FQ N++ L+ + + T +
Sbjct: 66 HKEALKKLFDLKIILLNRGKQQSEQSIQLNPLFQENIKSALVEFDKVIFSNSSNIKDTNK 125
Query: 131 LPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAP-RLTE 189
+P + DL+ Y+ QWE L L S A +P+ F S ++ L+RRD++ +T
Sbjct: 126 IPNINDLDNYSKKQWEQVLYFL--SDDAAQPSKFISDLL---LSSNLTRRDQDGSLSITS 180
Query: 190 SGFQFLLMDTNAQLWYIVREYISNSQER---GINQA---DLISFLLELSFHVAGEAYNLN 243
GF+FLL D Q+W ++ Y+++ +++ G+ Q DL+ FL LSF G+ Y +
Sbjct: 181 DGFKFLLKDVYTQIWTLLIVYLNDLEKKRREGLTQGSRNDLLGFLFRLSFLQLGKGYLVG 240
Query: 244 TLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLT------------DSSA 291
L+E+QK + GL+ ++ +F PT+L +L+ T +SS+
Sbjct: 241 ELTELQKEYLICLKQFGLIYMKSDASL-FFYPTRLIISLTTGKTLSLIQTLAVDKINSSS 299
Query: 292 -------------RKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITK 338
++ G+IV+ETN+R+YAY+ S L +L LF K+ Y+LPNL VG IT+
Sbjct: 300 NSANTVASSTIEKKESGYIVLETNYRLYAYTASSLQISLLSLFVKMLYRLPNLAVGIITR 359
Query: 339 ESLYNAFENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHY 398
ES+ A +GITA+QII F++ NAHP + +P+ V +QI LWE + NR+ T +
Sbjct: 360 ESIRTALIHGITADQIIDFIRHNAHPNAVNNGQPIPDVVAEQILLWEQERNRITYTKSVL 419
Query: 399 YDEFPS 404
Y+ FP+
Sbjct: 420 YNSFPT 425
>gi|396459783|ref|XP_003834504.1| similar to RNA polymerase II transcription factor B subunit 2
[Leptosphaeria maculans JN3]
gi|312211053|emb|CBX91139.1| similar to RNA polymerase II transcription factor B subunit 2
[Leptosphaeria maculans JN3]
Length = 483
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 137/413 (33%), Positives = 221/413 (53%), Gaps = 36/413 (8%)
Query: 25 RLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLV 83
+LYE P A+ R LP LAK VM + Y+ ++P ++ W + A+ L
Sbjct: 23 KLYEQPSTVLAVFRCMLPHLAKSIVMALLYMPTSLPTTDLDAWFKSTARKEKEQALFILD 82
Query: 84 QLRLFSEEKK---KETTYRLNSTFQSNLRKHLIYGGA-----LPREPMPSGITARLPTLE 135
L + ++++ K +++ L FQ +LR L G+ +P SG + ++E
Sbjct: 83 CLHIITQQRNDDGKTSSWALTPGFQRSLRNALEGAGSHCSFGVPATREESG--GKKVSVE 140
Query: 136 DLEAYAIGQWECFLLQLISSTQAERPTNFSSSMM-----KVFQRGLLSRRDKEAPRLTES 190
L+ Y+ GQWE L L+S R + S + + K+ G L R PR+T+
Sbjct: 141 FLDEYSRGQWEGILYYLVSGAAGLRSDSISRAEVGPGTKKLLMEGDLVRVIHGTPRITKD 200
Query: 191 GFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQK 250
GF F+L +TNAQ+W ++ Y+ E G+++ +++SFL L G Y+++TLS+ Q
Sbjct: 201 GFSFVLQETNAQVWSLLIVYLKMVHELGMSETEVLSFLFMLGSLDLGRDYSISTLSDTQL 260
Query: 251 SMIKDFADLGLVKLQQGRKENWFIPTKLATNLS------MSLTDS-------------SA 291
M+ D + +GLV + + F PT+LAT L+ MS + + +
Sbjct: 261 QMLDDLSAMGLV-YRTSKDARTFYPTRLATTLTSDSGSAMSASSNDIAQANQGNAGTLAT 319
Query: 292 RKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITA 351
+GFI++ETN+R+YAY+ S + IL LF+K++++ PNL+ G +TKES++ A GIT+
Sbjct: 320 ANKGFIIIETNYRLYAYTNSLIQIAILSLFTKLQHRFPNLVSGKLTKESVHKAVSAGITS 379
Query: 352 EQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
QIIS+L AHP++ + +P V DQIRLWE + RVE+TP +F S
Sbjct: 380 AQIISYLSTYAHPQMQKNVSFIPPTVMDQIRLWEYEGERVEVTPGFLMKDFGS 432
>gi|402224480|gb|EJU04542.1| transcription factor Tfb2 [Dacryopinax sp. DJM-731 SS1]
Length = 466
Score = 222 bits (565), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 142/402 (35%), Positives = 222/402 (55%), Gaps = 25/402 (6%)
Query: 23 LDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLP--DGFTKHRVAID 80
L RLY P C A+ R L +A++ VM + ++D + WV+P D + A+
Sbjct: 25 LSRLYTRPSACLAVFRLLDDVARQIVMSLLWLDSKTGVDLTA-WVVPNEDSRKHYAAALQ 83
Query: 81 RLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPRE---PMPSGITARLPTLEDL 137
LV+L + E + L FQS LR+ L GG R P P + E L
Sbjct: 84 DLVRLHIAKEANGR---ILLLPAFQSGLRRALT-GGGEHRSFGVPCPGARGKEVMATEQL 139
Query: 138 EAYAIGQWECFLLQLISS-TQAERPTNFSSSMMKVFQR-GLL----SRRDKEAPRLTESG 191
+ YA +WE L ++SS + A R S +++ + +R GL+ + R E+ R+T G
Sbjct: 140 DDYASERWESILHFMVSSGSSAGRVPPPSIAVIFLLRRSGLMVPIGAERHPES-RITSKG 198
Query: 192 FQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKS 251
FQFLL D++ QLW ++ +Y++ S+++G + ++I FL L G+ Y+ LSE Q
Sbjct: 199 FQFLLEDSHTQLWELLLQYLAMSEDQGRDLVEVIGFLFMLGSMQLGQEYSTENLSETQDV 258
Query: 252 MIKDFADLGLVKLQQGRKENWFIPTKLATNL----SMSLTDS----SARKEGFIVVETNF 303
M++DF D GL+ + F PT+LAT L S++ T S +A EGFI++ETN+
Sbjct: 259 MLQDFLDYGLIYRRNPDDHTRFYPTRLATTLTSTSSLAFTSSKHEKAASSEGFIILETNY 318
Query: 304 RMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAH 363
R+YAY+ + L +L LF + + NL++G +T+ES+ A NGITA+QIIS+L +AH
Sbjct: 319 RVYAYTENPLQIAVLNLFVALHSRFENLVIGRLTRESIKAALANGITADQIISYLTVHAH 378
Query: 364 PRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
P + P +P V DQIRLW+ + NR++ + Y++F S+
Sbjct: 379 PMMHKNNPVLPVTVQDQIRLWQLEKNRLKSENGYLYEDFNSQ 420
>gi|194749905|ref|XP_001957376.1| GF10392 [Drosophila ananassae]
gi|190624658|gb|EDV40182.1| GF10392 [Drosophila ananassae]
Length = 497
Score = 221 bits (564), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 127/411 (30%), Positives = 222/411 (54%), Gaps = 29/411 (7%)
Query: 19 TARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVA 78
T L++LY P IC A+ R LP +A+++++++ ++D +P ++ W + A
Sbjct: 44 TPETLEKLYNYPPICLAVFRELPEIARQFIIRILFVDQPVPQAVVSSWGAQRCAKEQTDA 103
Query: 79 IDRLVQLRLF--SEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGIT----ARLP 132
L L ++ + + + L+ TF+ ++R+ L+ GG +P P T ++
Sbjct: 104 TSCLTALNVWRVTAIPGGLSAWELSPTFKKSVRQVLLGGG----KPWPMTNTLEKDSKPR 159
Query: 133 TLEDLEAYAIGQWECFLLQLI---SSTQAERPTNFSSSMMKVFQRGLLSRRDKEAPRLTE 189
+ L+ YA+ +W C L ++ S AE ++ + L+ R +++ +T
Sbjct: 160 DINFLDTYAMSRWRCVLHYMVGTGSRNGAETEAISPDAVRILLHANLMKRDERDGITITR 219
Query: 190 SGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQ 249
GFQFLL+DT AQ+W+ + Y+ +ERG+ + +S L +LSF G Y+ +++
Sbjct: 220 QGFQFLLLDTRAQVWHFMLHYLDTCEERGMPLPECLSMLFQLSFSTLGRDYSSEGMNKQM 279
Query: 250 KSMIKDFADLGLVKLQQGRKENWFIPTKLATNLS---------MSLTDSSARKEGFIVVE 300
++ + GLV Q+ RKE F PT+LA N++ ++ + ++ G+IVVE
Sbjct: 280 LDFLQHLREFGLV-YQRKRKEGRFYPTRLALNVTSKEAAVAATLTTDEEGVQESGYIVVE 338
Query: 301 TNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQ 360
TN+R+YAY+ S L +L LF+++ Y+ PNL+VG +T++S+ A GITAEQI+S+L+Q
Sbjct: 339 TNYRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQ 398
Query: 361 NAHPRV------ADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
AHP + +P V DQI+LWE + NR T Y++F S+
Sbjct: 399 YAHPNMRLVESAIQSKSCLPPTVVDQIKLWEMERNRFTYTEGVVYNQFLSQ 449
>gi|19113355|ref|NP_596563.1| transcription factor TFIIH complex subunit Tfb2
[Schizosaccharomyces pombe 972h-]
gi|62901133|sp|P87303.2|TFB2_SCHPO RecName: Full=RNA polymerase II transcription factor B subunit 2;
AltName: Full=RNA polymerase II transcription factor B
52 kDa subunit; AltName: Full=RNA polymerase II
transcription factor B p52 subunit
gi|3080538|emb|CAA18666.1| transcription factor TFIIH complex subunit Tfb2
[Schizosaccharomyces pombe]
Length = 447
Score = 221 bits (563), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 128/385 (33%), Positives = 212/385 (55%), Gaps = 14/385 (3%)
Query: 25 RLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQ 84
RLY+ P C A+ R LP LA++YVM M + + + W+ + ++LV+
Sbjct: 21 RLYQKPAACLAVFRLLPILARQYVMSMLFNPMPVALSDFDLWIKLSSKVYQSESFNKLVR 80
Query: 85 LRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMP-SGITARLPTLEDLEAYAIG 143
+ +F + + T LNS F+ L GG +P + L T++ L+AYA
Sbjct: 81 MHIFQFDGQYIT---LNSEFRKQFITALTGGGNHNSFGVPCTDEDKHLVTVDFLDAYAKE 137
Query: 144 QWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAPRLTESGFQFLLMDTNAQL 203
WE L ++ + +A+ P ++ + +RG L K R+T +GFQFLL D NAQ+
Sbjct: 138 TWETILHFMVGTPEAKFP---GEGVLSLLKRGGLMSGPKNQLRITRAGFQFLLQDINAQI 194
Query: 204 WYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVK 263
W ++ +Y+ S++ ++ ++ FL L G AY+++ L++ Q+ M++D + GLV
Sbjct: 195 WTLLLDYLKLSEDTHMDPVQVLHFLFMLGSLDLGRAYSVDFLTDTQQIMLEDLREYGLV- 253
Query: 264 LQQGRKENWFIPTKLATNLSMSLTDSSARK------EGFIVVETNFRMYAYSTSKLHCEI 317
Q+ F PT+LAT L+ ++ +GFI+VETN+R+YAY++S L I
Sbjct: 254 YQRKITSKRFYPTRLATGLTTDYRSLHGKQSENDDDKGFIIVETNYRLYAYTSSPLQIAI 313
Query: 318 LRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRMPSVPENV 377
+ LF+ + + NL+VG IT++S+ A NGI AEQII++L +AHP++ +P +P +
Sbjct: 314 IGLFANLRARFSNLVVGVITRDSIRRALMNGIAAEQIITYLTTHAHPQMRSNVPLLPPTL 373
Query: 378 CDQIRLWESDLNRVEMTPAHYYDEF 402
DQI LWE + NR+ TP + +F
Sbjct: 374 VDQIYLWELEKNRLRATPGILFRDF 398
>gi|443894209|dbj|GAC71558.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH, subunit TFB2 [Pseudozyma antarctica
T-34]
Length = 491
Score = 220 bits (561), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 131/395 (33%), Positives = 214/395 (54%), Gaps = 15/395 (3%)
Query: 23 LDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRL 82
L RLYE P C AI R LP LA++ +M M ++ + A W+ D + VA+ +L
Sbjct: 55 LIRLYEKPASCLAIFRLLPMLARQLIMHMLFLHAPLAADDFLAWLRKDVRNEFDVAVAKL 114
Query: 83 VQLRLFS-EEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLPT-LEDLEAY 140
+L + + + LN+ F +R+ L GG+ +P + + L+ Y
Sbjct: 115 ARLSIVQLKPASAKQILLLNAVFTEGMRRALTGGGSHRSFGVPCDTEDKNAVDIAFLDDY 174
Query: 141 AIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAP---RLTESGFQFLLM 197
A +WE L ++ S ++ P +++ + L+ R A +T GFQFLL
Sbjct: 175 ARTKWETILHYMVGSDKSSTPREPVLYLLR--RSNLMQSRSAAATASLNITSRGFQFLLE 232
Query: 198 DTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFA 257
D N QLW+++ +Y+ ++ER ++ ++++FL L G Y+ L E Q M++DF
Sbjct: 233 DVNTQLWHLLLQYLDMAEERNMDLVEVLAFLFMLGSLELGRDYSTEELPETQLHMLEDFR 292
Query: 258 DLGLVKLQQGRKENWFIPTKLATNLS-------MSLTDSSARKEGFIVVETNFRMYAYST 310
D GLV Q+ F PT+LAT L+ +S S + G+I++ETN+R+YAY++
Sbjct: 293 DYGLV-YQRKASSRRFYPTRLATTLTSSAAAPLLSTNGSEPEERGYIILETNYRLYAYTS 351
Query: 311 SKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRM 370
+ L +L LF ++ + PNL+VG+IT++S+ +A NGITAEQII++L +AH ++
Sbjct: 352 NPLRVAVLSLFVTIKARFPNLLVGSITRDSVKSALANGITAEQIITYLTHHAHLQMHRND 411
Query: 371 PSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
P +P V DQIRLWE + NRV+ + +F S+
Sbjct: 412 PLLPVTVSDQIRLWEREKNRVQQNLGSLFTDFTSQ 446
>gi|440800489|gb|ELR21525.1| transcription factor tfb2 subfamily protein [Acanthamoeba
castellanii str. Neff]
Length = 465
Score = 220 bits (561), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 123/408 (30%), Positives = 224/408 (54%), Gaps = 41/408 (10%)
Query: 23 LDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIP-AKMMEEWVLP----DGFTKHRV 77
LD LY +P+ C+A+ RSLP LAK++V+++ + +I +++++ W P G R
Sbjct: 16 LDNLYADPWTCQAVFRSLPALAKQWVLRLAFASPSITRSRLLQHWTSPRFQSQGEAALRR 75
Query: 78 AIDRLVQLRLFSEEKKKE------------------TTYRLNSTFQSNLRKHLIYGGALP 119
+ + +++ Y N F SN +K
Sbjct: 76 LLALRILRERLPADRRDHGGGDEADVMEVMEPLGGGVAYEFNKHFGSNFQK--------- 126
Query: 120 REPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRG--LL 177
+ +PT E+L A+ W+ L ++ ST + P S ++ + RG ++
Sbjct: 127 MDGEGEEEEEAMPTREELAAFTTSCWDTIFLFIMGSTVIQPP---SDRVVSLLTRGEFMV 183
Query: 178 SRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAG 237
+ ++ R+ + GF FLL D Q+W ++ Y+ + QE N D++SFL LSF G
Sbjct: 184 VHEEDQSIRIADKGFPFLLKDLRTQVWTLLLLYLRSLQEEKANVHDVLSFLFRLSFLTVG 243
Query: 238 EAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSAR-KEGF 296
E Y ++ L+ + +++D DLG++ ++ + W PT+LA LS T+++ R +EG+
Sbjct: 244 EGYQMDDLAFSESGLLQDLQDLGII-YRKHKDSKWLYPTQLAIGLSS--TEAAKRDQEGW 300
Query: 297 IVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIIS 356
I+V T++R+YAY++S + +L LF+++EYQLPN+++G + +E++ A + GI+A QI+
Sbjct: 301 IIVGTDYRIYAYTSSPVKLLLLSLFTQIEYQLPNMVMGILLRENIRQAVQVGISANQILQ 360
Query: 357 FLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
FL+ NAHP++ P +PE++ DQ+RLWE++ R+ ++ ++YD+F S
Sbjct: 361 FLETNAHPQMKQNTPIIPESIADQLRLWEAEDRRLSLSSGYFYDDFAS 408
>gi|50557260|ref|XP_506038.1| YALI0F30173p [Yarrowia lipolytica]
gi|62901293|sp|Q6BZX4.1|TFB2_YARLI RecName: Full=RNA polymerase II transcription factor B subunit 2;
AltName: Full=RNA polymerase II transcription factor B
52 kDa subunit; AltName: Full=RNA polymerase II
transcription factor B p52 subunit
gi|49651908|emb|CAG78851.1| YALI0F30173p [Yarrowia lipolytica CLIB122]
Length = 467
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 129/409 (31%), Positives = 217/409 (53%), Gaps = 26/409 (6%)
Query: 13 DMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGF 72
D + L L RLY++P C A+ R LP LA+ +M M + I ++ V P
Sbjct: 11 DYLEGLPEAVLTRLYQSPATCLAVFRLLPALARTLIMGMIFNPDPIAVADVDALVKP--- 67
Query: 73 TKHRVAIDRLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLP 132
+ R+ ++ +LRL + + +N TF+ NLR L+ G +P +
Sbjct: 68 SSQRLKLETQKKLRLLHIFTETQAHIIINPTFKKNLRAALVGGDQNISFGVPCDTEDKHK 127
Query: 133 T-LEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAPRLTESG 191
+ L+A+A+ QWE L ++ ++ P S ++ + + L ++ R+T +G
Sbjct: 128 VDVAFLDAHAVSQWEMILHFMVGTSIGRTP---SDGVLNLLKHSGLMEPERGGLRITNAG 184
Query: 192 FQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKS 251
FQFLL D NAQ+W ++ +Y++ S+ ++ D+++F+ L G+ Y+L+ LSE QK
Sbjct: 185 FQFLLQDVNAQIWTLLLQYLNMSEYLQMDPVDVLNFIFMLGSLELGQDYSLSALSETQKH 244
Query: 252 MIKDFADLGLVKLQQGRKENWFIPTKLATNLSM------------------SLTDSSARK 293
M++D D G+V Q+ F PT+LAT L+ +++ S A
Sbjct: 245 MLEDLRDYGIV-YQRKASSRRFYPTRLATTLTSETAALRTASQSMEAATQDTISSSVAAD 303
Query: 294 EGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQ 353
GFI++ETNFR+YAY+ S L +L LF ++ + N++ G I ++S+ A NGITAEQ
Sbjct: 304 SGFIILETNFRLYAYTESPLQIAVLNLFVNLKTRFANMVTGQINRDSVRFALSNGITAEQ 363
Query: 354 IISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEF 402
II++L +AHPR+ +P V DQI+LW+ +++R+ T + + EF
Sbjct: 364 IITYLSVHAHPRMKGMEHVLPPTVVDQIKLWQLEMDRIRATDGYLFSEF 412
>gi|194388768|dbj|BAG60352.1| unnamed protein product [Homo sapiens]
Length = 386
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 128/367 (34%), Positives = 205/367 (55%), Gaps = 10/367 (2%)
Query: 4 VKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMM 63
V + +N + + L+ LDRLY +P C A+ R LP LAK +VM+M +++ +P +
Sbjct: 18 VHLQCRNLQEFLGGLSPGVLDRLYGHPATCLAVFRELPSLAKNWVMRMLFLEQPLPQAAV 77
Query: 64 EEWVLPDGFTKHRVAIDRLVQLRLFSEE--KKKETTYRLNSTFQSNLRKHLIYGG-ALPR 120
WV + + L LR++ + LN F+ NLR L+ GG A
Sbjct: 78 ALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLILNPIFRQNLRIALLGGGKAWSD 137
Query: 121 EPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLL-SR 179
+ G + L+ YA +WE L ++ S A + + + Q GL+ S
Sbjct: 138 DTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSAAVSQDLAQLLS---QAGLMKST 194
Query: 180 RDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEA 239
E P +T +GFQFLL+DT AQLWY + +Y+ +Q RG++ +++SFL +LSF G+
Sbjct: 195 EPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGKD 254
Query: 240 YNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSA--RKEGFI 297
Y++ +S+ + ++ + GLV Q+ RK + PT+LA NLS ++ + + GFI
Sbjct: 255 YSVEGMSDSLLNFLQHLREFGLV-FQRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFI 313
Query: 298 VVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISF 357
VVETN+R+YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII F
Sbjct: 314 VVETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHF 373
Query: 358 LQQNAHP 364
L+ AHP
Sbjct: 374 LRTRAHP 380
>gi|195590491|ref|XP_002084979.1| GD12553 [Drosophila simulans]
gi|194196988|gb|EDX10564.1| GD12553 [Drosophila simulans]
Length = 500
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 133/416 (31%), Positives = 228/416 (54%), Gaps = 40/416 (9%)
Query: 19 TARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVA 78
T L++LY P IC A+ R LP +A+++++++ ++D +P ++ W + A
Sbjct: 46 TPESLEKLYNYPPICLAVFRELPEIARQFIIRILFVDQPVPQAVVTSWGAQRCAKEQAEA 105
Query: 79 IDRLVQLRLF--SEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGIT----ARLP 132
L L ++ + + + L+ TF+ ++R+ L+ GG +P P T ++
Sbjct: 106 TSCLTALNVWRVTAIPGGLSAWELSPTFKKSVRQVLLGGG----KPWPMTNTLDKDSKPR 161
Query: 133 TLEDLEAYAIGQWECFLLQLI-----SSTQAERPTNFSSSMMKVFQRGLLSRRD-KEAPR 186
+ L+ YA+ +W C L ++ S T AE S +++ L +RD ++
Sbjct: 162 DIAFLDTYAMSRWRCVLHYMVGTGNRSGTDAE---AISPDAVRILLHANLMKRDERDGIT 218
Query: 187 LTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELS-FHVAGEAYNLNTL 245
+T GFQFLL+DT AQ+W+ + +Y+ +ERGI+ + +S L +L+ F G Y+ +
Sbjct: 219 ITRQGFQFLLLDTRAQVWHFMLQYLDTCEERGISLPECLSMLFQLTQFSTLGRDYSSEGM 278
Query: 246 SEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNL---------SMSLTDSSARKEGF 296
+ + ++ + GLV Q+ RKE F PT+LA N+ S+++ + + + G+
Sbjct: 279 NNQMLTFLQHLREFGLV-FQRKRKEGRFYPTRLALNVTSKEAAATASVAMDEEATQDCGY 337
Query: 297 IVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIIS 356
IVVETN+R+YAY+ S L +L LF+++ Y+ PNL+VG +T++S+ A GITAEQI+S
Sbjct: 338 IVVETNYRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQIVS 397
Query: 357 FLQQNAHPRVADRM--------PSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
+L+Q AHP + RM +P V DQI+LWE + NR T Y++F S
Sbjct: 398 YLEQYAHPNM--RMVESAIHSKSCLPPTVVDQIKLWELERNRFTYTEGVLYNQFLS 451
>gi|290987279|ref|XP_002676350.1| predicted protein [Naegleria gruberi]
gi|284089952|gb|EFC43606.1| predicted protein [Naegleria gruberi]
Length = 532
Score = 217 bits (553), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 145/466 (31%), Positives = 237/466 (50%), Gaps = 75/466 (16%)
Query: 8 AKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFY--------IDGAIP 59
+ N + +LT +L+ LY + + C+A+ R LP +AK+YVM+M I IP
Sbjct: 22 SDNIFSYLLTLTPDELNLLYGHTWTCKAVFRFLPDIAKQYVMRMLLCGSINSQQIAMDIP 81
Query: 60 AKMM------------------EEWVLPDGFTKHRVAIDRLVQLRLFSEEKKKETTYRLN 101
K+ +E +L + H A +L++ R+ S K + + LN
Sbjct: 82 KKLTNLSGYLDSNGAKKGREQDQEAILQEVRDAHENAFSQLLKYRIIS--KTSNSDFTLN 139
Query: 102 STFQSNLRK--------------HLIYGGALPREPMPSGITARLPTLED----------- 136
FQ R H +EP S + LE
Sbjct: 140 RYFQDTFRNLVTSVPSAQAPTTSHSSSHKRKEQEPETSISEENMDELEPKKKKRKKHKTK 199
Query: 137 ------LEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAPRLTES 190
L+ Y+ +WE LL L+ T AE FS ++ ++ + L ++ + R+T
Sbjct: 200 EIGAEFLQEYSKRKWENVLLYLVG-TAAESEQTFSGNVNELLKYSQLIKKS-DTVRITNE 257
Query: 191 GFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQK 250
GFQFLL +T Q+W +++ Y+ S +R + ++++F+ EL F G+ Y+ L+ QK
Sbjct: 258 GFQFLLQETKVQVWKLLKHYLETSGQRNQVKNEILNFIFELGFLDVGKEYSSKDLTSTQK 317
Query: 251 SMIKDFADLGLVKLQQGRKEN----WFIPTKLATNLSMSLTDS----------SARKEGF 296
S++ DF DLG++ L + +K+ +F PT LA +L++S++ S ++ G+
Sbjct: 318 SLLVDFNDLGIIYLHRDKKKKIKDKYFFPTPLAKSLTVSMSTSYDLISSFGSHNSINNGY 377
Query: 297 IVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIIS 356
I+VETN+R+YAY+ S L +L LF EY+LPN++VG IT+ ++ A +NGI+A QI+
Sbjct: 378 IIVETNYRVYAYTNSPLQIALLSLFIFPEYRLPNMVVGLITRSTIREALKNGISAHQILQ 437
Query: 357 FLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEF 402
FL+ NAHP++ + P +P+ V DQI LWE + NRV T + YD+F
Sbjct: 438 FLRLNAHPQMRLKKPVIPDTVSDQILLWEKERNRVIKTTSVVYDKF 483
>gi|169610043|ref|XP_001798440.1| hypothetical protein SNOG_08115 [Phaeosphaeria nodorum SN15]
gi|160701967|gb|EAT84391.2| hypothetical protein SNOG_08115 [Phaeosphaeria nodorum SN15]
Length = 426
Score = 217 bits (553), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 132/379 (34%), Positives = 204/379 (53%), Gaps = 25/379 (6%)
Query: 49 MQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQLRLFSE--EKKKETTYRLNSTFQS 106
M M Y+ + PA +E W P ++ A+D L +L + E K +Y L FQ
Sbjct: 1 MAMLYMPSSFPAADLEAWFKPSARSERVQALDILERLHILGSKREDDKSLSYSLIPGFQR 60
Query: 107 NLRKHLIYGGALPRE-PMPSGITARLP---TLEDLEAYAIGQWECFLLQLISSTQAERPT 162
+LR H I G R +P+ R ++E L+A+A QWE L ++S +P
Sbjct: 61 SLR-HAIEGSGTHRTFGVPASKAERGEKRLSIEFLDAHAREQWEGILFFMVSGAAGFQPG 119
Query: 163 ----NFSSSMMKVFQRGLLSRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERG 218
+ K+ G L R PR+T+ GF F+L +TNAQ+W ++ Y+ + G
Sbjct: 120 SVKMDVGPGTKKLLHAGDLVRTVHGTPRITKEGFSFVLQETNAQVWNLLIVYLKMVDDLG 179
Query: 219 INQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKL 278
+++ +++SFL L G+ Y+ +TLS Q M+ D + +G+V + + F PT+L
Sbjct: 180 MSETEVLSFLFMLGSLELGQDYSTSTLSPSQLHMLDDLSAMGIV-YRSSKDSPTFYPTRL 238
Query: 279 ATNLSMSLT-------------DSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVE 325
AT L+ DS A+ +GFI+VETN+R+YAY+ S + IL LF+K++
Sbjct: 239 ATTLTSDSGALPGSDVGTTEKPDSKAQNKGFIIVETNYRLYAYTNSLIQIAILSLFTKLQ 298
Query: 326 YQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWE 385
++ PNL+ G +TKES++ A + GIT+ QIIS+L AHP++ P +P V DQIRLWE
Sbjct: 299 HRFPNLVSGKLTKESVHRAVQAGITSAQIISYLTTYAHPQMQKSNPPLPPTVMDQIRLWE 358
Query: 386 SDLNRVEMTPAHYYDEFPS 404
+ RVE+T + EF S
Sbjct: 359 YEGERVEVTTGYLMREFGS 377
>gi|302684027|ref|XP_003031694.1| hypothetical protein SCHCODRAFT_15746 [Schizophyllum commune H4-8]
gi|300105387|gb|EFI96791.1| hypothetical protein SCHCODRAFT_15746 [Schizophyllum commune H4-8]
Length = 476
Score = 217 bits (553), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 139/431 (32%), Positives = 223/431 (51%), Gaps = 40/431 (9%)
Query: 7 IAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEW 66
+ + + + S L RLY+ P C +I R L PL ++ VM + +++ A+P M W
Sbjct: 9 VPHSLLPFLQSQPQNTLARLYQRPSSCLSIFRLLGPLERQIVMNLLWLESAVPVSTMGAW 68
Query: 67 VLPDGFTKHRVAIDRLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGG-----ALPRE 121
V+ + + A++ L +L + K ++ TF+S+LR+ L GG +P E
Sbjct: 69 VVREHQKIYEAALETLGKLHIVPTSPVK---LAVHPTFKSSLRQALTGGGTQGSFGIPAE 125
Query: 122 PMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRD 181
P + P + L+ YA+ +WE L ++SS + ++ + + + L
Sbjct: 126 P---DGKQQAPDVGKLDGYALERWETILHFMVSSGTGQESAQPAAGVTYLLKESGLMESR 182
Query: 182 KEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEA-- 239
R+T +GFQFLL+ +AQLW ++ Y+ + I+ +++SFL +S G+
Sbjct: 183 GGTLRITSAGFQFLLLTPHAQLWELLIHYLRLQEANNIDLVEVLSFLFMVSTTEFGQGNL 242
Query: 240 ----------YNLNTLSEIQKSMIKDFADLGLVKLQQGRKENW--FIPTKLATNLSMSL- 286
Y+ LS Q ++ D GL+ RK+N+ F PT+LAT L+ S
Sbjct: 243 EMLTGDLHQEYSTENLSTSQLVVMDDLRHYGLL---WQRKKNYPRFSPTRLATTLTSSAP 299
Query: 287 ------TDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKES 340
+S KEGFIV+ETN+R+YAY+ + L +L LF +Y+ PNL+VG+IT+ES
Sbjct: 300 PLPTSSGPTSGPKEGFIVLETNYRVYAYTDNALQTAVLNLFITPKYRFPNLVVGSITRES 359
Query: 341 LYNAFENGITAEQIISFLQQNAHPRVADRM-----PSVPENVCDQIRLWESDLNRVEMTP 395
+ A NGITA+QIIS+L +AHP++ + P +P V DQIRLWE + NR +
Sbjct: 360 VKRAMSNGITADQIISYLITHAHPQMHKNVCLVANPLLPVTVQDQIRLWELERNRTKSQE 419
Query: 396 AHYYDEFPSRV 406
Y EF S+
Sbjct: 420 GFLYTEFASQA 430
>gi|325181847|emb|CCA16302.1| general transcription factor IIH subunit putative [Albugo laibachii
Nc14]
Length = 473
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 131/431 (30%), Positives = 226/431 (52%), Gaps = 47/431 (10%)
Query: 13 DMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLP--- 69
+ + +L L+ LY +P++C+A+ +SLP LA+++VM++ + + +++ EW++
Sbjct: 5 EFLETLPNATLESLYADPWVCQAVFQSLPSLAQQFVMRLLSCNTVVQEQVLREWLVTQVQ 64
Query: 70 ---------------DGFTKHRVAIDRLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIY 114
D +H +I +L +LR+F++ +Y+L++ FQ L+ L +
Sbjct: 65 SDPGTSVGKEKAKSIDEIYQH--SIQKLRKLRVFTQSSTD--SYQLHAGFQKQLKYALCH 120
Query: 115 GGALP----REPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMK 170
G P R + I + E LE YA W+ L ++ ST P +++
Sbjct: 121 LGGCPWEVGRRKLGKDIEVSFASTE-LERYARSHWDAVLHYMVGSTSVPDPPQ---AVVD 176
Query: 171 VFQRGLL--SRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERG-INQADLISF 227
+ R L + D+ A +T+ G++F+L D + Q+W + EYI G + Q D++ F
Sbjct: 177 ILLRTNLMQASVDQRALHITDKGYEFMLKDIHIQMWIFILEYIKTLDRTGALKQEDILRF 236
Query: 228 LLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLT 287
L ++S+ Y + L++ Q+ ++ DF + GL+ ++ + F T LA NL T
Sbjct: 237 LFQISYCQVNAYYPVQDLTKTQQLLLTDFNNFGLLYRKRSNSDR-FYTTSLAINLIFGGT 295
Query: 288 DSSARKEG-------------FIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVG 334
+S E +VETNF++YAY+TS LH +L +F + +LPNL +G
Sbjct: 296 TTSNYSETQVARPRMSSMSDLLTIVETNFKVYAYTTSTLHIAMLSVFVDIVARLPNLAIG 355
Query: 335 AITKESLYNAFENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMT 394
IT+ESL +A +GI+A+QI FL ++AHPR+ P VPEN+ DQI LWE + NR++
Sbjct: 356 FITRESLRSALVHGISAQQIYDFLLKHAHPRMKKNNPIVPENIADQIYLWEKERNRIKFL 415
Query: 395 PAHYYDEFPSR 405
+D F S+
Sbjct: 416 EGILFDGFNSK 426
>gi|343428936|emb|CBQ72481.1| probable TFB2-TFIIH subunit (transcription/repair factor)
[Sporisorium reilianum SRZ2]
Length = 493
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 131/398 (32%), Positives = 213/398 (53%), Gaps = 18/398 (4%)
Query: 23 LDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRL 82
L RLYE P C AI R LP +A++ +M M ++D + A W+ + A+D+L
Sbjct: 54 LIRLYEKPASCLAIFRLLPMMARQLIMHMLFLDVPLAADDFMAWIKKEARKDFDAAVDKL 113
Query: 83 VQLRLFS-EEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLPT-LEDLEAY 140
+L + + + LN+ F +R+ L GG +P + + L+ Y
Sbjct: 114 SRLSIVQLKTTAAKQILLLNAVFTDGMRRALTGGGKHRSFGVPCDTEDKNAVDVAFLDEY 173
Query: 141 AIGQWECFLLQLISSTQAERPTN------FSSSMMKVFQRGLLSRRDKEAPRLTESGFQF 194
A +WE L ++ S + P S++M+ R + +T GFQF
Sbjct: 174 ARTKWETILHYMVGSDNSSTPREPVLYLLRRSNLMQ--PRSTSTSSSSGGLNITSRGFQF 231
Query: 195 LLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIK 254
LL D N QLW ++ +Y+ ++ER ++ ++++FL L G Y+ L E Q M++
Sbjct: 232 LLEDVNTQLWDLLLQYLDMAEERNMDLVEVLAFLFMLGSLELGRDYSTEELPETQLHMLE 291
Query: 255 DFADLGLVKLQQGRKENWFIPTKLATNLSMS-----LTDSSARKE--GFIVVETNFRMYA 307
DF D GLV Q+ F PT+LAT L+ S L+ + +E G+I++ETN+R+YA
Sbjct: 292 DFRDYGLV-YQRKASSRRFYPTRLATTLTSSAAVPLLSSNGTEQEERGYIILETNYRLYA 350
Query: 308 YSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVA 367
Y++++L +L LF ++ + PNL+VG+IT++S+ +A NGITAEQII++L +AH ++
Sbjct: 351 YTSNRLRVAVLSLFVTIKARFPNLVVGSITRDSVKSALANGITAEQIITYLTHHAHVQMH 410
Query: 368 DRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
P +P V DQIRLWE + NRV+ + +F S+
Sbjct: 411 RNDPLLPVTVSDQIRLWEREKNRVQQNLGSLFTDFTSQ 448
>gi|213402795|ref|XP_002172170.1| transcription factor TFIIH complex subunit Tfb2
[Schizosaccharomyces japonicus yFS275]
gi|212000217|gb|EEB05877.1| transcription factor TFIIH complex subunit Tfb2
[Schizosaccharomyces japonicus yFS275]
Length = 444
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 128/384 (33%), Positives = 205/384 (53%), Gaps = 13/384 (3%)
Query: 25 RLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQ 84
RLY+ P C AI R LP +A++YVM M + + + W + ++LV+
Sbjct: 21 RLYQKPATCLAIFRLLPIIARQYVMAMLFNPSPVALNDFDLWTKLSSKVYQTESFNKLVR 80
Query: 85 LRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLPT-LEDLEAYAIG 143
+ +F + + LNS F+ L GG +P R +E L+ YA
Sbjct: 81 MHIFQVDGQNVI---LNSEFRQQFITALTGGGTHNSFGVPCIDEDRQRVDIEFLDKYATD 137
Query: 144 QWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAPRLTESGFQFLLMDTNAQL 203
WE L ++ ++ + P ++ + +RG L K R+T SGFQFLL D N Q+
Sbjct: 138 TWETILHFMVGTSVKKIP---GEGVLSLLRRGGLMTGTKNQVRITHSGFQFLLQDINTQI 194
Query: 204 WYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVK 263
W ++ EY+ S++ ++ ++ FL L G Y+++ L++ Q+ M++D + GLV
Sbjct: 195 WTLLLEYLKLSEDTHMDPVQVLHFLFMLGSLELGRDYSVHFLTDTQQIMLEDLREYGLV- 253
Query: 264 LQQGRKENWFIPTKLATNLSMSLTD-----SSARKEGFIVVETNFRMYAYSTSKLHCEIL 318
Q+ F PT+LAT+L+ S +GFI+VETN+R+YAY+ S L IL
Sbjct: 254 YQKKSSSRRFYPTRLATSLTTEYHSPVKGAGSDAGKGFIIVETNYRLYAYTDSPLQIAIL 313
Query: 319 RLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRMPSVPENVC 378
LF+ + + NLIVG IT++S+ A ++GITAEQII++L +AHP++ +P +P +
Sbjct: 314 GLFTNLRARFSNLIVGVITRDSIRRALQSGITAEQIITYLTTHAHPQMRKEIPLLPPTLV 373
Query: 379 DQIRLWESDLNRVEMTPAHYYDEF 402
DQI LWE + NR+ TP + +F
Sbjct: 374 DQIYLWELERNRLRATPGILFRDF 397
>gi|281208616|gb|EFA82792.1| general transcription factor IIH [Polysphondylium pallidum PN500]
Length = 458
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 136/416 (32%), Positives = 236/416 (56%), Gaps = 42/416 (10%)
Query: 8 AKNFMDM---VASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMME 64
A++ +D+ ++SL+ DL+ LY + F C+AILRSLPP K+Y+++M I+ +I +
Sbjct: 22 ARSTLDIFQYLSSLSRNDLEELYSDTFTCQAILRSLPPRCKQYILKMLLIE-SIQVNNVS 80
Query: 65 EWVLPDGFTKHRVAIDRLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMP 124
W H +++RL L++ +++ FQSN+++ + + +
Sbjct: 81 SWTAQSNKQTHLESLERLEDLKILIKQE-----------FQSNIKESITN----MKNVVV 125
Query: 125 SGITARLPTL----EDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRR 180
S + P L + L++Y+ QWE L L S +E P + ++ + L++
Sbjct: 126 SAVERVEPNLQMSVDQLDSYSKSQWEKVLYFL--SDDSETPPELIADLLALSN---LTKL 180
Query: 181 DKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAY 240
+ + +T GF+FLL D Q+W ++ Y+++ + RG ++ D ++FL +LSF G AY
Sbjct: 181 NDGSMVITSEGFKFLLKDIYTQIWTLIIVYLNSLETRGKSRRDALAFLFKLSFLSLGSAY 240
Query: 241 NLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLT-----------DS 289
+N L+E +KS++ D GLV ++ + E F PT+L +L+ T +
Sbjct: 241 YVNDLTEDEKSLLFDLRQFGLVYIRSEKSE-IFYPTRLIISLTTGKTVTVIKDLAKEMSN 299
Query: 290 SARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGI 349
+ +++G+I++ETNFR+YAY+ S L +L LF K+ Y+LPNL VG +T+ES+ AF +GI
Sbjct: 300 TQKEQGYIILETNFRIYAYTASSLQISLLSLFVKMLYRLPNLSVGILTRESIRTAFLHGI 359
Query: 350 TAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
TA+QI+ F++QN HP + PE V +QIR+WE++ NR+ A +D FP++
Sbjct: 360 TADQIVDFIKQNGHPNMLK--VGAPEIVFEQIRIWENERNRILYKKAVLFDSFPTQ 413
>gi|325181222|emb|CCA15636.1| general transcription factor IIH subunit putative [Albugo laibachii
Nc14]
Length = 1033
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 131/431 (30%), Positives = 226/431 (52%), Gaps = 47/431 (10%)
Query: 13 DMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLP--- 69
+ + +L L+ LY +P++C+A+ +SLP LA+++VM++ + + +++ EW++
Sbjct: 52 EFLETLPNATLESLYADPWVCQAVFQSLPSLAQQFVMRLLSCNTVVQEQVLREWLVTQVQ 111
Query: 70 ---------------DGFTKHRVAIDRLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIY 114
D +H +I +L +LR+F++ +Y+L++ FQ L+ L +
Sbjct: 112 SDPGTSVGKEKAKSIDEIYQH--SIQKLRKLRVFTQSSTD--SYQLHAGFQKQLKYALCH 167
Query: 115 GGALP----REPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMK 170
G P R + I + E LE YA W+ L ++ ST P +++
Sbjct: 168 LGGCPWEVGRRKLGKDIEVSFASTE-LERYARSHWDAVLHYMVGSTSVPDPPQ---AVVD 223
Query: 171 VFQRGLL--SRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERG-INQADLISF 227
+ R L + D+ A +T+ G++F+L D + Q+W + EYI G + Q D++ F
Sbjct: 224 ILLRTNLMQASVDQRALHITDKGYEFMLKDIHIQMWIFILEYIKTLDRTGALKQEDILRF 283
Query: 228 LLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLT 287
L ++S+ Y + L++ Q+ ++ DF + GL+ ++ + F T LA NL T
Sbjct: 284 LFQISYCQVNAYYPVQDLTKTQQLLLTDFNNFGLLYRKRSNSDR-FYTTSLAINLIFGGT 342
Query: 288 DSSARKEG-------------FIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVG 334
+S E +VETNF++YAY+TS LH +L +F + +LPNL +G
Sbjct: 343 TTSNYSETQVARPRMSSMSDLLTIVETNFKVYAYTTSTLHIAMLSVFVDIVARLPNLAIG 402
Query: 335 AITKESLYNAFENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMT 394
IT+ESL +A +GI+A+QI FL ++AHPR+ P VPEN+ DQI LWE + NR++
Sbjct: 403 FITRESLRSALVHGISAQQIYDFLLKHAHPRMKKNNPIVPENIADQIYLWEKERNRIKFL 462
Query: 395 PAHYYDEFPSR 405
+D F S+
Sbjct: 463 EGILFDGFNSK 473
>gi|359497087|ref|XP_002267711.2| PREDICTED: general transcription factor IIH subunit 4-like [Vitis
vinifera]
Length = 238
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 110/191 (57%), Positives = 130/191 (68%), Gaps = 43/191 (22%)
Query: 215 QERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFI 274
QERG++ ADLISFLLELSFHV GEAYN+NTL+E Q++ IKD DLGLVKLQQGRKE+WFI
Sbjct: 47 QERGVDPADLISFLLELSFHVTGEAYNINTLTEFQRNTIKDLVDLGLVKLQQGRKESWFI 106
Query: 275 PTKLATNLSMSLTDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVG 334
PTKLATNLSMSL+D+S+RK+GF+VVETNFR+YAYS+SKLHCEILRLFS+
Sbjct: 107 PTKLATNLSMSLSDTSSRKQGFVVVETNFRLYAYSSSKLHCEILRLFSRSS--------- 157
Query: 335 AITKESLYNAFENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMT 394
+S + H + IRLWE+DLNRVE
Sbjct: 158 --------------------VSTVATPCHGYL--------------IRLWETDLNRVETM 183
Query: 395 PAHYYDEFPSR 405
P+H YDEFPSR
Sbjct: 184 PSHLYDEFPSR 194
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 33/43 (76%), Positives = 33/43 (76%)
Query: 1 MPLVKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPL 43
MP VKIIAKNFMDMVASL A LD LY N FICEAILR L L
Sbjct: 1 MPQVKIIAKNFMDMVASLPAMKLDALYHNTFICEAILRYLDLL 43
>gi|261187837|ref|XP_002620336.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Ajellomyces dermatitidis SLH14081]
gi|239593453|gb|EEQ76034.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Ajellomyces dermatitidis SLH14081]
gi|239608433|gb|EEQ85420.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Ajellomyces dermatitidis ER-3]
gi|327356043|gb|EGE84900.1| RNA pol II transcription initiation subunit [Ajellomyces
dermatitidis ATCC 18188]
Length = 485
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 143/432 (33%), Positives = 224/432 (51%), Gaps = 40/432 (9%)
Query: 12 MDMVASLTARDLDRLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPD 70
MD + SL RLY+ P AI R LP LAK +VM + Y+ +PA +E WV P+
Sbjct: 9 MDYLESLPGTVFRRLYQQPSTALAIFRRMLPHLAKCFVMALLYLKDPLPAADLELWVKPE 68
Query: 71 GFTKHRVAIDRLVQLRLFSEEKKKETT--YRLNSTFQSNLRKHLIYGGALPREPMPSGIT 128
+ A+ L +L + S + Y + F S+LR+ L G +PS
Sbjct: 69 SKRERDNALSILGRLHILSNTTTSDNVRAYTVTDPFASSLRQALTGGNKQQSFGVPSLTA 128
Query: 129 ARLP-TLEDLEAYAIGQWECFLLQLI--SSTQAERPTNFSSSMMKVFQR-GLLSRRDKEA 184
P T+ L+ YA QWE L ++ S+ ++ S + ++ Q L+ RD+
Sbjct: 129 DENPMTVALLDDYARSQWEGVLGYMVGTSALGVQQAVTLSKGVKQLLQACHLVEIRDRRV 188
Query: 185 PRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNT 244
+T+ GF F+L D N Q+W+I+ Y+ N+++ G++ +++SFL LS G++Y
Sbjct: 189 -EITKDGFAFVLQDLNTQVWHILILYVENAEQIGMDSVEVLSFLFVLSSLELGQSYEKKH 247
Query: 245 LSEIQKSMIKDFADLGLVKLQQGRKE-NWFIPTKLATNL-----------SMSLTDSSAR 292
L+ Q + D D G+V +E F PT+LAT L S SLT S
Sbjct: 248 LTSTQLKTLADLTDFGIVYQHPPHQEATRFYPTRLATTLTSDSITLGSSISSSLTAPSGG 307
Query: 293 --------KEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNA 344
GFI++ETN+R+YAY++S L ++ LF+ ++Y+ PNLI G +T++S+ A
Sbjct: 308 PAAVSNEPGTGFIIIETNYRLYAYTSSPLQISLISLFTTLKYRFPNLITGKLTRQSIRRA 367
Query: 345 FENGITAEQIISFLQQNAHPRVAD-----------RMPSV-PENVCDQIRLWESDLNRVE 392
E GITA+QIIS+L +AHP++ +P+V P V DQIRLW+ + +R++
Sbjct: 368 VEMGITADQIISYLTTHAHPQMRKYHASKSGANPAGVPAVLPPTVVDQIRLWQLERDRIK 427
Query: 393 MTPAHYYDEFPS 404
TP + +F S
Sbjct: 428 ATPGFLFKDFVS 439
>gi|66800623|ref|XP_629237.1| general transcription factor IIH, polypeptide 4 [Dictyostelium
discoideum AX4]
gi|74850716|sp|Q54C29.1|TF2H4_DICDI RecName: Full=General transcription factor IIH subunit 4; AltName:
Full=TFIIH basal transcription factor complex subunit 4
gi|60462610|gb|EAL60813.1| general transcription factor IIH, polypeptide 4 [Dictyostelium
discoideum AX4]
Length = 483
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 138/435 (31%), Positives = 238/435 (54%), Gaps = 52/435 (11%)
Query: 15 VASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTK 74
+ASL ++DL+ LY++P+ C+AILRSLPP +K+Y+++M +D P + ++W +
Sbjct: 10 LASLDSKDLEELYKDPWTCQAILRSLPPRSKQYILKMLLVD-TYPLSLAKDWSTQASIQQ 68
Query: 75 HRVAIDRLVQLRLFSEEK-----------------------------KKETTYRLNSTFQ 105
H+ ++ +L L++ +K + E T RLN FQ
Sbjct: 69 HKESLKKLFDLKIIFLDKINKPIQPQQQQSSQQSSSQQQQQQQQQQQQTEQTIRLNPLFQ 128
Query: 106 SNLRKHLIYGGALPREPMPS-GITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNF 164
N+++ L+ + S + P+++DL++Y+ QWE L L S +P+
Sbjct: 129 DNIKRSLVQVNQVIFSNNSSIKDNHKPPSIDDLDSYSKSQWEKVLYFL--SDDTVQPSKL 186
Query: 165 SSSMMKVFQRGLLSRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQER----GIN 220
S ++ L++++ + +T GF+FLL D Q+W ++ Y+ + +++ +
Sbjct: 187 ISELL---LSSNLTKQEGDGLSITSEGFKFLLKDVYTQIWTLLIVYLDDLEKKKGKGSGS 243
Query: 221 QADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLAT 280
+ DL+SFL LSF G Y ++ LSE QK + GL+ ++ F PT+L
Sbjct: 244 RNDLLSFLFRLSFLNLGRGYLVSELSEQQKEYLFALKQFGLIYMRTD-SSILFYPTRLII 302
Query: 281 NLS----MSLTDS-------SARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLP 329
+L+ +SL S + +++G+IV+ETN+R+YAY++S L +L LF K+ Y+LP
Sbjct: 303 SLTTGKTLSLIQSISSERTQTQKEQGYIVLETNYRLYAYTSSSLQISLLSLFVKMLYRLP 362
Query: 330 NLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLN 389
NL VG IT+ES+ A +GITA+QII F++ N+HP A+ +P+ V +QI LWE++ N
Sbjct: 363 NLAVGIITRESIRTALIHGITADQIIDFVRHNSHPNAANSGQPIPDVVAEQILLWEAERN 422
Query: 390 RVEMTPAHYYDEFPS 404
R+ T + Y+ FP+
Sbjct: 423 RITYTKSVLYNSFPT 437
>gi|295662078|ref|XP_002791593.1| RNA polymerase II transcription factor B subunit 2
[Paracoccidioides sp. 'lutzii' Pb01]
gi|226279719|gb|EEH35285.1| RNA polymerase II transcription factor B subunit 2
[Paracoccidioides sp. 'lutzii' Pb01]
Length = 484
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 145/431 (33%), Positives = 221/431 (51%), Gaps = 39/431 (9%)
Query: 12 MDMVASLTARDLDRLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPD 70
MD + SL RLY+ P AI R LP LAK +VM + Y+ +PA +E WV P+
Sbjct: 9 MDYLESLPGTVFRRLYQQPSTALAIFRRMLPYLAKYFVMALLYLKDPLPAADLELWVKPE 68
Query: 71 GFTKHRVAIDRLVQLRLFSEEKKKETT--YRLNSTFQSNLRKHLIYGGALPREPMPSGIT 128
+ A+ L +L + S + Y + F S+LR+ L G +PS +T
Sbjct: 69 SKRERDNALSILGRLHILSNATTTDHVRAYVVTEPFASSLRQALTGGDKQQSFGVPS-VT 127
Query: 129 --ARLPTLEDLEAYAIGQWECFLLQLI--SSTQAERPTNFSSSMMKVFQRGLLSRRDKEA 184
A L T+ L+ YA QWE L ++ S+ A++ S + ++ Q L
Sbjct: 128 PDANLMTVAQLDEYASRQWEGVLGYMVGTSALGAQQTVTLSKGVKQLLQACHLVEVHDRR 187
Query: 185 PRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNT 244
+T+ GF F+L D N Q+W I+ Y+ N+++ G+ +++SFL LS G++Y
Sbjct: 188 VDITKEGFAFVLQDVNTQVWNILILYVENAEQIGLESVEVLSFLFVLSSLELGQSYEKKH 247
Query: 245 LSEIQKSMIKDFADLGLVKLQQGRKE-NWFIPTKLATNL-----------SMSLT----- 287
L+ Q + D D G+V +E F PT+L T L S SLT
Sbjct: 248 LTSTQLKTLTDLTDFGIVYQHFPPQEATRFYPTRLVTTLTSDSATLGSSISTSLTGPSGP 307
Query: 288 ---DSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNA 344
+SS GFIV+ETN+R+YAY++S L ++ LF+ ++Y+ PNLI G IT++S+ A
Sbjct: 308 TAANSSEPSTGFIVIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSIRRA 367
Query: 345 FENGITAEQIISFLQQNAHPRVADRMPS-----------VPENVCDQIRLWESDLNRVEM 393
E GITA+QIIS+L +AHP++ S +P V DQIRLW+ + +R++
Sbjct: 368 VEMGITADQIISYLTTHAHPQMRKYHASKSSHTVGAPAVLPPTVVDQIRLWQLERDRIKA 427
Query: 394 TPAHYYDEFPS 404
TP + +F S
Sbjct: 428 TPGFLFKDFVS 438
>gi|353240543|emb|CCA72408.1| probable TFB2-TFIIH subunit (transcription/repair factor)
[Piriformospora indica DSM 11827]
Length = 462
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 127/408 (31%), Positives = 215/408 (52%), Gaps = 19/408 (4%)
Query: 11 FMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPD 70
+D + S T L RLY P C AI+R L PL ++ VM + +++G + + + W+ +
Sbjct: 13 LIDFLHSQTPTTLQRLYTRPSACLAIIRLLSPLERQLVMSLLWLEGPVETQSITNWITLE 72
Query: 71 GFTKHRVAIDRLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPM---PSGI 127
G + + A+ L +L++ K++ L ++F+S LR L G + P
Sbjct: 73 GRSAYEHALKSLAKLQILPNSKEQ---ILLQASFKSGLRNGLTGSGQVASFGALVEPDND 129
Query: 128 TARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAPRL 187
LPT E L+ YA+ +W+ L +++S + S ++ + + L + +
Sbjct: 130 MGTLPT-EMLDNYAVERWDTILHFMVTSGTEQASARPSEGVLYLLEHSGLMSNEHGRRII 188
Query: 188 TESGFQFLLMDTNAQLWYIVREYISNSQER-GINQADLISFLLELSFHVAGEAYNLNTLS 246
T +GFQFLL +AQLW + Y+ ER ++ D++ F LS G+ Y+ +TLS
Sbjct: 189 TSAGFQFLLQSPHAQLWEFILSYLRMMAEREDMDMVDILGFFFMLSMTQPGQHYSTHTLS 248
Query: 247 EIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLS----------MSLTDSSARKEGF 296
Q M+ D D GLV + F PT+ AT L+ +++ + + + F
Sbjct: 249 PTQLIMVSDLRDFGLVYFPSDTTTS-FQPTRFATTLTSYTSSFSDHDLTIENGADLSQEF 307
Query: 297 IVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIIS 356
+V+ETN+ +YAY+ + L +L LF + + PN+I+G++T++S+ A NGITA+QI+S
Sbjct: 308 VVLETNYHVYAYTNNPLQIAVLNLFVSFKARFPNMIMGSLTRDSVKKALVNGITADQILS 367
Query: 357 FLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
+L +AHP++ P +P V DQIRLWE + +RV+ + Y EF S
Sbjct: 368 YLVTHAHPQMRKNNPIIPVTVQDQIRLWELERHRVKGQDGYLYKEFAS 415
>gi|388857860|emb|CCF48525.1| probable TFB2-TFIIH subunit (transcription/repair factor) [Ustilago
hordei]
Length = 492
Score = 215 bits (547), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 135/403 (33%), Positives = 212/403 (52%), Gaps = 29/403 (7%)
Query: 23 LDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRL 82
L RLYE P C AI R LP +A++ +M M ++D + A W+ + A+D+L
Sbjct: 54 LIRLYEKPASCLAIFRLLPMMARQLIMHMLFLDVPLAADDFIAWIKKEARKDFDAAVDKL 113
Query: 83 VQLRLFS-EEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLPT-LEDLEAY 140
+L + + + LN+ F +R+ L GG +P + + L+ Y
Sbjct: 114 SRLSIVQLKSTGGKQILLLNAVFTDGMRRALTGGGKHRSFGVPCDTEDKNAVDVAFLDQY 173
Query: 141 AIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAPR-----------LTE 189
A +WE L ++ S + P V LL R + PR +T
Sbjct: 174 ARNKWETILHYMVGSENSSTPRE------PVLY--LLRRSNLMQPRPSSGSGSGGLNITS 225
Query: 190 SGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQ 249
GFQFLL D N QLW ++ +Y+ ++ER ++ ++++FL L G Y+ L E Q
Sbjct: 226 RGFQFLLEDVNTQLWDLLLQYLDMAEERNMDLVEVLAFLFMLGSLELGRDYSTEELPETQ 285
Query: 250 KSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMS-----LTDSSARKE--GFIVVETN 302
M++DF D GLV Q+ F PT+LAT L+ S L+ + A +E G+I++ETN
Sbjct: 286 LHMLEDFRDYGLV-YQRKASSRRFYPTRLATTLTSSAAIPLLSSNGAEQEERGYIILETN 344
Query: 303 FRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 362
+R+YAY+++ L +L LF ++ + PNL+VG+IT++S+ +A NGITAEQII++L +A
Sbjct: 345 YRLYAYTSNPLRVAVLSLFVTIKARFPNLVVGSITRDSVKSALANGITAEQIITYLTHHA 404
Query: 363 HPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
H ++ P +P V DQIRLWE + NRV + +F S+
Sbjct: 405 HVQMHRNDPLLPVTVSDQIRLWEREKNRVVQNLGSLFTDFTSQ 447
>gi|169784197|ref|XP_001826560.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Aspergillus oryzae RIB40]
gi|238508739|ref|XP_002385554.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Aspergillus flavus NRRL3357]
gi|83775305|dbj|BAE65427.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|220688446|gb|EED44799.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Aspergillus flavus NRRL3357]
gi|391868531|gb|EIT77745.1| RNA polymerase II transcription initiation protein [Aspergillus
oryzae 3.042]
Length = 480
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 142/432 (32%), Positives = 227/432 (52%), Gaps = 46/432 (10%)
Query: 13 DMVASLTARDLDRLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDG 71
D + SL +LY+ P AI R LP LAK +VM + Y+ +PA +E WV +
Sbjct: 12 DYLESLPGTTFYKLYQQPSTALAIFRRMLPDLAKCFVMALLYLKDPLPATDLEAWVKAES 71
Query: 72 FTKHRVAIDRLVQLRLFSEEKKKETT--YRLNSTFQSNLRKHLIYGGALPREPMPSGITA 129
+ A+ L +L + S + Y++ F S+LR+ L GA + G+
Sbjct: 72 LKERDNALSILGRLHILSNTVITDNVRAYKVTDPFASSLRQALT--GAEKTQSF--GVLW 127
Query: 130 RLP-----TLEDLEAYAIGQWECFLLQLISSTQ--AERPTNFSSSMMKVFQRG-LLSRRD 181
+ ++ +L+ YA QWE L ++ ++ +R N S + ++ Q G L+ RD
Sbjct: 128 HISDGAAVSIAELDEYARRQWEGVLGYMVGTSGLGIQRDANLSKGVKQLLQAGHLVEIRD 187
Query: 182 KEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYN 241
+ +T+ GF F+L D + Q+W+I+ Y+ ++ G++ +++SF+ LS G+AY+
Sbjct: 188 RRV-EITQDGFAFVLQDVSTQVWHILMLYVESADAIGMSSVEVLSFVFLLSSLELGKAYD 246
Query: 242 LNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSM----------------S 285
L+ Q + D AD G+V Q + F PT+LAT L+ S
Sbjct: 247 KKHLTSNQLRTLTDLADFGIV-YQDSPDASHFYPTRLATTLTSDSSALSNPISGSLSGPS 305
Query: 286 LTDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAF 345
T S+ GFI+VETN+R+YAY++S L ++ LF+ ++Y+ PNLI G IT++S+ A
Sbjct: 306 GTGSNKAGSGFIIVETNYRLYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSVRRAI 365
Query: 346 ENGITAEQIISFLQQNAHPRV------------ADRMPSV-PENVCDQIRLWESDLNRVE 392
E GITA+QIIS+L +AHP++ A PSV P V DQIRLW+ + +RV+
Sbjct: 366 EMGITADQIISYLATHAHPQMRKHNVSRSTSNQAGIPPSVLPPTVTDQIRLWQLERDRVK 425
Query: 393 MTPAHYYDEFPS 404
TP + +F S
Sbjct: 426 ATPGFLFKDFVS 437
>gi|71024427|ref|XP_762443.1| hypothetical protein UM06296.1 [Ustilago maydis 521]
gi|46097692|gb|EAK82925.1| hypothetical protein UM06296.1 [Ustilago maydis 521]
Length = 491
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 129/396 (32%), Positives = 214/396 (54%), Gaps = 16/396 (4%)
Query: 23 LDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRL 82
L RLY+ P C AI R LP +A++ +M M ++D + A W+ + + A+D+L
Sbjct: 54 LIRLYQKPASCLAIFRLLPMMARQLIMHMLFLDVPLAADDFLAWIKKEVKNEFDAAVDKL 113
Query: 83 VQLRLFS-EEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLPT-LEDLEAY 140
+L + + + LN+ F +R+ L GG +P + + L+ Y
Sbjct: 114 SRLSIIQLKASAGKQMLLLNAGFTEGMRRALTGGGKHRSFGVPCDTEDKNAVDVAFLDQY 173
Query: 141 AIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAPR----LTESGFQFLL 196
A +WE L ++ S + P +++ + L+ R + +T GFQFLL
Sbjct: 174 ARTKWETILHYMVGSDNSSTPREPVLYLLR--RSNLMQPRTASSSNGGLNITSRGFQFLL 231
Query: 197 MDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDF 256
D N QLW ++ +Y+ ++ER ++ ++++FL L G Y+ L E Q M++DF
Sbjct: 232 EDVNTQLWDLLLQYLDMAEERNMDLVEVLAFLFMLGSLELGRDYSTEELPETQLHMLEDF 291
Query: 257 ADLGLVKLQQGRKENWFIPTKLATNLSMS-----LTDSSARKE--GFIVVETNFRMYAYS 309
D GLV Q+ F PT+LAT L+ S L+ + +E G+I++ETN+R+YAY+
Sbjct: 292 RDYGLV-YQRKASSRRFYPTRLATTLTSSAAVPLLSSNGTEQEERGYIILETNYRLYAYT 350
Query: 310 TSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADR 369
++ L +L LF ++ + PNL+VG+IT++S+ +A NGITAEQII++L +AH ++
Sbjct: 351 SNPLRVAVLSLFVTIKARFPNLVVGSITRDSVKSALANGITAEQIITYLTHHAHLQMHRN 410
Query: 370 MPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
P +P V DQIRLWE + NRV+ + +F S+
Sbjct: 411 DPLLPVTVSDQIRLWEREKNRVQQNLGSLFTDFTSQ 446
>gi|225682227|gb|EEH20511.1| RNA polymerase II transcription factor B subunit 2
[Paracoccidioides brasiliensis Pb03]
gi|226289599|gb|EEH45083.1| TFIIH basal transcription factor complex p52 subunit
[Paracoccidioides brasiliensis Pb18]
Length = 484
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 143/431 (33%), Positives = 220/431 (51%), Gaps = 39/431 (9%)
Query: 12 MDMVASLTARDLDRLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPD 70
MD + SL RLY+ P AI R LP LAK +VM + Y+ +PA +E WV +
Sbjct: 9 MDYLESLPGTVFRRLYQQPSTALAIFRRMLPYLAKCFVMALLYLKDPLPAADLELWVKSE 68
Query: 71 GFTKHRVAIDRLVQLRLFSEEKKKETT--YRLNSTFQSNLRKHLIYGGALPREPMPSGIT 128
+ A+ L +L + S + Y + F S+LR+ L G +PS +T
Sbjct: 69 SKRERDNALSILGRLHILSNATTTDHVRAYVVTEPFASSLRQALTGGDKQQSFGVPS-VT 127
Query: 129 --ARLPTLEDLEAYAIGQWECFLLQLISST--QAERPTNFSSSMMKVFQRGLLSRRDKEA 184
A L T+ L+ YA QWE L ++ ++ A++ S + ++ Q L
Sbjct: 128 PDANLMTVAQLDEYASRQWEGVLGYMVGTSVLGAQQTVTLSKGVKQLLQACHLVEVHDRR 187
Query: 185 PRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNT 244
+T+ GF F+L D N Q+W I+ Y+ N+++ G+ +++SFL LS G++Y
Sbjct: 188 VDITKEGFAFVLQDVNTQVWNILILYVENAEQIGLESVEVLSFLFVLSSLELGQSYEKKH 247
Query: 245 LSEIQKSMIKDFADLGLVKLQQGRKE-NWFIPTKLATNL-----------SMSLT----- 287
L+ Q + D D G+V +E F PT+L T L S SLT
Sbjct: 248 LTSTQLKTLTDLTDFGIVYQHFPPQEATRFYPTRLVTTLTSDSATLGSSISTSLTGPSGP 307
Query: 288 ---DSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNA 344
+SS GFIV+ETN+R+YAY++S L ++ LF+ ++Y+ PNLI G IT++S+ A
Sbjct: 308 AAANSSEPSTGFIVIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSIRRA 367
Query: 345 FENGITAEQIISFLQQNAHPRVADRMPS-----------VPENVCDQIRLWESDLNRVEM 393
E GITA+QIIS+L +AHP++ S +P V DQIRLW+ + +R++
Sbjct: 368 VEMGITADQIISYLTTHAHPQMRKYHASKSSHTVGAPAVLPPTVVDQIRLWQLERDRIKA 427
Query: 394 TPAHYYDEFPS 404
TP + +F S
Sbjct: 428 TPGFLFKDFVS 438
>gi|121714341|ref|XP_001274781.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Aspergillus clavatus NRRL 1]
gi|119402935|gb|EAW13355.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Aspergillus clavatus NRRL 1]
Length = 478
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 137/431 (31%), Positives = 223/431 (51%), Gaps = 44/431 (10%)
Query: 13 DMVASLTARDLDRLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDG 71
D + SL +LY+ P AI R LP LAK +VM + Y+ +PA +E WV PD
Sbjct: 10 DYLESLPGTVFIKLYQEPSTALAIFRRMLPDLAKCFVMALLYLKDPLPAADLETWVKPDS 69
Query: 72 FTKHRVAIDRLVQLRLFSEEKKKETT--YRLNSTFQSNLRKHLIYGGALPREPMPSGITA 129
+ A+ L +L + + + Y + F S+LR+ L + S ++
Sbjct: 70 LKERDNALSTLGRLHILTNTMTADNVRAYMVTKEFASSLRQALTGAEHTQSFGVVSNVSD 129
Query: 130 RLP-TLEDLEAYAIGQWECFLLQLISSTQ--AERPTNFSSSMMKVFQRG-LLSRRDKEAP 185
++ DL+ Y+ QWE L ++ + +R N S + ++ Q G L+ RD+
Sbjct: 130 EAAVSIADLDEYSRRQWEGVLGYMVGTGGLGMQRDVNLSKGVKQLLQAGHLVEIRDRRV- 188
Query: 186 RLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTL 245
+T+ GF F+L D Q+W+I+ Y+ +++ G++ +++ F+ LS G +Y +
Sbjct: 189 EITKDGFAFVLQDVGTQVWHILILYVESAEAIGMDSVEVLKFVFFLSSLELGRSYEKKQM 248
Query: 246 SEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSA-------------- 291
+ Q + D AD G+V Q+ + F PT+LAT L+ +DSSA
Sbjct: 249 TSNQLRTLTDLADFGIV-YQESPEATHFYPTRLATTLT---SDSSALSNPIAGSLSGPTG 304
Query: 292 -----RKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFE 346
GFI++ETN+R+YAY++S L ++ LF+ ++Y+ PNLI G IT++S+ A E
Sbjct: 305 TASNKAGSGFIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSIRRAVE 364
Query: 347 NGITAEQIISFLQQNAHPRV----ADRMPS---------VPENVCDQIRLWESDLNRVEM 393
GITA+QIIS+L +AHP++ A R S +P V DQIRLW+ + +RV+
Sbjct: 365 MGITADQIISYLSTHAHPQMRKHNASRSTSNQAGMPVSVLPPTVVDQIRLWQLERDRVKA 424
Query: 394 TPAHYYDEFPS 404
TP + +F S
Sbjct: 425 TPGFLFKDFVS 435
>gi|149238950|ref|XP_001525351.1| TFIIH basal transcription factor complex subunit 2 [Lodderomyces
elongisporus NRRL YB-4239]
gi|146450844|gb|EDK45100.1| TFIIH basal transcription factor complex subunit 2 [Lodderomyces
elongisporus NRRL YB-4239]
Length = 503
Score = 212 bits (540), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 140/442 (31%), Positives = 223/442 (50%), Gaps = 68/442 (15%)
Query: 25 RLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQ 84
+LYE P C AI R L +AK Y+M M + + + K + +W P A+ RL
Sbjct: 24 KLYEAPATCLAIYRLLSSMAKFYIMSMIFTEKPVSMKDLTKWCKPLAKKLEFDALKRLES 83
Query: 85 LRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLPTLEDL---EAYA 141
L L E K T RLN+ F++N R L G+ S T P DL +++A
Sbjct: 84 LHLI-EFDSKGTHLRLNTIFRTNFRDCLT--GSQDPNAFGSVSTTADPNKVDLAFLDSFA 140
Query: 142 IGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAP-RLTESGFQFLLMDTN 200
+WE L ++ + + P+ S++K+ GL+ + +++ +T +GFQFLL D N
Sbjct: 141 SQKWESILHFMVGTEGTKPPSKSVLSLLKL--GGLMEQHEEDGTLNITNTGFQFLLQDVN 198
Query: 201 AQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLG 260
AQ+W ++ +Y++ +QE ++ D+++F+ L G +Y+++ LSE Q SM+ D D G
Sbjct: 199 AQIWTLLLQYLNLTQELNMDAVDVLNFIFILGSLELGNSYSVSNLSETQVSMLADLKDYG 258
Query: 261 LVKLQQGRKENWFIPTKLATNLS---------------------MSLTDSSARKEGFIVV 299
LV Q+ + F PT+LAT L+ SL S+ KE I++
Sbjct: 259 LV-YQKSDTSSRFYPTRLATTLTSDSAALKTPSMAVEQALDETETSLASSTQTKES-III 316
Query: 300 ETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQ 359
ETNF+MYAY+ S L IL LF ++ + N+I G IT+ES+ NA NGITA+QII FL+
Sbjct: 317 ETNFKMYAYTNSPLEIAILNLFVHMKTRFSNMICGQITRESIRNALYNGITADQIIKFLE 376
Query: 360 QNAHPRV------------------------------------ADRMPSVPENVCDQIRL 383
+AHP++ R+ +P NV DQI+L
Sbjct: 377 THAHPQMRALAKERLDKKVEFDASHNINTAGGAPQSKTDGAISQHRLEVLPPNVVDQIKL 436
Query: 384 WESDLNRVEMTPAHYYDEFPSR 405
W+ +L+R++ + + +F ++
Sbjct: 437 WQLELDRIQTFDGYLFKDFANQ 458
>gi|440636738|gb|ELR06657.1| hypothetical protein GMDG_00274 [Geomyces destructans 20631-21]
Length = 470
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 143/433 (33%), Positives = 227/433 (52%), Gaps = 37/433 (8%)
Query: 1 MPLVKIIAKNFMDMVASLTARDLDRLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIP 59
MP + + +++ + +T R RLYE P A+ R LP LAK +VM M Y+ +
Sbjct: 1 MPESSLASFEYLEKLPGVTFR---RLYEQPSTALAVFRRMLPHLAKCFVMGMLYMPKPLL 57
Query: 60 AKMMEEWVLPDGFTKHRVAIDRLVQLRLFSE---EKKKETTYRLNSTFQSNLRKHLIYGG 116
++ WV P G ++ A+ L +L + + + L F ++LR L GG
Sbjct: 58 LSELDNWVRPGGKSEKGKALAILSRLHIVTSTAASRDGPPALNLTHNFATSLRLALTGGG 117
Query: 117 ALPREPMPSGITARLP-TLEDLEAYAIGQWECFLLQLISSTQ------AERPTNFSSSMM 169
+PS T T+ L+ YA QWE L +++ST A+ P S+++
Sbjct: 118 DHGSFGIPSDSTQDSNITISFLDEYATAQWEGILHYVVNSTGDPSLLGAQYP---SANVR 174
Query: 170 KVFQRGLL--SRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISF 227
++ Q G L S+ +T++GF FLL + NAQ+W ++ ++ +++E ++ DL+SF
Sbjct: 175 QLLQMGTLVDSKGKYSGVGITQAGFSFLLQEANAQVWTVLLLWLEHAEELAMDHVDLLSF 234
Query: 228 LLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLT 287
L G AY+ TLS QKSM+ AD GLV PT+LAT L+ S +
Sbjct: 235 FFLLGSLELGLAYSTTTLSTTQKSMLPILADFGLVYSPPSSSSF--FPTRLATTLTSSAS 292
Query: 288 ----------------DSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNL 331
SS GFIV+ETN+R+YAY++S L +L LF+K+ + PN+
Sbjct: 293 ALRSIASGFSAAAASPASSGGTTGFIVIETNYRLYAYTSSPLQIAVLALFAKLSTRYPNM 352
Query: 332 IVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRV 391
+ G +T++S+ A E+GITA+QIIS+L +AHP++ + P +P V DQIRLW+ + R+
Sbjct: 353 VSGRVTRDSVRGAIEHGITADQIISYLTTHAHPQLRRKSPVLPPTVVDQIRLWQIENERM 412
Query: 392 EMTPAHYYDEFPS 404
+ T + +F S
Sbjct: 413 KATAGFLFKDFAS 425
>gi|225555033|gb|EEH03326.1| RNA pol II transcription initiation subunit [Ajellomyces capsulatus
G186AR]
Length = 485
Score = 211 bits (538), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 140/430 (32%), Positives = 223/430 (51%), Gaps = 40/430 (9%)
Query: 12 MDMVASLTARDLDRLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPD 70
M+ + SL RLY+ P AI R LP LAK +VM + Y+ +PA +E WV +
Sbjct: 9 MEYLESLPGTVFRRLYQQPSTALAIFRRMLPYLAKCFVMALLYLKDPLPAADLELWVKTE 68
Query: 71 GFTKHRVAIDRLVQLRLFSEEKKKETT--YRLNSTFQSNLRKHLIYGGALPREPMPSGIT 128
+ A+ L +L + S + Y + F S+LR+ L G +PS
Sbjct: 69 SKRERDNALSILSRLHILSNTTTSDNVRAYTVTDPFSSSLRQALTGGDKQQSFGVPSVTD 128
Query: 129 ARLP-TLEDLEAYAIGQWECFLLQLI--SSTQAERPTNFSSSMMKVFQR-GLLSRRDKEA 184
P T+ L+ YA QWE L ++ S+ ++ S + ++ Q L+ RD+
Sbjct: 129 DENPMTVAQLDHYARSQWEGVLGYMVGTSALGVQQAATLSKGVKQLLQACHLVEVRDRRV 188
Query: 185 PRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNT 244
+T+ GF F+L D N Q+W+I+ Y+ N+++ G++ +++SFL LS G++Y
Sbjct: 189 -EITKDGFAFVLQDLNTQVWHILILYVENAEQIGMDSIEVLSFLFVLSSLELGQSYEKKH 247
Query: 245 LSEIQKSMIKDFADLGLVKLQQGRKE-NWFIPTKLATNL-----------SMSLT----- 287
L+ Q + D D G+V +E F PT+LAT L S SLT
Sbjct: 248 LTSTQLKTLADLTDFGIVYQHSPHQEATRFYPTRLATTLTSDSITLGSSISSSLTAPNGV 307
Query: 288 ---DSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNA 344
S+ GFI++ETN+R+YAY++S L ++ LF+ ++Y+ PNLI G +T++S+ A
Sbjct: 308 PSASSNEPSTGFIIIETNYRLYAYTSSPLQISLIALFTSLKYRFPNLITGKLTRQSVRRA 367
Query: 345 FENGITAEQIISFLQQNAHPRVAD-----------RMPSV-PENVCDQIRLWESDLNRVE 392
E GITA+QIIS+L +AHP++ +P+V P V DQIRLW+ + +R++
Sbjct: 368 VEMGITADQIISYLTTHAHPQMRKYHATKAGANPVGVPTVLPPTVVDQIRLWQLERDRIK 427
Query: 393 MTPAHYYDEF 402
TP + +F
Sbjct: 428 ATPGFLFKDF 437
>gi|298705308|emb|CBJ48998.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 534
Score = 211 bits (538), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 133/426 (31%), Positives = 220/426 (51%), Gaps = 45/426 (10%)
Query: 10 NFMDMVASLTARDLDRLY--------------ENPFICEAILRSLPPLAKKYVMQMFYID 55
N D + L LD+LY + C A+L+SLP LAK++VM++ +++
Sbjct: 22 NVFDYLERLPPDALDKLYGAAPPAADGQGGRSHGQWTCRALLQSLPQLAKQHVMRLLFVE 81
Query: 56 GAIPAKMMEEWVLPDGFTKHRVAIDRLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYG 115
G + M++ WV + H ++ +LV LR+ + Y+LN F+ NL++ L
Sbjct: 82 GPVGKGMLKSWVKKEYQRVHAASVRKLVSLRVLLPSAGGQ-EYQLNPPFRENLQRALCAS 140
Query: 116 GALP-REPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQR 174
P + +P PT+ +E QW+ L L+ S + R T + + + +
Sbjct: 141 DKTPWQGDVPLSADRSPPTVAKIERQMHSQWQDVLYFLVGSDEEGRSTPPARVVKFMEES 200
Query: 175 GLLSR-RDKEAPRLTESGFQFLLMDTNAQLWYIVREYISN-SQERGINQADLISFLLELS 232
GL+ R + A R+T+ G++F+L + + Q W +V I+ + + + +L++FL +LS
Sbjct: 201 GLMRPVRGRGALRITDKGYEFMLKEAHVQAWMVVHALINGYGRTQPGCRDELLAFLFQLS 260
Query: 233 FHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDS--- 289
+ G+AY L L++ Q+ + +DF +LGL+ ++ K F PT +A NL + S
Sbjct: 261 YCKVGDAYPLGALTQTQRDLAQDFVELGLL-FKRKAKSTRFYPTSIAVNLIFGSSPSGDA 319
Query: 290 -------------------SARKEG----FIVVETNFRMYAYSTSKLHCEILRLFSKVEY 326
++E I+VETNF++ AY+ SKLH +L LF ++
Sbjct: 320 GGAGGGGTQRKPQPAGGLGGGKREDDTSIHIIVETNFQVIAYTRSKLHFAMLSLFLELRA 379
Query: 327 QLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWES 386
LPN IVGAIT+ES+ A GI Q++ FL+ +AHP V R P VPEN+ DQ+ LWE
Sbjct: 380 LLPNAIVGAITRESMRKALSTGIKGRQVLDFLKWHAHPVVRRRTPVVPENIADQVLLWER 439
Query: 387 DLNRVE 392
+ +R+E
Sbjct: 440 ERDRME 445
>gi|358057347|dbj|GAA96696.1| hypothetical protein E5Q_03367 [Mixia osmundae IAM 14324]
Length = 467
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 132/406 (32%), Positives = 218/406 (53%), Gaps = 26/406 (6%)
Query: 17 SLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHR 76
SL L RLY P A+ R LP A+ ++ + + IP + + + G + +
Sbjct: 24 SLPESTLSRLYALPSSSLAVFRLLPSTAQHLIVNALWQEHDIPQQDLHTYT-KRGEGRRQ 82
Query: 77 V-----AIDRLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARL 131
+ A+ RL ++L + E T + L+ F+ +LRK L G + G AR
Sbjct: 83 LDTAIAALQRLHIVQLIAGEPTGNTVFALSHIFRKSLRKALTGAG----KDSSFGWFARS 138
Query: 132 PTLED------LEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAP 185
E+ L+ YA QW+ L L+ S ++++P+ ++ GLLS D++
Sbjct: 139 SRDENALSIPQLDEYATDQWDSLLHCLVGSERSQQPSKAVIDLL--VAAGLLSSGDRDTR 196
Query: 186 RLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTL 245
R+T GFQFLL D N+QLW ++ Y+ S++ G + ++I+ + ++ G Y+ TL
Sbjct: 197 RITSLGFQFLLEDVNSQLWSLLLHYLKLSEDAGTDLKEVIALVFQIGNQELGRVYSSETL 256
Query: 246 SEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMS----LTDSSARKE-GFIVVE 300
+ +Q ++K F LGLV + K + PT+LA L+ L +A +E GF+++E
Sbjct: 257 NPLQLHILKTFGGLGLVYVY---KSGDYSPTRLAVTLTSGAPPLLKAGTAEEEQGFLILE 313
Query: 301 TNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQ 360
TN+R+YAY+ + L +L LF ++ + P L+VG IT+ES+ NGI ++QII+FL
Sbjct: 314 TNYRVYAYTQNPLQIAVLNLFVALKSRFPGLVVGMITRESIKAGLANGIKSDQIIAFLTA 373
Query: 361 NAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
+AHP++ + P +P V DQI+LWE + NRV+ P YD+F S+
Sbjct: 374 HAHPQMRKQEPLLPPTVVDQIKLWEREKNRVKTEPCFLYDDFRSQA 419
>gi|403163300|ref|XP_003323393.2| hypothetical protein PGTG_04930 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375163998|gb|EFP78974.2| hypothetical protein PGTG_04930 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 887
Score = 210 bits (535), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 130/440 (29%), Positives = 225/440 (51%), Gaps = 54/440 (12%)
Query: 11 FMDMVASLTARDLDRLYENPFICEAILRSL--------------------PPLAKKYVMQ 50
F++ + LT L RLY P C AI RSL P +A+ V+
Sbjct: 30 FLNALPKLT---LARLYNTPASCLAIFRSLTLSCLIDSSLIFCFSPSRLLPLIARHLVLN 86
Query: 51 MFYIDGAIPAKMMEEWVLPDGFTKHRVAID----RLVQLRLFSE-------------EKK 93
+ + D +I +E W + R+ ++ +L +L + +
Sbjct: 87 LLWSDHSIDKSQLELWADINHSESRRIHVEDSLTKLFRLNIVEQSSLDSSSSSFDDPRNH 146
Query: 94 KETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITAR-LPTLEDLEAYAIGQWECFLLQL 152
T LN +FQ N R+ L GG +P ++ + ++++L+ Y +WE L +
Sbjct: 147 HHLTLTLNRSFQQNFRRALTGGGNHCSFGVPCNTSSNAVVSIDELDRYGTEKWETILHYM 206
Query: 153 ISSTQAERPTNFSSSMMKVFQRGLLSRRDK---EAPRLTESGFQFLLMDTNAQLWYIVRE 209
+ S +P++ S++ Q GL++ D ++ ++T GF FLL D N QLW I+ +
Sbjct: 207 VGSRLPTKPSHNILSLLG--QSGLMTSSDPRSLQSLKITSKGFGFLLEDVNTQLWDILLQ 264
Query: 210 YISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRK 269
Y+ ++ G++ D+++ L L G+ Y+ + + Q +++D D GLV + +
Sbjct: 265 YLKMTEVNGLDVVDVLACLFMLGSLELGQEYSFSNWTPTQTQVLQDLVDYGLVLVSAPDR 324
Query: 270 ENWFIPTKLATNLSMSL-----TDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKV 324
F PT+LAT L+ + + + + GF+V+ETN+R+YAY+++ L +L LF +
Sbjct: 325 ---FYPTRLATTLTSTAPPLVSAERAQEEHGFLVLETNYRIYAYTSNPLQIAVLNLFLSL 381
Query: 325 EYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLW 384
Y+ PNL+VGA+T+ES+ +A NGITA+Q+I +L +AHP++ P +P V DQIRLW
Sbjct: 382 RYRFPNLVVGAVTRESIKSALSNGITADQVIMYLHTHAHPQMRKLEPLLPPTVVDQIRLW 441
Query: 385 ESDLNRVEMTPAHYYDEFPS 404
E + NR+ + Y++F S
Sbjct: 442 ELEKNRIRAQEGYLYEDFKS 461
>gi|325089533|gb|EGC42843.1| TFIIH basal transcription factor complex p52 subunit [Ajellomyces
capsulatus H88]
Length = 485
Score = 210 bits (535), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 139/432 (32%), Positives = 225/432 (52%), Gaps = 40/432 (9%)
Query: 12 MDMVASLTARDLDRLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPD 70
M+ + SL RLY+ P AI R LP LAK +VM + Y+ +PA +E WV +
Sbjct: 9 MEYLESLPGTVFRRLYQQPSTALAIFRRMLPYLAKCFVMALLYLKDPLPAADLELWVKAE 68
Query: 71 GFTKHRVAIDRLVQLRLFSEEKKKETT--YRLNSTFQSNLRKHLIYGGALPREPMPSGIT 128
+ A+ L +L + S + Y + F S+LR+ L G +PS
Sbjct: 69 SKRERDNALSILGRLHILSNTTTSDNVRAYTVTDPFSSSLRQALTGGDKQQSFGVPSLTD 128
Query: 129 ARLP-TLEDLEAYAIGQWECFLLQLI--SSTQAERPTNFSSSMMKVFQR-GLLSRRDKEA 184
P T+ L+ YA QWE L ++ S+ ++ S + ++ Q L+ RD+
Sbjct: 129 DENPMTVAQLDHYARSQWEGVLGYMVGTSALGVQQAVTLSKGVKQLLQACHLVEVRDRRV 188
Query: 185 PRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNT 244
+T+ GF F+L D N Q+W+I+ Y+ N+++ G++ +++SFL LS G++Y
Sbjct: 189 -EITKDGFAFVLQDLNTQVWHILILYVENAEQIGMDSIEVLSFLFVLSSLELGQSYEKKH 247
Query: 245 LSEIQKSMIKDFADLGLVKLQQGRKE-NWFIPTKLATNLSM----------------SLT 287
L+ Q + D D G+V +E F PT+LAT L+ +L
Sbjct: 248 LTSTQLKTLADLTDFGIVYQHSPHQEATRFYPTRLATTLTSDSITLSSSISSSLTAPNLV 307
Query: 288 DSSARKE---GFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNA 344
S++ E GFI++ETN+R+YAY++S L ++ LF+ ++Y+ PNLI G +T++S+ A
Sbjct: 308 PSASSNEPSTGFIIIETNYRLYAYTSSPLQISLIALFTSLKYRFPNLITGKLTRQSVRRA 367
Query: 345 FENGITAEQIISFLQQNAHPRVAD-----------RMPSV-PENVCDQIRLWESDLNRVE 392
E GITA+QIIS+L +AHP++ +P+V P V DQIRLW+ + +R++
Sbjct: 368 VEMGITADQIISYLTTHAHPQMRKYHATKSGANPVGVPTVLPPTVVDQIRLWQLERDRIK 427
Query: 393 MTPAHYYDEFPS 404
TP + +F S
Sbjct: 428 ATPGFLFKDFVS 439
>gi|240279262|gb|EER42767.1| TFIIH basal transcription factor complex p52 subunit [Ajellomyces
capsulatus H143]
Length = 485
Score = 210 bits (535), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 139/432 (32%), Positives = 225/432 (52%), Gaps = 40/432 (9%)
Query: 12 MDMVASLTARDLDRLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPD 70
M+ + SL RLY+ P AI R LP LAK +VM + Y+ +PA +E WV +
Sbjct: 9 MEYLESLPGTVFRRLYQQPSTALAIFRRMLPYLAKCFVMALLYLKDPLPAADLELWVKAE 68
Query: 71 GFTKHRVAIDRLVQLRLFSEEKKKETT--YRLNSTFQSNLRKHLIYGGALPREPMPSGIT 128
+ A+ L +L + S + Y + F S+LR+ L G +PS
Sbjct: 69 SKRERDNALSILGRLHILSNTTTSDNVRAYTVTDPFSSSLRQALTGGDKQQSFGVPSLTD 128
Query: 129 ARLP-TLEDLEAYAIGQWECFLLQLI--SSTQAERPTNFSSSMMKVFQR-GLLSRRDKEA 184
P T+ L+ YA QWE L ++ S+ ++ S + ++ Q L+ RD+
Sbjct: 129 DENPMTVAQLDHYARSQWEGVLGYMVGTSALGVQQAVTLSKGVKQLLQACHLVEVRDRRV 188
Query: 185 PRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNT 244
+T+ GF F+L D N Q+W+I+ Y+ N+++ G++ +++SFL LS G++Y
Sbjct: 189 -EITKDGFAFVLQDLNTQVWHILILYVENAEQIGMDSIEVLSFLFVLSSLELGQSYEKKH 247
Query: 245 LSEIQKSMIKDFADLGLVKLQQGRKE-NWFIPTKLATNLSM----------------SLT 287
L+ Q + D D G+V +E F PT+LAT L+ +L
Sbjct: 248 LTSTQLKTLADLTDFGIVYQHSPHQEATRFYPTRLATTLTSDSITLSSSISSSLTAPNLV 307
Query: 288 DSSARKE---GFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNA 344
S++ E GFI++ETN+R+YAY++S L ++ LF+ ++Y+ PNLI G +T++S+ A
Sbjct: 308 PSASSNEPSTGFIIIETNYRLYAYTSSPLQISLIALFTSLKYRFPNLITGKLTRQSVRRA 367
Query: 345 FENGITAEQIISFLQQNAHPRVAD-----------RMPSV-PENVCDQIRLWESDLNRVE 392
E GITA+QIIS+L +AHP++ +P+V P V DQIRLW+ + +R++
Sbjct: 368 VEMGITADQIISYLTTHAHPQMRKYHATKSGANPVGVPTVLPPTVVDQIRLWQLERDRIK 427
Query: 393 MTPAHYYDEFPS 404
TP + +F S
Sbjct: 428 ATPGFLFKDFVS 439
>gi|193633122|ref|XP_001945987.1| PREDICTED: general transcription factor IIH subunit 4-like
[Acyrthosiphon pisum]
Length = 471
Score = 210 bits (535), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 120/385 (31%), Positives = 209/385 (54%), Gaps = 19/385 (4%)
Query: 24 DRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKH---RVAID 80
DRLY +P C + R LP + +++ +++ +I+ +P ++ WV P +++ + +
Sbjct: 42 DRLYSHPPTCLTVFRVLPDITQQFTLRILFIEQPVPQSVLSSWV-PANYSRELDESIEVA 100
Query: 81 RLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGG--ALPREPMPSGITARLPTLEDLE 138
+ + + + LNSTF+ L+ L+ GG +P M + AR ++ L+
Sbjct: 101 TNLHIWKLTSVSGGLKGWILNSTFKKKLKVALMGGGRSTVPNSDMTADPKAR--DIDFLD 158
Query: 139 AYAIGQWECFLLQLISSTQAERPTNFSSSMMKVF-QRGLLSRRDKEAPRLTESGFQFLLM 197
+YA +WEC L ++ S SS ++V GL+ R ++P +T +GFQFLL+
Sbjct: 159 SYAYERWECILHYMVGSKHE----GISSDAVRVLLNAGLMVRDTDDSPVITSTGFQFLLL 214
Query: 198 DTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFA 257
D Q+WY + Y+ + RG++ A ++FL ++ G Y + +SE ++ ++
Sbjct: 215 DMATQVWYFMLRYMETVESRGLDLAQCLTFLFQIHLGTLGWDYITDEMSENLQAFLQHLR 274
Query: 258 DLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSAR---KEGFIVVETNFRMYAYSTSKLH 314
+ GLV Q+ RK F PT+L + + +S R KE +IVVETNFR+YA + S L
Sbjct: 275 EFGLV-YQRKRKAGRFYPTRLVIEMGQGNSRTSERMKNKERYIVVETNFRIYAMTDSDLK 333
Query: 315 CEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRMPSVP 374
++ LF+ + Y+ PN+ G +T++S+ A +GITA QI+ FL + HP++ + +P
Sbjct: 334 VALVALFTHMLYRFPNMSAGILTRDSVRTALRSGITAAQIVRFLTVHTHPQMQE--CGMP 391
Query: 375 ENVCDQIRLWESDLNRVEMTPAHYY 399
+ V DQI LWE++ NR+ T Y
Sbjct: 392 QTVIDQIYLWENERNRLTYTDGVLY 416
>gi|159474228|ref|XP_001695231.1| predicted protein [Chlamydomonas reinhardtii]
gi|158276165|gb|EDP01939.1| predicted protein [Chlamydomonas reinhardtii]
Length = 435
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 131/416 (31%), Positives = 211/416 (50%), Gaps = 29/416 (6%)
Query: 10 NFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLP 69
+F ++ L LY +P+ C A+ R+LPPLA+ YVM++ ++ PA ++ W
Sbjct: 2 DFAAYISGLPGPKRGALYRSPWTCLAVFRNLPPLAQVYVMRLLFVPSPFPADFVDSWAAK 61
Query: 70 DGFTKHRVAIDRLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALP-------REP 122
+ H+ A+ L L + +EE+ + Q R+ G P R
Sbjct: 62 GAASGHKAALSALRGLDVLTEERGRSRA----RAGQGRGRRATGQGPKGPVELSQLMRGD 117
Query: 123 MPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQR----GLLS 178
+P+ + P+LE+L +A+GQWE L L+ S+++ P +++V R G +
Sbjct: 118 VPAAAASCCPSLEELGEWAVGQWEALQLYLLGSSRS--PPQLPQ-LLRVRARTCEDGAAA 174
Query: 179 RRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQE-RGINQADLISFLLELSFHVAG 237
R A +T+ F +LL + QLW ++REYIS ++ G A +SFLL+L F G
Sbjct: 175 GRRSLA--VTQRSFHWLLQPGDRQLWAVLREYISGAEAASGDELAHTLSFLLQLGFRRVG 232
Query: 238 EAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARK---- 293
+ L ++ M A LG++ + QG LA + ++A
Sbjct: 233 QPCAWAELRPPEQRMAAHLAQLGVLAVFQGSAGAAGSGGALAARGGGAGGAAAAAAMDGG 292
Query: 294 ----EGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGI 349
+ F++VE+N+R+Y Y+ S + +L LF K E LPNL VGAI ++S+ A GI
Sbjct: 293 GAGSDAFVIVESNYRVYVYTRSPVTIAVLELFVKREALLPNLFVGAIRRDSILAALARGI 352
Query: 350 TAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
TA++++++L HP +A R P VPE V DQIRLWE+ +NR++ PA Y+ SR
Sbjct: 353 TADELVAYLSARPHPSIAARCPVVPEVVSDQIRLWEASMNRLQAHPAVLYENMESR 408
>gi|358370616|dbj|GAA87227.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Aspergillus kawachii IFO 4308]
Length = 478
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 142/429 (33%), Positives = 223/429 (51%), Gaps = 49/429 (11%)
Query: 17 SLTARDLDRLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKH 75
SL +LY+ P AI R LP LAK +VM + Y+ +PA +E WV + +
Sbjct: 15 SLAGTVFYKLYQQPSTALAIFRRMLPDLAKCFVMALLYLKDPLPAADLEAWVKSESLKER 74
Query: 76 RVAIDRLVQLRLFSEEKKKET--TYRLNSTFQSNLRKHLIYG-------GALPREPMPSG 126
A+ L +L + S + Y++ F ++LR H + G G L P S
Sbjct: 75 DNALSILGRLHILSNTVTADNVRAYKVTDPFATSLR-HALTGADQTQSFGVLSHVPAGSE 133
Query: 127 ITARLPTLEDLEAYAIGQWECFLLQLISSTQAERP-TNFSSSMMKVFQRG-LLSRRDKEA 184
++ + DL+ YA QWE L ++ + P S + ++ Q G L+ RD+
Sbjct: 134 VS-----ISDLDEYARRQWEGVLGYMVGTGGMGAPQVTLSKGVKELLQAGHLVEIRDRRV 188
Query: 185 PRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNT 244
+T+ GF F+L D + Q+W+I+ Y+ ++ ++ +++SF+ LS G++Y N
Sbjct: 189 -EITQDGFAFVLQDVSTQVWHILILYVESASAIKMDSVEVLSFVFLLSSLELGKSYEKNH 247
Query: 245 LSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNL---SMSLTD------------- 288
LS Q M+ D AD G+V Q+ F PT+LAT L S +L++
Sbjct: 248 LSSKQLRMLTDLADFGIV-YQESPDAPRFYPTRLATTLTSDSSALSNPLLGSLSGPAGEA 306
Query: 289 SSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENG 348
SS GFI++ETN+R+YAY++S L ++ LF+ ++Y+ PNLI G IT++S+ A E G
Sbjct: 307 SSKAGTGFIIIETNYRVYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSVRRAVEMG 366
Query: 349 ITAEQIISFLQQNAHPRV------------ADRMPSV-PENVCDQIRLWESDLNRVEMTP 395
ITA+QIIS+L +AHP++ A PSV P V DQIRLW+ + +RV+ T
Sbjct: 367 ITADQIISYLSTHAHPQMRKHNVSRSTSNQAGIPPSVLPPTVVDQIRLWQLERDRVKATS 426
Query: 396 AHYYDEFPS 404
+ +F S
Sbjct: 427 GFLFKDFVS 435
>gi|255721741|ref|XP_002545805.1| TFIIH basal transcription factor complex subunit 2 [Candida
tropicalis MYA-3404]
gi|240136294|gb|EER35847.1| TFIIH basal transcription factor complex subunit 2 [Candida
tropicalis MYA-3404]
Length = 494
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 137/434 (31%), Positives = 220/434 (50%), Gaps = 62/434 (14%)
Query: 25 RLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQ 84
+LYE P C +I R L P+AK Y+M M + + I + +++W P A+ RL
Sbjct: 24 KLYEAPATCLSIFRLLSPMAKFYIMSMIFNEKPIALRDLDKWCKPSARKLEFEALKRLES 83
Query: 85 LRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLPTLED--LEAYAI 142
L L E K + RL+STF+ N R L G P TA ++ L+ +A
Sbjct: 84 LHLI-EYDSKGSHIRLHSTFRKNFRDCLT-GSQNPNAFGSISTTADKYKVDIAFLDNFAS 141
Query: 143 GQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAPRLTESGFQFLLMDTNAQ 202
+WE L ++ + P S S++ + + G L R+T SGFQFLL D NAQ
Sbjct: 142 SKWETILHFMVGTEATATP---SDSVLSLLKLGGLMEGPNNKLRITNSGFQFLLQDVNAQ 198
Query: 203 LWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLV 262
+W ++ +Y++ +QE ++ D+++F+ L G++Y ++TLSE Q SM+ D D GLV
Sbjct: 199 IWTLLLQYLNLTQELNMDPVDVLNFIFVLGSLELGKSYAVSTLSETQVSMLADLKDYGLV 258
Query: 263 KLQQGRKENWFIPTKLATNLSMSLTDSSARK---------------EGFIVVETNFRMYA 307
Q+ F PT+LAT L+ +DS+A K + I++ETNF++YA
Sbjct: 259 -YQRSDTSGRFYPTRLATTLT---SDSAALKTPAMAMDEEEQQVVTKESIIIETNFKLYA 314
Query: 308 YSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRV- 366
Y+ S L IL LF +++ + N++ G IT+ES+ NA NGITA+QII FL+ +AH ++
Sbjct: 315 YTKSPLEIAILNLFVQLKTRFANMVCGQITRESIRNALYNGITADQIIKFLETHAHSQMR 374
Query: 367 -----------------------------------ADRMPSVPENVCDQIRLWESDLNRV 391
++ +P NV DQI+LW+ +L+R+
Sbjct: 375 ILAKEKLDKKIEFDTSHNINTAGGAPQSKTDGSISQHKLEILPPNVVDQIKLWQLELDRI 434
Query: 392 EMTPAHYYDEFPSR 405
+ + + +F ++
Sbjct: 435 QTFDGYLFKDFANQ 448
>gi|294660113|ref|XP_002777731.1| DEHA2G23254p [Debaryomyces hansenii CBS767]
gi|218511847|sp|Q6BGW8.2|TFB2_DEBHA RecName: Full=RNA polymerase II transcription factor B subunit 2;
AltName: Full=RNA polymerase II transcription factor B
52 kDa subunit; AltName: Full=RNA polymerase II
transcription factor B p52 subunit
gi|199434471|emb|CAR66042.1| DEHA2G23254p [Debaryomyces hansenii CBS767]
Length = 515
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 138/459 (30%), Positives = 225/459 (49%), Gaps = 92/459 (20%)
Query: 25 RLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQ 84
RLYE P C +I R LPP+AK Y+M M + + + K +++W P T + D L +
Sbjct: 25 RLYEAPATCLSIYRLLPPIAKFYIMTMLFYETPVALKDLDKWCKP---TSRKFQFDSLKR 81
Query: 85 LRLFS--EEKKKETTYRLNSTFQSNLRKHLIYG------GALPREPMPSGITARLPTLED 136
LR EE + RL+ TF+ N R L G L P ++ +
Sbjct: 82 LRALHLIEEDSSGSHIRLHPTFRQNFRDCLTGSQARNAFGNLSTTPDEQSVSIKF----- 136
Query: 137 LEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAPRL--TESGFQF 194
L+++A+ +WE L ++ + + P+N S++K GL+ +K RL T SGFQF
Sbjct: 137 LDSFALQKWESILHFMVGTELSVTPSNSVLSLLK--SGGLMEGTNKGGNRLNITNSGFQF 194
Query: 195 LLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIK 254
LL D NAQ+W ++ +Y++ +Q+ ++ D+++F+ L G+ Y +++LSE Q SM+
Sbjct: 195 LLQDINAQIWTLLLQYLNLTQDLNMDPVDVLNFIFILGSLELGKGYTVSSLSETQISMLA 254
Query: 255 DFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARK--------------------- 293
D D GL+ Q+ F PT+LAT L+ +DS+A K
Sbjct: 255 DLRDYGLI-YQRSENSERFYPTRLATTLT---SDSAALKSPSMAMEQALESTTETEEQQN 310
Query: 294 ------EGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFEN 347
+G I++ETNF++YAY+ S L IL LF ++ + N++ G IT+ES+ NA N
Sbjct: 311 LASNSNQGTIILETNFKIYAYTNSPLEIAILNLFVHLKTRFSNMVCGQITRESIRNALYN 370
Query: 348 GITAEQIISFLQQNAHPRVA---------------------------------------- 367
GITA+QII FL+ +AH ++
Sbjct: 371 GITADQIIKFLETHAHSQMKILAKERLDKKIEFDTSNNINTAGGAPQSQMMTNENGTTVA 430
Query: 368 -DRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
++ +P V DQI+LW+ +L+R++ + + +F S+
Sbjct: 431 QHKLEVLPPTVVDQIKLWQLELDRIQTFEGYLFKDFASQ 469
>gi|350639203|gb|EHA27557.1| transcription factor Tfb2 [Aspergillus niger ATCC 1015]
Length = 478
Score = 208 bits (529), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 141/429 (32%), Positives = 225/429 (52%), Gaps = 49/429 (11%)
Query: 17 SLTARDLDRLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKH 75
SL +LY+ P AI R LP LAK +VM + Y+ +PA +E WV + +
Sbjct: 15 SLAGTVFYKLYQQPSTALAIFRRMLPDLAKCFVMALLYLKDPLPAADLEAWVKSESLKER 74
Query: 76 RVAIDRLVQLRLFSEEKKKETT--YRLNSTFQSNLRKHLIYG-------GALPREPMPSG 126
A+ L +L + S + Y++ F ++LR H + G G L + P+ S
Sbjct: 75 DNALSILGRLHILSNTVTADNVRAYKVTDPFATSLR-HALTGADQTQSFGVLSQVPVGSE 133
Query: 127 ITARLPTLEDLEAYAIGQWECFLLQLISSTQAERP-TNFSSSMMKVFQRG-LLSRRDKEA 184
++ + +L+ YA QWE L ++ + P S + ++ Q G L+ RD+
Sbjct: 134 VS-----IPELDEYARRQWEGVLGYMVGTGGMGGPQVTLSKGVKELLQAGHLVEIRDRRV 188
Query: 185 PRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNT 244
+T+ GF F+L D + Q+W+I+ Y+ ++ ++ +++SF+ LS G++Y N
Sbjct: 189 -EITQDGFAFVLQDVSTQVWHILILYVESASAIKMDSVEVLSFVFLLSSLELGKSYEKNH 247
Query: 245 LSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNL---SMSLTD------------- 288
LS Q M+ D AD G+V Q+ F PT+LAT L S +L++
Sbjct: 248 LSSKQLRMLTDLADFGIV-YQESPDAPRFYPTRLATTLTSDSSALSNPLLGSLSGPAGEA 306
Query: 289 SSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENG 348
SS GFI++ETN+R+YAY++S L ++ LF+ ++Y+ PNLI G IT++S+ A E G
Sbjct: 307 SSKAGTGFIIIETNYRVYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSVRRAVEMG 366
Query: 349 ITAEQIISFLQQNAHPRV------------ADRMPSV-PENVCDQIRLWESDLNRVEMTP 395
ITA+QIIS+L +AHP++ A PSV P V DQIRLW+ + +RV+ T
Sbjct: 367 ITADQIISYLSTHAHPQMRKHNVSRSTSNQAGIPPSVLPPTVVDQIRLWQLERDRVKATS 426
Query: 396 AHYYDEFPS 404
+ +F S
Sbjct: 427 GFLFKDFVS 435
>gi|115388809|ref|XP_001211910.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114195994|gb|EAU37694.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 460
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 141/430 (32%), Positives = 222/430 (51%), Gaps = 60/430 (13%)
Query: 13 DMVASLTARDLDRLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDG 71
D + L +LY+ P AI R LP LAK +VM + Y+ +PAK +E WV D
Sbjct: 10 DYLEGLPGTVFQKLYQQPSTALAIFRRMLPDLAKCFVMALLYLKDPLPAKDLELWVKSD- 68
Query: 72 FTKHRVAIDRLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARL 131
L+ ++ + Y + + F ++LR+ L E G+ ++L
Sbjct: 69 ------------SLKTLADNVR---AYIVTNPFATSLRQALTG----TEETQSFGVLSQL 109
Query: 132 P-----TLEDLEAYAIGQWECFLLQLISSTQ--AERPTNFSSSMMKVFQRG-LLSRRDKE 183
P T+ DL+ YA QWE L ++ ++ +R S + ++ Q G L+ RD+
Sbjct: 110 PDHPPVTIADLDEYARRQWEGVLGYMVGTSGLGIQRDVTLSKGVKQLLQAGHLVEIRDRR 169
Query: 184 APRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLN 243
+T+ GF F+L D + Q+W+I+ Y+ +++ G++ +++SF+ LS G++Y
Sbjct: 170 V-EITQDGFAFVLQDVSTQVWHILILYVESAEAIGMDSVEVLSFVFLLSSLELGKSYEKQ 228
Query: 244 TLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNL-----------SMSL-----T 287
L+ Q + D AD G+V Q F PT+LAT L S SL T
Sbjct: 229 HLTSNQLRTLTDLADFGIV-YQHSPDATHFYPTRLATTLTSDASSLSNPISGSLSGPDGT 287
Query: 288 DSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFEN 347
S+ GFI+VETN+R+YAY++S L ++ LF+ ++Y+ PNLI G IT++S+ A E
Sbjct: 288 SSAKAGSGFIIVETNYRLYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSVRRAIEM 347
Query: 348 GITAEQIISFLQQNAHPRV------------ADRMPSV-PENVCDQIRLWESDLNRVEMT 394
GITA+QIIS+L +AHP++ A PSV P V DQIRLW+ + +RV+ T
Sbjct: 348 GITADQIISYLATHAHPQMRKHNVSRSTSNQAGIPPSVLPPTVVDQIRLWQLERDRVKAT 407
Query: 395 PAHYYDEFPS 404
+ +F S
Sbjct: 408 AGFLFKDFAS 417
>gi|241958748|ref|XP_002422093.1| TFIIH subunit, putative; tanscription initiation factor IIB,
putative [Candida dubliniensis CD36]
gi|223645438|emb|CAX40094.1| TFIIH subunit, putative [Candida dubliniensis CD36]
Length = 494
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 137/434 (31%), Positives = 220/434 (50%), Gaps = 62/434 (14%)
Query: 25 RLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQ 84
+LYE P C +I R L P+AK Y+M M + + I + +++W P A+ RL
Sbjct: 24 KLYEAPATCLSIFRLLSPMAKFYIMSMIFNEKPIALRDLDKWCKPSARKLEFEALKRLES 83
Query: 85 LRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPRE--PMPSGITARLPTLEDLEAYAI 142
L L E K + RL+S F+ N R L G P + + + + L+ +A
Sbjct: 84 LHLI-EYDSKGSHIRLHSIFRKNFRDCLT-GSQNPNAFGSISTTVDKHKVDIPFLDLFAS 141
Query: 143 GQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAPRLTESGFQFLLMDTNAQ 202
+WE L ++ + P S S++ + + G L R+T SGFQFLL D NAQ
Sbjct: 142 QKWETILHFMVGTESTATP---SDSVLSLLKLGGLMEGPNNDLRITNSGFQFLLQDVNAQ 198
Query: 203 LWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLV 262
+W ++ +Y++ +QE ++ D+++F+ L G++Y++++LSE Q SM+ D D GLV
Sbjct: 199 IWTLLLQYLNLTQELNMDPVDVLNFIFILGSLELGKSYSVSSLSETQVSMLADLKDYGLV 258
Query: 263 KLQQGRKENWFIPTKLATNLSMSLTDSSARK---------------EGFIVVETNFRMYA 307
Q+ F PT+LAT L+ +DS+A K + I++ETNF++YA
Sbjct: 259 -YQRSDTSGRFYPTRLATTLT---SDSAALKTPSMAMDEEEQQVVSKESIIIETNFKIYA 314
Query: 308 YSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRV- 366
Y+ S L IL LF + + N++ G IT+ES+ NA NGITA+QII FL+ +AHP++
Sbjct: 315 YTKSPLEIAILNLFVHFKTRFANMVCGQITRESIRNALYNGITADQIIKFLETHAHPQMR 374
Query: 367 -----------------------------ADRMPS------VPENVCDQIRLWESDLNRV 391
D M S +P NV DQI+LW+ +L+R+
Sbjct: 375 ILAKEKLDKKIEFDTSHNINTAGGAPQSKTDGMISQHKLEILPPNVVDQIKLWQLELDRI 434
Query: 392 EMTPAHYYDEFPSR 405
+ + + +F ++
Sbjct: 435 QTFDGYLFKDFSNQ 448
>gi|70982185|ref|XP_746621.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2) [Aspergillus fumigatus Af293]
gi|66844244|gb|EAL84583.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Aspergillus fumigatus Af293]
gi|159122143|gb|EDP47265.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Aspergillus fumigatus A1163]
Length = 479
Score = 207 bits (528), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 135/430 (31%), Positives = 223/430 (51%), Gaps = 46/430 (10%)
Query: 13 DMVASLTARDLDRLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDG 71
D + SL ++LY+ P AI R LP LAK +VM + Y+ +PA +E WV P
Sbjct: 10 DYLESLPGTVFNKLYQQPSTALAIFRRMLPDLAKCFVMALLYLKDPLPAADLEAWVRPGS 69
Query: 72 FTKHRVAIDRLVQLRLFSEEKKKET--TYRLNSTFQSNLRKHLIYGGALPREPMPSGITA 129
+ A+ L +L + + + Y + F S+LR+ L GA + G+ +
Sbjct: 70 LKERDNALSILGRLHIMTSTTTSDNIKAYTVTDPFASSLRQALT--GA--EQTQSFGVLS 125
Query: 130 RLP-----TLEDLEAYAIGQWECFLLQLISSTQ--AERPTNFSSSMMKVFQRG-LLSRRD 181
++ ++ DL+ Y+ QWE L ++ ++ +R + S + ++ Q G L+ RD
Sbjct: 126 QISDKEAVSIADLDEYSRRQWEGVLGYMVGTSGLGMQRDVSLSKGVKELLQAGHLVEIRD 185
Query: 182 KEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYN 241
+ +T+ GF F L D Q+W+I+ Y+ ++ G++ +++SF+ LS G++Y
Sbjct: 186 RRV-EITQDGFAFALQDVGTQVWHILVLYVESAAAIGMDSVEVLSFVFFLSSLELGKSYE 244
Query: 242 LNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSM----------------S 285
L+ Q + D AD G+V Q+ F PT+LAT L+ +
Sbjct: 245 KKQLTSNQLRTLTDLADFGIV-YQETPDATHFYPTRLATTLTSDSSTLSNPLAGSLSGPT 303
Query: 286 LTDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAF 345
T SS GFI++ETN+R+YAY++S L ++ LF+ ++Y+ PNLI G IT++S+ A
Sbjct: 304 GTSSSKAGSGFIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSVRRAV 363
Query: 346 ENGITAEQIISFLQQNAHPRVADR-----------MP--SVPENVCDQIRLWESDLNRVE 392
E GITA+QIIS+L +AHP++ MP +P V DQIRLW+ + +RV+
Sbjct: 364 EMGITADQIISYLSTHAHPQMRKHNVSRSTSNQAGMPVSVLPPTVVDQIRLWQLERDRVK 423
Query: 393 MTPAHYYDEF 402
T + +F
Sbjct: 424 ATHGFLFRDF 433
>gi|145248676|ref|XP_001400677.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Aspergillus niger CBS 513.88]
gi|134081344|emb|CAK41847.1| unnamed protein product [Aspergillus niger]
Length = 478
Score = 207 bits (527), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 142/429 (33%), Positives = 225/429 (52%), Gaps = 49/429 (11%)
Query: 17 SLTARDLDRLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKH 75
SL +LY+ P AI R LP LAK +VM + Y+ +PA +E WV + +
Sbjct: 15 SLAGTVFYKLYQQPSTALAIFRRMLPDLAKCFVMALLYLKDPLPAADLEAWVKSESLKER 74
Query: 76 RVAIDRLVQLRLFSEEKKKETT--YRLNSTFQSNLRKHLIYG-------GALPREPMPSG 126
A+ L +L + S + Y++ F ++LR H + G G L + +P+G
Sbjct: 75 DNALSILGRLHILSNTVTADNVRAYKVTDPFATSLR-HALTGADQTQSFGVLSQ--VPAG 131
Query: 127 ITARLPTLEDLEAYAIGQWECFLLQLISSTQAERP-TNFSSSMMKVFQRG-LLSRRDKEA 184
+P L++ YA QWE L ++ + P S + ++ Q G L+ RD+
Sbjct: 132 SEVSIPELDE---YARRQWEGVLGYMVGTGGMGGPQVTLSKGVKELLQAGHLVEIRDRRV 188
Query: 185 PRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNT 244
+T+ GF F+L D + Q+W+I+ Y+ ++ ++ +++SF+ LS G++Y N
Sbjct: 189 -EITQDGFAFVLQDVSTQVWHILILYVESASAIKMDSVEVLSFVFLLSSLELGKSYEKNH 247
Query: 245 LSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNL---SMSLTD------------- 288
LS Q M+ D AD G+V Q+ F PT+LAT L S +L++
Sbjct: 248 LSSKQLRMLTDLADFGIV-YQESPDAPRFYPTRLATTLTSDSSALSNPLLGSLSGPAGEA 306
Query: 289 SSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENG 348
SS GFI++ETN+R+YAY++S L ++ LF+ ++Y+ PNLI G IT++S+ A E G
Sbjct: 307 SSKAGTGFIIIETNYRVYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSVRRAVEMG 366
Query: 349 ITAEQIISFLQQNAHPRV------------ADRMPSV-PENVCDQIRLWESDLNRVEMTP 395
ITA+QIIS+L +AHP++ A PSV P V DQIRLW+ + +RV+ T
Sbjct: 367 ITADQIISYLSTHAHPQMRKHNVSRSTSNQAGIPPSVLPPTVVDQIRLWQLERDRVKATS 426
Query: 396 AHYYDEFPS 404
+ +F S
Sbjct: 427 GFLFKDFVS 435
>gi|67541703|ref|XP_664619.1| hypothetical protein AN7015.2 [Aspergillus nidulans FGSC A4]
gi|40742471|gb|EAA61661.1| hypothetical protein AN7015.2 [Aspergillus nidulans FGSC A4]
gi|259483674|tpe|CBF79257.1| TPA: TFIIH and nucleotide excision repair factor 3 complexes
subunit (Tfb2), putative (AFU_orthologue; AFUA_4G04360)
[Aspergillus nidulans FGSC A4]
Length = 482
Score = 207 bits (526), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 139/437 (31%), Positives = 227/437 (51%), Gaps = 51/437 (11%)
Query: 13 DMVASLTARDLDRLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDG 71
D + SL +LY+ P AI R LP LAK +VM + Y+ +PA +E WV +
Sbjct: 9 DYLESLPGTVFFKLYQQPSTALAIFRRMLPDLAKCFVMALLYLKDPLPAADLETWVRSES 68
Query: 72 ----FTKHR-VAIDRLVQLRLFSEEKKKETT--YRLNSTFQSNLRKHLIYGGALPREPMP 124
F + R A+ L +L + + Y + F ++LR+ L GA +E
Sbjct: 69 LRLSFLRERDSALSILGRLHILTNTTTSGNVRAYMVTDPFAASLRQALT--GA--KETQS 124
Query: 125 SGITARLPT-----LEDLEAYAIGQWECFLLQLISSTQ--AERPTNFSSSMMKVFQRG-L 176
G+ +P + DL+ YA QWE L ++ ++ +R N S + ++ Q G L
Sbjct: 125 FGVLHHIPDDQTVPIHDLDEYARRQWEGVLGYMVGTSGLGIQRDVNLSKGVKQLLQAGHL 184
Query: 177 LSRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVA 236
+ RD+ +T+ GF F+L D Q+W+I+ Y+ +++ G++ +++SF+ LS
Sbjct: 185 VEIRDRRV-EITQDGFAFVLQDVGTQVWHILILYVESAEAIGMDSVEVLSFIFLLSSLEL 243
Query: 237 GEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLT--------- 287
G++Y L+ Q + D AD G+V Q + + F PT+LAT L+ +
Sbjct: 244 GKSYEKKHLTSNQLRTLTDLADFGIV-YQDSPEASHFYPTRLATTLTSDSSALSNPISGA 302
Query: 288 -------DSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKES 340
DS+ GFI++ETN+R+YAY++S L ++ LF+ ++Y+ PNL+ G +T++S
Sbjct: 303 LSDPDGGDSNQPGSGFIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLVTGKVTRQS 362
Query: 341 LYNAFENGITAEQIISFLQQNAHPRV------------ADRMPSV-PENVCDQIRLWESD 387
+ A E GITA+QIIS+L +AHP++ A PSV P V DQIRLW+ +
Sbjct: 363 IRRAIEMGITADQIISYLATHAHPQMRKHNVARSTSNQAGMPPSVLPPTVVDQIRLWQLE 422
Query: 388 LNRVEMTPAHYYDEFPS 404
+RV+ T + +F S
Sbjct: 423 RDRVKATAGFLFKDFVS 439
>gi|448103836|ref|XP_004200137.1| Piso0_002710 [Millerozyma farinosa CBS 7064]
gi|359381559|emb|CCE82018.1| Piso0_002710 [Millerozyma farinosa CBS 7064]
Length = 512
Score = 207 bits (526), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 141/451 (31%), Positives = 220/451 (48%), Gaps = 79/451 (17%)
Query: 25 RLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQ 84
RLYE P C +I R LPPLAK Y+M M + + I K M +W P A+ RL
Sbjct: 25 RLYEAPATCLSIYRLLPPLAKFYIMSMLFYEHPIALKDMSKWCKPTAKKIQFEALKRLRS 84
Query: 85 LRLFSEEKKKETTYRLNSTFQSNLRKHLIYG------GALPREPMPSGITARLPTLEDLE 138
L L EE K + +L+ TF+ N R L G L P + I + L+
Sbjct: 85 LHLI-EEHNKGSHLKLHQTFRKNFRDCLTGSQTSNAFGNLCTSPDKNHIDVKF-----LD 138
Query: 139 AYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAP--RLTESGFQFLL 196
+A +WE L ++ + + P+ S++K GL+ A ++T GFQFLL
Sbjct: 139 MFASQKWESILHFMVGTELSVAPSKSVLSLLK--SGGLMEGPGTSASNLKITNKGFQFLL 196
Query: 197 MDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDF 256
D N Q+W ++ +Y+S +Q+ ++ D+++F+ L G++Y +++LSE Q SM+ D
Sbjct: 197 QDVNVQIWTLLLQYLSLTQDLNMDPVDVLNFIFVLGCLELGKSYTVSSLSETQISMLADL 256
Query: 257 ADLGLVKLQQGRKENWFIPTKLATNL----------SMSLT------DSSARKE------ 294
D GL+ KE F PT+LAT L SM+L +S A +E
Sbjct: 257 KDYGLIYQNHNDKER-FYPTRLATTLTSEAAALRTPSMALNQVVDTANSEATEEQMFSNT 315
Query: 295 -GFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQ 353
G I++ETNF++YAY+ S L IL LF ++ + PN++ G IT+ES+ A NGITA+Q
Sbjct: 316 GGSIILETNFKIYAYTNSPLEIAILNLFVHLKTRFPNMVTGQITRESIRKALFNGITADQ 375
Query: 354 IISFLQQNAHPRVAD---------------------------------------RMPSVP 374
II FL+ +AH ++ ++ +P
Sbjct: 376 IIKFLESHAHSQMKQMAKENLDKKIEFDANNNINTATDNQQLSVGEFGSANSYHKLEVLP 435
Query: 375 ENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
V DQI+LW+ +L+R++ + + +F S+
Sbjct: 436 PTVVDQIKLWQLELDRIQTFEGYLFKDFSSQ 466
>gi|260945437|ref|XP_002617016.1| hypothetical protein CLUG_02460 [Clavispora lusitaniae ATCC 42720]
gi|238848870|gb|EEQ38334.1| hypothetical protein CLUG_02460 [Clavispora lusitaniae ATCC 42720]
Length = 511
Score = 207 bits (526), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 139/452 (30%), Positives = 225/452 (49%), Gaps = 80/452 (17%)
Query: 25 RLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQ 84
+LY+ P C +I R LPPLAK Y++ M + D + K +E+W P + RL
Sbjct: 24 KLYDAPATCLSIFRLLPPLAKFYIVSMVFTDKPVAVKDLEKWCQPQARKLQYESFKRLRA 83
Query: 85 LRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGI----TARLPT-LEDLEA 139
L L EE + T RL+ TF++N R L A P GI + + P ++ L+
Sbjct: 84 LHLI-EEDQSGTHVRLHQTFRNNFRDCLTGSQA----PNAFGIIVPGSDKHPIDVKFLDQ 138
Query: 140 YAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAP---RLTESGFQFLL 196
+A +WE L ++ + E PT S S++ + + G L + ++T SGFQFLL
Sbjct: 139 FASQKWETILHFMVGT---ELPTVPSKSVLSLLKSGGLMEGHGTSAYSLKITNSGFQFLL 195
Query: 197 MDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDF 256
D N Q+W ++ EY++ +QE ++ D+++F+ L G+AY +++LS+ Q SM+ D
Sbjct: 196 QDVNVQIWTLLLEYLNLTQELNMDPVDVLNFIFILGSLELGKAYAVSSLSDTQLSMLPDL 255
Query: 257 ADLGLVKLQQGRKENWFIPTKLATNLS--------------MSLTDSSARKE-------- 294
D GLV Q+ F PT+LAT L+ +L D A+ +
Sbjct: 256 RDYGLV-YQRSENSTRFFPTRLATTLTSESTGLKTPSMALNQTLEDPGAKDQEHVTSEAG 314
Query: 295 ----GFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGIT 350
G I++ETNF++YAY+ S L IL LF ++ + N+++G IT+ES+ A NGIT
Sbjct: 315 ESQSGQIILETNFKLYAYTNSPLEIAILNLFVNLKVRFANMVIGQITRESIRKALYNGIT 374
Query: 351 AEQIISFLQQNAHPRV-------------------------------------ADRMPSV 373
A+QII FL+ +AH ++ R+ +
Sbjct: 375 ADQIIKFLETHAHSQMRTSAKEKLDKKIEFDASHNINTAGGAPQSKSSDSGVAQHRLEIL 434
Query: 374 PENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
P NV DQI+LW+ +L+R++ + + +F ++
Sbjct: 435 PPNVVDQIKLWQLELDRIQTFEGYLFKDFKNQ 466
>gi|119486893|ref|XP_001262366.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Neosartorya fischeri NRRL 181]
gi|119410523|gb|EAW20469.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Neosartorya fischeri NRRL 181]
Length = 468
Score = 207 bits (526), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 135/428 (31%), Positives = 219/428 (51%), Gaps = 53/428 (12%)
Query: 13 DMVASLTARDLDRLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDG 71
D + SL ++LY+ P AI R LP LAK +VM + Y+ +PA +E WV PD
Sbjct: 10 DYLESLPGTVFNKLYQQPSTALAIFRRMLPDLAKCFVMALLYLKDPLPAADLEAWVRPDS 69
Query: 72 FTKHRVAIDRLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARL 131
H + + Y + F S+LR+ L GA + G+ +++
Sbjct: 70 LKLH---------IMTSTTTSDNVKAYTVTDPFASSLRQALT--GA--EQTQSFGVLSQI 116
Query: 132 P-----TLEDLEAYAIGQWECFLLQLISSTQ--AERPTNFSSSMMKVFQRG-LLSRRDKE 183
++ DL+ YA QWE L ++ ++ +R + S + ++ Q G L+ RD+
Sbjct: 117 SDEEAVSIADLDEYARRQWEGVLGYMVGTSGLGMQRDVSLSKGVKELLQAGHLVEIRDRR 176
Query: 184 APRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLN 243
+T+ GF F+L D Q+W+I+ Y+ ++ G++ +++SF+ LS G++Y
Sbjct: 177 V-EITQDGFAFVLQDVGTQVWHILVLYVESAAAIGMDSVEVLSFVFFLSSLELGKSYEKK 235
Query: 244 TLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSM----------------SLT 287
L+ Q + D AD G+V Q+ F PT+LAT L+ + T
Sbjct: 236 QLTSNQLRTLTDLADFGIV-YQETPDATHFYPTRLATTLTSDSSTLSNPLAGSLSGPTGT 294
Query: 288 DSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFEN 347
SS GFI++ETN+R+YAY++S L ++ LF+ ++Y+ PNLI G IT++S+ A E
Sbjct: 295 SSSKAGSGFIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLITGKITRQSVRRAVEM 354
Query: 348 GITAEQIISFLQQNAHPRVADR-----------MP--SVPENVCDQIRLWESDLNRVEMT 394
GITA+QIIS+L +AHP++ MP +P V DQIRLW+ + +RV+ T
Sbjct: 355 GITADQIISYLSTHAHPQMRKHNVSRSTSNQAGMPVSVLPPTVVDQIRLWQLERDRVKAT 414
Query: 395 PAHYYDEF 402
+ +F
Sbjct: 415 HGFLFRDF 422
>gi|68490740|ref|XP_710812.1| hypothetical protein CaO19.13268 [Candida albicans SC5314]
gi|68490763|ref|XP_710801.1| hypothetical protein CaO19.5846 [Candida albicans SC5314]
gi|46432046|gb|EAK91553.1| hypothetical protein CaO19.5846 [Candida albicans SC5314]
gi|46432058|gb|EAK91564.1| hypothetical protein CaO19.13268 [Candida albicans SC5314]
gi|238879997|gb|EEQ43635.1| TFIIH basal transcription factor complex subunit 2 [Candida
albicans WO-1]
Length = 494
Score = 207 bits (526), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 138/432 (31%), Positives = 219/432 (50%), Gaps = 58/432 (13%)
Query: 25 RLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQ 84
+LYE P C +I R L P+AK Y+M M + + I + +++W P A+ RL
Sbjct: 24 KLYEAPATCLSIFRLLSPMAKFYIMSMIFNEKPIALRDLDKWCKPSARKLEFEALKRLES 83
Query: 85 LRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPRE--PMPSGITARLPTLEDLEAYAI 142
L L E K + RL+S F+ N R L G P + + + + L+ +A
Sbjct: 84 LHLI-EYDSKGSHVRLHSIFRKNFRDCLT-GSQNPNAFGSISTTVDKHKVDIPFLDLFAS 141
Query: 143 GQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAPRLTESGFQFLLMDTNAQ 202
+WE L ++ + P S S++ + + G L R+T SGFQFLL D NAQ
Sbjct: 142 QKWETILHFMVGTESTATP---SDSVLSLLKLGGLMEGPNNDLRITNSGFQFLLQDVNAQ 198
Query: 203 LWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLV 262
+W ++ +Y++ +QE ++ D+++F+ L G++Y++ +LSE Q SM+ D D GLV
Sbjct: 199 IWTLLLQYLNLTQELNMDPVDVLNFIFVLGSLELGKSYSVLSLSETQVSMLADLKDYGLV 258
Query: 263 KLQQGRKENWFIPTKLATNL----------SMSLTDSS---ARKEGFIVVETNFRMYAYS 309
Q+ F PT+LAT L SM++ + A KE I++ETNF++YAY+
Sbjct: 259 -YQRSDTSGRFYPTRLATTLTSDSAALKTPSMAMDEEEQQVATKES-IIIETNFKIYAYT 316
Query: 310 TSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRV--- 366
S L IL LF + + N++ G IT+ES+ NA NGITA+QII FL+ +AHP++
Sbjct: 317 KSPLEIAILNLFVHFKTRFANMVCGQITRESIRNALYNGITADQIIKFLETHAHPQMRIL 376
Query: 367 ---------------------------ADRMPS------VPENVCDQIRLWESDLNRVEM 393
D M S +P NV DQI+LW+ +L+R++
Sbjct: 377 AKEKLDKKIEFDTSHNINTAGGAPQSKTDGMVSQHKLEILPPNVVDQIKLWQLELDRIQT 436
Query: 394 TPAHYYDEFPSR 405
+ + +F ++
Sbjct: 437 FDGYLFKDFSNQ 448
>gi|347830304|emb|CCD46001.1| similar to RNA polymerase II transcription factor B subunit 2
[Botryotinia fuckeliana]
Length = 471
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 126/408 (30%), Positives = 219/408 (53%), Gaps = 29/408 (7%)
Query: 25 RLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLV 83
RLY+ P AI R LP LAK +VM + Y+ +P +++WV + A+ L
Sbjct: 22 RLYQQPSTSLAIFRRMLPHLAKTFVMALLYMPKPLPLIALDQWVQSKSKKQKDQALSLLS 81
Query: 84 QLRLF---SEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSG-ITARLPTLEDLEA 139
+L + + ++ T L F ++LR L GG +PS A ++ L+
Sbjct: 82 RLHIVDITALSREDPQTVALTKNFGASLRLALTGGGNHQSFGVPSSDHIAPHVDIDFLDT 141
Query: 140 YAIGQWEC---FLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAPRLTESGFQFLL 196
+A QWE +++ ++S + ++S+ + G L R + +T++GF FLL
Sbjct: 142 HARMQWEGILHYMVNTVTSGSGKDGNGPANSVKALLDAGKLVTRGR-GIGITQAGFSFLL 200
Query: 197 MDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDF 256
+ NAQ+W ++ +I N++ G++ D++SFL L G AY+ TL+E QK M+ +
Sbjct: 201 QEANAQVWTLLLLWIENAESMGMDSVDVLSFLFMLGSLELGRAYSTKTLTEAQKGMLANL 260
Query: 257 ADLGLVKLQQGRKENWFIPTKLATNLSM-------------SLTDSSARKEGFIVVETNF 303
DLGL+ L +F PT+LAT L+ + + S+A ++GFI++ETN+
Sbjct: 261 IDLGLIYLPPSAPTQFF-PTRLATTLTSDASALRTVAAGFDAASKSAASQKGFIIIETNY 319
Query: 304 RMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAH 363
R+YAY+ S L +L LF+K+ + PN++ G ++++S+ A +GIT++QII++L +AH
Sbjct: 320 RLYAYTNSPLQIAVLSLFTKLNTRYPNMVSGRVSRDSIRTAIAHGITSDQIITYLSTHAH 379
Query: 364 PRV------ADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
P++ + P +P V DQIRLW+ + R++ P +F ++
Sbjct: 380 PQLVKASSASHGGPVLPPTVVDQIRLWQLENERMKAVPGFLMKDFETQ 427
>gi|448100135|ref|XP_004199281.1| Piso0_002710 [Millerozyma farinosa CBS 7064]
gi|359380703|emb|CCE82944.1| Piso0_002710 [Millerozyma farinosa CBS 7064]
Length = 509
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 141/451 (31%), Positives = 220/451 (48%), Gaps = 79/451 (17%)
Query: 25 RLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQ 84
RLYE P C +I R LPPLAK Y+M M + + I K M +W P A+ RL
Sbjct: 25 RLYEAPATCLSIYRLLPPLAKFYIMSMLFYEHPIALKDMSKWCNPTAKKIQFEALKRLRS 84
Query: 85 LRLFSEEKKKETTYRLNSTFQSNLRKHLIYG------GALPREPMPSGITARLPTLEDLE 138
L L EE K + +L+ TF+ N R L G L P + I + L+
Sbjct: 85 LHLI-EEHNKGSHLKLHQTFRKNFRDCLTGSQTSNAFGNLCTSPDKNHIDVKF-----LD 138
Query: 139 AYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAP--RLTESGFQFLL 196
+A +WE L ++ + + P+ S++K GL+ A ++T GFQFLL
Sbjct: 139 MFASQKWESILHFMVGTELSVAPSKSVLSLLK--SGGLMEGPGTSASNLKITNKGFQFLL 196
Query: 197 MDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDF 256
D N Q+W ++ +Y+S +Q+ ++ D+++F+ L G++Y +++LSE Q SM+ D
Sbjct: 197 QDVNVQIWTLLLQYLSLTQDLNMDPVDVLNFIFVLGCLELGKSYTVSSLSETQISMLADL 256
Query: 257 ADLGLVKLQQGRKENWFIPTKLATNL----------SMSLT------DSSARKE------ 294
D GL+ KE F PT+LAT L SM+L +S A +E
Sbjct: 257 KDYGLIYQNHNDKER-FYPTRLATTLTSEAAALRTPSMALNQVVDTANSEATEEQMFSNT 315
Query: 295 -GFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQ 353
G I++ETNF++YAY+ S L IL LF ++ + PN++ G IT+ES+ A NGITA+Q
Sbjct: 316 GGSIILETNFKIYAYTNSPLEIAILNLFVHLKTRFPNMVTGQITRESIRKALFNGITADQ 375
Query: 354 IISFLQQNAHPRVAD---------------------------------------RMPSVP 374
II FL+ +AH ++ ++ +P
Sbjct: 376 IIKFLESHAHSQMKQMAKENLDKKIEFDANNNINTATDSQQLSVGEFGSASSYHKLEILP 435
Query: 375 ENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
V DQI+LW+ +L+R++ + + +F S+
Sbjct: 436 PTVVDQIKLWQLELDRIQTFEGYLFKDFSSQ 466
>gi|344303892|gb|EGW34141.1| RNA polymerase II transcription factor B subunit 2 [Spathaspora
passalidarum NRRL Y-27907]
Length = 519
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 134/454 (29%), Positives = 230/454 (50%), Gaps = 81/454 (17%)
Query: 25 RLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQ 84
+LYE P C +I R L P+AK ++M M + + + + +++W +P A+ +L
Sbjct: 25 KLYEAPATCLSIYRLLSPMAKFFIMSMIFNEKPVALRDLDKWCMPSARKLEYEALKQLES 84
Query: 85 LRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPRE--PMPSGITARLPTLEDLEAYAI 142
L L E K + RL+STF+ N R L G P + + + + L+++A
Sbjct: 85 LHLI-EYDSKASHIRLHSTFRQNFRDCLT-GSQDPTAFGSISTTVDKHQVDVAFLDSFAS 142
Query: 143 GQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAP--RLTESGFQFLLMDTN 200
+WE L ++ + P+N S++++ GL+ + A ++T +GFQFLL D N
Sbjct: 143 SKWETILHFMVGTESTATPSNSVLSLLRL--GGLMEGPGRGASNLKITNTGFQFLLQDVN 200
Query: 201 AQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLG 260
AQ+W ++ +Y++ +QE ++ D+++F+ L G++Y +++LSE Q SM+ D D G
Sbjct: 201 AQIWTLLLQYLNLTQELNMDPVDVLNFIFVLGSLELGKSYAVSSLSETQVSMLADLKDYG 260
Query: 261 LVKLQQGRKENWFIPTKLATNLSMSLTDSSARK--------------------------- 293
LV ++ + F PT+LAT L+ +DSSA K
Sbjct: 261 LV-YRRTDTSSRFYPTRLATTLT---SDSSALKTPAMAVEQALDSVEGGTDNEVSQTAPP 316
Query: 294 ---EGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGIT 350
+G +++ETNF++YAY+ S L IL LF + + N++ G IT+ES+ NA NGIT
Sbjct: 317 GNIQGTVIIETNFKLYAYTNSPLEIAILNLFVHLRTRFANMVCGQITRESIRNALYNGIT 376
Query: 351 AEQIISFLQQNAHPRV---------------------------------AD------RMP 371
A+QII FL+ +AHP++ AD ++
Sbjct: 377 ADQIIRFLETHAHPQMKVLAKEKLDKKIEFDTSHNINTAGGAPQSQALRADGSVAQHKLE 436
Query: 372 SVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
+P NV DQI+LW+ +L+R++ + + +FP++
Sbjct: 437 ILPPNVVDQIKLWQLELDRIQTFDGYLFKDFPNQ 470
>gi|402866404|ref|XP_003897374.1| PREDICTED: LOW QUALITY PROTEIN: valine--tRNA ligase, mitochondrial,
partial [Papio anubis]
Length = 1647
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 132/416 (31%), Positives = 212/416 (50%), Gaps = 43/416 (10%)
Query: 30 PFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQLRLFS 89
P L+ P L K +VM+M +++ +P + WV + + L LR++
Sbjct: 84 PCHLSGCLQGAPILGKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWH 143
Query: 90 EE--KKKETTYRLNSTFQSNLRKHLIYGG-ALPREPMPSGITARLPTLEDLEAYAIGQWE 146
+ LN F+ NLR L+ GG A + G + L+ YA +WE
Sbjct: 144 TQLLPGGLQGLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWE 203
Query: 147 CFLLQLISSTQAERPTNFSSSMMKVFQRGLL-SRRDKEAPRLTESGFQFLLMDTNAQLWY 205
L ++ S A + + + Q GL+ S E P +T +GFQFLL+DT AQLWY
Sbjct: 204 VVLHFMVGSPSAAVSQDLAQLLS---QAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWY 260
Query: 206 IVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQ 265
+ +Y+ +Q RG++ +++SFL +LSF G+ Y++ +S+ + ++ + GLV Q
Sbjct: 261 FMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLV-FQ 319
Query: 266 QGRKENWFIPTKLATNLSMSLTDSSA--RKEGFIVVETNFRMYAY--------------- 308
+ RK + PT+LA NLS ++ + + GFIVVETN+R+YAY
Sbjct: 320 RKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYTGEVGQSAPEEKKEG 379
Query: 309 ------------------STSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGIT 350
S S+L ++ LFS++ Y+ PN++V +T+ES+ A +GIT
Sbjct: 380 LEGVWVWGWPPHPLSIPGSESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGIT 439
Query: 351 AEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
A+QII FL+ AHP + + P +P + DQIRLWE + +R+ T Y++F S+V
Sbjct: 440 AQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQV 495
>gi|392594381|gb|EIW83705.1| transcription factor Tfb2 [Coniophora puteana RWD-64-598 SS2]
Length = 393
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 126/379 (33%), Positives = 204/379 (53%), Gaps = 35/379 (9%)
Query: 7 IAKN----FMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKM 62
IA+N + + S + L RLY+ P C +I R L PL+K+ VM + ++D + A
Sbjct: 13 IAQNSPHVLLPFLQSQSQNALTRLYQRPSSCLSIFRLLAPLSKQLVMNLLWLDSPVLAGT 72
Query: 63 MEEWVLPDGFTKHRVAIDRLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREP 122
M WV+ DG + A++ L++L + K LN TF++++R+ + G
Sbjct: 73 MAAWVVLDGKKLYDEALNTLMRLHILQSSGAK---LALNPTFKASMRQAITCSGTTGSFG 129
Query: 123 MPSGITAR--LPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQR-GLLSR 179
+P+ + P++E L+ YA+ +WE L ++SS Q + PT + ++ + QR GL++
Sbjct: 130 VPAQRDDKRDAPSVETLDGYAVQRWETILHYMVSSGQGQYPTKPTQGVLYLLQRSGLMAS 189
Query: 180 RDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEA 239
++T SGFQFLL QLW ++ +Y+ +ER ++ +++SF LS G
Sbjct: 190 YHGSTLQITSSGFQFLLYSPRDQLWDLLLQYLHMVEERQMDLVEVLSFFFMLSTMELGRE 249
Query: 240 YNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSM--------------- 284
Y+ LSE QK+M++D D GLV Q+ F PT+LAT L+
Sbjct: 250 YSTEPLSETQKAMLEDLRDYGLV-WQRKASSKRFSPTRLATTLTSVSPSLPTAGGSRNPG 308
Query: 285 ------SLTDSSA---RKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGA 335
S+ +S+ GFIV+ETN+R+YAY+ + L +L LF ++Y+ PNL+VG
Sbjct: 309 SAPGASSVANSTGINTNDHGFIVLETNYRVYAYTDNPLQIAVLNLFITLKYRFPNLVVGM 368
Query: 336 ITKESLYNAFENGITAEQI 354
+T+ES+ A NGI+AEQI
Sbjct: 369 LTRESVRRALGNGISAEQI 387
>gi|301786953|ref|XP_002928911.1| PREDICTED: LOW QUALITY PROTEIN: valyl-tRNA synthetase,
mitochondrial-like [Ailuropoda melanoleuca]
Length = 1637
Score = 204 bits (518), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 133/415 (32%), Positives = 212/415 (51%), Gaps = 52/415 (12%)
Query: 40 LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQLRLFSEE--KKKETT 97
LP LAK +VM+M +++ +P + WV + + L LR++ +
Sbjct: 72 LPSLAKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQG 131
Query: 98 YRLNSTFQSNLRKHLIYGG-ALPREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISST 156
LN F+ NLR L+ GG A + G + L+ YA +WE L ++ S
Sbjct: 132 LILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSP 191
Query: 157 QAERPTNFSSSMMKVFQRGLL-SRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQ 215
A + + + Q GL+ S E P +T +GFQFLL+DT AQLWY + +Y+ +Q
Sbjct: 192 SAAVSQDLAQLLS---QAGLMKSAEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQ 248
Query: 216 ERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIP 275
RG++ +++SFL +LSF G+ Y++ +S+ + ++ + GLV Q+ RK + P
Sbjct: 249 SRGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLV-FQRKRKSRRYYP 307
Query: 276 TKLATNLS--MSLTDSSARKEGFIVVETNFRMYAYST----------------------- 310
T+LA NLS +S +A + GFIVVETN+R+YAY+
Sbjct: 308 TRLAINLSSGVSGAGGTAHQPGFIVVETNYRLYAYTGEARQRAPNSRXGVGTFLVTLEGL 367
Query: 311 -------------------SKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITA 351
S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA
Sbjct: 368 EGVWVWGXPSSSLPIPGPESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITA 427
Query: 352 EQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
+QII FL+ AHP + + P +P + DQIRLWE + +R+ T Y++F S+V
Sbjct: 428 QQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQV 482
>gi|50310215|ref|XP_455127.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|62901296|sp|Q6CLR2.1|TFB2_KLULA RecName: Full=RNA polymerase II transcription factor B subunit 2;
AltName: Full=RNA polymerase II transcription factor B
52 kDa subunit; AltName: Full=RNA polymerase II
transcription factor B p52 subunit
gi|49644263|emb|CAG97834.1| KLLA0F01056p [Kluyveromyces lactis]
Length = 496
Score = 204 bits (518), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 129/431 (29%), Positives = 216/431 (50%), Gaps = 56/431 (12%)
Query: 25 RLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQ 84
RLYE+P C AI R L P+AK ++M M + D + + +++WV PD + + +I +
Sbjct: 24 RLYESPATCLAIYRLLSPMAKFFIMSMLFQDHDVSLRDLDKWVKPDAKYQLQYSIKSMKS 83
Query: 85 LRLFSE-EKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLP-TLEDLEAYAI 142
L L E E K+ RLN F+ + K+++ GG + P + L+ Y+
Sbjct: 84 LNLIIEGESKQPLLIRLNPIFKKSF-KNVLTGGEINNSFGDVADDDTNPVSTATLDQYSA 142
Query: 143 GQWECFLLQLISSTQAERPTNFSSSMMKVFQR-GLLSRRDKEAPRLTESGFQFLLMDTNA 201
+WE L ++ + P ++ + Q GL+ + ++T GFQFLL D NA
Sbjct: 143 EKWETILHYMVGTPNTNTP---GGKVLDLLQHSGLMEEAEYGELKITNQGFQFLLQDVNA 199
Query: 202 QLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGL 261
Q+W ++ +Y+ ++ ++ D+++F+ L G+AY + LS Q++M++D D GL
Sbjct: 200 QMWTLLLQYLKMAESLQMDPVDVLNFIFMLGALQLGKAYKCDQLSNTQRTMLQDMRDYGL 259
Query: 262 VKLQQGRKENWFIPTKLAT-----------------------NLSMSLTDSSARKE---- 294
+ Q F PT+LAT N + ++ S + E
Sbjct: 260 IYQNQSDYAK-FYPTRLATLLTSDTKAFRSASVALDSVLNKANETTAVEGDSGQDETTER 318
Query: 295 ---GFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITA 351
G +++ETNF++Y+YS S L IL LF ++ + N++ G +T+ES+ NA NGITA
Sbjct: 319 TQDGALIIETNFKLYSYSNSPLQIAILSLFVHLKSRFANMVTGQLTRESVRNALLNGITA 378
Query: 352 EQIISFLQQNAHPR------------------VADRMPSVPENVCDQIRLWESDLNRVEM 393
EQII++L+ +AHPR V + + +P V DQIRLW+ +L+R+
Sbjct: 379 EQIIAYLETHAHPRMRRLAEENLSKKLELDPTVKETLQVLPPTVVDQIRLWQLELDRIIS 438
Query: 394 TPAHYYDEFPS 404
+ Y +F S
Sbjct: 439 YDGYLYTDFES 449
>gi|62088538|dbj|BAD92716.1| VARS2L protein variant [Homo sapiens]
Length = 1653
Score = 203 bits (517), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 132/421 (31%), Positives = 212/421 (50%), Gaps = 48/421 (11%)
Query: 30 PFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQLRLFS 89
P L+ P L K +VM+M +++ +P + WV + + L LR++
Sbjct: 85 PCHMSGCLQGAPILGKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWH 144
Query: 90 EE--KKKETTYRLNSTFQSNLRKHLIYGG-ALPREPMPSGITARLPTLEDLEAYAIGQWE 146
+ LN F+ NLR L+ GG A + G + L+ YA +WE
Sbjct: 145 TQLLPGGLQGLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWE 204
Query: 147 CFLLQLISSTQAERPTNFSSSMMKVFQRGLL-SRRDKEAPRLTESGFQFLLMDTNAQLWY 205
L ++ S A + + + Q GL+ S E P +T +GFQFLL+DT AQLWY
Sbjct: 205 VVLHFMVGSPSAAVSQDLAQLLS---QAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWY 261
Query: 206 IVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQ 265
+ +Y+ +Q RG++ +++SFL +LSF G+ Y++ +S+ + ++ + GLV Q
Sbjct: 262 FMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLV-FQ 320
Query: 266 QGRKENWFIPTKLATNLSMSLTDSSA--RKEGFIVVETNFRMYAY--------------- 308
+ RK + PT+LA NLS ++ + + GFIVVETN+R+YAY
Sbjct: 321 RKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYTGEAGQRAPGRGGWG 380
Query: 309 -----------------------STSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAF 345
S S+L ++ LFS++ Y+ PN++V +T+ES+ A
Sbjct: 381 HLKEGLEGVWVWGWPPLPLSIPGSESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAI 440
Query: 346 ENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
+GITA+QII FL+ AHP + + P +P + DQIRLWE + +R+ T Y++F S+
Sbjct: 441 ASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQ 500
Query: 406 V 406
V
Sbjct: 501 V 501
>gi|403308524|ref|XP_003944708.1| PREDICTED: LOW QUALITY PROTEIN: valine--tRNA ligase, mitochondrial
[Saimiri boliviensis boliviensis]
Length = 1752
Score = 203 bits (517), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 132/421 (31%), Positives = 212/421 (50%), Gaps = 48/421 (11%)
Query: 30 PFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQLRLFS 89
P L+ P L K +VM+M +++ +P + WV + + L LR++
Sbjct: 183 PCHLSGCLQGAPILGKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWH 242
Query: 90 EE--KKKETTYRLNSTFQSNLRKHLIYGG-ALPREPMPSGITARLPTLEDLEAYAIGQWE 146
+ LN F+ NLR L+ GG A + G + L+ YA +WE
Sbjct: 243 TQLLPGGLQGLILNPVFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWE 302
Query: 147 CFLLQLISSTQAERPTNFSSSMMKVFQRGLL-SRRDKEAPRLTESGFQFLLMDTNAQLWY 205
L ++ S A + + + Q GL+ S E P +T +GFQFLL+DT AQLWY
Sbjct: 303 VVLHFMVGSPSAAVSQDLAQLLS---QAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWY 359
Query: 206 IVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQ 265
+ +Y+ +Q RG++ +++SFL +LSF G+ Y++ +S+ + ++ + GLV Q
Sbjct: 360 FMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLV-FQ 418
Query: 266 QGRKENWFIPTKLATNLSMSLTDSSA--RKEGFIVVETNFRMYAY--------------- 308
+ RK + PT+LA NLS ++ + + GFIVVETN+R+YAY
Sbjct: 419 RKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYTGEVGQRAPGRAGWG 478
Query: 309 -----------------------STSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAF 345
S S+L ++ LFS++ Y+ PN++V +T+ES+ A
Sbjct: 479 RGNARLEGVWVWGWPPHPLSIPGSESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAI 538
Query: 346 ENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
+GITA+QII FL+ AHP + + P +P + DQIRLWE + +R+ T Y++F S+
Sbjct: 539 ASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQ 598
Query: 406 V 406
V
Sbjct: 599 V 599
>gi|449296697|gb|EMC92716.1| hypothetical protein BAUCODRAFT_259506 [Baudoinia compniacensis
UAMH 10762]
Length = 489
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 133/414 (32%), Positives = 212/414 (51%), Gaps = 36/414 (8%)
Query: 12 MDMVASLTARDLDRLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPD 70
+D + L +LY+ P AI R LP LAK VM M Y+ ++ W+ PD
Sbjct: 10 LDYLEQLPGTTFTKLYQQPSTALAIFRRMLPHLAKTLVMAMLYMPKPFAVADVDVWIKPD 69
Query: 71 --GFTKHRVAIDRLVQLRLF---SEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPS 125
G A+ L +LR+ +EK Y+L+ F +LR L GG +PS
Sbjct: 70 AEGLQARDRALSILQRLRILFDTRDEKDGRPAYKLSDGFARSLRMALTGGGNHRSFGVPS 129
Query: 126 GITARLPTLED-LEAYAIGQWECFLLQLISSTQAERPTN---FSSSMMKVFQRG-LLSRR 180
+ P L+A+A QWE L ++ S A + S+ ++ Q+G +S R
Sbjct: 130 NAPDKQPVSGSYLDAFARQQWEAILYYVVGSAHAGLGGSTDAISAGTKQLLQKGEFVSVR 189
Query: 181 DKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAY 240
+EA +T+ GF FLL + NAQ+W ++ Y+ S ++ D++SFL L G +Y
Sbjct: 190 GREA-HITQRGFTFLLQEINAQIWTLLIVYLEVSGALHMDPVDVLSFLFTLGSLELGISY 248
Query: 241 NLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLT------------- 287
+ L+ Q+ M+ D +D GLV ++ E + PT+LAT L+
Sbjct: 249 STTNLTPTQQQMLDDLSDFGLV-YRRSNDEARYYPTRLATTLTSDAPALPNTSLTSTTST 307
Query: 288 ----------DSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAIT 337
+SA ++G+I++ETN+R+YAY++S L IL LF+ ++ + PNLI +T
Sbjct: 308 TTLASSSTDPSASANEKGYIILETNYRLYAYTSSPLPITILSLFATLKTRYPNLITAKLT 367
Query: 338 KESLYNAFENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRV 391
K S+ +A +GIT++QII++L +AHP + + P +P V DQIRLW+ + R+
Sbjct: 368 KSSIQSAIASGITSDQIITYLTTHAHPILRRQNPVLPPTVVDQIRLWQIEGERM 421
>gi|328863850|gb|EGG12949.1| hypothetical protein MELLADRAFT_46475 [Melampsora larici-populina
98AG31]
Length = 454
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 128/411 (31%), Positives = 218/411 (53%), Gaps = 28/411 (6%)
Query: 7 IAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYID-GAIPAKMMEE 65
I+ + + + L + L RLYE+P C +I R LP LA+ ++ + + ++
Sbjct: 9 ISSSLSEFLHDLPEQTLSRLYESPSSCLSIFRLLPSLARHLILNSLWSSIHSFNLTQLKL 68
Query: 66 WVLPDGFTKHRVA---IDRLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREP 122
W F ++ + + +L+ L L + L+ TF+ N + L GG
Sbjct: 69 WSTSIKFESNQSSQSTLAKLIHLHLI-----HSSNLTLHPTFKQNFQLALTGGGDHKSFG 123
Query: 123 MPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQ-RGLLSRRD 181
+P ++++ L L+ YA WE L ++ S + RP SS ++ + GL+S ++
Sbjct: 124 LPCKTSSKVDLLT-LDQYAKQNWETILHFMVGSNWSNRP---SSKIITLLTFSGLISSQN 179
Query: 182 KEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYN 241
++T GFQFLL D N QLW ++ +Y+ ++ ++ D++S L L G+ Y+
Sbjct: 180 S---KITSVGFQFLLDDINTQLWELLLQYLKMAEANDLDIVDVLSCLFMLGSLELGKEYS 236
Query: 242 LNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSL--------TDSSARK 293
+ S+ Q ++ D D GLV L Q + F PT+L T L+ + SS+
Sbjct: 237 MKNFSDTQVQVLNDLVDYGLVYLTQSKT---FYPTRLVTTLTSTAPPIVSNPSDQSSSDP 293
Query: 294 EGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQ 353
F+++ETN+R+YAY+++ L IL LF + + PNL++G+IT+ES+ AF+NGI A+Q
Sbjct: 294 NEFLILETNYRIYAYTSNPLQIAILNLFISFKSRFPNLVIGSITRESIKMAFKNGIRADQ 353
Query: 354 IISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
IIS+L+ ++H ++ P +P V DQIRLWE + NRV + Y++F S
Sbjct: 354 IISYLEFHSHSQMKLLKPILPNTVVDQIRLWELEKNRVRDQEGYLYEDFKS 404
>gi|390461382|ref|XP_002746360.2| PREDICTED: LOW QUALITY PROTEIN: valine--tRNA ligase, mitochondrial
[Callithrix jacchus]
Length = 1675
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 132/422 (31%), Positives = 212/422 (50%), Gaps = 49/422 (11%)
Query: 30 PFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQLRLFS 89
P L+ P L K +VM+M +++ +P + WV + + L LR++
Sbjct: 105 PCHLSGCLQGAPILGKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWH 164
Query: 90 EE--KKKETTYRLNSTFQSNLRKHLIYGG-ALPREPMPSGITARLPTLEDLEAYAIGQWE 146
+ LN F+ NLR L+ GG A + G + L+ YA +WE
Sbjct: 165 TQLLPGGLQGLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWE 224
Query: 147 CFLLQLISSTQAERPTNFSSSMMKVFQRGLL-SRRDKEAPRLTESGFQFLLMDTNAQLWY 205
L ++ S A + + + Q GL+ S E P +T +GFQFLL+DT AQLWY
Sbjct: 225 VVLHFMVGSPSAAVSQDLAQLLS---QAGLMKSAEPGEPPCITSAGFQFLLLDTPAQLWY 281
Query: 206 IVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQ 265
+ +Y+ +Q RG++ +++SFL +LSF G+ Y++ +S+ + ++ + GLV Q
Sbjct: 282 FMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLV-FQ 340
Query: 266 QGRKENWFIPTKLATNLSMSLTDSSA--RKEGFIVVETNFRMYAY--------------- 308
+ RK + PT+LA NLS ++ + + GFIVVETN+R+YAY
Sbjct: 341 RKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYTGEAGQRAPGRAGWG 400
Query: 309 ------------------------STSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNA 344
S S+L ++ LFS++ Y+ PN++V +T+ES+ A
Sbjct: 401 XHLKERFEGVWVWGWPPHPLSIPGSESELQIALIALFSEMLYRFPNMVVAQVTRESVQQA 460
Query: 345 FENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
+GITA+QII FL+ AHP + + P +P + DQIRLWE + +R+ T Y++F S
Sbjct: 461 IASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLS 520
Query: 405 RV 406
+V
Sbjct: 521 QV 522
>gi|342319113|gb|EGU11064.1| RNA polymerase II transcription factor B subunit 2 [Rhodotorula
glutinis ATCC 204091]
Length = 496
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 138/424 (32%), Positives = 216/424 (50%), Gaps = 34/424 (8%)
Query: 8 AKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWV 67
A + ++ L+A RLY +P C +I R LP ++ V+ M + + + + + WV
Sbjct: 19 ASHLYGLLDRLSAAMFTRLYASPASCLSIFRLLPVTSRHIVLNMLWYEEVVRVRDVALWV 78
Query: 68 L-------PDGFTKH-RVAIDRLVQLRLF-------SEEKKKETTYRLNSTFQSNLRKHL 112
G +H ++ L +L + S+ K ET +N F+ + R L
Sbjct: 79 RERKSEGGDKGERRHLSSSLSALARLHIITPRSSRPSDTSKDETELEMNPGFRDSFRMAL 138
Query: 113 IYGGALPREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISS-TQAERPTNFSSSMMKV 171
GG P+ T++ L+ +A QWE ++ S A++P S+++
Sbjct: 139 TGGGKQGSFGAPAAEQDEEVTVQFLDDHAEVQWETIQHFMVGSDAGAKKPGEKVLSLLE- 197
Query: 172 FQRGLL--SRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLL 229
+ GL+ R R+T GFQFLL D N QLW ++ Y+ SQ + + I FL
Sbjct: 198 -RSGLMYSPTRSLRNMRITSKGFQFLLEDVNTQLWDLLLVYLEGSQ----DLVETIGFLF 252
Query: 230 ELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSL--- 286
L G AY + LS+IQ +++D AD GLV L + R F PT+LAT L+ S
Sbjct: 253 MLGSLELGRAYMTDNLSQIQHGVLRDLADYGLVYLPE-RNAPIFYPTRLATTLTSSAPPL 311
Query: 287 --TDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNA 344
+ S ++GFIV+ETN+++YAY+++ L +L LF+ ++ + N + G IT+ES+
Sbjct: 312 VSSRHSNEEKGFIVLETNYKLYAYTSNPLQIAVLGLFAHLKTRFANFVTGHITRESIRRG 371
Query: 345 FENGITAEQIISFLQQNAHPRVADRMPS----VPENVCDQIRLWESDLNRVEMTPAHYYD 400
NGITA QIIS+L AHP++ + S +P V DQIRLWE + R++ T + YD
Sbjct: 372 LANGITANQIISYLASRAHPQMRAQAGSDDKLLPITVVDQIRLWEHERRRIQTTEGYLYD 431
Query: 401 EFPS 404
EF S
Sbjct: 432 EFSS 435
>gi|448527133|ref|XP_003869438.1| TFIIH and nucleotide excision repair factor 3 complex subunit
[Candida orthopsilosis Co 90-125]
gi|380353791|emb|CCG23303.1| TFIIH and nucleotide excision repair factor 3 complex subunit
[Candida orthopsilosis]
Length = 500
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 138/441 (31%), Positives = 219/441 (49%), Gaps = 70/441 (15%)
Query: 25 RLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQ 84
+LYE C +I R L P+AK Y+M M + + I + + +W P A+ RL
Sbjct: 24 KLYEASATCLSIYRLLSPMAKFYIMSMIFNEKPIAMRDLTKWCKPSAKKMEFEALKRLES 83
Query: 85 LRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPRE--PMPSGITARLPTLEDLEAYAI 142
L L + K T RLNS F+SN R L G P + +G + L+ +A
Sbjct: 84 LHLIDYDGKG-THVRLNSIFRSNFRDCLT-GSQDPNAFGSISTGDDVDKVEISFLDKFAS 141
Query: 143 GQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAPRLTESGFQFLLMDTNAQ 202
+WE +L + T+ PT SS + + GL+ + +T +GFQFLL D NAQ
Sbjct: 142 HEWET-ILHFMVGTEG-TPTPSSSVLSLLKLGGLM--EGESTLNITNTGFQFLLQDANAQ 197
Query: 203 LWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLV 262
+W ++ +Y++ + E +N D+++F+ L G+ Y+++ LSE Q SM+ D DLGLV
Sbjct: 198 IWTLLLQYLNLTSELNMNPVDVLNFIFILGCLELGKGYSVSNLSETQVSMLADLKDLGLV 257
Query: 263 KLQQGRKENWFIPTKLATNLSMSLTDSSARK----------------------EGFIVVE 300
Q+ N F PT+LAT L+ +DSSA K I++E
Sbjct: 258 -YQKSDTSNKFYPTRLATTLT---SDSSALKTPSMAVQQALEENEEQMMASNSRESIIIE 313
Query: 301 TNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQ 360
TNF++YAY+ S L IL LF +++ + N++ G IT+ES+ NA NGIT++QII FL+
Sbjct: 314 TNFKIYAYTNSPLEIAILNLFVQMKTRFSNMVCGQITRESIRNALYNGITSDQIIKFLET 373
Query: 361 NAHPRV------------------------------------ADRMPSVPENVCDQIRLW 384
+AHP++ ++ +P NV DQI+LW
Sbjct: 374 HAHPQMRALAKEKLDKKVEFDASHNINTAGGAPQSKTDGAISQHKLEVIPPNVVDQIKLW 433
Query: 385 ESDLNRVEMTPAHYYDEFPSR 405
+ +L+R++ + + +F ++
Sbjct: 434 QLELDRIQTVEGYLFKDFANQ 454
>gi|193654859|ref|XP_001944632.1| PREDICTED: general transcription factor IIH subunit 4-like
[Acyrthosiphon pisum]
Length = 470
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 120/380 (31%), Positives = 207/380 (54%), Gaps = 26/380 (6%)
Query: 24 DRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKH-RVAIDRL 82
DRLY +P C A+ R LP +A+ + +++ +I+ +P + WV P +++ +I+
Sbjct: 42 DRLYIHPPTCLAVFRVLPDIAQVFTLRILFIEQPVPLSALSSWV-PAKYSRELEESIEVA 100
Query: 83 VQLRLF--SEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLPT--LEDLE 138
+ L ++ + + LNSTF+ L+ ++ GG R +P+ L ++ L+
Sbjct: 101 INLHIWKLTSVSGGLKGWILNSTFKKKLKVAIMSGG---RTTVPNSDETALKARDIDILD 157
Query: 139 AYAIGQWECFLLQLISSTQAERPTNFSSSMMKVF-QRGLLSRRDKEAPRLTESGFQFLLM 197
+YA +WEC L ++ S + SS ++V GL+ R +P +T +G+QFLL+
Sbjct: 158 SYAYERWECILYYMVGS----KHKGISSDAVRVLLNAGLMVRDTDNSPVITSTGYQFLLL 213
Query: 198 DTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFA 257
D Q+WY + Y+ + RG++ A + FL ++ G Y + + E ++ ++
Sbjct: 214 DMATQVWYFMLHYMETVESRGLDLAQYLIFLFQIHLSTLGWDYITDGMPENLQTFLQHLQ 273
Query: 258 DLGLV---KLQQGRKENWFIPTKLATNLSMSLTDSSAR---KEGFIVVETNFRMYAYSTS 311
+ GLV KL+ GR F PT+L + + +S R KE +IVVETNFR+YA + S
Sbjct: 274 EFGLVYQRKLKAGR----FCPTRLVIEMGRENSHTSKRMKKKERYIVVETNFRIYAMTDS 329
Query: 312 KLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRMP 371
L ++ LF+ + Y+ PN+ G +T++S+ A NGITA QI+ FL + HP++ +
Sbjct: 330 DLKVALVALFTHMLYRFPNMSAGILTRDSVQTALRNGITAAQIVRFLTVHTHPQMQE--C 387
Query: 372 SVPENVCDQIRLWESDLNRV 391
+P+ V DQI LWE + NR+
Sbjct: 388 GMPQTVIDQIYLWEYERNRL 407
>gi|146414792|ref|XP_001483366.1| hypothetical protein PGUG_04095 [Meyerozyma guilliermondii ATCC
6260]
Length = 498
Score = 201 bits (511), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 136/439 (30%), Positives = 221/439 (50%), Gaps = 68/439 (15%)
Query: 25 RLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQ 84
+LYE P C +I R L P+AK ++M M + + + +++W P + A+ RL
Sbjct: 24 QLYEAPATCLSIFRLLTPIAKFFIMLMIFNERPQSVRDLDKWCRPQSKSLQYDALKRLKS 83
Query: 85 LRLFSEEKKKETTYRLNSTFQSNLRKHLIYG------GALPREPMPSGITARLPTLEDLE 138
L L EE RL+ TF+ N R L G L E +T L+
Sbjct: 84 LHLV-EEDAGGNFLRLHMTFRKNFRDCLTGNQEATAFGNLCTEDDGHKVTVAF-----LD 137
Query: 139 AYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSR---RDKEAPRLTESGFQFL 195
++A+ +WE L ++ + E P+ S S++ + + G L D ++T SGFQFL
Sbjct: 138 SFALQKWETILHYMVGT---ELPSKPSPSVLSLLKLGGLMEGPGTDARNLKITNSGFQFL 194
Query: 196 LMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKD 255
L D NAQ+W ++ +Y++ +Q+ ++ D+++FL L G++Y L+ LS+ Q SM+ D
Sbjct: 195 LQDMNAQIWTLLLQYLNLTQDLNMDPVDVLNFLFVLGSLELGKSYYLSGLSDTQVSMLAD 254
Query: 256 FADLGLVKLQQGRKENWFIPTKLATNL----------SMSLTDS--------SARKEGFI 297
D GLV Q + + F PT+LAT L SM+L + S +G I
Sbjct: 255 LRDYGLV-YQYSDQSDRFYPTRLATTLTSESASLKTPSMALDQAVDTSNGLDSGPGQGSI 313
Query: 298 VVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISF 357
++ETNF++YAY+ S L IL LF + + N++ G IT+ES+ NA NGIT++QII+F
Sbjct: 314 ILETNFKLYAYTNSPLEIAILNLFVNLRTRFSNMVSGQITRESIRNALYNGITSDQIINF 373
Query: 358 LQQNAHPRV-------------------------------ADRMPSVPENVCDQIRLWES 386
L+ +AHP++ ++ +P V DQI+LW+
Sbjct: 374 LETHAHPQMRALAKERLDKKVEFDASNNINTAGGGQNSSAQHKLEILPPTVVDQIKLWQL 433
Query: 387 DLNRVEMTPAHYYDEFPSR 405
+L+R++ + + EF S+
Sbjct: 434 ELDRIQAFEGYLFKEFASQ 452
>gi|313245784|emb|CBY34779.1| unnamed protein product [Oikopleura dioica]
Length = 456
Score = 201 bits (511), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 122/407 (29%), Positives = 231/407 (56%), Gaps = 21/407 (5%)
Query: 1 MPLVKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPA 60
MP +K+IA + + AR +D LY +P C ++ R LP +AK ++++ +I+ I
Sbjct: 15 MPELKLIA--LHEFLVKQDARVIDELYRHPATCMSVFRELPQIAKHIIIRILFINQQIAK 72
Query: 61 KMMEEWVLPDGFTKHRVAIDRLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPR 120
++E WV + K VA++ + LR++ E + LN++F++ L++ L GG R
Sbjct: 73 PLIESWVADEHREKFEVAMEIITGLRIW-ENTNDGIAFNLNNSFRTYLQEALFGGGETWR 131
Query: 121 EPMPS-GITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSR 179
+ + G T+E L+ Y +W+ +L ++ Q + S ++ + + L
Sbjct: 132 PAVETLGADKNAKTVEQLDTYTKERWDQ-ILSFLTQEQGK----LSEEVISLLKYAGLCD 186
Query: 180 RDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEA 239
+ E FQFLL+D ++Q+WY++ +Y+ Q+ G++ ++++F+L+L + G
Sbjct: 187 AN------GEKRFQFLLLDRSSQVWYLLVQYLGYVQKLGLSLVNVLAFVLQLGYCSFGTD 240
Query: 240 YNL-NTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMS---LTDSSARKEG 295
Y N+ +EI + +I+ F ++GL+ ++ KE F PT+LA ++S++ S +E
Sbjct: 241 YPCDNSNNEISR-VIQHFREMGLI-FKRKSKEQRFYPTRLAQSISIAGGKKASSEDVQEQ 298
Query: 296 FIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQII 355
FI+VETN+R+YAY+ S+LH ++ LF++V+Y+ P +IV ++++S+ + + GI+AEQI+
Sbjct: 299 FILVETNYRIYAYTDSELHYALISLFAEVQYRFPYMIVAQMSRDSIQQSADYGISAEQIL 358
Query: 356 SFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEF 402
++L+ +AHP VP+ V D I LW + R++ Y +F
Sbjct: 359 NYLRSSAHPIARKNKHWVPQVVEDNIHLWCKERERLKFNDGLLYHQF 405
>gi|190347675|gb|EDK39997.2| hypothetical protein PGUG_04095 [Meyerozyma guilliermondii ATCC
6260]
Length = 498
Score = 201 bits (511), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 136/439 (30%), Positives = 220/439 (50%), Gaps = 68/439 (15%)
Query: 25 RLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQ 84
+LYE P C +I R L P+AK ++M M + + + +++W P + A+ RL
Sbjct: 24 QLYEAPATCLSIFRLLTPIAKFFIMSMIFNERPQSVRDLDKWCRPQSKSLQYDALKRLKS 83
Query: 85 LRLFSEEKKKETTYRLNSTFQSNLRKHLIYG------GALPREPMPSGITARLPTLEDLE 138
L L EE RL+ TF+ N R L G L E +T L+
Sbjct: 84 LHLV-EEDAGGNFLRLHMTFRKNFRDCLTGNQEATAFGNLCTEDDGHKVTVAF-----LD 137
Query: 139 AYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSR---RDKEAPRLTESGFQFL 195
++A +WE L ++ + E P+ S S++ + + G L D ++T SGFQFL
Sbjct: 138 SFASQKWETILHYMVGT---ELPSKPSPSVLSLLKSGGLMEGPGTDARNLKITNSGFQFL 194
Query: 196 LMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKD 255
L D NAQ+W ++ +Y++ +Q+ ++ D+++FL L G++Y L+ LS+ Q SM+ D
Sbjct: 195 LQDMNAQIWTLLLQYLNLTQDLNMDPVDVLNFLFVLGSLELGKSYYLSGLSDTQVSMLAD 254
Query: 256 FADLGLVKLQQGRKENWFIPTKLATNL----------SMSLTDS--------SARKEGFI 297
D GLV Q + + F PT+LAT L SM+L + S +G I
Sbjct: 255 LRDYGLV-YQYSDQSDRFYPTRLATTLTSESASLKTPSMALDQAVDTSNGLDSGPGQGSI 313
Query: 298 VVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISF 357
++ETNF++YAY+ S L IL LF + + N++ G IT+ES+ NA NGIT++QII+F
Sbjct: 314 ILETNFKLYAYTNSPLEIAILNLFVNLRTRFSNMVSGQITRESIRNALYNGITSDQIINF 373
Query: 358 LQQNAHPRV-------------------------------ADRMPSVPENVCDQIRLWES 386
L+ +AHP++ ++ +P V DQI+LW+
Sbjct: 374 LETHAHPQMRALAKERLDKKVEFDASNNINTAGGGQNSSAQHKLEILPPTVVDQIKLWQL 433
Query: 387 DLNRVEMTPAHYYDEFPSR 405
+L+R++ + + EF S+
Sbjct: 434 ELDRIQAFEGYLFKEFASQ 452
>gi|296422658|ref|XP_002840876.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295637102|emb|CAZ85067.1| unnamed protein product [Tuber melanosporum]
Length = 460
Score = 201 bits (510), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 127/400 (31%), Positives = 210/400 (52%), Gaps = 33/400 (8%)
Query: 25 RLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLV 83
RLY+ P AI R LP LAK V + Y + + +E V P+ + + A +L
Sbjct: 21 RLYQQPATALAIFRRMLPNLAKTIVTTLLYNETPVTLGDLEALVRPESYREREDAFSKLR 80
Query: 84 QLRLFSEEKKKETTYRLNSTFQSNLR---KHLIYGGALPREPMPSGITARLPTLEDLEAY 140
+L + +E + L+ F+ N R H +G E + T++ L+ Y
Sbjct: 81 RLHIITENNLDVS---LDPVFKKNFRLAGDHHSFGVPCNAE------DKKKITVQFLDDY 131
Query: 141 AIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRR--DKEAPRLTESGFQFLLMD 198
A Q+E L ++ + +P S ++ + +G L R + +T++GF FLL D
Sbjct: 132 ATRQFEAILHYMVGTHNEVKP---SKGVITLLTKGGLMERYGAGSSATITKNGFSFLLQD 188
Query: 199 TNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFAD 258
N Q+W ++ +Y+ S+E G+ Q D++ FLL L G+AY++NTL+E QK M+ D D
Sbjct: 189 GNPQIWALLIQYLEMSEELGMEQTDVLHFLLMLGSLELGQAYSVNTLTETQKLMLADLRD 248
Query: 259 LGLVKLQQGRKENWFIPTKLATNLS--------------MSLTDSSARKEGFIVVETNFR 304
G+V Q+ + F PT+LAT L+ ++ + +GFI++ETN+R
Sbjct: 249 YGIV-YQRKSSSDRFYPTRLATTLTSESGGLRSASASMSSAMAKDAEEGKGFIILETNYR 307
Query: 305 MYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHP 364
+YAY+ S L +L LF K+ + PNL+ G I++ S+ A + GITA+Q+I +L +AHP
Sbjct: 308 VYAYTDSPLQIAVLNLFVKLSTRYPNLVSGRISRRSIQEAIKMGITADQVIDYLSAHAHP 367
Query: 365 RVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
++ + ++P V DQIRLW+ + R+ T + +F S
Sbjct: 368 QMRKSLVTLPPTVVDQIRLWQIEGERMRTTTGFLFKDFHS 407
>gi|397471696|ref|XP_003807420.1| PREDICTED: LOW QUALITY PROTEIN: valine--tRNA ligase, mitochondrial
[Pan paniscus]
Length = 1673
Score = 201 bits (510), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 132/428 (30%), Positives = 212/428 (49%), Gaps = 55/428 (12%)
Query: 30 PFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQLRLFS 89
P L+ P L K +VM+M +++ +P + WV + + L LR++
Sbjct: 98 PCHMSGCLQGAPILGKNWVMRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWH 157
Query: 90 EE--KKKETTYRLNSTFQSNLRKHLIYGG-ALPREPMPSGITARLPTLEDLEAYAIGQWE 146
+ LN F+ NLR L+ GG A + G + L+ YA +WE
Sbjct: 158 TQLLPGGLQGLILNPIFRQNLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWE 217
Query: 147 CFLLQLISSTQAERPTNFSSSMMKVFQRGLL-SRRDKEAPRLTESGFQFLLMDTNAQLWY 205
L ++ S A + + + Q GL+ S E P +T +GFQFLL+DT AQLWY
Sbjct: 218 VVLHFMVGSPSAAVSQDLAQLLS---QAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWY 274
Query: 206 IVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQ 265
+ +Y+ +Q RG++ +++SFL +LSF G+ Y++ +S+ + ++ + GLV Q
Sbjct: 275 FMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLV-FQ 333
Query: 266 QGRKENWFIPTKLATNLSMSLTDSSA--RKEGFIVVETNFRMYAY--------------- 308
+ RK + PT+LA NLS ++ + + GFIVVETN+R+YAY
Sbjct: 334 RKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETNYRLYAYTGEAGQRAPGRGGWG 393
Query: 309 ------------------------------STSKLHCEILRLFSKVEYQLPNLIVGAITK 338
S S+L ++ LFS++ Y+ PN++V +T+
Sbjct: 394 ALXHFSXHLKEGLEGVWVWGCPPHPLSIPGSESELQIALIALFSEMLYRFPNMVVAQVTR 453
Query: 339 ESLYNAFENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHY 398
ES+ A +GITA+QII FL+ AHP + + P +P + DQIRLWE + +R+ T
Sbjct: 454 ESVQQAIASGITAQQIIHFLRTRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVL 513
Query: 399 YDEFPSRV 406
Y++F S+V
Sbjct: 514 YNQFLSQV 521
>gi|119179887|ref|XP_001241468.1| hypothetical protein CIMG_08631 [Coccidioides immitis RS]
gi|392866653|gb|EAS30172.2| transcription factor Tfb2 [Coccidioides immitis RS]
Length = 487
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 140/427 (32%), Positives = 221/427 (51%), Gaps = 54/427 (12%)
Query: 25 RLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLV 83
+LY+ P AI R LP LAK +VM + Y+ +PA +E WV + A+ L
Sbjct: 22 KLYQQPSTALAIFRRMLPHLAKCFVMALLYLKDPLPAAELELWVKAGSKRERDNALSILS 81
Query: 84 QLRLFSEEKKKE--TTYRLNSTFQSNLRKHLIYG------GALPREPMPSGITARLPTLE 135
+L + S + Y + + F S+LR+ L G G + P P +T +
Sbjct: 82 RLHILSSTTTSDHVRAYMVTNPFSSSLRQALTGGDKQHSFGVMSSMPDPHPMT-----VS 136
Query: 136 DLEAYAIGQWECFLLQLISST----QAERPTNFSSSMMKVFQR-GLLSRRDKEAPRLTES 190
DL+ YA QWE L ++ + Q E T S + + Q L+ RD+ +T+
Sbjct: 137 DLDDYARRQWEGVLGYMVGTNSLGIQRENVT-LSKGVKSLLQACHLVEVRDRRV-EITKE 194
Query: 191 GFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQK 250
GF F+L D N Q+W+I+ Y+ N++ G++ +++SFL LS G++Y+ L+ Q
Sbjct: 195 GFAFVLQDVNTQVWHILILYVENAEAIGMDSVEVLSFLFLLSSLELGQSYDKKHLTSTQL 254
Query: 251 SMIKDFADLGLVKLQQGRKENW-FIPTKLATNL---SMSLTDS----------------S 290
+ D D G+V E+ F PT+LAT L SM+L+ S
Sbjct: 255 RTLADLTDFGIVYQHSPASESTRFYPTRLATTLTSDSMALSSSISGNLAPAGPNINAATG 314
Query: 291 ARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGIT 350
A GFI++ETN+R+YAY++S L ++ LF+ ++Y+ PNL+ G +T++S+ A E GIT
Sbjct: 315 APGTGFIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLVTGKLTRQSIRRAVEMGIT 374
Query: 351 AEQIISFLQQNAHPRV------------ADRMPSV-PENVCDQIRLWESDLNRVEMTPAH 397
A+QIIS+L +AHP++ A + SV P V DQIRLW+ + +R++ TP
Sbjct: 375 ADQIISYLTTHAHPQMRKVNASKSTSTTAGLLASVLPPTVVDQIRLWQLERDRLKATPGF 434
Query: 398 YYDEFPS 404
+ +F S
Sbjct: 435 LFKDFAS 441
>gi|150863860|ref|XP_001382481.2| RNA polymerase II transcription factor B subunit 2 (RNA polymerase
II transcription factor B p52 subunit) (RNA polymerase
II transcription factor B 52 kDa subunit)
[Scheffersomyces stipitis CBS 6054]
gi|149385116|gb|ABN64452.2| RNA polymerase II transcription factor B subunit 2 (RNA polymerase
II transcription factor B p52 subunit) (RNA polymerase
II transcription factor B 52 kDa subunit)
[Scheffersomyces stipitis CBS 6054]
Length = 515
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 132/454 (29%), Positives = 217/454 (47%), Gaps = 85/454 (18%)
Query: 25 RLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQ 84
+LYE P C AI R L P+AK Y+M M + D + + ++ W P ++ RL
Sbjct: 25 KLYEAPATCLAIFRLLTPMAKFYIMTMLFNDRPVALRDLDRWCKPHAKKLQFDSLKRLTS 84
Query: 85 LRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLPTLED-------- 136
L L EE + RLN TF+ N R L S L T D
Sbjct: 85 LHLI-EEDRGGNHIRLNPTFRQNFRNCLT-------GSQQSNAFGNLCTTTDKHRVDISF 136
Query: 137 LEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAP---RLTESGFQ 193
L+ +A +WE L ++ + PT +S++ + + G L +P ++T +GFQ
Sbjct: 137 LDNFASSKWETILHFMVGTEATATPT---TSVLSLLKSGGLMEGPGNSPQNLKITNTGFQ 193
Query: 194 FLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMI 253
FLL D NAQ+W ++ +Y++ +QE ++ D+++F+ L G++Y +++LS Q SM+
Sbjct: 194 FLLQDVNAQIWTLLLQYLNLTQELHMDPVDVLNFIFVLGSLELGKSYLVSSLSATQVSML 253
Query: 254 KDFADLGLVKLQQGRKENWFIPTKLATNL----------SMSL--------------TDS 289
D D GLV Q+ F PT+LAT L SM++ D+
Sbjct: 254 ADLRDYGLV-YQRTDSSGRFYPTRLATTLTSDSAALKTPSMAIEQALGASDGGEEQQIDT 312
Query: 290 SARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGI 349
+G +++ETNF++YAY+ S L IL LF ++ + N++ G IT+ES+ +A NGI
Sbjct: 313 PTGNQGTVIIETNFKLYAYTNSPLEIAILNLFVHLKSRFTNMVCGQITRESIRSALYNGI 372
Query: 350 TAEQIISFLQQNAHPRVA--------------------------------------DRMP 371
TA+QII FL+ +AHP++ ++
Sbjct: 373 TADQIIKFLETHAHPQMKLLAKEKLDKKIEFDTSHNINTAGGAPQSKVSGEGNVAQHKLE 432
Query: 372 SVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
+P V DQI+LW+ +L+R++ + + +F ++
Sbjct: 433 ILPPTVVDQIKLWQLELDRIQTFDGYLFKDFSNQ 466
>gi|366997767|ref|XP_003683620.1| hypothetical protein TPHA_0A01010 [Tetrapisispora phaffii CBS 4417]
gi|357521915|emb|CCE61186.1| hypothetical protein TPHA_0A01010 [Tetrapisispora phaffii CBS 4417]
Length = 504
Score = 198 bits (503), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 131/439 (29%), Positives = 219/439 (49%), Gaps = 70/439 (15%)
Query: 25 RLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQ 84
+LYE+P C +I R LP +AK ++M M + D I + ++ WV +G + + AI +
Sbjct: 24 KLYESPAACLSIYRLLPHMAKFFIMSMVFNDNEISIRDLDRWVNSNGKLQFQDAIKSMKS 83
Query: 85 LRLFSEEKKKETTY-RLNSTFQSNLRKHLIYGGALPREPMPSGITARLPTLED---LEAY 140
L + K++ + + LNSTF+++ R L GG + I ++ L D L Y
Sbjct: 84 LHILIPGKEQGSMFVNLNSTFRTSFRNALT-GGEVNNSF--GNIIDKIDDLVDSSVLLEY 140
Query: 141 AIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAP-RLTESGFQFLLMDT 199
A +WE L ++ + + P++ +++K L+ + ++T GFQFLL D
Sbjct: 141 AKDKWETILHFMVGTPLIKMPSDNVLNLLK--HSKLMEELESSGEFKITNDGFQFLLQDV 198
Query: 200 NAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADL 259
N+Q+W ++ +Y+ ++ +N D+++F+ L G+AY++ LS+ QK M+KD D
Sbjct: 199 NSQIWTLLLQYLKIAEALHMNPVDILNFIFMLGSLEFGKAYSVEGLSDTQKKMLKDMRDY 258
Query: 260 GLVKLQQGRKENWFIPTKLATNLSMSLTDSSARKE------------------------- 294
GL+ Q+ F PT LAT L+ +DSS +
Sbjct: 259 GLI-FQKTSNATVFYPTTLATQLT---SDSSTVRSASGAIDSLLSQTKEANNNDSQNLGT 314
Query: 295 -------------GFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESL 341
G +VVETNF++Y+YS S L IL LF + + N++ G IT+ES+
Sbjct: 315 DTDDLVGNQSVEVGALVVETNFKLYSYSNSPLQIAILSLFVHLRTRFSNMVTGQITRESI 374
Query: 342 YNAFENGITAEQIISFLQQNAHPRV------------------ADRMPSVPENVCDQIRL 383
NA NG+TA+QII++LQ +AHP++ D++ +P V DQI+L
Sbjct: 375 RNALANGVTADQIIAYLQSHAHPQMRRLAETTLKKKLELDPNTKDQLQILPPTVVDQIKL 434
Query: 384 WESDLNRVEMTPAHYYDEF 402
W+ +L+R+ Y +F
Sbjct: 435 WQLELDRILSYDGSLYSDF 453
>gi|154273196|ref|XP_001537450.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150415962|gb|EDN11306.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 455
Score = 197 bits (502), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 134/430 (31%), Positives = 214/430 (49%), Gaps = 61/430 (14%)
Query: 12 MDMVASLTARDLDRLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPD 70
M+ + SL RLY+ P AI R LP LAK +VM + Y+ +PA +E WV
Sbjct: 9 MEYLESLPGTVFRRLYQQPSTALAIFRRMLPYLAKCFVMALLYLKDPLPAADLELWV--- 65
Query: 71 GFTKHRVAIDRLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITAR 130
+ + + Y + F S+LR+ L G +PS
Sbjct: 66 --------------------KAESKRAYTVTDPFSSSLRQALTGGDKQQSFGVPSLTDDE 105
Query: 131 LP-TLEDLEAYAIGQWECFLLQLI--SSTQAERPTNFSSSMMKVFQR-GLLSRRDKEAPR 186
T+ L+ YA QWE L ++ S+ ++ S + ++ Q L+ RD+
Sbjct: 106 NSMTVAQLDYYARSQWEGVLGYMVGTSALGVQQAVTLSKGVKQLLQACHLVEVRDRRV-E 164
Query: 187 LTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLS 246
+T+ GF F+L D N Q+W+I+ Y+ N+++ G++ +++SFL LS G++Y L+
Sbjct: 165 ITKDGFAFVLQDLNTQVWHILILYVENAEQIGMDSIEVLSFLFVLSSLELGQSYEKKHLT 224
Query: 247 EIQKSMIKDFADLGLVKLQQGRKE-NWFIPTKLATNL-----------SMSLT------- 287
Q + D D G+V +E F PT+LAT L S SLT
Sbjct: 225 STQLKTLADLTDFGIVYQHSPHQEATRFYPTRLATTLTSDSITLGSSISSSLTAPNGVPS 284
Query: 288 -DSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFE 346
S+ GFI++ETN+R+YAY++S L ++ LF+ ++Y+ PNLI G +T++S+ A E
Sbjct: 285 ATSNEPSTGFIIIETNYRLYAYTSSPLQISLIALFTSLKYRFPNLITGKLTRQSVRRAVE 344
Query: 347 NGITAEQIISFLQQNAHPRVADRMPS------------VPENVCDQIRLWESDLNRVEMT 394
GITA+QIIS+L +AHP++ + +P V DQIRLW+ + +R++ T
Sbjct: 345 MGITADQIISYLTTHAHPQMRKYHATKPGANPVGVHTVLPPTVVDQIRLWQLERDRIKAT 404
Query: 395 PAHYYDEFPS 404
P + +F S
Sbjct: 405 PGFLFKDFVS 414
>gi|403217068|emb|CCK71563.1| hypothetical protein KNAG_0H01490 [Kazachstania naganishii CBS
8797]
Length = 519
Score = 197 bits (501), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 138/455 (30%), Positives = 221/455 (48%), Gaps = 85/455 (18%)
Query: 25 RLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQ 84
RLYE+P C AI R LP LAK ++M M + D + + ++ WV +G + + AI +
Sbjct: 26 RLYESPATCLAIYRLLPQLAKFFIMSMVFNDNDVSLRDLDRWVKANGKIQFQEAIKSMKS 85
Query: 85 LRLFSEEKKK-ETTYRLNSTFQSNLRKHLIYGGALPR---------EPMPSGI-TARLPT 133
L L K + LN TF+ + R L GG + E + +G+ RL
Sbjct: 86 LHLIIPTKGAGQLMIGLNPTFRDSFRNALT-GGEVSNSFGTVVNDDEDLRTGVLNMRL-- 142
Query: 134 LEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKE--APRLTESG 191
L+ Y+ +WE L ++ + +E P SS+++ + + L + A ++T G
Sbjct: 143 ---LDEYSANKWETILHFMVGTPMSEVP---SSNVLNLLKHSKLMEESEATGAFKITNEG 196
Query: 192 FQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKS 251
FQFLL + N+Q+W ++ +Y+ S+ ++ D+++F+ L G+AY+ LSE Q+
Sbjct: 197 FQFLLQEINSQIWTLLLQYLKMSESLQMDAVDVLNFIFMLGALEVGKAYSTEGLSETQRI 256
Query: 252 MIKDFADLGLVKLQQGRKENWFIPTKLA-------------------TNLSMSLTDSSAR 292
MIKD D G+V Q+ N F PT LA T LS + D+S+
Sbjct: 257 MIKDMRDYGIV-FQKNSTSNVFYPTSLAIMLTSDSKTVLRTASGAMNTVLSQNKEDTSST 315
Query: 293 -------------------------KEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQ 327
++G ++VETNF++Y+YS S L IL LF ++ +
Sbjct: 316 NKSATDKANNDTAELDQVGTQSQEVQDGVLIVETNFKVYSYSNSPLQIAILSLFVHLKSR 375
Query: 328 LPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRV------------------ADR 369
N++ G ITK S+ NA +NGITAEQII++L+ +AHP++ D
Sbjct: 376 FINMVTGQITKRSIINALDNGITAEQIIAYLETHAHPQMRRLAEAQLEKKRELDQNCKDP 435
Query: 370 MPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
+ +P V DQI+LW+ + +RV Y +F S
Sbjct: 436 LQILPPTVVDQIKLWQLEKDRVMAFEGSLYSDFES 470
>gi|242794214|ref|XP_002482325.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Talaromyces stipitatus ATCC 10500]
gi|218718913|gb|EED18333.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Talaromyces stipitatus ATCC 10500]
Length = 481
Score = 197 bits (501), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 132/421 (31%), Positives = 214/421 (50%), Gaps = 46/421 (10%)
Query: 25 RLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLV 83
+LY+ P AI R LP LAK VM + Y+ +PA +E W+ + + AI L
Sbjct: 22 KLYQQPSTALAIFRRMLPDLAKCLVMALLYMRDPLPAADLELWIKGESKKERDHAISILG 81
Query: 84 QLRLFSEE--KKKETTYRLNSTFQSNLRKHLIYG------GALPREPMPSGITARLPTLE 135
+L + + + Y + + F S+LR+ L G P P+ ++ +
Sbjct: 82 RLHIMTNTMTSNQVRAYMVTNPFASSLRQALTGAENSQSFGVPCSTPDPNPVS-----IA 136
Query: 136 DLEAYAIGQWECFLLQLISSTQ--AERPTNFSSSMMKVFQRGLLSRRDKEAPRLTESGFQ 193
DL+ YA QWE L ++ ++ + S + ++ Q G L +T+ GF
Sbjct: 137 DLDEYARRQWEGVLGYMVGTSGPGVRETVSLSRGVKQLLQAGHLVEIHHGRVDITQDGFA 196
Query: 194 FLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMI 253
F+L D N Q+W I+ Y+ +++ +N +++SF+ LS G++Y L+ Q +
Sbjct: 197 FVLQDVNTQVWQIIILYVESAKALQVNDVEVLSFIFLLSSLELGQSYEKKHLTPDQLRAL 256
Query: 254 KDFADLGLV-KLQQGRKENWFIPTKLATNLS-----MSLTDSSARK-----------EGF 296
D D G+V + + + F PT+LAT L+ +S T SSA GF
Sbjct: 257 TDLTDFGIVYQYPSAAEADRFYPTRLATTLTSDSSALSNTMSSALSAQTNAATGEPGSGF 316
Query: 297 IVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIIS 356
I++ETN+R+YAY++S L ++ LF+ ++Y+ PNL+ G IT++S+ A E GITA+QIIS
Sbjct: 317 IIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLVTGKITRQSIRRAVEMGITADQIIS 376
Query: 357 FLQQNAHP--RVADRMPS-----------VPENVCDQIRLWESDLNRVEMTPAHYYDEFP 403
+L +AHP R D + S +P V DQIRLW+ + +R++ TP + EF
Sbjct: 377 YLSTHAHPQMRKEDAVKSTSNTAGLPRSVLPPTVVDQIRLWQLERDRIKATPGFLFKEFA 436
Query: 404 S 404
S
Sbjct: 437 S 437
>gi|212535854|ref|XP_002148083.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Talaromyces marneffei ATCC 18224]
gi|210070482|gb|EEA24572.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Talaromyces marneffei ATCC 18224]
Length = 481
Score = 197 bits (501), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 132/435 (30%), Positives = 219/435 (50%), Gaps = 49/435 (11%)
Query: 11 FMDMVASLTARDLDRLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLP 69
+++ + LT +LY+ P AI R LP LAK VM + Y+ +PA +E WV
Sbjct: 11 YLEQLPGLT---FQKLYQQPSTALAIFRRMLPDLAKCLVMAVLYMRDPLPAADLELWVKA 67
Query: 70 DGFTKHRVAIDRLVQLRLFSEE--KKKETTYRLNSTFQSNLRKHLIYG------GALPRE 121
+ + AI L +L + + + Y + + F S+LR+ L G
Sbjct: 68 ESKKERDHAISILGRLHIMTNTMTSNQVRAYMVTNPFASSLRQALTGAENSQSFGVPCST 127
Query: 122 PMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQ--AERPTNFSSSMMKVFQRGLLSR 179
P P+ ++ + DL+ YA QWE L ++ ++ + S + ++ Q G L
Sbjct: 128 PDPNAVS-----IADLDEYARRQWEGVLGYMVGTSGPGVRETVSLSRGVKQLLQAGHLVE 182
Query: 180 RDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEA 239
+T+ GF F+L D N Q+W I+ Y+ +++ +N +++SF+ LS G++
Sbjct: 183 IHHGRVDITQDGFAFVLQDVNTQVWQIIILYVESAKALQVNDVEVLSFIFLLSSLELGQS 242
Query: 240 YNLNTLSEIQKSMIKDFADLGLVKLQQGRKE-NWFIPTKLATNL-------------SMS 285
Y L+ Q + D D G++ E + F PT+LAT L S+S
Sbjct: 243 YEKRHLTPDQLRALADLTDFGIIYQHAPASEADRFYPTRLATTLTSDSSALSNTMSSSLS 302
Query: 286 LTDSSARKE---GFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLY 342
++A E GFI++ETN+R+YAY++S L ++ LF+ ++Y+ PNL+ G IT++S+
Sbjct: 303 AQTNAATGEPGSGFIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLVTGKITRQSIR 362
Query: 343 NAFENGITAEQIISFLQQNAHPRVADR-----------MPS--VPENVCDQIRLWESDLN 389
A E GITA+QIIS+L +AHP++ +P +P V DQIRLW+ + +
Sbjct: 363 RAVEMGITADQIISYLSTHAHPQMRKEGAVKSTSNTAGIPRSVLPPTVVDQIRLWQLERD 422
Query: 390 RVEMTPAHYYDEFPS 404
R++ TP + EF S
Sbjct: 423 RIKATPGFLFKEFAS 437
>gi|320167307|gb|EFW44206.1| transcription factor tfb2 [Capsaspora owczarzaki ATCC 30864]
Length = 406
Score = 197 bits (500), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 119/344 (34%), Positives = 193/344 (56%), Gaps = 19/344 (5%)
Query: 71 GFTKHRVAIDRLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGG-----ALPREPMPS 125
G H A++RL LR+ +T + L+ +F+ LR L GG A P EP
Sbjct: 3 GGGAHLEALERLKALRI---AVANDTHWLLSPSFRVGLRTALSGGGKSWAGADPLEP--- 56
Query: 126 GITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEA- 184
A + +LE Y+ +WE +LQ +++ E T +++ + LS + A
Sbjct: 57 --DAHAKDISELEQYSRARWEA-VLQFMAAANTEGVTQEVVNVLIDAELISLSTTGEGAG 113
Query: 185 -PRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLN 243
P +T GFQFLL D Q+WY + +Y+++ Q+RG + + ++ L LSF G Y +
Sbjct: 114 RPVITNKGFQFLLQDVATQVWYFLTQYLNSLQKRGADPVEALALLFRLSFSTVGMDYPVE 173
Query: 244 TLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDS--SARKEGFIVVET 301
L+E Q +++ ++GLV ++ R F PT LA NL+ + +A +G+IVVET
Sbjct: 174 GLTEGQLDLLQHLREIGLV-FRRKRTSRRFYPTPLAINLASGSAKNLDAADVKGYIVVET 232
Query: 302 NFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQN 361
NFR+YAY+ S L +L LF + Y+ PN++ G ++++S+ A G+TAEQ+I FL+ +
Sbjct: 233 NFRIYAYTDSPLQLALLSLFVDLRYRFPNMVCGLLSRDSVRKALVKGLTAEQMIRFLRTH 292
Query: 362 AHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
AHP++ R P +PE + DQ+RLWE + NR+ + PA Y+ F ++
Sbjct: 293 AHPQMRSRTPVLPETISDQLRLWELERNRLRVLPAVLYERFSNQ 336
>gi|303321033|ref|XP_003070511.1| Transcription factor tfb2 family protein [Coccidioides posadasii
C735 delta SOWgp]
gi|240110207|gb|EER28366.1| Transcription factor tfb2 family protein [Coccidioides posadasii
C735 delta SOWgp]
gi|320036051|gb|EFW17991.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Coccidioides posadasii str. Silveira]
Length = 487
Score = 197 bits (500), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 139/440 (31%), Positives = 226/440 (51%), Gaps = 54/440 (12%)
Query: 12 MDMVASLTARDLDRLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPD 70
++ + SL +LY+ P AI R LP LAK +VM + Y+ +PA +E WV
Sbjct: 9 LEYLESLPGTVFLKLYQQPSTALAIFRRMLPHLAKCFVMALLYLKDPLPAAELELWVKAG 68
Query: 71 GFTKHRVAIDRLVQLRLFSEEKKKE--TTYRLNSTFQSNLRKHLIYG------GALPREP 122
+ A+ L +L + S + Y + + F S+LR+ L G G + P
Sbjct: 69 SKRERDNALSILSRLHILSSTTTSDHVRAYMVTNPFSSSLRQALTGGDKQHSFGVMSSMP 128
Query: 123 MPSGITARLPTLEDLEAYAIGQWECFLLQLISST----QAERPTNFSSSMMKVFQR-GLL 177
P +T + DL+ YA QWE L ++ + Q E T S + + Q L+
Sbjct: 129 DPHPMT-----VSDLDDYARRQWEGVLGYMVGTNSLGIQRENVT-LSKGVKSLLQACHLV 182
Query: 178 SRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAG 237
RD+ +T+ GF F+L D N Q+W+I+ Y+ N++ G++ +++SFL LS G
Sbjct: 183 EVRDRRV-EITKEGFAFVLQDVNTQVWHILILYVENAEAIGMDSVEVLSFLFLLSSLELG 241
Query: 238 EAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENW-FIPTKLATNL---SMSLTD----- 288
++Y+ L+ Q + D D G+V E+ F PT+LAT L SM+L+
Sbjct: 242 QSYDKKHLTSTQLRTLADLTDFGIVYQHSPASESTRFYPTRLATTLTSDSMALSSPISGN 301
Query: 289 -----------SSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAIT 337
+ A GFI++ETN+R+YAY++S L ++ LF+ ++Y+ PNL+ G +T
Sbjct: 302 LAPAGPNINAATGAPGTGFIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLVTGKLT 361
Query: 338 KESLYNAFENGITAEQIISFLQQNAHPRV-----------ADRMPS--VPENVCDQIRLW 384
++S+ A E GITA+QIIS+L +AHP++ +P+ +P V DQIRLW
Sbjct: 362 RQSIRRAVEMGITADQIISYLTTHAHPQMRKVNASKSTSTTAGLPASVLPPTVVDQIRLW 421
Query: 385 ESDLNRVEMTPAHYYDEFPS 404
+ + +R++ TP + +F S
Sbjct: 422 QLERDRLKATPGFLFKDFAS 441
>gi|389642093|ref|XP_003718679.1| RNA polymerase II transcription factor B subunit 2 [Magnaporthe
oryzae 70-15]
gi|351641232|gb|EHA49095.1| RNA polymerase II transcription factor B subunit 2 [Magnaporthe
oryzae 70-15]
gi|440469172|gb|ELQ38293.1| RNA polymerase II transcription factor B subunit 2 [Magnaporthe
oryzae Y34]
gi|440481925|gb|ELQ62459.1| RNA polymerase II transcription factor B subunit 2 [Magnaporthe
oryzae P131]
Length = 490
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 132/423 (31%), Positives = 224/423 (52%), Gaps = 44/423 (10%)
Query: 25 RLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLV 83
+LY+ P AI R LPPLAK +VM + Y+ +P ++ WV P+ A+ L
Sbjct: 23 KLYQQPATAFAIFRRMLPPLAKTFVMSLLYMPQPLPLTALDSWVKPEAKKNKDQALSILR 82
Query: 84 QLRLFS----EEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLPTLED--- 136
+ + + ++K L F+ +LR L GG+ +PS + +P D
Sbjct: 83 SMHITTITPVTKEKPVQEMSLTPNFKKSLRLALEGGGSHNSFGVPSSLP--IPPQVDVAF 140
Query: 137 LEAYAIGQWECFLLQLISS---TQAERPT---NFSSSMMK------VFQRGLLSRRDKEA 184
L+ +A +W+ L +++S T + P+ NF S ++ + Q GL+ RR E
Sbjct: 141 LDKWARSRWDAILHYVVNSVEETDSMEPSKKYNFGGSKLQDTVKTLLVQGGLVQRRSSER 200
Query: 185 PRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQA-DLISFLLELSFHVAGEAYNLN 243
+T++GF FLL + NAQ+W ++ +++ + E N+A D++SFL L G+AY+
Sbjct: 201 ISITKTGFTFLLQEANAQVWTLLLQWLHSVNEDNTNRAVDMLSFLFMLGTLELGQAYDTG 260
Query: 244 TLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNL--------SMSLTDSSARKE- 294
LSE +++M+ D D GLV + + +F PT+LAT L S+S ++A E
Sbjct: 261 ALSEERRNMLPDLNDFGLVYIPPSNPDQYF-PTRLATTLTSGSSALRSVSSGVAAATAEA 319
Query: 295 ------GFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENG 348
G I++ETNFR+YAY+++ L IL LF+ ++ + ++ G +++ S+ A +G
Sbjct: 320 GENNTKGAIILETNFRIYAYTSTPLQIAILALFANLKMRFAGMVTGQLSRHSIKRAISHG 379
Query: 349 ITAEQIISFLQQNAHP---RVAD--RMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFP 403
ITA+QII +L +AH R+A P +P V DQIRLW+ + R+++ + + +F
Sbjct: 380 ITADQIIEYLASHAHEQMHRIAAIRNKPVLPPTVVDQIRLWQLETERMQVQRGYLFKDFE 439
Query: 404 SRV 406
S+
Sbjct: 440 SQA 442
>gi|296818081|ref|XP_002849377.1| RNA polymerase II transcription factor B subunit 2 [Arthroderma
otae CBS 113480]
gi|238839830|gb|EEQ29492.1| RNA polymerase II transcription factor B subunit 2 [Arthroderma
otae CBS 113480]
Length = 484
Score = 196 bits (497), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 131/419 (31%), Positives = 212/419 (50%), Gaps = 46/419 (10%)
Query: 25 RLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLV 83
+LY+ P A+ R LP LAK +VM + Y+ +PA +E WV D + ++ L
Sbjct: 22 KLYQQPSTALAVFRRMLPHLAKCFVMALLYLKDPLPATELELWVRSDSRRERDNSLSILG 81
Query: 84 QLRLFSEEKKKETT--YRLNSTFQSNLRKHLIYGGALPREPMPSGITARLP-----TLED 136
+L + S Y + F ++LR+ L+ G R G+ + P ++ D
Sbjct: 82 RLHILSSTTTSNHVRAYMVTEPFSTSLRQALM--GEDQRHSF--GVISMTPDKHHVSVAD 137
Query: 137 LEAYAIGQWECFLLQLI--SSTQAERPT-NFSSSMMKVFQR-GLLSRRDKEAPRLTESGF 192
L+ YA QWE L ++ SS R T S + ++ Q L+ RD+ +T+ GF
Sbjct: 138 LDEYARQQWEAVLGYMVGMSSLSGGRETITLSKGVKQLLQACHLVEIRDRRV-EITKEGF 196
Query: 193 QFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSM 252
F+L D N Q+W+I+ Y+ N++ + +++SF+ LS G++Y N L+ Q
Sbjct: 197 AFVLQDVNTQVWHILILYVENAEAIHMESVEVLSFIFLLSSLELGQSYEKNDLTPTQLRT 256
Query: 253 IKDFADLGLV-KLQQGRKENWFIPTKLATNL--------------------SMSLTDSSA 291
+ D D G+V + F PT+LAT L S T ++
Sbjct: 257 LADLTDFGIVYQHTPASGSTRFYPTRLATTLTSDSLAMPGPVSGEPVVPAPSTGPTSATD 316
Query: 292 RKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITA 351
GFI+VETN+R+YAY++S L ++ LF+ ++++ PNLI G IT++S+ A E GITA
Sbjct: 317 GGTGFIIVETNYRLYAYTSSPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGITA 376
Query: 352 EQIISFLQQNAHPRVADRMPS--------VPENVCDQIRLWESDLNRVEMTPAHYYDEF 402
+QI+S+L +AHP++ S +P V DQIRLW+ + +R++ T + EF
Sbjct: 377 DQIVSYLTTHAHPQMRKTKKSTSNVAASILPPTVVDQIRLWQLERDRIKATSGFLFKEF 435
>gi|345569654|gb|EGX52519.1| hypothetical protein AOL_s00043g13 [Arthrobotrys oligospora ATCC
24927]
Length = 452
Score = 196 bits (497), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 125/397 (31%), Positives = 212/397 (53%), Gaps = 30/397 (7%)
Query: 25 RLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLV 83
RLY+ P A+ R LP LAK +VM M + +P +E +V PD A +L
Sbjct: 22 RLYQQPATALAVFRRMLPNLAKTFVMAMLFNPDPVPVSSLEAFVRPDQNRLKEEAFGKLR 81
Query: 84 QLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITAR-LPTLEDLEAYAI 142
QL + +E ++ T+ L++TF+ LRK L GG+ +P + + L+ YA
Sbjct: 82 QLYIVTE---RQGTFALDATFKVELRKALTGGGSPGSFGLPCDTEDKNRVDISFLDRYAA 138
Query: 143 GQWECFLLQLIS--STQAERPTNFSSSMMKVFQRGLLSRRDKEAPR-LTESGFQFLLMDT 199
QWE L ++ + ++RP S ++++ G L +++ + +T+ GF FLL +
Sbjct: 139 EQWESILHFMVGNETQSSQRP---SEGVVRLLLHGKLMEQNRASGTVITQLGFSFLLQEV 195
Query: 200 NAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADL 259
N+Q+W ++ Y+ +++ ++ D++ FL L G+ Y++ +L+ QK M++D D
Sbjct: 196 NSQVWTLLLLYLDMAEQLRMDPVDILHFLFLLGSLELGQDYDMKSLTPTQKGMLEDLRDY 255
Query: 260 GLVKLQQGRKENWFIPTKLATNLS---MSLTDSSAR---------KEGFIVVETNFRMYA 307
G+V Q+ F PT+LAT L+ SL +A + GF+++ETN+
Sbjct: 256 GIV-YQRKSSSRRFYPTRLATGLTSEIRSLRSPAATLANATPGNDENGFLIIETNYH--- 311
Query: 308 YSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVA 367
S L +L LF+++ + PNL+ IT+ES+ A GITAEQII ++ NAHP++
Sbjct: 312 ---SPLQIAVLNLFARLISRFPNLVTARITRESIRQAIGKGITAEQIIDYMTANAHPQLR 368
Query: 368 DRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
P +P V DQIRLW+ + R+++T + + EF +
Sbjct: 369 KNSPVLPPTVVDQIRLWQIEGERMKVTSGYLFKEFAT 405
>gi|444320663|ref|XP_004180988.1| hypothetical protein TBLA_0E04140 [Tetrapisispora blattae CBS 6284]
gi|387514031|emb|CCH61469.1| hypothetical protein TBLA_0E04140 [Tetrapisispora blattae CBS 6284]
Length = 518
Score = 194 bits (493), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 124/436 (28%), Positives = 216/436 (49%), Gaps = 61/436 (13%)
Query: 24 DRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLV 83
+RLYE+P C AI R LP LAK ++M M + + + + ++ WV +G + + AI +
Sbjct: 23 NRLYESPATCLAIYRLLPQLAKFFIMSMVFNENEVMLRDLDRWVKSNGKLQFQEAIKSMK 82
Query: 84 QLRLFSEEKKKETTY-RLNSTFQSNLRKHLIYGGALPREPMPSGITARLPTLEDLEAYAI 142
L L K LNSTF+S+L+ L G + + LE Y+
Sbjct: 83 SLHLLVPGKNSGAVMINLNSTFRSSLKNALTGGEINNSFGVIVDNDRDAVKVSMLEEYST 142
Query: 143 GQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAP--RLTESGFQFLLMDTN 200
+WE L ++ + A P S +++ + + L ++E ++T GFQFLL ++N
Sbjct: 143 KKWETILHFMVGTPLATTP---SENVLNLLKHSKLMEENEETNEFKITNEGFQFLLQESN 199
Query: 201 AQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLG 260
+Q+W ++ +Y+ ++ ++ D+++F+ L G+AY++++LSE QK M+KD D G
Sbjct: 200 SQIWTLLLQYLKLTETLHMDPVDVLNFIFMLGALEFGKAYSISSLSETQKIMLKDMRDYG 259
Query: 261 LVKLQQGRKENWFIPTKLATNLSMSL---------------------------------- 286
L+ Q+ N F PT +AT L+
Sbjct: 260 LI-FQKNSNANVFYPTSMATMLTSDARNVRTASGAIDHILQRSQETNKNSNNDDDEDVDQ 318
Query: 287 --TDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNA 344
+++ A +G +++ETNF++Y+YS S L +L LF ++ + N++ G IT++S+ A
Sbjct: 319 VGSNTQATADGALIIETNFKLYSYSNSPLQIAVLSLFVHLKTRFSNMVTGQITRDSIRRA 378
Query: 345 FENGITAEQIISFLQQNAHPRV------------------ADRMPSVPENVCDQIRLWES 386
NGITAEQII++L ++HP++ D + +P V DQI+LW+
Sbjct: 379 LRNGITAEQIIAYLLTHSHPQMRRLAEGNLEKKLELDPNAKDSLQILPPTVVDQIKLWQL 438
Query: 387 DLNRVEMTPAHYYDEF 402
+L+R+ Y +F
Sbjct: 439 ELDRIISHEGSLYSDF 454
>gi|327298217|ref|XP_003233802.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
Tfb2 [Trichophyton rubrum CBS 118892]
gi|326463980|gb|EGD89433.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
Tfb2 [Trichophyton rubrum CBS 118892]
Length = 484
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 132/418 (31%), Positives = 211/418 (50%), Gaps = 44/418 (10%)
Query: 25 RLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLV 83
+LY+ P AI R LP LAK +VM + Y+ +PA +E WV D + ++ L
Sbjct: 22 KLYQQPSTALAIFRRMLPHLAKCFVMALLYLKDPLPATELELWVRSDSRRERDNSLSILG 81
Query: 84 QLRLFSEEKKKE--TTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLP-TLEDLEAY 140
+L + S Y + F ++LR+ L+ + S + P ++ DL+ Y
Sbjct: 82 RLHILSSTTTSNHVRAYMVTEPFSTSLRQALMGEDQQGSFGVISVTPDKYPVSVADLDEY 141
Query: 141 AIGQWECFLLQLI--SSTQAERPT-NFSSSMMKVFQR-GLLSRRDKEAPRLTESGFQFLL 196
A QWE L ++ SS R T S + ++ Q L+ RD+ +T+ GF F+L
Sbjct: 142 ARQQWEAVLGYMVGMSSLSGGRETITLSKGVKQLLQACHLVEIRDRRV-EITKEGFAFVL 200
Query: 197 MDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDF 256
D N Q+W+I+ Y+ N++ G+ +++SF+ LS G++Y L+ Q + D
Sbjct: 201 QDVNTQVWHILILYVENAEAIGMESVEVLSFIFLLSSLELGQSYEKKDLTPTQLRTLADL 260
Query: 257 ADLGLV-KLQQGRKENWFIPTKLATNLSMSLTDSSARK---------------------- 293
D G+V + F PT+LAT L+ +DS A
Sbjct: 261 TDFGIVYQHTPASGSTRFYPTRLATTLT---SDSLAMSGPISGEPTVPTTSAGTTTSGDA 317
Query: 294 -EGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAE 352
GFI+VETN+R+YAY++S L ++ LF+ ++++ PNLI G IT++S+ A E GITA+
Sbjct: 318 GTGFIIVETNYRLYAYTSSPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGITAD 377
Query: 353 QIISFLQQNAHPRVADRMPS--------VPENVCDQIRLWESDLNRVEMTPAHYYDEF 402
QIIS+L +AHP++ S +P V DQIRLW+ + +R++ T + EF
Sbjct: 378 QIISYLTTHAHPQMRKTKKSTSNVSSSVLPPTVVDQIRLWQLERDRIKATSGFLFKEF 435
>gi|401623322|gb|EJS41426.1| tfb2p [Saccharomyces arboricola H-6]
Length = 513
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 131/447 (29%), Positives = 220/447 (49%), Gaps = 71/447 (15%)
Query: 24 DRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLV 83
+RLY +P C AI R LPPLAK ++M M + + +P +++WV +G + + AI +
Sbjct: 23 NRLYTSPATCLAIYRILPPLAKFFIMAMVFNENEVPLLDLDKWVNSNGKLQFQDAIKSMK 82
Query: 84 QLRLFSEEKKKET-TYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLPTLEDLEAYAI 142
L L K T LN TF+ +LR L GG + R + + +L+ L+ Y+
Sbjct: 83 SLHLLIPNKSSGTLMINLNPTFKISLRNALT-GGEV-RNSFGIVVEDNVVSLKMLDEYSA 140
Query: 143 GQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAP-RLTESGFQFLLMDTNA 201
+WE L ++ + A+ P+ +++K L+ + ++T GFQFLL + N+
Sbjct: 141 NKWETILHFMVGTPLAKIPSEKVLNLLK--HSKLMEEINSTGEFKITNEGFQFLLQEINS 198
Query: 202 QLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGL 261
QLW ++ +Y+ + ++ D++ F+ L G+AY ++ LS+ Q+ M++D D GL
Sbjct: 199 QLWTLLLQYLKVIETSKMDLVDVLHFIFMLGALEVGKAYKIDALSDTQRIMLQDMRDYGL 258
Query: 262 VKLQQGRKENWFIPTKLATNLS---------MSLTDSSARK------------------- 293
V Q+ + F PTKLA L+ + DS R+
Sbjct: 259 V-FQKHSNDTIFYPTKLALMLTSDTKTIRSASNAMDSVLRQNREEPSANEEGANGKATTD 317
Query: 294 ------------------EGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGA 335
+G ++VETNF++Y+YS S L +L LF ++ + N+++G
Sbjct: 318 TTTSDDLNKADLKNQDIPDGSLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQ 377
Query: 336 ITKESLYNAFENGITAEQIISFLQQNAHPRV------------------ADRMPSVPENV 377
IT+ES+ A NGITAEQII++L+ +AHP++ + + +P V
Sbjct: 378 ITRESIRRALTNGITAEQIIAYLETHAHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTV 437
Query: 378 CDQIRLWESDLNRVEMTPAHYYDEFPS 404
DQIRLW+ +L+RV Y +F +
Sbjct: 438 VDQIRLWQLELDRVMTYEGSLYSDFET 464
>gi|410082501|ref|XP_003958829.1| hypothetical protein KAFR_0H02850 [Kazachstania africana CBS 2517]
gi|372465418|emb|CCF59694.1| hypothetical protein KAFR_0H02850 [Kazachstania africana CBS 2517]
Length = 514
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 130/448 (29%), Positives = 220/448 (49%), Gaps = 78/448 (17%)
Query: 25 RLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQ 84
RLYE+P C AI R LP L+K +VM M + + + + ++ WV +G + + AI +
Sbjct: 24 RLYESPATCLAIYRLLPQLSKFFVMAMVFNENDVSLRDLDRWVKTNGKLQFQEAIKSMKS 83
Query: 85 LRLFSEEKKKET-TYRLNSTFQSNLRKHLI-------YGGALPREPMPSGITARLPTLED 136
L L K T LN TF+ + R L +G + E S IT L
Sbjct: 84 LHLIIPSKGNGTLMISLNPTFKESFRNALTGGEINNSFGIVVDNEAEQSRIT-----LNF 138
Query: 137 LEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAP-RLTESGFQFL 195
L+ Y+ +WE L ++ + A P++ +++K L+ DK ++T GFQFL
Sbjct: 139 LDEYSANKWETILHFMVGTPLANIPSHNVLNLLK--NSRLMEEADKTGTLKITNEGFQFL 196
Query: 196 LMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKD 255
L + N+Q+W ++ +Y+ S+ ++ ++++F+ L G+AY+++ L+E QK+M++D
Sbjct: 197 LQEINSQIWTLLLQYLKMSESLQMDPVEVLNFIFMLGALEIGKAYSIDGLTETQKTMLRD 256
Query: 256 FADLGLVKLQQGRKENWFIPTKLATNLSM------------------------------- 284
F D G++ Q+ F PT LA L+
Sbjct: 257 FRDYGII-FQKNLTSKSFYPTNLAIMLTSDTRSILRTASGAMDSVLRQNKEDSKKQTSGD 315
Query: 285 SLTDSSAR------------KEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLI 332
++T+ +A ++G +++ETNF++Y+YS S L IL LF ++ + N++
Sbjct: 316 NVTEDTANDIEQIGLENQEGRDGSLIIETNFKLYSYSNSPLQIAILSLFVHLKSRFVNMV 375
Query: 333 VGAITKESLYNAFENGITAEQIISFLQQNAHPRV------------------ADRMPSVP 374
G IT+ES+ A NGITA+QII++L+ +AHPR+ D + +P
Sbjct: 376 TGQITRESIRRALVNGITADQIIAYLETHAHPRMRRLAEEKLEKKLELDPNSKDTLQILP 435
Query: 375 ENVCDQIRLWESDLNRVEMTPAHYYDEF 402
V DQI+LW+ +L+RV Y +F
Sbjct: 436 PTVVDQIKLWQLELDRVITYDGSLYSDF 463
>gi|452839401|gb|EME41340.1| hypothetical protein DOTSEDRAFT_156149 [Dothistroma septosporum
NZE10]
Length = 497
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 134/430 (31%), Positives = 212/430 (49%), Gaps = 45/430 (10%)
Query: 8 AKNFMDMVASLTARDLDRLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEW 66
A+ +D + L RLY+ P AI R LP LAK VM M Y+ A ++ W
Sbjct: 5 AQRALDYLEQLPGTTFTRLYQQPSTALAIFRRMLPHLAKTLVMAMLYMPTPFAAADLDTW 64
Query: 67 VLPDGFTKHR--VAIDRLVQLR-LFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPM 123
+ PD ++ A+ L +LR LF ++ + Y+L+ + +LR L GG +
Sbjct: 65 IRPDMPSQQARDKALSTLQRLRILFDDQHNGKPAYKLSPAYGKSLRLALTGGGDHRSFGV 124
Query: 124 PSGITARLP-TLEDLEAYAIGQWECFLLQLISSTQAERP----TNFSSSMMKVFQRG--- 175
P + + E L+ +A QWE L ++ S A R + S + Q+G
Sbjct: 125 PCSTPDKTSISTEYLDTFARRQWEAILYYVVGS--ANRALSGEVDISPGTKSLLQKGEFV 182
Query: 176 -LLSRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFH 234
++S + ++ +GF FLL D NAQ+W ++ Y+ S ++ +++SFL L
Sbjct: 183 NVMSGGKQRM--ISTTGFTFLLQDVNAQVWSLLIVYLEVSSSLRMDPVEVLSFLFTLGSL 240
Query: 235 VAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNL------------ 282
G +Y+ L+ Q+ M+ D AD GL+ + ++ PT+LAT L
Sbjct: 241 ELGISYSTANLTPTQQQMLDDLADFGLIYRRTPDSHRYY-PTRLATTLTSDAPALTNNSL 299
Query: 283 ----------SMSLTDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLI 332
S L S++ ++GFI++ETN+R+YAY++S L IL LF+ + + PNLI
Sbjct: 300 MKTTVMTGDPSTDLAASASDEKGFIILETNYRLYAYTSSPLLISILSLFASLNTRYPNLI 359
Query: 333 VGAITKESLYNAFENGITAEQIISFLQQNAHP-----RVADRMPSVPENVCDQIRLWESD 387
ITK S NA GIT++QII +L +AHP A+ P +P V DQI+LW+ +
Sbjct: 360 TAKITKASTQNAIAAGITSDQIIDYLTTHAHPVLRRQAAANDAPILPPTVVDQIKLWQME 419
Query: 388 LNRVEMTPAH 397
R+E T +
Sbjct: 420 GERMEATKGY 429
>gi|344229873|gb|EGV61758.1| transcription factor Tfb2 [Candida tenuis ATCC 10573]
Length = 498
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 136/462 (29%), Positives = 230/462 (49%), Gaps = 70/462 (15%)
Query: 3 LVKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKM 62
L K+ N ++ SL RLY P C +I R LPP+AK ++M M + + + K
Sbjct: 6 LFKVTVINHLE---SLDDNTQSRLYAAPATCLSIYRLLPPMAKFHIMSMLFYEKNVAIKD 62
Query: 63 MEEWVLPDGFTKHRVAIDRLVQLRL--FSEEKKKETTYRLNSTFQSNLRKHLIYGGALPR 120
+E+W P+ +++R+ L L SE K RL+ F+ N L G P
Sbjct: 63 LEKWSKPNTRHLQIESLERMRSLHLVQLSESGKY---IRLHPIFKKNFADCLT-GNQAPD 118
Query: 121 E--PMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLS 178
+ + + L+ L+ +A +WE L ++ + + P S S++++ + G L
Sbjct: 119 AFGNLDTTVEKHTVDLKFLDTFASTKWETILHYMVGTELSSTP---SRSVLRLLKSGGLM 175
Query: 179 RRDKEAP---RLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHV 235
+ + ++T GFQFLL D NAQ+W ++ +Y+ + E ++ D+++F+ L
Sbjct: 176 EGNSDYAQELKITNKGFQFLLQDINAQIWTLLLQYLDLTLELQMDAVDVLNFIFLLGSLE 235
Query: 236 AGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNL----------SMS 285
G++Y+++ LS+ Q M+ D D GLV Q+ N F PT+LAT L SM+
Sbjct: 236 LGKSYSVSALSDTQIEMLSDLRDFGLV-YQRSATSNKFYPTRLATTLTSDSNSLQTPSMA 294
Query: 286 L---------TDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAI 336
+ TDS R+E I++ETNF++Y+Y+ S L IL LF + + N++ G I
Sbjct: 295 IDKANSGIDSTDSKQRQES-IIIETNFKVYSYTQSPLEIAILNLFVHLRTRFANMVTGQI 353
Query: 337 TKESLYNAFENGITAEQIISFLQQNAHPRVA----------------------------- 367
T+ES+ NA NGITA Q+I FL+ +AHP++
Sbjct: 354 TRESIRNALYNGITAGQVIKFLETHAHPQMKMLAQEKLDKKIEFDASNNINTATGKSTDA 413
Query: 368 ---DRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
++ +P NV DQI+LW+ +L+R++ + + EF +++
Sbjct: 414 PMQHKLEILPPNVVDQIKLWQLELDRIQTFDGYLFREFNNQI 455
>gi|323302721|gb|EGA56527.1| Tfb2p [Saccharomyces cerevisiae FostersB]
Length = 505
Score = 192 bits (487), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 130/447 (29%), Positives = 221/447 (49%), Gaps = 71/447 (15%)
Query: 24 DRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLV 83
+RLY +P C AI R LPPLAK ++M M + + +P +++WV +G + + AI +
Sbjct: 23 NRLYTSPATCLAIYRILPPLAKFFIMAMVFNENEVPLLDLDKWVNSNGKLQFQNAIKSMK 82
Query: 84 QLRLFSEEKKKET-TYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLPTLEDLEAYAI 142
L L K T LN TF+ +LR L GG + + + + +L+ L+ Y+
Sbjct: 83 SLHLLIPNKSSGTLMINLNPTFKISLRNALT-GGEV-QNSFGVVVEENVVSLDLLDEYSA 140
Query: 143 GQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAP-RLTESGFQFLLMDTNA 201
+WE L ++ + A+ P+ +++K L+ + ++T GFQFLL + N+
Sbjct: 141 NKWETILHFMVGTPLAKIPSEKVLNLLK--HSKLMEEVNSTGEFKITNEGFQFLLQEINS 198
Query: 202 QLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGL 261
QLW ++ +Y+ + ++ D++ F+ L G+AY ++ LSE Q+ M++D D GL
Sbjct: 199 QLWTLLLQYLKMIETSKMDLVDVLHFIFMLGALEVGKAYKIDALSETQRIMLQDMRDYGL 258
Query: 262 VKLQQGRKENWFIPTKLATNLS---------MSLTDSSARK------------------- 293
V Q+ ++ F PTKLA L+ + DS R+
Sbjct: 259 V-FQKHSNDSIFYPTKLALMLTSDTKTIRSASNAMDSVLRQNREEPSVNEDGANGKSTTD 317
Query: 294 ------------------EGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGA 335
+G ++VETNF++Y+YS S L +L LF ++ + N+++G
Sbjct: 318 ITTSDDLNKAGLKNQDIPDGSLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQ 377
Query: 336 ITKESLYNAFENGITAEQIISFLQQNAHPRV------------------ADRMPSVPENV 377
IT+ES+ A NGITA+QII++L+ +AHP++ + + +P V
Sbjct: 378 ITRESIRRALTNGITADQIIAYLETHAHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTV 437
Query: 378 CDQIRLWESDLNRVEMTPAHYYDEFPS 404
DQIRLW+ +L+RV Y +F +
Sbjct: 438 VDQIRLWQLELDRVITYEGSLYSDFET 464
>gi|299750117|ref|XP_001836551.2| transcription factor TFIIH complex subunit Tfb2 [Coprinopsis
cinerea okayama7#130]
gi|298408751|gb|EAU85259.2| transcription factor TFIIH complex subunit Tfb2 [Coprinopsis
cinerea okayama7#130]
Length = 403
Score = 191 bits (486), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 104/282 (36%), Positives = 171/282 (60%), Gaps = 8/282 (2%)
Query: 132 PTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAPRLTESG 191
P++ L+AYA+ +WE L ++SS +RPT S + + + L +A ++ +G
Sbjct: 76 PSIAALDAYALERWETILHYMVSSGSGQRPTEPSPGVRFLLKTSGLMNEIHDALHISSAG 135
Query: 192 FQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKS 251
FQFLL + QLW+++ +Y+ +++R ++ D++SF LS G Y++ L++ Q +
Sbjct: 136 FQFLLHSPHEQLWHLLLQYLQLAEQRRMDLVDVLSFFFMLSTMELGREYSVQNLTKTQSA 195
Query: 252 MIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTD-------SSARKEGFIVVETNFR 304
M++D D GL+ Q+ + F PT+L+T L+ S SS +EGFI++ETN+R
Sbjct: 196 MLEDLRDYGLI-WQRKQTSKRFSPTRLSTTLTSSSPPLPSTSGASSGPQEGFIILETNYR 254
Query: 305 MYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHP 364
+YAY+ + L +L LF+ ++Y+ PNL+VG+IT+ES+ A NGI+A+QIIS+L +AHP
Sbjct: 255 IYAYTDNPLQTAVLSLFASLKYRFPNLVVGSITRESVKKALLNGISADQIISYLITHAHP 314
Query: 365 RVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
+ P +P V DQIRLWE + NR++ Y F S+
Sbjct: 315 NMRKNNPLLPVTVQDQIRLWELEKNRLKSREGFLYTAFASQA 356
>gi|6325135|ref|NP_015203.1| Tfb2p [Saccharomyces cerevisiae S288c]
gi|62901134|sp|Q02939.1|TFB2_YEAST RecName: Full=RNA polymerase II transcription factor B subunit 2;
AltName: Full=General transcription and DNA repair
factor IIH subunit TFB2; Short=TFIIH subunit TFB2;
AltName: Full=RNA polymerase II transcription factor B
52 kDa subunit; AltName: Full=RNA polymerase II
transcription factor B p52 subunit
gi|1163092|gb|AAB68240.1| Tfb2p: TFIIH subunit [Saccharomyces cerevisiae]
gi|1778059|gb|AAB40628.1| transcription/repair factor TFIIH subunit Tfb2, partial
[Saccharomyces cerevisiae]
gi|151942676|gb|EDN61022.1| transcription initiation factor TFIIH subunit [Saccharomyces
cerevisiae YJM789]
gi|190407835|gb|EDV11100.1| RNA polymerase II transcription factor B subunit 2 [Saccharomyces
cerevisiae RM11-1a]
gi|256274256|gb|EEU09164.1| Tfb2p [Saccharomyces cerevisiae JAY291]
gi|259150035|emb|CAY86838.1| Tfb2p [Saccharomyces cerevisiae EC1118]
gi|285815419|tpg|DAA11311.1| TPA: Tfb2p [Saccharomyces cerevisiae S288c]
gi|323331200|gb|EGA72618.1| Tfb2p [Saccharomyces cerevisiae AWRI796]
gi|323335024|gb|EGA76314.1| Tfb2p [Saccharomyces cerevisiae Vin13]
gi|323346175|gb|EGA80465.1| Tfb2p [Saccharomyces cerevisiae Lalvin QA23]
gi|323352000|gb|EGA84539.1| Tfb2p [Saccharomyces cerevisiae VL3]
gi|365762780|gb|EHN04313.1| Tfb2p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
gi|392295887|gb|EIW06990.1| Tfb2p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 513
Score = 191 bits (486), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 130/447 (29%), Positives = 221/447 (49%), Gaps = 71/447 (15%)
Query: 24 DRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLV 83
+RLY +P C AI R LPPLAK ++M M + + +P +++WV +G + + AI +
Sbjct: 23 NRLYTSPATCLAIYRILPPLAKFFIMAMVFNENEVPLLDLDKWVNSNGKLQFQNAIKSMK 82
Query: 84 QLRLFSEEKKKET-TYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLPTLEDLEAYAI 142
L L K T LN TF+ +LR L GG + + + + +L+ L+ Y+
Sbjct: 83 SLHLLIPNKSSGTLMINLNPTFKISLRNALT-GGEV-QNSFGVVVEENVVSLDLLDEYSA 140
Query: 143 GQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAP-RLTESGFQFLLMDTNA 201
+WE L ++ + A+ P+ +++K L+ + ++T GFQFLL + N+
Sbjct: 141 NKWETILHFMVGTPLAKIPSEKVLNLLK--HSKLMEEVNSTGEFKITNEGFQFLLQEINS 198
Query: 202 QLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGL 261
QLW ++ +Y+ + ++ D++ F+ L G+AY ++ LSE Q+ M++D D GL
Sbjct: 199 QLWTLLLQYLKMIETSKMDLVDVLHFIFMLGALEVGKAYKIDALSETQRIMLQDMRDYGL 258
Query: 262 VKLQQGRKENWFIPTKLATNLS---------MSLTDSSARK------------------- 293
V Q+ ++ F PTKLA L+ + DS R+
Sbjct: 259 V-FQKHSNDSIFYPTKLALMLTSDTKTIRSASNAMDSVLRQNREEPSVNEDGANGKSTTD 317
Query: 294 ------------------EGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGA 335
+G ++VETNF++Y+YS S L +L LF ++ + N+++G
Sbjct: 318 ITTSDDLNKAGLKNQDIPDGSLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQ 377
Query: 336 ITKESLYNAFENGITAEQIISFLQQNAHPRV------------------ADRMPSVPENV 377
IT+ES+ A NGITA+QII++L+ +AHP++ + + +P V
Sbjct: 378 ITRESIRRALTNGITADQIIAYLETHAHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTV 437
Query: 378 CDQIRLWESDLNRVEMTPAHYYDEFPS 404
DQIRLW+ +L+RV Y +F +
Sbjct: 438 VDQIRLWQLELDRVITYEGSLYSDFET 464
>gi|45187599|ref|NP_983822.1| ADL274Wp [Ashbya gossypii ATCC 10895]
gi|62901343|sp|Q75B51.1|TFB2_ASHGO RecName: Full=RNA polymerase II transcription factor B subunit 2;
AltName: Full=RNA polymerase II transcription factor B
52 kDa subunit; AltName: Full=RNA polymerase II
transcription factor B p52 subunit
gi|44982337|gb|AAS51646.1| ADL274Wp [Ashbya gossypii ATCC 10895]
gi|374107034|gb|AEY95942.1| FADL274Wp [Ashbya gossypii FDAG1]
Length = 514
Score = 191 bits (486), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 129/441 (29%), Positives = 222/441 (50%), Gaps = 72/441 (16%)
Query: 25 RLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQ 84
RLY++P C AI R L P+AK ++M M + D + + ++ WV PD + AI +
Sbjct: 33 RLYQSPATCLAIYRLLSPMAKFFIMSMIFQDEEVSLRDLDRWVKPDAKFQLHDAIKSMKL 92
Query: 85 LRLFSEEKKKE-TTYRLNSTFQSNLRKHLI-------YGGALPREPMPSGITARLPTLED 136
L L +E + + +LNS F+ + + L +G + E P T+
Sbjct: 93 LHLITEGRSGQPLMVQLNSIFKESFKNALTGGEVKNSFGNVVEEENDPV-------TMAM 145
Query: 137 LEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAPRLTESGFQFLL 196
L++YA +WE L ++ + + P S+++ + + KE ++T GFQFLL
Sbjct: 146 LDSYAADKWETILHFMVGTPLTKSPGKNVLSLLRHSKLMEVDESSKEL-KITNEGFQFLL 204
Query: 197 MDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDF 256
D NAQ+W ++ +Y++ ++ ++ D+++ + + G+AY++ LSE QK+M++D
Sbjct: 205 QDANAQIWTLLLQYLTMAETFQMDPVDVLNLIFMIGALELGKAYSVVGLSETQKTMLQDL 264
Query: 257 ADLGLVKLQQGRKENWFIPTKLATNLS---MSLTDSSA--------RKEGF--------- 296
D GLV Q+ + F PT+LAT L+ +S+ +S R EG
Sbjct: 265 RDYGLV-FQKQSNLSKFYPTRLATMLTSDVVSIRSASGAVNSVLRQRAEGVDGKVLNGTA 323
Query: 297 -----------------IVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKE 339
++VETNF++Y+YS S L IL LF ++ + N++ G IT+E
Sbjct: 324 LGDDDLQAGGEGALDGALIVETNFKLYSYSNSPLQIAILSLFIHLKTRFQNMVTGQITRE 383
Query: 340 SLYNAFENGITAEQIISFLQQNAHP---RVA---------------DRMPSVPENVCDQI 381
S+ A NGITA+QII++++ +AHP R+A D + +P V DQI
Sbjct: 384 SIRRALHNGITADQIIAYMETHAHPQMRRLAGDNLEKKLELDPNCRDTLQVLPPTVVDQI 443
Query: 382 RLWESDLNRVEMTPAHYYDEF 402
+LW+ +L+R+ + + +F
Sbjct: 444 KLWQLELDRIISYDGYLFRDF 464
>gi|315041383|ref|XP_003170068.1| TFIIH basal transcription factor complex p52 subunit [Arthroderma
gypseum CBS 118893]
gi|311345102|gb|EFR04305.1| TFIIH basal transcription factor complex p52 subunit [Arthroderma
gypseum CBS 118893]
Length = 484
Score = 191 bits (484), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 128/415 (30%), Positives = 208/415 (50%), Gaps = 38/415 (9%)
Query: 25 RLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLV 83
+LY+ P A+ R LP LAK +VM + Y+ +PA +E WV D + ++ L
Sbjct: 22 KLYQQPSTALAVFRRMLPHLAKCFVMALLYLKDPLPATELELWVRSDSRRERDNSLSILG 81
Query: 84 QLRLFSEEKKKE--TTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLP-TLEDLEAY 140
+L + S Y + F ++LR+ L+ + S + P ++ DL+ Y
Sbjct: 82 RLHILSSTTSSSHVRAYMVTEPFSTSLRQALMGEDQKGSFGVISVTPDKYPVSVADLDEY 141
Query: 141 AIGQWECFLLQLI--SSTQAERPT-NFSSSMMKVFQR-GLLSRRDKEAPRLTESGFQFLL 196
A QWE L ++ SS R T S + ++ Q L+ RD+ +T+ GF F+L
Sbjct: 142 ARQQWEAVLGYMVGMSSLSGGRETITLSKGVKQLLQACHLVEIRDRRV-EITKEGFAFVL 200
Query: 197 MDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDF 256
D N Q+W+I+ Y+ N++ G+ +++SF+ LS G++Y L+ Q + D
Sbjct: 201 QDVNTQVWHILILYVENAEAIGMESVEVLSFIFLLSSLELGQSYEKKNLTPTQLRTLADL 260
Query: 257 ADLGLV-KLQQGRKENWFIPTKLATNLSMSLTDSSARKEG-------------------- 295
D G+V + F PT+LAT L+ S G
Sbjct: 261 TDFGIVYQHTPASGSTRFYPTRLATTLTSDSLAMSGPISGEPTVPAAAAGATTAGDAGTG 320
Query: 296 FIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQII 355
FI+VETN+R+YAY++S L ++ LF+ ++++ PNLI G IT++S+ A E GITA+QI+
Sbjct: 321 FIIVETNYRLYAYTSSPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGITADQIV 380
Query: 356 SFLQQNAHPRVADRMPS--------VPENVCDQIRLWESDLNRVEMTPAHYYDEF 402
S+L +AHP++ S +P V DQIRLW+ + +R++ T + EF
Sbjct: 381 SYLTTHAHPQMRKTKKSTTNVSSTVLPPTVVDQIRLWQLERDRIKATSGFLFKEF 435
>gi|326483225|gb|EGE07235.1| transcription factor Tfb2 [Trichophyton equinum CBS 127.97]
Length = 484
Score = 191 bits (484), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 132/418 (31%), Positives = 210/418 (50%), Gaps = 44/418 (10%)
Query: 25 RLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLV 83
+LY+ P AI R LP LAK +VM + Y+ +PA +E WV D + ++ L
Sbjct: 22 KLYQQPSTALAIFRRMLPHLAKCFVMALLYLKDPLPATELELWVRSDSRRERDNSLSILG 81
Query: 84 QLRLFSEEKKKE--TTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLP-TLEDLEAY 140
+L + S Y + F ++LR+ L+ + S + P ++ DL+ Y
Sbjct: 82 RLHILSSTTTSNHVRAYMVTEPFSTSLRQALMGEDQQGSFGVISVTPDKYPVSVADLDEY 141
Query: 141 AIGQWECFLLQLI--SSTQAERPT-NFSSSMMKVFQR-GLLSRRDKEAPRLTESGFQFLL 196
A QWE L ++ SS R T S + ++ Q L+ RD+ +T+ GF F+L
Sbjct: 142 ARQQWEAVLGYMVGMSSLSGGRETITLSKGVKQLLQACHLVEIRDRRV-EITKEGFAFVL 200
Query: 197 MDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDF 256
D N Q+W+I+ Y+ N++ G+ +++SF+ LS G++Y L+ Q + D
Sbjct: 201 QDVNTQVWHILILYVENAEAIGMESVEVLSFIFLLSSLELGQSYEKKDLTPTQLRTLADL 260
Query: 257 ADLGLV-KLQQGRKENWFIPTKLATNLSMSLTDSSARK---------------------- 293
D G+V + F PT+LAT L+ +DS A
Sbjct: 261 TDFGIVYQHTPASGSARFYPTRLATTLT---SDSLAMSGPISGEPAVPTTTSGTTTSGDA 317
Query: 294 -EGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAE 352
GFI+VETN R+YAY++S L ++ LF+ ++++ PNLI G IT++S+ A E GITA+
Sbjct: 318 GTGFIIVETNHRLYAYTSSPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGITAD 377
Query: 353 QIISFLQQNAHPRVADRMPS--------VPENVCDQIRLWESDLNRVEMTPAHYYDEF 402
QIIS+L +AHP++ S +P V DQIRLW+ + +R++ T + EF
Sbjct: 378 QIISYLTTHAHPQMRKTKKSTSNVSSSVLPPTVVDQIRLWQLERDRIKATSGFLFKEF 435
>gi|339237219|ref|XP_003380164.1| general transcription factor IIH subunit 4 [Trichinella spiralis]
gi|316977049|gb|EFV60220.1| general transcription factor IIH subunit 4 [Trichinella spiralis]
Length = 674
Score = 190 bits (483), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 123/400 (30%), Positives = 206/400 (51%), Gaps = 6/400 (1%)
Query: 7 IAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEW 66
+ + F++ ++ L + L++LYE+ C I R LP +A++ VM++ + +E W
Sbjct: 7 VPQTFLNYLSQLPSGILNKLYESAVACTGIYRYLPSVAQQIVMRLSLVSSGTTIADIEGW 66
Query: 67 VLPDGFTKHRVAIDRLVQLRLFSEEKKKET-TYRLNSTFQSNLRKHLIYGGALPREPMPS 125
++ + ++ L QL + E + LN F NLR L+ A+ + +
Sbjct: 67 MVDEKKDILHESLKYLRQLHILQECNLSSIESVVLNRVFAKNLRLALLCKDAICFKTVTV 126
Query: 126 GITARLPTLEDLEAYAIGQWECFLLQL-ISSTQAERPTNFSSSMMKVFQRGLLSRRDKEA 184
+ + DL++YA +WE L L + S Q+E+ + + +V Q L +
Sbjct: 127 DPKHQ-KSFADLDSYASERWESVLKYLALPSAQSEKSVSVETK--RVLQDSGLIQLCDSK 183
Query: 185 PRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNT 244
+LT GFQF+L D QLW + Y++ +++G D I +L+ AY+
Sbjct: 184 MQLTSDGFQFILYDRRQQLWTYLLHYLAQLEKKGSPVHDCIMLILQACLGSHRAAYSTEN 243
Query: 245 LSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARKEGFIVVETNFR 304
L+E + I+ ++GLV Q+ R WF T L + L+ + SS+ KEGF++VETNFR
Sbjct: 244 LTEAALNFIQHLREIGLVH-QRKRSAGWFYYTPLISVLTGLKSSSSSSKEGFLIVETNFR 302
Query: 305 MYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHP 364
+Y Y+ S L I+ F + Y+ PNL+ + +ES+ AF+ I+AEQII +L NAH
Sbjct: 303 VYCYTDSVLDLAIVSTFCEPLYRFPNLVACILNRESVRRAFQVNISAEQIIQYLFSNAHK 362
Query: 365 RVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
+ + P++P V DQI+LWE + +R + P Y F S
Sbjct: 363 NMQKQTPTIPSTVTDQIKLWEMERDRFKFDPGVMYSNFFS 402
>gi|349581695|dbj|GAA26852.1| K7_Tfb2p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 513
Score = 190 bits (482), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 129/447 (28%), Positives = 221/447 (49%), Gaps = 71/447 (15%)
Query: 24 DRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLV 83
+RLY +P C AI R LPPLAK ++M M + + +P +++WV +G + + AI +
Sbjct: 23 NRLYTSPATCLAIYRILPPLAKFFIMAMVFNENEVPLLDLDKWVNSNGKLQFQNAIKSMK 82
Query: 84 QLRLFSEEKKKET-TYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLPTLEDLEAYAI 142
L L K T LN TF+ +LR L GG + + + + +L+ L+ Y+
Sbjct: 83 SLHLLIPNKSSGTLMINLNPTFKISLRNALT-GGEV-QNSFGVVVEENVVSLDLLDEYSA 140
Query: 143 GQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAP-RLTESGFQFLLMDTNA 201
+WE L ++ + A+ P+ +++K L+ + ++T GFQFLL + N+
Sbjct: 141 NKWETILHFMVGTPLAKIPSEKVLNLLK--HSKLMEEVNSTGEFKITNEGFQFLLQEINS 198
Query: 202 QLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGL 261
QLW ++ +Y+ + ++ D++ F+ L G+AY ++ LS+ Q+ M++D D GL
Sbjct: 199 QLWTLLLQYLKMIETSKMDLVDVLHFIFMLGALEVGKAYKIDALSQTQRIMLQDMRDYGL 258
Query: 262 VKLQQGRKENWFIPTKLATNLS---------MSLTDSSARK------------------- 293
V Q+ ++ F PTKLA L+ + DS R+
Sbjct: 259 V-FQKHSNDSIFYPTKLALMLTSDTKTIRSASNAMDSVLRQNREEPSVNEDGANGKSTTD 317
Query: 294 ------------------EGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGA 335
+G ++VETNF++Y+YS S L +L LF ++ + N+++G
Sbjct: 318 ITTSDDLNKAGLKNQDIPDGSLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQ 377
Query: 336 ITKESLYNAFENGITAEQIISFLQQNAHPRV------------------ADRMPSVPENV 377
IT+ES+ A NGITA+QII++L+ +AHP++ + + +P V
Sbjct: 378 ITRESIRRALTNGITADQIIAYLETHAHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTV 437
Query: 378 CDQIRLWESDLNRVEMTPAHYYDEFPS 404
DQIRLW+ +L+RV Y +F +
Sbjct: 438 VDQIRLWQLELDRVITYEGSLYSDFET 464
>gi|326474968|gb|EGD98977.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
Tfb2 [Trichophyton tonsurans CBS 112818]
Length = 489
Score = 190 bits (482), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 132/418 (31%), Positives = 209/418 (50%), Gaps = 44/418 (10%)
Query: 25 RLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLV 83
+LY+ P AI R LP LAK +VM + Y+ +PA +E WV D + ++ L
Sbjct: 22 KLYQQPSTALAIFRRMLPHLAKCFVMALLYLKDPLPATELELWVRSDSRRERDNSLSILG 81
Query: 84 QLRLFSEEKKKETT--YRLNSTFQSNLRKHLIYGGALPREPMPSGITARLP-TLEDLEAY 140
+L + S Y + F ++LR+ L+ + S + P ++ DL Y
Sbjct: 82 RLHILSSTTTSNHVRAYMVTEPFSTSLRQALMGEDQQGSFGVISVTPDKYPVSVADLNEY 141
Query: 141 AIGQWECFLLQLI--SSTQAERPT-NFSSSMMKVFQRG-LLSRRDKEAPRLTESGFQFLL 196
A QWE L ++ SS R T S + ++ Q L+ RD+ +T+ GF F+L
Sbjct: 142 ARQQWEAVLGYMVGMSSLSGGRETITLSKGVKQLLQACHLVEIRDRRV-EITKEGFAFVL 200
Query: 197 MDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDF 256
D N Q+W+I+ Y+ N++ G+ +++SF+ LS G++Y L+ Q + D
Sbjct: 201 QDVNTQVWHILILYVENAEAIGMESVEVLSFIFLLSSLELGQSYEKKDLTPTQLRTLADL 260
Query: 257 ADLGLV-KLQQGRKENWFIPTKLATNLSMSLTDSSARK---------------------- 293
D G+V + F PT+LAT L+ +DS A
Sbjct: 261 TDFGIVYQHTPASGSARFYPTRLATTLT---SDSLAMSGPISGEPAVPTTTSGTTTSGDA 317
Query: 294 -EGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAE 352
GFI+VETN R+YAY++S L ++ LF+ ++++ PNLI G IT++S+ A E GITA+
Sbjct: 318 GTGFIIVETNHRLYAYTSSPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEMGITAD 377
Query: 353 QIISFLQQNAHPRVADRMPS--------VPENVCDQIRLWESDLNRVEMTPAHYYDEF 402
QIIS+L +AHP++ S +P V DQIRLW+ + +R++ T + EF
Sbjct: 378 QIISYLTTHAHPQMRKTKKSTSNVSSSVLPPTVVDQIRLWQLERDRIKATSGFLFKEF 435
>gi|156057729|ref|XP_001594788.1| hypothetical protein SS1G_04596 [Sclerotinia sclerotiorum 1980]
gi|154702381|gb|EDO02120.1| hypothetical protein SS1G_04596 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 420
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 113/371 (30%), Positives = 201/371 (54%), Gaps = 28/371 (7%)
Query: 58 IPAKMMEEWVLPDGFTKHRVAIDRLVQLRLF---SEEKKKETTYRLNSTFQSNLRKHLIY 114
+P +++WV P + A+ L +L + + ++ T L +F ++LR L
Sbjct: 5 LPLTALDQWVQPKSKRQKDQALSLLSRLHIVDITALSREDPQTVALTKSFGTSLRLALTG 64
Query: 115 GGALPREPMPSG-ITARLPTLEDLEAYAIGQWECFLLQLISSTQA---ERPTNFSSSMMK 170
GG +PS A ++ L+ +A QWE L ++++ A + + + S+ K
Sbjct: 65 GGNHQSFGVPSSDHIAPHVDIDFLDVHARMQWEGILHYMVNNVAAGSGQEGSGPAGSVKK 124
Query: 171 VFQRGLLSRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLE 230
+ G L R + +T++GF FLL + NAQ+W ++ +I N++ G++ D++SFL
Sbjct: 125 LLDAGKLVTRGRSV-GITQAGFSFLLQEANAQVWTLLLLWIENAESMGMDSVDVLSFLFM 183
Query: 231 LSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSM------ 284
L G AY+ TL++ Q +M+ + D GL+ L +F PT+LAT L+
Sbjct: 184 LGSLELGRAYSTTTLTDAQHNMLGNLIDFGLIYLPPSAPTQFF-PTRLATTLTSDASALR 242
Query: 285 -------SLTDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAIT 337
+ + S+A ++GFI++ETN+R+YAY+ S L +L LF+K+ + PN++ G ++
Sbjct: 243 TVSAGFDAASKSAANQKGFIIIETNYRLYAYTNSPLQIAVLSLFTKLNTRYPNMVSGRVS 302
Query: 338 KESLYNAFENGITAEQIISFLQQNAHPRV------ADRMPSVPENVCDQIRLWESDLNRV 391
++S+ A +GIT++QII++L +AHP++ A P +P V DQIRLW+ + R+
Sbjct: 303 RDSIRTAISHGITSDQIITYLSTHAHPQLVKASTAAHGGPVLPPTVVDQIRLWQLENERM 362
Query: 392 EMTPAHYYDEF 402
+ P + EF
Sbjct: 363 KAVPGFLFKEF 373
>gi|320581720|gb|EFW95939.1| TFIIH basal transcription factor complex subunit 2 [Ogataea
parapolymorpha DL-1]
Length = 544
Score = 189 bits (479), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 132/465 (28%), Positives = 221/465 (47%), Gaps = 86/465 (18%)
Query: 25 RLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRV------- 77
+LY +P C A+ R LP LAK Y+M M + D AIP+ + W+ F R
Sbjct: 32 QLYRSPETCLAVFRLLPSLAKFYIMTMLFQDAAIPSTDLNRWIKSSSFNGSRNLSKIYQN 91
Query: 78 -AIDRLVQL-------RLFSEEKKKETT----YRLNSTFQSNLRKHLIYGGALPREPMPS 125
++ RL L R+F+ + +TT LN TF+ + R L G E +
Sbjct: 92 DSLKRLKALNLLKETRRIFTHPQTGQTTQLPFVSLNPTFRQSFRNALT-GSRDANEVVED 150
Query: 126 GITARLPTL--EDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVF----------Q 173
+ T+ E L+ Y + +WE L ++ S E P+ ++++ Q
Sbjct: 151 NNLESIETISVEFLDTYCLHRWESILHFMVGSETKELPSVGVLTLLRYSGLMELPSDREQ 210
Query: 174 RGLLSRRDKEAPRL--------------TESGFQFLLMDTNAQLWYIVREYISNSQERGI 219
R L + P+L T++GFQFLL D N+Q+W ++ +Y+ S++ +
Sbjct: 211 REHLGDDYEFQPKLQQSSSIQTLKNLLITQNGFQFLLQDINSQIWTLLLQYLKMSEKLMM 270
Query: 220 NQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLA 279
N ++++F+ L G+ Y ++ LS+ QK M+ D D GL+ + F PT+LA
Sbjct: 271 NPVEVLNFIFMLGSLELGQGYPVDPLSDTQKIMLDDLIDYGLIYTPSKTNKKIFYPTRLA 330
Query: 280 TNLSMSLTD---------------SSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKV 324
T L+ T+ ++ +G IVVETNF++Y Y++S L IL LF +
Sbjct: 331 TTLTSEHTNFKTSAAVIDQEITNSKNSNNQGTIVVETNFKIYCYTSSPLQIAILNLFVHL 390
Query: 325 EYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHP----RVADRMPS-------- 372
+ + N++ G IT+ES+ A NGITA+Q+I++L +AHP + D++
Sbjct: 391 KARFANMVTGVITRESVRRALINGITADQMINYLGSHAHPWMVKQAEDKLARKLELESSI 450
Query: 373 -------------VPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
+P V DQI+LW+ +L+R++ + Y +F +
Sbjct: 451 GNAAGQRQIDLEILPPTVVDQIKLWQLELDRIQSFKGYLYKDFST 495
>gi|366991461|ref|XP_003675496.1| hypothetical protein NCAS_0C01390 [Naumovozyma castellii CBS 4309]
gi|342301361|emb|CCC69129.1| hypothetical protein NCAS_0C01390 [Naumovozyma castellii CBS 4309]
Length = 516
Score = 188 bits (478), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 130/453 (28%), Positives = 218/453 (48%), Gaps = 81/453 (17%)
Query: 25 RLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQ 84
RLYE+P C AI R LP LAK +VM M + D + + ++ WV +G + + AI +
Sbjct: 24 RLYESPATCLAIYRLLPQLAKFFVMSMVFNDKEVSLRDLDRWVKSNGKMQFQDAIKSMKS 83
Query: 85 LRLF--SEEKKKETTYRLNSTFQSNLRKHLIYG------GALPREPMPSGITARLPTLED 136
L L S+ LN TF+ + + L G G + + +G+ + L+
Sbjct: 84 LHLIIPSKSNGGPLMINLNPTFRESFKNALTGGQVDNSFGIVADDDDDNGVVS----LQL 139
Query: 137 LEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRG-LLSRRDKEAP-RLTESGFQF 194
L+ Y+ +WE L ++ + + P S+S++ + + L+ D + ++T GFQF
Sbjct: 140 LDEYSANKWETILHFMVGTPMSSIP---SASVLNLLKHTRLMEEVDHTSEFKITNEGFQF 196
Query: 195 LLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIK 254
LL + N+Q+W ++ +Y+ S+ ++ D+++F+ L G+ Y+++ LSE QK M+K
Sbjct: 197 LLQELNSQIWTLLLQYLKMSETLKMDSVDVLNFIFMLGALEVGKGYSIDGLSETQKIMLK 256
Query: 255 DFADLGLVKLQQGRKENWFIPTKLATNLSMSLT-------------------DSSARK-- 293
D D GLV Q+ N F PT LA L+ DSS +
Sbjct: 257 DMRDYGLV-FQKYTNSNLFYPTNLALMLTSDTKSIVRTASGALESVLQNKREDSSKQANG 315
Query: 294 ------------------------EGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLP 329
+G +++ETNF++Y+YS S L L LF ++ +
Sbjct: 316 EINKDLITNGDELDQVGYNPQDIPDGSLIIETNFKLYSYSNSPLQIATLSLFVHLKSRFA 375
Query: 330 NLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRV------------------ADRMP 371
N++ G IT+ES+ A NGITA+QII++L+ +AHP++ D +
Sbjct: 376 NMVTGQITRESIRRALINGITADQIIAYLETHAHPQMRRLAEERLEKKLELDPNSKDPLQ 435
Query: 372 SVPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
+P V DQI+LW+ +L+RV Y +F +
Sbjct: 436 VLPPTVVDQIKLWQLELDRVITYEGSLYSDFET 468
>gi|156839389|ref|XP_001643386.1| hypothetical protein Kpol_479p16 [Vanderwaltozyma polyspora DSM
70294]
gi|156113994|gb|EDO15528.1| hypothetical protein Kpol_479p16 [Vanderwaltozyma polyspora DSM
70294]
Length = 504
Score = 188 bits (478), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 131/437 (29%), Positives = 213/437 (48%), Gaps = 59/437 (13%)
Query: 25 RLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQ 84
+LYE+P C AI R LP LAK ++M M + D I + ++ WV + + +I +
Sbjct: 24 KLYESPATCLAIYRLLPQLAKFFIMSMVFNDNEISLRDLDRWVKASAKVQFQESIKCMKS 83
Query: 85 LRLFSEEKKKETT-YRLNSTFQSNLRKHLIYGGALPREPMPSGITARLPTLEDLEAYAIG 143
L L K+ + LN TF+ +LR L G + L T L++Y+
Sbjct: 84 LNLLIPGKEPGSIMVNLNPTFRKSLRNALSGGEINNSFGIVVDSDNDLVTRAMLDSYSAD 143
Query: 144 QWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAP-RLTESGFQFLLMDTNAQ 202
+WE L ++ + P++ +++K L+ + ++T GFQFLL + N Q
Sbjct: 144 KWETILHFMVGTPLVSIPSDNVLNLLK--HSKLMEEVESTGEFKITNEGFQFLLQEVNLQ 201
Query: 203 LWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLV 262
+W ++ +Y+ ++ +N D++ F+ L G+AY+ LS QK+M+KD D GL+
Sbjct: 202 IWTLLLQYLKMAETFRMNPVDVLHFIFMLGALEFGKAYSTEELSATQKTMMKDMRDYGLI 261
Query: 263 KLQQGRKENWFIPTKLATNLS------------------------MSLTDSSAR------ 292
Q+ N F PT+LAT L+ T SA
Sbjct: 262 -FQKNSNTNVFYPTRLATMLTSDSRTVRNASSAMDSVLTQSKDEPSGATSGSADVDEQVG 320
Query: 293 ------KEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFE 346
++G +VVETNF++Y+YS S L IL LF ++ + N++ G IT+ES+ NA
Sbjct: 321 TQGQNIQDGALVVETNFKLYSYSNSPLQIAILSLFVHLKTRFSNMVTGQITRESIRNALS 380
Query: 347 NGITAEQIISFLQQNAHP---RVADR---------------MPSVPENVCDQIRLWESDL 388
NGITAEQII++L+ +AHP R+A+ + +P V DQI+LW+ +L
Sbjct: 381 NGITAEQIIAYLETHAHPQMRRLAEERLKKKLELDQNSKETLQILPPTVVDQIKLWQLEL 440
Query: 389 NRVEMTPAHYYDEFPSR 405
+R+ Y +F ++
Sbjct: 441 DRIISYEGSLYSDFDNK 457
>gi|16198089|gb|AAL13841.1| LD30622p [Drosophila melanogaster]
Length = 350
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 103/290 (35%), Positives = 168/290 (57%), Gaps = 27/290 (9%)
Query: 137 LEAYAIGQWECFLLQLISS-----TQAERPTNFSSSMMKVFQRGLLSRRDKEAPRLTESG 191
L+ YA+ +W C L ++ + T AE + ++ + L+ R +++ +T G
Sbjct: 17 LDTYAMSRWRCVLHYMVGTGNRNGTDAEAIS--PDAVRILLHANLMKRDERDGITITRQG 74
Query: 192 FQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKS 251
FQFLL+DT AQ+W+ + +Y+ +ERGI+ + +S L +LSF G Y+ ++ +
Sbjct: 75 FQFLLLDTRAQVWHFMLQYLDTCEERGISLPECLSMLFQLSFSTLGRDYSSEGMNSQMLT 134
Query: 252 MIKDFADLGLVKLQQGRKENWFIPTKLATNL---------SMSLTDSSARKEGFIVVETN 302
++ + GLV Q+ RKE F PT+LA N+ S+++ + + + G+IVVETN
Sbjct: 135 FLQHLREFGLV-FQRKRKEGRFYPTRLALNVTSKEAAATASVAMDEEATQDCGYIVVETN 193
Query: 303 FRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 362
+R+YAY+ S L +L LF+++ Y+ PNL+VG +T++S+ A GITAEQI+S+L+Q A
Sbjct: 194 YRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTRDSVRQALRGGITAEQIVSYLEQYA 253
Query: 363 HPRVADRM--------PSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
HP + RM +P V DQI+LWE + NR T Y++F S
Sbjct: 254 HPNM--RMVESAIHSKSCLPPTVVDQIKLWELERNRFTYTEGVLYNQFLS 301
>gi|440791255|gb|ELR12502.1| TFIIH basal transcription factor complex subunit 2, putative
[Acanthamoeba castellanii str. Neff]
Length = 304
Score = 188 bits (477), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 89/243 (36%), Positives = 159/243 (65%), Gaps = 6/243 (2%)
Query: 165 SSSMMKVFQRG--LLSRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQA 222
S ++ + RG ++ + ++ R+ + GF FLL D Q+W ++ Y+ + QE N
Sbjct: 10 SDRVVSLLTRGEFMVVHEEDQSIRIADKGFPFLLKDLRTQVWILLLLYLRSLQEEKANVH 69
Query: 223 DLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNL 282
D++SFL LSF GE Y ++ L+ + +++D DLG++ ++ R W PT+LA L
Sbjct: 70 DVLSFLFRLSFLTVGEGYQMDDLAFSESGLLQDLQDLGII-YRKHRDSKWLYPTQLAIGL 128
Query: 283 SMSLTDSSAR-KEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESL 341
S T+++ R +EG+I+V T++R+YAY++S + +L LF+++EYQLPN+++G + +E++
Sbjct: 129 SS--TEAAKRDQEGWIIVGTDYRIYAYTSSPVKLLLLSLFTQIEYQLPNMVMGILLRENI 186
Query: 342 YNAFENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDE 401
A + GI+A QI+ FL+ NAHP++ P +PE++ DQ+RLWE++ R+ ++P ++YD+
Sbjct: 187 RQAVQVGISANQILQFLETNAHPQMKQNTPIIPESIADQLRLWEAEDRRLSLSPGYFYDD 246
Query: 402 FPS 404
F S
Sbjct: 247 FAS 249
>gi|46127843|ref|XP_388475.1| hypothetical protein FG08299.1 [Gibberella zeae PH-1]
gi|408390907|gb|EKJ70292.1| hypothetical protein FPSE_09509 [Fusarium pseudograminearum CS3096]
Length = 494
Score = 187 bits (476), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 132/429 (30%), Positives = 223/429 (51%), Gaps = 56/429 (13%)
Query: 25 RLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAID--- 80
+LY+ P AI R LP LAK +VM++ + I +++WV P + A+
Sbjct: 23 KLYQQPSTALAIFRRMLPQLAKVFVMRILFNPKPILLSDLDDWVKPSHKRQKDQALSVMR 82
Query: 81 --RLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLP---TLE 135
+VQ+ S+E+ +E +L + F+ +LR L GG+ +PS T ++P ++
Sbjct: 83 VLHIVQISTPSKERPQE--MQLTTNFKVSLRLALSGGGSHNSFGVPS--TLQIPPEIDID 138
Query: 136 DLEAYAIGQWECFLLQLISST---QAERPTNFSSSMMKVFQRG-LLSRRDKEAPRLTESG 191
L+ YA +WE L ++SS A + + S+ ++ G L+ RR A +T++G
Sbjct: 139 FLDRYARKKWEDILHFVVSSVGYKSAGESSGPNKSVKELLVAGRLVDRRPSGAIGITQAG 198
Query: 192 FQFLLMDTNAQLWYIVR---EYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEI 248
F FLL + NAQ+W ++ E + ++ G+ D++SFL L+ G AY+ N L+E
Sbjct: 199 FTFLLQEANAQVWTLLLLWLEAMDVNKMAGLEATDMLSFLFVLASMELGRAYDTNALTEQ 258
Query: 249 QKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSM------------------------ 284
+K+M+ D GL+ + K + F PT+LAT L+
Sbjct: 259 RKNMLPSLVDFGLIYIPN-HKRSMFFPTRLATTLTSSSNSLRSISDGVAAATAAALQPGQ 317
Query: 285 ------SLTDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITK 338
S T SS + G +++ETN+R+YAY+ S L +L LF+K++ + P+++ G I++
Sbjct: 318 SRAPGSSATGSSNEQRGSVIIETNYRIYAYTQSTLQIAVLALFTKLQMRFPDMVAGRISR 377
Query: 339 ESLYNAFENGITAEQIISFLQQNAHPRVA-----DRMPSVPENVCDQIRLWESDLNRVEM 393
+S+ A GITAEQIIS+L +AH ++ + P +P V DQIRLW+ + R++
Sbjct: 378 QSIRQAINFGITAEQIISYLSAHAHDQMRRTASLNNKPVLPPTVVDQIRLWQLENERMKT 437
Query: 394 TPAHYYDEF 402
T + +F
Sbjct: 438 TGGFLFKDF 446
>gi|365986883|ref|XP_003670273.1| hypothetical protein NDAI_0E02130 [Naumovozyma dairenensis CBS 421]
gi|343769043|emb|CCD25030.1| hypothetical protein NDAI_0E02130 [Naumovozyma dairenensis CBS 421]
Length = 522
Score = 187 bits (476), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 133/455 (29%), Positives = 211/455 (46%), Gaps = 86/455 (18%)
Query: 25 RLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQ 84
RLYE+P C AI R LP L+K +VM M + + + + ++ WV PD + + AI +
Sbjct: 24 RLYESPATCLAIYRLLPQLSKFFVMSMVFNNNEVSLRDLDRWVKPDAKLQFQDAIKSMKS 83
Query: 85 LRLF--SEEKKKETTYRLNSTFQSNLRKHLIYG------GALPREPMPSGITARLPTLED 136
L L S+ LN TF+ + + L G G + E +T+ L
Sbjct: 84 LHLIIPSKGNGGPLMIILNPTFRESFKNALTGGQINNSFGIISDEDENGVVTSTL----- 138
Query: 137 LEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAP--RLTESGFQF 194
L Y+ +WE L ++ + + P S S++ + + L + ++T GFQF
Sbjct: 139 LNEYSANKWETILHFMVGTPMSSIP---SGSVLNLLKHTKLMEEVENTGEFKITNEGFQF 195
Query: 195 LLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIK 254
LL + N+QLW ++ +Y+ S+ ++ D+++F+ L G+ Y+++ LSE QK M+K
Sbjct: 196 LLQELNSQLWTLLLQYLKLSETLNMDPVDVLNFIFMLGALEVGKGYSIDGLSETQKIMLK 255
Query: 255 DFADLGLVKLQQGRKENWFIPTKLATNLS------------------------------- 283
D D GLV Q+ F PT LA L+
Sbjct: 256 DMRDYGLV-FQKVSNSKTFYPTNLALMLTSDTKSIVRTASGAIESVLNENRSGSNANENG 314
Query: 284 -------------MSLTDSSARK-----EGFIVVETNFRMYAYSTSKLHCEILRLFSKVE 325
M D K +G ++VETNF++Y+YS S L IL LF ++
Sbjct: 315 YEPGTKKKNENTIMGTIDQVGMKNQDVPDGSLIVETNFKLYSYSNSPLQIAILSLFVHLK 374
Query: 326 YQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRV------------------A 367
++ N++ G IT+ES+ A NGITAEQII++L+ +AHP++
Sbjct: 375 FRFVNMVTGQITRESIRRALINGITAEQIIAYLETHAHPQMRRLAEEKLEKKLELDANCK 434
Query: 368 DRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEF 402
D + +P V DQIRLW+ +L+RV Y +F
Sbjct: 435 DSLQILPPTVVDQIRLWQLELDRVIAYEGSLYSDF 469
>gi|452977455|gb|EME77221.1| hypothetical protein MYCFIDRAFT_146475 [Pseudocercospora fijiensis
CIRAD86]
Length = 476
Score = 184 bits (468), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 132/432 (30%), Positives = 209/432 (48%), Gaps = 51/432 (11%)
Query: 8 AKNFMDMVASLTARDLDRLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEW 66
A+ ++ + L +LY+ P AI R LP +AK VM M Y+ ++ W
Sbjct: 5 AQRALEYLEQLPGTTFTKLYQQPSTALAIFRRMLPHMAKTLVMAMLYMPTPFSVADLDTW 64
Query: 67 VLPDGFTKHRVAIDRLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSG 126
V P+ H A + K++ RLN F +LR+ L GG +P
Sbjct: 65 VKPE----HDSA-----------QAKERRPALRLNPAFSKSLRQALTGGGNHRSFGVPCT 109
Query: 127 ITARLP-TLEDLEAYAIGQWECFLLQLISSTQA--ERPTNFSSSMMKVFQRGLLSRRDKE 183
+ T+E L+ +A QWE L ++ S + + +S K+ + G +
Sbjct: 110 TPDKSSVTIEFLDTFARKQWETILYYVVGSANSALSGGMDVHNSTKKLLENGEFVQMQNG 169
Query: 184 APR--LTESGFQFLLMDTNAQLWYIVREYISNSQ-ERGINQADLISFLLELSFHVAGEAY 240
+ +T GF FLL D NAQ+W ++ Y+ S+ E ++ D++SFL L G +Y
Sbjct: 170 GRQRFITTDGFTFLLQDVNAQIWSLLIVYLQTSETECFMDPVDVLSFLFTLGSLELGTSY 229
Query: 241 NLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTD------------ 288
+ + L++ Q M++D +DLGL+ + + + PT+LAT L+
Sbjct: 230 STSNLTKTQLQMLEDLSDLGLIF--HPEQSDRYYPTRLATTLTSDAPALLNSSHTSTTTT 287
Query: 289 ----------SSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITK 338
+SA ++GFI++ETN+R+YAY+ S L IL LF+ + + PNL+ ITK
Sbjct: 288 VSASSNDDLAASANEKGFIILETNYRLYAYTNSPLLISILSLFASLHTRYPNLVTAKITK 347
Query: 339 ESLYNAFENGITAEQIISFLQQNAHP---RVAD--RMPSVPENVCDQIRLWESDLNRVEM 393
S+ A +GIT+ QIIS+LQ +AHP R A P +P V DQIRLW+ + R+
Sbjct: 348 TSVQAAISSGITSNQIISYLQTHAHPILRRTASMHNAPILPPTVVDQIRLWQIEGERMTS 407
Query: 394 TPAHYYDEFPSR 405
T + E S+
Sbjct: 408 TKGYLIREVGSK 419
>gi|342884444|gb|EGU84659.1| hypothetical protein FOXB_04847 [Fusarium oxysporum Fo5176]
Length = 496
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 129/427 (30%), Positives = 218/427 (51%), Gaps = 52/427 (12%)
Query: 25 RLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLV 83
+LY+ P AI R LP LAK +VM++ Y I +++WV P + A+ L
Sbjct: 25 KLYQQPSTAFAIFRRMLPHLAKTFVMRILYSPKPILLSDLDDWVKPSHKRQKDQALSILR 84
Query: 84 QLRLFS---EEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLPTLED---L 137
L + S K++ +L + F+ +LR L GG +PS T ++P D L
Sbjct: 85 VLHIVSITAPSKERPQEMQLTTNFKLSLRLALSGGGTHNSFGVPS--TLQIPPEIDIAFL 142
Query: 138 EAYAIGQWECFLLQLISST---QAERPTNFSSSMMKVFQRG-LLSRRDKEAPRLTESGFQ 193
+ YA +WE L ++SS A + + S+ ++ G L+ R+ + +T++GF
Sbjct: 143 DRYARKKWEDILHFVVSSVGYKSAGESSGPNKSVKELLVAGRLVDRKASGSVGITQAGFT 202
Query: 194 FLLMDTNAQLWYIVR---EYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQK 250
FLL + NAQ+W ++ E + ++ G+ D++SFL L+ G AY+ N L+E +K
Sbjct: 203 FLLQEANAQVWTLLLLWLEAMEVNKMAGLEATDMLSFLFVLASMELGRAYDTNALTEQRK 262
Query: 251 SMIKDFADLGLVKLQQGRKENWFIPTKLATNLSM-------------------------- 284
+M+ D GL+ + K + F PT+LAT L+
Sbjct: 263 NMLPSLVDFGLIYIPN-HKRSMFFPTRLATTLTSSSNSLRSISDGVAAATAAALQPGQSG 321
Query: 285 ----SLTDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKES 340
S+T ++ + G +++ETN+R+YAY+ S L +L LF+K+ + P+++ G I+++S
Sbjct: 322 APGGSVTAANTEQRGSVIIETNYRIYAYTQSTLQIAVLALFTKLAMRFPDMVAGRISRQS 381
Query: 341 LYNAFENGITAEQIISFLQQNAHPRVA-----DRMPSVPENVCDQIRLWESDLNRVEMTP 395
+ A + GITAEQIIS+L +AH ++ + P +P V DQIRLW+ + R++ T
Sbjct: 382 IRQAIQFGITAEQIISYLSAHAHDQMHRTAALNNKPVLPPTVVDQIRLWQLENERMKTTS 441
Query: 396 AHYYDEF 402
+ F
Sbjct: 442 GFLFKSF 448
>gi|444727354|gb|ELW67853.1| Valyl-tRNA synthetase, mitochondrial [Tupaia chinensis]
Length = 1382
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 125/407 (30%), Positives = 203/407 (49%), Gaps = 49/407 (12%)
Query: 4 VKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMM 63
V + +N + + L++ LDRLY +P C A+ R LP LAK +VM+M +++ +P +
Sbjct: 13 VHLQCRNLQEFLGGLSSGVLDRLYGHPATCLAVFRELPSLAKNWVMRMLFLEQPLPQAAV 72
Query: 64 EEWVLPDGFTKHRVAIDRLVQLRLFSEE--KKKETTYRLNSTFQSNLRKHLIYGGALPRE 121
WV + + L LR++ + LN F+ NLR L+
Sbjct: 73 ALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLILNPIFRQNLRIALLE------- 125
Query: 122 PMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRD 181
I A+ L+ Q F+LQ + + Q+ R + + +FQ
Sbjct: 126 -----IVAQFLLLD-----TPAQLWYFMLQYLQTAQS-RGMDLVEILSFLFQLSF----S 170
Query: 182 KEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYN 241
R E G +MD +++SFL +LSF G+ Y+
Sbjct: 171 TLGKRRGERGRVHTVMDL----------------------VEILSFLFQLSFSTLGKDYS 208
Query: 242 LNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLS--MSLTDSSARKEGFIVV 299
+ +S+ + ++ + GLV Q+ RK + PT+LA NLS +S +A + GFIVV
Sbjct: 209 VEGMSDSLLNFLQHLREFGLV-FQRKRKSRRYYPTRLAINLSSGVSGAGGTAHQPGFIVV 267
Query: 300 ETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQ 359
ETN+R+YAY+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII FL+
Sbjct: 268 ETNYRLYAYTESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLR 327
Query: 360 QNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
AHP + + P +P + DQIRLWE + +R+ T Y++F S+V
Sbjct: 328 TRAHPVMLKQTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQV 374
>gi|365758016|gb|EHM99881.1| Tfb2p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 419
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 119/391 (30%), Positives = 203/391 (51%), Gaps = 53/391 (13%)
Query: 24 DRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLV 83
+RLY +P C AI R LPPLAK ++M M + + +P +++WV +G + + AI +
Sbjct: 23 NRLYTSPATCLAIYRILPPLAKFFIMAMVFNENEVPLLDLDKWVNSNGKLQFQDAIKSMK 82
Query: 84 QLRLFSEEKKKET-TYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLPTLEDLEAYAI 142
L L +K T LN TF+ +LR L GG + + + + +L+ L+ Y+
Sbjct: 83 SLHLLIPDKSSGTLMINLNPTFKISLRNALT-GGEV-QNSFGVVVEDNVISLDLLDEYSA 140
Query: 143 GQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAP-RLTESGFQFLLMDTNA 201
+WE L ++ + A+ P+ +++K L+ + ++T GFQFLL + N+
Sbjct: 141 NKWETILHFMVGTPLAKIPSEKVLNLLK--HSKLMEEVNSTGEFKITNEGFQFLLQEINS 198
Query: 202 QLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGL 261
QLW ++ +Y+ + ++ D++ F+ L G+AY ++ LSE Q+ M++D D GL
Sbjct: 199 QLWTLLLQYLKMIETSKMDLVDVLHFIFMLGALEVGKAYKIDALSETQRIMLQDMRDYGL 258
Query: 262 VKLQQGRKENWFIPTKLATNLS---------------------------------MSLTD 288
V Q+ ++ F PTKLA L+ S TD
Sbjct: 259 V-FQKHLNDSIFYPTKLALMLTSDTKTIISASNAMDSVLRQNREEPSVNEDGANGKSTTD 317
Query: 289 SSARK-------------EGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGA 335
++A +G ++VETNF++Y+YS S L +L LF ++ + N+++G
Sbjct: 318 TTASDDLNKAGSKNQDIPDGSLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQ 377
Query: 336 ITKESLYNAFENGITAEQIISFLQQNAHPRV 366
IT+ES+ NA NGITA+QII++L+ +AHP++
Sbjct: 378 ITRESIRNALTNGITADQIIAYLETHAHPQM 408
>gi|254584224|ref|XP_002497680.1| ZYRO0F11088p [Zygosaccharomyces rouxii]
gi|238940573|emb|CAR28747.1| ZYRO0F11088p [Zygosaccharomyces rouxii]
Length = 509
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 129/442 (29%), Positives = 214/442 (48%), Gaps = 71/442 (16%)
Query: 25 RLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQ 84
RLYE+P C AI R LP LAK ++M M + + AI + ++ WV G + + AI +
Sbjct: 25 RLYESPATCLAIYRLLPQLAKFFIMSMVFNESAISLRDLDRWVKSSGKLQFQDAIKSMKS 84
Query: 85 LRLFSEEKKKE-TTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLP----TLEDLEA 139
L L + LN TF+ + K+ + GG + GI L T + L+
Sbjct: 85 LHLLIPVRSNGPMLIDLNHTFRESF-KNALTGGEVSNS---FGIVMELDNDTVTTDMLDK 140
Query: 140 YAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAP-RLTESGFQFLLMD 198
Y+ +WE L ++ ++ A P+ +++K L+ + ++T GFQFLL +
Sbjct: 141 YSADKWETILHFMVGTSLAHIPSENVLNLLK--HSKLMEESNTTGEFKITNEGFQFLLQE 198
Query: 199 TNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFAD 258
N+QLW ++ +Y+ ++ ++ D+++F+ L + Y+ N LSE QK M+KD D
Sbjct: 199 VNSQLWALLLQYLKMTEMLQMDPVDILNFIFMLGALEFSKPYSTNGLSETQKVMLKDMRD 258
Query: 259 LGLVKLQQGRKENWFIPTKLATNLS---MSLTDSSAR----------------------- 292
GLV Q N F PT+LA+ L+ S+ +S
Sbjct: 259 YGLV-FQMNSNANVFYPTRLASMLTSDPKSIRGASGAMDSVLKQNKEDVLNKSGGAGANA 317
Query: 293 --------------KEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITK 338
++G ++VETNF++Y+YS S L IL LF ++ + N++ G +T+
Sbjct: 318 DDDADEIGSEGQNMQDGALIVETNFKLYSYSNSPLQIAILSLFVHMKSRFSNMVTGQLTR 377
Query: 339 ESLYNAFENGITAEQIISFLQQNAHPRV------------------ADRMPSVPENVCDQ 380
ES+ A NGITA+QII++L+ +AHP++ D + +P V DQ
Sbjct: 378 ESIRQALLNGITADQIIAYLETHAHPQMRRMAEEQLEKRLELDPNSKDPLQILPPTVVDQ 437
Query: 381 IRLWESDLNRVEMTPAHYYDEF 402
I+LW+ +L+R+ Y +F
Sbjct: 438 IKLWQLELDRIISFEGSLYSDF 459
>gi|453082965|gb|EMF11011.1| RNA polymerase II transcription factor B subunit 2 [Mycosphaerella
populorum SO2202]
Length = 507
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 132/428 (30%), Positives = 216/428 (50%), Gaps = 40/428 (9%)
Query: 8 AKNFMDMVASLTARDLDRLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEW 66
A+ ++ + L +LY+ P AI R LP LAK VM + Y+ A ++ W
Sbjct: 5 AQRALEYLEQLPGTTFTKLYQQPSTALAIFRRMLPHLAKTMVMALLYLPQPFAAADLDAW 64
Query: 67 VLPDGFT---KHRVAIDRLVQLRL-FSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREP 122
V PD + K R A+ L +L++ + E + YRL+ F +LR L G
Sbjct: 65 VKPDHDSTQAKER-ALSILQRLKIVYDEPYQGRPAYRLSPAFGKSLRSALTGSGTHNSFG 123
Query: 123 MPSGITARLP-TLEDLEAYAIGQWECFLLQLISSTQAERP--TNFSSSMMKVFQRG--LL 177
+P + P T++ L+ +A QWE L ++ S A + SS + + G ++
Sbjct: 124 VPCTEPDKHPVTVDYLDTFARTQWEAILYYVVGSASAGLGGGVDISSGTRTLLETGKFVV 183
Query: 178 SRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQER-GINQADLISFLLELSFHVA 236
+ +T GF FLL D NAQ+W ++ Y+ S + ++ D++SFL L
Sbjct: 184 VKGGGRHRAITTDGFTFLLQDVNAQVWSLLIVYLEVSGDLLQMDSVDVLSFLFTLGSLEL 243
Query: 237 GEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSM---SLTDSS--- 290
G +Y+ + L+ Q M+ D + GLV + + ++ PT+LAT L+ +L ++S
Sbjct: 244 GVSYSTSNLTPTQLQMLDDLTNFGLVYRRSPSSDRYY-PTRLATTLTSDAPALPNNSFTT 302
Query: 291 ----------------ARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVG 334
A ++G+I++ETN+R+YAY++S L IL LF+ + + PNLI
Sbjct: 303 TTTTNQNADPNDPAATASEKGYIILETNYRLYAYTSSPLPISILSLFASLNTRYPNLITA 362
Query: 335 AITKESLYNAFENGITAEQIISFLQQNAHP-----RVADRMPSVPENVCDQIRLWESDLN 389
ITK S++ A +GIT++QIIS+LQ +AHP + P +P V DQIRLW+ +
Sbjct: 363 KITKTSIHTAIASGITSDQIISYLQTHAHPILRRQAALNSAPILPPTVVDQIRLWQIEGE 422
Query: 390 RVEMTPAH 397
R++ T +
Sbjct: 423 RMKSTKGY 430
>gi|354546081|emb|CCE42810.1| hypothetical protein CPAR2_204530 [Candida parapsilosis]
Length = 451
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 125/415 (30%), Positives = 206/415 (49%), Gaps = 70/415 (16%)
Query: 51 MFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQLRLFSEEKKKETTYRLNSTFQSNLRK 110
M + + +P + + +W P A+ RL L L E K T RLN+ F+SN R
Sbjct: 1 MIFNEKPMPMRDLTKWCKPSAKRMEFDALKRLESLHLI-EYDSKGTNVRLNTIFRSNFRD 59
Query: 111 HLIYGGALPREPMPSGITARLPTLED--LEAYAIGQWECFLLQLISSTQAERPTNFSSSM 168
L G P +E L+ +A +WE L ++ + P S S+
Sbjct: 60 CLT-GSQDPNAFGSISTDDDADKIETSFLDKFASHEWENILHFMVGTEGTPTP---SRSV 115
Query: 169 MKVFQRGLLSRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFL 228
+ + + G L + E +T +GFQFLL D NAQ+W ++ +Y++ + E +N D+++F+
Sbjct: 116 LSLLKLGGLMEGESELT-ITNTGFQFLLQDVNAQIWTLLLQYLNLTSESNMNPVDVLNFI 174
Query: 229 LELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTD 288
L G++Y++ +LSE Q SM+ D DLGL+ Q+ + + F PT+LAT L+ +D
Sbjct: 175 FILGCLELGKSYSVLSLSETQVSMLADLKDLGLI-YQKSKTSSRFYPTRLATTLT---SD 230
Query: 289 SSARK-------------EG---------FIVVETNFRMYAYSTSKLHCEILRLFSKVEY 326
SSA K EG I++ETNF++YAY+ S L IL LF +++
Sbjct: 231 SSALKTPSMAVEQALEESEGQMMTSTSRESIIIETNFKIYAYTNSPLEIAILNLFVQMKT 290
Query: 327 QLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVA------------------- 367
+ N++ G IT+ES+ NA NGIT++QII FL+ +AHP++
Sbjct: 291 RFSNMVCGQITRESIRNALYNGITSDQIIKFLETHAHPQMKALAKEKLDKKVEFDASHNI 350
Query: 368 -----------------DRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
++ +P NV DQI+LW+ +L+R++ + + +F ++
Sbjct: 351 NTAGGAPQSKTDGTISQHKLEVIPPNVVDQIKLWQLELDRIQTVEGYLFKDFANQ 405
>gi|313224499|emb|CBY20289.1| unnamed protein product [Oikopleura dioica]
Length = 411
Score = 182 bits (461), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 110/373 (29%), Positives = 213/373 (57%), Gaps = 19/373 (5%)
Query: 35 AILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQLRLFSEEKKK 94
++ R LP +AK ++++ +I+ I ++E WV + K VA++ + LR++ E
Sbjct: 2 SVFRELPQIAKHIIIRILFINQQIAKPLIESWVADEHREKFEVAMEIITGLRIW-ENTND 60
Query: 95 ETTYRLNSTFQSNLRKHLIYGGALPREPMPS-GITARLPTLEDLEAYAIGQWECFLLQLI 153
+ LN++F++ L++ L GG R + + G T+E L+ Y +W+ +L +
Sbjct: 61 GIAFNLNNSFRTYLQEALFGGGETWRPAVETLGADKNAKTVEQLDTYTKERWDQ-ILSFL 119
Query: 154 SSTQAERPTNFSSSMMKVFQRGLLSRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISN 213
+ Q + S ++ + + L + E FQFLL+D ++Q+WY++ +Y+
Sbjct: 120 TQEQGK----LSEEVISLLKYAGLCDANGEKR------FQFLLLDRSSQVWYLLVQYLGY 169
Query: 214 SQERGINQADLISFLLELSFHVAGEAYNL-NTLSEIQKSMIKDFADLGLVKLQQGRKENW 272
Q+ G++ ++++F+L+L + G Y N+ +EI + +I+ F ++GL+ ++ KE
Sbjct: 170 VQKLGLSLVNVLAFVLQLGYCSFGTDYPCDNSNNEISR-VIQHFREMGLI-FKRKSKEQR 227
Query: 273 FIPTKLATNLSMS---LTDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLP 329
F PT+LA ++S++ S +E FI+VETN+R+YAY+ S+LH ++ LF++V+Y+ P
Sbjct: 228 FYPTRLAQSISIAGGKKASSEDVQEQFILVETNYRIYAYTDSELHYALISLFAEVQYRFP 287
Query: 330 NLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLN 389
+IV ++++S+ + + GI+AEQI+++L+ +AHP VP+ V D I LW +
Sbjct: 288 YMIVAQMSRDSIQQSADYGISAEQILNYLRSSAHPIARKNKHWVPQVVEDNIHLWCKERE 347
Query: 390 RVEMTPAHYYDEF 402
R++ Y +F
Sbjct: 348 RLKFNDGLLYHQF 360
>gi|367009216|ref|XP_003679109.1| hypothetical protein TDEL_0A05660 [Torulaspora delbrueckii]
gi|359746766|emb|CCE89898.1| hypothetical protein TDEL_0A05660 [Torulaspora delbrueckii]
Length = 513
Score = 181 bits (460), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 126/447 (28%), Positives = 211/447 (47%), Gaps = 75/447 (16%)
Query: 25 RLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQ 84
RLY++P C AI R LP +AK ++M M + + I + ++ WV G + + AI +
Sbjct: 25 RLYQSPATCLAIYRLLPQMAKFFIMSMVFNENEISLRDLDRWVKASGKNQFQDAIKSMKS 84
Query: 85 LRLFSEEKKKE-TTYRLNSTFQSNLRKHLIYG------GALPREPMPSGITARLPTLEDL 137
L L + LN TF+ + R L G G + EP + E L
Sbjct: 85 LHLLIPVRTNGPMMINLNQTFRESFRNALTGGEVNNSFGIVVDEP------NDVVNTEML 138
Query: 138 EAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAP-RLTESGFQFLL 196
+AY+ +WE L ++ ++ A P+ +++K L+ ++T GFQFLL
Sbjct: 139 DAYSADKWETILHFMVGTSLASTPSGNVLNLLK--HSKLMEEVSTTGEFKITNEGFQFLL 196
Query: 197 MDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDF 256
+ N+Q+W ++ +Y+ ++ ++ ++++F+ L G+ Y+++ LS+ QK M KD
Sbjct: 197 QEVNSQIWALLLQYLKMTETLQMDPVEVLNFIFMLGALEFGKPYSMDGLSDTQKLMSKDM 256
Query: 257 ADLGLVKLQQGRKENWFIPTKLATNLSMSL-----------------TDSSARK------ 293
D GLV Q+ F PT+LAT L+ D ++ K
Sbjct: 257 RDYGLV-FQKNSNSKVFYPTRLATMLTSDAKSIRGASGAMDSVLKQNKDEASNKAASSNA 315
Query: 294 -----------------EGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAI 336
+G ++VETNF++Y+YS S L IL LF ++ + N++ G I
Sbjct: 316 DADSDDDEVGINGQPIQDGALIVETNFKLYSYSNSPLQIAILSLFVHLKSRFTNMVTGQI 375
Query: 337 TKESLYNAFENGITAEQIISFLQQNAHPRV------------------ADRMPSVPENVC 378
T+ES+ A NGITA+QII++LQ +AHP++ D + +P V
Sbjct: 376 TRESIRRALINGITADQIIAYLQTHAHPQMRRLAEEKLEKKLELDANCKDTLQILPPTVV 435
Query: 379 DQIRLWESDLNRVEMTPAHYYDEFPSR 405
DQI+LW+ +L+R+ Y + S
Sbjct: 436 DQIKLWQLELDRIISYDGSLYSDIDSH 462
>gi|154308354|ref|XP_001553513.1| hypothetical protein BC1G_08237 [Botryotinia fuckeliana B05.10]
Length = 410
Score = 181 bits (459), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 107/350 (30%), Positives = 191/350 (54%), Gaps = 25/350 (7%)
Query: 79 IDRLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSG-ITARLPTLEDL 137
+ RL + + + ++ T L F ++LR L GG +PS A ++ L
Sbjct: 19 LSRLHIVDITALSREDPQTVALTKNFGASLRLALTGGGNHQSFGVPSSDHIAPHVDIDFL 78
Query: 138 EAYAIGQWEC---FLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAPRLTESGFQF 194
+ +A QWE +++ ++S + ++S+ + G L R + +T++GF F
Sbjct: 79 DTHARMQWEGILHYMVNTVTSGSGKDGNGPANSVKALLDAGKLVTRGR-GIGITQAGFSF 137
Query: 195 LLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIK 254
LL + NAQ+W ++ +I N++ G++ D++SFL L G AY+ TL+E QK M+
Sbjct: 138 LLQEANAQVWTLLLLWIENAESMGMDSVDVLSFLFMLGSLELGRAYSTKTLTEAQKGMLA 197
Query: 255 DFADLGLVKLQQGRKENWFIPTKLATNLSM-------------SLTDSSARKEGFIVVET 301
+ DLGL+ L +F PT+LAT L+ + + S+A ++GFI++ET
Sbjct: 198 NLIDLGLIYLPPSAPTQFF-PTRLATTLTSDASALRTVAAGFDAASKSAASQKGFIIIET 256
Query: 302 NFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQN 361
N+R+YAY+ S L +L LF+K+ + PN++ G ++++S+ A +GIT++QII++L +
Sbjct: 257 NYRLYAYTNSPLQIAVLSLFTKLNTRYPNMVSGRVSRDSIRTAIAHGITSDQIITYLSTH 316
Query: 362 AHPRV------ADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
AHP++ + P +P V DQIRLW+ + R++ P +F ++
Sbjct: 317 AHPQLVKASSASHGGPVLPPTVVDQIRLWQLENERMKAVPGFLMKDFETQ 366
>gi|50291141|ref|XP_448003.1| hypothetical protein [Candida glabrata CBS 138]
gi|62901305|sp|Q6FP41.1|TFB2_CANGA RecName: Full=RNA polymerase II transcription factor B subunit 2;
AltName: Full=RNA polymerase II transcription factor B
52 kDa subunit; AltName: Full=RNA polymerase II
transcription factor B p52 subunit
gi|49527314|emb|CAG60954.1| unnamed protein product [Candida glabrata]
Length = 504
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 127/439 (28%), Positives = 215/439 (48%), Gaps = 67/439 (15%)
Query: 25 RLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQ 84
RLY++P C AI R LP LAK ++M + + D I + ++ WV +G + + AI +
Sbjct: 24 RLYKSPATCLAIYRLLPTLAKFFIMTVIFNDKDISLRDLDRWVKSNGKLQFQEAIKSMKS 83
Query: 85 LRLFSEEK-KKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITA---RLPTLEDL--- 137
L L + + LN TF+ + R L GG + GI RL T+ +L
Sbjct: 84 LHLLIPTRVNGQLLINLNPTFRESFRNALT-GGEVNNS---FGIVVDEDRLDTVVNLAVL 139
Query: 138 EAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRR--DKEAPRLTESGFQFL 195
+ YA +WE L ++ + A+ P S +++ + + L D +T GFQFL
Sbjct: 140 DEYAATKWETILHFMVGTPMAKMP---SENVLNLLKHSKLMEEVPDSSEFMITNEGFQFL 196
Query: 196 LMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKD 255
L + N+Q+W ++ +Y+ ++ ++ +++F+ L G+AY+ LSE Q M+ D
Sbjct: 197 LQEVNSQIWSLLLQYLKLAESLHMDPVHVLNFIFMLGALETGKAYSTENLSETQLKMLLD 256
Query: 256 FADLGLVKLQQGRKENWFIPTKLATNLS----------------MSLTDSSAR------- 292
D GLV Q+ N F PT+LA L+ ++ D +A+
Sbjct: 257 MRDYGLV-FQKTSNPNIFYPTRLAQMLTSDTKSMRTASGAMESVLNKPDDAAKSTDDKYD 315
Query: 293 ---------KEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYN 343
++G +++ETNF++Y+Y S L IL LF ++ + N++ G IT+ES+
Sbjct: 316 SLEGKAEDIQDGALIIETNFKLYSYCNSPLQIAILSLFVHLKSRFANMVAGQITRESIRR 375
Query: 344 AFENGITAEQIISFLQQNAHPRV------------------ADRMPSVPENVCDQIRLWE 385
A NGITA+Q+I++L+ +AHP++ D + +P V DQI+LW+
Sbjct: 376 ALINGITADQVIAYLESHAHPQMRRLAEEKLQKKLELDPNCKDPLQVLPPTVVDQIKLWQ 435
Query: 386 SDLNRVEMTPAHYYDEFPS 404
+L+RV Y +F +
Sbjct: 436 LELDRVLTYEGSLYIDFDT 454
>gi|302917488|ref|XP_003052448.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256733388|gb|EEU46735.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 496
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 131/430 (30%), Positives = 221/430 (51%), Gaps = 58/430 (13%)
Query: 25 RLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRL- 82
+LY+ P AI R LP LAK +VM++ Y I +++WV P + A+ L
Sbjct: 25 KLYQQPSTAFAIFRRMLPHLAKTFVMRILYSPKPILLTDLDDWVKPSAKRQKDQALSILR 84
Query: 83 ----VQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLPTLED-- 136
VQ+ S+E+ +E +L + F+++LR L GG +PS T ++P D
Sbjct: 85 VLYIVQITAPSKERPQEM--QLTANFKTSLRLALSGGGTHNSFGVPS--TLQVPPEIDII 140
Query: 137 -LEAYAIGQWECFLLQLISSTQAERPTNFSS----SMMKVFQRG-LLSRRDKEAPRLTES 190
L+ YA +WE +L + S+ + T SS S+ ++ G L+ R+ +T++
Sbjct: 141 FLDRYARRKWED-ILHFVVSSVGYKSTGDSSGPNKSVKELLIAGRLVDRKPNGMVGITQA 199
Query: 191 GFQFLLMDTNAQLWYIVR---EYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSE 247
GF FLL + NAQ+W ++ E + ++ G+ D++SFL L+ G AY+ N L+E
Sbjct: 200 GFTFLLQEANAQVWTLLLLWLEAMDVNKGAGLEATDMLSFLFVLASMELGRAYDTNALTE 259
Query: 248 IQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLS------MSLTDS------------ 289
+K+M+ D GL+ + K + F PT+LAT L+ S++D
Sbjct: 260 QRKNMLPSLVDFGLIYIPN-HKRSMFFPTRLATTLTSSGNSLRSISDGVAAATAAALQPG 318
Query: 290 ------------SARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAIT 337
+ G +++ETN+R+YAY+ S L +L LFSK+ + P+++ G ++
Sbjct: 319 QSGGSGGSTTGSGQEQRGSVIIETNYRIYAYTQSTLQIAVLALFSKLSMRFPDMVAGRLS 378
Query: 338 KESLYNAFENGITAEQIISFLQQNAHPRVA-----DRMPSVPENVCDQIRLWESDLNRVE 392
++S+ A GITA+QIIS+L +AH ++ + P +P V DQIRLW+ + R++
Sbjct: 379 RQSIRQAINFGITADQIISYLAAHAHDQMHRTAALNNKPVLPPTVVDQIRLWQLENERMK 438
Query: 393 MTPAHYYDEF 402
T + +F
Sbjct: 439 TTSGFLFKDF 448
>gi|406604717|emb|CCH43852.1| RNA polymerase II transcription factor B subunit 2 [Wickerhamomyces
ciferrii]
Length = 488
Score = 178 bits (452), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 120/423 (28%), Positives = 209/423 (49%), Gaps = 50/423 (11%)
Query: 25 RLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQ 84
RLY++P C AI R LP LAK ++M M + + + + WV P+G A+ +
Sbjct: 24 RLYQSPATCLAIYRLLPGLAKFFIMSMVFNEKPQAVRDFDSWVKPNGKMDLVEALRSIKS 83
Query: 85 LRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPRE--PMPSGITARLPTLEDLEAYAI 142
L L E +N F+ + R +++ G + + ++E L+ YA
Sbjct: 84 LHLLKEANGIAI---MNPVFRKSFR-NVLTGNEIGNSFGVLCDTDDTHKVSVEFLDQYAA 139
Query: 143 GQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAPRLTESGFQFLLMDTNAQ 202
+WE L ++ + + P+ S++ GL+ + + +T GFQFLL D NAQ
Sbjct: 140 NKWETILHFMVGTELEQSPSPGVLSLLT--HSGLMEGKSVKDMLITNEGFQFLLQDVNAQ 197
Query: 203 LWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLV 262
LW ++ +Y+ ++ ++ D+++F+ L G+ Y+L+ LSE Q +M+ D D GL+
Sbjct: 198 LWTLLLQYLRMAESLQMDPVDVLNFIFMLGSLELGKDYSLSALSETQVNMLGDLRDYGLI 257
Query: 263 KLQQGRKENWFIPTKLATNLSM----------------------SLTDSSARKEGFIVVE 300
Q+ F PT+LAT L+ + T ++ G I++E
Sbjct: 258 -YQRKSTSRRFYPTRLATTLTSDTTSLRSASSAMNKVIENAKDSAYTPTNIENSGTIIIE 316
Query: 301 TNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQ 360
TNF++YAY+ S L IL LF ++ + NL+ G IT+ES+ A +GIT+EQIIS+L+
Sbjct: 317 TNFKVYAYTNSPLQIAILNLFVHLKARFSNLVTGQITRESIRKALVSGITSEQIISYLES 376
Query: 361 NAHPRV-------------------ADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDE 401
+AHP++ +++ + + DQI+LW+ +L+R+ + + +
Sbjct: 377 HAHPQLRRAAEEELNKKNGFESNNHGEKIQILQPTIADQIKLWQLELDRIMSFDGYLFTD 436
Query: 402 FPS 404
F S
Sbjct: 437 FSS 439
>gi|255936123|ref|XP_002559088.1| Pc13g06540 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211583708|emb|CAP91723.1| Pc13g06540 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 483
Score = 178 bits (451), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 129/423 (30%), Positives = 220/423 (52%), Gaps = 51/423 (12%)
Query: 25 RLYENPFICEAILRSLPP----LAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAID 80
+LY+ P AI R + P +AK +VM + Y+ +P +E WV + + A+
Sbjct: 22 KLYQQPSTALAIFRRMLPDLVNVAKCFVMALLYLKDPLPTADLEVWVNSESKKERDNALS 81
Query: 81 RLVQLRLFSEEKKKETT--YRLNSTFQSNLRKHLIYGGALPREPMPSGITARLP-----T 133
L +L + S + Y + + F ++LR+ L GA + G+ ++ P +
Sbjct: 82 ILGRLHILSTTLTGDNVRAYMITNPFAASLRQALT--GA--EDSHSFGVFSQAPEHTLTS 137
Query: 134 LEDLEAYAIGQWECFLLQLI--SSTQAERP-TNFSSSMMKVFQRGLLSRRDKEAPRLTES 190
+ DL+ YA QWE + ++ S+ +R N S + ++ Q G L +T+
Sbjct: 138 ITDLDEYARRQWEGVMGYMVGTSALSGQRDMVNLSKGVKQLLQAGHLVEIRGNRVDITKD 197
Query: 191 GFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQK 250
GF F+L D N Q+W+I+ Y+ +++ G++ +++SFL LS G +Y+ + ++ Q
Sbjct: 198 GFGFVLQDVNTQVWHILILYVESAEAIGMDSVEVLSFLFLLSSLELGSSYDKSHMTPNQL 257
Query: 251 SMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSA------------------R 292
+ D AD G+V Q+ F PT+LAT L+ +DSSA
Sbjct: 258 RTLMDLADFGIV-YQEHADATRFYPTRLATTLT---SDSSALSNPVTGSLTGPAGPSGGS 313
Query: 293 KEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAE 352
GFI++ETN+R+YAY++S L ++ LF+ ++Y+ PNL+ G IT++S+ A E GITA+
Sbjct: 314 GSGFIIIETNYRLYAYTSSPLQISLISLFTNLKYRFPNLVTGKITRQSVRRAIEMGITAD 373
Query: 353 QIISFLQQNAHPRV---------ADRMPS--VPENVCDQIRLWESDLNRVEMTPAHYYDE 401
QIIS+L +AHP++ +P+ +P V DQIRLW+ + +R+ T + +
Sbjct: 374 QIISYLLSHAHPQLRKHSAAQPNGKGVPASVLPPTVTDQIRLWQLERDRLRATAGFLFKD 433
Query: 402 FPS 404
F S
Sbjct: 434 FTS 436
>gi|358385986|gb|EHK23582.1| hypothetical protein TRIVIDRAFT_36912 [Trichoderma virens Gv29-8]
Length = 492
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 130/434 (29%), Positives = 217/434 (50%), Gaps = 54/434 (12%)
Query: 18 LTARDLDRLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHR 76
L +LY+ P AI R LP LAK +VM+M ++ + ++ WV P+ K
Sbjct: 16 LPGTTFKKLYQQPSTTFAIFRRMLPHLAKTFVMRMLFLPHPMTLTDLDAWVKPEAKRKKD 75
Query: 77 VAID-----RLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARL 131
++ +VQ+ S+EK +E +L S F+ +LR L GG +PS + +
Sbjct: 76 QSLSILRSLHIVQISAPSKEKPQEI--QLMSNFKKSLRLALTGGGNHNSFGVPSSLL--V 131
Query: 132 PTLEDL---EAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLS-----RRDKE 183
P DL + YA +WE L ++SS + S K + L++ RR
Sbjct: 132 PPEIDLPFLDRYARKKWEDVLHFVVSSVGYKSSVGDGSGPNKGVKELLIAGRLVDRRPNG 191
Query: 184 APRLTESGFQFLLMDTNAQLWYIVREYI---SNSQERGINQADLISFLLELSFHVAGEAY 240
+ +T++GF FLL + NAQ+W ++ ++ N++ G++ D++SFL L+ G AY
Sbjct: 192 SLGITQAGFTFLLQEANAQVWTLLLLWLEVLGNNKGSGLDPVDMLSFLFMLASLELGRAY 251
Query: 241 NLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMS--------------- 285
+ N L+E +++M+ D GL+ + Q K + F PT+LAT L+
Sbjct: 252 DTNALTEERRNMLPSLVDFGLIYIPQ-HKRSMFFPTRLATTLTSGGNSLRTISEGVTAAT 310
Query: 286 ------------LTDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIV 333
L S + G +V+ETN+R+YAY+ S L +L LF+K+ + P+++
Sbjct: 311 QSAQTSQQALGPLGGSGGEQAGSVVIETNYRLYAYTQSALQIAVLALFAKLNMRFPDMVA 370
Query: 334 GAITKESLYNAFENGITAEQIISFLQQNAHPRVA-----DRMPSVPENVCDQIRLWESDL 388
G +++ S+ A GITA+QIIS+L +AH ++ P +P V DQIRLW+ +
Sbjct: 371 GRLSRASIRQAINFGITADQIISYLAAHAHEQMHRTAALTNKPVLPPTVVDQIRLWQLEN 430
Query: 389 NRVEMTPAHYYDEF 402
R++ T + +F
Sbjct: 431 ERMKTTSGFLFRDF 444
>gi|340518979|gb|EGR49219.1| trancription factor tfb2-like protein [Trichoderma reesei QM6a]
Length = 480
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 128/424 (30%), Positives = 217/424 (51%), Gaps = 46/424 (10%)
Query: 18 LTARDLDRLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHR 76
L +LY+ P AI R LP LAK +VM+M ++ + ++ WV P+ K
Sbjct: 16 LPGTTFKKLYQQPSTTFAIFRRMLPHLAKTFVMRMLFLPHPMTLADLDAWVKPEAKRKKD 75
Query: 77 VAID-----RLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARL 131
++ +VQ+ S+EK +E +L S F+ +LR L GG +PS + +
Sbjct: 76 QSLSILRSLHIVQISAPSKEKPQEI--QLMSNFKRSLRLALTGGGNHNSFGVPSSLM--V 131
Query: 132 PTLEDL---EAYAIGQWECFLLQLISST-------QAERPTNFSSSMMKVFQRGLLSRRD 181
P DL + YA +WE L ++SS + P ++ Q L+ RR
Sbjct: 132 PPEIDLPFLDRYARKKWEDVLHFVVSSVGYKSSLGEGSGPNKGVKELLIAGQ--LVDRRP 189
Query: 182 KEAPRLTESGFQFLLMDTNAQLWYIVREYI---SNSQERGINQADLISFLLELSFHVAGE 238
+ +T++GF FLL + NAQ+W ++ ++ N++ G++ D++SFL L+ G
Sbjct: 190 NGSLGITQAGFTFLLQEANAQVWTLLLLWLEVLGNNKSSGLDPVDMLSFLFMLASLELGR 249
Query: 239 AYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLS--------------- 283
AY+ N L+E +++M+ D GL+ + Q K + F PT+LAT L+
Sbjct: 250 AYDTNALTEERRNMLPSLVDFGLIYIPQ-HKRSMFFPTRLATTLTSGGNSLRTISEGSLG 308
Query: 284 MSLTDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYN 343
+ + + G +V+ETN+R+YAY+ S L +L LF+K+ + P+++ G +++ S+
Sbjct: 309 LLGGGGTGEQAGSVVIETNYRLYAYTQSALQIAVLALFAKLNMRFPDMVAGRLSRASIRQ 368
Query: 344 AFENGITAEQIISFLQQNAHPRVA-----DRMPSVPENVCDQIRLWESDLNRVEMTPAHY 398
A GITA+QIIS+L +AH ++ P +P V DQIRLW+ + R++ T
Sbjct: 369 AINFGITADQIISYLSAHAHEQMHRTAALTNKPVLPPTVVDQIRLWQLENERMKTTSGFL 428
Query: 399 YDEF 402
+ +F
Sbjct: 429 FRDF 432
>gi|346972761|gb|EGY16213.1| RNA polymerase II transcription factor B subunit 2 [Verticillium
dahliae VdLs.17]
Length = 476
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 123/424 (29%), Positives = 209/424 (49%), Gaps = 41/424 (9%)
Query: 13 DMVASLTARDLDRLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDG 71
D + L +LY NP I R L PLAK +V + Y+ G IP +E V P+
Sbjct: 13 DYLEKLPGTTFRKLYLNPSTALCISRRMLSPLAKTFVTMLLYLPGPIPIADLEARVKPEY 72
Query: 72 FTKHRVAIDRLVQLRLFS---EEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGIT 128
A+ +L L + + +L + F + R+ L GA P G+
Sbjct: 73 KRAKDHALAQLRSLHMLQMSVPTQGAPQMIQLTANFSKSYREALEGNGA----PGSFGVI 128
Query: 129 ARLP-----TLEDLEAYAIGQWECFLLQLISST-----QAERPTNFSSSMMKVFQRGLLS 178
+ P T+E +E +A +W+ L +++S + P N S+ ++ G L
Sbjct: 129 SPHPRSPEITIESIEKHARQKWDAILHYIVNSVTPGFVDSGGPKN---SVKELLLAGQLV 185
Query: 179 RRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGE 238
+R + +T++GF FLL + NAQ+W ++ ++ + ++ D++SFL +LS E
Sbjct: 186 KRQGGSTTITQAGFTFLLQEENAQVWTLLLLWLDATDQQTAKHTDMLSFLFQLSILELDE 245
Query: 239 AYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIP--TKLATNLSM---SLTDS---- 289
Y+ LS+ ++ M+ D GLV + + +F+ + TN+S +LTD
Sbjct: 246 VYDTEALSKERQDMLPSLMDFGLVYIPSHKPFQYFVTPQARALTNVSTATRTLTDGMNAL 305
Query: 290 ------SARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYN 343
+ I+VETN+R+YAY+TS+L +L LF + + P ++ G +T++S+
Sbjct: 306 SQPQGGPGDQTNGIIVETNYRVYAYTTSQLQIAVLSLFCHLSVKFPGMVSGRLTRQSVRQ 365
Query: 344 AFENGITAEQIISFLQQNAHPRV-----ADRMPSVPENVCDQIRLWESDLNRVEMTPAHY 398
A + GITA+QIIS+L +AH ++ AD P +P V DQIRLW+ + R++ T
Sbjct: 366 AIDFGITADQIISYLAAHAHEQMHRQAAADGKPVLPPTVVDQIRLWQLENERMKTTSGFL 425
Query: 399 YDEF 402
+ +F
Sbjct: 426 FRDF 429
>gi|402083674|gb|EJT78692.1| RNA polymerase II transcription factor B subunit 2 [Gaeumannomyces
graminis var. tritici R3-111a-1]
Length = 496
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 124/436 (28%), Positives = 218/436 (50%), Gaps = 60/436 (13%)
Query: 25 RLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTK--HRVAIDR 81
+LY+ P AI R LP LAK +VM + Y+ + ++ W PD + H +AI R
Sbjct: 22 KLYQQPSTAFAIFRRMLPSLAKTFVMALLYMPKPLSLADLDTWAKPDARKQKDHALAILR 81
Query: 82 LVQLRLF---SEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLPTLED-- 136
++ + L S+EK +E + L F+ +LR L GG G+ + LP +
Sbjct: 82 VLHIVLITAPSKEKSQEVS--LTPNFKKSLRLALEGGGT---SHNTFGVPSSLPVPPNVD 136
Query: 137 ---LEAYAIGQWECFLLQLISSTQ----AERPTN--------FSSSMMKVFQRGLLSRRD 181
L+ +A +W+ L +++S + A+RP+ ++ ++ G L R
Sbjct: 137 IPFLDKFARTRWDAILHYVVNSVEGDEFAQRPSKRLHSGGSKLQDTVKELLVAGGLVERR 196
Query: 182 KEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQA----DLISFLLELSFHVAG 237
+ ++++GF FLL ++NAQ+W ++ +++ G + + D++SFL L G
Sbjct: 197 GGSISISKTGFTFLLQESNAQVWTLLLQWLEAVNAAGGDHSAMAVDMLSFLFMLGTLELG 256
Query: 238 EAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSM------------- 284
+AY+ LSE +++M+ D GLV + G +F PT+LAT L+
Sbjct: 257 QAYDTEELSEQRRNMLPSLVDFGLVYIPPGNTSQYF-PTRLATTLTSGSSALRSASSALA 315
Query: 285 ---------SLTDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGA 335
S S +G I++ETN+R+YAY+++ L IL LF+++ ++ ++ G
Sbjct: 316 AATAEGPNASGGGQSEAAKGSIIIETNYRLYAYTSTPLQIAILGLFAELRFRFAGMVTGR 375
Query: 336 ITKESLYNAFENGITAEQIISFLQQNAHP---RVAD--RMPSVPENVCDQIRLWESDLNR 390
+ +ES+ A GITA+Q+I +L +AH R A + P +P V DQIRLW+ + R
Sbjct: 376 LDRESIKRAISYGITADQVIEYLAAHAHEQMHRTATLRKKPVLPPTVVDQIRLWQLETER 435
Query: 391 VEMTPAHYYDEFPSRV 406
+++ + +F S+
Sbjct: 436 MKIMRGFLFRDFDSQA 451
>gi|302405781|ref|XP_003000727.1| RNA polymerase II transcription factor B subunit 2 [Verticillium
albo-atrum VaMs.102]
gi|261360684|gb|EEY23112.1| RNA polymerase II transcription factor B subunit 2 [Verticillium
albo-atrum VaMs.102]
Length = 476
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 123/424 (29%), Positives = 209/424 (49%), Gaps = 41/424 (9%)
Query: 13 DMVASLTARDLDRLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDG 71
D + L +LY NP I R L PLAK +V + Y+ G IP +E V P+
Sbjct: 13 DYLEKLPGTTFRKLYLNPSTALCISRRMLSPLAKTFVTMLLYLPGPIPIADLEARVKPEY 72
Query: 72 FTKHRVAIDRLVQLRLFS---EEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGIT 128
A+ +L L + + +L + F + R+ L GA P G+
Sbjct: 73 KRAKDHALAQLRSLHMLQMSVPTQGAPQMIQLTANFSKSYREALEGNGA----PGSFGVI 128
Query: 129 ARLP-----TLEDLEAYAIGQWECFLLQLISST-----QAERPTNFSSSMMKVFQRGLLS 178
+ P T+E +E +A +W+ L +++S + P N S+ ++ G L
Sbjct: 129 SPHPRSPEITIESIEKHARQKWDAILHYIVNSVTPGFVDSGGPKN---SVKELLLAGQLV 185
Query: 179 RRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGE 238
+R + +T++GF FLL + NAQ+W ++ ++ + ++ D++SFL +LS E
Sbjct: 186 KRQGGSTTITQAGFTFLLQEENAQVWTLLLLWLDATDQQTAKHTDMLSFLFQLSILELDE 245
Query: 239 AYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIP--TKLATNLSM---SLTDS---- 289
Y+ LS+ ++ M+ D GLV + + +F+ + TN+S +LTD
Sbjct: 246 VYDTEALSKERQDMLPSLMDFGLVYIPSHKPFQYFVTPQARALTNVSTATRTLTDGMNAL 305
Query: 290 ------SARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYN 343
+ I+VETN+R+YAY+TS+L +L LF + + P ++ G +T++S+
Sbjct: 306 SQSQGGPGDQTNGIIVETNYRVYAYTTSQLQIAVLSLFCHLSVKFPGMVSGRLTRQSVRQ 365
Query: 344 AFENGITAEQIISFLQQNAHPRV-----ADRMPSVPENVCDQIRLWESDLNRVEMTPAHY 398
A + GITA+QIIS+L +AH ++ AD P +P V DQIRLW+ + R++ T
Sbjct: 366 AIDFGITADQIISYLAAHAHEQMHRQAAADGKPVLPPTVVDQIRLWQLENERMKTTSGFL 425
Query: 399 YDEF 402
+ +F
Sbjct: 426 FRDF 429
>gi|255713706|ref|XP_002553135.1| KLTH0D09790p [Lachancea thermotolerans]
gi|238934515|emb|CAR22697.1| KLTH0D09790p [Lachancea thermotolerans CBS 6340]
Length = 506
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 126/446 (28%), Positives = 216/446 (48%), Gaps = 67/446 (15%)
Query: 18 LTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRV 77
L A+ RLY +P C AI R LP LAK ++M + + + + ++ WV G +
Sbjct: 17 LPAQVHSRLYSSPATCLAIYRLLPKLAKFFIMSLVFNETDASLRDLDRWVKSGGKYQFNE 76
Query: 78 AIDRLVQLRLFSEEKKKET-TYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLPTLED 136
AI + L L E + LN F+S+ R L GG + + + T+
Sbjct: 77 AIKSMKSLHLLIEGNSGQPFMINLNPIFRSSFRNALT-GGEV-NNSFGNVVEDENETVHT 134
Query: 137 --LEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRG-LLSRRDKEAPRLTESGFQ 193
L+ YA +WE L ++ + + P S +++ + Q L+ ++T GFQ
Sbjct: 135 SILDQYAANKWETILHFMVGTPLTQTP---SRNVLSLLQHSKLMEESSSGELQITNEGFQ 191
Query: 194 FLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMI 253
FLL D NAQ+W ++ +Y+ ++ ++ D+++F+ L G+AY+ LS+ QK M+
Sbjct: 192 FLLQDANAQIWALLLQYLRLAETLQMDPVDVLNFIFMLGALELGKAYSDTNLSDTQKIML 251
Query: 254 KDFADLGLVKLQQGRKENWFIPTKL----------------ATNLSMSLTDSSARKE--- 294
+D D GLV Q+ + F PT+L A N +S SS+++E
Sbjct: 252 QDMRDYGLV-FQKASNTHKFYPTRLTAMLTSDTSSIRSASGAMNSVLSQGTSSSKEEAAA 310
Query: 295 --------------------GFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVG 334
G +++ETNF++Y+YS S L +L LF ++ + N++ G
Sbjct: 311 VAETDEETAQVGGTTQNIPDGAVILETNFKLYSYSNSPLQIAVLSLFVHLKSRFSNMVTG 370
Query: 335 AITKESLYNAFENGITAEQIISFLQQNAHP---RVADR---------------MPSVPEN 376
IT+ES+ A NGITA+QII++++ +AHP R+A++ + +P
Sbjct: 371 QITRESIRRALHNGITADQIIAYMETHAHPQMRRLAEQTLDKKMELDPNCNEGLQILPPT 430
Query: 377 VCDQIRLWESDLNRVEMTPAHYYDEF 402
V DQI+LW+ +L+R+ + + +F
Sbjct: 431 VVDQIKLWQLELDRIISYDGYLFTDF 456
>gi|403180193|ref|XP_003338499.2| hypothetical protein PGTG_19983 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375166015|gb|EFP94080.2| hypothetical protein PGTG_19983 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 679
Score = 175 bits (443), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 90/253 (35%), Positives = 152/253 (60%), Gaps = 12/253 (4%)
Query: 161 PTNFSSSMMKVF-QRGLLSRRDK---EAPRLTESGFQFLLMDTNAQLWYIVREYISNSQE 216
PT S +++ + Q GL++ D ++ ++T GF FLL D N QLW I+ +Y+ ++
Sbjct: 7 PTKPSHNILSLLGQSGLMTSSDPRSLQSLKITSKGFGFLLEDVNTQLWDILLQYLKMTEA 66
Query: 217 RGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPT 276
G++ D+++ L L G+ Y+ + + Q +++D D GLV + + F PT
Sbjct: 67 NGLDVVDVLACLFMLGSLELGQEYSFSNWTPTQTQVLQDLVDYGLVLVSAPDR---FYPT 123
Query: 277 KLATNLSMSL-----TDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNL 331
+LAT L+ + + + + GF+V+ETN+R+YAY+++ L +L LF + Y+ PNL
Sbjct: 124 RLATTLTSTAPPLVSAERAQEEHGFLVLETNYRIYAYTSNPLQIAVLNLFLSLRYRFPNL 183
Query: 332 IVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRV 391
+VGA+T+ES+ +A NGITA+Q+I +L +AHP++ P +P V DQIRLWE + NR+
Sbjct: 184 VVGAVTRESIKSALSNGITADQVIMYLHTHAHPQMRKLEPLLPPTVVDQIRLWELEKNRI 243
Query: 392 EMTPAHYYDEFPS 404
+ Y++F S
Sbjct: 244 RAQEGYLYEDFKS 256
>gi|256086358|ref|XP_002579367.1| tfiih polypeptide [Schistosoma mansoni]
Length = 486
Score = 174 bits (441), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 123/422 (29%), Positives = 201/422 (47%), Gaps = 47/422 (11%)
Query: 23 LDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRL 82
LD LY +P C + R LP LAK +VM++ +I+ IP ++ WV + L
Sbjct: 21 LDELYTHPPTCLVVFRELPELAKHFVMRLLFIEQPIPKSIVSGWVEKGSIALLNDSCKAL 80
Query: 83 VQLRLF--SEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGIT--ARLPTLEDLE 138
LR++ ++ ++ LN +Q ++R L +GG P +T +++ L+
Sbjct: 81 TDLRIWHSTDSNVSRGSWSLNKRYQESIRISL-FGGGKPLLGDLGVVTNDKYSKSVDFLK 139
Query: 139 AYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAP-RLTESGFQFLLM 197
+YA +W+ L ++ S AE + ++ L+ + P +T+ GF FLLM
Sbjct: 140 SYAAERWDAILHFMVGSESAEVGSVVKDVLL---LSNLMKCEGNDCPIGITKHGFHFLLM 196
Query: 198 DTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFA 257
Q+ + Y +E N + F+ +LSF ++Y + LS Q+ +++
Sbjct: 197 SRQFQVLVFILHYFDYLKENSKNLVGALQFVFQLSFLCPTKSYPVEALSTAQQEVLQHMR 256
Query: 258 DLGLVKLQQGRKENWFIPTKLA--------------------------TNLSMS------ 285
+LGL Q+ R F T LA T +S S
Sbjct: 257 ELGLA-YQRKRTAPRFYVTPLALDFAGGHTTFLESKSGGWGPQSGVIPTGVSKSDPTDSN 315
Query: 286 -----LTDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKES 340
++ S++ G+I++ETNFR+YAY+ S L +L LFSK+ + PNL+V IT++S
Sbjct: 316 KMFSQISVSNSSDVGYILLETNFRLYAYTDSPLRTALLSLFSKIRARFPNLVVADITRDS 375
Query: 341 LYNAFENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYD 400
+ A GITA QI+SFL NAHP + + P +P + DQIRLWE + +R Y+
Sbjct: 376 VREALIRGITANQILSFLTANAHPDMLLQNPILPPTLTDQIRLWELERDRFLFQEGCLYE 435
Query: 401 EF 402
+F
Sbjct: 436 QF 437
>gi|378729002|gb|EHY55461.1| transcription initiation factor TFIIH subunit H4 [Exophiala
dermatitidis NIH/UT8656]
Length = 508
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 132/444 (29%), Positives = 213/444 (47%), Gaps = 67/444 (15%)
Query: 25 RLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLV 83
RLY +P AI R L LAK +VM + Y+ +P K +E +V + AID L
Sbjct: 22 RLYRSPASALAIFRKRLTSLAKSFVMMLVYMPAPVPVKQLELFVKDTSRGEREYAIDLLH 81
Query: 84 QLRLFSE-EKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSG-ITARLP-----TLED 136
+ +F + Y L F +LR+ L G + G + +P T+
Sbjct: 82 RYHIFKDVTYNGAKAYALTPDFAKSLRRALTGAG----DSRSFGQVATNVPEDQKVTIAQ 137
Query: 137 LEAYAIGQWECFLLQLISS---------TQAERPTNFSSSMMKVFQRGLL-----SRRDK 182
L+ YA +WE L ++ S T+ + S ++++ + G L +
Sbjct: 138 LDEYARQRWEGILGYMVGSSSILLESGTTENQASIQPSPGVIELLKAGHLIELTGTYSRG 197
Query: 183 EAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNL 242
+A ++T+ GF F+L D N Q+W ++ Y+ N++ +++ D++SFL +S G AY+
Sbjct: 198 QAAKITKEGFAFVLQDINTQIWALLFLYVDNAEVFEMDKVDVLSFLFFVSSLELGLAYST 257
Query: 243 NTLSEIQKSMIKDFADLGLVK---LQQGRKENWFIPTKLA---------------TNLSM 284
L E Q + D G+V L+ G ++F PT+LA T L+
Sbjct: 258 APLDETQSRCLSDLVSFGIVYQPLLEDGTPVDYFYPTRLATTLTSDSSTTLSATNTTLAS 317
Query: 285 SLTDSSARKEG---------FIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGA 335
SL+ SA FI+VETN+R+YAY++S L +L LF + + PNL+ G
Sbjct: 318 SLSTKSAHSSSSSSAGAGKGFIIVETNYRLYAYTSSPLQIALLSLFVNLRSRHPNLVTGK 377
Query: 336 ITKESLYNAFENGITAEQIISFLQQNAHP-----------RVADR---MPSVPENVCDQI 381
++K S+ A + GITA+QIIS+L +AHP R AD +P +P + DQI
Sbjct: 378 MSKSSVQRAIQAGITADQIISYLTSHAHPQMRRHAQAEQARNADPNRVVPILPATILDQI 437
Query: 382 RLWESDLNRVEMTPAHYYDEFPSR 405
LW+ + +R+ TP +FP++
Sbjct: 438 HLWQLERDRMTTTPGFLLKDFPNQ 461
>gi|353231086|emb|CCD77504.1| putative tfiih, polypeptide [Schistosoma mansoni]
Length = 486
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 123/422 (29%), Positives = 201/422 (47%), Gaps = 47/422 (11%)
Query: 23 LDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRL 82
LD LY +P C + R LP LAK +VM++ +I+ IP ++ WV + L
Sbjct: 21 LDELYTHPPTCLVVFRELPELAKHFVMRLLFIEQPIPKSIVSGWVEKGSSALLNDSCKAL 80
Query: 83 VQLRLF--SEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGIT--ARLPTLEDLE 138
LR++ ++ ++ LN +Q ++R L +GG P +T +++ L+
Sbjct: 81 TDLRIWHSTDSNVSRGSWSLNKRYQESIRISL-FGGGKPLLGDLGVVTNDKYSKSVDFLK 139
Query: 139 AYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAP-RLTESGFQFLLM 197
+YA +W+ L ++ S AE + ++ L+ + P +T+ GF FLLM
Sbjct: 140 SYAAERWDAILHFMVGSESAEVGSVVKDVLL---LSNLMKCEGNDCPIGITKHGFHFLLM 196
Query: 198 DTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFA 257
Q+ + Y +E N + F+ +LSF ++Y + LS Q+ +++
Sbjct: 197 SRQFQVLVFILHYFDYLKENSKNLVGALQFVFQLSFLCPTKSYPVEALSTAQQEVLQHMR 256
Query: 258 DLGLVKLQQGRKENWFIPTKLA--------------------------TNLSMS------ 285
+LGL Q+ R F T LA T +S S
Sbjct: 257 ELGLA-YQRKRTAPRFYVTPLALDFAGGHTTFLESKSGGWGPQSGVIPTGVSKSDPTDSN 315
Query: 286 -----LTDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKES 340
++ S++ G+I++ETNFR+YAY+ S L +L LFSK+ + PNL+V IT++S
Sbjct: 316 KMFSQISVSNSSDVGYILLETNFRLYAYTDSPLRTALLSLFSKIRARFPNLVVADITRDS 375
Query: 341 LYNAFENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYD 400
+ A GITA QI+SFL NAHP + + P +P + DQIRLWE + +R Y+
Sbjct: 376 VREALIRGITANQILSFLTANAHPDMLLQNPILPPTLTDQIRLWELERDRFLFQEGCLYE 435
Query: 401 EF 402
+F
Sbjct: 436 QF 437
>gi|346323394|gb|EGX92992.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Cordyceps militaris CM01]
Length = 511
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 131/459 (28%), Positives = 222/459 (48%), Gaps = 67/459 (14%)
Query: 8 AKNFMDMVASLTARDLDRLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEW 66
A D + L +LY+ P AI R LP LAK VM+M Y+ G + ++ W
Sbjct: 8 AMQLADYLEKLPGTTFRKLYQQPSTAFAIFRRMLPHLAKTIVMRMLYMSGPMSLADLDAW 67
Query: 67 VLPDGFTKHRVAIDRLVQLRLF---SEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPM 123
V P+ + + L L + + K K +L + F++++R L GG +
Sbjct: 68 VKPEAKRQKDQMLSTLRSLHIVQISAPAKDKTQEVQLTTNFRTSMRLALTGGGNHNSFGV 127
Query: 124 PSGITARLPTLED--LEAYAIGQWECFLLQLISSTQAERPTNF------------SSSMM 169
PS +T P ++ L+ YA +W+ L ++SS + + S+
Sbjct: 128 PSSLTVP-PEIDAPFLDRYARRRWDEILHFVVSSVGFKSTATTAAGGEGGGGGGPNKSVR 186
Query: 170 KVFQRG-LLSRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQA-DLISF 227
+ G L+ RR + +T++GF FLL + NAQ+W ++ ++ ++Q+ N A D +SF
Sbjct: 187 DILIAGRLVDRRPSGSVGITQAGFTFLLQEANAQVWTLLLLWL-DAQDADKNAAVDTLSF 245
Query: 228 LLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLS---- 283
L L+ G AY+ + LSE ++ M+ AD GLV + +K + F PT+LAT L+
Sbjct: 246 LFMLASLEPGRAYDTHALSEQRRDMLPALADFGLVYIPP-QKRSMFFPTRLATTLTSGAS 304
Query: 284 ---MSLTDS--------------------------------SARKEGFIVVETNFRMYAY 308
S++D S + G +V+ETN+R+YAY
Sbjct: 305 SGLRSISDGVAAATSAARSSPSSSAHQPASSSAGSGPLGGGSGDQPGAVVIETNYRLYAY 364
Query: 309 STSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVA- 367
+ S L +L LF+++ + P+++ G +T+ S+ A + GITA+QII++L +AH ++
Sbjct: 365 TQSTLQIAVLALFARLTMRFPDMVAGRLTRTSIRQAIQFGITADQIIAYLAAHAHEQMHR 424
Query: 368 ----DRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEF 402
P +P V DQIRLW+ + R++ TP + +F
Sbjct: 425 TAALTNKPVLPPTVVDQIRLWQLENERMKTTPGFLFRDF 463
>gi|254573074|ref|XP_002493646.1| Subunit of TFIIH and nucleotide excision repair factor 3 complexes
[Komagataella pastoris GS115]
gi|238033445|emb|CAY71467.1| Subunit of TFIIH and nucleotide excision repair factor 3 complexes
[Komagataella pastoris GS115]
gi|328354526|emb|CCA40923.1| RNA polymerase II transcription factor B subunit 2 [Komagataella
pastoris CBS 7435]
Length = 547
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 129/485 (26%), Positives = 219/485 (45%), Gaps = 115/485 (23%)
Query: 24 DRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRV------ 77
++LY++ C AI R LP LAK YVM + + + ++ + W+ PD T H V
Sbjct: 25 NKLYQSSITCLAIFRLLPSLAKFYVMSLLFNEHSVSYIDLTRWIRPDNMT-HLVSSSTKS 83
Query: 78 ----------------------AIDRLVQLRLFSEEKKKET-----------TYRLNSTF 104
+ RL QL L + +K+ T +LN F
Sbjct: 84 SSSSAVTNFPGAIGKNKMYQNECLKRLKQLNLIKDVRKQWTNPNTGQLTNVVVLQLNPMF 143
Query: 105 QSNLRKHLIYGGALPREPMPSGITARLP-------TLEDLEAYAIGQWECFLLQLISSTQ 157
+ N R AL M + + + + E L+ Y++ +WE L ++ S
Sbjct: 144 RKNFR------NALTGSQMSNDLNNDIEMTDVEEVSEEFLDRYSLMKWENILHFMVGSEV 197
Query: 158 AERPTNFSSSMMKVFQRGLLS-----------------RRDKEAPRLTESGFQFLLMDTN 200
E P+ S+++ GL+ D ++ ++T+ GFQFLL D N
Sbjct: 198 PELPSIGVLSLLR--HSGLMDIGEENEFIATPKRFKGDNMDLKSMKITQDGFQFLLQDIN 255
Query: 201 AQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLG 260
AQ+W ++ +Y+ S++ +N D+++F+ L G+ Y + LS+ Q M++D D G
Sbjct: 256 AQIWQLLLQYLRMSEKLMMNPVDVLNFIFMLGSLELGKQYPVTLLSDTQVIMLEDLVDYG 315
Query: 261 LVKLQQGRKENWFIPTKLATNL----------SMSLTD-----SSARKEGFIVVETNFRM 305
L+ + K +F PT++AT L SMS+ + G IV+ETNF++
Sbjct: 316 LIYYPE--KNRYFYPTRMATTLTSEKTTFKTASMSMNQVLEGGPEVKDHGSIVLETNFKL 373
Query: 306 YAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR 365
Y Y+TS L IL LF + + N++ G IT+ES+ A NGITA+QII +L+ +AH +
Sbjct: 374 YCYTTSPLQIAILNLFVHLRTRFANMVTGMITRESVREALRNGITADQIIKYLETHAHSQ 433
Query: 366 VAD--------------------------RMPSVPENVCDQIRLWESDLNRVEMTPAHYY 399
+ ++ +P V DQI+LW+ +++R++ +
Sbjct: 434 MKKLAEEKLLKKLEFDSNATETQQIGSQVKLEVIPPTVVDQIKLWQLEMDRLQTFAGFLF 493
Query: 400 DEFPS 404
+F +
Sbjct: 494 KDFAN 498
>gi|406862938|gb|EKD15987.1| tfiih and nucleotide excision repair factor 3 complexes subunit
[Marssonina brunnea f. sp. 'multigermtubi' MB_m1]
Length = 421
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 104/329 (31%), Positives = 184/329 (55%), Gaps = 24/329 (7%)
Query: 100 LNSTFQSNLRKHLIYGGALPREPMPS-GITARLPTLEDLEAYAIGQWE---CFLLQLISS 155
L F+++LR L GG +PS TA E L+ YA +WE F++ + +
Sbjct: 46 LTKNFRASLRLALTGGGEQQSFGIPSHDSTASGVDAEFLDNYARERWEDILHFVVNSVGN 105
Query: 156 TQAERPTNFSSSMMKVFQRG-LLSRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNS 214
+ + T +S+ + + G L++ + +T++GF FLL + NAQ+W ++ +I N+
Sbjct: 106 SMRQDGTGPPTSVRLLLEAGKLVTTGRRSGGGITQAGFSFLLQEVNAQVWALLLLWIENA 165
Query: 215 QERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFI 274
++ + +++SFL L G++Y+ TLS Q++M+K D GLV + +F
Sbjct: 166 EKMELESVEVLSFLFMLGSLELGKSYSTATLSASQRAMLKYLVDFGLVYSPSSTPQQFF- 224
Query: 275 PTKLATNLSMSLTD-------------SSARKEGFIVVETNFRMYAYSTSKLHCEILRLF 321
PT+L+T L+ + S + +GFI++ETN+R+YAY+ S L +L LF
Sbjct: 225 PTRLSTTLTSDASGLRSVSAGFDDALKSESGTKGFIIIETNYRLYAYTNSPLQIAVLALF 284
Query: 322 SKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRV-----ADRMPSVPEN 376
+K+ + PN++ G +T++S+ NA +GIT++QIIS+L+ +AHP++ + P +P
Sbjct: 285 TKMGVRYPNMVTGRVTRKSVANAISHGITSDQIISYLRAHAHPQLVKAAAVNGNPVLPPT 344
Query: 377 VCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
V DQIRLW+ + R++ T + EF ++
Sbjct: 345 VVDQIRLWQIENERMKATAGFLFKEFETQ 373
>gi|400597688|gb|EJP65418.1| RNA polymerase II transcription factor B subunit 2 [Beauveria
bassiana ARSEF 2860]
Length = 507
Score = 171 bits (432), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 132/458 (28%), Positives = 219/458 (47%), Gaps = 69/458 (15%)
Query: 8 AKNFMDMVASLTARDLDRLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEW 66
A D + L +LY+ P AI R LP LAK +VM+M Y+ + ++ W
Sbjct: 8 ALQLADYLEKLPGTTFRKLYQQPSTAFAIFRRMLPQLAKTFVMRMLYMPNPMSLNDLDSW 67
Query: 67 VLPDGFTKHRVAIDRL-----VQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPRE 121
V P+ + + L VQ+ S++K +E +L + F++++R L GG
Sbjct: 68 VKPEAKRQKDQMLSTLRSLHIVQISAPSKDKTQEV--QLTANFRTSMRLALTGGGNHGSF 125
Query: 122 PMPSGITARLPTLEDL---EAYAIGQWECFLLQLISSTQAERPTNF-------SSSMMKV 171
+PS T +P DL + YA +W+ L ++SS + + S+ +
Sbjct: 126 GVPS--TLAVPPEIDLPFLDRYARRRWDEILHFVVSSVGFKNAGGGGAAADGPNKSVRDI 183
Query: 172 FQRG-LLSRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLE 230
G L+ RR + +T++GF FLL + NAQ+W ++ ++ D++SFL
Sbjct: 184 LVAGRLVDRRPNGSIGITQAGFTFLLQEANAQVWTLLLLWLDALDADKSAAVDMLSFLFV 243
Query: 231 LSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDS- 289
L+ G AY+ N L+E ++ M+ AD GLV + +K + F PT+LAT L+ + S
Sbjct: 244 LASLELGRAYDTNALTEHRRDMLPALADFGLVYIPP-QKRSMFFPTRLATTLTSGASSSL 302
Query: 290 ---------------------------------------SARKEGFIVVETNFRMYAYST 310
+ + G IV+ETN+R+YAY+
Sbjct: 303 RSISEGVVAATSAALSASSSSSTHQDQNHQSRAGPLGAGAGDQIGSIVIETNYRLYAYTQ 362
Query: 311 SKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR----- 365
S L +L LFS++ + P+++ G IT+ S+ A + GITA+QII++L +AH +
Sbjct: 363 STLQIAVLALFSRLTMRFPDMVAGRITRTSIRQAIQFGITADQIIAYLAAHAHEQMHRSA 422
Query: 366 -VADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEF 402
+A R P +P V DQIRLW+ + R++ T + +F
Sbjct: 423 ALAAR-PVLPPTVVDQIRLWQLENERMKTTGGFLFRDF 459
>gi|358394641|gb|EHK44034.1| hypothetical protein TRIATDRAFT_222970 [Trichoderma atroviride IMI
206040]
Length = 491
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 128/433 (29%), Positives = 213/433 (49%), Gaps = 53/433 (12%)
Query: 18 LTARDLDRLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHR 76
L +LY+ P AI R LP LAK +VM+M ++ + ++ WV P+ K
Sbjct: 16 LPGTTFKKLYQQPSTAFAIFRRMLPHLAKTFVMRMLFMPHPMTLNDLDVWVKPEAKRKKD 75
Query: 77 VAID-----RLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARL 131
++ +VQ+ S+EK +E +L S F+ +LR L GG +PS + +
Sbjct: 76 QSLSILRGLHIVQISAPSKEKPQEI--QLMSNFKRSLRLALTGGGNHNSFGVPSSLL--I 131
Query: 132 PTLEDL---EAYAIGQWECFLLQLISSTQAERPTNFSSSMMKV----FQRGLLSRRDKEA 184
P DL + YA +W+ L ++SS + + S V L+ RR
Sbjct: 132 PPEIDLPFLDRYARKKWDDVLHFVVSSVGYKSVGDGSGPNKGVKELLIAGRLVDRRPNGG 191
Query: 185 PRLTESGFQFLLMDTNAQLWYIVREYI---SNSQERGINQADLISFLLELSFHVAGEAYN 241
+T++GF FLL + NAQ+W ++ ++ N++ G++ D++SFL L+ G AY+
Sbjct: 192 LGITQAGFTFLLQEANAQVWTLLLLWLDVLGNNKGSGLDPVDMLSFLFMLASLELGRAYD 251
Query: 242 LNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARKEGF----- 296
N L+E +++M+ D GL+ + Q K + F PT+LAT L+ EG
Sbjct: 252 TNALTEERRNMLPSLVDFGLIYIPQ-HKRSMFFPTRLATTLTSGGNSLRTISEGVTAATQ 310
Query: 297 ----------------------IVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVG 334
+V+ETN+R+YAY+ S L +L LF+K+ + P+++ G
Sbjct: 311 SAQTSQQSLGPLGGGGEQQSGSVVIETNYRLYAYTQSALQIAVLALFAKLNMRFPDMVAG 370
Query: 335 AITKESLYNAFENGITAEQIISFLQQNAHPRVA-----DRMPSVPENVCDQIRLWESDLN 389
+++ S+ A GITA+QIIS+L +AH ++ P +P V DQIRLW+ +
Sbjct: 371 RLSRASIRQAINFGITADQIISYLAAHAHEQMHRTAALTNKPVLPPTVVDQIRLWQLENE 430
Query: 390 RVEMTPAHYYDEF 402
R++ T + +F
Sbjct: 431 RMKTTSGFLFRDF 443
>gi|398406290|ref|XP_003854611.1| hypothetical protein MYCGRDRAFT_69317 [Zymoseptoria tritici IPO323]
gi|339474494|gb|EGP89587.1| hypothetical protein MYCGRDRAFT_69317 [Zymoseptoria tritici IPO323]
Length = 493
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 131/427 (30%), Positives = 208/427 (48%), Gaps = 39/427 (9%)
Query: 8 AKNFMDMVASLTARDLDRLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEW 66
A +D + SL +LY P A+ R LP LAK VM M Y+ G + A +E W
Sbjct: 6 ATTALDYLESLPGLTFAKLYAQPSTALAVFRRMLPHLAKTVVMSMLYLPGPMRAADLEAW 65
Query: 67 VLPD--GFTKHRVAIDRLVQLRLFSEEKKKETT---YRLNSTFQSNLRKHLIYGGALPRE 121
D + ++ LV+L++ EE+ YRL+ F LR L GA+ +
Sbjct: 66 FKSDVQSSQERERSVSVLVRLKVLVEEQAGGGAGLQYRLSEAFAKGLRCALT--GAVGKG 123
Query: 122 PMPSG---ITARLPTLED-----LEAYAIGQWECFLLQLISSTQAERPTNF--SSSMMKV 171
G + ED L+ +A QWE L ++ S + + S+ ++
Sbjct: 124 DAGKGSFGVACSAEKDEDVNVKFLDDFARRQWEAILYYVVGSANSGLSGDLEISAGTKRL 183
Query: 172 FQRG-LLSRRDKEAPRL-TESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLL 229
Q+G + + RL T++GF FLL + NAQ+W + Y+ ++ ++ D++SFL
Sbjct: 184 LQQGNFVQIKSGGKQRLITQAGFTFLLEEVNAQVWSLAVVYLRVCEDLQMDPVDVLSFLF 243
Query: 230 ELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLS------ 283
L G Y+ + L+ Q+ M+ D D GL+ + + PT+LAT L+
Sbjct: 244 TLGSLELGIPYSTSNLTLTQQHMLDDLQDFGLI-YRSSPTATAYYPTRLATTLTSDAPAL 302
Query: 284 -------MSLTDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAI 336
+ + + ++G+I++ETN+R+YAY++S L IL LFS + + PNLI I
Sbjct: 303 PNNTSSTTTTAATESNEKGYIILETNYRLYAYTSSPLQISILSLFSTLHTRYPNLITAKI 362
Query: 337 TKESLYNAFENGITAEQIISFLQQNAHP---RVADRM--PSVPENVCDQIRLWESDLNRV 391
TK S+ +A IT++QII++L +AHP R A M P +P V DQIRLW+ + R+
Sbjct: 363 TKSSIASAISQSITSDQIITYLSTHAHPVLRRQAAIMSAPILPPTVVDQIRLWQIEGERM 422
Query: 392 EMTPAHY 398
Y
Sbjct: 423 TTWKGFY 429
>gi|388581217|gb|EIM21527.1| transcription factor Tfb2 [Wallemia sebi CBS 633.66]
Length = 449
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 111/317 (35%), Positives = 180/317 (56%), Gaps = 22/317 (6%)
Query: 100 LNSTFQSNLRKHLIYGGALPREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAE 159
LN F+ +L++ LI G + E + + IT T+E L+ +A QWE L ++ S +
Sbjct: 84 LNEQFKHSLKQILI-GREI--ESLSNSITT---TIEQLDTFATQQWELILHYMVGSYNST 137
Query: 160 RPTNFSSSMMKVFQRGLLSRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGI 219
S ++ + ++ L + R++ GFQFLL D QLW ++ EY+S S+ R +
Sbjct: 138 P----SKGVLFLLEKSNLIYTANKQHRISSKGFQFLLEDVQVQLWQLLLEYLSLSEIRQM 193
Query: 220 NQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLA 279
+ +++SFL L G+ Y++ +L+ Q++M+ D D GLV Q+ F PT+LA
Sbjct: 194 DLVEVLSFLFVLGTLELGKDYSIESLTNTQQAMLSDLRDYGLV-WQRKTSSKRFYPTRLA 252
Query: 280 TNLSMS-----------LTDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQL 328
T L+ S +S+ + FI++ETN+R+YAY+++ L IL LF ++ +
Sbjct: 253 TTLTSSAPPLLPTTESSSFTTSSDNKRFIILETNYRLYAYTSNPLQISILNLFVTLKARY 312
Query: 329 PNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDL 388
PNL++G IT++S+ +A NGITAEQII +L +AH ++ P +P V DQIRLWE +
Sbjct: 313 PNLVIGVITRDSIRSALSNGITAEQIIGYLTSHAHTQMHRNNPLLPVTVSDQIRLWELEK 372
Query: 389 NRVEMTPAHYYDEFPSR 405
NR++ Y EF S+
Sbjct: 373 NRLKADDGVLYAEFRSQ 389
>gi|171681598|ref|XP_001905742.1| hypothetical protein [Podospora anserina S mat+]
gi|170940758|emb|CAP66407.1| unnamed protein product [Podospora anserina S mat+]
Length = 540
Score = 167 bits (424), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 129/444 (29%), Positives = 213/444 (47%), Gaps = 76/444 (17%)
Query: 30 PFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQLRLF- 88
P + + P AK YV + Y + +E W P+G AI RL L +
Sbjct: 58 PSLRTTVTNGHAPTAKVYVQALLYSPTPLTTNDIELWTRPEGKGTSNRAIARLRSLHIAQ 117
Query: 89 ---SEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLPT-----LEDLEAY 140
S KK +L + F+ +LR L GG+ +PS LPT ++ L+ +
Sbjct: 118 LSQSTRAKKAQDIQLTANFKKSLRLALEGGGSHNSFGVPS----TLPTDPKIHIQYLDQW 173
Query: 141 AIGQWECFLLQLISST-------------QAERPTNFSSSMMKVFQRGLLSRRDKEAPRL 187
A W+ L +++S P ++K+ GL+ ++
Sbjct: 174 AGRIWQDILYYVVNSVPMKANESGGRHGSGGGGPKRAVRELLKMG--GLVREGAGGLVQI 231
Query: 188 TESGFQFLLMDTNAQLWYIVREYISNS-------QERG-------INQADLISFLLELSF 233
+E GF FLL + NAQ+W ++ ++ + +E+G I+ +++SFL L+
Sbjct: 232 SEHGFNFLLQEANAQVWTLLLLWLEAADRNKALAKEQGTDITGTAIDNVEMLSFLFMLAS 291
Query: 234 HVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNL-------SMSL 286
G AY+ + L+E +K+M+ AD GL+ + + R + ++ PT+LAT L S+S
Sbjct: 292 LELGRAYDTSALTETRKNMLPALADFGLIYIDRDRPQQYY-PTRLATTLTSLSTMRSVSA 350
Query: 287 TDSSARKE---------------------GFIVVETNFRMYAYSTSKLHCEILRLFSKVE 325
+ +A K+ G IVVETN+R+YAY++S L IL+LF ++
Sbjct: 351 SIDAATKKTPGDAGSLGADSTPTAPADENGGIVVETNYRIYAYTSSPLQIAILKLFCRLH 410
Query: 326 YQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRV----ADRMPSV-PENVCDQ 380
+ PN++ +T+ES+ A + GITA QII +L +AHP++ A R +V P V DQ
Sbjct: 411 MRFPNMVTARLTRESVQEAIKEGITANQIIDYLVAHAHPQMRRAAAARGTTVIPPTVMDQ 470
Query: 381 IRLWESDLNRVEMTPAHYYDEFPS 404
IRLW+ + R++ TP + +F S
Sbjct: 471 IRLWQLESQRMQKTPGFQFKDFES 494
>gi|212535856|ref|XP_002148084.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Talaromyces marneffei ATCC 18224]
gi|210070483|gb|EEA24573.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Talaromyces marneffei ATCC 18224]
Length = 422
Score = 167 bits (423), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 98/304 (32%), Positives = 163/304 (53%), Gaps = 32/304 (10%)
Query: 133 TLEDLEAYAIGQWECFLLQLISSTQ--AERPTNFSSSMMKVFQRGLLSRRDKEAPRLTES 190
++ DL+ YA QWE L ++ ++ + S + ++ Q G L +T+
Sbjct: 75 SIADLDEYARRQWEGVLGYMVGTSGPGVRETVSLSRGVKQLLQAGHLVEIHHGRVDITQD 134
Query: 191 GFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQK 250
GF F+L D N Q+W I+ Y+ +++ +N +++SF+ LS G++Y L+ Q
Sbjct: 135 GFAFVLQDVNTQVWQIIILYVESAKALQVNDVEVLSFIFLLSSLELGQSYEKRHLTPDQL 194
Query: 251 SMIKDFADLGLVKLQQGRKE-NWFIPTKLATNL-------------SMSLTDSSARKE-- 294
+ D D G++ E + F PT+LAT L S+S ++A E
Sbjct: 195 RALADLTDFGIIYQHAPASEADRFYPTRLATTLTSDSSALSNTMSSSLSAQTNAATGEPG 254
Query: 295 -GFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQ 353
GFI++ETN+R+YAY++S L ++ LF+ ++Y+ PNL+ G IT++S+ A E GITA+Q
Sbjct: 255 SGFIIIETNYRLYAYTSSPLQISLIALFTTLKYRFPNLVTGKITRQSIRRAVEMGITADQ 314
Query: 354 IISFLQQNAHPRVADR-----------MPS--VPENVCDQIRLWESDLNRVEMTPAHYYD 400
IIS+L +AHP++ +P +P V DQIRLW+ + +R++ TP +
Sbjct: 315 IISYLSTHAHPQMRKEGAVKSTSNTAGIPRSVLPPTVVDQIRLWQLERDRIKATPGFLFK 374
Query: 401 EFPS 404
EF S
Sbjct: 375 EFAS 378
>gi|326435722|gb|EGD81292.1| hypothetical protein PTSG_11329 [Salpingoeca sp. ATCC 50818]
Length = 545
Score = 167 bits (423), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 111/388 (28%), Positives = 200/388 (51%), Gaps = 19/388 (4%)
Query: 25 RLYENPFICEAILR-SLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLV 83
R+Y++ C + + L L+K+ V+++ Y+ A+ ++ W++P+ ++++L
Sbjct: 26 RMYKHAHACLGLFQLCLDGLSKQMVLRLLYLPSALSKAIVSTWIVPEQARLWTESLEQLQ 85
Query: 84 QLRLFSEEKKKETTYRLNSTFQSNLRKHLI-YGGALPREPMPSGITARLPTLEDLEAYAI 142
+L++ + + + F++ L L G + P P A + + L+ YA
Sbjct: 86 KLQILAVSG--DGAVAVAQPFKNGLIAALQGTGKERCQAPDPGKHAA---SKDFLQQYAD 140
Query: 143 GQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLS-RRDKEAPRLTESGFQFLLMDTNA 201
+WE L ++ S E T ++Q L++ R D + ++ GFQFLL D
Sbjct: 141 EKWEDVLDYMVGSDSLEEDT----VKQALYQSHLMTWREDVQDFDISSLGFQFLLQDRRT 196
Query: 202 QLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGL 261
QLW+ V +++ Q GI+ D + F+ +++ G Y L++ + +++ F+ +GL
Sbjct: 197 QLWFYVIKFLQTCQNEGIDIEDALGFVFRVAYTTFGNYYPTEVLNKTEHLVLRHFSKIGL 256
Query: 262 VKLQQGRKENWFIPTKLATNL--SMSLTDSSARKEGFIVVETNFRMYAYSTSKLHCEILR 319
+ +K + PT+LA L S +L A + GF+V+ETNFR+YAY+ S L +L+
Sbjct: 257 C-YHRKKKSKQYYPTRLALALRTSSALVAKDAMRCGFLVIETNFRIYAYTKSDLKISLLK 315
Query: 320 LFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRMPS----VPE 375
LF+K Y+LP ++V +IT+ S+ A GITA QI+ FL+ + R+ + S VP
Sbjct: 316 LFAKPLYKLPTMLVASITRSSVRQAMAQGITARQILHFLKVHCSSRMRHQPHSVFSPVPP 375
Query: 376 NVCDQIRLWESDLNRVEMTPAHYYDEFP 403
V DQI LWE + +R+ T + P
Sbjct: 376 TVSDQILLWEQETSRLSKTEGVLFRHLP 403
>gi|405952750|gb|EKC20525.1| General transcription factor IIH subunit 4 [Crassostrea gigas]
Length = 380
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 115/410 (28%), Positives = 188/410 (45%), Gaps = 93/410 (22%)
Query: 5 KIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMME 64
K+ K+ + +L + LD+LY +P C A+ R
Sbjct: 7 KLECKDLHGYLKTLPSTILDKLYNHPATCLAVFR-------------------------- 40
Query: 65 EWVLPDGFTKHRVAIDRLVQLRLFSEEKKKET--TYRLNSTFQSNLRKHLIYGG---ALP 119
+H+ A L +LR++ E++ + + LN TF+SN++ L+ GG
Sbjct: 41 ---------EHKFAAKVLSELRVWHEQQMQGGLLGWVLNGTFRSNMKVALLGGGMDSQFT 91
Query: 120 REPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSR 179
P+P A+ + L+ YA+ +WEC L ++ ST+ S ++ V L
Sbjct: 92 GGPLPPDKHAK--DVPFLDKYALERWECVLHFMVGSTEGTE--GVSKDIIDVLLNAGLMT 147
Query: 180 RD--KEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAG 237
D P +T +GFQFLLMD +Q+WY + +Y+ + RG++ D +SFL +LSF G
Sbjct: 148 MDGVDPMPSITPAGFQFLLMDIGSQVWYFMLQYLDTVEARGMDLIDCLSFLFQLSFSTLG 207
Query: 238 EAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTD--SSARKEG 295
+ RK + PT+LA NL+ +D S + G
Sbjct: 208 K-----------------------------RKIQRYYPTRLAINLAAGQSDFTSVGKNTG 238
Query: 296 FIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQII 355
+++VETN+R+YAY+ S L ++ LF ++ Y+ P VG +T+ S+ +A GITA+Q
Sbjct: 239 YLMVETNYRVYAYTDSPLQVALVALFCEMLYRFPTFSVGNLTRVSVRDALIRGITADQT- 297
Query: 356 SFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
P +P V DQ+RLWE + +R + YD+F S+
Sbjct: 298 ---------------PVIPSTVTDQVRLWELERDRFKFMEGVLYDQFLSQ 332
>gi|392901990|ref|NP_001255863.1| Protein Y73F8A.24, isoform a [Caenorhabditis elegans]
gi|26985915|emb|CAB70230.2| Protein Y73F8A.24, isoform a [Caenorhabditis elegans]
Length = 481
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 126/418 (30%), Positives = 213/418 (50%), Gaps = 44/418 (10%)
Query: 11 FMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPA-------KMM 63
F+D + ++ + RL + P I R LPP+A++ +Q+ + G+ P K +
Sbjct: 23 FLDFLITIPPKTRQRLLQKPSCAFFIYRMLPPIAQQTTIQLIW-KGSFPTASDVEETKEI 81
Query: 64 EEWV--LPDGFTKHRVAIDRLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGAL-PR 120
E+ V L D HR D + + +T Y+ + + + L I L P
Sbjct: 82 EDQVKLLKDLGLVHRQTTDGKLNI---------DTDYKRSYMYAAMLGAAQISSLVLEPN 132
Query: 121 EPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRR 180
E G +D+E A+ +W+C +L+ ++ E S + +F++ +
Sbjct: 133 EGNRRG--------KDVEKKAVERWDC-ILRYLALPSEENTQAVSETTRNLFKKANFTSG 183
Query: 181 DKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLEL------SFH 234
+T GFQFLL+ Q+W V EY+ +G + ++I L+++ F
Sbjct: 184 GDTQIEITTFGFQFLLLSPVKQMWTYVIEYLKLEMSQGQDIVEVIEPLIQIVLLANRGFK 243
Query: 235 VAGEAYNLNT-LSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLT--DSSA 291
E Y ++ +E Q ++ +LG++ +++ RK+ F T+L T+L+ + T D SA
Sbjct: 244 AERECYQIDANWTEPQNELLNHLRELGVIFIRK-RKDGVFFLTQLLTHLATNETIDDVSA 302
Query: 292 RK--EGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGI 349
K G ++VETNFR+YAY++S L I+ LF+++ Y+ ++ VG IT+ES+ A ++GI
Sbjct: 303 EKVSNGKVIVETNFRVYAYTSSLLQLAIIALFTEMTYRFQDMSVGMITRESVRGALQHGI 362
Query: 350 TAEQIISFLQQNAHPR-VADRMP--SVPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
TA QIISFL+ NAHP+ +A P +P V DQIRLWE + R+ + A+ Y F S
Sbjct: 363 TAAQIISFLRANAHPQCIATSGPVNCLPITVADQIRLWEDERRRMNLKDAYIYSHFES 420
>gi|116201905|ref|XP_001226764.1| hypothetical protein CHGG_08837 [Chaetomium globosum CBS 148.51]
gi|88177355|gb|EAQ84823.1| hypothetical protein CHGG_08837 [Chaetomium globosum CBS 148.51]
Length = 503
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 124/435 (28%), Positives = 213/435 (48%), Gaps = 56/435 (12%)
Query: 25 RLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLV 83
+LY+ P AI R LPPLAK +V + Y+ + ++ WV +G + A+ L
Sbjct: 24 KLYQQPSTAFAIFRRMLPPLAKVFVQALLYMPSPLLLAELDVWVRAEGKMQRDRALSILR 83
Query: 84 QLRLFS---EEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGI-TARLPTLEDLEA 139
L + K K +L + F+++LR L GG+ +PS + A + L+
Sbjct: 84 SLHIIQITPPGKDKPQEMQLTTNFKNSLRLALEGGGSHNSFGVPSTLPVAPRIDIAHLDG 143
Query: 140 YAIGQWECFLLQLISS--TQAERPTNFSSSMMKVFQRGLL-------SRRDKEAP-RLTE 189
YA +WE L +++S Q+E S K + LL R D +A +T+
Sbjct: 144 YARKKWEDILHYVVNSVPAQSEGGGGVSGGGPKTSVKELLLIGRLVERRHDTQAGVGITQ 203
Query: 190 SGFQFLLMDTNAQLWYIVREYI-----SNSQERG---INQADLISFLLELSFHVAGEAYN 241
+GF FLL + NAQ+W ++ ++ + + +G + D++SFL L+ G AYN
Sbjct: 204 AGFTFLLQEANAQVWTLLLLWLEAADRAKGEAKGAANTDSIDMLSFLFMLASLELGRAYN 263
Query: 242 LNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSM----------------- 284
L+E +++M+ D GL+ + + +F PT+LAT L+
Sbjct: 264 TGALTETRRNMLPHLVDFGLIFIPREDTRQYF-PTRLATTLTSSASSLRSVSSGFNAATA 322
Query: 285 ----------SLTDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVG 334
+ ++SA +G +++ETN+R+YAY++S L +L LF+++ + + G
Sbjct: 323 NNPGDTSSLGTTPETSAASKGSVIIETNYRLYAYTSSPLQIAVLGLFTQLNMRFAGMATG 382
Query: 335 AITKESLYNAFENGITAEQIISFLQQNAHPRV-----ADRMPSVPENVCDQIRLWESDLN 389
+T++S+ A GITA+QIIS+L +AH ++ A P +P V DQIRLW+ +
Sbjct: 383 RLTRDSIRRAIGFGITADQIISYLASHAHDQMVRAAAATGRPVLPPTVVDQIRLWQLENE 442
Query: 390 RVEMTPAHYYDEFPS 404
R+ + + +F S
Sbjct: 443 RMRTSAGFLFKDFDS 457
>gi|302495702|ref|XP_003009866.1| hypothetical protein ARB_03925 [Arthroderma benhamiae CBS 112371]
gi|291173385|gb|EFE29221.1| hypothetical protein ARB_03925 [Arthroderma benhamiae CBS 112371]
Length = 386
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 101/303 (33%), Positives = 163/303 (53%), Gaps = 34/303 (11%)
Query: 133 TLEDLEAYAIGQWECFLLQLI--SSTQAERPT-NFSSSMMKVFQR-GLLSRRDKEAPRLT 188
++ DL+ YA QWE L ++ SS R T S + ++ Q L+ RD+ +T
Sbjct: 36 SVADLDEYARQQWEAVLGYMVGMSSLSGGRETITLSKGVKQLLQACHLVEVRDRRV-EIT 94
Query: 189 ESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEI 248
+ GF F+L D N Q+W+I+ Y+ N++ G+ +++SF+ LS G++Y L+
Sbjct: 95 KEGFAFVLQDVNTQVWHILILYVENAEAIGMESVEVLSFIFLLSSLELGQSYEKKDLTPT 154
Query: 249 QKSMIKDFADLGLV-KLQQGRKENWFIPTKLATNL---SMSLTD---------------- 288
Q + D D G+V + F PT+LAT L S++++
Sbjct: 155 QLRTLADLTDFGIVYQHTPASGSTRFYPTRLATTLTSDSLAMSGPISGEPAVPTTTAGTT 214
Query: 289 -SSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFEN 347
S GFI+VETN+R+YAY++S L ++ LF+ ++++ PNLI G IT++S+ A E
Sbjct: 215 TSGDAGTGFIIVETNYRLYAYTSSPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEM 274
Query: 348 GITAEQIISFLQQNAHPRVADRMPS--------VPENVCDQIRLWESDLNRVEMTPAHYY 399
GITA+QIIS+L +AHP++ S +P V DQIRLW+ + +R++ T +
Sbjct: 275 GITADQIISYLTTHAHPQMRKTKKSTSNVSSSVLPPTVVDQIRLWQLERDRIKATSGFLF 334
Query: 400 DEF 402
EF
Sbjct: 335 KEF 337
>gi|453232316|ref|NP_001263824.1| Protein Y73F8A.24, isoform c [Caenorhabditis elegans]
gi|412976517|emb|CCO25636.1| Protein Y73F8A.24, isoform c [Caenorhabditis elegans]
Length = 485
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 128/422 (30%), Positives = 215/422 (50%), Gaps = 48/422 (11%)
Query: 11 FMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPA-------KMM 63
F+D + ++ + RL + P I R LPP+A++ +Q+ + G+ P K +
Sbjct: 23 FLDFLITIPPKTRQRLLQKPSCAFFIYRMLPPIAQQTTIQLIW-KGSFPTASDVEETKEI 81
Query: 64 EEWV--LPDGFTKHRVAIDRLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGAL-PR 120
E+ V L D HR D + + +T Y+ + + + L I L P
Sbjct: 82 EDQVKLLKDLGLVHRQTTDGKLNI---------DTDYKRSYMYAAMLGAAQISSLVLEPN 132
Query: 121 EPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRR 180
E G +D+E A+ +W+C +L+ ++ E S + +F++ +
Sbjct: 133 EGNRRG--------KDVEKKAVERWDC-ILRYLALPSEENTQAVSETTRNLFKKANFTSG 183
Query: 181 DKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSF-----HV 235
+T GFQFLL+ Q+W V EY+ +G + ++I L+++ V
Sbjct: 184 GDTQIEITTFGFQFLLLSPVKQMWTYVIEYLKLEMSQGQDIVEVIEPLIQIVLLANRVQV 243
Query: 236 AG-----EAYNLNT-LSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLT-- 287
AG E Y ++ +E Q ++ +LG++ +++ RK+ F T+L T+L+ + T
Sbjct: 244 AGFKAERECYQIDANWTEPQNELLNHLRELGVIFIRK-RKDGVFFLTQLLTHLATNETID 302
Query: 288 DSSARK--EGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAF 345
D SA K G ++VETNFR+YAY++S L I+ LF+++ Y+ ++ VG IT+ES+ A
Sbjct: 303 DVSAEKVSNGKVIVETNFRVYAYTSSLLQLAIIALFTEMTYRFQDMSVGMITRESVRGAL 362
Query: 346 ENGITAEQIISFLQQNAHPR-VADRMP--SVPENVCDQIRLWESDLNRVEMTPAHYYDEF 402
++GITA QIISFL+ NAHP+ +A P +P V DQIRLWE + R+ + A+ Y F
Sbjct: 363 QHGITAAQIISFLRANAHPQCIATSGPVNCLPITVADQIRLWEDERRRMNLKDAYIYSHF 422
Query: 403 PS 404
S
Sbjct: 423 ES 424
>gi|302667863|ref|XP_003025510.1| hypothetical protein TRV_00272 [Trichophyton verrucosum HKI 0517]
gi|291189624|gb|EFE44899.1| hypothetical protein TRV_00272 [Trichophyton verrucosum HKI 0517]
Length = 386
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 101/303 (33%), Positives = 163/303 (53%), Gaps = 34/303 (11%)
Query: 133 TLEDLEAYAIGQWECFLLQLI--SSTQAERPT-NFSSSMMKVFQR-GLLSRRDKEAPRLT 188
++ DL+ YA QWE L ++ SS R T S + ++ Q L+ RD+ +T
Sbjct: 36 SVADLDEYARQQWEAVLGYMVGMSSLSGGRETITLSKGVKQLLQACHLVEVRDRRV-EIT 94
Query: 189 ESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEI 248
+ GF F+L D N Q+W+I+ Y+ N++ G+ +++SF+ LS G++Y L+
Sbjct: 95 KEGFAFVLQDVNTQVWHILILYVENAEAIGMESVEVLSFVFLLSSLELGQSYEKKDLTPT 154
Query: 249 QKSMIKDFADLGLV-KLQQGRKENWFIPTKLATNL---SMSLTD---------------- 288
Q + D D G+V + F PT+LAT L S++++
Sbjct: 155 QLRTLADLTDFGIVYQHTPASGSTRFYPTRLATTLTSDSLAMSGPISGEPAVPTTTAGTT 214
Query: 289 -SSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFEN 347
S GFI+VETN+R+YAY++S L ++ LF+ ++++ PNLI G IT++S+ A E
Sbjct: 215 TSGDAGTGFIIVETNYRLYAYTSSPLQISLISLFTTLKFRFPNLITGKITRQSVRRAIEM 274
Query: 348 GITAEQIISFLQQNAHPRVADRMPS--------VPENVCDQIRLWESDLNRVEMTPAHYY 399
GITA+QIIS+L +AHP++ S +P V DQIRLW+ + +R++ T +
Sbjct: 275 GITADQIISYLTTHAHPQMRKTKKSTSNVSSSVLPPTVVDQIRLWQLERDRIKATSGFLF 334
Query: 400 DEF 402
EF
Sbjct: 335 KEF 337
>gi|308460924|ref|XP_003092760.1| hypothetical protein CRE_20979 [Caenorhabditis remanei]
gi|308252519|gb|EFO96471.1| hypothetical protein CRE_20979 [Caenorhabditis remanei]
Length = 470
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 120/418 (28%), Positives = 218/418 (52%), Gaps = 46/418 (11%)
Query: 11 FMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPD 70
F+D + S+ ++ R+ + P C I R LPP+A++ +Q+ + G+ P D
Sbjct: 24 FLDFLISVPPKERQRMLQKPSCCFFIYRMLPPIAQQVTIQLIW-KGSFPK--------AD 74
Query: 71 GFTKHRVAIDRLVQLRLFSE----EKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSG 126
+ D Q++L + K+ Y +++ + ++ +Y L + S
Sbjct: 75 DLEVTKSIED---QVKLLEDLGIVSKQANGKYLIDNDY----KRSYMYAAMLGAASISSL 127
Query: 127 ITARLPTLED------LEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRR 180
+ L T ++ +E A+ +W+C +L+ ++ E S + +F++ +
Sbjct: 128 V---LETNDEKRRGKEVEKKAVERWDC-ILRYLALPSEENTQAVSETTRNLFKKANFTSG 183
Query: 181 DKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLEL------SFH 234
D +T SGFQFLL+ Q+W V EY+ +G + ++I L+++ F
Sbjct: 184 DSRI-EITTSGFQFLLLSPVKQMWTYVIEYLKLEMSQGQDIVEVIEPLIQIVLLANRGFK 242
Query: 235 VAGEAYNLN-TLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLT--DSSA 291
E Y ++ + S Q+ ++ +LG++ +++ RK+ F T L T+L+ + T D+SA
Sbjct: 243 AEKECYQIDPSWSIPQQELLNHLRELGVIFIRK-RKDGVFFLTHLLTHLATNETIDDTSA 301
Query: 292 RK--EGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGI 349
+ G ++VETNFR+YAY++S L I+ LF+++ Y+ ++ VG IT+ES+ A ++GI
Sbjct: 302 ERVSNGKVIVETNFRVYAYTSSLLQLAIIALFTEMTYRFQDMSVGMITRESVRGALQHGI 361
Query: 350 TAEQIISFLQQNAHPRV---ADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
TA QIISFL+ NAHP+ + + +P V DQIRLWE + R+++ A+ Y F S
Sbjct: 362 TAAQIISFLRANAHPQCIATSGAINCLPITVADQIRLWEDERRRMDLKDAYIYSHFES 419
>gi|412992641|emb|CCO18621.1| predicted protein [Bathycoccus prasinos]
Length = 542
Score = 165 bits (417), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 109/327 (33%), Positives = 162/327 (49%), Gaps = 63/327 (19%)
Query: 137 LEAYAIGQWECFLLQLISSTQAER----------------PTNFSSSMMKVFQRGLLSRR 180
L+ YA +WE LL+L S E+ + + K+F L +
Sbjct: 175 LDHYAKSRWETVLLELTSEASVEKKRKKKKMMKKKGDGGAGAQHALHLRKLFYAARLITK 234
Query: 181 DKEAPRLTESGFQFLLMDTNAQLWYIVREYI------------SNSQERGI--------- 219
D R+T GF FLL T+ Q+W ++ Y + ++E G
Sbjct: 235 DG---RITSEGFSFLLGATSEQIWILLARYARGGDFEKKMKNKTTNEENGEGDKMETNGE 291
Query: 220 ------------NQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLV--KLQ 265
+ A ++FL+ LSF G Y+ LSE ++ + + LG++
Sbjct: 292 DEKQQHKLKSDESSAAAMAFLVRLSFQHPGRKYSKANLSEAERRVASHLSALGVLYENED 351
Query: 266 QGRKENWFIPTKLATNLSMSLTDSSA-----RKEGFIVVETNFRMYAYSTSKLHCEILRL 320
NW++PT L+ LS T SSA R +G I+VETNFR+YAY+ S+L E+LRL
Sbjct: 352 DENDNNWYVPTVLSAGLSSVSTTSSAKSALARIDGHIIVETNFRVYAYTHSELETEVLRL 411
Query: 321 FSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR--VADR--MPSVPEN 376
F++ +Y+LPN VG IT++S+ +A GI+ +QI+++LQ AHPR + R P+VP
Sbjct: 412 FTRPDYKLPNAYVGMITRDSILDAMRAGISPDQIVNYLQTRAHPRCTIGKRPNHPAVPPT 471
Query: 377 VCDQIRLWESDLNRVEMTPAHYYDEFP 403
VCDQIRLW DL RV+ Y +FP
Sbjct: 472 VCDQIRLWARDLYRVKADDCVMYTDFP 498
Score = 39.3 bits (90), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 42/80 (52%), Gaps = 16/80 (20%)
Query: 11 FMDMVASLTARDLDRLYE-NPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLP 69
F++ A L+ ++ LY +P+ A+LR+L PLAK+Y+M+ + LP
Sbjct: 22 FVEFFAHLSEEKINHLYTISPWATLAVLRALTPLAKQYIMRCVF--------------LP 67
Query: 70 DGFTKH-RVAIDRLVQLRLF 88
+G + R + LVQLR F
Sbjct: 68 EGAKQRTRWRLKALVQLRAF 87
>gi|268534706|ref|XP_002632484.1| Hypothetical protein CBG13719 [Caenorhabditis briggsae]
Length = 482
Score = 164 bits (416), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 122/413 (29%), Positives = 213/413 (51%), Gaps = 36/413 (8%)
Query: 11 FMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPD 70
F+D + S+ +D R+ + P I R LP +A++ +Q+ + G+ P E
Sbjct: 28 FLDFLMSIKPKDRQRMLQKPSCSFFIYRMLPAIAQQVTIQLIW-KGSFPKADDVE----- 81
Query: 71 GFTKHRVAIDRLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITAR 130
TK +I+ V+L L K T +L T ++ ++ +Y L + S +
Sbjct: 82 -LTK---SIEEQVELLLKLGIAHKATDGKL--TIDNDYKRSYMYAAMLGAASISSLV--- 132
Query: 131 LPTLE------DLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEA 184
L T + D+E A+ +W+C +L+ ++ E S + +F++ + +
Sbjct: 133 LETNDEKRRGKDVEKKAVERWDC-ILRYLALPSEENTEAVSDTTKDLFRKANFTSGESRI 191
Query: 185 PRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLEL------SFHVAGE 238
+T GFQFLL+ Q+W V EY+ +G + ++I L+++ F E
Sbjct: 192 -EITTFGFQFLLLSPVKQMWTYVIEYLKLEISKGNDIVEVIEPLIQIVLLANRGFKAEKE 250
Query: 239 AYNLN-TLSEIQKSMIKDFADLGLVKLQQGRKENWFIP---TKLATNLSMSLTDSSARKE 294
Y ++ ++ Q ++ +LG++ +++ + +F+ T LATN ++ T S
Sbjct: 251 CYQIDEQWTQPQHDLLNHLRELGVIFIRKRKDGVFFLTHLLTHLATNETIDDTSSEKASN 310
Query: 295 GFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQI 354
G ++VETNFR+YAY++S L I+ LF+++ Y+ ++ VG IT+ES+ A ++GITA QI
Sbjct: 311 GKVIVETNFRVYAYTSSLLQLAIIALFTEMTYRFTDMSVGMITRESVRGALQHGITAAQI 370
Query: 355 ISFLQQNAHPR-VADRMP--SVPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
ISFL+ NAHP+ VA P +P V DQIRLWE + R+++ A+ Y F S
Sbjct: 371 ISFLRANAHPQCVATSGPVNCLPITVADQIRLWEDERRRMDLKDAYIYSHFES 423
>gi|320587960|gb|EFX00435.1| tfiih and nucleotide excision repair factor 3 complexes subunit
[Grosmannia clavigera kw1407]
Length = 518
Score = 164 bits (415), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 125/449 (27%), Positives = 213/449 (47%), Gaps = 72/449 (16%)
Query: 25 RLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAID--- 80
RLY+ P AI R LP LAK +V + Y+DG + ++ V+P+ + A+
Sbjct: 25 RLYQQPSTTLAIFRRMLPSLAKSFVRMLLYMDGPLLLDTLDTMVVPEAKRERDQALSILR 84
Query: 81 --RLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPS--GITARLPTLED 136
+VQ+ ++ +E L + F+++ R L GG+ +PS ++A T
Sbjct: 85 ALHIVQITTAMRDRPQEIM--LTANFKNSFRLALEGGGSQNSFGVPSEKPVSAETDT-AF 141
Query: 137 LEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLL-------SRRDKEAP-RLT 188
L+ +A +WE L +++S S K + LL SR + +T
Sbjct: 142 LDRFARRKWEDILHYVVNSVGLPSGPGHDSQGPKNTVKELLLAGHLVESRPGRPGGVGIT 201
Query: 189 ESGFQFLLMDTNAQLWYIVREYI-SNSQER----GINQADLISFLLELSFHVAGEAYNLN 243
++GF FLL NAQ+W ++ +++ + Q+R ++ D++SFL LS G AY+ N
Sbjct: 202 QTGFTFLLQAANAQVWTLLLQWLEAVDQQRQAGGAVDSVDMLSFLFMLSTLELGRAYDTN 261
Query: 244 TLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDS-------------- 289
L+E +++M+ D GL+ + + + + PT+LAT L+ S +
Sbjct: 262 ALTEARRNMLPSLMDFGLIYIPPHDRASQYFPTRLATTLTSSGAAALRSLSTGFAAASAN 321
Query: 290 -----------------------------SARKEGFIVVETNFRMYAYSTSKLHCEILRL 320
+ G +++ETNFR+YAY+ S L +L L
Sbjct: 322 STAAAGLPGTGTGSNNNHVGAASTANSAATTEGRGSVILETNFRLYAYTASPLQIAVLAL 381
Query: 321 FSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRV-----ADRMPSVPE 375
FSK+ + P ++ G +T++S+ A GIT++QIIS+L NAH ++ A+ P +P
Sbjct: 382 FSKLSQRFPGMVAGKLTRDSVRRAISYGITSDQIISYLAANAHEQMHKYAAANHRPVLPP 441
Query: 376 NVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
V DQIRLWE + R++ + +F S
Sbjct: 442 TVVDQIRLWELENERMKSHKGFLFKDFDS 470
>gi|452824378|gb|EME31381.1| transcription initiation factor TFIIH subunit H4 [Galdieria
sulphuraria]
Length = 740
Score = 164 bits (415), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 126/421 (29%), Positives = 208/421 (49%), Gaps = 43/421 (10%)
Query: 20 ARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAI 79
A +++ LY++P+ I L L K + ++ +I I A + +W ++ + +
Sbjct: 19 ADEVETLYKSPYATSCIFEILSSLQKAILTRLLFIQ-EIDASTLLKWFQKQSQSQIQTIV 77
Query: 80 DRLVQLR-LFSEEKKKETTYRLNSTFQSNLRKHLIYGGALP---REPMPSG---ITARLP 132
+ LV LR L+ ++ +++N FQ +L+ ++ P +E G T++
Sbjct: 78 NHLVDLRILYRKQGSNSVQFQMNPKFQKSLQDLILRNVVSPFVRKEETDIGDHNTTSQQQ 137
Query: 133 TLED---LEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAPRLTE 189
D L Y+ W L L+ E S +++ L + ++ R+T
Sbjct: 138 LSYDVTFLSQYSRHCWNKILQFLVGLEPLESEP--SVRIIEALVESGLIEQVEDGFRITT 195
Query: 190 SGFQFLLMDTNAQLWYIVREYI-SNS---------QERGINQADLISFLLELSFHVAGEA 239
GFQFLL DT Q+W+I+ + SN Q+R +N+ +++ FL ELSF +G A
Sbjct: 196 VGFQFLLKDTREQVWFILENILFSNETNLKIGTKLQKRVMNE-EMVEFLFELSFCASGVA 254
Query: 240 YNLNTLSEIQKSMIKDFADLGLVKLQQG---RKENWFIPTKLATNLSMS----------- 285
Y+ TLS+ Q+ I ADLGL+ L K+++F T L L+ S
Sbjct: 255 YHWKTLSKTQQIYIPYLADLGLIYLHISSSLEKDSYFYVTPLGVTLTRSICWLGDKNNYL 314
Query: 286 -LTD-SSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYN 343
L D S+ + ++V+TNFR+Y Y+ +L LF + Y+LPN+ VG IT++S+
Sbjct: 315 LLEDWSNVDNDCRMIVQTNFRVYVYTNCTFQISLLSLFIQFLYRLPNMAVGVITRDSIRT 374
Query: 344 AFENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFP 403
A NGITA+Q+IS+LQ + HP + ++P + DQIRLWE+ RV A D F
Sbjct: 375 ALNNGITAQQMISYLQNHMHPNMKGKLPIT---IIDQIRLWEAQRFRVTTKHALLLDHFD 431
Query: 404 S 404
+
Sbjct: 432 N 432
>gi|367030175|ref|XP_003664371.1| hypothetical protein MYCTH_2307126 [Myceliophthora thermophila ATCC
42464]
gi|347011641|gb|AEO59126.1| hypothetical protein MYCTH_2307126 [Myceliophthora thermophila ATCC
42464]
Length = 512
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 124/457 (27%), Positives = 217/457 (47%), Gaps = 67/457 (14%)
Query: 13 DMVASLTARDLDRLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDG 71
D + L +LY+ P AI R LPPLAK +V + Y+ + ++ WV P+G
Sbjct: 12 DYLEKLPGTTFRKLYQQPSTAFAIFRRMLPPLAKVFVQALLYMPSPMLLSELDVWVRPEG 71
Query: 72 FTKHRVAIDRLVQLRLFS---EEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGIT 128
+ A+ L L + K K +L + F+++LR L GG +PS +
Sbjct: 72 KMQRDRALSILRSLHIVQITPPGKDKPQEIQLTTNFRNSLRLALEGGGTHNSFGVPSTLP 131
Query: 129 ARLPTLED--LEAYAIGQWECFLLQLISSTQAERPTNF-------SSSMMKVFQRG-LLS 178
P ++ L++YA +WE L +++S A+ +S+ + G L+
Sbjct: 132 VD-PRIDIAYLDSYARRKWEDILHYVVNSVPAQGEGGGGGSVGGPKASVKDLLLIGRLVE 190
Query: 179 RRD--KEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQE-----------------RGI 219
RR+ K +T++GF FLL + NAQ+W ++ ++ +
Sbjct: 191 RRNDTKAGVGITQAGFTFLLQEANAQVWTLLLLWLEAADRAKGAAEQGKGGGAAAAAAKA 250
Query: 220 NQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLA 279
+ D++SFL L+ G AY+ + LSE +++M+ D GL+ + + +F PT+LA
Sbjct: 251 DSIDMLSFLFMLASLELGRAYDTDALSETRRNMLPALVDFGLIYIPREDTRQYF-PTRLA 309
Query: 280 TNLSM---------------------------SLTDSSARKEGFIVVETNFRMYAYSTSK 312
T L+ + ++SA +G +++ETN+R+YAY++S
Sbjct: 310 TTLTSSASSLRSVSSGFSAAAANNPGDASSLGTTPETSAASKGSVIIETNYRLYAYTSSP 369
Query: 313 LHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRV-----A 367
L +L LF+++ + + G +T++S+ A GITA+QIIS+L +AH ++ A
Sbjct: 370 LQIAVLALFTQLNMRFAGMATGRLTRDSIRRAIGFGITADQIISYLASHAHEQMVRAAAA 429
Query: 368 DRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
P +P V DQIRLW+ + R+ + + +F S
Sbjct: 430 TGRPVLPPTVVDQIRLWQLENERMRTSAGFLFKDFDS 466
>gi|324512881|gb|ADY45318.1| General transcription factor IIH subunit 4 [Ascaris suum]
Length = 481
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 130/434 (29%), Positives = 218/434 (50%), Gaps = 51/434 (11%)
Query: 14 MVASLTARD---LDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPD 70
++ L RD LD LYE+P AI R+LP ++++ ++++ ++ + + + WV +
Sbjct: 8 LLGYLVKRDREQLDTLYESPGAVFAIFRALPDVSQQCILKVLWLQEGVQPSLWQSWVKHE 67
Query: 71 GFTKHRVAIDRLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGIT-- 128
+ D L +L + E K++ T LN F+ + + + G L R +T
Sbjct: 68 HSSLVENHFDLLRRLGII--ESKEQIT--LNPIFRKSYIRAVQMG--LYRASQLKAMTDL 121
Query: 129 ---ARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQR-GLLSRRDKEA 184
+R +DL A +WEC L L +Q + S + ++F+ GL S + E
Sbjct: 122 DEKSRKSASKDLGKKATERWECILHYLALPSQ-KSEQGVSGATKQLFRAAGLTSGGESEG 180
Query: 185 P-RLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSF-----HVAGE 238
+T +GFQFLL++ Q+W + Y+ + G N + FLL+L+ H G
Sbjct: 181 DMEITSAGFQFLLLNQTEQIWMYILHYLRLEESMGKNVMAELDFLLKLTLCVDHTHTNGR 240
Query: 239 ---------------AYNLN-TLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNL 282
A+ ++ + SE + + +LGLV +++ RK+ +F T L +L
Sbjct: 241 LRVVDGDREKKATWRAFFIDESWSETITNFLMHLRELGLVFIRK-RKDGYFFITPLFAHL 299
Query: 283 SMSLTDSSARKE-----GFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAIT 337
++S + +A E G+I+VETN+R+YAY+ S L IL F+++ Y+ ++ VG +T
Sbjct: 300 TVSSSSETAVAEKRTHSGYIIVETNYRVYAYTDSSLQLAILSTFTEMLYRFNDMSVGMLT 359
Query: 338 KESLYNAFENGITAEQIISFLQQNAHPRV---ADRMPS----VPENVCDQIRLWESDLNR 390
++S+ A + GITA QIISFL+ NAHP A+ P VP V DQIRLWE + +R
Sbjct: 360 RDSVRRALQVGITASQIISFLRANAHPETVAAANASPGVIHCVPVTVADQIRLWEDERHR 419
Query: 391 VEMTPAHYYDEFPS 404
+ + Y F S
Sbjct: 420 LVFFDSALYSTFES 433
>gi|384484453|gb|EIE76633.1| hypothetical protein RO3G_01337 [Rhizopus delemar RA 99-880]
Length = 284
Score = 161 bits (408), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 89/239 (37%), Positives = 139/239 (58%), Gaps = 38/239 (15%)
Query: 186 RLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTL 245
++T GFQFLL D N Q+W + +Y+ + A+L S LEL GE Y+++TL
Sbjct: 12 QITNKGFQFLLQDVNTQVWAFLLQYL--------DMAELGS--LEL-----GENYSVDTL 56
Query: 246 SEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMS-------------------- 285
++ Q M++D D G+V Q+ + + PT+LAT L+
Sbjct: 57 TQTQLQMLEDLRDYGIV-YQRKKHSKRYYPTRLATTLTSGKSALATVAGKYNHMMQETNI 115
Query: 286 --LTDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYN 343
TD+ + +GFI++ETN+++YAY+ S L +L LF +++ + N++ G IT++S+ N
Sbjct: 116 DDTTDTESVDQGFIILETNYKLYAYTDSPLQIAVLNLFVQLQSRFRNMVTGVITRDSIRN 175
Query: 344 AFENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEF 402
A GITAEQII +LQ +AHP++ P +P V DQIRLWE + NR++ TP++ Y EF
Sbjct: 176 ALMKGITAEQIIYYLQSHAHPQMRKETPVLPLTVVDQIRLWEMERNRLKPTPSYLYHEF 234
>gi|340939354|gb|EGS19976.1| RNA polymerase II transcription factor B subunit 2-like protein
[Chaetomium thermophilum var. thermophilum DSM 1495]
Length = 514
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 126/456 (27%), Positives = 210/456 (46%), Gaps = 69/456 (15%)
Query: 13 DMVASLTARDLDRLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDG 71
D + L +LY+ P AI R LPPLAK +V + Y+ + ++ WV P+
Sbjct: 14 DYLEKLPGTTFRKLYQQPSSAFAIFRRMLPPLAKVFVQALLYMPQPMLLSDLDVWVRPEA 73
Query: 72 FTKHRVAIDRLVQLRLFS---EEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGIT 128
A+ L L + K + +L + F+++LR L G A +PS +
Sbjct: 74 KMHRDRALSILRSLHIVQITPPGKDRPQEVQLTTNFRNSLRLALEGGAAHNSFGVPSSLP 133
Query: 129 ARLPTLED--LEAYAIGQWECFLLQLISSTQAERPTNF--------SSSMMKVFQRGLLS 178
P ++ L+ YA +WE L ++SS +S+ + G L
Sbjct: 134 VD-PRIDIAFLDNYARKKWEDILHYVVSSVPVHGDAGPGGMGGGGPKASVKDLLLAGRLV 192
Query: 179 RR---DKEAPRLTESGFQFLLMDTNAQLWYIVREYI------------------SNSQER 217
R K +T++GF FLL + NAQ+W ++ ++ N+
Sbjct: 193 ERRPDTKTGIGITQAGFTFLLQEANAQVWTLLLLWLEAADQAKAAAAAASGVDPKNAPPT 252
Query: 218 GINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTK 277
+ +++SFL L+ G AY+ + LSE +++M+ D GL+ + + +F PT+
Sbjct: 253 KPDSIEMLSFLFMLASLELGRAYDTDALSETRRNMLPALVDFGLIYIPREDTRQYF-PTR 311
Query: 278 LATNLSMSLT--------------------------DSSARKEGFIVVETNFRMYAYSTS 311
LAT L+ S + D SA K G I++ETN+R+YAY++S
Sbjct: 312 LATTLTSSASALRSVSSGFTAATNNTANDASSLGGADPSAHK-GSIIIETNYRLYAYTSS 370
Query: 312 KLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRV----- 366
L +L LF+ + + ++ G +T+ES+ A GITA+QIIS+L +AH ++
Sbjct: 371 PLQIAVLALFTHLNMRFAGMVTGRLTRESIRRAISFGITADQIISYLASHAHEQMVRAAA 430
Query: 367 ADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEF 402
A P +P V DQIRLW+ + R+ +P + +F
Sbjct: 431 AAGRPVLPPTVVDQIRLWQLENERMRTSPGFLFKDF 466
>gi|310799661|gb|EFQ34554.1| transcription factor tfb2 [Glomerella graminicola M1.001]
Length = 485
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 131/431 (30%), Positives = 216/431 (50%), Gaps = 44/431 (10%)
Query: 13 DMVASLTARDLDRLYENPFICEAILRSL-PPLAKKYVMQMFYIDGAIPAKMMEEWVLPDG 71
D + L +LY P AI R + PPLAK VM + Y+ + + ++ WV P+
Sbjct: 13 DYLEKLPGTTFRKLYLQPSTAFAIFRRMIPPLAKTIVMAILYMPKPMLLEDLDVWVKPES 72
Query: 72 FTKHRVAIDRL-----VQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSG 126
+ AI L +Q+ + S+E+ +E +L + F+++LR L GG +PS
Sbjct: 73 RRQKDQAISTLRSLHILQITVPSKERPQEM--QLTTNFKNSLRLALEGGGTHNSFGVPSS 130
Query: 127 ITARLP---TLEDLEAYAIGQWECFLLQLISSTQ--AERPTNFSSSMMKVFQRGLLSRRD 181
+ A P T+ L+ YA +WE L ++++ SS+ + G L RR
Sbjct: 131 LPA--PPEITVPFLDRYARRKWEDILHYIVNTVNPGGADLGGPKSSVKNLLVAGQLVRRQ 188
Query: 182 KEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQER----GINQADLISFLLELSFHVAG 237
A +T++GF FLL + NAQ+W ++ ++ + G+ D++SFL L+ G
Sbjct: 189 GSAVGITQAGFTFLLQEANAQVWTLLLLWLEATDHAEDAAGMESTDMLSFLFLLASLELG 248
Query: 238 EAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNL--------------S 283
Y+ N L+E +++M+ D GL+ + + + +F PT+LAT L S
Sbjct: 249 RPYDTNALTEARRNMLPSLLDFGLIYIPSHKPQQYF-PTRLATTLTSSSSALSYAHCRDS 307
Query: 284 MSLTDSSA-----RKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITK 338
+ S A R F + ETN+R+YAY + L +L LF K++ + +++ G +T+
Sbjct: 308 ETFFQSYASHLMIRVRPFRLSETNYRVYAYGQTPLQIAVLSLFCKLKLRFADMVSGRLTR 367
Query: 339 ESLYNAFENGITAEQIISFLQQNAHPRV-----ADRMPSVPENVCDQIRLWESDLNRVEM 393
S+ NA E GITA+QIIS+L +AH ++ A P +P V DQIRLW+ + R+
Sbjct: 368 NSIRNAVERGITADQIISYLAAHAHEQMHRMAAARSRPILPPTVIDQIRLWQLETERMTT 427
Query: 394 TPAHYYDEFPS 404
T + +F S
Sbjct: 428 TSGFLFRDFDS 438
>gi|323306981|gb|EGA60265.1| Tfb2p [Saccharomyces cerevisiae FostersO]
Length = 448
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 115/420 (27%), Positives = 202/420 (48%), Gaps = 71/420 (16%)
Query: 51 MFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQLRLFSEEKKKET-TYRLNSTFQSNLR 109
M + + +P +++WV +G + + AI + L L K T LN TF+ +LR
Sbjct: 1 MVFNENEVPLLDLDKWVNSNGKLQFQNAIKSMKSLHLLIPNKSSGTLMINLNPTFKISLR 60
Query: 110 KHLIYGGALPREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMM 169
L GG + + + + +L+ L+ Y+ +WE L ++ + A+ P+ +++
Sbjct: 61 NALT-GGEV-QNSFGVVVEENVVSLDLLDEYSANKWETILHFMVGTPLAKIPSEKVLNLL 118
Query: 170 KVFQRGLLSRRDKEAP-RLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFL 228
K L+ + ++T GFQFLL + N+QLW ++ +Y+ + ++ D++ F+
Sbjct: 119 K--HSKLMEEVNSTGEFKITNEGFQFLLQEINSQLWTLLLQYLKMIETSKMDLVDVLHFI 176
Query: 229 LELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLS----- 283
L G+AY ++ LSE Q+ M++D D GLV Q+ ++ F PTKLA L+
Sbjct: 177 FMLGALEVGKAYKIDALSETQRIMLQDMRDYGLV-FQKHSNDSIFYPTKLALMLTSDTKT 235
Query: 284 ----MSLTDSSARK-------------------------------------EGFIVVETN 302
+ DS R+ +G ++VETN
Sbjct: 236 IRSASNAMDSVLRQNREEPSVNEDGANGKSTTDITTSDDLNKAGLKNQDIPDGSLIVETN 295
Query: 303 FRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 362
F++Y+YS S L +L LF ++ + N+++G IT+ES+ A NGITA+QII++L+ +A
Sbjct: 296 FKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRRALTNGITADQIIAYLETHA 355
Query: 363 HPRV------------------ADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
HP++ + + +P V DQIRLW+ +L+RV Y +F +
Sbjct: 356 HPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTVVDQIRLWQLELDRVITYEGSLYSDFET 415
>gi|312069992|ref|XP_003137940.1| transcription factor Tfb2 family protein [Loa loa]
Length = 470
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 121/420 (28%), Positives = 205/420 (48%), Gaps = 34/420 (8%)
Query: 11 FMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPD 70
++ ++S LD+LY +P + AI R LP L+++++++ ++ G W PD
Sbjct: 9 LLEYLSSRAIDQLDKLYSSPAVSFAIFRLLPELSQQFILKTVWLTGTNRKCEWHAWSAPD 68
Query: 71 GFTKHRVAIDRLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGIT-- 128
T I L +L++ +++ LN ++ + K G L + S +T
Sbjct: 69 METAVMNQISGLRKLKIIVGDEE----VMLNPVYRRSYLKATRIG--LCKASELSAVTDC 122
Query: 129 ---ARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLL--SRRDKE 183
R +DL A+ +WEC L L +Q + S + ++F+ L S D
Sbjct: 123 DEKTRKSANKDLGKKAVERWECILHYLALPSQ-KSEQGVSGTTKRLFRSAGLTSSENDDG 181
Query: 184 APRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFH------VAG 237
+T +GFQFLL++ + Q+W + Y + GI+ + FL ++
Sbjct: 182 DIEITSAGFQFLLLNRSEQIWTYLLYYFHMQEVAGIDIIKELDFLFRVTNGSDKERPAGS 241
Query: 238 EAYNLN-TLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNL-SMSLTDSSA---- 291
A+ ++ E K + +LGLV +++ RK+ +F T L +L S+S +
Sbjct: 242 RAFLIDECWPEPTKDFLMHLRELGLVFIRK-RKDGFFFLTPLLNHLTSISYASETGIENR 300
Query: 292 RKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITA 351
+ GF++VETN+R+YAY+ S L IL F+++ Y+ ++ VG +++E++ AF+ GITA
Sbjct: 301 NQNGFVIVETNYRVYAYTDSNLQLAILSTFTEMLYRFNDMSVGVLSREAVRRAFQVGITA 360
Query: 352 EQIISFLQQNAHPRV-------ADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
QII+FL+ NAHP + SVP V DQIRLWE + R+ A Y F S
Sbjct: 361 AQIIAFLRANAHPTTYAVTTEYGGMIQSVPITVADQIRLWEDERRRLMFCNAAVYSSFES 420
>gi|367040709|ref|XP_003650735.1| hypothetical protein THITE_2110518 [Thielavia terrestris NRRL 8126]
gi|346997996|gb|AEO64399.1| hypothetical protein THITE_2110518 [Thielavia terrestris NRRL 8126]
Length = 513
Score = 158 bits (400), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 124/459 (27%), Positives = 215/459 (46%), Gaps = 70/459 (15%)
Query: 13 DMVASLTARDLDRLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDG 71
D + L +LY+ P AI R LPPLAK +V + Y+ + ++ WV P+
Sbjct: 12 DYLEKLPGTTFRKLYQQPSTAFAIFRRMLPPLAKIFVQALLYMPSPLLLSELDVWVRPEA 71
Query: 72 FTKHRVAIDRLVQLRLFS---EEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSG-- 126
A+ L L + K K +L + F+++LR L GG +PS
Sbjct: 72 KMHRDRALSILRSLHIVQISPPGKDKPQEIQLTTNFKNSLRLALEGGGTHNSFGVPSTLP 131
Query: 127 ITARLPTLEDLEAYAIGQWECFLLQLISST--QAERPTNF------SSSMMKVFQRGLLS 178
+ R+ + L+ YA +WE L +++S AE +S+ + G L
Sbjct: 132 VDPRI-DIAHLDQYARKKWEDILHYVVNSVPVHAEGGGGGGSGGGPKASVKDLLLAGRLV 190
Query: 179 RR---DKEAPRLTESGFQFLLMDTNAQLWYIVREYI-------------------SNSQE 216
R + +T++GF FLL + NAQ+W ++ ++ + SQ
Sbjct: 191 ERRPDTRGGVAITQAGFTFLLQEANAQVWTLLLLWLEAADHSKAAAAAAGDHAKGAASQG 250
Query: 217 RGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPT 276
+ + D++SFL L+ G AY+ ++L+E +++M+ D GL+ + + +F PT
Sbjct: 251 KP-DSIDMLSFLFMLASLELGRAYDTDSLTETRRNMLPALVDFGLIYIPREDTRQYF-PT 308
Query: 277 KLATNL--------------------------SMSLTDSSARKEGFIVVETNFRMYAYST 310
+LAT L S+ +T + +G +++ETN+R+YAY++
Sbjct: 309 RLATTLTSSASALRSVSSGFSAAAANNPGDAASLGMTPDTTPTKGSLIIETNYRLYAYTS 368
Query: 311 SKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRV---- 366
S L +L LF+++ + ++ G +T++S+ A GITA+QIIS+L +AH ++
Sbjct: 369 SPLQIAVLALFTQLNMRFAGMVTGRLTRDSIRRAISFGITADQIISYLASHAHEQMTRAA 428
Query: 367 -ADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
A P +P V DQIRLW+ + R+ T + +F S
Sbjct: 429 AAAGRPVLPPTVVDQIRLWQLENERMRTTAGFLFKDFDS 467
>gi|393910938|gb|EFO26137.2| transcription factor Tfb2 family protein [Loa loa]
Length = 486
Score = 158 bits (400), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 125/433 (28%), Positives = 207/433 (47%), Gaps = 47/433 (10%)
Query: 11 FMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPD 70
++ ++S LD+LY +P + AI R LP L+++++++ ++ G W PD
Sbjct: 12 LLEYLSSRAIDQLDKLYSSPAVSFAIFRLLPELSQQFILKTVWLTGTNRKCEWHAWSAPD 71
Query: 71 GFTKHRVAIDRLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGIT-- 128
T I L +L++ +++ LN ++ + K G L + S +T
Sbjct: 72 METAVMNQISGLRKLKIIVGDEE----VMLNPVYRRSYLKATRIG--LCKASELSAVTDC 125
Query: 129 ---ARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLL--SRRDKE 183
R +DL A+ +WEC L L +Q + S + ++F+ L S D
Sbjct: 126 DEKTRKSANKDLGKKAVERWECILHYLALPSQ-KSEQGVSGTTKRLFRSAGLTSSENDDG 184
Query: 184 APRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFH---VAGEAY 240
+T +GFQFLL++ + Q+W + Y + GI+ + FL L+ + +G
Sbjct: 185 DIEITSAGFQFLLLNRSEQIWTYLLYYFHMQEVAGIDIIKELDFLFRLTLYSGDTSGGTR 244
Query: 241 NLNT-------------LSEIQKSMIKDF----ADLGLVKLQQGRKENWFIPTKLATNL- 282
N + E KDF +LGLV +++ RK+ +F T L +L
Sbjct: 245 VTNGSDKERPAGSRAFLIDECWPEPTKDFLMHLRELGLVFIRK-RKDGFFFLTPLLNHLT 303
Query: 283 SMSLTDSSA----RKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITK 338
S+S + + GF++VETN+R+YAY+ S L IL F+++ Y+ ++ VG +++
Sbjct: 304 SISYASETGIENRNQNGFVIVETNYRVYAYTDSNLQLAILSTFTEMLYRFNDMSVGVLSR 363
Query: 339 ESLYNAFENGITAEQIISFLQQNAHPRV-------ADRMPSVPENVCDQIRLWESDLNRV 391
E++ AF+ GITA QII+FL+ NAHP + SVP V DQIRLWE + R+
Sbjct: 364 EAVRRAFQVGITAAQIIAFLRANAHPTTYAVTTEYGGMIQSVPITVADQIRLWEDERRRL 423
Query: 392 EMTPAHYYDEFPS 404
A Y F S
Sbjct: 424 MFCNAAVYSSFES 436
>gi|308452336|ref|XP_003089005.1| hypothetical protein CRE_21072 [Caenorhabditis remanei]
gi|308243592|gb|EFO87544.1| hypothetical protein CRE_21072 [Caenorhabditis remanei]
Length = 468
Score = 158 bits (400), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 119/418 (28%), Positives = 217/418 (51%), Gaps = 48/418 (11%)
Query: 11 FMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPD 70
F+D + S+ ++ R+ + P I R LPP+A++ +Q+ + G+ P D
Sbjct: 23 FLDFLISVPPKERQRMLQKPSFF--IYRMLPPIAQQVTIQLIW-KGSFPK--------AD 71
Query: 71 GFTKHRVAIDRLVQLRLFSE----EKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSG 126
+ D Q++L + K+ Y +++ + ++ +Y L + S
Sbjct: 72 DLEVTKSIED---QVKLLEDLGIVSKQANGKYLIDNDY----KRSYMYAAMLGAASISSL 124
Query: 127 ITARLPTLED------LEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRR 180
+ L T ++ +E A+ +W+C +L+ ++ E S + +F++ +
Sbjct: 125 V---LETNDEKRRGKEVEKKAVERWDC-ILRYLALPSEENTQAVSETTRNLFKKANFTSG 180
Query: 181 DKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLEL------SFH 234
D +T SGFQFLL+ Q+W V EY+ +G + ++I L+++ F
Sbjct: 181 DSRI-EITTSGFQFLLLSPVKQMWTYVIEYLKLEMSQGQDIVEVIEPLIQIVLLANRGFK 239
Query: 235 VAGEAYNLN-TLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLT--DSSA 291
E Y ++ + S Q+ ++ +LG++ +++ RK+ F T L T+L+ + T D+SA
Sbjct: 240 AEKECYQIDPSWSIPQQELLNHLRELGVIFIRK-RKDGVFFLTHLLTHLATNETIDDTSA 298
Query: 292 RK--EGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGI 349
+ G ++VETNFR+YAY++S L I+ LF+++ Y+ ++ VG IT+ES+ A ++GI
Sbjct: 299 ERVSNGKVIVETNFRVYAYTSSLLQLAIIALFTEMTYRFQDMSVGMITRESVRGALQHGI 358
Query: 350 TAEQIISFLQQNAHPR---VADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
TA QIISFL+ NAHP+ + + +P V DQIRLWE + R+++ A+ Y F S
Sbjct: 359 TAAQIISFLRANAHPQCIATSGAINCLPITVADQIRLWEDERRRMDLKDAYIYSHFES 416
>gi|341886092|gb|EGT42027.1| hypothetical protein CAEBREN_00374 [Caenorhabditis brenneri]
Length = 460
Score = 157 bits (398), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 119/414 (28%), Positives = 210/414 (50%), Gaps = 38/414 (9%)
Query: 11 FMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPD 70
F+D + ++ R+ + P I R LPP+A++ +Q+ + G+ P D
Sbjct: 12 FLDFLITIPPNQRQRMLQKPSCAFFIYRMLPPIAQQVTIQLIW-KGSFPQ--------AD 62
Query: 71 GFTKHRVAIDRLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITAR 130
R D Q++L E + ++ +Y L + S +
Sbjct: 63 DVEVTRQIED---QVKLLEELGVVHRQSDGKLAIDHDYKRSYMYAAMLGAASISSLV--- 116
Query: 131 LPTLED------LEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEA 184
L T E+ +E A+ +W+C +L+ ++ E S + +F+ + D
Sbjct: 117 LETNEEKRRGKEVEKKAVERWDC-ILRYLALPSEENTQAVSETTRNLFKMANFTSGDSRI 175
Query: 185 PRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLEL------SFHVAGE 238
+T SGFQFLL+ Q+W V EY+ +G + ++I L+++ + E
Sbjct: 176 -EITTSGFQFLLLSPVKQMWTYVIEYLKLEMSQGHDIVEVIEPLIQIVLLANRGYKAEKE 234
Query: 239 AYNLN-TLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLT--DSSARK-- 293
Y ++ + + Q+ ++ +LG++ +++ RK+ F T+L T+L+ + T D+SA K
Sbjct: 235 CYQIDPSWTGPQQELLNHLRELGVIFIRK-RKDGVFFLTQLLTHLATNETIDDTSAEKVS 293
Query: 294 EGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQ 353
G ++VETNFR+YAY++S L I+ LF+++ Y+ ++ VG IT+ES+ A ++GITA Q
Sbjct: 294 NGKVIVETNFRVYAYTSSLLQLAIIALFTEMTYRFQDMSVGMITRESVRGALQHGITAAQ 353
Query: 354 IISFLQQNAHPR-VADRMP--SVPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
IISFL+ NAHP+ +A P +P V DQIRLWE + R+++ ++ Y F S
Sbjct: 354 IISFLRANAHPQCIATSGPVNCLPITVADQIRLWEDERRRMDLKDSYIYSNFES 407
>gi|425777809|gb|EKV15965.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Penicillium digitatum PHI26]
gi|425782577|gb|EKV20476.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Penicillium digitatum Pd1]
Length = 382
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 101/306 (33%), Positives = 168/306 (54%), Gaps = 36/306 (11%)
Query: 131 LPTLEDLEAYAIGQWECFLLQLI--SSTQAER-PTNFSSSMMKVFQRGLLSRRDKEAPRL 187
L ++ DL+ YA QWE + ++ S+ +R N S + ++ Q G L +
Sbjct: 34 LTSIADLDEYARRQWEGVMGYMVGTSALSGQRDAVNLSKGVKQLLQAGHLVEIRGNRVDI 93
Query: 188 TESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSE 247
T+ GF F+L D N Q+W+I+ Y+ +++ G++ +++SFL LS G +Y+ + ++
Sbjct: 94 TKDGFGFVLQDVNTQVWHILILYVESAEAIGMDSVEVLSFLFLLSSLELGSSYDKSHMTS 153
Query: 248 IQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSA---------------- 291
Q + D AD G+V Q+ F PT+LAT L+ +DSSA
Sbjct: 154 NQLRTLADLADFGIV-YQEDADATHFYPTRLATTLT---SDSSALSNPVTGSLTGPVGPS 209
Query: 292 --RKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGI 349
GFI++ETN+R+YAY++S L ++ LF+ ++Y+ PNL+ G IT++S+ A E GI
Sbjct: 210 GGSGSGFIIIETNYRLYAYTSSPLQISLISLFTNLKYRFPNLVTGKITRQSVRRAIEMGI 269
Query: 350 TAEQIISFLQQNAHPRV---------ADRMPS--VPENVCDQIRLWESDLNRVEMTPAHY 398
TA+QIIS+L +AHP++ +P+ +P V DQIRLW+ + +R+ T
Sbjct: 270 TADQIISYLLSHAHPQLRKHSAAQSNGKGIPASVLPPTVTDQIRLWQLERDRLRATAGFL 329
Query: 399 YDEFPS 404
+ +F S
Sbjct: 330 FKDFTS 335
>gi|392901992|ref|NP_001255864.1| Protein Y73F8A.24, isoform b [Caenorhabditis elegans]
gi|290457493|emb|CBK19514.1| Protein Y73F8A.24, isoform b [Caenorhabditis elegans]
Length = 431
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 119/389 (30%), Positives = 200/389 (51%), Gaps = 44/389 (11%)
Query: 40 LPPLAKKYVMQMFYIDGAIPA-------KMMEEWV--LPDGFTKHRVAIDRLVQLRLFSE 90
LPP+A++ +Q+ + G+ P K +E+ V L D HR D + +
Sbjct: 2 LPPIAQQTTIQLIW-KGSFPTASDVEETKEIEDQVKLLKDLGLVHRQTTDGKLNI----- 55
Query: 91 EKKKETTYRLNSTFQSNLRKHLIYGGAL-PREPMPSGITARLPTLEDLEAYAIGQWECFL 149
+T Y+ + + + L I L P E G +D+E A+ +W+C +
Sbjct: 56 ----DTDYKRSYMYAAMLGAAQISSLVLEPNEGNRRG--------KDVEKKAVERWDC-I 102
Query: 150 LQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAPRLTESGFQFLLMDTNAQLWYIVRE 209
L+ ++ E S + +F++ + +T GFQFLL+ Q+W V E
Sbjct: 103 LRYLALPSEENTQAVSETTRNLFKKANFTSGGDTQIEITTFGFQFLLLSPVKQMWTYVIE 162
Query: 210 YISNSQERGINQADLISFLLEL------SFHVAGEAYNLNT-LSEIQKSMIKDFADLGLV 262
Y+ +G + ++I L+++ F E Y ++ +E Q ++ +LG++
Sbjct: 163 YLKLEMSQGQDIVEVIEPLIQIVLLANRGFKAERECYQIDANWTEPQNELLNHLRELGVI 222
Query: 263 KLQQGRKENWFIPTKLATNLSMSLT--DSSARK--EGFIVVETNFRMYAYSTSKLHCEIL 318
+++ RK+ F T+L T+L+ + T D SA K G ++VETNFR+YAY++S L I+
Sbjct: 223 FIRK-RKDGVFFLTQLLTHLATNETIDDVSAEKVSNGKVIVETNFRVYAYTSSLLQLAII 281
Query: 319 RLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR-VADRMP--SVPE 375
LF+++ Y+ ++ VG IT+ES+ A ++GITA QIISFL+ NAHP+ +A P +P
Sbjct: 282 ALFTEMTYRFQDMSVGMITRESVRGALQHGITAAQIISFLRANAHPQCIATSGPVNCLPI 341
Query: 376 NVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
V DQIRLWE + R+ + A+ Y F S
Sbjct: 342 TVADQIRLWEDERRRMNLKDAYIYSHFES 370
>gi|453232318|ref|NP_001263825.1| Protein Y73F8A.24, isoform d [Caenorhabditis elegans]
gi|412976516|emb|CCO25635.1| Protein Y73F8A.24, isoform d [Caenorhabditis elegans]
Length = 435
Score = 155 bits (392), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 121/393 (30%), Positives = 202/393 (51%), Gaps = 48/393 (12%)
Query: 40 LPPLAKKYVMQMFYIDGAIPA-------KMMEEWV--LPDGFTKHRVAIDRLVQLRLFSE 90
LPP+A++ +Q+ + G+ P K +E+ V L D HR D + +
Sbjct: 2 LPPIAQQTTIQLIW-KGSFPTASDVEETKEIEDQVKLLKDLGLVHRQTTDGKLNI----- 55
Query: 91 EKKKETTYRLNSTFQSNLRKHLIYGGAL-PREPMPSGITARLPTLEDLEAYAIGQWECFL 149
+T Y+ + + + L I L P E G +D+E A+ +W+C +
Sbjct: 56 ----DTDYKRSYMYAAMLGAAQISSLVLEPNEGNRRG--------KDVEKKAVERWDC-I 102
Query: 150 LQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAPRLTESGFQFLLMDTNAQLWYIVRE 209
L+ ++ E S + +F++ + +T GFQFLL+ Q+W V E
Sbjct: 103 LRYLALPSEENTQAVSETTRNLFKKANFTSGGDTQIEITTFGFQFLLLSPVKQMWTYVIE 162
Query: 210 YISNSQERGINQADLISFLLELSF-----HVAG-----EAYNLNT-LSEIQKSMIKDFAD 258
Y+ +G + ++I L+++ VAG E Y ++ +E Q ++ +
Sbjct: 163 YLKLEMSQGQDIVEVIEPLIQIVLLANRVQVAGFKAERECYQIDANWTEPQNELLNHLRE 222
Query: 259 LGLVKLQQGRKENWFIPTKLATNLSMSLT--DSSARK--EGFIVVETNFRMYAYSTSKLH 314
LG++ +++ RK+ F T+L T+L+ + T D SA K G ++VETNFR+YAY++S L
Sbjct: 223 LGVIFIRK-RKDGVFFLTQLLTHLATNETIDDVSAEKVSNGKVIVETNFRVYAYTSSLLQ 281
Query: 315 CEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR-VADRMP-- 371
I+ LF+++ Y+ ++ VG IT+ES+ A ++GITA QIISFL+ NAHP+ +A P
Sbjct: 282 LAIIALFTEMTYRFQDMSVGMITRESVRGALQHGITAAQIISFLRANAHPQCIATSGPVN 341
Query: 372 SVPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
+P V DQIRLWE + R+ + A+ Y F S
Sbjct: 342 CLPITVADQIRLWEDERRRMNLKDAYIYSHFES 374
>gi|358333989|dbj|GAA34589.2| transcription initiation factor TFIIH subunit 4 [Clonorchis
sinensis]
Length = 456
Score = 154 bits (388), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 119/414 (28%), Positives = 188/414 (45%), Gaps = 61/414 (14%)
Query: 23 LDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRL 82
LD L+ +P C I R LP LAK +V ++ +++ IP ++ WV + A L
Sbjct: 21 LDELFSHPPTCLVIFRELPELAKHFVARLLFVEQPIPKSIVSGWVEKGSQALLQDACRAL 80
Query: 83 VQLRLFS--EEKKKETTYRLNSTFQSNLRKHLIYGGA-LPREPMPSGITARLPTLEDLEA 139
LR++ + + L+ +Q +LR L GG L + + + ++ LE
Sbjct: 81 SDLRIWHSVDGNVARGAWCLSKKYQESLRISLFGGGKPLLGDLGSTAVDKYSKDVDFLET 140
Query: 140 YAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAP-RLTESGFQFLLMD 198
YA +W+ L ++ S +E + ++ L+ + P +T+ GF FLLM
Sbjct: 141 YAAERWDALLHFMVGSESSEVGSVVQDVLL---LSNLMKGGTPDTPVGITKHGFHFLLMS 197
Query: 199 TNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFAD 258
Q+ + Y +L ++Y ++ L+ Q+ +++ +
Sbjct: 198 RPDQVRLFLLHYFD--------------YL---------KSYPVDALTAPQQEVLQQMRE 234
Query: 259 LGLVKLQQGRKENWFIPTKLATNLSMSL------------------------------TD 288
LGL Q+ R F T LAT LS S T
Sbjct: 235 LGLA-YQRKRTAPRFYVTPLATVLSGSRCHQPAMSSGSVLSAIPTGVSHLEGTDSVQPTA 293
Query: 289 SSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENG 348
SS G+I++ETNFR+YAY+ S L +L LFSK+ + PNL+V IT++S+ A G
Sbjct: 294 SSTSDVGYILLETNFRLYAYTDSPLRTALLSLFSKIRARFPNLVVADITRDSVREALIRG 353
Query: 349 ITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEF 402
ITA+QIISFL NAHP + P +P + DQIRLWE + +R Y++F
Sbjct: 354 ITADQIISFLTTNAHPDMLREPPILPPTLVDQIRLWELERDRFVFQEGCLYEQF 407
>gi|401887601|gb|EJT51582.1| hypothetical protein A1Q1_07170 [Trichosporon asahii var. asahii
CBS 2479]
gi|406699646|gb|EKD02845.1| hypothetical protein A1Q2_02789 [Trichosporon asahii var. asahii
CBS 8904]
Length = 451
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 111/342 (32%), Positives = 173/342 (50%), Gaps = 29/342 (8%)
Query: 76 RVAIDR-LVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITAR--LP 132
R A DR + Q +F +K + + LN TF+ +R L G +P AR P
Sbjct: 47 RAARDRQIFQPLVFKHQKFQ---WSLNDTFKRGMRNALTGLGTSNSFGVPYKRQARDPAP 103
Query: 133 TLEDLEAYAIGQWECFLLQLISSTQAERPTNF--SSSMMKVFQRGLLSRRDKEAPR---- 186
T E+L Y WE L ++SS R T +S + + GL+ + + R
Sbjct: 104 TAEELIDYGENTWESILKYMVSSGLGARVTGRPRNSVLSLLHASGLMVDQFDDTGRNPNY 163
Query: 187 ----LTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYN- 241
+T GFQFLL D QLW I+ Y++ Q RG AD++S L G+ Y+
Sbjct: 164 DRLTITSQGFQFLLEDRQTQLWQILMFYLARKQRRGEAAADILSLFFSLGCMQLGQDYSA 223
Query: 242 ---LNTLSEIQKSMIKDFADLGLVKLQQ---GRKENWFIPTKLATNLSMSLTD----SSA 291
+ Q+ + D G + +Q GRK + F PT LAT+L T ++A
Sbjct: 224 SKSFQNFPQGQQCL-DDLEQYGFIYRRQEVDGRKTDQFFPTHLATSLCSGDTSVNRSATA 282
Query: 292 RKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITA 351
+ F+++ETN+R+YAY++++L IL LF + + PNLIVG + + + A + GI+A
Sbjct: 283 DDKRFLILETNYRVYAYTSNELEIAILNLFVNITIRYPNLIVGHLDRRHVKAAMDKGISA 342
Query: 352 EQIISFLQQNAHPRVADRMPSVPE-NVCDQIRLWESDLNRVE 392
QIIS+L +AHP++ + P + +V DQ+ LW+ + NRV+
Sbjct: 343 YQIISYLTTHAHPQMYNHPPPLLHPSVTDQLHLWDRERNRVQ 384
>gi|322697444|gb|EFY89224.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Metarhizium acridum CQMa 102]
Length = 541
Score = 152 bits (384), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 113/385 (29%), Positives = 193/385 (50%), Gaps = 70/385 (18%)
Query: 82 LVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLPTLED---LE 138
++Q+ + S+EK +E +L + F+++LR L GG +PS T +P D L+
Sbjct: 115 IIQISMPSKEKPQEM--QLTTNFKNSLRLALTGGGEHNSFGVPS--TLLVPPEIDTAFLD 170
Query: 139 AYAIGQWECFLLQLISSTQAERPTNFSS---SMMKVFQRG-LLSRRDKEAPRLTESGFQF 194
YA +WE L ++SS + + S S+ ++ G L+ RR + +T++GF F
Sbjct: 171 RYARKRWEDILHFVVSSVGYKSTGDVSGPNKSVKELLIAGRLVDRRPNGSIGITQAGFTF 230
Query: 195 LLMDTNAQLWYIVREYISNSQER---GINQADLISFLLELSFHVAGEAYNLNTLSEIQKS 251
LL + NAQ+W ++ ++ S+ G+ D++SFL L+ G AY+ N L+E +++
Sbjct: 231 LLQEPNAQVWTLLLLWLEASETNKAAGLETVDMLSFLFVLASLELGRAYDTNALTEQRRN 290
Query: 252 MIKDFADLGLVKLQQGRKENWFIPTKLATNLSMS-------------------------- 285
M+ D GL+ + Q K + F PT+LAT L+
Sbjct: 291 MLPSLLDFGLIYIPQ-HKRSMFFPTRLATALTSGGGSLRTISEGVAAATAAAGPSSHEGV 349
Query: 286 ----------------------LTDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSK 323
SA ++G ++VETN+R+YAY+ S L +L LF K
Sbjct: 350 AAATAAAGPSSQSGSGSGGAGGPLGPSADQKGSVIVETNYRIYAYTQSTLQIAVLALFCK 409
Query: 324 VEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR------VADRMPSVPENV 377
+ + P+++ G +T+ S++ A GITA+QIIS+L +AH + +A++ P +P V
Sbjct: 410 LNMRFPDMVAGRVTRTSIHQAINFGITADQIISYLAAHAHEQMHRTAALANK-PILPPTV 468
Query: 378 CDQIRLWESDLNRVEMTPAHYYDEF 402
DQIRLW+ + R++ T + +F
Sbjct: 469 VDQIRLWQLENERMKTTGGFLFRDF 493
>gi|402592023|gb|EJW85952.1| transcription factor Tfb2 family protein [Wuchereria bancrofti]
Length = 470
Score = 151 bits (382), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 123/420 (29%), Positives = 198/420 (47%), Gaps = 34/420 (8%)
Query: 11 FMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPD 70
++ ++S L++LY P + AI R LP L+++ +++M ++ G+ W P+
Sbjct: 9 LLEYLSSRATDQLNKLYSYPAVSFAIFRLLPELSQQLILKMLWLSGSNRNCGWRAWSAPN 68
Query: 71 ---GFTKHRVAIDRLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGI 127
H A+ +L ++ + EE YR ++ LR I G +
Sbjct: 69 FEAAVINHISALKKL-KIIIGDEEIMINPVYR-----RAYLRATRIGLGKASELSVVKDC 122
Query: 128 --TARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLL--SRRDKE 183
R +DL A+ +WEC L L +Q + S + ++F+ L S D
Sbjct: 123 DEKTRKSANKDLGKKAVERWECILHYLALPSQ-KSEQGVSGTTKRLFRSAGLTSSGNDDG 181
Query: 184 APRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGE---AY 240
+T +GFQFLL++ Q+W + Y + GI+ + FL ++ E
Sbjct: 182 DIEITSAGFQFLLLNRPEQIWTYLLHYFHMQEVAGIDIIKELDFLFKVVNDSDKERPAGS 241
Query: 241 NLNTLSEIQKSMIKDF----ADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARKE-- 294
+ E IKDF +LGLV +++ RK+ F T L +L+ S A E
Sbjct: 242 RAFVIDENWPETIKDFLIHLRELGLVFIRK-RKDGVFFLTPLLNHLTGISNTSEASMENR 300
Query: 295 ---GFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITA 351
GF++VETN+R+YAY+ S L IL F+++ Y+ ++ VG +++E++ AF+ GITA
Sbjct: 301 NQHGFVIVETNYRVYAYTDSNLQLAILSTFTEMLYRFNDMSVGVLSREAVRRAFQVGITA 360
Query: 352 EQIISFLQQNAHP-------RVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
QII+FL NAHP + SVP V DQI+LWE + R+ A Y F S
Sbjct: 361 AQIIAFLSTNAHPITYAVTTEYGRMIQSVPITVADQIQLWEDERRRLTFCNATVYSSFES 420
>gi|322711210|gb|EFZ02784.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
(Tfb2), putative [Metarhizium anisopliae ARSEF 23]
Length = 540
Score = 150 bits (380), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 110/360 (30%), Positives = 185/360 (51%), Gaps = 62/360 (17%)
Query: 82 LVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLPTLED---LE 138
++Q+ + S+EK +E +L + F+++LR L GG +PS T +P D L+
Sbjct: 118 IIQISMPSKEKPQEM--QLTTNFKNSLRLALTGGGEHNSFGVPS--TLLVPPEIDTPFLD 173
Query: 139 AYAIGQWECFLLQLISSTQAERPTNFSS---SMMKVFQRG-LLSRRDKEAPRLTESGFQF 194
YA +WE L ++SS + + S S+ ++ G L+ RR +T++GF F
Sbjct: 174 RYARKRWEDILHFVVSSVGYKSTGDVSGPNKSVKELLIAGRLVDRRPNGTIGITQAGFTF 233
Query: 195 LLMDTNAQLWYIVREYISNSQER---GINQADLISFLLELSFHVAGEAYNLNTLSEIQKS 251
LL + NAQ+W ++ ++ S+ G+ D++SFL L+ G AY+ N L+E +K+
Sbjct: 234 LLQEPNAQVWTLLLLWLEASETNKAAGLETVDMLSFLFVLASLELGRAYDTNALTEQRKN 293
Query: 252 MIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTD----------------------- 288
M+ D GL+ + Q K + F PT+LAT L+ +
Sbjct: 294 MLPSLLDFGLIYIPQ-HKRSMFFPTRLATTLTSGGSSLRTISEGVAAATAAATTTAPAGG 352
Query: 289 -----------------SSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNL 331
SA ++G ++VETN+R+YAY+ S L +L LF K+ + P++
Sbjct: 353 PSSQSGSGSGGTGGPLGPSADQKGSVIVETNYRIYAYTQSTLQIAVLALFCKLNMRFPDM 412
Query: 332 IVGAITKESLYNAFENGITAEQIISFLQQNAHPR------VADRMPSVPENVCDQIRLWE 385
+ G +T+ S++ A GI+A+QIIS+L +AH + +A++ P +P V DQIRLW+
Sbjct: 413 VAGRVTRTSIHQAINFGISADQIISYLAAHAHEQMHRSAALANK-PILPPTVVDQIRLWQ 471
>gi|336270172|ref|XP_003349845.1| hypothetical protein SMAC_00733 [Sordaria macrospora k-hell]
gi|380095234|emb|CCC06707.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 496
Score = 150 bits (380), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 124/456 (27%), Positives = 214/456 (46%), Gaps = 69/456 (15%)
Query: 8 AKNFMDMVASLTARDLDRLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEW 66
+ ++ + L RLY++P AI R L AK V + Y+ P ++ W
Sbjct: 5 GQQLLEYLEKLPGTAFRRLYQSPATALAIFRRMLSHHAKVIVRGLLYMPDPFPLTSLDIW 64
Query: 67 VLPDGFTK-HR----VAIDRLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPRE 121
P+ K HR A+ L L++ K + ++ ++F+ L+ GG
Sbjct: 65 FPPESGGKIHRDRTIAALRSLHVLQVSPPGKDRTQEIQITTSFKKGLQLAFEGGGEDGSF 124
Query: 122 PMPSGITARLPTLED--LEAYAIGQWECFLLQLISS----TQAERPTNFSSSMMKVFQRG 175
+PS + PT++ L+ YA +W+ L +++S + +E + +S+ ++ G
Sbjct: 125 GVPSSLPVD-PTIDIGYLDKYARKKWDDILHYVVNSLGVQSSSEGVSGPKASVKELLLAG 183
Query: 176 LLSRR---DKEAPRLTESGFQFLLMDTNAQLWYIVR---EYISNSQERGINQADLISFLL 229
L R + +T++GF FLL ++NAQ+W ++ E ++ + G++ D++SFL
Sbjct: 184 RLVERRPDTRTGIGITQAGFTFLLQESNAQVWTLLLLWLEAADHAADSGMDSIDMLSFLF 243
Query: 230 ELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNL------- 282
LS G AY+ + L+E +++M+ D GL+ R F PT+LAT L
Sbjct: 244 MLSSLELGTAYDTSALTETRRNMLPSLVDFGLIYFP--RDTRQFFPTRLATTLTSTASAL 301
Query: 283 ---SMSLTDSSARKEG-------------------------FIVVETNFRMYAYSTSKLH 314
S + T ++A G I++ETN+R+YAY+TS L
Sbjct: 302 RTVSSAFTAATANPSGEPGTTTSTSGGPAGATPADKTGTSKGIIIETNYRIYAYTTSPLQ 361
Query: 315 CEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRMPSV- 373
+L LF ++ + PN++ A+ A GITA+QIIS+L +AH ++ R +V
Sbjct: 362 IAVLALFCQLNMRFPNMVSAAL------RAIGFGITADQIISYLHAHAHQQMV-REATVT 414
Query: 374 -----PENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
P V DQIRLW+ + R++ T + +F S
Sbjct: 415 GKTVLPPTVVDQIRLWQLENERMQTTAGFLFKDFES 450
>gi|2245059|emb|CAB10482.1| transcription factor like protein [Arabidopsis thaliana]
gi|7268453|emb|CAB80973.1| transcription factor like protein [Arabidopsis thaliana]
Length = 101
Score = 150 bits (380), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 72/90 (80%), Positives = 78/90 (86%)
Query: 1 MPLVKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPA 60
MP VKIIAKNFMDMVASL A LD+LY N FICEAILRSLPPLAKKYV+QM YID +PA
Sbjct: 1 MPQVKIIAKNFMDMVASLPAIKLDKLYNNVFICEAILRSLPPLAKKYVLQMLYIDVPVPA 60
Query: 61 KMMEEWVLPDGFTKHRVAIDRLVQLRLFSE 90
MMEEWVL DG +KHRVAIDRL+QLR+FSE
Sbjct: 61 TMMEEWVLADGTSKHRVAIDRLIQLRIFSE 90
>gi|164425579|ref|XP_960485.2| hypothetical protein NCU05523 [Neurospora crassa OR74A]
gi|157070982|gb|EAA31249.2| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 475
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 117/451 (25%), Positives = 209/451 (46%), Gaps = 80/451 (17%)
Query: 8 AKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWV 67
+ ++ + L RLY++P AI R + + Y +G K+ +
Sbjct: 5 GQQLLEYLEKLPGTAFRRLYQSPATALAIFRRM----------LSYHEGG--GKIHRDRT 52
Query: 68 LPDGFTKHRVAIDRLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGI 127
+ A+ L L++ K + ++ ++F+ L+ GG E G+
Sbjct: 53 I--------AALRSLHILQVSPPSKDRTQEIQITASFKKGLQLAFEGGG----EDGSFGV 100
Query: 128 TARLPT-----LEDLEAYAIGQWECFLLQLISS----TQAERPTNFSSSMMKVFQRGLLS 178
+ LP + L+ YA +W+ L +++S +E + +S+ ++ G L
Sbjct: 101 PSSLPVDPSIDIAYLDRYARKKWDDILHYVVNSLGVHGSSEGVSGPKASVKELLLAGRLV 160
Query: 179 RR---DKEAPRLTESGFQFLLMDTNAQLWYIVR---EYISNSQERGINQADLISFLLELS 232
R + +T++GF FLL ++NAQ+W ++ E ++ + G++ D++SFL LS
Sbjct: 161 ERRPDTRTGIGITQAGFTFLLQESNAQVWTLLLLWLEAADHTADSGMDSIDMLSFLFMLS 220
Query: 233 FHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNL---------- 282
G AY+ + L+E +++M+ D GL+ R F PT+LAT L
Sbjct: 221 SLELGTAYDTSALTETRRNMLPSLVDFGLIYFP--RDTRQFFPTRLATTLTSTASALRTV 278
Query: 283 SMSLTDSSARKEG------------------------FIVVETNFRMYAYSTSKLHCEIL 318
S + T ++A G I++ETN+R+YAY+TS L +L
Sbjct: 279 SSAFTAATANPSGEPGTTTSTSGGPAATPDKTQTAAKGIIIETNYRIYAYTTSPLQIAVL 338
Query: 319 RLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRM-----PSV 373
LF ++ + PN++ G +T++S+ A GITA+QIIS+L +AH ++ P +
Sbjct: 339 ALFCQLNMRFPNMVSGRLTRDSIRQAIGFGITADQIISYLHAHAHQQMVREATVTGKPVL 398
Query: 374 PENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
P V DQIRLW+ + R++ T + +F S
Sbjct: 399 PPTVVDQIRLWQLENERMQTTAGFLFKDFES 429
>gi|90398991|emb|CAJ86263.1| H0801D08.21 [Oryza sativa Indica Group]
Length = 119
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 67/117 (57%), Positives = 90/117 (76%), Gaps = 2/117 (1%)
Query: 1 MPLVKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPA 60
MP V ++A+NFMDMVA+L A LD LY++ FICEA+LRSLPPLAKKY +QM Y+ + A
Sbjct: 1 MPQVMVVARNFMDMVAALPAAKLDMLYDSAFICEAVLRSLPPLAKKYALQMLYVSAPVAA 60
Query: 61 KMMEEWVLPDGFTKHRVAIDRLVQLRLFSE--EKKKETTYRLNSTFQSNLRKHLIYG 115
MEEWVL + KHRVAIDRL+QLR+F E +++KE +Y++N FQ N++K+L+ G
Sbjct: 61 AAMEEWVLDEYAAKHRVAIDRLLQLRVFVEVRDRRKEVSYKMNQKFQGNMQKYLVDG 117
>gi|223993063|ref|XP_002286215.1| transcription factor of TFIIH family [Thalassiosira pseudonana
CCMP1335]
gi|220977530|gb|EED95856.1| transcription factor of TFIIH family [Thalassiosira pseudonana
CCMP1335]
Length = 461
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 99/296 (33%), Positives = 159/296 (53%), Gaps = 39/296 (13%)
Query: 136 DLEAYAIGQWECFLLQLISSTQAERPTNFS-----SSMMKVFQRGLLSRRDKE------- 183
DLE Y W+ L L+ S + +R + + +M++ R L + D +
Sbjct: 138 DLETYTQSSWDSVLHFLVGSDETDRGSGYGIEEPPEAMVQFLMRIGLMQVDPDFKGNDKS 197
Query: 184 -AP-RLTESGFQFLLMDTNAQLWYIVREYI---SNSQERGINQADLISFLLELSFHVAGE 238
AP +T G++F+L DTNAQ+W V +Y+ ++ +++ + + +SFL+ L GE
Sbjct: 198 RAPLVITSKGYEFMLRDTNAQVWQFVLQYLNSMAHHEQKDFIRMEALSFLICLGSCRVGE 257
Query: 239 AYNLNTL-SEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSS------- 290
Y + L S+ +++++DF+ GL+ + + +N F PT+ A NL S +
Sbjct: 258 GYQSSVLGSKSARALMRDFSRFGLLFVCRVAGKNSFYPTRAAVNLVASSEKAGRQASDLM 317
Query: 291 ---------ARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEY--QLPNLIVGAITKE 339
+R +VV+TNF++ AY+ SKLH L LF V +LPN+I IT++
Sbjct: 318 GQTLAAPVPSRSHLAVVVQTNFQVVAYTKSKLHISTLGLFCDVSSFRRLPNVIFFHITRD 377
Query: 340 SLYNAFENGITAEQIISFLQQNAHP--RVADRMPSVPENVCDQIRLWESDLNRVEM 393
S+ +AF G+TA+QI+ FL +AHP R DR P VP NV DQI LW+ + +RV M
Sbjct: 378 SIRSAFRLGVTADQILRFLHVHAHPMLRSGDR-PMVPSNVVDQILLWDRERHRVVM 432
>gi|336466164|gb|EGO54329.1| hypothetical protein NEUTE1DRAFT_87562 [Neurospora tetrasperma FGSC
2508]
gi|350286986|gb|EGZ68233.1| transcription factor Tfb2 [Neurospora tetrasperma FGSC 2509]
Length = 475
Score = 148 bits (373), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 117/451 (25%), Positives = 209/451 (46%), Gaps = 80/451 (17%)
Query: 8 AKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWV 67
+ ++ + L RLY++P AI R + + Y +G K+ +
Sbjct: 5 GQQLLEYLEKLPGTAFRRLYQSPATALAIFRRM----------LSYHEGG--GKIHCDRT 52
Query: 68 LPDGFTKHRVAIDRLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGI 127
+ A+ L L++ K + ++ ++F+ L+ GG E G+
Sbjct: 53 I--------AALRSLHILQVSPPSKDRTQEIQITASFKKGLQLAFEGGG----EDGSFGV 100
Query: 128 TARLPT-----LEDLEAYAIGQWECFLLQLISS----TQAERPTNFSSSMMKVFQRGLLS 178
+ LP + L+ YA +W+ L +++S +E + +S+ ++ G L
Sbjct: 101 PSSLPVDPSIDIAYLDKYARKKWDDILHYVVNSLGVHGSSEGVSGPKASVKELLLAGRLV 160
Query: 179 RR---DKEAPRLTESGFQFLLMDTNAQLWYIVR---EYISNSQERGINQADLISFLLELS 232
R + +T++GF FLL ++NAQ+W ++ E ++ + G++ D++SFL LS
Sbjct: 161 ERRPDTRTGIGITQAGFTFLLQESNAQVWTLLLLWLEAADHTADSGMDSIDMLSFLFMLS 220
Query: 233 FHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNL---------- 282
G AY+ + L+E +++M+ D GL+ R F PT+LAT L
Sbjct: 221 SLELGTAYDTSALTETRRNMLPSLVDFGLIYFP--RDTRQFFPTRLATTLTSTASALRTV 278
Query: 283 SMSLTDSSARKEG------------------------FIVVETNFRMYAYSTSKLHCEIL 318
S + T ++A G I++ETN+R+YAY+TS L +L
Sbjct: 279 SSAFTAATANPSGEPGTTTSTSGGPAATPDKTQTAAKGIIIETNYRIYAYTTSPLQIAVL 338
Query: 319 RLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRM-----PSV 373
LF ++ + PN++ G +T++S+ A GITA+QIIS+L +AH ++ P +
Sbjct: 339 ALFCQLNMRFPNMVSGRLTRDSIRQAIGFGITADQIISYLHAHAHQQMVREATVTGKPVL 398
Query: 374 PENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
P V DQIRLW+ + R++ T + +F S
Sbjct: 399 PPTVVDQIRLWQLENERMQTTAGFLFKDFES 429
>gi|430812024|emb|CCJ30551.1| unnamed protein product [Pneumocystis jirovecii]
Length = 319
Score = 145 bits (365), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 85/268 (31%), Positives = 147/268 (54%), Gaps = 22/268 (8%)
Query: 137 LEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAPRLTESGFQFLL 196
L+ YA +WE L ++ + + P + ++++ GL+ + ++T SGFQFLL
Sbjct: 26 LDNYAKNKWEMILHFMVGTGTEKLPGDHVLNLLR--YSGLMFGSRCDEMKITNSGFQFLL 83
Query: 197 MDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDF 256
++ N+Q+W ++ Y++ ++ ++ D++ FL L G++Y +++L+ IQ ++++
Sbjct: 84 LNINSQIWTLLLHYLNMIEDLEVDPVDILQFLFMLGNLELGQSYLISSLTPIQIQVLENL 143
Query: 257 ADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARKEGFIVVETNFRMYAYSTSKLHCE 316
D G+V ++ + F PT+LAT L+ + E I S L
Sbjct: 144 KDYGIVYRRKSSRR--FYPTRLATILT-------SNSEILI-----------RHSPLQIA 183
Query: 317 ILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRMPSVPEN 376
+L LF + + NL+VG IT+ S+ AF NGITAEQIIS+L AHP++ P +P
Sbjct: 184 VLNLFVHLYLRFSNLVVGVITRNSVRQAFMNGITAEQIISYLTSYAHPQMKKNTPIIPPT 243
Query: 377 VCDQIRLWESDLNRVEMTPAHYYDEFPS 404
V DQIRLWE + NR++ T + + +F S
Sbjct: 244 VNDQIRLWEMERNRLKATEGYLFRDFNS 271
>gi|164660182|ref|XP_001731214.1| hypothetical protein MGL_1397 [Malassezia globosa CBS 7966]
gi|159105114|gb|EDP44000.1| hypothetical protein MGL_1397 [Malassezia globosa CBS 7966]
Length = 231
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 75/223 (33%), Positives = 135/223 (60%), Gaps = 12/223 (5%)
Query: 152 LISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAP---RLTESGFQFLLMDTNAQLWYIVR 208
++ S QA P N +++ + QR L +RD + +T GFQFLL D N+QLW ++
Sbjct: 1 MVGSEQAASPRN---TVLFLLQRAGLMQRDHQTNDKLNITSLGFQFLLQDVNSQLWALLL 57
Query: 209 EYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGR 268
Y+S + ER ++ ++++F + G AY S+ Q +++ +D GLV + +
Sbjct: 58 HYLSMADERNMDLVEVLAFFFTVGGLELGRAYESRGFSQTQLQTLEELSDYGLV-YRPSK 116
Query: 269 KENWFIPTKLATNLSMSLTDSSAR-----KEGFIVVETNFRMYAYSTSKLHCEILRLFSK 323
+F PT+LA+ L+ + + +R ++G++++ETN+R+YAY+ + L IL LF
Sbjct: 117 SAKYFFPTRLASTLTSTASPLLSRLNDQEEQGYLILETNYRVYAYTANLLRIAILNLFVT 176
Query: 324 VEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRV 366
++ +LPNL++G +T+ S+ +A GITA+QII++L +AHP++
Sbjct: 177 LKSRLPNLVIGQLTRHSVKSALNKGITADQIITYLTHHAHPQM 219
>gi|392574410|gb|EIW67546.1| hypothetical protein TREMEDRAFT_33390 [Tremella mesenterica DSM
1558]
Length = 504
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 107/416 (25%), Positives = 192/416 (46%), Gaps = 37/416 (8%)
Query: 24 DRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTK-HRVA--ID 80
+ +Y++ C ILR LP + + ++ + + + ++ + D F+ +VA I
Sbjct: 37 EEIYKSEAACLCILRQLPEICRYIILHFIWSHQPVKSSDLKALLRMDQFSPLSQVANDIQ 96
Query: 81 RLVQLRLFS----EEKKKETTYRLNSTFQSNLRKHLIYGGALPR--EPMPSGITARLPTL 134
++ ++F ++ K+ +N F+ LR L G P P T P+
Sbjct: 97 PAIKRKIFQPLELQQGKRGHHTPMNDVFKRGLRNALTGSGTSNSFGLPYPLSPTDEYPSE 156
Query: 135 EDLEAYAIGQWECFLLQLISST-----QAERPTN------FSSSMMKVFQRGLLSRRDKE 183
L + WE L ++SS RP SS +M ++ +
Sbjct: 157 SQLIQFGDRTWESILKYMVSSGLDRGFPVARPETSVLRLLHSSGLMADLSESNGRQQSLD 216
Query: 184 APRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLN 243
+T GFQFLL D QLW I+ Y+++ + I+ A ++S L ++Y+L+
Sbjct: 217 RMTITSQGFQFLLEDRQTQLWQILVFYLTSIEVSCISSAPVLSLFFSLGCMQITQSYSLS 276
Query: 244 TLSE-IQKSMIKDFADLGLV---KLQQGRKENWFIPTKLATNL------------SMSLT 287
T S Q + D GL+ G K ++F PT L+T+L + +
Sbjct: 277 TFSNPTQLQALDDLQSYGLIYRPATPSGAKADYFFPTHLSTSLCSGNSALSAALNTTEID 336
Query: 288 DSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFEN 347
S + F+++ETN+++YAY+ ++L IL LF ++ Q NL+VG + + + A E
Sbjct: 337 TSHEDDKKFLILETNYKIYAYTNNELEIAILNLFMDIKVQYRNLVVGKLDRAHVKAAMEK 396
Query: 348 GITAEQIISFLQQNAHPRVADRMPSV-PENVCDQIRLWESDLNRVEMTPAHYYDEF 402
GI+A QII++LQ +AHP++ P + + DQ+ LW+ + NR+ + ++ F
Sbjct: 397 GISAYQIIAYLQSHAHPQMYSSPPPILHTTIVDQLHLWDKERNRLRTEESEMFEFF 452
>gi|321265552|ref|XP_003197492.1| subunit of TFIIH and nucleotide excision repair factor 3 complexes
[Cryptococcus gattii WM276]
gi|317463972|gb|ADV25705.1| Subunit of TFIIH and nucleotide excision repair factor 3 complexes,
putative [Cryptococcus gattii WM276]
Length = 481
Score = 140 bits (354), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 121/420 (28%), Positives = 203/420 (48%), Gaps = 41/420 (9%)
Query: 16 ASLT-ARD--LDRLYENPFICEAILRSLPPLAKKYVMQMFY-------IDGAIPAKMMEE 65
ASLT A D LD +++ F E + R LPP+ ++ ++ + D + +M
Sbjct: 17 ASLTSALDIFLDNQHQSYF--EELYRLLPPVCRQIILHALWSHHPLRVTDVKLLLQMDVH 74
Query: 66 WVLPDGFTKHRVAIDRLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMP- 124
L + R AI+R V + KK++ + LN +F+ LR L GA +P
Sbjct: 75 APLEECDEVMRPAINRKVLHPM--HYKKQKMQWSLNDSFKRGLRNALTGLGASNSFGVPF 132
Query: 125 ---SGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSS--MMKVFQ------ 173
LP+ ++L AY +E L ++SS T FS S +V Q
Sbjct: 133 ERHQATDLDLPSKDELVAYGEETFESILKYMVSSGLG---TEFSGSRPQPEVLQLLHASG 189
Query: 174 ----RGLLSRRDKEAPRLT--ESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISF 227
G +SRR+ RLT GFQFLL QLW I+ Y+S + ++++S
Sbjct: 190 LMTDPGDMSRRNPNIYRLTITSKGFQFLLEKRQTQLWEILMYYLSAKEANSERSSEVLSM 249
Query: 228 LLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNL----S 283
L G+ Y+ + + + D A G + + + F PT LAT+L +
Sbjct: 250 FFSLGCMQLGQDYSASNSFPHAQEALNDLAQYGFI-YKPSPDSDQFWPTHLATSLCSGDA 308
Query: 284 MSLTDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYN 343
++ SA + F+++ETN+++YAY++++L IL LF + + PNL+VG + ++ +
Sbjct: 309 SAIQSQSADDKRFLILETNYKIYAYTSNELEIAILNLFVDIRIRYPNLVVGKLDRQHVKA 368
Query: 344 AFENGITAEQIISFLQQNAHPRVADRMPSVPE-NVCDQIRLWESDLNRVEMTPAHYYDEF 402
A E GI+A QII++L +AHP++ + P + + DQ+ LW+ + NR++ Y+ F
Sbjct: 369 AMEKGISARQIIAYLSSHAHPQMYNSPPPLLHPTIVDQLHLWDRERNRLQTEETVMYEFF 428
>gi|58262306|ref|XP_568563.1| hypothetical protein CNN01190 [Cryptococcus neoformans var.
neoformans JEC21]
gi|57230737|gb|AAW47046.1| hypothetical protein CNN01190 [Cryptococcus neoformans var.
neoformans JEC21]
Length = 481
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 117/433 (27%), Positives = 202/433 (46%), Gaps = 40/433 (9%)
Query: 2 PLVKIIAKNFMDMVASLTARD--LDRLYENPFICEAILRSLPPLAKKYVMQMFY------ 53
P V K+ + A D LD +++ F E + R LPP+ ++ ++ +
Sbjct: 4 PFVSAPKKDATKQASPTLALDIFLDNQHQSYF--EELYRLLPPVCRQIILHALWSHHPLR 61
Query: 54 -IDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQLRLFSEEKKKETTYRLNSTFQSNLRKHL 112
+D + +M L + R AIDR V + KK+ + LN +F+ LR L
Sbjct: 62 TVDVKLLLQMDVHAPLEECDEVMRPAIDRKVLHPM--HYKKQRMQWSLNDSFKKGLRNAL 119
Query: 113 IYGGALPREPMP----SGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSS- 167
G +P LP+ ++L AY +E L ++SS T FS S
Sbjct: 120 TGLGTSNSFGVPFERHQATDLDLPSEDELVAYGEDTFESILKYMVSSGLG---TEFSGSR 176
Query: 168 -----MMKVFQRGLLSR------RDKEAPRLT--ESGFQFLLMDTNAQLWYIVREYISNS 214
+ + GL++ R+ RLT GFQFLL QLW I+ Y+S
Sbjct: 177 PQPEVLQLLHTSGLMTDSVDMNGRNPNINRLTITSKGFQFLLEKRQTQLWEILMYYLSAK 236
Query: 215 QERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFI 274
+ ++++S L G+ Y+ + + + D A G + + + F
Sbjct: 237 EANSERSSEVLSMFFSLGCMQLGQDYSASNSFPGAQEALDDLAQYGFI-YKSSPDSDQFW 295
Query: 275 PTKLATNL----SMSLTDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPN 330
PT LAT+L + ++ SA + F+++ETN+++YAY++++L IL LF + + PN
Sbjct: 296 PTHLATSLCSGDASAIQSQSADDKRFLILETNYKIYAYTSNELEIAILNLFVDIRIRYPN 355
Query: 331 LIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRMPSVPE-NVCDQIRLWESDLN 389
L+VG + ++ + A E GI+A QII++L +AHP++ + P + + DQ+ LW+ + N
Sbjct: 356 LVVGKLDRQHVKAAMEKGISARQIIAYLSSHAHPQMYNSPPPLLHPTIVDQLYLWDRERN 415
Query: 390 RVEMTPAHYYDEF 402
R++ Y+ F
Sbjct: 416 RLQTEETVMYEFF 428
>gi|134118870|ref|XP_771938.1| hypothetical protein CNBN1180 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50254542|gb|EAL17291.1| hypothetical protein CNBN1180 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 481
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 117/433 (27%), Positives = 202/433 (46%), Gaps = 40/433 (9%)
Query: 2 PLVKIIAKNFMDMVASLTARD--LDRLYENPFICEAILRSLPPLAKKYVMQMFY------ 53
P V K+ + A D LD +++ F E + R LPP+ ++ ++ +
Sbjct: 4 PFVSAPKKDATKQASPTLALDIFLDNQHQSYF--EELYRLLPPVCRQIILHALWSHHPLR 61
Query: 54 -IDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQLRLFSEEKKKETTYRLNSTFQSNLRKHL 112
+D + +M L + R AIDR V + KK+ + LN +F+ LR L
Sbjct: 62 TVDVKLLLQMDVHAPLEECDEVMRPAIDRKVLHPM--HYKKQRMQWSLNDSFKKGLRNAL 119
Query: 113 IYGGALPREPMP----SGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSS- 167
G +P LP+ ++L AY +E L ++SS T FS S
Sbjct: 120 TGLGTSNSFGVPFERHQATDLDLPSEDELVAYGEDTFESILKYMVSSGLG---TEFSGSR 176
Query: 168 -----MMKVFQRGLLSR------RDKEAPRLT--ESGFQFLLMDTNAQLWYIVREYISNS 214
+ + GL++ R+ RLT GFQFLL QLW I+ Y+S
Sbjct: 177 PQPEVLQLLHTSGLMTDSVDMNGRNPNINRLTITSKGFQFLLEKRQTQLWEILMYYLSAK 236
Query: 215 QERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFI 274
+ ++++S L G+ Y+ + + + D A G + + + F
Sbjct: 237 EANSERSSEVLSMFFSLGCMQLGQDYSASNSFPGAQEALDDLAQYGFI-YKSSPDSDQFW 295
Query: 275 PTKLATNL----SMSLTDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPN 330
PT LAT+L + ++ SA + F+++ETN+++YAY++++L IL LF + + PN
Sbjct: 296 PTHLATSLCSGDASAIQSQSADDKRFLILETNYKIYAYTSNELEIAILNLFVDIRIRYPN 355
Query: 331 LIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRMPSVPE-NVCDQIRLWESDLN 389
L+VG + ++ + A E GI+A QII++L +AHP++ + P + + DQ+ LW+ + N
Sbjct: 356 LVVGKLDRQHVKAAMEKGISARQIIAYLSSHAHPQMYNSPPPLLHPTIVDQLYLWDRERN 415
Query: 390 RVEMTPAHYYDEF 402
R++ Y+ F
Sbjct: 416 RLQTEETVMYEFF 428
>gi|170589001|ref|XP_001899262.1| Transcription factor Tfb2 family protein [Brugia malayi]
gi|158593475|gb|EDP32070.1| Transcription factor Tfb2 family protein [Brugia malayi]
Length = 465
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 98/304 (32%), Positives = 151/304 (49%), Gaps = 36/304 (11%)
Query: 135 EDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLL--SRRDKEAPRLTESGF 192
+DL A+ +WEC L L +Q + S + ++F+ L S D +T +GF
Sbjct: 114 KDLGKKAVERWECILHYLALPSQ-KSEQGVSGTTKRLFRSAGLTSSGNDDGDIEITSAGF 172
Query: 193 QFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKS- 251
QFLL++ Q+W + Y + GI+ + FL +L+ + ++ K
Sbjct: 173 QFLLLNRPEQIWTYLLHYFHMQEVAGIDIVKELDFLFKLTLYSGSTCGGTRVVNGSDKER 232
Query: 252 ---------------MIKDF----ADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSAR 292
IKDF +LGLV +++ RK+ +F T L +L+ S A
Sbjct: 233 PTGSRAFIIDENWPETIKDFLMHLRELGLVFIRK-RKDGFFFLTPLLNHLTGISNTSEAG 291
Query: 293 KE-----GFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFEN 347
E GF++VETN+R+YAY+ S L IL F+++ Y+ ++ VG +++E++ AF+
Sbjct: 292 MENRNQNGFVIVETNYRVYAYTDSNLQLAILSTFTEMLYRFSDMSVGVLSREAVRRAFQV 351
Query: 348 GITAEQIISFLQQNAHPRVA------DRM-PSVPENVCDQIRLWESDLNRVEMTPAHYYD 400
GITA QII+FL +NAHP RM SVP V DQI+LWE + R+ A Y
Sbjct: 352 GITAAQIIAFLSRNAHPITCAVTTEYGRMIQSVPITVVDQIQLWEDERRRLTFCSAAVYS 411
Query: 401 EFPS 404
F S
Sbjct: 412 SFES 415
>gi|428181353|gb|EKX50217.1| TFB2 transcription factor B2 nucleotide excision repair [Guillardia
theta CCMP2712]
Length = 463
Score = 138 bits (347), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 117/424 (27%), Positives = 196/424 (46%), Gaps = 45/424 (10%)
Query: 1 MPLVKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPA 60
M +V+ +A V L +LYEN + C +++R +PPL ++ +M++ ++ G
Sbjct: 1 MRVVEDMAWEVCRTVGELPESACSKLYENQWACVSMMRLMPPLGRQLLMRLVHL-GQERV 59
Query: 61 KMMEEWVLPDGFTKHRV--AIDRLVQLR-LFSEEKKKET---TYRLNSTFQSNLRKHLIY 114
+++ WV + R+ + ++ +++ L+ +E KE+ + R+N TF + + L+
Sbjct: 60 ELLR-WVDEGEVQRQRLEEVVRKMEEVKILYLDEDVKESDGVSVRINPTFSERILQFLLQ 118
Query: 115 GGALPREPMPSGITARLP--TLEDLEAYAIGQWECFLLQLISS--TQAERPTNFSSSMMK 170
P I P T E+L+ +A +WE L ++S +RP
Sbjct: 119 RDTSPWSK--GRIMGDSPDTTAEELQQWAQNRWESILHYMVSPGLDPMKRPEPGWKPTKW 176
Query: 171 VFQRGLLSRRDKE-----------APRLTESGFQFLLMDTNAQLWYIVREYISNSQERGI 219
V RG+L DK AP++T GFQFLL Q+ ++ ++
Sbjct: 177 V--RGILVGEDKNGNSFMMSLQNLAPKITAFGFQFLLRPLRNQILQMLMHVLTKKGSENK 234
Query: 220 NQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLA 279
+ D +S + +LSF G Y L+ LS + +I DLG V + ++ +F PT L
Sbjct: 235 DLQDSLSCIFQLSFCTLGAGYRLDLLSRSNQELITSLHDLGAVYFKPNDRQ-YFFPTPLI 293
Query: 280 TNLSM-SLTDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITK 338
NL S S + + E + A ILRLF++ Y+ P++I+ IT+
Sbjct: 294 VNLCTESEVPSGPSTDAGVADEFSAEPAA--------GILRLFTRPIYKFPHMIIAVITR 345
Query: 339 ESLYNAFENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHY 398
ES+ NA II +L+ +AHP+ + P VPE V DQI WE + R+ PA
Sbjct: 346 ESIRNAL--------IIEYLRMHAHPQCLENWPIVPEVVTDQICFWEQERCRIRAEPAVA 397
Query: 399 YDEF 402
Y F
Sbjct: 398 YHNF 401
>gi|405123868|gb|AFR98631.1| transcription factor TFIIH complex subunit Tfb2 [Cryptococcus
neoformans var. grubii H99]
Length = 481
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 114/401 (28%), Positives = 189/401 (47%), Gaps = 43/401 (10%)
Query: 30 PFICEAI----LRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQL 85
P +C I L S PL V + +D P + +E + P AIDR V
Sbjct: 43 PSVCRQIILHALWSHHPLRTVDVKLLLQMDVHAPLEECDEVMRP--------AIDRKVLH 94
Query: 86 RLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMP----SGITARLPTLEDLEAYA 141
+ KK++ + +N +F+ LR L G +P LP+ ++L AY
Sbjct: 95 PM--HYKKQKMQWSMNDSFKKGLRNALTGLGTSNSFGVPFERHQATDLDLPSEDELVAYG 152
Query: 142 IGQWECFLLQLISSTQAERPTNFSSS--MMKVFQ----------RGLLSRRDKEAPRLT- 188
+E L ++SS T FS S +V Q G ++ R+ RLT
Sbjct: 153 EDTFESILKYMVSSGLG---TEFSGSRPQPEVLQLLHASGLMTDPGDMTGRNPNINRLTI 209
Query: 189 -ESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSE 247
GFQFLL QLW I+ Y+S + ++++S L G+ Y+ +
Sbjct: 210 TSKGFQFLLEKRQTQLWEILMYYLSAKEANSERSSEVLSMFFSLGCMQLGQDYSASKSFP 269
Query: 248 IQKSMIKDFADLGLV-KLQQGRKENWFIPTKLATNL----SMSLTDSSARKEGFIVVETN 302
+ + D A G + K G + W PT LAT+L + ++ SA + F+++ETN
Sbjct: 270 HAQEALDDLAQYGFIYKSSPGSDQFW--PTHLATSLCSGDASAIQSQSADDKRFLILETN 327
Query: 303 FRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 362
+++YAY++++L IL LF + + PNL+VG + ++ + A E GI+A QII++L +A
Sbjct: 328 YKIYAYTSNELEIAILNLFVDIRIRYPNLVVGKLDRQHVKAAMEKGISAGQIIAYLSSHA 387
Query: 363 HPRVADRMPSVPE-NVCDQIRLWESDLNRVEMTPAHYYDEF 402
HP++ + P + + DQ+ LW+ + NR++ Y+ F
Sbjct: 388 HPQMYNSPPPLLHPTIVDQLHLWDRERNRLQTEETVMYEFF 428
>gi|393910939|gb|EJD76097.1| transcription factor Tfb2 family protein, variant [Loa loa]
Length = 389
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 97/304 (31%), Positives = 152/304 (50%), Gaps = 36/304 (11%)
Query: 135 EDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLL--SRRDKEAPRLTESGF 192
+DL A+ +WEC L L +Q + S + ++F+ L S D +T +GF
Sbjct: 38 KDLGKKAVERWECILHYLALPSQ-KSEQGVSGTTKRLFRSAGLTSSENDDGDIEITSAGF 96
Query: 193 QFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFH---VAGEAYNLNT----- 244
QFLL++ + Q+W + Y + GI+ + FL L+ + +G N
Sbjct: 97 QFLLLNRSEQIWTYLLYYFHMQEVAGIDIIKELDFLFRLTLYSGDTSGGTRVTNGSDKER 156
Query: 245 --------LSEIQKSMIKDF----ADLGLVKLQQGRKENWFIPTKLATNL-SMSLTDSSA 291
+ E KDF +LGLV +++ RK+ +F T L +L S+S +
Sbjct: 157 PAGSRAFLIDECWPEPTKDFLMHLRELGLVFIRK-RKDGFFFLTPLLNHLTSISYASETG 215
Query: 292 ----RKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFEN 347
+ GF++VETN+R+YAY+ S L IL F+++ Y+ ++ VG +++E++ AF+
Sbjct: 216 IENRNQNGFVIVETNYRVYAYTDSNLQLAILSTFTEMLYRFNDMSVGVLSREAVRRAFQV 275
Query: 348 GITAEQIISFLQQNAHPRV-------ADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYD 400
GITA QII+FL+ NAHP + SVP V DQIRLWE + R+ A Y
Sbjct: 276 GITAAQIIAFLRANAHPTTYAVTTEYGGMIQSVPITVADQIRLWEDERRRLMFCNAAVYS 335
Query: 401 EFPS 404
F S
Sbjct: 336 SFES 339
>gi|149031822|gb|EDL86757.1| rCG41794 [Rattus norvegicus]
Length = 209
Score = 135 bits (339), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 65/158 (41%), Positives = 104/158 (65%), Gaps = 3/158 (1%)
Query: 251 SMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDS--SARKEGFIVVETNFRMYAY 308
+ ++ + GLV Q+ RK + PT+LA NLS ++ + + + GFIVVETN+R+YAY
Sbjct: 7 NFLQHLREFGLV-FQRKRKSRRYYPTRLAINLSSGVSGAGGTVHQPGFIVVETNYRLYAY 65
Query: 309 STSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVAD 368
+ S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII FL+ AHP +
Sbjct: 66 TESELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLK 125
Query: 369 RMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
+ P +P + DQIRLWE + +R+ T Y++F S+V
Sbjct: 126 QTPVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQV 163
>gi|390370355|ref|XP_783140.3| PREDICTED: general transcription factor IIH subunit 4-like, partial
[Strongylocentrotus purpuratus]
Length = 180
Score = 134 bits (338), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 68/139 (48%), Positives = 93/139 (66%), Gaps = 2/139 (1%)
Query: 268 RKENWFIPTKLATNLSMSLTDSSA--RKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVE 325
RK + PT+LA NL+ ++ + K+GFIVVETNFR+YAY+ S L EIL LF +
Sbjct: 1 RKSMRYYPTRLAINLASGVSSMAKDDHKDGFIVVETNFRIYAYTESDLQVEILGLFCSMM 60
Query: 326 YQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWE 385
Y+ PNL V A+T+ES+ A NGITAEQI+SFL+ +AHP + + P VP + DQ+RLWE
Sbjct: 61 YRFPNLSVAALTRESVQLAISNGITAEQILSFLRTHAHPNMRLKTPIVPPTISDQVRLWE 120
Query: 386 SDLNRVEMTPAHYYDEFPS 404
+ +R+ T Y+EF S
Sbjct: 121 LERDRLSFTQGIIYNEFLS 139
>gi|195477962|ref|XP_002086438.1| GE22864 [Drosophila yakuba]
gi|194186228|gb|EDW99839.1| GE22864 [Drosophila yakuba]
Length = 241
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 72/192 (37%), Positives = 112/192 (58%), Gaps = 16/192 (8%)
Query: 228 LLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNL----- 282
L +LSF G Y+ ++ + ++ + GLV Q+ RKE F PT+LA N+
Sbjct: 2 LFQLSFSTLGRDYSSEGMNSQMLTFLQHLREFGLV-FQRKRKEGRFYPTRLALNVTSKEA 60
Query: 283 ----SMSLTDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITK 338
S+++ + + + G+IVVETN+R+YAY+ S L +L LF+++ Y+ PNL+VG +T+
Sbjct: 61 AATASVAMDEEATQDCGYIVVETNYRVYAYTDSPLQVAVLGLFTELLYRFPNLVVGVLTR 120
Query: 339 ESLYNAFENGITAEQIISFLQQNAHPRV------ADRMPSVPENVCDQIRLWESDLNRVE 392
+S+ A GITAEQI+S+L+Q AHP + +P V DQI+LWE + NR
Sbjct: 121 DSVRQALRGGITAEQIVSYLEQYAHPNMRLVESAIQSKSCLPPTVVDQIKLWELERNRFT 180
Query: 393 MTPAHYYDEFPS 404
T Y++F S
Sbjct: 181 YTEGVVYNQFLS 192
>gi|323455511|gb|EGB11379.1| hypothetical protein AURANDRAFT_52511 [Aureococcus anophagefferens]
Length = 291
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 124/221 (56%), Gaps = 19/221 (8%)
Query: 187 LTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLS 246
+T G FLL + + Q+W +V EYI + G + ++++ +L L++ GE Y ++ LS
Sbjct: 12 VTGRGVDFLLKERHEQVWALVDEYIRAA---GDAKGEVVALVLTLAYATPGEGYAIHELS 68
Query: 247 EIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSS--ARKEGF------IV 298
E QK+ + LGLV ++ + F PT L +++ S+ AR G ++
Sbjct: 69 EAQKAALDVLFALGLV-YRRNASSSRFYPTTLGVDVAFGARRSAGGARAGGDFRRPVDVI 127
Query: 299 VETNFRMYAYS-----TSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQ 353
V+TNF++ AY+ TS L L LF+++ +LPNL+VG I+++++ + GI Q
Sbjct: 128 VQTNFQVLAYTDAGVNTSTLVLATLNLFAELTTRLPNLVVGTISRDAIKRCVDRGIRVPQ 187
Query: 354 IISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMT 394
I+ FL+ +AHP A + VP+NV DQ+ LW + NRV T
Sbjct: 188 IVKFLRAHAHP--AMKASGVPQNVTDQMALWAGEGNRVAFT 226
>gi|432098035|gb|ELK27922.1| General transcription factor IIH subunit 4 [Myotis davidii]
Length = 377
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 90/310 (29%), Positives = 144/310 (46%), Gaps = 79/310 (25%)
Query: 100 LNSTFQSNLRKHLIYGG-ALPREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQA 158
LN F NLR L+ GG A + G + L+ YA +WE L + S A
Sbjct: 98 LNPVFHQNLRTALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMAGSPSA 157
Query: 159 ERPTNFSSSMMKVFQRGLLSRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERG 218
FLL+DT AQLWY + +++ +Q RG
Sbjct: 158 A----------------------------------FLLLDTPAQLWYFMLQHLQTAQSRG 183
Query: 219 INQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKL 278
++ +++SFL +LSF G+ Y++ +S+ ++ + GLV Q+ RK + PT+L
Sbjct: 184 MDLVEILSFLFQLSFSSLGKDYSVEGMSDSRR-------EFGLV-FQRKRKSRRYYPTRL 235
Query: 279 ATNLS--MSLTDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAI 336
A +LS +S +A + GFIVVETN+++YAY+ S+L ++ LFS++ Y+ PN++V
Sbjct: 236 AISLSSGVSGAGGTAHQPGFIVVETNYQLYAYTESELQIALIALFSEMLYRFPNMVVA-- 293
Query: 337 TKESLYNAFENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPA 396
+ P +P + DQIRLWE + +R+ T
Sbjct: 294 --------------------------------QTPVLPPTITDQIRLWELERDRLLFTEG 321
Query: 397 HYYDEFPSRV 406
Y++F S+V
Sbjct: 322 VLYNQFLSQV 331
>gi|355693901|gb|AER99490.1| proteinral transcription factor IIH, polypeptide 4, 52kDa [Mustela
putorius furo]
Length = 265
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 78/248 (31%), Positives = 127/248 (51%), Gaps = 7/248 (2%)
Query: 4 VKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMM 63
V + +N + + L+ LDRLY +P C A+ R LP LAK +VM+M +++ +P +
Sbjct: 13 VHLQCRNLQEFLGGLSPGVLDRLYGHPATCLAVFRELPSLAKNWVMRMLFLEQPLPQAAV 72
Query: 64 EEWVLPDGFTKHRVAIDRLVQLRLFSEE--KKKETTYRLNSTFQSNLRKHLIYGG-ALPR 120
WV + + L LR++ + LN F+ NLR L+ GG A
Sbjct: 73 ALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLILNPIFRQNLRIALLGGGKAWSD 132
Query: 121 EPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLL-SR 179
+ G + L+ YA +WE L ++ S A + + + Q GL+ S
Sbjct: 133 DTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSAAVSQDLAQLLS---QAGLMKST 189
Query: 180 RDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEA 239
E P +T +GFQFLL+DT AQLWY + +Y+ +Q RG++ +++SFL +LSF G+
Sbjct: 190 EPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGKD 249
Query: 240 YNLNTLSE 247
Y++ +S+
Sbjct: 250 YSVEGMSD 257
>gi|90076516|dbj|BAE87938.1| unnamed protein product [Macaca fascicularis]
Length = 358
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 78/248 (31%), Positives = 127/248 (51%), Gaps = 7/248 (2%)
Query: 4 VKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMM 63
V + +N + + L+ LDRLY +P C A+ R LP LAK +VM+M +++ +P +
Sbjct: 48 VHLQCRNLQEFLGGLSPGVLDRLYGHPATCLAVFRELPSLAKNWVMRMLFLEQPLPQAAV 107
Query: 64 EEWVLPDGFTKHRVAIDRLVQLRLFSEE--KKKETTYRLNSTFQSNLRKHLIYGG-ALPR 120
WV + + L LR++ + LN F+ NLR L+ GG A
Sbjct: 108 ALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLILNPIFRQNLRIALLGGGKAWSD 167
Query: 121 EPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLL-SR 179
+ G + L+ YA +WE L ++ S A + + + Q GL+ S
Sbjct: 168 DTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSAAVSQDLAQLLS---QAGLMKST 224
Query: 180 RDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEA 239
E P +T +GFQFLL+DT AQLWY + +Y+ +Q RG++ +++SFL +LSF G+
Sbjct: 225 EPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEILSFLFQLSFSTLGKD 284
Query: 240 YNLNTLSE 247
Y++ +S+
Sbjct: 285 YSVEGMSD 292
>gi|412992644|emb|CCO18624.1| predicted protein [Bathycoccus prasinos]
Length = 464
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 84/275 (30%), Positives = 131/275 (47%), Gaps = 59/275 (21%)
Query: 137 LEAYAIGQWECFLLQLISSTQAER----------------PTNFSSSMMKVFQRGLLSRR 180
L+ YA +WE LL+L S E+ + + K+F L +
Sbjct: 190 LDHYAKSRWETVLLELTSEASVEKKRKKKKMMKKKGDGGAGAQHALHLRKLFYAARLITK 249
Query: 181 DKEAPRLTESGFQFLLMDTNAQLWYIVREYI------------SNSQERGI--------- 219
D R+T GF FLL T+ Q+W ++ Y + ++E G
Sbjct: 250 D---GRITSEGFSFLLGATSEQIWILLARYARGGDFEKKMKNKTTNEENGEGDKMETNGE 306
Query: 220 ------------NQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLV--KLQ 265
+ A ++FL+ LSF G Y+ LSE ++ + + LG++
Sbjct: 307 DEKQQHKLKSDESSAAAMAFLVRLSFQHPGRKYSKANLSEAERRVASHLSALGVLYENED 366
Query: 266 QGRKENWFIPTKLATNLSMSLTDSSA-----RKEGFIVVETNFRMYAYSTSKLHCEILRL 320
NW++PT L+ LS T SSA R +G I+VETNFR+YAY+ S+L E+LRL
Sbjct: 367 DENDNNWYVPTVLSAGLSSVSTTSSAKSALARIDGHIIVETNFRVYAYTHSELETEVLRL 426
Query: 321 FSKVEYQLPNLIVGAITKESLYNAFENGITAEQII 355
F++ +Y+LPN VG IT++S+ +A GI+ +QI+
Sbjct: 427 FTRPDYKLPNAYVGMITRDSILDAMRAGISPDQIV 461
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/81 (37%), Positives = 48/81 (59%), Gaps = 3/81 (3%)
Query: 11 FMDMVASLTARDLDRLYE-NPFICEAILRSLPPLAKKYVMQMFYI--DGAIPAKMMEEWV 67
F++ A L+ ++ LY +P+ A+LR+L PLAK+Y+M+ ++ I +M+ W
Sbjct: 22 FVEFFAHLSEEKINHLYTISPWATLAVLRALTPLAKQYIMRCVFLPLHSVIAHEMLASWT 81
Query: 68 LPDGFTKHRVAIDRLVQLRLF 88
T H +AI+RLVQLR F
Sbjct: 82 KRGCETAHEMAIERLVQLRAF 102
>gi|403348380|gb|EJY73625.1| TFIIH and nucleotide excision repair factor 3 complexes subunit
[Oxytricha trifallax]
Length = 532
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 89/295 (30%), Positives = 148/295 (50%), Gaps = 39/295 (13%)
Query: 91 EKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLPTLEDLEAYAIGQWEC--- 147
E + +YRLN F +N+ L L R I + L+ + +WE
Sbjct: 113 EAQNRKSYRLNKEFLNNVIDILTTKFELTRLKEKKKIKNQ----AQLDTASQTKWENISK 168
Query: 148 FLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAP--RLTESGFQFLLMDTNAQLWY 205
F+L +I Q N S++ + Q+ D A +LT +GFQ++LMD +Q+
Sbjct: 169 FMLNIIPPQQ-----NTPKSLINLLQQARYIEFDVNAQTYQLTVNGFQYILMDIASQVQN 223
Query: 206 IVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQ 265
I+ YI ++ G N +++ F+ L+ N N S+++DF D+GLV ++
Sbjct: 224 ILLNYIQTAETSGRNVVEVLGFIFNLTLTEPNYDGNENV-----NSILRDFDDMGLVGME 278
Query: 266 QGRKENWFIPTKLATNLSMSLTDS-----------------SARKEGFIVVETNFRMYAY 308
+ +K F+ T L + S T S ++ F++VETNF++YAY
Sbjct: 279 RRQK---FVITSLLQSFLQSQTGQLQKSSDSSSSNVIDAQRSQSQDKFLIVETNFKVYAY 335
Query: 309 STSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAH 363
++S L+ +LR F +VE PNL+VG +T++SL AF+ GI++ QI+SFL+ + H
Sbjct: 336 TSSDLYRALLRQFIRVECIFPNLVVGTLTRKSLQKAFQRGISSGQILSFLESHIH 390
>gi|303388087|ref|XP_003072278.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH subunit TFB2 [Encephalitozoon
intestinalis ATCC 50506]
gi|303301417|gb|ADM10918.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH subunit TFB2 [Encephalitozoon
intestinalis ATCC 50506]
Length = 414
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 82/232 (35%), Positives = 121/232 (52%), Gaps = 27/232 (11%)
Query: 175 GLLSRRDKEAPRLTESGFQFLLMDTNAQLWYIVR---EYISNSQERGINQADLISFLLEL 231
GLL R + +T GF+FLL QLW +V +Y S E ++ + I EL
Sbjct: 159 GLLGRDEN----ITNRGFEFLLKTKKEQLWCLVLLSLKYFLKSVEEEVSTLEAI---FEL 211
Query: 232 SFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSA 291
S G Y + S + K ++K LG++KL EN I + L ++S
Sbjct: 212 STKNVGTVYRHS--SSMDKRLLKYLEVLGILKLYG---ENLAIRRSF-----VQLFEASE 261
Query: 292 R-KEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGIT 350
R + FI+VETN ++YAY+ S+ ++ LF V + LPNL G+IT+ES+ AF+ GIT
Sbjct: 262 RNRREFIIVETNNKIYAYTNSEYEKSVIHLFCNVSFNLPNLTKGSITEESVNAAFDKGIT 321
Query: 351 AEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEF 402
QII FL+ ++ P S+P + +QI +WES NR+ M P + Y F
Sbjct: 322 GRQIIHFLEASSKP------GSLPPAIINQIIIWESKRNRIFMAPGYLYSNF 367
>gi|401825125|ref|XP_003886658.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH subunit TFB2 [Encephalitozoon hellem
ATCC 50504]
gi|395459803|gb|AFM97677.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH subunit TFB2 [Encephalitozoon hellem
ATCC 50504]
Length = 414
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 78/220 (35%), Positives = 115/220 (52%), Gaps = 23/220 (10%)
Query: 187 LTESGFQFLLMDTNAQLWYIVR---EYISNSQERGINQADLISFLLELSFHVAGEAYNLN 243
+T GF+FLL QLW +V +Y S + I+ + L ELS G AY+
Sbjct: 167 ITNRGFEFLLKTKKEQLWCLVLLSLKYFLGSVDEEIHALEA---LFELSTKAVGIAYHRT 223
Query: 244 TLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSAR-KEGFIVVETN 302
+ ++K LG++KL KE+ I + L ++S R + FI+VETN
Sbjct: 224 --DRMDARLLKYLEALGILKLY---KESLAIGKSF-----VQLFEASERNRREFIIVETN 273
Query: 303 FRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA 362
++YAY+ S+ ++ LF V LPNLI G+IT+ES+ AF+ GIT +QII FL+ +
Sbjct: 274 NKIYAYTNSEYEKSVIHLFCSVSVNLPNLIKGSITEESVNVAFDKGITGKQIIHFLEASV 333
Query: 363 HPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEF 402
P S+P + QI +WES NR+ M P + Y F
Sbjct: 334 KP------GSLPPAISSQIMIWESKRNRIFMAPGYIYSNF 367
>gi|397564488|gb|EJK44232.1| hypothetical protein THAOC_37249 [Thalassiosira oceanica]
Length = 971
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 84/287 (29%), Positives = 145/287 (50%), Gaps = 30/287 (10%)
Query: 132 PTLEDLEAYAIGQWECFLLQLISSTQAERPT-----NFSSSMMKVFQRGLLSRRDKE--- 183
P+ +LEA+ QW+ L L+ S ++R + +M+ R L + D +
Sbjct: 220 PSRRELEAHTQSQWDSVLHFLVGSDDSDRKGVNGIEEPNVAMVNFLTRIGLMQEDPDFTG 279
Query: 184 -----AP-RLTESGFQFLLMDTNAQLWYIVREYISNSQERGIN---QADLISFLLELSFH 234
AP +T G++F+L DTNAQ+W V +Y+++ + + + +SFL+ L
Sbjct: 280 KDRSRAPLVITSKGYEFMLRDTNAQVWQFVLQYLNSMAHHDLKDTIRKEALSFLICLGSC 339
Query: 235 VAGEAYNLNTL-SEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARK 293
GE Y + L S+ + ++KDFA GL+ + + + F PT++A NL S +R+
Sbjct: 340 RIGEGYFSSVLGSKSARVLMKDFARFGLLFVCRVAGKTAFYPTRVAVNLVASNEKGGSRQ 399
Query: 294 EGFIVVETNFR------MYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFEN 347
++ M A ++ H ++ +LPN++ +T++S+ +AF
Sbjct: 400 SDALLPSVAATRSLEEAMNAPDPTRSHLAVISY-----KRLPNVVFFHLTRDSIKSAFRL 454
Query: 348 GITAEQIISFLQQNAHPRV-ADRMPSVPENVCDQIRLWESDLNRVEM 393
G+TA+QI+ FLQ +AHP + + P P NV DQI LW+ + RV M
Sbjct: 455 GVTADQILRFLQVHAHPMMRSGNQPMPPANVRDQILLWDRERRRVVM 501
>gi|85691005|ref|XP_965902.1| hypothetical protein ECU01_0670 [Encephalitozoon cuniculi GB-M1]
gi|19068469|emb|CAD24937.1| hypothetical protein [Encephalitozoon cuniculi GB-M1]
gi|449329766|gb|AGE96035.1| hypothetical protein ECU01_0670 [Encephalitozoon cuniculi]
Length = 414
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 82/238 (34%), Positives = 122/238 (51%), Gaps = 28/238 (11%)
Query: 170 KVFQRGLLSRRDKEAPRLTESGFQFLLMDTNAQLWYIVR---EYISNSQERGINQADLIS 226
+V G L RD+ +T GF+FLL QLW +V +Y S E I ++
Sbjct: 153 EVLMFGGLLGRDRN---ITNRGFEFLLKTKKEQLWCLVLLSLKYFLGSVEEEI---AVLE 206
Query: 227 FLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLS-MS 285
L ELS G Y + ++ + + K LG+++L + LA S +
Sbjct: 207 ALFELSARAVGTVYR--QVDQMDRRLFKYLEALGILRLYE---------KGLAIGRSFVQ 255
Query: 286 LTDSSAR-KEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNA 344
L ++S R + FI+VETN ++YAY+ S+ ++ LF V LPNLI G IT+ES+ A
Sbjct: 256 LFEASERNRREFIIVETNNKIYAYTNSEYEKSVIHLFCNVTVNLPNLIKGIITEESVNVA 315
Query: 345 FENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEF 402
F+ GIT +QII FL+ + R S+P + +QI +WES NR+ M P + Y F
Sbjct: 316 FDKGITGKQIIHFLEASV------RQGSLPPAIRNQIVIWESKRNRIFMVPGYLYSNF 367
>gi|67483069|ref|XP_656826.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56474050|gb|EAL51440.1| hypothetical protein EHI_192400 [Entamoeba histolytica HM-1:IMSS]
gi|449706224|gb|EMD46115.1| tfiih basal transcription factor complex p52 subunit, putative
[Entamoeba histolytica KU27]
Length = 484
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 102/414 (24%), Positives = 199/414 (48%), Gaps = 43/414 (10%)
Query: 6 IIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPA-KMME 64
II K+ + L + Y++ + C I R LP + + ++++ ++ I +M
Sbjct: 5 IIEKDIIKYFQVLPEIAMKSAYKSKWTCSEIFRLLPGMCQSLLLRIIFLTERISINEMYN 64
Query: 65 EWVLPDGFTKHRVAIDRLVQLRLFSEEKKKETTY-RLNSTFQSNLRKHLIYGGALPREPM 123
++ +P+ I+ + + + +E++ Y +LN+ FQSN + +L G P +
Sbjct: 65 QFKIPNETMDE--VINTIHSIHIIDKEEEDGILYIKLNNDFQSNFKMNLT-GSMEPAYKI 121
Query: 124 P--SGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRD 181
+ +++ + L+ I +E F+ +L+ Q N S++ +++F+ L +
Sbjct: 122 KEVNEKVSKIIQEKCLKENCIKTFEYFMNKLL---QFNSQPNISNNPLRIFKDLELVK-- 176
Query: 182 KEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYN 241
+E ++T G+QFL +T QLW I+ I Q R + I+ + E+++ YN
Sbjct: 177 EETRQITRKGYQFLFQETKTQLWVIMLSIIGLIQRRI---SPFINDVFEMTYLKEHIIYN 233
Query: 242 LNTLSEIQK-SMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSAR------KE 294
+ ++ ++ F DLG++ +++N T L MSL S+A K+
Sbjct: 234 CDHFKKVYGPDPLQLFNDLGIIVYY--KEQNVMAITPL-----MSLLRSNANIPSDLVKK 286
Query: 295 GFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQI 354
+ E N+ +Y Y+ S+ ++ RLF + +QL NL VG + +S+ AF GIT+E +
Sbjct: 287 PKTITEINYSVYIYTESQFQVDLYRLFIRKNFQLANLWVGKLNHQSVTEAFAKGITSEML 346
Query: 355 ISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVE---MTPAHYYDEFPSR 405
I+FLQ P++P + QI LW+ ++NR + + +YD+ R
Sbjct: 347 INFLQ-----------PNLPRTIQKQIDLWKKEINRFKEEHVVRYRFYDDAIGR 389
>gi|167395228|ref|XP_001741280.1| TFIIH basal transcription factor complex p52 subunit [Entamoeba
dispar SAW760]
gi|165894177|gb|EDR22239.1| TFIIH basal transcription factor complex p52 subunit, putative
[Entamoeba dispar SAW760]
Length = 489
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 101/414 (24%), Positives = 200/414 (48%), Gaps = 43/414 (10%)
Query: 6 IIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPA-KMME 64
II K+ + L + Y++ + C I R LP + + ++++ ++ I ++
Sbjct: 5 IIEKDIIKYFQVLPEIAMKSAYKSKWTCSEIFRLLPGMCQSLLLRIIFLKERITIHELYN 64
Query: 65 EWVLPDGFTKHRVAIDRLVQLRLFSEEKKKETTY-RLNSTFQSNLRKHLIYGGALPREPM 123
++ +P+ I+ + + + +E++ Y +LN+ FQ+N + +LI G P +
Sbjct: 65 QFKIPNETMDE--VINTIHSIHIIDKEEENGILYIKLNNDFQNNFKMNLI-GSMEPAYKI 121
Query: 124 P--SGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRD 181
+ +++ + L+ I +E F+ +L+ Q N S++ +++F+ L +
Sbjct: 122 KEVNEKVSQIIKEKCLKENCIKTFEYFMNKLL---QFNSQPNISNNPLRIFKDLELVK-- 176
Query: 182 KEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYN 241
+E ++T G+QFL +T QLW I+ I Q R + I+ + E+++ YN
Sbjct: 177 EETRQITRKGYQFLFQETKTQLWVIMLSIIGLIQRRI---SPFINDVFEMTYLKENIIYN 233
Query: 242 LNTLSEIQK-SMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSAR------KE 294
+ ++ ++ F DLG++ +++N T L MSL S+A K+
Sbjct: 234 CDRFKKVYGPDPLQLFNDLGIIVYY--KEQNVMAITPL-----MSLLRSNANIPSDLVKK 286
Query: 295 GFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQI 354
+ E N+ +Y Y+ S+ ++ RLF + +QL NL VG + +S+ AF GIT+E +
Sbjct: 287 PKTITEINYSVYIYTESQFQVDLYRLFIRKNFQLANLWVGKLNHQSVTEAFAKGITSEML 346
Query: 355 ISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVE---MTPAHYYDEFPSR 405
I+FLQ P++P + QI LW+ ++NR + + +YD+ R
Sbjct: 347 INFLQ-----------PNLPRTIQKQIDLWKKEINRFKEEHVVRYKFYDDAIGR 389
>gi|396080769|gb|AFN82390.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH subunit TFB2 [Encephalitozoon
romaleae SJ-2008]
Length = 414
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 99/317 (31%), Positives = 157/317 (49%), Gaps = 39/317 (12%)
Query: 91 EKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLPTLEDLEAYAIGQWECFLL 150
EKK + Y L+++F R+ L+ G + + M + LE E + G++E L
Sbjct: 85 EKKGDNIY-LDASF----RRSLMEGFCILK--MEKHYKSTGKGLEIDENESNGKFEAILR 137
Query: 151 QLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAPRLTESGFQFLLMDTNAQLWYIVR-- 208
+++ + R F + +F G L R+D+ +T GF+FLL QLW +V
Sbjct: 138 SIVN--REYRIQAFGIGEVLMF--GGLLRKDRN---ITNRGFEFLLKTKKEQLWCLVLLS 190
Query: 209 -EYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQG 267
+Y S E I + L ELS A++ + + ++K LG++++
Sbjct: 191 LKYFLRSVEEEIYALEA---LFELSTKTIETAHHQD--GRMDGRLLKYLEALGILRVCGE 245
Query: 268 RKENWFIPTKLATNLS-MSLTDSSAR-KEGFIVVETNFRMYAYSTSKLHCEILRLFSKVE 325
R LA S + L ++S R + FI+VETN ++YAY+ S+ ++ LF V
Sbjct: 246 R---------LAIGRSFVQLFEASERNRREFIIVETNNKIYAYTNSEYEKSVIHLFCNVS 296
Query: 326 YQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWE 385
LPNLI G+IT+ES+ AF+ GIT +QII FL+ + P S+P + QI +WE
Sbjct: 297 VSLPNLIKGSITEESVNVAFDKGITGKQIIHFLEASVKP------GSLPPAITSQIMIWE 350
Query: 386 SDLNRVEMTPAHYYDEF 402
S NR+ M P + Y F
Sbjct: 351 SKRNRIFMVPGYIYSNF 367
>gi|123488965|ref|XP_001325283.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121908180|gb|EAY13060.1| hypothetical protein TVAG_212420 [Trichomonas vaginalis G3]
Length = 460
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 101/421 (23%), Positives = 175/421 (41%), Gaps = 62/421 (14%)
Query: 3 LVKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKM 62
L K I K D V +L + D D C +R LPPL+++ + ++ YI+ P
Sbjct: 4 LFKYIRKQPEDFVINLIS-DQDA-------CRTYIRLLPPLSQQILFRLLYIERGFPRSE 55
Query: 63 MEEWVLPDGFTKHRVAIDRLVQLRL--FSEEKKKETTYRLNSTFQSNLRKHLIYGGAL-- 118
+E+W AI +L++ L + E +Y NS+ N R + G
Sbjct: 56 VEKWSPKSKSNDLINAIKKLIKAHLVYIVTDTISENSY--NSSIIINKRVREVLLGTYVE 113
Query: 119 -----------------PRE---PMPSGITARLPT-----LEDLEAYAIGQWECFLLQLI 153
P E P ++ PT L+D Y + F+L+L
Sbjct: 114 DKKAFVRQKNPFAPDDQPDENFVPFDELDQSKFPTIDEKILDDFSNYQLESILSFMLKLR 173
Query: 154 SSTQAERPTNFSSSMMKVFQRGLLSRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISN 213
+ E S + + F L + G +FLL+ Q+W IV+ Y+
Sbjct: 174 DNIDNEAKKILSDAKLMEFGGNLCPK-----------GHRFLLLSPKEQIWRIVKCYLKF 222
Query: 214 SQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWF 273
+++ + + FLL++ + Y + +L+ QK ++ F +GLV + ++F
Sbjct: 223 TKDLHSS----LRFLLKIGSMELSKGYPITSLTPTQKELLSPFKTIGLVYIDG----DYF 274
Query: 274 IPTKLATNLSMSLTDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIV 333
PTK N + + EG+++++TNF++ A+ S LH +L+ F+ V Y+ P
Sbjct: 275 YPTKSILNFFGK--SNIFQTEGWMLIDTNFKITAFPKSPLHTALLKKFANVTYEFPGFAS 332
Query: 334 GAITKESLYNAFENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEM 393
I+ S A G T + II FL+ N ++ +P V Q +W R+ +
Sbjct: 333 AFISPNSFREALNQGTTLDDIIGFLKSNLSHKIGSG--QIPSAVMKQFYVWRDQRERLTV 390
Query: 394 T 394
T
Sbjct: 391 T 391
>gi|76154655|gb|AAX26095.2| SJCHGC05044 protein [Schistosoma japonicum]
Length = 214
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 52/114 (45%), Positives = 72/114 (63%)
Query: 289 SSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENG 348
SS+ G+I++ETNFR+YAY+ S L +L LFSK+ + PNL+V IT++S+ A G
Sbjct: 52 SSSSDVGYILLETNFRLYAYTDSPLRTALLSLFSKIRARFPNLVVADITRDSVREALIRG 111
Query: 349 ITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEF 402
ITA QI+SFL NAHP + P +P + DQIRLWE + +R Y+ F
Sbjct: 112 ITANQILSFLTSNAHPDMLLSNPILPPTLTDQIRLWELERDRFVFQEGCLYEHF 165
>gi|47212202|emb|CAF90416.1| unnamed protein product [Tetraodon nigroviridis]
Length = 471
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 72/247 (29%), Positives = 114/247 (46%), Gaps = 22/247 (8%)
Query: 4 VKIIAKNFMDMVASLTARDLDRLYENPFICEAILR--------------SLPPLAKKYVM 49
V++ KN + L+ LDRLY +P C A+ R LP L+K +VM
Sbjct: 21 VQLQCKNLHQYLKELSPEVLDRLYNHPATCLAVYRWPGRGPRRPRVLAPELPSLSKNFVM 80
Query: 50 QMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQLRLFSEEKKKET--TYRLNSTFQSN 107
+M +++ +P + WV D H + L LRL+ + + Y LN F+ N
Sbjct: 81 RMLFLEQPLPQAAVALWVGKDSQRDHDECVCVLAGLRLWHSQHLQGGLQGYVLNPVFKEN 140
Query: 108 LRKHLIYGG-ALPREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSS 166
L+ L+ GG A E G + L+ YA +WE L ++ S A + +
Sbjct: 141 LKTALLGGGRAWAEEGGALGPDRHARDIGSLDRYATERWEVILHFMVGSPCAAVSQDLAQ 200
Query: 167 SMMKVFQRGLLSRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGIN--QADL 224
++ GL+ E P +T +GFQFLL+DT +QLWY +Y+ +Q+ + L
Sbjct: 201 LLVHA---GLMKSEAGEPPYITSAGFQFLLLDTASQLWYFTLQYLKTAQDYSVEGMSDSL 257
Query: 225 ISFLLEL 231
++FL L
Sbjct: 258 LTFLQHL 264
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 28/71 (39%), Positives = 46/71 (64%), Gaps = 1/71 (1%)
Query: 326 YQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWE 385
Y+ PN++V +T+ES+ A NGITA+Q ++F ++ P + P +P + DQIRLWE
Sbjct: 282 YRFPNVVVAQVTRESVQQAIANGITAQQRLTFDLWSS-PTPLSQTPVLPPTITDQIRLWE 340
Query: 386 SDLNRVEMTPA 396
+ +R++ T A
Sbjct: 341 LERDRLQFTEA 351
>gi|85857462|gb|ABC86267.1| RE43632p [Drosophila melanogaster]
Length = 280
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 63/227 (27%), Positives = 119/227 (52%), Gaps = 19/227 (8%)
Query: 19 TARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVA 78
T L++LY P IC A+ R LP +A+++++++ ++D +P ++ W + A
Sbjct: 50 TPESLEKLYNYPPICLAVFRELPEIARQFIIRILFVDQPVPQAVVTSWGAQRCAKEQAEA 109
Query: 79 IDRLVQLRLF--SEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGIT----ARLP 132
L L ++ + + + L+ TF+ ++R+ L+ GG +P P T ++
Sbjct: 110 TSCLTALNVWRVTAIPGGLSAWELSPTFKKSVRQVLLGGG----KPWPMTNTLDKDSKPR 165
Query: 133 TLEDLEAYAIGQWECFLLQLISS-----TQAERPTNFSSSMMKVFQRGLLSRRD-KEAPR 186
+ L+ YA+ +W C L ++ + T AE S +++ L +RD ++
Sbjct: 166 DIAFLDTYAMSRWRCVLHYMVGTGNRNGTDAE---AISPDAVRILLHANLMKRDERDGIT 222
Query: 187 LTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSF 233
+T GFQFLL+DT AQ+W+ + +Y+ +ERGI+ + +S L +LSF
Sbjct: 223 ITRQGFQFLLLDTRAQVWHFMLQYLDTCEERGISLPECLSMLFQLSF 269
>gi|422294120|gb|EKU21420.1| transcription initiation factor TFIIH subunit 4, partial
[Nannochloropsis gaditana CCMP526]
Length = 213
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 63/181 (34%), Positives = 96/181 (53%), Gaps = 26/181 (14%)
Query: 237 GEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSAR---- 292
G Y + LS Q+++++ F GL+ Q R F PT +A NL T R
Sbjct: 1 GRPYPVRALSPTQQALLEKFISFGLI-YQDDRHSRHFYPTAVAVNLIFGGTVQEERLRRG 59
Query: 293 -----KEG---------------FIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLI 332
EG ++VETN+++ AY++S LH E+LR+F+ V +LPN++
Sbjct: 60 HGHVGPEGRENTKELRLVDPSQLAVIVETNYQLVAYTSSSLHVEMLRIFTDVRCRLPNVV 119
Query: 333 VGAITKESLYNAFENGITAEQIISFLQQNAHPRV-ADRMPSVPENVCDQIRLWESDLNRV 391
+G IT+ S+ A +GITA I+SFL+ + H V A + +PENV QI LW + +RV
Sbjct: 120 IGFITRASVRRAMASGITAATILSFLKTHTHVAVRAGKGRLLPENVEAQIELWHQERSRV 179
Query: 392 E 392
+
Sbjct: 180 K 180
>gi|387595518|gb|EIJ93142.1| hypothetical protein NEPG_02098 [Nematocida parisii ERTm1]
Length = 398
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 74/217 (34%), Positives = 106/217 (48%), Gaps = 22/217 (10%)
Query: 187 LTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLS 246
LT GF FLL QLW ++ +I E N+ I L EL Y +
Sbjct: 155 LTHKGFNFLLTGKKRQLWTLLLAHIQEDPETAENE---ILVLCELLVKDPKRTY---AVD 208
Query: 247 EIQKS-MIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARKEGFIVVETNFRM 305
QKS ++ F LGL+ ++G K + S+ L D E F+V+E+NFR+
Sbjct: 209 RSQKSKLLGLFESLGLIFFEKG-------LVKFSPTFSL-LFDDEEGAEKFLVLESNFRL 260
Query: 306 YAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR 365
Y YS L I+ LFS + PN+IV I ++S+ A +GITA QI +L QN+
Sbjct: 261 YIYSNRPLDTFIISLFSIKSREFPNMIVAMINEDSIRQALMHGITAGQIRVYLNQNS--- 317
Query: 366 VADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEF 402
M + ENV +QIRLWE +NR+ ++ + F
Sbjct: 318 ----MYEINENVIEQIRLWEKRMNRIHSWESYIFSNF 350
>gi|443919931|gb|ELU39967.1| Tfb2 domain-containing protein [Rhizoctonia solani AG-1 IA]
Length = 302
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 75/268 (27%), Positives = 132/268 (49%), Gaps = 20/268 (7%)
Query: 23 LDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRL 82
L+RLY P C A+ R L L ++ V+ + + D +P + + W P+ +
Sbjct: 33 LNRLYSKPSACLAVFRLLRQLDRQLVLNLLWCDSGVPLQGLMGWAKPEAISSQHSP---- 88
Query: 83 VQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPRE---PMPSGITARLPTLEDLEA 139
L + K++ R++ S++ + + G R P+PS ++++ ++E L+
Sbjct: 89 ---SLGIQPAKRDAHDRVSRRATSSVTRRVCDSGEH-RSFGLPVPSRGSSQM-SIEQLDR 143
Query: 140 YAIGQWECFLLQLI-SSTQAERPTNFSSSMMKVFQRGLLSRRDKEAPRLTESGFQFLLMD 198
A+ WE L ++ +++ A+RP + ++ V L R +T +GFQFLL
Sbjct: 144 KALEHWETILHYMVDTNSSAQRPGGGALHLL-VAGGWLEEGRGGGGHEITSTGFQFLLQS 202
Query: 199 TNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFAD 258
AQLW I+ +Y+ S R ++ A+++SFL LS G+ Y+ + LS Q +M+ D D
Sbjct: 203 PRAQLWDILLQYLHMSDARRMDIAEVLSFLFMLSLMKLGQEYSCDNLSPTQNAMMTDLKD 262
Query: 259 LGLVKLQQGRKENWFIPTKLATNLSMSL 286
G+ F PT+LAT L+ SL
Sbjct: 263 YGI------NANKSFYPTRLATTLTSSL 284
>gi|440295536|gb|ELP88449.1| TFIIH basal transcription factor complex p52 subunit, putative
[Entamoeba invadens IP1]
Length = 531
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 93/377 (24%), Positives = 182/377 (48%), Gaps = 36/377 (9%)
Query: 26 LYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQL 85
+Y+ + I R LP L + ++++ ++ I +E+ FT + I ++ L
Sbjct: 24 IYQTKWSSSGIFRLLPSLCQSLLLRLVFLSSTIDLPTVEK-----QFTITKQEIIHVLSL 78
Query: 86 RLFSEEKKK-----ETTYRLNSTFQSNLRKHLIYGGALPREPMPS---GITARLPTLEDL 137
+ KK +T +++N+ FQ++L++ ++ G +P + + L L
Sbjct: 79 LESVKIIKKVMINGKTQFQMNADFQTHLKESIL-GPMVPNYVVSDCNFSVKDALIKKASL 137
Query: 138 EAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAPRLTESGFQFLLM 197
A+ +E FL++L++ ++ N S ++F+ L + ++T G+QFL
Sbjct: 138 NN-AVKMFEDFLMKLLTFNESH---NLPKSAFELFKD--LELVNDAGSQITRKGYQFLFQ 191
Query: 198 DTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSM-IKDF 256
+T Q+W ++ +Q R LI+ EL++ YN + ++I + + F
Sbjct: 192 ETKTQIWILMLSIFGPAQ-RKRRMGKLINEFFELTYLAPETCYNADPFTKIYSAAPLILF 250
Query: 257 ADLGLVKLQQGRKENWFIPTKLATNL-SMSLTDSSARKEGFIVVETNFRMYAYSTSKLHC 315
+G++ +++N + T L + L S +L S + + ETN+ +Y Y+ S
Sbjct: 251 DMMGIIAY--SKEKNALVVTPLMSLLRSNALVPSDLVTKPRTITETNYTVYIYTESFFQV 308
Query: 316 EILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRMPSVPE 375
++ LF + QL NL VG IT +++ AF GIT E +++FLQ P++P+
Sbjct: 309 KLYSLFIRQNLQLTNLCVGRITYDTVTEAFLKGITNEMLVNFLQ-----------PNLPK 357
Query: 376 NVCDQIRLWESDLNRVE 392
N+ QI LW+ +LNR++
Sbjct: 358 NIQAQIDLWKRELNRLK 374
>gi|300707677|ref|XP_002996037.1| hypothetical protein NCER_100934 [Nosema ceranae BRL01]
gi|239605297|gb|EEQ82366.1| hypothetical protein NCER_100934 [Nosema ceranae BRL01]
Length = 403
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 66/216 (30%), Positives = 109/216 (50%), Gaps = 18/216 (8%)
Query: 187 LTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLS 246
+T GF+FLL Q W+++ + I Q ++ELS + Y L
Sbjct: 157 ITNKGFEFLLKPRYDQYWFLIIAALKFYCVDEILQISNFMSIMELSNMLPIYKYKLK--K 214
Query: 247 EIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARKEGFIVVETNFRMY 306
++ K + LGL+KL+ + T S +K FI++ETNF++Y
Sbjct: 215 DVNKKFYDFLSYLGLIKLEND----------ILTIYHNLFVKSDTKKLRFILLETNFKLY 264
Query: 307 AYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRV 366
AY++S I++LFS + ++PNLI +IT+ESL NAF G+T++QII+FL+ +
Sbjct: 265 AYTSSVYEMSIIQLFSNIYLKMPNLIKASITEESLSNAFSKGVTSQQIINFLKSYS---- 320
Query: 367 ADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEF 402
+P + QI +WE+ R+++ P + Y F
Sbjct: 321 --LFEDIPVAIISQIIIWETKRKRIKIFPGYLYSNF 354
>gi|258577869|ref|XP_002543116.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237903382|gb|EEP77783.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 311
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 84/274 (30%), Positives = 132/274 (48%), Gaps = 22/274 (8%)
Query: 25 RLYENPFICEAILRS-LPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLV 83
+LY+ P AI R LP LAK +VM + Y+ +PA +E WV + + A+ L
Sbjct: 22 KLYQQPSTALAIFRRMLPHLAKCFVMALLYLKDPLPAAELELWVKTESKRERDNALSILS 81
Query: 84 QLRLFSEEKKKET--TYRLNSTFQSNLRKHLIYG------GALPREPMPSGITARLPTLE 135
+L + S + + + F S+LR+ L G G + P P +T +
Sbjct: 82 RLHILSSTTTANHVRAFMVTNPFSSSLRQALTGGDKQQSFGVISTTPDPQPMT-----IA 136
Query: 136 DLEAYAIGQWECFLLQLISST----QAERPTNFSSSMMKVFQR-GLLSRRDKEAPRLTES 190
DL+ YA QWE L ++ + Q E T S + + Q L+ RD+ +T+
Sbjct: 137 DLDDYARRQWEGVLGYMVGTNTLGIQRESVT-LSKGVKSLLQACHLVEVRDRRV-EITKE 194
Query: 191 GFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQK 250
GF F+L D N Q+W+I+ Y+ N++ G++ +++SFL LS G++Y LS Q
Sbjct: 195 GFAFVLQDVNTQVWHILVLYVENAEAIGMDSVEVLSFLFLLSSLELGQSYEKQHLSSTQL 254
Query: 251 SMIKDFADLGLVKLQQGRKE-NWFIPTKLATNLS 283
+ D D G+V E F PT+LAT L+
Sbjct: 255 RTLADLTDFGIVYQHSPASEATRFYPTRLATTLT 288
>gi|380492949|emb|CCF34234.1| RNA polymerase II transcription factor B subunit 2 [Colletotrichum
higginsianum]
Length = 343
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 86/291 (29%), Positives = 144/291 (49%), Gaps = 26/291 (8%)
Query: 13 DMVASLTARDLDRLYENPFICEAILRSL-PPLAKKYVMQMFYIDGAIPAKMMEEWVLPDG 71
D + L +LY P AI R + PPLAK VM + Y+ + + ++ WV P+
Sbjct: 13 DYLEKLPGTTFRKLYLQPSTAFAIFRRMIPPLAKTIVMAILYMPKPMLLEDLDVWVKPES 72
Query: 72 FTKHRVAIDRL-----VQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGG-----ALPRE 121
+ AI L +Q+ + S+E+ +E +L + F+S+LR L GG +P
Sbjct: 73 RRQKDQAISTLRSLHILQITVPSKERPQE--MQLTTNFKSSLRLALEGGGTHNSFGVPSS 130
Query: 122 -PMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERP--TNFSSSMMKVFQRGLLS 178
P+P IT L+ YA +WE L ++++ SS+ + G L
Sbjct: 131 LPVPPEITVPF-----LDRYARRKWEDILHYIVNTVNPGGADLGGPKSSVKNLLVAGQLV 185
Query: 179 RRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQER----GINQADLISFLLELSFH 234
RR A +T++GF FLL + NAQ+W ++ ++ + G+ D++SFL L+
Sbjct: 186 RRQGSAVGITQAGFTFLLQEANAQVWTLLLLWLEATDHAEDAAGMESTDMLSFLFLLASL 245
Query: 235 VAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMS 285
G AY+ N L+E +++M+ D GL+ + + + +F PT+LAT L+ S
Sbjct: 246 ELGRAYDTNALTEARRNMLPSLLDFGLIYIPSHKPQQYF-PTRLATTLTSS 295
>gi|387592902|gb|EIJ87926.1| hypothetical protein NEQG_01998 [Nematocida parisii ERTm3]
Length = 398
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 73/217 (33%), Positives = 106/217 (48%), Gaps = 22/217 (10%)
Query: 187 LTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLS 246
LT GF FLL QLW ++ +I E N+ I L EL Y +
Sbjct: 155 LTHKGFNFLLTGKKRQLWTLLLAHIQEDPETAENE---ILVLCELLVKDPKRTY---AVD 208
Query: 247 EIQKS-MIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARKEGFIVVETNFRM 305
QKS ++ F LGL+ ++G K + S+ L D E F+V+E+NFR+
Sbjct: 209 RSQKSKLLGLFESLGLIFFEKG-------LVKFSPTFSL-LFDDEEGAEKFLVLESNFRL 260
Query: 306 YAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPR 365
Y YS L I+ LFS + PN++V I ++S+ A +GITA QI +L QN+
Sbjct: 261 YIYSNRPLDTFIISLFSIKSREFPNMMVAMINEDSIRQALMHGITAGQIRVYLNQNS--- 317
Query: 366 VADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEF 402
M + ENV +QIRLWE +NR+ ++ + F
Sbjct: 318 ----MYEINENVIEQIRLWEKRMNRIHSWESYIFSNF 350
>gi|402465832|gb|EJW01474.1| hypothetical protein EDEG_03937 [Edhazardia aedis USNM 41457]
Length = 465
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 84/260 (32%), Positives = 132/260 (50%), Gaps = 28/260 (10%)
Query: 149 LLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAPRLTESGFQFLLMDTNAQLWYIVR 208
+L LI Q E P N + F R L + +T +GF FLL QLW ++
Sbjct: 179 ILHLIVD-QQETPKNEEILKLLTFSRIL-----NKNLEITNTGFDFLLKTRKEQLWILLI 232
Query: 209 ---EYISNSQERGINQADLISFLLELSFHVAGEAYNL-NTLSEIQKSMIKDFADLGLVKL 264
+Y+ ++ + I + D I + EL ++Y + N S IQ + I + LGL L
Sbjct: 233 TTIKYLKSTMK--IEEKDFILLIFELCEKRPYKSYKIINCPSSIQ-TCINYMSKLGL--L 287
Query: 265 QQGRKENWFI--PTKLATNLSMSLTDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFS 322
+ K N I + L NL ++ E F+ +ETNF++YAY+TSK IL LFS
Sbjct: 288 ENSIKNNTVILSISPLFINL---FEENLVLSESFMYIETNFKLYAYTTSKYDFSILSLFS 344
Query: 323 KVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIR 382
K+ +LPNLI I ++S+ AF+ I+A+QI +L+ + +VP+NV +Q+
Sbjct: 345 KISCKLPNLISAIINEDSVNTAFDKKISAKQISYYLKS--------KGKNVPKNVVEQVY 396
Query: 383 LWESDLNRVEMTPAHYYDEF 402
+WES NR++ + + F
Sbjct: 397 IWESKRNRIKTSECTLFKGF 416
>gi|361130709|gb|EHL02459.1| putative RNA polymerase II transcription factor B subunit 2 [Glarea
lozoyensis 74030]
Length = 210
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 58/175 (33%), Positives = 96/175 (54%), Gaps = 32/175 (18%)
Query: 252 MIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARKE----------------G 295
M+++ D GLV + K+ +F PT+LAT L+ +D+SA + G
Sbjct: 1 MLENLVDFGLVYIPPSVKDQFF-PTRLATTLT---SDASALRSITAGFESALSTGSGTAG 56
Query: 296 FIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQII 355
FI++ETN+ S L +L LF+K+ + PN++ G +T+ES+Y A +GI+++QII
Sbjct: 57 FIIIETNYH------SPLQIAVLALFTKLTTRYPNMVTGRVTRESVYRAVSSGISSDQII 110
Query: 356 SFLQQNAHPRV------ADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
++L +AHP + P +P V DQIRLW+ + R++ T + +F S
Sbjct: 111 TYLSTHAHPELLKASAAKGGGPVLPPTVVDQIRLWQIENERMKTTYGFLFKDFES 165
>gi|71033737|ref|XP_766510.1| hypothetical protein [Theileria parva strain Muguga]
gi|68353467|gb|EAN34227.1| hypothetical protein, conserved [Theileria parva]
Length = 359
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 56/191 (29%), Positives = 108/191 (56%), Gaps = 19/191 (9%)
Query: 225 ISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRK----ENWFIPTKLAT 280
+ LL LS G+ ++ L++ Q +++ +LG+V + +K + +I K T
Sbjct: 83 VELLLSLSQASCGDCFSTEGLTKTQLRLLRLLNELGIVYYKNPKKFYLYDLSYIVGKTNT 142
Query: 281 NLSMSLTDS-----SARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGA 335
N + + A + I+V++NF++Y Y+ S L +L +++ + PNL+VG
Sbjct: 143 NTGLPTSKDLDLSIKAGNDSRIIVQSNFKVYVYTASPLQISVLSHLCELQARTPNLVVGV 202
Query: 336 ITKESLYNAFENGITAEQIISFLQQNA-HPRVADR---------MPSVPENVCDQIRLWE 385
+T+ES+ +AF++GIT+++II FL N + + ++ S+PENVC Q+++WE
Sbjct: 203 LTRESVQSAFKSGITSKEIIRFLSPNGMNSSIGNQENTLLNSTFTYSIPENVCRQLKMWE 262
Query: 386 SDLNRVEMTPA 396
S+ +R+E+ P+
Sbjct: 263 SERDRIELCPS 273
>gi|378754852|gb|EHY64880.1| hypothetical protein NERG_01936 [Nematocida sp. 1 ERTm2]
Length = 396
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 67/216 (31%), Positives = 104/216 (48%), Gaps = 20/216 (9%)
Query: 187 LTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLS 246
LT GF FLL Q+W +V +I QE + + + + E Y ++
Sbjct: 153 LTHKGFNFLLTGRKNQMWTLVLAHI---QEDLATRQEEVLVMCEFLVKDPKRMYAIDV-- 207
Query: 247 EIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARKEGFIVVETNFRMY 306
+ M+ F LGL+ ++G + ++ S+ L D E F+ +E+NFR+Y
Sbjct: 208 SYRSKMLDLFESLGLITFERG-------LVRFSSTFSL-LFDDEEGGEKFLTLESNFRLY 259
Query: 307 AYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRV 366
YS L I+ LFS + PN+IV I ++S+ A GITA QI +L QN+
Sbjct: 260 IYSNRPLDVFIISLFSIKIREFPNMIVAMINEDSIRQALTYGITAGQIRVYLNQNS---- 315
Query: 367 ADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEF 402
M + ENV +QIRLWE +NR+ ++ + F
Sbjct: 316 ---MHKINENVLEQIRLWEKRMNRIHAWESYIFSNF 348
>gi|84998132|ref|XP_953787.1| TFIIH subunit, Tf2b [Theileria annulata]
gi|65304784|emb|CAI73109.1| TFIIH subunit, Tf2b homologue, putative [Theileria annulata]
Length = 645
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 58/194 (29%), Positives = 109/194 (56%), Gaps = 25/194 (12%)
Query: 225 ISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRK----ENWFIPTKLAT 280
+ LL LS G+ ++ L++ Q +++ +LG+V + +K + +I K T
Sbjct: 370 VELLLSLSQASCGDCFSTEGLTKTQIRLLRLLNELGIVYYKNPKKFYLYDLSYIVGKTNT 429
Query: 281 N--------LSMSLTDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLI 332
N L +S+ A + I+V++NF++Y Y+ S L +L +++ + PNL+
Sbjct: 430 NSVLPNSKDLDISI---KAGNDSRIIVQSNFKVYVYTASPLQISVLSHLCELQARTPNLV 486
Query: 333 VGAITKESLYNAFENGITAEQIISFLQQNA-HPRVADR---------MPSVPENVCDQIR 382
VG +T+ES+ +AF++GIT+++II FL N + + + S+PENVC Q++
Sbjct: 487 VGVLTRESVQSAFKSGITSKEIIRFLSPNGMNSSIGSQENTLLNSSFTYSIPENVCRQLK 546
Query: 383 LWESDLNRVEMTPA 396
+WES+ +R+E+ P+
Sbjct: 547 MWESERDRIELCPS 560
>gi|403221352|dbj|BAM39485.1| transcription factor IIH subunit [Theileria orientalis strain
Shintoku]
Length = 634
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 57/192 (29%), Positives = 106/192 (55%), Gaps = 23/192 (11%)
Query: 225 ISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLA----- 279
+ LL LS G+ ++ L++ Q +++ +LG+V + N F L+
Sbjct: 342 VELLLALSQSKCGDCFSTENLTKTQMRLVRLLNELGIVYYKNA---NRFYIYDLSYIIGK 398
Query: 280 TNLSMSL-----TDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVG 334
+N+++ L + +A E I+V++NF++Y Y+ S L +L +++ + PNL+VG
Sbjct: 399 SNINVDLFKEFDVNKTAGNESRIIVQSNFKVYVYTASPLQISVLSHLCELQARTPNLVVG 458
Query: 335 AITKESLYNAFENGITAEQIISFLQ--------QNAHPRVADRMPS--VPENVCDQIRLW 384
+T+ES+ +AF++GIT+ +II FL D + +PENVC Q+++W
Sbjct: 459 VLTRESVQSAFKSGITSNEIIRFLSPMKLSLSYAGTCNSFMDSFANYKIPENVCRQLKMW 518
Query: 385 ESDLNRVEMTPA 396
ES+ +R+E++PA
Sbjct: 519 ESERDRIELSPA 530
>gi|167535774|ref|XP_001749560.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771952|gb|EDQ85611.1| predicted protein [Monosiga brevicollis MX1]
Length = 417
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 77/294 (26%), Positives = 137/294 (46%), Gaps = 22/294 (7%)
Query: 101 NSTFQSNLRKHLIYGGALPREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAER 160
S F +LR L P + MP A T LE ++ WE L + ++ + +
Sbjct: 93 GSVFLDSLRTALAGRTDAPDQAMPPPKKASEVTPAFLEEHSRVSWEAILSYMTTNAELK- 151
Query: 161 PTNFSSSMMKVFQRGLLSRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISN-SQER-- 217
+S+M+ + Q L D+ +T GFQFLL D Q+W ++ ++ S+E
Sbjct: 152 ----NSTMILILQEAQLIVNDQPPISVTPEGFQFLLQDRANQVWLLIVTFLKLLSREEPI 207
Query: 218 ---GINQADLISFLLELSFHVAGEA----YNLNTLSEIQKSMIKDFADLGLVKLQQGRKE 270
G+ + + L + E ++ L+ +Q + ++ +G+V + RK
Sbjct: 208 PGEGLPMHEALVLLFRIGLGRLNEVSVLEFSAENLTTMQITFLRQLHYMGVV-YHRNRKS 266
Query: 271 NWFIPTKLATNLSMSLTDSSARKE-GFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLP 329
F T A L + + ++ E GF++ ETNF +YAY+ S + +L F+ + Y+LP
Sbjct: 267 KRFFLTPYAVMLYHNAELALSQTETGFLLAETNFHVYAYTDSAVKVALLSKFATLTYRLP 326
Query: 330 NLIVGAITKESLYNAFENGITAEQIISFLQQNA-----HPRVADRMPSVPENVC 378
+ IT+ S+ A GITA+QI+ F+Q+++ H A P+ P++
Sbjct: 327 IMTTAIITRTSVRRALSQGITADQILRFMQRSSLLLRHHDETAPTRPTSPQHAT 380
>gi|66358470|ref|XP_626413.1| possible transcription factor TFIIH [Cryptosporidium parvum Iowa
II]
gi|46227864|gb|EAK88784.1| possible transcription factor TFIIH [Cryptosporidium parvum Iowa
II]
Length = 709
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 50/143 (34%), Positives = 86/143 (60%), Gaps = 9/143 (6%)
Query: 269 KENWFIPTKLATNLSMS----LTDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKV 324
+E W T +N+ ++ L+ +S R E I+V++NFR+Y Y+ S L +ILR +V
Sbjct: 496 EELWHKSTTETSNVKINEPNCLSMNSTRLEAGIIVQSNFRIYCYTASPLQAKILRHLCQV 555
Query: 325 EYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRM-----PSVPENVCD 379
+ + PN+I G +T++ L +A+ G++AEQI+ F NAHP + R +P NV
Sbjct: 556 KVRGPNIICGILTRKGLLSAYSMGVSAEQILRFFSSNAHPIILRRFMLEGTSIIPVNVET 615
Query: 380 QIRLWESDLNRVEMTPAHYYDEF 402
Q++LWE D NR++++ A + ++
Sbjct: 616 QLKLWEKDKNRLKISHASTFSDW 638
>gi|221482097|gb|EEE20458.1| tfiih, polypeptide, putative [Toxoplasma gondii GT1]
Length = 861
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 44/108 (40%), Positives = 74/108 (68%), Gaps = 5/108 (4%)
Query: 294 EGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQ 353
E ++V++NF++Y Y+ S L +L +++ ++PNLIVG +T+ S+ A+++GITA+Q
Sbjct: 672 EAGMLVQSNFKVYVYTASALQLSVLSHLCELQARMPNLIVGILTRASVLAAYKSGITADQ 731
Query: 354 IISFLQQNAHPRVADR-----MPSVPENVCDQIRLWESDLNRVEMTPA 396
II FL+ +AHP V +R P +PENV Q+R+WE++ R+ + PA
Sbjct: 732 IIRFLEAHAHPVVLERKLRTNAPLLPENVTIQLRMWEAERMRLSLYPA 779
>gi|401410644|ref|XP_003884770.1| hypothetical protein NCLIV_051680 [Neospora caninum Liverpool]
gi|325119188|emb|CBZ54742.1| hypothetical protein NCLIV_051680 [Neospora caninum Liverpool]
Length = 868
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 44/108 (40%), Positives = 74/108 (68%), Gaps = 5/108 (4%)
Query: 294 EGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQ 353
E ++V++NF++Y Y+ S L +L +++ ++PNLIVG +T+ S+ A+++GITA+Q
Sbjct: 679 EAGMLVQSNFKVYVYTASALQLSVLSHLCELQARMPNLIVGILTRASVLAAYKSGITADQ 738
Query: 354 IISFLQQNAHPRVADR-----MPSVPENVCDQIRLWESDLNRVEMTPA 396
II FL+ +AHP V +R P +PENV Q+R+WE++ R+ + PA
Sbjct: 739 IIRFLEAHAHPVVLERKLRTNAPLLPENVTIQLRMWEAERMRLSLYPA 786
>gi|237842971|ref|XP_002370783.1| hypothetical protein TGME49_014520 [Toxoplasma gondii ME49]
gi|211968447|gb|EEB03643.1| hypothetical protein TGME49_014520 [Toxoplasma gondii ME49]
Length = 836
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 44/108 (40%), Positives = 74/108 (68%), Gaps = 5/108 (4%)
Query: 294 EGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQ 353
E ++V++NF++Y Y+ S L +L +++ ++PNLIVG +T+ S+ A+++GITA+Q
Sbjct: 647 EAGMLVQSNFKVYVYTASALQLSVLSHLCELQARMPNLIVGILTRASVLAAYKSGITADQ 706
Query: 354 IISFLQQNAHPRVADR-----MPSVPENVCDQIRLWESDLNRVEMTPA 396
II FL+ +AHP V +R P +PENV Q+R+WE++ R+ + PA
Sbjct: 707 IIRFLEAHAHPVVLERKLRTNAPLLPENVTIQLRMWEAERMRLSLYPA 754
>gi|221502519|gb|EEE28246.1| tfiih, polypeptide, putative [Toxoplasma gondii VEG]
Length = 836
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 44/108 (40%), Positives = 74/108 (68%), Gaps = 5/108 (4%)
Query: 294 EGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQ 353
E ++V++NF++Y Y+ S L +L +++ ++PNLIVG +T+ S+ A+++GITA+Q
Sbjct: 647 EAGMLVQSNFKVYVYTASALQLSVLSHLCELQARMPNLIVGILTRASVLAAYKSGITADQ 706
Query: 354 IISFLQQNAHPRVADR-----MPSVPENVCDQIRLWESDLNRVEMTPA 396
II FL+ +AHP V +R P +PENV Q+R+WE++ R+ + PA
Sbjct: 707 IIRFLEAHAHPVVLERKLRTNAPLLPENVTIQLRMWEAERMRLSLYPA 754
>gi|429962184|gb|ELA41728.1| hypothetical protein VICG_01232 [Vittaforma corneae ATCC 50505]
Length = 449
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 67/244 (27%), Positives = 113/244 (46%), Gaps = 41/244 (16%)
Query: 186 RLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTL 245
++T GF+FLL+ QLW+++ I + ++++ L E+ Y ++
Sbjct: 172 QITNKGFEFLLLARKDQLWFLIVNAIKYHSRDSLEESEMFISLAEILMKRRCGPYICSSF 231
Query: 246 SEIQKSMIKDFAD-LG-LVKLQQGRKENWFIPTKLATNLSMSLTDSSARKEG-------- 295
S F D +G L + Q +++ + + + L DS+ G
Sbjct: 232 SSWHS-----FLDSIGVLFVISQDNNLVFYVNNSVLYD-KVPLADSAMTHHGTAGIDDIY 285
Query: 296 -----------------FIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITK 338
FIV+ETNF++YAY+++ +L LFSK Y PNLI +
Sbjct: 286 DIDYTKPNIPINKASNKFIVLETNFKIYAYTSTAYDKSVLSLFSKTVYVFPNLIKACFDE 345
Query: 339 ESLYNAFENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHY 398
ESL +AF GITA+QII +LQ+++ VP+N+ +QI +WE +R+ +
Sbjct: 346 ESLLSAFNKGITAKQIIKYLQEHS--------EEVPKNIVNQISIWEHRQHRIRARNGYL 397
Query: 399 YDEF 402
Y +F
Sbjct: 398 YHDF 401
>gi|207340606|gb|EDZ68904.1| YPL122Cp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 226
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 75/127 (59%), Gaps = 18/127 (14%)
Query: 294 EGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQ 353
+G ++VETNF++Y+YS S L +L LF ++ + N+++G IT+ES+ A NGITA+Q
Sbjct: 49 DGSLIVETNFKIYSYSNSPLQIAVLSLFVHLKARFVNMVLGQITRESIRRALTNGITADQ 108
Query: 354 IISFLQQNAHPRV------------------ADRMPSVPENVCDQIRLWESDLNRVEMTP 395
II++L+ +AHP++ + + +P V DQIRLW+ +L+RV
Sbjct: 109 IIAYLETHAHPQMRRLAEEKLEKKLELDPNCKEPLQVLPPTVVDQIRLWQLELDRVITYE 168
Query: 396 AHYYDEF 402
Y +F
Sbjct: 169 GSLYSDF 175
>gi|241654873|ref|XP_002411342.1| transcription factor IIH (TFIIH), putative [Ixodes scapularis]
gi|215503972|gb|EEC13466.1| transcription factor IIH (TFIIH), putative [Ixodes scapularis]
Length = 218
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 58/209 (27%), Positives = 107/209 (51%), Gaps = 26/209 (12%)
Query: 17 SLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHR 76
+L+ LD+LY +P C A+ R LP +++ Y+M++ ++D +P ++ W +H
Sbjct: 21 TLSPATLDQLYTHPATCLAVFRELPIISRHYIMRLMFVDQPVPQAVVSSWNEQKYVKEHL 80
Query: 77 VAIDRLVQLRLFSEEK--KKETTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLPTL 134
+++ L L ++++ + L+ F+ N++ L+ GG +P A TL
Sbjct: 81 ESLEALTALHIWADSSLPGGLPGWSLSGVFRKNIQIALLGGG----QPW-----AVYSTL 131
Query: 135 ED---------LEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKV-FQRGLLSRRDKE- 183
E L+ YA+ +WEC L ++ E S+ +++ GL+ ++E
Sbjct: 132 EKDKHGRDAQFLDRYAMERWECVLHFMVGCHTKE---GISADAVRILLHAGLMKSEEEEG 188
Query: 184 -APRLTESGFQFLLMDTNAQLWYIVREYI 211
AP +T GFQFLLMDT +Q+W+ V +Y+
Sbjct: 189 SAPLITMEGFQFLLMDTASQVWHFVLQYL 217
>gi|207340600|gb|EDZ68899.1| YPL122Cp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 226
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 59/207 (28%), Positives = 108/207 (52%), Gaps = 6/207 (2%)
Query: 24 DRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLV 83
+RLY +P C AI R LPPLAK ++M M + + +P +++WV +G + + AI +
Sbjct: 23 NRLYTSPATCLAIYRILPPLAKFFIMAMVFNENEVPLLDLDKWVNSNGKLQFQNAIKSMK 82
Query: 84 QLRLFSEEKKKET-TYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLPTLEDLEAYAI 142
L L K T LN TF+ +LR L GG + + + + +L+ L+ Y+
Sbjct: 83 SLHLLIPNKSSGTLMINLNPTFKISLRNALT-GGEV-QNSFGVVVEENVVSLDLLDEYSA 140
Query: 143 GQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAP-RLTESGFQFLLMDTNA 201
+WE L ++ + A+ P+ +++K L+ + ++T GFQFLL + N+
Sbjct: 141 NKWETILHFMVGTPLAKIPSEKVLNLLK--HSKLMEEVNSTGEFKITNEGFQFLLQEINS 198
Query: 202 QLWYIVREYISNSQERGINQADLISFL 228
QLW ++ +Y+ + ++ D++ F+
Sbjct: 199 QLWTLLLQYLKMIETSKMDLVDVLHFI 225
>gi|194390490|dbj|BAG62007.1| unnamed protein product [Homo sapiens]
Length = 220
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 63/218 (28%), Positives = 108/218 (49%), Gaps = 7/218 (3%)
Query: 49 MQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQLRLFSEE--KKKETTYRLNSTFQS 106
M+M +++ +P + WV + + L LR++ + LN F+
Sbjct: 1 MRMLFLEQPLPQAAVALWVKKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLILNPIFRQ 60
Query: 107 NLRKHLIYGG-ALPREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFS 165
NLR L+ GG A + G + L+ YA +WE L ++ S A + +
Sbjct: 61 NLRIALLGGGKAWSDDTSQLGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSAAVSQDLA 120
Query: 166 SSMMKVFQRGLL-SRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADL 224
+ Q GL+ S E P +T +GFQFLL+DT AQLWY + +Y+ +Q RG++ ++
Sbjct: 121 QLLS---QAGLMKSTEPGEPPCITSAGFQFLLLDTPAQLWYFMLQYLQTAQSRGMDLVEI 177
Query: 225 ISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLV 262
+SFL +LSF G+ Y++ +S+ + ++ + GLV
Sbjct: 178 LSFLFQLSFSTLGKDYSVEGMSDSLLNFLQHLREFGLV 215
>gi|428673252|gb|EKX74165.1| conserved hypothetical protein [Babesia equi]
Length = 668
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 50/174 (28%), Positives = 98/174 (56%), Gaps = 23/174 (13%)
Query: 237 GEAYNLNTLSEIQKSMIKDFADLGLVKLQQ--------------GRKENWFIPTKLATNL 282
G++++ +L++ Q +++ +LG+V + G K +P+ ++
Sbjct: 422 GDSFSTKSLTKSQFRILRLLYELGIVYYKSIKGPFYVLDLSFIVGPKN--LVPSNSPLSV 479
Query: 283 SMSLTDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLY 342
S++ +S I+V++NF++Y Y+ + L +IL + +V+ + PN++VG +T+ES
Sbjct: 480 HTSISFTSEYLPSKIIVQSNFKVYVYTVNNLQFDILNILCEVQARTPNMVVGVLTRESAQ 539
Query: 343 NAFENGITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPA 396
AF++GIT+ +II F + + PENV Q+R+WE++ NRVE++PA
Sbjct: 540 RAFKSGITSHEIIRFFS-------STNTSTFPENVIRQLRMWEAERNRVELSPA 586
>gi|355693906|gb|AER99491.1| proteinral transcription factor IIH, polypeptide 4, 52kDa [Mustela
putorius furo]
Length = 143
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 40/96 (41%), Positives = 64/96 (66%)
Query: 311 SKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRM 370
S+L ++ LFS++ Y+ PN++V +T+ES+ A +GITA+QII FL+ AHP + +
Sbjct: 2 SELQIALIALFSEMLYRFPNMVVAQVTRESVQQAIASGITAQQIIHFLRTRAHPVMLKQT 61
Query: 371 PSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSRV 406
P +P + DQIRLWE + +R+ T Y++F S+V
Sbjct: 62 PVLPPTITDQIRLWELERDRLRFTEGVLYNQFLSQV 97
>gi|429965574|gb|ELA47571.1| hypothetical protein VCUG_00894 [Vavraia culicis 'floridensis']
Length = 424
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 73/274 (26%), Positives = 128/274 (46%), Gaps = 55/274 (20%)
Query: 159 ERPTNFSSSMMKVFQRGLLSRRDKEAPRLTESGFQFLLMDTNAQLWYIV--------REY 210
+R + ++ V Q L KE +T SGF+FLL + Q+W+++ + Y
Sbjct: 132 DRNAWVNEVVLDVLQYAKLINVKKE---ITHSGFEFLLKNRKEQMWFLILNGILMLGKSY 188
Query: 211 IS---------NSQER----GINQADLISFLLELSFHVA----GEAYNLNTLSEIQKSMI 253
+ ER +N+A + + E S + GE +I +
Sbjct: 189 VDWLIMIVFEMGMYERYRWLKVNEAKIFCDVCEHSNNTGSVGGGEE------GDILFRFL 242
Query: 254 KDFADLGLVKLQQGR---KENWFIPTKLATNLSMSLTDSSARKEGFIVVETNFRMYAYST 310
K + LG+++ E +F+ L S++ ++ F+VVETN+++YAY+T
Sbjct: 243 KYLSYLGIIQHSNDMVKPTEEYFL-----------LFSSASTQDSFLVVETNYKLYAYTT 291
Query: 311 SKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRM 370
S I++LF ++ +LPNL+V IT+ES+ AF GIT +QI+ +L + +
Sbjct: 292 SSHELSIIKLFCQIIRELPNLVVAHITEESVNAAFVKGITGQQIVDYLNEKSRSE----- 346
Query: 371 PSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPS 404
+P V +QI +WE +R++ A Y F +
Sbjct: 347 --LPVVVLEQILIWERKRDRMKCMDAVIYSHFMT 378
>gi|440493551|gb|ELQ76006.1| RNA polymerase II transcription initiation/nucleotide excision
repair factor TFIIH, subunit TFB2 [Trachipleistophora
hominis]
Length = 426
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 70/265 (26%), Positives = 122/265 (46%), Gaps = 39/265 (14%)
Query: 159 ERPTNFSSSMMKVFQRGLLSRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISNSQERG 218
+R + ++ V Q L KE +T SGF+FLL Q+W+++ I +
Sbjct: 132 DRNAKVNEVVLDVLQYAKLVNIIKE---ITHSGFEFLLKSRKEQMWFLILNGILMLGKNY 188
Query: 219 INQADLISFLL------------ELSFHVAGEAYNLNT---------LSEIQKSMIKDFA 257
++ +I F + E +N NT + ++K +
Sbjct: 189 VDWLTMIVFEMGMYERHCWLKINESKIFGTARKHNKNTNIDPVSDKEAHSVLVRLLKYLS 248
Query: 258 DLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARKEGFIVVETNFRMYAYSTSKLHCEI 317
LG+++ + PT L S+A ++ F++VETN+++YAY+TS I
Sbjct: 249 YLGIIQYSNDMVK----PTAEY----FLLFSSAAIQDSFLIVETNYKLYAYTTSSHELSI 300
Query: 318 LRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRMPSVPENV 377
++LFS++ +LPNL+ IT+ES+ AF GIT +QI+ +L + + +P V
Sbjct: 301 IKLFSQIVRELPNLVTAHITEESVNAAFLKGITGQQIVDYLTEKSKSE-------LPPVV 353
Query: 378 CDQIRLWESDLNRVEMTPAHYYDEF 402
+QI +WE +R++ A Y F
Sbjct: 354 LEQILIWERQRDRMKCIDAVIYSHF 378
>gi|399949686|gb|AFP65344.1| tfIIB-like protein [Chroomonas mesostigmatica CCMP1168]
Length = 464
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 41/100 (41%), Positives = 64/100 (64%), Gaps = 4/100 (4%)
Query: 297 IVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIIS 356
I++E+NFR+Y Y + L ++L +FS++ YQLPNL VG IT++S+ AF +G T+ II
Sbjct: 329 IIIESNFRIYVYKNAHLGNDLLLVFSEILYQLPNLFVGEITEKSILKAFNSGSTSRNIIG 388
Query: 357 FLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPA 396
FL+ N HP P +P V +QI+ WE +++T +
Sbjct: 389 FLKNNLHPIC----PWIPSAVTNQIKAWEFQKTEIKITES 424
>gi|60603004|gb|AAX27959.1| unknown [Schistosoma japonicum]
Length = 100
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 40/76 (52%), Positives = 54/76 (71%)
Query: 289 SSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENG 348
SS+ G+I++ETNFR+YAY+ S L +L LFSK+ + PNL+V IT++S+ A G
Sbjct: 8 SSSSDVGYILLETNFRLYAYTDSPLRTALLSLFSKIRARFPNLVVADITRDSVREALIRG 67
Query: 349 ITAEQIISFLQQNAHP 364
ITA QI+SFL NAHP
Sbjct: 68 ITANQILSFLTSNAHP 83
>gi|389586531|dbj|GAB69260.1| hypothetical protein PCYB_146880 [Plasmodium cynomolgi strain B]
Length = 913
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 39/100 (39%), Positives = 67/100 (67%)
Query: 297 IVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIIS 356
++V++NF++Y Y+ S L IL +++ + PN++VG +T+ S+ NA+ + ITA QII
Sbjct: 733 LIVQSNFKVYLYTNSTLKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITANQIIK 792
Query: 357 FLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPA 396
FL+ AHP ++ S+P NV Q++LWES+ +R+ + A
Sbjct: 793 FLESYAHPGRSNFKSSIPVNVITQLKLWESERHRLTLDDA 832
Score = 45.4 bits (106), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 39/159 (24%), Positives = 66/159 (41%), Gaps = 18/159 (11%)
Query: 13 DMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGF 72
D + L + L+E+ EAILRSL L + + ++ +I + + M W+ P+
Sbjct: 15 DYMKELNEKIWSYLFEDSLAHEAILRSLNELEQIIISRLLFIQQVVSERAMRLWINPNSL 74
Query: 73 TKHRVAIDRLVQLRLFSEEKKKETTYR------------LNSTFQSNLRKHLIYGGALPR 120
K I LV+ ++ E + K+ Y LN +++ K I+ +
Sbjct: 75 KKLSECIKNLVEAKILVESETKKDNYNQYKINEKFRHTMLNKIYKNREDKIFIFNNN-AK 133
Query: 121 EPMPSGITAR----LPTLEDLEAYAIGQWECFLLQLISS 155
E M PT E++ YA +W LL I+S
Sbjct: 134 EQMEKEKNLHEKNLYPTKEEIANYAQMRWNN-LLHFIAS 171
>gi|209880858|ref|XP_002141868.1| hypothetical protein [Cryptosporidium muris RN66]
gi|209557474|gb|EEA07519.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
Length = 794
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 41/114 (35%), Positives = 68/114 (59%), Gaps = 5/114 (4%)
Query: 294 EGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQ 353
E I+V++NFR+Y Y+ S L +ILR +V+ + PN+I G +T+ L +A+ G+ A Q
Sbjct: 603 EAGIIVQSNFRVYCYTASPLQAKILRHLCQVKVRGPNIICGVLTRRGLLSAYSMGVKAYQ 662
Query: 354 IISFLQQNAHP-----RVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEF 402
I+ F NAHP + D +P +V Q++LWE+D NR+E+ + ++
Sbjct: 663 ILRFFISNAHPIILKKHITDGTSIIPISVETQLKLWENDHNRLEINKVSLFSDW 716
>gi|221061889|ref|XP_002262514.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium
knowlesi strain H]
gi|193811664|emb|CAQ42392.1| hypothetical protein, conserved in Apicomplexan species [Plasmodium
knowlesi strain H]
Length = 943
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 39/100 (39%), Positives = 67/100 (67%)
Query: 297 IVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIIS 356
++V++NF++Y Y+ S L IL +++ + PN++VG +T+ S+ NA+ + ITA QII
Sbjct: 763 LIVQSNFKVYLYTNSTLKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITANQIIK 822
Query: 357 FLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPA 396
FL+ AHP ++ S+P NV Q++LWES+ +R+ + A
Sbjct: 823 FLESYAHPGRSNFKSSIPVNVITQLKLWESERHRLTLDDA 862
Score = 43.1 bits (100), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 25/96 (26%), Positives = 47/96 (48%), Gaps = 3/96 (3%)
Query: 13 DMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGF 72
D + L+ + L+E+ EAIL+SL L + + ++ +I + + M W+ P+
Sbjct: 15 DYMKELSEKIWTYLFEDSLAHEAILQSLNELEQIIISRLLFIQQVVSERAMRLWINPNSL 74
Query: 73 TKHRVAIDRLVQLRLFSEEKKKETT---YRLNSTFQ 105
K I LV+ ++ E + K+ YR+N F+
Sbjct: 75 KKLSECIKNLVEAKILVESETKKDNYNQYRINEKFR 110
>gi|156086630|ref|XP_001610724.1| hypothetical protein [Babesia bovis T2Bo]
gi|154797977|gb|EDO07156.1| conserved hypothetical protein [Babesia bovis]
Length = 586
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 52/185 (28%), Positives = 101/185 (54%), Gaps = 17/185 (9%)
Query: 237 GEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKE------NWFIPTKLAT-----NLSMS 285
G+ + L+ + Q+ +++ +LGLV + ++ + K A +SMS
Sbjct: 328 GDTFRLDNPTLSQQRILRFLIELGLVYADSDGSKIYVTNLSFLVDVKHANLVEEHQISMS 387
Query: 286 LTDSSARKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAF 345
+ + K +VV++NF++YAY S L +L +++ + PNL++G +T+ SL AF
Sbjct: 388 VCGNKGSK---MVVQSNFKIYAYIPSALQMNVLNHICELQAKTPNLVIGVLTRSSLQTAF 444
Query: 346 ENGITAEQIISFLQQNAH---PRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEF 402
++GITA+Q+I F + ++ + +VPENV Q+++WE++ NR+E+ A + +
Sbjct: 445 KSGITADQLICFFESKGQYDDIQINRNVMNVPENVRRQLKMWEAERNRLELLNAVVFKRW 504
Query: 403 PSRVI 407
+ I
Sbjct: 505 DNDFI 509
>gi|449017614|dbj|BAM81016.1| similar to TFIIH subunit TFB2 [Cyanidioschyzon merolae strain 10D]
Length = 693
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 42/106 (39%), Positives = 66/106 (62%), Gaps = 5/106 (4%)
Query: 297 IVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIIS 356
I+VETNFR+YAY+ S +L LF+++ Y++P + +G IT++S+ A + GITA+Q++
Sbjct: 538 IIVETNFRLYAYAASSFQVALLSLFTRILYRMPGVAIGVITRDSVRRALKCGITAKQLLH 597
Query: 357 FLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEF 402
FL ++ SVP NV DQI LWE + R++ P + F
Sbjct: 598 FLGIHSM-----EGKSVPFNVHDQILLWELERKRIQAYPGVLLEGF 638
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 84/363 (23%), Positives = 135/363 (37%), Gaps = 104/363 (28%)
Query: 10 NFMDMVASLTARDLDRLYENPFICE----AILRSLPPLAKKYVMQMFYIDG---AIPAKM 62
+ +D V L R +D L+ C+ ILR+LPPLAK + + + IPA +
Sbjct: 27 SLVDYVCRLDERSIDMLFT---ACDLTVFGILRALPPLAKIILFRFLALSDYVQEIPAHI 83
Query: 63 MEEWV--------------------------LPDGFTKHRVAIDRLVQL--------RLF 88
+ W+ PD + + + R++ + R
Sbjct: 84 VTAWIADPRTEERNDPVWGHTSRTSAKSTSAFPDSERERNLIVSRILHVLTRLRFLRRTV 143
Query: 89 SEEKKKET-----TYRLNSTFQSNLRKHL---------IYGGALPREPMPSGITA----- 129
E+ T TY L + +L ++L + G PR S + A
Sbjct: 144 QEQPSNGTSQELVTYALPPAIREHLLRYLWATTNSQENLEGACTPRRSSASSVDASTRAS 203
Query: 130 -----------------RLPT---LEDLEAYAIGQWECFLLQLISSTQAE----RPTNFS 165
R+PT L Y WE L +I AE +PT
Sbjct: 204 WIRLLEPFQRIANASDGRIPTRLTAPQLREYTRHAWEHVLHFVIGIPVAEPEIRQPTGPE 263
Query: 166 SSMMKVFQR------------GLLSRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISN 213
+ V + G+ + + R+TE GF +LL+D +AQLW ++ Y+ N
Sbjct: 264 HTPEHVMEPPSDEVIEALLSLGVSEQVGSDGMRITERGFSYLLLDIHAQLWALLEAYMEN 323
Query: 214 SQERGINQADLISFLLELSFHVAGEAYNLN--TLSEIQKSMIKDFADLGLVKL---QQGR 268
E ++ +L L L F V G Y+ +LS Q M+ A++GLV L +Q R
Sbjct: 324 WDEARGDRLELFDLLFRLGFSVPGHIYDGGDPSLSAAQCRMLGFLAEIGLVYLMETEQSR 383
Query: 269 KEN 271
++
Sbjct: 384 QDG 386
>gi|156095867|ref|XP_001613968.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148802842|gb|EDL44241.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 960
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 40/106 (37%), Positives = 68/106 (64%)
Query: 297 IVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIIS 356
++V++NF++Y Y+ S L IL +++ + PN++VG +T+ S+ NA+ + ITA QII
Sbjct: 780 LIVQSNFKVYLYTNSTLKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITANQIIK 839
Query: 357 FLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEF 402
FL+ AHP + S+P NV Q++LWES+ +R+ + A + F
Sbjct: 840 FLESYAHPGRSTFKSSIPVNVITQLKLWESERHRLTLDDAIVFKSF 885
Score = 41.6 bits (96), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 37/159 (23%), Positives = 67/159 (42%), Gaps = 18/159 (11%)
Query: 13 DMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGF 72
D + ++ + L+E+ EAIL SL L + + ++ +I + + M W+ P+
Sbjct: 15 DYMKEMSEKIWTYLFEDSLAHEAILHSLNELEQIIISRLLFIQQVVSERAMRLWINPNSL 74
Query: 73 TKHRVAIDRLVQLRLFSEEKKKETTYR------------LNSTFQSNLRKHLIYGGALPR 120
K I LV+ ++ E + K+ Y LN +++ K I+ + +
Sbjct: 75 KKLSECIKNLVEAKILVESETKKDNYNQYKINEKFRHTMLNKIYKNREDKIFIFNNNV-K 133
Query: 121 EPMPSGITAR----LPTLEDLEAYAIGQWECFLLQLISS 155
E M PT E++ YA +W LL I+S
Sbjct: 134 EQMEKEKNLHENNLYPTKEEVANYAHMRWNN-LLHFIAS 171
>gi|158302621|ref|XP_561433.2| Anopheles gambiae str. PEST AGAP012880-PA [Anopheles gambiae str.
PEST]
gi|157021114|gb|EAL42401.2| AGAP012880-PA [Anopheles gambiae str. PEST]
Length = 244
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 55/217 (25%), Positives = 108/217 (49%), Gaps = 20/217 (9%)
Query: 8 AKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWV 67
K+ + + S L++LY P IC A+ R LP +A+++V+++ +++ IP ++ W
Sbjct: 34 CKDLEEYLKSRPPEVLEKLYNYPAICLAVYRELPEIARQFVIRILFVEQPIPQAVVSSWA 93
Query: 68 LPDGFTKHRVAIDR-LVQLRLFSEEKKKE--TTYRLNSTFQSNLRKHLIYGGALPREPMP 124
+ K ++ + L +L ++ + L TF+ NL+ L+ GG P
Sbjct: 94 T-QVYAKENTSVSQVLTELGVWRSAAYPGGLAAWELCPTFKKNLKIALLGGG------RP 146
Query: 125 SGITARLP------TLEDLEAYAIGQWECFLLQLI--SSTQAERPTNFSSSMMKVFQRGL 176
++ L ++ L+ YA+ +W C L ++ S++ S +++
Sbjct: 147 WSMSNALDPDQKSRDIDFLDTYAMSRWRCVLHYMVGAGSSKGMEGEGISPDAVRILLHAN 206
Query: 177 LSRRDKE--APRLTESGFQFLLMDTNAQLWYIVREYI 211
L +RD+ +P +T GFQFLL+DT AQ+W+ + +Y+
Sbjct: 207 LMKRDESDGSPVITRQGFQFLLLDTQAQVWHFMLQYL 243
>gi|124806778|ref|XP_001350829.1| transcription factor Tfb2, putative [Plasmodium falciparum 3D7]
gi|23496958|gb|AAN36509.1| transcription factor Tfb2, putative [Plasmodium falciparum 3D7]
Length = 945
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 39/106 (36%), Positives = 69/106 (65%)
Query: 297 IVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIIS 356
++V++NF++Y Y++S L IL +++ + PN++VG +T+ S+ NA+ + ITA QII
Sbjct: 766 LIVQSNFKVYLYTSSLLKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITANQIIK 825
Query: 357 FLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEF 402
FL+ +HP + S+P NV Q++LWES+ +R+ + A + F
Sbjct: 826 FLESYSHPGKNNFKSSIPMNVITQLKLWESERHRLTLEDAIVFKSF 871
>gi|429854513|gb|ELA29524.1| tfiih and nucleotide excision repair factor 3 complexes subunit
[Colletotrichum gloeosporioides Nara gc5]
Length = 509
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 41/101 (40%), Positives = 63/101 (62%), Gaps = 5/101 (4%)
Query: 293 KEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAE 352
+ G I+VETN+R+YAY + L +L LF K++ + +++ G +T+ S+ NA E GITA+
Sbjct: 369 RSGSIIVETNYRVYAYGQTPLQIAVLSLFCKLKLRFADMVSGRLTRNSIRNAVERGITAD 428
Query: 353 QIISFLQQNAHPRV-----ADRMPSVPENVCDQIRLWESDL 388
QIIS+L +AH ++ P +P V DQIRL D+
Sbjct: 429 QIISYLASHAHEQMHRFAAVRNKPVLPPTVIDQIRLEYEDI 469
>gi|399219100|emb|CCF75987.1| unnamed protein product [Babesia microti strain RI]
Length = 612
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 62/240 (25%), Positives = 109/240 (45%), Gaps = 59/240 (24%)
Query: 213 NSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGL----------- 261
N +E N D+I F+L LS G+ + L++ QK ++ DLG+
Sbjct: 293 NLEEAEKNVTDIIYFILTLSQLTIGQPISSKGLTKAQKRFLRFAVDLGILWQPKLDCDES 352
Query: 262 -VKLQQG-----RKENWF-----------IPTKLATNLSMSLTDSSARK----------- 293
+ L G R + +F + L+ +++ D S+RK
Sbjct: 353 SIDLDSGIHNIRRDKIYFAAPHALLFRSHMSEPLSLVSAITTADFSSRKLEIPNHYFSPY 412
Query: 294 ----------------EGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAIT 337
E I+V++NF++Y Y+ S L +L +++ + PNL++G +T
Sbjct: 413 LLNETETAVPRMRHNLESGIIVQSNFKVYVYTASPLQINVLAHLCELQSRTPNLVIGVLT 472
Query: 338 KESLYNAFENGITAEQIISFLQQNAHP----RVADRMPSVPENVCDQIRLWESDLNRVEM 393
+ S + AF GITA+QI FL+ ++HP V + +P NV Q+ +WE++ NR+ +
Sbjct: 473 RASAHAAFRAGITAKQICQFLETHSHPILLQNVREGGSYLPNNVVTQLNMWEAERNRISL 532
>gi|145509120|ref|XP_001440504.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124407721|emb|CAK73107.1| unnamed protein product [Paramecium tetraurelia]
Length = 400
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 97/395 (24%), Positives = 175/395 (44%), Gaps = 67/395 (16%)
Query: 33 CEAILRSLPPLAKKYVMQM-FYIDGAIPAKMMEEWVLPDGFTKH-RVAIDRLVQLRLFSE 90
C + RSLPP + + +M F+ + +P +++ + + +D +QL+L ++
Sbjct: 27 CIPVFRSLPPNLRTIIQRMVFFEERKVPESIIQ----AQNYQQSPNTILDLAIQLKLITK 82
Query: 91 EKKKE-----TTYRLNSTFQSNLRKHLIYGGALPREPMPSGITARLPTLEDLEAYAIGQW 145
+++++ + Y L+S FQ++LR I G + + + + I E L+ +W
Sbjct: 83 QEERQQQAVNSYYYLDSNFQNSLRDS-IEGKS---KSILTKIGDCTENKELLKRECSRKW 138
Query: 146 ECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAPRLTESGFQFLLMDTNAQLWY 205
+ L LIS ++ S +V Q + S K R GF F+L T+ Q+
Sbjct: 139 K-DLFDLISRRDR---SDLSCYRQQVRQTLIES---KLLERNLGVGFSFILSSTHKQINQ 191
Query: 206 IVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQ 265
I++ Y++ ++++ F+L LS + Y + E QK++IKD + GL Q
Sbjct: 192 ILKYYVTKQS------SNVVRFILCLSVLDPMKIYQMPN-EEWQKNVIKDLQEFGLTHYQ 244
Query: 266 QGRKENWFIPTKLATNLSMSLTDSSARKEGFIVVETNFRMYAYSTS------KLHCEILR 319
+ F L N S + I+VE NFR+YAY S ++ C +L
Sbjct: 245 DQQMRITF----LFWNFLYEPPSVSIGIQCNIIVEANFRIYAYLNSGDQQEEEILCNLLN 300
Query: 320 LFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNA-----------HPRVAD 368
LFS+++ + LI+ +++ S+ A + A+QII FL+ N+ + D
Sbjct: 301 LFSEIKKRFKILIIADLSESSIRKAVRENLQAKQIIQFLEMNSKQLKQQAATEKQHKSND 360
Query: 369 RMPS-----------------VPENVCDQIRLWES 386
+ +P NV QI+ WES
Sbjct: 361 ELKKRLDFLRVFQEGVPEKAIIPHNVVQQIQYWES 395
>gi|322785901|gb|EFZ12520.1| hypothetical protein SINV_14207 [Solenopsis invicta]
Length = 170
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 48/160 (30%), Positives = 83/160 (51%), Gaps = 14/160 (8%)
Query: 8 AKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWV 67
KN + + S T L++LY P IC A+ R LP +AK YVM++ +++ +P ++ W
Sbjct: 16 CKNLHEYLKSRTPETLNKLYHKPPICLAVFRELPVIAKNYVMRLLFVEQPVPQAVIASWC 75
Query: 68 LPDGFTKHRVAIDRLVQLRLFSEEK--KKETTYRLNSTFQSNLRKHLIYGGALPREPMPS 125
F +H+ + L +L ++ E + LN+TF+ NL+ L+ GG P
Sbjct: 76 SKLYFEEHQKVVQVLNELYVWKEASIPGGLPGWILNNTFKKNLKIVLLGGGK------PW 129
Query: 126 GITARLPT------LEDLEAYAIGQWECFLLQLISSTQAE 159
++ +L T + L++YA+ +WEC L ++ S Q E
Sbjct: 130 TMSNQLETDSKPRDVAFLDSYALERWECVLHYMVGSQQQE 169
>gi|324515605|gb|ADY46257.1| General transcription factor IIH subunit 4 [Ascaris suum]
Length = 245
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 66/227 (29%), Positives = 113/227 (49%), Gaps = 30/227 (13%)
Query: 129 ARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQR-GLLSRRDKEAP-R 186
+R +DL A +WEC L L +Q + S + ++F+ GL S + E
Sbjct: 19 SRKSASKDLGKKATERWECILHYLALPSQ-KSEQGVSGATKQLFRAAGLTSGGESEGDME 77
Query: 187 LTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSF-----HVAGE--- 238
+T +GFQFLL++ Q+W + Y+ + G N + FLL+L+ H G
Sbjct: 78 ITSAGFQFLLLNQTEQIWMYILHYLRLEESMGKNVMAELDFLLKLTLCVDHTHTNGRLRV 137
Query: 239 ------------AYNLN-TLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMS 285
A+ ++ + SE + + +LGLV +++ RK+ +F T L +L++S
Sbjct: 138 VDGDREKKATWRAFFIDESWSETITNFLMQLRELGLVFIRK-RKDGYFFITPLFAHLTVS 196
Query: 286 LTDSSARKE-----GFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQ 327
+ +A E G+I+VETN+R+YAY+ S L IL F+++ Y+
Sbjct: 197 SSSETAVAEKRTHSGYIIVETNYRVYAYTDSSLQLAILSTFTEMLYR 243
>gi|82704497|ref|XP_726580.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23482045|gb|EAA18145.1| Drosophila melanogaster LD30622p-related [Plasmodium yoelii yoelii]
Length = 926
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 68/106 (64%)
Query: 297 IVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIIS 356
+++++NF++Y Y++S L IL +++ + PN++VG +T+ S+ NA+ + ITA+QII
Sbjct: 746 LIIQSNFKVYLYTSSILKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITADQIIK 805
Query: 357 FLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEF 402
FL+ +HP +P NV Q++LWE++ +R+ + + + F
Sbjct: 806 FLESYSHPGKTKLKSIIPINVITQLKLWEAERHRLTLEDSIVFKNF 851
>gi|70950974|ref|XP_744765.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56524852|emb|CAH80923.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
Length = 922
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 35/106 (33%), Positives = 68/106 (64%)
Query: 297 IVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIIS 356
+++++NF++Y Y++S L IL +++ + PN++VG +T+ S+ NA+ + ITA+QII
Sbjct: 742 LIIQSNFKVYLYTSSILKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITADQIIK 801
Query: 357 FLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEF 402
FL+ +HP +P NV Q++LWE++ +R+ + + + F
Sbjct: 802 FLESYSHPGKTKLKSIIPINVITQLKLWEAERHRLTLEDSIVFKNF 847
>gi|68070707|ref|XP_677265.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56497310|emb|CAH94094.1| conserved hypothetical protein [Plasmodium berghei]
Length = 929
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 34/95 (35%), Positives = 64/95 (67%)
Query: 297 IVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIIS 356
+++++NF++Y Y++S L IL +++ + PN++VG +T+ S+ NA+ + ITA+QII
Sbjct: 749 LIIQSNFKVYLYTSSILKINILSHLCELQARTPNMVVGILTRRSVLNAYNSDITADQIIK 808
Query: 357 FLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRV 391
FL+ +HP +P NV Q++LWE++ +R+
Sbjct: 809 FLESYSHPGKTKLKSIIPINVITQLKLWEAERHRL 843
>gi|340508252|gb|EGR33999.1| hypothetical protein IMG5_027880 [Ichthyophthirius multifiliis]
Length = 454
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 69/259 (26%), Positives = 116/259 (44%), Gaps = 65/259 (25%)
Query: 193 QFLLMDTNAQLWYIVREYISN-SQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKS 251
+F+L DTNAQ+ I+ Y +Q I + D+I+F++ LS + Y N SE K
Sbjct: 189 EFILQDTNAQIHQILFYYCQVFAQMENIEEEDIINFIITLSNLDVNKTYFFNYRSEYAKK 248
Query: 252 ------------MIKDFADLGLVK-LQQGRKENWFIPTKLATNL---SMSLTDSSARKEG 295
++ D A LG++K G+++ F T L S+ + +A+
Sbjct: 249 QNIDQKFTFELKILSDLAKLGMIKEFDLGQQKKVFGITPLIWQFCYRSIDIKTINAK--- 305
Query: 296 FIVVETNFRMYAY----------STSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAF 345
I+VETNF +YAY S SK ++L+ FSK+ Y P+LIV +T+ + AF
Sbjct: 306 -IIVETNFNLYAYLDYNPQNKTFSESKYIRDLLKKFSKIHYTFPHLIVAQLTEAKMKQAF 364
Query: 346 ENGITAEQIISFLQQNAHPR----VADRMPS----------------------------- 372
GIT++ +I F + + + + D+ S
Sbjct: 365 NQGITSKLLIEFFHKTSDAKLKKYLKDKQMSAIKLTQSLNLNKKKLDFLQLFQKKEEDFS 424
Query: 373 -VPENVCDQIRLWESDLNR 390
VP+N+ +I+ WE + ++
Sbjct: 425 LVPDNIIQEIQTWEKEKDK 443
>gi|269861689|ref|XP_002650538.1| transcription/repair factor TFIIH subunit [Enterocytozoon bieneusi
H348]
gi|220065990|gb|EED43520.1| transcription/repair factor TFIIH subunit [Enterocytozoon bieneusi
H348]
Length = 291
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 59/222 (26%), Positives = 103/222 (46%), Gaps = 41/222 (18%)
Query: 187 LTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLS 246
+T G +FLLM+ N Q+WY++ E I S L S+ L+ H + LN L
Sbjct: 72 ITTKGLEFLLMNKNEQIWYLIVEDIRLSSHL---HKKLYSYAQVLNGHKIIDKKILNFLV 128
Query: 247 EIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARKEGFIVVETNFRMY 306
EI ++I + D L K +I K T + + FI +ETNF++Y
Sbjct: 129 EI--NVIDNIVDQNLYK---------YILYKNNTEI-----------QKFIYLETNFKLY 166
Query: 307 AYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRV 366
+Y+ + IL LFS + +++P +I + ++ + N + GI +QI+ +++++
Sbjct: 167 SYTNNLCDRAILDLFSNIVFEVPGMIKSILDEKKIMNTLDRGIKIQQIVEYIRRHT---- 222
Query: 367 ADRMPSVPENVCDQ----IRLWESDLNRVEMTPAHYYDEFPS 404
N CDQ I +W+ NR+ + Y +F +
Sbjct: 223 --------INNCDQILHMIEIWDKQRNRINKQIGYLYSDFTN 256
>gi|269860316|ref|XP_002649880.1| transcription/repair factor TFIIH subunit [Enterocytozoon bieneusi
H348]
gi|220066720|gb|EED44193.1| transcription/repair factor TFIIH subunit [Enterocytozoon bieneusi
H348]
Length = 369
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 58/222 (26%), Positives = 103/222 (46%), Gaps = 41/222 (18%)
Query: 187 LTESGFQFLLMDTNAQLWYIVREYISNSQERGINQADLISFLLELSFHVAGEAYNLNTLS 246
+T G +FLLM+ N Q+WY++ E I S L S+ L+ H + LN L
Sbjct: 150 ITTKGLEFLLMNKNEQIWYLIVEDIRLSSHL---HKKLYSYAQVLNGHKIIDKKILNFLV 206
Query: 247 EIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARKEGFIVVETNFRMY 306
EI ++I + D L K +I K ++ + FI +ETNF++Y
Sbjct: 207 EI--NVIDNIVDQNLYK---------YILYK-----------NNTEIQKFIYLETNFKLY 244
Query: 307 AYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRV 366
+Y+ + IL LFS + +++P +I + ++ + N + GI +QI+ +++++
Sbjct: 245 SYTNNLCDRAILDLFSNIVFEVPGMIKSILDEKKIMNTLDRGIKIQQIVEYIRRHT---- 300
Query: 367 ADRMPSVPENVCDQ----IRLWESDLNRVEMTPAHYYDEFPS 404
N CDQ I +W+ NR+ + Y +F +
Sbjct: 301 --------INNCDQILHMIEIWDKQRNRINKQIGYLYSDFTN 334
>gi|294950275|ref|XP_002786548.1| tfiih, polypeptide, putative [Perkinsus marinus ATCC 50983]
gi|239900840|gb|EER18344.1| tfiih, polypeptide, putative [Perkinsus marinus ATCC 50983]
Length = 580
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 102/483 (21%), Positives = 193/483 (39%), Gaps = 113/483 (23%)
Query: 23 LDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWV--LPDG-FTKHRVAI 79
+ LY++P + + L + K ++ ++ + I ++ W+ L D +K+R A+
Sbjct: 50 FEELYKDPAVVAYVYSRLNGIGKVFINRLILVHRPIQQGVIRHWISSLHDSDHSKYRAAL 109
Query: 80 DRLVQLRLFSEEKKKETTYRLNSTFQSNLRKHLIYGGALPREPMP--------------- 124
+L +L + + K + R++S F+ L + + + + P
Sbjct: 110 SQLTKLHILLMDPKGQH-LRMSSNFRQRLHEVFLKNTSDDHQSPPWEILEEAKVDVSKVD 168
Query: 125 ----SGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRR 180
+G R + L+ YA +WE L ++ A +S+ R LS
Sbjct: 169 GVLTNGGAPRDAGMNRLDEYAALRWEYVLEAMVDGMTAAVG---DASLRPEGARQGLSGI 225
Query: 181 DKEAPRLTESG--FQFLLMDTNAQLWYIVREYISNSQERGINQADL--ISFLLELSFHVA 236
+ T +G FQF+L +Q W +VR + + Q+ L I ++ L+ +
Sbjct: 226 VGMSSTSTSTGQLFQFVLASRTSQYWELVRRAFDMIERQHGIQSLLAPIRMVVALTRSIK 285
Query: 237 GEAYNLNTLSEIQ---------------KSMIKDFADLGLV---------------KLQQ 266
E L + I+ ++++ ADLG+V + Q
Sbjct: 286 HEPERLLPGTRIRVNPQCMATPGAKGAVTTLLETLADLGIVVPATYDDELRKVAALQRQY 345
Query: 267 GRKENWFIPTKLA-----------------TNLSMSLTDSSA--RKEGFIVVETNFRMYA 307
G + +P + + + MS+ D++A K + V++NF + A
Sbjct: 346 GESVKYELPLVSSDSRMVVSSLVYVMQASRSKMMMSVDDTTAITNKSCRLFVDSNFAVTA 405
Query: 308 YSTSKLHCEILRLFSKVEYQL-------PN---LIVGAITKESLYNAFENGITAEQIISF 357
Y+TS L +L F +++ QL PN ++G +T+ S+ +A + G+T+E IIS+
Sbjct: 406 YTTSSLDLRLLGTFVQLQRQLGDGREYDPNDFGCVLGTLTQSSVQSAAQRGVTSEYIISY 465
Query: 358 LQQNAHPRVADR----------------------MPSVPENVCDQIRLWESDL--NRVEM 393
L+ + PR A + +P NV QI LWE + NR+ +
Sbjct: 466 LKSHVDPRAAHMGSQGGRSSAATANTGAARGEKFIDGIPANVVTQITLWEREAIHNRLRI 525
Query: 394 TPA 396
P
Sbjct: 526 DPG 528
>gi|160331881|ref|XP_001712647.1| hypothetical protein HAN_3g523 [Hemiselmis andersenii]
gi|159766096|gb|ABW98322.1| hypothetical protein HAN_3g523 [Hemiselmis andersenii]
Length = 432
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 32/97 (32%), Positives = 60/97 (61%), Gaps = 4/97 (4%)
Query: 297 IVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIIS 356
I++E+N+R+Y Y + + ++ +FS++ Y LPN VG IT+ S+ A ++GIT + I+
Sbjct: 300 IIIESNYRIYVYQKNSNNNQLFLIFSEILYILPNFFVGEITETSISRALKSGITIQNILG 359
Query: 357 FLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEM 393
F+++N H S+P + +Q RLWE ++++
Sbjct: 360 FIRENLHCVCR----SIPSTILNQFRLWEFQKKKIKI 392
>gi|238570605|ref|XP_002386883.1| hypothetical protein MPER_14683 [Moniliophthora perniciosa FA553]
gi|215440101|gb|EEB87813.1| hypothetical protein MPER_14683 [Moniliophthora perniciosa FA553]
Length = 69
Score = 71.2 bits (173), Expect = 8e-10, Method: Composition-based stats.
Identities = 34/69 (49%), Positives = 49/69 (71%), Gaps = 5/69 (7%)
Query: 290 SARKEGFIVVETNFRMYAYSTS-----KLHCEILRLFSKVEYQLPNLIVGAITKESLYNA 344
S ++GFIV+ETN+R+YAY+ S L IL LF ++Y+ PNL+VG+IT+ES+ A
Sbjct: 1 SGSQQGFIVLETNYRVYAYTGSFSLHNPLQTAILHLFVSLKYRFPNLVVGSITRESVKKA 60
Query: 345 FENGITAEQ 353
NGI+A+Q
Sbjct: 61 LSNGISADQ 69
>gi|308460920|ref|XP_003092758.1| hypothetical protein CRE_20976 [Caenorhabditis remanei]
gi|308252517|gb|EFO96469.1| hypothetical protein CRE_20976 [Caenorhabditis remanei]
Length = 128
Score = 71.2 bits (173), Expect = 8e-10, Method: Composition-based stats.
Identities = 35/75 (46%), Positives = 49/75 (65%), Gaps = 3/75 (4%)
Query: 333 VGAITKESLYNAFENGITAEQIISFLQQNAHPR---VADRMPSVPENVCDQIRLWESDLN 389
VG IT+ES+ A ++GITA QIISFL+ NAHP+ + + +P V DQIRLWE +
Sbjct: 3 VGMITRESVRGALQHGITAAQIISFLRANAHPQCIATSGAINCLPITVADQIRLWEDERR 62
Query: 390 RVEMTPAHYYDEFPS 404
R+++ A+ Y F S
Sbjct: 63 RMDLKDAYIYSHFES 77
>gi|330038479|ref|XP_003239608.1| TATA binding protein of transcription factor IIB-like protein
[Cryptomonas paramecium]
gi|327206532|gb|AEA38710.1| TATA binding protein of transcription factor IIB-like protein
[Cryptomonas paramecium]
Length = 467
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 39/105 (37%), Positives = 60/105 (57%), Gaps = 13/105 (12%)
Query: 286 LTDSSARKEGF-IVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVGAITKESLYNA 344
L+ KE F +++E+N+R+YAY+ + +IL F + Y LP L VG I K S++ A
Sbjct: 307 LSKKIKTKECFQMIIESNYRVYAYNNNTFLNKILIQFCDLIYNLPGLFVGEINKTSIHKA 366
Query: 345 FENGITAEQIISFLQQNAHPRVADRMPSVPENVC----DQIRLWE 385
GI A+ IISF+ +N+H + +N C +QIR+WE
Sbjct: 367 IHKGINAKNIISFILKNSH--------YIHQNSCNPIINQIRIWE 403
>gi|146161819|ref|XP_001033510.2| hypothetical protein TTHERM_00313290 [Tetrahymena thermophila]
gi|146142827|gb|EAR85847.2| hypothetical protein TTHERM_00313290 [Tetrahymena thermophila
SB210]
Length = 484
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 84/327 (25%), Positives = 134/327 (40%), Gaps = 85/327 (25%)
Query: 137 LEAYAIGQWECFLLQL------ISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAPRLTES 190
LE A QW+ QL I++T ++ + +S +M VF R +LS P S
Sbjct: 150 LEGKAKNQWKSICNQLMLMAQEINNTSSQSQSKYSRLVMDVF-RQILSVE----PGNLGS 204
Query: 191 GFQ----FLLMDTNAQLWYIVREYISN-SQERGINQADLISFLLELS----FHVAGEAYN 241
G+Q F+L DTN Q+ I+ Y S+ I+ +SFLL LS + YN
Sbjct: 205 GYQKYMEFILEDTNTQIDKILTLYCYQYSRNHKISVETAVSFLLTLSNLDITKIYQNTYN 264
Query: 242 LNTLSEIQKSM--IKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARK------ 293
T + + + + D LG+V +KE + ++A ++ L RK
Sbjct: 265 NETATFQPRELEFLNDLHSLGMV----FKKE--ILKDRIAFYITPVLWQFCYRKIDLKTL 318
Query: 294 EGFIVVETNFRMYAY-----------STSKL-----HCE-ILRLFSKVEYQLPNLIVGAI 336
I +ETNF +YAY S L H E +L F + Y+ P+LIVG +
Sbjct: 319 NAKITIETNFNVYAYIDQEDPKEIKDSKDTLKDQTEHIEKLLASFCDLHYKFPHLIVGQL 378
Query: 337 TKESLYNAFENGITAEQIISFLQQNAHP-------------------------------- 364
+++ + F+NG+TA +I F + + P
Sbjct: 379 SEQKTKDQFKNGLTARLLIQFFNKTSDPQMKKYLKDKQMNAILLTQVQNLQKKKLEFLNL 438
Query: 365 --RVADRMPSVPENVCDQIRLWESDLN 389
+ D +P+N+ +I WE + N
Sbjct: 439 FKKTEDNFSLIPDNIQQEIETWEREKN 465
>gi|149031821|gb|EDL86756.1| rCG41767 [Rattus norvegicus]
Length = 187
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 44/154 (28%), Positives = 70/154 (45%), Gaps = 3/154 (1%)
Query: 8 AKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWV 67
+N + + L+ LDRLY +P C A+ R LP LAK +VM+M +++ +P + WV
Sbjct: 17 CRNLQEFLGGLSPGVLDRLYGHPATCLAVFRELPSLAKNWVMRMLFLEQPLPQAAVALWV 76
Query: 68 LPDGFTKHRVAIDRLVQLRLFSEE--KKKETTYRLNSTFQSNLRKHLIYGG-ALPREPMP 124
+ + L LR++ + LN F+ NLR L+ GG A +
Sbjct: 77 KKEFSKAQEESTGLLSGLRIWHTQLLPGGLQGLILNPVFRQNLRIALLGGGKAWSDDTSQ 136
Query: 125 SGITARLPTLEDLEAYAIGQWECFLLQLISSTQA 158
G + L+ YA +WE L ++ S A
Sbjct: 137 LGPDKHARDVPSLDKYAEERWEVVLHFMVGSPSA 170
>gi|302831321|ref|XP_002947226.1| hypothetical protein VOLCADRAFT_87373 [Volvox carteri f.
nagariensis]
gi|300267633|gb|EFJ51816.1| hypothetical protein VOLCADRAFT_87373 [Volvox carteri f.
nagariensis]
Length = 191
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 48/181 (26%), Positives = 82/181 (45%), Gaps = 32/181 (17%)
Query: 10 NFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLP 69
+F +A+L+A LY +P+ A+ R+LPPLA+ YVM++ ++ PA ++ W L
Sbjct: 2 DFTAYIANLSADKRTLLYRSPWTSLAVFRNLPPLAQVYVMRLLFVPTPFPADYLDSWALR 61
Query: 70 DGFTKHRVAIDRLVQLRLFSEEKKKETT----------------------------YR-- 99
+ H+ A+ L L + E++ K T +R
Sbjct: 62 SATSTHQAALAALRGLDVLLEQRIKPVTTVGPHAAGFGAVAGGRAASAAAAPQQQVHRAV 121
Query: 100 --LNSTFQSNLRKHLIYGGALPREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQ 157
L+ F++ L++ G L R +P A P+LE L +A GQWE L L+ + +
Sbjct: 122 CVLHPDFRTQLQRVACCGSQLMRGDVPPAAAASAPSLEQLGEWATGQWEALQLYLLGAGR 181
Query: 158 A 158
+
Sbjct: 182 S 182
>gi|302831319|ref|XP_002947225.1| hypothetical protein VOLCADRAFT_35893 [Volvox carteri f.
nagariensis]
gi|300267632|gb|EFJ51815.1| hypothetical protein VOLCADRAFT_35893 [Volvox carteri f.
nagariensis]
Length = 90
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 26/58 (44%), Positives = 36/58 (62%)
Query: 348 GITAEQIISFLQQNAHPRVADRMPSVPENVCDQIRLWESDLNRVEMTPAHYYDEFPSR 405
GI A++++S+L HP +A R P VPE V DQIRLWE+ +NR+ Y+ SR
Sbjct: 1 GIAADELVSYLAARPHPSIASRTPVVPEVVSDQIRLWEASMNRLRADSVVLYENLASR 58
>gi|342184190|emb|CCC93671.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
Length = 499
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 65/263 (24%), Positives = 110/263 (41%), Gaps = 54/263 (20%)
Query: 183 EAPRLTESGFQFLLMDTNAQLWYIVR----EYISNSQERGINQADLISFLLEL-SFHVAG 237
EA R+T G F + Q W +V ++ + +G+ +A L L L + +
Sbjct: 178 EAARITCEGLSFCMQSLQQQWWTLVSVALDRVLTLTSGKGVTRATLWQLLAVLFALDASD 237
Query: 238 EAYNLNTLSEIQKS--MIKDFADLGLVK--LQQGRKENWFIP--------TKLATNLSMS 285
Y + E ++ ++ +++GLV L GRK P + + + + +
Sbjct: 238 YVYPFPSKEEDLEAFHLLARLSEVGLVYPLLCNGRKCFVLSPHFHHAICWSSVPSLCAAA 297
Query: 286 LTDSSAR--------KEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVG-AI 336
L D+ + E I+ ETNFR+YAY+ + IL F+ E + +IV +
Sbjct: 298 LLDNPNQTPCPLRREDEDTIITETNFRLYAYTRNPDLLGILDQFAVKEIDIDGIIVCYRV 357
Query: 337 TKESLYNAFENGITAEQIISFLQQNAHPR-----------VADRM----------PS--- 372
T+ S A GI A+ I+ FL AHP V+D + PS
Sbjct: 358 TRSSFALALRKGIDAKHILQFLTLKAHPSMVRKDGGASRDVSDNLALNSVAGFGKPSEVH 417
Query: 373 ----VPENVCDQIRLWESDLNRV 391
+P++ CDQ+ WE + R+
Sbjct: 418 QSTVIPQSFCDQLMTWERECRRL 440
>gi|380476902|emb|CCF44454.1| RNA polymerase II transcription factor B subunit 2 [Colletotrichum
higginsianum]
Length = 126
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 32/80 (40%), Positives = 48/80 (60%), Gaps = 7/80 (8%)
Query: 331 LIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRM------PSVPENVCDQIRLW 384
++ G +T+ S+ NA E GITA+QIIS+L +AH ++ RM P +P V DQIRLW
Sbjct: 1 MVSGRLTRNSIRNAVERGITADQIISYLAAHAHEQM-HRMAAVRSRPVLPPTVVDQIRLW 59
Query: 385 ESDLNRVEMTPAHYYDEFPS 404
+ + R+ T + +F S
Sbjct: 60 QLETERMTTTSGFLFRDFDS 79
>gi|300122496|emb|CBK23066.2| unnamed protein product [Blastocystis hominis]
Length = 339
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 52/185 (28%), Positives = 85/185 (45%), Gaps = 25/185 (13%)
Query: 155 STQAERPTNFSSSMMKVFQRGLLSRRDKEAP-RLTESGFQFLLMDTNAQLWYIVREYISN 213
S A+ P+ S + F R L R D A +T +G+QFL+ Q W+++ +I++
Sbjct: 152 SPHAQPPS--VSEVRDCFLRCKLLRADSAAKFAITATGYQFLMKSYRDQAWWLLHPHITS 209
Query: 214 SQERGINQADLISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWF 273
+ N +SFL L + AY++ LSE Q+ +++ G+V + F
Sbjct: 210 A-----NALPALSFLFSLRHMIPCRAYDVAALSEPQRQLLEIARIAGVVYTPTTAGKRVF 264
Query: 274 IPTKLATNLSMSLTDS---SARK---EGF--------IVVETNFRMYAY---STSKLHCE 316
T L L S RK EGF +V+E NF +YA+ + ++LH +
Sbjct: 265 FATPLGAALFTQEGASRGEGGRKSGGEGFMKAEGSMRVVIEANFHVYAFGLLAENRLHVQ 324
Query: 317 ILRLF 321
+LR F
Sbjct: 325 LLRQF 329
>gi|401419872|ref|XP_003874425.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322490661|emb|CBZ25923.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 559
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 42/144 (29%), Positives = 61/144 (42%), Gaps = 49/144 (34%)
Query: 297 IVVETNFRMYAYSTSKLHCEILRLFSKVEYQL-PNLIVGAITKESLYNAFENGITAEQII 355
I+ ETNFR+YAY+ IL F+++E + NL +T++S A GITA Q++
Sbjct: 359 IITETNFRLYAYTDDADLLNILNQFAELEEIVNGNLHCYRVTRDSFAAAMRKGITAAQVL 418
Query: 356 SFLQQNAHPRVADR-----------MPS-------------------------------- 372
FL AHP + R PS
Sbjct: 419 RFLSLRAHPSMLRRHGEREESGDTAAPSSAGGVSSSLTFSAPHRRGAVPGTCGGTTAIDA 478
Query: 373 -----VPENVCDQIRLWESDLNRV 391
VP++ CDQ+R+WES+ +RV
Sbjct: 479 VSTIVVPQSFCDQMRMWESECHRV 502
>gi|162605818|ref|XP_001713424.1| hypothetical protein GTHECHR3089 [Guillardia theta]
gi|13794356|gb|AAK39733.1|AF083031_90 hypothetical protein [Guillardia theta]
Length = 180
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 38/136 (27%), Positives = 66/136 (48%), Gaps = 7/136 (5%)
Query: 256 FADLGLVKLQQGRKENWFIPTKLATNLSMSLTDSSARKEGF-IVVETNFRMYAYSTSKLH 314
F ++GL+ L G+K I K+ + + E F I++E NFR+Y +
Sbjct: 5 FYNMGLINLT-GKKTIQ-ISNKIINLEKNGIIIQKKKTERFKIIIEVNFRIYIFKYFDFS 62
Query: 315 CEILRLFSKVEYQLPNLIVGAITKESLYNAFENGITAEQIISFLQQNAHPRVADRMPSVP 374
+ SK Y LP VG I + F+ G+T++ I+ F+++N H +++PS
Sbjct: 63 NSFFKFISKKIYSLPYFYVGEINENYSRRLFQLGVTSDNILKFIKKNLH-YTCNKIPSTF 121
Query: 375 ENVCDQIRLWESDLNR 390
E D++R+WE +
Sbjct: 122 E---DRLRIWEISFKK 134
>gi|146103275|ref|XP_001469523.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|134073893|emb|CAM72632.1| conserved hypothetical protein [Leishmania infantum JPCM5]
Length = 557
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/144 (27%), Positives = 58/144 (40%), Gaps = 49/144 (34%)
Query: 297 IVVETNFRMYAYSTSKLHCEILRLFSKVEYQL-PNLIVGAITKESLYNAFENGITAEQII 355
I+ ETNFR+YAY+ IL F+K+E + NL +T++S +A GITA Q++
Sbjct: 357 IITETNFRLYAYTDDADLLNILNQFAKLEEIVNGNLHCYRVTRDSFASAMRKGITAAQVL 416
Query: 356 SFLQQNAHPRVADRMPS------------------------------------------- 372
FL AHP + R
Sbjct: 417 RFLSLRAHPSMLRRHGEREENGDTTSPSSAGGASSSFTFSASPRRRAVAGTCGSTTSIDA 476
Query: 373 -----VPENVCDQIRLWESDLNRV 391
VP++ CDQ+ +WES+ RV
Sbjct: 477 VSTLVVPQSFCDQMHMWESECRRV 500
>gi|398024082|ref|XP_003865202.1| hypothetical protein, conserved [Leishmania donovani]
gi|322503439|emb|CBZ38524.1| hypothetical protein, conserved [Leishmania donovani]
Length = 557
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 40/144 (27%), Positives = 58/144 (40%), Gaps = 49/144 (34%)
Query: 297 IVVETNFRMYAYSTSKLHCEILRLFSKVEYQL-PNLIVGAITKESLYNAFENGITAEQII 355
I+ ETNFR+YAY+ IL F+K+E + NL +T++S +A GITA Q++
Sbjct: 357 IITETNFRLYAYTDDADLLNILNQFAKLEEIVNGNLHCYRVTRDSFASAMRKGITAAQVL 416
Query: 356 SFLQQNAHPRVADRMPS------------------------------------------- 372
FL AHP + R
Sbjct: 417 RFLSLRAHPSMLRRHGEREENGDTTSPSSAGGASSSFTFSASPRRRAVAGTCGSTTSIDA 476
Query: 373 -----VPENVCDQIRLWESDLNRV 391
VP++ CDQ+ +WES+ RV
Sbjct: 477 VSTLVVPQSFCDQMHMWESECRRV 500
>gi|154345556|ref|XP_001568715.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134066057|emb|CAM43844.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 561
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 41/143 (28%), Positives = 58/143 (40%), Gaps = 48/143 (33%)
Query: 297 IVVETNFRMYAYSTSKLHCEILRLFSKVEYQL-PNLIVGAITKESLYNAFENGITAEQII 355
I+ ETNFR+YAY+ IL F+++E + NL +T++S A GITA Q++
Sbjct: 362 IITETNFRLYAYTEDSDLLNILHQFAELEEVVNGNLHCYRVTRDSFAAAMRKGITATQVL 421
Query: 356 SFLQQNAHPRVADR------------------MPS------------------------- 372
FL AHP + R PS
Sbjct: 422 RFLSLRAHPSMLRRYGEKEENGDTKSPSPAGGAPSSTFSAPPRRAAVAGTRGSTTFIDAV 481
Query: 373 ----VPENVCDQIRLWESDLNRV 391
VP + CDQ+R+WE + RV
Sbjct: 482 TTLVVPHSFCDQLRMWERECRRV 504
>gi|340057132|emb|CCC51474.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 502
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 37/128 (28%), Positives = 58/128 (45%), Gaps = 30/128 (23%)
Query: 294 EGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVG-AITKESLYNAFENGITAE 352
E I+ ETNFR+YAY+ +K IL F+ E ++ +IV +T+ S A GI A+
Sbjct: 315 EDTIITETNFRLYAYTRNKHLLAILEQFAIKEAEVDGMIVCFRVTRASFAAALRKGIGAQ 374
Query: 353 QIISFLQQNAH-----------PRVADRMPS------------------VPENVCDQIRL 383
I+ FL+ AH PR + + + +P + CDQ+
Sbjct: 375 HIVQFLRVKAHSSMLKHQLTCDPRDSAGLAATSSRVTPTDTPWTHADKIIPRSFCDQLFT 434
Query: 384 WESDLNRV 391
WE + R+
Sbjct: 435 WERECRRL 442
>gi|157876614|ref|XP_001686652.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68129727|emb|CAJ09033.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 626
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 39/144 (27%), Positives = 57/144 (39%), Gaps = 49/144 (34%)
Query: 297 IVVETNFRMYAYSTSKLHCEILRLFSKVEYQL-PNLIVGAITKESLYNAFENGITAEQII 355
I+ ETNFR++AY+ IL F+++E + NL +T+ S A GITA Q++
Sbjct: 426 IITETNFRLFAYTNDADLLNILNQFAELEEIVNGNLHCYRVTRASFAAAMRKGITAAQVL 485
Query: 356 SFLQQNAHPRVADRMPS------------------------------------------- 372
FL AHP + R
Sbjct: 486 RFLSLRAHPSMLRRHGEREENGDTASPSSAGGASSSFTFSASPRRGAVAGTCGSTTSIDA 545
Query: 373 -----VPENVCDQIRLWESDLNRV 391
VP++ CDQ+R+WES+ RV
Sbjct: 546 VSTLVVPQSFCDQMRMWESECRRV 569
>gi|76156573|gb|AAX27758.2| SJCHGC02916 protein [Schistosoma japonicum]
Length = 141
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 54/111 (48%), Gaps = 2/111 (1%)
Query: 8 AKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWV 67
+ + + + ++ A LD LY +P C + R LP LAK VM++ +I+ IP ++ WV
Sbjct: 6 SSSIFEYLKNVGAHMLDELYTHPPTCLVVFRELPELAKHIVMRLLFIEQPIPKSIVSGWV 65
Query: 68 LPDGFTKHRVAIDRLVQLRLF--SEEKKKETTYRLNSTFQSNLRKHLIYGG 116
+ L LR++ ++ ++ LN +Q ++R L GG
Sbjct: 66 EKGSSALLNDSCSALTVLRIWHSTDTNVSGGSWSLNKKYQESIRISLFGGG 116
>gi|71747422|ref|XP_822766.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|70832434|gb|EAN77938.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 500
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 59/261 (22%), Positives = 101/261 (38%), Gaps = 58/261 (22%)
Query: 187 LTESGFQFLLMDTNAQLWYIVREYISNS----QERGINQADL-----ISFLLELSFHVAG 237
+T G F + Q W +V + + +G+++A L + F L S +V
Sbjct: 183 VTSEGLAFCMQPLQQQWWTLVSVVLDRALAIMSGKGVSRATLWQLLAVLFALNTSDYVYL 242
Query: 238 EAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFI--------------PTKLATNLS 283
+ L Q ++ +++GLV E F+ P+ L
Sbjct: 243 FPSKDDDLEAFQ--LLARLSEVGLVYPLICNGEKCFVLSPHFHHAICWSSTPSLCTAALL 300
Query: 284 MSLTDSSAR----KEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVG-AITK 338
S+R E I+ ETNFR+YAY+ + IL F+ E + ++V +T+
Sbjct: 301 DDTRGPSSRLRREDEDTIITETNFRLYAYTKNPDMLRILDQFAVKEVDVVGMVVCYRVTR 360
Query: 339 ESLYNAFENGITAEQIISFLQQNAHPRVADR------------MPS-------------- 372
S +A GI A I+ FL AHP + + +P+
Sbjct: 361 ASFASALAKGIGANHILQFLTVKAHPSMIKQSNSEAGDPSCPVLPAASVGFGNTSEYRQG 420
Query: 373 --VPENVCDQIRLWESDLNRV 391
+P++ CDQ+ WE + R+
Sbjct: 421 NIIPQSFCDQLFTWERECRRL 441
>gi|261332554|emb|CBH15549.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 500
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 59/261 (22%), Positives = 101/261 (38%), Gaps = 58/261 (22%)
Query: 187 LTESGFQFLLMDTNAQLWYIVREYISNS----QERGINQADL-----ISFLLELSFHVAG 237
+T G F + Q W +V + + +G+++A L + F L S +V
Sbjct: 183 VTSEGLAFCMQPLQQQWWTLVSVVLDRALAIMSGKGVSRATLWQLLAVLFALNTSDYVYL 242
Query: 238 EAYNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFI--------------PTKLATNLS 283
+ L Q ++ +++GLV E F+ P+ L
Sbjct: 243 FPSKDDDLEAFQ--LLARLSEVGLVYPLICNGEKCFVLSPHFHHAICWSSTPSLCTAALL 300
Query: 284 MSLTDSSAR----KEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVG-AITK 338
S+R E I+ ETNFR+YAY+ + IL F+ E + ++V +T+
Sbjct: 301 DDTRGPSSRLRREDEDTIITETNFRLYAYTKNPDMLRILDQFAVKEVDVVGMVVCYRVTR 360
Query: 339 ESLYNAFENGITAEQIISFLQQNAHPRVADR------------MPS-------------- 372
S +A GI A I+ FL AHP + + +P+
Sbjct: 361 ASFASALAKGIGANHILQFLTVKAHPSMIKQSNSEAGDPSCPVLPAASAGFGNTSEYRQG 420
Query: 373 --VPENVCDQIRLWESDLNRV 391
+P++ CDQ+ WE + R+
Sbjct: 421 NIIPQSFCDQLFTWERECRRL 441
>gi|71425201|ref|XP_813043.1| DNA repair and transcription factor protein [Trypanosoma cruzi
strain CL Brener]
gi|70877891|gb|EAN91192.1| DNA repair and transcription factor protein, putative [Trypanosoma
cruzi]
Length = 501
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 74/335 (22%), Positives = 129/335 (38%), Gaps = 65/335 (19%)
Query: 114 YGGALPREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQ 173
YG A+PR MP + TL+ + C L I +R +S M++ ++
Sbjct: 116 YGKAMPR--MPLAQLSFQNTLDASRSILHMSTACALAGSIDPFGEKRGQPLAS-MLQYYR 172
Query: 174 RGLLSRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISN----SQERGINQADL----- 224
L E +T G F + Q W +V + ++ G+ +A+L
Sbjct: 173 ---LVPTSSETREITSEGLSFCMQPLQQQWWTLVSVALDRILVLTKNSGVTRAELWQLLA 229
Query: 225 ISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVK--LQQGRKENWFIP------- 275
+ F L+ S V L+ Q ++ +++GLV + GR+ P
Sbjct: 230 VLFALDTSQFVYPFPEKEKDLAAFQ--ILARLSEVGLVYPLISGGRRCFALSPHFHHAVC 287
Query: 276 -TKLATNLSMSLTDSSARKEG--------FIVVETNFRMYAYSTSKLHCEILRLFSKVEY 326
+ A + +L DS G I+ E NFR+YAY+ + IL F++ +
Sbjct: 288 WSSTAPLCTAALLDSGGNSMGRVRREDEDTIITEANFRLYAYTRNPDLLNILNQFAERDA 347
Query: 327 QLPNLIVG-AITKESLYNAFENGITAEQIISFLQQNAHP------RVADRMPS------- 372
++ +I +T+ + A + GI + I+ FL AHP R D +
Sbjct: 348 EVDQMIACYRVTRRTFAAALKRGIGSSHILQFLAVKAHPSMLRHHREGDSKKASGLSVLG 407
Query: 373 ----------------VPENVCDQIRLWESDLNRV 391
+P++ CDQ+ WE + R+
Sbjct: 408 AGTRFKNASDIRIDEIIPQSFCDQLMTWEKECRRL 442
>gi|71650598|ref|XP_813994.1| DNA repair and transcription factor protein [Trypanosoma cruzi
strain CL Brener]
gi|70878928|gb|EAN92143.1| DNA repair and transcription factor protein, putative [Trypanosoma
cruzi]
Length = 501
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 75/341 (21%), Positives = 129/341 (37%), Gaps = 77/341 (22%)
Query: 114 YGGALPREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQ 173
YG A+PR MP + TL+ + C L I +R +S M++ ++
Sbjct: 116 YGKAMPR--MPLAQLSFQNTLDASRSVLHMTTACALAGSIDPFGEKRGQPLAS-MLQYYR 172
Query: 174 RGLLSRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISN----SQERGINQADLISFLL 229
L E +T G F + Q W +V + ++ G+ +A+L L
Sbjct: 173 ---LVPTSSETREITSEGLSFCMQPLQQQWWTLVSVALDRILVLTKNSGVTRAELWQLLA 229
Query: 230 ELSFHVAGEAYNLNT------LSEIQKSM-----IKDFADLGLVK--LQQGRKENWFIP- 275
L + LNT E +K + + +++GLV + GR+ P
Sbjct: 230 VL--------FALNTSQFVYPFPEKEKDLAAFQILARLSEVGLVYPLISGGRRCFALSPH 281
Query: 276 -------TKLATNLSMSLTDSSARKEG--------FIVVETNFRMYAYSTSKLHCEILRL 320
+ A + +L DS G I+ E NFR+YAY+ + IL
Sbjct: 282 FHHAVCWSSTAPLCTAALLDSGGNSMGRVRREDEDTIITEANFRLYAYTKNPDLLNILNQ 341
Query: 321 FSKVEYQLPNLIVG-AITKESLYNAFENGITAEQIISFLQQNAHP------RVADRMPS- 372
F++ + ++ +I +T+ + A + GI + I+ FL AHP R D +
Sbjct: 342 FAERDAEVDQMIACYRVTRRTFAAALKRGIGSSHILQFLAVKAHPSMLMHHREGDSKKAS 401
Query: 373 ----------------------VPENVCDQIRLWESDLNRV 391
+P++ CDQ+ WE + R+
Sbjct: 402 GLSVLGAGTRFKNASDIRIDEIIPQSFCDQLMTWEKECRRL 442
>gi|148691278|gb|EDL23225.1| general transcription factor II H, polypeptide 4 [Mus musculus]
Length = 116
Score = 52.4 bits (124), Expect = 4e-04, Method: Composition-based stats.
Identities = 22/59 (37%), Positives = 35/59 (59%)
Query: 9 KNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWV 67
+N + + L+ LDRLY +P C A+ R LP LAK +VM+M +++ +P + WV
Sbjct: 18 RNLQEFLGGLSPGVLDRLYGHPATCLAVFRELPSLAKNWVMRMLFLEQPLPQAAVALWV 76
>gi|407835017|gb|EKF99108.1| DNA repair and transcription factor protein, putative [Trypanosoma
cruzi]
Length = 501
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 74/335 (22%), Positives = 129/335 (38%), Gaps = 65/335 (19%)
Query: 114 YGGALPREPMPSGITARLPTLEDLEAYAIGQWECFLLQLISSTQAERPTNFSSSMMKVFQ 173
YG A+PR MP + TL+ + C L I +R +S M++ ++
Sbjct: 116 YGKAMPR--MPLAQLSLQNTLDASRSIIHMTTACALAGSIDPFGEKRGQPLAS-MLQYYR 172
Query: 174 RGLLSRRDKEAPRLTESGFQFLLMDTNAQLWYIVREYISN----SQERGINQADL----- 224
L E +T G F + Q W +V + ++ G+ +A+L
Sbjct: 173 ---LVPTSSETREITSEGLSFCMQPLQQQWWTLVSVALDRILVLTKNSGVTRAELWQLLA 229
Query: 225 ISFLLELSFHVAGEAYNLNTLSEIQKSMIKDFADLGLVK--LQQGRKENWFIP------- 275
+ F L+ S V L+ Q ++ +++GLV + GR+ P
Sbjct: 230 VLFALDTSQFVYPFPEKEKDLAAFQ--ILARLSEVGLVYPLISGGRRCFALSPHFHHAVC 287
Query: 276 -TKLATNLSMSLTDSSA--------RKEGFIVVETNFRMYAYSTSKLHCEILRLFSKVEY 326
+ A + +L DS E I+ E NFR+YAY+ + IL F++ +
Sbjct: 288 WSSTAPLCTAALLDSGGDSMGRVRREDEDTIITEANFRLYAYTRNPDLLNILNQFAERDA 347
Query: 327 QLPNLIVG-AITKESLYNAFENGITAEQIISFLQQNAHP------RVADRMPS------- 372
++ +I +T+ + A + GI + I+ FL AHP R D +
Sbjct: 348 EVDQMIACYRVTRRTFAAALKRGIGSLHILQFLAVKAHPSMLRHHREGDSKKASGLSVLG 407
Query: 373 ----------------VPENVCDQIRLWESDLNRV 391
+P++ CDQ+ WE + R+
Sbjct: 408 AGTRFKKASDIRIDEIIPQSFCDQLMTWEKECRRL 442
>gi|407397449|gb|EKF27748.1| DNA repair and transcription factor protein [Trypanosoma cruzi
marinkellei]
Length = 501
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/128 (25%), Positives = 56/128 (43%), Gaps = 30/128 (23%)
Query: 294 EGFIVVETNFRMYAYSTSKLHCEILRLFSKVEYQLPNLIVG-AITKESLYNAFENGITAE 352
E I+ E NFR+YAY+ + IL F++ + ++ +I +T+++ A + GI +
Sbjct: 315 EDTIITEANFRLYAYTKNPDLLNILNQFAERDAEIDQMIACYRVTRKTFAAALKRGIGSS 374
Query: 353 QIISFLQQNAHP------RVADRMPS-----------------------VPENVCDQIRL 383
I+ FL AHP R D + +P++ CDQ+
Sbjct: 375 HILQFLAVKAHPSMLRNHRDGDSKKTSGLSVLGAGTRFKNTSDIRIDGIIPQSFCDQLMT 434
Query: 384 WESDLNRV 391
WE + R+
Sbjct: 435 WEKECRRL 442
>gi|218184994|gb|EEC67421.1| hypothetical protein OsI_34611 [Oryza sativa Indica Group]
Length = 199
Score = 47.0 bits (110), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 34/49 (69%), Gaps = 2/49 (4%)
Query: 42 PLAKKYVMQMFYIDG--AIPAKMMEEWVLPDGFTKHRVAIDRLVQLRLF 88
PLAKKY +++ + ++ A M+E VL + TKHR++IDR +QL++F
Sbjct: 144 PLAKKYTLELLFASSPPSVTAAAMKECVLDEYSTKHRISIDRFLQLKIF 192
>gi|337743331|gb|AEI73162.1| GTF2H4 [Kryptolebias marmoratus]
Length = 82
Score = 46.6 bits (109), Expect = 0.020, Method: Composition-based stats.
Identities = 28/81 (34%), Positives = 43/81 (53%), Gaps = 12/81 (14%)
Query: 240 YNLNTLSEIQKSMIKDFADLGLVKLQQGRKENWFIPTKLATNLSMSLT-----------D 288
Y++ +SE + ++ + GLV Q+ RK + PT+LA L+ +T
Sbjct: 3 YSVEGMSESLLTFLQHLREFGLV-FQRKRKSRRYYPTRLAITLAAGVTSNSSSSSNLSST 61
Query: 289 SSARKEGFIVVETNFRMYAYS 309
A GFIVVETN+R+YAY+
Sbjct: 62 QGAGDAGFIVVETNYRVYAYT 82
>gi|222613247|gb|EEE51379.1| hypothetical protein OsJ_32426 [Oryza sativa Japonica Group]
Length = 199
Score = 46.6 bits (109), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 34/49 (69%), Gaps = 2/49 (4%)
Query: 42 PLAKKYVMQMFYIDG--AIPAKMMEEWVLPDGFTKHRVAIDRLVQLRLF 88
PLAKKY +++ + ++ A M+E VL + TKHR++IDR +QL++F
Sbjct: 144 PLAKKYTLELLFASSPPSVTAAAMKECVLDEYSTKHRISIDRFLQLKIF 192
>gi|14165325|gb|AAK55457.1|AC069300_12 hypothetical protein [Oryza sativa Japonica Group]
gi|31433471|gb|AAP54976.1| expressed protein [Oryza sativa Japonica Group]
Length = 280
Score = 45.1 bits (105), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 21/49 (42%), Positives = 34/49 (69%), Gaps = 2/49 (4%)
Query: 42 PLAKKYVMQMFYIDG--AIPAKMMEEWVLPDGFTKHRVAIDRLVQLRLF 88
PLAKKY +++ + ++ A M+E VL + TKHR++IDR +QL++F
Sbjct: 167 PLAKKYTLELLFASSPPSVTAAAMKECVLDEYSTKHRISIDRFLQLKIF 215
>gi|430811942|emb|CCJ30591.1| unnamed protein product [Pneumocystis jirovecii]
Length = 110
Score = 45.1 bits (105), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 30/97 (30%), Positives = 46/97 (47%), Gaps = 11/97 (11%)
Query: 25 RLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQ 84
RLY+NP C ++ R LP LAK+++ + Y P + V + K A+DRL +
Sbjct: 22 RLYKNPSTCLSVFRLLPSLAKQFIFSLLYYPDPFPLDDFDLLV-KENTRKQAQALDRLCR 80
Query: 85 LRLFSEEKKKETTYRLNSTFQSNLRKHLIY---GGAL 118
+ +F EKK + Y F N + + GG L
Sbjct: 81 IHIF--EKKNDHIY-----FTENFKNEFLVALMGGLL 110
>gi|430812023|emb|CCJ30550.1| unnamed protein product [Pneumocystis jirovecii]
Length = 88
Score = 44.3 bits (103), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 21/68 (30%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 25 RLYENPFICEAILRSLPPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQ 84
RLY+NP C ++ R LP LAK+++ + Y P + ++ + K A+DRL +
Sbjct: 22 RLYKNPSTCLSVFRLLPSLAKQFIFSLLYYPDPFPLDDF-DLLVKENTRKQAQALDRLCR 80
Query: 85 LRLFSEEK 92
+ +F ++K
Sbjct: 81 IHIFEKKK 88
>gi|55296646|dbj|BAD69366.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 110
Score = 41.6 bits (96), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 21/48 (43%), Positives = 29/48 (60%)
Query: 41 PPLAKKYVMQMFYIDGAIPAKMMEEWVLPDGFTKHRVAIDRLVQLRLF 88
PPLAK + + A M+E VL + TKHR++IDR +QLR+F
Sbjct: 48 PPLAKYTLELCVSTPPPVAAAAMKECVLDEYATKHRISIDRFLQLRIF 95
>gi|298249219|ref|ZP_06973023.1| ATP-dependent transcriptional regulator, MalT-like, LuxR family
[Ktedonobacter racemifer DSM 44963]
gi|297547223|gb|EFH81090.1| ATP-dependent transcriptional regulator, MalT-like, LuxR family
[Ktedonobacter racemifer DSM 44963]
Length = 1032
Score = 40.8 bits (94), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 32/114 (28%), Positives = 53/114 (46%), Gaps = 11/114 (9%)
Query: 3 LVKIIAKNFMDMVASLTARDLDRLYENPFICEAILRSLPPLAKKYVMQMFYID---GAIP 59
L+ ++ + D L A + + ++CE ILR LPP +++++Q ++ G +
Sbjct: 348 LLMLVLRGHADGAEVLRATGVTPRFLLDYVCEEILRKLPPDMQRFLLQTSVLERLTGPLC 407
Query: 60 AKMMEEWVLPDGFTKHRVAIDRLVQLRLF-SEEKKKETTYRLNSTFQSNLRKHL 112
+ EE PDG K ++ L+Q LF S T YR + F LR L
Sbjct: 408 ESVTEE---PDGQLK----LESLLQANLFVSALDATNTWYRYHPLFAETLRTQL 454
>gi|359075905|ref|XP_002695282.2| PREDICTED: sialic acid binding Ig-like lectin 5-like [Bos taurus]
Length = 520
Score = 38.5 bits (88), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 39/73 (53%), Gaps = 3/73 (4%)
Query: 263 KLQQGRKENWFIPTK-LATNLSMSLTDSSARKEG--FIVVETNFRMYAYSTSKLHCEILR 319
KL++G + + +P + A N S+S+TD +A G F VET+FR YAY L ++
Sbjct: 76 KLREGTQGRFSVPGEPQAGNCSLSITDVNAGDSGTYFFQVETHFRKYAYLNKMLFLNVIA 135
Query: 320 LFSKVEYQLPNLI 332
L + + P +
Sbjct: 136 LTHQPHVRSPGAL 148
>gi|358416930|ref|XP_875692.5| PREDICTED: LOW QUALITY PROTEIN: sialic acid binding Ig-like lectin
5-like [Bos taurus]
Length = 479
Score = 38.5 bits (88), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 39/73 (53%), Gaps = 3/73 (4%)
Query: 263 KLQQGRKENWFIPTKL-ATNLSMSLTDSSARKEG--FIVVETNFRMYAYSTSKLHCEILR 319
KL++G + + +P + A N S+S+TD +A G F VET+FR YAY L ++
Sbjct: 76 KLREGTQGRFSVPGEPQAGNCSLSITDVNAGDSGTYFFQVETHFRKYAYLNKMLFLNVIA 135
Query: 320 LFSKVEYQLPNLI 332
L + + P +
Sbjct: 136 LTHQPHVRSPGAL 148
>gi|296477381|tpg|DAA19496.1| TPA: sialic acid binding Ig-like lectin 5-like [Bos taurus]
Length = 546
Score = 38.5 bits (88), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 39/73 (53%), Gaps = 3/73 (4%)
Query: 263 KLQQGRKENWFIPTKL-ATNLSMSLTDSSARKEG--FIVVETNFRMYAYSTSKLHCEILR 319
KL++G + + +P + A N S+S+TD +A G F VET+FR YAY L ++
Sbjct: 82 KLREGTQGRFSVPGEPQAGNCSLSITDVNAGDSGTYFFQVETHFRKYAYLNKMLFLNVIA 141
Query: 320 LFSKVEYQLPNLI 332
L + + P +
Sbjct: 142 LTHQPHVRSPGAL 154
>gi|170590870|ref|XP_001900194.1| Cadherin domain containing protein [Brugia malayi]
gi|158592344|gb|EDP30944.1| Cadherin domain containing protein [Brugia malayi]
Length = 2088
Score = 38.1 bits (87), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 28/111 (25%), Positives = 56/111 (50%), Gaps = 7/111 (6%)
Query: 143 GQWECFLLQLISSTQAERPTNFSSSMMKVFQRGLLSRRDKEAPRLTESGFQFLLM-DTNA 201
G ++Q++++ Q + TNF +++ KVF + R+D EAP + + ++FLL + ++
Sbjct: 1610 GTTGTIMVQIVATGQFLQKTNFEANVCKVF----IERKDTEAPPILQPHYKFLLSENADS 1665
Query: 202 QLWYIVREYISNSQERGINQADLISFLLELSFHVA--GEAYNLNTLSEIQK 250
+++ E + I + I + E F V G+ Y T++ QK
Sbjct: 1666 SRSFVIFENLPEEYYLDIVEEGRIGYATEQPFCVGDPGKLYLCGTMNYEQK 1716
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.321 0.135 0.392
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,957,048,263
Number of Sequences: 23463169
Number of extensions: 234521130
Number of successful extensions: 534639
Number of sequences better than 100.0: 418
Number of HSP's better than 100.0 without gapping: 391
Number of HSP's successfully gapped in prelim test: 27
Number of HSP's that attempted gapping in prelim test: 532956
Number of HSP's gapped (non-prelim): 554
length of query: 407
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 262
effective length of database: 8,957,035,862
effective search space: 2346743395844
effective search space used: 2346743395844
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 78 (34.7 bits)