BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 005234
(707 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225455842|ref|XP_002273382.1| PREDICTED: transcription initiation factor TFIID subunit 2-like
isoform 2 [Vitis vinifera]
Length = 1345
Score = 1027 bits (2656), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 494/617 (80%), Positives = 540/617 (87%), Gaps = 3/617 (0%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGAS-PVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
MLEKQMG FRKILQ I+ RAQ + +RTLSTKEFRHFANKVGNLERPFLKEFFPRWV
Sbjct: 449 MLEKQMGPESFRKILQTIVFRAQDTTRSLRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 508
Query: 60 GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMS 119
G+CGCPVLR G SYNKRKN+VELAVLR CT PD+ T VL+ N DSENR+ DIGWPGMMS
Sbjct: 509 GSCGCPVLRAGLSYNKRKNLVELAVLRGCTAAPDTNTMVLNGNIDSENREVDIGWPGMMS 568
Query: 120 IRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGL 179
IRVHELDGMYDHPILPMAG+ WQLLEIQCHSKLAARR KPKKGSKPDG DDNGD V +
Sbjct: 569 IRVHELDGMYDHPILPMAGETWQLLEIQCHSKLAARRFQKPKKGSKPDGSDDNGD-VPAV 627
Query: 180 DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHL 239
DMRS+ ESPL W+R DPE+EYLAEIHFNQP QMWINQLE+D DVVAQAQAIA LEALP L
Sbjct: 628 DMRSNTESPLLWLRVDPELEYLAEIHFNQPAQMWINQLERDKDVVAQAQAIATLEALPQL 687
Query: 240 SFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL 299
SF+VVN LNNFLSDSKAFWRVRIEAA+ALANTASEETDWAGLLHLVKFYKSRRFD NIGL
Sbjct: 688 SFSVVNALNNFLSDSKAFWRVRIEAAFALANTASEETDWAGLLHLVKFYKSRRFDANIGL 747
Query: 300 PRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 359
P+PNDF DF EYFVLEAIPHA+AMVRAAD KSPREAVEFVLQLLKYNDNNGNPYSDVFWL
Sbjct: 748 PKPNDFHDFPEYFVLEAIPHAIAMVRAADKKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 807
Query: 360 AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS 419
AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQI LKLS
Sbjct: 808 AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIGLKLS 867
Query: 420 GFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLR 479
GFI LD+V++L+KPFRDF IWQVR+EASRALL LEFH GID+ALSLFIK VEEEPS+R
Sbjct: 868 GFIPLDRVIELVKPFRDFQAIWQVRIEASRALLGLEFHFKGIDAALSLFIKYVEEEPSIR 927
Query: 480 GQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRHHLFGILQILA 539
GQVKLG+HAMR+CQIKGGS+S++++ + TLVALL LLESRIAFNNVFLRHHLF IL+ILA
Sbjct: 928 GQVKLGVHAMRLCQIKGGSESDNDIKSSTLVALLRLLESRIAFNNVFLRHHLFCILRILA 987
Query: 540 GRAPTLYGVPRDKLLLLGDGE-TSEQKNVFASFVTEMRRAEPPMDVPNLSQDNLAVRDAS 598
GR PTLYGVPRD++ + E SEQKN F + V E + EPP+D PN+S D LA+ +AS
Sbjct: 988 GRLPTLYGVPRDQIPQMDPAEICSEQKNGFITIVKETKSLEPPVDTPNVSHDGLALPEAS 1047
Query: 599 KEVDCVANGHAENILAV 615
+E D V+N H + V
Sbjct: 1048 READTVSNSHERKMPVV 1064
>gi|225455844|ref|XP_002273351.1| PREDICTED: transcription initiation factor TFIID subunit 2-like
isoform 1 [Vitis vinifera]
Length = 1325
Score = 984 bits (2544), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 477/617 (77%), Positives = 522/617 (84%), Gaps = 23/617 (3%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGAS-PVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
MLEKQMG FRKILQ I+ RAQ + +RTLSTKEFRHFANKVGNLERPFLKEFFPRWV
Sbjct: 449 MLEKQMGPESFRKILQTIVFRAQDTTRSLRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 508
Query: 60 GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMS 119
G+CGCPVLR G SYNKRKN+VELAVLR CT PD+ T VL+ N DSENR+ DIGWPGMMS
Sbjct: 509 GSCGCPVLRAGLSYNKRKNLVELAVLRGCTAAPDTNTMVLNGNIDSENREVDIGWPGMMS 568
Query: 120 IRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGL 179
IRVHELDGMYDHPILPMAG+ WQLLEIQCHSKLAARR KPKKGSKPDG DDNGD V +
Sbjct: 569 IRVHELDGMYDHPILPMAGETWQLLEIQCHSKLAARRFQKPKKGSKPDGSDDNGD-VPAV 627
Query: 180 DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHL 239
DMRS+ ESPL W+R DPE+EYLAEIHFNQP QMWINQLE+D DVVAQAQAIA LEALP L
Sbjct: 628 DMRSNTESPLLWLRVDPELEYLAEIHFNQPAQMWINQLERDKDVVAQAQAIATLEALPQL 687
Query: 240 SFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL 299
SF+VVN LNNFLSDSKAFWRVRIEAA+ALANTASEETDWAGLLHLVKFYKSRRFD NIGL
Sbjct: 688 SFSVVNALNNFLSDSKAFWRVRIEAAFALANTASEETDWAGLLHLVKFYKSRRFDANIGL 747
Query: 300 PRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 359
P+PNDF DF EYFVLEAIPHA+AMVRAAD KSPREAVEFVLQLLKYNDNNGNPYSDVFWL
Sbjct: 748 PKPNDFHDFPEYFVLEAIPHAIAMVRAADKKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 807
Query: 360 AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS 419
AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQI LKLS
Sbjct: 808 AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIGLKLS 867
Query: 420 GFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLR 479
GFI LD+V++L+KPFRDF IWQVR+EASRALL LEFH +
Sbjct: 868 GFIPLDRVIELVKPFRDFQAIWQVRIEASRALLGLEFH--------------------FK 907
Query: 480 GQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRHHLFGILQILA 539
GQVKLG+HAMR+CQIKGGS+S++++ + TLVALL LLESRIAFNNVFLRHHLF IL+ILA
Sbjct: 908 GQVKLGVHAMRLCQIKGGSESDNDIKSSTLVALLRLLESRIAFNNVFLRHHLFCILRILA 967
Query: 540 GRAPTLYGVPRDKLLLLGDGE-TSEQKNVFASFVTEMRRAEPPMDVPNLSQDNLAVRDAS 598
GR PTLYGVPRD++ + E SEQKN F + V E + EPP+D PN+S D LA+ +AS
Sbjct: 968 GRLPTLYGVPRDQIPQMDPAEICSEQKNGFITIVKETKSLEPPVDTPNVSHDGLALPEAS 1027
Query: 599 KEVDCVANGHAENILAV 615
+E D V+N H + V
Sbjct: 1028 READTVSNSHERKMPVV 1044
>gi|297734162|emb|CBI15409.3| unnamed protein product [Vitis vinifera]
Length = 1245
Score = 973 bits (2516), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 467/554 (84%), Positives = 503/554 (90%), Gaps = 2/554 (0%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGAS-PVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
MLEKQMG FRKILQ I+ RAQ + +RTLSTKEFRHFANKVGNLERPFLKEFFPRWV
Sbjct: 670 MLEKQMGPESFRKILQTIVFRAQDTTRSLRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 729
Query: 60 GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMS 119
G+CGCPVLR G SYNKRKN+VELAVLR CT PD+ T VL+ N DSENR+ DIGWPGMMS
Sbjct: 730 GSCGCPVLRAGLSYNKRKNLVELAVLRGCTAAPDTNTMVLNGNIDSENREVDIGWPGMMS 789
Query: 120 IRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGL 179
IRVHELDGMYDHPILPMAG+ WQLLEIQCHSKLAARR KPKKGSKPDG DDNGD V +
Sbjct: 790 IRVHELDGMYDHPILPMAGETWQLLEIQCHSKLAARRFQKPKKGSKPDGSDDNGD-VPAV 848
Query: 180 DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHL 239
DMRS+ ESPL W+R DPE+EYLAEIHFNQP QMWINQLE+D DVVAQAQAIA LEALP L
Sbjct: 849 DMRSNTESPLLWLRVDPELEYLAEIHFNQPAQMWINQLERDKDVVAQAQAIATLEALPQL 908
Query: 240 SFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL 299
SF+VVN LNNFLSDSKAFWRVRIEAA+ALANTASEETDWAGLLHLVKFYKSRRFD NIGL
Sbjct: 909 SFSVVNALNNFLSDSKAFWRVRIEAAFALANTASEETDWAGLLHLVKFYKSRRFDANIGL 968
Query: 300 PRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 359
P+PNDF DF EYFVLEAIPHA+AMVRAAD KSPREAVEFVLQLLKYNDNNGNPYSDVFWL
Sbjct: 969 PKPNDFHDFPEYFVLEAIPHAIAMVRAADKKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 1028
Query: 360 AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS 419
AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQI LKLS
Sbjct: 1029 AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIGLKLS 1088
Query: 420 GFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLR 479
GFI LD+V++L+KPFRDF IWQVR+EASRALL LEFH GID+ALSLFIK VEEEPS+R
Sbjct: 1089 GFIPLDRVIELVKPFRDFQAIWQVRIEASRALLGLEFHFKGIDAALSLFIKYVEEEPSIR 1148
Query: 480 GQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRHHLFGILQILA 539
GQVKLG+HAMR+CQIKGGS+S++++ + TLVALL LLESRIAFNNVFLRHHLF IL+ILA
Sbjct: 1149 GQVKLGVHAMRLCQIKGGSESDNDIKSSTLVALLRLLESRIAFNNVFLRHHLFCILRILA 1208
Query: 540 GRAPTLYGVPRDKL 553
GR PTLYGVPRD++
Sbjct: 1209 GRLPTLYGVPRDQI 1222
>gi|224132980|ref|XP_002321457.1| predicted protein [Populus trichocarpa]
gi|222868453|gb|EEF05584.1| predicted protein [Populus trichocarpa]
Length = 1359
Score = 962 bits (2488), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 472/628 (75%), Positives = 529/628 (84%), Gaps = 5/628 (0%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
MLEKQMG FFRKILQ +ISRA+ PVR+LSTKEFRHFA KVGNLERPF+KEFF RWV
Sbjct: 447 MLEKQMGPEFFRKILQKVISRARDTIPVRSLSTKEFRHFATKVGNLERPFVKEFFLRWVC 506
Query: 61 TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSI 120
+CGCPVLRMGFSYNKRKN+VELAVLR+ T PD+ L N DSENR+GDIGWPGMMSI
Sbjct: 507 SCGCPVLRMGFSYNKRKNMVELAVLREFTAAPDANASFL--NLDSENREGDIGWPGMMSI 564
Query: 121 RVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLD 180
RV+ELDGMYDHP+LP+AG+ WQLLEIQCHSKLAARR KPKK SKPDG D+NGD A D
Sbjct: 565 RVYELDGMYDHPVLPLAGEMWQLLEIQCHSKLAARRFQKPKKSSKPDGFDENGDVPAS-D 623
Query: 181 MRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLS 240
MRSS+ESPLSWIRADPEMEYLAEIHFNQP+QMWINQLE+D DVVAQAQAIAAL+ LP LS
Sbjct: 624 MRSSLESPLSWIRADPEMEYLAEIHFNQPIQMWINQLERDEDVVAQAQAIAALKTLPQLS 683
Query: 241 FNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP 300
F+V N +NNFL+D+KAFWRVRIE A+ALANTASEE DWAGLLHLVKFYKSRRFD IGLP
Sbjct: 684 FSVTNAMNNFLNDTKAFWRVRIETAFALANTASEENDWAGLLHLVKFYKSRRFDAAIGLP 743
Query: 301 RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLA 360
+PNDF DF EYFVLEAIPHAVA VRAAD KSPREAVEF+LQLLKYNDN GNPYSDVFWLA
Sbjct: 744 KPNDFHDFPEYFVLEAIPHAVAKVRAADKKSPREAVEFILQLLKYNDNTGNPYSDVFWLA 803
Query: 361 ALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSG 420
ALVQSVGELEFGQQ++LFLSSLLKRID LLQFDRLM SYNGILTISCIRTLTQIALKLSG
Sbjct: 804 ALVQSVGELEFGQQTVLFLSSLLKRIDCLLQFDRLMLSYNGILTISCIRTLTQIALKLSG 863
Query: 421 FISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLRG 480
I D V +LIKPFRDF TIWQ+R+EASRALLDLEFHC G+D+ALSLFI +EEEPSLRG
Sbjct: 864 SIHHDHVFELIKPFRDFKTIWQIRIEASRALLDLEFHCKGMDAALSLFITYLEEEPSLRG 923
Query: 481 QVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRHHLFGILQILAG 540
Q KLG HAMR+CQI+ SDS + TL+AL+ LLE I FNN LRHHLF ILQILAG
Sbjct: 924 QAKLGAHAMRLCQIQDESDSEDAIKCTTLLALIRLLEGHIGFNNTILRHHLFCILQILAG 983
Query: 541 RAPTLYGVPRDKLLLLGDGET-SEQKNVFASFVTEMRRAEPPMDVPNLSQDNLAVRDASK 599
RA TLYG+PRD+ L +GD ET S+ +N+FA VTE + EPPM++P L+QDN A +A K
Sbjct: 984 RAATLYGIPRDRTLCIGDSETCSDPRNIFAGLVTETKPLEPPMEIPKLAQDNFAFPEAIK 1043
Query: 600 EVDCVANGHAENI-LAVPEAQRMQMSFP 626
E D ++N + +A+PE +++ P
Sbjct: 1044 EADIISNKDQHKMDMAIPEEASEEVAVP 1071
>gi|356563105|ref|XP_003549806.1| PREDICTED: transcription initiation factor TFIID subunit 2-like
isoform 1 [Glycine max]
Length = 1388
Score = 908 bits (2346), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 455/593 (76%), Positives = 512/593 (86%), Gaps = 5/593 (0%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGAS-PVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
MLEKQMG FR+ILQ I+SRAQ + ++TLSTKEFRHFANKVGNLERPFLK+FFPRWV
Sbjct: 450 MLEKQMGPESFRRILQTIVSRAQDKTRSIKTLSTKEFRHFANKVGNLERPFLKDFFPRWV 509
Query: 60 GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMS 119
G+CGCPVLRMGFSYNKRKN+VELAVLR CT S T +L N D+E RDGD GWPGMMS
Sbjct: 510 GSCGCPVLRMGFSYNKRKNMVELAVLRGCTALQTSNTSILDINPDTETRDGDTGWPGMMS 569
Query: 120 IRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGL 179
IRV+ELDGMYDHPILPMAGDAWQLLEIQCHSKLAARR KPKKG K DG DDNGD V +
Sbjct: 570 IRVYELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRFQKPKKGLKLDGSDDNGD-VPSM 628
Query: 180 DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHL 239
DMRS+ ESPL WIRADP+MEYLAE+HFNQPVQMWINQLEKD DV+AQAQAIAALEA P L
Sbjct: 629 DMRSNTESPLLWIRADPDMEYLAEVHFNQPVQMWINQLEKDKDVIAQAQAIAALEASPQL 688
Query: 240 SFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL 299
SF++VN LNNFLSDSKAFWRVRIEAA+ALAN+ASEETD++GLLHL+KFYKSRRFD +IGL
Sbjct: 689 SFSIVNALNNFLSDSKAFWRVRIEAAFALANSASEETDFSGLLHLMKFYKSRRFDTDIGL 748
Query: 300 PRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 359
P+PNDF DF+EYFVLEAIPHAVAMVRAAD KSPREA+EFVLQLLKYNDNNGNPYSDVFWL
Sbjct: 749 PKPNDFHDFAEYFVLEAIPHAVAMVRAADKKSPREAIEFVLQLLKYNDNNGNPYSDVFWL 808
Query: 360 AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS 419
AALVQSVGELEFGQQSIL LSSLLKRIDRLLQFD LMPSYNGILTISCIRTLTQIALKLS
Sbjct: 809 AALVQSVGELEFGQQSILLLSSLLKRIDRLLQFDSLMPSYNGILTISCIRTLTQIALKLS 868
Query: 420 GFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLR 479
GFI LD+V L+KPFRD +WQVR+EASRALLDLEFHC G+DSAL LFIK +EEE SLR
Sbjct: 869 GFIPLDRVYGLVKPFRDIKALWQVRIEASRALLDLEFHCKGMDSALLLFIKYIEEEHSLR 928
Query: 480 GQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRHHLFGILQILA 539
GQ+KL H MR+CQ++ G +SN E+ + TLV++LNLLE RIAFNN FLRH+LF ILQILA
Sbjct: 929 GQLKLATHVMRLCQMRDGLNSNDEITSQTLVSMLNLLEGRIAFNNAFLRHYLFCILQILA 988
Query: 540 GRAPTLYGVPRD-KLLLLGDGETSE-QKNVFASFVTEMRRAEPPMDVPNLSQD 590
R PTL+G+PR+ ++L + E S QKN+ A +E + + P + +L+Q+
Sbjct: 989 RRHPTLHGIPRENRMLHMSLTEASNYQKNMLA-LDSESKPLDLPSSIDDLTQN 1040
>gi|356513902|ref|XP_003525647.1| PREDICTED: transcription initiation factor TFIID subunit 2-like
[Glycine max]
Length = 1388
Score = 905 bits (2338), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 461/618 (74%), Positives = 522/618 (84%), Gaps = 15/618 (2%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGAS-PVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
MLEKQMG FR+ILQ I+SRAQ + ++TLSTKEFRHFANKVGNLERPFLK+FFPRWV
Sbjct: 450 MLEKQMGPESFRRILQTIVSRAQDKTRSIKTLSTKEFRHFANKVGNLERPFLKDFFPRWV 509
Query: 60 GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMS 119
+CGCPVLRMGFSYNKRKN+VELAVLR CT S T +L N D+E RDGDIGWPGMMS
Sbjct: 510 SSCGCPVLRMGFSYNKRKNMVELAVLRGCTTLQTSSTSILDINPDTETRDGDIGWPGMMS 569
Query: 120 IRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGL 179
IRV+ELDGMYDHPILPMAG+AWQLLEIQCHSKLAARR KPKKG K DG DDNGD V +
Sbjct: 570 IRVYELDGMYDHPILPMAGEAWQLLEIQCHSKLAARRFQKPKKGLKLDGSDDNGD-VPSM 628
Query: 180 DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHL 239
DMR + ESPL WIRADP+MEYLAE+HFNQPVQMWINQLEKD DV+AQAQAIAALEA P L
Sbjct: 629 DMRLNTESPLLWIRADPDMEYLAEVHFNQPVQMWINQLEKDKDVIAQAQAIAALEASPQL 688
Query: 240 SFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL 299
SF++VN LNNFLSDSKAFWRVRIEAA+ALAN+ASEETD++GLLHLVKFYKSRRFD +IGL
Sbjct: 689 SFSIVNALNNFLSDSKAFWRVRIEAAFALANSASEETDFSGLLHLVKFYKSRRFDPDIGL 748
Query: 300 PRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 359
P+PNDF+DF+EYFVLEAIPHAVAMVRAAD KSPREA+EFVLQLLKYNDNNGNPYSDVFWL
Sbjct: 749 PKPNDFQDFAEYFVLEAIPHAVAMVRAADKKSPREAIEFVLQLLKYNDNNGNPYSDVFWL 808
Query: 360 AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS 419
AALVQSVGELEFGQQSIL LSSLLKRIDRLLQFD LMPSYNGILTISCIRTLTQIALKLS
Sbjct: 809 AALVQSVGELEFGQQSILLLSSLLKRIDRLLQFDSLMPSYNGILTISCIRTLTQIALKLS 868
Query: 420 GFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLR 479
GFI LD+V +L+KPFRD +WQV++EAS+ALLDLEFHC G+DSAL LFIK +EEE SLR
Sbjct: 869 GFIPLDRVYELVKPFRDLKALWQVQIEASKALLDLEFHCKGMDSALLLFIKYIEEEHSLR 928
Query: 480 GQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRHHLFGILQILA 539
GQ+KL H MR+CQ++ G +SN E+ + TLV++LNLLE RIAFNNV LRH+LF ILQILA
Sbjct: 929 GQLKLATHVMRLCQMRDGLNSNDEITSQTLVSMLNLLEGRIAFNNVSLRHYLFCILQILA 988
Query: 540 GRAPTLYGVPR-DKLLLLGDGETSE-QKNVFASFVTEMRRAEPPMDVPNLSQDNL----- 592
R PTL+G+PR +++L + E QKN+FA +E + + P NL+Q NL
Sbjct: 989 RRPPTLHGIPRGNRMLHMSLAEACNYQKNIFA-LDSESKPLDLPSSTKNLTQ-NLGPTME 1046
Query: 593 ----AVRDASKEVDCVAN 606
AV +A K+ C A+
Sbjct: 1047 GLRDAVDEAPKDQPCEAS 1064
>gi|449439379|ref|XP_004137463.1| PREDICTED: transcription initiation factor TFIID subunit 2-like
[Cucumis sativus]
Length = 1362
Score = 903 bits (2333), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 449/646 (69%), Positives = 517/646 (80%), Gaps = 33/646 (5%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQG-ASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
MLEKQMG FRKILQNI+S A+ S + LSTKEFR ANK+GNLERPFLKEFFPRWV
Sbjct: 448 MLEKQMGPESFRKILQNIVSHAKDTGSTSQLLSTKEFRQLANKIGNLERPFLKEFFPRWV 507
Query: 60 GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMS 119
+CGCP+LRMGFSYNKRKN+VE+AV R+CT P T+ ENRD D GWPGMMS
Sbjct: 508 ESCGCPLLRMGFSYNKRKNMVEMAVSRECTATP---------ATNVENRDSDAGWPGMMS 558
Query: 120 IRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGL 179
IR++ELDG++DHP+LPM G++WQLLEIQCHSKLAARR K KKGSKPDG DDN D + L
Sbjct: 559 IRIYELDGVFDHPVLPMTGESWQLLEIQCHSKLAARRLQKTKKGSKPDGSDDNAD-IPAL 617
Query: 180 DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHL 239
D+RSS+ESPL W+RADPEMEYLAEIHF+QPVQMWINQLEKD DV+AQAQAIA LE LP
Sbjct: 618 DIRSSVESPLLWLRADPEMEYLAEIHFHQPVQMWINQLEKDKDVIAQAQAIATLEMLPQP 677
Query: 240 SFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL 299
SF++VN LNNFL D KAFWRVRIEAA A+A TASE+TDWAGLL+L+KF+KS+RFD + GL
Sbjct: 678 SFSIVNALNNFLKDPKAFWRVRIEAALAMAKTASEDTDWAGLLNLIKFFKSQRFDADTGL 737
Query: 300 PRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 359
P+PN+FRDF EYFVLEAIPHAVAMVR D KSPREAVEFVLQLLKYNDNNGNPYSDVFWL
Sbjct: 738 PKPNEFRDFPEYFVLEAIPHAVAMVRGTDQKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 797
Query: 360 AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS 419
AALVQSVGELEFGQQSILFL+SLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS
Sbjct: 798 AALVQSVGELEFGQQSILFLASLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS 857
Query: 420 GFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLR 479
G +SLD++++LI+PFRDFN++WQVR+EA+R+LLDLE+HCNGID+ L LFIK +EEE SLR
Sbjct: 858 GLLSLDRIIELIRPFRDFNSMWQVRIEATRSLLDLEYHCNGIDATLLLFIKYLEEENSLR 917
Query: 480 GQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRHHLFGILQILA 539
GQVKL +H MR+CQI S SN V+ TLVALL LLE +AFNNV+LRH+LF ILQ+L+
Sbjct: 918 GQVKLAVHVMRLCQIMRRSGSNDVVNNDTLVALLLLLEGNMAFNNVYLRHYLFSILQVLS 977
Query: 540 GRAPTLYGVPRD-KLLLLGD-GETSEQKNVFASFVTEMRRAEP----------------- 580
GR+PTLYGVPR+ K L +GD G SEQK + S + E EP
Sbjct: 978 GRSPTLYGVPREYKTLHMGDTGTFSEQKRMLTSIIPEFNPPEPSSVSAVAPMPCIPATLS 1037
Query: 581 --PMDVPNLSQDNLAVRDASKEVDCVANGHAENILAVPEAQRMQMS 624
P+ VP DNLAV + SKE +A + +A+ EA R S
Sbjct: 1038 SEPLHVPTPRPDNLAVPELSKEEGAIAED-PKQAMAIVEAPREAAS 1082
>gi|449516818|ref|XP_004165443.1| PREDICTED: transcription initiation factor TFIID subunit 2-like
[Cucumis sativus]
Length = 1362
Score = 901 bits (2329), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 449/646 (69%), Positives = 517/646 (80%), Gaps = 33/646 (5%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQG-ASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
MLEKQMG FRKILQNI+S A+ S + LSTKEFR ANK+GNLERPFLKEFFPRWV
Sbjct: 448 MLEKQMGPESFRKILQNIVSHAKDTGSTSQLLSTKEFRQLANKIGNLERPFLKEFFPRWV 507
Query: 60 GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMS 119
+CGCP+LRMGFSYNKRKN+VE+AV R+CT P T+ ENRD D GWPGMMS
Sbjct: 508 ESCGCPLLRMGFSYNKRKNMVEMAVSRECTATP---------ATNVENRDSDAGWPGMMS 558
Query: 120 IRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGL 179
IR++ELDG++DHP+LPM G++WQLLEIQCHSKLAARR K KKGSKPDG DDN D + L
Sbjct: 559 IRIYELDGVFDHPVLPMTGESWQLLEIQCHSKLAARRLQKTKKGSKPDGSDDNAD-IPAL 617
Query: 180 DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHL 239
D+RSS+ESPL W+RADPEMEYLAEIHF+QPVQMWINQLEKD DV+AQAQAIA LE LP
Sbjct: 618 DIRSSVESPLLWLRADPEMEYLAEIHFHQPVQMWINQLEKDKDVIAQAQAIATLEMLPQP 677
Query: 240 SFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL 299
SF++VN LNNFL D KAFWRVRIEAA A+A TASE+TDWAGLL+L+KF+KS+RFD + GL
Sbjct: 678 SFSIVNALNNFLKDPKAFWRVRIEAALAMAKTASEDTDWAGLLNLIKFFKSQRFDADTGL 737
Query: 300 PRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 359
P+PN+FRDF EYFVLEAIPHAVAMVR D KSPREAVEFVLQLLKYNDNNGNPYSDVFWL
Sbjct: 738 PKPNEFRDFPEYFVLEAIPHAVAMVRGTDQKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 797
Query: 360 AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS 419
AALVQSVGELEFGQQSILFL+SLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS
Sbjct: 798 AALVQSVGELEFGQQSILFLASLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS 857
Query: 420 GFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLR 479
G +SLD++++LI+PFRDFN++WQVR+EA+R+LLDLE+HCNGID+ L LFIK +EEE SLR
Sbjct: 858 GLLSLDRIIELIRPFRDFNSMWQVRIEATRSLLDLEYHCNGIDATLLLFIKYLEEENSLR 917
Query: 480 GQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRHHLFGILQILA 539
GQVKL +H MR+CQI S SN V+ TLVALL LLE +AFNNV+LRH+LF ILQ+L+
Sbjct: 918 GQVKLAVHVMRLCQIMRRSGSNDVVNNDTLVALLLLLEGNMAFNNVYLRHYLFCILQVLS 977
Query: 540 GRAPTLYGVPRD-KLLLLGD-GETSEQKNVFASFVTEMRRAEP----------------- 580
GR+PTLYGVPR+ K L +GD G SEQK + S + E EP
Sbjct: 978 GRSPTLYGVPREYKTLHMGDTGTFSEQKRMLTSLIPEFNPPEPSSVSAVAPMPCIPATLS 1037
Query: 581 --PMDVPNLSQDNLAVRDASKEVDCVANGHAENILAVPEAQRMQMS 624
P+ VP DNLAV + SKE +A + +A+ EA R S
Sbjct: 1038 SEPLHVPTPRPDNLAVPELSKEEGAIAED-PKQAMAIVEAPREAAS 1082
>gi|356563107|ref|XP_003549807.1| PREDICTED: transcription initiation factor TFIID subunit 2-like
isoform 2 [Glycine max]
Length = 1368
Score = 871 bits (2250), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 440/593 (74%), Positives = 496/593 (83%), Gaps = 25/593 (4%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGAS-PVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
MLEKQMG FR+ILQ I+SRAQ + ++TLSTKEFRHFANKVGNLERPFLK+FFPRWV
Sbjct: 450 MLEKQMGPESFRRILQTIVSRAQDKTRSIKTLSTKEFRHFANKVGNLERPFLKDFFPRWV 509
Query: 60 GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMS 119
G+CGCPVLRMGFSYNKRKN+VELAVLR CT S T +L N D+E RDGD GWPGMMS
Sbjct: 510 GSCGCPVLRMGFSYNKRKNMVELAVLRGCTALQTSNTSILDINPDTETRDGDTGWPGMMS 569
Query: 120 IRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGL 179
IRV+ELDGMYDHPILPMAGDAWQLLEIQCHSKLAARR KPKKG K DG DDNGD V +
Sbjct: 570 IRVYELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRFQKPKKGLKLDGSDDNGD-VPSM 628
Query: 180 DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHL 239
DMRS+ ESPL WIRADP+MEYLAE+HFNQPVQMWINQLEKD DV+AQAQAIAALEA P L
Sbjct: 629 DMRSNTESPLLWIRADPDMEYLAEVHFNQPVQMWINQLEKDKDVIAQAQAIAALEASPQL 688
Query: 240 SFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL 299
SF++VN LNNFLSDSKAFWRVRIEAA+ALAN+ASEETD++GLLHL+KFYKSRRFD +IGL
Sbjct: 689 SFSIVNALNNFLSDSKAFWRVRIEAAFALANSASEETDFSGLLHLMKFYKSRRFDTDIGL 748
Query: 300 PRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 359
P+PNDF DF+EYFVLEAIPHAVAMVRAAD KSPREA+EFVLQLLKYNDNNGNPYSDVFWL
Sbjct: 749 PKPNDFHDFAEYFVLEAIPHAVAMVRAADKKSPREAIEFVLQLLKYNDNNGNPYSDVFWL 808
Query: 360 AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS 419
AALVQSVGELEFGQQSIL LSSLLKRIDRLLQFD LMPSYNGILTISCIRTLTQIALKLS
Sbjct: 809 AALVQSVGELEFGQQSILLLSSLLKRIDRLLQFDSLMPSYNGILTISCIRTLTQIALKLS 868
Query: 420 GFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLR 479
GFI LD+V L+KPFRD +WQVR+EASRALLDLEFHC +
Sbjct: 869 GFIPLDRVYGLVKPFRDIKALWQVRIEASRALLDLEFHC--------------------K 908
Query: 480 GQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRHHLFGILQILA 539
GQ+KL H MR+CQ++ G +SN E+ + TLV++LNLLE RIAFNN FLRH+LF ILQILA
Sbjct: 909 GQLKLATHVMRLCQMRDGLNSNDEITSQTLVSMLNLLEGRIAFNNAFLRHYLFCILQILA 968
Query: 540 GRAPTLYGVPRD-KLLLLGDGETSE-QKNVFASFVTEMRRAEPPMDVPNLSQD 590
R PTL+G+PR+ ++L + E S QKN+ A +E + + P + +L+Q+
Sbjct: 969 RRHPTLHGIPRENRMLHMSLTEASNYQKNMLA-LDSESKPLDLPSSIDDLTQN 1020
>gi|297842145|ref|XP_002888954.1| membrane alanyl aminopeptidase [Arabidopsis lyrata subsp. lyrata]
gi|297334795|gb|EFH65213.1| membrane alanyl aminopeptidase [Arabidopsis lyrata subsp. lyrata]
Length = 1390
Score = 843 bits (2178), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 410/596 (68%), Positives = 483/596 (81%), Gaps = 4/596 (0%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGAS-PVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
MLEKQMGS+ FRKILQ IISRA+ S +R+LSTKEFR FANK+GNLERPFLKEFF RWV
Sbjct: 450 MLEKQMGSDSFRKILQKIISRAKDPSNSIRSLSTKEFRQFANKIGNLERPFLKEFFQRWV 509
Query: 60 GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMS 119
+CGCPVLR+G SYNKRKN VE+A LR+CT D+R V+ + +DSE+RD D GWPG+MS
Sbjct: 510 SSCGCPVLRIGLSYNKRKNNVEMAALRECTAALDARLSVIGATSDSESRDVDAGWPGIMS 569
Query: 120 IRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGL 179
IRV+ELDGM DHP LPMAGD WQLLE+ CHSKLAA+R KPKKG KPDG +DN DA+A L
Sbjct: 570 IRVYELDGMSDHPKLPMAGDRWQLLELPCHSKLAAKRYQKPKKGGKPDGAEDNVDAIAPL 629
Query: 180 DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHL 239
+ ++S+ESPL+WI+ADPEMEY+AEIH +QP+QMW+NQLEKDGDVVAQAQAIA+LEAL
Sbjct: 630 ENKTSIESPLAWIKADPEMEYIAEIHLHQPLQMWVNQLEKDGDVVAQAQAIASLEALKQH 689
Query: 240 SFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL 299
SF++VN L N LSDSK FWR+RI AA+ALA TASEETDWAGL HL+KFYKSRRFD IGL
Sbjct: 690 SFSIVNALKNVLSDSKVFWRIRIGAAFALAKTASEETDWAGLQHLIKFYKSRRFDAEIGL 749
Query: 300 PRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 359
P+PN+F DF EYFVLEAIPHA+A VR A+ KSPREAVEF+LQLLKYNDN+GN YSDVFWL
Sbjct: 750 PKPNNFHDFPEYFVLEAIPHAIARVRGAEGKSPREAVEFILQLLKYNDNSGNRYSDVFWL 809
Query: 360 AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS 419
A LVQSVG+LEF QQS+ FL+ LLKRIDRLLQFDRLMPSYNGILTISCIRTL Q ALKLS
Sbjct: 810 AVLVQSVGDLEFCQQSLTFLAPLLKRIDRLLQFDRLMPSYNGILTISCIRTLAQTALKLS 869
Query: 420 GFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLR 479
IS D + KLI+PFR+ +TI Q+R+EASRALLD+E+ GI S L LF+K V EE SLR
Sbjct: 870 DSISFDHICKLIEPFRNSDTILQIRIEASRALLDIEYQSKGISSTLLLFMKYVVEESSLR 929
Query: 480 GQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRHHLFGILQILA 539
GQVKL +H MR+CQI G DS+ VDTV+L+ LL+L +S + FNN LR++LF I QILA
Sbjct: 930 GQVKLCVHTMRLCQIAVGCDSDDCVDTVSLLDLLHLFKSHVVFNNELLRYYLFCIFQILA 989
Query: 540 GRAPTLYGVPRDKLLLLGD-GETSEQKNVFASFVTEMRRAEPPMDVPNLSQDNLAV 594
GR PTL+GVP++K L L D E KNVF+ V EP + + + + +L V
Sbjct: 990 GRPPTLFGVPKEKPLQLVDVAACIEPKNVFS--VPGAEAGEPSLALGDANGQSLDV 1043
>gi|22330618|ref|NP_177536.2| TBP-associated factor 2 [Arabidopsis thaliana]
gi|20856938|gb|AAM26691.1| At1g73960/F2P9_17 [Arabidopsis thaliana]
gi|332197409|gb|AEE35530.1| TBP-associated factor 2 [Arabidopsis thaliana]
Length = 1390
Score = 828 bits (2139), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 405/570 (71%), Positives = 474/570 (83%), Gaps = 2/570 (0%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGAS-PVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
MLEKQMGS+ FRKILQ IISRA+ S +R+LSTKEFR FANK+GNLERPFLKEFF RWV
Sbjct: 450 MLEKQMGSDSFRKILQKIISRAKDPSNSIRSLSTKEFRQFANKIGNLERPFLKEFFQRWV 509
Query: 60 GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMS 119
+ GCPVLR+G SYNKRKN VE+A LR+CT D+R V+ + +DSE+RD D GWPG+MS
Sbjct: 510 ASYGCPVLRIGLSYNKRKNNVEMAALRECTAALDARLSVIGATSDSESRDVDAGWPGIMS 569
Query: 120 IRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGL 179
IRV+ELDGM DHP LPMAGD WQLLE+ CHSKLAA+R KPKKG KPDG +DN DA+A L
Sbjct: 570 IRVYELDGMSDHPKLPMAGDRWQLLELPCHSKLAAKRYQKPKKGGKPDGAEDNVDAIAPL 629
Query: 180 DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHL 239
+ ++S+ESPL+WI+ADPEMEY+AEIH +QP+QMW+NQLEKDGDVVAQAQAIA+LEAL
Sbjct: 630 ENKTSIESPLAWIKADPEMEYIAEIHLHQPLQMWVNQLEKDGDVVAQAQAIASLEALKQH 689
Query: 240 SFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL 299
SF++VN L N L+DSK FWR+RI AA+ALA TASEE+DWAGL HL+KFYKSRRFD IGL
Sbjct: 690 SFSIVNALKNVLTDSKVFWRIRIAAAFALAKTASEESDWAGLQHLIKFYKSRRFDAEIGL 749
Query: 300 PRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 359
P+PNDFRDF EYFVLEAIPHA+A+VR A+ KSPREAVEF+LQLLKYNDN+GN YSDVFWL
Sbjct: 750 PKPNDFRDFPEYFVLEAIPHAIAIVRGAEGKSPREAVEFILQLLKYNDNSGNSYSDVFWL 809
Query: 360 AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS 419
A LVQSVG+LEF QQS+ FL+ LLKRIDRLLQFDRLMPSYNGILTISCIRTL Q ALKLS
Sbjct: 810 AVLVQSVGDLEFCQQSLTFLAPLLKRIDRLLQFDRLMPSYNGILTISCIRTLAQTALKLS 869
Query: 420 GFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLR 479
IS D + KLI+PFR+ +TI Q+R+E SRALLD+E+ GI SAL LF+K + EE SLR
Sbjct: 870 DSISFDHICKLIEPFRNSDTILQIRIEGSRALLDIEYQSKGISSALLLFMKYLVEESSLR 929
Query: 480 GQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRHHLFGILQILA 539
GQVKL +H MR+CQI G DS+ VDTVTL+ LL+L +S + FNN LR++LF I QILA
Sbjct: 930 GQVKLCVHTMRLCQIAVGCDSDDCVDTVTLLDLLHLFKSHVVFNNELLRYYLFCIFQILA 989
Query: 540 GRAPTLYGVPRDKLLLLGDGETS-EQKNVF 568
GR PTL+GVP++K L L D E E KNVF
Sbjct: 990 GRPPTLFGVPKEKPLQLVDVEACIEPKNVF 1019
>gi|79321265|ref|NP_001031279.1| TBP-associated factor 2 [Arabidopsis thaliana]
gi|332197410|gb|AEE35531.1| TBP-associated factor 2 [Arabidopsis thaliana]
Length = 1370
Score = 796 bits (2055), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 392/570 (68%), Positives = 460/570 (80%), Gaps = 22/570 (3%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGAS-PVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
MLEKQMGS+ FRKILQ IISRA+ S +R+LSTKEFR FANK+GNLERPFLKEFF RWV
Sbjct: 450 MLEKQMGSDSFRKILQKIISRAKDPSNSIRSLSTKEFRQFANKIGNLERPFLKEFFQRWV 509
Query: 60 GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMS 119
+ GCPVLR+G SYNKRKN VE+A LR+CT D+R V+ + +DSE+RD D GWPG+MS
Sbjct: 510 ASYGCPVLRIGLSYNKRKNNVEMAALRECTAALDARLSVIGATSDSESRDVDAGWPGIMS 569
Query: 120 IRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGL 179
IRV+ELDGM DHP LPMAGD WQLLE+ CHSKLAA+R KPKKG KPDG +DN DA+A L
Sbjct: 570 IRVYELDGMSDHPKLPMAGDRWQLLELPCHSKLAAKRYQKPKKGGKPDGAEDNVDAIAPL 629
Query: 180 DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHL 239
+ ++S+ESPL+WI+ADPEMEY+AEIH +QP+QMW+NQLEKDGDVVAQAQAIA+LEAL
Sbjct: 630 ENKTSIESPLAWIKADPEMEYIAEIHLHQPLQMWVNQLEKDGDVVAQAQAIASLEALKQH 689
Query: 240 SFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL 299
SF++VN L N L+DSK FWR+RI AA+ALA TASEE+DWAGL HL+KFYKSRRFD IGL
Sbjct: 690 SFSIVNALKNVLTDSKVFWRIRIAAAFALAKTASEESDWAGLQHLIKFYKSRRFDAEIGL 749
Query: 300 PRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 359
P+PNDFRDF EYFVLEAIPHA+A+VR A+ KSPREAVEF+LQLLKYNDN+GN YSDVFWL
Sbjct: 750 PKPNDFRDFPEYFVLEAIPHAIAIVRGAEGKSPREAVEFILQLLKYNDNSGNSYSDVFWL 809
Query: 360 AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS 419
A LVQSVG+LEF QQS+ FL+ LLKRIDRLLQFDRLMPSYNGILTISCIRTL Q ALKLS
Sbjct: 810 AVLVQSVGDLEFCQQSLTFLAPLLKRIDRLLQFDRLMPSYNGILTISCIRTLAQTALKLS 869
Query: 420 GFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLR 479
IS D + KLI+PFR+ +TI Q+R+E SRALLD+E+ +
Sbjct: 870 DSISFDHICKLIEPFRNSDTILQIRIEGSRALLDIEYQS--------------------K 909
Query: 480 GQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRHHLFGILQILA 539
GQVKL +H MR+CQI G DS+ VDTVTL+ LL+L +S + FNN LR++LF I QILA
Sbjct: 910 GQVKLCVHTMRLCQIAVGCDSDDCVDTVTLLDLLHLFKSHVVFNNELLRYYLFCIFQILA 969
Query: 540 GRAPTLYGVPRDKLLLLGDGETS-EQKNVF 568
GR PTL+GVP++K L L D E E KNVF
Sbjct: 970 GRPPTLFGVPKEKPLQLVDVEACIEPKNVF 999
>gi|38490055|gb|AAR21621.1| TFIID component TAF2 [Arabidopsis thaliana]
Length = 1370
Score = 795 bits (2054), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 392/570 (68%), Positives = 460/570 (80%), Gaps = 22/570 (3%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGAS-PVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
MLEKQMGS+ FRKILQ IISRA+ S +R+LSTKEFR FANK+GNLERPFLKEFF RWV
Sbjct: 450 MLEKQMGSDSFRKILQKIISRAKDPSNSIRSLSTKEFRQFANKIGNLERPFLKEFFQRWV 509
Query: 60 GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMS 119
+ GCPVLR+G SYNKRKN VE+A LR+CT D+R V+ + +DSE+RD D GWPG+MS
Sbjct: 510 ASYGCPVLRIGLSYNKRKNNVEMAALRECTAALDARLSVIGATSDSESRDVDAGWPGIMS 569
Query: 120 IRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGL 179
IRV+ELDGM DHP LPMAGD WQLLE+ CHSKLAA+R KPKKG KPDG +DN DA+A L
Sbjct: 570 IRVYELDGMSDHPKLPMAGDRWQLLELPCHSKLAAKRYQKPKKGGKPDGAEDNVDAIAPL 629
Query: 180 DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHL 239
+ ++S+ESPL+WI+ADPEMEY+AEIH +QP+QMW+NQLEKDGDVVAQAQAIA+LEAL
Sbjct: 630 ENKTSIESPLAWIKADPEMEYIAEIHLHQPLQMWVNQLEKDGDVVAQAQAIASLEALKQH 689
Query: 240 SFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL 299
SF++VN L N L+DSK FWR+RI AA+ALA TASEE+DWAGL HL+KFYKSRRFD IGL
Sbjct: 690 SFSIVNALKNVLTDSKVFWRIRIAAAFALAKTASEESDWAGLQHLIKFYKSRRFDAEIGL 749
Query: 300 PRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 359
P+PNDFRDF EYFVLEAIPHA+A+VR A+ KSPREAVEF+LQLLKYNDN+GN YSDVFWL
Sbjct: 750 PKPNDFRDFPEYFVLEAIPHAIAIVRGAEGKSPREAVEFILQLLKYNDNSGNSYSDVFWL 809
Query: 360 AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS 419
A LVQSVG+LEF QQS+ FL+ LLKRIDRLLQFDRLMPSYNGILTISCIRTL Q ALKLS
Sbjct: 810 AVLVQSVGDLEFCQQSLTFLAPLLKRIDRLLQFDRLMPSYNGILTISCIRTLAQTALKLS 869
Query: 420 GFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLR 479
IS D + KLI+PFR+ +TI Q+R+E SRALLD+E+ +
Sbjct: 870 DSISFDHICKLIEPFRNSDTILQIRIEGSRALLDIEYQS--------------------K 909
Query: 480 GQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRHHLFGILQILA 539
GQVKL +H MR+CQI G DS+ VDTVTL+ LL+L +S + FNN LR++LF I QILA
Sbjct: 910 GQVKLCVHTMRLCQIAVGCDSDDCVDTVTLLDLLHLFKSHVVFNNELLRYYLFCIFQILA 969
Query: 540 GRAPTLYGVPRDKLLLLGDGETS-EQKNVF 568
GR PTL+GVP++K L L D E E KNVF
Sbjct: 970 GRPPTLFGVPKEKPLQLVDVEACIEPKNVF 999
>gi|222641555|gb|EEE69687.1| hypothetical protein OsJ_29324 [Oryza sativa Japonica Group]
Length = 1503
Score = 779 bits (2012), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/640 (60%), Positives = 480/640 (75%), Gaps = 21/640 (3%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
MLEKQMG + FRKILQ I++ + + RTLSTKEFRH ANKVGNLERPFLKEFFPRWV
Sbjct: 597 MLEKQMGPDSFRKILQMIVAPTRAS---RTLSTKEFRHLANKVGNLERPFLKEFFPRWVE 653
Query: 61 TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSI 120
+ GCPV+R+G SY+KR+N+VELAV R CT K D P + +N DS R+GD GWPGMMS+
Sbjct: 654 SSGCPVMRLGISYSKRRNLVELAVSRGCTTKVDP-GPDIRTNGDS--REGDTGWPGMMSV 710
Query: 121 RVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLD 180
RVHE DG+YDHPI+PMAG+A Q++EIQCHSK+AA+R K KKGSKPDG D+N DA + D
Sbjct: 711 RVHETDGVYDHPIVPMAGEALQVVEIQCHSKVAAKRFQKTKKGSKPDGSDENIDA-SNQD 769
Query: 181 MRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLS 240
R+SM++PL WIR DPEMEYLAEIHF+QPVQMWINQLEKD DV++Q+QAI+ LE P L+
Sbjct: 770 NRASMDAPLLWIRVDPEMEYLAEIHFHQPVQMWINQLEKDKDVISQSQAISVLEKSPQLT 829
Query: 241 FNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP 300
F V N LNNFL+D+KAFWRVR+EAAYALA TASE T+ GLLHLVKFYKSRRFD +IGLP
Sbjct: 830 FAVTNALNNFLNDTKAFWRVRVEAAYALAVTASEGTELTGLLHLVKFYKSRRFDADIGLP 889
Query: 301 RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLA 360
RPNDF D EYFVLEAIPHAVA+VR+AD SP+EA+EF+LQLLKYNDNNGN YSDV+WL+
Sbjct: 890 RPNDFHDIPEYFVLEAIPHAVALVRSADKSSPKEAIEFILQLLKYNDNNGNVYSDVYWLS 949
Query: 361 ALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSG 420
A+VQ++GELEFGQQ + LSSLLKRIDRLLQFD MP YNG+LT+SCIRTL +IA ++S
Sbjct: 950 AMVQAIGELEFGQQGVGLLSSLLKRIDRLLQFDNFMPGYNGVLTVSCIRTLARIAQRVSS 1009
Query: 421 FISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLRG 480
I LD+V +LI P+R+ + W+VR+EA R L+DLEFH G+D+AL LF+K EE SLRG
Sbjct: 1010 SICLDRVCELIVPYRNMDKPWKVRMEAGRVLIDLEFHHKGLDAALLLFLKYANEERSLRG 1069
Query: 481 QVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRHHLFGILQILAG 540
KL +H +R+CQ S N+++ TLV LL LL + A+NNV+LRH++F ILQI AG
Sbjct: 1070 GTKLAVHVLRLCQANIESHDNNQIQLPTLVGLLCLLAGKKAYNNVYLRHNVFCILQIAAG 1129
Query: 541 RAPTLYGVPR--DKLLLLGDGETSEQKNVFASFVTEMRRAEPPMDVPNLSQ--------- 589
R+PTL+GVP+ ++ + + + +S + R EP P++ +
Sbjct: 1130 RSPTLHGVPKVVTPPQVVLEISSDQHTKADSSVPQQSRPQEPSTSTPSVREVLPTSGPLK 1189
Query: 590 --DNLAVRDASKEVDCVANGHAENILAVPEAQRMQMSFPT 627
DN++ + V + A+NI E +R +S PT
Sbjct: 1190 DADNISNCSERRNVIFIPTKDADNISNCSE-RRNVISIPT 1228
>gi|218202136|gb|EEC84563.1| hypothetical protein OsI_31331 [Oryza sativa Indica Group]
Length = 1511
Score = 779 bits (2012), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 384/640 (60%), Positives = 479/640 (74%), Gaps = 21/640 (3%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
MLEKQMG + FRKILQ I++ + + RTLSTKEFRH ANKVGNLERPFLKEFFPRWV
Sbjct: 604 MLEKQMGPDSFRKILQMIVAPTRAS---RTLSTKEFRHLANKVGNLERPFLKEFFPRWVE 660
Query: 61 TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSI 120
+ GCPV+R+G SY+KR+N+VELAV R CT K D P + +N DS R+GD GWPGMMS+
Sbjct: 661 SSGCPVMRLGISYSKRRNLVELAVSRGCTTKVDP-GPDIRTNGDS--REGDTGWPGMMSV 717
Query: 121 RVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLD 180
RVHE DG+YDHPI+PMAG+A Q++EIQCHSK+AA+R K KKGSKPDG D+N DA + D
Sbjct: 718 RVHETDGVYDHPIVPMAGEALQVVEIQCHSKVAAKRFQKTKKGSKPDGSDENIDA-SNQD 776
Query: 181 MRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLS 240
R+SM++PL WIR DPEMEYLAEIHF+QPVQMWINQLEKD DV++Q+QAI+ LE P L+
Sbjct: 777 NRASMDAPLLWIRVDPEMEYLAEIHFHQPVQMWINQLEKDKDVISQSQAISVLEKSPQLT 836
Query: 241 FNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP 300
F V N LNNFL+D+KAFWRVR+EAAYALA TASE T+ GLLHLVKFYKSRRFD +IGLP
Sbjct: 837 FAVTNALNNFLNDTKAFWRVRVEAAYALAVTASEGTELTGLLHLVKFYKSRRFDADIGLP 896
Query: 301 RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLA 360
RPNDF D EYFVLEAIPHAVA+VR+AD SP+EA+EF+LQLLKYNDNNGN YSDV+WL+
Sbjct: 897 RPNDFHDIPEYFVLEAIPHAVALVRSADKSSPKEAIEFILQLLKYNDNNGNVYSDVYWLS 956
Query: 361 ALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSG 420
A+VQ++GELEFGQQ + LSSLLKRIDRLLQFD MP YNG+LT+SCIRTL +IA ++S
Sbjct: 957 AMVQAIGELEFGQQGVGLLSSLLKRIDRLLQFDNFMPGYNGVLTVSCIRTLARIAQRVSS 1016
Query: 421 FISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLRG 480
I LD+V +LI P+R+ + W+VR+EA R L+DLEFH G+D+AL LF+K EE SLRG
Sbjct: 1017 SICLDRVCELIVPYRNMDKPWKVRMEAGRVLIDLEFHHKGLDAALLLFLKYANEERSLRG 1076
Query: 481 QVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRHHLFGILQILAG 540
KL +H +R+CQ S N+++ TLV LL LL + A+NNV+LRH++F ILQI AG
Sbjct: 1077 GTKLAVHVLRLCQANIESHDNNQIQLPTLVGLLCLLAGKKAYNNVYLRHNVFCILQIAAG 1136
Query: 541 RAPTLYGVPR--DKLLLLGDGETSEQKNVFASFVTEMRRAEPPMDVPNLSQ--------- 589
R+PTL+GVP+ ++ + + +S + R EP P++ +
Sbjct: 1137 RSPTLHGVPKVVTPPQVVLEISNDQHTKADSSVPQQSRPQEPSTSTPSVREVLPTSGPLK 1196
Query: 590 --DNLAVRDASKEVDCVANGHAENILAVPEAQRMQMSFPT 627
DN++ + V + A+NI E +R +S PT
Sbjct: 1197 DADNISNCSERRNVIFIPTKDADNISNCSE-RRNVISIPT 1235
>gi|357153634|ref|XP_003576516.1| PREDICTED: uncharacterized protein LOC100821849 [Brachypodium
distachyon]
Length = 1407
Score = 772 bits (1994), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 377/608 (62%), Positives = 469/608 (77%), Gaps = 12/608 (1%)
Query: 2 LEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGT 61
LEKQMG + FRKILQ I++ + + RTLSTKEFRH ANKVGNLERPFLKEFFPRWV +
Sbjct: 541 LEKQMGPDSFRKILQMIVASTRAS---RTLSTKEFRHLANKVGNLERPFLKEFFPRWVES 597
Query: 62 CGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIR 121
CGCPV+R+G SY+KR+N+VELAV R CT K D P S+ + + +D G+PGMMS+R
Sbjct: 598 CGCPVMRLGISYSKRRNVVELAVSRGCTAKAD---PGSDSHVNDDIQDSANGFPGMMSVR 654
Query: 122 VHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLDM 181
VHE DG+YDHPILPMAG+A Q++E+QCHSKLAA+R K KKGSKPDG D+N +A + +
Sbjct: 655 VHETDGVYDHPILPMAGEALQVVELQCHSKLAAKRFQKTKKGSKPDGSDENVEA-STQEN 713
Query: 182 RSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSF 241
R+SM+SPL WIR DPEMEYLAEIHF+QP+QMWINQLEKD DV++Q+QAIA LE LP +SF
Sbjct: 714 RTSMDSPLLWIRVDPEMEYLAEIHFHQPIQMWINQLEKDKDVISQSQAIAVLEKLPQISF 773
Query: 242 NVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPR 301
V+N LNNFL+D+KAFWRVR+EAAYALA TASE+TD GLLHLVKFYKSRRFD +IGLPR
Sbjct: 774 AVINALNNFLNDTKAFWRVRVEAAYALAVTASEDTDLTGLLHLVKFYKSRRFDADIGLPR 833
Query: 302 PNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAA 361
PNDF D EYFVLEAIPHAVA+VR++D SPR+A+EFVLQLLKYNDNNGN YSDV+WLAA
Sbjct: 834 PNDFNDIPEYFVLEAIPHAVALVRSSDKSSPRQAIEFVLQLLKYNDNNGNVYSDVYWLAA 893
Query: 362 LVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGF 421
+VQ++GELEFGQQ I LSSLLKRIDRLLQFD MP YNG+LT+SCIR L +IA ++S
Sbjct: 894 MVQAIGELEFGQQGIGLLSSLLKRIDRLLQFDNFMPGYNGVLTVSCIRALARIAERVSSS 953
Query: 422 ISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLRGQ 481
I +D++ +LI PFR+ + W+V++EASR L+DLE H G+D+AL LF+K EE SL+G
Sbjct: 954 ICIDRICELIAPFRNIDKPWKVQIEASRVLIDLEHHHKGLDAALLLFLKYTYEEKSLQGG 1013
Query: 482 VKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRHHLFGILQILAGR 541
KL +H +R+CQ D ++ T+V LL+LL SR A+NNVFLRH++F ILQI AGR
Sbjct: 1014 TKLAVHVLRLCQANVEPDLTDQIKLPTVVGLLHLLGSRKAYNNVFLRHNVFCILQIAAGR 1073
Query: 542 APTLYGVPRDKLLL---LGDGETSEQKNVFASFVTEMRRAEPPMDVPNLSQDNLAVRDAS 598
+PTL+GV KL+ L +S+Q S V ++ R + P ++ L S
Sbjct: 1074 SPTLHGV--SKLVAPSPLVQEISSDQHAKADSSVPQLSRPQEPSSSTPSVREVLLTTGPS 1131
Query: 599 KEVDCVAN 606
K+ D ++N
Sbjct: 1132 KDADNISN 1139
>gi|12325155|gb|AAG52531.1|AC016662_25 unknown protein; 58745-68005 [Arabidopsis thaliana]
Length = 1273
Score = 667 bits (1721), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 327/477 (68%), Positives = 382/477 (80%), Gaps = 32/477 (6%)
Query: 11 FRKILQNIISRAQGAS-PVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRM 69
++ ILQ IISRA+ S +R+LSTKEFR FANK+GNLERPFLKEFF RWV + GCPVLR+
Sbjct: 398 WKSILQKIISRAKDPSNSIRSLSTKEFRQFANKIGNLERPFLKEFFQRWVASYGCPVLRI 457
Query: 70 GFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGMY 129
G SYNKRKN VE+A LR+CT D+R V+ + +DSE+RD D GWPG+MSIRV+ELDGM
Sbjct: 458 GLSYNKRKNNVEMAALRECTAALDARLSVIGATSDSESRDVDAGWPGIMSIRVYELDGMS 517
Query: 130 DHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLDMRSSMESPL 189
DHP LPMAGD WQLLE+ CHSKLAA+R KPKKG KPDG +DN DA+A L+ ++S+ESPL
Sbjct: 518 DHPKLPMAGDRWQLLELPCHSKLAAKRYQKPKKGGKPDGAEDNVDAIAPLENKTSIESPL 577
Query: 190 SWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNN 249
+WI+ADPEMEY+AEIH +QP+QMW+NQLEKDGDVVAQAQAIA+LEAL SF++VN L N
Sbjct: 578 AWIKADPEMEYIAEIHLHQPLQMWVNQLEKDGDVVAQAQAIASLEALKQHSFSIVNALKN 637
Query: 250 FLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFS 309
L+DSK FWR+RI AA+ALA TASEE+DWAGL HL+KFYKSRRFD IGLP+PNDFRDF
Sbjct: 638 VLTDSKVFWRIRIAAAFALAKTASEESDWAGLQHLIKFYKSRRFDAEIGLPKPNDFRDFP 697
Query: 310 EYFVLE-------------------------------AIPHAVAMVRAADNKSPREAVEF 338
EYFVLE AIPHA+A+VR A+ KSPREAVEF
Sbjct: 698 EYFVLEVGNELDIVSRIDLRSCFPLGVSIVIPLPETMAIPHAIAIVRGAEGKSPREAVEF 757
Query: 339 VLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPS 398
+LQLLKYNDN+GN YSDVFWLA LVQSVG+LEF QQS+ FL+ LLKRIDRLLQFDRLMPS
Sbjct: 758 ILQLLKYNDNSGNSYSDVFWLAVLVQSVGDLEFCQQSLTFLAPLLKRIDRLLQFDRLMPS 817
Query: 399 YNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLE 455
YNGILTISCIRTL Q ALKLS IS D + KLI+PFR+ +TI Q+R+E SRALLD+E
Sbjct: 818 YNGILTISCIRTLAQTALKLSDSISFDHICKLIEPFRNSDTILQIRIEGSRALLDIE 874
>gi|168045101|ref|XP_001775017.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673604|gb|EDQ60124.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1068
Score = 621 bits (1601), Expect = e-175, Method: Compositional matrix adjust.
Identities = 319/548 (58%), Positives = 402/548 (73%), Gaps = 9/548 (1%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGAS-PVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
MLEKQMG FRK+LQ II RAQ + RTLST EFRHFANK+GNLERPFLK+FFPRWV
Sbjct: 437 MLEKQMGPEPFRKVLQRIIMRAQDPNRKNRTLSTTEFRHFANKLGNLERPFLKDFFPRWV 496
Query: 60 GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNT-DSENRDGDIGWPGMM 118
+ GCP LRMGF Y+KR+N+VELAV R+ T P + +S+ R D+GWPGMM
Sbjct: 497 ESSGCPRLRMGFVYSKRRNMVELAVHRESTAMPKHVGVIGKVEAPESKLRLSDMGWPGMM 556
Query: 119 SIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAG 178
SIR+HELDGMYDHP LPMAGD +QLLE+QCHSKLA RR +PKKG+K + +DN DA
Sbjct: 557 SIRLHELDGMYDHPSLPMAGDCYQLLELQCHSKLAGRRIQRPKKGAKGEAVEDNVDATPA 616
Query: 179 LDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPH 238
S+ESPL WIR DPE+EYLAE++ QP MWINQLEKD DV+ Q QA+AAL A P
Sbjct: 617 ----QSLESPLLWIRGDPELEYLAELNLRQPEPMWINQLEKDRDVIGQIQAVAALRAYPR 672
Query: 239 LSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIG 298
SF VVN LNN L D K F RVRIEAA ALA+TASE T + GL +L+K+YKSRRFD +IG
Sbjct: 673 TSFAVVNALNNCLIDPKVFCRVRIEAAAALASTASEGTSYMGLQNLLKYYKSRRFDPDIG 732
Query: 299 LPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFW 358
LP+PNDF DFSEYFVLEAIP AVA VR +D KSP EA +F+L +LK+NDN+GN YSDVFW
Sbjct: 733 LPKPNDFHDFSEYFVLEAIPGAVAEVRGSDGKSPLEATDFILHILKHNDNSGNAYSDVFW 792
Query: 359 LAALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKL 418
LA++++S+G LEFG+Q++ L+ +LK+IDR LQ+DRLMPSYNG++TISCI TL ++A++
Sbjct: 793 LASVIESIGSLEFGKQNLRTLARMLKQIDRFLQYDRLMPSYNGVVTISCIHTLAKLAIRF 852
Query: 419 SGFISLDQVVKLIKPFRDFNTI-WQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPS 477
S + ++V +L+ F+D + WQVRV A +ALLD+E G+++A++L ++ V + S
Sbjct: 853 SHALPTERVKQLLNQFKDRQKVQWQVRVAALKALLDIELQSKGLEAAVNLALQQVHGDLS 912
Query: 478 LRGQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRHHLFGILQI 537
G+ K+ H +R+ I + VT+ LL LLES AF+NV LRH +F LQ+
Sbjct: 913 FGGESKIMKHVVRMASIT--EKGKCRIGGVTIARLLALLESSKAFHNVMLRHSIFSFLQV 970
Query: 538 LAGRAPTL 545
LAGR L
Sbjct: 971 LAGRYDML 978
>gi|302767542|ref|XP_002967191.1| hypothetical protein SELMODRAFT_408008 [Selaginella moellendorffii]
gi|300165182|gb|EFJ31790.1| hypothetical protein SELMODRAFT_408008 [Selaginella moellendorffii]
Length = 1670
Score = 570 bits (1468), Expect = e-159, Method: Compositional matrix adjust.
Identities = 305/551 (55%), Positives = 380/551 (68%), Gaps = 30/551 (5%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRT-LSTKEFRHFANKVGNLERPFLKEFFPRWV 59
MLEKQMG RK+LQ I+SRA + + +FRHFANK+GNLERPFLKEFFPRWV
Sbjct: 409 MLEKQMGPESLRKVLQRIVSRAGEPCFLSSYFQGLQFRHFANKIGNLERPFLKEFFPRWV 468
Query: 60 GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMS 119
+ GCP+LRMGF YNKR+NIVE AV RDCT P+ V+ S D+GWPGMM+
Sbjct: 469 ESQGCPLLRMGFEYNKRRNIVEFAVHRDCTA-PE----VMMLGNKS---IADVGWPGMMN 520
Query: 120 IRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGL 179
I++HELDGMYDHP L ++GD QLLEI CHSK+A RR +PKK K + DD D
Sbjct: 521 IQIHELDGMYDHPNLSLSGDVHQLLEIPCHSKMANRRPQRPKKPGKVEIIDDLADD---- 576
Query: 180 DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHL 239
+ S+ESPL W+RADP M+YL +I QP QMWINQLE+D D VAQ QAIAAL+ LP +
Sbjct: 577 --KPSVESPLLWLRADPGMDYLTKIQLWQPEQMWINQLERDKDAVAQLQAIAALQKLP-V 633
Query: 240 SFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL 299
SF++V F RVR+EAA+ALANTA E T W GL HL+K+YKSR FD IGL
Sbjct: 634 SFSIV------------FCRVRVEAAFALANTAGEGTSWLGLGHLMKYYKSRCFDMEIGL 681
Query: 300 PRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 359
PRPN F DFSEYFVL+AIP AV VR KSP EAVEF+L +LK+NDN+GN YSD FWL
Sbjct: 682 PRPNSFHDFSEYFVLQAIPPAVGTVRGNQGKSPPEAVEFILNILKHNDNSGNAYSDSFWL 741
Query: 360 AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS 419
+++++SVGELE QQSI + +LK+IDR Q+DRLMPSYN ILT SCI+TLT++ALK +
Sbjct: 742 SSVIESVGELEISQQSIPKFARILKQIDRYFQYDRLMPSYNCILTRSCIQTLTRLALKFN 801
Query: 420 GFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLR 479
ISL + L+ PF+ T WQVRV A +A LD++F ++S+L + + V E+P R
Sbjct: 802 TVISLAPIETLLSPFKVLTTPWQVRVAALQAQLDMKFQSESLNSSLRVALDFVREDPCAR 861
Query: 480 GQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRHHLFGILQILA 539
Q K+ +A+RIC K S S+ VD+ T+ +LL ES N LRH++F ILQ+LA
Sbjct: 862 VQCKVMSYALRICLRKHASTSS--VDSSTVSSLLEFFESEQCAVNHKLRHYIFSILQVLA 919
Query: 540 GRAPTLYGVPR 550
G+ TL V R
Sbjct: 920 GKPATLLRVSR 930
>gi|302754206|ref|XP_002960527.1| hypothetical protein SELMODRAFT_402861 [Selaginella moellendorffii]
gi|300171466|gb|EFJ38066.1| hypothetical protein SELMODRAFT_402861 [Selaginella moellendorffii]
Length = 1689
Score = 565 bits (1455), Expect = e-158, Method: Compositional matrix adjust.
Identities = 302/546 (55%), Positives = 377/546 (69%), Gaps = 30/546 (5%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRT-LSTKEFRHFANKVGNLERPFLKEFFPRWV 59
MLEKQMG RK+LQ I+SRA + + +FRHFANK+GNLERPFLKEFFPRWV
Sbjct: 446 MLEKQMGPESLRKVLQRIVSRAGEPCFLSSYFQGLQFRHFANKIGNLERPFLKEFFPRWV 505
Query: 60 GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMS 119
+ GCP+LRMGF YNKR+NIVE AV RDCT P+ V+ S D+GWPGMM+
Sbjct: 506 ESQGCPLLRMGFEYNKRRNIVEFAVHRDCTA-PE----VMMLGNKS---IADVGWPGMMN 557
Query: 120 IRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGL 179
I++HELDGMYDHP L ++GD QLLEI CHSK+A RR +PKK K + DD D
Sbjct: 558 IQIHELDGMYDHPNLSLSGDVHQLLEIPCHSKMANRRPQRPKKPGKVEIIDDLADD---- 613
Query: 180 DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHL 239
+ S+ESPL W+RADP M+YL +I QP QMWINQLE+D D VAQ QAIAAL+ LP +
Sbjct: 614 --KPSVESPLLWLRADPGMDYLTKIQLWQPEQMWINQLERDKDAVAQLQAIAALQKLP-V 670
Query: 240 SFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL 299
SF++V F RVR+EAA+ALANTA E T W GL HL+K+YKSR FD IGL
Sbjct: 671 SFSIV------------FCRVRVEAAFALANTAGEGTSWLGLGHLMKYYKSRCFDMEIGL 718
Query: 300 PRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 359
PRPN F DFSEYFVL+AIP AV VR KSP EAVEF+L +LK+NDN+GN Y D FWL
Sbjct: 719 PRPNSFHDFSEYFVLQAIPPAVGTVRGNQGKSPPEAVEFILNILKHNDNSGNAYLDSFWL 778
Query: 360 AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS 419
+++++SVGELE QQSI + +LK+IDR Q+DRLMPSYN ILT SCI+TLT++ALK +
Sbjct: 779 SSVIESVGELEISQQSIPKFARILKQIDRYFQYDRLMPSYNCILTRSCIQTLTRLALKFN 838
Query: 420 GFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLR 479
ISL + L+ PF+ T WQVRV A +A LD++F ++S+L + + V E+P R
Sbjct: 839 TVISLAPIETLLSPFKVLTTPWQVRVAALQAQLDMKFQSESLNSSLRVALDFVREDPCAR 898
Query: 480 GQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRHHLFGILQILA 539
Q K+ +A+RIC K S S+ VD+ T+ +LL ES N LRH++F ILQ+LA
Sbjct: 899 VQCKVMSYALRICLRKHASTSS--VDSSTVSSLLEFFESEQCAVNHKLRHYIFSILQVLA 956
Query: 540 GRAPTL 545
G+ TL
Sbjct: 957 GKPATL 962
>gi|168042563|ref|XP_001773757.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674872|gb|EDQ61374.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1412
Score = 500 bits (1288), Expect = e-139, Method: Compositional matrix adjust.
Identities = 286/573 (49%), Positives = 369/573 (64%), Gaps = 81/573 (14%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGA-SPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
MLE+QMG FRK+LQ II RAQ S +TLST EFRHFANK+GNLERPFLK+FFPRWV
Sbjct: 482 MLERQMGPEPFRKVLQRIIMRAQDPDSKNQTLSTTEFRHFANKLGNLERPFLKDFFPRWV 541
Query: 60 GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNT-DSENRDGDIGWPGMM 118
+ GCP L +KR+N+VELAV R+ T P V D + R+ D+GWPGMM
Sbjct: 542 ESSGCPRL------SKRRNMVELAVRRESTSIPKRVGAVGKLEIPDLKLRNSDMGWPGMM 595
Query: 119 SIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAG 178
SIR+HELDGMYDHP LPMAGD +QLLE+QCHSKLA RR +PKKG+K + +D DA
Sbjct: 596 SIRLHELDGMYDHPSLPMAGDCYQLLELQCHSKLAGRRMQRPKKGTKGEAVEDRVDATPA 655
Query: 179 LDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPH 238
S+ESPL W+R DPE+EYLAE++ QP QMWINQLEKD DV+ Q QA+AAL A P
Sbjct: 656 ----QSLESPLLWVRGDPELEYLAELNLIQPEQMWINQLEKDRDVIGQVQAVAALRAHPR 711
Query: 239 LSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIG 298
SF VVN LNN L D K F RVRIEAA ALA+TAS
Sbjct: 712 TSFAVVNALNNCLIDPKVFCRVRIEAAAALASTAS------------------------- 746
Query: 299 LPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFW 358
EAIP A+A VR D KSP EA EF+L +LK+NDNNGN +SDV+W
Sbjct: 747 ----------------EAIPGAIAEVRGNDGKSPLEASEFILHILKHNDNNGNVHSDVYW 790
Query: 359 LAALVQSVGELEFGQQSIL-----------------------FLSSLLKRIDRLLQFDRL 395
LA+++++VG LEFG+Q +L L+ +LK+IDR LQ+DRL
Sbjct: 791 LASVIEAVGSLEFGKQFVLSRSNSGSIVILLGNWGLPIQDLRTLARILKQIDRFLQYDRL 850
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTI-WQVRVEASRALLDL 454
MPSYNG++ SCIRTL IA++ S +S+++V L+K F+D + I W+VRV A +ALLD+
Sbjct: 851 MPSYNGVVATSCIRTLANIAIRFSYALSIERVKLLLKQFKDRDKINWRVRVAALKALLDI 910
Query: 455 EFHCNGIDSALSLFIKSVEEEPSLRGQVKLGIHAMRICQ-IKGGSDSNHEVDTVTLVALL 513
E++ G+++A+ L ++ + E S + K+ H RI I+ GS ++ +V++ L+
Sbjct: 911 EYYSKGLEAAVILALQLIHSEFSFPVKSKIMKHVGRIASLIEKGS---CKIGSVSVARLM 967
Query: 514 NLLESRIAFNNVFLRHHLFGILQILAGRAPTLY 546
LL+S AF+NV LRH +F LQ+LAGR+ +LY
Sbjct: 968 TLLKSSTAFHNVMLRHSVFSFLQVLAGRSASLY 1000
>gi|242044652|ref|XP_002460197.1| hypothetical protein SORBIDRAFT_02g024430 [Sorghum bicolor]
gi|241923574|gb|EER96718.1| hypothetical protein SORBIDRAFT_02g024430 [Sorghum bicolor]
Length = 310
Score = 457 bits (1177), Expect = e-126, Method: Compositional matrix adjust.
Identities = 218/317 (68%), Positives = 261/317 (82%), Gaps = 8/317 (2%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
MLEKQMG + FRKILQ I++ + + RTLSTKEFRH ANKVGNLERPFLKEFFPRWV
Sbjct: 1 MLEKQMGPDSFRKILQVIVAPNRAS---RTLSTKEFRHLANKVGNLERPFLKEFFPRWVE 57
Query: 61 TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSI 120
+CGCP++R+G SY+K++N++ELAV R CT K TP +T+ + R+GD GWPGMMS+
Sbjct: 58 SCGCPIMRLGISYSKKRNMIELAVSRGCTAKA---TPDPDIHTNGDTREGDAGWPGMMSV 114
Query: 121 RVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLD 180
RVHE DG YDHP+LPMAG+A Q++EIQCHS+LAA+R K KK +K DG DDN DA + +
Sbjct: 115 RVHETDGAYDHPVLPMAGEALQVVEIQCHSRLAAKRVWK-KKNTKLDGSDDNIDA-STQE 172
Query: 181 MRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLS 240
R+S++SPL WIR DPEMEYLAEIHF+QPVQMWINQLEKD DV++Q+QAI+ LE LP LS
Sbjct: 173 NRTSVDSPLLWIRVDPEMEYLAEIHFHQPVQMWINQLEKDKDVISQSQAISVLEKLPQLS 232
Query: 241 FNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP 300
F V+N LNNFL+D+KAFWRVRIEAAYALA T+SE TD AGLLHLVKFYKSRRFD +IG+P
Sbjct: 233 FAVINALNNFLNDTKAFWRVRIEAAYALAVTSSEATDLAGLLHLVKFYKSRRFDADIGMP 292
Query: 301 RPNDFRDFSEYFVLEAI 317
RPNDF D EYFVLE +
Sbjct: 293 RPNDFHDVPEYFVLEVM 309
>gi|147776761|emb|CAN61428.1| hypothetical protein VITISV_006736 [Vitis vinifera]
Length = 299
Score = 441 bits (1133), Expect = e-121, Method: Compositional matrix adjust.
Identities = 211/259 (81%), Positives = 226/259 (87%), Gaps = 2/259 (0%)
Query: 63 GCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRV 122
G LR G SYNKRKN+VELAVLR CT PD+ T VL+ N DSENR+ DIGWPGMMSIRV
Sbjct: 6 GLKFLRAGLSYNKRKNLVELAVLRGCTAAPDTNTXVLNGNIDSENREVDIGWPGMMSIRV 65
Query: 123 HELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLDMR 182
HELDGMYDHPILPMAG+ WQLLEIQCHSKLAARR KPKKGSKPDG DDNGD V +DMR
Sbjct: 66 HELDGMYDHPILPMAGETWQLLEIQCHSKLAARRFQKPKKGSKPDGSDDNGD-VPAVDMR 124
Query: 183 SSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFN 242
S+ ESPL W+R DPE+EYLAEIHFNQP QMWINQLE+D DVVAQAQAIA LEALP LSF+
Sbjct: 125 SNTESPLLWLRVDPELEYLAEIHFNQPAQMWINQLERDKDVVAQAQAIATLEALPQLSFS 184
Query: 243 VVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRP 302
VVN LNNFLSDSKAFWRVRIEAA+ALANTASEETDWAGLLHLVKFYKSRRFD NIGLP+P
Sbjct: 185 VVNALNNFLSDSKAFWRVRIEAAFALANTASEETDWAGLLHLVKFYKSRRFDANIGLPKP 244
Query: 303 NDFRDFSEYFVLEAIPHAV 321
NDF DF EYFVLE + H+V
Sbjct: 245 NDFHDFPEYFVLE-VAHSV 262
>gi|326491201|dbj|BAK05700.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 391
Score = 393 bits (1010), Expect = e-106, Method: Compositional matrix adjust.
Identities = 190/310 (61%), Positives = 230/310 (74%), Gaps = 26/310 (8%)
Query: 6 MGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCP 65
M + FRKILQ I++ + + RTLSTKEFRH NKVGNLERPFLKEFFPRW+ +CGCP
Sbjct: 1 MWPDSFRKILQMIVASTRAS---RTLSTKEFRHLVNKVGNLERPFLKEFFPRWIESCGCP 57
Query: 66 VLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHEL 125
++R+G SY+KR+N+VELAV R CT K D P ++ + + ++G GWPGMMS+RVHE
Sbjct: 58 IMRLGISYSKRRNLVELAVSRGCTAKAD---PGSDNHVNGDIQEGATGWPGMMSVRVHET 114
Query: 126 DGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLDMRSSM 185
G YDH ILPMAG+A ++E+QCHSK+AA+R K KKGSKPDG DDN DA + + R+SM
Sbjct: 115 GGAYDHLILPMAGEALHVVELQCHSKVAAKRFQKTKKGSKPDGSDDNVDA-STQENRTSM 173
Query: 186 ESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVN 245
+SPL WIR D EMEYLAEIHF+QP+QMWINQLEKD DV++Q+Q I LE LP LSF V+N
Sbjct: 174 DSPLLWIRVDQEMEYLAEIHFHQPIQMWINQLEKDKDVISQSQEITVLEKLPQLSFAVIN 233
Query: 246 TLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDF 305
LNNFL+D+K +TD AGLLHLVKFYKS RFD +IGLPRPNDF
Sbjct: 234 ALNNFLNDTK-------------------DTDLAGLLHLVKFYKSHRFDTDIGLPRPNDF 274
Query: 306 RDFSEYFVLE 315
D EYFVLE
Sbjct: 275 HDIPEYFVLE 284
>gi|384253674|gb|EIE27148.1| hypothetical protein COCSUDRAFT_55172 [Coccomyxa subellipsoidea
C-169]
Length = 1510
Score = 230 bits (586), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 186/577 (32%), Positives = 301/577 (52%), Gaps = 49/577 (8%)
Query: 1 MLEKQMGSNFFRKILQNII------SRAQGASPVRTLSTKEFRHFANKVGNLERPF-LKE 53
MLE++ G + F+++L I+ + AQGA+P R L T EF + K G E +
Sbjct: 410 MLEQRAGEDNFKRLLSRIVRASCQPATAQGANP-RLLGTHEFLNQIVKAGGKESKHEISA 468
Query: 54 FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
F RW+ GCP+L ++++++ +ELA+ + + +++ S
Sbjct: 469 FHQRWIIGRGCPLLTAAVVFHRKRSTLELAIRQTGPQAARAAAEKVAAKEGSSA------ 522
Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSK----PDGC 169
G + + V E + +HP+ + A +L+E++ SK A + KP + K +
Sbjct: 523 --GFIKVLVRETEAPVEHPVN-LGTAALRLVELKVTSKPAKPKPGKPGRKKKLQVVHEDL 579
Query: 170 DDNGDA-------VAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGD 222
DD+GDA VAG R P+ W+R DP E LA + QP M QLE+ D
Sbjct: 580 DDDGDATMADAPDVAGEASRL----PVEWVRLDPHGESLAAVRLAQPEVMASAQLERSKD 635
Query: 223 VVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLL 282
V AQ+ AIA L AL S+ VVN L + + + R+R+EAA AL TA T W+GL
Sbjct: 636 VAAQSAAIAQLAALRPTSYGVVNALRACMQHAATYCRIRVEAALALGATAGPSTAWSGLE 695
Query: 283 HLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQL 342
HL+K+Y++R FD +IGLPRP F D S+++V +A+PHA+A VR A S EA E +L+
Sbjct: 696 HLLKYYRTRIFDSDIGLPRPWQFTDLSQHYVDQALPHAIAAVRDAAGHSFPEAAEALLEH 755
Query: 343 LKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGI 402
L + DN+GN YSD LAAL+ ++G L QS ++ +L R++R LQ + +MPS++ I
Sbjct: 756 LDHADNSGNAYSDDNALAALLDALGSLHL--QSPDAMAGILDRLERYLQREAVMPSFHSI 813
Query: 403 LTISCIRTLTQIALKLSGFISLDQVVKL-IKPFRDFNTIWQVRVEASRALLDLEFHCNGI 461
+ + + +LT +A+ + ++L ++ + + +R+ A++ + + +G
Sbjct: 814 VGCTALTSLTSLAVSMRSPPKQKARLRLFLQRYSRPDAPHHLRLTAAQCSVRIAACTDGA 873
Query: 462 DSALS--LFIKSVEEEPSLRGQV----------KLGIHAMRICQIKGGSDSNHEVDTVTL 509
D A+ L + E+ P+LRG + A K G+ + ++ ++ +
Sbjct: 874 DGAVQAALAALAAEKSPALRGAILESLLGLLHRPAAAAARSGGPAKSGTAAKTKIVSMAV 933
Query: 510 VALLNLLESRIAFNNVFLRHHLFGILQILAGRAPTLY 546
+A L+ L S A +V LRH F L L G APTL+
Sbjct: 934 LAQLHALLS--AHGDVGLRHLAFMALMRLGGLAPTLF 968
>gi|328874793|gb|EGG23158.1| hypothetical protein DFA_05290 [Dictyostelium fasciculatum]
Length = 1643
Score = 227 bits (578), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 146/457 (31%), Positives = 249/457 (54%), Gaps = 43/457 (9%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRT-LSTKEFRHFANKVGNLERPFLKEFFPRWV 59
M+EK++ + RK+L +++ + PV T + TK+F + L LK+F +W+
Sbjct: 532 MIEKRITEDGLRKVLNSLLV-SPTNEPVETNVPTKKFIKL---IKTLTGNDLKQFSEKWI 587
Query: 60 GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMS 119
G P L GF +NK+K+ +E A+ + T D + SS ++
Sbjct: 588 YGKGSPNLVCGFYFNKKKHQIEFAIKQVFTRSEDK---IASS----------------LT 628
Query: 120 IRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGL 179
IR++EL+G Y+H I D Q ++ C+S++ R + + D+G+ + +
Sbjct: 629 IRIYELEGTYEHII--NFDDDMQEFDLTCYSRIRKSRKKRIRA--------DDGEEME-I 677
Query: 180 DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHL 239
D+ ESPL IR DPEME++ +I F QP MWI QLE D DVVAQ++A+ L P
Sbjct: 678 DLSQKRESPLLHIRIDPEMEWIHKITFKQPEYMWIYQLELDKDVVAQSEALRGLCTFP-- 735
Query: 240 SFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL 299
+ N + + F ++ K ++RVR+EAA LA +ET++ GL L+ ++K +D + L
Sbjct: 736 TTNSLRAVFKFFANPKHYYRVRMEAAIVLAKLTVKETNYEGLDLLINYFKQNFYDPHSRL 795
Query: 300 PRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 359
N+F DFS YFV + IP AV+M++ + ++P E ++F+L LLK NDN+ N YSD +++
Sbjct: 796 LAENNFSDFSAYFVKKIIPQAVSMIKDSQGRTPIEGIDFLLDLLKDNDNSLNAYSDNYYV 855
Query: 360 AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS 419
A+++ S+ + + +S + ++ R L+FD+L P Y+ +T +C+ ++ Q L+ S
Sbjct: 856 ASVLSSLANVHTTNST--HISEIHGQLTRYLRFDQLYPCYHNTITSACLTSICQ--LQAS 911
Query: 420 GFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEF 456
I LD +K K ++ ++R A + L +
Sbjct: 912 KQIGLD--LKFFKDHFNYTYYEEIRKTAFQCYYRLAY 946
>gi|268637510|ref|XP_002649090.1| transcription initiation factor TFIID subunit [Dictyostelium
discoideum AX4]
gi|256012812|gb|EEU04038.1| transcription initiation factor TFIID subunit [Dictyostelium
discoideum AX4]
Length = 2044
Score = 225 bits (574), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 137/431 (31%), Positives = 235/431 (54%), Gaps = 39/431 (9%)
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
LK F +W+ G P L GF++N++K + E A L+ +P+ +
Sbjct: 602 LKHFSEKWIYGRGSPNLICGFNFNRKKFMTEFA-LKQVLNRPEDKIS------------- 647
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
G + IR+ ELDG ++ ++ D ++ + CHS+ R K KK DG +
Sbjct: 648 -----GTLMIRIVELDGSFEQ-VINFEDDMFEY-DFPCHSRC---RKTKKKKLLSEDGEE 697
Query: 171 DNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAI 230
+D+ E+PL W R DPE+E++ I F QP MWI+QLE D DV+AQ +AI
Sbjct: 698 ------MEIDLSQKRETPLLWFRIDPELEWIHRITFRQPEYMWIHQLELDRDVIAQMEAI 751
Query: 231 AALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKS 290
L+ ++SFN V + F+ +S+ F+RVR +AA L+ +++ET++ GL LV ++K
Sbjct: 752 KGLKD--YISFNSVRAVFKFIQNSRYFYRVRCDAAMLLSKLSTKETNFEGLNVLVSYFKG 809
Query: 291 RRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNG 350
FD + +PN F+DF+ YF+ ++IP ++++++ + ++P E +EF+++LLK NDN+
Sbjct: 810 LFFDRDSNQLKPNSFKDFNSYFMQKSIPLSISLIKDKEGRTPIECIEFLIELLKDNDNSL 869
Query: 351 NPYSDVFWLAALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRT 410
N YSD ++L AL+ + + + L L + K++ R L++D+L PSY +TI C++
Sbjct: 870 NTYSDCYYLEALLSAFSNIHTTNE--LLLKKIKKQVARYLKYDQLYPSYRNCVTIGCLKA 927
Query: 411 LTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL--EFHCN-GIDSALSL 467
Q+ + LD L K + ++ +VR A +++ L +F C+ G S
Sbjct: 928 YCQLYSIKNSKDDLD--FTLFKEYSTYSHFEEVRKVALQSIYKLAYQFQCSPGTGSINDY 985
Query: 468 FIKSVEEEPSL 478
F+ + E +L
Sbjct: 986 FLDLFDNEQTL 996
>gi|281210428|gb|EFA84594.1| hypothetical protein PPL_01584 [Polysphondylium pallidum PN500]
Length = 1764
Score = 219 bits (558), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 126/412 (30%), Positives = 230/412 (55%), Gaps = 38/412 (9%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRT-LSTKEFRHFANKVGNLERPFLKEFFPRWV 59
M+E+ + + RK+L +++ G P+ T + TK+F + L LK+F RW+
Sbjct: 401 MIERLISEDGLRKVLNSMLVSQTGNEPIETIMPTKKF---LKMIKTLTGQDLKQFSERWI 457
Query: 60 GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMS 119
G P L GF +N++++ E A ++ +P+ + I P ++
Sbjct: 458 FGKGTPHLLCGFYFNRKRHQTEFA-MKQLAARPEDK----------------ITSP--IT 498
Query: 120 IRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGL 179
+R+ ELD Y+H I D W + CHS+ + K + D+G+ + +
Sbjct: 499 LRITELDRTYEHII--NFDDDWNQYDFPCHSRNRKSKKKKIRH--------DDGEEME-I 547
Query: 180 DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHL 239
D+ E+PL W R DPE+E++ +I F QP MW++QLE D DV+AQ++A+ L + P L
Sbjct: 548 DLSQKRETPLLWFRIDPEIEWIHKITFKQPEYMWVHQLELDRDVIAQSEALCGLASFPTL 607
Query: 240 SFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL 299
+ ++ F + K +++VR+EAA +A + +ET++ GL ++ ++KS+ +D+ GL
Sbjct: 608 Y--SLRSVFKFFCNQKQYYKVRMEAALVIAKISCKETNYEGLELMISYFKSQFYDQESGL 665
Query: 300 PRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 359
N+F DF Y++ + IP +V++++ D ++P EA+EF++ LLK N+N+ NP+SD ++L
Sbjct: 666 LDENNFTDFINYYIRKIIPVSVSLMKDKDGRTPAEAIEFLVDLLKNNENSSNPFSDNYYL 725
Query: 360 AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTL 411
A+++ S+ + + + S+ +I R L+FD+L P Y +TI+ ++ L
Sbjct: 726 ASIMSSLANVNTNNDQL--IKSMHNQIVRYLKFDQLYPCYRNTITIAALQAL 775
>gi|440798321|gb|ELR19389.1| PBS lyase HEATlike repeat domain containing protein [Acanthamoeba
castellanii str. Neff]
Length = 981
Score = 214 bits (544), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 150/461 (32%), Positives = 249/461 (54%), Gaps = 49/461 (10%)
Query: 1 MLEKQMGSNFFRKILQNIIS--RAQGASPVRTLSTKEFRHFANKVGNLERPFLKE--FFP 56
+LEK +G + RK++Q +I+ AQ + TLS F K+ + LK+ F
Sbjct: 351 LLEKLLGLDAMRKVVQQLIAPPEAQAGTLPPTLSLHRFLKLIAKLTGHD---LKKDGFDK 407
Query: 57 RWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPG 116
RW+ GC + GF +N++++++E AV R G
Sbjct: 408 RWIHGSGCVEVECGFWFNRKRHLIEFAV-----------------------RQGGKAESN 444
Query: 117 MMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAV 176
+++RVHE+DG +DH +L + D + ++ C+S+L R K + G P+ +D V
Sbjct: 445 KLAVRVHEVDGAHDH-LLDLEEDLYGT-DLPCYSRLRKNRRKKGQPGEGPEEKEDEKPDV 502
Query: 177 AGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEAL 236
+ ++P+ WIR DPE E L ++ F QP +MWINQ+E D DVVAQ + + +
Sbjct: 503 P----QPRSDTPILWIRIDPEFEMLRKVQFRQPEKMWINQIELDRDVVAQYEGVQGVGQC 558
Query: 237 PHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRF--- 293
S V L + + + + F+R+R AA AL + E + A L L+++++S F
Sbjct: 559 N--SAAAVAALASLMENPRMFYRIRALAAEALGSIRGEASA-AALEALLRYFRSAYFYDG 615
Query: 294 DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPY 353
DEN + RPNDF DF+ Y V +AIP A++ VR + +P E +EF+L LLK NDN GNPY
Sbjct: 616 DENSQM-RPNDFTDFAAYHVQKAIPVALSQVRDTEGFTPDEVLEFLLHLLKQNDNTGNPY 674
Query: 354 SDVFWLAALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQ 413
SD +LA +++++G + +S S+LK++ R + +RL+PSY+ +T++C++ + +
Sbjct: 675 SDYSYLATVIEALGNTQ--PKSNKGWDSILKQVQRFMHLERLLPSYHNRVTVACLKVMKK 732
Query: 414 IALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
++ SG LD + L + + +VR+ A +AL +L
Sbjct: 733 --MQASGRARLD--LDLFTSYTSYGYYRKVRIVALKALGEL 769
>gi|330801236|ref|XP_003288635.1| hypothetical protein DICPUDRAFT_152902 [Dictyostelium purpureum]
gi|325081308|gb|EGC34828.1| hypothetical protein DICPUDRAFT_152902 [Dictyostelium purpureum]
Length = 1853
Score = 207 bits (528), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 121/385 (31%), Positives = 216/385 (56%), Gaps = 37/385 (9%)
Query: 30 TLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCT 89
T+ TK+F + + LK+F +W+ G P L GF++N++K E A L+
Sbjct: 544 TIQTKKFLKLIKSMTGQD---LKQFAEKWIYGRGSPNLICGFNFNRKKFQTEFA-LKQVL 599
Query: 90 VKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCH 149
+P+ + +SS+ + IR+ EL+G ++ ++ D ++ + CH
Sbjct: 600 NRPEDK---ISSS---------------LMIRITELEGSFEQ-VINFEDDMFE-YDFPCH 639
Query: 150 SKLAARRALKPKKGSKPDGCDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQP 209
S+ R K KK DG + +D+ E+PL W R DP++E++ +I F QP
Sbjct: 640 SRC---RKTKKKKLLSEDGEE------MEIDLSQKRETPLLWFRIDPDLEWIHKITFRQP 690
Query: 210 VQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALA 269
MWI+QLE D DV+AQ +AI L+ +S N V + F+ +++ F+RVR +A L+
Sbjct: 691 EYMWIHQLELDKDVIAQIEAIKGLKDF--VSHNSVRAVFKFIQNTRFFYRVRCDATLLLS 748
Query: 270 NTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADN 329
+++ET++ GL LV ++K+ +D + R N+F+DF+ YF+ +AIP ++++++ +
Sbjct: 749 RLSTKETNFEGLNVLVSYFKNLFYDPDSNQLRSNNFKDFNSYFIQKAIPLSISLIKDKEG 808
Query: 330 KSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLSSLLKRIDRL 389
++P E +EF+L+LLK NDN+ N YSD ++L L+ S+ + + + + + K++ R
Sbjct: 809 RTPLECIEFLLELLKDNDNSLNTYSDCYYLETLLISLSNIHTTNEQL--IKKIRKQVSRY 866
Query: 390 LQFDRLMPSYNGILTISCIRTLTQI 414
L++D+L PSY +T C+R Q+
Sbjct: 867 LKYDQLYPSYRNCVTAGCLRAFCQL 891
>gi|406604797|emb|CCH43782.1| Transcription initiation factor TFIID subunit 2 [Wickerhamomyces
ciferrii]
Length = 1347
Score = 191 bits (484), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 148/518 (28%), Positives = 252/518 (48%), Gaps = 65/518 (12%)
Query: 31 LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTV 90
LST F+H KV N R L+ FF +WV G P+ R+ +NK++ +E+ + +
Sbjct: 561 LSTYHFQHVCEKV-NHNR--LESFFRQWVYGSGVPIFRVTQRFNKKRMFIEMGIRQVQHQ 617
Query: 91 KPDSRTP-----VLSSNTDSENRDGDIG---WPGMMSIRVHELDGM-YDHPILPMAGDAW 141
+ +++ P + + E D + + G M+IR+HE DG Y+H + G +
Sbjct: 618 ETENQHPSPDTFISEARRQQETTDAQVPATLFTGPMTIRIHEADGTPYEHIVDLKEG--F 675
Query: 142 QLLEIQCHSKLAA-RRALKPKKGSKPDGCDDNGDAVA----GLDMRSSME---------- 186
L+IQ ++K +R+ K +K K D + V G ++S E
Sbjct: 676 TKLDIQYNTKYKRLKRSQKQQKSDKDKPEPDEDETVLLHCLGDILQSDQEIKEWDLTEWT 735
Query: 187 ---------SPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALE-AL 236
WIR D + E++ +IH NQP M+ +QL++D DV AQ ++I A
Sbjct: 736 KEEEEKMNNEAFEWIRVDADFEWICKIHINQPDYMYASQLQQDRDVEAQLESINYFSIAN 795
Query: 237 PHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDEN 296
P ++ + L L DS+ ++ VR+EAA +A A E+++W GL HL+ +K EN
Sbjct: 796 PSTLYSSI--LLRTLLDSRYYYGVRLEAAKGMAKYAKEKSNWIGLKHLLTAFKRLYCFEN 853
Query: 297 IGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDV 356
+P PN+F DF YF+ +AIP A++ ++ +N P E F+L +LK+NDN NPYSD
Sbjct: 854 SSIPFPNEFSDFPSYFLKKAIPEALSTIQDKNNNCPLEVRRFLLDILKFNDNLNNPYSDA 913
Query: 357 FWLAALVQSV-----------GELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTI 405
F+++ L+ + + FL++ + + R + D +PSYN ++ +
Sbjct: 914 FYVSHLITCLVRSLTSNVLIDNNSKLNSTDREFLNNAVAELSRHQKMDEWIPSYNSLIGV 973
Query: 406 SCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSAL 465
S ++A K G + LD L + ++ N VR+ A + L F G+ ++
Sbjct: 974 SAFNEKLKLARK--GLLELDPDY-LFEHTKE-NYFPDVRITAFQGL----FQSGGLKNSR 1025
Query: 466 SL----FIKSVEEEPSLRGQ-VKLGIHAMRICQIKGGS 498
L + S E+ P +R + ++ + + I+GGS
Sbjct: 1026 ILHYFFLLTSFEKSPYMRHKLIEAFAQSFGVVVIEGGS 1063
>gi|405952443|gb|EKC20254.1| Transcription initiation factor TFIID subunit 2 [Crassostrea gigas]
Length = 1252
Score = 184 bits (468), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 134/482 (27%), Positives = 229/482 (47%), Gaps = 68/482 (14%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEFRHFANKV-GNLERPF 50
MLE+++G +++ +++ A + + LST F + V G PF
Sbjct: 439 MLEQRIGEQLLIQVVNKLVALATTGASQKVGANTWGNLLLSTSSFLKIISTVTGKDISPF 498
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
L+ WV GC F +N+++N+VEL + +D K G
Sbjct: 499 LE----LWVSQSGCARFFGNFVFNRKRNVVELELKQDLGAK------------------G 536
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ + G +++ + ELDG ++H + +I CHSK R K KK G +
Sbjct: 537 ALKYVGPLTVTIQELDGSFNHNF--KIEENKTKFDITCHSKC---RRNKKKKIPLMTGEE 591
Query: 171 DNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAI 230
+ D ++ +D +SP+ W+R DP+M L+ + + QP MW QL + D+VAQ +AI
Sbjct: 592 VDMD-LSAMDA----DSPVLWLRVDPDMNLLSHVTWEQPDYMWQYQLRYERDIVAQREAI 646
Query: 231 AALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETD-WAGLLHLVKFYK 289
ALE P + L + L D F+R+R+EAA+ LA A+ WAG ++ ++
Sbjct: 647 RALENYP--TAGTRRALTDILEDDHCFYRIRMEAAHCLAKVANSMVGVWAGPPAMMTIFR 704
Query: 290 SRRFDENI-GLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDN 348
+ + R N+F +F YF+L+ IP+A+A++R P E +F+++L KYNDN
Sbjct: 705 KMFGSHSCPAIIRQNNFSNFQNYFLLKTIPNAMALLRNIHTICPPEVFKFIMELFKYNDN 764
Query: 349 NGNPYSDVFWLAALVQSVGELEFGQQSILFLSS----------------LLKRIDRLLQF 392
+ N +SD ++ AALV++V + +I +S+ +L+ I R L
Sbjct: 765 SRNKFSDNYYRAALVEAVSKT--VTPAITLMSTPGQGPTADNLSAETKQILEEITRYLNL 822
Query: 393 DRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALL 452
++L+P Y +T+SC+ + L F L V+ K + +R+ A A++
Sbjct: 823 EKLLPCYRHTVTVSCLSAIR----TLQKFGHLPSEVEFFKSYAQHGVFRDIRLAAIDAIV 878
Query: 453 DL 454
D
Sbjct: 879 DF 880
>gi|330924409|ref|XP_003300628.1| hypothetical protein PTT_11925 [Pyrenophora teres f. teres 0-1]
gi|311325144|gb|EFQ91276.1| hypothetical protein PTT_11925 [Pyrenophora teres f. teres 0-1]
Length = 1526
Score = 181 bits (460), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 166/608 (27%), Positives = 269/608 (44%), Gaps = 97/608 (15%)
Query: 2 LEKQMGSNFFRKILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
L KQ G N + L ++ A QG P ++T +F KVG+ + L+ FF +W+
Sbjct: 506 LVKQSGKNGVDRCLYRMLFNARQGKYPNGAITTDDFLDICEKVGHQK---LESFFQQWIV 562
Query: 61 TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSEN---------RD-- 109
GCP ++NK+K +V+L + + ++ DS L + +++ RD
Sbjct: 563 GSGCPTFHCLPTFNKKKQVVQLTIRQ---LQADSSITTLENGLSADDFMREAKEKSRDLR 619
Query: 110 GDIGWP---GMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSK 165
G+P G M+IR+HE DG Y+H + + + + K R L ++ +
Sbjct: 620 SATGYPTFVGPMTIRIHEADGTPYEHIVDINSNNVKVEIPYNTKYKRLKRNRLNKERMAA 679
Query: 166 PDGCDDNGDAVAGL--------------------------DMRSSMESPLSWIRADPEME 199
G D +G+ + D WIR D + E
Sbjct: 680 AAGLDASGETQEDVTFYMLGDLFTTPQEHNEWRLDDWPAEDEAKQEGEAYEWIRIDADFE 739
Query: 200 YLAEIHF-NQPVQMWINQLEKDGDVVAQAQAIAAL---EALPHLSFNVVNTLNNFLSDSK 255
++ + + P M+++QL++D DVVAQA++I L E +S +V T L D++
Sbjct: 740 WICRTNIEDMPSYMYVSQLQQDKDVVAQAESIQFLANKEGHKLISTFLVKT----LMDNR 795
Query: 256 AFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLE 315
F +R AA LA +A + TDW GL HL K ++ N G+ R NDF D S Y +
Sbjct: 796 YFHGIRTMAASVLAQSALKRTDWVGLFHLKKAFQELFCLPNSGMTRSNDFSDRSMYLIQC 855
Query: 316 AIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGEL------ 369
AIP A+A ++ D +SP EA F+L L++YNDN GN YSD ++A L++ + E
Sbjct: 856 AIPQAIARIKGEDGRSPLEAKRFLLDLVRYNDNRGNDYSDDHYIATLMRCIAETITKSNT 915
Query: 370 -------EFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFI 422
E+ Q+ F S + + R + D +P+Y + T + + ALKL
Sbjct: 916 TSELSHEEWAQEQD-FTSKAIGELTRHQRLDEWIPTYQNVYTTTAL----DCALKL---- 966
Query: 423 SLDQVVKLIKPFR--DFNTIWQ------VRVEASRALLDLEFHCNGIDSALSLFIKSVEE 474
V+ + PF+ +F Q VR++A L+ L D+ L + +
Sbjct: 967 ----VMSRVTPFKPSEFLQYAQLGNADNVRLKAWECLIRLGMFRK--DTVLKFLVHEIRS 1020
Query: 475 EPS--LRGQVKLGIHAMRICQIKGGS--DSNHEVDTVTLVALLNLLESRIAFNNVFLRHH 530
+PS R Q+ L + + I Q+ G VDT ++ S + R +
Sbjct: 1021 DPSPYFRKQL-LRLFGIAIGQVATGDVWAPEKAVDTTADTLVVENEASNAERASALARQN 1079
Query: 531 LFGILQIL 538
L G L+ L
Sbjct: 1080 LNGALRGL 1087
>gi|156359773|ref|XP_001624939.1| predicted protein [Nematostella vectensis]
gi|156211747|gb|EDO32839.1| predicted protein [Nematostella vectensis]
Length = 997
Score = 177 bits (448), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 140/499 (28%), Positives = 235/499 (47%), Gaps = 62/499 (12%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEFRHFANKVGNLERPFL 51
M+E ++G ++ ++S A + + LST F + V + +
Sbjct: 444 MIEIRIGQEPLLQVFNKLLSLANTTAQPKVDSSLWRNLLLSTSGFLKSISTVSGKD---I 500
Query: 52 KEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGD 111
F +W+ G F +N+++NIVEL + +D + G
Sbjct: 501 NVFIDQWICHSGIAKFHGSFVFNRKRNIVELDIKQDLS-------------------KGT 541
Query: 112 IGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDD 171
+ + G +++ V ELDG ++H + D E+ CHSK + K + + D
Sbjct: 542 VKYVGPLTVCVQELDGSFNHTV--QIEDITSRHELPCHSKSRRNKKKKIPLMTGEE-VDM 598
Query: 172 NGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIA 231
N DA+ +SP+ W+R DPE+ +L ++ F QP MW QL + D++AQ Q++
Sbjct: 599 NLDAMDS-------DSPVLWLRIDPEVTWLRQVSFEQPDYMWQYQLRHERDIMAQVQSLV 651
Query: 232 ALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETD-WAGLLHLVKFYKS 290
ALE P + + L + ++ + F+RVR EA LA AS D W G ++ +K+
Sbjct: 652 ALERFP--TAPTRDALMDIITQKECFYRVRQEACMCLAKVASASADSWTGQTAMLNIFKN 709
Query: 291 RRFDENI-GLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNN 349
++ + R NDF +F YFVL+ +P A+A VR ++ PRE V F++ LLKYNDN
Sbjct: 710 MFGSKSAPQIIRYNDFSNFISYFVLKTLPIAIASVRNIHSQCPREVVLFLMDLLKYNDNR 769
Query: 350 GNPYSDVFWLAALVQSV------GELEFGQQSILFLS------SLLKRIDRLLQFDRLMP 397
N YSD + LAAL++++ G Q +L + +L + R L ++L P
Sbjct: 770 LNKYSDSYHLAALIEALSATITPGVAMVSTQRLLLVKLKDDTQLVLAEVIRRLNLEKLQP 829
Query: 398 SYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFH 457
SY +T+SC++ L L+++G + D V + + F VR A L D
Sbjct: 830 SYRYCVTVSCLKALRN--LQVNGQVPADSVA--FEFYAGFGHFEDVRCSAIECLADYT-K 884
Query: 458 CNGIDSALSLFIKSVEEEP 476
+ L+ ++ VE++P
Sbjct: 885 VEVSEKTLNFLMEIVEKDP 903
>gi|68487713|ref|XP_712267.1| hypothetical protein CaO19.9682 [Candida albicans SC5314]
gi|46433641|gb|EAK93074.1| hypothetical protein CaO19.9682 [Candida albicans SC5314]
Length = 917
Score = 176 bits (447), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 142/470 (30%), Positives = 226/470 (48%), Gaps = 57/470 (12%)
Query: 24 GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELA 83
G+ P TL+T+ FR+ KV R L+ FF +WV G PV R+ +NK+K ++E+
Sbjct: 103 GSLPNGTLNTEHFRYVCEKVN---RNKLENFFKQWVFGAGAPVFRISQRFNKKKGMIEMN 159
Query: 84 VLRDCTVKPDSRTPVLSSNTDSENRDGDIG--------------WPGMMSIRVHELDGM- 128
+ + V+ RTP DS + GD + G M+IRVHE DG
Sbjct: 160 IRQ---VQHHERTP---KKLDSSSFIGDAVAYLDNEPSPPQQHVFTGPMTIRVHESDGSP 213
Query: 129 YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPD-GCDDNGDAVAG--------- 178
Y+H + +A L+IQ +SK R+ K G P + GD +
Sbjct: 214 YEHIV--ELKEANTKLDIQYNSKF--RKIKKKDDGVDPGIAFNQFGDVLLSHAEMQEWNL 269
Query: 179 -----LDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
D S P WIR D + E++A I QP M+ +QL D D+ AQ +A+
Sbjct: 270 RDWDKRDEESLSSEPFEWIRGDVDFEWIARIDIKQPDFMFGSQLVYDRDIEAQLEAVRYF 329
Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVK-FYKSRR 292
L + L L D + F+ VRI AA ALA+ ++E ++ GL +L+K F K
Sbjct: 330 GELEKTNTTYCTALTRTLMDDRYFYGVRIAAAQALADCSNEANNFIGLEYLLKAFGKLYC 389
Query: 293 FDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNP 352
FD++ +P+ NDF DF + + A P + V+ D K P+ +L L+K+NDN+ N
Sbjct: 390 FDDSF-IPKSNDFSDFRSFMLQSAFPRILCGVKDNDGKVPKRIQNLLLNLVKFNDNSMNA 448
Query: 353 YSDVFWLAALVQS-VGELEFGQQSIL-------FLSSLLKRIDRLLQFDRLMPSYNGILT 404
+ D +++ LV S V G ++ + F + + RL + DR +PSY I+
Sbjct: 449 FVDSLYVSELVVSLVTSTVSGWRNDIEEYNDTEFAGKVSSELFRLQKLDRWIPSYQNIIE 508
Query: 405 ISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
++CI+ ++AL + +S ++++ + + RVEA R +L+L
Sbjct: 509 VTCIQQNIRMALVGARKLSFEELLYMTLD----KYPMEARVEAFRGILEL 554
>gi|68487423|ref|XP_712410.1| hypothetical protein CaO19.2135 [Candida albicans SC5314]
gi|46433795|gb|EAK93224.1| hypothetical protein CaO19.2135 [Candida albicans SC5314]
Length = 917
Score = 176 bits (447), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 142/470 (30%), Positives = 226/470 (48%), Gaps = 57/470 (12%)
Query: 24 GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELA 83
G+ P TL+T+ FR+ KV R L+ FF +WV G PV R+ +NK+K ++E+
Sbjct: 103 GSLPNGTLNTEHFRYVCEKVN---RNKLENFFKQWVFGAGAPVFRISQRFNKKKGMIEMN 159
Query: 84 VLRDCTVKPDSRTPVLSSNTDSENRDGDIG--------------WPGMMSIRVHELDGM- 128
+ + V+ RTP DS + GD + G M+IRVHE DG
Sbjct: 160 IRQ---VQHHERTP---KKLDSSSFIGDAVAYLDNEPSPPQQHVFTGPMTIRVHESDGSP 213
Query: 129 YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPD-GCDDNGDAVAG--------- 178
Y+H + +A L+IQ +SK R+ K G P + GD +
Sbjct: 214 YEHIV--ELKEANTKLDIQYNSKF--RKIKKKDDGVDPGIAFNQFGDVLLSHAEMQEWNL 269
Query: 179 -----LDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
D S P WIR D + E++A I QP M+ +QL D D+ AQ +A+
Sbjct: 270 RDWDKRDEESLSSEPFEWIRGDVDFEWIARIDIKQPDFMFGSQLVYDRDIEAQLEAVRYF 329
Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVK-FYKSRR 292
L + L L D + F+ VRI AA ALA+ ++E ++ GL +L+K F K
Sbjct: 330 GELEKTNTTYCTALTRTLMDDRYFYGVRIAAAQALADCSNEANNFIGLEYLLKAFGKLYC 389
Query: 293 FDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNP 352
FD++ +P+ NDF DF + + A P + V+ D K P+ +L L+K+NDN+ N
Sbjct: 390 FDDSF-IPKSNDFSDFRSFMLQSAFPRILCGVKDNDGKVPKRIQNLLLNLVKFNDNSMNA 448
Query: 353 YSDVFWLAALVQS-VGELEFGQQSIL-------FLSSLLKRIDRLLQFDRLMPSYNGILT 404
+ D +++ LV S V G ++ + F + + RL + DR +PSY I+
Sbjct: 449 FVDSLYVSELVVSLVTSTVSGWRNDIEEYNDTEFAGKVSSELFRLQKLDRWIPSYQNIIE 508
Query: 405 ISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
++CI+ ++AL + +S ++++ + + RVEA R +L+L
Sbjct: 509 VTCIQQNIRMALVGARKLSFEELLYMTLD----KYPMEARVEAFRGILEL 554
>gi|241956976|ref|XP_002421208.1| TATA binding protein-associated factor, putative; transcription
initiation factor TFIID subunit, putative [Candida
dubliniensis CD36]
gi|223644551|emb|CAX41369.1| TATA binding protein-associated factor, putative [Candida
dubliniensis CD36]
Length = 1294
Score = 176 bits (446), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 134/459 (29%), Positives = 218/459 (47%), Gaps = 42/459 (9%)
Query: 24 GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELA 83
G+ P TLST+ FR+ KV R L FF +WV G P+ R+ +NK+K ++E+
Sbjct: 563 GSLPNGTLSTEHFRYVCEKVN---RNKLDGFFKQWVFGAGAPIFRISQRFNKKKGMIEMN 619
Query: 84 V--LRDCTVKP---------DSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGM-YDH 131
+ ++ KP D L T + + G M+IRVHE DG Y+H
Sbjct: 620 IRQVQHYERKPRKLDSCGFIDDAIAYLDEETVPPQQHV---FTGPMTIRVHESDGSPYEH 676
Query: 132 PILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPD-GCDDNGDAVAGL----------- 179
+ +A L+IQ +SK R+ K +G P + GD +
Sbjct: 677 IV--ELKEANTKLDIQYNSKF--RKIKKKDEGVDPGIAFNQFGDVLVSAAEMQEWNLRDW 732
Query: 180 ---DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEAL 236
D S P WIR D + E++A I QP M+ +QL D D+ AQ +A+ L
Sbjct: 733 DKRDEESLSSEPFEWIRGDVDFEWIARIDIKQPDFMFGSQLVYDRDIEAQLEAVRYFGEL 792
Query: 237 PHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDEN 296
+ L L D + F+ VRI AA ALA+ SE ++ GL +L+K + +
Sbjct: 793 EKTNVTYCTALTRTLMDERYFYGVRIAAAQALADCCSEANNFIGLEYLLKAFGELYCFDG 852
Query: 297 IGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDV 356
+P+ NDF DF + + A P + ++ D + P+ E +L L+KYNDN+GN + D
Sbjct: 853 SFIPKSNDFSDFRGFMLQRAFPKILCGIKDNDGRVPKRVQELLLNLVKYNDNSGNVFVDS 912
Query: 357 FWLAALVQS-VGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIA 415
+++ LV++ V G + F+ ++ + RL + DR + SY ++ ++CI+ ++A
Sbjct: 913 LYVSELVEALVSSTVSGGRDDGFVDKVVGELFRLQKMDRWVVSYQNVIEVTCIQQSIRMA 972
Query: 416 LKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
L +SL++++ + + RVEA +L L
Sbjct: 973 LAGVRKVSLEELLYMTLD----KYPMEARVEAFSGILGL 1007
>gi|156848157|ref|XP_001646961.1| hypothetical protein Kpol_2000p71 [Vanderwaltozyma polyspora DSM
70294]
gi|156117643|gb|EDO19103.1| hypothetical protein Kpol_2000p71 [Vanderwaltozyma polyspora DSM
70294]
Length = 1495
Score = 176 bits (445), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 134/486 (27%), Positives = 229/486 (47%), Gaps = 76/486 (15%)
Query: 13 KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
++L I +A G P +LS+ F+H +V + L+ FF +W+ G P+ R+
Sbjct: 687 RVLPKIFLQAMSGDLPNNSLSSAHFQHICERVN---KNKLENFFQQWIYGSGVPIFRVTQ 743
Query: 72 SYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSEN---------RDGDIGWPGMMSIRV 122
+NK++ +VE+ + R C + V S+ S + R+ + G M+IR+
Sbjct: 744 RFNKKRMVVEMGI-RQCQAQELGSGKVTGSSGFSSSALNFLEHPTREMAPHFSGSMTIRI 802
Query: 123 HELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKP-----KKGSKPDGCDDNGDAV 176
HE DG Y+H + D + L+IQ ++K RA KP KK + P+ +D +
Sbjct: 803 HEPDGTPYEHIV--EVKDVFTKLDIQYNTKYRRSRARKPLGARSKKENTPEVSNDINEKY 860
Query: 177 AGLDMRSSME-------SP--------------------------LSWIRADPEMEYLAE 203
D ++++ SP WIR D + E++++
Sbjct: 861 HERDNNTNIQKLGTVLTSPQDCSEWNLTDISVSSEGHELQRQGEAFEWIRIDSDFEWISK 920
Query: 204 IHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLS----DSKAFWR 259
I+ NQP M+ +QL++DGDV AQ ++I E + S N ++ L+ DS+ F+
Sbjct: 921 IYINQPDYMFTSQLQQDGDVEAQIESIRYFEDVVMSSQNNSKLYSSILTRTIVDSRYFYG 980
Query: 260 VRIEAAYALANTASEETDW----AGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLE 315
VR+EA AL+ + + G HL+K Y+ EN +P N F DF Y+V +
Sbjct: 981 VRLEACRALSKFVVRDNESLSFRGGAQHLIKIYQKLFCYENSNIPLNNCFSDFKRYYVQQ 1040
Query: 316 AIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQS 375
+IP ++ V+ +++P F++ LL YN+N GNPY D F++ L+++ + S
Sbjct: 1041 SIPQNLSNVKDERDETPDFIKSFIIDLLTYNENTGNPYDDTFFVCTLIKAAVKCAINSPS 1100
Query: 376 -ILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPF 434
F+ LL+++ R D+ P+Y ++T T+ LKL+ K + F
Sbjct: 1101 DTNFIGKLLEQLYRYESLDKWTPTYQLLIT----NTILSEKLKLNS--------KKLDRF 1148
Query: 435 RDFNTI 440
D N+I
Sbjct: 1149 EDLNSI 1154
>gi|451849770|gb|EMD63073.1| hypothetical protein COCSADRAFT_44824, partial [Cochliobolus sativus
ND90Pr]
Length = 1157
Score = 176 bits (445), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 156/565 (27%), Positives = 256/565 (45%), Gaps = 95/565 (16%)
Query: 2 LEKQMGSNFFRKILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
L KQ G N + L ++ A QG P ++T +F KVG+ + L+ FF +W+
Sbjct: 498 LVKQSGKNGVDRCLYRMLFNARQGKYPNGAVTTDDFLDICEKVGHQK---LESFFQQWIV 554
Query: 61 TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSEN---------RD-- 109
GCP S+NK+K +V+L + + ++ DS + + +++ RD
Sbjct: 555 GAGCPTFHCVPSFNKKKQVVQLTIKQ---LQADSSITTIENGLAADDFMREAKQKSRDLR 611
Query: 110 GDIGWP---GMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSK 165
G+P G M+IR+HE DG Y+H + + + + K R L ++ +
Sbjct: 612 SATGYPTFVGPMTIRIHEADGTPYEHIVDINSNNVKVEIPYNTKYKRLKRNRLNKERMAA 671
Query: 166 PDGCDDNGDAVAGL------DMRSSME--------------------SPLSWIRADPEME 199
G D +G+ + D+ +S + WIR D + E
Sbjct: 672 AAGLDASGETQEDVTFYMLGDLFTSPQEHEEWRLDDWPQKDEAKQEGEAYEWIRIDADFE 731
Query: 200 YLAEIHF-NQPVQMWINQLEKDGDVVAQAQAIAAL---EALPHLSFNVVNTLNNFLSDSK 255
++ + + P M+++QL++D DVVAQA++I L E P +S +V T L D++
Sbjct: 732 WICRTNIEDMPSYMYVSQLQQDKDVVAQAESIQFLANKEGHPLISTFLVKT----LMDNR 787
Query: 256 AFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLE 315
F +R AA LA +A + T W GL HL K ++ N + R NDF D S Y +
Sbjct: 788 YFHGIRTMAAMVLAQSALKRTQWVGLFHLKKAFRELFCLPNSPMTRSNDFSDRSMYLIQC 847
Query: 316 AIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGEL------ 369
AIP A+A ++ D +SP EA F+L L++YNDN GN YSD ++A L++ + E
Sbjct: 848 AIPRAIAKIKGEDGRSPLEAKRFLLDLVRYNDNRGNDYSDDHYIATLMRCLAETLTKTNI 907
Query: 370 -------EFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFI 422
E+ Q+ F + + + R + D +P+Y + T + + A KL
Sbjct: 908 GSELSHEEWAQEQD-FTNKAIGELTRHQRLDEWIPTYQNVYTTTAL----DCAFKL---- 958
Query: 423 SLDQVVKLIKPFR--DFNTIWQ------VRVEASRALLDLEFHCNGIDSALSLFIKSVEE 474
V + PF+ +F Q VR++A L+ L DS + + +
Sbjct: 959 ----VTSRVTPFKPTEFLQYAQLGNADNVRLKAWECLIRLGMFRK--DSVMRFLVHEIRS 1012
Query: 475 EPS--LRGQVKLGIHAMRICQIKGG 497
+PS R Q+ L I + I Q+ G
Sbjct: 1013 DPSPYFRKQM-LRILGLAIGQVATG 1036
>gi|344300178|gb|EGW30518.1| hypothetical protein SPAPADRAFT_157654 [Spathaspora passalidarum NRRL
Y-27907]
Length = 1507
Score = 174 bits (442), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 129/438 (29%), Positives = 204/438 (46%), Gaps = 65/438 (14%)
Query: 24 GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELA 83
G P T++T F++ KV R L FF +WV + G P LR+ +NK++ ++E+
Sbjct: 599 GDLPNGTITTTHFQYLCEKVN---RNKLDGFFKQWVYSTGTPHLRIQQKFNKKRGMIEMQ 655
Query: 84 V-----LRDCTVKPDSRTPVLSSNTDSENRDGDIGWP---GMMSIRVHELDGM-YDHPIL 134
+ L + R+ + + + E+ G P G ++IRVHE DG Y+H +
Sbjct: 656 IRQVQHLEKRQARLTPRSFITDATSYIEDEPGHPLLPVFTGPITIRVHEADGTPYEHIVE 715
Query: 135 PMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLDMRSSMES------- 187
G+ ++IQ +SK+ KPKK D AGLDM S S
Sbjct: 716 LKEGNV--KIDIQYNSKMR-----KPKKK----------DIEAGLDMMSPFTSRLGDVLT 758
Query: 188 ------------------------PLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDV 223
P WIR D + E++A I QP M+ QL D D+
Sbjct: 759 SPADMSSWGLSEFEKRDEEMMFNDPFEWIRVDVDYEWIAVIDIKQPDYMFAAQLCHDRDI 818
Query: 224 VAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLH 283
AQ +++ L S L L D + F+ VRI AA ALA ++ ++ GL +
Sbjct: 819 EAQLESVRYFRGLDKPSVVHCTVLVRTLMDDRYFYGVRIAAAEALAKFSNSSNNFIGLGY 878
Query: 284 LVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLL 343
L++ YK +P+ NDF DF +YF+ AIP ++ VR D P E +L L+
Sbjct: 879 LLRAYKELFCFRGSTIPKSNDFADFRQYFIQVAIPKILSKVRDQDGIVPPVIQELLLNLV 938
Query: 344 KYNDNNGNPYSDVFWLAALVQSVGELEFG-----QQSILFLSSLLKRIDRLLQFDRLMPS 398
K+NDN N ++D +++ L++++ + F + IDRLL+ D+ MPS
Sbjct: 939 KFNDNTNNSFTDGIYVSKLIEALTTTAVTGRKDEPHTKQFAKHVNTEIDRLLKLDQYMPS 998
Query: 399 YNGILTISCIRTLTQIAL 416
Y+ I+ I+CI+ ++A+
Sbjct: 999 YHNIIEITCIKQKIRLAV 1016
>gi|452001588|gb|EMD94047.1| hypothetical protein COCHEDRAFT_1192202 [Cochliobolus heterostrophus
C5]
Length = 1562
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 156/565 (27%), Positives = 256/565 (45%), Gaps = 95/565 (16%)
Query: 2 LEKQMGSNFFRKILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
L KQ G N + L ++ A QG P ++T +F KVG+ + L+ FF +W+
Sbjct: 507 LVKQSGKNGVDRCLYRMLFNARQGKYPNGAVTTDDFLDICEKVGHQK---LESFFQQWII 563
Query: 61 TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSEN---------RD-- 109
GCP S+NK+K +V+L + + ++ DS + + +++ RD
Sbjct: 564 GAGCPTFHCVPSFNKKKQVVQLTIKQ---LQADSSITTIENGLAADDFMREAKQKSRDLR 620
Query: 110 GDIGWP---GMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSK 165
G+P G M+IR+HE DG Y+H + + + + K R L ++ +
Sbjct: 621 SATGYPTFVGPMTIRIHEADGTPYEHIVDINSNNVKVEIPYNTKYKRLKRNRLNKERMAA 680
Query: 166 PDGCDDNGDAVAGL------DMRSSME--------------------SPLSWIRADPEME 199
G D +G+ + D+ +S + WIR D + E
Sbjct: 681 AAGLDASGETQEDVTFYMLGDLFTSPQEHEEWRLDDWPPEDEAKQEGEAYEWIRIDADFE 740
Query: 200 YLAEIHF-NQPVQMWINQLEKDGDVVAQAQAIAAL---EALPHLSFNVVNTLNNFLSDSK 255
++ + + P M+++QL++D DVVAQA++I L E P +S +V T L D++
Sbjct: 741 WICRTNIEDMPSYMYVSQLQQDKDVVAQAESIQFLANKEGHPLISTFLVKT----LMDNR 796
Query: 256 AFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLE 315
F +R AA LA +A + T W GL HL K ++ N + R NDF D S Y +
Sbjct: 797 YFHGIRTMAAMVLAQSALKRTQWVGLFHLKKAFRELFCLPNSPMTRSNDFSDRSMYLIQC 856
Query: 316 AIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGEL------ 369
AIP A+A ++ D +SP EA F+L L++YNDN GN YSD ++A L++ + E
Sbjct: 857 AIPRAIAKIKGEDGRSPLEAKRFLLDLVRYNDNRGNDYSDDHYIATLMRCLAETLTKTNI 916
Query: 370 -------EFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFI 422
E+ Q+ F + + + R + D +P+Y + T + + A KL
Sbjct: 917 SSELSHEEWAQEQD-FTNKAIGELTRHQRLDEWIPTYQNVYTTTAL----DCAFKL---- 967
Query: 423 SLDQVVKLIKPFR--DFNTIWQ------VRVEASRALLDLEFHCNGIDSALSLFIKSVEE 474
V + PF+ +F Q VR++A L+ L DS + + +
Sbjct: 968 ----VTSRVTPFKPTEFLQYAQLGNADNVRLKAWECLIRLGMFRK--DSVMRFLVHEIRS 1021
Query: 475 EPS--LRGQVKLGIHAMRICQIKGG 497
+PS R Q+ L I + I Q+ G
Sbjct: 1022 DPSPYFRKQM-LRILGLAIGQVATG 1045
>gi|238882937|gb|EEQ46575.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 1423
Score = 173 bits (438), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 141/470 (30%), Positives = 226/470 (48%), Gaps = 57/470 (12%)
Query: 24 GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELA 83
G+ P TL+T+ FR+ KV R L+ FF +WV G PV R+ +NK+K ++E+
Sbjct: 609 GSLPNGTLNTEHFRYVCEKVN---RNKLENFFKQWVFGAGAPVFRISQRFNKKKGMIEMN 665
Query: 84 VLRDCTVKPDSRTPVLSSNTDSENRDGDIG--------------WPGMMSIRVHELDGM- 128
+ + V+ RTP DS + GD + G M+IRVHE DG
Sbjct: 666 IRQ---VQHHERTP---KKLDSSSFIGDAVAYLDNEPSPPQQHVFTGPMTIRVHESDGSP 719
Query: 129 YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPD-GCDDNGDAVAG--------- 178
Y+H + +A L+IQ +SK R+ K G P + GD +
Sbjct: 720 YEHIV--ELKEANTKLDIQYNSKF--RKIKKKDDGVDPGIAFNQFGDVLLSHAEMQEWNL 775
Query: 179 -----LDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
D S P WIR D + E++A I QP M+ +QL D D+ AQ +A+
Sbjct: 776 RDWDKRDEESLSSEPFEWIRGDVDFEWIARIDIKQPDFMFGSQLVYDRDIEAQLEAVRYF 835
Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVK-FYKSRR 292
L + L L D + F+ VRI AA ALA+ ++E ++ GL +L+K F +
Sbjct: 836 GELEKTNTTYCTALTRTLMDDRYFYGVRIAAAQALADCSNEANNFIGLEYLLKAFGELYC 895
Query: 293 FDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNP 352
FD++ +P+ NDF DF + + A P + V+ D K P+ +L L+K+NDN+ N
Sbjct: 896 FDDSF-IPKSNDFSDFRGFMLQSAFPRILCGVKDNDGKVPKRIQNLLLNLVKFNDNSMNA 954
Query: 353 YSDVFWLAALVQS-VGELEFGQQSIL-------FLSSLLKRIDRLLQFDRLMPSYNGILT 404
+ D +++ LV S V G ++ + F + + RL + DR +PSY I+
Sbjct: 955 FVDSLYVSELVVSLVTSTVSGWRNDIEEYNDTEFAGKVSSELFRLQKLDRWIPSYQNIIE 1014
Query: 405 ISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
++CI+ ++AL + +S ++++ + + RVEA R +L+L
Sbjct: 1015 VTCIQQNIRMALVGARKLSFEELLYMTLD----KYPMEARVEAFRGILEL 1060
>gi|388853437|emb|CCF52836.1| related to TAF2-component of TFIID complex [Ustilago hordei]
Length = 1846
Score = 171 bits (433), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 129/462 (27%), Positives = 221/462 (47%), Gaps = 87/462 (18%)
Query: 31 LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTV 90
LST++F KV + L+ F W+ GCP ++N++K ++E+ + ++
Sbjct: 543 LSTQQFLRTCRKVSGAD---LRLFTEHWIRGSGCPRFICSANFNRKKLLIEMHIRQE--- 596
Query: 91 KPDSRTPVLSSNTDSENRDGDIG-------WPGMMSIRVHELDGM-YDHPILPMAGDAWQ 142
+ + + R GD + G M++R+HE DG Y+H +L + G A +
Sbjct: 597 --------VPAAQFAAARPGDALAANPVPLFEGQMTVRIHEADGTPYEH-VLEIKGPA-K 646
Query: 143 LLEIQCHSKLA-ARRALK---------------PKKGSKPDGCDDNGDAVAGLDMRSSME 186
E+ ++K RR K ++ + G D G + + + E
Sbjct: 647 RYEVPFNTKYKRVRRNTKRFQARQAAAAAAAQGDQEAQEAIGLIDLGFGLGMWEDEKARE 706
Query: 187 -----------------SPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
+P WIR D + E+LA IHF QP MW++QL++D DVVAQ A
Sbjct: 707 EWKVADWTEEDEEKMSSAPYEWIRMDADFEWLASIHFEQPDYMWVSQLQRDRDVVAQVSA 766
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYK 289
+ AL +P S + L + +K F+RVR EA +AL + A + D GL HL+K ++
Sbjct: 767 VHALAQMP--SLVTCSMLTRTVLVNKYFYRVRTEAVHALVHCAIPQLDNLGLFHLLKLFR 824
Query: 290 SRRF-----DENI-------GLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVE 337
S+ D +I +PR NDF D ++YF+ A+ HA++ VR D ++P +
Sbjct: 825 SQFCHDPPDDASIEDPLDVPCIPRANDFSDAADYFLQRALIHAISRVRDPDGRTPPQIKR 884
Query: 338 FVLQLLKYNDNNGNPYSDVFWLAALVQSVG--------------ELEFGQQSILFLSSLL 383
F++ LL+YNDN+ N + D F+LA + +V + + + L+ +
Sbjct: 885 FLINLLRYNDNSTNHFVDDFYLAGAINAVASAFIPVDSAMAGSQDPDAANEDSFLLTHAI 944
Query: 384 KRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLD 425
++RL + DRL+PSY+ ++T++ + Q+A+ L+ D
Sbjct: 945 AEVERLQELDRLVPSYHNVITLASLD--FQVAMMLANLKPRD 984
>gi|194215065|ref|XP_001916176.1| PREDICTED: LOW QUALITY PROTEIN: transcription initiation factor
TFIID subunit 2 [Equus caballus]
Length = 1205
Score = 171 bits (433), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 137/477 (28%), Positives = 227/477 (47%), Gaps = 57/477 (11%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + R AL K
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSK-SRRXALIYNDKLKKIPLM 613
Query: 171 DNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAI 230
NG+ V S +SPL WIR DP+M L ++ F Q MW QL + DVVAQ ++I
Sbjct: 614 -NGEEVDXXVFSFSADSPLLWIRIDPDMAVLRKVEFEQADFMWQYQLRYERDVVAQQESI 672
Query: 231 AALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFYK 289
ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 673 LALEKFPTPASRLA--LTDILEQEQCFYRVRMAACFCLAKIANSMVSTWTGPPAMKSLF- 729
Query: 290 SRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYND 347
+R F + + N+F F YF+ + +P A+A++R N P+E + F+L L+KYND
Sbjct: 730 TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYND 789
Query: 348 NNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRLM 396
N N +SD ++ A AL SV + L +L L+ I R L ++L+
Sbjct: 790 NRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKLL 849
Query: 397 PSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
PSY +T+SC+R + L+ +G + D L K + ++ +R+ A A++D
Sbjct: 850 PSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 902
>gi|255716052|ref|XP_002554307.1| KLTH0F02222p [Lachancea thermotolerans]
gi|238935690|emb|CAR23870.1| KLTH0F02222p [Lachancea thermotolerans CBS 6340]
Length = 1443
Score = 171 bits (432), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 141/512 (27%), Positives = 226/512 (44%), Gaps = 85/512 (16%)
Query: 13 KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
++L I +A G P TLST F+H +V + L+ FF WV G P+ R+
Sbjct: 635 RVLPKIFLQAMSGDLPNSTLSTAHFQHVCERVNKNK---LENFFKEWVYGSGVPIFRITQ 691
Query: 72 SYNKRKNIVELAVLRDC---------TVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRV 122
+NK++ +VE+ + R C TV + + ++ S N++ + G M+IR+
Sbjct: 692 RFNKKRMVVEMGI-RQCQVQELGLGKTVGDEGFSSSAMNHLCSPNKNLTPVFTGSMTIRI 750
Query: 123 HELDGM-YDHPILPMAGDAWQLLEIQCHS------------KLAARRALKPKKGSKPD-- 167
HE DG Y+H + D + L+IQ ++ K A + P PD
Sbjct: 751 HEADGTPYEHIV--ELKDTFTKLDIQYNTKYKRLKRRRKINKTAKQEGKDPTPAENPDVL 808
Query: 168 -GCDDNGDAV---------------------------AGLDMRSSMESPLSWIRADPEME 199
D+N D V G + + E+ WIR D + E
Sbjct: 809 VAEDENEDIVLVNCLGDVLLSAKDCNQWNLTDPMVTSEGDEFQQQNEA-FEWIRIDADFE 867
Query: 200 YLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNT----------LNN 249
++ +IH NQP M+ +QL++D DV AQ +I E + VVN+ L
Sbjct: 868 WVCKIHINQPDYMFASQLQQDRDVEAQIDSIRFFEDV------VVNSNVNSMVYSSILTR 921
Query: 250 FLSDSKAFWRVRIEAAYALANTASEETDW----AGLLHLVKFYKSRRFDENIGLPRPNDF 305
+ D + F+ VRIEA AL+ ++ + G HL+K ++ EN +P NDF
Sbjct: 922 TVMDERYFYGVRIEACRALSRFIMQQPNTDNFSGGARHLIKIFQYYFCYENSNIPFDNDF 981
Query: 306 RDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQS 365
D+ YF+ +AIP +A VR ++ PR F+L +L+YN+N N Y D ++ L+ S
Sbjct: 982 SDYQRYFLQKAIPQLLAEVRNQNDVCPRYVKRFLLDILRYNENGDNSYYDTHYVVTLITS 1041
Query: 366 VGELEFGQ-QSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISL 424
V + F LL + R D+ +PSY +L CI + + L + GF
Sbjct: 1042 VVTSALQDPEDASFYQDLLHELRRFKNLDQWIPSYQ-LLVTKCI-MIQHLRLAIDGFYVF 1099
Query: 425 DQVVKLIKPF--RDFNTIWQVRVEASRALLDL 454
D + +++ N+ + A+ L DL
Sbjct: 1100 DDIANILEYTVKEKLNSTSETSFHATEGLQDL 1131
>gi|384496605|gb|EIE87096.1| hypothetical protein RO3G_11807 [Rhizopus delemar RA 99-880]
Length = 1667
Score = 171 bits (432), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 143/507 (28%), Positives = 231/507 (45%), Gaps = 95/507 (18%)
Query: 1 MLEKQM----GSNFFRKILQNI-ISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
ML+K+M G+ ++L I +S G +ST F KV + K F
Sbjct: 248 MLDKRMCKGGGTLGLSRVLPKILVSAMSGELAQNAISTHYFLRLCRKVSGFD---TKVFA 304
Query: 56 PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCT----VKPDSRTPVLSSNTDSENRDGD 111
+WV GCP F +N++K +VE+ + ++ T + PD+ +S + +
Sbjct: 305 EQWVYRSGCPKFSFSFHFNRKKMVVEIFMRQENTNALVIGPDN----MSYGVEYQELMAP 360
Query: 112 IGWPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLA-ARRALK---------- 159
+ + G +++R+HE DG Y+H IL + D Q E+Q ++K RR K
Sbjct: 361 L-FTGNLTVRIHEADGTPYEH-ILDIHSDK-QKFEVQFNTKYKRIRRNTKRFQAKQAAAA 417
Query: 160 PKKGSKPDGCDDNGDAVAGLDMRSSM-------ESP------------------------ 188
+ ++ GD L + S+ E P
Sbjct: 418 AAVAEEEQENEEGGDQTTVLGIIPSLGLGMPIFEDPNRKKEWRIVEWGQDEEDTSGAASA 477
Query: 189 -LSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTL 247
WIR D E E++ + F QP MW QL KD DVVAQ +AI AL+ +P S+ +L
Sbjct: 478 MFDWIRLDAEFEWVCVVEFKQPDYMWAAQLTKDRDVVAQHEAIDALKYMP--SYATSTSL 535
Query: 248 NNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSR-------RFDENIG-- 298
+ D K F+++R+EAAY LA+ +W GL L K + R + N G
Sbjct: 536 LRAIMDPKCFYKIRMEAAYGLASCGIPSLNWVGLHQLSKMFHQRFCLPISSQSSGNDGKT 595
Query: 299 --------LPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNG 350
+P+PN+F D SEYF+ +A A A +R +P + +F+L LL+YNDN G
Sbjct: 596 DELPLIRSIPKPNNFSDLSEYFLQKASVVAFAQIRDERGLTPVKIRQFLLDLLRYNDNIG 655
Query: 351 NPYSDVFWLAALVQSVGELEFGQQSILFL----------SSLLKRIDRLLQFDRLMPSYN 400
N +SD +++A L+ ++G+ F ++ I+R D ++PSY+
Sbjct: 656 NEFSDCYYVATLISALGDALIPTADEGFQVDDPEGEQVNAAAKAEIERFRTLDYVIPSYH 715
Query: 401 GILTISCI---RTLTQIALKLSGFISL 424
I+T++C+ R + ++L+ F SL
Sbjct: 716 NIITVTCLKYTRYGNYLEVRLAAFDSL 742
>gi|190345979|gb|EDK37961.2| hypothetical protein PGUG_02059 [Meyerozyma guilliermondii ATCC 6260]
Length = 1601
Score = 169 bits (429), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 136/474 (28%), Positives = 229/474 (48%), Gaps = 75/474 (15%)
Query: 30 TLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCT 89
TLST F++ KV R L+ FF +WV G P+ + +NK+++I+E+ + + +
Sbjct: 616 TLSTLHFQYVCEKVN---RNKLESFFKQWVFNAGVPLFSVAQRFNKKRSIIEMQIRQIQS 672
Query: 90 V-----KPDSRTPVLSSNTDSENRDGDIGWP---GMMSIRVHELDGM-YDHPILPMAGDA 140
KP+S + S ++ + P G M+IR+HE +G Y+H I+ + +
Sbjct: 673 QIIRKSKPNSNNFMEDSIAYLDDEPSFLVQPVFTGPMTIRIHEANGTPYEH-IIDLKHET 731
Query: 141 WQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAG-------LDMRSSME------- 186
+ L+IQ ++K R +K K G D+ + V+ L ++ ME
Sbjct: 732 SK-LDIQYNTKF---RRMKKKTG------DEVHETVSSFKQLGDILQTKAEMEEWNLVEW 781
Query: 187 ----------SPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEAL 236
W+R D + E+++++ QP M+ +QL+ D DV AQ +A+
Sbjct: 782 IKDEEDPSFNDAFEWLRVDADFEWISKMSVKQPDYMFASQLQYDRDVEAQYEAVRYFATR 841
Query: 237 PHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDEN 296
+ N L + DS+ ++ VRI AA ALA + + GL +L+K +K+
Sbjct: 842 DKPNRNYCTVLTRTVMDSRYYYGVRIAAARALAGFSKTNNLFIGLEYLMKIFKTLYCFPE 901
Query: 297 IGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDV 356
+P+ NDF D + YF+ + + A +R D SP E F+L LLKYNDN+ N Y D
Sbjct: 902 SNVPKTNDFNDLASYFLKKEMISIFATIRDDDGNSPTEVKRFLLSLLKYNDNSNNDYQDG 961
Query: 357 FWLAALVQSV-------GELE-----FGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILT 404
F++AAL++++ G+L F F+ L I+RL + D +PSY+ +++
Sbjct: 962 FYVAALIKALTSCIIITGDLRDIPDVFDNSKQKFVDDALVEINRLHKLDEWVPSYHNMIS 1021
Query: 405 ISCIRTLTQIALKLSGFISLDQVVKLIKPFRDF--NTIWQ----VRVEASRALL 452
+ C+++ Q+A+ G+ L PF D T+ + VRVEA R L
Sbjct: 1022 VVCLQSKVQLAVH--GYYEL--------PFEDLLPYTLGKFPLDVRVEAFRGLF 1065
>gi|392597793|gb|EIW87115.1| TATA-binding protein associated factor Taf2 [Coniophora puteana
RWD-64-598 SS2]
Length = 1830
Score = 169 bits (429), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 141/529 (26%), Positives = 243/529 (45%), Gaps = 84/529 (15%)
Query: 1 MLEKQMGSN-----FFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
+L++++G + R + + +S G P LST F KV ++ L+ F
Sbjct: 481 ILDRRLGKSGTSLGLSRVLPKIFLSAMTGEMPNNALSTHSFLRTCRKVSGVD---LRTFA 537
Query: 56 PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGW- 114
+W+ GCP ++N++K VE+ + ++C P R L N S+ + + +
Sbjct: 538 EQWIYGSGCPAFGFTATFNRKKMAVEITMRQEC---PAYR--ALEQNEVSKLLNKPVPFF 592
Query: 115 PGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLA-ARRALK----------PKK 162
G M++R+HE DG Y+H +L + A++ E+ ++K RR K
Sbjct: 593 EGQMTVRIHEADGTPYEH-VLDIRS-AFKRYEVPFNTKYKRIRRNTKRYLARQAAAQAAA 650
Query: 163 GSKPDGCDDNG--DAVAGLDMRSSME--------------------SPLSWIRADPEMEY 200
P+ + G D GLD+ + + WIR D + E+
Sbjct: 651 EGDPEAVEAMGLIDMGFGLDIWEKEKERENWKVADWTEEEEQVMSGATYEWIRMDADFEW 710
Query: 201 LAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRV 260
+A + F QP MW++QL++D DVVAQ +A+ AL P + V +T + S ++R+
Sbjct: 711 IAAVAFEQPDFMWVSQLQRDRDVVAQMEAVHALARQP--TAIVSSTFTKTVLVSNYYYRI 768
Query: 261 RIEAAYALANTASEETDWAGLLHLVKFYKSRRFD---------ENIGLPRPNDFRDFSEY 311
R EAA L N A + D+ GL H+ K + +D + +P+PNDF D +EY
Sbjct: 769 RCEAAACLVNCAIRKLDFLGLFHMFKLFLRYCYDPEDPNQDLFSHNYVPKPNDFSDLAEY 828
Query: 312 FVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV----- 366
FV +++ +A VR + K+P +F++ L+YNDN NPYSD F++ L+ +
Sbjct: 829 FVRKSLVDGIAQVRFENGKTPSVVRQFLVDQLRYNDNTANPYSDAFYICTLMAASACANV 888
Query: 367 -------GEL-------EFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLT 412
GEL E + L+ L+ ++R DRL+PS + ++T + +
Sbjct: 889 PTAPPERGELLKEEVRSEQTAEDASLLNQALEEVNRYRSMDRLIPSPHNVVTTAVLEFY- 947
Query: 413 QIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGI 461
+ L ++ + D +K+ P QVR+ A L +++ I
Sbjct: 948 -LLLSVANLVPND--MKVFFPLTREGNYTQVRIAAFDGLFMTKWYSPPI 993
>gi|255733076|ref|XP_002551461.1| hypothetical protein CTRG_05759 [Candida tropicalis MYA-3404]
gi|240131202|gb|EER30763.1| hypothetical protein CTRG_05759 [Candida tropicalis MYA-3404]
Length = 1463
Score = 169 bits (429), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 142/481 (29%), Positives = 231/481 (48%), Gaps = 49/481 (10%)
Query: 24 GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELA 83
G P LST+ FR+ KV R L FF +WV G PV R+ +NK+K ++E+
Sbjct: 667 GDLPNGALSTEHFRYVCEKVN---RNKLDNFFKQWVFGSGAPVFRISQRFNKKKGMIEMN 723
Query: 84 VLRDCTVKPDSRTPVLSSN--TDSENRDGDIGWP-------GMMSIRVHELDGM-YDHPI 133
+ R +++ + N D+ + D P G M+IRVHE DG Y+H I
Sbjct: 724 I-RQIQHHENTKRKLDKDNFVVDAISYLDDESAPPVQQVFTGPMTIRVHESDGSPYEH-I 781
Query: 134 LPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDN--GDAVAG------------- 178
+ + +A L+IQ +SK R+ K +G P G N GD +
Sbjct: 782 VDLK-EANTKLDIQYNSKF--RKIKKKDEGVDP-GIPFNQFGDVLTTPFEMQQWNLKDWN 837
Query: 179 -LDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALP 237
D S P WIR D ++E++A I QP M+ +QL D D+ AQ +A+ L
Sbjct: 838 KRDEESLSSEPFEWIRGDVDLEWIARIDIKQPDFMFGSQLIYDRDIEAQLEAVRYFGELE 897
Query: 238 HLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENI 297
+ TL L D + F+ VRI AA ALA ++E ++ G +L++ ++S
Sbjct: 898 KTNIVYCTTLVRTLMDERYFYGVRIAAAQALAGCSNEGNNFIGTDYLLQAFRSLYCFPGS 957
Query: 298 GLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVF 357
+PR NDF DF + + ++ P ++ V+ D K P E +L L+K+NDN+ N ++D
Sbjct: 958 IIPRSNDFSDFRSFMLQKSFPKILSSVKDNDGKVPDRIRELLLNLVKFNDNSMNGFNDSH 1017
Query: 358 WLAALVQSVGELEFG------QQS--ILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIR 409
++ ++ ++ E QQS F+ ++ +DRL + D MPSY + I+C++
Sbjct: 1018 YVCTILSALTESAVSGWRKDLQQSNDARFVEKVVTELDRLQKLDHWMPSYQSSVEIACVK 1077
Query: 410 TLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFI 469
++AL + +S + ++ + + RVEA + LL++ N S L F+
Sbjct: 1078 QKIRMALVGAITVSFEDLLHMTLD----KYPMEARVEAFKGLLEIGGLKN--KSVLDYFL 1131
Query: 470 K 470
K
Sbjct: 1132 K 1132
>gi|343429422|emb|CBQ72995.1| related to TAF2-component of TFIID complex [Sporisorium reilianum
SRZ2]
Length = 1838
Score = 169 bits (429), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 126/455 (27%), Positives = 221/455 (48%), Gaps = 73/455 (16%)
Query: 31 LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTV 90
LST+ F KV + L+ F W+ GCP ++N++K ++E+ + ++
Sbjct: 536 LSTQHFLRTCRKVSGAD---LRLFTEHWIRGSGCPRFICSANFNRKKLLIEMHIRQEV-- 590
Query: 91 KPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCH 149
P ++ + D+ + + + G M++R+HE DG Y+H +L + G A + E+ +
Sbjct: 591 -PAAQF-AAARPGDALSANAVPLFEGQMTVRIHEADGTPYEH-VLEIKGPA-KRYEVPFN 646
Query: 150 SKLA-ARRALK---------------PKKGSKPDGCDDNGDAVAGLDMRSSME------- 186
+K RR K ++ + G D G + + + E
Sbjct: 647 TKYKRVRRNTKRFQARQAAAAAAAQGDQEAQEAIGLIDLGFGLGMWEDEKAREEWKVADW 706
Query: 187 ----------SPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEAL 236
+P WIR D + E+LA IHF QP MW++QL++D DVVAQ A+ AL +
Sbjct: 707 TEEDEEKMSSAPYEWIRMDADFEWLASIHFEQPDYMWVSQLQRDRDVVAQVSAVHALAQM 766
Query: 237 PHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDEN 296
P S + L + +K F+RVR EA +AL + A + D GL HL+K ++++ ++
Sbjct: 767 P--SLVTCSMLTRTVLVNKYFYRVRTEAVHALVHCAIPQLDNLGLFHLLKLFRTQFCHDS 824
Query: 297 IG------------LPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLK 344
+PR NDF D ++YF+ A+ HA++ VR D ++P + F++ LL+
Sbjct: 825 PDEAAIENPLDVPCIPRANDFSDAADYFLQRALIHAISRVRDPDGRTPPQVKRFLINLLR 884
Query: 345 YNDNNGNPYSDVFWLAALVQSVGELEFGQQSIL--------------FLSSLLKRIDRLL 390
YNDN+ N + D F+LA + ++ +S + LS + ++RL
Sbjct: 885 YNDNSTNHFVDDFYLAGAINALASAFIPVESAIASNQDADAVSEESFLLSHAIAEVERLQ 944
Query: 391 QFDRLMPSYNGILTISCIRTLTQIALKLSGFISLD 425
+ DRL+PSY+ ++T++ + Q+A+ L+ D
Sbjct: 945 ELDRLVPSYHNVITLASLD--FQVAMMLANLKPRD 977
>gi|395334726|gb|EJF67102.1| hypothetical protein DICSQDRAFT_158655 [Dichomitus squalens
LYAD-421 SS1]
Length = 1880
Score = 169 bits (429), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 137/484 (28%), Positives = 226/484 (46%), Gaps = 87/484 (17%)
Query: 31 LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTV 90
+ST F KV ++ + F +W+ GCP S+N++K VE+ + +D
Sbjct: 527 ISTHSFLRMCRKVSGIDP---RSFAEQWIYGSGCPTFGFSASFNRKKMAVEITMRQDAPA 583
Query: 91 KPDSRTPVLSSNTDSENRDGDIG-WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQC 148
L +N ++ + + G M+IR+HE DG Y+H +L + ++ E+
Sbjct: 584 YKH-----LENNEITKMMLKPVPFFEGQMTIRIHEADGTPYEH-VLDIRS-PFKRYEVPF 636
Query: 149 HSKLA-ARRALK------------PKKGSKPDGCDDNGDAVAGLDMRSSMESPLSW---- 191
++K RR K + + D D GLD+ + +W
Sbjct: 637 NTKYKRVRRNTKRYLARQAAAQAAAEGDADAAAAMDMVDMGFGLDIWEKEQERENWKVAD 696
Query: 192 ----------------IRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEA 235
IR D + E++A I F+Q MW++QL++D DVVAQ +AI L
Sbjct: 697 WTEEDEQNMAGQTYEWIRIDADFEWIAAIAFDQKDYMWVSQLQRDRDVVAQYEAILVLSK 756
Query: 236 LPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFD- 294
P+ + +T + S ++R+R EAA AL A++ +W GL HL K + ++
Sbjct: 757 TPNPI--ISSTFTKTVLVSNYYFRIRCEAAAALVTCANQRLEWLGLFHLFKLFLRYCYEP 814
Query: 295 --------ENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+ +PRPNDF DF+EYFV ++I A++ VR + KSP +F++ L+YN
Sbjct: 815 EDPKADLFSHTYVPRPNDFSDFAEYFVRKSILSAISRVRFENGKSPPVIRQFLVDQLRYN 874
Query: 347 DNNGNPYSDVFWLAALVQSVG-------ELEFGQ------------QSILFLSSLLKRID 387
DN NPY+D F++ ++ ++G E G+ Q L + L +D
Sbjct: 875 DNTTNPYADAFYICNIISALGCATVSSHPPERGEFISNDAAPAQDPQDAALLQAALSEVD 934
Query: 388 RLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPF----RDFNTIWQV 443
R DRL+P+Y+ ++T++ I Q+ L L+G + D +PF R+ N I QV
Sbjct: 935 RYRSMDRLIPTYHNVVTVAVIE--FQMLLSLAGLVPHDP-----RPFFPLTREGNYI-QV 986
Query: 444 RVEA 447
R+ A
Sbjct: 987 RIAA 990
>gi|443894493|dbj|GAC71841.1| TATA binding protein associated factor [Pseudozyma antarctica T-34]
Length = 1845
Score = 169 bits (427), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 122/437 (27%), Positives = 217/437 (49%), Gaps = 70/437 (16%)
Query: 31 LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTV 90
LST++F KV + L+ F W+ GCP ++N++K ++E+ + ++
Sbjct: 543 LSTQQFLRTCRKVSGAD---LRLFTEHWIRGSGCPRFICSANFNRKKLLIEMHIRQEV-- 597
Query: 91 KPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCH 149
P ++ + D+ + + G M++R+HE DG Y+H +L + G A + E+ +
Sbjct: 598 -PAAQF-AAARPGDALAANAVPLFEGQMTVRIHEADGTPYEH-VLEIKGPA-KRYEVPFN 653
Query: 150 SKLA-ARRALK---------------PKKGSKPDGCDDNGDAVAGLDMRSSME------- 186
+K RR K ++ + G D G + + + E
Sbjct: 654 TKYKRVRRNTKRFQARQAAAAAAAQGDQEAQEAIGLIDLGFGLGMWEDEQAREEWKVADW 713
Query: 187 ----------SPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEAL 236
+P WIR D + E+LA IHF QP MW++QL++D DVVAQ A+ AL +
Sbjct: 714 TEEDEEKMSSAPYEWIRMDADFEWLASIHFEQPDHMWVSQLQRDRDVVAQVSAVHALAQM 773
Query: 237 PHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSR----- 291
P S + L + +K F+RVR EA +AL + A + D GL HL+K ++S+
Sbjct: 774 P--SLVTCSMLTRTVLVNKYFYRVRTEAVHALVHCAIPQLDNLGLFHLLKLFRSQFCHDP 831
Query: 292 ----RFDENIG---LPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLK 344
F++ + +PR NDF D ++YF+ A+ HA++ VR D ++P + F++ LL+
Sbjct: 832 PNEASFEDPLDVPCIPRANDFSDAADYFLQRALIHAISRVRNPDGRTPAQVKRFLINLLR 891
Query: 345 YNDNNGNPYSDVFWLAALVQSVGE----LEFG---------QQSILFLSSLLKRIDRLLQ 391
YNDN+ N + D F+LA + ++ ++ G + L+ + ++RL +
Sbjct: 892 YNDNSTNHFVDDFYLAGAINALASAFIPIDSGLAQDSSANADEENFLLAHAIAEVERLQE 951
Query: 392 FDRLMPSYNGILTISCI 408
DRL+PS++ ++T++ +
Sbjct: 952 LDRLVPSFHNVITLASL 968
>gi|146420870|ref|XP_001486388.1| hypothetical protein PGUG_02059 [Meyerozyma guilliermondii ATCC 6260]
Length = 1601
Score = 168 bits (426), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 138/474 (29%), Positives = 229/474 (48%), Gaps = 75/474 (15%)
Query: 30 TLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCT 89
TLST F++ KV R L+ FF +WV G P+ + +NK+++I+E+ + +
Sbjct: 616 TLSTLHFQYVCEKVN---RNKLESFFKQWVFNAGVPLFLVAQRFNKKRSIIEMQIRQIQL 672
Query: 90 V-----KPDSRTPVLSSNTDSENRDGDIGWP---GMMSIRVHELDGM-YDHPILPMAGDA 140
KP+S + S ++ + P G M+IR+HE +G Y+H I+ + +
Sbjct: 673 QIIRKSKPNSNNFMEDSIAYLDDEPSFLVQPVFTGPMTIRIHEANGTPYEH-IIDLKHET 731
Query: 141 WQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAG-------LDMRSSM-------- 185
+ L+IQ ++K R +K K G D+ + V+ L ++ M
Sbjct: 732 SK-LDIQYNTKF---RRMKKKTG------DEVHETVSSFKQLGDILQTKAEMEEWNLVEW 781
Query: 186 ----ESPL-----SWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEAL 236
E PL W+R D + E+++++ QP M+ +QL+ D DV AQ +A+
Sbjct: 782 IKDEEDPLFNDAFEWLRVDADFEWISKMSVKQPDYMFASQLQYDRDVEAQYEAVRYFATR 841
Query: 237 PHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDEN 296
+ N L + DS+ ++ VRI AA ALA + + GL +L+K +K+
Sbjct: 842 DKPNRNYCTVLTRTVMDSRYYYGVRIAAARALAGFSKTNNLFIGLEYLMKIFKTLYCFPE 901
Query: 297 IGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDV 356
+P+ NDF D + YF+ + + A +R D SP E F+L LLKYNDN+ N Y D
Sbjct: 902 SNVPKTNDFNDLASYFLKKEMISIFATIRDDDGNSPTEVKRFLLSLLKYNDNSNNDYQDG 961
Query: 357 FWLAALVQSV-------GELE-----FGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILT 404
F++AAL++++ G+L F F+ L I+RL + D +PSY+ +++
Sbjct: 962 FYVAALIKALTSCIIITGDLRDIPDVFDNSKQKFVDDALVEINRLHKLDEWVPSYHNMIS 1021
Query: 405 ISCIRTLTQIALKLSGFISLDQVVKLIKPFRDF--NTIWQ----VRVEASRALL 452
+ C++ Q+A+ G+ L PF D T+ + VRVEA R L
Sbjct: 1022 VVCLQLKVQLAVH--GYYEL--------PFEDLLPYTLGKFPLDVRVEAFRGLF 1065
>gi|50548681|ref|XP_501810.1| YALI0C13904p [Yarrowia lipolytica]
gi|49647677|emb|CAG82120.1| YALI0C13904p [Yarrowia lipolytica CLIB122]
Length = 1264
Score = 168 bits (426), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 134/484 (27%), Positives = 231/484 (47%), Gaps = 74/484 (15%)
Query: 31 LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTV 90
L T F +V + + L EFF +WV + G PVL++ +NK++ +E+ + + T
Sbjct: 567 LGTNHFYKLCERVSHCK---LDEFFDQWVYSSGVPVLQVTQRFNKKRMFIEMGIRQTQTD 623
Query: 91 KPDSRTPV--------LSSNTDSENRDGDIGWP------GMMSIRVHELDGM-YDHPILP 135
K + PV + S ++ D + G P G ++IR+HE DG Y+H +
Sbjct: 624 KSGEKKPVEDTFIGDVIRSEEGVDHGDEEEGGPKTPLFTGPITIRIHEADGTPYEHVV-- 681
Query: 136 MAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGC-----------------DDNGDAVAG 178
+ L+IQ ++K +R + KK + D+ G ++
Sbjct: 682 DIKHIFTKLDIQYNTKY--KRLKRNKKEVEAAATVTATAAPPAASASNNEEDEVGVVISC 739
Query: 179 L----------------------DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQ 216
L +MR S E+ W+R D + E++ +IH NQP M+ +Q
Sbjct: 740 LGDVLQTDAELEEWELSEWGKDEEMRMSNEA-FEWLRIDADFEWICKIHVNQPDYMYASQ 798
Query: 217 LEKDGDVVAQAQAIAAL-EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEE 275
L++D DVVAQ +++ E+ P ++ + L L D + ++ VR+ AA L A E
Sbjct: 799 LQQDRDVVAQYESVQFFAESRPSKVYSTI--LLRTLIDRRYYYGVRVAAALGLVKMAVSE 856
Query: 276 TDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREA 335
T + G HL+K +++ N +PR NDF DF YFV +AI A + +R D + P E
Sbjct: 857 TSYIGQTHLLKAFQNLFCFPNSSIPRSNDFSDFPTYFVQKAITRAFSQIRNGDGECPAEI 916
Query: 336 VEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGEL-----EFGQQSILFLSSLLKRIDRLL 390
+L +L YN+N+ N SD F+L+ L+ + ++ + + F + + IDR
Sbjct: 917 RTLILDILAYNENSTNSMSDSFYLSDLITFLADVLCSSDSNDESTREFAARAVNEIDRCG 976
Query: 391 QFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRA 450
+ D +PSY ++T++ + ++ LK G S ++ LI+ R N+ +VR+ A A
Sbjct: 977 RLDTWIPSYQNVVTVAALEAKKRLFLK--GLFSA-KLQDLIQFTRSVNSS-EVRLAAFEA 1032
Query: 451 LLDL 454
+L +
Sbjct: 1033 MLHI 1036
>gi|71015037|ref|XP_758766.1| hypothetical protein UM02619.1 [Ustilago maydis 521]
gi|46098556|gb|EAK83789.1| hypothetical protein UM02619.1 [Ustilago maydis 521]
Length = 1846
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 130/482 (26%), Positives = 229/482 (47%), Gaps = 75/482 (15%)
Query: 31 LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTV 90
LST+ F KV + L+ F W+ GCP ++N++K ++E+ + ++
Sbjct: 542 LSTQLFLRTCRKVSGAD---LRLFTEHWIRGSGCPRFICSANFNRKKLLIEMNIRQEV-- 596
Query: 91 KPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCH 149
P ++ + D+ + + G M++R+HE DG Y+H +L + G A + E+ +
Sbjct: 597 -PAAQF-AAARPGDALAANAVPLFEGQMTVRIHEADGTPYEH-VLEIKGPA-KRYEVPFN 652
Query: 150 SKLA-ARRALK---------------PKKGSKPDGCDDNGDAVAGLDMRSSME------- 186
+K RR K ++ + G D G + + + E
Sbjct: 653 TKYKRVRRNTKRFQARQAAAAAAAQGDQEAQEAIGLIDLGFGLGMWEDEKAREEWKVADW 712
Query: 187 ----------SPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEAL 236
+P WIR D + E+LA IHF QP MW++QL++D DVVAQ A+ AL +
Sbjct: 713 TEEDEEKMSSAPYEWIRMDADFEWLASIHFEQPDYMWVSQLQRDRDVVAQVSAVHALAQM 772
Query: 237 PHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDEN 296
P S + L + +K F+RVR EA +AL + A + D GL HL+K ++++ +
Sbjct: 773 P--SLVTCSMLTRTVLVNKYFYRVRTEAVHALVHCAIPQLDNLGLFHLLKLFRTQFCHDP 830
Query: 297 IG------------LPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLK 344
+PR NDF D ++YF+ A+ HA++ VR D ++P + F++ LL+
Sbjct: 831 PDEASIENPLDVPCIPRANDFSDAADYFLQRALIHAISRVRNPDGRTPPQVKRFLINLLR 890
Query: 345 YNDNNGNPYSDVFWLAALVQSVG--------------ELEFGQQSILFLSSLLKRIDRLL 390
YNDN+ N + D F+LA + ++ + + + L+ + ++RL
Sbjct: 891 YNDNSTNHFVDDFYLAGAINALASAFIPIESAVAATQDADMASEESFLLAHAVAEVERLQ 950
Query: 391 QFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRA 450
+ DRL+PSY+ ++T++ + Q+A+ L+ D ++L + VR+ A
Sbjct: 951 ELDRLVPSYHNVITLASLD--FQVAMMLANLKPKD--LQLFFTYTRQGNFTPVRIAALNC 1006
Query: 451 LL 452
LL
Sbjct: 1007 LL 1008
>gi|440640095|gb|ELR10014.1| hypothetical protein GMDG_00772 [Geomyces destructans 20631-21]
Length = 1406
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 158/580 (27%), Positives = 253/580 (43%), Gaps = 91/580 (15%)
Query: 2 LEKQMGSNFFRKILQNIISRAQ-GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
L K GS +++ ++ A G R ++T+ FR K+G+ + L FF +W+
Sbjct: 554 LTKLSGSTGIVRVISRLLYNASIGEVSDRAVTTEGFRRLCEKIGHYK---LDAFFNQWIM 610
Query: 61 TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRD-------GDIG 113
GCP ++ +NK+K VE+ + + +P R + E ++ GD+
Sbjct: 611 GAGCPRFQVTQKFNKKKLAVEMTISQKQDTQPTQRK-LEKHQFLREFKEEAYGVYAGDVQ 669
Query: 114 --WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKL-----AARRALKPKKGSK 165
+ G M+IR+HE DG Y+H + G Q +EI H+K + R + G
Sbjct: 670 PVFTGPMTIRIHESDGTPYEHIVEIREG--IQKIEIPYHTKYKRLKRSRREKERATAGPG 727
Query: 166 PDGCDDNGDAVA-----------------GL-DMRSSMESPL-----SWIRADPEMEYLA 202
D +N D V GL D + ME + WIR D + E++
Sbjct: 728 VDITTENHDDVLLYCLGDVLQSKEDVQEWGLVDWDTEMEVKMDQESYEWIRMDADFEWIC 787
Query: 203 EIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRI 262
E++ N P M+++QL++D DVVAQ ++ L P V L L D + FW +R
Sbjct: 788 ELNLNMPSYMYLSQLQQDRDVVAQQDSMLFLAKSPPHPL-VSTILIRTLMDRRYFWGIRE 846
Query: 263 EAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVA 322
A LA E W G +HL K ++ +P+ NDF D ++Y + AIP A+A
Sbjct: 847 MACRYLATHGVEAHSWIGRVHLEKAFQRMFCYPGSSMPKSNDFSDKTDYRMKLAIPKAMA 906
Query: 323 MVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV------------GELE 370
VR + K P EA +F+L LL++NDN N YSD +++ L++++ G +
Sbjct: 907 RVRNIEGKCPLEARQFLLDLLRFNDNGNNEYSDYYYICTLMKALADSLIPIKNETPGGMS 966
Query: 371 FG------QQSILFLSSLLKRIDRLLQFDRLMPSYNGILT---ISCIRTLTQIALKLSGF 421
F ++ F L+ I+R + D + SY I T + C R L + + S
Sbjct: 967 FSFADEDEEELQRFQQVALEEIERYRRMDEWILSYQNIYTKTALECKRRLMKSKITPSD- 1025
Query: 422 ISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLR-- 479
+ + + T+ VR++A AL DL + N L + S + P +R
Sbjct: 1026 ------PRELFAYTHDGTMDLVRIQAFDALTDLGYLSNHHLLRYFLNVMSTDASPFVRDR 1079
Query: 480 ---------GQVKLGIHAMRICQIKGGSDSNHEVDTVTLV 510
G V LG H+ K + E D TL+
Sbjct: 1080 LCQILFSGLGAVALGEHS------KAAVEPKQEEDMDTLI 1113
>gi|393213296|gb|EJC98793.1| hypothetical protein FOMMEDRAFT_161628 [Fomitiporia mediterranea
MF3/22]
Length = 1807
Score = 167 bits (422), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 135/502 (26%), Positives = 233/502 (46%), Gaps = 91/502 (18%)
Query: 1 MLEKQMGSN-----FFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
+L++++G + R + + +S G LST F KV ++ L+ F
Sbjct: 471 ILDRRLGKSGTSLGLSRVLPKIFLSATAGDMAQNCLSTHFFLRTCRKVSGVD---LRSFV 527
Query: 56 PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG-- 113
+W+ GCP ++N++K VE+ + ++C P + + + + +
Sbjct: 528 DQWIHGSGCPQFSFLATFNRKKMAVEIQMRQEC--------PAYAIHQNDPISNSLLKPV 579
Query: 114 --WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLA-ARRALK------PKKG 163
+ G M++R+HE DG Y+H + ++ + E+ ++K RR K
Sbjct: 580 PFFEGQMTVRIHEADGTPYEHVLDILS--PIKRFEVPFNTKYKRVRRNTKRYLARQAAAQ 637
Query: 164 SKPDGCDDNGDAVAGLDMRSSME--------------------------SPLSWIRADPE 197
+ +G + +A+ +DM +E + WIR D +
Sbjct: 638 AAAEGDTEAAEAMGMVDMSFGLEIWENEEEREKWKVTDWTEEEEQAMAGATYEWIRMDAD 697
Query: 198 MEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAF 257
E++A I F QP MW++QL++D DVVAQ +A+ AL P + V +TL + + F
Sbjct: 698 FEWIARIQFEQPDYMWVSQLQRDRDVVAQVEAVHALARQP--TPIVSSTLTKTVLVTNYF 755
Query: 258 WRVRIEAAYALANTASEETDWAGLLHLVKFY---------KSRRFDENIGLPRPNDFRDF 308
+RVR EAA AL N AS D+ GL HL K + S+ + +PRPNDF D
Sbjct: 756 FRVRCEAALALVNCASRRLDFVGLFHLFKLFLRYCYEPEDSSQDLFSHKFVPRPNDFSDV 815
Query: 309 SEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV-- 366
+EYF A+ +A++ VR + K+P E +F + L+YNDN N YSD +++A+L+ ++
Sbjct: 816 AEYF---ALVNAISQVRFENGKTPPEIRQFFIDQLRYNDNTTNAYSDAYYIASLITALAC 872
Query: 367 ----------GEL-------EFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIR 409
GEL + + L + + +DR DRL+PS + ++TI+ +
Sbjct: 873 ATISTAPPERGELTMSVPKNDITGEDHQLLQAAITEVDRYRSMDRLIPSVHNVVTIASLE 932
Query: 410 TLTQIALKLSGFISLDQVVKLI 431
I L ++ I D + L+
Sbjct: 933 --FHIMLAMANLIPSDPKIFLL 952
>gi|150865037|ref|XP_001384087.2| hypothetical protein PICST_58795 [Scheffersomyces stipitis CBS 6054]
gi|149386293|gb|ABN66058.2| predicted protein, partial [Scheffersomyces stipitis CBS 6054]
Length = 1408
Score = 166 bits (420), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 136/470 (28%), Positives = 223/470 (47%), Gaps = 57/470 (12%)
Query: 30 TLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLR-DC 88
TLST+ F++ KV R L+ FF +WV G P+ + +NK++ ++E+++ +
Sbjct: 589 TLSTQHFQYVCEKVN---RNKLENFFKQWVYGVGAPIFNITQRFNKKRGVIEMSIRQIQH 645
Query: 89 TVKPDSRTPVLSSNTDS-ENRDGDIGWP------GMMSIRVHELDGM-YDHPILPMAGDA 140
V S T S DS + + +P G M+IRVHE DG Y+H + G+
Sbjct: 646 QVTRKSGTNAESFINDSIAYLEDEPTFPVQSIFTGPMTIRVHEADGTPYEHIVDLKEGNT 705
Query: 141 WQLLEIQCHSKLAARRALKPKKGSKPDGCDDN--GDAVAG---------LDMRSSMESPL 189
L++Q +SK RR K + + + + GD + D + P+
Sbjct: 706 --KLDVQYNSKF--RRMKKNRDETSENAVTFSRLGDVLESEKEMEEWNLADWAKVDDDPM 761
Query: 190 S-----WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVV 244
+ WIR D + E++A QP M+ +QL+ D DV AQ A+ L + S
Sbjct: 762 NIEAFEWIRVDVDFEWIARFDVKQPDYMFGSQLQHDRDVEAQFDAVRYLGNIEKPSTIHC 821
Query: 245 NTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPND 304
L + D + ++ VRI AA ALAN ++ T++ G+ +LVK Y+ +P ND
Sbjct: 822 TALTRTVVDERYYYGVRIAAAEALANFSNSVTNFIGVPYLVKIYRELYCFPGSSIPLSND 881
Query: 305 FRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQ 364
F DF +F+ + IP + +R +D++ P +L L+K+NDN N + D F+++ LVQ
Sbjct: 882 FNDFGRFFLQKEIPKQLCKIRDSDDEVPVVIRNLILNLIKFNDNTNNNFQDSFYISELVQ 941
Query: 365 SVGELEF-------------------GQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTI 405
S+ + F+++++ I+RL + D +PSY+ ++++
Sbjct: 942 SLTTCAVNSSFPNSPKDIFPKSHPHGSSEKKKFVANVITEINRLQKLDEWIPSYHNVVSV 1001
Query: 406 SCIRTLTQ-IALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
+C LTQ I L L G + L L F Q+RVEA R L L
Sbjct: 1002 TC---LTQKIRLALHGHLDLSFEDLLYFTVEKFPI--QIRVEAFRGLFVL 1046
>gi|354543021|emb|CCE39739.1| hypothetical protein CPAR2_601590 [Candida parapsilosis]
Length = 1364
Score = 166 bits (419), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 132/435 (30%), Positives = 204/435 (46%), Gaps = 45/435 (10%)
Query: 24 GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELA 83
G P TLST F++ KV R L FF +WV G P ++ +NK+K ++EL
Sbjct: 621 GDLPNGTLSTDHFQYVCEKVN---RNKLDSFFKQWVFGSGAPSFKISQRFNKKKGMIELT 677
Query: 84 VLR---------DCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGM-YDHPI 133
+ + VK + N + ++ ++ + G M+IRVHE DG Y+H I
Sbjct: 678 IRQVQHHENKNVQLNVKSFIDDAICFLNKEPLSQRQNV-FTGPMNIRVHESDGTPYEHII 736
Query: 134 LPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDN--GDAVA-------------- 177
+ ++I K+ R+ K +G P G + N GD +
Sbjct: 737 --HIKEMKTTIDISLSKKV--RKIRKKDEGVDP-GLNFNQFGDILTTEEEKRTWYLRDWE 791
Query: 178 GLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALP 237
D S P WIR D + E++A+I QP M+ +QL D DV AQ A+ L
Sbjct: 792 SRDEDSLSTEPFEWIRGDIDFEWIAKIEVLQPDFMYGSQLVYDRDVEAQFDALRYFGNLE 851
Query: 238 HLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENI 297
S L L D + F+R+RI AA ALA+ ++E ++ G+ +L+K YK +
Sbjct: 852 KPSIIYCTALVRTLLDERYFYRIRIAAAEALASCSNESNNYIGMKYLIKAYKYMYCFPDS 911
Query: 298 GLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVF 357
+P+ NDF DF + + +AIP + + D + P E + LLKYNDN+ N Y D
Sbjct: 912 SIPKSNDFGDFRSFLIQKAIPGILCKITDNDGRVPAEVQGLLFNLLKYNDNSMNDYQDTL 971
Query: 358 WLAALVQ--SVGELEFGQQSIL------FLSSLLKRIDRLLQFDRLMPSYNGILTISCIR 409
+L+ L++ +V + ++ + F + I R+ + D PSY IL I+C +
Sbjct: 972 YLSDLIKALTVSAISGWHKNTVEGNNLEFALKVDVEITRIQKLDEWRPSYQRILQITCFK 1031
Query: 410 TLTQIALKLSGFISL 424
+ QIA L G ISL
Sbjct: 1032 SRVQIA--LCGAISL 1044
>gi|361131910|gb|EHL03525.1| putative Transcription initiation factor TFIID subunit 2 [Glarea
lozoyensis 74030]
Length = 1328
Score = 165 bits (418), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 152/541 (28%), Positives = 239/541 (44%), Gaps = 82/541 (15%)
Query: 8 SNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVL 67
S R I + IIS G ++TK+FR KV + F+ +W+ GCP L
Sbjct: 559 SGLPRIISKLIISGNTGLPEESVMTTKDFRRLVEKVTKYRA--TESFWNQWIYGAGCPRL 616
Query: 68 RMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRD-------GDIG--WPGMM 118
+ +NK++ VE+ + + + P +R P+ N E ++ G+I + G +
Sbjct: 617 NITQRFNKKRLCVEMTINQSQSTLPTTR-PLKKENFPRELKEELHGIYAGEIQPLFTGPL 675
Query: 119 SIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAA-----RRALKPKKGSKPDGCDDN 172
+IR+HE DG Y+H + + Q +EI ++K R+ + G+ DG +
Sbjct: 676 TIRIHEADGTPYEHIVEIREQN--QKIEIPYNTKYKRLKRNRRQKERLNPGANVDGETGD 733
Query: 173 ------GDAVAG---------------LDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQ 211
GD + G + R ES W+R D + E++ N P
Sbjct: 734 TLYYCLGDVLQGEQDVAEWGLKDWDKETEARMDSES-YEWLRIDADFEWICGTTLNVPAY 792
Query: 212 MWINQLEKDGDVVAQAQAIAALE-ALPHLSFNVVNTLN-NFLSDSKAFWRVRIEAAYALA 269
MWI+QL++D DVVAQ Q++ L A PH +V T L D++ + +R AA L
Sbjct: 793 MWISQLQQDRDVVAQQQSMQFLALATPH---GLVATFCLRTLMDTRYYHEIRTMAANILK 849
Query: 270 NTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADN 329
A E W+GL HL + Y+ E +PR NDF D Y+V +AIP A++ R
Sbjct: 850 KHACEAAHWSGLRHLTRAYQEFFCYEGTKMPRSNDFSDKRSYWVEQAIPTALSQSRTKAG 909
Query: 330 KSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVG---------------------- 367
K P+ A EF+L L++NDN NP+SD +A L+ ++
Sbjct: 910 KCPKAAREFILDTLQFNDNRANPFSDNHKVANLLTALAESLMSSDPGTYKEPANYTGNGP 969
Query: 368 ---ELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFI-- 422
E+E + F +L +DR + D + SY I T + + + L +G I
Sbjct: 970 SEPEVEDPDEIAQFRDKVLVELDRYRRMDEWINSYQNIYTTTVLEAYRK--LMKAGVIPK 1027
Query: 423 -SLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLRGQ 481
LD V L D VR++A +L+DL + + L L + S + P R Q
Sbjct: 1028 EPLDFVQYLHDGTSDL-----VRIKAFESLIDLGYITHEGICGLLLNVLSTDISPYTRNQ 1082
Query: 482 V 482
+
Sbjct: 1083 L 1083
>gi|396470788|ref|XP_003838714.1| hypothetical protein LEMA_P023870.1 [Leptosphaeria maculans JN3]
gi|312215283|emb|CBX95235.1| hypothetical protein LEMA_P023870.1 [Leptosphaeria maculans JN3]
Length = 1522
Score = 165 bits (417), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 147/560 (26%), Positives = 242/560 (43%), Gaps = 84/560 (15%)
Query: 2 LEKQMGSNFFRKILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
L KQ G N + L ++ A QG ++T +F KVG+ + L+ FF +WV
Sbjct: 508 LVKQSGKNGVDRCLYRMLFNARQGKYSNGAIATDDFLDICEKVGHQK---LESFFAQWVY 564
Query: 61 TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG------- 113
GCP+ + S+NK+K +V+L + ++ +R + D + G
Sbjct: 565 GSGCPIFQCFPSFNKKKQVVQLQITQEQAQPSITRIESGLTAEDFMREAKEKGHEVQSGT 624
Query: 114 -WP---GMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDG 168
+P G M+IR+HE DG Y+H + + + K R L ++ + G
Sbjct: 625 EYPTFVGPMTIRIHEADGTPYEHIVDINSASVKVEIPYNTKYKRLKRNRLNKERMAAAAG 684
Query: 169 CDDNGDAVAGL--------------------------DMRSSMESPLSWIRADPEMEYLA 202
D +G+ + D + WIR D + E++
Sbjct: 685 LDASGETQEDVTFYMLGDLFTTPEEHAGWRLDDWPAEDEAKQEQEAYEWIRIDADFEWIC 744
Query: 203 EIHFNQ-PVQMWINQLEKDGDVVAQAQAI---AALEALPHLSFNVVNTLNNFLSDSKAFW 258
+ N P M+++QL++D DVVAQA++I A E +S ++ T L DS+ F
Sbjct: 745 KTVINDMPSYMYVSQLQQDKDVVAQAESIQFLAQKEGHKLISTFLIKT----LMDSRYFH 800
Query: 259 RVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIP 318
+R AA +A +A + T+W GL HL K ++ + R NDF D S Y + AIP
Sbjct: 801 GIRTMAAEVMAQSALKRTEWIGLFHLKKAFRELFCIPGSPMTRSNDFTDRSSYLIQCAIP 860
Query: 319 HAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQS--- 375
A+A +R D ++P E F+L +L+YNDN GN YSD +++ L++ + + ++
Sbjct: 861 RAIAKIRGEDGRAPMEVKRFLLDVLQYNDNRGNDYSDDHYISTLMRCLADCLISSKNNND 920
Query: 376 ----------ILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIAL------KLS 419
F + + R + D +P+Y I T + + T++ L K S
Sbjct: 921 DLSPAEWAAEQRFQEKAIGELTRYQRLDEWIPTYQNIYTTTALECATKLTLNRVTPFKPS 980
Query: 420 GFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPS-- 477
F+ Q+ VR++A L+ L DS L I + +PS
Sbjct: 981 EFLQYAQL----------GNADNVRLKAWECLVRLGLFRK--DSVLRFMIHELRSDPSPY 1028
Query: 478 LRGQVKLGIHAMRICQIKGG 497
R Q+ L I + + QI G
Sbjct: 1029 FRKQL-LRIFGVAVGQIATG 1047
>gi|403214688|emb|CCK69188.1| hypothetical protein KNAG_0C00750 [Kazachstania naganishii CBS 8797]
Length = 1419
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 132/482 (27%), Positives = 221/482 (45%), Gaps = 73/482 (15%)
Query: 24 GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELA 83
G P +LS+ F+H +V N R L+ FF +WV G PV R+ YNK++ ++E+
Sbjct: 632 GELPNNSLSSAHFQHVCERV-NKNR--LESFFQQWVYGAGVPVFRITQRYNKKRMVIEMG 688
Query: 84 VLRDCT---VKPDSRTPVLSSN--------TDSENRDGDIGWPGMMSIRVHELDGM-YDH 131
+ R C + SR V S ++ NR+ + G M+IR+HE DG Y+H
Sbjct: 689 I-RQCQNQDSESHSRKVVGSDGFFKSAMNYIETPNRNTPSFFTGSMTIRIHESDGTPYEH 747
Query: 132 PILPMAGDAWQLLEIQCHSKLAARRALK----PKK-----------GSKPDGCDDNGDAV 176
+ D + ++IQ ++K R K PK+ SK D++
Sbjct: 748 IV--ELKDTYTKIDIQYNTKYRRMRGRKVVATPKEESLERSTSNLAASKVVAVDEDDIPK 805
Query: 177 AGLDMRSS----------------------MESPLSWIRADPEMEYLAEIHFNQPVQMWI 214
G + S+ M WIR D + E+L ++H NQP M+
Sbjct: 806 LGTVLTSAEDCYKWKLTDIAKTTEGNEALLMNEAFEWIRIDSDFEWLCKVHINQPDYMFA 865
Query: 215 NQLEKDGDVVAQAQAIAALEAL----PHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALAN 270
+QL +DGDV AQ +I + + S + L D K F+ VR+EA AL+
Sbjct: 866 SQLRQDGDVEAQIDSIRYYQDVIMDSNQQSLVYSSILTRTAMDEKYFYGVRLEACKALSQ 925
Query: 271 -------TASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAM 323
+ +E G HL+K ++ ++ +P+ NDF +YF+ + IP ++
Sbjct: 926 YVYKQPESPNESEFVGGSKHLIKIFQELFCFDDSNVPKSNDFSIIQKYFLQKWIPKYLSQ 985
Query: 324 VRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV-GELEFGQQSILFLSSL 382
V++ N+ P+ F+L +L YN+N+ NP+ D F+LA L+ V + FL +
Sbjct: 986 VKSETNECPKFVKNFLLDILTYNENSENPFDDSFYLANLISCVIDSILHDHDDTEFLHRI 1045
Query: 383 LKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS--GFISLDQVVKLIKPFRDFNTI 440
+ ++ R D+ MPSY ++T RT+ LKL+ G + + + +++ + I
Sbjct: 1046 MGQLHRYENLDKWMPSYQVLVT----RTILNQKLKLNIEGLYTFENLKDVLEYSLNIEHI 1101
Query: 441 WQ 442
W+
Sbjct: 1102 WK 1103
>gi|170044706|ref|XP_001849978.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167867753|gb|EDS31136.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 1222
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 138/511 (27%), Positives = 231/511 (45%), Gaps = 66/511 (12%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRH-------FANKVGNLERPFLKE 53
MLE ++G ++ ++ A A+ + +S+ + F + + +
Sbjct: 450 MLEHRIGQELLLQVFNKQLALASNAASTK-ISSGLWHQLHISTNIFTKAIFTVTGKDMAV 508
Query: 54 FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
F +WV T G M +N+++N +EL + +D N+ G
Sbjct: 509 FIDQWVRTGGHARFTMTSVFNRKRNTIELEIRQDAV-----------------NQKGVRK 551
Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
+ G + I++ ELDG + H + Q+ I S + + K K C G
Sbjct: 552 YVGPLLIQLQELDGTFKHTL--------QIENIVVKSDITCHSKSRRNKKKKIPLC--TG 601
Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
+ V +D+ ESP+ WIR DPEM L +H QP W QL + DV AQ +IAAL
Sbjct: 602 EEV-DMDLSPMDESPVLWIRLDPEMTLLRSVHIEQPDFQWQFQLRHERDVTAQLDSIAAL 660
Query: 234 E--ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFYKS 290
E A P + +T+ N + F+ VR +AA L A S T W G ++ +K
Sbjct: 661 ERYASPATRLALTDTIEN----EQIFYEVRCKAALCLTKVANSMVTSWQGPPAMLTIFK- 715
Query: 291 RRFDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYND 347
+F + P R N+F +F YF+ + IP A+A +R A P E + F+L L KYND
Sbjct: 716 -KFFGSFSAPHIIRQNNFTNFQHYFLQKTIPVAMAGLRTAHGICPPEVIRFLLDLFKYND 774
Query: 348 NNGNPYSDVFWLAALVQSVGE-----LEFGQQSILFLSS--------LLKRIDRLLQFDR 394
N N YSD ++ AALV+++G + Q + S +L+ + R+L ++
Sbjct: 775 NTKNHYSDNYYRAALVEALGNSITPVVSMVHQGMALTSENLSADSKLVLEEVTRILNLEK 834
Query: 395 LMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
+PSY ++I+C++ + + L+ G + + K+ + + ++ VRV A L+D
Sbjct: 835 HLPSYKYTVSIACLKVIRK--LQKCGHLPTNP--KIYRSYAEYGQYIDVRVAAMECLVDY 890
Query: 455 EFHCNGIDSALSLFIKSVEEEPSLRGQVKLG 485
+G L I +E +P + +LG
Sbjct: 891 -VKIDGRWEDLENLIDLLETDPDPMARHRLG 920
>gi|348517379|ref|XP_003446211.1| PREDICTED: transcription initiation factor TFIID subunit 2-like
[Oreochromis niloticus]
Length = 1206
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 132/478 (27%), Positives = 231/478 (48%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + LST F + +N G P
Sbjct: 448 LIENRISMEFMLQVFNKLLSLASTASSQKYQSHMWSQMLLSTHAFLKSISNVSGKDIGPL 507
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 508 IKQ----WVDQSGVVKFFGSFAFNRKRNVLELEIRQDYT------------------SSG 545
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H +L + ++ + +I CHSK + K
Sbjct: 546 TQKYVGPIKVTVQELDGSFNH-MLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 596
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ +A
Sbjct: 597 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSILRKVEFEQADFMWQYQLRYERDVVAQEEA 654
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEE-TDWAGLLHLVKFY 288
I+ALE P + + L + L + F++VR++A + LA A+ + W G + +
Sbjct: 655 ISALEKFPTPASRLA--LTDILEQEQCFYKVRMQACFCLAKIANAMVSTWQGPPAMKSLF 712
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F +F YF+ + +P A+A++R N P++ + F+L L+KYN
Sbjct: 713 -TRMFCCKSCPNIVKTNNFINFQSYFLQKTMPVAMALLRDVQNLCPKDVLNFILDLIKYN 771
Query: 347 DNNGNPYSDVFWLAALVQSV-----------GELEFGQQSILFLSSLLKRIDRLLQFDRL 395
DN N +SD ++ A L++++ E+ + +L+ I R L ++L
Sbjct: 772 DNRKNKFSDNYYRADLIEALTNSLTPAISINNEVRTVDNLNADVRLILEEITRFLNMEKL 831
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PS+ +T+SC+R + Q L+ +G I D L K + + VR+ A AL+D
Sbjct: 832 LPSFRNTITVSCLRAIRQ--LQKNGHIPSDP--SLFKSYAQYGHFVDVRIAALEALVD 885
>gi|430812189|emb|CCJ30342.1| unnamed protein product [Pneumocystis jirovecii]
Length = 1052
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 143/508 (28%), Positives = 236/508 (46%), Gaps = 68/508 (13%)
Query: 2 LEKQMGSNFFRKILQNIISRAQGASPVR-TLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
L K S +I+ + +A G + LST F + KV R L F +W+
Sbjct: 372 LTKSGASWGLSRIIPKLFLQAMGGTLANNALSTSHFLKISEKVS---RVSLDVFAQQWIH 428
Query: 61 TCGCPVLRMGFSYNKRKNIVELAV--LRDCTVKPDSRTPVLSSNTDSENRDGDIG----- 113
G P LR+ +N++K ++E+ + ++D P S T V D++ +I
Sbjct: 429 GRGYPRLRVIQRFNRKKMLIEIGIRQVQDIETPPKSLT-VTDFFNDAKFHANNISNSTFP 487
Query: 114 -WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKP----D 167
+ G M+IR+HE DG YDH + +A+ L+IQ + K R + KK K D
Sbjct: 488 IFTGTMTIRIHEADGTPYDHVV--ELKEAFTKLDIQYNIKNKRFRRNQSKKQKKDMMDID 545
Query: 168 GC----DDNGDAVAGL---------------------DMRSSMESPLSWIRADPEMEYLA 202
G +DN +V D E WIR D + E++
Sbjct: 546 GVGDIDNDNDFSVQCFGDILQSEQEIKNWKLEDWLKEDEEMMSEEAFEWIRLDADFEWIC 605
Query: 203 EIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVV--NTLNNFLSDSKAFWRV 260
I NQP M+ +QL +D DV+AQ +AI + + N + + L L DS+ ++ +
Sbjct: 606 IIWVNQPDYMYFSQLRQDRDVIAQYEAI---QYFSNSKGNCIYSSILLRTLMDSRYYYGI 662
Query: 261 RIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHA 320
R EAA AL A+E+ DW G HL K +++ N + + N+F + SEY+V IP A
Sbjct: 663 RCEAALALVKCATEDLDWIGWYHLFKAFQTNYCFPNSSISKSNNFSNLSEYYVQCIIPIA 722
Query: 321 VAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGE--LEFGQQSILF 378
++ +R S + +F++ LL+YNDN+ N YSD +++ L+ S+ + ++S F
Sbjct: 723 MSKIRDKKGNSLIQIKKFLIDLLRYNDNSSNEYSDCYYICTLLNSLASTLITENKESFEF 782
Query: 379 ------------LSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQ 426
+ + L I+R + D+L+PS+ I++I+ ++ + L +G I L+
Sbjct: 783 NFAMNDPENKNIIDTALSEIERFQRIDQLLPSFQNIISITSLK--IKYLLSKAGIIPLN- 839
Query: 427 VVKLIKPFRDFNTIWQVRVEASRALLDL 454
+ I P+ VR+ A LL L
Sbjct: 840 -FQEILPYTHSANYDLVRLTAFDVLLKL 866
>gi|403283587|ref|XP_003933197.1| PREDICTED: transcription initiation factor TFIID subunit 2 [Saimiri
boliviensis boliviensis]
Length = 1153
Score = 163 bits (412), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 427 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 486
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 487 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 524
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 525 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 575
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ ++
Sbjct: 576 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 633
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 634 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 691
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF+ + +P A+A++R N P+E + F+L L+KYN
Sbjct: 692 -TRMFCCKSCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 750
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL SV + L +L L+ I R L ++L
Sbjct: 751 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 810
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC+R + L+ +G + D L K + ++ +R+ A A++D
Sbjct: 811 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 864
>gi|390605011|gb|EIN14402.1| hypothetical protein PUNSTDRAFT_80735 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 1886
Score = 163 bits (412), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 129/449 (28%), Positives = 201/449 (44%), Gaps = 77/449 (17%)
Query: 24 GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELA 83
G P LST F KV L+ + F +W+ GCP S+N++K VE+
Sbjct: 512 GDLPSNALSTHYFLRMCRKVSGLDP---RSFAEQWIYGSGCPSFGFSASFNRKKMAVEIT 568
Query: 84 VLRDCTVKPDSRTPVLSSNTDSENRDGDIGWP---GMMSIRVHELDGMYDHPILPMAGDA 140
+ ++ P R D R P G M++R+HE DG +L + +
Sbjct: 569 MRQEA---PAYRL----YEHDEVKRALWKPVPFFEGQMTVRIHEADGTPYEHVLDIRSE- 620
Query: 141 WQLLEIQCHSKLA-----ARRALKPKKGSKPDGCDDNG--------DAVAGLDMRSSME- 186
++ E+ ++K +R L + ++ D D GLD+ E
Sbjct: 621 FKRFEVPFNTKYKRVRRNTKRYLARQAAAQAAAEGDAEAAEAIGMIDMSFGLDIWEKEEE 680
Query: 187 -------------------SPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQA 227
+ WIR D + E++A I F QP MW++QL++D DVVAQ
Sbjct: 681 RENWKVADWTEEDEQIMSGATYEWIRMDADFEWIAAIAFEQPDFMWVSQLQRDRDVVAQL 740
Query: 228 QAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKF 287
+A+ AL P + V +TL S F+R+R EA AL + A + D+ GL HL K
Sbjct: 741 EAVYALSRQP--TAIVSSTLTKTALVSNYFFRIRCEAIIALVSCAIRKLDYLGLFHLFKL 798
Query: 288 YKSRRFD---------ENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEF 338
+ +D + +PRPNDF D SEYFV +++ A++ VR + K+P F
Sbjct: 799 FLRYCYDPEDPHQDLFTHTYVPRPNDFSDLSEYFVRKSLLTAISRVRFENGKTPAIIRRF 858
Query: 339 VLQLLKYNDNNGNPYSDVFWLAALVQSVG-------------------ELEFGQQSILFL 379
++ LKYNDN NPYSD F++ +++ ++G + E ++ L
Sbjct: 859 LVDQLKYNDNTSNPYSDAFYICSIISALGCAIVSLAPPERGELLPMESKTEQSSDDVVLL 918
Query: 380 SSLLKRIDRLLQFDRLMPSYNGILTISCI 408
L +DR DRL+PS + +TI+ I
Sbjct: 919 RQALAEVDRYRSMDRLIPSPHNAVTIAAI 947
>gi|393240985|gb|EJD48509.1| hypothetical protein AURDEDRAFT_112948 [Auricularia delicata
TFB-10046 SS5]
Length = 1830
Score = 163 bits (412), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 125/479 (26%), Positives = 217/479 (45%), Gaps = 86/479 (17%)
Query: 1 MLEKQMGSN-----FFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
+L++++G + R + + +S G P LST F KV + + F
Sbjct: 471 ILDRRLGKSGTSLGLSRVLPKLFLSAISGEMPNNALSTNVFFKMCRKVCGFDS---RGFA 527
Query: 56 PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG-- 113
+W+ GCP LR ++N+++ VE+ + +P N D R +
Sbjct: 528 EQWIYASGCPRLRFAANFNRKRMAVEITM--------QQHSPAWEKNKDDPVRLAQLKPV 579
Query: 114 --WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLA-ARRALK------PKKG 163
+ G M++R+HE DG Y+H +L + A++ ++ ++K RR K
Sbjct: 580 HIFEGHMTVRIHEADGTPYEH-VLDIQT-AYKKYDVPFNTKYKRVRRNTKRFIARQQAAA 637
Query: 164 SKPDGCDDNGDAVAGLDMRSSME--------------------------SPLSWIRADPE 197
G + +A+ +D+ ++ + WIR D +
Sbjct: 638 LAAQGDTEAAEAIGMIDVGFGLDVWEDAKERENWKVADWTEEEEQVMAGATYEWIRMDAD 697
Query: 198 MEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAF 257
E++A I F QP MW++QL++D DVVAQ +AI AL P + V T + + F
Sbjct: 698 FEWIATIAFEQPDFMWVSQLQRDRDVVAQLEAIRALSRQP--TAIVSGTFTKTVLVTNYF 755
Query: 258 WRVRIEAAYALANTASEETDWAGLLHLVKFY---------KSRRFDENIGLPRPNDFRDF 308
+R+R+EAA L + A+ + + GL HL K + K + + +P NDF DF
Sbjct: 756 YRIRVEAALGLVSCATNKLGFIGLFHLFKIFLRYCHTPTNKDQDLFTHTYVPLQNDFSDF 815
Query: 309 SEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGE 368
SEYFV +A+ AV+ VR + K+P F++ L++NDN N YSD +++ ++ ++G
Sbjct: 816 SEYFVRKAVLTAVSKVRFENGKTPPIVRRFLIDQLRFNDNTNNAYSDARYISCIITALGH 875
Query: 369 L-------EFG------------QQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCI 408
E+G Q+ + L ++R DRL+PS + ++T+S +
Sbjct: 876 ACVSTAPPEYGELMQTQGRTDQNQEDVELLDQATAEVERYRTMDRLVPSSHNVVTVSVL 934
>gi|431901704|gb|ELK08581.1| Transcription initiation factor TFIID subunit 2 [Pteropus alecto]
Length = 1252
Score = 162 bits (411), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 460 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 519
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 520 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 557
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 558 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 608
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ ++
Sbjct: 609 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 666
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 667 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 724
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF+ + +P A+A++R N P+E + F+L L+KYN
Sbjct: 725 -TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 783
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL SV + L +L L+ I R L ++L
Sbjct: 784 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 843
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC+R + L+ +G + D L K + ++ +R+ A A++D
Sbjct: 844 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 897
>gi|392570887|gb|EIW64059.1| hypothetical protein TRAVEDRAFT_55102 [Trametes versicolor
FP-101664 SS1]
Length = 1844
Score = 162 bits (410), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 97/285 (34%), Positives = 152/285 (53%), Gaps = 34/285 (11%)
Query: 191 WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNF 250
WIR D + E++A I F+Q MW++QL++D DVVAQ +AIA L P+ + +TL
Sbjct: 702 WIRIDADFEWIAAIAFDQKDYMWVSQLQRDRDVVAQYEAIAVLSKSPNAI--IASTLTKT 759
Query: 251 LSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFY-------KSRRFD--ENIGLPR 301
+ S F+R+R EAA AL N A+++ DW GL HL K + + ++ D + +PR
Sbjct: 760 ILVSNYFYRIRCEAAGALVNCATQKLDWLGLFHLFKLFLRYCYEPEDQKADLFSHTYVPR 819
Query: 302 PNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAA 361
PNDF D +EYFV +AI ++A VR K+P F++ L+YNDN NPY+D +++
Sbjct: 820 PNDFSDIAEYFVRKAILRSIAKVRFETGKTPTVIRRFLVDQLRYNDNTTNPYADALYIST 879
Query: 362 LVQSVGEL-------EFGQ------------QSILFLSSLLKRIDRLLQFDRLMPSYNGI 402
++ ++G E G+ Q L L +DR DRL+P+++ +
Sbjct: 880 VISALGYAIVSAVPPERGEFVSSENGPVPDLQDAELLKLALSEVDRYRSMDRLIPTFHNV 939
Query: 403 LTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEA 447
+T++ I + L L+G + D +L P QVR+ A
Sbjct: 940 VTVAVIE--FHMLLTLAGLVPHDS--RLFFPLTREGNYTQVRIAA 980
Score = 42.0 bits (97), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 28/106 (26%), Positives = 48/106 (45%), Gaps = 16/106 (15%)
Query: 31 LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTV 90
+ST F KV ++ + F +W+ GCP S+N++K VE+ + +D
Sbjct: 517 VSTHAFLRMCRKVSGIDP---RSFAEQWIYGSGCPTFGFSASFNRKKMAVEITMRQDA-- 571
Query: 91 KPDSRTPVLSSNTDSENRDGDIG----WPGMMSIRVHELDGM-YDH 131
P ++ +E + + G M+IR+HE DG Y+H
Sbjct: 572 ------PAYKAHETNEIAKLMMKPVPFFEGQMTIRIHEADGTPYEH 611
>gi|395818007|ref|XP_003782430.1| PREDICTED: transcription initiation factor TFIID subunit 2
[Otolemur garnettii]
Length = 1199
Score = 162 bits (410), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 665
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 666 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 723
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF+ + +P A+A++R N P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL SV + L +L L+ I R L ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC+R + L+ +G + D L K + ++ +R+ A A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 896
>gi|2827282|gb|AAC68502.1| TATA binding protein associated factor [Homo sapiens]
Length = 1189
Score = 162 bits (410), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 449 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 508
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 509 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 546
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 547 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 597
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ ++
Sbjct: 598 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 655
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 656 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 713
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF+ + +P A+A++R N P+E + F+L L+KYN
Sbjct: 714 -TRMFCCKSCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 772
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL SV + L +L L+ I R L ++L
Sbjct: 773 DNGKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 832
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC+R + L+ +G + D L K + ++ +R+ A A++D
Sbjct: 833 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 886
>gi|351715354|gb|EHB18273.1| Transcription initiation factor TFIID subunit 2, partial
[Heterocephalus glaber]
Length = 1172
Score = 162 bits (410), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 136/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 431 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 490
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 491 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 528
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 529 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLL------- 579
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ A +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ ++
Sbjct: 580 -NGEE-ADMDLSAMDADSPLLWIRIDPDMAVLRKVEFEQADFMWQYQLRYERDVVAQQES 637
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 638 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMAACFCLAKIANSMVSTWTGPPAMKSLF 695
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + R N+F F YF+ + +P A+A++R N P+E + FVL L+KYN
Sbjct: 696 -TRMFCCKTCPNIVRTNNFLSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFVLDLIKYN 754
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL SV + L +L L+ I R L ++L
Sbjct: 755 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 814
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC+R + L+ +G + D L K + ++ +R+ A A++D
Sbjct: 815 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 868
>gi|4507347|ref|NP_003175.1| transcription initiation factor TFIID subunit 2 [Homo sapiens]
gi|3046692|gb|AAC13540.1| TBP-associated factor TAFII150 [Homo sapiens]
gi|148922381|gb|AAI46380.1| TAF2 RNA polymerase II, TATA box binding protein (TBP)-associated
factor, 150kDa [synthetic construct]
Length = 1199
Score = 162 bits (409), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 665
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 666 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 723
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF+ + +P A+A++R N P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKSCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL SV + L +L L+ I R L ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC+R + L+ +G + D L K + ++ +R+ A A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 896
>gi|383413613|gb|AFH30020.1| transcription initiation factor TFIID subunit 2 [Macaca mulatta]
Length = 1199
Score = 162 bits (409), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 665
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 666 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 723
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF+ + +P A+A++R N P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKSCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL SV + L +L L+ I R L ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC+R + L+ +G + D L K + ++ +R+ A A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 896
>gi|45199193|ref|NP_986222.1| AFR674Cp [Ashbya gossypii ATCC 10895]
gi|44985333|gb|AAS54046.1| AFR674Cp [Ashbya gossypii ATCC 10895]
gi|374109455|gb|AEY98361.1| FAFR674Cp [Ashbya gossypii FDAG1]
Length = 1385
Score = 162 bits (409), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 126/453 (27%), Positives = 214/453 (47%), Gaps = 74/453 (16%)
Query: 30 TLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCT 89
+LST F++ +V R L++FF +WV G PV R+ +NK++ +VE+ + R
Sbjct: 583 SLSTAHFQYVCERVN---RSKLEQFFAQWVFGSGVPVFRITQRFNKKRMVVEMGI-RQVQ 638
Query: 90 VKPDSRTPVLSSNTDSENRDGDIGWP---------GMMSIRVHELDGM-YDHPILPMAGD 139
++ + ++ + + P G M+IR+HE DG Y+H + D
Sbjct: 639 LQELGQGHIIGKRGFHASAMDYLCHPDKQLTQIFTGSMTIRIHEADGTPYEHIV--ELKD 696
Query: 140 AWQLLEIQCHSKLAARRALKP-KKGSKPDGC------------------DDNGD------ 174
+ L+IQ ++K + + K KPDG +DN D
Sbjct: 697 VFTKLDIQYNTKYKRLKRRRKINKPIKPDGKEASIVEPLPKDEDMKILDEDNDDIVLVNC 756
Query: 175 -------------------AVAGLDMRSSMES-PLSWIRADPEMEYLAEIHFNQPVQMWI 214
A+ G D S ++ WIR D + E++ +++ NQP M+
Sbjct: 757 LGDTLLSREDCQHWNLTDPAIIGEDDESQQQNEAFEWIRVDADFEWICKVYINQPDYMFA 816
Query: 215 NQLEKDGDVVAQAQAIAALEALPHLSFNVVNT------LNNFLSDSKAFWRVRIEAAYAL 268
+QL++D DV AQ ++I E + ++ + VN+ L + D + F+ VR+EA AL
Sbjct: 817 SQLQQDRDVEAQIESIRFFEDV--VASSAVNSQVYSSILTRTVMDDRYFYGVRLEACNAL 874
Query: 269 ANTASEETD----WAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMV 324
+ +E + G HL+ ++ EN + + NDF DFS YF+ AIP +A
Sbjct: 875 SKFILKEVEPNQFSGGARHLIAIFRYYFCYENSNIQKNNDFTDFSRYFIQCAIPRFLANT 934
Query: 325 RAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQ-QSILFLSSLL 383
R K P F+L+LL+YNDN+GNPYSDV ++ A++ SV G + +++ LL
Sbjct: 935 RDTQGKCPTFVKRFLLELLRYNDNSGNPYSDVRYVVAIIDSVVACTLGDPEDKEYINDLL 994
Query: 384 KRIDRLLQFDRLMPSYNGILTISCIRTLTQIAL 416
+ R DR +PSY +++ + ++ +A+
Sbjct: 995 NELKRFENLDRWLPSYQLLVSKAILKQYLLLAV 1027
>gi|397504097|ref|XP_003822645.1| PREDICTED: transcription initiation factor TFIID subunit 2 [Pan
paniscus]
gi|410267202|gb|JAA21567.1| TAF2 RNA polymerase II, TATA box binding protein (TBP)-associated
factor, 150kDa [Pan troglodytes]
gi|410307198|gb|JAA32199.1| TAF2 RNA polymerase II, TATA box binding protein (TBP)-associated
factor, 150kDa [Pan troglodytes]
Length = 1199
Score = 162 bits (409), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 665
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 666 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 723
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF+ + +P A+A++R N P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKSCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL SV + L +L L+ I R L ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC+R + L+ +G + D L K + ++ +R+ A A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 896
>gi|332214173|ref|XP_003256205.1| PREDICTED: transcription initiation factor TFIID subunit 2
[Nomascus leucogenys]
Length = 1199
Score = 162 bits (409), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 665
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 666 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 723
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF+ + +P A+A++R N P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKSCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL SV + L +L L+ I R L ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC+R + L+ +G + D L K + ++ +R+ A A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 896
>gi|297683539|ref|XP_002819433.1| PREDICTED: transcription initiation factor TFIID subunit 2 [Pongo
abelii]
Length = 1199
Score = 162 bits (409), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 665
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 666 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 723
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF+ + +P A+A++R N P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKSCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL SV + L +L L+ I R L ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC+R + L+ +G + D L K + ++ +R+ A A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 896
>gi|388453833|ref|NP_001253309.1| transcription initiation factor TFIID subunit 2 [Macaca mulatta]
gi|380788081|gb|AFE65916.1| transcription initiation factor TFIID subunit 2 [Macaca mulatta]
gi|384939964|gb|AFI33587.1| transcription initiation factor TFIID subunit 2 [Macaca mulatta]
Length = 1199
Score = 162 bits (409), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 665
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 666 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 723
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF+ + +P A+A++R N P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKSCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL SV + L +L L+ I R L ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC+R + L+ +G + D L K + ++ +R+ A A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 896
>gi|114621469|ref|XP_519923.2| PREDICTED: transcription initiation factor TFIID subunit 2 [Pan
troglodytes]
gi|410208974|gb|JAA01706.1| TAF2 RNA polymerase II, TATA box binding protein (TBP)-associated
factor, 150kDa [Pan troglodytes]
gi|410354347|gb|JAA43777.1| TAF2 RNA polymerase II, TATA box binding protein (TBP)-associated
factor, 150kDa [Pan troglodytes]
Length = 1199
Score = 162 bits (409), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 665
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 666 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 723
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF+ + +P A+A++R N P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKSCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL SV + L +L L+ I R L ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC+R + L+ +G + D L K + ++ +R+ A A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 896
>gi|145559533|sp|Q6P1X5.3|TAF2_HUMAN RecName: Full=Transcription initiation factor TFIID subunit 2;
AltName: Full=150 kDa cofactor of initiator function;
AltName: Full=RNA polymerase II TBP-associated factor
subunit B; AltName: Full=TBP-associated factor 150 kDa;
AltName: Full=Transcription initiation factor TFIID 150
kDa subunit; Short=TAF(II)150; Short=TAFII-150;
Short=TAFII150
gi|189067574|dbj|BAG38179.1| unnamed protein product [Homo sapiens]
Length = 1199
Score = 162 bits (409), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 665
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 666 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 723
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF+ + +P A+A++R N P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKSCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL SV + L +L L+ I R L ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC+R + L+ +G + D L K + ++ +R+ A A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 896
>gi|2739087|gb|AAC02966.1| cofactor of initiator function [Homo sapiens]
Length = 1199
Score = 162 bits (409), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 665
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 666 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 723
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF+ + +P A+A++R N P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKSCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL SV + L +L L+ I R L ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC+R + L+ +G + D L K + ++ +R+ A A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 896
>gi|410987700|ref|XP_004000133.1| PREDICTED: LOW QUALITY PROTEIN: transcription initiation factor
TFIID subunit 2 [Felis catus]
Length = 1207
Score = 162 bits (409), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 467 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 526
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 527 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 564
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 565 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 615
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ ++
Sbjct: 616 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 673
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 674 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 731
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF+ + +P A+A++R N P+E + F+L L+KYN
Sbjct: 732 -TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 790
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL SV + L +L L+ I R L ++L
Sbjct: 791 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 850
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC+R + L+ +G + D L K + ++ +R+ A A++D
Sbjct: 851 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 904
>gi|296227297|ref|XP_002759311.1| PREDICTED: transcription initiation factor TFIID subunit 2
[Callithrix jacchus]
Length = 1199
Score = 162 bits (409), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 665
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 666 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 723
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF+ + +P A+A++R N P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKSCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL SV + L +L L+ I R L ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC+R + L+ +G + D L K + ++ +R+ A A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 896
>gi|73974557|ref|XP_532318.2| PREDICTED: transcription initiation factor TFIID subunit 2 [Canis
lupus familiaris]
Length = 1199
Score = 162 bits (409), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 665
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 666 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 723
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF+ + +P A+A++R N P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL SV + L +L L+ I R L ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC+R + L+ +G + D L K + ++ +R+ A A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 896
>gi|291388465|ref|XP_002710796.1| PREDICTED: TBP-associated factor 2 [Oryctolagus cuniculus]
Length = 1198
Score = 162 bits (409), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 665
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 666 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 723
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF+ + +P A+A++R N P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL SV + L +L L+ I R L ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC+R + L+ +G + D L K + ++ +R+ A A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 896
>gi|292622006|ref|XP_002664849.1| PREDICTED: transcription initiation factor TFIID subunit 2 [Danio
rerio]
Length = 1191
Score = 161 bits (408), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 137/478 (28%), Positives = 224/478 (46%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G P
Sbjct: 448 LIENRISMEFMLQVFNKLLSLASTASSQKYQSHMWSQMLVSTSGFLKSISNVSGKDIGPL 507
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 508 IKQ----WVDQSGVVKFFGSFAFNRKRNVLELEIRQDYT------------------SSG 545
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 546 TQKYVGPIKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 596
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ +A
Sbjct: 597 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSILRKVEFEQADFMWQYQLRYERDVVAQEEA 654
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I ALE P + L + L + F++VR+ A + LA A S + W G + +
Sbjct: 655 ILALEKFP--TPPSRRALTDILEQDQCFYKVRMHACFCLAKIANSMVSTWTGPPAMKSLF 712
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF+ + IP A+A +R N PRE + F+L L+KYN
Sbjct: 713 -TRMFCCKSCPNIVKTNNFISFQSYFLQKTIPVAMAQLRDVQNLCPREVLSFILDLIKYN 771
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL S+ + + SL L+ I R L ++L
Sbjct: 772 DNRKNKFSDNYYRAELIDALTNSLTPAISINNEVRTVDSLNADVRLILEEITRFLNMEKL 831
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC+R + L+ +G I D L K + ++ VR+ A A++D
Sbjct: 832 LPSYRNTITVSCLRAIRM--LQKNGHIPSDPT--LFKSYAEYGHFVDVRIAALEAVID 885
>gi|79154077|gb|AAI07958.1| TAF2 RNA polymerase II, TATA box binding protein (TBP)-associated
factor [Danio rerio]
Length = 1191
Score = 161 bits (408), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 137/478 (28%), Positives = 224/478 (46%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G P
Sbjct: 448 LIENRISMEFMLQVFNKLLSLASTASSQKYQSHMWSQMLVSTSGFLKSISNVSGKDIGPL 507
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 508 IKQ----WVDQSGVVKFFGSFAFNRKRNVLELEIRQDYT------------------SSG 545
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 546 TQKYVGPIKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 596
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ +A
Sbjct: 597 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSILRKVEFEQADFMWQYQLRYERDVVAQEEA 654
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I ALE P + L + L + F++VR+ A + LA A S + W G + +
Sbjct: 655 ILALEKFP--TPPSRRALTDILEQDQCFYKVRMHACFCLAKIANSMVSTWTGPPAMKSLF 712
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF+ + IP A+A +R N PRE + F+L L+KYN
Sbjct: 713 -TRMFCCKSCPNIVKTNNFISFQSYFLQKTIPVAMAQLRDVQNLCPREVLSFILDLIKYN 771
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL S+ + + SL L+ I R L ++L
Sbjct: 772 DNRKNKFSDNYYRAELIDALTNSLTPAISINNEVRTVDSLNADVRLILEEITRFLNMEKL 831
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC+R + L+ +G I D L K + ++ VR+ A A++D
Sbjct: 832 LPSYRNTITVSCLRAIRM--LQKNGHIPSDPT--LFKSYAEYGHFVDVRIAALEAVID 885
>gi|426360591|ref|XP_004047522.1| PREDICTED: transcription initiation factor TFIID subunit 2-like,
partial [Gorilla gorilla gorilla]
Length = 959
Score = 161 bits (408), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 135/478 (28%), Positives = 229/478 (47%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T +P G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------SP------------G 556
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 665
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 666 ILALEKFPTPASRL--ALTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 723
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF+ + +P A+A++R N P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKSCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL SV + L +L L+ I R L ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC+R + L+ +G + D L K + ++ +R+ A A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 896
>gi|116256083|sp|Q32PW3.2|TAF2_DANRE RecName: Full=Transcription initiation factor TFIID subunit 2;
AltName: Full=TBP-associated factor 150 kDa; AltName:
Full=Transcription initiation factor TFIID 150 kDa
subunit; Short=TAF(II)150; Short=TAFII-150;
Short=TAFII150
Length = 1191
Score = 161 bits (408), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 137/478 (28%), Positives = 224/478 (46%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G P
Sbjct: 448 LIENRISMEFMLQVFNKLLSLASTASSQKYQSHMWSQMLVSTSGFLKSISNVSGKDIGPL 507
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 508 IKQ----WVDQSGVVKFFGSFAFNRKRNVLELEIRQDYT------------------SSG 545
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 546 TQKYVGPIKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 596
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ +A
Sbjct: 597 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSILRKVEFEQADFMWQYQLRYERDVVAQEEA 654
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I ALE P + L + L + F++VR+ A + LA A S + W G + +
Sbjct: 655 ILALEKFP--TPPSRRALTDILEQDQCFYKVRMHACFCLAKIANSMVSTWTGPPAMKSLF 712
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF+ + IP A+A +R N PRE + F+L L+KYN
Sbjct: 713 -TRMFCCKSCPNIVKTNNFISFQSYFLQKTIPVAMAQLRDVQNLCPREVLSFILDLIKYN 771
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL S+ + + SL L+ I R L ++L
Sbjct: 772 DNRKNKFSDNYYRAELIDALTNSLTPAISINNEVRTVDSLNADVRLILEEITRFLNMEKL 831
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC+R + L+ +G I D L K + ++ VR+ A A++D
Sbjct: 832 LPSYRNTITVSCLRAIRM--LQKNGHIPSDPT--LFKSYAEYGHFVDVRIAALEAVID 885
>gi|119612397|gb|EAW91991.1| TAF2 RNA polymerase II, TATA box binding protein (TBP)-associated
factor, 150kDa, isoform CRA_a [Homo sapiens]
gi|119612398|gb|EAW91992.1| TAF2 RNA polymerase II, TATA box binding protein (TBP)-associated
factor, 150kDa, isoform CRA_a [Homo sapiens]
gi|119612399|gb|EAW91993.1| TAF2 RNA polymerase II, TATA box binding protein (TBP)-associated
factor, 150kDa, isoform CRA_a [Homo sapiens]
Length = 1030
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 290 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 349
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 350 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 387
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 388 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 438
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ ++
Sbjct: 439 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 496
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 497 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 554
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF+ + +P A+A++R N P+E + F+L L+KYN
Sbjct: 555 -TRMFCCKSCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 613
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL SV + L +L L+ I R L ++L
Sbjct: 614 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 673
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC+R + L+ +G + D L K + ++ +R+ A A++D
Sbjct: 674 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 727
>gi|82081311|sp|Q5ZIT8.1|TAF2_CHICK RecName: Full=Transcription initiation factor TFIID subunit 2;
AltName: Full=TBP-associated factor 150 kDa; AltName:
Full=Transcription initiation factor TFIID 150 kDa
subunit; Short=TAF(II)150; Short=TAFII-150;
Short=TAFII150
gi|53134697|emb|CAG32355.1| hypothetical protein RCJMB04_23j21 [Gallus gallus]
Length = 1168
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 136/478 (28%), Positives = 230/478 (48%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 449 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 508
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T +P G
Sbjct: 509 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------SP------------G 546
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 547 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 597
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ +A
Sbjct: 598 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQSDFMWQYQLRYERDVVAQEEA 655
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 656 ILALEKFPTPASRL--ALTDILEQEQCFYRVRMLACFCLAKIANSMVSTWTGPPAMKSLF 713
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F +F YF+ + +P A+A++R N P+E + F+L L+KYN
Sbjct: 714 -TRMFCCKTCPNIVKTNNFMNFQSYFLQKTMPVAMALLRDVHNLCPKEVLMFILDLIKYN 772
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL SV + L +L L+ I R L ++L
Sbjct: 773 DNRKNKFSDNYYRAELIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 832
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC++ + L+ +G + D L K + ++ VR+ A A++D
Sbjct: 833 LPSYRHTITVSCLKAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDVRIAALEAVVD 886
>gi|417406103|gb|JAA49727.1| Putative tata binding protein associated factor [Desmodus rotundus]
Length = 1199
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 134/478 (28%), Positives = 228/478 (47%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYTSL------------------G 556
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 665
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I+ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 666 ISALEKFPTPASRLA--LTDILEQEQCFYRVRMAACFCLAKIANSMVSTWTGPPAMKSLF 723
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF+ + +P A+A++R N P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL SV + L +L L+ I R L ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC+R + L+ +G + D L K + ++ +R+ A A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 896
>gi|281347100|gb|EFB22684.1| hypothetical protein PANDA_019775 [Ailuropoda melanoleuca]
Length = 1100
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 360 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 419
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 420 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 457
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 458 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 508
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ ++
Sbjct: 509 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 566
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 567 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 624
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF+ + +P A+A++R N P+E + F+L L+KYN
Sbjct: 625 -TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 683
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL SV + L +L L+ I R L ++L
Sbjct: 684 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 743
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC+R + L+ +G + D L K + ++ +R+ A A++D
Sbjct: 744 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 797
>gi|301788308|ref|XP_002929574.1| PREDICTED: transcription initiation factor TFIID subunit 2-like,
partial [Ailuropoda melanoleuca]
Length = 1099
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 359 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 418
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 419 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 456
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 457 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 507
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ ++
Sbjct: 508 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 565
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 566 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 623
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF+ + +P A+A++R N P+E + F+L L+KYN
Sbjct: 624 -TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 682
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL SV + L +L L+ I R L ++L
Sbjct: 683 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 742
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC+R + L+ +G + D L K + ++ +R+ A A++D
Sbjct: 743 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 796
>gi|350582943|ref|XP_001924394.4| PREDICTED: transcription initiation factor TFIID subunit 2 [Sus
scrofa]
Length = 1105
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 365 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 424
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 425 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 462
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 463 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 513
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ ++
Sbjct: 514 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 571
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 572 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 629
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF+ + +P A+A++R N P+E + F+L L+KYN
Sbjct: 630 -TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 688
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL SV + L +L L+ I R L ++L
Sbjct: 689 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 748
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC+R + L+ +G + D L K + ++ +R+ A A++D
Sbjct: 749 LPSYRHTITVSCLRAIR--VLQKNGHVPSDP--SLFKSYAEYGHFVDIRIAALEAVVD 802
>gi|363731074|ref|XP_003640904.1| PREDICTED: transcription initiation factor TFIID subunit 2 [Gallus
gallus]
Length = 1200
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 136/478 (28%), Positives = 230/478 (48%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 449 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 508
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T +P G
Sbjct: 509 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------SP------------G 546
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 547 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 597
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ +A
Sbjct: 598 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQSDFMWQYQLRYERDVVAQEEA 655
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 656 ILALEKFPTPASRL--ALTDILEQEQCFYRVRMLACFCLAKIANSMVSTWTGPPAMKSLF 713
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F +F YF+ + +P A+A++R N P+E + F+L L+KYN
Sbjct: 714 -TRMFCCKTCPNIVKTNNFMNFQSYFLQKTMPVAMALLRDVHNLCPKEVLMFILDLIKYN 772
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL SV + L +L L+ I R L ++L
Sbjct: 773 DNRKNKFSDNYYRAELIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 832
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC++ + L+ +G + D L K + ++ VR+ A A++D
Sbjct: 833 LPSYRHTITVSCLKAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDVRIAALEAVVD 886
>gi|334326275|ref|XP_003340732.1| PREDICTED: LOW QUALITY PROTEIN: transcription initiation factor
TFIID subunit 2-like [Monodelphis domestica]
Length = 1200
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQSDFMWQYQLRYERDVVAQEES 665
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 666 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMLACFCLAKIANSMVSTWTGPPAMKSLF 723
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF+ + +P A+A++R N P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL SV + L +L L+ I R L ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC+R + L+ +G + D L K + ++ +R+ A A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 896
>gi|224046665|ref|XP_002199309.1| PREDICTED: transcription initiation factor TFIID subunit 2
[Taeniopygia guttata]
Length = 1200
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 136/478 (28%), Positives = 230/478 (48%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 449 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 508
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T +P G
Sbjct: 509 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------SP------------G 546
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 547 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 597
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ +A
Sbjct: 598 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQSDFMWQYQLRYERDVVAQEEA 655
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 656 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMLACFCLAKIANSMVSTWTGPPAMKSLF 713
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F +F YF+ + +P A+A++R N P+E + F+L L+KYN
Sbjct: 714 -TRMFCCKTCPNIVKTNNFMNFQSYFLQKTMPVAMALLRDVHNLCPKEVLMFILDLIKYN 772
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL SV + L +L L+ I R L ++L
Sbjct: 773 DNRKNKFSDNYYRAELIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 832
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC++ + L+ +G + D L K + ++ VR+ A A++D
Sbjct: 833 LPSYRHTITVSCLKAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDVRIAALEAVVD 886
>gi|410904835|ref|XP_003965897.1| PREDICTED: LOW QUALITY PROTEIN: transcription initiation factor
TFIID subunit 2-like [Takifugu rubripes]
Length = 1206
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 132/478 (27%), Positives = 237/478 (49%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G P
Sbjct: 448 LIENRISMEFMLQVFNKLLSLASTASSQKYQSQMWSQMLVSTHGFLKSISNVSGKDIGPL 507
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV F++N+++N++EL + +D T +P G
Sbjct: 508 IKQ----WVDQSAVVKFFGSFAFNRKRNVLELEIRQDYT------SP------------G 545
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H +L + ++ + +I CHSK + K
Sbjct: 546 TQKYVGPIKVTVQELDGSFNH-MLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 596
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ +A
Sbjct: 597 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSILRKVEFEQADFMWQYQLRYERDVVAQEEA 654
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEE-TDWAGLLHLVKFY 288
I+ALE P + + L + L + F++VR++A + L+ A+ + W G + +
Sbjct: 655 ISALEKFPTPASRL--ALTDILEQDQCFYKVRMQACFCLSKIANAMVSTWQGPPAMKSLF 712
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F +F YF+ + +P A+A++R N P++ + F+L L+KYN
Sbjct: 713 -TRMFCCKSCPNIVKTNNFINFQSYFLQKTMPVAMALLRDVQNLCPKDVLNFILDLIKYN 771
Query: 347 DNNGNPYSDVFWLAALVQSVG-------ELEFGQQSILFLSS----LLKRIDRLLQFDRL 395
DN N +SD ++ A L++++ + +++ L+S +L+ I R L ++L
Sbjct: 772 DNRKNKFSDNYYRAELIRALTNSLTXAISISNEVRTVDNLNSDVRLILEEITRYLNMEKL 831
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC+R + + L+ +G I D L K + ++ VRV A AL+D
Sbjct: 832 LPSYRNTITVSCLRAIRE--LQKNGHIPSD--ASLFKSYAEYGHFVDVRVAALEALVD 885
>gi|158294667|ref|XP_001688719.1| AGAP005731-PA [Anopheles gambiae str. PEST]
gi|157015672|gb|EDO63725.1| AGAP005731-PA [Anopheles gambiae str. PEST]
Length = 1272
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 136/511 (26%), Positives = 235/511 (45%), Gaps = 66/511 (12%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRH-------FANKVGNLERPFLKE 53
MLE ++G ++ ++ A A+ + + + + H F+ + + +
Sbjct: 460 MLEHRIGQELLLQVFNKQLALASNAAGTK-IGSGLWGHMLISTNVFSKAIFTVTGKDMSV 518
Query: 54 FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
F +WV T G M +N ++N +EL + +DC N+ G
Sbjct: 519 FIDQWVRTGGHAKFTMTSVFNSKRNTIELEIRQDCV-----------------NQRGVRK 561
Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
+ G + I++ ELDG + H + Q+ I S + + K K C G
Sbjct: 562 YVGPLLIQLQELDGTFKHTL--------QIENIIVKSDITCHSKSRRNKKKKIPLC--TG 611
Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
+ V +D+ ESP+ WIR DPEM L ++ QP W QL + DV AQ +AI AL
Sbjct: 612 EEV-DMDLSPMAESPVLWIRLDPEMTLLRSVNIEQPDFQWQFQLRHERDVTAQLEAIKAL 670
Query: 234 E--ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEE-TDWAGLLHLVKFYKS 290
E A P + +T+ + + F+ VR +AA L A+ T W G ++ +K
Sbjct: 671 EKYATPQTRLALTDTIES----ERVFYEVRCQAAICLTKVANAMVTSWQGPPAMLTIFK- 725
Query: 291 RRFDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYND 347
+F + P R N+F +F YF+ + IP A+A +R A P E + F+L L KYND
Sbjct: 726 -KFFGSFSAPHIIRQNNFTNFQHYFLQKTIPVAMAGLRTAHGICPPEVIRFLLDLFKYND 784
Query: 348 NNGNPYSDVFWLAALVQSVGELEFGQQSILF----LSS---------LLKRIDRLLQFDR 394
N+ N YSD ++ AALV+++G S++ LSS +L+ + R+L ++
Sbjct: 785 NSKNHYSDNYYRAALVEALGNTITPVISVVHQGRALSSENLSQDAKLVLEEVTRILNLEK 844
Query: 395 LMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
+PSY ++I+C++ + + L+ G + + K+ + + + VR+ A L+D
Sbjct: 845 HLPSYKYTVSIACLKVIRK--LQKCGHLPTNP--KIYRSYAAYGQYIDVRLAAMECLVDY 900
Query: 455 EFHCNGIDSALSLFIKSVEEEPSLRGQVKLG 485
+G L ++ +E +P + +LG
Sbjct: 901 -VKIDGRWEDLEHLMELLETDPDPMARHQLG 930
>gi|254584730|ref|XP_002497933.1| ZYRO0F16830p [Zygosaccharomyces rouxii]
gi|186929049|emb|CAQ43374.1| Transcription initiation factor TFIID subunit 2 [Zygosaccharomyces
rouxii]
gi|238940826|emb|CAR29000.1| ZYRO0F16830p [Zygosaccharomyces rouxii]
Length = 1478
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 123/445 (27%), Positives = 206/445 (46%), Gaps = 64/445 (14%)
Query: 13 KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
+IL I +A G P +LS+ F++ +V + L+ FF +W+ G P+ R+
Sbjct: 628 RILPKIFLQAMSGDLPNNSLSSSHFQYVCERVN---KNKLESFFQQWIYGSGVPIFRVTQ 684
Query: 72 SYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWP---------GMMSIRV 122
+NK++ +VE+ + R C + S+ ++ + + P G M+IR+
Sbjct: 685 RFNKKRMMVEMGI-RQCQDQELSQDKIVGKEGFCSSALNHLQHPVRAMTPFFTGSMTIRI 743
Query: 123 HELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKG------------------ 163
HE DG Y+H + D + L+IQ ++K R K G
Sbjct: 744 HEADGTPYEHIV--ELKDVFTKLDIQYNTKYRRLRNRKINVGRPSNKAASPDLEEDLEFQ 801
Query: 164 ----------------SKPDGCDD--NGDAVAGLDMRSSME--SPLSWIRADPEMEYLAE 203
S P C D DA+ D + WIR D + E++ +
Sbjct: 802 EEKIDNSAIQRLGTVLSSPKDCKDWDLTDAIVTTDGNEAQRQNEAFEWIRIDSDFEWICK 861
Query: 204 IHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLS----DSKAFWR 259
I+ NQP M+ +QL++D DV AQ ++I E + S N ++ L+ D + F+
Sbjct: 862 IYVNQPDYMFTSQLQQDSDVEAQVESIKYFEDVIIHSPNNSQIYSSILTRTAMDKRYFYG 921
Query: 260 VRIEAAYALANTASEETDWA----GLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLE 315
VR+EA ALA ++ D A G +HL+K +++ E +P NDF F EYF+ +
Sbjct: 922 VRLEACKALARYTMKDHDPAHFTGGAMHLIKIFQNLFCYEGSSIPLNNDFTSFQEYFLQK 981
Query: 316 AIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFG-QQ 374
IP +A+VR N+ P +FVL +L YN+N+ N Y D +L +L+++ + ++
Sbjct: 982 NIPKFLALVRNERNECPHFVKQFVLDILTYNENSENLYDDTHYLCSLIEAAVDCCISNKK 1041
Query: 375 SILFLSSLLKRIDRLLQFDRLMPSY 399
FL+ +L ++ R D MP+Y
Sbjct: 1042 DQEFLNKVLDQLHRYANLDDWMPTY 1066
>gi|118786940|ref|XP_315746.3| AGAP005731-PB [Anopheles gambiae str. PEST]
gi|116126564|gb|EAA10740.3| AGAP005731-PB [Anopheles gambiae str. PEST]
Length = 1233
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 136/511 (26%), Positives = 235/511 (45%), Gaps = 66/511 (12%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRH-------FANKVGNLERPFLKE 53
MLE ++G ++ ++ A A+ + + + + H F+ + + +
Sbjct: 460 MLEHRIGQELLLQVFNKQLALASNAAGTK-IGSGLWGHMLISTNVFSKAIFTVTGKDMSV 518
Query: 54 FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
F +WV T G M +N ++N +EL + +DC N+ G
Sbjct: 519 FIDQWVRTGGHAKFTMTSVFNSKRNTIELEIRQDCV-----------------NQRGVRK 561
Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
+ G + I++ ELDG + H + Q+ I S + + K K C G
Sbjct: 562 YVGPLLIQLQELDGTFKHTL--------QIENIIVKSDITCHSKSRRNKKKKIPLC--TG 611
Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
+ V +D+ ESP+ WIR DPEM L ++ QP W QL + DV AQ +AI AL
Sbjct: 612 EEV-DMDLSPMAESPVLWIRLDPEMTLLRSVNIEQPDFQWQFQLRHERDVTAQLEAIKAL 670
Query: 234 E--ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEE-TDWAGLLHLVKFYKS 290
E A P + +T+ + + F+ VR +AA L A+ T W G ++ +K
Sbjct: 671 EKYATPQTRLALTDTIES----ERVFYEVRCQAAICLTKVANAMVTSWQGPPAMLTIFK- 725
Query: 291 RRFDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYND 347
+F + P R N+F +F YF+ + IP A+A +R A P E + F+L L KYND
Sbjct: 726 -KFFGSFSAPHIIRQNNFTNFQHYFLQKTIPVAMAGLRTAHGICPPEVIRFLLDLFKYND 784
Query: 348 NNGNPYSDVFWLAALVQSVGELEFGQQSILF----LSS---------LLKRIDRLLQFDR 394
N+ N YSD ++ AALV+++G S++ LSS +L+ + R+L ++
Sbjct: 785 NSKNHYSDNYYRAALVEALGNTITPVISVVHQGRALSSENLSQDAKLVLEEVTRILNLEK 844
Query: 395 LMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
+PSY ++I+C++ + + L+ G + + K+ + + + VR+ A L+D
Sbjct: 845 HLPSYKYTVSIACLKVIRK--LQKCGHLPTNP--KIYRSYAAYGQYIDVRLAAMECLVDY 900
Query: 455 EFHCNGIDSALSLFIKSVEEEPSLRGQVKLG 485
+G L ++ +E +P + +LG
Sbjct: 901 -VKIDGRWEDLEHLMELLETDPDPMARHQLG 930
>gi|209954626|ref|NP_579853.1| TAF2 RNA polymerase II, TATA box binding protein-associated factor,
150 kD [Rattus norvegicus]
Length = 1200
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 133/478 (27%), Positives = 227/478 (47%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTYGFLKSISNVSGKDIQPL 518
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW +L + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYELRYERDVVAQQES 665
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 666 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 723
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF+ + +P A+A++R N P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL SV + L +L L+ I R L ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC+R + L+ +G + D L K + ++ +R+ A A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSD--ASLFKSYAEYGHFVDIRIAALEAVVD 896
>gi|124486849|ref|NP_001074757.1| transcription initiation factor TFIID subunit 2 [Mus musculus]
gi|223462133|gb|AAI47639.1| TAF2 RNA polymerase II, TATA box binding protein (TBP)-associated
factor [Mus musculus]
gi|223462641|gb|AAI51080.1| TAF2 RNA polymerase II, TATA box binding protein (TBP)-associated
factor [Mus musculus]
Length = 1198
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 133/478 (27%), Positives = 227/478 (47%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTYGFLKSISNVSGKDIQPL 518
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW +L + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYELRYERDVVAQQES 665
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 666 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 723
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF+ + +P A+A++R N P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL SV + L +L L+ I R L ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC+R + L+ +G + D L K + ++ +R+ A A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSD--ASLFKSYAEYGHFVDIRIAALEAVVD 896
>gi|389751631|gb|EIM92704.1| hypothetical protein STEHIDRAFT_89926 [Stereum hirsutum FP-91666
SS1]
Length = 1835
Score = 160 bits (405), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 141/488 (28%), Positives = 219/488 (44%), Gaps = 85/488 (17%)
Query: 31 LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDC-T 89
LST F KV ++ L+ F +W+ GCP S+N++K VE+ + +D
Sbjct: 518 LSTHTFLRTCRKVSGVD---LRSFAEQWIYGSGCPSFGFSASFNRKKMAVEITMRQDAPA 574
Query: 90 VKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQC 148
K PV+ + + G M+IR+HE DG Y+H +L + ++ E+
Sbjct: 575 YKELENNPVVKALLKPVP-----FFEGQMTIRIHEADGTPYEH-VLDIRA-PFKRYEVPF 627
Query: 149 HSKLA-ARRALK---------------PKKGSKPDGCDDNGDAVAGLDMRSSMESPLSWI 192
++K RR K + ++ G D G GLD+ + +W
Sbjct: 628 NTKYKRVRRNTKRYLARQAAAQAAAEGDAEAAEAMGMVDMG---FGLDIWENERERENWK 684
Query: 193 RAD--------------------PEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAA 232
AD + E++A + F+QP MW++QL++D DVVAQ +A+ A
Sbjct: 685 VADWSEDDEQVMSGATYEWIRMDADFEWIAAVAFDQPDYMWVSQLQRDRDVVAQLEAVNA 744
Query: 233 LEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRR 292
L +P S V + L + S F+RVR EAA AL A D+ GL HL K +
Sbjct: 745 LSKMP--SAIVSSQLTKTVLVSNYFFRVRCEAALALVTCAIRRLDFLGLFHLFKLFLRYC 802
Query: 293 FD---------ENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLL 343
+D + +PRPNDF DF++YFV A+ A++ VR + K P +F+ L
Sbjct: 803 YDPEDPQQDLFTHTYVPRPNDFSDFADYFVRRALIKAISQVRFENGKCPSVVRKFLTDQL 862
Query: 344 KYNDNNGNPYSDVFWLAALVQSV------------GEL-------EFGQQSILFLSSLLK 384
KYNDN NPY+D F++ L+ S+ GEL E + + L L
Sbjct: 863 KYNDNTANPYTDAFYICTLISSLACACVSTSPPERGELFPMEAKNEQSSEDAMILKDALA 922
Query: 385 RIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVR 444
DR DRL+ S + ++TI+ + ++ L ++ I D ++ P QVR
Sbjct: 923 EADRYRNMDRLIASPHNVVTIAALE--FRLILSVANLIPNDP--RVFFPLTREGNYVQVR 978
Query: 445 VEASRALL 452
+ A AL
Sbjct: 979 IAAFDALF 986
>gi|169603133|ref|XP_001794988.1| hypothetical protein SNOG_04573 [Phaeosphaeria nodorum SN15]
gi|111067213|gb|EAT88333.1| hypothetical protein SNOG_04573 [Phaeosphaeria nodorum SN15]
Length = 1480
Score = 160 bits (405), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 146/564 (25%), Positives = 246/564 (43%), Gaps = 91/564 (16%)
Query: 2 LEKQMGSNFFRKILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
L KQ G N + L ++ A QG ++T++F KVG+ + L FF +WV
Sbjct: 506 LVKQSGKNGVDRCLYRLLFNARQGKYDNGAITTEDFLDICEKVGHQK---LDSFFSQWVV 562
Query: 61 TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDS-----ENRDGDIG-- 113
GCP S+NK+K +V+L + + ++ DS + + + E ++ G
Sbjct: 563 GSGCPNFECFPSFNKKKQVVQLQIRQ---IQADSTVTTIENALSADDFMREAKESAHGFL 619
Query: 114 -------WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSK 165
+ G M+IR+HE DG Y+H + + + + K R L ++ +
Sbjct: 620 HGNEFPTFVGPMTIRIHEADGTPYEHIVDINSSNVKVEIPYNTKYKRLKRNRLNKERQAA 679
Query: 166 PDGCDDNGDAVAGLDM-------------------------RSSMES-PLSWIRADPEME 199
G D +G+A + + ME WIR D + E
Sbjct: 680 AAGLDASGEAQEDVTFYMLGDVFGTAKEQEEWRLSDWPAEDEAKMEGEAYEWIRIDADFE 739
Query: 200 YLAEIHFNQ-PVQMWINQLEKDGDVVAQAQAI---AALEALPHLSFNVVNTLNNFLSDSK 255
++ H N P M+++QL++D DVVAQ ++I A+ E +S +V T L D++
Sbjct: 740 WICRAHINDMPSYMYVSQLQQDKDVVAQYESIQFLASKEGHRLISTFLVKT----LMDNR 795
Query: 256 AFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLE 315
F +R AA +A +A + T+W GL HL K ++ + R NDF D S Y +
Sbjct: 796 YFHGIRTYAADVMAQSALKRTEWTGLYHLKKAFRELFCIPGSPMTRSNDFSDRSLYLIQC 855
Query: 316 AIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGEL------ 369
AIP A++ ++ D ++P EA F+L +++YNDN GN YSD ++++ L+ + +
Sbjct: 856 AIPKAISKIKGEDGRAPLEAKRFLLDIIRYNDNRGNEYSDDYYISTLMHCLADCLTANKS 915
Query: 370 --------EFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILT---ISCIRTLTQ---IA 415
E + F ++ + R + D +P+Y + T + C R L
Sbjct: 916 GDMELFDAETERDEFEFTKKAIEELTRHQRLDEWIPTYQNVYTTTALDCARKLIMNRVTP 975
Query: 416 LKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEE 475
K S F+ Q+ VR++A L+ L D+ L + + +
Sbjct: 976 FKPSEFLQYAQL----------GNADNVRLKAWDCLVRLGMLRK--DAVLKFLMHDIRSD 1023
Query: 476 PS--LRGQVKLGIHAMRICQIKGG 497
PS R Q+ L + + I QI G
Sbjct: 1024 PSPYFRKQL-LRVFSFAIGQIATG 1046
>gi|402879027|ref|XP_003903157.1| PREDICTED: LOW QUALITY PROTEIN: transcription initiation factor
TFIID subunit 2 [Papio anubis]
Length = 1173
Score = 160 bits (405), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 133/478 (27%), Positives = 227/478 (47%), Gaps = 66/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 665
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 666 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 723
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF+ + +P A+A++R N P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKSCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL SV + L +L L+ I R L ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC+R + + + + DQ+ L K + ++ +R+ A A++D
Sbjct: 843 LPSYRHTITVSCLRAIRVLRKR---HVPXDQL--LFKSYAEYGHFVDIRIAALEAVVD 895
>gi|344272789|ref|XP_003408212.1| PREDICTED: transcription initiation factor TFIID subunit 2
[Loxodonta africana]
Length = 1202
Score = 160 bits (404), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 133/478 (27%), Positives = 227/478 (47%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 665
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 666 ILALEKFPTPASRLA--LTDILDQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 723
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF+ + +P A+A++R N P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKTCPNIVKTNNFLSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL SV + + +L L+ I R L ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTVDNLSPDVRLILEEITRFLNMEKL 842
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC+R + L+ +G + D L K + ++ +R+ A A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 896
>gi|116256085|sp|Q8C176.2|TAF2_MOUSE RecName: Full=Transcription initiation factor TFIID subunit 2;
AltName: Full=TBP-associated factor 150 kDa; AltName:
Full=Transcription initiation factor TFIID 150 kDa
subunit; Short=TAF(II)150; Short=TAFII-150;
Short=TAFII150
Length = 1104
Score = 160 bits (404), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 136/479 (28%), Positives = 231/479 (48%), Gaps = 67/479 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 449 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTYGFLKSISNVSGKDIQPL 508
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ W+ G F++N+++N++EL + +D T +P G
Sbjct: 509 IKQ----WLDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------SP------------G 546
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 547 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 597
Query: 171 DNGDAVAGLDMRSSME--SPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQ 228
NG+ V +D+ S+ME SPL WIR DP+M L ++ F Q MW +L + DVVAQ +
Sbjct: 598 -NGEEV-DMDL-SAMEADSPLLWIRIDPDMSVLRKVEFEQADFMWQYELRYERDVVAQQE 654
Query: 229 AIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKF 287
+I ALE P + + L + L + F+RVR+ A + LA A S + W G +
Sbjct: 655 SILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSL 712
Query: 288 YKSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKY 345
+ +R F + + N+F F YF+ + +P A+A++R N P+E + F+L L+KY
Sbjct: 713 F-TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKY 771
Query: 346 NDNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDR 394
NDN N +SD ++ A AL SV + L +L L+ I R L ++
Sbjct: 772 NDNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEK 831
Query: 395 LMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
L+PSY +T+SC+R + L+ +G + D L K + ++ +R+ A A++D
Sbjct: 832 LLPSYRHTITVSCLRAIR--VLQKNGHVPSD--ASLFKSYAEYGHFVDIRIAALEAVVD 886
>gi|51467894|ref|NP_001003835.1| transcription initiation factor TFIID subunit 2 [Danio rerio]
gi|49619029|gb|AAT68099.1| TATA box binding protein (TBP)-associated factor 150kDa [Danio
rerio]
Length = 1191
Score = 160 bits (404), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 137/478 (28%), Positives = 223/478 (46%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G P
Sbjct: 448 LIENRISMEFMLQVFNKLLSLASTASSQKYQSHMWSQMLVSTSGFLKSISNVSGKDIGPL 507
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 508 IKQ----WVDQSGVVKFFGSFAFNRKRNVLELEIRQDYT------------------SSG 545
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 546 TQKYVGPIKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 596
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ +A
Sbjct: 597 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSILRKVEFEQADFMWQYQLRYERDVVAQEEA 654
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I ALE P + L + L + F++VR+ A + LA A S + W G + +
Sbjct: 655 ILALEKFP--TPPSRRALTDILEQDQCFYKVRMHACFCLAKIANSMVSTWTGPPAMKSLF 712
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF+ + IP A+A +R N PRE + F+L L+KYN
Sbjct: 713 -TRMFCCKSCPNIVKTNNFISFQSYFLQKTIPVAMAQLRDVQNLCPREVLSFILDLIKYN 771
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N SD ++ A AL S+ + + SL L+ I R L ++L
Sbjct: 772 DNRKNKLSDNYYRAELIDALTNSLTPAISINNEVRTVDSLNADVRLILEEITRFLNMEKL 831
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC+R + L+ +G I D L K + ++ VR+ A A++D
Sbjct: 832 LPSYRNTITVSCLRAIRM--LQKNGHIPSDPT--LFKSYAEYGHFVDVRIAALEAVID 885
>gi|145353402|ref|XP_001421003.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|145357369|ref|XP_001422892.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581239|gb|ABO99296.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144583136|gb|ABP01251.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 1112
Score = 159 bits (403), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 129/442 (29%), Positives = 215/442 (48%), Gaps = 54/442 (12%)
Query: 1 MLEKQMGSNFFRKILQNII------SRAQG-----------ASPVRTLSTKEFRHFANKV 43
MLEK++G + +K+L+ ++ +G +S R + T +
Sbjct: 403 MLEKRLGPDVMQKVLKYFAGLHVRRNKKEGGTRAGPSAEVLSSNARWIHTLQLFDHCRAT 462
Query: 44 GNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNT 103
NL + + F RWV G P L +G+ +RKN++E AV + +V + +
Sbjct: 463 VNLGKGEVNSFLERWVYGAGTPKLCVGYVVKRRKNVMEFAVKLEGSVAAAAADRAALAVA 522
Query: 104 DSENRDGDIGWPGMMSIRVHELD--GMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPK 161
+ +++R+ E + DH ++ + AWQL+EI K+ RR PK
Sbjct: 523 RNHRTS--------VTVRMREENRPDANDH-VVSLGHSAWQLMEIPLQPKIKDRR---PK 570
Query: 162 KGSKPDGCDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPV----QMWINQL 217
+ GD ++ ++M+ P+ ++R DPE E++ I + M L
Sbjct: 571 TI-----IESGGDP----ELIAAMDCPVRYVRVDPEFEWMGNIEQSAKQVGLESMMAQML 621
Query: 218 EKDGDVVAQAQAIAAL-EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET 276
EK+ D+VAQ A+ L + + S + V L+ L+ F RVR EAA AL +ASE+T
Sbjct: 622 EKEKDIVAQTIAVEFLGRRVANGSVSAVLVLDKCLNSEDTFCRVRAEAALALGKSASEKT 681
Query: 277 DWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNK-SPREA 335
W GL L++ YK + D N G P+ NDFRD ++ V EA+ A+ V DN + ++A
Sbjct: 682 QWGGLNALIRHYKKFQCDANTGKPKQNDFRDLAKVIVDEAVITALGYV--MDNGITHQDA 739
Query: 336 VEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGEL--EFGQQSILFLSSLLKRIDRLLQFD 393
+E ++ LKYNDN GNP SD ++A + ++G G+Q L ++++ I ++ D
Sbjct: 740 LEAIVARLKYNDNEGNPQSDDGFVATCIAALGRCVPADGKQ----LQTVMQTIHYYIRRD 795
Query: 394 RLMPSYNGILTISCIRTLTQIA 415
PS + +T + IR L +A
Sbjct: 796 DRFPSDDLCVTCAGIRALGLLA 817
>gi|198435612|ref|XP_002126792.1| PREDICTED: similar to TAF2 RNA polymerase II, TATA box binding
protein (TBP)-associated factor [Ciona intestinalis]
Length = 1154
Score = 159 bits (403), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 126/477 (26%), Positives = 234/477 (49%), Gaps = 62/477 (12%)
Query: 2 LEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPF------LKEFF 55
+E ++G + ++S A ++ + +S+ + G L+ + +
Sbjct: 444 IENRIGPQLLLQAFNKLLSLANTSAQQKVISSTWSTMLLSTAGFLKSTYSVSGKDISNLI 503
Query: 56 PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWP 115
+WV G F +N+++N VEL +KPD +++ T +
Sbjct: 504 DQWVHKGGIVRFNSKFVFNRKRNTVEL------DIKPD----IINKGTQK--------YV 545
Query: 116 GMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDA 175
G +++RV ELDG ++H I + ++ CHSK +RR + K NG+
Sbjct: 546 GPLTVRVQELDGSFNHTI--QIEENTTKHDLPCHSK--SRRNKRKKIPLL------NGEE 595
Query: 176 VAGLDMRSSM---ESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAA 232
V DM S +SP+ WIR DP++ ++F QP MW QL + DV+ Q++AI A
Sbjct: 596 V---DMDLSFMDPDSPVLWIRIDPDVNIPRLVNFEQPDYMWQYQLRYERDVIGQSEAIGA 652
Query: 233 LEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETD-WAGLLHLVKFYKSR 291
LEA + N L + + + +++VR++A +LANTA+ + W G ++ + +R
Sbjct: 653 LEAFSTPA--TRNALTDIIENEHYYYKVRMQACKSLANTANAMVNTWTGPPAMLNLF-TR 709
Query: 292 RF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNN 349
F D + + R N+F +F +YF+ + +P A++M+R P+E + ++ L+K+NDN+
Sbjct: 710 FFCCDTSPPIVRMNNFSNFQQYFLQKQLPVAMSMLRNTHGVCPKEILHYLFGLIKFNDNS 769
Query: 350 GNPYSDVFWLAALVQSVGE-------LEFGQQSI-----LFLSSLLKRIDRLLQFDRLMP 397
N YSD ++ +ALV ++ L+ Q+ + + +++ + R L ++ +P
Sbjct: 770 KNKYSDNYYRSALVDALNNTITPAVTLQSDQKVTADSMPMDIHTIVDEVTRFLNLEKHLP 829
Query: 398 SYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
SY +TISC+R + L+ +G + D ++L K + VR+ A L+D+
Sbjct: 830 SYRYTITISCLRAIRN--LQRNGHLPSD--IELFKSYAAPGFFRDVRLAAVGCLVDV 882
>gi|342321629|gb|EGU13562.1| TATA-binding protein associated factor Taf2 [Rhodotorula glutinis
ATCC 204091]
Length = 1877
Score = 159 bits (402), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 135/477 (28%), Positives = 228/477 (47%), Gaps = 71/477 (14%)
Query: 11 FRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMG 70
F KI + IS + S +LST F K+ LK FF +WV GCP +
Sbjct: 499 FPKIFLDAIS-GELTSQANSLSTTSFMRTCRKLCGGSSEALKIFFDQWVYGSGCPTFEVT 557
Query: 71 FSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG-WPGMMSIRVHELDGM- 128
++N++K VEL V + C S+ + +++ I + G M+IR+HE DG
Sbjct: 558 ANFNRKKMAVELNVTQRCYAYAWSQ----KAPWEAQGHLRPIPLFDGQMTIRIHEADGTP 613
Query: 129 YDHPILPMAGDAWQLLEIQCHSKLAARR----------------ALKPKKGSKPDGCDDN 172
Y+H + D+++ E+ ++K R AL + + G D
Sbjct: 614 YEHVL--TIQDSFKRHEVPFNTKYKRVRRNTKRYQARQAAATAAALGDIEAQEDMGLIDL 671
Query: 173 GDAVAGLDMRSSME-----------------SPLSWIRADPEMEYLAEIHFNQPVQMWIN 215
G ++ + E + WIR D ++E++ + F QP MWI+
Sbjct: 672 GFSLGMWEDEKERERWRVADWTEEDDLVMSQATYEWIRIDADIEWICIVKFTQPDFMWIS 731
Query: 216 QLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEE 275
QL++D DVVAQ +AI AL ++ N L S F+R+R+EAA AL + A+ E
Sbjct: 732 QLQRDRDVVAQLEAIHALSSI-PSPIVSSNMCKTVLV-SNYFFRIRMEAALALISCATLE 789
Query: 276 TDWAGLLHLVKFYKSRR-FDENI---------GLPRPNDFRDFSEYFVLEAIPHAVAMVR 325
+ GL HL K +++ F+ ++ +P+ NDF DF+ YF+ +A+ A++M+R
Sbjct: 790 QQYLGLFHLFKTFQTWYCFEPDVETKDPFGFRCIPKTNDFSDFTLYFMKKALVTAISMIR 849
Query: 326 AADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVG---------ELEFGQQSI 376
++P +F++ LL YNDN GN YSD F++ +++ ++ +L + I
Sbjct: 850 DERGQTPPVIQQFLVDLLAYNDNLGNKYSDAFYITSVMNALSHTLVNVVPRDLNGDTEQI 909
Query: 377 LFLSS------LLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQV 427
+ S ++ ++R L DRL+PSY+ +TI+ I ++ L L+ I D++
Sbjct: 910 TEVDSNENLAPAVQEVERYLASDRLVPSYHNAVTIAGIE--FKLKLTLASLIPEDRM 964
>gi|329744651|ref|NP_001178070.1| transcription initiation factor TFIID subunit 2 [Bos taurus]
Length = 1199
Score = 159 bits (402), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ Q MW QL + DVVAQ +A
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMAVLRKVDSEQADFMWQYQLRYERDVVAQQEA 665
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I+ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 666 ISALERFPTPASRLA--LTDVLEQEQCFYRVRMAACFCLAKIANSMVSTWTGPPAMKSLF 723
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF+ + +P A+A++R N P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL SV + L +L L+ I R L ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC+R + L+ +G + D L K + ++ +R+ A A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDP--SLFKSYAEYGHFVDIRIAALEAVVD 896
>gi|398397863|ref|XP_003852389.1| hypothetical protein MYCGRDRAFT_42345, partial [Zymoseptoria tritici
IPO323]
gi|339472270|gb|EGP87365.1| hypothetical protein MYCGRDRAFT_42345 [Zymoseptoria tritici IPO323]
Length = 1180
Score = 159 bits (402), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 136/513 (26%), Positives = 246/513 (47%), Gaps = 73/513 (14%)
Query: 2 LEKQMGSNFFRKILQNIISRAQGASPVR-TLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
L K GS +I+ I A+ S V LST++F+ ++G+ + L+ FF +WVG
Sbjct: 517 LIKSSGSTGVTRIINKIFLNAKTGSLVNGELSTQDFQRTCERLGHNK---LESFFRQWVG 573
Query: 61 TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRD---------GD 111
GCP+ + +NK+K +VE+ +++ ++ ++ P SN E ++
Sbjct: 574 GAGCPIFYVTQRFNKKKLVVEMTIVQR-QLERKTKPPFEPSNFMREIKEHVNEVYMAGTQ 632
Query: 112 IGWPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAA-RRALKPKKGSKPDGC 169
+ G M+IR+HE DG Y+H + + LEI ++K +R+ + K+ + +G
Sbjct: 633 NVFTGPMTIRIHEADGTPYEHIV--EIKEPITKLEIPYNTKYKRLKRSRRQKERAAAEG- 689
Query: 170 DDNGDAVAGL-------------------------DMRSSMESPLSWIRADPEMEYLAEI 204
+ G+ L D + WIR D + E++ +I
Sbjct: 690 NPAGEGEEALLYCLGDILDTPEDIQEWELRDWTAEDEDKMGQESYEWIRMDADFEWIGKI 749
Query: 205 HFNQPVQMWINQLEKDGDVVAQAQAI-AALEALPHLSFNVVNTLNNFLSDSKAFWRVRIE 263
H P+ M+++QL++D D+VAQ +++ L + PH ++ L L D + F+ +R+
Sbjct: 750 HLVLPLYMYVSQLQQDRDLVAQHESMRYILGSNPH--HTSLSILLRTLMDHRYFYGIRVM 807
Query: 264 AAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAM 323
AA LA A ++ G HL K ++ DE+ +PRPNDF + + + AIP A+A
Sbjct: 808 AAEGLAVLAKDDFTRIGQKHLTKAFEHLFCDEDSVMPRPNDFSNRVTFIMQCAIPQAMAR 867
Query: 324 VRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAAL--------------VQSVGEL 369
+R AD K PR +F + LK+NDN+ N +SD +++ L V+ V +
Sbjct: 868 LRDADGKVPRAVQQFFVDKLKFNDNSDNEFSDCHYVSTLMTCLTDSLVASHREVKPVYQF 927
Query: 370 EFGQQSIL--------FLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGF 421
+FG+ L F + + I+R + D + +Y + + + I L ++ +
Sbjct: 928 DFGEDEPLEPENPDAEFEALAVGVIERYRRIDEWVLTYQNVYSTTAIECLQKLT---NAG 984
Query: 422 ISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
I D+ +L++ R +T VR++A R L+++
Sbjct: 985 IVKDKSKELLQYTRP-STAPNVRLQAFRCLIEM 1016
>gi|432908138|ref|XP_004077772.1| PREDICTED: transcription initiation factor TFIID subunit 2-like
[Oryzias latipes]
Length = 1227
Score = 159 bits (401), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 132/478 (27%), Positives = 228/478 (47%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + LST F + +N G P
Sbjct: 463 LIENRISMEFMLQVFNKLLSLASTASSQKYQSHMWSQMLLSTYGFLKSISNVSGKDIGPL 522
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 523 IKQ----WVDQSGVVKFFGSFAFNRKRNVLELEIRQDYT------------------SSG 560
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 561 TQKYVGPIKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 611
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ +A
Sbjct: 612 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSILRKVEFEQADFMWQYQLRYERDVVAQEEA 669
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I+ALE P + + L + L + F++VR+ A + LA A S + W G + +
Sbjct: 670 ISALEKFPTPASRLA--LTDILEQEQCFYKVRMLACFCLAKIANSMVSTWQGPPAMKSLF 727
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF+ + +P A+A++R N P++ + F+L L+KYN
Sbjct: 728 -TRMFCCKSCPNIVKTNNFISFQSYFLQKTMPVAMALLRDVQNLCPKDVLNFILDLIKYN 786
Query: 347 DNNGNPYSDVFWLAALVQSV-----------GELEFGQQSILFLSSLLKRIDRLLQFDRL 395
DN N +SD ++ A L++++ E+ + +L+ I R L ++L
Sbjct: 787 DNRKNKFSDNYYRAELIEALTNSLTPAISINNEVRTVDNLNPDVRLILEEITRFLNMEKL 846
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PS+ +T+SC+R + Q L+ +G I D L + ++ VR+ A AL+D
Sbjct: 847 LPSFRNTITVSCLRAIRQ--LQRNGHIPSDP--SLFTSYAEYGHFVDVRIAALEALVD 900
>gi|296480455|tpg|DAA22570.1| TPA: TAF2 RNA polymerase II, TATA box binding protein
(TBP)-associated factor, 150kDa [Bos taurus]
Length = 1203
Score = 159 bits (401), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ Q MW QL + DVVAQ +A
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMAVLRKVDSEQADFMWQYQLRYERDVVAQQEA 665
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I+ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 666 ISALERFPTPASRLA--LTDVLEQEQCFYRVRMAACFCLAKIANSMVSTWTGPPAMKSLF 723
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF+ + +P A+A++R N P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL SV + L +L L+ I R L ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC+R + L+ +G + D L K + ++ +R+ A A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDP--SLFKSYAEYGHFVDIRIAALEAVVD 896
>gi|440894490|gb|ELR46931.1| Transcription initiation factor TFIID subunit 2, partial [Bos
grunniens mutus]
Length = 1197
Score = 159 bits (401), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 457 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 516
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 517 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 554
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 555 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 605
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ Q MW QL + DVVAQ +A
Sbjct: 606 -NGEEV-DMDLSAMDADSPLLWIRIDPDMAVLRKVDSEQADFMWQYQLRYERDVVAQQEA 663
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I+ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 664 ISALERFPTPASRLA--LTDVLEQEQCFYRVRMAACFCLAKIANSMVSTWTGPPAMKSLF 721
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF+ + +P A+A++R N P+E + F+L L+KYN
Sbjct: 722 -TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 780
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL SV + L +L L+ I R L ++L
Sbjct: 781 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 840
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC+R + L+ +G + D L K + ++ +R+ A A++D
Sbjct: 841 LPSYRHTITVSCLRAIR--VLQKNGHVPSDP--SLFKSYAEYGHFVDIRIAALEAVVD 894
>gi|295442960|ref|NP_593331.2| TATA-binding protein associated factor Taf2 (predicted)
[Schizosaccharomyces pombe 972h-]
gi|259016379|sp|P87121.3|TAF2_SCHPO RecName: Full=Transcription initiation factor TFIID subunit 2;
AltName: Full=TBP-associated factor 2
gi|254745512|emb|CAB08750.3| TATA-binding protein associated factor Taf2 (predicted)
[Schizosaccharomyces pombe]
Length = 1172
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 122/432 (28%), Positives = 202/432 (46%), Gaps = 54/432 (12%)
Query: 31 LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTV 90
LST F K ++ L F +W+ G P+ R+ +N++K I+E+ +
Sbjct: 511 LSTSHFLKTCEKASHMR---LDVFAQQWIYGYGYPIFRVVQRFNRKKMIIEMGI-DQVQT 566
Query: 91 KPDSRTPVLSSNTDSE------NRDGDIGWP---GMMSIRVHELDGM-YDHPILPMAGDA 140
K R P+ N S+ N G P G M+IR+HE DG Y+H + D+
Sbjct: 567 KEAPRAPMSDKNFLSDAIRHLNNESIPTGLPVFSGPMTIRIHEADGTPYEHVV--ELKDS 624
Query: 141 WQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGD----AVAGL-DMRSSMES-------- 187
+ L+IQ ++K + K K DG D NGD + L D+ S E
Sbjct: 625 FTKLDIQYNTKYKRISRNRSTKNVKRDG-DHNGDDSDYVIRSLGDVLQSDEDIERWHLYD 683
Query: 188 ------------PLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEA 235
WIR D + E++ ++ QP M+++QL++D DVVAQ + I +
Sbjct: 684 YTKEEEDTMATEAFEWIRVDADFEWICDLRVRQPEHMYVSQLQQDRDVVAQLETIRHFTS 743
Query: 236 LPH-LSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFD 294
+S V L L D++ ++ +R EAA ALA A E DW G HL Y +
Sbjct: 744 ESFTVSQQVSTVLLRTLLDNRYYYGIRQEAARALARCAIPELDWVGYYHLRMAYLEKFCF 803
Query: 295 ENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYS 354
++ +P+ NDF + +EY+V A+ + +R +P + +L LL YNDN N +S
Sbjct: 804 KDSTIPKSNDFSNITEYYVKCAMLESFPNIRDRKGVTPSSVKKLLLDLLCYNDNANNEFS 863
Query: 355 DVFWLAALVQSV--GELEFGQ---------QSILFLSSLLKRIDRLLQFDRLMPSYNGIL 403
D +++ L+ S+ G + G+ + + F++ ++ IDR ++ D MPS+ I+
Sbjct: 864 DAYFICLLIDSLVEGLIPRGETVQYAFSDPEHMEFVNQVINEIDRYMRIDACMPSFKNII 923
Query: 404 TISCIRTLTQIA 415
T ++ ++A
Sbjct: 924 TCKSLKAKLRLA 935
>gi|149410593|ref|XP_001509779.1| PREDICTED: transcription initiation factor TFIID subunit 2, partial
[Ornithorhynchus anatinus]
Length = 1099
Score = 157 bits (398), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 133/478 (27%), Positives = 226/478 (47%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 359 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 418
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV F++N+++N++EL + +D T G
Sbjct: 419 IKQ----WVDQSSVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 456
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 457 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 507
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ ++
Sbjct: 508 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQEES 565
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 566 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMLACFCLAKIANSMVSTWTGPPAMKSLF 623
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF+ + +P A+A++R N P+E + F+L L+KYN
Sbjct: 624 -TRMFCCKSCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 682
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL SV + L +L L+ I R L ++L
Sbjct: 683 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 742
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC+R + L+ +G + D L K + ++ +R+ A A++D
Sbjct: 743 LPSYRHTITVSCLRAIR--VLQKNGHVPSDP--SLFKSYAEYGHFVDIRLAALEAVVD 796
>gi|391343753|ref|XP_003746170.1| PREDICTED: transcription initiation factor TFIID subunit 2
[Metaseiulus occidentalis]
Length = 1191
Score = 157 bits (397), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 133/501 (26%), Positives = 218/501 (43%), Gaps = 65/501 (12%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPV------RTLSTKEFRHFANKVGNLERPFLKEF 54
MLE ++G ++ ++S A A+ +ST F+ V E + F
Sbjct: 444 MLENRLGKELLLQVFNKLLSLAGAATKGIGGLENMLVSTNSFQKAVFLVTGKE---INTF 500
Query: 55 FPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGW 114
WV G F +N+++N VEL + R P + +N G +
Sbjct: 501 IDHWVRQGGHAKFVGSFVFNRKRNTVELEI----------RQPDVGTN-------GIKKY 543
Query: 115 PGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGD 174
G + + + ELDG + H + D +I CHSK + K + G+
Sbjct: 544 NGPLLVTIQELDGTFKHML--QIEDIVSKHDITCHSKSRRNKKKKIPLCT--------GE 593
Query: 175 AVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
V +D+ + +SP+ WIR DPEM L +H NQ W QL + D+ AQ QA+ L
Sbjct: 594 EV-DMDLSAMDADSPVLWIRIDPEMLVLRTVHINQQDFHWHYQLRYERDITAQLQALREL 652
Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRF 293
E P S + L + + + + F+RVR EA Y L A+ WAG ++ +K +
Sbjct: 653 E--PMCSQDTRKALTDTIENDQCFYRVRCEATYCLKQVANALQTWAGPPAMITIFK--KL 708
Query: 294 DENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNG 350
+ P + N+F +F YF+ +AIP A+A +R P E + F+L L KYN+N+
Sbjct: 709 YGSFASPNMVKQNNFSNFQHYFLQQAIPKAMAGLRTVHRICPPEVLSFLLDLFKYNENSK 768
Query: 351 NPYSDVFWLAALVQSVGELEFGQQSILF---------------LSSLLKRIDRLLQFDRL 395
N YSD ++ AL+ ++ E S++ S+L+ I R L ++L
Sbjct: 769 NRYSDCYYRRALIDALSETVTPVVSVMLNQSNSKVMVDSMSEETKSILEEITRCLNLEKL 828
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLE 455
+P Y + T+ C+ + KL + +L + + VR+ A L+D
Sbjct: 829 LPGYKLVCTVGCLHAIR----KLQKMGHIPNSAELFRQYASHQQFEDVRLAALEMLVDFT 884
Query: 456 FHCNGIDSALSLFIKSVEEEP 476
+G L+ + VE +P
Sbjct: 885 -STDGSVEDLNHLLDVVESDP 904
>gi|345566660|gb|EGX49602.1| hypothetical protein AOL_s00078g91 [Arthrobotrys oligospora ATCC
24927]
Length = 1221
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 136/491 (27%), Positives = 227/491 (46%), Gaps = 90/491 (18%)
Query: 31 LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAV--LRDC 88
LST +F KVG+ + L FF +WV G P + +NK+K IVE+ + +D
Sbjct: 477 LSTAQFIKQCEKVGHQK---LDSFFNQWVFRSGYPRFEISQRFNKKKMIVEMIIRQTQDK 533
Query: 89 TVKPDSRTPVLSSNTDSENR--DG--DIGWP----GMMSIRVHELDGM-YDHPILPMAGD 139
V TP S D+ NR DG D P G M+IR+HE DG Y+H +
Sbjct: 534 EVAQKGLTP-QSFLRDALNRQTDGVKDASPPQVFSGPMTIRIHEADGTPYEHVV------ 586
Query: 140 AWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLDMRSSMES------------ 187
Q+ E+ +SKL K K+ + + A AG+D+ + ++
Sbjct: 587 --QIHEV--YSKLEIPYNTKYKRLKRTRRQKERLAAFAGMDINAETQAEDVLLYCLGDVL 642
Query: 188 -------------------------PLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGD 222
WIR D + E++ + NQP M+++QL++D D
Sbjct: 643 QSDEDTADWRLTDFTKEEEDKMAQEAFEWIRVDADFEWVCTMRLNQPDHMYLSQLQQDRD 702
Query: 223 VVAQAQAIAAL-EALP-HLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAG 280
V AQ +++ EA P HL + L + D + ++ +R EAA AL+ A+ E W G
Sbjct: 703 VAAQYESVLHFKEAKPSHL---ISTILVRTVMDRRYYYGIRCEAAKALSRCATAELSWIG 759
Query: 281 LLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVL 340
HL K ++ + +PR NDF DF+ YFV + I +R KSP ++L
Sbjct: 760 EFHLQKAFQEFYCFPDSLIPRANDFSDFTSYFVQQNIITGFGGIRDGMGKSPLSVQRYLL 819
Query: 341 QLLKYNDNNGNPYSDVFWLAALVQSVG---ELEFGQQSILFL--------------SSLL 383
+++YNDN+ N +SD F LA L++S+G + GQ+ ++ + ++
Sbjct: 820 DIIRYNDNSNNSFSDSFHLANLLRSLGSCLQAGSGQRGMMDMDVDETDTEEREKSKEEII 879
Query: 384 KRIDRLLQFDRLMPSYNGILTISCIRTLTQ-IALKLSGFISLDQVVKLIKPFRDFNTIWQ 442
+ I+RL + D+ M SY+ +++ +C+ + I K+ + L+ + + T Q
Sbjct: 880 REIERLQRMDQWMSSYHNVISTTCLDVKNRLIKAKVQEYTPLEFLA-----YSKEGTSEQ 934
Query: 443 VRVEASRALLD 453
+R++A L++
Sbjct: 935 LRIKAINCLVE 945
>gi|426236245|ref|XP_004012081.1| PREDICTED: transcription initiation factor TFIID subunit 2 [Ovis
aries]
Length = 1178
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 134/478 (28%), Positives = 225/478 (47%), Gaps = 65/478 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L Q MW QL + DVVAQ +A
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMAVLRRADSEQADFMWQYQLRYERDVVAQQEA 665
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
IAALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 666 IAALERFPTPASRLA--LTDVLEQEQCFYRVRMAACFCLAKIANSMVSTWTGPPAMKSLF 723
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF+ + +P A+A++R N P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782
Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
DN N +SD ++ A AL SV + L +L L+ I R L ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+PSY +T+SC+R + L+ +G + D L + + ++ +R+ A A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDP--SLFRSYAEYGHFVDIRIAALEAVVD 896
>gi|340712189|ref|XP_003394646.1| PREDICTED: transcription initiation factor TFIID subunit 2-like
[Bombus terrestris]
Length = 1198
Score = 156 bits (395), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 132/505 (26%), Positives = 228/505 (45%), Gaps = 73/505 (14%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRH-------FANKVGNLERPFLKE 53
MLE ++G ++ +S A A+ + + + + H F + + +
Sbjct: 457 MLEHRIGQELLLQVFNKQLSLAANAAQ-QKIDSGLWSHMLISTNVFTKAIFTVTGKDMAV 515
Query: 54 FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
F +WV T G + F +N+++N VEL + +D T N+ G
Sbjct: 516 FIDQWVRTGGHAKFSLSFVFNRKRNTVELEIRQDTT-----------------NQRGIRK 558
Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
+ G + + + ELDG + H L + G + +I CHSK + K + G
Sbjct: 559 YVGPLLVNIQELDGTFKH-TLQIEGTVAKA-DITCHSKSRRNKKKKIPLCT--------G 608
Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
+ V +D+ + +SP+ WIR DPEM L + QP W QL + DV AQ +AI AL
Sbjct: 609 EEV-DMDLSAMDDSPVLWIRLDPEMTILRAVQIEQPDYQWQYQLRHERDVTAQLEAIEAL 667
Query: 234 E--ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYKS 290
+ A P + +T+ N +++VR+ AA+ L A+ WAG ++ ++
Sbjct: 668 QNHATPATRLALTDTIEN----EHCYYKVRLRAAHCLTKVANAMVATWAGPPAMLAIFRK 723
Query: 291 -------RRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLL 343
RR + + N+F +F YF+ + IP A+A +R A P E + F++ L
Sbjct: 724 LFGSASCRR------IIKQNNFSNFQHYFLQKTIPVAMAGLRNAHGICPPEVLAFLMDLF 777
Query: 344 KYNDNNGNPYSDVFWLAALVQSVGE-----LEFGQQSILFLSSL-------LKRIDRLLQ 391
KYNDN+ N YSD ++ AAL++++G + Q + + SL L+ + R L
Sbjct: 778 KYNDNSKNRYSDNYYRAALIEALGATVTPVISVQQGTAINAESLSVDTKAILEEVTRNLN 837
Query: 392 FDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRAL 451
++L+P Y ++++CI+ + L F L L + + + VR+ A AL
Sbjct: 838 LEKLLPCYKYTISVACIKVIR----ILQKFGHLPSNPHLFRAYATYGQFIDVRIAALEAL 893
Query: 452 LDLEFHCNGIDSALSLFIKSVEEEP 476
+D +G L + +E +P
Sbjct: 894 VDFT-RVDGKWEDLEFLLDMIEMDP 917
>gi|365986418|ref|XP_003670041.1| hypothetical protein NDAI_0D04850 [Naumovozyma dairenensis CBS 421]
gi|343768810|emb|CCD24798.1| hypothetical protein NDAI_0D04850 [Naumovozyma dairenensis CBS 421]
Length = 1544
Score = 156 bits (395), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 136/495 (27%), Positives = 236/495 (47%), Gaps = 85/495 (17%)
Query: 13 KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
++L I +A GA +L+++ F+H +V + L+ FF +WV G P+LR+
Sbjct: 679 RVLPKIFLQAMSGALSNNSLTSQHFQHVCERVN---KNKLEYFFQQWVYGSGVPILRVTQ 735
Query: 72 SYNKRKNIVELAVLRDCTVKPDSRTPVLSSN---------TDSENRDGDIGWPGMMSIRV 122
+NK+K +VE+ + R C K + V+ + D +R+ + G M+IR+
Sbjct: 736 RFNKKKMVVEMGI-RQCQDKELDQGKVIGEDGFFASALEHIDHPDRNRTSFFTGSMTIRI 794
Query: 123 HELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKG--SKPDGC---------- 169
HE DG Y+H + D + +EIQ ++K R + G + P G
Sbjct: 795 HEADGTPYEHIV--ELKDIFTKIEIQYNTKYRRIRNKRLNVGGTNTPSGSGALSSAASRT 852
Query: 170 --------DDNGDAV-AGLDMRSSME---------------------------------S 187
++ DA+ + D+ S +E
Sbjct: 853 NLNSQEFLNEQMDAIPSNRDVDSKIEKLGNVLTTRENRERWNLTDFSKTTEGNEFQRQNE 912
Query: 188 PLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEAL----PHLSFNV 243
W+R D + E++ +IH NQP M+ +QL++DGDV AQ ++I EA+ P S
Sbjct: 913 AFEWLRIDSDSEWICKIHINQPDYMFASQLQQDGDVEAQLESIRYYEAVILDSPVNSMIY 972
Query: 244 VNTLNNFLSDSKAFWRVRIEAAYALANTASEETD----WAGLLHLVKFYKSRRFDENIGL 299
+ L D + F+ +R+EA AL+ A ETD G HL+K ++S +N +
Sbjct: 973 SSILTRTAMDRRYFYGIRLEACRALSKYAMNETDPNNFMGGSRHLIKVFQSLFCLDNSNI 1032
Query: 300 PRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 359
P+ N+F DF YF+ + IP ++ VR N+SP +F+L LL YN+N+ N Y+D +++
Sbjct: 1033 PKNNNFSDFQLYFLQKEIPSFLSKVRNERNESPHFIKQFLLDLLTYNENSENNYNDCYYV 1092
Query: 360 AALVQSVGELEFG-QQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKL 418
A L+ +V + ++ FL +L +I R ++ +P+Y I+T + I ++ L +
Sbjct: 1093 ATLLHAVVDCAIADREDETFLPKVLDQIRRYENLEKWLPTYQLIVTYTIIE--IKLKLHV 1150
Query: 419 SGFISL---DQVVKL 430
+G S D++++L
Sbjct: 1151 AGLYSFENNDEILQL 1165
>gi|196008193|ref|XP_002113962.1| hypothetical protein TRIADDRAFT_27954 [Trichoplax adhaerens]
gi|190582981|gb|EDV23052.1| hypothetical protein TRIADDRAFT_27954 [Trichoplax adhaerens]
Length = 890
Score = 156 bits (395), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 111/443 (25%), Positives = 210/443 (47%), Gaps = 62/443 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTK------EFRHFANKVGNLERPFLKEF 54
M+E ++G + +++ A + + S++ +HF + + +KEF
Sbjct: 343 MIEIRIGQESLLQAFNKLLTLANTGAKHHSHSSQWGNLLISTKHFLKTIHLVSGKDIKEF 402
Query: 55 FPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGW 114
+WV G P + FS+N+++N +EL + T+ +++ +
Sbjct: 403 INQWVCRSGLPHFKGSFSFNRKRNTLELEL------------------TEKQDKSQFGFY 444
Query: 115 PGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGD 174
G ++I V E+DG Y H + ++ +I CHSK + K NGD
Sbjct: 445 VGPLTICVQEVDGSYTHTL--QIEESTYHYDISCHSKSKKNKKKKIPLS--------NGD 494
Query: 175 AVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALE 234
+ +SP+ W+R DP+M +L ++ F+QP MW Q++ + D +AQ A+ AL
Sbjct: 495 ETEINFSSADSDSPVLWVRIDPDMHWLRKVTFDQPDYMWQYQMKYERDSLAQYLAVKALA 554
Query: 235 ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWA--------GLLHLV- 285
++S + V L ++D ++R+R+ A ALA E WA L ++
Sbjct: 555 K--YVSADSVCCLTEIINDRHIYYRIRVAAIKALAKVTGE---WALQRCSIATSLFNIYR 609
Query: 286 KFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKY 345
K + SR +PR NDF YF+ +AIP A+ +R + PRE ++F++ L+KY
Sbjct: 610 KMFGSRSCQT---IPRYNDFSYMIGYFIQKAIPKALGSIRNSHGNCPREILDFIIDLIKY 666
Query: 346 NDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLSS-----------LLKRIDRLLQFDR 394
NDN+ N YSD +++++L+ ++ +I +S +L+ + L +++
Sbjct: 667 NDNSQNEYSDNYYMSSLIDALTSAISPATTITGISESNYTLSEESKIILQEVTHALNYEK 726
Query: 395 LMPSYNGILTISCIRTLTQIALK 417
++ S+ +++SCI+ + + K
Sbjct: 727 ILASHRNCISVSCIKNIRMLQKK 749
>gi|194868201|ref|XP_001972244.1| GG13996 [Drosophila erecta]
gi|190654027|gb|EDV51270.1| GG13996 [Drosophila erecta]
Length = 1222
Score = 156 bits (394), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 144/587 (24%), Positives = 261/587 (44%), Gaps = 76/587 (12%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEFRHFANKVGNLERPFL 51
MLE ++G ++ ++ A A+ + +ST F + + +
Sbjct: 463 MLENRIGQELLIQVFNKQLALASSAATTKIGAGLWSQLLISTNIF---IKAIFTVTGKDM 519
Query: 52 KEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGD 111
F +WV T G + +N+++N +EL + +D N+ G
Sbjct: 520 SVFMDQWVRTGGHAKFSLTSVFNRKRNTIELEIRQDFV-----------------NQRGI 562
Query: 112 IGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDD 171
+ G + +++ ELDG + H + Q+ S + + K K C
Sbjct: 563 RKYNGPLMVQLQELDGTFKHTL--------QIESTLVKSDITCHSKSRRNKKKKIPLC-- 612
Query: 172 NGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIA 231
G+ V +D+ + +SP+ WIR DPEM L ++ QP W QL + DV AQ QAI
Sbjct: 613 TGEEV-DMDLSAMDDSPVLWIRLDPEMILLRDLIIEQPDFQWQYQLRHERDVTAQFQAIQ 671
Query: 232 ALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYKS 290
AL+ P + + L + + + F++VR EAA++L A++ W+G ++ +
Sbjct: 672 ALQKYPTNATRLA--LTDTIESERCFYKVRCEAAHSLTKVANQMVASWSGPPAMLNIF-- 727
Query: 291 RRFDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYND 347
R+F + P + N+F +F YF+ +AIP A+A +R + P E + F+ L KYN+
Sbjct: 728 RKFFGSFSAPHIIKLNNFSNFQLYFLQKAIPVAMAGLRTSHGICPPEVMRFLFDLFKYNE 787
Query: 348 NNGNPYSDVFWLAALVQSVGE---------LEFGQQSILFLSS----LLKRIDRLLQFDR 394
N+ N Y+D ++ AALV+++GE + Q + LS+ +L + RLL ++
Sbjct: 788 NSRNHYTDAYYRAALVEALGETLTPVVSVAIHGTQITTDSLSTDAKLVLDEVTRLLNMEK 847
Query: 395 LMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
+PSY ++++SC++ + KL F L + + + + ++ +R+ A L+D
Sbjct: 848 HLPSYKYMVSVSCLKVIR----KLQKFGHLPSLPHIYRSYAEYGIYLDLRIAAMECLVDF 903
Query: 455 EFHCNGIDSALSLFIKSVEEEPSLRGQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLN 514
+G L I +E +P + L + I +S +D LV L
Sbjct: 904 -VKVDGRSEDLEHLITLLETDPDPAARHALA--QLLIDSPPFTRESRSRLDKPNLVDRLW 960
Query: 515 LLESRIAFNNVFLRHHLFGILQILAGRAPTLYGVPRDKLLLLGDGET 561
+R+A++ LR + + LYG R L G+ ++
Sbjct: 961 FSINRLAYDTK-LRCDIVDLYY-------ALYGTKRPNCLQAGENQS 999
>gi|225561939|gb|EEH10219.1| TATA-binding protein associated factor Taf2 [Ajellomyces capsulatus
G186AR]
Length = 1247
Score = 155 bits (393), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 142/545 (26%), Positives = 230/545 (42%), Gaps = 85/545 (15%)
Query: 1 MLEKQMGSNFFRKILQNIISRA-----QGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
+LE+++ + + IISR G P L+T F+ ++G+ + L FF
Sbjct: 511 ILERRLSKASGKATISRIISRIFLNARMGDLPNGALATSFFQKTCERLGHTK---LDTFF 567
Query: 56 PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSS---NTDSENRDGDI 112
+WV GCP +NK+K +VE+ + + + +P R +S+ + E R+
Sbjct: 568 QQWVYGAGCPRFLATQRFNKKKLVVEMMIRQVQSEQPTVRDLEMSTFMRDVKEEIRNVYA 627
Query: 113 G-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKP 166
G + G M+IR+HE DG Y+H + G + K R + ++ +
Sbjct: 628 GPVQHVFTGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAAAA 687
Query: 167 DGCDDN------------GDAVAGLDM---------------RSSMESPLSWIRADPEME 199
G D N GD + + R ES WIR D + E
Sbjct: 688 SGIDPNAELQDDVLLYCLGDVLQSEEEIQTWKLVEWSKEDEDRMGQES-YEWIRMDADFE 746
Query: 200 YLAEIHFNQPVQMWINQLEKDGDVVAQAQAI---AALEALPHLSFNVVNTLNNFLSDSKA 256
++ ++ P M+++QL++D D+VAQ ++I AA P +S V T L D +
Sbjct: 747 WICKMSLTMPGYMYLSQLQQDRDIVAQLESIQYMAAQREHPLISTIFVRT----LMDRRY 802
Query: 257 FWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEA 316
F+ +R AA AL A +E DW GL HL ++ + R NDF D + Y +
Sbjct: 803 FYGIRAAAARALVKHAKDEVDWIGLYHLESAFQELFCLPGSPMTRSNDFSDRAAYILQLT 862
Query: 317 IPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFG---- 372
IP A+A VR K+P F+ LK+NDN+ N SD +++ AL++S+ E G
Sbjct: 863 IPQAIAQVRDNSGKTPMRVKRFLYDKLKFNDNSNNELSDCYYITALMKSICEALLGRRET 922
Query: 373 -----------------QQSILFLSSLLKRIDRLLQFDRLMPSYNGILT---ISCIRTLT 412
Q + IDR + D S+ I + I C R L
Sbjct: 923 EIDGVGIDFDVERELERQAEQQLDRDCIAEIDRYRRMDEWTTSFQNIYSRTAIECQRRLM 982
Query: 413 QIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSV 472
Q + +D+ + P+ T +R++A L+D + N L+ FI +
Sbjct: 983 QANV-------IDRDIMQFLPYTRAGTYEMLRLDAFNILIDFDIFQNS--ELLTWFIYVM 1033
Query: 473 EEEPS 477
+PS
Sbjct: 1034 SADPS 1038
>gi|24662032|ref|NP_729571.1| TBP-associated factor 2, isoform A [Drosophila melanogaster]
gi|442631480|ref|NP_001261668.1| TBP-associated factor 2, isoform B [Drosophila melanogaster]
gi|27924002|sp|Q24325.2|TAF2_DROME RecName: Full=Transcription initiation factor TFIID subunit 2;
AltName: Full=Transcription initiation factor TFIID 150
kDa subunit; Short=TAF(II)150; Short=TAFII-150;
Short=TAFII150
gi|17944283|gb|AAL48035.1| LD43729p [Drosophila melanogaster]
gi|23093758|gb|AAF50190.2| TBP-associated factor 2, isoform A [Drosophila melanogaster]
gi|220945982|gb|ACL85534.1| Taf2-PA [synthetic construct]
gi|323301248|gb|ADX35966.1| LD23043p [Drosophila melanogaster]
gi|440215588|gb|AGB94362.1| TBP-associated factor 2, isoform B [Drosophila melanogaster]
Length = 1221
Score = 155 bits (393), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 127/502 (25%), Positives = 231/502 (46%), Gaps = 66/502 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEFRHFANKVGNLERPFL 51
MLE ++G ++ ++ A A+ + +ST F + + +
Sbjct: 462 MLENRIGQELLIQVFNKQLALASSAATTKIGAGLWSQLLISTNIF---IKAIFTVTGKDM 518
Query: 52 KEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGD 111
F +WV T G + +N+++N +EL + +D N+ G
Sbjct: 519 SVFMDQWVRTGGHAKFSLTSVFNRKRNTIELEIRQDYV-----------------NQRGI 561
Query: 112 IGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDD 171
+ G + +++ ELDG + H + Q+ S + + K K C
Sbjct: 562 RKYNGPLMVQLQELDGTFKHTL--------QIESTLVKSDITCHSKSRRNKKKKIPLC-- 611
Query: 172 NGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIA 231
G+ V +D+ + +SP+ WIR DPEM L ++ QP W QL + DV AQ QAI
Sbjct: 612 TGEEV-DMDLSAMDDSPVLWIRLDPEMILLRDLIIEQPDFQWQYQLRHERDVTAQFQAIQ 670
Query: 232 ALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYKS 290
AL+ P + + L + + + F++VR EAA++L A++ W+G ++ +
Sbjct: 671 ALQKYPTNATRLA--LTDTIESERCFYQVRCEAAHSLTKVANQMVASWSGPPAMLNIF-- 726
Query: 291 RRFDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYND 347
R+F + P + N+F +F YF+ +AIP A+A +R + P E + F+ L KYN+
Sbjct: 727 RKFFGSFSAPHIIKLNNFSNFQLYFLQKAIPVAMAGLRTSHGICPPEVMRFLFDLFKYNE 786
Query: 348 NNGNPYSDVFWLAALVQSVGE---------LEFGQQSILFLSS----LLKRIDRLLQFDR 394
N+ N Y+D ++ AALV+++GE + Q + LS+ +L + RLL ++
Sbjct: 787 NSRNHYTDAYYRAALVEALGETLTPVVSVAIHGTQITTDSLSTDAKLVLDEVTRLLNMEK 846
Query: 395 LMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
+PSY ++++SC++ + KL F L + + + + ++ +R+ A L+D
Sbjct: 847 HLPSYKYMVSVSCLKVIR----KLQKFGHLPSLPHIYRSYAEYGIYLDLRIAAMECLVDF 902
Query: 455 EFHCNGIDSALSLFIKSVEEEP 476
+G L I +E +P
Sbjct: 903 -VKVDGRSEDLEHLITLLETDP 923
>gi|240275553|gb|EER39067.1| TATA-binding protein associated factor Taf2 [Ajellomyces capsulatus
H143]
Length = 1125
Score = 155 bits (393), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 142/545 (26%), Positives = 230/545 (42%), Gaps = 85/545 (15%)
Query: 1 MLEKQMGSNFFRKILQNIISRA-----QGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
+LE+++ + + IISR G P L+T F+ ++G+ + L FF
Sbjct: 511 ILERRLSKASGKATISRIISRIFLNARMGDLPNGALATSFFQKTCERLGHTK---LDTFF 567
Query: 56 PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSS---NTDSENRDGDI 112
+WV GCP +NK+K +VE+ + + + +P R +S+ + E R+
Sbjct: 568 QQWVYGAGCPRFLATQRFNKKKLVVEMMIRQVQSEQPTVRDLEMSTFMRDVKEEIRNVYA 627
Query: 113 G-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKP 166
G + G M+IR+HE DG Y+H + G + K R + ++ +
Sbjct: 628 GPVQHVFTGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAAAA 687
Query: 167 DGCDDN------------GDAVAGLDM---------------RSSMESPLSWIRADPEME 199
G D N GD + + R ES WIR D + E
Sbjct: 688 SGIDPNAELQDDVLLYCLGDVLQSEEEIQTWKLVEWSKEDEDRMGQES-YEWIRMDADFE 746
Query: 200 YLAEIHFNQPVQMWINQLEKDGDVVAQAQAI---AALEALPHLSFNVVNTLNNFLSDSKA 256
++ ++ P M+++QL++D D+VAQ ++I AA P +S V T L D +
Sbjct: 747 WICKMSLTMPGYMYLSQLQQDRDIVAQLESIQYMAAQREHPLISTIFVRT----LMDRRY 802
Query: 257 FWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEA 316
F+ +R AA AL A +E DW GL HL ++ + R NDF D + Y +
Sbjct: 803 FYGIRAAAARALVKHAKDEVDWIGLYHLESAFQELFCLPGSPMTRSNDFSDRAAYILQLT 862
Query: 317 IPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFG---- 372
IP A+A VR K+P F+ LK+NDN+ N SD +++ AL++S+ E G
Sbjct: 863 IPQAIAQVRDNSGKTPMRVKRFLYDKLKFNDNSNNELSDCYYITALMKSICEALLGRRET 922
Query: 373 -----------------QQSILFLSSLLKRIDRLLQFDRLMPSYNGILT---ISCIRTLT 412
Q + IDR + D S+ I + I C R L
Sbjct: 923 EIDGVGIDFDVERELERQAEQQLDRDCIAEIDRYRRMDEWTTSFQNIYSRTAIECQRRLM 982
Query: 413 QIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSV 472
Q + +D+ + P+ T +R++A L+D + N L+ FI +
Sbjct: 983 QANV-------IDRDIMQFLPYTRAGTYEMLRLDAFNILIDFDIFQNS--ELLTWFIYVM 1033
Query: 473 EEEPS 477
+PS
Sbjct: 1034 SADPS 1038
>gi|367012451|ref|XP_003680726.1| hypothetical protein TDEL_0C06260 [Torulaspora delbrueckii]
gi|359748385|emb|CCE91515.1| hypothetical protein TDEL_0C06260 [Torulaspora delbrueckii]
Length = 1396
Score = 155 bits (393), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 119/457 (26%), Positives = 213/457 (46%), Gaps = 63/457 (13%)
Query: 30 TLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCT 89
+LST F++ +V R L+ FF +WV G P+ R+ +NK++ +VE+ + R C
Sbjct: 625 SLSTAHFQYVCERVN---RNKLEPFFQQWVYGSGVPIFRVTQRFNKKRMMVEMGI-RQCQ 680
Query: 90 VKPDSRTPVLSSN---------TDSENRDGDIGWPGMMSIRVHELDGM-YDHPILPMAGD 139
+ + V+ + N++ + G M+IR+HE DG Y+H + D
Sbjct: 681 AQELGQDKVVGEDGFCSSSLGYMQHPNKNLTTCFTGSMTIRIHEADGTPYEHIV--ELKD 738
Query: 140 AWQLLEIQCHSKLAARR-----ALKPK-KGSKPDGCDDN-------------GDAVAGL- 179
+ ++IQ ++K R A KP KG PD +D+ G+ ++G
Sbjct: 739 VFTKIDIQYNTKYRRIRNKRLNATKPTGKGMTPDITEDSFTERVEGGVIQKLGNVLSGPN 798
Query: 180 DMRS----------------SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDV 223
D R WIR D + E++ +++ NQP M+ +QL++D DV
Sbjct: 799 DCREWNLTEHSVTTDGNEAQRQNEAFEWIRIDSDFEWVCKLYVNQPDYMFTSQLQQDSDV 858
Query: 224 VAQAQAIAALEAL----PHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETD-- 277
AQ +++ E P S + L + D + F+ VR+EA ALA + +
Sbjct: 859 EAQVESVRFFEDTIIHSPVSSQIYSSILTRTIMDKRYFYGVRLEACRALARYVVRDFEPN 918
Query: 278 --WAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREA 335
G HL+K + +N +P NDF +F EYF+ + IP + +V+ ++P
Sbjct: 919 NFSGGAKHLIKIFHELFCYQNSNIPLNNDFTNFQEYFLQKNIPQYLTLVKNERKETPAFV 978
Query: 336 VEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEF-GQQSILFLSSLLKRIDRLLQFDR 394
+F+L LL YN+N+ NPY D + L+QSV + + +L L+++ R +
Sbjct: 979 KQFLLDLLTYNENSENPYDDSHYRCTLIQSVVDCALTNRHDRDYLEKALEQLRRYQNLEE 1038
Query: 395 LMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLI 431
MP+Y ++ + ++ ++ L + G ++ + K++
Sbjct: 1039 WMPTYQLLVKNAILK--AKLRLGVEGLYDMNDLNKIL 1073
>gi|350413992|ref|XP_003490174.1| PREDICTED: transcription initiation factor TFIID subunit 2-like
[Bombus impatiens]
Length = 1189
Score = 155 bits (392), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 129/499 (25%), Positives = 227/499 (45%), Gaps = 61/499 (12%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRH-------FANKVGNLERPFLKE 53
MLE ++G ++ +S A A+ + + + + H F + + +
Sbjct: 447 MLEHRIGQELLLQVFNKQLSLAANAAQ-QKIDSGLWSHMLISTNVFTKAIFTVTGKDMAV 505
Query: 54 FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
F +WV T G + F +N+++N VEL + +D T N+ G
Sbjct: 506 FIDQWVRTGGHAKFSLSFVFNRKRNTVELEIRQDTT-----------------NQRGIRK 548
Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
+ G + + + ELDG + H L + G + +I CHSK + K + G
Sbjct: 549 YVGPLLVNIQELDGTFKH-TLQIEGTVAKA-DITCHSKSRRNKKKKIPLCT--------G 598
Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
+ V +D+ + +SP+ WIR DPEM L + QP W QL + DV AQ +AI AL
Sbjct: 599 EEV-DMDLSAMDDSPVLWIRLDPEMTLLRAVQIEQPDYQWQYQLRHERDVTAQLEAIEAL 657
Query: 234 E--ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYKS 290
+ A P + +T+ N +++VR+ AA+ L A+ WAG ++ ++
Sbjct: 658 QNHATPATRLALTDTIEN----EHCYYKVRLRAAHCLTKVANAMVATWAGPPAMLAIFRK 713
Query: 291 RRFDENIG-LPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNN 349
+ + + N+F +F YF+ + IP A+A +R A P E + F++ L KYNDN+
Sbjct: 714 LFGSASCRRIIKQNNFSNFQHYFLQKTIPVAMAGLRNAHGICPPEVLAFLMDLFKYNDNS 773
Query: 350 GNPYSDVFWLAALVQSVGE-----LEFGQQSILFLSSL-------LKRIDRLLQFDRLMP 397
N YSD ++ AAL++++G + Q + + SL L+ + R L ++L+P
Sbjct: 774 KNRYSDNYYRAALIEALGATVTPVISVQQGTAITAESLSVDTRAILEEVTRNLNLEKLLP 833
Query: 398 SYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFH 457
Y ++++C++ + L F L L + + + VR+ A AL+D
Sbjct: 834 CYKYTVSVACLKVIR----ILQKFGHLPSNPHLFRAYATYGQFIDVRIAALEALVDFT-R 888
Query: 458 CNGIDSALSLFIKSVEEEP 476
+G L + +E +P
Sbjct: 889 VDGKWEDLEFLLDMIEMDP 907
>gi|541665|emb|CAA55830.1| dTAF II 150 [Drosophila melanogaster]
Length = 1213
Score = 155 bits (392), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 127/502 (25%), Positives = 231/502 (46%), Gaps = 66/502 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEFRHFANKVGNLERPFL 51
MLE ++G ++ ++ A A+ + +ST F + + +
Sbjct: 462 MLENRIGQELLIQVFNKQLALASSAATTKIGAGLWSQLLISTNIF---IKAIFTVTGKDM 518
Query: 52 KEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGD 111
F +WV T G + +N+++N +EL + +D N+ G
Sbjct: 519 SVFMDQWVRTGGHAKFSLTSVFNRKRNTIELEIRQDYV-----------------NQRGI 561
Query: 112 IGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDD 171
+ G + +++ ELDG + H + Q+ S + + K K C
Sbjct: 562 RKYNGPLMVQLQELDGTFKHTL--------QIESTLVKSDITCHSKSRRNKKKKIPLC-- 611
Query: 172 NGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIA 231
G+ V +D+ + +SP+ WIR DPEM L ++ QP W QL + DV AQ QAI
Sbjct: 612 TGEEV-DMDLSAMDDSPVLWIRLDPEMILLRDLIIEQPDFQWQYQLRHERDVTAQFQAIQ 670
Query: 232 ALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYKS 290
AL+ P + + L + + + F++VR EAA++L A++ W+G ++ +
Sbjct: 671 ALQKYPTNATRLA--LTDTIESERCFYQVRCEAAHSLTKVANQMVASWSGPPAMLNIF-- 726
Query: 291 RRFDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYND 347
R+F + P + N+F +F YF+ +AIP A+A +R + P E + F+ L KYN+
Sbjct: 727 RKFFGSFSAPHIIKLNNFSNFQLYFLQKAIPVAMAGLRTSHGICPPEVMRFLFDLFKYNE 786
Query: 348 NNGNPYSDVFWLAALVQSVGE---------LEFGQQSILFLSS----LLKRIDRLLQFDR 394
N+ N Y+D ++ AALV+++GE + Q + LS+ +L + RLL ++
Sbjct: 787 NSRNHYTDAYYRAALVEALGETLTPVVSVAIHGTQITTDSLSTDAKLVLDEVTRLLNMEK 846
Query: 395 LMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
+PSY ++++SC++ + KL F L + + + + ++ +R+ A L+D
Sbjct: 847 HLPSYKYMVSVSCLKVIR----KLQKFGHLPSLPHIYRSYAEYGIYLDLRIAAMECLVDF 902
Query: 455 EFHCNGIDSALSLFIKSVEEEP 476
+G L I +E +P
Sbjct: 903 -VKVDGRSEDLEHLITLLETDP 923
>gi|195326481|ref|XP_002029957.1| GM24832 [Drosophila sechellia]
gi|194118900|gb|EDW40943.1| GM24832 [Drosophila sechellia]
Length = 1197
Score = 155 bits (392), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 127/502 (25%), Positives = 231/502 (46%), Gaps = 66/502 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEFRHFANKVGNLERPFL 51
MLE ++G ++ ++ A A+ + +ST F + + +
Sbjct: 440 MLENRIGQELLIQVFNKQLALASSAATTKIGAGLWSQLLISTNIF---IKAIFTVTGKDM 496
Query: 52 KEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGD 111
F +WV T G + +N+++N +EL + +D N+ G
Sbjct: 497 SVFMDQWVRTGGHAKFSLTSVFNRKRNTIELEIRQDYV-----------------NQRGI 539
Query: 112 IGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDD 171
+ G + +++ ELDG + H + Q+ S + + K K C
Sbjct: 540 RKYNGPLMVQLQELDGTFKHTL--------QIESTLVKSDITCHSKSRRNKKKKIPLC-- 589
Query: 172 NGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIA 231
G+ V +D+ + +SP+ WIR DPEM L ++ QP W QL + DV AQ QAI
Sbjct: 590 TGEEV-DMDLSAMDDSPVLWIRLDPEMILLRDLIIEQPDFQWQYQLRHERDVTAQFQAIQ 648
Query: 232 ALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYKS 290
AL+ P + + L + + + F++VR EAA++L A++ W+G ++ +
Sbjct: 649 ALQKYPTNATRLA--LTDTIESERCFYQVRCEAAHSLTKVANQMVASWSGPPAMLNIF-- 704
Query: 291 RRFDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYND 347
R+F + P + N+F +F YF+ +AIP A+A +R + P E + F+ L KYN+
Sbjct: 705 RKFFGSFSAPHIIKLNNFSNFQLYFLQKAIPVAMAGLRTSHGICPPEVMRFLFDLFKYNE 764
Query: 348 NNGNPYSDVFWLAALVQSVGE---------LEFGQQSILFLSS----LLKRIDRLLQFDR 394
N+ N Y+D ++ AALV+++GE + Q + LS+ +L + RLL ++
Sbjct: 765 NSRNHYTDAYYRAALVEALGETLTPVVSVAIHGTQITTDSLSTDAKLVLDEVTRLLNMEK 824
Query: 395 LMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
+PSY ++++SC++ + KL F L + + + + ++ +R+ A L+D
Sbjct: 825 HLPSYKYMVSVSCLKVIR----KLQKFGHLPSLPHIYRSYAEYGIYLDLRIAAMECLVDF 880
Query: 455 EFHCNGIDSALSLFIKSVEEEP 476
+G L I +E +P
Sbjct: 881 -VKVDGRSEDLEHLITLLETDP 901
>gi|195376663|ref|XP_002047112.1| GJ12109 [Drosophila virilis]
gi|194154270|gb|EDW69454.1| GJ12109 [Drosophila virilis]
Length = 1207
Score = 155 bits (392), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 118/440 (26%), Positives = 212/440 (48%), Gaps = 54/440 (12%)
Query: 54 FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
F +WV T G + +N+++N +EL + +D N+ G
Sbjct: 522 FMDQWVRTGGHAKFSLTSVFNRKRNTIELEIRQDFV-----------------NQRGVRK 564
Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
+ G + +++ ELDG + H + + +I CHSK + K + G
Sbjct: 565 YNGPLLVQLQELDGTFKHTL--QIENTLVKADITCHSKSRRNKKKKIPLCT--------G 614
Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
+ V +D+ + +SP+ WIR DPEM L ++ QP W QL + DV AQ QAI AL
Sbjct: 615 EEV-DMDLSAMDDSPVLWIRLDPEMILLRDLIIEQPDFQWQYQLRHERDVTAQFQAIQAL 673
Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYKSRR 292
+ P + + L + + F++VR +AA++L A++ W+G ++ + R+
Sbjct: 674 QKYPTNATRMA--LTDTIESESCFYKVRCQAAHSLTKVANQMVASWSGPPAMLNIF--RK 729
Query: 293 FDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNN 349
F + P + N+F +F YF+ +AIP A+A +R + P E + F+ L KYN+N+
Sbjct: 730 FFGSFSAPHIIKLNNFSNFQLYFLQKAIPVAMAGLRTSHGICPPEVMRFLFDLFKYNENS 789
Query: 350 GNPYSDVFWLAALVQSVGE---------LEFGQQSILFLSS----LLKRIDRLLQFDRLM 396
N Y+D ++ AALV+++GE + Q ++ LS+ +L + RLL ++ +
Sbjct: 790 RNHYTDAYYRAALVEALGETLTPVVSVAMHGTQITVDSLSTDAKLVLDEVTRLLNMEKHL 849
Query: 397 PSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEF 456
PSY ++++SC++ + KL F L + + + + ++ +R+ A L+D
Sbjct: 850 PSYKYMVSVSCLKVIR----KLQKFGHLPSLPHIYRSYAEYGIFLDLRIAAMECLVDF-V 904
Query: 457 HCNGIDSALSLFIKSVEEEP 476
+G L I +E +P
Sbjct: 905 KVDGRSEDLEHLITLLETDP 924
>gi|325091384|gb|EGC44694.1| TATA-binding protein associated factor Taf2 [Ajellomyces capsulatus
H88]
Length = 1247
Score = 155 bits (392), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 142/545 (26%), Positives = 230/545 (42%), Gaps = 85/545 (15%)
Query: 1 MLEKQMGSNFFRKILQNIISRA-----QGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
+LE+++ + + IISR G P L+T F+ ++G+ + L FF
Sbjct: 511 ILERRLSKASGKATISRIISRIFLNARMGDLPNGALATSFFQKTCERLGHTK---LDTFF 567
Query: 56 PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSS---NTDSENRDGDI 112
+WV GCP +NK+K +VE+ + + + +P R +S+ + E R+
Sbjct: 568 QQWVYGAGCPRFLATQRFNKKKLVVEMMIRQVQSEQPTVRDLEMSTFMRDVKEEIRNVYA 627
Query: 113 G-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKP 166
G + G M+IR+HE DG Y+H + G + K R + ++ +
Sbjct: 628 GPVQHVFTGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAAAA 687
Query: 167 DGCDDN------------GDAVAGLDM---------------RSSMESPLSWIRADPEME 199
G D N GD + + R ES WIR D + E
Sbjct: 688 SGIDPNAELQDDVLLYCLGDVLQSEEEIQTWKLVEWSKEDEDRMGQES-YEWIRMDADFE 746
Query: 200 YLAEIHFNQPVQMWINQLEKDGDVVAQAQAI---AALEALPHLSFNVVNTLNNFLSDSKA 256
++ ++ P M+++QL++D D+VAQ ++I AA P +S V T L D +
Sbjct: 747 WICKMSLTMPGYMYLSQLQQDRDIVAQLESIQYMAAQREHPLISTIFVRT----LMDRRY 802
Query: 257 FWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEA 316
F+ +R AA AL A +E DW GL HL ++ + R NDF D + Y +
Sbjct: 803 FYGIRAAAARALVKHAKDEVDWIGLYHLESAFQELFCLPGSPMTRSNDFSDRAAYILQLT 862
Query: 317 IPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFG---- 372
IP A+A VR K+P F+ LK+NDN+ N SD +++ AL++S+ E G
Sbjct: 863 IPQAIAQVRDNSGKTPMRVKRFLYDKLKFNDNSNNELSDCYYITALMKSICEALLGRRET 922
Query: 373 -----------------QQSILFLSSLLKRIDRLLQFDRLMPSYNGILT---ISCIRTLT 412
Q + IDR + D S+ I + I C R L
Sbjct: 923 EIDGVGIDFDVERELERQAEQQLDRDCIAEIDRYRRMDEWTTSFQNIYSRTAIECQRRLM 982
Query: 413 QIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSV 472
Q + +D+ + P+ T +R++A L+D + N L+ FI +
Sbjct: 983 QANV-------IDRDIMQFLPYTRAGTYEMLRLDAFNILIDFDIFQNS--ELLTWFIYVM 1033
Query: 473 EEEPS 477
+PS
Sbjct: 1034 SADPS 1038
>gi|195589193|ref|XP_002084340.1| GD12884 [Drosophila simulans]
gi|194196349|gb|EDX09925.1| GD12884 [Drosophila simulans]
Length = 1224
Score = 155 bits (392), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 127/502 (25%), Positives = 231/502 (46%), Gaps = 66/502 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEFRHFANKVGNLERPFL 51
MLE ++G ++ ++ A A+ + +ST F + + +
Sbjct: 467 MLENRIGQELLIQVFNKQLALASSAATTKIGAGLWSQLLISTNIF---IKAIFTVTGKDM 523
Query: 52 KEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGD 111
F +WV T G + +N+++N +EL + +D N+ G
Sbjct: 524 SVFMDQWVRTGGHAKFSLTSVFNRKRNTIELEIRQDFV-----------------NQRGI 566
Query: 112 IGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDD 171
+ G + +++ ELDG + H + Q+ S + + K K C
Sbjct: 567 RKYNGPLMVQLQELDGTFKHTL--------QIESTLVKSDITCHSKSRRNKKKKIPLC-- 616
Query: 172 NGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIA 231
G+ V +D+ + +SP+ WIR DPEM L ++ QP W QL + DV AQ QAI
Sbjct: 617 TGEEV-DMDLSAMDDSPVLWIRLDPEMILLRDLIIEQPDFQWQYQLRHERDVTAQFQAIQ 675
Query: 232 ALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYKS 290
AL+ P + + L + + + F++VR EAA++L A++ W+G ++ +
Sbjct: 676 ALQKYPTNATRLA--LTDTIESERCFYQVRCEAAHSLTKVANQMVASWSGPPAMLNIF-- 731
Query: 291 RRFDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYND 347
R+F + P + N+F +F YF+ +AIP A+A +R + P E + F+ L KYN+
Sbjct: 732 RKFFGSFSAPHIIKLNNFSNFQLYFLQKAIPVAMAGLRTSHGICPPEVMRFLFDLFKYNE 791
Query: 348 NNGNPYSDVFWLAALVQSVGE---------LEFGQQSILFLSS----LLKRIDRLLQFDR 394
N+ N Y+D ++ AALV+++GE + Q + LS+ +L + RLL ++
Sbjct: 792 NSRNHYTDAYYRAALVEALGETLTPVVSVAIHGTQITTDSLSTDAKLVLDEVTRLLNMEK 851
Query: 395 LMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
+PSY ++++SC++ + KL F L + + + + ++ +R+ A L+D
Sbjct: 852 HLPSYKYMVSVSCLKVIR----KLQKFGHLPSLPHIYRSYAEYGIYLDLRIAAMECLVDF 907
Query: 455 EFHCNGIDSALSLFIKSVEEEP 476
+G L I +E +P
Sbjct: 908 -VKVDGRSEDLEHLITLLETDP 928
>gi|194750911|ref|XP_001957773.1| GF23856 [Drosophila ananassae]
gi|190625055|gb|EDV40579.1| GF23856 [Drosophila ananassae]
Length = 1221
Score = 155 bits (391), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 118/440 (26%), Positives = 211/440 (47%), Gaps = 54/440 (12%)
Query: 54 FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
F +WV T G + +N+++N +EL + +D N+ G
Sbjct: 521 FMDQWVRTGGHAKFSLTSVFNRKRNTIELEIRQDFV-----------------NQRGVRK 563
Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
+ G + +++ ELDG + H + + +I CHSK + K + G
Sbjct: 564 YNGPLLVQLQELDGTFKHTL--QIENTLVKADITCHSKSRRNKKKKIPLCT--------G 613
Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
+ V +D+ + +SP+ WIR DPEM + ++ QP W QL + DV AQ QAI AL
Sbjct: 614 EEV-DMDLSAMDDSPVLWIRLDPEMILMRDLIIEQPDFQWQYQLRHEKDVTAQFQAIQAL 672
Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYKSRR 292
+ P + + L + + + F++VR EAA++L A++ W+G ++ + R+
Sbjct: 673 QKYPTNATRMA--LTDTIESERCFYKVRCEAAHSLTKVANQMVASWSGPPAMLNIF--RK 728
Query: 293 FDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNN 349
F + P + N+F +F YF+ +AIP A+A +R + P E + F+ L KYN+N+
Sbjct: 729 FFGSFSAPHIIKLNNFSNFQLYFLQKAIPVAMAGLRTSHGICPTEVMRFLFDLFKYNENS 788
Query: 350 GNPYSDVFWLAALVQSVGE---------LEFGQQSILFLSS----LLKRIDRLLQFDRLM 396
N Y+D ++ AALV+++GE + Q + LS+ +L + RLL ++ +
Sbjct: 789 RNHYTDAYYRAALVEALGETLTPVVSVAIHGTQITTDSLSTDAKLVLDEVTRLLNMEKHL 848
Query: 397 PSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEF 456
PSY ++++SC++ + KL F L + + + + + +R+ A L+D
Sbjct: 849 PSYKYMVSVSCLKVIR----KLQKFGHLPSLPHIYRSYAAYGIFLDLRIAAMECLVDF-V 903
Query: 457 HCNGIDSALSLFIKSVEEEP 476
+G L I +E +P
Sbjct: 904 KVDGRSEDLEHLITLLETDP 923
>gi|407929393|gb|EKG22223.1| hypothetical protein MPH_00402 [Macrophomina phaseolina MS6]
Length = 1499
Score = 155 bits (391), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 155/586 (26%), Positives = 257/586 (43%), Gaps = 78/586 (13%)
Query: 4 KQMGSNFFRKILQNIISRAQ-GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTC 62
K+M KI+ ++ A+ G +ST++F +K LE + +F+ +WV
Sbjct: 530 KRMAKMGVTKIIYKVLLAAKTGKLQNGAISTEDFIQIVDKY--LESKPIPDFWRQWVYGA 587
Query: 63 GCPVLRMGFSYNKRKNIVELAVLR------DCTVKPDSRTPVLSSNTDSENRDGDIG--- 113
GCP + +NK+K+++E A+ + + K P +E++ G
Sbjct: 588 GCPHFYVTQRFNKKKSVIEFAIRQTHGDRSELEEKAQDLDPCTFLKEVNEHQSGMSKKKP 647
Query: 114 ---WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLA--ARRALKPKKGSKPD 167
+ G M++ VHE DG YDH I+P+ D + ++K R + ++ +
Sbjct: 648 TRVFTGAMTVTVHEADGTPYDH-IIPI-NDMTTRFDAPYNTKYTRLKRNRRQKERAAAAQ 705
Query: 168 GCDDNGDAVAG-----------LDMRSSMES-----------------PLSWIRADPEME 199
G D NGD LD ++ WIR D + E
Sbjct: 706 GIDLNGDGELPDEIPLYCLGDVLDTEKEIQEWNFDEWTKEEEETMSNEQFEWIRVDADFE 765
Query: 200 YLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWR 259
++A+I F+QP+ M + QL++D DVVAQA+A+ +L A S + L + D + F
Sbjct: 766 WIAKITFSQPLWMLVAQLQQDPDVVAQAEALQSLSAQ-RPSKLISTYLTRTVMDRRYFHG 824
Query: 260 VRIEAAYALANTASEETD-----WAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVL 314
+R AA +LA A D W GL HL K ++ + + R NDF D S+Y V
Sbjct: 825 IRTMAAESLAKCAIPRIDDVDLNWIGLSHLEKVFQELFCLPDSQMVRSNDFSDRSQYLVQ 884
Query: 315 EAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGEL----- 369
AIP A+A VR K A +++L LK+NDN+ N YSD +++A L+ ++ +
Sbjct: 885 CAIPRAIAKVREPSGKGLMRAKKWILDKLKFNDNSHNEYSDCYYVATLLSALAQQLAYKD 944
Query: 370 ---------EFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSG 420
E Q I F L I+R + D +PSY ++T++ ++ Q L +G
Sbjct: 945 VSTISFDFDEDEQDEIDFQRDALNEIERHRRLDEWIPSYQNVVTVTALQ--CQGLLMENG 1002
Query: 421 FI--SLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSL 478
I + ++ KP VR++A L+DL + +F S + P +
Sbjct: 1003 IIPRKILDFLQYAKP----GAAESVRLKALECLVDLNMMKHPAILRYLIFTLSYDPSPYM 1058
Query: 479 RGQVK--LGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAF 522
R + +G RI D++++V ++ ESR A
Sbjct: 1059 RENLYYLIGRGLARIAIGDKKEDTSNQVFGGLVIEQETSTESRAAM 1104
>gi|195493074|ref|XP_002094263.1| GE20292 [Drosophila yakuba]
gi|194180364|gb|EDW93975.1| GE20292 [Drosophila yakuba]
Length = 1219
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 128/502 (25%), Positives = 233/502 (46%), Gaps = 66/502 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEFRHFANKVGNLERPFL 51
MLE ++G ++ ++ A A+ + +ST F + + +
Sbjct: 460 MLENRIGQELLIQVFNKQLALASSAATTKIGAGLWSQLLISTNIF---IKAIFTVTGKDM 516
Query: 52 KEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGD 111
F +WV T G + +N+++N +EL + +D N+ G
Sbjct: 517 SVFMDQWVRTGGHAKFSLTSVFNRKRNTIELEIRQDFV-----------------NQRGI 559
Query: 112 IGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDD 171
+ G + +++ ELDG + H + + +I CHSK + K +
Sbjct: 560 RKYNGPLMVQLQELDGTFKHTL--QIENTLVKSDITCHSKSRRNKKKKIPLCT------- 610
Query: 172 NGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIA 231
G+ V +D+ + +SP+ WIR DPEM L ++ QP W QL + DV AQ QAI
Sbjct: 611 -GEEV-DMDLSAMDDSPVLWIRLDPEMILLRDLIIEQPDFQWQYQLRHERDVTAQFQAIQ 668
Query: 232 ALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYKS 290
AL+ P + + L + + + F++VR EAA++L A++ W+G ++ +
Sbjct: 669 ALQKYPTNATRLA--LTDTIESERCFYKVRCEAAHSLTKVANQMVASWSGPPAMLNIF-- 724
Query: 291 RRFDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYND 347
R+F + P + N+F +F YF+ +AIP A+A +R + P E + F+ L KYN+
Sbjct: 725 RKFFGSFSAPHIIKLNNFSNFQLYFLQKAIPVAMAGLRTSHGICPPEVMRFLFDLFKYNE 784
Query: 348 NNGNPYSDVFWLAALVQSVGE---------LEFGQQSILFLSS----LLKRIDRLLQFDR 394
N+ N Y+D ++ AALV+++GE + Q + LS+ +L + RLL ++
Sbjct: 785 NSRNHYTDAYYRAALVEALGETLTPVVSVAIHGTQITTDSLSTDAKLVLDEVTRLLNMEK 844
Query: 395 LMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
+PSY ++++SC++ + KL F L + + + + ++ +R+ A L+D
Sbjct: 845 HLPSYKYMVSVSCLKVIR----KLQKFGHLPSLPHIYRSYAEYGIYLDLRIAAMECLVDF 900
Query: 455 EFHCNGIDSALSLFIKSVEEEP 476
+G L I +E +P
Sbjct: 901 -VKVDGRIEDLEHLITLLETDP 921
>gi|156053606|ref|XP_001592729.1| hypothetical protein SS1G_05650 [Sclerotinia sclerotiorum 1980]
gi|154703431|gb|EDO03170.1| hypothetical protein SS1G_05650 [Sclerotinia sclerotiorum 1980 UF-70]
Length = 1524
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 150/543 (27%), Positives = 246/543 (45%), Gaps = 81/543 (14%)
Query: 5 QMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGC 64
Q G+ R I + I++ T+ST+ FR K+ FF +WV GC
Sbjct: 551 QGGTGISRVISKMIVNANASGGADSTISTEGFRKSCEKITKYRD--TDSFFTQWVLGAGC 608
Query: 65 PVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDS------ENRDGDIG----- 113
P + +NK++ VE+ + + + +S T L + D E+ G I
Sbjct: 609 PHFTITQKFNKKRLCVEMTITQ---TQANSETE-LKLHKDGFYRSFKESTAGVIAPKAQN 664
Query: 114 -WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSK---LAARRALKPKK--GSKP 166
+ G ++IR+HE DG Y+H + M + +EI ++K L R +K + G
Sbjct: 665 TFTGPLTIRIHEADGTPYEHIV--MIREKTSRIEIPYNTKYKRLKRNRRMKERANAGIAM 722
Query: 167 DGCDDNGDAVAGLDMRSSMESP-----------------------LSWIRADPEMEYLAE 203
D +N D + ++SP WIR D + E++ +
Sbjct: 723 DINAENHDDALIYCLGDVLQSPQDIQEWGLRDWDPATEAKMEAESYEWIRVDADFEWICD 782
Query: 204 IHFN-QPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRI 262
F M+++QL++D D+ AQ +++A L+ P V L L D + F+ +R
Sbjct: 783 KKFTGMEAYMYVSQLQQDRDISAQYESMAYLDNAPPHPL-VATFLIRTLMDRRYFYGIRT 841
Query: 263 EAAYALANTASE-ETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAV 321
AA +LA A E + W GL HL K ++ +PR NDF D Y++ AIP A+
Sbjct: 842 MAAKSLAKHAVEKDVKWIGLTHLEKAFQEFFCYPGTKMPRSNDFSDKQAYWIECAIPEAM 901
Query: 322 AMVRA-ADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVG------------- 367
+ VR+ D+ P+ A +F+L L++NDN N YSD F +A L++S+
Sbjct: 902 SRVRSLYDSLCPKRARQFILDQLRFNDNGNNDYSDNFKVANLLKSLADSLIPGEKGVNND 961
Query: 368 --------ELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS 419
E+E QQ F + +L+ +DR + D + SY I TI+ I ++ L +
Sbjct: 962 ALQLDEAEEIEEPQQ---FKNIVLEELDRYSRMDEWIDSYQNIYTITVIE--CKLKLMNA 1016
Query: 420 GFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLR 479
G I +D + + D ++ VR++AS AL DL F N + L + S + P +R
Sbjct: 1017 GVIPVD-ALDFAQYLHDGSSP-NVRIKASWALADLGFLKNTAVACLLVTAMSTDPSPWVR 1074
Query: 480 GQV 482
G++
Sbjct: 1075 GEL 1077
>gi|406865162|gb|EKD18205.1| TATA-binding protein associated factor Taf2 [Marssonina brunnea f.
sp. 'multigermtubi' MB_m1]
Length = 1530
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 150/559 (26%), Positives = 249/559 (44%), Gaps = 79/559 (14%)
Query: 2 LEKQMGSNFFRKILQNIISRAQ-GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
L K GS +I+ ++ A G + +ST+ FR KV + F+ WV
Sbjct: 544 LVKASGSAGLVRIISKMVMIANTGDAKESFVSTEGFRKSCEKVTKYRQ--TDSFWKEWVY 601
Query: 61 TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRD-----GDIG-- 113
GCP L + +NK++ VEL + + P R DS R+ DI
Sbjct: 602 GSGCPRLSITQKFNKKRLCVELTISQKQDTLPTQR----EMQKDSFLRELKEVVHDIYSL 657
Query: 114 -----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSK-- 165
+ G M+IR+HE DG Y+H + + +EI ++K + + +K +
Sbjct: 658 PVQPVFTGPMTIRIHEADGTPYEHIV--EIRELHSKIEIPYNTKYKRLKRTRKQKEQQNA 715
Query: 166 -------PDGCDDN-----GDAVAGLDM--------------RSSMESPLSWIRADPEME 199
+G +D GD + G DM + +ES WIR D + E
Sbjct: 716 MANADPNAEGGEDTLIYCLGDTIQGPDMEEWGVIEWDAENLAKMELES-YEWIRVDADFE 774
Query: 200 YLAEIHFNQ-PVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFW 258
++ E F+Q P M+++QL++D DVVAQ ++ L+ + V L L D++ F
Sbjct: 775 WICEKQFSQMPAYMYVSQLQQDRDVVAQQDSMLYLKNMQAHKL-VATFLTRTLMDTRYFH 833
Query: 259 RVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIP 318
+R AA AL AS W G+ L K Y PRPNDF D Y V +I
Sbjct: 834 GIRSMAAEALNIHASPAVQWVGMKQLEKAYVEFYAYAKDKTPRPNDFSDKRAYTVEMSII 893
Query: 319 HAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGEL--------- 369
++A +R ++ P+++ F++ +L++NDN+ N +SD F +A L+ ++ +
Sbjct: 894 KSLAKIRDSNGNCPKQSRLFLMDILRFNDNSNNQFSDNFKIATLLSALADSLIPLEDADN 953
Query: 370 ------EFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTIS---CIRTLTQIALKLSG 420
E + + F +L+ +DR + D + SY T++ C + L Q +
Sbjct: 954 LDIYDDEEDMEPVEFKKQVLEELDRYARMDEWINSYQNCFTVTVLDCKQRLMQ-----AK 1008
Query: 421 FISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLRG 480
I D V+ ++ D + + VR++A R+L+DL F N ++L L + S + P R
Sbjct: 1009 VIPTD-AVEFVQYLHDGTSEF-VRIKAFRSLIDLGFLTNNAVASLLLNVASTDPSPYTRR 1066
Query: 481 QVKLGIHAMRICQIKGGSD 499
++ I + I I G D
Sbjct: 1067 KL-FEIFCLGIASIAFGED 1084
>gi|380013884|ref|XP_003690975.1| PREDICTED: LOW QUALITY PROTEIN: transcription initiation factor
TFIID subunit 2-like [Apis florea]
Length = 1181
Score = 154 bits (388), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 131/505 (25%), Positives = 228/505 (45%), Gaps = 73/505 (14%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRH-------FANKVGNLERPFLKE 53
MLE ++G ++ +S A A+ + + + + H F + + +
Sbjct: 439 MLEHRIGQELLLQVFNKQLSLAANAAQ-QKIDSGLWSHMLISTNVFTKAIFTVTGKDMAV 497
Query: 54 FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
F +WV T G + F +N+++N VEL + ++ T N+ G
Sbjct: 498 FIDQWVRTGGHAKFSLSFIFNRKRNTVELEIRQETT-----------------NQRGIRK 540
Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
+ G + + + ELDG + H L + G + +I CHSK + K + G
Sbjct: 541 YVGPLLVNIQELDGTFKH-TLQIEGTVAKA-DITCHSKSRRNKKKKIPLCT--------G 590
Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
+ V +D+ + +SP+ WIR DPEM L + QP W QL + DV AQ +AI AL
Sbjct: 591 EEV-DMDLSAMDDSPVLWIRLDPEMTLLRAVQVEQPDYQWQYQLRHERDVTAQLEAIEAL 649
Query: 234 E--ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYK- 289
+ A P + +T+ N +++VR+ AA+ L A+ WAG ++ ++
Sbjct: 650 QNHATPATRLALTDTIEN----EHCYYKVRLRAAHCLTKVANAMVATWAGPPAMLAIFRK 705
Query: 290 ------SRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLL 343
RR + + N+F +F YF+ + IP A+A +R A P E + F++ L
Sbjct: 706 LFGSASCRR------IIKQNNFTNFQHYFLQKTIPVAMAGLRNAHGICPPEVLAFLMDLF 759
Query: 344 KYNDNNGNPYSDVFWLAALVQSVGE-----LEFGQQSILFLSSL-------LKRIDRLLQ 391
KYNDN+ N YSD ++ AAL++++G + Q + + SL L+ + R L
Sbjct: 760 KYNDNSKNRYSDNYYRAALIEALGATVTPVISVQQGTAITAESLSVDTRAILEEVTRNLN 819
Query: 392 FDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRAL 451
++L+P Y +++SC++ + L F L L + + + VR+ A AL
Sbjct: 820 LEKLLPCYKYTVSVSCLKVIR----ILQKFGHLPSNPHLFRAYAAYGQFIDVRIAALEAL 875
Query: 452 LDLEFHCNGIDSALSLFIKSVEEEP 476
+D +G L + +E +P
Sbjct: 876 VDFT-RVDGKWEDLEFLLDMIEMDP 899
>gi|328787388|ref|XP_393397.4| PREDICTED: transcription initiation factor TFIID subunit 2, partial
[Apis mellifera]
Length = 1185
Score = 154 bits (388), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 131/505 (25%), Positives = 228/505 (45%), Gaps = 73/505 (14%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRH-------FANKVGNLERPFLKE 53
MLE ++G ++ +S A A+ + + + + H F + + +
Sbjct: 443 MLEHRIGQELLLQVFNKQLSLAANAAQ-QKIDSGLWSHMLISTNVFTKAIFTVTGKDMAV 501
Query: 54 FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
F +WV T G + F +N+++N VEL + ++ T N+ G
Sbjct: 502 FIDQWVRTGGHAKFSLSFIFNRKRNTVELEIRQETT-----------------NQRGIRK 544
Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
+ G + + + ELDG + H L + G + +I CHSK + K + G
Sbjct: 545 YVGPLLVNIQELDGTFKH-TLQIEGTVAKA-DITCHSKSRRNKKKKIPLCT--------G 594
Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
+ V +D+ + +SP+ WIR DPEM L + QP W QL + DV AQ +AI AL
Sbjct: 595 EEV-DMDLSAMDDSPVLWIRLDPEMTLLRAVQVEQPDYQWQYQLRHERDVTAQLEAIEAL 653
Query: 234 E--ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYK- 289
+ A P + +T+ N +++VR+ AA+ L A+ WAG ++ ++
Sbjct: 654 QNHATPATRLALTDTIEN----EHCYYKVRLRAAHCLTKVANAMVATWAGPPAMLAIFRK 709
Query: 290 ------SRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLL 343
RR + + N+F +F YF+ + IP A+A +R A P E + F++ L
Sbjct: 710 LFGSASCRR------IIKQNNFTNFQHYFLQKTIPVAMAGLRNAHGICPPEVLAFLMDLF 763
Query: 344 KYNDNNGNPYSDVFWLAALVQSVGE-----LEFGQQSILFLSSL-------LKRIDRLLQ 391
KYNDN+ N YSD ++ AAL++++G + Q + + SL L+ + R L
Sbjct: 764 KYNDNSKNRYSDNYYRAALIEALGATVTPVISVQQGTAITAESLSVDTRAILEEVTRNLN 823
Query: 392 FDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRAL 451
++L+P Y +++SC++ + L F L L + + + VR+ A AL
Sbjct: 824 LEKLLPCYKYTVSVSCLKVIR----ILQKFGHLPSNPHLFRAYAAYGQFIDVRIAALEAL 879
Query: 452 LDLEFHCNGIDSALSLFIKSVEEEP 476
+D +G L + +E +P
Sbjct: 880 VDFT-RVDGKWEDLEFLLDMIEMDP 903
>gi|383856938|ref|XP_003703963.1| PREDICTED: transcription initiation factor TFIID subunit 2-like
[Megachile rotundata]
Length = 1199
Score = 153 bits (387), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 127/499 (25%), Positives = 227/499 (45%), Gaps = 61/499 (12%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRH-------FANKVGNLERPFLKE 53
MLE ++G ++ +S A A+ + + + + H F + + +
Sbjct: 457 MLEHRIGQELLLQVFNKQLSLAANAAQ-QKIDSGLWSHMLISTNVFTKAIFTVTGKDMAV 515
Query: 54 FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
F +WV T G + F +N+++N VEL + +D T N+ G
Sbjct: 516 FIDQWVRTGGHAKFSLSFVFNRKRNTVELEIRQDTT-----------------NQRGIRK 558
Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
+ G + + + ELDG + H L + G + +I CHSK + K + G
Sbjct: 559 YVGPLLVNIQELDGTFKH-TLQIEGTVAKA-DITCHSKSRRNKKKKIPLCT--------G 608
Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
+ V +D+ + +SP+ WIR DP+M + + QP W QL + DV AQ +AI AL
Sbjct: 609 EEV-DMDLSAMDDSPVLWIRLDPDMTLMRAVQIEQPDYQWQYQLRHERDVTAQLEAIEAL 667
Query: 234 E--ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYKS 290
+ A P + +T+ N +++VR+ AA+ L A+ WAG ++ ++
Sbjct: 668 QNHATPATRLALTDTIEN----EHCYYKVRLRAAHCLTKVANAMVATWAGPPAMLAIFRK 723
Query: 291 RRFDENIG-LPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNN 349
+ + + N+F +F YF+ + IP A+A +R A P E + F++ L KYNDN+
Sbjct: 724 LFGSASCRRIIKQNNFSNFQHYFLQKTIPVAMAGLRNAHGICPPEVLAFLMDLFKYNDNS 783
Query: 350 GNPYSDVFWLAALVQSVGE-----LEFGQQSILFLSSL-------LKRIDRLLQFDRLMP 397
N YSD ++ AAL++++G + Q + + SL L+ + R L ++L+P
Sbjct: 784 KNRYSDNYYRAALIEALGATVTPVISIQQGTAITAESLSVDTKAILEEVTRNLNLEKLLP 843
Query: 398 SYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFH 457
Y ++++C++ + L F L L + + + VR+ A AL+D
Sbjct: 844 CYKYTVSVACLKVIR----ILQKFGHLPSNPHLFRAYAAYGQFIDVRIAALEALVDFT-R 898
Query: 458 CNGIDSALSLFIKSVEEEP 476
+G L + +E +P
Sbjct: 899 VDGKWEDLEFLLDMIEMDP 917
>gi|303323785|ref|XP_003071884.1| hypothetical protein CPC735_074210 [Coccidioides posadasii C735 delta
SOWgp]
gi|240111586|gb|EER29739.1| hypothetical protein CPC735_074210 [Coccidioides posadasii C735 delta
SOWgp]
Length = 1253
Score = 153 bits (387), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 140/538 (26%), Positives = 237/538 (44%), Gaps = 98/538 (18%)
Query: 1 MLEKQMGSNFFRKILQNIISRA-----QGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
+LE+++ + + IISR G P L+T F+ + G+++ L FF
Sbjct: 496 ILERRLSKASGKATVSRIISRIFLNCRMGDIPNSALTTIFFQKLCERFGHVK---LDAFF 552
Query: 56 PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTP------VLSSNTDSENRD 109
+WV GCP +NK+K +VE+ + + V+ + +P + E ++
Sbjct: 553 QQWVYGAGCPRFVATQRFNKKKLVVEMMIRQ---VQSEQSSPRDLEKSTFMRDVKEELQN 609
Query: 110 GDIG-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLA--ARRALKPK 161
G + G M+IR+HE DG Y+H + +A +I ++K R + +
Sbjct: 610 IYAGSLQHVFTGSMTIRIHEADGTPYEHIV--EIKEAVTKFDIPYNTKYKRLKRNKRQRE 667
Query: 162 KGSKPDGCDDNGDAVAGL------DMRSSMESPL--------------------SWIRAD 195
+ + G D N +A + D+ S E L WIR D
Sbjct: 668 RAAASSGVDPNSEAQDDVLLYCLGDVLQSEEEILKWRLVEWTKEDEDRMGQESYEWIRMD 727
Query: 196 PEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA-------IAALEALPHLSFNVVNTLN 248
+ E++ ++ P M+++QL++D DVVAQ +A +AA + + + +
Sbjct: 728 ADFEWICKMSLTMPGYMYLSQLQQDRDVVAQLEASNSRVGFMAAANCIQSIQYMAAQREH 787
Query: 249 NFLS--------DSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP 300
+S DS+ F +R AA+AL A E+ DW GL HL ++ N +
Sbjct: 788 PLISTIFARTLMDSRYFHGIRTAAAHALVKHAKEDVDWIGLFHLETAFQELFCLPNSPMT 847
Query: 301 RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLA 360
R N+F D S Y + AIP+A+A VR + + P F+ + LK+NDN+ N +SD +++A
Sbjct: 848 RSNNFSDRSSYALQIAIPNAIAKVRNRNGRCPMRVKRFLYEKLKFNDNSNNEFSDCYYVA 907
Query: 361 ALVQSVG--------------------ELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYN 400
AL++S+ ELE + L+L + + I+R + D S+
Sbjct: 908 ALMRSIANALIGGKESIHTKSKLDMTQELERQAEDQLYLDCIAE-INRYKRMDEWTSSFQ 966
Query: 401 GILT---ISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLE 455
+ T + C+R L + LD V P+ T VR++A R LLDL+
Sbjct: 967 NLYTRTALDCLRRL------MKEHRVLDFDVAQFLPYTRIGTFDLVRLDAFRILLDLD 1018
>gi|195012688|ref|XP_001983726.1| GH15413 [Drosophila grimshawi]
gi|193897208|gb|EDV96074.1| GH15413 [Drosophila grimshawi]
Length = 1207
Score = 153 bits (387), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 117/440 (26%), Positives = 211/440 (47%), Gaps = 54/440 (12%)
Query: 54 FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
F +WV T G + +N+++N +EL + +D N+ G
Sbjct: 522 FMDQWVRTGGHAKFSLTSVFNRKRNTIELEIRQDFV-----------------NQRGVRK 564
Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
+ G + +++ ELDG + H + + +I CHSK + K + G
Sbjct: 565 YNGPLLVQLQELDGTFKHTL--QIENTLVKADITCHSKSRRNKKKKIPLCT--------G 614
Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
+ V +D+ + +SP+ WIR DPEM L ++ QP W QL + DV AQ QAI AL
Sbjct: 615 EEV-DMDLSAMGDSPVLWIRLDPEMILLRDLIIEQPDFQWQYQLRHERDVTAQFQAIQAL 673
Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYKSRR 292
+ P + + L + + F++VR +AA++L A++ W+G ++ + R+
Sbjct: 674 QKYPTNATRLA--LTDTIESESCFYKVRCQAAHSLTKVANQMVASWSGPPAMLNIF--RK 729
Query: 293 FDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNN 349
F + P + N+F +F Y++ +AIP A+A +R + P E + F+ L KYN+N+
Sbjct: 730 FFGSFSAPHIIKLNNFSNFQLYYLQKAIPVAMAGLRTSHGICPPEVMRFLFDLFKYNENS 789
Query: 350 GNPYSDVFWLAALVQSVGE---------LEFGQQSILFLSS----LLKRIDRLLQFDRLM 396
N Y+D ++ AALV+++GE + Q + LS+ +L + RLL ++ +
Sbjct: 790 RNHYTDAYYRAALVEALGETLTPVVSVAMHGTQITTDSLSTDAKLVLDEVTRLLNMEKHL 849
Query: 397 PSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEF 456
PSY ++++SC++ + KL F L + + + + ++ +R+ A L+D
Sbjct: 850 PSYKYMVSVSCLKVIR----KLQKFGHLPSLPHIYRSYAEYGIFLDLRIAAMECLVDF-V 904
Query: 457 HCNGIDSALSLFIKSVEEEP 476
+G L I +E +P
Sbjct: 905 KVDGRSEDLEHLITLLETDP 924
>gi|260943123|ref|XP_002615860.1| hypothetical protein CLUG_04742 [Clavispora lusitaniae ATCC 42720]
gi|238851150|gb|EEQ40614.1| hypothetical protein CLUG_04742 [Clavispora lusitaniae ATCC 42720]
Length = 1698
Score = 153 bits (386), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 114/438 (26%), Positives = 205/438 (46%), Gaps = 67/438 (15%)
Query: 30 TLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCT 89
TLST+ F++ KV R L+ FF +WV G P+ + +NK+++++E+ + + +
Sbjct: 707 TLSTQHFQYVCEKVN---RNRLESFFKQWVYGIGTPIFNITQRFNKKRSMIEVTIRQTQS 763
Query: 90 VKPDSRTP--------VLSSNTDSENRDGDIGWPGMMSIRVHELDGM-YDHPILPMAGDA 140
+P + P L+ D+ + G M+IRVHE DG Y+H + D+
Sbjct: 764 QQPKNVHPQAETFIEDSLTYLNDTPIYPVQQTFLGPMTIRVHEADGTPYEHIV--DIKDS 821
Query: 141 WQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVA-----GLDMRSSME--------- 186
++Q ++K R LK K D+N ++ G ++S E
Sbjct: 822 VVKFDVQYNTKF---RRLKKNK-------DENSESTPIFAKLGDVLQSEQEMQDWNLADW 871
Query: 187 --------SPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPH 238
P WIR D + E++A + QP M+ QL++D D+ AQ AI +
Sbjct: 872 PKRDEEFLDPFEWIRVDTDFEWIARFNVKQPDYMFGAQLQQDRDIEAQIAAIEYFGSQEK 931
Query: 239 LSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIG 298
+ +L L DS+ F+ VRI AA +LA + ++AGL +L++ +K EN
Sbjct: 932 PNTIYCTSLLRTLMDSRYFYGVRIAAAKSLAIISKSNNNFAGLGYLIRAFKELYCFENSS 991
Query: 299 LPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFW 358
+P N F DF + F+ +AIP + ++ D +P+ + +LKYNDN+ N + D ++
Sbjct: 992 VPFSNSFDDFGKLFLQKAIPRFLGSIKDDDGNTPQSIKSLLFNVLKYNDNSNNEFQDCYY 1051
Query: 359 LAALVQSVGE-----------LEFG----------QQSILFLSSLLKRIDRLLQFDRLMP 397
++ L+ S+ E ++F + F+ +++ R+ + D +P
Sbjct: 1052 VSDLIWSLTEALIPANSEQLNVDFHLEREKAEASYSKDAKFIQKVVEEFSRIQKLDGWVP 1111
Query: 398 SYNGILTISCIRTLTQIA 415
SY +++ +CI ++A
Sbjct: 1112 SYQSVVSYACIEQKIRLA 1129
>gi|189241274|ref|XP_974531.2| PREDICTED: similar to conserved hypothetical protein [Tribolium
castaneum]
Length = 1158
Score = 153 bits (386), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 125/478 (26%), Positives = 219/478 (45%), Gaps = 61/478 (12%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRH-------FANKVGNLERPFLKE 53
MLE ++G ++ +S A A+ + + + + H F + + +
Sbjct: 447 MLEHRIGQELLLQVFNKQLSLATNAAQ-QKIGSGLWGHMLISTNVFTKAIFTVTGKDMAV 505
Query: 54 FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
F +WV T G + +N+++N +EL + +D T + R V
Sbjct: 506 FIDQWVRTGGHAKFHLTSVFNRKRNTIELEIRQDATTQLGIRKYV--------------- 550
Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
G + + + ELDG + H +L + + +I CHSK + K + G
Sbjct: 551 --GPLLVTLQELDGTFKH-MLQIENTVVKA-DITCHSKSRRNKKKKIPLCT--------G 598
Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
D V +D+ + +SP+ WIR DPEM L + QP W QL + DV AQ +AI AL
Sbjct: 599 DEV-DMDLSAMDDSPVLWIRLDPEMTLLRSVVIEQPDYQWQYQLRHERDVTAQMEAIYAL 657
Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYKSRR 292
E P + + L + + + + +++VR AA+ L A+ +WAG ++ + R+
Sbjct: 658 ERYPTPATRLA--LTDTIENEQCYYKVRCRAAHCLTKVANAMVANWAGPPAMLAIF--RK 713
Query: 293 FDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNN 349
+ P R N F +F Y++ + IP A+A +R A P + + F+L L KYNDN+
Sbjct: 714 LFGSFAAPHIIRQNSFTNFQHYYLQKTIPLAMAGLRNAHGICPPDVIRFLLDLFKYNDNS 773
Query: 350 GNPYSDVFWLAALVQSVGELEFGQQSILF---------LSS----LLKRIDRLLQFDRLM 396
N YSD ++ AALV+++G S++ LS+ +L+ + R+L ++ +
Sbjct: 774 KNRYSDNYYRAALVEALGNTVTPVVSVIHQGAPITAESLSTDTKLILEEVTRMLNLEKAL 833
Query: 397 PSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
P Y ++++C++ + KL F L + K + + VRV A L+D
Sbjct: 834 PCYKYTVSVACLKAIR----KLQKFGHLPSRPTIFKSYASYGHFIDVRVAALECLVDF 887
>gi|270013200|gb|EFA09648.1| hypothetical protein TcasGA2_TC011774 [Tribolium castaneum]
Length = 1169
Score = 153 bits (386), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 125/478 (26%), Positives = 219/478 (45%), Gaps = 61/478 (12%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRH-------FANKVGNLERPFLKE 53
MLE ++G ++ +S A A+ + + + + H F + + +
Sbjct: 458 MLEHRIGQELLLQVFNKQLSLATNAAQ-QKIGSGLWGHMLISTNVFTKAIFTVTGKDMAV 516
Query: 54 FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
F +WV T G + +N+++N +EL + +D T + R V
Sbjct: 517 FIDQWVRTGGHAKFHLTSVFNRKRNTIELEIRQDATTQLGIRKYV--------------- 561
Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
G + + + ELDG + H +L + + +I CHSK + K + G
Sbjct: 562 --GPLLVTLQELDGTFKH-MLQIENTVVKA-DITCHSKSRRNKKKKIPLCT--------G 609
Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
D V +D+ + +SP+ WIR DPEM L + QP W QL + DV AQ +AI AL
Sbjct: 610 DEV-DMDLSAMDDSPVLWIRLDPEMTLLRSVVIEQPDYQWQYQLRHERDVTAQMEAIYAL 668
Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYKSRR 292
E P + + L + + + + +++VR AA+ L A+ +WAG ++ + R+
Sbjct: 669 ERYPTPATRLA--LTDTIENEQCYYKVRCRAAHCLTKVANAMVANWAGPPAMLAIF--RK 724
Query: 293 FDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNN 349
+ P R N F +F Y++ + IP A+A +R A P + + F+L L KYNDN+
Sbjct: 725 LFGSFAAPHIIRQNSFTNFQHYYLQKTIPLAMAGLRNAHGICPPDVIRFLLDLFKYNDNS 784
Query: 350 GNPYSDVFWLAALVQSVGELEFGQQSILF---------LSS----LLKRIDRLLQFDRLM 396
N YSD ++ AALV+++G S++ LS+ +L+ + R+L ++ +
Sbjct: 785 KNRYSDNYYRAALVEALGNTVTPVVSVIHQGAPITAESLSTDTKLILEEVTRMLNLEKAL 844
Query: 397 PSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
P Y ++++C++ + KL F L + K + + VRV A L+D
Sbjct: 845 PCYKYTVSVACLKAIR----KLQKFGHLPSRPTIFKSYASYGHFIDVRVAALECLVDF 898
>gi|225679290|gb|EEH17574.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
Length = 1253
Score = 152 bits (385), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 149/550 (27%), Positives = 237/550 (43%), Gaps = 95/550 (17%)
Query: 1 MLEKQMGSNFFRKILQNIISRA-----QGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
+LE+++ + + IISR G P +L+T + ++G+ + L +FF
Sbjct: 511 ILERRLSKASGKATISRIISRIFLHARMGDLPNGSLATSFLQKICERLGHTK---LDQFF 567
Query: 56 PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSR---TPVLSSNTDSENRDGDI 112
+WV GCP +NK+K +VE+ + + + +P R + E ++
Sbjct: 568 QQWVYGAGCPRFLATQRFNKKKLVVEMMIRQVQSEQPTVRDLDKLTFMRDVKEEIKNVYA 627
Query: 113 G-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKP 166
G + G M+IR+HE DG Y+H + G + K R + ++ +
Sbjct: 628 GAVQHVFTGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAAAA 687
Query: 167 DGCDDN------------GDAVAGLDM---------------RSSMESPLSWIRADPEME 199
G D N GD + D R ES WIR D + E
Sbjct: 688 SGLDPNAELQDDVLLYCLGDVLQSEDEIQKWKLVEWSKEDEDRMGQES-YEWIRMDADFE 746
Query: 200 YLAEIHFNQPVQMWINQLEKDGDVVAQAQAI---AALEALPHLSFNVVNTLNNFLSDSKA 256
++ ++ P M+++QL++D D+VAQ ++I AA P +S + TL D +
Sbjct: 747 WICKMSLTMPGYMYLSQLQQDRDIVAQLESIQYMAAQREHPLISTIFIRTL----MDRRY 802
Query: 257 FWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP-----RPNDFRDFSEY 311
F+ +R AA AL A E+ DW GL HL F E LP R NDF D S Y
Sbjct: 803 FYGIRAAAAKALVKHAKEDVDWIGLFHL-----ESAFQELFCLPGSPMTRSNDFSDRSSY 857
Query: 312 FVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVG---- 367
+ IP A+A VR + K+P F+ LK+NDN+ + SD +++AAL++S+
Sbjct: 858 ILQLTIPQAIAQVRDSTGKTPMRVKRFLYDKLKFNDNSNSEISDCYYVAALMKSICDALL 917
Query: 368 ----------------ELEFGQQSILFLS-SLLKRIDRLLQFDRLMPSYNGIL---TISC 407
E+E +Q+ L + IDR + D S+ I I C
Sbjct: 918 GRKENEPRNIDMDFDMEMELERQAEQQLDRDCIAEIDRYRRMDEWTSSFQNIYARTAIGC 977
Query: 408 IRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSL 467
R Q +G I LD +V+ ++ R T +R++A L +EF I L+
Sbjct: 978 QRRFMQ-----AGVIELD-LVQFLQYTRA-GTYEMLRLDAFDIL--IEFDIFQIPELLTW 1028
Query: 468 FIKSVEEEPS 477
FI + +PS
Sbjct: 1029 FIYVMSTDPS 1038
>gi|336389953|gb|EGO31096.1| hypothetical protein SERLADRAFT_444675 [Serpula lacrymans var.
lacrymans S7.9]
Length = 1798
Score = 152 bits (385), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 117/373 (31%), Positives = 176/373 (47%), Gaps = 39/373 (10%)
Query: 191 WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNF 250
WIR D + E++A I F QP MW++QL++D DVVAQ +AI AL P + V +T
Sbjct: 703 WIRMDADFEWIAAIAFEQPDFMWVSQLQRDRDVVAQLEAIHALAKQP--TAIVSSTFTKT 760
Query: 251 LSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRF---DENIGL------PR 301
+ S F+R+R EAA AL N A D+ GL HL K + + D N L P+
Sbjct: 761 VLVSNYFYRIRCEAALALVNCAIRRLDFLGLFHLFKLFLRNCYEPEDSNQDLFAHTYVPK 820
Query: 302 PNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAA 361
PNDF D SEYFV +++ +A++ VR + KSP +F++ L+YNDN NPYSD F++
Sbjct: 821 PNDFSDLSEYFVRKSLVNAISRVRFENGKSPSVVRQFLIDQLRYNDNTANPYSDAFYICT 880
Query: 362 LVQSV------------GEL-------EFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGI 402
++ + GEL E + L + IDR DRL+ S + +
Sbjct: 881 IISASACATVSTAPPERGELLPTETRSEQNSEDTSLLKQAISEIDRYRSMDRLITSPHNV 940
Query: 403 LTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGI- 461
+TI+ + + L ++ D +++ P QVR+ A L +++ I
Sbjct: 941 VTIAALEFY--LVLTVANLAPSD--LRVFFPLTREGNYTQVRIAAFDGLFLSKWYTPAIM 996
Query: 462 DSALSLFIKSVEEEPSLRGQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIA 521
+L+L + E + +L+ L I K EVD + V L + R
Sbjct: 997 KYSLALLVSMGEMKNALKETESLLIEEDGSMPEKAKEVKKSEVDLMIKV----LRKDREI 1052
Query: 522 FNNVFLRHHLFGI 534
N LR L I
Sbjct: 1053 GKNEVLREFLMPI 1065
Score = 48.9 bits (115), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 36/141 (25%), Positives = 63/141 (44%), Gaps = 21/141 (14%)
Query: 1 MLEKQMGSN-----FFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
+L++++G + R + + +S G P LST F KV +E L+ F
Sbjct: 483 ILDRRLGKSGTSLGLSRVLPKVFLSGISGEMPNNALSTHSFLRTCRKVSGVE---LRSFA 539
Query: 56 PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG-- 113
+W+ GCP S+N++K VE+ + ++ P + +E +
Sbjct: 540 EQWIYGSGCPAFGFSASFNRKKMAVEITMRQEA--------PAFKLHESNEVSKALLKPV 591
Query: 114 --WPGMMSIRVHELDGM-YDH 131
+ G M+IR+HE DG Y+H
Sbjct: 592 PFFEGQMTIRIHEADGTPYEH 612
>gi|308453099|ref|XP_003089299.1| hypothetical protein CRE_11590 [Caenorhabditis remanei]
gi|308241221|gb|EFO85173.1| hypothetical protein CRE_11590 [Caenorhabditis remanei]
Length = 1092
Score = 152 bits (384), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 143/515 (27%), Positives = 249/515 (48%), Gaps = 75/515 (14%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEFRHFANKVGNLERP-F 50
ML+K++G F ++L +++ S +T +ST+ F + V E P F
Sbjct: 494 MLQKRIGLEPFMRVLHRLLTVGLDMSEKKTTPAAWRHLLISTEAFFRSVSSVTGREIPTF 553
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
L++F V T G + F +N+++NIVE+ V +D D+E +G
Sbjct: 554 LEQF----VRTGGHAAFAVKFDFNRKRNIVEIEVKQD----------------DTEG-NG 592
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPK--KGSKPDG 168
+ G +S+ V E+DG + H I A EI CHSK ++ K G + +
Sbjct: 593 RTQYTGPLSVVVQEVDGAFSHTI--QIDGAISHAEISCHSKGRKQKKKKVPILTGEEIEI 650
Query: 169 CDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQ 228
N DA ESP+ W+R D + + EI +QP+ W L+ + DV+AQ +
Sbjct: 651 DLTNMDA----------ESPILWLRIDNDYLLIREISISQPMFHWEYMLKYERDVIAQME 700
Query: 229 AIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALA---NTASEETDWAGLLHLV 285
A+ ++ALP S + + + + +S+ + F+R+R AA+ L N SE + +
Sbjct: 701 ALERIQALP--SAHSRSVIVDAVSNERFFYRIRCRAAFVLTSVQNRKSEAIAVGTPVLIN 758
Query: 286 KFYKSRRFDENIGLPRPNDF----RDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQ 341
F +S + +P+PN+F ++ +YFV++++P A++ +R + + F+L
Sbjct: 759 MFRESFGCKSSTNIPKPNNFVVTAQNLQQYFVMQSLPQAISRLRKQTGECHEDVQSFLLD 818
Query: 342 LLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILF-------LSS----LLKRIDRLL 390
L+K+NDN+ N YSD F+ A+L S+ + + F LSS L+K L
Sbjct: 819 LIKFNDNSTNRYSDDFYRASLFNSLAASIHPSEFLPFRVDLPENLSSEFRILIKEFTYAL 878
Query: 391 QFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRA 450
D + PSY ++ I+ + L Q L+ G++ LD KL+ F N+ Q+R A
Sbjct: 879 NMDTVNPSYGRVVGIAALSGLYQ--LQKCGYLPLDS--KLLWAFAHRNSCVQMRRCAVTL 934
Query: 451 LLD-LEFHCNGIDSA---LSLFIKSV--EEEPSLR 479
++D + + +D+ LS+ + V E++PS+R
Sbjct: 935 IIDRIVNDIHAVDTKMYDLSMLLNLVLTEKDPSIR 969
>gi|358057254|dbj|GAA96863.1| hypothetical protein E5Q_03536 [Mixia osmundae IAM 14324]
Length = 1605
Score = 152 bits (384), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 118/454 (25%), Positives = 210/454 (46%), Gaps = 71/454 (15%)
Query: 12 RKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLE---RPFLKEFFPRWVGTCGCPVLR 68
R I + +IS G T+ T F K+ N RP+ + W+ GCP
Sbjct: 516 RVIPKLLISVMAGELKDNTIGTPGFLRMCRKLANAGVDLRPWANQ----WIYGSGCPHFN 571
Query: 69 MGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGM 128
+ ++N++K I+E+ + + D+ T S+ + + G ++IR+HE DG
Sbjct: 572 VVINFNRKKMIIEVDITQHLPAANDAATQPWSATAHTNPVQS---FDGALTIRIHEFDGT 628
Query: 129 -YDHPILPMAGDAWQLLEIQCHSKL----AARRALKPKKGSKPDGCDDNGDAVAGLDMRS 183
YDH + AG ++ E+ +++ + R+ P++ D DD G + + R
Sbjct: 629 PYDHVLGVKAG--YKRHEVAYNTRFKRIRSGIRSFIPRQH---DANDDPGMGIKASESRF 683
Query: 184 SME--------------------------SPLSWIRADPEMEYLAEIHFNQPVQMWINQL 217
++ +P WIR D +M+++ + + QP WI QL
Sbjct: 684 HLQLWEDAAQRTRWKVADYTEEEVEAIANTPYEWIRIDADMDWICSVAYVQPEYAWIAQL 743
Query: 218 EKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETD 277
E+D DV+AQ A+ L A P S ++L L F+R+RIEA YA+ A++
Sbjct: 744 ERDKDVIAQLDAVHGLLARP--SLMAASSLAKTLLVETYFFRIRIEAIYAMLGCATDALQ 801
Query: 278 WAGLLHLVK-FYK---------SRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAA 327
GL +L++ FY+ +E + +P PNDF + +Y + +A AV++VR A
Sbjct: 802 QRGLFYLLRLFYQFFDTGKTPFPADHNEYVCVPSPNDFSNVPDYLIKKAFISAVSLVRDA 861
Query: 328 DNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSIL----FL---- 379
SP A ++ LL YNDN+ N + D F ++ ++ + ++ F+
Sbjct: 862 SGLSPHVAKIMLIDLLAYNDNSRNAFDDSFMISEVISDLAVALLADKTDTSDANFVPTPA 921
Query: 380 -SSLLKR----IDRLLQFDRLMPSYNGILTISCI 408
++L++R ++R DRL+PSY ++T++ +
Sbjct: 922 DTALVQRASDEVERYQSLDRLVPSYQNLVTVAAL 955
>gi|195126212|ref|XP_002007568.1| GI13016 [Drosophila mojavensis]
gi|193919177|gb|EDW18044.1| GI13016 [Drosophila mojavensis]
Length = 1212
Score = 152 bits (384), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 113/418 (27%), Positives = 203/418 (48%), Gaps = 53/418 (12%)
Query: 54 FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
F +WV T G + +N+++N +EL + +D N+ G
Sbjct: 522 FMDQWVRTGGHAKFSLTSVFNRKRNTIELEIRQDFV-----------------NQRGVRK 564
Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
+ G + +++ ELDG + H + + +I CHSK + K + G
Sbjct: 565 YNGPLLVQLQELDGTFKHTL--QIENTLVKADITCHSKSRRNKKKKIPLCT--------G 614
Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
+ V +D+ + +SP+ WIR DPEM L ++ QP W QL + DV AQ QAI AL
Sbjct: 615 EEV-DMDLSAMDDSPVLWIRLDPEMILLRDLIIEQPDFQWQYQLRHERDVTAQFQAIQAL 673
Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYKSRR 292
+ P + + L + + F++VR +AA++L A++ W+G ++ + R+
Sbjct: 674 QKYPTNATRLA--LTDTIESDHCFYKVRCQAAHSLTKVANQMVASWSGPPAMLNIF--RK 729
Query: 293 FDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNN 349
F + P + N+F +F YF+ +AIP A+A +R + P E + F+ L KYN+N+
Sbjct: 730 FFGSFSAPHIIKLNNFSNFQLYFLQKAIPVAMAGLRTSHGICPPEVMRFLFDLFKYNENS 789
Query: 350 GNPYSDVFWLAALVQSVGE---------LEFGQQSILFLSS----LLKRIDRLLQFDRLM 396
N Y+D ++ AALV+++GE + Q + LS+ +L + RLL ++ +
Sbjct: 790 RNHYTDAYYRAALVEALGETLTPVVSVAMHGTQITTDSLSTDAKLVLDEVTRLLNMEKHL 849
Query: 397 PSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
PSY ++++SC++ + KL F L + + + + ++ +R+ A L+D
Sbjct: 850 PSYKYMVSVSCLKVIR----KLQKFGHLPSLPHIYRSYAEYGIFLDLRIAAMECLVDF 903
>gi|195428287|ref|XP_002062205.1| GK17418 [Drosophila willistoni]
gi|194158290|gb|EDW73191.1| GK17418 [Drosophila willistoni]
Length = 1224
Score = 152 bits (384), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 131/503 (26%), Positives = 237/503 (47%), Gaps = 62/503 (12%)
Query: 54 FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
F +WV T G + +N+++N +EL + +D N+ G
Sbjct: 521 FMDQWVRTGGHAKFSLTSVFNRKRNTIELEIRQDFV-----------------NQRGVRK 563
Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
+ G + +++ ELDG + H + + +I CHSK + K + G
Sbjct: 564 YNGPLLVQLQELDGTFKHTL--QIENTLVKADITCHSKSRRNKKKKIPLCT--------G 613
Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
+ V +D+ + +SP+ WIR DPEM L ++ QP W QL + DV AQ QAI AL
Sbjct: 614 EEV-DMDLSAMDDSPVLWIRLDPEMILLRDLIIEQPDFQWQYQLRHERDVTAQFQAIQAL 672
Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETD-WAGLLHLVKFYKSRR 292
+ P + + L + + + F++VR +AA++L A++ W+G ++ + R+
Sbjct: 673 QKYPTNATRMA--LTDTIESERCFYKVRCQAAHSLTKVANQMVAAWSGPPAMLNIF--RK 728
Query: 293 FDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNN 349
F + P + N+F +F YF+ ++IP A+A +R + P E + F+ L KYN+N+
Sbjct: 729 FFGSFSAPHIIKLNNFSNFQLYFLQKSIPVAMAGLRTSHGICPPEVMRFLFDLFKYNENS 788
Query: 350 GNPYSDVFWLAALVQSVGE---------LEFGQQSILFLSS----LLKRIDRLLQFDRLM 396
N Y+D ++ AALV+++GE + Q + LS+ +L + RLL ++ +
Sbjct: 789 RNHYTDAYYRAALVEALGETLTPVVSVAMHGTQITTESLSTDAKLVLDEVTRLLNMEKHL 848
Query: 397 PSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEF 456
PSY ++++SC++ + ++ KL SL + + + + +R+ A L+D
Sbjct: 849 PSYKYMVSVSCLKVIRKLQ-KLGHLPSLPHIY---RSYASYGIFLDLRIAAMECLVDF-V 903
Query: 457 HCNGIDSALSLFIKSVEEEPSLRGQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALL--- 513
+G L I +E +P + G+ + I +S +D LV L
Sbjct: 904 KVDGRSDDLEDLITLLETDPD--PAARHGLAQLLIDNPPFTRESRSRLDKPNLVERLWYS 961
Query: 514 -NLL--ESRIAFNNVFLRHHLFG 533
N L ++++ + V L H L+G
Sbjct: 962 INTLAYDTKLRCDTVDLYHSLYG 984
>gi|308492512|ref|XP_003108446.1| CRE-TAF-2 protein [Caenorhabditis remanei]
gi|308248186|gb|EFO92138.1| CRE-TAF-2 protein [Caenorhabditis remanei]
Length = 1092
Score = 152 bits (384), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 143/515 (27%), Positives = 249/515 (48%), Gaps = 75/515 (14%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEFRHFANKVGNLERP-F 50
ML+K++G F ++L +++ S +T +ST+ F + V E P F
Sbjct: 494 MLQKRIGLEPFMRVLHRLLTVGLDMSEKKTTPAAWRHLLISTEAFFRSVSSVTGREIPTF 553
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
L++F V T G + F +N+++NIVE+ V +D D+E +G
Sbjct: 554 LEQF----VRTGGHAAFAVKFDFNRKRNIVEIEVKQD----------------DTEG-NG 592
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPK--KGSKPDG 168
+ G +S+ V E+DG + H I A EI CHSK ++ K G + +
Sbjct: 593 RTQYTGPLSVVVQEVDGAFSHTI--QIDGAISHAEISCHSKGRKQKKKKVPILTGEEIEI 650
Query: 169 CDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQ 228
N DA ESP+ W+R D + + EI +QP+ W L+ + DV+AQ +
Sbjct: 651 DLTNMDA----------ESPILWLRIDNDYLLIREISISQPMFHWEYMLKYERDVIAQME 700
Query: 229 AIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALA---NTASEETDWAGLLHLV 285
A+ ++ALP S + + + + +S+ + F+R+R AA+ L N SE + +
Sbjct: 701 ALERIQALP--SAHSRSVIVDAVSNERFFYRIRCRAAFVLTSVQNRKSEAIAVGTPVLIN 758
Query: 286 KFYKSRRFDENIGLPRPNDF----RDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQ 341
F +S + +P+PN+F ++ +YFV++++P A++ +R + + F+L
Sbjct: 759 MFRESFGCKSSTNIPKPNNFVVTAQNLQQYFVMQSLPQAISRLRKQTGECHEDVQSFLLD 818
Query: 342 LLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILF-------LSS----LLKRIDRLL 390
L+K+NDN+ N YSD F+ A+L S+ + + F LSS L+K L
Sbjct: 819 LIKFNDNSTNRYSDDFYRASLFNSLAASIHPSEFLPFRVDLPENLSSEFRVLIKEFTYAL 878
Query: 391 QFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRA 450
D + PSY ++ I+ + L Q L+ G++ LD KL+ F N+ Q+R A
Sbjct: 879 NMDTVNPSYGRVVGIAALSGLYQ--LQKCGYLPLDS--KLLWAFAHRNSCVQMRRCAVTL 934
Query: 451 LLD-LEFHCNGIDSA---LSLFIKSV--EEEPSLR 479
++D + + +D+ LS+ + V E++PS+R
Sbjct: 935 IIDRIVNDIHAVDTKMYDLSMLLNLVLTEKDPSIR 969
>gi|336376907|gb|EGO05242.1| hypothetical protein SERLA73DRAFT_82815 [Serpula lacrymans var.
lacrymans S7.3]
Length = 1798
Score = 152 bits (383), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 117/373 (31%), Positives = 175/373 (46%), Gaps = 39/373 (10%)
Query: 191 WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNF 250
WIR D + E++A I F QP MW++QL++D DVVAQ +AI AL P + V +T
Sbjct: 703 WIRMDADFEWIAAIAFEQPDFMWVSQLQRDRDVVAQLEAIHALAKQP--TAIVSSTFTKT 760
Query: 251 LSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRF---DENIGL------PR 301
+ S F+R+R EAA AL N A D+ GL HL K + + D N L P+
Sbjct: 761 VLVSNYFYRIRCEAALALVNCAIRRLDFLGLFHLFKLFLRNCYEPEDSNQDLFAHTYVPK 820
Query: 302 PNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAA 361
PNDF D SEYFV +++ +A++ VR + KSP +F++ L+YNDN NPYSD F++
Sbjct: 821 PNDFSDLSEYFVRKSLVNAISRVRFENGKSPSVVRQFLIDQLRYNDNTANPYSDAFYICT 880
Query: 362 LVQSV------------GEL-------EFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGI 402
++ + GEL E + L + IDR DRL+ S + +
Sbjct: 881 IISASACATVSTAPPERGELLPTETRSEQNSEDTSLLKQAISEIDRYRSMDRLITSPHNV 940
Query: 403 LTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGI- 461
+TI+ + + L ++ D +++ P QVR+ A L +++ I
Sbjct: 941 VTIAALEFY--LVLTVANLAPSD--LRVFFPLTREGNYTQVRIAAFDGLFLSKWYTPAIM 996
Query: 462 DSALSLFIKSVEEEPSLRGQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIA 521
L+L + E + +L+ L I K EVD + V L + R
Sbjct: 997 KYVLALLVSMGEMKNALKETESLLIEEDGSMPEKAKEVKKSEVDLMIKV----LRKDREI 1052
Query: 522 FNNVFLRHHLFGI 534
N LR L I
Sbjct: 1053 GKNEVLREFLMPI 1065
Score = 48.9 bits (115), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 36/141 (25%), Positives = 63/141 (44%), Gaps = 21/141 (14%)
Query: 1 MLEKQMGSN-----FFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
+L++++G + R + + +S G P LST F KV +E L+ F
Sbjct: 483 ILDRRLGKSGTSLGLSRVLPKVFLSGISGEMPNNALSTHSFLRTCRKVSGVE---LRSFA 539
Query: 56 PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG-- 113
+W+ GCP S+N++K VE+ + ++ P + +E +
Sbjct: 540 EQWIYGSGCPAFGFSASFNRKKMAVEITMRQEA--------PAFKLHESNEVSKALLKPV 591
Query: 114 --WPGMMSIRVHELDGM-YDH 131
+ G M+IR+HE DG Y+H
Sbjct: 592 PFFEGQMTIRIHEADGTPYEH 612
>gi|358372687|dbj|GAA89289.1| transcription initiation factor TFIID subunit TSM1/127kD [Aspergillus
kawachii IFO 4308]
Length = 1273
Score = 152 bits (383), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 134/525 (25%), Positives = 231/525 (44%), Gaps = 88/525 (16%)
Query: 1 MLEKQMGSNFFRKILQNIISR-----AQGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
+L++++ + + IISR G P ++T F+ ++G+ + L FF
Sbjct: 511 ILDRRLTKASGKATMSRIISRLFLNSRMGDIPNGAVTTSLFQRTCERLGHAK---LDSFF 567
Query: 56 PRWVGTCGCPVLRMGFSYNKRKNIVELAVLR---DCTVKPDSRTPVLSSNTDSENRDGDI 112
+WV GCP + +NK+K +VE+ + + D + D + E R
Sbjct: 568 QQWVYGAGCPRFQATQRFNKKKLVVEMMIRQVQADASSTRDLDKNAFMRDVKEEIRGVYA 627
Query: 113 G-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKP 166
G + G M+IR+HE DG Y+H + G + K R + ++ +
Sbjct: 628 GVIQPVFTGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAAAV 687
Query: 167 DGCDDNGDA------------------VAGLDMRS--------SMESPLSWIRADPEMEY 200
G D N +A VA +R WIR D + E+
Sbjct: 688 TGGDPNAEAQEDVLLYCLGDVLQTEEEVAEWKLRDWSEEEEERMSRESYEWIRMDADFEW 747
Query: 201 LAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFN-----VVNTLNNFLSDSK 255
+ ++ P M+++QL++D DVVAQ LE+L +++ + L D +
Sbjct: 748 ICKLSLAMPGYMYLSQLQQDRDVVAQ------LESLQYMAIQREHELISTIFVRTLMDRR 801
Query: 256 AFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP-----RPNDFRDFSE 310
F+ +R+ AA AL A+E +W GL HL R F E LP R NDF D +
Sbjct: 802 YFYGIRVAAARALVKHATEAVEWVGLFHL-----ERAFQELFCLPGSPMTRSNDFSDRAA 856
Query: 311 YFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGEL- 369
YF+ AIP +++ VR ++ K+P F+ LK+NDN+ N YSD F++A L++S+
Sbjct: 857 YFIQLAIPESISKVRDSNGKTPMRVKRFLYDKLKFNDNSNNEYSDNFYVATLMESLCHAM 916
Query: 370 ------------EFGQQSILFLSS-------LLKRIDRLLQFDRLMPSYNGILTISCIRT 410
+F + +L + + IDR + D S+ I + + +R
Sbjct: 917 LGKPERPSDEMDDFDMERVLAAQAEEQLEKDAIAEIDRYRRMDEWSSSFQNIYSRAALR- 975
Query: 411 LTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLE 455
Q+ L + +SLD +++ + P+ T +R++A L++L+
Sbjct: 976 -CQVRLMEAKILSLD-IMQFL-PYTRAGTYDLLRLDAFECLIELD 1017
>gi|357612256|gb|EHJ67886.1| hypothetical protein KGM_13825 [Danaus plexippus]
Length = 1189
Score = 152 bits (383), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 144/575 (25%), Positives = 260/575 (45%), Gaps = 73/575 (12%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRH-------FANKVGNLERPFLKE 53
MLE+++G ++ +S A A+ + + + + H F + + +
Sbjct: 446 MLEQRIGQELLLQVFNKQLSLATNAANTK-IGSGLWGHLLLSTNLFVKAIFTVTGKDMAV 504
Query: 54 FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
F +WV T G ++ +N+++N VEL + +D S + G
Sbjct: 505 FVDQWVRTGGHAKFQLTSVFNRKRNTVELEIRQD-----------------SVHERGIRK 547
Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
+ G + +++ ELDG + H + + +I CHSK + K + G
Sbjct: 548 YVGPLLVQLQELDGTFKHTL--QIENTVVKADITCHSKSRRNKKKKIPLCT--------G 597
Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
+ V +D+ + +SP+ WIR DPEM L +QP W QL + DV AQ++AI AL
Sbjct: 598 EEV-DMDLSAMDDSPVLWIRLDPEMSLLRSTVISQPDYQWQYQLRHERDVTAQSEAIDAL 656
Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEE-TDWAGLLHLVKFYKSRR 292
P + L + + + + +++R AA+ L A+ + WAG ++ + R+
Sbjct: 657 HNYPEPATR--KALTDIIENEQTHYKIRCRAAHCLTKVANAMISSWAGPPAMLTIF--RK 712
Query: 293 FDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNN 349
+ P + N+F + YF+ + IP A+A +R P E V F+L L KYNDN+
Sbjct: 713 MFGSFAAPHIIKQNNFDNLQHYFLQKTIPVAMAGLRNIHGICPPEVVRFLLDLFKYNDNS 772
Query: 350 GNPYSDVFWLAALVQSVGELEFGQQSIL---------FLSS----LLKRIDRLLQFDRLM 396
N +SD ++ AALV ++ S+L LS+ +L+ I R+L ++++
Sbjct: 773 KNHFSDNYYKAALVDALAATITPVISVLQPGAPITAESLSADTRLVLEEITRVLNLEKVL 832
Query: 397 PSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEF 456
P Y +T+SC+R + + L+ G L + + + + + VR+ A L+D
Sbjct: 833 PCYKNTVTVSCLRAIRR--LQQCG--HLPSIPTVFRAYAQYGQYIDVRLAAFEGLVDF-V 887
Query: 457 HCNGIDSALSLFIKSVEEEPSLRGQVKLGIHAMRICQIKGGSDSNHEVDTVTLVA-LLNL 515
+G LS + ++E +P V+ G+ + + H +DT ++V L N
Sbjct: 888 RVDGKPEDLSYLLTAIENDPD--PGVRHGLARLMVSMPPFERAQRHRLDTESVVHRLWNN 945
Query: 516 LESRIAFNNVFLRHHLFGILQILAGRAPTLYGVPR 550
+ S+++ N+ LR L + TLYG+ R
Sbjct: 946 INSQLS-NDARLRCDLVDLYY-------TLYGLKR 972
>gi|449541548|gb|EMD32531.1| hypothetical protein CERSUDRAFT_118583 [Ceriporiopsis subvermispora
B]
Length = 1819
Score = 152 bits (383), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 93/300 (31%), Positives = 154/300 (51%), Gaps = 36/300 (12%)
Query: 191 WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNF 250
WIR D + E++A I F+Q MW++QL++D DVVAQ +AI+ L +P+ + +T
Sbjct: 699 WIRIDADFEWIAAIAFDQKDYMWVSQLQRDRDVVAQYEAISVLSKMPNPI--ISSTFTRT 756
Query: 251 LSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFY----------KSRRFDENIGLP 300
+ S ++R+R EAA AL A + +W GL HL K + K F+ +P
Sbjct: 757 VLVSNYYFRIRCEAAQALVTCAVQRIEWLGLFHLFKLFLRYCYEPEDPKPDLFNHKY-VP 815
Query: 301 RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLA 360
RPNDF DF+EYFV +++ A++ VR + K+P +F++ L+YNDN N +SD ++
Sbjct: 816 RPNDFSDFAEYFVRKSLVKAISRVRFENGKTPPIIRQFLIDQLRYNDNTANLFSDAMYIC 875
Query: 361 ALVQSV------------GELEFGQ-------QSILFLSSLLKRIDRLLQFDRLMPSYNG 401
++ ++ GE + Q + L L +DR DRL+PSY+
Sbjct: 876 TVISALACATMSTVPPERGEFISAESQPVQDAQDVDLLKQALAEVDRYRSMDRLIPSYHN 935
Query: 402 ILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGI 461
++T++ I + L ++GF+ D ++ P QVR+ A L +++ I
Sbjct: 936 VVTVAVIEFY--LMLSVAGFVPHDP--RMFFPLTREGNYHQVRMAAFDTLFLTKWYTPKI 991
>gi|366999642|ref|XP_003684557.1| hypothetical protein TPHA_0B04540 [Tetrapisispora phaffii CBS 4417]
gi|357522853|emb|CCE62123.1| hypothetical protein TPHA_0B04540 [Tetrapisispora phaffii CBS 4417]
Length = 1523
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 123/454 (27%), Positives = 209/454 (46%), Gaps = 62/454 (13%)
Query: 13 KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
+IL I +A G P +LS F+H +V + L +FF +WV + G P+ R+
Sbjct: 655 RILPKIFLQAMSGDLPNNSLSAAHFQHVCERVN---KSKLGKFFQQWVYSSGVPIFRVTQ 711
Query: 72 SYNKRKNIVELAVLR---------DCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRV 122
+NK++ ++E+ + + VK + L+ DS R+ + G M+IR+
Sbjct: 712 RFNKKRMVIEMGIRQVQEQKVSGSKNGVKEGFSSRALNYIQDS-TRETIPHFTGSMTIRI 770
Query: 123 HELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARR-----ALKPKKGSKPDGCDDNGDAV 176
HE DG Y+H + D + L+IQ ++K R +LK K K + + V
Sbjct: 771 HEPDGTPYEHIV--EIKDVFTKLDIQYNTKYRRSRTKKQSSLKLDKDGKTELNEQPSQVV 828
Query: 177 -----------AGLDMRSSMES--------------------PLSWIRADPEMEYLAEIH 205
L +S ES WIR D ++E++ E+H
Sbjct: 829 KFPNVNIQKFGTVLSTQSDCESWNLADPSLSSEVLEIQRQNEAFQWIRIDSDLEWICELH 888
Query: 206 FNQPVQMWINQLEKDGDVVAQAQAI----AALEALPHLSFNVVNTLNNFLSDSKAFWRVR 261
NQ M+ QL++D D+ AQ Q+I +E S + L + DS+ F+ VR
Sbjct: 889 INQSDYMFAAQLQQDADIEAQIQSIRYYDDTIEQSQVTSLLYSSILTRTVMDSRYFYGVR 948
Query: 262 IEAAYALANTASEETDW----AGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAI 317
+EA AL + G HL+K Y+ +P NDF D +Y++ +AI
Sbjct: 949 LEACRALTKYIVRDLPNRVFPGGPRHLIKIYQELFCYSESNIPLNNDFSDIRKYYLQKAI 1008
Query: 318 PHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEF-GQQSI 376
P ++ V + ++SP F+L +L YN+N+GN Y+D++++ L+Q++ E
Sbjct: 1009 PVYLSEVVSETDESPEFVKNFLLDILTYNENSGNNYNDIYFICYLIQALVECATKSANDR 1068
Query: 377 LFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRT 410
F+S +L+++ R D+ PSY +LT + + +
Sbjct: 1069 SFISKVLEQLQRYENLDKWTPSYQLLLTKTILES 1102
>gi|134083644|emb|CAK47036.1| unnamed protein product [Aspergillus niger]
Length = 1271
Score = 151 bits (381), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 134/524 (25%), Positives = 227/524 (43%), Gaps = 79/524 (15%)
Query: 1 MLEKQMGSNFFRKILQNIISR-----AQGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
+L++++ + + IISR G P ++T F+ ++G+ + L FF
Sbjct: 511 ILDRRLTKASGKATMSRIISRLFLNSRMGDIPNGAVTTSLFQRTCERLGHAK---LDSFF 567
Query: 56 PRWVGTCGCPVLRMGFSYNKRKNIVELAVLR---DCTVKPDSRTPVLSSNTDSENRDGDI 112
+WV GCP + +NK+K +VE+ + + D + D + E R
Sbjct: 568 QQWVYGAGCPRFQATQRFNKKKLVVEMMIRQVQADASSTRDLDKNAFMRDVKEEIRGVYA 627
Query: 113 G-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKP 166
G + G M+IR+HE DG Y+H + G + K R + ++ +
Sbjct: 628 GVIQPVFTGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAAAV 687
Query: 167 DGCDDNGDAVAGL------DMRSSMES--------------------PLSWIRADPEMEY 200
G D N + + D+ S E WIR D + E+
Sbjct: 688 TGGDPNAETQEDVLLYCLGDVLQSEEEVAEWRLRDWSEEEEERMSRESYEWIRMDADFEW 747
Query: 201 LAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRV 260
+ ++ P M+++QL++D DVVAQ + L +L H + + L D + F+ +
Sbjct: 748 ICKLSLAMPGYMYLSQLQQDRDVVAQLEVSKFLYSLEHELISTI--FVRTLMDRRYFYGI 805
Query: 261 RIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP-----RPNDFRDFSEYFVLE 315
R+ AA AL A++ W GL HL R F E LP R NDF D + YF+
Sbjct: 806 RVAAARALVKHATDAVGWVGLFHL-----ERAFQELFCLPGSPMTRSNDFSDRAAYFIQL 860
Query: 316 AIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGEL------ 369
AIP +++ +R A+ K+P F+ LK+NDN+ N YSD F++A L++S+
Sbjct: 861 AIPESISKIRDANGKTPMRVKRFLYDKLKFNDNSNNEYSDNFYVATLMESLCHAMLGKPE 920
Query: 370 -------EFGQQSIL-------FLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIA 415
+F + +L + IDR + D S+ I + + +R Q+
Sbjct: 921 RPSDEMDDFDMERVLEAQAEEQLEKDAIAEIDRYRRMDEWSSSFQNIYSRAALR--CQVR 978
Query: 416 LKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCN 459
L + +SLD + L P+ T +R++A L++L+ N
Sbjct: 979 LMEAKILSLDIMQFL--PYTRAGTYDLLRLDAFECLIELDIFRN 1020
>gi|154299901|ref|XP_001550368.1| hypothetical protein BC1G_10841 [Botryotinia fuckeliana B05.10]
gi|347841569|emb|CCD56141.1| similar to aminopeptidase N [Botryotinia fuckeliana]
Length = 1585
Score = 151 bits (381), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 145/541 (26%), Positives = 246/541 (45%), Gaps = 74/541 (13%)
Query: 5 QMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGC 64
Q G+ R I + I++ T+ST+ FR K+ FF +WV GC
Sbjct: 551 QGGTGISRVISKMIVNANANGGTDSTISTEGFRKSCEKITKYRD--TDSFFAQWVHGSGC 608
Query: 65 PVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNT---DSENRDGDIGWP------ 115
P + +NK++ VE+ + + T P L + + DG + P
Sbjct: 609 PHFTITQKFNKKRLCVEMTITQ--TQAHLETEPKLHKDGFYRAFKELDGGVSQPKTQSTF 666
Query: 116 -GMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSK---LAARRALKPKK--GSKPDG 168
G ++IR+HE DG Y+H + M + +EI ++K L R +K + G+ DG
Sbjct: 667 TGPLTIRIHEADGTPYEHIV--MIREKTSRIEIPYNTKYKRLKRNRRMKERANAGTAIDG 724
Query: 169 CDDNGDAV---------AGLDMR------------SSMES-PLSWIRADPEMEYLAEIHF 206
+++ DA+ + DM+ + ME+ WIR D + E++ + F
Sbjct: 725 LENHDDALIYCLGDVLQSPQDMQEWGLRDWDPITEAKMEAESYEWIRVDADFEWICDKKF 784
Query: 207 N-QPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAA 265
M+++QL++D D+ AQ +++A L+ P V L L D + F+ +R AA
Sbjct: 785 TGMEAYMYVSQLQQDRDISAQYESMAYLDNAPPHPL-VATFLIRTLMDRRYFYGIRAMAA 843
Query: 266 YALANTASE-ETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMV 324
+LA A E + W GL HL K Y + P+ NDF D Y++ AI AV+ +
Sbjct: 844 KSLAKHAVEKDVKWIGLKHLEKAYLEFFYFPGGKTPKANDFSDKQAYWIECAIVEAVSRI 903
Query: 325 RAA-DNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV----------------- 366
R+ D+ P A F+ +L++NDN NPYSD F +A +++S+
Sbjct: 904 RSMYDSTCPSTARRFIFDVLRFNDNGNNPYSDSFKIANILKSLADSLIPGAKGVNNDVLQ 963
Query: 367 ---GELEFG--QQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGF 421
E + G ++ + +L +DR + D + SY I TI+ I ++ L +G
Sbjct: 964 LDDAEEDGGDKKEDPDYRKLILDELDRYSRMDEWIDSYQNIYTITAIE--CKLKLMNAGV 1021
Query: 422 ISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLRGQ 481
I +D + + F D + +VR++A+ AL DL F + + L + S + P +R +
Sbjct: 1022 IQVD-ALDFAQYFHD-GSAQRVRIKAAWALADLGFLKSTAVACLLVTAASTDPSPWVRKE 1079
Query: 482 V 482
+
Sbjct: 1080 L 1080
>gi|226291005|gb|EEH46433.1| transcription initiation factor TFIId 127kD subunit [Paracoccidioides
brasiliensis Pb18]
Length = 1238
Score = 151 bits (381), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 148/550 (26%), Positives = 237/550 (43%), Gaps = 95/550 (17%)
Query: 1 MLEKQMGSNFFRKILQNIISRA-----QGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
+LE+++ + + IISR G P +L+T + ++G+ + L +FF
Sbjct: 511 ILERRLSKASGKATISRIISRIFLHARMGDLPNGSLATSFLQKICERLGHTK---LDQFF 567
Query: 56 PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSR---TPVLSSNTDSENRDGDI 112
+WV GCP +NK+K +VE+ + + + +P R + E ++
Sbjct: 568 QQWVYGAGCPRFLATQRFNKKKLVVEMMIRQVQSEQPTVRDLDKLTFMRDVKEEIKNVYA 627
Query: 113 G-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKP 166
G + G M+IR+HE DG Y+H + G + K R + ++ +
Sbjct: 628 GAVQHVFTGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAAAA 687
Query: 167 DGCDDN------------GDAVAGLDM---------------RSSMESPLSWIRADPEME 199
G D N GD + + R ES WIR D + E
Sbjct: 688 SGLDPNAELQDDVLLYCLGDVLQSEEEIQKWKLVEWSKEDEDRMGQES-YEWIRMDADFE 746
Query: 200 YLAEIHFNQPVQMWINQLEKDGDVVAQAQAI---AALEALPHLSFNVVNTLNNFLSDSKA 256
++ ++ P M+++QL++D D+VAQ ++I AA P +S + T L D +
Sbjct: 747 WICKMSLTMPGYMYLSQLQQDRDIVAQLESIQYMAAQREHPLISTIFIRT----LMDRRY 802
Query: 257 FWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP-----RPNDFRDFSEY 311
F+ +R AA AL A E+ DW GL HL F E LP R NDF D S Y
Sbjct: 803 FYGIRAAAAKALVKHAKEDVDWIGLFHL-----ESAFQELFCLPGSPMTRSNDFSDRSSY 857
Query: 312 FVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVG---- 367
+ IP A+A VR + K+P F+ LK+NDN+ + SD +++AAL++S+
Sbjct: 858 ILQLTIPQAIAQVRDSTGKTPMRVKRFLYDKLKFNDNSNSEISDCYYVAALMKSICDALL 917
Query: 368 ----------------ELEFGQQSILFLS-SLLKRIDRLLQFDRLMPSYNGIL---TISC 407
E+E +Q+ L + IDR + D S+ I I C
Sbjct: 918 GRKENEPRNIDMDFDMEMELERQAEQQLDRDCIAEIDRYRRMDEWTSSFQNIYARTAIGC 977
Query: 408 IRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSL 467
R Q +G I LD +V+ ++ R T +R++A L +EF I L+
Sbjct: 978 QRRFMQ-----AGVIELD-LVQFLQYTRA-GTYEMLRLDAFDIL--IEFDIFQIPELLTW 1028
Query: 468 FIKSVEEEPS 477
FI + +PS
Sbjct: 1029 FIYVMSTDPS 1038
>gi|70983546|ref|XP_747300.1| transcription initiation factor TFIID subunit TSM1/127kD, putative
[Aspergillus fumigatus Af293]
gi|66844926|gb|EAL85262.1| transcription initiation factor TFIID subunit TSM1/127kD, putative
[Aspergillus fumigatus Af293]
gi|159123694|gb|EDP48813.1| transcription initiation factor TFIID subunit TSM1/127kD, putative
[Aspergillus fumigatus A1163]
Length = 1358
Score = 151 bits (381), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 139/526 (26%), Positives = 232/526 (44%), Gaps = 90/526 (17%)
Query: 1 MLEKQMGSNFFRKILQNIISRA-----QGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
+L++++ + + IISR G P +++ +F+ ++G+ + L FF
Sbjct: 508 ILDRRLTKASGKATMSRIISRMFLNARMGDIPNGAITSSQFQKICERLGHAK---LDTFF 564
Query: 56 PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG-- 113
+WV GCP + +NK+K +VE+ + + +P +R L NT + +I
Sbjct: 565 QQWVYGAGCPRFQATQRFNKKKLVVEMMIKQVQADQPSTRD--LDKNTFMRDVKEEIRGV 622
Query: 114 --------WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGS 164
+ G M+IR+HE DG Y+H + G + K R + ++ +
Sbjct: 623 YAGNIQPVFSGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAA 682
Query: 165 KPDGCDDNGDAVAGL---------------------------DMRSSMESPLSWIRADPE 197
G D N + + + R ES WIR D +
Sbjct: 683 AVSGGDPNAETQEDVLLYCLGDVLQTEEEMQEWRLADWSKEDEDRMGQES-YEWIRMDAD 741
Query: 198 MEYLAEIHFNQPVQMWINQLEKDGDVVAQ---AQAIAALEALPHLSFNVVNTLNNFLSDS 254
E++ ++ P M+++QL++D DVVAQ Q +AA P +S V T L D
Sbjct: 742 FEWICKLSLVMPGYMYLSQLQQDRDVVAQLESLQYMAAQREHPLISTIFVRT----LMDR 797
Query: 255 KAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP-----RPNDFRDFS 309
+ F+ +R AA AL A EET+W GL HL R F E LP R NDF D +
Sbjct: 798 RYFYGIREAAARALVKHAKEETNWLGLFHL-----ERAFQELFCLPGSPMTRSNDFSDRA 852
Query: 310 EYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQS---- 365
Y + AIP A++ VR D K+P F+ LK+NDN+ N YSD F++A L+QS
Sbjct: 853 AYALQLAIPEAISKVRDNDGKTPLRVKRFLYDKLKFNDNSNNEYSDHFYVATLMQSLCHA 912
Query: 366 -VGELE--------FGQQSILFLSS-------LLKRIDRLLQFDRLMPSYNGILTISCIR 409
+G++E F + +L + + + DR + D S+ + + + +R
Sbjct: 913 MLGKVESRDVDIDDFDMERMLEIQAEEQLEKDAIAEFDRYRRMDEWSSSFQNVYSRAALR 972
Query: 410 TLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLE 455
Q+ L + + +D + P+ T +R +A L++L+
Sbjct: 973 --CQMQLMRAKILDVD--IMQFLPYTRAGTYDLLRADAFECLVELD 1014
>gi|307172277|gb|EFN63782.1| Transcription initiation factor TFIID subunit 2 [Camponotus
floridanus]
Length = 1192
Score = 150 bits (380), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 123/479 (25%), Positives = 219/479 (45%), Gaps = 64/479 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGAS---------PVRTLSTKEFRHFANKVGNLERPFL 51
MLE ++G ++ +S A A+ P +ST F A + + +
Sbjct: 448 MLEHRIGQELLLQVFNKQLSLAANAAQQKIESGLWPHMLISTNVF---AKAIFTVTGKDM 504
Query: 52 KEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGD 111
F +WV T G + F +N+++N VEL + +D + ++ G
Sbjct: 505 SVFIDQWVRTGGHAKFSLSFVFNRKRNTVELEIRQD-----------------TAHQRGI 547
Query: 112 IGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDD 171
+ G + + + ELDG + H L + G + +I CHSK + K +
Sbjct: 548 RKYVGPLVVNIQELDGTFKH-TLQIEGTMARA-DITCHSKSRRNKKKKIPLCT------- 598
Query: 172 NGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIA 231
G+ V +D+ + +SP+ WIR DPEM + + QP W QL + DV AQ +AI
Sbjct: 599 -GEEV-DMDLSAMDDSPVLWIRLDPEMTIMRAVQIEQPDYQWQYQLRHERDVTAQLEAIV 656
Query: 232 ALE--ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFY 288
AL+ + P + +T+ N +++VR+ AA+ L A+ WAG ++ +
Sbjct: 657 ALQHHSTPATRLALTDTIEN----EHCYYKVRLRAAHCLTKVANAMVATWAGPPAMLAIF 712
Query: 289 KSRRFDENIG-LPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYND 347
+ + + + N+F +F YF+ + IP A+A +R A P E + F++ L KYND
Sbjct: 713 RKLFGSASCRRIIKQNNFSNFQHYFLQKTIPVAMAGLRNAHGICPPEVLSFLMDLFKYND 772
Query: 348 NNGNPYSDVFWLAALVQSVGE-----LEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
N+ N YSD ++ AAL++++G + Q + + SL L+ + R L ++L
Sbjct: 773 NSKNRYSDNYYRAALIEALGATVTPVISVQQGTAITAESLSIDTKAILEEVTRNLNLEKL 832
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
+P Y ++++C++ + L F L + + + + VR+ A AL+D
Sbjct: 833 LPCYKYTVSVACLKVIR----ILQKFGHLPSNPHIFRAYAAYGQFIDVRIAALEALVDF 887
>gi|17542624|ref|NP_500378.1| Protein TAF-2 [Caenorhabditis elegans]
gi|351065369|emb|CCD61342.1| Protein TAF-2 [Caenorhabditis elegans]
Length = 1086
Score = 150 bits (380), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 138/518 (26%), Positives = 239/518 (46%), Gaps = 75/518 (14%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRT--------LSTKE--FRHFANKVGNLERPF 50
ML++++G F ++L +++ S +T L+T E FR + G F
Sbjct: 494 MLQRRIGLEPFMRVLHRVLTVGLDMSEKKTTPAAWRHLLTTTESFFRSVISVTGKEIPSF 553
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
L +F V T G + F +N+++NIVE+ + +D D+E +G
Sbjct: 554 LSQF----VRTGGHAAFAVKFDFNRKRNIVEIEIKQD----------------DTEG-NG 592
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPK--KGSKPDG 168
+ G +S+ V E+DG + H I A EI CHSK +R K G + +
Sbjct: 593 RTQYTGPLSVVVQEVDGAFTHTI--QIDGAVSHAEISCHSKGRKQRKKKVPLLTGEEIEI 650
Query: 169 CDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQ 228
N DA ESP+ W+R DP+ + EI +QP+ W L + DV+AQ +
Sbjct: 651 DLTNMDA----------ESPILWLRIDPDYLLIREITISQPMFHWEYMLRYERDVIAQME 700
Query: 229 AIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALA---NTASEETDWAGLLHLV 285
A+ ++ALP S + + + + +++ K F+R+R AA+ L N SE + +
Sbjct: 701 ALERIQALP--SAHSRSVIVDAVANEKFFYRIRYRAAFVLTFVQNRKSEALTVGTPVLIN 758
Query: 286 KFYKSRRFDENIGLPRPNDF----RDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQ 341
F +S +PR N+F ++ +YFV++A+P A+A +R + + F+L
Sbjct: 759 MFRESFGSKAATNIPRSNNFVVTAQNLQQYFVMQALPQAIARLRRQSGECHEDVQPFLLD 818
Query: 342 LLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILF-----------LSSLLKRIDRLL 390
L+K+NDN+ N YSD F+ AAL S+ F ++ + L+K L
Sbjct: 819 LIKFNDNSTNRYSDDFYRAALYNSLASSVFPHDALPCHVELPENLSNDVRVLIKEFTYAL 878
Query: 391 QFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRA 450
D + PS+ ++ + + L Q L+ G++ LD +L+ F N Q+R A
Sbjct: 879 NMDTVSPSWGRVVGAAALTGLYQ--LQKCGYLPLDS--QLLWTFSHPNCCVQMRRCAITL 934
Query: 451 LLDLEFH------CNGIDSALSLFIKSVEEEPSLRGQV 482
++D + +D + L + +E++PS+R +
Sbjct: 935 IIDRIVNDPHAADTRMLDLSRILELAELEQDPSIRRMI 972
>gi|302694747|ref|XP_003037052.1| hypothetical protein SCHCODRAFT_64031 [Schizophyllum commune H4-8]
gi|300110749|gb|EFJ02150.1| hypothetical protein SCHCODRAFT_64031 [Schizophyllum commune H4-8]
Length = 1812
Score = 150 bits (380), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 94/290 (32%), Positives = 151/290 (52%), Gaps = 34/290 (11%)
Query: 191 WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNF 250
WIR D + E++A I F+QP MW++QL++D DVVAQ +AI AL P + V +TL
Sbjct: 695 WIRMDADFEWIAFISFDQPDFMWVSQLQRDRDVVAQLEAIYALSRTP--TAIVSSTLTRT 752
Query: 251 LSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFD---------ENIGLPR 301
+ + F+R+R EAA AL + A T++ GL HL K + +D ++ +P+
Sbjct: 753 VLVTNYFYRIRCEAALALVHCAVRRTEFLGLFHLFKLFLRYCYDTEDPKQDLFKHSYVPK 812
Query: 302 PNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAA 361
PNDF D +EYFV + + A+A VR + K+P +F++ LK+NDN N YSD F++
Sbjct: 813 PNDFSDIAEYFVRKHLVTAIARVRFENGKTPPVVRQFLIDQLKFNDNTTNAYSDAFYICT 872
Query: 362 LVQSV-------------------GELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGI 402
++ + G E + + L+ + I+R + D+L+PS + +
Sbjct: 873 IIDGLACATTSTAPPERGELLPTEGRAELTTEDVHLLNEAMSEIERYQEMDKLIPSPHNV 932
Query: 403 LTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALL 452
+T++ I T L ++ +I + RD N QVR+ A AL
Sbjct: 933 VTVAVIEFYT--VLMVANYIP-SHPYRFFFYTRDGNYT-QVRIAAFDALF 978
Score = 43.9 bits (102), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 27/102 (26%), Positives = 52/102 (50%), Gaps = 8/102 (7%)
Query: 31 LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTV 90
LST F KV ++ ++ F +W+ GCP ++N++K VE+ + ++C
Sbjct: 510 LSTSYFLRVCRKVSGVD---VRGFAEQWIYGSGCPSFAFQATFNRKKMAVEITMRQEC-- 564
Query: 91 KPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGM-YDH 131
P + ++ + + + + + G M+IR+HE DG Y+H
Sbjct: 565 -PAYKAHEGNAFSMEQFKPAPL-FEGQMTIRIHEADGTPYEH 604
>gi|164661735|ref|XP_001731990.1| hypothetical protein MGL_1258 [Malassezia globosa CBS 7966]
gi|159105891|gb|EDP44776.1| hypothetical protein MGL_1258 [Malassezia globosa CBS 7966]
Length = 1113
Score = 150 bits (379), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 84/248 (33%), Positives = 137/248 (55%), Gaps = 28/248 (11%)
Query: 187 SPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNT 246
+P WIR D + E++A+I F QP MW++QL++D DVVAQ A+ AL +P S +T
Sbjct: 102 APYEWIRLDADFEWMAQIQFEQPDYMWVSQLQRDRDVVAQLAAVHALSQMP--SLITSST 159
Query: 247 LNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYK-SRRFD----------E 295
L + +K F+R+R EAAY LAN A D GL HL ++ S D E
Sbjct: 160 LTRTVLVTKYFYRIRAEAAYGLANCALPHLDLLGLFHLFMLFRTSYCLDVPHEGDSTSLE 219
Query: 296 NIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSD 355
+P+PNDF D ++YFV A+ HA+A VR ++ F++ LL+YNDN+ N + D
Sbjct: 220 APCIPKPNDFSDMADYFVRRALIHAIARVRDHRGRALVIVQRFLIYLLRYNDNSTNRFVD 279
Query: 356 VFWLAALVQSV---------------GELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYN 400
++LA+++ ++ + + + + + ++ ++RL DRL+PSY+
Sbjct: 280 DYYLASIINALAGTLIPIDSAGYSTHADETYSPEEMELRAVAIQEVERLQVLDRLVPSYH 339
Query: 401 GILTISCI 408
++TI+ +
Sbjct: 340 NVVTIAAL 347
>gi|409051976|gb|EKM61452.1| hypothetical protein PHACADRAFT_156702 [Phanerochaete carnosa
HHB-10118-sp]
Length = 1899
Score = 150 bits (378), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 91/299 (30%), Positives = 152/299 (50%), Gaps = 34/299 (11%)
Query: 191 WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNF 250
WIR D + E++A I F+Q MW++QL++D DVVAQ +AIA L P+ + +T
Sbjct: 714 WIRIDADFEWIANIAFDQKDYMWVSQLQRDRDVVAQYEAIAVLAKQPNAI--ISSTFTKT 771
Query: 251 LSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFD---------ENIGLPR 301
+ S F+R+R EAA AL A + DW GL HL K + +D + +PR
Sbjct: 772 VLVSNYFFRIRCEAAQALVTCAIQRLDWIGLFHLFKLFLRYCYDPEDPNQDLFRHTYVPR 831
Query: 302 PNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAA 361
PNDF DF+EYFV +++ ++ VR + K+P +F++ L+YNDN NP+SD ++ +
Sbjct: 832 PNDFSDFAEYFVRKSLLQTISRVRFENGKTPAVVRQFLIDQLRYNDNTYNPFSDALYICS 891
Query: 362 LVQSV------------GELEFGQ-------QSILFLSSLLKRIDRLLQFDRLMPSYNGI 402
++ ++ GE + + L + +DR DRL+PS + +
Sbjct: 892 IISALACATVSTVPPERGEFVAAEARPPMDPEDDKLLKEAVSEVDRYRSMDRLIPSVHNV 951
Query: 403 LTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGI 461
++I+ I + L ++ I D +++ P QVR+ A +L +++ I
Sbjct: 952 VSIATIEFY--LLLSVANLIPHDP--RMLIPLTREGNFTQVRMAAFDSLFFTKWYTPAI 1006
>gi|260807981|ref|XP_002598786.1| hypothetical protein BRAFLDRAFT_212923 [Branchiostoma floridae]
gi|229284061|gb|EEN54798.1| hypothetical protein BRAFLDRAFT_212923 [Branchiostoma floridae]
Length = 995
Score = 150 bits (378), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 127/503 (25%), Positives = 232/503 (46%), Gaps = 67/503 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEFRHFANKVGNLERPFL 51
++E+++G ++ ++S A A+ + LS+ F + V + L
Sbjct: 448 LVEQKIGFESLLQVFNKLLSLATSAASQKYSPTNWSNMLLSSASFVKLISTVSGKDIHLL 507
Query: 52 KEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGD 111
+ +WV G F +N+++N+VEL + G
Sbjct: 508 LD---QWVYQGGVVQFSGSFVFNRKRNLVELEL------------------NQKVGEHGI 546
Query: 112 IGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDD 171
+ G +++ + ELDG ++H + D E+ CHSK + K
Sbjct: 547 QKYVGPLTVTIQELDGSFNHSV--QIEDNTTRHELVCHSKSRRNKKKKIPLV-------- 596
Query: 172 NGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAI 230
NG+ V +D+ + +SP+ WIR DP++ + ++ + QP MW QL + D++AQ +I
Sbjct: 597 NGEEV-DMDLSAMDADSPVLWIRVDPQLTLVRKVEWQQPDFMWQYQLRYERDIIAQHDSI 655
Query: 231 AALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEE-TDWAGLLHLVKFYK 289
ALE P + + L + L + F+RVR+EAA+ L+ A+ + W G ++ +K
Sbjct: 656 MALEQFPTPASRLA--LTDILENESCFYRVRMEAAFCLSKVANAMVSSWMGPPAMMSMFK 713
Query: 290 SRRFDENI--GLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYND 347
R F + L + N+F + YF+ + +P A+ VR A + P++ ++F+LQL+KYND
Sbjct: 714 -RMFCTQMYTNLIKCNNFSNLQGYFLQKTMPVAMGAVRDAHGRCPKDVLQFILQLIKYND 772
Query: 348 NNGNPYSDVFWLAALVQSVGELEFGQQSILFLSS--------------LLKRIDRLLQFD 393
N N +SD ++ A+L+ ++ ++L S+ +L+ + RLL +
Sbjct: 773 NGKNKFSDNYYRASLIDALAATVTPSVTMLDTSTDSRTEQALLPETKAVLEDVVRLLNLE 832
Query: 394 RLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+L+P Y +T+SC++ + L+ +G L L K + D VR+ A L D
Sbjct: 833 KLLPCYQFTITVSCLKAIR--TLQKNG--HLPSNAALFKSYADPGQYVDVRLVAMEILTD 888
Query: 454 LEFHCNGIDSALSLFIKSVEEEP 476
H + + L + VE +P
Sbjct: 889 F-VHVDQDEEVLGWLLDYVEIDP 910
>gi|119188493|ref|XP_001244853.1| hypothetical protein CIMG_04294 [Coccidioides immitis RS]
gi|392867767|gb|EAS33464.2| transcription initiation factor TFIID complex 127kD subunit
[Coccidioides immitis RS]
Length = 1253
Score = 150 bits (378), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 138/530 (26%), Positives = 232/530 (43%), Gaps = 88/530 (16%)
Query: 2 LEKQMGSNFFRKILQNIISRAQ-GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
L K G +IL I + G P L+T F+ + G+++ L FF +WV
Sbjct: 501 LSKASGKATVSRILSRIFLNCRMGDIPNSALTTIFFQKLCERFGHVK---LDAFFQQWVY 557
Query: 61 TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSR---TPVLSSNTDSENRDGDIG---- 113
GCP +NK+K +VE+ + + + + +R + E ++ G
Sbjct: 558 GAGCPRFVATQRFNKKKLVVEMMIRQVQSEQLSARDLEKSTFMRDVKEELQNIYAGTLQH 617
Query: 114 -WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLA--ARRALKPKKGSKPDGC 169
+ G M+IR+HE DG Y+H + +A +I ++K R + ++ + G
Sbjct: 618 VFTGSMTIRIHEADGTPYEHIV--EIKEAVTKFDIPYNTKYKRLKRNKRQRERAAASSGV 675
Query: 170 DDNGDAVAGL------DMRSSMESPL--------------------SWIRADPEMEYLAE 203
D N +A + D+ S E L WIR D + E++ +
Sbjct: 676 DPNSEAQDDVLLYCLGDVLQSEEEILKWRLVEWTKEDEDRMGQESYEWIRMDADFEWICK 735
Query: 204 IHFNQPVQMWINQLEKDGDVVAQAQA-------IAALEALPHLSFNVVNTLNNFLS---- 252
+ P M+++QL++D DVVAQ +A +AA + + + + +S
Sbjct: 736 MSLTMPGYMYLSQLQQDRDVVAQLEASNSRVGFMAAANCIQSIQYMAAQREHPLISTIFA 795
Query: 253 ----DSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDF 308
DS+ F +R AA+AL A E+ DW GL HL ++ N + R N+F D
Sbjct: 796 RTLMDSRYFHGIRTAAAHALVKHAKEDVDWIGLFHLETAFQELFCLPNSPMTRSNNFSDR 855
Query: 309 SEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVG- 367
S Y + AIP+A++ VR + + P F+ + LK+NDN+ N +SD +++AAL++S+
Sbjct: 856 SSYALQIAIPNAISKVRNRNGRCPMRVKRFLYEKLKFNDNSNNEFSDCYYVAALMRSIAN 915
Query: 368 -------------------ELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILT---I 405
ELE + L+L + + I+R + D S+ + T +
Sbjct: 916 ALIGGKESIHTKSKLDMTQELERQAEDQLYLDCIAE-INRYKRMDEWTSSFQNLYTRTAL 974
Query: 406 SCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLE 455
C+R L + LD V P+ T VR++A R LLDL+
Sbjct: 975 DCLRRL------MKEHRVLDFDVAQFLPYTRIGTFDLVRLDAFRILLDLD 1018
>gi|307205887|gb|EFN84045.1| Transcription initiation factor TFIID subunit 2 [Harpegnathos
saltator]
Length = 1103
Score = 150 bits (378), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 123/483 (25%), Positives = 220/483 (45%), Gaps = 72/483 (14%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRH-------FANKVGNLERPFLKE 53
MLE ++G ++ +S A A+ + + + + H F + + +
Sbjct: 361 MLEHRIGQELLLQVFNKQLSLAANAAQ-QKIDSGLWSHMLISTNVFTKAIFTVTGKDMAV 419
Query: 54 FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
F +WV T G + F +N+++N VEL + +D T ++ G
Sbjct: 420 FIDQWVRTGGHAKFTLSFVFNRKRNTVELEIKQDTT-----------------HQRGIRK 462
Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
+ G + + + ELDG + H L + G + +I CHSK + K + G
Sbjct: 463 YVGTLLVNIQELDGTFKH-TLQIEGTLAKA-DITCHSKSRRNKKKKIPLCT--------G 512
Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
+ V +D+ + +SP+ WIR DPEM + + QP W QL + DV AQ +AI AL
Sbjct: 513 EEV-DMDLSAMDDSPVLWIRLDPEMTLMRAVQIEQPDYQWQYQLRHERDVTAQLEAIEAL 571
Query: 234 E--ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYK- 289
+ + P + +T+ N +++VR+ AA+ L A+ WAG ++ ++
Sbjct: 572 QNHSTPATRLALTDTIEN----EHCYYKVRLRAAHCLTKVANAMVATWAGPPAMMAIFRK 627
Query: 290 ------SRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLL 343
RR + + N+F +F YF+ + IP A+A +R A P E + F++ L
Sbjct: 628 LFGSASCRR------IIKQNNFSNFQHYFLQKTIPVAMAGLRNAHGICPPEVLAFLMDLF 681
Query: 344 KYNDNNGNPYSDVFWLAALVQSVGE-----LEFGQQSILFLSSL-------LKRIDRLLQ 391
KYNDN+ N YSD ++ AAL++++G + Q + + SL L+ + R L
Sbjct: 682 KYNDNSKNRYSDNYYRAALIEALGATVTPVISVQQGTAITAESLSIDTKAILEEVTRNLN 741
Query: 392 FDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRAL 451
++L+P Y ++++C++ + L F L L + + + +R+ A AL
Sbjct: 742 LEKLLPCYKYTVSVACLKVIR----ILQKFGHLPSNPNLFRAYAAYGQFIDIRIAALEAL 797
Query: 452 LDL 454
+D
Sbjct: 798 VDF 800
>gi|403417076|emb|CCM03776.1| predicted protein [Fibroporia radiculosa]
Length = 1900
Score = 149 bits (376), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 94/285 (32%), Positives = 144/285 (50%), Gaps = 34/285 (11%)
Query: 191 WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNF 250
WIR D + E++A I F+Q MW++QL++D DVVAQ +AI L P+ + +TL
Sbjct: 789 WIRIDADFEWIAGIAFDQKDYMWVSQLQRDRDVVAQYEAILVLSKSPNAI--ISSTLTKT 846
Query: 251 LSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRF---DENIGL------PR 301
+ S ++R+R EAA AL + + +W GL HL K + + D N L PR
Sbjct: 847 VLVSNYYFRIRCEAAQALVSCGIQRLEWLGLFHLFKLFLRYCYEPEDPNPDLFSHTYVPR 906
Query: 302 PNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAA 361
PNDF D +EY+V +++ A+A VR + K+P +F++ L+YNDN NPYSD ++
Sbjct: 907 PNDFSDLTEYYVRKSLLRAIARVRFENGKTPPIIRQFLIDQLRYNDNTANPYSDALYICT 966
Query: 362 LVQ-------SVGELEFGQ------------QSILFLSSLLKRIDRLLQFDRLMPSYNGI 402
++ SV E G+ Q L L +DR DRL+P+Y+ +
Sbjct: 967 IISALATATISVVPPERGEFIATDSQPVQDSQDADLLRQALAEVDRYRSMDRLIPTYHNV 1026
Query: 403 LTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEA 447
+T++ I + L + G I D + P QVR+ A
Sbjct: 1027 VTVAVIEFY--LMLSMGGLIPHD--ARTFFPLTREGNYTQVRIAA 1067
Score = 45.1 bits (105), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 32/141 (22%), Positives = 65/141 (46%), Gaps = 21/141 (14%)
Query: 1 MLEKQMGSN-----FFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
+L++++G + R + + +S G P +ST F KV ++ + F
Sbjct: 569 ILDRRLGKSGTSLGLSRVLPKVFLSAISGEMPNNAISTHSFLRTCRKVSGIDP---RSFA 625
Query: 56 PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG-- 113
+W+ GCP ++N++K VE+++ ++ P ++ ++E +
Sbjct: 626 EQWIYGSGCPTFGFSATFNRKKMAVEISMRQEA--------PAYKAHENNEITRTLMKPV 677
Query: 114 --WPGMMSIRVHELDGM-YDH 131
+ G M+IR+HE DG Y+H
Sbjct: 678 PFFEGQMTIRIHEADGTPYEH 698
>gi|332022415|gb|EGI62723.1| Transcription initiation factor TFIID subunit 2 [Acromyrmex
echinatior]
Length = 1205
Score = 149 bits (375), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 120/477 (25%), Positives = 219/477 (45%), Gaps = 60/477 (12%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRH-------FANKVGNLERPFLKE 53
MLE ++G ++ +S A A+ + + + + H FA + + +
Sbjct: 457 MLEHRIGQELLLQVFNKQLSLAANAAQ-QKIESGLWSHMLISTNVFAKAIFTVTGKDMSV 515
Query: 54 FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
F +WV T G + F +N+++N VEL + +D T ++ G
Sbjct: 516 FIDQWVRTGGHAKFSLSFIFNRKRNTVELEIRQDTT-----------------HQRGIRK 558
Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
+ G + + + ELDG + H L + G + +I CHSK + K + G
Sbjct: 559 YVGPLVVNIQELDGTFKH-TLQIEGTMARA-DITCHSKSRRNKKKKIPLCT--------G 608
Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
+ V +D+ + +SP+ WIR DPEM + + QP W QL + DV AQ +AI AL
Sbjct: 609 EEV-DMDLSAMDDSPVLWIRLDPEMTIMRAVQIEQPDYQWQYQLRHERDVTAQLEAIEAL 667
Query: 234 E--ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYKS 290
+ + P + +T+ N +++VR+ AA+ L + W+G ++ ++
Sbjct: 668 QHHSTPATRLALTDTIEN----EHCYYKVRLRAAHCLTKVGNAMVATWSGPPAMLAIFRK 723
Query: 291 RRFDENIG-LPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNN 349
+ + + N+F +F YF+ + IP A+A +R A P E + F++ L KYNDN+
Sbjct: 724 LFGSASCRRIIKQNNFSNFQHYFLQKTIPVAMAGLRNAHGICPPEVLAFLMDLFKYNDNS 783
Query: 350 GNPYSDVFWLAALVQSVGE-----LEFGQQSILFLSSL-------LKRIDRLLQFDRLMP 397
N YSD ++ AAL++++G + Q + + SL L+ + R L ++L+P
Sbjct: 784 KNRYSDNYYRAALIEALGATVTPVISVQQGTAITAESLSIDTKAILEEVTRNLNLEKLLP 843
Query: 398 SYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
Y ++++C++ + L F L L + + + +R+ A AL+D
Sbjct: 844 CYKYTVSVACLKVIR----ILQKFGHLPSNPHLFRAYAAYGQFIDIRIAALEALVDF 896
>gi|154283629|ref|XP_001542610.1| transcription initiation factor TFIId 127kD subunit [Ajellomyces
capsulatus NAm1]
gi|150410790|gb|EDN06178.1| transcription initiation factor TFIId 127kD subunit [Ajellomyces
capsulatus NAm1]
Length = 1170
Score = 149 bits (375), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 133/512 (25%), Positives = 216/512 (42%), Gaps = 83/512 (16%)
Query: 1 MLEKQMGSNFFRKILQNIISRA-----QGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
+LE+++ + + IISR G P L+T F+ ++G+ + L FF
Sbjct: 511 ILERRLSKASGKATISRIISRIFLNARMGDLPNGALATSFFQKTCERLGHTK---LDTFF 567
Query: 56 PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSS---NTDSENRDGDI 112
+WV GCP +NK+K +VE+ + + + +P R +S+ + E R+
Sbjct: 568 QQWVYGAGCPRFLATQRFNKKKLVVEMMIRQVQSEQPTVRDLEMSTFMRDVKEEIRNVYA 627
Query: 113 G-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKP 166
G + G M+IR+HE DG Y+H + G + K R + ++ +
Sbjct: 628 GPVQHVFTGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAAAA 687
Query: 167 DGCDDN------------GDAVAGLDM---------------RSSMESPLSWIRADPEME 199
G D N GD + + R ES WIR D + E
Sbjct: 688 SGIDPNAELQDDVLLYCLGDVLQSEEEIQTWKLVEWSKEDEDRMGQES-YEWIRMDADFE 746
Query: 200 YLAEIHFNQPVQMWINQLEKDGDVVAQAQAI---AALEALPHLSFNVVNTLNNFLSDSKA 256
++ ++ P M+++QL++D D+VAQ ++I AA P +S V T L D +
Sbjct: 747 WICKMSLTMPGYMYLSQLQQDRDIVAQLESIQYMAAQREHPLISTIFVRT----LMDRRY 802
Query: 257 FWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEA 316
F+ +R AA AL A +E DW GL HL ++ + R NDF D + Y +
Sbjct: 803 FYGIRAAAARALVKHAKDEVDWIGLYHLESAFQELFCLPGSPMTRSNDFSDRAAYILQLT 862
Query: 317 IPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFG---- 372
IP A+A VR K+P F+ LK+NDN+ N SD +++ AL++S+ E G
Sbjct: 863 IPQAIAQVRDNSGKTPMRVKRFLYDKLKFNDNSNNELSDCYYITALMKSICEALLGRRET 922
Query: 373 -----------------QQSILFLSSLLKRIDRLLQFDRLMPSYNGILT---ISCIRTLT 412
Q + IDR + D S+ I + I C R L
Sbjct: 923 EIDGVGVDFDVERELERQAERQLDRDCIAEIDRYRRMDEWTTSFQNIYSRTAIECQRRLM 982
Query: 413 QIALKLSGFISLDQVVKLIKPFRDFNTIWQVR 444
Q + +D+ + P+ + + +VR
Sbjct: 983 QANV-------IDRDIMQFLPYTELESSTEVR 1007
>gi|213410437|ref|XP_002175988.1| transcription initiation factor TFIID subunit 2
[Schizosaccharomyces japonicus yFS275]
gi|212004035|gb|EEB09695.1| transcription initiation factor TFIID subunit 2
[Schizosaccharomyces japonicus yFS275]
Length = 1147
Score = 148 bits (374), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 134/489 (27%), Positives = 222/489 (45%), Gaps = 74/489 (15%)
Query: 2 LEKQMGSNFFRKILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
L K GS +++ ++ +A G LST F K +++ L F +W+
Sbjct: 452 LTKTGGSLGMSRVIPKLLLQAMSGDMHNGCLSTTHFLKTCEKASHIK---LDVFAQQWIY 508
Query: 61 TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNT----DSENR--DGDIGW 114
G P+ R+ +N++K I+E+ + + P LS N+ D R +G+
Sbjct: 509 GFGYPIFRVVQRFNRKKMIIEMGINQ----VQAKEMPKLSRNSLDFMDEATRHLNGETVK 564
Query: 115 P------GMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPD 167
P G M+IR+HE DG Y+H + D + L+IQ ++K + K + +
Sbjct: 565 PVIPVFSGPMTIRIHEADGTPYEHVV--ELKDTFTKLDIQYNTKYKRISRNRSSKNVRKE 622
Query: 168 GCDDNGDAVAGLDMRSSMES------------------------------PLSWIRADPE 197
G DD+G GLD ++S WIR D +
Sbjct: 623 G-DDHG----GLDSDYVIKSLGDILQTDEDVENWHLYDYTKEDEDAMATEAFEWIRVDAD 677
Query: 198 MEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHL-SFNVVNTLNNFLSDSKA 256
E++ + QP M+++QL++D DVVAQ +AI + S V L L DS+
Sbjct: 678 FEWICGLRVGQPEHMYVSQLQQDRDVVAQLEAIRYFCSDSFRPSRQVSTVLLRTLMDSRY 737
Query: 257 FWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEA 316
++ +R EAA ALA A+EE +W G HL K + + EN +P+ N F +Y++
Sbjct: 738 YYGIRQEAALALAKCATEELEWIGHFHLRKAFLDKFCFENSTIPKSNVFSSIIDYYLKCT 797
Query: 317 IPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV-------GE- 368
+ HA + +R +P + +L LL +NDN N SD ++ L++++ GE
Sbjct: 798 MVHAFSRIRDRKGDTPITVKKLLLDLLCFNDNANNELSDAHYICHLIEALVDALIPRGET 857
Query: 369 LEFG---QQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLD 425
++F + F+ +++ IDR L+ D MPSY +++ RT + KLS L
Sbjct: 858 MQFAFSDAGHMQFVKTVIAEIDRYLRIDGWMPSYCNVIS----RTALEGKNKLSQVFHLG 913
Query: 426 QVVKLIKPF 434
VK + P+
Sbjct: 914 FEVKDLFPY 922
>gi|393907247|gb|EFO22550.2| TATA binding protein associated factor [Loa loa]
Length = 1083
Score = 148 bits (374), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 137/485 (28%), Positives = 231/485 (47%), Gaps = 68/485 (14%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEFRHFANKVGNLERPFL 51
ML +++G F +++Q I+S A AS ++ T+ST+ F + V E P
Sbjct: 480 MLNQRLGKEPFLQVIQKILSVAMQASQLQKEPAHWQHMTVSTESFFRTVSSVTGQELP-- 537
Query: 52 KEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGD 111
F +W+ G + +++N+++N++EL V ++ S NR G
Sbjct: 538 -TFLEQWIHNGGHAKFWVQYAFNRKRNMIELEVKQE----------------PSANR-GR 579
Query: 112 IGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDD 171
+ G + + V ELDG + H + + + ++QCHSK RR K KK G +
Sbjct: 580 QSYVGPLIVVVQELDGSFTHTVQIDSNHSKH--DLQCHSK--GRRQ-KKKKIPLSTGEEL 634
Query: 172 NGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIA 231
D +A +D +SP+ WIR DP++ L +++ QPV W L+ + DV+AQ QA+
Sbjct: 635 EID-LANMDP----DSPVLWIRIDPDLLLLRKVNIQQPVYQWEYMLKYERDVLAQLQALD 689
Query: 232 ALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETD-WAG----LLHLVK 286
L+ P S + L + F+RVR AAY LA ++ + W G ++ K
Sbjct: 690 ILQRFP--SPHARAVLIETIETEAFFYRVRCRAAYILAEVVNKLPETWLGPPALMILFKK 747
Query: 287 FYKSRRFDENIGLPRPNDFRDFSE----YFVLEAIPHAVAMVRAADNKSPREAVEFVLQL 342
F+ + +P+PN+F S YF++ AIP A+A +R++ N + E F++ L
Sbjct: 748 FFGCKSAPH---IPKPNNFVATSANLQLYFLMHAIPQALARLRSSSNGALSEVHHFLIGL 804
Query: 343 LKYNDNNGNPYSDVFWLAALVQSVGELEFGQQS--ILFLS---------SLLKRIDRLLQ 391
+KYNDN N YSD + A+L+ S+ +S I+ +S LL L
Sbjct: 805 IKYNDNTVNRYSDDHYRASLINSLAATLIPAESRDIIDMSPLSLDTEMRELLSEFTLALN 864
Query: 392 FDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRAL 451
D L PS+ I+ I+ ++ + ++ + I LD K+ F N +R+ A L
Sbjct: 865 MDTLKPSFGRIIGITALQAIYH--MQRTFHIPLDP--KVFWSFAQPNMYVPMRLAALTFL 920
Query: 452 LDLEF 456
+D+ +
Sbjct: 921 VDMVY 925
>gi|358335688|dbj|GAA28713.2| transcription initiation factor TFIID subunit 2 [Clonorchis sinensis]
Length = 1343
Score = 148 bits (374), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 121/443 (27%), Positives = 206/443 (46%), Gaps = 58/443 (13%)
Query: 31 LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTV 90
LST FR V + ++ F +WV G L + F +N+++N+VEL + +D
Sbjct: 692 LSTSSFRRIILMVTGQD---IQNFLDQWVFRTGHVRLFVKFHFNRKRNVVELELKQDLQS 748
Query: 91 KPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHS 150
+ G + + G +++ + ELDG + H G + ++ CHS
Sbjct: 749 R------------------GTLSYTGPLTVMLQELDGAFMHTFKLEEGRMAR--DLPCHS 788
Query: 151 KLAARRALKPKKGSKPDGCDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPV 210
K + K +K P NGD V R ESPL W+R DP++ + I+ QP
Sbjct: 789 K-----SRKHRKKRIPLA---NGDEVDMDLGRIDAESPLLWVRMDPDLAVIHTIYVEQPD 840
Query: 211 QMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALAN 270
MW L D D + Q +A AL P S + L N +++ + F+RVR +A ++L +
Sbjct: 841 FMWHLMLLHDRDCLGQLEATHALRDFP--SPETRHALANVIANERVFYRVRTDACFSLCH 898
Query: 271 TASEETDWAGL----------LHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHA 320
A+E + G + L F++ GL R N+F+ YF++ AI +
Sbjct: 899 IANELSAMGGAQGTTANAATSVLLPLFWQLFSSPAARGLVRQNNFKHLQHYFLMRAICRS 958
Query: 321 VAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGEL---EFGQQSIL 377
+A +R PRE + F+ +LL++NDN N YSD +LA L++++ ++ + ++
Sbjct: 959 LATLR-VQQVCPREVLPFISELLRHNDNTRNEYSDYCYLADLIRTMKDILTPAIVVRGVM 1017
Query: 378 FLSSL-------LKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKL 430
SSL ++ + L D PSY G +T+ C+ T+ + L+ GF+ +D +
Sbjct: 1018 SSSSLPVEARTVIEEVVHHLNLDTQTPSYKGTITVVCLATIRR--LQRLGFLPVDPSLFY 1075
Query: 431 IKPFRDFNTIWQVRVEASRALLD 453
+ F + VR+ L+D
Sbjct: 1076 LYASPGF--YYDVRLAGLECLVD 1096
>gi|310789844|gb|EFQ25377.1| hypothetical protein GLRG_00521 [Glomerella graminicola M1.001]
Length = 1554
Score = 148 bits (374), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 142/527 (26%), Positives = 234/527 (44%), Gaps = 71/527 (13%)
Query: 31 LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTV 90
++T+ F+ K R L+ F+ +WV GCP + +NK++ VE+ + R
Sbjct: 612 INTESFQRVCEKNA---RNKLEAFWAQWVYGSGCPRFDVFQRFNKKRLCVEMTI-RQVQD 667
Query: 91 KPDSRTPVLSSN---------TDSENRDGDIG-WPGMMSIRVHELDGM-YDHPILPMAGD 139
K S VL + + D +G + G M+IR+HE DG Y+H I+ + D
Sbjct: 668 KALSEAEVLKKTEFLRAIKEKANGVHVDAAMGLFTGPMTIRIHEADGTPYEH-IVEIRED 726
Query: 140 AWQLLEIQC---------HSKLAARRALKPKKGSKPDGCDDN-----GDAVAGL------ 179
A + + + K A L + + PD ++ GD +
Sbjct: 727 AAKAAKFEIPYNTKYKRLKRKRRATEKLSAQSAADPDNNEEALLYCLGDVLTSPEDMRKW 786
Query: 180 -------DMRSSM-ESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIA 231
D+ +M + WIR D + E+ E+ N M+++QL++D DVVAQ ++
Sbjct: 787 EFSEWEPDIEKAMDQESYEWIRMDADFEWACEMKTNLQSYMYVSQLQQDRDVVAQQDSML 846
Query: 232 ALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSR 291
L+ P V L L D + F VR AA L + A+E+ AG+ HL++ +
Sbjct: 847 YLKKSPPHPL-VSTFLTRTLVDRRYFHGVRTMAADMLYHHATEKVAMAGMTHLMRAFTEL 905
Query: 292 RFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGN 351
+ PRPNDF D +Y V A+P A+A VR ++ K P A +L +++N+N N
Sbjct: 906 FCYPDSATPRPNDFSDKKQYLVQSALPMAIAKVRDSNGKCPEPARRLLLNQVQFNNNANN 965
Query: 352 PYSDVFWLAALVQSVG---------------ELEFGQQSILFLSSLLKRIDRLLQFDRLM 396
PYSD F++A L++++ +L+ + FL + IDR + D
Sbjct: 966 PYSDHFYVAKLLEALAHSLIPEKRRDAGDSMDLDTSAEESSFLEEAIVEIDRFRRMDEWA 1025
Query: 397 PSYNGILTIS---CIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
SY I T + C R L + +G I + I+ +D +T VRV+A AL+D
Sbjct: 1026 NSYQNIWTTTALECRRKLMK-----AGVIP-KSALDFIQYLQD-DTCDLVRVKAFEALVD 1078
Query: 454 LEFHCNGIDSALSLFIKSVEEEPSLRGQVKLGIHAMRICQIKGGSDS 500
L + L + + S + P +R ++ L I + + G S
Sbjct: 1079 LGYMMELPIFRLIVSVMSTDRSPFVRDKL-LKIFCTGLASLAFGEQS 1124
>gi|443429476|gb|AGC92654.1| transcription initiation factor TFIID subunit 2-like protein
[Heliconius erato]
Length = 1364
Score = 148 bits (374), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 133/536 (24%), Positives = 240/536 (44%), Gaps = 68/536 (12%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEFRHFANKVGNLERPFL 51
MLE+++G ++ +S A A+ + LST F + + +
Sbjct: 434 MLEQRIGQELLLQVFNKQLSLATNAANTKIGSGLWGHLLLSTNLF---VKAIFTVTGKDM 490
Query: 52 KEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGD 111
F +WV T G ++ +N+++N VEL + +D S + G
Sbjct: 491 AVFVDQWVRTGGHAKFQLTSVFNRKRNTVELEIRQD-----------------SVHERGI 533
Query: 112 IGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDD 171
+ G + +++ ELDG + H + Q+ I + + + K K C
Sbjct: 534 RKYVGPLLVQLQELDGTFKHTL--------QIENIVVKADITCHSKSRRNKKKKIPLC-- 583
Query: 172 NGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIA 231
G+ V +D+ + +SP+ WIR DPEM L +QP W QL + DV AQ++AI
Sbjct: 584 TGEEV-DMDLSAMEDSPVLWIRLDPEMSLLRSTIISQPDYQWQYQLRHERDVTAQSEAID 642
Query: 232 ALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEE-TDWAGLLHLVKFYKS 290
AL P + L + + + + ++++R AA+ L A+ + WAG ++ +
Sbjct: 643 ALHNYPEPA--TRKALTDTIENEQTYYKIRCRAAHCLTKVANAMISSWAGPPAMLTIF-- 698
Query: 291 RRFDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYND 347
R+ + P + N+F + YF+ IP A+A +R P E V F+L L KYND
Sbjct: 699 RKMFGSFSAPHIIKQNNFDNLQHYFLQRTIPVAMAGLRNIHGICPPEVVRFLLDLFKYND 758
Query: 348 NNGNPYSDVFWLAALVQSVGELEFGQQSIL---------FLSS----LLKRIDRLLQFDR 394
N+ N +SD ++ AALV ++ S+L LS+ +L+ I R+L ++
Sbjct: 759 NSKNHFSDNYYKAALVDALAATITPVISVLQPGAPITADSLSADTRLVLEEITRVLNLEK 818
Query: 395 LMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
++P Y ++++C+R + + L+ G L + + + + ++ VR+ A L+D
Sbjct: 819 VLPCYKNTVSVNCLRAIRR--LQQCG--HLPSIPTIFRAYAEYGQYIDVRLAAFECLVDF 874
Query: 455 EFHCNGIDSALSLFIKSVEEEPSLRGQVKLGIHAMRICQIKGGSDSNHEVDTVTLV 510
+G LS + +VE +P V+ G+ + + S H +DT +V
Sbjct: 875 -VRVDGKPEDLSSLLTAVENDPD--PGVRHGLARLLVSMPPFEKASRHRLDTEAVV 927
>gi|350633147|gb|EHA21513.1| hypothetical protein ASPNIDRAFT_45548 [Aspergillus niger ATCC 1015]
Length = 1272
Score = 148 bits (374), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 134/529 (25%), Positives = 227/529 (42%), Gaps = 88/529 (16%)
Query: 1 MLEKQMGSNFFRKILQNIISR-----AQGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
+L++++ + + IISR G P ++T F+ ++G+ + L FF
Sbjct: 511 ILDRRLTKASGKATMSRIISRLFLNSRMGDIPNGAVTTSLFQRTCERLGHAK---LDSFF 567
Query: 56 PRWVGTCGCPVLRMGFSYNKRKNIVELAVLR---DCTVKPDSRTPVLSSNTDSENRDGDI 112
+WV GCP + +NK+K +VE+ + + D + D + E R
Sbjct: 568 QQWVYGAGCPRFQATQRFNKKKLVVEMMIRQVQADASSTRDLDKNAFMRDVKEEIRGVYA 627
Query: 113 G-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKP 166
G + G M+IR+HE DG Y+H + G + K R + ++ +
Sbjct: 628 GVIQPVFTGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAAAV 687
Query: 167 DGCDDNGDA------------------VAGLDMRS--------SMESPLSWIRADPEMEY 200
G D N + VA +R WIR D + E+
Sbjct: 688 TGGDPNAETQEDVLLYCLGDVLQSEEEVAEWRLRDWSEEEEERMSRESYEWIRMDADFEW 747
Query: 201 LAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFN-----VVNTLNNFLSDSK 255
+ ++ P M+++QL++D DVVAQ LE+L +++ + L D +
Sbjct: 748 ICKLSLAMPGYMYLSQLQQDRDVVAQ------LESLQYMAIQREHELISTIFVRTLMDRR 801
Query: 256 AFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP-----RPNDFRDFSE 310
F+ +R+ AA AL A++ W GL HL R F E LP R NDF D +
Sbjct: 802 YFYGIRVAAARALVKHATDAVGWVGLFHL-----ERAFQELFCLPGSPMTRSNDFSDRAA 856
Query: 311 YFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGEL- 369
YF+ AIP +++ +R A+ K+P F+ LK+NDN+ N YSD F++A L++S+
Sbjct: 857 YFIQLAIPESISKIRDANGKTPMRVKRFLYDKLKFNDNSNNEYSDNFYVATLMESLCHAM 916
Query: 370 ------------EFGQQSIL-------FLSSLLKRIDRLLQFDRLMPSYNGILTISCIRT 410
+F + +L + IDR + D S+ I + + +R
Sbjct: 917 LGKPERPSDEMDDFDMERVLEAQAEEQLEKDAIAEIDRYRRMDEWSSSFQNIYSRAALR- 975
Query: 411 LTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCN 459
Q+ L + +SLD + L P+ T +R++A L++L+ N
Sbjct: 976 -CQVRLMEAKILSLDIMQFL--PYTRAGTYDLLRLDAFECLIELDIFRN 1021
>gi|260808452|ref|XP_002599021.1| hypothetical protein BRAFLDRAFT_130728 [Branchiostoma floridae]
gi|229284297|gb|EEN55033.1| hypothetical protein BRAFLDRAFT_130728 [Branchiostoma floridae]
Length = 1059
Score = 148 bits (374), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 125/503 (24%), Positives = 232/503 (46%), Gaps = 67/503 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEFRHFANKVGNLERPFL 51
++E+++G ++ ++S A A+ + LS+ F + V + L
Sbjct: 448 LVEQKIGFESLLQVFNKLLSLATSAASQKYTPTNWSNMLLSSASFVKLISTVSGKDIHLL 507
Query: 52 KEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGD 111
+ +WV G F +N+++N+VEL + G
Sbjct: 508 LD---QWVYQGGVVHFSGSFVFNRKRNLVELEL------------------NQKVGEHGI 546
Query: 112 IGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDD 171
+ G +++ + ELDG ++H + D E+ CHSK + K
Sbjct: 547 QKYVGPLTVTIQELDGSFNHSV--QIEDNTTRHELVCHSKSRRNKKKKIPLV-------- 596
Query: 172 NGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAI 230
NG+ V +D+ + +SP+ WIR DP++ + ++ + QP MW QL + D++AQ +I
Sbjct: 597 NGEEV-DMDLSAMDADSPVLWIRVDPQLTLVRKVEWQQPDFMWQYQLRYERDIIAQHDSI 655
Query: 231 AALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEE-TDWAGLLHLVKFYK 289
ALE P + + L + L + F+RVR+EAA+ L+ A+ + W G ++ +K
Sbjct: 656 MALEQFPTPASRLA--LTDILENESCFYRVRMEAAFCLSKVANAMVSSWMGPPAMMSMFK 713
Query: 290 SRRFDENI--GLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYND 347
R F + L + N+F + YF+ + +P A+ VR A + P++ ++F+LQL+KYND
Sbjct: 714 -RMFCTQMYTNLIKCNNFSNLQGYFLQKTMPVAMGAVRDAHGRCPKDVLQFILQLIKYND 772
Query: 348 NNGNPYSDVFWLAALVQSVGELEFGQQSILFLSS--------------LLKRIDRLLQFD 393
N N +SD ++ A+L+ ++ ++L S+ +L+ + R L +
Sbjct: 773 NGKNKFSDNYYRASLIDALAATVTPSVTMLDTSTDSRTEQALLPETKAVLEDVVRHLNLE 832
Query: 394 RLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+L+P Y +T+SC++ + L+ +G + + L K + D VR+ A L D
Sbjct: 833 KLLPCYQFTITVSCLKAIR--TLQKNGHLPSN--AALFKSYADPGQYVDVRLVAMEILTD 888
Query: 454 LEFHCNGIDSALSLFIKSVEEEP 476
H + + L + VE +P
Sbjct: 889 F-VHVDQDEEVLGWLLDYVENDP 910
>gi|317036312|ref|XP_001398085.2| TATA-binding protein associated factor Taf2 [Aspergillus niger CBS
513.88]
Length = 1245
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 133/529 (25%), Positives = 230/529 (43%), Gaps = 88/529 (16%)
Query: 1 MLEKQMGSNFFRKILQNIISR-----AQGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
+L++++ + + IISR G P ++T F+ ++G+ + L FF
Sbjct: 484 ILDRRLTKASGKATMSRIISRLFLNSRMGDIPNGAVTTSLFQRTCERLGHAK---LDSFF 540
Query: 56 PRWVGTCGCPVLRMGFSYNKRKNIVELAVLR---DCTVKPDSRTPVLSSNTDSENRDGDI 112
+WV GCP + +NK+K +VE+ + + D + D + E R
Sbjct: 541 QQWVYGAGCPRFQATQRFNKKKLVVEMMIRQVQADASSTRDLDKNAFMRDVKEEIRGVYA 600
Query: 113 G-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKP 166
G + G M+IR+HE DG Y+H + G + K R + ++ +
Sbjct: 601 GVIQPVFTGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAAAV 660
Query: 167 DGCDDNGDAVAGL------DMRSSMES--------------------PLSWIRADPEMEY 200
G D N + + D+ S E WIR D + E+
Sbjct: 661 TGGDPNAETQEDVLLYCLGDVLQSEEEVAEWRLRDWSEEEEERMSRESYEWIRMDADFEW 720
Query: 201 LAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFN-----VVNTLNNFLSDSK 255
+ ++ P M+++QL++D DVVAQ LE+L +++ + L D +
Sbjct: 721 ICKLSLAMPGYMYLSQLQQDRDVVAQ------LESLQYMAIQREHELISTIFVRTLMDRR 774
Query: 256 AFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP-----RPNDFRDFSE 310
F+ +R+ AA AL A++ W GL HL R F E LP R NDF D +
Sbjct: 775 YFYGIRVAAARALVKHATDAVGWVGLFHL-----ERAFQELFCLPGSPMTRSNDFSDRAA 829
Query: 311 YFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGEL- 369
YF+ AIP +++ +R A+ K+P F+ LK+NDN+ N YSD F++A L++S+
Sbjct: 830 YFIQLAIPESISKIRDANGKTPMRVKRFLYDKLKFNDNSNNEYSDNFYVATLMESLCHAM 889
Query: 370 ------------EFGQQSIL-------FLSSLLKRIDRLLQFDRLMPSYNGILTISCIRT 410
+F + +L + IDR + D S+ I + + +R
Sbjct: 890 LGKPERPSDEMDDFDMERVLEAQAEEQLEKDAIAEIDRYRRMDEWSSSFQNIYSRAALR- 948
Query: 411 LTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCN 459
Q+ L + +SLD +++ + P+ T +R++A L++L+ N
Sbjct: 949 -CQVRLMEAKILSLD-IMQFL-PYTRAGTYDLLRLDAFECLIELDIFRN 994
>gi|83765542|dbj|BAE55685.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|391864701|gb|EIT73995.1| TATA binding protein associated factor [Aspergillus oryzae 3.042]
Length = 1269
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 138/525 (26%), Positives = 231/525 (44%), Gaps = 88/525 (16%)
Query: 1 MLEKQMGSNFFRKILQNIISR-----AQGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
+L++++ + + IISR G P +ST F+ ++G+ + L FF
Sbjct: 511 ILDRRLTKASGKATMSRIISRLFLNARMGDIPNGAVSTMLFQKTCERLGHAK---LDSFF 567
Query: 56 PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG-- 113
+WV GCP + +NK+K +VE+ + + + +P +R L NT + +I
Sbjct: 568 QQWVLGAGCPRFQATQRFNKKKLVVEMMIKQVQSEQPSTRD--LDKNTFMRDVKEEIRGI 625
Query: 114 --------WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGS 164
+ G M+IR+HE DG Y+H + G + K R + ++ +
Sbjct: 626 YAGVVQPVFTGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAA 685
Query: 165 KPDGCDDN------------GDAV--------------AGLDMRSSMESPLSWIRADPEM 198
G D N GD + + D + WIR D +
Sbjct: 686 AASGGDPNAEVQEDVLLYCLGDVLQSEEEIQEWKLADWSKEDEERMGQESYEWIRMDADF 745
Query: 199 EYLAEIHFNQPVQMWINQLEKDGDVVAQ---AQAIAALEALPHLSFNVVNTLNNFLSDSK 255
E++ ++ P M+++QL++D DV+AQ Q +AA P +S V T L D +
Sbjct: 746 EWICKLSLVMPGYMYLSQLQQDRDVIAQLESLQYMAAQREHPLISTIFVRT----LMDRR 801
Query: 256 AFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP-----RPNDFRDFSE 310
F+ +R+ AA AL A EE +W GL HL R F E LP R NDF D +
Sbjct: 802 YFYGIRVAAAKALIKHAKEEINWLGLFHL-----ERAFQELFCLPGSPMTRSNDFSDRAA 856
Query: 311 YFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQS----- 365
Y + IP A++ VR + K+P F+ L++NDN+ N YSD +++A L+QS
Sbjct: 857 YVLQLVIPEAISKVRDNNGKTPMRVKRFLYDKLRFNDNSNNEYSDNYYVATLMQSLCHAM 916
Query: 366 VGELE--------FGQQSIL-------FLSSLLKRIDRLLQFDRLMPSYNGILTISCIRT 410
+G +E F + +L + IDR + D SY + + + +R
Sbjct: 917 LGRVESRSHDLDDFDMEGVLESQAEEKLEKDAIAEIDRYRRMDEWSSSYQNLYSRTTLR- 975
Query: 411 LTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLE 455
Q+ L + LD +++ + P+ T +R+EA L++ +
Sbjct: 976 -CQMQLMQAKITELD-IMQFL-PYTRAGTYDLLRLEAFECLVETD 1017
>gi|317139688|ref|XP_001817687.2| TATA-binding protein associated factor Taf2 [Aspergillus oryzae
RIB40]
Length = 1242
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 138/525 (26%), Positives = 231/525 (44%), Gaps = 88/525 (16%)
Query: 1 MLEKQMGSNFFRKILQNIISR-----AQGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
+L++++ + + IISR G P +ST F+ ++G+ + L FF
Sbjct: 484 ILDRRLTKASGKATMSRIISRLFLNARMGDIPNGAVSTMLFQKTCERLGHAK---LDSFF 540
Query: 56 PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG-- 113
+WV GCP + +NK+K +VE+ + + + +P +R L NT + +I
Sbjct: 541 QQWVLGAGCPRFQATQRFNKKKLVVEMMIKQVQSEQPSTRD--LDKNTFMRDVKEEIRGI 598
Query: 114 --------WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGS 164
+ G M+IR+HE DG Y+H + G + K R + ++ +
Sbjct: 599 YAGVVQPVFTGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAA 658
Query: 165 KPDGCDDN------------GDAV--------------AGLDMRSSMESPLSWIRADPEM 198
G D N GD + + D + WIR D +
Sbjct: 659 AASGGDPNAEVQEDVLLYCLGDVLQSEEEIQEWKLADWSKEDEERMGQESYEWIRMDADF 718
Query: 199 EYLAEIHFNQPVQMWINQLEKDGDVVAQ---AQAIAALEALPHLSFNVVNTLNNFLSDSK 255
E++ ++ P M+++QL++D DV+AQ Q +AA P +S V T L D +
Sbjct: 719 EWICKLSLVMPGYMYLSQLQQDRDVIAQLESLQYMAAQREHPLISTIFVRT----LMDRR 774
Query: 256 AFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP-----RPNDFRDFSE 310
F+ +R+ AA AL A EE +W GL HL R F E LP R NDF D +
Sbjct: 775 YFYGIRVAAAKALIKHAKEEINWLGLFHL-----ERAFQELFCLPGSPMTRSNDFSDRAA 829
Query: 311 YFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQS----- 365
Y + IP A++ VR + K+P F+ L++NDN+ N YSD +++A L+QS
Sbjct: 830 YVLQLVIPEAISKVRDNNGKTPMRVKRFLYDKLRFNDNSNNEYSDNYYVATLMQSLCHAM 889
Query: 366 VGELE--------FGQQSIL-------FLSSLLKRIDRLLQFDRLMPSYNGILTISCIRT 410
+G +E F + +L + IDR + D SY + + + +R
Sbjct: 890 LGRVESRSHDLDDFDMEGVLESQAEEKLEKDAIAEIDRYRRMDEWSSSYQNLYSRTTLR- 948
Query: 411 LTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLE 455
Q+ L + LD +++ + P+ T +R+EA L++ +
Sbjct: 949 -CQMQLMQAKITELD-IMQFL-PYTRAGTYDLLRLEAFECLVETD 990
>gi|238483151|ref|XP_002372814.1| transcription initiation factor TFIID subunit TSM1/127kD, putative
[Aspergillus flavus NRRL3357]
gi|220700864|gb|EED57202.1| transcription initiation factor TFIID subunit TSM1/127kD, putative
[Aspergillus flavus NRRL3357]
Length = 1290
Score = 147 bits (372), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 138/525 (26%), Positives = 228/525 (43%), Gaps = 88/525 (16%)
Query: 1 MLEKQMGSNFFRKILQNIISR-----AQGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
+L++++ + + IISR G P +ST F+ ++G+ + L FF
Sbjct: 511 ILDRRLTKASGKATMSRIISRLFLNARMGDIPNGAVSTMLFQKTCERLGHAK---LDSFF 567
Query: 56 PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG-- 113
+WV GCP + +NK+K +VE+ + + + +P +R L NT + +I
Sbjct: 568 QQWVLGAGCPRFQATQRFNKKKLVVEMMIKQVQSEQPSTRD--LDKNTFMRDVKEEIRGI 625
Query: 114 --------WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGS 164
+ G M+IR+HE DG Y+H + G + K R + ++ +
Sbjct: 626 YAGVVQPVFTGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAA 685
Query: 165 KPDGCDDN------------GDAV--------------AGLDMRSSMESPLSWIRADPEM 198
G D N GD + + D + WIR D +
Sbjct: 686 AASGGDPNAEVQEDVLLYCLGDVLQSEEEIQEWKLADWSKEDEERMGQESYEWIRMDADF 745
Query: 199 EYLAEIHFNQPVQMWINQLEKDGDVVAQ---AQAIAALEALPHLSFNVVNTLNNFLSDSK 255
E++ ++ P M+++QL++D DV+AQ Q +AA P +S V T L D +
Sbjct: 746 EWICKLSLVMPGYMYLSQLQQDRDVIAQLESLQYMAAQREHPLISTIFVRT----LMDRR 801
Query: 256 AFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP-----RPNDFRDFSE 310
F+ +R+ AA AL A EE +W GL HL R F E LP R NDF D +
Sbjct: 802 YFYGIRVAAAKALIKHAKEEINWLGLFHL-----ERAFQELFCLPGSPMTRSNDFSDRAA 856
Query: 311 YFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQS----- 365
Y + IP A++ VR + K+P F+ L++NDN+ N YSD +++A L+QS
Sbjct: 857 YVLQLVIPEAISKVRDNNGKTPMRVKRFLYDKLRFNDNSNNEYSDNYYVATLMQSLCHAM 916
Query: 366 VGELE--------FGQQSIL-------FLSSLLKRIDRLLQFDRLMPSYNGILTISCIRT 410
+G +E F + +L + IDR + D SY + + + +R
Sbjct: 917 LGRVESRSHDLDDFDMEGVLESQAEEKLEKDAIAEIDRYRRMDEWSSSYQNLYSRTTLR- 975
Query: 411 LTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLE 455
Q+ L + LD + P+ T +R+EA L++ +
Sbjct: 976 -CQMQLMQAKITELD--IMQFLPYTRAGTYDLLRLEAFECLVETD 1017
>gi|403161521|ref|XP_003321845.2| hypothetical protein PGTG_03382 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375171304|gb|EFP77426.2| hypothetical protein PGTG_03382 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 2059
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 120/443 (27%), Positives = 199/443 (44%), Gaps = 69/443 (15%)
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
LK + +W+ GCP+ ++ +N++K EL LR +S + N
Sbjct: 523 LKSWADQWIYASGCPIFQVTAYFNRKKLTHEL-TLRQLNRSKQYYEDDQTSWAEKCNLKP 581
Query: 111 DIGWPGMMSIRVHELDGM-YDHPI-LPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDG 168
+ G MSIR+ E DG+ Y+H + + ++L + K PD
Sbjct: 582 VDRFEGQMSIRIQESDGIPYEHVLDIKEPFKRYELPSNHKNKKFKRSSRRYASSNQAPDL 641
Query: 169 CDDNGDAVAGLDMRSSME--------------------SPLSWIRADPEMEYLAEIHFNQ 208
+ + + S E S + WIR D E+E++ E F
Sbjct: 642 TETSTETFTYPPWERSQEERDRWKVEDWTEEEDLFITQSNVEWIRLDVEVEWICEFRFEM 701
Query: 209 PVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYAL 268
MW+ QL++D DVVAQ +A+ AL+ LP S V + L + S+ F+R+RIEA AL
Sbjct: 702 KEFMWLEQLQRDRDVVAQLEAVRALKTLP--SKVVSSHLCRVVLISEYFFRIRIEAILAL 759
Query: 269 ANTASEETDWAGLLHLVKFYKSR-RFDENI---------GLPRPNDFRDFSEYFVLEAIP 318
+ A+ + GL HL+K ++SR + + +P+PN F +F++YF+ +A+
Sbjct: 760 VSCATSRCGFLGLFHLLKIFQSRFCYPPQVEPDSPWHIRAIPKPNQFDNFADYFIRKALV 819
Query: 319 HAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV------------ 366
A++ VR +N SP +F L L YNDN+ NPYSD ++A + S+
Sbjct: 820 VALSHVRDQNNNSPSVCHQFFLDQLVYNDNSLNPYSDAHYIATTIGSMTDSLIHLADNAG 879
Query: 367 ------------------GELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCI 408
G+LE L L ++ ++R DRL+PSY ++ ++ I
Sbjct: 880 REQLVEIFGTKALSTGTNGDLEKTCDKRLLL--MINELERCFMADRLVPSYRNLIAMAVI 937
Query: 409 RTLTQIALKLSGFISLDQVVKLI 431
++ L +G + +D V L+
Sbjct: 938 E--AKLKLMFAGLLPVDLVFFLL 958
>gi|444314387|ref|XP_004177851.1| hypothetical protein TBLA_0A05390 [Tetrapisispora blattae CBS 6284]
gi|387510890|emb|CCH58332.1| hypothetical protein TBLA_0A05390 [Tetrapisispora blattae CBS 6284]
Length = 1685
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 121/467 (25%), Positives = 213/467 (45%), Gaps = 71/467 (15%)
Query: 13 KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
++L I +A G P +LS F H KV + L++FF +WV + G P+ R+
Sbjct: 802 RVLPKIFLQAMSGDLPNNSLSASHFIHVCEKVN---KNKLEDFFQQWVYSSGVPIFRITQ 858
Query: 72 SYNKRKNIVELAVLR--------DCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVH 123
+NK++ +VE+ + + D + D +NR+ + G M+IR+H
Sbjct: 859 KFNKKRMVVEVGIRQVQNQELSQDKIIGKDGFCSSALHTLQDQNRNLPPFFTGSMTIRIH 918
Query: 124 ELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKP-----DGCDDNGDAVA 177
E DG Y+H + +++ L+IQ ++K R +K +K D D+ + V
Sbjct: 919 EADGAPYEHIV--ELKESFTKLDIQYNTKYK-RTKIKKIVSTKSSLNLKDNAPDDYETVE 975
Query: 178 GLDMRSS-----------MESPLS--------------------------WIRADPEMEY 200
+ + S + SP+ WIR D + E+
Sbjct: 976 IFNEKDSEVIKIQKLGNVLTSPMDCYNWNLVDFNTTTEAHEYQKQADSFEWIRIDSDFEW 1035
Query: 201 LAEIHFNQPVQMWINQLEKDGDVVAQAQAI----AALEALPHLSFNVVNTLNNFLSDSKA 256
+ +I NQP M+ +QL +DGDV AQ ++I +E S + L + DS+
Sbjct: 1036 ICKIIINQPDYMFASQLRQDGDVEAQLESIRYYEDVIEHSSTSSNVYSSILTRTIMDSRY 1095
Query: 257 FWRVRIEAAYALANTASEETD----WAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYF 312
F+ VRI A AL+ +E D G LHLV+ +++ N +P+ NDF DF Y+
Sbjct: 1096 FYGVRIAACRALSKYIFKEIDPDNFRGGPLHLVRIFQNLFCFNNSNIPKNNDFSDFRNYY 1155
Query: 313 VLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQS-VGELEF 371
+ +AIP ++ VR P + +L LL YN+N+ N + D +++ L++S V +
Sbjct: 1156 LRKAIPFFLSSVRDEHGNCPSFIRQLLLDLLTYNENSENIFDDTYYICTLIKSLVNAMLT 1215
Query: 372 GQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKL 418
Q +F+ + ++ R + +P+Y ++ ++T+ + KL
Sbjct: 1216 EDQDKVFIKHVSEQFHRYYNLEEWLPTYQKLV----LKTIIEERFKL 1258
>gi|119484530|ref|XP_001262044.1| hypothetical protein NFIA_097760 [Neosartorya fischeri NRRL 181]
gi|119410200|gb|EAW20147.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
Length = 1272
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 121/426 (28%), Positives = 191/426 (44%), Gaps = 68/426 (15%)
Query: 1 MLEKQMGSNFFRKILQNIISRA-----QGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
+L++++ + + IISR G P +++ +F+ ++G+ + L FF
Sbjct: 508 ILDRRLTKASGKATMSRIISRMFLNARMGDIPNGAITSSQFQKICERLGHAK---LDTFF 564
Query: 56 PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG-- 113
+WV GCP + +NK+K +VE+ + + +P +R L NT + +I
Sbjct: 565 QQWVYGAGCPRFQATQRFNKKKLVVEMMIKQVQADQPSTRD--LDKNTFMRDVKEEIRGV 622
Query: 114 --------WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGS 164
+ G M+IR+HE DG Y+H + G + K R + ++ +
Sbjct: 623 YAGNIQPVFSGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAA 682
Query: 165 KPDGCDDNGDAVAGL---------------------------DMRSSMESPLSWIRADPE 197
G D N + + + R ES WIR D +
Sbjct: 683 AVTGGDPNAETQEDVLLYCLGDVLQTEEEMQEWRLADWSKEDEDRMGQES-YEWIRMDAD 741
Query: 198 MEYLAEIHFNQPVQMWINQLEKDGDVVAQ-----AQAIAALEALPHLSFNVVNTLNNFLS 252
E++ ++ P M+++QL++D DVVAQ Q +AA P +S V T L
Sbjct: 742 FEWICKLSLVMPGYMYLSQLQQDRDVVAQLEVSSLQYMAAQREHPLISTIFVRT----LM 797
Query: 253 DSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP-----RPNDFRD 307
D + F+ +R AA AL A EE +W GL HL R F E LP R NDF D
Sbjct: 798 DRRYFYGIREAAARALVKHAKEEINWLGLFHL-----ERAFQELFCLPGSPMTRSNDFSD 852
Query: 308 FSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVG 367
+ Y + AIP A++ VR D K+P F+ LK+NDN+ N YSD F++A L+QS+
Sbjct: 853 RAAYVLQLAIPEAISKVRDNDGKTPLRVKRFLYDKLKFNDNSNNEYSDNFYVATLMQSLC 912
Query: 368 ELEFGQ 373
G+
Sbjct: 913 HAMLGK 918
>gi|148697321|gb|EDL29268.1| mCG116451 [Mus musculus]
Length = 1199
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 130/477 (27%), Positives = 220/477 (46%), Gaps = 72/477 (15%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTYGFLKSISNVSGKDIQPL 518
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607
Query: 171 DNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAI 230
NG+ V DM LS + DP+M L ++ F Q MW +L + DVVAQ ++I
Sbjct: 608 -NGEEV---DMD------LSAMEIDPDMSVLRKVEFEQADFMWQYELRYERDVVAQQESI 657
Query: 231 AALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFYK 289
ALE P + + L + L + F+RVR A + LA A S + W G + +
Sbjct: 658 LALEKFPTPASRLA--LTDILEQEQCFYRVRTSACFCLAKIANSMVSTWTGPPAMKSLF- 714
Query: 290 SRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYND 347
+R F + + N+F F YF+ + +P A+A++R N P+E + F+L L+KYND
Sbjct: 715 TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYND 774
Query: 348 NNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRLM 396
N N +SD ++ A AL SV + L +L L+ I R L ++L+
Sbjct: 775 NRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKLL 834
Query: 397 PSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
PSY +T+SC+R + L+ +G + D L K + ++ +R+ A A++D
Sbjct: 835 PSYRHTITVSCLRAIR--VLQKNGHVPSD--ASLFKSYAEYGHFVDIRIAALEAVVD 887
>gi|149066382|gb|EDM16255.1| TAF2 RNA polymerase II, TATA box binding protein (TBP)-associated
factor [Rattus norvegicus]
Length = 1015
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 130/477 (27%), Positives = 221/477 (46%), Gaps = 72/477 (15%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 290 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTYGFLKSISNVSGKDIQPL 349
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 350 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 387
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 388 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 438
Query: 171 DNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAI 230
NG+ V DM LS + DP+M L ++ F Q MW +L + DVVAQ ++I
Sbjct: 439 -NGEEV---DMD------LSAMEIDPDMSVLRKVEFEQADFMWQYELRYERDVVAQQESI 488
Query: 231 AALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFYK 289
ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 489 LALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF- 545
Query: 290 SRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYND 347
+R F + + N+F F YF+ + +P A+A++R N P+E + F+L L+KYND
Sbjct: 546 TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYND 605
Query: 348 NNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRLM 396
N N +SD ++ A AL SV + L +L L+ I R L ++L+
Sbjct: 606 NRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKLL 665
Query: 397 PSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
PSY +T+SC+R + L+ +G + D L K + ++ +R+ A A++D
Sbjct: 666 PSYRHTITVSCLRAIR--VLQKNGHVPSD--ASLFKSYAEYGHFVDIRIAALEAVVD 718
>gi|353235918|emb|CCA67923.1| related to TAF2-component of TFIID complex [Piriformospora indica
DSM 11827]
Length = 1782
Score = 147 bits (370), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 122/475 (25%), Positives = 217/475 (45%), Gaps = 84/475 (17%)
Query: 2 LEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLE-RPFLKEFFPRWVG 60
+ K S +I+ II A +ST F K ++ RPF ++ WV
Sbjct: 513 MHKTGTSQGLHRIIPKIILDALENESAAAISTHSFLRMCRKTAGIDFRPFAEQ----WVY 568
Query: 61 TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG----WPG 116
GCP + +N++K VE+ V ++ +P N + + + + G
Sbjct: 569 GSGCPRFAVRSHFNRKKMAVEMHVKQE--------SPAYEFNKEDPVQLALLRPMPFFDG 620
Query: 117 MMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLA-----ARRALKPKKGSKPDGCD 170
M+IR+HE DG Y+H +L + ++ E+ ++K +R + ++ ++ G
Sbjct: 621 QMTIRIHEADGTPYEH-VLDIR-QPYKKFEVPFNTKYKRVRRNTKRFIARQQAAEAAGNQ 678
Query: 171 DNGDAVAGLDMRSSM--------------------------ESPLSWIRADPEMEYLAEI 204
+ + + +DM + + WIR D ++E+L +I
Sbjct: 679 ETAELIGVVDMSFGLGIWEDDASREEWKVADWTEADEAHMATATYEWIRLDTDIEWLTKI 738
Query: 205 HFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEA 264
F+Q MW++QL++D DV+AQ +AI AL +P S V +T + + F+RVR EA
Sbjct: 739 SFDQEPFMWVSQLQRDRDVIAQIEAIHALSRIP--SKIVSSTFTKTVLTTTYFFRVRSEA 796
Query: 265 AYALANTASEETDWAGLLHLVKFYKSRRFD--------ENIGLPRPNDFRDFSEYFVLEA 316
A +L A ++ GL HL K + +D +N +P+ NDF DFSEYF+ +A
Sbjct: 797 ALSLVPCAGAAVEYLGLYHLYKLFFRYCYDPVNKEDIFKNEVVPKANDFSDFSEYFLRKA 856
Query: 317 IPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV---------- 366
+ A++ VR + ++ F++ L+YNDN+ N +SD ++LA ++ ++
Sbjct: 857 LITALSQVRVENGETVPIVRRFLIDQLRYNDNSTNNFSDSYYLAHIIGALGCSLVSTTPP 916
Query: 367 --GELE-----------FGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCI 408
GEL+ Q + L + +DR DR++PS+ +T++ I
Sbjct: 917 ERGELQGDHIPKIVDDPRAQINKDILDGAIAEVDRYRDMDRVIPSWRNAITVATI 971
>gi|345483769|ref|XP_001599737.2| PREDICTED: transcription initiation factor TFIID subunit 2-like
[Nasonia vitripennis]
Length = 1227
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 124/499 (24%), Positives = 222/499 (44%), Gaps = 61/499 (12%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRH-------FANKVGNLERPFLKE 53
MLE ++G ++ +S A A+ + + + + H F + + +
Sbjct: 457 MLEHRIGQELLLQVFNKQLSLATNAAQ-QKIDSGLWSHMLISTNVFTKAIFTVTGKDMAV 515
Query: 54 FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
F +WV T G + F +N+++N VEL + +D T + G
Sbjct: 516 FIDQWVRTGGHAKFTLSFKFNRKRNTVELEIRQDKT-----------------HERGIRK 558
Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
+ G + + + ELDG + H L + G + +I CHSK + K + G
Sbjct: 559 YVGPLLVNIQELDGTFKH-TLQIEGTVAKA-DITCHSKSRRNKKKKIPLCT--------G 608
Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
+ V +D+ + +SP+ WIR DP+M + QP W QL + DV AQ +AI AL
Sbjct: 609 EEV-DMDLSAMDDSPVLWIRLDPDMTLMRAAQIEQPDYQWQYQLRHERDVTAQMEAIDAL 667
Query: 234 E--ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYKS 290
+ A P + +T+ N +++VR+ AA+ L A+ WAG ++ ++
Sbjct: 668 QNHATPATRLALTDTIEN----EHCYYKVRLRAAHCLTKVANAMVATWAGPPAMLAIFRK 723
Query: 291 RRFDENIG-LPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNN 349
+ + + N+F +F YF+ + IP A+A +R A P E + F++ L KYNDN+
Sbjct: 724 LFGSASCRRIIKQNNFTNFQHYFLQKTIPVAMAGLRNAHGICPPEVLSFLMDLFKYNDNS 783
Query: 350 GNPYSDVFWLAALVQSVGELEFGQQSILF------------LSSLLKRIDRLLQFDRLMP 397
N YSD ++ AAL++++G SI ++L+ + R L ++L+P
Sbjct: 784 KNRYSDNYYRAALIEALGATVTPVISIQHGAAITAESLSADTKAILEEVTRHLNLEKLLP 843
Query: 398 SYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFH 457
Y ++++C++ + L F L + + + + VR+ A AL+D
Sbjct: 844 CYKYTVSVACLKVIR----ILQKFGHLPSTPLIFREYAAYGQFIDVRIAALTALVDFT-K 898
Query: 458 CNGIDSALSLFIKSVEEEP 476
+G L + E +P
Sbjct: 899 VDGKWEDLEFLLDMAENDP 917
>gi|295665422|ref|XP_002793262.1| transcription initiation factor TFIId 127kD subunit
[Paracoccidioides sp. 'lutzii' Pb01]
gi|226278176|gb|EEH33742.1| transcription initiation factor TFIId 127kD subunit
[Paracoccidioides sp. 'lutzii' Pb01]
Length = 1196
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 135/500 (27%), Positives = 213/500 (42%), Gaps = 91/500 (18%)
Query: 1 MLEKQMGSNFFRKILQNIISRA-----QGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
+LE+++ + + IISR G P +L+T + ++G+ + L +FF
Sbjct: 511 ILERRLSKASGKATISRIISRIFLHARMGDLPNGSLATSFLQKICERLGHTK---LDQFF 567
Query: 56 PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKP---DSRTPVLSSNTDSENRDGDI 112
+WV GCP +NK+K +VE+ + + + +P D + E ++
Sbjct: 568 QQWVYGAGCPRFLATQRFNKKKLVVEMMIRQVQSEQPTVRDLHKLTFMRDVKEEIKNVYA 627
Query: 113 G-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKP 166
G + G M+IR+HE DG Y+H + G + K R + ++ +
Sbjct: 628 GAVQHVFTGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAAAA 687
Query: 167 DGCDDN------------GDAVAGLDM---------------RSSMESPLSWIRADPEME 199
G D N GD + + R ES WIR D + E
Sbjct: 688 SGLDPNAELQDDVLLYCLGDVLQSEEEIQKWKLVEWSKEDEDRMGQES-YEWIRMDADFE 746
Query: 200 YLAEIHFNQPVQMWINQLEKDGDVVAQAQAI---AALEALPHLSFNVVNTLNNFLSDSKA 256
++ ++ P M+++QL++D D+VAQ ++I AA P +S + T L D +
Sbjct: 747 WICKMSLTMPGYMYLSQLQQDRDIVAQLESIQYMAAQREHPLISTIFIRT----LMDRRY 802
Query: 257 FWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP-----RPNDFRDFSEY 311
F+ +R AA AL A E+ DW GL HL F E LP R NDF D S Y
Sbjct: 803 FYGIRAAAAKALVKHAKEDVDWIGLFHL-----ESAFQELFCLPGSPMTRSNDFSDRSSY 857
Query: 312 FVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVG---- 367
+ IP A+A VR + K+P F+ LK+NDN+ + SD +++AAL++S+
Sbjct: 858 ILQLTIPQAIAQVRDSTGKTPMRVKRFLYDKLKFNDNSNSEISDCYYVAALMKSICDALL 917
Query: 368 ----------------ELEFGQQSILFLS-SLLKRIDRLLQFDRLMPSYNGIL---TISC 407
E+E +Q+ L + IDR + D S+ I I C
Sbjct: 918 GRKENEPGNIDMDFDMEMELERQAEQQLDRDCIAEIDRYRRMDEWTSSFQNIYARTAIGC 977
Query: 408 IRTLTQIALKLSGFISLDQV 427
R Q +G I LD V
Sbjct: 978 QRRFMQ-----AGVIELDLV 992
>gi|452842010|gb|EME43946.1| hypothetical protein DOTSEDRAFT_71677 [Dothistroma septosporum NZE10]
Length = 1498
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 137/516 (26%), Positives = 235/516 (45%), Gaps = 83/516 (16%)
Query: 2 LEKQMGSNFFRKILQNIISRAQ-GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
L K GS +I+ I+ A+ G LST EF+ K+G+ + L+ FF +WV
Sbjct: 528 LIKSSGSTGVMRIINKILLNAKTGTLHNGALSTAEFQRTCEKLGHNK---LEHFFRQWVQ 584
Query: 61 TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRD-------GDIG 113
GCP+ + +NK+K +VE+ + R + ++ +N E ++ G++
Sbjct: 585 GAGCPIFYVTQRFNKKKLVVEMEI-RQRQHERQTKPLFAPNNFMREVKEHVREVWAGELQ 643
Query: 114 --WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSK---LAARRALKPKKGSKPD 167
+ G M+IR+HE DG Y+H + G LEI ++K L R + + ++
Sbjct: 644 PVFTGPMTIRIHEADGTPYEHIVEIKEGTT--KLEIPYNTKYKRLKRSRRQRERAAAEGQ 701
Query: 168 GCDDNGDAVAGLDMRSSMESP-----------------------LSWIRADPEMEYLAEI 204
+N D + ++ P WIR D + E+L I
Sbjct: 702 ANGENTDEALLYCLGDVLDRPEDMKEWNLMDWSKEDEDKMGGESYEWIRMDADFEWLGRI 761
Query: 205 HFNQPVQMWINQLEKDGDVVAQAQAIAAL--EALPHLSFNVVNTLNNFLSDSKAFWRVRI 262
H QP+ M+I+QL++D D+VAQ +++ L A H+S ++ L L D + F+ +RI
Sbjct: 762 HLTQPLYMYISQLQQDRDIVAQHESMRHLCGAAAHHVSLSI---LLRTLMDERYFYGIRI 818
Query: 263 EAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVA 322
AA LA+ A + G +HL K + +E+ +PRPNDF + + AIP A+
Sbjct: 819 LAAEGLASLARGQLLKIGQMHLTKAFAEMFCEEDGVMPRPNDFTSRVAFIMQCAIPRAMT 878
Query: 323 MVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVG--------------- 367
+R + K P+ +F + LK+NDN+ N YSD +++ L+ +
Sbjct: 879 KLRDDEGKVPKAVEQFFVDKLKFNDNSDNLYSDCHYVSTLMTCLAESLVVSHRIIRPTVA 938
Query: 368 ---ELEFGQQSIL-------------FLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTL 411
E FG++ + F + I+R + D + +Y I + + IR L
Sbjct: 939 PTYEFNFGEEEPVEEEEQEEFNPDADFEQMAIDEIERYRRIDEWVVTYQNIYSTTAIRCL 998
Query: 412 TQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEA 447
+ L +G + D+ +L++ R +T VR+EA
Sbjct: 999 QK--LTKAGVVK-DKTKELLQYTRP-STADNVRLEA 1030
>gi|448533340|ref|XP_003870613.1| Tsm1 transcription initiation factor TFIID subunit [Candida
orthopsilosis Co 90-125]
gi|380354968|emb|CCG24484.1| Tsm1 transcription initiation factor TFIID subunit [Candida
orthopsilosis]
Length = 1371
Score = 146 bits (369), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 120/445 (26%), Positives = 207/445 (46%), Gaps = 55/445 (12%)
Query: 24 GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELA 83
G P TLST F++ KV R L FF +WV G P ++ +NK+K ++E+
Sbjct: 617 GDLPNGTLSTDHFQYVCEKVN---RNKLDSFFKQWVFGSGAPSFKISQRFNKKKGMIEMT 673
Query: 84 VLR-----DCTVKPDSRTPVLSS----NTDSENRDGDIGWPGMMSIRVHELDGM-YDHPI 133
+ + + TV+ ++ + + + N + + ++ + G M+IRVHE DG Y+H I
Sbjct: 674 IRQVQHHENKTVRLNAESFIDDALCFINKEPLPQRQNV-FTGPMNIRVHESDGTPYEHII 732
Query: 134 LPMAGDAWQLLEIQCHSKLA----ARRALKPKKGSKPDGCDDN--GDAVAGLDMRSSME- 186
+ E++ ++ R+ K +G P G + N GD + + + E
Sbjct: 733 --------HIKEVKTTIDISLSKKVRKIKKKDEGVDP-GINFNQFGDILISEEDKRKWEL 783
Query: 187 -------------SPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
P WIR D + E++A+I QP M+ +QL D DV AQ A+
Sbjct: 784 QDWESRDEDSLGAEPFEWIRGDIDFEWIAKIEVLQPDFMYGSQLVYDRDVEAQFDALRYF 843
Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRF 293
L S L D + F+R+RI AA ALA ++ D+ G+ +L+K YK
Sbjct: 844 GNLEKPSVIYCTALVRTFLDERYFYRIRIAAAEALATCSNASNDFIGMKYLIKAYKHLYC 903
Query: 294 DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPY 353
+ +P+ NDF DF + + A+P + + + + P++ + + +K+NDN N Y
Sbjct: 904 FPDSSIPKANDFNDFKLFLIQRAVPGILCRITDDEGRVPKDVQDLLFNFVKFNDNAMNDY 963
Query: 354 SDVFWLAALVQSV----------GELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGIL 403
D +L+ L++++ E Q F + I RL + D PSY IL
Sbjct: 964 QDTLYLSDLIKALTTSAISGWRKDTAEGNNQE--FAHKVNAEITRLQKLDEWRPSYQRIL 1021
Query: 404 TISCIRTLTQIALKLSGFISLDQVV 428
+C ++ Q+AL + +SL++++
Sbjct: 1022 QTTCFKSRVQMALYGAITLSLEEMM 1046
>gi|328708804|ref|XP_001952652.2| PREDICTED: transcription initiation factor TFIID subunit 2-like
isoform 1 [Acyrthosiphon pisum]
Length = 1215
Score = 146 bits (368), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 148/580 (25%), Positives = 263/580 (45%), Gaps = 75/580 (12%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGAS--PVRT-------LSTKEFRHFANKVGNLERPFL 51
MLE ++G + ++L +S A AS P+ + +ST F V ++ +
Sbjct: 436 MLENRIGYSLLLQVLNKHLSLASNASQNPMNSGLWGHMLISTNIFNKTIFTVSGID---M 492
Query: 52 KEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGD 111
F +WV T G F +N+++N +EL + +D + R V
Sbjct: 493 SVFIDQWVRTGGHAKFTCTFVFNRKRNTLELEIKQDAVCQRGIRKYV------------- 539
Query: 112 IGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDD 171
G + + + ELDG + H L + G+ + +I CHSK + K +
Sbjct: 540 ----GPLLVSIQELDGPFKH-TLQVEGNVARA-DITCHSKSRRNKKKKIPLCT------- 586
Query: 172 NGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAI 230
GD V +D+ + +SP+ WIR DPEM L QP W QL + DV AQ +A+
Sbjct: 587 -GDEV-DIDLNAMDNDSPVLWIRLDPEMTILRSCIIEQPEYQWNYQLRHEKDVTAQMEAV 644
Query: 231 AALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEE-TDWAGLLHLVKFYK 289
AL+ P ++ N L + + + + + VR +AA+ L A+ T+W G ++ +K
Sbjct: 645 VALDKFPTIA--TRNALCDIIENEQVYIHVRCKAAHCLTKIANAMVTNWTGPPAMLATFK 702
Query: 290 SRRFDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+ + P + N+F + Y++ + IP A+A +R + P E ++F+L L KYN
Sbjct: 703 --KMFGSFSCPHIIKQNNFSNLQHYYLQKTIPIAMAGLRNSHGICPPEILKFILDLFKYN 760
Query: 347 DNNGNPYSDVFWLAALVQSVGE-----LEFGQQSILFLSS--------LLKRIDRLLQFD 393
DN+ N YSD ++ A+L+ ++G + Q++ + +L+ I + L +
Sbjct: 761 DNSKNRYSDNYYRASLIDALGATITPVISLMQKACDITADSLSGDSKLILEEITQHLNLE 820
Query: 394 RLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+ + +Y +TI+C++ + KL F L + + + + VR+ A AL+D
Sbjct: 821 KRLQTYKYPITITCLKAIR----KLQKFGHLPNNPNIFRSYSSYGNFIGVRLCALEALVD 876
Query: 454 LEFHC-NG--IDSALSLFIKSVEEEPSLRGQ-VKLGIHAMRICQIKGGSDSNHEVDTVTL 509
F C +G D + I + +P +R + V++ K + N++ + L
Sbjct: 877 --FTCLDGRWDDLKYLIDIADSDPDPKVRNELVRMLCKNPPFTPKKPNNRLNNDYLSHHL 934
Query: 510 VALLNLL---ESRIAFNNVFLRHHLFGILQILAGRAPTLY 546
L+N L ++ + N V L + LFG +IL LY
Sbjct: 935 WKLINSLLSYDTTLRCNFVDLYYTLFGRQKILRAEVAALY 974
>gi|328708802|ref|XP_003243802.1| PREDICTED: transcription initiation factor TFIID subunit 2-like
isoform 2 [Acyrthosiphon pisum]
Length = 1249
Score = 145 bits (367), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 148/580 (25%), Positives = 263/580 (45%), Gaps = 75/580 (12%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGAS--PVRT-------LSTKEFRHFANKVGNLERPFL 51
MLE ++G + ++L +S A AS P+ + +ST F V ++ +
Sbjct: 470 MLENRIGYSLLLQVLNKHLSLASNASQNPMNSGLWGHMLISTNIFNKTIFTVSGID---M 526
Query: 52 KEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGD 111
F +WV T G F +N+++N +EL + +D + R V
Sbjct: 527 SVFIDQWVRTGGHAKFTCTFVFNRKRNTLELEIKQDAVCQRGIRKYV------------- 573
Query: 112 IGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDD 171
G + + + ELDG + H L + G+ + +I CHSK + K +
Sbjct: 574 ----GPLLVSIQELDGPFKH-TLQVEGNVARA-DITCHSKSRRNKKKKIPLCT------- 620
Query: 172 NGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAI 230
GD V +D+ + +SP+ WIR DPEM L QP W QL + DV AQ +A+
Sbjct: 621 -GDEV-DIDLNAMDNDSPVLWIRLDPEMTILRSCIIEQPEYQWNYQLRHEKDVTAQMEAV 678
Query: 231 AALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEE-TDWAGLLHLVKFYK 289
AL+ P ++ N L + + + + + VR +AA+ L A+ T+W G ++ +K
Sbjct: 679 VALDKFPTIA--TRNALCDIIENEQVYIHVRCKAAHCLTKIANAMVTNWTGPPAMLATFK 736
Query: 290 SRRFDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+ + P + N+F + Y++ + IP A+A +R + P E ++F+L L KYN
Sbjct: 737 --KMFGSFSCPHIIKQNNFSNLQHYYLQKTIPIAMAGLRNSHGICPPEILKFILDLFKYN 794
Query: 347 DNNGNPYSDVFWLAALVQSVGE-----LEFGQQSILFLSS--------LLKRIDRLLQFD 393
DN+ N YSD ++ A+L+ ++G + Q++ + +L+ I + L +
Sbjct: 795 DNSKNRYSDNYYRASLIDALGATITPVISLMQKACDITADSLSGDSKLILEEITQHLNLE 854
Query: 394 RLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+ + +Y +TI+C++ + KL F L + + + + VR+ A AL+D
Sbjct: 855 KRLQTYKYPITITCLKAIR----KLQKFGHLPNNPNIFRSYSSYGNFIGVRLCALEALVD 910
Query: 454 LEFHC-NG--IDSALSLFIKSVEEEPSLRGQ-VKLGIHAMRICQIKGGSDSNHEVDTVTL 509
F C +G D + I + +P +R + V++ K + N++ + L
Sbjct: 911 --FTCLDGRWDDLKYLIDIADSDPDPKVRNELVRMLCKNPPFTPKKPNNRLNNDYLSHHL 968
Query: 510 VALLNLL---ESRIAFNNVFLRHHLFGILQILAGRAPTLY 546
L+N L ++ + N V L + LFG +IL LY
Sbjct: 969 WKLINSLLSYDTTLRCNFVDLYYTLFGRQKILRAEVAALY 1008
>gi|121719809|ref|XP_001276603.1| conserved hypothetical protein [Aspergillus clavatus NRRL 1]
gi|119404815|gb|EAW15177.1| conserved hypothetical protein [Aspergillus clavatus NRRL 1]
Length = 1269
Score = 145 bits (366), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 140/524 (26%), Positives = 224/524 (42%), Gaps = 92/524 (17%)
Query: 2 LEKQMGSNFFRKILQNIISRAQ-GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
L K G +I+ I A+ G P +++ +F+ ++G+ + L FF +WV
Sbjct: 513 LTKASGKATMSRIISRIFLNARMGDLPNGAITSSQFQKTCERLGHAK---LDTFFQQWVY 569
Query: 61 TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNT-----DSENRDGDIG-- 113
GCP + +NK+K +VE+ + + +P +R L NT E R G
Sbjct: 570 GAGCPRFQATQRFNKKKLVVEMMIKQVQADQPSTRD--LDRNTFMRDVKEEIRGVYAGHV 627
Query: 114 ---WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGC 169
+ G M+IR+HE DG Y+H + G + K R + ++ + G
Sbjct: 628 QPVFSGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAAATIGG 687
Query: 170 DDNGDAVAGL---------------------------DMRSSMESPLSWIRADPEMEYLA 202
D N + + + R ES WIR D + E++
Sbjct: 688 DPNAEVQEDVLLYCLGDVLQTEEEMQEWRLADWSKEDEDRMGQES-YEWIRMDADFEWIC 746
Query: 203 EIHFNQPVQMWINQLEKDGDVVAQ---AQAIAALEALPHLSFNVVNTLNNFLSDSKAFWR 259
++ P M+++QL++D DV+AQ Q +AA P +S V T L D + F+
Sbjct: 747 KLSLVMPGYMYLSQLQQDRDVIAQLESLQYMAAQREHPLISTIFVRT----LMDRRYFYG 802
Query: 260 VRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP-----RPNDFRDFSEYFVL 314
+R AA AL A EE +W GL HL R F E LP R NDF D + Y +
Sbjct: 803 IRAAAARALVKHAKEEVNWLGLFHL-----ERAFQELFCLPGSPMTRSNDFSDRAAYVLQ 857
Query: 315 EAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQS-----VGEL 369
IP A++ VR D K+P F+ LK+NDN+ N YSD F++A L++S +G++
Sbjct: 858 LVIPEAISKVRDNDGKTPMRVKRFLYDKLKFNDNSNNEYSDNFYVATLMRSLCHAMLGKV 917
Query: 370 --------EFGQQSIL-------FLSSLLKRIDRLLQFDRLMPSYNGILT---ISCIRTL 411
EF + +L + + IDR + D S+ + + + C R
Sbjct: 918 ESRSADMDEFDMERMLEAQAEEELDKAAIAEIDRYRRMDEWSSSFQNVYSRAALHCQRQF 977
Query: 412 TQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLE 455
Q + LD + P+ T +R +A L++L+
Sbjct: 978 MQAKI-------LDLDIMQFLPYTRAGTYDLLRADAFEGLVELD 1014
>gi|195035725|ref|XP_001989322.1| GH10122 [Drosophila grimshawi]
gi|193905322|gb|EDW04189.1| GH10122 [Drosophila grimshawi]
Length = 1138
Score = 145 bits (366), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 129/447 (28%), Positives = 218/447 (48%), Gaps = 54/447 (12%)
Query: 57 RWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPG 116
+WV T G + +N ++N +EL + +D K R + G
Sbjct: 516 QWVRTGGHVKFSVISVFNCKRNTIELDIRQDYIGKRGVRQ-----------------YNG 558
Query: 117 MMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAV 176
+ +++ ELDG + H + D I CHSK + K + G+ +
Sbjct: 559 PLMVQLQELDGTFKHTL--QIEDTLTRSVISCHSKSRRNKKKKIPLWT--------GEEI 608
Query: 177 AGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEAL 236
+D+ + +SP+ W+R DPEM L ++ QP W QL + DV AQ QAI ALE
Sbjct: 609 -DMDLSAMDDSPVLWVRIDPEMILLRDLIVEQPDFQWQYQLRHERDVTAQFQAIRALER- 666
Query: 237 PHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETD-WAG----LLHLVKFYKSR 291
+ + + L + + + F+RVR EAA+ L A++ D W+G L +K + S
Sbjct: 667 -YATSATRSALEDTIENECCFYRVRCEAAHCLTKVANQMVDSWSGPPPMLDIFIKMFGSL 725
Query: 292 RFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGN 351
+ + L NDF +F Y++ +AIP A+A +R P + V F+L L+KYN+N+ N
Sbjct: 726 QAPHIVRL---NDFSNFQLYYLQKAIPVAMAGLRTLHGICPPDVVRFLLDLIKYNENSQN 782
Query: 352 PYSDVFWLAALVQSVGE-LEFGQQSILFLSS---------LLKRIDRLLQFDRLMPSYNG 401
PYSD ++ AALV ++G+ L G QS+ SS +L + R L+ + +PSY
Sbjct: 783 PYSDSYYRAALVTALGDTLSPGYQSLTTDSSSSISTDPKLVLSAVTRFLRGEHHLPSYKY 842
Query: 402 ILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL-EFHCNG 460
++ ++C++ + ++ KL SL + + +R + +R+ A L+DL + H +G
Sbjct: 843 MVLVACLKVIRRLQ-KLGHLPSLPHIFRSYSQYRSY---LDLRLSAMECLVDLVQVHESG 898
Query: 461 IDSALSLFIKSVEEEPSLR-GQVKLGI 486
D + + + +P+ R G V+L I
Sbjct: 899 EDLEHLMHLLETDPDPAARYGLVQLLI 925
>gi|410078536|ref|XP_003956849.1| hypothetical protein KAFR_0D00670 [Kazachstania africana CBS 2517]
gi|372463434|emb|CCF57714.1| hypothetical protein KAFR_0D00670 [Kazachstania africana CBS 2517]
Length = 1388
Score = 145 bits (366), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 125/491 (25%), Positives = 221/491 (45%), Gaps = 79/491 (16%)
Query: 13 KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
++L I +A G P +LS F+H +V + L+ FF +WV G P+ R+
Sbjct: 601 RVLPKIFLQAMSGDLPNNSLSASHFQHICERVN---KNKLEYFFQQWVYGSGVPIFRVTQ 657
Query: 72 SYNKRKNIVELAVLRDCTVKPDSRTPVLSS---------NTDSENRDGDIGWPGMMSIRV 122
+NK++ ++EL + R C + + S + E +D + G M+IR+
Sbjct: 658 RFNKKRMVIELGI-RQCQNTELGQDKTVGSSGFFNSALHHLKHEEKDIVPFFTGSMTIRI 716
Query: 123 HELDGM-YDHPILPMAGDAWQLLEIQCHSK---LAARRALKPKK--GSKPDGCDD--NGD 174
HE DG Y+H + D + ++IQ ++K L RR K S + DD +G
Sbjct: 717 HEADGTPYEHIV--ELKDIFTKIDIQYNTKYKKLRGRRMASTLKLNASTSNLQDDFIDGS 774
Query: 175 AVAGLDMRSSMESPLS------------------------------WIRADPEMEYLAEI 204
V + + LS WIR D + E++ ++
Sbjct: 775 VVDKDEEVQKFGTVLSSEEDCKKWNLENLTKTTEGNELQKQNEVFEWIRIDSDFEWICKM 834
Query: 205 HFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFL----------SDS 254
+ NQP M+ +QL++DGDV AQ ++I E + VN N L D
Sbjct: 835 YINQPDYMFASQLQQDGDVEAQFESIKYFEDV------TVNANKNSLIYSSILARTAMDD 888
Query: 255 KAFWRVRIEAAYALAN--TASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYF 312
+ F+ +R+EA ALA EE G HL+K +++ E+ +P+ N F F +Y
Sbjct: 889 RYFYGIRLEACKALAKFIYRPEEIFLGGSRHLIKIFQTLFCFEDSNIPKNNRFDSFQKYL 948
Query: 313 VLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFG 372
+ IP ++ VR N P +F+L +L YN+N NP+ D+ +L L+ +V
Sbjct: 949 LQREIPSFLSSVRTEQNDCPSFVKQFLLDILTYNENAENPFDDIIYLTRLINNVVTCTIN 1008
Query: 373 -QQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKL--SGFISLDQVVK 429
++ FL ++++++R D +P+Y + ++T+ + L+L +G + K
Sbjct: 1009 DKEDKSFLKKVVEQLNRYENLDTWIPTYQ----VKLMKTILEEKLRLNIAGLYDFGNLKK 1064
Query: 430 LIKPFRDFNTI 440
++ +++T+
Sbjct: 1065 YLQYSLNYDTL 1075
>gi|443691774|gb|ELT93532.1| hypothetical protein CAPTEDRAFT_221667 [Capitella teleta]
Length = 1210
Score = 145 bits (366), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 126/470 (26%), Positives = 209/470 (44%), Gaps = 67/470 (14%)
Query: 25 ASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAV 84
++P + +ST G +PF++E WV G F +N+++NI+EL +
Sbjct: 468 STPAKAISTV--------TGKDIQPFIQE----WVQQGGHARFFGQFVFNRKRNIIELEL 515
Query: 85 LRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLL 144
++ T K G + + G +++ + ELDG ++H +
Sbjct: 516 KQEWTTK------------------GALRYVGPLTVVLQELDGTFNHTF--KIEENKTKF 555
Query: 145 EIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEI 204
EI CHSK + K + + D GD A +SP+ W+R DPEM L +
Sbjct: 556 EITCHSKSRRHKKKKIPLCTGEEIDMDLGDTDA--------DSPVLWMRIDPEMFILRSV 607
Query: 205 HFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEA 264
QP MW L + VV+Q QAI AL P + L + + + F+RVRI A
Sbjct: 608 QLEQPDFMWQYLLRYERCVVSQCQAITALTNYPTPKTRLA--LTDTIESDQCFYRVRIAA 665
Query: 265 AYALANTASEET-DWAGLLHLVKFYKSRRFDENIGLP---RPNDFRDFSEYFVLEAIPHA 320
A L A+ WAG ++ + R+ + P R N F +F YF+ +AIP
Sbjct: 666 AEGLVKVANSMVGSWAGPPAMMTIF--RKLFGSHSCPTIVRQNHFNNFQHYFLQKAIPGL 723
Query: 321 VAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLS 380
+A +R P + F+L L KYNDN+ N YSD ++ AAL+ +V E +++ ++
Sbjct: 724 MAGLRTVHGICPPDVHGFILDLFKYNDNSKNKYSDNYYRAALIDAVAETITPAVTMVTVT 783
Query: 381 SL--------------LKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQ 426
+ L+ + R L ++L+P Y ++T+SC++ + KL F L
Sbjct: 784 GMCPSTDSLTPDTSLILEELTRYLNLEKLLPCYRLVITVSCLKAIR----KLQKFGHLPS 839
Query: 427 VVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEP 476
+ + + + VR+ A AL+D G ++ +EE+P
Sbjct: 840 DAAIFRSYAEQGNFLDVRLAALEALVDFT-KVEGSFEIWQWLLEVIEEDP 888
>gi|50292707|ref|XP_448786.1| hypothetical protein [Candida glabrata CBS 138]
gi|49528099|emb|CAG61756.1| unnamed protein product [Candida glabrata]
Length = 1385
Score = 145 bits (366), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 121/458 (26%), Positives = 211/458 (46%), Gaps = 69/458 (15%)
Query: 13 KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
++L I +A G P L++ F+H KV + L+ FF +WV G P+ R+
Sbjct: 580 RVLPKIFLQAMSGDLPNNALTSSHFQHVCEKVN---KSKLESFFQQWVFGSGVPLFRVTQ 636
Query: 72 SYNKRKNIVELAVL-----------RDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSI 120
YNK++ +VE+ + RD + D L+S ++ N + + G ++I
Sbjct: 637 RYNKKRMVVEMGIRQYPNDTDEFNERDKSKMEDFFNKALNS-LEAPNVNKTSIFTGSITI 695
Query: 121 RVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDG----------- 168
R+H+ DG Y+H L D + ++IQ ++K R K + K +
Sbjct: 696 RIHDSDGTPYEH--LVEIKDIYTKVDIQYNTKHKKTRTRKNNEDEKTNEIRYGTSGSDIS 753
Query: 169 -----------CDDNG-----DAVAGL---DMRSSMESPLSWIRADPEMEYLAEIHFNQP 209
C D G AVA L +++ + ES + WIR D + E+L+ + NQP
Sbjct: 754 FLGNILDSDTECQDWGIVHPSTAVANLSGDEIQRANESTIEWIRIDSDFEWLSTFYLNQP 813
Query: 210 VQMWINQLEKDGDVVAQAQAIAALEALPHLSFNV---------VNTLNNFLSDSKAFWRV 260
M+ QL++D D+ AQ LE++ + N+ + L L D + F+ +
Sbjct: 814 DYMFATQLQRDSDIEAQ------LESVWYYGDNLRDNMASKIYASQLVRTLMDDRYFYGI 867
Query: 261 RIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHA 320
RI+AA AL+ + TD L + +K +N +P+ N+F F +YF+ + P
Sbjct: 868 RIQAAKALSRYICKPTDKEHFL--LTAFKKLYCHDNSNIPKSNNFSSFQDYFLQMSFPSL 925
Query: 321 VAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQ-SVGELEFGQQSILFL 379
+A ++ +P +F+L +L YNDN+ N Y D ++L + + +V G ++ +
Sbjct: 926 IAEIKDHYGDTPDYIGDFLLDILSYNDNSENSYYDTYYLCEIFEFTVNCCNGGNRN--HV 983
Query: 380 SSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALK 417
+ L+++I R + PSYN LT I ++ K
Sbjct: 984 TKLMEQIRRYEHLEYWSPSYNLALTSKLIEIKCRLHTK 1021
>gi|115491151|ref|XP_001210203.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114197063|gb|EAU38763.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 1260
Score = 145 bits (365), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 136/527 (25%), Positives = 231/527 (43%), Gaps = 91/527 (17%)
Query: 1 MLEKQMGSNFFRKILQNIISRA-----QGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
+LE+++ + + I+SR G P +++ F+ ++G+ + L FF
Sbjct: 503 ILERRLNKASGKPTMTRIVSRVFLNSRMGDIPNGAITSSHFQKTCERLGHAK---LDTFF 559
Query: 56 PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG-- 113
+WV GCP + +NK+K +VE+ + + + +P +R L NT + +I
Sbjct: 560 QQWVYGAGCPRFQATQRFNKKKLVVEMVIRQIQSEQPSTRD--LDRNTFMRDVKEEIRGV 617
Query: 114 --------WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGS 164
+ G M+IR+HE DG Y+H + G + K R + ++ +
Sbjct: 618 YAGAIQPVFTGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAA 677
Query: 165 KPDGCDDNGDAVAGL---------------------------DMRSSMESPLSWIRADPE 197
G D N + + + R ES WIR D +
Sbjct: 678 AVTGTDPNAETQEDVLLYCLGDVLQSEEEMQEWRLADWSKEEEERMGQES-YEWIRMDAD 736
Query: 198 MEYLAEIHFNQPVQMWINQLEKDGDVVAQ---AQAIAALEALPHLSFNVVNTLNNFLSDS 254
E++ ++ P M+++QL++D DVVAQ Q +AA P +S V T L D
Sbjct: 737 FEWICKLSLVMPGYMYLSQLQQDRDVVAQLESLQYMAAQREHPLISTIFVRT----LMDR 792
Query: 255 KAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP-----RPNDFRDFS 309
+ F +R AA AL A E+ +W GL HL R F E +P R NDF D +
Sbjct: 793 RYFHGIRAAAARALVKHAKEDINWLGLFHL-----ERAFQELFCIPGSPMTRSNDFSDRA 847
Query: 310 EYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQS---- 365
Y + IP A++ VR + K+P F+ L++NDN+ N YSD F++A L+QS
Sbjct: 848 AYALQLIIPEAISKVRDNNGKTPMRVKRFLYDKLRFNDNSNNEYSDNFYVATLMQSLCHA 907
Query: 366 -VGELE---------FGQQSIL-------FLSSLLKRIDRLLQFDRLMPSYNGILTISCI 408
+G++E F + +L L IDR + D SY + + + +
Sbjct: 908 LLGKVETRSEDLDDNFDMERVLEAQAEEQLERDALAEIDRYRRMDEWSSSYQNVYSRAAL 967
Query: 409 RTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLE 455
R Q+ L + + +D V++ + P+ T +R++A L++ +
Sbjct: 968 R--CQMQLMQANILDMD-VMQFL-PYTRAGTYDLLRLDAFECLVETD 1010
>gi|261198060|ref|XP_002625432.1| transcription initiation factor TFIId 127kD subunit [Ajellomyces
dermatitidis SLH14081]
gi|239595395|gb|EEQ77976.1| transcription initiation factor TFIId 127kD subunit [Ajellomyces
dermatitidis SLH14081]
Length = 1250
Score = 145 bits (365), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 121/424 (28%), Positives = 188/424 (44%), Gaps = 64/424 (15%)
Query: 1 MLEKQMGSNFFRKILQNIISRA-----QGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
+LE+++ + + IISR G P L+T F+ ++G+ + L FF
Sbjct: 511 ILERRLSKASGKATISRIISRIFLNARMGDLPNGALTTSFFQRTCERLGHTK---LDSFF 567
Query: 56 PRWVGTCGCPVLRMGFSYNKRKNIVELAVLR----DCTVKPDSRTPVLSSNTDSENRDGD 111
+WV GCP +NK+K +VE+ + + TV+ D E ++
Sbjct: 568 QQWVYGAGCPRFLATQRFNKKKLVVEMMIRQVQSEQATVR-DLEKSTFMREVKEEIKNIY 626
Query: 112 IG-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSK 165
G + G M+IR+HE DG Y+H + G + K R + ++ +
Sbjct: 627 AGPVQHVFTGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAAA 686
Query: 166 PDGCDDN------------GDAVAGLDM---------------RSSMESPLSWIRADPEM 198
G D N GD + + R ES WIR D +
Sbjct: 687 TSGMDPNAELQDDVLLYCLGDVLQSEEEIQKWKLVEWSKEDEDRMGQES-YEWIRMDADF 745
Query: 199 EYLAEIHFNQPVQMWINQLEKDGDVVAQAQAI---AALEALPHLSFNVVNTLNNFLSDSK 255
E++ ++ P M+++QL++D D+VAQ ++I AA P +S V T L D +
Sbjct: 746 EWICKMSLTMPGYMYLSQLQQDRDIVAQLESIQYMAAQREHPLISTIFVRT----LMDRR 801
Query: 256 AFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP-----RPNDFRDFSE 310
F+ +R AA AL A +E DW GL HL F E LP R NDF D +
Sbjct: 802 YFYGIRAAAAKALVKHAKDEVDWIGLYHL-----ETAFQELFCLPGSPMTRSNDFSDRAA 856
Query: 311 YFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELE 370
Y + IP A+A VR K+P F+ LK+NDN+ N SD +++AAL++S+ E
Sbjct: 857 YILQLTIPQAIAQVRDNTGKTPMRVKRFLYDKLKFNDNSNNELSDCYYIAALMKSICEAL 916
Query: 371 FGQQ 374
G++
Sbjct: 917 LGRK 920
>gi|239607754|gb|EEQ84741.1| transcription initiation factor TFIId 127kD subunit [Ajellomyces
dermatitidis ER-3]
gi|327354600|gb|EGE83457.1| transcription initiation factor TFIId 127kD subunit [Ajellomyces
dermatitidis ATCC 18188]
Length = 1250
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 121/424 (28%), Positives = 188/424 (44%), Gaps = 64/424 (15%)
Query: 1 MLEKQMGSNFFRKILQNIISRA-----QGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
+LE+++ + + IISR G P L+T F+ ++G+ + L FF
Sbjct: 511 ILERRLSKASGKATISRIISRIFLNARMGDLPNGALTTSFFQRTCERLGHTK---LDSFF 567
Query: 56 PRWVGTCGCPVLRMGFSYNKRKNIVELAVLR----DCTVKPDSRTPVLSSNTDSENRDGD 111
+WV GCP +NK+K +VE+ + + TV+ D E ++
Sbjct: 568 QQWVYGAGCPRFLATQRFNKKKLVVEMMIRQVQSEQATVR-DLEKSTFMREVKEEIKNIY 626
Query: 112 IG-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSK 165
G + G M+IR+HE DG Y+H + G + K R + ++ +
Sbjct: 627 AGPVQHVFTGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAAA 686
Query: 166 PDGCDDN------------GDAVAGLDM---------------RSSMESPLSWIRADPEM 198
G D N GD + + R ES WIR D +
Sbjct: 687 TSGMDPNAELQDDVLLYCLGDVLQSEEEIQKWKLVEWSKEDEDRMGQES-YEWIRMDADF 745
Query: 199 EYLAEIHFNQPVQMWINQLEKDGDVVAQAQAI---AALEALPHLSFNVVNTLNNFLSDSK 255
E++ ++ P M+++QL++D D+VAQ ++I AA P +S V T L D +
Sbjct: 746 EWICKMSLTMPGYMYLSQLQQDRDIVAQLESIQYMAAQREHPLISTIFVRT----LMDRR 801
Query: 256 AFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP-----RPNDFRDFSE 310
F+ +R AA AL A +E DW GL HL F E LP R NDF D +
Sbjct: 802 YFYGIRAAAAKALVKHAKDEVDWIGLYHL-----ETAFQELFCLPGSPMTRSNDFSDRAA 856
Query: 311 YFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELE 370
Y + IP A+A VR K+P F+ LK+NDN+ N SD +++AAL++S+ E
Sbjct: 857 YILQLTIPQAIAQVRDNTGKTPMRVKRFLYDKLKFNDNSNNELSDCYYIAALMKSICEAL 916
Query: 371 FGQQ 374
G++
Sbjct: 917 LGRK 920
>gi|346319804|gb|EGX89405.1| transcription initiation factor [Cordyceps militaris CM01]
Length = 1463
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 139/513 (27%), Positives = 231/513 (45%), Gaps = 74/513 (14%)
Query: 2 LEKQMGSNFFRKILQNIISRAQ--GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
+ K G + ++LQ +++RAQ G+ L T++F+ K G+ R L+ F+ +WV
Sbjct: 534 MNKASGGHGLTRVLQKLLTRAQIEGSDKSMILDTEKFKTICEK-GSRYR--LETFWNQWV 590
Query: 60 GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNT---DSENRDGDIG--- 113
GCP + +NK++ VEL + + K ++ P L + + R ++
Sbjct: 591 YGSGCPKFDVKARFNKKRLCVELTLNQVQFQK--AKKPQLEKDDFLREIRERRAEVPLGE 648
Query: 114 ----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQC-----HSKLAARRALKPKKG 163
+ G M++R+HE DG Y+H IL + DA + + + + +L R +K K
Sbjct: 649 TQPLFTGPMTVRIHEADGTPYEH-ILEIREDATRATKFEIPYNTKYKRLKRTRRMKEKHT 707
Query: 164 SKPDGCDDNGDAVAGLDMRSSMESP-----------------------LSWIRADPEMEY 200
+ DN D + ++SP WIR D + E+
Sbjct: 708 GQNIEITDNNDDALLYCLGDVLQSPEDQREWELIDWDPETERKMDQESYEWIRLDADFEW 767
Query: 201 LAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRV 260
+++ M+++QL++D DVVAQ A+ L P L L L D + F +
Sbjct: 768 ASDMSRTLEPYMYVSQLQQDRDVVAQQDAMLYLAQGP-LHPIASGFLVRTLIDRRYFHGI 826
Query: 261 RIEAAYALANTAS-EETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPH 319
R AA AL ++ ++ G+ L++ ++ E PRPNDF D +Y V AI
Sbjct: 827 RTMAAAALTRQSNIKDIPLLGMRQLMRAFREMYCYEQTNQPRPNDFTDKKQYNVRCAIIK 886
Query: 320 AVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEF-------- 371
A+A VR NK P EA F+L L+ N+N GNPYSD F++A LV+S+
Sbjct: 887 AIAEVRDHTNKCPMEARRFILDQLRLNNNEGNPYSDHFYIAFLVKSLSISLIPSKQDDWL 946
Query: 372 -------GQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCI---RTLTQIALKLSGF 421
++ FL L++I+R+L+ D SY T++ + + L + + F
Sbjct: 947 TRQNRIPNEEETAFLEDALEQIERVLRRDEWTNSYQNTWTLAGLDAKQRLMKADVIPKNF 1006
Query: 422 ISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
Q L+ RD +R++ AL+DL
Sbjct: 1007 TDFGQY--LLDGTRDL-----IRIKCFEALIDL 1032
>gi|449667464|ref|XP_002158433.2| PREDICTED: transcription initiation factor TFIID subunit 2-like,
partial [Hydra magnipapillata]
Length = 1033
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 103/375 (27%), Positives = 183/375 (48%), Gaps = 61/375 (16%)
Query: 54 FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
F +W+ G + FS+N+++NI+EL V C K G
Sbjct: 506 FLDQWICHSGVAKINASFSFNRKRNIIELEV-SQCVGK------------------GCQK 546
Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
+ G M I + ELDG + H + D E+ CHSK + P D
Sbjct: 547 YVGPMIICIQELDGSFPHTV--QVEDIISQHELPCHSKSRRNKKK-----KIPLITGDEV 599
Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
D + + S +SP+ W+R DP+M + + QP MW QL + DV +Q +A+ AL
Sbjct: 600 DMDLSVTLDS--DSPVLWVRIDPDMTWPRHVSLQQPDYMWQYQLRYERDVTSQIEAVRAL 657
Query: 234 EALPHLSFN--VVNTLNNFLSDSKAFWRVRIEAAYALANTASEETD-WAGLLHLVKFYKS 290
E P +++T+NN +AF++VR+EAA+ LA A+ E + W G +++ ++
Sbjct: 658 EKYPCSGTRSALIDTINNV----EAFFKVRMEAAFCLAKIANTEVEVWDGESAMIEIFE- 712
Query: 291 RRFDENI--GLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDN 348
+ F N +P+ N+F +F+ YF+ + +P +++M++ + ++ R ++F+L+LLKYNDN
Sbjct: 713 KMFGSNACASIPKFNNFSNFTGYFMQKVLPTSISMLKNSQSQCTRRTLQFILELLKYNDN 772
Query: 349 NGNPYSDVFWLAALVQSV-------------------GELEFGQQSILFLSSLLKRIDRL 389
N + D F++++L+ + L + + L LS +++R
Sbjct: 773 RFNQFHDGFYVSSLIDGLKNVIGPALHLASATSFDERKNLPVDEHAKLVLSEVVRR---- 828
Query: 390 LQFDRLMPSYNGILT 404
L D+++PS+ +T
Sbjct: 829 LNMDKIIPSFRYAVT 843
>gi|302924510|ref|XP_003053905.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256734846|gb|EEU48192.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 1486
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 151/544 (27%), Positives = 249/544 (45%), Gaps = 79/544 (14%)
Query: 2 LEKQMGSNFFRKILQNIISRAQ--GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
L K G + +ILQ ++++ Q + L T++FR K R L+ F+ +WV
Sbjct: 546 LIKASGGHGLTRILQKLLTKVQIEASDKSTILETEKFRAVCEKGA---RYRLESFWNQWV 602
Query: 60 GTCGCPVLRMGFSYNKRKNIVELAV--LRDCTVKPDSRTPVLSSN----TDSENRDG-DI 112
GCP + +NK++ VEL + ++ TVK P L + E R G
Sbjct: 603 YGSGCPRFDVKAKFNKKRLCVELTLNQIQFQTVK----KPALERDDFLRVIKERRAGVKT 658
Query: 113 G-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQC-----HSKLAARRALKPK 161
G + G M++R+HE DG Y+H IL + DA + + + + +L R +K K
Sbjct: 659 GEVQPLFTGPMTVRIHEADGTPYEH-ILEIREDATRSTKFEIPYNTKYKRLKRTRRMKEK 717
Query: 162 K--GSKPDG--CDDN-----GDAVAGLDMRSSME--------------SPLSWIRADPEM 198
+ G+ D DD GD + D ++ E WIR D +
Sbjct: 718 QNVGASMDAENLDDTLLYCLGDVLQTPDEQAQWELIDWDPDTERKMDQESYEWIRVDADF 777
Query: 199 EYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFW 258
E+ ++ M+++QL++D DVVAQ A+ L P L L L D + F
Sbjct: 778 EWACDMKRTLEPYMYVSQLQQDRDVVAQQDAMLYLTHGP-LHPIASGFLTRTLVDRRYFH 836
Query: 259 RVRIEAAYALANTAS-EETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAI 317
+R AA AL A+ ++ GL L+K + + P+PNDF D +Y V AI
Sbjct: 837 GIRTMAAEALPRQANIKDISMLGLRQLMKAFSEMFCHKGTNQPKPNDFSDKKQYNVQCAI 896
Query: 318 PHAVAMVRAADN-KSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVG--------E 368
A+A VR A+ K P EA +F+L L +N+N NPYSD F++A LV+++ +
Sbjct: 897 IKAIAQVRDANTYKCPLEARQFILDQLLFNNNEDNPYSDHFYIATLVEALATSLIPSKKD 956
Query: 369 LEFGQQSIL-------FLSSLLKRIDRLLQFDRLMPSYNGILTISCI---RTLTQIALKL 418
F Q+ + FL +++I+R+L+ D SY + TI+ + + L + +
Sbjct: 957 DWFAMQNKIPNEEETQFLDRAMEQIERVLRRDEWTNSYQNVWTIAGLNAKQRLMKAEVIP 1016
Query: 419 SGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSL 478
+I Q L+ RD +R++A AL+DL + +L ++ + + P +
Sbjct: 1017 KAYIDFGQY--LLDGTRDL-----IRIKAFEALIDLGAMMDPTMFSLLIYSLTTDRSPYV 1069
Query: 479 RGQV 482
R ++
Sbjct: 1070 RNKL 1073
>gi|326477130|gb|EGE01140.1| transcription initiation factor TFIId 127kD subunit [Trichophyton
equinum CBS 127.97]
Length = 1240
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 112/404 (27%), Positives = 188/404 (46%), Gaps = 49/404 (12%)
Query: 1 MLEKQMGSNFFRKILQNIISRA-----QGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
+LE+++ + + IISR G P L++ F+ ++G+ + L FF
Sbjct: 514 ILERRLSKASGKATISRIISRIFLNARMGDLPNCELTSAHFQKLCERMGHTK---LDAFF 570
Query: 56 PRWVGTCGCPVLRMGFSYNKRKNIVELAVLR---DCTVKPDSRTPVLSSNTDSENRDGDI 112
+WV GCP +NK+K +VE+ + + + T D + E R+
Sbjct: 571 SQWVYGAGCPRFTASQRFNKKKLVVEMMISQVQSEQTTARDLERSTFMRDVKEEIRNVYA 630
Query: 113 G-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEI-QCHSKLAARRALKPKKGSK 165
G + G M+I++HE DG Y+H ++EI + +K K K+ S
Sbjct: 631 GTVQNVFTGSMTIQIHEADGTPYEH-----------IVEIKEAITKFDIPYNTKYKRLS- 678
Query: 166 PDGCDDNGDAVAGL-DM--------------RSSMESPLSWIRADPEMEYLAEIHFNQPV 210
G D G+ + DM R ES WIR D + E++ ++ P
Sbjct: 679 --GIGDKGNELRPYEDMEKWKIVEWTKEDESRMGQES-YEWIRMDADFEWICKMSLGMPG 735
Query: 211 QMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALAN 270
M+++QL++D DVVAQ ++I + A + + + + D + F +R A AL
Sbjct: 736 YMYLSQLQQDRDVVAQLESIQHMAAQREHAL-ISSIFVRTILDRRYFHGIRTAATKALVK 794
Query: 271 TASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNK 330
A +E DW GL HL ++ N + R NDF D S Y V A+ A++ VR D +
Sbjct: 795 HAKQEVDWIGLFHLENAFQELYCLPNSSMARSNDFSDRSSYAVQLALIEAISEVRDEDGR 854
Query: 331 SPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQ 374
+P +F+ + LK+NDN+ N YSD F+++ L++S+ G++
Sbjct: 855 TPMRVKQFLYEKLKFNDNSNNEYSDCFYISTLLKSICNALLGRR 898
>gi|268553097|ref|XP_002634531.1| C. briggsae CBR-TAF-2 protein [Caenorhabditis briggsae]
Length = 1090
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 131/522 (25%), Positives = 243/522 (46%), Gaps = 83/522 (15%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRTL---------STKEFRHFANKVGNLERPFL 51
M++K++G F ++L +++ S + L +T F + V E P
Sbjct: 493 MIQKRIGMEPFMRVLHRLLTVGLDMSEKKLLPPAWKHLLITTDSFFRNIHSVTGKEIP-- 550
Query: 52 KEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGD 111
F +WV G + F +N+++N+VE+ + ++ DSE +G
Sbjct: 551 -SFLDQWVRVGGHASFAIKFDFNRKRNLVEMEIKQE----------------DSEG-NGR 592
Query: 112 IGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPK--KGSKPDGC 169
+ + G +S+ V E+DG + H I A EI CHSK ++ K G + +
Sbjct: 593 MQYTGPLSVVVQEVDGAFSHTI--QIDGAVSHAEISCHSKGRKQKKKKVPILTGEEIEID 650
Query: 170 DDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
N DA ESP+ W+R D + + EI +QP+ W L+ + DV+AQ +A
Sbjct: 651 LTNMDA----------ESPILWLRIDHDYLLIREITISQPMFHWEYMLKYERDVIAQMEA 700
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDW--AGLLHLVKF 287
+ ++ALP S + + + + +S+ + F+R+R AA+ L + + +++ G L+
Sbjct: 701 LERIQALP--SAHSRSVIVDAVSNERFFYRIRSRAAFVLTSVQNRKSEAIAVGTPVLINM 758
Query: 288 YKSRRFDENIG------LPRPNDF----RDFSEYFVLEAIPHAVAMVRAADNKSPREAVE 337
++ EN G +PR N+F +YFV+++IP A++ +R + + +
Sbjct: 759 FR-----ENFGSKAVTNIPRSNNFVVTAEKLQQYFVMQSIPQAISRLRKPSGECNEDVLP 813
Query: 338 FVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILF-----------LSSLLKRI 386
F+L L+K+NDN+ N YSD F+ A+L S+ + + + + ++K
Sbjct: 814 FLLDLIKFNDNSTNRYSDDFYRASLFNSLAASIYPSEYLPYRVDLPENLSNEFRVMIKEF 873
Query: 387 DRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVE 446
L D + PSY ++ I+ + L Q L+ G++ LD +L+ F + N Q+R
Sbjct: 874 TYALNMDTVNPSYGRVVGIAALSGLYQ--LQKCGYLPLDS--ELLWTFANRNLCIQMRRC 929
Query: 447 ASRALLD-LEFHCNGIDSALS-----LFIKSVEEEPSLRGQV 482
A ++D + N +D+ + L + E +PS+R +
Sbjct: 930 AITLIVDRISNDTNALDTRMEDLSRLLDLVETETDPSIRAMI 971
>gi|308810875|ref|XP_003082746.1| TFIID component TAF2 (ISS) [Ostreococcus tauri]
gi|116061215|emb|CAL56603.1| TFIID component TAF2 (ISS) [Ostreococcus tauri]
Length = 969
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 119/437 (27%), Positives = 206/437 (47%), Gaps = 50/437 (11%)
Query: 1 MLEKQMGSNFFRKILQNI----ISRAQGASPVRTLSTKE--------------FRHFANK 42
MLEK++G + +K+L+ + R + RT + E F H
Sbjct: 400 MLEKRLGPDVMQKVLKYFAGLHVRRNKKEQATRTGPSNEVLAGNARWIHTLQLFDHCRAT 459
Query: 43 VGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSN 102
V NL + + F RWV G P L +G+ ++KN++E AV + + + +
Sbjct: 460 V-NLGKGEVNSFLERWVYGAGTPKLSVGYVVKRKKNVMEFAVKLEGSAAAAAADRAALAV 518
Query: 103 TDSENRDGDIGWPGMMSIRVHELD--GMYDHPILPMAGDAWQLLEIQCHSKLAARRALKP 160
+ +++R+ E + DH ++ + +WQL+EI + RR P
Sbjct: 519 ARNHRSS--------VTVRMREENRADANDH-VVSLGQSSWQLMEIPLQPRPKDRR---P 566
Query: 161 KKGSKPDGCDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPV----QMWINQ 216
K + GD ++ ++M+ P+ ++R DPE E++ I + M
Sbjct: 567 KTI-----IESGGDP----ELIAAMDCPVRYVRVDPEFEWMGNIEQSARQVGLESMMAQM 617
Query: 217 LEKDGDVVAQAQAIAAL-EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEE 275
LEK+ D+VAQ A+ L + + S + V L+ L+ F RVR EAA AL +ASE+
Sbjct: 618 LEKEKDIVAQIVAVEFLGRRVANGSVSAVLVLDKCLNSEDTFCRVRAEAAMALGKSASEK 677
Query: 276 TDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPR-E 334
T W L L++ YK + +E G P+ NDFRD ++ V +A+ A+A V + K +
Sbjct: 678 TQWGSLHALLRHYKKYQCNERTGKPKQNDFRDLAKVIVDKAVITALAHVYDEEAKCTHAD 737
Query: 335 AVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDR 394
A++ ++ LKYN+N GNP SD ++A + ++ ++ L ++ +I ++ D
Sbjct: 738 AIDAIVNRLKYNNNEGNPQSDDGFVATCLSALARCV--PATLGELRAITAQIHYYIRRDS 795
Query: 395 LMPSYNGILTISCIRTL 411
PS + +T + IR L
Sbjct: 796 RFPSVDLCVTCAGIRAL 812
>gi|312077933|ref|XP_003141518.1| TATA binding protein associated factor [Loa loa]
Length = 1086
Score = 143 bits (361), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 137/491 (27%), Positives = 235/491 (47%), Gaps = 77/491 (15%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEFRHFANKVGNLERPFL 51
ML +++G F +++Q I+S A AS ++ T+ST+ F + V E P
Sbjct: 480 MLNQRLGKEPFLQVIQKILSVAMQASQLQKEPAHWQHMTVSTESFFRTVSSVTGQELP-- 537
Query: 52 KEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGD 111
F +W+ G + +++N+++N++EL V ++ S NR G
Sbjct: 538 -TFLEQWIHNGGHAKFWVQYAFNRKRNMIELEVKQE----------------PSANR-GR 579
Query: 112 IGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDD 171
+ G + + V ELDG + H + + + ++QCHSK RR K KK G +
Sbjct: 580 QSYVGPLIVVVQELDGSFTHTVQIDSNHSKH--DLQCHSK--GRRQ-KKKKIPLSTGEEL 634
Query: 172 NGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIA 231
D +A +D +SP+ WIR DP++ L +++ QPV W L+ + DV+AQ QA+
Sbjct: 635 EID-LANMDP----DSPVLWIRIDPDLLLLRKVNIQQPVYQWEYMLKYERDVLAQLQALD 689
Query: 232 ALEALP--HLSFNVVNTLNNFLSDSKAFWR----VRIEAAYALANTASEETD-WAG---- 280
L+ P H ++ T+ +++AF+ VR AAY LA ++ + W G
Sbjct: 690 ILQRFPSPHARAVLIETI-----ETEAFFYRQALVRCRAAYILAEVVNKLPETWLGPPAL 744
Query: 281 LLHLVKFYKSRRFDENIGLPRPNDFRDFSE----YFVLEAIPHAVAMVRAADNKSPREAV 336
++ KF+ + +P+PN+F S YF++ AIP A+A +R++ N + E
Sbjct: 745 MILFKKFFGCKSAPH---IPKPNNFVATSANLQLYFLMHAIPQALARLRSSSNGALSEVH 801
Query: 337 EFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQS--ILFLS---------SLLKR 385
F++ L+KYNDN N YSD + A+L+ S+ +S I+ +S LL
Sbjct: 802 HFLIGLIKYNDNTVNRYSDDHYRASLINSLAATLIPAESRDIIDMSPLSLDTEMRELLSE 861
Query: 386 IDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRV 445
L D L PS+ I+ I+ ++ + ++ + I LD K+ F N +R+
Sbjct: 862 FTLALNMDTLKPSFGRIIGITALQAIYH--MQRTFHIPLDP--KVFWSFAQPNMYVPMRL 917
Query: 446 EASRALLDLEF 456
A L+D+ +
Sbjct: 918 AALTFLVDMVY 928
>gi|321476028|gb|EFX86989.1| hypothetical protein DAPPUDRAFT_307126 [Daphnia pulex]
Length = 1171
Score = 143 bits (361), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 121/480 (25%), Positives = 215/480 (44%), Gaps = 59/480 (12%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTK------EFRHFANKVGNLERPFLKEF 54
MLE ++G ++ +S A AS + + + FA + + + F
Sbjct: 479 MLELRIGQPLLIQVFNKQLSLAVQASQQKCQANQWGAMLISTATFAKAIFTVTGKDIGVF 538
Query: 55 FPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGW 114
WV G ++ F +N+++N VEL + +D + + ++ G +
Sbjct: 539 VDTWVRQGGHARFQLSFVFNRKRNTVELEIRQD-------------AASGAQPARGLRRY 585
Query: 115 PGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGD 174
G + + + ELDG + H + + A L S+ ++ + G + D
Sbjct: 586 VGPLVVALQELDGTFRHTLQIESSVAKNDLTCHSKSRRNKKKKIPLCTGEEVD------- 638
Query: 175 AVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
+D+ + +SP+ WIR DP+M L + +QP W QL + DV AQ +A+ AL
Sbjct: 639 ----MDLSAMDADSPVLWIRLDPDMSLLRAVEISQPDYQWQYQLRHERDVTAQMEAVMAL 694
Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEE-TDWAG----LLHLVKFY 288
E + + L + + + F++VR +AAY LA A+ + WAG L+ K +
Sbjct: 695 ERFATPATRLA--LTDIIENENCFYKVRCDAAYCLAKVANGMVSSWAGPPAMLMIFRKLF 752
Query: 289 KSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDN 348
S + R N+F +F YF+ + IP A+A +R A P+E + F+L L KYNDN
Sbjct: 753 GSFSCQHIV---RQNNFANFQHYFLQKTIPVAMAQLRNALGVCPQEVLRFLLDLFKYNDN 809
Query: 349 NGNPYSDVFWLAALVQSVGEL-------------EFGQQSILFLS-SLLKRIDRLLQFDR 394
N SD ++ AAL++++G +S+ + +L+ + R L ++
Sbjct: 810 TKNRLSDCWYRAALIEALGNTASPVVAVVAQSAASLTPESVTKETHQVLEEVARALNLEK 869
Query: 395 LMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
L+P+Y +T+ C++ + KL L L K + + VRV A AL+++
Sbjct: 870 LLPTYKFAVTVGCLKAIR----KLQKCGQLPSKSDLFKSYAQYGQFHDVRVTALEALVEV 925
>gi|341880460|gb|EGT36395.1| hypothetical protein CAEBREN_08215 [Caenorhabditis brenneri]
Length = 833
Score = 143 bits (361), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 123/458 (26%), Positives = 218/458 (47%), Gaps = 73/458 (15%)
Query: 1 MLEKQMGSNFFRKILQNIIS--------RAQGASPVRTLSTKE--FRHFANKVGNLERPF 50
ML+K++G F ++L +++ +A A+ L+T E FR ++ G F
Sbjct: 236 MLQKRIGLEPFMRVLHRVLTVGLDMSEKKATPAAWRHLLTTTEAFFRSVSSVTGREIPTF 295
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
L +F V T G + F +N+++NIVE+ V +D D+E +G
Sbjct: 296 LDQF----VRTGGHVAFAVKFDFNRKRNIVEIEVKQD----------------DTEG-NG 334
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPK--KGSKPDG 168
+ + G +S+ V E+DG + H I A EI CHSK ++ K G + +
Sbjct: 335 RVQYTGPLSVVVQEVDGAFSHTI--QIDGATSHAEISCHSKGRKQKKKKVPILTGEEIEI 392
Query: 169 CDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQ 228
N DA ESP+ W+R D + + EI +QP+ W L+ + DV+AQ +
Sbjct: 393 DLTNMDA----------ESPILWLRIDHDYLLIREITISQPMFHWEYMLKYEKDVIAQME 442
Query: 229 AIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFY 288
A+ ++ALP S + + + + +S+ K F+R+R AA+ L + + +++ + V
Sbjct: 443 ALERIQALP--SAHSRSVIVDAVSNDKFFYRIRSRAAFVLTSVLNRKSEAIAVGTPVLI- 499
Query: 289 KSRRFDENIG------LPRPNDF----RDFSEYFVLEAIPHAVAMVRAADNKSPREAVEF 338
F EN G +PR N+F ++ +YFV++++P A++ +R + E F
Sbjct: 500 --NMFRENFGSKAATNIPRSNNFVVTAQNLQQYFVMQSLPQAISRLRKQSGECHEEVQPF 557
Query: 339 VLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILF-----------LSSLLKRID 387
+L L+K+NDN+ N YSD F+ A+L S+ + + + + L+K
Sbjct: 558 LLDLIKFNDNSTNRYSDDFYRASLYNSLAASVYSSEYLPYRVDLPENLNQDFRILIKEFT 617
Query: 388 RLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLD 425
L D + PS+ ++ I+ + + Q L+ G+ LD
Sbjct: 618 HALNMDTVNPSWGRVVGIAALSGIYQ--LQKCGYFPLD 653
>gi|170584208|ref|XP_001896898.1| TATA binding protein associated factor [Brugia malayi]
gi|158595723|gb|EDP34252.1| TATA binding protein associated factor, putative [Brugia malayi]
Length = 1096
Score = 143 bits (361), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 140/491 (28%), Positives = 233/491 (47%), Gaps = 77/491 (15%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGAS-----PVR----TLSTKEFRHFANKVGNLERPFL 51
ML +++G F +++Q I+S A AS PV T+ST+ F + V E P
Sbjct: 490 MLNQRLGKESFLQVIQKILSVAMQASQLQKEPVHWQHMTVSTESFFRTVSSVTGQELP-- 547
Query: 52 KEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGD 111
F +W+ + G + +++N+++N++EL V ++ S NR G
Sbjct: 548 -TFLEQWIHSGGHAKFWVQYAFNRKRNMIELEVKQE----------------PSANR-GR 589
Query: 112 IGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDD 171
+ G + + V ELDG + H + + + ++QCHSK RR K KK G +
Sbjct: 590 QSYVGPLIVVVQELDGSFTHTVQIDSNHSKH--DLQCHSK--GRRQ-KKKKIPLSTGEEL 644
Query: 172 NGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIA 231
D +A +D +SP+ WIR DP++ L ++ QPV W L+ + DV+AQ QA+
Sbjct: 645 EID-LANMDP----DSPVLWIRIDPDLLLLRKVSIQQPVYQWEYMLKYERDVLAQLQALD 699
Query: 232 ALEALP--HLSFNVVNTLNNFLSDSKAFWR---VRIEAAYALANTASE--ETDWAG---- 280
L+ P H ++ T+ F+R VR AAY LA ++ ET W G
Sbjct: 700 VLQRFPSPHARAILIETIET----EVFFYRQVLVRCRAAYILAEVVNKLPET-WLGPPAL 754
Query: 281 LLHLVKFYKSRRFDENIGLPRPNDFRDFSE----YFVLEAIPHAVAMVRAADNKSPREAV 336
++ KF+ + +P+PN+F S YF++ AIP A+A +R++ N + E
Sbjct: 755 MILFKKFFGCKSAPH---IPKPNNFVATSANLQLYFLMHAIPQALARLRSSSNNALSEVH 811
Query: 337 EFVLQLLKYNDNNGNPYSDVFWLAALVQSVGEL-------EFGQQSILFLSS----LLKR 385
F++ L+KYNDN N YSD + A+L+ S+ + S L L++ LL
Sbjct: 812 HFLIGLIKYNDNTVNRYSDDHYRASLINSLAATLVPAENRDIINMSPLSLNTEMKELLSE 871
Query: 386 IDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRV 445
L D L PS+ ++ I+ ++ + ++ + I LD K+ F N +R+
Sbjct: 872 FTLALNMDTLKPSFGRVIGITALQAIYH--MQRTFHIPLDP--KVFWSFAQPNMYVPMRL 927
Query: 446 EASRALLDLEF 456
A L+D+ +
Sbjct: 928 AALTFLIDMVY 938
>gi|346973924|gb|EGY17376.1| transcription initiation factor TFIID 127kD subunit [Verticillium
dahliae VdLs.17]
Length = 1533
Score = 143 bits (360), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 143/514 (27%), Positives = 233/514 (45%), Gaps = 75/514 (14%)
Query: 31 LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTV 90
++T+ FR F K + L+ F+ +W+ GCP + +NK+K VE+ V R V
Sbjct: 599 INTESFRRFCEKNSKSK---LESFWKQWIYGTGCPRFDVQQKFNKKKLCVEMMV-RQVQV 654
Query: 91 KPDSRTPVLSSN-----TDSENRDGDIG-----WPGMMSIRVHELDGM-YDHPILPMAGD 139
K L N + D IG + G M+IR+HE DG Y+H I+ + D
Sbjct: 655 KERDSNRALDKNDFLRLVKEKLYDVAIGEIQPWFTGPMTIRIHEADGTPYEH-IVEVRED 713
Query: 140 AWQL----LEIQCHSKL-----AARRALKPKKGSKPDG--CDDN-----GDAVAG----- 178
A + +I ++K + R+ + + DG +D+ GD +
Sbjct: 714 AAAMKMCKFDIPYNTKYKRLKRSRRQRDRAAAQTTADGEVNEDSLLYCLGDVLQSQDDIQ 773
Query: 179 ----LDMRSSMESPLS-----WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
LD + ME + WIR D + E++ + N M+++QL++D DVVAQ
Sbjct: 774 EWDLLDWPADMEKKMDQESYEWIRMDADFEWICTMKTNSESYMYVSQLQQDRDVVAQQDT 833
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALA--NTASEETDWAGLLHLVKF 287
+ L P V + L L D + F +R AA L + + GL HL++
Sbjct: 834 MLYLGRSPPHPL-VSSILTRTLFDKRYFHGIRTMAASQLHRHSVPGPKDSMVGLFHLLRA 892
Query: 288 YKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYND 347
+ +PRPNDF D +Y AIP A+A VR ++ K P A F+L+ +++N+
Sbjct: 893 FTQLCCFPGTQMPRPNDFSDRKQYLAQSAIPLAIARVRDSNGKCPERARHFLLEQVQFNN 952
Query: 348 NNGNPYSDVFWLAALVQSVG---------ELEFGQQSIL-------FLSSLLKRIDRLLQ 391
N NP+SD F++A L++++ + E I+ FL + L IDR +
Sbjct: 953 NANNPFSDNFYVAKLLEALATSMIPDKRPDTEVMLDDIIRTDEEDQFLQNALTEIDRYRR 1012
Query: 392 FDRLMPSYNGILTIS---CIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEAS 448
D SY+ I TI+ C R L + +G I D + ++ +D T VR++A
Sbjct: 1013 MDEWANSYHNIWTITALDCKRRLMK-----AGVIPKDG-LDFMQYLQD-ETYDLVRIKAF 1065
Query: 449 RALLDLEFHCNGIDSALSLFIKSVEEEPSLRGQV 482
AL+DL F + + L S ++ P +R ++
Sbjct: 1066 EALVDLGFMMDDVVFPHLLRTLSTDKSPFVRHKL 1099
>gi|302416095|ref|XP_003005879.1| transcription initiation factor TFIId 127kD subunit [Verticillium
albo-atrum VaMs.102]
gi|261355295|gb|EEY17723.1| transcription initiation factor TFIId 127kD subunit [Verticillium
albo-atrum VaMs.102]
Length = 1552
Score = 143 bits (360), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 160/602 (26%), Positives = 262/602 (43%), Gaps = 105/602 (17%)
Query: 31 LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTV 90
++T+ FR F K + L+ F+ +W+ GCP + +NK+K VE+ V R V
Sbjct: 599 INTESFRRFCEKNSKSK---LESFWKQWIYGTGCPRFDVQQKFNKKKLCVEMMV-RQVQV 654
Query: 91 KPDSRTPVLSSN-----TDSENRDGDIG-----WPGMMSIRVHELDGM-YDHPILPMAGD 139
K L + + D IG + G M+IR+HE DG Y+H I+ + D
Sbjct: 655 KERDSNRALDKDDFLRLVKEKLYDVAIGEIQPWFTGPMTIRIHEADGTPYEH-IVEVRED 713
Query: 140 AWQL----LEIQCHSKL-----AARRALKPKKGSKPDG--CDDN-----GDAVAG----- 178
A + +I ++K + R+ + + DG +D+ GD +
Sbjct: 714 AAAMKMCKFDIPYNTKYKRLKRSRRQRDRAAAQTTADGEVNEDSLLYCLGDVLQSQDDIQ 773
Query: 179 ----LDMRSSMESPLS-----WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
LD + ME + WIR D + E++ + N M+++QL++D DVVAQ
Sbjct: 774 EWDLLDWPADMEKKMDQESYEWIRMDADFEWICTMKTNSESYMYVSQLQQDRDVVAQQDT 833
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALA--NTASEETDWAGLLHLVKF 287
+ L P V + L L D + F +R AA L + + GL HL++
Sbjct: 834 MLYLGRSPPHPL-VSSILTRTLFDKRYFHGIRTMAASQLHRHSVPGPKDSMVGLFHLLRA 892
Query: 288 YKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYND 347
+ +PRPNDF D +Y AIP A+A VR A+ K P A F+L+ +++N+
Sbjct: 893 FTQLCCFPGTQMPRPNDFSDRKQYLAQSAIPLAIARVRDANGKCPERARHFLLEQVQFNN 952
Query: 348 NNGNPYSDVFWLAALVQSVG---------ELEFGQQSIL-------FLSSLLKRIDRLLQ 391
N NP+SD F++A L++++ + E I+ FL + L IDR +
Sbjct: 953 NANNPFSDNFYVAKLLEALATSMIPDKRPDTEVMLDDIIRTDEEDQFLQNALTEIDRYRR 1012
Query: 392 FDRLMPSYNGILTIS---CIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEAS 448
D SY+ I TI+ C R L + +G I D + ++ +D T VR++A
Sbjct: 1013 MDEWANSYHNIWTITALDCKRRLMK-----AGVIPKDG-LDFMQYLQD-ETYDLVRIKAF 1065
Query: 449 RALLDLEFHCNGIDSALSLFIKSVEEEPSLRGQVKLGIHAMRICQIKGGSDSNHEVDTVT 508
AL+DL F + + L S ++ P +R H++ V
Sbjct: 1066 EALVDLGFMMDDVVFPHLLRTLSTDKSPFVR----------------------HKLLKVF 1103
Query: 509 LVALLNLLESRIAFNNVFLRHHLFGILQILAGRAPTLYGVPRDKLLLL--GDGETSEQKN 566
+ L L + N +R + L G L P+D +L++ GD ET+E++
Sbjct: 1104 SIGLAGLAFGEFSKPNSAIRPQTGDMF--LDG----LADEPKDDMLVVEAGDKETTERRQ 1157
Query: 567 VF 568
+
Sbjct: 1158 EY 1159
>gi|242003978|ref|XP_002422931.1| transcription initiation factor TFIID subunit, putative [Pediculus
humanus corporis]
gi|212505824|gb|EEB10193.1| transcription initiation factor TFIID subunit, putative [Pediculus
humanus corporis]
Length = 1197
Score = 142 bits (359), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 125/500 (25%), Positives = 222/500 (44%), Gaps = 71/500 (14%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRH-------FANKVGNLERPFLKE 53
MLE ++G ++ +S A AS + + + + H F + + +
Sbjct: 445 MLEHRIGQELLLQVFNKQLSLASNASQ-QKIGSGLWSHLLISTNVFHKAIFTVTGKDMSV 503
Query: 54 FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
F +WV T G + F +N+++N VEL + +D T N+ G
Sbjct: 504 FIDQWVRTGGHAKFSLTFVFNRKRNTVELEIRQDST-----------------NQRGIRK 546
Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
+ G + + + ELDG + H L + G + +I CHSK +
Sbjct: 547 YMGPLLVTIQELDGTFKH-TLQVEGTSRA--DITCHSKSRRNKK---------------- 587
Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
L ++ LS + DPE+ L + QP W QL + DV+AQ +AIA+L
Sbjct: 588 -KKIPLCTGEEVDMDLSAMELDPEITLLRSTNIEQPDYQWQYQLRHERDVIAQIEAIASL 646
Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEE-TDWAGLLHLVKFYKSRR 292
E P + + L + + + +++VR +AA+ L A+ +WAG ++ + R+
Sbjct: 647 ERYPTPATRLA--LTDTIENEGTYYKVRCKAAHCLTKVANAMIANWAGPPAMLTIF--RK 702
Query: 293 FDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNN 349
+ P + N+F++F Y++ + IP A+A +R P + + F+L L KYNDN+
Sbjct: 703 LFGSFSCPHIMKQNNFQNFQHYYLQKTIPIAMAGIRNVHGICPPDVIRFLLDLFKYNDNS 762
Query: 350 GNPYSDVFWLAALVQSVGE-----LEFGQQSILF----LSS----LLKRIDRLLQFDRLM 396
N +SD ++ A+L++++ + QQ LSS +L+ I R L ++L+
Sbjct: 763 KNLFSDNYYRASLIEALAASVTPVISVVQQGSAIKAENLSSDARLILEEITRCLNLEKLL 822
Query: 397 PSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEF 456
P Y +T++C++ + KL F L L + + + VR+ A AL+D
Sbjct: 823 PCYKYTVTVACLKAIR----KLQKFGHLPNHPALFRSYAAYGQFIDVRLAALEALVDYT- 877
Query: 457 HCNGIDSALSLFIKSVEEEP 476
+G S L + E +P
Sbjct: 878 KVDGKWSDLEFLLNLAESDP 897
>gi|426201311|gb|EKV51234.1| hypothetical protein AGABI2DRAFT_182197 [Agaricus bisporus var.
bisporus H97]
Length = 1842
Score = 142 bits (359), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 84/246 (34%), Positives = 130/246 (52%), Gaps = 30/246 (12%)
Query: 191 WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNF 250
WIR D + E++A I F+QP MW++QL++D DVVAQ +A+ AL P + V +TL
Sbjct: 697 WIRMDADFEWIANIKFDQPDFMWVSQLQRDRDVVAQLEALHALAEKP--TAIVSSTLTKT 754
Query: 251 LSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFD---------ENIGLPR 301
+ S ++R+R EAA AL A D+ GL HL K + +D + +P+
Sbjct: 755 VLVSNYYYRIRCEAALALVTCAIRRLDFLGLFHLFKLFLRYCYDPEDPTQDLFTHTYVPK 814
Query: 302 PNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAA 361
PN+F D +EYFV +++ A++ VR + KSP +F + L+YNDN NPYSD ++
Sbjct: 815 PNNFSDLAEYFVRKSLITAISRVRFENGKSPSVVRQFFIDQLRYNDNTSNPYSDGLYICT 874
Query: 362 LVQSV------------GEL-------EFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGI 402
++ + GEL E + + + IDR DRL+PS + I
Sbjct: 875 IISAAAYAAVSVAPPERGELLPAEIRSEQTAEDKELVRQTIAEIDRYRSMDRLIPSPHNI 934
Query: 403 LTISCI 408
+TI+ +
Sbjct: 935 VTIAAL 940
>gi|326472005|gb|EGD96014.1| transcription initiation factor TFIId 127kD subunit [Trichophyton
tonsurans CBS 112818]
Length = 1254
Score = 142 bits (359), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 108/416 (25%), Positives = 191/416 (45%), Gaps = 48/416 (11%)
Query: 1 MLEKQMGSNFFRKILQNIISRA-----QGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
+LE+++ + + IISR G P L++ F+ ++G+ + L FF
Sbjct: 503 ILERRLSKASGKATISRIISRIFLNARMGDLPNCELTSAHFQKLCERMGHTK---LDAFF 559
Query: 56 PRWVGTCGCPVLRMGFSYNKRKNIVELAVLR---DCTVKPDSRTPVLSSNTDSENRDGDI 112
+WV GCP +NK+K +VE+ + + + T D + E R+
Sbjct: 560 SQWVYGAGCPRFTASQRFNKKKLVVEMMISQVQSEQTTARDLERSTFMRDVKEEIRNVYA 619
Query: 113 G-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKL-------------A 153
G + G M+I++HE DG Y+H + +A +I ++K A
Sbjct: 620 GTVQNVFTGSMTIQIHEADGTPYEHIV--EIKEAITKFDIPYNTKYKRLKRNRRQKERAA 677
Query: 154 ARRALKPKKGSKPD----------GCDDNGDAVAGLDMRSSMESPLS-----WIRADPEM 198
A + P ++ D D++ + ++ ES + WIR D +
Sbjct: 678 AMSGIDPSSETQDDVLLYCLGDVLQSDEDMEKWKIVEWTKEDESRMGQESYEWIRMDADF 737
Query: 199 EYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFW 258
E++ ++ P M+++QL++D DVVAQ ++I + A + + + + D + F
Sbjct: 738 EWICKMSLGMPGYMYLSQLQQDRDVVAQLESIQHMAAQREHAL-ISSIFVRTILDRRYFH 796
Query: 259 RVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIP 318
+R A AL A +E DW GL HL ++ N + R NDF D S Y V A+
Sbjct: 797 GIRTAATKALVKHAKQEVDWIGLFHLENAFQELYCLPNSSMARSNDFSDRSSYAVQLALI 856
Query: 319 HAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQ 374
A++ VR D ++P +F+ + LK+NDN+ N YSD F+++ L++S+ G++
Sbjct: 857 EAISEVRDEDGRTPMRVKQFLYEKLKFNDNSNNEYSDCFYISTLLKSICNALLGRR 912
>gi|448117256|ref|XP_004203211.1| Piso0_000812 [Millerozyma farinosa CBS 7064]
gi|359384079|emb|CCE78783.1| Piso0_000812 [Millerozyma farinosa CBS 7064]
Length = 1539
Score = 142 bits (358), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 139/510 (27%), Positives = 239/510 (46%), Gaps = 64/510 (12%)
Query: 31 LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVL----- 85
LST F++ KV R L+ FF +WV G PVL + +N++++IVE+++
Sbjct: 657 LSTSHFQYVCEKVN---RNKLESFFKQWVYGSGIPVLYVIQRFNRKRSIVEVSIRQIQHR 713
Query: 86 --RDCTVKPDS-RTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGM-YDHPILPMAGDAW 141
R T+KP++ ++ D + + G M+IR+HE DG Y+H + G +
Sbjct: 714 EPRKNTLKPENFLNDAIAYIDDEPVYNSQSVFTGPMTIRIHEADGTPYEHIVYLKEG--Y 771
Query: 142 QLLEIQCHSKLAARR-----ALKPKKG---------SKPDGCDDNGDAVAGLDMRSSMES 187
L+IQ +++ R A +P S+ D + N D +
Sbjct: 772 SKLDIQYNTRFRRLRKNREEAYEPSTAFTGLGDVYSSEADMKEWNFAEFEKNDDENIYND 831
Query: 188 PLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTL 247
W+R D + E++A+++ NQ QM+ +QL+ D V AQ AI+ + L
Sbjct: 832 TFEWMRVDADSEWIAKVNINQSDQMFASQLQYDRHVEAQYDAISFFGDRERPTNIHCTVL 891
Query: 248 NNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRD 307
L D + ++ +RI AA ALAN + E+ ++ GL +L+K YK N +P NDF +
Sbjct: 892 TRTLLDDRYYYGIRIAAARALANFSKEDNEFIGLTYLLKAYKEMFCYPNSEIPLANDFSN 951
Query: 308 FSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVG 367
+ V +P ++ VR + P +F+L L+KYNDN+ N + D +L+ L+ ++
Sbjct: 952 VGRFLVQRELPIIMSSVRDNEGNVPEMIKKFLLNLVKYNDNSNNFFQDPLYLSKLIHALC 1011
Query: 368 EL----------EFGQQSI--------LFLSSLLKRIDRLLQFDRLMPSYNGILTISCIR 409
+F QQ + + + ++ I+RL + D L+PSY ++ +C++
Sbjct: 1012 SSIINPNKSIIHDFLQQYLGSSDSVNDSYANEVVMEINRLQKLDGLVPSYQSVIYTTCLK 1071
Query: 410 TLTQIALKLSGFISLDQVV--KLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSA--L 465
++AL +S ++++ L K D +R+EA R + F GI + L
Sbjct: 1072 EKVRLALHGQINVSFEELLYYTLDKYCLD------IRIEAFRGI----FLLGGIKNKNIL 1121
Query: 466 SLFIKSV---EEEPSLRGQVKLGIHAMRIC 492
S F+K + P LR Q+ L IC
Sbjct: 1122 SYFLKICLLDDSSPYLRTQL-LEAFKYSIC 1150
>gi|400598146|gb|EJP65866.1| transcription initiation factor [Beauveria bassiana ARSEF 2860]
Length = 1484
Score = 142 bits (358), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 131/465 (28%), Positives = 216/465 (46%), Gaps = 65/465 (13%)
Query: 2 LEKQMGSNFFRKILQNIISRAQ--GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
+ K G + ++LQ +++RAQ G+ L T++F+ K G+ R L+ F+ +WV
Sbjct: 535 MNKASGGHGLTRVLQKLLTRAQIEGSDKSTILDTEKFKTLCEK-GSRYR--LETFWNQWV 591
Query: 60 GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNT--------DSENRDGD 111
GCP + +NK++ VEL + + K ++ P L + +E G+
Sbjct: 592 YGSGCPRFDVKARFNKKRLCVELTLNQVQLHK--AKKPQLEKDDFLREIKERRAEVPRGE 649
Query: 112 IG--WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQC-----HSKLAARRALKPK-- 161
I + G M++R+HE DG Y+H IL + DA + + + + +L R +K K
Sbjct: 650 IQPLFTGPMTVRIHEADGTPYEH-ILEIREDATRATKFEIPYNTKYKRLKRTRRMKEKHS 708
Query: 162 KGSKPDGCDDNGDAVAG-----------------LDMRSSMESPL-----SWIRADPEME 199
G + ++N DA+ +D E + WIR D + E
Sbjct: 709 TGQSMEIAENNDDALLYCLGDVLQTPEDQREWELIDWDPETERKMDQESYEWIRLDADFE 768
Query: 200 YLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWR 259
+ +++ M+++QL++D DVVAQ A+ L P L L L D + F
Sbjct: 769 WASDMSRTLEPYMYVSQLQQDRDVVAQQDAMLYLAQGP-LHPIASGFLVRTLIDRRYFHG 827
Query: 260 VRIEAAYALANTAS-EETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIP 318
+R AA AL ++ ++ G+ L++ ++ E PRPNDF D +Y V AI
Sbjct: 828 IRTMAAAALTRQSNIKDIPLLGMRQLMRAFREMYCYEQTNQPRPNDFTDKKQYNVRCAII 887
Query: 319 HAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEF------- 371
A+A VR NK P EA F+L L+ N+N GNPYSD F++A LV+S+
Sbjct: 888 KAIAEVRDHTNKCPMEARRFILDQLRLNNNEGNPYSDHFYIAFLVESLATSLIPSKQDDW 947
Query: 372 --------GQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCI 408
++ FL L++I+R+L+ D SY T++ +
Sbjct: 948 LTRQNKIPNEEENAFLEDALEQIERVLRRDEWTNSYRNTWTVAGL 992
>gi|341878003|gb|EGT33938.1| hypothetical protein CAEBREN_01134 [Caenorhabditis brenneri]
Length = 652
Score = 142 bits (358), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 123/460 (26%), Positives = 220/460 (47%), Gaps = 77/460 (16%)
Query: 1 MLEKQMGSNFFRKILQNIIS--------RAQGASPVRTLSTKE--FRHFANKVGNLERPF 50
ML+K++G F ++L +++ +A A+ L+T E FR ++ G F
Sbjct: 55 MLQKRIGLEPFMRVLHRVLTVGLDMSEKKATPAAWRHLLTTTEAFFRSVSSVTGREIPTF 114
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
L +F V T G + F +N+++NIVE+ V +D D+E +G
Sbjct: 115 LDQF----VRTGGHVAFAVKFDFNRKRNIVEIEVKQD----------------DTEG-NG 153
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPK--KGSKPDG 168
+ + G +S+ V E+DG + H I A EI CHSK ++ K G + +
Sbjct: 154 RVQYTGPLSVVVQEVDGAFSHTI--QIDGATSHAEISCHSKGRKQKKKKVPILTGEEIEI 211
Query: 169 CDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQ 228
N DA ESP+ W+R D + + EI +QP+ W L+ + DV+AQ +
Sbjct: 212 DLTNMDA----------ESPILWLRIDHDYLLIREITISQPMFHWEYMLKYEKDVIAQME 261
Query: 229 AIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDW--AGLLHLVK 286
A+ ++ALP S + + + + +S+ K F+R+R AA+ L + + +++ G L+
Sbjct: 262 ALERIQALP--SAHSRSVIVDAVSNDKFFYRIRSRAAFVLTSVLNRKSEAIAVGTPVLIN 319
Query: 287 FYKSRRFDENIG------LPRPNDF----RDFSEYFVLEAIPHAVAMVRAADNKSPREAV 336
++ EN G +PR N+F ++ +YFV++++P A++ +R + E
Sbjct: 320 MFR-----ENFGSKAATNIPRSNNFVVTAQNLQQYFVMQSLPQAISRLRKQSGECHEEVQ 374
Query: 337 EFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILF-----------LSSLLKR 385
F+L L+K+NDN+ N YSD F+ A+L S+ + + + + L+K
Sbjct: 375 PFLLDLIKFNDNSTNRYSDDFYRASLYNSLAASVYPSEYLPYRVDLPENLNQDFRILIKE 434
Query: 386 IDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLD 425
L D + PS+ ++ I+ + + Q L+ G+ LD
Sbjct: 435 FTHALNMDTVNPSWGRVVGIAALSGIYQ--LQKCGYFPLD 472
>gi|307104297|gb|EFN52551.1| hypothetical protein CHLNCDRAFT_138525 [Chlorella variabilis]
Length = 1744
Score = 142 bits (358), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 125/440 (28%), Positives = 203/440 (46%), Gaps = 45/440 (10%)
Query: 1 MLEKQMGSNFFRKILQNIIS---------------RAQGAS-PVRTLSTKEFRHFANKVG 44
MLE++ G FRK + ++++ A GA+ P ++ + + A G
Sbjct: 830 MLERRAGEELFRKHVGDVVAGAIAAVQAEQAAVAGTASGAAGPSQSPAARP--SPAAGAG 887
Query: 45 NLERPFLKEFFP---RWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSS 101
+ F KE P RWV G P + + ++ R + +ELAV + P+S +
Sbjct: 888 S----FRKEMGPFAVRWVYGRGVPHITAAYIHHSRVSALELAVKQ---AGPESAKRAADA 940
Query: 102 NTDSENRDGDIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPK 161
+ G +R E DHP+ + G+A+ L E++ + ++ + K
Sbjct: 941 AERAAVAGGSNAGIIKAVVREGEPPTSTDHPVH-LGGEAFVLTELKVTPVVEKKQPGRRK 999
Query: 162 KGSKPDGCDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDG 221
+ + D+ + ++ E P+ W+R DP E L +I QP + QL +
Sbjct: 1000 RKVEGVEVDEAPKTIV-----ATPEEPVLWVRLDPACELLCDIRLMQPERCLAAQLLQSR 1054
Query: 222 DVVAQAQAIAALEALPHLSFNV------VNTLNNFLSDSKAFWRVRIEAAYALANTASEE 275
DVVAQA+AI L L S V+ L L+D AF+RVR +AA ALA E+
Sbjct: 1055 DVVAQAEAIQRLAELRVHSAATWRQQPGVDALRRTLADRAAFYRVRCDAALALAGLRDEQ 1114
Query: 276 TDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREA 335
AGL L+ FY+S F +PR D SEYFV +A+ AV++ R D +P +
Sbjct: 1115 GQAAGLPLLLDFYRSHYFLPGSEVPRSIAIADPSEYFVAQAVATAVSLCREPDGSTPIDV 1174
Query: 336 VEFVLQLL-KYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDR 394
V F+ L +Y G P+SD +AAL S+G ++F F SLL +++ L+ D
Sbjct: 1175 VLFLSTCLNEYVSVPGAPFSDYGLVAALCASLGNVQFEG----FGDSLLTLLEKFLERDS 1230
Query: 395 LMPSYNGILTISCIRTLTQI 414
++ S + +C+ T+ +
Sbjct: 1231 VLVSPGSAVGCACLSTMAAL 1250
>gi|344232622|gb|EGV64495.1| hypothetical protein CANTEDRAFT_120010 [Candida tenuis ATCC 10573]
Length = 1370
Score = 142 bits (357), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 115/431 (26%), Positives = 205/431 (47%), Gaps = 53/431 (12%)
Query: 30 TLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCT 89
TLST+ F++ KV R L+ FF +WV G PV + +N++++++E+ V+R
Sbjct: 624 TLSTQHFQYLCEKVN---RNRLETFFKQWVFGSGTPVFNVSQRFNRKRSMIEV-VVRQAQ 679
Query: 90 VKPDSRT-PVLSSNTDS-----ENRDGDI---GWPGMMSIRVHELDGM-YDHPILPMAGD 139
V+ R+ P++ + D+ +N + G M+IRVHE DG+ Y+H I+ +
Sbjct: 680 VQERKRSAPLVENFIDTSIAFLDNEPAPTVQSAFLGPMTIRVHEADGVPYEH-IVELKDS 738
Query: 140 AWQLLEIQCHSKLAARRALKPKKGSKPDGCDDN-GDAVAG--------------LDMRSS 184
A +L +I +SK RR K K+ + G GD + +D
Sbjct: 739 AVKL-DINYNSKF--RRMKKHKEDAAEGGTYTRFGDIIQSPEELAAWGLQEWPPIDDEEL 795
Query: 185 MESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVV 244
+ WIR D + E++A+++ QP M+ +QL+ D DV AQ +AI
Sbjct: 796 YNNAFEWIRIDTDFEWIAKVNVRQPDYMYSSQLQYDRDVEAQYEAIRFFGNSEKPIIAYC 855
Query: 245 NTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPND 304
+L L D + ++ VRI AA LA A +E ++ G+ L+K Y N +P ND
Sbjct: 856 TSLIRTLMDPRYYYGVRIAAAEGLARIARQENNFMGMNCLIKAYTQLFCFPNSTIPLAND 915
Query: 305 FRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQ 364
F + ++ + P +R + P + +L L+ YN+N+ N + D ++++L++
Sbjct: 916 FGNIPKFMLQREFPRIFCSIRDDNGNVPFALKDIILNLVVYNENSNNSFEDSLYVSSLIE 975
Query: 365 SV-GELEFGQQSIL-------------------FLSSLLKRIDRLLQFDRLMPSYNGILT 404
S+ G + +IL F+ +++ I+R + D L+PS++ L
Sbjct: 976 SLAGSIINRNATILSNLTLPPEQDANLDPKEKEFIEVVVEEINRQQKLDELVPSHHHTLA 1035
Query: 405 ISCIRTLTQIA 415
I+C+ ++A
Sbjct: 1036 ITCLDQKIRLA 1046
>gi|448119681|ref|XP_004203792.1| Piso0_000812 [Millerozyma farinosa CBS 7064]
gi|359384660|emb|CCE78195.1| Piso0_000812 [Millerozyma farinosa CBS 7064]
Length = 1536
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 137/512 (26%), Positives = 237/512 (46%), Gaps = 68/512 (13%)
Query: 31 LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVL----- 85
LST F++ KV R L+ FF +WV G P+L + +N++++IVE+++
Sbjct: 657 LSTSHFQYVCEKVN---RNKLESFFKQWVYGSGIPILYVIQRFNRKRSIVEVSIRQIQHR 713
Query: 86 --RDCTVKPDS-RTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGM-YDHPILPMAGDAW 141
R T+KP++ ++ D + + G M+IR+HE DG Y+H + G +
Sbjct: 714 EPRKNTLKPENFLNDAIAYIDDEPVYNSQSVFTGPMTIRIHEADGTPYEHIVYLKEG--Y 771
Query: 142 QLLEIQCHSKLAARR-----ALKPKKG---------SKPDGCDDNGDAVAGLDMRSSMES 187
L+IQ +++ R A +P S+ D + N D +
Sbjct: 772 SKLDIQYNTRFRRLRKNREEAYEPSTAFTGLGDVYSSEADMKEWNFAEFEKNDDENIYND 831
Query: 188 PLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTL 247
W+R D + E++A+++ NQ QM+ +QL+ D V AQ AI+ + L
Sbjct: 832 TFEWMRVDADSEWIAKVNINQSDQMFASQLQYDRGVEAQYDAISFFGDRERPTNIHCTVL 891
Query: 248 NNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRD 307
L D + ++ +RI AA ALAN + E+ ++ GL +L+K YK N +P NDF +
Sbjct: 892 TRTLLDDRYYYGIRIAAARALANFSKEDNNFIGLTYLLKAYKEMFCYPNSEIPLANDFSN 951
Query: 308 FSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALV---- 363
+FV +P ++ VR + P +F+L L+KYNDN+ N + D +L+ L+
Sbjct: 952 VGRFFVQHELPIIMSSVRDNEGNVPEMIKKFLLNLVKYNDNSNNFFQDPLYLSKLIHALC 1011
Query: 364 ----------------QSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISC 407
Q +G + S + + ++ I+RL + D L+PSY ++ +C
Sbjct: 1012 NSVINPNKSIIHDSLQQYLGSSDSANNS--YANEVVMEINRLQKLDGLVPSYQSVIYTTC 1069
Query: 408 IRTLTQIALKLSGFISLDQVV--KLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSA- 464
++ ++AL +S ++++ L K D +R+EA + + F GI +
Sbjct: 1070 LKEKVRLALHGQINVSFEELLYYTLDKYCLD------IRIEAFKGI----FLLGGIKNKN 1119
Query: 465 -LSLFIKSV---EEEPSLRGQVKLGIHAMRIC 492
L F+K + P LR Q+ L IC
Sbjct: 1120 ILQYFLKICLLDDSSPYLRTQL-LEAFKYSIC 1150
>gi|47213759|emb|CAF95588.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1264
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 95/297 (31%), Positives = 158/297 (53%), Gaps = 23/297 (7%)
Query: 172 NGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAI 230
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ +AI
Sbjct: 561 NGEEV-DMDLSAMDADSPLLWIRIDPDMSILRKVEFEQADFMWQYQLRYERDVVAQEEAI 619
Query: 231 AALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEE-TDWAGLLHLVKFYK 289
+ALE P + + L + L + F++VR++A + L+ A+ + W G + K
Sbjct: 620 SALEKFPTPASRLA--LTDILEQDQCFYKVRMQACFCLSKIANAMVSTWQGPPAM-KILF 676
Query: 290 SRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYND 347
+R F + + N+F +F YF+ + +P A+A++R N P+E + F+L L+KYND
Sbjct: 677 TRMFCCKSCPNIVKTNNFINFQSYFLQKTMPVAMALLRDIQNLCPKEVLNFILDLIKYND 736
Query: 348 NNGNPYSDVFWLAALVQSVGELEFGQQSILF-----------LSSLLKRIDRLLQFDRLM 396
N N +SD ++ A L++++ SI + +L+ I R L ++L+
Sbjct: 737 NRKNKFSDNYYRADLIRALTNSLTPAISISNEVRTVDNLNPDVRLILEEITRYLNMEKLL 796
Query: 397 PSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
PS+ +T+SC+R + + L+ +G I D L K + + VRV A AL+D
Sbjct: 797 PSFRNTITVSCLRAIRE--LQKNGHIPSD--ASLFKSYAQYGHFVDVRVAALEALVD 849
>gi|315046148|ref|XP_003172449.1| hypothetical protein MGYG_05040 [Arthroderma gypseum CBS 118893]
gi|311342835|gb|EFR02038.1| hypothetical protein MGYG_05040 [Arthroderma gypseum CBS 118893]
Length = 1255
Score = 140 bits (354), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 108/416 (25%), Positives = 191/416 (45%), Gaps = 48/416 (11%)
Query: 1 MLEKQMGSNFFRKILQNIISRA-----QGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
+LE+++ + + IISR G P L++ F+ ++G+ + L FF
Sbjct: 514 ILERRLSKASGKATVSRIISRIFLNARMGDLPNCELTSAHFQKLCERMGHTK---LDAFF 570
Query: 56 PRWVGTCGCPVLRMGFSYNKRKNIVELAVLR---DCTVKPDSRTPVLSSNTDSENRDGDI 112
+WV GCP +NK+K +VE+ + + + T D + E R+
Sbjct: 571 NQWVYGAGCPRFTASQRFNKKKLVVEMMISQVQVEQTTARDLERSTFMRDVKEEIRNVYA 630
Query: 113 G-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKL-------------A 153
G + G M+I++HE DG Y+H + +A +I ++K A
Sbjct: 631 GTVQNVFTGSMTIQIHEADGTPYEHIV--EIKEAITKFDIPYNTKYKRLKRNRRQKERAA 688
Query: 154 ARRALKPKKGSKPD----------GCDDNGDAVAGLDMRSSMESPLS-----WIRADPEM 198
A + P ++ D D++ + ++ ES + WIR D +
Sbjct: 689 AMSGIDPSSETQDDVLLYCLGDVLQSDEDMEKWKIVEWTKEDESRMGQESYEWIRMDADF 748
Query: 199 EYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFW 258
E++ ++ P M+++QL++D DVVAQ ++I + A + + + + D + F
Sbjct: 749 EWICKMSLGMPGYMYLSQLQQDRDVVAQLESIQHMAAQREHAL-ISSIFVRTILDRRYFH 807
Query: 259 RVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIP 318
+R A AL A +E DW GL HL +K N + R NDF D S Y V A+
Sbjct: 808 GIRTAATKALVKHAKQEVDWIGLFHLENAFKELYCLPNSSMARSNDFSDRSSYAVQLALI 867
Query: 319 HAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQ 374
A++ VR D ++P +F+ + LK+NDN+ N +SD F+++ L++S+ G++
Sbjct: 868 EAISEVRDEDGRTPMRVKQFLYEKLKFNDNSNNEFSDCFYISTLLKSICNALLGRK 923
>gi|366990875|ref|XP_003675205.1| hypothetical protein NCAS_0B07500 [Naumovozyma castellii CBS 4309]
gi|342301069|emb|CCC68834.1| hypothetical protein NCAS_0B07500 [Naumovozyma castellii CBS 4309]
Length = 1212
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 113/413 (27%), Positives = 197/413 (47%), Gaps = 53/413 (12%)
Query: 13 KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
++L I +A G P TL+ F+H +V + L FF W+ G PV R+
Sbjct: 587 RVLPKIFLQAMSGDLPNNTLTASHFQHVCERVN---KSRLGSFFQEWIYGSGVPVFRVTQ 643
Query: 72 SYNKRKNIVELAVLR-----DCTVKPDS--RTPVLSSNTDSENRDGDIGWPGMMSIRVHE 124
+N+++ +VE+ + + + D + + D+ N+ + G M+IR+HE
Sbjct: 644 RFNRKRMVVEMGIRQVDEEDHAILGKDGFFNSALRYLQNDTLNKTN--VFTGSMTIRIHE 701
Query: 125 LDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKP-----DGCDDNGDAVAG 178
DG Y+H + D + +EIQ ++K R+ K +G + +G D G
Sbjct: 702 ADGTPYEHIV--EIKDVFTKIEIQYNTKY--RKMKKDTEGPRLGNVSFEGLTDISQTSDG 757
Query: 179 LDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPH 238
+++ E WIR D + E+L + H NQP M+ QL++DGDV AQ A+ E +
Sbjct: 758 KELQMQNEG-FEWIRIDSDFEWLGKFHLNQPDYMYAAQLQQDGDVEAQVDALRYYEDVGV 816
Query: 239 LSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIG 298
S + L + D K F+ VR+EA LA + G +LV+ +K+
Sbjct: 817 ESDVYCSILTRTILDEKYFYGVRVEACKTLACSVK------GAGYLVRIFKALFCLPGCD 870
Query: 299 LPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFW 358
+P+ NDF +F+ YF+ + I ++ + D+++ + +F+L +L+YN N+ N Y D +
Sbjct: 871 IPKNNDFTNFTTYFLQKGIVESLGTI---DDENIQ---KFLLSILEYNSNDENKYDDTEY 924
Query: 359 LAALVQSVGELEFGQQSILFLSSLLKR--IDRLLQFDRLMPSYNGILTISCIR 409
L L+ +V + KR + RL+ ++ +PSY+ L I+ R
Sbjct: 925 LILLLNAVKD---------------KRDIVARLVNMEKWVPSYHDKLNIALQR 962
>gi|302506745|ref|XP_003015329.1| hypothetical protein ARB_06452 [Arthroderma benhamiae CBS 112371]
gi|291178901|gb|EFE34689.1| hypothetical protein ARB_06452 [Arthroderma benhamiae CBS 112371]
Length = 1265
Score = 140 bits (353), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 108/416 (25%), Positives = 194/416 (46%), Gaps = 48/416 (11%)
Query: 1 MLEKQMGSNFFRKILQNIISRA-----QGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
+LE+++ + + IISR G P L++ F+ ++G+ + L FF
Sbjct: 514 ILERRLSKASGKATVSRIISRIFLNARMGDLPNCELTSAHFQKLCERMGHTK---LDAFF 570
Query: 56 PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSS---NTDSENRDGDI 112
+WV GCP +NK+K +VE+ + + + + +R S+ + E R+
Sbjct: 571 SQWVYGAGCPRFTASQRFNKKKLVVEMMISQVQSEQATARDLERSTFMRDVKEEIRNVYA 630
Query: 113 G-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKL-------------A 153
G + G M+I++HE DG Y+H + +A +I ++K A
Sbjct: 631 GTVQNVFTGSMTIQIHEADGTPYEHIV--EIKEAITKFDIPYNTKYKRLKRNRRQKERAA 688
Query: 154 ARRALKPKKGSKPD----------GCDDNGDAVAGLDMRSSMESPLS-----WIRADPEM 198
A + P ++ D D++ + ++ ES + WIR D +
Sbjct: 689 AMSGIDPSSETQDDVLLYCLGDVLQSDEDMEKWRIVEWTKEDESRMGQESYEWIRMDADF 748
Query: 199 EYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFW 258
E++ ++ P M+++QL++D DVVAQ ++I + A + + + + D + F
Sbjct: 749 EWICKMSLGMPGYMYLSQLQQDRDVVAQLESIQHMAAQREHAL-ISSIFVRTILDRRYFH 807
Query: 259 RVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIP 318
+R A AL A +E DW GL HL ++ N + R NDF D S Y V A+
Sbjct: 808 GIRTAATKALVKHAKQEVDWIGLFHLENAFQELYCLPNSSMARSNDFSDRSSYAVQLALI 867
Query: 319 HAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQ 374
A++ VR D ++P +F+ + LK+NDN+ N YSD F+++ L++S+ G++
Sbjct: 868 EAISEVRDEDGRTPMRVKQFLYEKLKFNDNSNNEYSDCFYISTLLKSICNALLGRR 923
>gi|67522074|ref|XP_659098.1| hypothetical protein AN1494.2 [Aspergillus nidulans FGSC A4]
gi|40744651|gb|EAA63807.1| hypothetical protein AN1494.2 [Aspergillus nidulans FGSC A4]
gi|259486816|tpe|CBF84980.1| TPA: transcription initiation factor TFIID subunit TSM1/127kD,
putative (AFU_orthologue; AFUA_8G04950) [Aspergillus
nidulans FGSC A4]
Length = 1282
Score = 140 bits (352), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 132/542 (24%), Positives = 237/542 (43%), Gaps = 78/542 (14%)
Query: 1 MLEKQMGSNFFRKILQNIISR-----AQGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
+L++++ + + IISR G P +++ F+ ++G+ + L FF
Sbjct: 508 ILDRRLTKASGKATMSRIISRLFLNARMGELPNGAVTSSVFQKTCERLGHAK---LDTFF 564
Query: 56 PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRT---PVLSSNTDSENRDGDI 112
+W+ GCP + +NK+K +VE+ + + + +P +R + E R+
Sbjct: 565 NQWIYGAGCPRFQATQRFNKKKLVVEMMIKQVQSDQPTARDLERDAFMRDVKEEIRNVYA 624
Query: 113 G-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKP 166
G + G M+IR+HE DG Y+H + G + K R + ++ +
Sbjct: 625 GTIQPVFTGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAAAV 684
Query: 167 DGCDDNGDAVAGL---------------------------DMRSSMESPLSWIRADPEME 199
G D N + + + R ES WIR D + E
Sbjct: 685 IGADGNAEVQEDILLYCLGDVLQSEEEMQQWRLADWSKEDEERMGQES-YEWIRMDADFE 743
Query: 200 YLAEIHFNQPVQMWINQLEKDGDVVAQ---AQAIAALEALPHLSFNVVNTLNNFLSDSKA 256
++ ++ P M+++QL++D DVVAQ Q +AA P +S + T L D +
Sbjct: 744 WICKLSLVMPGYMYLSQLQQDRDVVAQLESLQYMAAQREHPLISTIFLRT----LMDRRY 799
Query: 257 FWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEA 316
F+ +R+ AA AL A EE +W GL HL K ++ + R N+F D + Y +
Sbjct: 800 FYGIRVAAARALVKHAKEEINWLGLFHLEKAFQELFCLPGSPMTRSNNFSDRAAYVLQLV 859
Query: 317 IPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQS-----VGELE- 370
IP A++ VR K+P F+ LK+NDN+ N YSD F++A L++S +G +E
Sbjct: 860 IPEAISKVRDVSGKTPMRVKRFLYDKLKFNDNSNNEYSDNFYVATLMKSLCHAMLGRVES 919
Query: 371 -------FGQQSIL-------FLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIAL 416
F ++L + IDR + D S+ + + + + Q+
Sbjct: 920 RTDDLNNFDMDAVLEAQAEEQLEKDAIAEIDRYRRMDEWSSSFQNLYSRAALYCQMQL-- 977
Query: 417 KLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEP 476
+ I+ V++ + P+ T +R++A L++L+ + L FI ++ +P
Sbjct: 978 -MQAKIAEVDVMQFL-PYTRVGTYDLLRLDAFECLVELDIFKS--PELLRWFIFTMSNDP 1033
Query: 477 SL 478
S+
Sbjct: 1034 SI 1035
>gi|50304677|ref|XP_452294.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49641427|emb|CAH01145.1| KLLA0C02189p [Kluyveromyces lactis]
Length = 1380
Score = 139 bits (351), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 116/447 (25%), Positives = 202/447 (45%), Gaps = 68/447 (15%)
Query: 30 TLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAV----- 84
+LS+ F+ +V N R L+ FF +WV G P+ R+ +NK++ VE+ +
Sbjct: 617 SLSSTHFQRVCERV-NKNR--LETFFKQWVYGSGVPIFRITQRFNKKRMAVEMGIRQVQM 673
Query: 85 ----LRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGM-YDHPILPMAGD 139
L K + + + D + + + G M+IR+HE DG Y+H + D
Sbjct: 674 QELGLGKIVGKENFNSSAMEYLQDQDKSPTQV-FTGSMTIRIHEADGTPYEHIV--ELKD 730
Query: 140 AWQLLEIQCHSKLA-------ARRALKPKKGS--------------------------KP 166
+ L++Q ++K R K K
Sbjct: 731 VFTKLDVQYNTKYKRLKRRRKVNRVTKEAKEEINEDTQEEENANEDIVLINCMGDVLVSK 790
Query: 167 DGCD-----DNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDG 221
D C+ D G D++ E+ WIR D + E+++++H NQP M+ +QL++D
Sbjct: 791 DDCNKWNLTDPMTTSEGDDLQQQNEA-FEWIRIDSDFEWISKVHINQPDYMFASQLQQDR 849
Query: 222 DVVAQAQAIAALEAL----PHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALAN---TASE 274
DV AQ +++ E + S + L D K F VR+ A AL+ A E
Sbjct: 850 DVEAQVESVRFFEDVILNGTSNSLIYSSILTRTAMDPKYFHGVRLAACRALSKFTIKAEE 909
Query: 275 ETDW-AGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPR 333
+ ++ G HLV+ ++ E+ +P+ NDF D+ +YF+ +AIP ++ V+ + P
Sbjct: 910 QQEFKGGPRHLVQIFRELFCYEDSNIPKNNDFSDYPKYFLQKAIPKLLSSVKNDEGVCPA 969
Query: 334 EAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQ-QSILFLSSLLKRIDRLLQF 392
EF+L +++YNDN N YSD F++ L++S+ + ++ ++ I R
Sbjct: 970 FVKEFLLDVVRYNDNTDNIYSDTFYITELLESLVDCSISDINDTKYIDEVMNEIQRFYNL 1029
Query: 393 DRLMPSYNGILTISCIRTLTQIALKLS 419
D+ M SY + T +T+ + LKL+
Sbjct: 1030 DQWMSSYQFLTT----KTIQKQRLKLA 1052
>gi|255945547|ref|XP_002563541.1| Pc20g10490 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211588276|emb|CAP86378.1| Pc20g10490 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 1243
Score = 139 bits (351), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 150/609 (24%), Positives = 260/609 (42%), Gaps = 94/609 (15%)
Query: 1 MLEKQMGSNFFRKILQNIISR-----AQGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
+L++++ + + IISR G P +++ F+ ++G+ + L FF
Sbjct: 508 ILDRRLTKASGKATMSRIISRLFLNSRMGDIPNGAVTSSLFQKTCERLGHAK---LDVFF 564
Query: 56 PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRD------ 109
+WV GCP + +NK+K +VE+ ++R +P + + + DS RD
Sbjct: 565 QQWVYGAGCPRFQATQRFNKKKLVVEM-MIRQVQSEPQAPRDL---HKDSFLRDVKEEVQ 620
Query: 110 ----GDIG--WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKK 162
G + + G M++R+HE DG Y+H + G +I ++K + LK K
Sbjct: 621 HVYAGPVQPVFTGSMTLRIHEADGTPYEHIVEIKEGVT--KFDIPYNTKY---KRLKRNK 675
Query: 163 GSKPDGCDDNGDAVAGLDM-----------------------------RSSMESPLSWIR 193
+ +GDA D+ R ES WIR
Sbjct: 676 KQRERAVA-SGDAEVQEDVLLYCLGDVLQTEEETQAWRLADWTKEDEERMGQES-YEWIR 733
Query: 194 ADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQA---QAIAALEALPHLSFNVVNTLNNF 250
D + E++ ++ P M+++QL++D DVVAQ Q +AA P +S V TL
Sbjct: 734 MDADFEWICKLSLGMPGYMYLSQLQQDRDVVAQLESLQYMAAQREHPLISTIFVRTL--- 790
Query: 251 LSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSE 310
DS+ F +R AA AL A +E DW GL HL K ++ + R NDF D +
Sbjct: 791 -MDSRYFHGIRTTAARALVKHAKDEVDWIGLYHLEKAFQELFCLPGSPMTRSNDFSDRAA 849
Query: 311 YFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELE 370
Y + + IP A++ VR K+P F+ LK+NDN+ N YSD F++A+L++S+
Sbjct: 850 YILQQVIPEAISKVRDNSGKTPLRVKRFLFDKLKFNDNSNNEYSDNFYVASLMESLCHAM 909
Query: 371 FGQQSILFLSSL---------------------LKRIDRLLQFDRLMPSYNGILTISCIR 409
G+ L + DR + D S+ + + + ++
Sbjct: 910 LGKNEARTDEELDHFDMERVLEAQAEEQLDKDAIAEFDRYRRMDEWSSSFQNLYSRAALK 969
Query: 410 TLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFI 469
Q L + I LD +++ + P+ T +R++A + L+DL+ N +F
Sbjct: 970 --CQAQLMQAKMIDLD-LMRFV-PYTRAGTYDLLRIQAFQCLVDLDITQNPELLRWLMFT 1025
Query: 470 KSVEEEPSLRGQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRH 529
S + +R Q L + + QI G E + + ++I +++ R
Sbjct: 1026 LSADSSAWIRHQ-SLRLLGQTLAQIAFGRPQEAEAAQTDGLIIEQESSTQIRRDDIARRQ 1084
Query: 530 HLFGILQIL 538
+ G ++ L
Sbjct: 1085 TIPGAIEAL 1093
>gi|302660060|ref|XP_003021714.1| hypothetical protein TRV_04172 [Trichophyton verrucosum HKI 0517]
gi|291185624|gb|EFE41096.1| hypothetical protein TRV_04172 [Trichophyton verrucosum HKI 0517]
Length = 1265
Score = 139 bits (351), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 108/416 (25%), Positives = 194/416 (46%), Gaps = 48/416 (11%)
Query: 1 MLEKQMGSNFFRKILQNIISRA-----QGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
+LE+++ + + IISR G P L++ F+ ++G+ + L FF
Sbjct: 514 ILERRLSKASGKATVSRIISRIFLNARMGDLPNCELTSAHFQKLCERMGHTK---LDAFF 570
Query: 56 PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSS---NTDSENRDGDI 112
+WV GCP +NK+K +VE+ + + + + +R S+ + E R+
Sbjct: 571 SQWVYGAGCPRFTASQRFNKKKLVVEMMISQVQSEQATARDLERSTFMRDVKEEIRNVYA 630
Query: 113 G-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKL-------------A 153
G + G M+I++HE DG Y+H + +A +I ++K A
Sbjct: 631 GTVQNVFTGSMTIQIHEADGTPYEHIV--EIKEAITKFDIPYNTKYKRLKRNRRQKERAA 688
Query: 154 ARRALKPKKGSKPD----------GCDDNGDAVAGLDMRSSMESPLS-----WIRADPEM 198
A + P ++ D D++ + ++ ES + WIR D +
Sbjct: 689 AMSGIDPSSETQDDVLLYCLGDVLQSDEDMEKWRIVEWTKEDESRMGQESYEWIRMDADF 748
Query: 199 EYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFW 258
E++ ++ P M+++QL++D DVVAQ ++I + A + + + + D + F
Sbjct: 749 EWICKMSLGMPGYMYLSQLQQDRDVVAQLESIQHMAAQREHAL-ISSIFVRTILDRRYFH 807
Query: 259 RVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIP 318
+R A AL A +E DW GL HL ++ N + R NDF D S Y V A+
Sbjct: 808 GIRTAATKALVKHAKQEVDWIGLFHLENAFQELYCLPNSSMARSNDFSDRSSYAVQLALI 867
Query: 319 HAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQ 374
A++ VR D ++P +F+ + LK+NDN+ N YSD F+++ L++S+ G++
Sbjct: 868 EAISEVRDEDGRTPIRVKQFLYEKLKFNDNSNNEYSDCFYISTLLKSICNALLGRR 923
>gi|212541210|ref|XP_002150760.1| transcription initiation factor TFIID subunit TSM1/127kD, putative
[Talaromyces marneffei ATCC 18224]
gi|210068059|gb|EEA22151.1| transcription initiation factor TFIID subunit TSM1/127kD, putative
[Talaromyces marneffei ATCC 18224]
Length = 1295
Score = 139 bits (351), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 144/536 (26%), Positives = 227/536 (42%), Gaps = 107/536 (19%)
Query: 1 MLEKQMGSNFFRKILQNIISR-----AQGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
+LE+++ + + IISR G +ST F K G+ + L FF
Sbjct: 511 ILERRLVKASGKPTVSRIISRLLLNNRMGDLTHGAVSTTSFIKLCEKFGHCK---LDVFF 567
Query: 56 PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG-- 113
+WV GCP +NK+K +VE+ + K P N + + D+
Sbjct: 568 QQWVFGAGCPRFTASQRFNKKKLVVEIMI------KQSQGDPQPPKNLEKSSFLRDMKEE 621
Query: 114 ------------WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKP 160
+ G M+IR+HE DG Y+H + +A E+ ++K +R +
Sbjct: 622 IRSIYAAPIQPVFTGSMTIRIHEADGTPYEHIV--EIKEAITKFEVPYNTKY--KRLKRN 677
Query: 161 KKGSKPDGCDDNGDAVAGL-----------------DM--------------RSSMESPL 189
K+ + GD+ A DM R ES
Sbjct: 678 KRQKERVTHTAGGDSTAETQDDVLLYCLGDVLQTEEDMQEWRLSDWSKEEEERMGQES-Y 736
Query: 190 SWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQ---AQAIAALEALPHLSFNVVNT 246
WIR D + E++ ++ P M+++QL++D DVVAQ Q +A +A P +S V T
Sbjct: 737 EWIRMDADFEWICKMSLGMPGYMYLSQLQQDRDVVAQLESMQYMAIQKAHPLISTIFVRT 796
Query: 247 LNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP-----R 301
L D + F +R AA AL A EE DW GL HL R F E LP R
Sbjct: 797 L----MDKRYFHGIRTAAANALVKHAKEELDWLGLYHL-----ERAFQELFCLPGTHVTR 847
Query: 302 PNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAA 361
NDF D + Y + +AIP A++ VR + +P F+ L+++DN+ N YSD F++A+
Sbjct: 848 DNDFSDQASYILQKAIPEAISRVRDNNGNTPLRVKRFIFDKLRFSDNSNNEYSDNFYIAS 907
Query: 362 LVQSV--------------GELEFGQQSIL-------FLSSLLKRIDRLLQFDRLMPSYN 400
L+ S+ E+EF + L ++ IDR + D S+
Sbjct: 908 LMTSLVNAMEGRKAEASYSNEMEFDMERELERQAEEKLEQDVITEIDRYRRMDEWSSSFQ 967
Query: 401 GILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEF 456
I + + + Q+ L S + LD ++ ++ R T +RVEA L++L+
Sbjct: 968 NIYSRTALH--CQLQLTKSKIMDLD-LIHFLQYTRS-GTFDMLRVEAFECLVELDL 1019
>gi|299756392|ref|XP_001829302.2| TATA-binding protein associated factor Taf2 [Coprinopsis cinerea
okayama7#130]
gi|298411655|gb|EAU92262.2| TATA-binding protein associated factor Taf2 [Coprinopsis cinerea
okayama7#130]
Length = 1794
Score = 139 bits (350), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 92/299 (30%), Positives = 146/299 (48%), Gaps = 34/299 (11%)
Query: 191 WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNF 250
WIR D + E++A + F QP MW++QL++D DVVAQ +AI AL P + V +TL
Sbjct: 621 WIRIDADFEWIANMKFEQPDFMWVSQLQRDRDVVAQLEAIYALAEKP--TAIVSSTLTKT 678
Query: 251 LSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFD---------ENIGLPR 301
+ S ++RVR EAA AL + A D+ GL HL K + +D + +P+
Sbjct: 679 VLVSNYYYRVRCEAALALVSCAIRRLDFLGLFHLFKLFLRYCYDPEDPNQDLFTHKYVPK 738
Query: 302 PNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWL-- 359
PN+F DF EYFV + + A++ VR + +P +F++ L+YNDN NPY+D +++
Sbjct: 739 PNNFSDFPEYFVRKTLITAISQVRFENGNTPSVVRQFLIDQLRYNDNTSNPYTDGYYICT 798
Query: 360 -----------------AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGI 402
L+ S E + + + +DR DRL+PS + I
Sbjct: 799 IIAAAACAIISMAPPERGELMPSDTSTELNAEDLELIKQTRIEVDRYRSMDRLIPSPHNI 858
Query: 403 LTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGI 461
+TI+ + +AL ++ I K+ P QVR+ A L +++ I
Sbjct: 859 VTIAALEFY--MALGVANLIPSHP--KVFFPLTREGNYTQVRIAAFDGLFMTKWYAPQI 913
Score = 41.6 bits (96), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 29/106 (27%), Positives = 48/106 (45%), Gaps = 16/106 (15%)
Query: 31 LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTV 90
LST F KV ++ L+ F +W+ GCP S+N++K VE+ + ++
Sbjct: 436 LSTHAFLRTCRKVSGVD---LRSFAEQWIYGSGCPSFGFSASFNRKKMAVEITMRQEA-- 490
Query: 91 KPDSRTPVLSSNTDSENRDGDIG----WPGMMSIRVHELDGM-YDH 131
P + +E + + G M+IR+HE DG Y+H
Sbjct: 491 ------PAFKALEHNEIGKALLKPVPFFEGQMTIRIHEADGTPYEH 530
>gi|313228775|emb|CBY17926.1| unnamed protein product [Oikopleura dioica]
Length = 984
Score = 139 bits (350), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 122/458 (26%), Positives = 203/458 (44%), Gaps = 63/458 (13%)
Query: 13 KILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFS 72
K+L N R + + + +S+ F F V ++ +K+ WV G + FS
Sbjct: 432 KLLANKSGRFEKSPQI--ISSVSFLKF---VASVTEKDIKDLLDLWVRRGGVANIVASFS 486
Query: 73 YNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGMYDHP 132
+ + KN +E+ + +D R G + + G M++RV ELDG ++H
Sbjct: 487 FQRNKNTIEIEIKQD--------------------RSGSVRYVGPMTVRVQELDGTFNHT 526
Query: 133 ILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLDMRSSMESPLSWI 192
I D + I CHS RA + K+ P NG+ V +S +SP+ WI
Sbjct: 527 I--QVEDKTNKISINCHS-----RARRNKRKKIPLL---NGEEVDIDLSQSDQDSPVLWI 576
Query: 193 RADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLS 252
R DP+ L I QP ++ + +++ Q + I L+ P + N V +L +
Sbjct: 577 RLDPDCNLLRSIELRQPDYCLQYLVKYEREIIGQLEGIEHLKDFP--TENTVKSLKEAVE 634
Query: 253 DSKAFWRVRIEAAYALANTASEETDWA----GLLHLVKFYKSRRFDENIG---LPRPNDF 305
FWRVR+ AA +A A + D LL + K Y E + L R NDF
Sbjct: 635 SLDFFWRVRVSAAEMMARIADQMPDPVPVKDALLSVFKGYFC--LSETVSKERLVRRNDF 692
Query: 306 RDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQS 365
+F +YFV AIP +++ +R + +P F+ LLK+NDN+ N Y D W A L+ +
Sbjct: 693 SNFQQYFVQCAIPVSLSKIRLSSGMTPAGVFTFIANLLKHNDNSKNAYLDCQWKAYLITA 752
Query: 366 VGELEFGQQSILFLSS-------LLKRIDRLLQFDRLMPSYNG---ILTISCIRTLTQIA 415
+ LSS +++ + R++ + L+P+Y I ++ +RTL +
Sbjct: 753 LSNTLCRAVVAQKLSSVSEEAKEVIEEVMRVMNTEWLLPTYKHVTFIAALTAVRTLQRQQ 812
Query: 416 LKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+ +L + F + + VRV A + L+D
Sbjct: 813 -------HIPACPELFRRFSESDQYSTVRVAAIKCLVD 843
>gi|425781951|gb|EKV19885.1| Transcription initiation factor TFIID subunit TSM1/127kD, putative
[Penicillium digitatum PHI26]
gi|425783990|gb|EKV21801.1| Transcription initiation factor TFIID subunit TSM1/127kD, putative
[Penicillium digitatum Pd1]
Length = 1240
Score = 139 bits (350), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 138/523 (26%), Positives = 237/523 (45%), Gaps = 85/523 (16%)
Query: 1 MLEKQMGSNFFRKILQNIISR-----AQGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
+L++++ + + IISR G P +++ F+ ++G+ + L FF
Sbjct: 505 ILDRRLTKASGKATMSRIISRLFLNSRMGDIPNGAITSSLFQKTCERLGHAK---LDVFF 561
Query: 56 PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRD------ 109
+WV GCP + +NK+K +VE+ ++R +P P + DS RD
Sbjct: 562 QQWVYGAGCPRFQATQRFNKKKLVVEM-MIRQVQSEPQ---PPRDLDKDSFLRDVKEEVR 617
Query: 110 ----GDIG--WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAA-RRALKPK 161
G + + G M+IR+HE DG Y+H + G +I ++K +R K +
Sbjct: 618 HVYAGAVQPVFTGSMTIRIHEADGTPYEHIVEIKEGVT--KFDIPYNTKYKRLKRNKKQR 675
Query: 162 KGSKPDG-----------CDDN-------------GDAVAGLDMRSSMESPLSWIRADPE 197
+ + G C + D + R ES WIR D +
Sbjct: 676 ERAVASGDAEVQEEVLLYCLGDVLQTEEETQAWRLADWTKEDEERMGQES-YEWIRMDAD 734
Query: 198 MEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEAL---PHLSFNVVNTLNNFLSDS 254
E++ ++ P M+++QL++D DVVAQ +++ + A P +S V TL DS
Sbjct: 735 FEWICKLSLGMPGYMYLSQLQQDRDVVAQLESLQYMTAQREHPLISTIFVRTL----MDS 790
Query: 255 KAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVL 314
+ F +R AA AL A +E DW GL HL K ++ + R NDF D + Y +
Sbjct: 791 RYFHGIRTTAARALVKHAKDEIDWVGLHHLEKAFQELFCLPGSPMTRSNDFSDRAAYILQ 850
Query: 315 EAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQS-----VGEL 369
+ IP A++ VR K+P F+ LK+NDN+ N YSD F++A+L++S +G+
Sbjct: 851 QVIPDAISKVRDNSGKTPLRVKRFLFDKLKFNDNSNNEYSDNFYVASLMKSLCHAMLGKS 910
Query: 370 E---------FGQQSILFLSS-------LLKRIDRLLQFDRLMPSYNGILTISCIRTLTQ 413
E F + +L + + + DR + D S+ + + + ++ Q
Sbjct: 911 EARTDEDLDHFDMERVLEVQAEEQLDKDAIAEFDRYRRMDEWSSSFQNLYSRAALQ--CQ 968
Query: 414 IALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEF 456
L + I LD +++ I P+ T +R+EA + L+DL+
Sbjct: 969 AQLMQAKMIDLD-LMRFI-PYTRAGTYDLLRIEAFQCLVDLDI 1009
>gi|320031839|gb|EFW13797.1| TBP-associated factor [Coccidioides posadasii str. Silveira]
Length = 1208
Score = 139 bits (349), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 125/476 (26%), Positives = 209/476 (43%), Gaps = 90/476 (18%)
Query: 58 WVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTP------VLSSNTDSENRDGD 111
WV GCP +NK+K +VE+ + + V+ + +P + E ++
Sbjct: 510 WVYGAGCPRFVATQRFNKKKLVVEMMIRQ---VQSEQSSPRDLEKSTFMRDVKEELQNIY 566
Query: 112 IG-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLA--ARRALKPKKG 163
G + G M+IR+HE DG Y+H + +A +I ++K R + ++
Sbjct: 567 AGSLQHVFTGSMTIRIHEADGTPYEHIV--EIKEAVTKFDIPYNTKYKRLKRNKRQRERA 624
Query: 164 SKPDGCDDNGDAVAGL------DMRSSMESPL--------------------SWIRADPE 197
+ G D N +A + D+ S E L WIR D +
Sbjct: 625 AASSGVDPNSEAQDDVLLYCLGDVLQSEEEILKWRLVEWTKEDEDRMGQESYEWIRMDAD 684
Query: 198 MEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA-------IAALEALPHLSFNVVNTLNNF 250
E++ ++ P M+++QL++D DVVAQ +A +AA + + + +
Sbjct: 685 FEWICKMSLTMPGYMYLSQLQQDRDVVAQLEASNSRVGFMAAANCIQSIQYMAAQREHPL 744
Query: 251 LS--------DSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRP 302
+S DS+ F +R AA+AL A E+ DW GL HL ++ N + R
Sbjct: 745 ISTIFARTLMDSRYFHGIRTAAAHALVKHAKEDVDWIGLFHLETAFQELFCLPNSPMTRS 804
Query: 303 NDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAAL 362
N+F D S Y + AIP+A+A VR + + P F+ + LK+NDN+ N +SD +++AAL
Sbjct: 805 NNFSDRSSYALQIAIPNAIAKVRNRNGRCPMRVKRFLYEKLKFNDNSNNEFSDCYYVAAL 864
Query: 363 VQSVG--------------------ELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGI 402
++S+ ELE + L+L + + I+R + D S+ +
Sbjct: 865 MRSIANALIGGKESIHTKSKLDMTQELERQAEDQLYLDCIAE-INRYKRMDEWTSSFQNL 923
Query: 403 LT---ISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLE 455
T + C+R L + LD V P+ T VR++A R LLDL+
Sbjct: 924 YTRTALDCLRRL------MKEHRVLDFDVAQFLPYTRIGTFDLVRLDAFRILLDLD 973
>gi|409083644|gb|EKM84001.1| hypothetical protein AGABI1DRAFT_104011 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 1854
Score = 138 bits (348), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 84/248 (33%), Positives = 130/248 (52%), Gaps = 32/248 (12%)
Query: 191 WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNF 250
WIR D + E++A I F+QP MW++QL++D DVVAQ +A+ AL P + V +TL
Sbjct: 707 WIRMDADFEWIANIKFDQPDFMWVSQLQRDRDVVAQLEALHALAEKP--TAIVSSTLTKT 764
Query: 251 LSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFD---------ENIGLPR 301
+ S ++R+R EAA AL A D+ GL HL K + +D + +P+
Sbjct: 765 VLVSNYYYRIRCEAALALVTCAIRRLDFLGLFHLFKLFLRYCYDPEDPTQDLFTHTYVPK 824
Query: 302 PNDFRDFSEYFV--LEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 359
PN+F D +EYFV +++ A++ VR + KSP +F + L+YNDN NPYSD ++
Sbjct: 825 PNNFSDLAEYFVRKYKSLITAISRVRFENGKSPSVVRQFFIDQLRYNDNTSNPYSDGLYI 884
Query: 360 AALVQSV------------GEL-------EFGQQSILFLSSLLKRIDRLLQFDRLMPSYN 400
++ + GEL E + + + IDR DRL+PS +
Sbjct: 885 CTIISAAAYAAVSVAPPERGELLPAEIRSEQTAEDKELVRQTIAEIDRYRSMDRLIPSPH 944
Query: 401 GILTISCI 408
I+TI+ +
Sbjct: 945 NIVTIAAL 952
>gi|449299494|gb|EMC95508.1| hypothetical protein BAUCODRAFT_123936 [Baudoinia compniacensis UAMH
10762]
Length = 1237
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 139/559 (24%), Positives = 260/559 (46%), Gaps = 100/559 (17%)
Query: 2 LEKQMGSNFFRKILQNIISRAQ-GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
L K G+ ++I+ I+ A G LST +F+ K+G+ + L+ FF +WV
Sbjct: 544 LLKASGTTGVQRIINKILVNATTGTLENGELSTVDFQRQCEKLGHNK---LESFFRQWVF 600
Query: 61 TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG--DIGWP--- 115
GCP+ ++ +NK+K +VE+ +L+ ++ ++ P +N E ++ +I P
Sbjct: 601 GAGCPIFQVSQRFNKKKLVVEMTILQR-QIERQTKPPFAPNNFMREIKEHVQEIWAPETL 659
Query: 116 ----GMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAA-RRALKPKKGSKPDGC 169
G M+IR+HE DG Y+H + + LEI ++K +R+ + K + +
Sbjct: 660 PVFTGPMTIRIHEADGTPYEHIV--EIKEQTTKLEIPYNTKYKRLKRSRRQKDRAMAESN 717
Query: 170 DDNG-------------------------DAVAGLDMRSSMESPLSWIRADPEMEYLAEI 204
+ G D + + R ES WIR D + E++ +I
Sbjct: 718 NGEGGEESLLYCLGDILDSPEECREWNLVDWTSEDEDRMGQES-YEWIRMDADFEWIGKI 776
Query: 205 HFNQPVQMWINQLEKDGDVVAQAQAIAALE-ALP-HLSFNVVNTLNNFLSDSKAFWRVRI 262
H P+ M+++QL++D D+VAQ ++ L A P H+SF++ L L D + F +R
Sbjct: 777 HLTMPLYMYVSQLQQDRDLVAQYDSMRYLTLANPHHVSFSI---LVRTLMDHRYFHGIRA 833
Query: 263 EAAYALA---------NTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFV 313
AA LA ++ ++E G HL+K ++ +PRPND+ + + + V
Sbjct: 834 MAADGLAILVARVDPQSSNADELRTIGQFHLMKAFEELFCFPGENMPRPNDWTNRANFIV 893
Query: 314 LEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGE----- 368
AIP ++A +R + K P F + L++NDN+ NP+SD ++A L+ + +
Sbjct: 894 QCAIPRSMAKLRDGEGKVPLAIRRFFMDKLRFNDNSDNPFSDCHYVATLLTCLADSLVVS 953
Query: 369 ---------LEFGQQSIL--------FLSSLLKRIDRLLQFDRLMPSYNGILTIS---CI 408
FG+ + + + I+R + D +Y+ I +++ C+
Sbjct: 954 HREPQPTYTFSFGEDEPMEPENPDEPLEAEGINAIERYRRIDEWETTYHNIYSLTALECL 1013
Query: 409 RTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSL- 467
+ LT+ + + D+ ++++ R N VRV A R L+++ G+ +S+
Sbjct: 1014 QKLTKAGM------AKDKTSEVLQYTRPANADL-VRVAAFRCLVEI-----GVARKMSMM 1061
Query: 468 --FIKSVEEEPS--LRGQV 482
+ + +EPS +RG++
Sbjct: 1062 AYLLHCIADEPSPFVRGRL 1080
>gi|294654543|ref|XP_002769993.1| DEHA2A06424p [Debaryomyces hansenii CBS767]
gi|199428965|emb|CAR65370.1| DEHA2A06424p [Debaryomyces hansenii CBS767]
Length = 1628
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 139/541 (25%), Positives = 247/541 (45%), Gaps = 60/541 (11%)
Query: 31 LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAV--LRDC 88
LST+ F++ KV R L FF +WV G P+ + +NK+++++E+++ ++
Sbjct: 636 LSTQHFQYVCEKVN---RNKLDTFFKQWVFGSGVPIFNISQKFNKKRSMIEMSIRQVQQS 692
Query: 89 TVKPDSRTPVLSSNTDSENRDGDIGWP------GMMSIRVHELDGM-YDHPILPMAGDAW 141
K P N + D + + G M+IR+HE DG Y+H I+ + D
Sbjct: 693 DTKKLHPNPESFINDSISHLDDEPSFSIQPVFTGPMTIRIHEADGTPYEH-IVDLK-DGH 750
Query: 142 QLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVA--------GLDMRSSMESPL---- 189
L+IQ ++K + + GD + GL+ E +
Sbjct: 751 VKLDIQYNTKFKRLKKNREDLNEPNTTFSKLGDILTTPEEMDEWGLNEWVKHEEDMLYND 810
Query: 190 --SWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTL 247
WIR D + E++A+I+ QP M+ +QL+ D DV AQ +A+ + L
Sbjct: 811 AFEWIRVDADYEWIAQINVKQPDYMFGSQLQYDRDVEAQYEAVRYFGDREKPNTLYCTAL 870
Query: 248 NNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRD 307
+ D + ++ +RI AA ALA+ + + ++ G+ +L+K +K +P NDF +
Sbjct: 871 TRTVMDDRYYYGLRIAAAQALADISKPDNNFIGIGYLIKIFKELYCFPGSSIPLSNDFNN 930
Query: 308 FSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVG 367
F+ +F+ + IP ++ VR + + P +L L+K+N+N+ N + D F+++ L+ S+
Sbjct: 931 FNRFFLQKQIPQILSEVRDENGEVPFVIKNLLLNLVKFNENSNNNFQDCFYMSDLITSLT 990
Query: 368 ELEFGQQSIL---------------------FLSSLLKRIDRLLQFDRLMPSYNGILTIS 406
+S L FL ++ I+RL + D +PS+ I+ I+
Sbjct: 991 TSALNAKSALTSQDPLNLLDTSDSINSEKQKFLQDVITEINRLQKLDEWVPSHQSIIAIT 1050
Query: 407 CIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALS 466
CI+ ++I L G + L L + + +R+EA R LL L N S L
Sbjct: 1051 CIK--SKIRLATHGLLQLSFEDLLYYTLEKYP--FDIRIEAFRGLLILGGLKNA--SVLQ 1104
Query: 467 LFIKSV---EEEPSLRGQ-VKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAF 522
++K P RG+ + L I+A+ + I G + + + TL L + SR+A
Sbjct: 1105 YYLKVCLLGATTPYFRGKLIDLLINAICVAAIDGTPSTLDDPEFKTLEKLFD-ENSRMAN 1163
Query: 523 N 523
N
Sbjct: 1164 N 1164
>gi|452979536|gb|EME79298.1| hypothetical protein MYCFIDRAFT_34831, partial [Pseudocercospora
fijiensis CIRAD86]
Length = 1197
Score = 137 bits (346), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 134/524 (25%), Positives = 239/524 (45%), Gaps = 88/524 (16%)
Query: 2 LEKQMGSNFFRKILQNIISRAQ-GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
L K GS +I+ I A+ GA LST + + K+G+ + L+ FF +WV
Sbjct: 525 LVKSSGSTGVMRIINKIFLNAKTGALTNCELSTIDLQRTCEKLGHNK---LESFFKQWVL 581
Query: 61 TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG--DIGWP--- 115
GCP+ + +NK+K +VE+ +++ ++ ++ +N E ++ ++ P
Sbjct: 582 GSGCPIFHVSQRFNKKKLVVEMTIVQR-QLERQTKPEFAPNNFMREIKEHVQEVWAPEIQ 640
Query: 116 ----GMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAA-RRALKPKKGSKPDGC 169
G M+IR+HE DG Y+H + + LEI ++K +R+ + K+ + +G
Sbjct: 641 TVFSGPMTIRIHEADGTPYEHIV--EIKEPITKLEIPYNTKYKRLKRSRRAKERAVAEGA 698
Query: 170 --DDNGDAV----------------------AGLDMRSSMESPLSWIRADPEMEYLAEIH 205
++ DA+ + D + WIR D + E++ ++H
Sbjct: 699 AGEEGSDALLYCLGDILDNEQDQQEWNLVDWSAEDEDKMGQESYEWIRMDADFEWIGKMH 758
Query: 206 FNQPVQMWINQLEKDGDVVAQAQAIAAL--EALPHLSFNVVNTLNNFLSDSKAFWRVRIE 263
P+ M+I+QL++D D+VAQ +++ L H+S + L L D + F+ +R+
Sbjct: 759 LVMPLYMYISQLQQDRDLVAQYESMRYLLGSNPHHVSLTI---LVRTLMDERYFYGIRVM 815
Query: 264 AAYALANTASEETDWAGLLHLVKFYKSRRFDENI-GLPRPNDFRDFSEYFVLEAIPHAVA 322
AA L+ A ++ G LHL K + D +I +PRPNDF + + AIP A++
Sbjct: 816 AAEGLSLMAKDKLTEIGTLHLTKAFAHFFCDGDIKSMPRPNDFSSRVSFIMQCAIPRAMS 875
Query: 323 MVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGE-------------- 368
+R + K P + F + L YNDN+ N YSD ++A L+ + +
Sbjct: 876 KLRDIEGKVPNKVQRFFVDKLIYNDNSDNAYSDCHYVATLMTCLTDALVASHRIIEPKQT 935
Query: 369 ----------LEFGQQSI----LFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQI 414
+E Q+ I F + I+R + D + +Y I + + I L +
Sbjct: 936 FEFNFDDDMGMEEDQEPINPNAAFEQEAVNVIERYRRLDEWIVTYQNIYSTTAIECLQK- 994
Query: 415 ALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHC 458
L +G + D+ +L++ RD +T VR++A FHC
Sbjct: 995 -LTKAGVVK-DKAKELLQYTRD-STADNVRIQA--------FHC 1027
>gi|378732524|gb|EHY58983.1| transcription initiation factor TFIID subunit D2 [Exophiala
dermatitidis NIH/UT8656]
Length = 1299
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 110/409 (26%), Positives = 188/409 (45%), Gaps = 48/409 (11%)
Query: 2 LEKQMGSNFFRKILQNIISRAQGASPVR-TLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
+ K G++ IL I++RA+ V LST+ F+ A ++G+++ + +F +WV
Sbjct: 517 IAKTQGASKMPGILAKILTRARTGDLVNNALSTELFQKLAERMGHMK---IDDFLNQWVK 573
Query: 61 TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRD--------GDI 112
GCP + +NK+K +VE+ + + R + S D
Sbjct: 574 GAGCPRFQAFQRFNKKKLVVEMLIKQVQGTTTTERELEIDSFMRDAREDFQSVYAAPAQP 633
Query: 113 GWPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKL-----AARRALKPKKGSKP 166
+ G M+IR+HE DG Y+H + DA +I ++K + ++ + +
Sbjct: 634 VFTGPMTIRIHEADGTPYEHIV--EIKDASTTFDIPYNTKYKRLKRSRKQRARATAKAAA 691
Query: 167 DGCDDNGDAV----------------------AGLDMRSSMESPLSWIRADPEMEYLAEI 204
+G D+ D++ +G D WIR D + E++ I
Sbjct: 692 EGGDEENDSLVYCLGDVLQSEEEIANWKITEWSGEDEEKMNAESYEWIRLDADFEWICRI 751
Query: 205 HFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEA 264
+ P M+ +QL++D DVVAQ +A+ + P V + L D + F VR A
Sbjct: 752 NLQMPGYMFTSQLQQDRDVVAQLEALRHIAVYPPSPL-VSSIFVRTLMDRRYFHGVRAMA 810
Query: 265 AYALANTASEETDWA--GLLHLVKFYKSR---RFDENIGLPRPNDFRDFSEYFVLEAIPH 319
A+ L A +E D GL HL K ++ R ++ + RPNDF D Y + I
Sbjct: 811 AHCLVKHAKDEGDAKNIGLFHLKKAFEELYCVRDEQGSAMTRPNDFSDQLSYMMQCEIVG 870
Query: 320 AVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGE 368
A+ VR + +P+E EF+L L++NDN+ N YSD F++A L++++ +
Sbjct: 871 AIGRVRDSRGHTPKEVKEFLLDKLRFNDNSVNEYSDAFYVARLMKALAQ 919
>gi|296806237|ref|XP_002843928.1| transcription initiation factor TFIId 127kD subunit [Arthroderma
otae CBS 113480]
gi|238845230|gb|EEQ34892.1| transcription initiation factor TFIId 127kD subunit [Arthroderma
otae CBS 113480]
Length = 1265
Score = 137 bits (344), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 111/417 (26%), Positives = 191/417 (45%), Gaps = 50/417 (11%)
Query: 1 MLEKQMGSNFFRKILQNIISRA-----QGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
+LE+++ + + IISR G P L++ F+ ++G+ + L FF
Sbjct: 514 ILERRLSKASGKATVSRIISRIFLNARMGDLPNCELTSAYFQKLCERMGHTK---LDAFF 570
Query: 56 PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSS---NTDSENRDGDI 112
+WV GCP +NK+K +VE+ + + T +R S+ + E R+
Sbjct: 571 NQWVYGAGCPRFTASQRFNKKKLVVEMMISQVQTEHMSARDLERSTFMRDVKEEIRNIYA 630
Query: 113 G-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLA--ARRALKPKKGS 164
G + G M+I++HE DG Y+H + +A +I ++K R + ++ +
Sbjct: 631 GTVQHVFTGSMTIQIHEADGTPYEHIV--EIKEAVTKFDIPYNTKYKRLKRNRRQKERAA 688
Query: 165 KPDGCDDNGDAVAGL-------------DM--------------RSSMESPLSWIRADPE 197
G D N +A + DM R ES WIR D +
Sbjct: 689 AMSGIDPNSEAQDDVLLYCLGDVLQSDEDMEKWKLVEWTKEDESRMGQES-YEWIRMDAD 747
Query: 198 MEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAF 257
E++ ++ P M+++QL++D DVVAQ ++I + A + + + D + F
Sbjct: 748 FEWICKMSLGMPGYMYLSQLQQDRDVVAQLESIQHMAAQREHAL-ISTIFVRTILDRRYF 806
Query: 258 WRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAI 317
+R A AL A +E DW GL HL ++ N + R NDF D + Y V A+
Sbjct: 807 HGIRTAATKALVKHAKQEVDWIGLFHLENAFQELYCLPNSTMVRSNDFSDRASYAVQLAL 866
Query: 318 PHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQ 374
A++ VR D K+P +F+ LK+NDN+ N +SD F+++ L++S+ G++
Sbjct: 867 IEAISEVRDDDGKTPVRVKQFLYDKLKFNDNSNNEFSDCFYISTLLKSICNALLGKR 923
>gi|322703910|gb|EFY95511.1| transcription initiation factor TFIID subunit TSM1, putative
[Metarhizium anisopliae ARSEF 23]
Length = 1457
Score = 135 bits (340), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 144/519 (27%), Positives = 233/519 (44%), Gaps = 78/519 (15%)
Query: 1 MLEKQM----GSNFFRKILQNIISRAQGASPVRT--LSTKEFRHFANKVGNLERPFLKEF 54
+L+K+M G + +ILQ +++AQ S R+ L T++FR K G+ R L+ F
Sbjct: 542 ILDKRMIKASGGHGLTRILQKFLTKAQIESSDRSTILDTEKFRAVCEK-GSRYR--LEAF 598
Query: 55 FPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSN----TDSENRDG 110
+ +WV GCP + +NK++ VEL L + RT L + E R G
Sbjct: 599 WNQWVYGSGCPRFEVKAKFNKKRLCVEL-TLNQIQHQAAKRTGGLEKDDFLRVVKERRAG 657
Query: 111 -DIG-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQC-----HSKLAARRAL 158
+G + G M++R+HE DG Y+H IL + DA + + + + +L R +
Sbjct: 658 LKLGEVQPLFTGPMTVRIHEADGTPYEH-ILEIREDATRSTKFEIPYNTKYKRLKRTRRM 716
Query: 159 KPKKGSK-----PDGCDDN-----GDAVAGLD--------------MRSSMESPLSWIRA 194
K K + DD GD + D R + WIR
Sbjct: 717 KEKHSGGGSMEITENADDALLYCLGDVLQTPDDHRDWELIEWDPETERKMDQESYEWIRV 776
Query: 195 DPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDS 254
D + E+ ++ M+++QL++D DVVAQ A+ L P L L L D
Sbjct: 777 DADFEWACDMKRTLEPYMYVSQLQQDRDVVAQQDAMLYLTHGP-LHPIASGFLVRTLIDR 835
Query: 255 KAFWRVRIEAAYALANTAS-EETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFV 313
+ F +R AA AL ++ + GL L++ ++ E P PNDF D +Y V
Sbjct: 836 RYFHGIRTMAAEALPRQSNIKGIPLLGLRQLMRAFREMFCYEQTNQPIPNDFTDKKQYNV 895
Query: 314 LEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEF-- 371
AI A++ VR + P EA +F+L L +N+N NPYSD F++A LV+++
Sbjct: 896 RRAIIKAISEVRDHTYRCPLEARQFILNQLLFNNNEDNPYSDHFYIATLVEALATSLIPS 955
Query: 372 -------------GQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCI---RTLTQIA 415
++ LFL +++I+R+L+ D SY + TI+ + + L +
Sbjct: 956 KKDDWLSRQSKAPDEEERLFLEKAMEQIERVLRRDEWTNSYQNVWTIAGLDAKQRLMKAE 1015
Query: 416 LKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
+ + Q L+ RD +R++A AL+DL
Sbjct: 1016 VIPKSYPDFAQY--LLDGSRDL-----IRIKAFEALIDL 1047
>gi|254569566|ref|XP_002491893.1| TFIID subunit (150 kDa), involved in RNA polymerase II
transcription initiation [Komagataella pastoris GS115]
gi|238031690|emb|CAY69613.1| TFIID subunit (150 kDa), involved in RNA polymerase II
transcription initiation [Komagataella pastoris GS115]
Length = 1571
Score = 135 bits (340), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 115/388 (29%), Positives = 183/388 (47%), Gaps = 58/388 (14%)
Query: 31 LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAV------ 84
LST F+H KV + + L FF WV G PVLR+ +NK++ +E+ +
Sbjct: 609 LSTSHFQHVCEKVAHHK---LDPFFNNWVHNSGTPVLRVTQRFNKKRMFIEMGIRQVQGY 665
Query: 85 --LRDCTVKPDSRTPV--LSSNTDSENRDG----DIGWPGMMSIRVHELDGM-YDHPILP 135
R K + PV L+ N+ G + G M+IR+HE DG Y+H +
Sbjct: 666 ELARSDASKSRKKDPVSFLNEACQHVNQTGPGITSQIFTGPMTIRIHEADGTPYEHIVDL 725
Query: 136 MAGDAWQLLEIQCHSKLAARRALK-----PKKGSKPDGCDDNGDAVA-GLDMR------- 182
G + L+IQ ++K + K KK + + GD ++ DM+
Sbjct: 726 KEG--FTKLDIQYNTKYKRIKKNKKDEELTKKDDSENTVNRLGDILSRSKDMQEWHLEDL 783
Query: 183 ------SSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEA- 235
+ + WIR D + E++ +IH NQP M+ +QL++D DV AQ ++
Sbjct: 784 SAEGEEARTQDAFEWIRIDADFEWICQIHLNQPDYMYQSQLQQDRDVEAQLDSVNFFSNS 843
Query: 236 -LPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVK------FY 288
P++ ++ V L L DS+ F+ VR EAA LA + EE + GL HL+K Y
Sbjct: 844 LRPNVFYSSV--LVRTLMDSRYFYGVRCEAAKGLARLSKEENNHIGLHHLLKALKQLFCY 901
Query: 289 KSRRFDENIG---------LPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFV 339
E+ LP PNDF DF F+ E IP A++ ++ D+ +
Sbjct: 902 PMHDSHEDATTLINNPKNFLPLPNDFSDFQRMFIQETIPAALSTIKIKDSDLFLNLRRIL 961
Query: 340 LQLLKYNDNNGNPYSDVFWLAALVQSVG 367
L+LL+YNDN N ++D F++ +L++++
Sbjct: 962 LRLLQYNDNLSNDFNDCFYICSLIRALS 989
>gi|340517026|gb|EGR47272.1| predicted protein [Trichoderma reesei QM6a]
Length = 1532
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 128/462 (27%), Positives = 207/462 (44%), Gaps = 63/462 (13%)
Query: 2 LEKQMGSNFFRKILQNIISRAQ--GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
L K G + +ILQ ++RAQ G+ L T++FR K L+ F+ +WV
Sbjct: 565 LSKASGGHGLTRILQKQLTRAQIEGSDKFLILDTEKFRATCEKAAKYR---LESFWNQWV 621
Query: 60 GTCGCPVLRMGFSYNKRKNIVEL-------AVLRDCTVKPDSRTPVLSSNTDSENRDGDI 112
GCP + +NK++ VEL ++ + D V+ + + G+I
Sbjct: 622 YGSGCPRFDVKAKFNKKRLCVELMLIQVQHQTIKKMELSKDQFLRVVRERR-AHLKTGEI 680
Query: 113 G--WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQC-----HSKLAARRALKPKKG- 163
+ G M++R+HE DG Y+H IL + DA + + + + +L R +K K
Sbjct: 681 QPLFTGPMTVRIHEADGTPYEH-ILEIREDATRSTKFEIPYNTKYKRLKRTRRMKEKHNV 739
Query: 164 ----SKPDGCDDN-----GDAVAGLD--------------MRSSMESPLSWIRADPEMEY 200
P+ DD GD + + R + WIR D + E+
Sbjct: 740 GASMEIPENTDDALLYCLGDVLQSPEDLRDWELIEWDPETERKMDQESYEWIRVDADFEW 799
Query: 201 LAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRV 260
++ M+++QL++D DVVAQ A+ L P L L L D + F +
Sbjct: 800 ACSMNRTLEPYMYVSQLQQDRDVVAQQDAMLYLSTGP-LHPIASGFLVRTLVDRRYFHGI 858
Query: 261 RIEAAYALANTAS-EETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPH 319
R AA AL A+ ++ GL L++ Y+ + P+PNDF D +Y V +I
Sbjct: 859 RTMAAQALTRQANIKDIPMLGLRQLMRAYREMFCYDQTNQPQPNDFSDKRQYNVRCSIIR 918
Query: 320 AVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV--------GELEF 371
A+A VR P EA +F+L L +N+N NP+SD ++A LV+++ + F
Sbjct: 919 AIAEVRDGAGYCPVEARQFILDQLLFNNNENNPFSDHHYIAVLVEALATSLIPTQADEWF 978
Query: 372 GQQSIL-------FLSSLLKRIDRLLQFDRLMPSYNGILTIS 406
+Q + FL L++I+R+L+ D SY I TI+
Sbjct: 979 KRQVKIPSDEERHFLDQALEQIERVLRRDEWTSSYQNIWTIA 1020
>gi|342883575|gb|EGU84038.1| hypothetical protein FOXB_05458 [Fusarium oxysporum Fo5176]
Length = 1499
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 145/517 (28%), Positives = 232/517 (44%), Gaps = 81/517 (15%)
Query: 2 LEKQMGSNFFRKILQNIISRAQGASPVRT--LSTKEFRHFANKVGNLERPFLKEFFPRWV 59
L K G + +IL ++++ Q S R L T++FR K L+ F+ +WV
Sbjct: 551 LIKASGGHGLTRILSKLLTKVQIESSDRATILETEKFRTTCEKGAKYR---LESFWSQWV 607
Query: 60 GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNT--------DSENRDGD 111
GCP + +NK++ VEL L + ++ P L N S + G+
Sbjct: 608 YGSGCPRFDVKAKFNKKRLCVEL-TLNQIQFQ-TAKKPPLDKNDFLRVVKERRSGVKPGE 665
Query: 112 IG--WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQC-----HSKLAARRALKPKK- 162
+ + G M++R+HE DG Y+H IL + DA + + + + +L R +K K+
Sbjct: 666 VQPLFTGPMTVRIHEADGTPYEH-ILEIREDATRSTKFEIPYNTKYKRLKRTRRMKEKQN 724
Query: 163 -GSKPDG--CDDN-----GDAVAGLDMRSSME--------------SPLSWIRADPEMEY 200
G+ D DD GD + D ++ E WIR D + E+
Sbjct: 725 VGASMDAENLDDALLYCLGDVLQTPDEQAQWELIDWDPETERKMDQESYEWIRVDADFEW 784
Query: 201 LAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRV 260
++ M+++QL++D DVVAQ A+ L P L L L D + F +
Sbjct: 785 SCDMKRTLEPYMYVSQLQQDRDVVAQQDAMLYLTHGP-LHPIASGFLTRTLVDRRYFHGI 843
Query: 261 RIEAAYALANTAS-EETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPH 319
R AA AL A+ ++ GL L+K + + P+PNDF D +Y V AI
Sbjct: 844 RTMAAEALPRQANIKDLSMLGLRQLMKAFSEMFCHKGTNQPKPNDFSDKKQYNVQCAIIK 903
Query: 320 AVAMVRAADN-KSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVG--------ELE 370
A+A VR A K P +A +F+L L +N+N NPYSD F++A LV+++ +
Sbjct: 904 AIAQVRDAHTYKCPLDARQFILDQLLFNNNEDNPYSDHFYIATLVEALATSLIPSKQDDW 963
Query: 371 FGQQSIL-------FLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFIS 423
F Q+ + FL +++I+R+L+ D SY + TI+ LS
Sbjct: 964 FAMQNKIPNEDEKQFLERAIEQIERVLRRDEWTHSYQNVWTIAG----------LSAKQR 1013
Query: 424 LDQVVKLIKPFRDF------NTIWQVRVEASRALLDL 454
L + + K + DF T +R++A AL+DL
Sbjct: 1014 LMKAEVIPKKYSDFGQYLLDGTRDLIRIKAFEALIDL 1050
>gi|380478690|emb|CCF43451.1| transcription initiation factor TFIId 127kD subunit [Colletotrichum
higginsianum]
Length = 702
Score = 134 bits (338), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 107/375 (28%), Positives = 176/375 (46%), Gaps = 45/375 (12%)
Query: 31 LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTV 90
++T+ F+ K R L+ F+ +WV GCP + +NK++ VE+ + R
Sbjct: 332 INTESFQRVCEKNA---RNKLEAFWAQWVYGSGCPRFDVFQRFNKKRLCVEMTI-RQVQD 387
Query: 91 KPDSRTPVLSSN---------TDSENRDGDIG-WPGMMSIRVHELDGM-YDHPILPMAGD 139
+ S + VL + D +G + G M+IR+HE DG Y+H I+ + D
Sbjct: 388 RALSESEVLKKTEFLRAIKEKANGVQVDEAMGLFTGPMTIRIHEADGTPYEH-IVEIRED 446
Query: 140 AWQLLEIQC---------HSKLAARRALKPKKGSKPDGCDDN-----GDAVAG-LDMR-- 182
A + + + K A L + + PD ++ GD + DMR
Sbjct: 447 AAKAAKFEIPYNTKYKRLKRKRRATEKLSAQSAADPDNNEETLLYCLGDVLTNPEDMRKW 506
Query: 183 -----------SSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIA 231
+ + WIR D + E+ E+ N M+++QL++D DVVAQ ++
Sbjct: 507 EFSEWEPEIEKAMDQESYEWIRMDADFEWACEMKTNLQSYMYVSQLQQDRDVVAQQDSML 566
Query: 232 ALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSR 291
L+ P V L L D + F +R AA L + A+E+ AG+ HL+K +
Sbjct: 567 YLKKSPPHPL-VSTFLTRTLVDRRYFHGIRTMAADMLYHHATEKVAMAGMTHLLKAFTEL 625
Query: 292 RFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGN 351
+ PRPNDF D +Y V A+P A+A +R A+ + P A +L +++N+N N
Sbjct: 626 FCYPDSATPRPNDFTDKKQYLVQSALPMAIAKIRDANGRCPEPARRLLLNQVQFNNNANN 685
Query: 352 PYSDVFWLAALVQSV 366
PYSD F++A L+++V
Sbjct: 686 PYSDHFYVAKLLEAV 700
>gi|389637227|ref|XP_003716252.1| TATA-binding protein associated factor Taf2 [Magnaporthe oryzae
70-15]
gi|351642071|gb|EHA49933.1| TATA-binding protein associated factor Taf2 [Magnaporthe oryzae
70-15]
Length = 1579
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 136/536 (25%), Positives = 242/536 (45%), Gaps = 87/536 (16%)
Query: 2 LEKQMGSNFFRKILQNIISRAQG-ASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
+ K+ N R+++ ++SR+ AS VR +S++ FR KV L+ F+ +WV
Sbjct: 519 MTKKASHNGVRRVVLRLVSRSNTEASDVRFISSESFRKQCEKVHKGAP--LELFWQQWVQ 576
Query: 61 TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNT------------DSENR 108
+ GC M YNK+ + V+L V R K P + N ++
Sbjct: 577 SAGCARFDMSQRYNKKHSRVDLTV-RQPEEKSQLPQPPFNKNNFWGVLQDGRHVPAAKKE 635
Query: 109 DGDIGWPGMMSIRVHELDGM-YDH--PILPMAGDAWQLLEIQCHSKLAA-RRALKPKKGS 164
+ + + G ++I +HE +G Y+H I ++ +E+ ++K +R + K+ +
Sbjct: 636 NVSVWFTGPITIGIHEANGTPYEHVAEIQRTGPRQFKAIEMPYNTKYKRLKRTRRQKERA 695
Query: 165 KP-----DGCDDNGDAVA----------------GL------DMRSSMESPLSWIRADPE 197
+ +G + +AV GL +++ + W+R D +
Sbjct: 696 QALVQAGEGGEQAEEAVLYSLGDVLQSPEEEKEWGLADWTPEQVKAMEQDSFEWLRIDAD 755
Query: 198 MEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAF 257
E++ + N M+++QL++D DVVAQ +++ L + L L D++ +
Sbjct: 756 FEWICSMKTNMQPWMYVSQLQQDRDVVAQQESVLWLSQKAGGEI-LATVLGRTLMDTRYY 814
Query: 258 WRVRIEAAYALAN---TASEE--------------TDWAGLLHLVKFYKSRRFDENIGLP 300
+R +A AL T EE TD GL L+K +K+ + P
Sbjct: 815 HGIRTQAIKALQKQVWTKEEESLDPDSGIVNKLKKTDAGGLRILMKAFKAFFCYPDSQTP 874
Query: 301 RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLA 360
RPNDF + +Y V AIP A+A VR + PREA +F+L+ L++N+N GN YSD F++A
Sbjct: 875 RPNDFSNKMQYAVQRAIPEAIARVREK-GRCPREAQQFLLEQLQFNNNTGNEYSDHFYVA 933
Query: 361 ALVQSVGELEFGQQSI-----------------LFLSSLLKRIDRLLQFDRLMPSYNGIL 403
L+ ++ S+ FL + L+ I+R L+ D SY I
Sbjct: 934 GLLDALATSLIPDNSVETVTLRNSDDPETALDEEFLEAALQMIERYLRMDEWTNSYQNIW 993
Query: 404 TISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCN 459
T++ + + L +G I + + ++ +D +T+ + ++A AL+DL F N
Sbjct: 994 TVTALGCKQK--LMKAGVIPIS-AIDFVRYLQD-STLDLISIKAFEALVDLGFLFN 1045
>gi|328351608|emb|CCA38007.1| Transcription initiation factor TFIID subunit 2 [Komagataella
pastoris CBS 7435]
Length = 2310
Score = 134 bits (337), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 115/388 (29%), Positives = 183/388 (47%), Gaps = 58/388 (14%)
Query: 31 LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAV------ 84
LST F+H KV + + L FF WV G PVLR+ +NK++ +E+ +
Sbjct: 1348 LSTSHFQHVCEKVAHHK---LDPFFNNWVHNSGTPVLRVTQRFNKKRMFIEMGIRQVQGY 1404
Query: 85 --LRDCTVKPDSRTPV--LSSNTDSENRDG----DIGWPGMMSIRVHELDGM-YDHPILP 135
R K + PV L+ N+ G + G M+IR+HE DG Y+H +
Sbjct: 1405 ELARSDASKSRKKDPVSFLNEACQHVNQTGPGITSQIFTGPMTIRIHEADGTPYEHIVDL 1464
Query: 136 MAGDAWQLLEIQCHSKLAARRALK-----PKKGSKPDGCDDNGDAVA-GLDMR------- 182
G + L+IQ ++K + K KK + + GD ++ DM+
Sbjct: 1465 KEG--FTKLDIQYNTKYKRIKKNKKDEELTKKDDSENTVNRLGDILSRSKDMQEWHLEDL 1522
Query: 183 ------SSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEA- 235
+ + WIR D + E++ +IH NQP M+ +QL++D DV AQ ++
Sbjct: 1523 SAEGEEARTQDAFEWIRIDADFEWICQIHLNQPDYMYQSQLQQDRDVEAQLDSVNFFSNS 1582
Query: 236 -LPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVK------FY 288
P++ ++ V L L DS+ F+ VR EAA LA + EE + GL HL+K Y
Sbjct: 1583 LRPNVFYSSV--LVRTLMDSRYFYGVRCEAAKGLARLSKEENNHIGLHHLLKALKQLFCY 1640
Query: 289 KSRRFDENIG---------LPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFV 339
E+ LP PNDF DF F+ E IP A++ ++ D+ +
Sbjct: 1641 PMHDSHEDATTLINNPKNFLPLPNDFSDFQRMFIQETIPAALSTIKIKDSDLFLNLRRIL 1700
Query: 340 LQLLKYNDNNGNPYSDVFWLAALVQSVG 367
L+LL+YNDN N ++D F++ +L++++
Sbjct: 1701 LRLLQYNDNLSNDFNDCFYICSLIRALS 1728
>gi|46135781|ref|XP_389582.1| hypothetical protein FG09406.1 [Gibberella zeae PH-1]
Length = 1505
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 130/466 (27%), Positives = 214/466 (45%), Gaps = 67/466 (14%)
Query: 2 LEKQMGSNFFRKILQNIISRAQ--GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
L K G + +IL ++++ Q G+ L T++FR K L+ F+ +WV
Sbjct: 551 LIKASGGHGLTRILSKMLTKVQIEGSDKATILETEKFRATCEKGAKYR---LESFWNQWV 607
Query: 60 GTCGCPVLRMGFSYNKRKNIVELAV--LRDCTVKPDSRTPVLSSN---TDSENRDGDIG- 113
GCP + +NK++ VEL + ++ VK + P+ + E R G
Sbjct: 608 YGSGCPRFDVKAKFNKKRLCVELTLNQIQSMMVK---KAPLEKEDFLRVVKERRSGVKAG 664
Query: 114 -----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQC-----HSKLAARRALKPKK 162
+ G M++R+HE DG Y+H IL + DA + + + + +L R +K K+
Sbjct: 665 EVQPLFTGPMTVRIHEADGTPYEH-ILEIREDATRSTKFEIPYNTKYKRLKRTRRMKEKQ 723
Query: 163 --GSKPDG--CDDN-----GDAVAGLDMRSSME--------------SPLSWIRADPEME 199
G+ D DD GD + + ++ E WIR D + E
Sbjct: 724 NVGASMDAENLDDTLLYCLGDVLQTPEEQAQWELIDWDPETERKMDQESYEWIRVDADFE 783
Query: 200 YLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWR 259
+ ++ M+++QL++D DVVAQ A+ L P L L L D + F
Sbjct: 784 WSCDMKRTLEPYMYVSQLQQDRDVVAQQDAMLYLTQGP-LHPIASGFLTRTLVDRRYFHG 842
Query: 260 VRIEAAYALANTAS-EETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIP 318
+R AA AL A+ ++ GL L+K + + P+PNDF D +Y V AI
Sbjct: 843 IRTMAAEALPRQANIKDLSMLGLRQLMKAFSEMFCHKGTNQPKPNDFSDKRQYNVQCAII 902
Query: 319 HAVAMVRAADN-KSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVG--------EL 369
A+A VR A+ K P EA +F+L L +N+N NP+SD F++A LV+++ +
Sbjct: 903 KAIAQVRDANTYKCPLEARQFILDQLLFNNNEDNPFSDHFYIATLVEALATSLIPSKQDD 962
Query: 370 EFGQQSIL-------FLSSLLKRIDRLLQFDRLMPSYNGILTISCI 408
F Q+ + FL +++I+R+L+ D SY + TI+ +
Sbjct: 963 WFAMQNKIPNEEEKQFLEKAIEQIERVLRRDEWTHSYQNVWTIAGL 1008
>gi|408392483|gb|EKJ71837.1| hypothetical protein FPSE_07938 [Fusarium pseudograminearum CS3096]
Length = 1505
Score = 133 bits (335), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 130/466 (27%), Positives = 214/466 (45%), Gaps = 67/466 (14%)
Query: 2 LEKQMGSNFFRKILQNIISRAQ--GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
L K G + +IL ++++ Q G+ L T++FR K L+ F+ +WV
Sbjct: 551 LIKASGGHGLTRILSKMLTKVQIEGSDKATILETEKFRATCEKGAKYR---LESFWNQWV 607
Query: 60 GTCGCPVLRMGFSYNKRKNIVELAV--LRDCTVKPDSRTPVLSSN---TDSENRDGDIG- 113
GCP + +NK++ VEL + ++ VK + P+ + E R G
Sbjct: 608 YGSGCPRFDVKAKFNKKRLCVELTLNQIQSMMVK---KAPLEKEDFLRVVKERRSGVKAS 664
Query: 114 -----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQC-----HSKLAARRALKPKK 162
+ G M++R+HE DG Y+H IL + DA + + + + +L R +K K+
Sbjct: 665 EVQPLFTGPMTVRIHEADGTPYEH-ILEIREDATRSTKFEIPYNTKYKRLKRTRRMKEKQ 723
Query: 163 --GSKPDG--CDDN-----GDAVAGLDMRSSME--------------SPLSWIRADPEME 199
G+ D DD GD + + ++ E WIR D + E
Sbjct: 724 NVGASMDAENLDDTLLYCLGDVLQTPEEQAQWELIDWDPETERKMDQESYEWIRVDADFE 783
Query: 200 YLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWR 259
+ ++ M+++QL++D DVVAQ A+ L P L L L D + F
Sbjct: 784 WSCDMKRTLEPYMYVSQLQQDRDVVAQQDAMLYLTQGP-LHPIASGFLTRTLVDRRYFHG 842
Query: 260 VRIEAAYALANTAS-EETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIP 318
+R AA AL A+ ++ GL L+K + + P+PNDF D +Y V AI
Sbjct: 843 IRTMAAEALPRQANIKDLSMLGLRQLMKAFSEMFCHKGTNQPKPNDFSDKRQYNVQCAII 902
Query: 319 HAVAMVRAADN-KSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVG--------EL 369
A+A VR A+ K P EA +F+L L +N+N NP+SD F++A LV+++ +
Sbjct: 903 KAIAQVRDANTYKCPLEARQFILDQLLFNNNEDNPFSDHFYIATLVEALATSLIPSKQDD 962
Query: 370 EFGQQSIL-------FLSSLLKRIDRLLQFDRLMPSYNGILTISCI 408
F Q+ + FL +++I+R+L+ D SY + TI+ +
Sbjct: 963 WFAMQNKIPNEEEKQFLEKAIEQIERVLRRDEWTHSYQNVWTIAGL 1008
>gi|429860015|gb|ELA34770.1| transcription initiation factor tfiid 127kd subunit [Colletotrichum
gloeosporioides Nara gc5]
Length = 1551
Score = 133 bits (335), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 133/508 (26%), Positives = 217/508 (42%), Gaps = 69/508 (13%)
Query: 31 LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVEL-------- 82
++T+ F+ K R L+ F+ +WV GCP + +NK++ VE+
Sbjct: 607 INTESFQRVCEKNA---RNKLEAFWGQWVYGSGCPRFDVFQRFNKKRLCVEMNIRQVQDK 663
Query: 83 AVLRDCTVKPDSRTPVLSSNTDSENRDGDIG-WPGMMSIRVHELDGM-YDHPILPMAGDA 140
A+ T+K + + IG + G M+IR+HE DG Y+H I+ + DA
Sbjct: 664 ALAEAQTLKKTEFLRAIKEKYHGVKTEETIGLFTGPMTIRIHEADGTPYEH-IVEIREDA 722
Query: 141 WQLLEIQC---------HSKLAARRALKPKKGSKPDGCDDN-----GDAVAGL------- 179
+ + + K A L + + PD ++ GD + G
Sbjct: 723 AKAAKFEIPYNTKYKRLKRKRRATEKLSAQSAADPDNNEETLLYCLGDVLTGPEDVRKWE 782
Query: 180 ------DMRSSM-ESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAA 232
D+ +M + WIR D + E+ E+ N M+++QL++D DVVAQ ++
Sbjct: 783 FAEWEPDIEKAMDQESYEWIRMDADFEWACEMKTNLQSYMYVSQLQQDRDVVAQQDSMLY 842
Query: 233 LEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRR 292
L P V L L D + F +R AA L A+++ AG+ HL++ ++
Sbjct: 843 LRKSPPHPL-VSTFLTRTLVDRRYFHGIRTMAAETLHQHATDKVGMAGMTHLLRAFRELF 901
Query: 293 FDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNP 352
DE G PR NDF D +Y V A+P A+A +R + K P A +L+ L N N+ NP
Sbjct: 902 CDEK-GEPRHNDFTDKKQYIVQCALPMAIAKIRDSTGKCPEPAQRLLLKELATNKNDNNP 960
Query: 353 YSDVFWLAALVQSVG---------------ELEFGQQSILFLSSLLKRIDRLLQFDRLMP 397
YS F+ + ++ ++ +L+ L +L IDR + D
Sbjct: 961 YSADFYNSKILDALATSLVPEKRRDSGDQMDLDTSVDERAVLVEVLAEIDRHRRMDEWNS 1020
Query: 398 SYNGILTIS---CIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
SY I T + C R L + + I+ Q ++ T VR++A L+DL
Sbjct: 1021 SYQNIYTTTALECWRKLMKAGVIPKSPINFWQYLQ-------DGTYDLVRIKAFECLVDL 1073
Query: 455 EFHCNGIDSALSLFIKSVEEEPSLRGQV 482
F L L + S + P +R +V
Sbjct: 1074 GFMMEMPVFRLLLSVISTDRSPFVRHKV 1101
>gi|402079196|gb|EJT74461.1| TATA-binding protein associated factor Taf2 [Gaeumannomyces graminis
var. tritici R3-111a-1]
Length = 1592
Score = 133 bits (335), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 132/532 (24%), Positives = 237/532 (44%), Gaps = 84/532 (15%)
Query: 2 LEKQMGSNFFRKILQNIISRAQG-ASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
L K+ N R+++ II++++ A R +++ FR KV + L+ F+ +WV
Sbjct: 528 LTKRATHNGLRRVIYKIITKSRSYAEESRIVNSDSFRKACEKVAH-NSSSLEPFWQQWVH 586
Query: 61 TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSN-----------TDSENRD 109
+ GC M YNK+ + V+L V + S P+ S+ +
Sbjct: 587 SSGCARFEMSQRYNKKHSRVDLTVRQPEDKGSISPLPLQKETFWGALQDRLHVPASKKEN 646
Query: 110 GDIGWPGMMSIRVHELDGM-YDH--PILPMAGDAWQLLEIQCHSKLAARRALKPKK---- 162
+ + G ++I +HE +G Y+H I ++ +E+ ++K + +K +K
Sbjct: 647 VSVWFTGPITIGIHEANGTPYEHVAEIRRTGPKQFKTIEMPYNTKYKRLKRVKRQKERAA 706
Query: 163 --GSKPDGCDDNGDAVAGLDMRSSMESP-----------------------LSWIRADPE 197
+ +G DN + + +++P W+R D +
Sbjct: 707 ATAAHAEGGGDNAEEAVLYSLGDVLQTPEDVELWQFADWTPEQEKSMEHDSFEWLRLDAD 766
Query: 198 MEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAF 257
E++ + N M+++QL++D DVVAQ +I + + + L L D++ F
Sbjct: 767 FEWICSMKTNMQPWMYVSQLQQDRDVVAQQDSILYMSQKAGGEI-LASVLGRTLMDTRYF 825
Query: 258 WRVRIEAAYAL-------------ANT----ASEETDWAGLLHLVKFYKSRRFDENIGLP 300
+R +A AL A+T S++TD GL L+ +KS + P
Sbjct: 826 HGIRTQAVKALQRQTWTKEVETYNADTRVTQKSKKTDLGGLRILMMAFKSFFCYPDTQTP 885
Query: 301 RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLA 360
RPNDF + +YFV +AIP A+A VR + P EA F+L+ L++N+N N YSD F+++
Sbjct: 886 RPNDFSNKMQYFVQQAIPGAIARVRDK-GRCPPEAQRFLLEQLRFNNNAENEYSDHFYVS 944
Query: 361 ALVQS-----VGELEFG-----------QQSILFLSSLLKRIDRLLQFDRLMPSYNGILT 404
+L+ + V + E Q+ FL L I+R L+ D SY I T
Sbjct: 945 SLLDALATSLVPDTESPPLPLRANDPQIQEDKAFLEEALMEIERYLRMDEWTNSYQNIWT 1004
Query: 405 ISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEF 456
+ + + L +G I + + ++ +D T+ ++++A AL+DL F
Sbjct: 1005 TTALGCKQK--LMKAGVIPI-SAIDFVQYLQD-GTLDLIQIKAFEALVDLGF 1052
>gi|322693301|gb|EFY85166.1| transcription initiation factor TFIID subunit TSM1/127kD, putative
[Metarhizium acridum CQMa 102]
Length = 1457
Score = 133 bits (335), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 142/519 (27%), Positives = 233/519 (44%), Gaps = 78/519 (15%)
Query: 1 MLEKQM----GSNFFRKILQNIISRAQGASPVRT--LSTKEFRHFANKVGNLERPFLKEF 54
+L+K+M G + +ILQ +++AQ S R+ L T++FR K G+ R L+ F
Sbjct: 542 ILDKRMIKASGGHGLTRILQKFLTKAQIESSDRSTILDTEKFRAVCEK-GSRYR--LEAF 598
Query: 55 FPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSN----TDSENRDG 110
+ +WV GCP + +NK++ VEL L + RT L + E R G
Sbjct: 599 WNQWVYGSGCPRFEVKAKFNKKRLCVEL-TLNQIQHQEAKRTGGLEKDDFLRVVKERRAG 657
Query: 111 -DIG-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQC-----HSKLAARRAL 158
+G + G M++R+HE DG Y+H IL + DA + + + + +L R +
Sbjct: 658 LKLGEVQPLFTGPMTVRIHEADGTPYEH-ILEIREDATRSTKFEIPYNTKYKRLKRTRRM 716
Query: 159 KPKKGSK-----PDGCDDN-----GDAVAGLD--------------MRSSMESPLSWIRA 194
K K + DD GD + D R + WIR
Sbjct: 717 KEKHSGGGSMEITENADDALLYCLGDVLQTQDDLRDWELIEWDPETERKMDQESYEWIRV 776
Query: 195 DPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDS 254
D + E+ ++ M+++QL++D DVVAQ A+ L P L L L D
Sbjct: 777 DADFEWACDMKRTLEPYMYVSQLQQDRDVVAQQDAMLYLTHGP-LHPIASGFLVRTLIDR 835
Query: 255 KAFWRVRIEAAYALANTAS-EETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFV 313
+ F +R AA AL ++ + GL L++ ++ + P PNDF D +Y V
Sbjct: 836 RYFHGIRTMAAEALPRQSNIKGIPLLGLRQLMRAFREMFCFQQTNQPFPNDFTDKKQYNV 895
Query: 314 LEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEF-- 371
A+ A++ VR + P EA +F+L L +N+N NPYSD F++A LV+++
Sbjct: 896 RRAVIRAISEVRDHTYRCPLEARQFILDQLLFNNNEDNPYSDHFYIATLVEALATSLIPS 955
Query: 372 -------------GQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCI---RTLTQIA 415
++ LFL +++I+R+L+ D SY + TI+ + + L +
Sbjct: 956 KQDDWLSRQSKAPDEEERLFLEKAMEQIERVLRRDEWTNSYQNVWTIAGLDAKQRLMKAE 1015
Query: 416 LKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
+ + Q L+ RD +R++A AL+DL
Sbjct: 1016 VIPKSYPDFAQY--LLDGSRDL-----IRIKAFEALIDL 1047
>gi|296421671|ref|XP_002840388.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295636603|emb|CAZ84579.1| unnamed protein product [Tuber melanosporum]
Length = 1269
Score = 133 bits (335), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 128/502 (25%), Positives = 228/502 (45%), Gaps = 65/502 (12%)
Query: 31 LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLR-DCT 89
+ST+ F K + + L+ FF +WV G P + +NK++ IVE+ + + +
Sbjct: 537 MSTQHFTRLCEKNSHTK---LESFFQQWVYGSGYPRFEVSQRFNKKRMIVEMGIRQVQSS 593
Query: 90 VKPDSRTPVLSSNTDSENRDGDIG-------WPGMMSIRVHELDGM-YDHPILPMAGDAW 141
P +R + D+ N + I + G M+IR+HE DG Y+H + G +
Sbjct: 594 ETPAARVTAENFMQDAMNHERHIVVGPTRPVYTGPMTIRIHEADGTPYEHVVDIKEG--Y 651
Query: 142 QLLEIQCHSKL-----AARRALKPKKGSKPDGCDDN----------GDAVAGLDM----- 181
L+I ++K + R+ + G+ D D GD + D
Sbjct: 652 TKLDIPYNTKYKRLKRSRRQKERAAAGAGVDVSIDAQNDDVLLYCLGDVLQAEDEVVDWK 711
Query: 182 ----------RSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIA 231
+ + ES WIR D + E++ + NQP M+++QL++D DV+AQ +A+
Sbjct: 712 MEDWSKEEEDKMAQES-FEWIRMDADFEWICTLKVNQPDYMFLSQLQQDRDVIAQYEALQ 770
Query: 232 ALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSR 291
++ + + + L D + +W +R+EAA LA A+ E +W G HL+K ++
Sbjct: 771 YFSSVKESAL-ISSIFVRTLMDRRYYWAIRVEAAMGLAKCATAELNWIGQFHLMKAFQVF 829
Query: 292 RFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGN 351
+P+ NDF D + Y+V +AIP AV+ +R P A ++L +L++NDNN N
Sbjct: 830 FCFPESSVPKSNDFADATAYYVQKAIPTAVSRIRNVSGACPTSAQRWILDVLRWNDNNNN 889
Query: 352 PYSDVFWLAALVQSVGE----LEF---------GQQSILFLSSLLKRIDRLLQFDRLMPS 398
Y D ++A L++++ +F +++I + + I+R L+ D+
Sbjct: 890 NYDDCLYIATLMKALANTITITKFDLNNDSGYEDEEAIQTQNESINEIERYLRIDKWQSV 949
Query: 399 YNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL-EFH 457
Y +L+ + + L LSG D K + T+ +R + L++L H
Sbjct: 950 YQNVLSETAVEILKDWI--LSGIKDKDS--KFFLSYTREGTLDSLRAKCFECLVELGAMH 1005
Query: 458 CNGIDSALSLFIKSVEEEPSLR 479
G L+ + S + P LR
Sbjct: 1006 RPGFVKYLAKVMSS-DPSPYLR 1026
>gi|358387804|gb|EHK25398.1| hypothetical protein TRIVIDRAFT_177367, partial [Trichoderma virens
Gv29-8]
Length = 1429
Score = 132 bits (332), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 127/463 (27%), Positives = 214/463 (46%), Gaps = 61/463 (13%)
Query: 2 LEKQMGSNFFRKILQNIISRAQ--GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
L K G + +ILQ ++RAQ G+ L +++FR K L+ F+ +WV
Sbjct: 549 LSKASGGHGLTRILQKHLTRAQIEGSDKFLVLDSEKFRATCEKAAKYR---LESFWNQWV 605
Query: 60 GTCGCPVLRMGFSYNKRKNIVELAVLR---DCTVKPD-SRTPVLSSNTD--SENRDGDIG 113
GCP + +NK++ VEL +++ + K + SR L + + ++ + G++
Sbjct: 606 YGSGCPRFDVKAKFNKKRLCVELMLIQVQHQTSKKTELSRDDFLRAIKERRADLKTGEVQ 665
Query: 114 --WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQC-----HSKLAARRALKPKKGSK 165
+ G M++R+HE DG Y+H IL + DA + + + + +L R +K K +
Sbjct: 666 PLFTGPMTVRIHEADGTPYEH-ILEIREDATRSTKFEIPYNTKYKRLKRTRRMKEKHNAG 724
Query: 166 -----PDGCDDN-----GDAVAGLD--------------MRSSMESPLSWIRADPEMEYL 201
P+ DD GD + + R + WIR D + E+
Sbjct: 725 ASMEIPENTDDALLYCLGDVLQSPEDLRDWELIEWDPETERKMDQESYEWIRVDADFEWA 784
Query: 202 AEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVR 261
++ M+++QL++D DVVAQ A+ L P L L L D + F +R
Sbjct: 785 CSMNRTLEPYMYVSQLQQDRDVVAQQDAMLYLSTGP-LHPIASGFLVRTLVDRRYFHGIR 843
Query: 262 IEAAYALANTAS-EETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHA 320
AA AL A+ ++ GL L++ ++ + P NDF D +Y V +I A
Sbjct: 844 TMAAQALTRQANIKDIPMLGLRQLMRAFRDMFCYDQTNQPLQNDFSDKRQYNVRCSIIKA 903
Query: 321 VAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV--------GELEFG 372
++ VR N P EA +F+L L +N+N NPYSD ++A LV+S+ + F
Sbjct: 904 ISEVRDGTNYCPFEARQFILDQLLFNNNENNPYSDHHYIAMLVESLATSLIPAQADEWFK 963
Query: 373 QQSIL-------FLSSLLKRIDRLLQFDRLMPSYNGILTISCI 408
+Q + FL +++I+R+L+ D SY I TI+ +
Sbjct: 964 RQVKVPTDEERQFLDEAMEQIERVLRRDEWTNSYQNIWTIAGL 1006
>gi|149236281|ref|XP_001524018.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|146452394|gb|EDK46650.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 1152
Score = 131 bits (330), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 118/432 (27%), Positives = 193/432 (44%), Gaps = 53/432 (12%)
Query: 24 GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELA 83
G P TLS F+H KV R L FF +W G P + +NK+K ++E+
Sbjct: 162 GELPNGTLSCDHFQHVCEKVN---RNKLDAFFKQWAFGAGAPSFVITQRFNKKKGMLEMT 218
Query: 84 VLRDCTVKPDSRTPVLSSNTDSENRDGDIG----------WPGMMSIRVHELDGM-YDHP 132
+ + + D R L +T + ++ + G M+IRVHE DG Y+H
Sbjct: 219 IRQ--VQQHDRRERKLDKDTYMNDALFELNDEPLLQRQNLFTGPMTIRVHESDGTPYEHI 276
Query: 133 ILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDN------GDAVAG-------- 178
I + +++Q + K+ + DG D GDAV
Sbjct: 277 I--HIKEVKTTIDLQLNKKVRKIKKKD-------DGVDPGIPFNQFGDAVPEEAKKLFAL 327
Query: 179 -----LDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
D S P WIR D + E++A I QP M+ +QL D DV AQ AI
Sbjct: 328 QDWERRDEESLATLPFEWIRGDVDFEWIANIEVRQPDYMFGSQLIYDRDVEAQYDAIQYF 387
Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTAS-EETDWAGLLHLVKFYKSRR 292
++ L+ L L D + F+ VRI AA ALA+ AS ++T++ G+ +L++ +K
Sbjct: 388 ASVERLNTVYCTALVRTLFDKRYFYGVRIAAAQALASPASNQQTNYLGIKYLIEAFKYLH 447
Query: 293 FDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNP 352
+P+ NDF D + + AIP ++ V ++ K P + + L+KYNDN+ N
Sbjct: 448 CFPGSSIPKSNDFADIQAFLIQTAIPDTLSRVTDSEGKVPTIVRDLLYDLVKYNDNSMNE 507
Query: 353 YSDVFWLAALVQSVGELEFG--------QQSILFLSSLLKRIDRLLQFDRLMPSYNGILT 404
+ D +++ L+ S+ ++ F + + + RL + D PS+N IL
Sbjct: 508 FQDSQYVSKLIFSLTRSAISGWRSDMVDEKDKQFANKVFLEVSRLQKLDEWRPSHNRILQ 567
Query: 405 ISCIRTLTQIAL 416
++C + ++ L
Sbjct: 568 VACFKAKMKMVL 579
>gi|358390390|gb|EHK39796.1| hypothetical protein TRIATDRAFT_296783 [Trichoderma atroviride IMI
206040]
Length = 1512
Score = 131 bits (330), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 128/463 (27%), Positives = 209/463 (45%), Gaps = 65/463 (14%)
Query: 2 LEKQMGSNFFRKILQNIISRAQGASPVR--TLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
L K G + +ILQ ++RAQ S + L +++FR K L+ F+ +WV
Sbjct: 549 LSKASGGHGLTRILQKQLTRAQIESSDKFLILDSEKFRATCEKAAKYR---LESFWNQWV 605
Query: 60 GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSN-----TDSENRDGDIG- 113
GCP + +NK++ VEL +++ T + + LS + D IG
Sbjct: 606 YGSGCPRFDVKAKFNKKRLCVELMLIQ--TQQQAIKKTELSKDEFFRAIKERRADLKIGE 663
Query: 114 ----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQC-----HSKLAARRALKPKKG 163
+ G M+IR+HE DG Y+H IL + DA + + + + +L R +K K
Sbjct: 664 AQPLFTGPMTIRIHEADGTPYEH-ILEIREDAARSTKFEIPYNTKYKRLKRTRRMKEKHN 722
Query: 164 SK-----PDGCDDN-----GDAVAGLD--------------MRSSMESPLSWIRADPEME 199
+ P+ DD GD + + R + WIR D + E
Sbjct: 723 AGASMEIPENTDDALLYCLGDVLQSPEDLRDWELIEWDPETERKMDQESYEWIRVDADFE 782
Query: 200 YLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWR 259
+ ++ M+++QL++D DVVAQ A+ L P L L L D + F
Sbjct: 783 WACSMNRTLEPYMYVSQLQQDRDVVAQQDAMLYLSTGP-LHPIASGFLVRTLVDRRYFHG 841
Query: 260 VRIEAAYALANTAS-EETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIP 318
+R AA AL A+ ++ GL L++ ++ + P NDF D +Y V +I
Sbjct: 842 IRTMAAKALTRQANIKDIPMLGLRQLMRAFRDMFCYDQTNQPLQNDFSDKRQYNVRCSII 901
Query: 319 HAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV--------GELE 370
A++ VR N P EA +F+L L +N+N NPYSD +++A LV+++ +
Sbjct: 902 RAISEVRDGTNYCPFEARQFILDQLLFNNNENNPYSDHYYIALLVEALATSLIPAQADEW 961
Query: 371 FGQQSIL-------FLSSLLKRIDRLLQFDRLMPSYNGILTIS 406
F +Q + FL +++I+R+L+ D SY I TI+
Sbjct: 962 FKRQVKIPTEEERRFLEEAMEQIERVLRRDDWTHSYQNIWTIA 1004
>gi|195169174|ref|XP_002025400.1| GL11884 [Drosophila persimilis]
gi|194108868|gb|EDW30911.1| GL11884 [Drosophila persimilis]
Length = 1217
Score = 131 bits (330), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 114/440 (25%), Positives = 202/440 (45%), Gaps = 54/440 (12%)
Query: 54 FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
F + V T G + +N+++N +EL + +D N G
Sbjct: 521 FMDQSVRTGGHAKFSLSSVFNRKRNTIELEIRQDFV-----------------NHRGVRK 563
Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
+ G + +++ ELDG + H + + +I CHSK + K + G
Sbjct: 564 YNGPLLVQLQELDGTFKHTL--QIENTLVKADITCHSKSRRNKKKKIPLST--------G 613
Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
+ V +D+ + +SP+ WIR DPEM L ++ QP W QL + DV AQ QAI AL
Sbjct: 614 EEV-DMDLSAMDDSPVLWIRLDPEMILLRDLIIEQPDFQWQYQLRHERDVTAQFQAIQAL 672
Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYKSRR 292
+ ++ + L A + ++AA++L A++ W+G ++ + R+
Sbjct: 673 QQARQTPRDLPSPTP--LRAIAASIKCGVQAAHSLTKVANQMVASWSGPPAMLNIF--RK 728
Query: 293 FDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNN 349
F + P + N+F +F YF+ +AIP A+A +R + P E + F+ L KYN+N+
Sbjct: 729 FFGSFSAPHIIKLNNFSNFQLYFLQKAIPVAMAGLRTSHGICPPEVMRFLFDLFKYNENS 788
Query: 350 GNPYSDVFWLAALVQSVGE---------LEFGQQSILFLSS----LLKRIDRLLQFDRLM 396
N Y+D ++ AALV+++GE + Q + LS+ +L + RLL ++ +
Sbjct: 789 RNHYTDAYYRAALVEALGETLTPVVSVSMHGTQITTDSLSTDAKLVLDEMTRLLNMEKSL 848
Query: 397 PSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEF 456
PSY ++++SC++ + KL F L + + + +R+ A L+D
Sbjct: 849 PSYKYVVSVSCLKVIR----KLQKFGHLPSLPNFYHRYAAYGVFLDLRIAAMECLVDF-V 903
Query: 457 HCNGIDSALSLFIKSVEEEP 476
+G L I +E +P
Sbjct: 904 KVDGRSEDLDHLINLLETDP 923
>gi|395512337|ref|XP_003760397.1| PREDICTED: transcription initiation factor TFIID subunit 2
[Sarcophilus harrisii]
Length = 1071
Score = 131 bits (329), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 108/379 (28%), Positives = 180/379 (47%), Gaps = 52/379 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L ++ F Q MW QL + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQSDFMWQYQLRYERDVVAQEES 665
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 666 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMLACFCLAKIANSMVSTWTGPPAMKSLF 723
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF+ + +P A+A++R N P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782
Query: 347 DNNGNPYSD--VFWLAALV 363
DN N +FW AA+
Sbjct: 783 DNRKNKQQSLGIFWTAAVT 801
>gi|440475330|gb|ELQ44013.1| TATA-binding protein associated factor Taf2 [Magnaporthe oryzae Y34]
gi|440486210|gb|ELQ66100.1| TATA-binding protein associated factor Taf2 [Magnaporthe oryzae P131]
Length = 1579
Score = 131 bits (329), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 135/536 (25%), Positives = 241/536 (44%), Gaps = 87/536 (16%)
Query: 2 LEKQMGSNFFRKILQNIISRAQG-ASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
+ K+ N R+++ ++SR+ AS VR +S++ FR KV L+ F+ +WV
Sbjct: 519 MTKKASHNGVRRVVLRLVSRSNTEASDVRFISSESFRKQCEKVHKGAP--LELFWQQWVQ 576
Query: 61 TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNT------------DSENR 108
+ GC M YNK+ + V+L V R K P + N ++
Sbjct: 577 SAGCARFDMSQRYNKKHSRVDLTV-RQPEEKSQLPQPPFNKNNFWGVLQDGRHVPAAKKE 635
Query: 109 DGDIGWPGMMSIRVHELDGM-YDH--PILPMAGDAWQLLEIQCHSKLAA-RRALKPKKGS 164
+ + + G ++I +HE +G Y+H I ++ +E+ ++K +R + K+ +
Sbjct: 636 NVSVWFTGPITIGIHEANGTPYEHVAEIQRTGPRQFKAIEMPYNTKYKRLKRTRRQKERA 695
Query: 165 KP-----DGCDDNGDAVA----------------GL------DMRSSMESPLSWIRADPE 197
+ +G + +AV GL +++ + W+R D +
Sbjct: 696 QALVQAGEGGEQAEEAVLYSLGDVLQSPEEEKEWGLADWTPEQVKAMEQDSFEWLRIDAD 755
Query: 198 MEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAF 257
E++ + N M+++QL++D DVVAQ +++ L + L L D++ +
Sbjct: 756 FEWICSMKTNMQPWMYVSQLQQDRDVVAQQESVLWLSQKAGGEI-LATVLGRTLMDTRYY 814
Query: 258 WRVRIEAAYALAN---TASEE--------------TDWAGLLHLVKFYKSRRFDENIGLP 300
+R +A AL T EE TD GL L+K +K+ + P
Sbjct: 815 HGIRTQAIKALQKQVWTKEEESLDPDSGIVNKLKKTDAGGLRILMKAFKAFFCYPDSQTP 874
Query: 301 RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLA 360
RPNDF + +Y V AI A+A VR + PREA +F+L+ L++N+N GN YSD F++A
Sbjct: 875 RPNDFSNKMQYAVQRAISEAIARVREK-GRCPREAQQFLLEQLQFNNNTGNEYSDHFYVA 933
Query: 361 ALVQSVGELEFGQQSI-----------------LFLSSLLKRIDRLLQFDRLMPSYNGIL 403
L+ ++ S+ FL + L+ I+R L+ D SY I
Sbjct: 934 GLLDALATSLIPDNSVETVTLRNSDDPETALDEEFLEAALQMIERYLRMDEWTNSYQNIW 993
Query: 404 TISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCN 459
T++ + + L +G I + + ++ +D +T+ + ++A AL+DL F N
Sbjct: 994 TVTALGCKQK--LMKAGVIPIS-AIDFVRYLQD-STLDLISIKAFEALVDLGFLFN 1045
>gi|242799450|ref|XP_002483381.1| transcription initiation factor TFIID subunit TSM1/127kD, putative
[Talaromyces stipitatus ATCC 10500]
gi|218716726|gb|EED16147.1| transcription initiation factor TFIID subunit TSM1/127kD, putative
[Talaromyces stipitatus ATCC 10500]
Length = 1286
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 132/499 (26%), Positives = 211/499 (42%), Gaps = 98/499 (19%)
Query: 30 TLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCT 89
++ T+ F + G+ + L+ FF +WV GCP +NK+K +VE+ +
Sbjct: 546 SVYTQGFIKLCERFGHCK---LEVFFNQWVFGAGCPRFTATQRFNKKKLVVEIMI----- 597
Query: 90 VKPDSRTPVLSSNTDSENRDGDIG--------------WPGMMSIRVHELDGM-YDHPIL 134
K P N + + D+ + G M+IR+HE DG Y+H +
Sbjct: 598 -KQSQGEPQPPKNLEKSSFLRDMKEEIRSIYAAPVQPIFTGSMTIRIHEADGTPYEHIV- 655
Query: 135 PMAGDAWQLLEIQCHSKLA---------ARRALKPKKGSKPDGCDDN-----GDAV-AGL 179
+A E+ ++K R L S + DD GD +
Sbjct: 656 -EIKEAVTKFEVPYNTKYKRLKRNKRQKERVTLNAGADSTAESQDDVLLYCLGDVLQTEE 714
Query: 180 DM--------------RSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVA 225
DM R ES WIR D + E++ ++ P M+++QL++D DVVA
Sbjct: 715 DMQEWRLSDWSKEEEERMGQES-YEWIRMDADFEWICKMSLVMPGYMYLSQLQQDRDVVA 773
Query: 226 Q---AQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLL 282
Q Q +A + P +S V TL D + F +R AAYAL A EE DW GL
Sbjct: 774 QLESMQYMAIQKPHPLISTIFVRTL----MDKRYFHGIRTAAAYALVKHAKEELDWLGLY 829
Query: 283 HLVKFYKSRRFDENIGLP-----RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVE 337
HL R F E LP R NDF D + Y + + IP A++ +R + K+P +
Sbjct: 830 HL-----ERAFQELFCLPGTHVTRDNDFSDHASYILQKVIPEAISRIRDNNGKTPLRVKQ 884
Query: 338 FVLQLLKYNDNNGNPYSDVFWLAALVQSV--------------GELEFGQQSIL------ 377
F+ L+++DN+ N YSD F++A+L+ S+ ++EF L
Sbjct: 885 FLFDKLRFSDNSNNEYSDNFYIASLMASLVNALEGRIAETPSQDDMEFDMDKELERQAED 944
Query: 378 -FLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRD 436
++ IDR + D S+ I + RT L+L+ +D + +
Sbjct: 945 KLEQDVIAEIDRYRRMDEWSSSFQNIYS----RTALHCQLQLTKARIMDLDLIHFLQYTR 1000
Query: 437 FNTIWQVRVEASRALLDLE 455
T +R++A L++L+
Sbjct: 1001 AGTFDMLRIDAFECLVELD 1019
>gi|378754734|gb|EHY64763.1| hypothetical protein NERG_02166 [Nematocida sp. 1 ERTm2]
Length = 867
Score = 130 bits (328), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 117/500 (23%), Positives = 215/500 (43%), Gaps = 72/500 (14%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
+LE + F +KI++ ++ RT++T++ + + ++ F +V
Sbjct: 356 ILENNLTRAFMQKIMKEVLE-------ARTMNTQDLIRVIKGITGKD---VRALFDAYVY 405
Query: 61 TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSI 120
G P + N R L VL S + D + G + +
Sbjct: 406 RTGIPTVTAQVEQNSRAGGFSL---------------VLRQKIHSVHPDANRHMAGNICV 450
Query: 121 RVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLD 180
RV+E D + DH + G A E+ C S+ RR K + S
Sbjct: 451 RVYETDSVLDHILF--LGSAPITHELVC-SQRNHRRKQKEENVS---------------- 491
Query: 181 MRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLS 240
L W+R DP +E++ Q M+ QL + DV Q +A+ ++ P S
Sbjct: 492 --------LLWVRIDPGIEWMKVAAVEQADYMFAEQLVSEKDVYGQMEALVGVQKNP--S 541
Query: 241 FNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP 300
+ + L + D F++V I A LA + +EE+ + G +V+F+ + +N +
Sbjct: 542 ETICSILERVMGDPGVFYKVSIAAGILLAKSVNEESGYFGFQRVVQFFINTYCIQNTTIV 601
Query: 301 RPNDFRDFSEYFVLEAIPHAVAMVRAADNKS-------PREAVE-FVLQLLKYNDNNGNP 352
R NDF YF+ + I ++++ + KS + V F+L L++YNDN GN
Sbjct: 602 RTNDFSQHRMYFMQKNIASSMSLCQLDSTKSLGGRAIRAKNVVSAFLLNLMRYNDNTGNS 661
Query: 353 YSDVFWLAALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLT 412
+ D F+LA ++ ++ + + L + + ++RL Q D L PS+ I+T + I+
Sbjct: 662 FEDAFYLADIITALA-VALCSDAHLDPAPFIVEMERLRQKDLLFPSHQNIVTCAAIKAFA 720
Query: 413 QIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSV 472
++ ++ +S +++ + P ++VR+ A L+ L H + I + L LF V
Sbjct: 721 RLGVQGYVQVSFKGMLQYVAP----GNFYKVRMAAYECLILL--HADKIGTILDLF---V 771
Query: 473 EEEPSLRGQVKLGIHAMRIC 492
EE +R ++ I +C
Sbjct: 772 NEERPVRIKILRTIRDSAVC 791
>gi|290993096|ref|XP_002679169.1| predicted protein [Naegleria gruberi]
gi|284092785|gb|EFC46425.1| predicted protein [Naegleria gruberi]
Length = 1957
Score = 130 bits (328), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 119/470 (25%), Positives = 217/470 (46%), Gaps = 66/470 (14%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFR-------HFANKVGNLERPFLKE 53
M+++Q+ ++ R +L + A TL +K + F +G LKE
Sbjct: 564 MIDRQISTSNMRNLLNKFLKNAH------TLDSKRYNDRLISSSSFIKTIGKTFATDLKE 617
Query: 54 FFPRWVGTCGCPVLRMGFSYNKRKNIVELAV---LRDCTVKPDSRTPVLSSNTDSENRDG 110
F W+ G P + F Y+ ++ V+L V L ++P
Sbjct: 618 FSSYWIYGTGIPQFAISFKYDDMRSCVDLKVKQILPFPEIQP------------------ 659
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKG-SKPDGC 169
+ G +++R+ E +G + I + + Q E +SK + K + G +P
Sbjct: 660 ---FRGKVTVRILESEGAPNDFIANLDKEENQF-EFHINSKPIKKHHKKVQGGQGQP--- 712
Query: 170 DDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
+AV + PL WIR DPE +++ F+Q +MW QL DVVAQ ++
Sbjct: 713 ---TEAVRVI-------IPLKWIRFDPEQDWIKVQSFSQTQEMWNYQLNDVKDVVAQYES 762
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYK 289
I AL V+ LN L D+ ++++R+ AA+AL+ L L+ F+K
Sbjct: 763 IKALTFFADTE-PVITILNRILRDNSYYYQIRVCAAHALSKIPQPIEKKEALNVLIGFFK 821
Query: 290 SRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAV----EFVLQLLKY 345
++ +DE + +PN+F++F++YF+ + IP +A+ D+K E +L+L+KY
Sbjct: 822 TQFYDEQLTYLKPNNFKNFADYFLKKEIP--LALTNFKDDKVNNETFPEVVLLLLELIKY 879
Query: 346 NDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDR-LMPSYNGILT 404
ND++ N +S F++A L++S+ L + F + K + + L D L+PSY+ I T
Sbjct: 880 NDDSENLFSGNFYIANLIKSISLLNTSNPN--FNQKIEKILYKYLVIDSDLIPSYHYIKT 937
Query: 405 ISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
++ + ++ + I+LD + + DF VR + R ++++
Sbjct: 938 VTSLESIANLQSSGKSKINLDVFYR----YLDFKKSDHVRFSSWRCVMNI 983
>gi|320581027|gb|EFW95249.1| transcription initiation factor TFIID subunit [Ogataea parapolymorpha
DL-1]
Length = 1543
Score = 129 bits (325), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 144/553 (26%), Positives = 240/553 (43%), Gaps = 119/553 (21%)
Query: 31 LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLR---- 86
LST F H KV + + L+ FF W+ +CG P+ ++ +NK++ +E+++ +
Sbjct: 623 LSTAHFHHVCEKVAHHK---LESFFQNWIHSCGVPIFQVTQRFNKKRMFIEMSIRQIQKN 679
Query: 87 -----------DCTVKPDSRTP------VLSSN---TDSENRDGDIGWPGMMSIRVHELD 126
D K R V +N ++ E + G ++IR+HE D
Sbjct: 680 NKLETNDELQMDINSKETLRKQHQREHFVDEANKFLSEEETFSAPNVFTGPITIRIHEAD 739
Query: 127 GM-YDHPILPMAGDAWQLLEIQCHSKL--AARRA----------------------LKPK 161
G Y+H + D L+IQ ++K + RR K +
Sbjct: 740 GTPYEHIV--DIKDPVTKLDIQYNTKYRRSKRRKEEDQEEAERKEREREKRERDKLRKER 797
Query: 162 KGSKPDGCDDN------GDAVA--------GLDMRSSME--------SPLSWIRADPEME 199
+ K G DD GD G + +E W+R D + E
Sbjct: 798 EKDKLRGDDDEPTVKKLGDVFMKPKDLEDWGFTEKQKIEENDYEILGDAFEWLRVDADFE 857
Query: 200 YLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWR 259
++ +++ N M+ +QL +D DV AQ +++ H S L + D + F+
Sbjct: 858 WICKVYINLNENMFESQLRQDRDVEAQLESVKFFSESLHPSIYFATVLMRTILDKRYFYG 917
Query: 260 VRIEAAYALANTASEETDWAGLLHLVKFYK---------SRRF---DENIGLPRPNDFRD 307
VR EAA LA + E+ D G+ L+K YK S+ + D N LP PNDF +
Sbjct: 918 VRAEAALGLAKLSKEDNDHLGMRFLLKAYKYFYCYNNTISKGYPELDPNEYLPLPNDFSE 977
Query: 308 FSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVG 367
FS+YFV++AI ++ V + SP E + +L +LK+NDNN N + D F+LA + +
Sbjct: 978 FSDYFVMKAIVTGLSTVTNKNGDSPIELKKIMLNILKFNDNNNNYFDDSFYLANFITCLT 1037
Query: 368 EL----------------EFGQQSIL-----FLSSLLKRIDRLLQFDRLMPSYNGILTIS 406
L E Q+++ FL + I+R + D+ PSY+ I+T++
Sbjct: 1038 NLIVSSNKKIPNHNESHVEMDQRNLTDPTEKFLIESIVEINRAAKMDQWSPSYHHIVTVT 1097
Query: 407 CIRTLTQIALKLSGFISLDQV--VKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSA 464
++ ++ L L G + L + + +PF D + +R+ A +LL L N
Sbjct: 1098 MLQ--EKVRLALLGLVDLSYIDLIPYTRPFYDSD----IRLRAFESLLLLGGLRNS--HV 1149
Query: 465 LSLFIKSVEEEPS 477
LSLF +++ E S
Sbjct: 1150 LSLFFTTLKLETS 1162
>gi|388580227|gb|EIM20543.1| hypothetical protein WALSEDRAFT_60853 [Wallemia sebi CBS 633.66]
Length = 1762
Score = 129 bits (323), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 83/265 (31%), Positives = 139/265 (52%), Gaps = 40/265 (15%)
Query: 191 WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNF 250
WIR D + E+++ I F QP MW++QL++D DVVAQ +A+ AL+++P S V L
Sbjct: 709 WIRVDADFEWISMITFEQPDFMWVSQLQRDRDVVAQLEAVHALKSIP--SKLVSTQLTRT 766
Query: 251 LSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSR-------RFDENIG---LP 300
+ + F+R+R+EAA L A+ E D+ G+ HL K ++SR + G +P
Sbjct: 767 VLVTNYFFRIRMEAAMVLHTCATPELDYIGMFHLFKIFQSRYCYPPPQEIMDPFGFKCIP 826
Query: 301 RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLA 360
N+F D SEYFV +A+ ++ +R ++S EF++ L+YNDN+ N YSD ++L+
Sbjct: 827 LANNFDDLSEYFVRKAVIVSLTSIRDQSSRSMLNIREFLVDQLRYNDNSSNRYSDSYYLS 886
Query: 361 --------ALVQSV---------------GELEFGQQSILFLSSLLKRIDRLLQFDRLMP 397
ALV +V + FG ++ ++R DRL+P
Sbjct: 887 GIISCLAIALVPTVQHEASVLQQDEDPDANDHPFGPH---VRRQAIEEVERYRTMDRLVP 943
Query: 398 SYNGILTISCIRTLTQIALKLSGFI 422
SY +++++ + + L ++G I
Sbjct: 944 SYQNVISVATLE--FNVKLMIAGII 966
Score = 47.8 bits (112), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 33/104 (31%), Positives = 54/104 (51%), Gaps = 12/104 (11%)
Query: 31 LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTV 90
+ST F KV ++ L++F +W+ G P +G ++N++K VE+ V ++
Sbjct: 524 ISTNGFLRTCRKVAGVD---LRQFADQWIFGSGTPRFTIGANFNRKKMCVEMTVKQESPA 580
Query: 91 KPDSRTPVLSSNTDS--ENRDGDIGWPGMMSIRVHELDGM-YDH 131
D+ TP NT NR I + G M++R+HE DG Y+H
Sbjct: 581 Y-DAHTP----NTPEMHYNRPTQI-FEGQMTVRIHEADGTPYEH 618
>gi|198465660|ref|XP_002135019.1| GA23812 [Drosophila pseudoobscura pseudoobscura]
gi|198150260|gb|EDY73646.1| GA23812 [Drosophila pseudoobscura pseudoobscura]
Length = 1153
Score = 129 bits (323), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 92/319 (28%), Positives = 159/319 (49%), Gaps = 36/319 (11%)
Query: 54 FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
F + V T G + +N+++N +EL + +D N G
Sbjct: 521 FMDQSVRTGGHAKFSLSSVFNRKRNTIELEIRQDFV-----------------NHRGVRK 563
Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
+ G + +++ ELDG + H + + +I CHSK + K + G
Sbjct: 564 YNGPLLVQLQELDGTFKHTL--QIENTLVKADITCHSKSRRNKKKKIPLST--------G 613
Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
+ V +D+ + +SP+ WIR DPEM L ++ QP W QL + DV AQ QAI AL
Sbjct: 614 EEV-DMDLSAMDDSPVLWIRLDPEMILLRDLIIEQPDFQWQYQLRHERDVTAQFQAIQAL 672
Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYKSRR 292
+ P + + L + + + F++VR++AA++L A++ W+G ++ + R+
Sbjct: 673 QHYPTNATRLA--LTDTIESDRCFYKVRVQAAHSLTKVANQMVASWSGPPAMLNIF--RK 728
Query: 293 FDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNN 349
F + P + N+F +F YF+ +AIP A+A +R + P E + F+ L KYN+N+
Sbjct: 729 FFGSFSAPHIIKLNNFSNFQLYFLQKAIPVAMAGLRTSHGICPPEVMRFLFDLFKYNENS 788
Query: 350 GNPYSDVFWLAALVQSVGE 368
N Y+D ++ AALV+++GE
Sbjct: 789 RNHYTDAYYRAALVEALGE 807
>gi|444732446|gb|ELW72741.1| Transcription initiation factor TFIID subunit 2 [Tupaia chinensis]
Length = 1407
Score = 127 bits (319), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 109/396 (27%), Positives = 179/396 (45%), Gaps = 92/396 (23%)
Query: 70 GFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGMY 129
F++N+++N++EL + +D T +P G + G + + V ELDG +
Sbjct: 737 SFAFNRKRNVLELEIKQDYT------SP------------GTQKYVGPLKVTVQELDGSF 778
Query: 130 DHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLDMRS-SMESP 188
+H L + ++ + +I CHSK + K NG+ V +D+ + +SP
Sbjct: 779 NH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM--------NGEEV-DMDLSAMDADSP 827
Query: 189 LSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLN 248
L WIR DP+M L ++ F Q MW QL + DVVAQ ++I ALE P + + L
Sbjct: 828 LLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQESILALEKFPTPASRLA--LT 885
Query: 249 NFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDF 308
+ L + F+RVR+ A + LA T
Sbjct: 886 DILEQEQCFYRVRMSACFCLAKT------------------------------------- 908
Query: 309 SEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLA----ALVQ 364
+P A+A++R N P+E + F+L L+KYNDN N +SD ++ A AL
Sbjct: 909 --------MPVAMALLRDVHNLCPKEVLTFILDLIKYNDNRKNKFSDNYYRAEMIDALAN 960
Query: 365 SVGELEFGQQSILFLSSL-------LKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALK 417
SV + L +L L+ I R L ++L+PSY +T+SC+R + L+
Sbjct: 961 SVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKLLPSYRHTITVSCLRAIR--VLQ 1018
Query: 418 LSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+G + D L K + ++ +R+ A A++D
Sbjct: 1019 KNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 1052
>gi|256274023|gb|EEU08938.1| Taf2p [Saccharomyces cerevisiae JAY291]
Length = 1407
Score = 125 bits (315), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 105/402 (26%), Positives = 190/402 (47%), Gaps = 58/402 (14%)
Query: 13 KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
++L I +A G P +L++ F+H +V + L+ FF WV G P+LR+
Sbjct: 630 RVLPKIFLQAMSGDLPNNSLTSSHFQHVCERVN---KSKLENFFNEWVYGSGVPILRVTQ 686
Query: 72 SYNKRKNIVELAVLRDCTVKPDSRTPVLSS---------NTDSENRDGDIGWPGMMSIRV 122
+N+++ ++EL + R + VL + + + + + G M+IR+
Sbjct: 687 RFNRKRMVIELGI-RQVQDEELGHEKVLGEEGFFKSALDHLEHPDLNRTECFTGSMTIRI 745
Query: 123 HELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGS------------KPDGC 169
HE DG Y+H + D + ++IQ ++K RR K G+ KP
Sbjct: 746 HEHDGTPYEHIV--EIKDTFTKIDIQYNTKY--RRLRKRGGGANDENGVENNNEEKPIVV 801
Query: 170 DDN--GDAVAGLDM----------RSSMESPL-------SWIRADPEMEYLAEIHFNQPV 210
D N G+ + R+S + L WIR D ++E++ ++H NQP
Sbjct: 802 DVNCLGNVYMSPEECSRFSLTEFNRTSESNELLKQNEAFEWIRIDSDLEWICQMHINQPD 861
Query: 211 QMWINQLEKDGDVVAQAQAIAALE------ALPHLSFNVVNTLNNFLSDSKAFWRVRIEA 264
M+ +QL +DGD+ AQ +AI E + L ++ + L D + F+ +R+ A
Sbjct: 862 YMFSSQLRQDGDIEAQLEAIRYYEDVVVNGGVKSLVYSSI--LFRTAIDERYFFGIRLAA 919
Query: 265 AYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMV 324
AL+ + G+ HL++ ++ E+ +P+ N+F + YF+ IP +A V
Sbjct: 920 CEALSKYVYDPDFTGGVKHLIQIFQILFCLEDSNIPKSNNFENPKLYFLQCNIPKYLAKV 979
Query: 325 RAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV 366
+ + K P+ +F+L +L YN+N N YSD ++ +L+++V
Sbjct: 980 KNENGKCPKLVKQFLLDILVYNENGENKYSDDAYVRSLIENV 1021
>gi|387592660|gb|EIJ87684.1| hypothetical protein NEQG_02231 [Nematocida parisii ERTm3]
gi|387595289|gb|EIJ92914.1| hypothetical protein NEPG_02313 [Nematocida parisii ERTm1]
Length = 1348
Score = 125 bits (314), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 109/485 (22%), Positives = 209/485 (43%), Gaps = 71/485 (14%)
Query: 8 SNFFRKILQNIISRA------QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGT 61
S F KIL+N ++RA + RT+ T++ + + ++ F +V
Sbjct: 350 SGAFIKILENNLTRAFMQKIMKEVLEARTIGTQDLIRVVKGITGKD---VRMLFEAYVYR 406
Query: 62 CGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIR 121
G P + N R + +L S + + + G + +R
Sbjct: 407 TGIPTVIAQVEQNSRAGGFSI---------------ILKQKIHSVHPEANRHITGNICVR 451
Query: 122 VHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLDM 181
V+E D + DH + G E+ C+ + R+ +KG +
Sbjct: 452 VYETDSVLDHVLF--IGSTPITHELACNQRSQRRK----QKGEES--------------- 490
Query: 182 RSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSF 241
+ L WIR DP +E++ Q M+ QL + DV Q +A+A ++ P S
Sbjct: 491 -----ASLLWIRIDPGLEWMKVSVVEQADYMFAEQLVSEKDVYGQMEALAGVQKNP--SE 543
Query: 242 NVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPR 301
+ L + D++ F++V + A LA + +EE+ + G +V+++ + +N + +
Sbjct: 544 TICGILERVMGDAQMFYKVSVAAGILLAKSVNEESGYFGFQRVVQYFINSHCIQNTTIVK 603
Query: 302 PNDFRDFSEYFVLEAIPHAVAMVRAADNKS-------PREAVE-FVLQLLKYNDNNGNPY 353
NDF + YF+ + I ++++ + KS + V F+L L++YNDN GN +
Sbjct: 604 TNDFSQYRTYFMQKNIASSMSLCQLDSTKSLSGRIIRAKNVVSAFLLNLMRYNDNTGNQF 663
Query: 354 SDVFWLAALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQ 413
D F++A ++ S+ + + L + I+RL + D L PS+ I+T + I+ T+
Sbjct: 664 EDSFYIADIITSLS-IALCSDAYLDTGPFVMEIERLRKKDLLFPSHQNIVTCATIKAFTR 722
Query: 414 IALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVE 473
++++ +S +++ P ++VR+ A +L+ L L L I+ E
Sbjct: 723 LSIQGCITVSFKGILRYTLP----ENFYKVRMAAYESLILLH------PDQLKLIIEMAE 772
Query: 474 EEPSL 478
E L
Sbjct: 773 SEERL 777
>gi|190406471|gb|EDV09738.1| transcription initiation factor TFIID subunit 2 [Saccharomyces
cerevisiae RM11-1a]
Length = 1407
Score = 125 bits (314), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 103/401 (25%), Positives = 190/401 (47%), Gaps = 56/401 (13%)
Query: 13 KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
++L I +A G P +L++ F+H +V + L+ FF WV G P+LR+
Sbjct: 630 RVLPKIFLQAMSGDLPNNSLTSSHFQHVCERVN---KSKLENFFNEWVYGSGVPILRVTQ 686
Query: 72 SYNKRKNIVELAVLR--------DCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVH 123
+N+++ ++EL + + + V + + + + + + G M+IR+H
Sbjct: 687 RFNRKRMVIELGIRQVQDEELGHEKVVGEEGFFKSALDHLEHPDLNRTECFTGSMTIRIH 746
Query: 124 ELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGS------------KPDGCD 170
E DG Y+H + D + ++IQ ++K RR K G+ KP D
Sbjct: 747 EHDGTPYEHIV--EIKDTFTKIDIQYNTKY--RRLRKRGGGANDENGVENNNEEKPIVVD 802
Query: 171 DN--GDAVAGLDM----------RSSMESPL-------SWIRADPEMEYLAEIHFNQPVQ 211
N G+ + R+S + L WIR D ++E++ ++H NQP
Sbjct: 803 VNCLGNVYMSPEECSRFSLTEFNRTSESNELLKQNEAFEWIRIDSDLEWICQMHINQPDY 862
Query: 212 MWINQLEKDGDVVAQAQAIAALE------ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAA 265
M+ +QL +DGD+ AQ +AI E + L ++ + L D + F+ +R+ A
Sbjct: 863 MFSSQLRQDGDIEAQLEAIRYYEDVVVNGGVKSLVYSSI--LFRTAIDERYFFGIRLAAC 920
Query: 266 YALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVR 325
AL+ + G+ HL++ ++ E+ +P+ N+F + YF+ IP +A V+
Sbjct: 921 EALSKYVYDPDFTGGVKHLIQIFQILFCLEDSNIPKSNNFENPKLYFLQCNIPKYLAKVK 980
Query: 326 AADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV 366
+ K P+ +F+L +L YN+N N YSD ++ +L+++V
Sbjct: 981 NENGKCPKLVKQFLLDILVYNENGENKYSDDAYVRSLIENV 1021
>gi|207347283|gb|EDZ73510.1| YCR042Cp-like protein [Saccharomyces cerevisiae AWRI1631]
Length = 1407
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 103/401 (25%), Positives = 190/401 (47%), Gaps = 56/401 (13%)
Query: 13 KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
++L I +A G P +L++ F+H +V + L+ FF WV G P+LR+
Sbjct: 630 RVLPKIFLQAMSGDLPNNSLTSSHFQHVCERVN---KSKLENFFNEWVYGSGVPILRVTQ 686
Query: 72 SYNKRKNIVELAVLR--------DCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVH 123
+N+++ ++EL + + + V + + + + + + G M+IR+H
Sbjct: 687 RFNRKRMVIELGIRQVQDEELGHEKVVGEEGFFKSALDHLEHPDLNRTECFTGSMTIRIH 746
Query: 124 ELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGS------------KPDGCD 170
E DG Y+H + D + ++IQ ++K RR K G+ KP D
Sbjct: 747 EHDGTPYEHIV--EIKDTFTKIDIQYNTKY--RRLRKRGGGANDENGVENNNEEKPIVVD 802
Query: 171 DN--GDAVAGLDM----------RSSMESPL-------SWIRADPEMEYLAEIHFNQPVQ 211
N G+ + R+S + L WIR D ++E++ ++H NQP
Sbjct: 803 VNCLGNVYMSPEECSRFSLTEFNRTSESNELLKQNEAFEWIRIDSDLEWICQMHINQPDY 862
Query: 212 MWINQLEKDGDVVAQAQAIAALE------ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAA 265
M+ +QL +DGD+ AQ +AI E + L ++ + L D + F+ +R+ A
Sbjct: 863 MFSSQLRQDGDIEAQLEAIRYYEDVVVNGGVKSLVYSSI--LFRTAIDERYFFGIRLAAC 920
Query: 266 YALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVR 325
AL+ + G+ HL++ ++ E+ +P+ N+F + YF+ IP +A V+
Sbjct: 921 EALSKYVYDPDFTGGVKHLIQIFQILFCLEDSNIPKSNNFENPKLYFLQCNIPKYLAKVK 980
Query: 326 AADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV 366
+ K P+ +F+L +L YN+N N YSD ++ +L+++V
Sbjct: 981 NENGKCPKLVKQFLLDILVYNENGENKYSDDAYVRSLIENV 1021
>gi|228348|prf||1803221A TSM1 gene
Length = 1410
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 103/408 (25%), Positives = 193/408 (47%), Gaps = 56/408 (13%)
Query: 13 KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
++L I +A G P +L++ F+H +V + L+ FF WV G P+LR+
Sbjct: 633 RVLPKIFLQAMSGDLPNNSLTSSHFQHVCERVN---KSKLENFFNEWVYGSGVPILRVTQ 689
Query: 72 SYNKRKNIVELAVLR--------DCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVH 123
+N+++ ++EL + + + V + + + + + + G M+IR+H
Sbjct: 690 RFNRKRMVIELGIRQVQDEELGHEKVVGEEGFFKSALDHLEHPDLNRTECFTGSMTIRIH 749
Query: 124 ELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGS------------KPDGCD 170
E DG Y+H + D + ++IQ ++K RR K G+ KP D
Sbjct: 750 EHDGTPYEHIV--EIKDTFTKIDIQYNTKY--RRLRKRGGGANDENGVENNNEEKPIVVD 805
Query: 171 DN--GDAVAGLDM----------RSSMESPL-------SWIRADPEMEYLAEIHFNQPVQ 211
N G+ + R+S + L WIR D ++E++ ++H NQP
Sbjct: 806 VNCLGNVYMSPEECSRFSLTEFNRTSESNELLKQNEAFEWIRIDSDLEWICQMHINQPDY 865
Query: 212 MWINQLEKDGDVVAQAQAIAALE------ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAA 265
M+ +QL +DGD+ AQ +AI E + L ++ + L D + F+ +R+ A
Sbjct: 866 MFSSQLRQDGDIEAQLEAIRYYEDVVVNGGVKSLVYSSI--LFRTAIDERYFFGIRLAAC 923
Query: 266 YALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVR 325
AL+ + G+ HL++ ++ E+ +P+ N+F + YF+ IP +A V+
Sbjct: 924 EALSKYVYDPDFTGGVKHLIQIFQILFCLEDSNIPKSNNFENPKLYFLQCNIPKYLAKVK 983
Query: 326 AADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQ 373
+ K P+ +F+L +L YN+N N YSD ++ +L+++V ++ +
Sbjct: 984 NENGKCPKLVKQFLLDILVYNENGENKYSDDAYVRSLIENVVKVALNE 1031
>gi|173056|gb|AAA35179.1| TSM1 [Saccharomyces cerevisiae]
Length = 1407
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 103/401 (25%), Positives = 190/401 (47%), Gaps = 56/401 (13%)
Query: 13 KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
++L I +A G P +L++ F+H +V + L+ FF WV G P+LR+
Sbjct: 630 RVLPKIFLQAMSGDLPNNSLTSSHFQHVCERVN---KSKLENFFNEWVYGSGVPILRVTQ 686
Query: 72 SYNKRKNIVELAVLR--------DCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVH 123
+N+++ ++EL + + + V + + + + + + G M+IR+H
Sbjct: 687 RFNRKRMVIELGIRQVQDEELGHEKVVGEEGFFKSALDHLEHPDLNRTECFTGSMTIRIH 746
Query: 124 ELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGS------------KPDGCD 170
E DG Y+H + D + ++IQ ++K RR K G+ KP D
Sbjct: 747 EHDGTPYEHIV--EIKDTFTKIDIQYNTKY--RRLRKRGGGANDENGVENNNEEKPIVVD 802
Query: 171 DN--GDAVAGLDM----------RSSMESPL-------SWIRADPEMEYLAEIHFNQPVQ 211
N G+ + R+S + L WIR D ++E++ ++H NQP
Sbjct: 803 VNCLGNVYMSPEECSRFSLTEFNRTSESNELLKQNEAFEWIRIDSDLEWICQMHINQPDY 862
Query: 212 MWINQLEKDGDVVAQAQAIAALE------ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAA 265
M+ +QL +DGD+ AQ +AI E + L ++ + L D + F+ +R+ A
Sbjct: 863 MFSSQLRQDGDIEAQLEAIRYYEDVVVNGGVKSLVYSSI--LFRTAIDERYFFGIRLAAC 920
Query: 266 YALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVR 325
AL+ + G+ HL++ ++ E+ +P+ N+F + YF+ IP +A V+
Sbjct: 921 EALSKYVYDPDFTGGVKHLIQIFQILFCLEDSNIPKSNNFENPKLYFLQCNIPKYLAKVK 980
Query: 326 AADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV 366
+ K P+ +F+L +L YN+N N YSD ++ +L+++V
Sbjct: 981 NENGKCPKLVKQFLLDILVYNENGENKYSDDAYVRSLIENV 1021
>gi|259144983|emb|CAY78248.1| Taf2p [Saccharomyces cerevisiae EC1118]
Length = 1407
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 105/402 (26%), Positives = 189/402 (47%), Gaps = 58/402 (14%)
Query: 13 KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
++L I +A G P +L++ F+H +V + L+ FF WV G P+LR+
Sbjct: 630 RVLPKIFLQAMSGDLPNNSLTSSHFQHVCERVN---KSKLENFFNEWVYGSGVPILRVTQ 686
Query: 72 SYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWP---------GMMSIRV 122
+N+++ ++EL + R + V+ ++ + P G M+IR+
Sbjct: 687 RFNRKRMVIELGI-RQVQDEELGHEKVVGEEGFFKSELDHLEHPDLNRTECFTGSMTIRI 745
Query: 123 HELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGS------------KPDGC 169
HE DG Y+H + D + ++IQ ++K RR K G+ KP
Sbjct: 746 HEHDGTPYEHIV--EIKDTFTKIDIQYNTKY--RRLRKRGGGANDENGVENNNEEKPIVV 801
Query: 170 DDN--GDAVAGLDM----------RSSMESPL-------SWIRADPEMEYLAEIHFNQPV 210
D N G+ + R+S + L WIR D ++E++ ++H NQP
Sbjct: 802 DVNCLGNVYMSPEECSRFSLTEFNRTSESNELLKQNEAFEWIRIDSDLEWICQMHINQPD 861
Query: 211 QMWINQLEKDGDVVAQAQAIAALE------ALPHLSFNVVNTLNNFLSDSKAFWRVRIEA 264
M+ +QL +DGD+ AQ +AI E + L ++ + L D + F+ +R+ A
Sbjct: 862 YMFSSQLRQDGDIEAQLEAIRYYEDVVVNGGVKSLVYSSI--LFRTAIDERYFFGIRLAA 919
Query: 265 AYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMV 324
AL+ + G+ HL++ ++ E+ +P+ N+F + YF+ IP +A V
Sbjct: 920 CEALSKYVYDPDFTGGVKHLIQIFQILFCLEDSNIPKSNNFENPKLYFLQCNIPKYLAKV 979
Query: 325 RAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV 366
+ + K P+ +F+L +L YN+N N YSD ++ +L+++V
Sbjct: 980 KNENGKCPKLVKQFLLDILVYNENGENKYSDDAYVRSLIENV 1021
>gi|323349526|gb|EGA83748.1| Taf2p [Saccharomyces cerevisiae Lalvin QA23]
Length = 1406
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 103/401 (25%), Positives = 190/401 (47%), Gaps = 56/401 (13%)
Query: 13 KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
++L I +A G P +L++ F+H +V + L+ FF WV G P+LR+
Sbjct: 629 RVLPKIFLQAMSGDLPNNSLTSSHFQHVCERVN---KSKLENFFNEWVYGSGVPILRVTQ 685
Query: 72 SYNKRKNIVELAVLR--------DCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVH 123
+N+++ ++EL + + + V + + + + + + G M+IR+H
Sbjct: 686 RFNRKRMVIELGIRQVQDEELGHEKVVGEEGFFKSXLDHLEHPDLNRTECFTGSMTIRIH 745
Query: 124 ELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGS------------KPDGCD 170
E DG Y+H + D + ++IQ ++K RR K G+ KP D
Sbjct: 746 EHDGTPYEHIV--EIKDTFTKIDIQYNTKY--RRLRKRGGGANDENGVENNNEEKPIVVD 801
Query: 171 DN--GDAVAGLDM----------RSSMESPL-------SWIRADPEMEYLAEIHFNQPVQ 211
N G+ + R+S + L WIR D ++E++ ++H NQP
Sbjct: 802 VNCLGNVYMSPEECSRFSLTEFNRTSESNELLKQNEAFEWIRIDSDLEWICQMHINQPDY 861
Query: 212 MWINQLEKDGDVVAQAQAIAALE------ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAA 265
M+ +QL +DGD+ AQ +AI E + L ++ + L D + F+ +R+ A
Sbjct: 862 MFSSQLXQDGDIEAQLEAIRYYEDVVVNGGVKSLVYSSI--LFRTAIDERYFFGIRLAAC 919
Query: 266 YALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVR 325
AL+ + G+ HL++ ++ E+ +P+ N+F + YF+ IP +A V+
Sbjct: 920 EALSKYVYDPDFTGGVKHLIQIFQILFCLEDSNIPKSNNFENPKLYFLQCNIPKYLAKVK 979
Query: 326 AADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV 366
+ K P+ +F+L +L YN+N N YSD ++ +L+++V
Sbjct: 980 NENGKCPKLVKQFLLDILVYNENGENKYSDDAYVRSLIENV 1020
>gi|6319890|ref|NP_009971.1| Taf2p [Saccharomyces cerevisiae S288c]
gi|136453|sp|P23255.3|TAF2_YEAST RecName: Full=Transcription initiation factor TFIID subunit 2;
AltName: Full=TAFII-150; AltName: Full=TBP-associated
factor 150 kDa; AltName: Full=TBP-associated factor 2;
AltName: Full=TSM-1
gi|3904|emb|CAA45337.1| YCR724 [Saccharomyces cerevisiae]
gi|1907184|emb|CAA42290.1| component of TFIID complex [Saccharomyces cerevisiae]
gi|285810736|tpg|DAA07520.1| TPA: Taf2p [Saccharomyces cerevisiae S288c]
Length = 1407
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 103/401 (25%), Positives = 190/401 (47%), Gaps = 56/401 (13%)
Query: 13 KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
++L I +A G P +L++ F+H +V + L+ FF WV G P+LR+
Sbjct: 630 RVLPKIFLQAMSGDLPNNSLTSSHFQHVCERVN---KSKLENFFNEWVYGSGVPILRVTQ 686
Query: 72 SYNKRKNIVELAVLR--------DCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVH 123
+N+++ ++EL + + + V + + + + + + G M+IR+H
Sbjct: 687 RFNRKRMVIELGIRQVQDEELGHEKVVGEEGFFKSALDHLEHPDLNRTECFTGSMTIRIH 746
Query: 124 ELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGS------------KPDGCD 170
E DG Y+H + D + ++IQ ++K RR K G+ KP D
Sbjct: 747 EHDGTPYEHIV--EIKDTFTKIDIQYNTKY--RRLRKRGGGANDENGVENNNEEKPIVVD 802
Query: 171 DN--GDAVAGLDM----------RSSMESPL-------SWIRADPEMEYLAEIHFNQPVQ 211
N G+ + R+S + L WIR D ++E++ ++H NQP
Sbjct: 803 VNCLGNVYMSPEECSRFSLTEFNRTSESNELLKQNEAFEWIRIDSDLEWICQMHINQPDY 862
Query: 212 MWINQLEKDGDVVAQAQAIAALE------ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAA 265
M+ +QL +DGD+ AQ +AI E + L ++ + L D + F+ +R+ A
Sbjct: 863 MFSSQLRQDGDIEAQLEAIRYYEDVVVNGGVKSLVYSSI--LFRTAIDERYFFGIRLAAC 920
Query: 266 YALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVR 325
AL+ + G+ HL++ ++ E+ +P+ N+F + YF+ IP +A V+
Sbjct: 921 EALSKYVYDPDFTGGVKHLIQIFQILFCLEDSNIPKSNNFENPKLYFLQCNIPKYLAKVK 980
Query: 326 AADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV 366
+ K P+ +F+L +L YN+N N YSD ++ +L+++V
Sbjct: 981 NENGKCPKLVKQFLLDILVYNENGENKYSDDAYVRSLIENV 1021
>gi|392300827|gb|EIW11917.1| Taf2p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 1369
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 103/401 (25%), Positives = 190/401 (47%), Gaps = 56/401 (13%)
Query: 13 KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
++L I +A G P +L++ F+H +V + L+ FF WV G P+LR+
Sbjct: 630 RVLPKIFLQAMSGDLPNNSLTSSHFQHVCERVN---KSKLENFFNEWVYGSGVPILRVTQ 686
Query: 72 SYNKRKNIVELAVLR--------DCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVH 123
+N+++ ++EL + + + V + + + + + + G M+IR+H
Sbjct: 687 RFNRKRMVIELGIRQVQDEELGHEKVVGEEGFFKSALDHLEHPDLNRTECFTGSMTIRIH 746
Query: 124 ELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGS------------KPDGCD 170
E DG Y+H + D + ++IQ ++K RR K G+ KP D
Sbjct: 747 EHDGTPYEHIV--EIKDTFTKIDIQYNTKY--RRLRKRGGGANDENGVENNNEEKPIVVD 802
Query: 171 DN--GDAVAGLDM----------RSSMESPL-------SWIRADPEMEYLAEIHFNQPVQ 211
N G+ + R+S + L WIR D ++E++ ++H NQP
Sbjct: 803 VNCLGNVYMSPEECSRFSLTEFNRTSESNELLKQNEAFEWIRIDSDLEWICQMHINQPDY 862
Query: 212 MWINQLEKDGDVVAQAQAIAALE------ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAA 265
M+ +QL +DGD+ AQ +AI E + L ++ + L D + F+ +R+ A
Sbjct: 863 MFSSQLRQDGDIEAQLEAIRYYEDVVVNGGVKSLVYSSI--LFRTAIDERYFFGIRLAAC 920
Query: 266 YALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVR 325
AL+ + G+ HL++ ++ E+ +P+ N+F + YF+ IP +A V+
Sbjct: 921 EALSKYVYDPDFTGGVKHLIQIFQILFCLEDSNIPKSNNFENPKLYFLQCNIPKYLAKVK 980
Query: 326 AADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV 366
+ K P+ +F+L +L YN+N N YSD ++ +L+++V
Sbjct: 981 NENGKCPKLVKQFLLDILVYNENGENKYSDDAYVRSLIENV 1021
>gi|151943862|gb|EDN62162.1| TATA binding protein-associated factor [Saccharomyces cerevisiae
YJM789]
Length = 1405
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 103/401 (25%), Positives = 190/401 (47%), Gaps = 56/401 (13%)
Query: 13 KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
++L I +A G P +L++ F+H +V + L+ FF WV G P+LR+
Sbjct: 628 RVLPKIFLQAMSGDLPNNSLTSSHFQHVCERVN---KSKLENFFNEWVYGSGVPILRVTQ 684
Query: 72 SYNKRKNIVELAVLR--------DCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVH 123
+N+++ ++EL + + + V + + + + + + G M+IR+H
Sbjct: 685 RFNRKRMVIELGIRQVQDEELGHEKVVGEEGFFKSALDHLEHPDLNRTECFTGSMTIRIH 744
Query: 124 ELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGS------------KPDGCD 170
E DG Y+H + D + ++IQ ++K RR K G+ KP D
Sbjct: 745 EHDGTPYEHIV--EIKDTFTKIDIQYNTKY--RRLRKRGGGANDENGVENNNEEKPIVVD 800
Query: 171 DN--GDAVAGLDM----------RSSMESPL-------SWIRADPEMEYLAEIHFNQPVQ 211
N G+ + R+S + L WIR D ++E++ ++H NQP
Sbjct: 801 VNCLGNVYMSPEECSRFNLTEFNRTSESNELLKQNEAFEWIRIDSDLEWICQMHINQPDY 860
Query: 212 MWINQLEKDGDVVAQAQAIAALE------ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAA 265
M+ +QL +DGD+ AQ +AI E + L ++ + L D + F+ +R+ A
Sbjct: 861 MFSSQLRQDGDIEAQLEAIRYYEDVVVNGGVKSLVYSSI--LFRTAIDERYFFGIRLAAC 918
Query: 266 YALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVR 325
AL+ + G+ HL++ ++ E+ +P+ N+F + YF+ IP +A V+
Sbjct: 919 EALSKYVYDPDFTGGVKHLIQIFQILFCLEDSNIPKSNNFENPKLYFLQCNIPKYLAKVK 978
Query: 326 AADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV 366
+ K P+ +F+L +L YN+N N YSD ++ +L+++V
Sbjct: 979 NENGKCPKLVKQFLLDILVYNENGENKYSDDAYVRSLIENV 1019
>gi|349576782|dbj|GAA21952.1| K7_Taf2p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 1405
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 103/401 (25%), Positives = 190/401 (47%), Gaps = 56/401 (13%)
Query: 13 KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
++L I +A G P +L++ F+H +V + L+ FF WV G P+LR+
Sbjct: 628 RVLPKIFLQAMSGDLPNNSLTSSHFQHVCERVN---KSKLENFFNEWVYGSGVPILRVTQ 684
Query: 72 SYNKRKNIVELAVLR--------DCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVH 123
+N+++ ++EL + + + V + + + + + + G M+IR+H
Sbjct: 685 RFNRKRMVIELGIRQVQDEELGHEKVVGEEGFFKSALDHLEHPDLNRTECFTGSMTIRIH 744
Query: 124 ELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGS------------KPDGCD 170
E DG Y+H + D + ++IQ ++K RR K G+ KP D
Sbjct: 745 EHDGTPYEHIV--EIKDTFTKIDIQYNTKY--RRLRKRGGGANDENGVENNNEEKPIVVD 800
Query: 171 DN--GDAVAGLDM----------RSSMESPL-------SWIRADPEMEYLAEIHFNQPVQ 211
N G+ + R+S + L WIR D ++E++ ++H NQP
Sbjct: 801 VNCLGNVYMSPEECSRFNLTEFNRTSESNELLKQNEAFEWIRIDSDLEWICQMHINQPDY 860
Query: 212 MWINQLEKDGDVVAQAQAIAALE------ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAA 265
M+ +QL +DGD+ AQ +AI E + L ++ + L D + F+ +R+ A
Sbjct: 861 MFSSQLRQDGDIEAQLEAIRYYEDVVVNGGVKSLVYSSI--LFRTAIDERYFFGIRLAAC 918
Query: 266 YALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVR 325
AL+ + G+ HL++ ++ E+ +P+ N+F + YF+ IP +A V+
Sbjct: 919 EALSKYVYDPDFTGGVKHLIQIFQILFCLEDSNIPKSNNFENPKLYFLQCNIPKYLAKVK 978
Query: 326 AADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV 366
+ K P+ +F+L +L YN+N N YSD ++ +L+++V
Sbjct: 979 NENGKCPKLVKQFLLDILVYNENGENKYSDDAYVRSLIENV 1019
>gi|157138539|ref|XP_001664244.1| hypothetical protein AaeL_AAEL014028 [Aedes aegypti]
gi|108869468|gb|EAT33693.1| AAEL014028-PA [Aedes aegypti]
Length = 619
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 92/307 (29%), Positives = 148/307 (48%), Gaps = 30/307 (9%)
Query: 198 MEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALE--ALPHLSFNVVNTLNNFLSDSK 255
M L +H QP W QL + DV AQ +AI ALE A P + +T+ N +
Sbjct: 1 MTILRSVHIEQPDFQWQFQLRHERDVTAQLEAIDALERYASPATRLALTDTIEN----EQ 56
Query: 256 AFWRVRIEAAYALANTASEE-TDWAGLLHLVKFYKSRRFDENIGLP---RPNDFRDFSEY 311
F+ VR +AA L A+ T W G ++ +K +F + P R N+F +F Y
Sbjct: 57 VFYEVRCKAALCLTKVATAMVTSWQGPPAMLTIFK--KFFGSFSAPHIIRQNNFTNFQHY 114
Query: 312 FVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGE--- 368
F+ + IP A+A +R A P E + F+L L KYNDN N YSD ++ AALV+++G
Sbjct: 115 FLQKTIPVAMAGLRTAHGICPPEVIRFLLDLFKYNDNTKNHYSDNYYRAALVEALGNSIT 174
Query: 369 --LEFGQQSILFLSS--------LLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKL 418
+ Q + S +L+ + R+L ++ +PSY ++I+C++ + + L+
Sbjct: 175 PVISVVHQGMALTSENLSADSKLVLEEVTRILNLEKHLPSYKYTVSIACLKVIRK--LQK 232
Query: 419 SGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSL 478
G + + K+ + + + VRV A L+D +G L I +E +P
Sbjct: 233 CGHLPTNP--KIYRSYAAYGQYIDVRVAAMECLVDF-VKIDGRWEDLEHLIDMLETDPDP 289
Query: 479 RGQVKLG 485
+ KLG
Sbjct: 290 MARHKLG 296
>gi|241095546|ref|XP_002409520.1| transcription initiation factor TFII-D subunit, putative [Ixodes
scapularis]
gi|215492763|gb|EEC02404.1| transcription initiation factor TFII-D subunit, putative [Ixodes
scapularis]
Length = 1137
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 127/509 (24%), Positives = 217/509 (42%), Gaps = 102/509 (20%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEFRHFANKVGNLERPFL 51
MLE ++G ++ ++S A A+ + LST F+ V + +
Sbjct: 352 MLEDRIGRELLLQVFNKLLSLASSAAQQKVTSNMWHNMLLSTNSFQKAIFIVTGKD---I 408
Query: 52 KEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGD 111
K F +WV G F +N+++N VEL + TD+ + G
Sbjct: 409 KTFLEQWVRQGGHAKFHGTFVFNRKRNTVELEI----------------KQTDTHSL-GI 451
Query: 112 IGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDD 171
+ G +++ + ELDG + H L + +A + +I CHSK + K +
Sbjct: 452 KKYVGPLTVTIQELDGTFKH-NLQIEENATKH-DITCHSKSRRNKKKKIPLCT------- 502
Query: 172 NGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAI 230
G+ V +D+ + +SP+ WIR DP+M L ++ F QP W QL + DV +Q
Sbjct: 503 -GEEV-DMDLTAMDADSPVLWIRIDPDMLVLRQVVFEQPDYQWQYQLRYERDVTSQ---- 556
Query: 231 AALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKS 290
IE A + +T WAG ++ +K
Sbjct: 557 -------------------------------IEVANHMVST------WAGPPAMMTIFKK 579
Query: 291 RRFDENI-GLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNN 349
+ + R N+F +F YF+ +AIP A+A +R P E ++F+ L KYN+N+
Sbjct: 580 MFGSHSCPNIIRQNNFSNFQHYFLQKAIPVAMAGLRTIHGICPPEVMKFLFDLFKYNENS 639
Query: 350 GNPYSDVFWLAALVQSVGELEFGQQSILF----------LSS----LLKRIDRLLQFDRL 395
N +SD ++ AL++++GE S+L LS+ +L+ I R L ++L
Sbjct: 640 KNKFSDNYYRTALIEALGETVTPVVSVLAHNGQPITAESLSADTKLILEEITRCLNLEKL 699
Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLE 455
+P Y +T+ C++ + KL L + L + + + VR+ A AL+D
Sbjct: 700 LPCYKFTITVGCLKAIR----KLQKMGHLPNISALFREYAQYGLFLDVRLAALEALVDYT 755
Query: 456 FHCNGIDSALSLFIKSVEEEPSLRGQVKL 484
+G LS + VE +P R ++ L
Sbjct: 756 -TVDGNADDLSFLLDIVENDPVPRVRLAL 783
>gi|353229094|emb|CCD75265.1| Tata binding protein associated factor (M01 family) [Schistosoma
mansoni]
Length = 1192
Score = 124 bits (312), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 115/445 (25%), Positives = 203/445 (45%), Gaps = 62/445 (13%)
Query: 31 LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTV 90
LST FR + V + ++ F +W G + F +N+++N+VEL + +D
Sbjct: 563 LSTASFRRIISMVTGQD---IRNFLNQWACHAGHVRIFAKFHFNRKRNVVELELKQDLQS 619
Query: 91 KPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHS 150
+ G + + G +++ + ELDG + H G + ++ CHS
Sbjct: 620 R------------------GTLNYTGPITVMLQELDGAFIHTFKLEEGRMSR--DLPCHS 659
Query: 151 KLAARRALKPKKGSKPDGCDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPV 210
K R K NGD V R ESPL W+R DP++ + +IH +QP
Sbjct: 660 KSRKHRKKKISLA--------NGDEVDMDLARIDPESPLLWLRIDPDLAIIHDIHVDQPD 711
Query: 211 QMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALAN 270
MW L D D + Q +A+ AL+ S + L+N + + + F+ VR+EA + L +
Sbjct: 712 FMWHLMLAHDRDCLGQLEAVNALKDFA--SPETRHVLSNIILNERIFYHVRMEACFTLCH 769
Query: 271 ------------TASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIP 318
T + T + LL + F++ GL R N+F + YF+ A+
Sbjct: 770 VVNELSTLSNNPTVASATVTSCLLPM--FWQLFSSPAARGLIRQNNFTNLQFYFLQRAMV 827
Query: 319 HAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGEL---EFGQQS 375
A++ +R P + + F+ LL++NDN+ NPYSD +++A LV+++ + +
Sbjct: 828 KALSTLR-VQQVCPHDILIFISDLLRHNDNSRNPYSDSYYVADLVKAMKDTLTPAIIVRG 886
Query: 376 ILFLSSL-------LKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVV 428
++ S+L ++ + LL + SY G +T+ C+ + + L+ GF+ +D
Sbjct: 887 VMSASTLPFEARTVIEEVSHLLNLESETISYKGTVTVVCLAAIRR--LQRLGFLPIDP-- 942
Query: 429 KLIKPFRDFNTIWQVRVEASRALLD 453
L + N +RV A A++D
Sbjct: 943 SLFYQYALSNFYCDIRVAAIEAVVD 967
>gi|328772360|gb|EGF82398.1| hypothetical protein BATDEDRAFT_86202 [Batrachochytrium
dendrobatidis JAM81]
Length = 1595
Score = 124 bits (310), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 113/427 (26%), Positives = 174/427 (40%), Gaps = 82/427 (19%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKV-GNLERPFLKEFFPRWV 59
MLEK+MG N +KI ++ A LST F K+ G LE KEF +W+
Sbjct: 519 MLEKRMGKNLLQKIANKLMVSAMSGELPTGLSTTHFLKMVRKISGKLE---TKEFADQWI 575
Query: 60 GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMS 119
GCP+ + + +N++K ++EL V +DC S V+ +N + G
Sbjct: 576 FRSGCPIFTIRYHFNRKKMVIELRVKQDC-----SNVDVIGANQR---------FSGAFM 621
Query: 120 IRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSK-------------- 165
+RV E G +D + + + +I H+K R KK K
Sbjct: 622 VRVQEPGGTFDTEV--RIEEKTKQYDIIYHTKYKRIRRRAGKKYKKVADNDDDDDDEDVY 679
Query: 166 ------------PDGCDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMW 213
D N D ++ WIR DP+ ++ F Q MW
Sbjct: 680 YMEEDEKANDAIADAEAQNQDTNGNFNIIEPDRITFEWIRLDPDNVWICTKTFEQEDFMW 739
Query: 214 INQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTAS 273
L K+ D+ AQ++AI AL + L+ +NT+ F+ D F+ +R+ A L+ S
Sbjct: 740 NALLRKEKDIGAQSEAIVALSKIQSLA--SLNTITTFVRDHSHFYHLRLLAVSVLSKFDS 797
Query: 274 EETDWAGLLHLVKFYKS---RRFDENIGLPRPNDFRDFSEY-FVLEAIPHAVAMVRAADN 329
+E L +L+K Y+ +D I +P+ N+F +Y LE + V A
Sbjct: 798 DELYSTALGNLIKIYRDIFCISYDGPI-IPKKNNFSSLQDYKLRLEIVRSLAKFVDARGA 856
Query: 330 KSPREAVEFVLQLLKYNDNNGN----------------------------PYSDVFWLAA 361
P + FV LLKYNDN+GN YSD W++
Sbjct: 857 TLPVVRMMFV-NLLKYNDNSGNEVKIAACTEYILMSVFFNYHYMLYRGQFKYSDGEWVSL 915
Query: 362 LVQSVGE 368
LV+ +G+
Sbjct: 916 LVKLIGD 922
>gi|258575701|ref|XP_002542032.1| predicted protein [Uncinocarpus reesii 1704]
gi|237902298|gb|EEP76699.1| predicted protein [Uncinocarpus reesii 1704]
Length = 667
Score = 122 bits (307), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 115/413 (27%), Positives = 184/413 (44%), Gaps = 88/413 (21%)
Query: 116 GMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLA--ARRALKPKKGSKPDGCDDN 172
G M+IR+HE DG Y+H + +A +I ++K R + ++ + G D N
Sbjct: 42 GSMTIRIHEADGTPYEHIV--EIKEAVTKFDIPYNTKYKRLKRNKRQKERAAASSGFDPN 99
Query: 173 ------------GDAVAGLDM---------------RSSMESPLSWIRADPEMEYLAEIH 205
GD + + R ES WIR D + E++ ++
Sbjct: 100 TEIQDDVLLYCLGDVLQSEEEMRKWRLVEWTKEDEDRMGQES-YEWIRMDADFEWICKMS 158
Query: 206 FNQPVQMWINQLEKDGDVVAQAQAIAALEAL---PHLSFNVVNTLNNFLSDSKAFWRVRI 262
P M+++QL++D DVVAQ ++I + A P +S V TL D + F+ +R
Sbjct: 159 LTMPGYMYLSQLQQDRDVVAQLESIQYMVAQREHPLISTIFVRTL----MDKRYFYGIRS 214
Query: 263 EAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVA 322
AA+AL A EE W GL HL ++ N + R NDF D + Y + +AIP A+A
Sbjct: 215 AAAHALVKHAKEEVHWIGLFHLETAFQELFCLPNSPMSRSNDFSDRASYCLQKAIPEAMA 274
Query: 323 MVRAADNKSPREAVEFVLQLLKYNDNNGNP------------------YSDVFWLAALVQ 364
VR +SP +F+ + LK+NDN+ N +SD +++AAL++
Sbjct: 275 KVRNRHGQSPMRVKKFLYEKLKFNDNSNNKVPLSLGSHTNDCELMLPQFSDCYYVAALIR 334
Query: 365 SVGEL-----------------EFGQQS--ILFLSSLLKRIDRLLQFDRLMPSYNGILT- 404
S+ + E G+Q+ L+L + + IDR + D S+ + +
Sbjct: 335 SITDALINGKENSDGVDFDINRELGRQAENQLYLDCIAE-IDRYKRMDEWTSSFQNLYSR 393
Query: 405 --ISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLE 455
I C R L + G + LD V P+ T +R++A R LLDL+
Sbjct: 394 TAIDCQRRL------MKGKV-LDVDVAQFLPYARVGTFDLLRLDAFRILLDLD 439
>gi|453085709|gb|EMF13752.1| hypothetical protein SEPMUDRAFT_148949 [Mycosphaerella populorum
SO2202]
Length = 1596
Score = 121 bits (303), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 133/547 (24%), Positives = 232/547 (42%), Gaps = 111/547 (20%)
Query: 2 LEKQMGSNFFRKILQNIISRAQ-GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
L K GS +I+ I A+ GA LST +F+ K+G+ + L+ FF +WV
Sbjct: 545 LIKASGSTGVLRIINKIFLNAKTGALSNGELSTADFQRTCEKLGHNK---LESFFRQWVW 601
Query: 61 TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG--DIGWP--- 115
GCP+ + +NK+K +VE+ + + ++ ++ +SN E ++ ++ P
Sbjct: 602 GSGCPIFYVQQRFNKKKLVVEMTIFQR-QLERQTKPAFEASNFMREVKEHIQEVWAPEVQ 660
Query: 116 ----GMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLA--ARRALKPKKGSKPDG 168
G M+IR+HE DG Y+H + + LEI ++K R L+ + +G
Sbjct: 661 PVFTGPMTIRIHEADGTPYEHIV--EIKEPITKLEIPYNTKYKRLKRSRLQKNRAEAANG 718
Query: 169 CDDNGDAVAGL-------------------------DMRSSMESPLSWIRADPEMEYLAE 203
++ D + L D + WIR D + E++ +
Sbjct: 719 QLNDEDGQSALLYCLGDILATNEDKEEWKLVDWSAEDEEKMNQESYEWIRMDADFEWIGK 778
Query: 204 IHFNQPVQMWINQLEKDGDVVAQAQAIAALEAL--PHLSFNVVNTLNNFLSDSKAFWRVR 261
IH P+ M+I+QL++D D+ AQ +++ L A H+S ++ L L D + F +R
Sbjct: 779 IHLVMPLYMYISQLQQDRDIAAQYESMRYLLASNPHHVSLSI---LLRTLMDERYFHGIR 835
Query: 262 IEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIG----LPRPNDFRDFSEYFVLEAI 317
+ AA LA A ++ G L+K ++ F E G +PRPNDF + AI
Sbjct: 836 VMAAEGLAKLAKDKLMDIGKHQLIKAFE-WFFCETEGNGPVMPRPNDFSSRITLIMQCAI 894
Query: 318 PHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGE--------- 368
P A+A +R D K P + E LL +NDN SD ++A L+ + +
Sbjct: 895 PEAMAKLRDEDGKVPMDVRELFQALLTHNDNANPDISDCHYVATLMNCLADSLVASKWDG 954
Query: 369 ------------------LEFGQQS--------------------------ILFLSSLLK 384
EFG+ F S +
Sbjct: 955 GKKKEVVAPEPEVPQSYTFEFGEDEEDTGMMDAGGPVKDLIDFDEEPVDPDAPFTKSAID 1014
Query: 385 RIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVR 444
I+R + D + +Y + +++ I+ L ++ + + D+ V+L++ + +T VR
Sbjct: 1015 AIERHRRLDEYVVTYQNLYSVTAIKCLQKL---IKAGVVKDKTVELMQ-YTSHSTADNVR 1070
Query: 445 VEASRAL 451
+EA R +
Sbjct: 1071 LEAFRCM 1077
>gi|402222601|gb|EJU02667.1| hypothetical protein DACRYDRAFT_21692 [Dacryopinax sp. DJM-731 SS1]
Length = 1785
Score = 120 bits (302), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 93/300 (31%), Positives = 153/300 (51%), Gaps = 37/300 (12%)
Query: 187 SPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNT 246
S WIR D + E++A I F QP MW +QL++D DVVAQ +AI AL +P L + + T
Sbjct: 689 STYEWIRCDADFEWIATIRFEQPDFMWTSQLQRDRDVVAQLEAIHALARMPSLVVSSILT 748
Query: 247 LNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDE-NIG------- 298
+ + F+RVR+EAA AL + A+ E G+ HL K + + E N G
Sbjct: 749 KTVLVENY--FFRVRMEAALALQSCANIE--HIGMFHLFKLFTRYCYSEANHGDRDFREK 804
Query: 299 -LPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPRE-AVEFVLQLLKYNDNNGNPYSDV 356
+P PNDF+DF+ YF+ +++ A++ R N + A + +L+YN+N N YSD
Sbjct: 805 FIPEPNDFKDFASYFIRKSVLVAISRCRDDSNGNTWSVARRLFIDVLRYNNNAKNQYSDS 864
Query: 357 FWLA--------ALVQSV-----------GELEFGQQSILFLSSLLKRIDRLLQFDRLMP 397
+L+ ALV V G + + L ++ ++R D+ +P
Sbjct: 865 RYLSTAIVALGGALVPGVPPERGILTDAEGPVPLTESDATILEEAVQEVERYRLRDQYVP 924
Query: 398 SYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFH 457
S + ++TI+ + ++ L L G I+ D ++ R+ N + VR+ A ALL ++++
Sbjct: 925 SQHNVITIAALE--FKLMLMLGGLIATDGNF-FLEYTREENFV-PVRLAAFDALLVMKWY 980
Score = 42.0 bits (97), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 32/113 (28%), Positives = 52/113 (46%), Gaps = 28/113 (24%)
Query: 30 TLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCT 89
TLST F KV ++ K F +W+ G P + S+N++K VE+ + +DC
Sbjct: 507 TLSTHSFLRTCRKVSGVDP---KTFADQWIYGSGTPRFGVTASFNRKKMAVEVTIAQDC- 562
Query: 90 VKPDSRTPVLSSNTDSENRDGD-IG---------WPGMMSIRVHELDGM-YDH 131
P ++ +GD +G + G ++IR+HE DG Y+H
Sbjct: 563 -------PAYMAH------EGDPVGLAVFNPVQLFEGPITIRIHEADGTPYEH 602
>gi|327305057|ref|XP_003237220.1| transcription initiation factor TFIId 127kD subunit [Trichophyton
rubrum CBS 118892]
gi|326460218|gb|EGD85671.1| transcription initiation factor TFIId 127kD subunit [Trichophyton
rubrum CBS 118892]
Length = 1251
Score = 120 bits (300), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 106/404 (26%), Positives = 182/404 (45%), Gaps = 51/404 (12%)
Query: 14 ILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTC-----GCPVLR 68
IL+ +S+A G + + + ++ F ++G RP L + C G L
Sbjct: 514 ILERRLSKASGKATISRIISRIF--LNARMGR--RPRLLVHYHTAGVACLHTLTGEMALS 569
Query: 69 MGFSYNKRKNIVELAVLR---DCTVKPDSRTPVLSSNTDSENRDGDIG-----WPGMMSI 120
+NK+K +VE+ + + + T D + E R G + G M+I
Sbjct: 570 SANGFNKKKLVVEMMISQVQSEQTTARDLERSTFMRDVKEEIRSVYAGTVQNVFTGSMTI 629
Query: 121 RVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLA--ARRALKPKKGSKPDGCDDN----- 172
++HE DG Y+H + +A +I ++K R + ++ + G D N
Sbjct: 630 QIHEADGTPYEHIV--EIKEAITKFDIPYNTKYKRLKRNRRQKERAAAMSGIDPNSETQD 687
Query: 173 -------GDAV-AGLDM--------------RSSMESPLSWIRADPEMEYLAEIHFNQPV 210
GD + + DM R ES WIR D + E++ ++ P
Sbjct: 688 DVLLYCLGDVLQSDEDMEKWRIVEWTKEDESRMGQES-YEWIRMDADFEWICKMSLGMPG 746
Query: 211 QMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALAN 270
M+++QL++D DVVAQ ++I + A + + + + D + F +R A AL
Sbjct: 747 YMYLSQLQQDRDVVAQLESIQHMAAQREHAL-ISSIFVRTILDRRYFHGIRTAATKALVK 805
Query: 271 TASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNK 330
A +E DW GL HL ++ N + R NDF D S Y V A+ A++ VR D +
Sbjct: 806 HAKQEVDWIGLFHLENAFQELYCLPNSSMARSNDFSDRSSYAVQLALIEAISEVRDEDGR 865
Query: 331 SPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQ 374
+P +F+ + LK+NDN+ N YSD F+++ L++S+ G++
Sbjct: 866 TPMRVKQFLYEKLKFNDNSNNEYSDCFYISTLLKSICNALLGRR 909
>gi|412991384|emb|CCO16229.1| predicted protein [Bathycoccus prasinos]
Length = 1510
Score = 119 bits (299), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 118/427 (27%), Positives = 184/427 (43%), Gaps = 59/427 (13%)
Query: 25 ASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVE--L 82
AS R +S++ F NL + + F RWV CG P + +S KRKN +E L
Sbjct: 807 ASNARWISSQHFLDHCRSAVNLGKGEIASFVERWVYGCGSPTFDVEYSIRKRKNKLEFVL 866
Query: 83 AVLRD-CTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGMYDH---------- 131
V+ T+ D ++ N + ++IR+ E DH
Sbjct: 867 KVIHSPATLAADEAASRVARNHRTS-----------VTIRLQEDATASDHVVALTAGGAT 915
Query: 132 ----PILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKP-----DGCDDNGDAVAGLDMR 182
+ P + + + +Q +K + + KK S P G D N D + D
Sbjct: 916 AAPGTVAPSEHEHFAEMPLQVKTKESR---YEKKKFSAPVVESGTGADINNDLLTAKD-- 970
Query: 183 SSMESPLSWIRADPEMEYLAEIHF-NQPVQMWINQ---LEKDGDVVAQAQAIAALEALPH 238
+ +SW+R DP+ E+L I + V + Q L+K+ DVVAQ+ A+ L
Sbjct: 971 ----NQVSWVRVDPDGEWLCNIRLPTEQVGLESMQALLLDKEMDVVAQSSAVRFLAERAK 1026
Query: 239 L-SFNVVNTLNNFL-SDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDEN 296
+ S L L S+ F RVR EAA AL + ++ + GL LV +Y+ + D N
Sbjct: 1027 IGSQTACAALEKCLRSNKTTFCRVRAEAAMALGKSTGVKSKFLGLSALVSYYRRTQCDPN 1086
Query: 297 IGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADN------KSPREAVEFVLQLLKYNDNNG 350
G P+PND RD SE V EA+ A+ N +P A + ++ L NDN G
Sbjct: 1087 TGFPKPNDTRDVSETIVDEAVLLALGASVDDSNSMNGVISTPTSAFDCLVDRLSRNDNTG 1146
Query: 351 NPYSDVFWLAALVQ--SVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCI 408
NP SDV + A ++ S+ ++ L ++ ++ R F + S ++ SC
Sbjct: 1147 NPNSDVGLIVAALEALSISMPPDAKRRDLAMAQAVRYAHRDPVF---IKSDRNVVARSCY 1203
Query: 409 RTLTQIA 415
R L +A
Sbjct: 1204 RALANLA 1210
>gi|312384150|gb|EFR28947.1| hypothetical protein AND_02477 [Anopheles darlingi]
Length = 657
Score = 118 bits (296), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 92/307 (29%), Positives = 148/307 (48%), Gaps = 30/307 (9%)
Query: 198 MEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALE--ALPHLSFNVVNTLNNFLSDSK 255
M L ++ QP W QL + DV AQ AI ALE A P + +T+ N +
Sbjct: 1 MTLLRSVNIEQPDFQWQFQLRHERDVTAQLDAIQALEKYATPQARLALTDTIEN----ER 56
Query: 256 AFWRVRIEAAYALANTASEE-TDWAGLLHLVKFYKSRRFDENIGLP---RPNDFRDFSEY 311
F+ VR +AA L A+ T W G ++ +K +F + P R N+F +F Y
Sbjct: 57 IFYEVRCQAALCLTKVANAMVTSWQGPPAMLTIFK--KFFGSYSAPHIIRQNNFTNFQHY 114
Query: 312 FVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEF 371
F+ + IP A+A +R A P E + F+L L KYNDN+ N YSD ++ AALV+++G
Sbjct: 115 FLQKTIPVAMAGLRTAHGICPPEVIRFLLDLFKYNDNSKNHYSDNYYRAALVEALGNTIT 174
Query: 372 GQQSILF----LSS---------LLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKL 418
S++ LSS +L+ + R+L ++ +PSY ++I+C++ + KL
Sbjct: 175 PIISVVHQGRALSSENLSADAKLVLEEVTRILNLEKHLPSYKYTVSIACLKVIR----KL 230
Query: 419 SGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSL 478
L K+ + + + VR+ A L+D +G L I+ +E +P
Sbjct: 231 QKCGHLPTNPKIYRSYAAYGQYIDVRLAAMECLVDY-VKIDGRWEDLEHLIELLETDPDP 289
Query: 479 RGQVKLG 485
+ +LG
Sbjct: 290 MARHQLG 296
>gi|303283346|ref|XP_003060964.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457315|gb|EEH54614.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 1697
Score = 116 bits (291), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 120/450 (26%), Positives = 198/450 (44%), Gaps = 67/450 (14%)
Query: 26 SPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNK--RKNIVELA 83
SP R L F + + + + F RW+ CGCP L +G+++ K R+ ++
Sbjct: 755 SPARFLRQSVFLTSCRQNATMTKQEMAAFTARWIEGCGCPRLTVGYTFRKSRRQELLFAI 814
Query: 84 VLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELD-GMYDHPILPMAGD-AW 141
L C + N +++RV E D DH + A D A+
Sbjct: 815 KLDGCRAAAAADKIAFRKNPKIS-----------VTVRVQETDLPPSDHAVSLSAHDRAY 863
Query: 142 QLLEIQCHSKLAARR--------ALKPKKGSKPDGCDDNGDAVAGLDMRSSM-----ESP 188
L+ +Q +K RR AL + ++ G + D LD ++ E P
Sbjct: 864 CLMPLQLVTKPKDRRTIARQQQAALDRQARARETGNQNPDD----LDAAAAAEVLQWECP 919
Query: 189 LSWIRADPEMEYLAEIHFNQPVQ------MWINQLEKD--GDVVAQAQAIAALEALPHL- 239
++W+R DPE E+LAE+ P++ M QL K+ DV AQ AIA L A
Sbjct: 920 VNWVRIDPEGEWLAEVRV--PIEQIGLEGMITAQLRKERPADVGAQLHAIAYLRARAEAG 977
Query: 240 SFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGL--LHLVKFYKSRRFDENI 297
S + VN L + + D+K F RVR +AA AL A ++T + L +++ Y+ RR D
Sbjct: 978 STSAVNALLHCVEDAKTFCRVRADAAKALGACACDDTQRSNLAMTSVMRTYRKRRCDPET 1037
Query: 298 GLPRPNDFRDFSEYFVLEAIPHAVAMVRA----------ADNKSPREAVEFVLQLLKYND 347
GLP P D R++++ V E + A+ + R D +P E V+ ++ +L +
Sbjct: 1038 GLPAPTDLRNYADALVDEGLVAALGLPRVPAREAAANDMVDYTTPSECVDLLVDMLNNLE 1097
Query: 348 NNGNPYSDVFWLAALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNG------ 401
++G+P+ +A ++++G + L S L I R L D + G
Sbjct: 1098 SDGDPHDMSSLVATALRALGAASL--EKTDALQSSLDAISRWLTKDVNICGVGGRTHAGG 1155
Query: 402 -ILTISCIRTLTQIALKLSGFISLDQVVKL 430
+T + + L +A +L SL +VV +
Sbjct: 1156 RRVTTAALEALRAVATRLP---SLARVVAV 1182
>gi|402589753|gb|EJW83684.1| hypothetical protein WUBG_05405 [Wuchereria bancrofti]
Length = 523
Score = 115 bits (289), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 113/378 (29%), Positives = 182/378 (48%), Gaps = 49/378 (12%)
Query: 105 SENRDGDIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGS 164
S NR G + G + + V ELDG + H + + + ++QCHSK RR K KK
Sbjct: 11 SANR-GRQSYVGPLIVVVQELDGSFTHTVQIDSNHSKH--DLQCHSK--GRRQ-KKKKIP 64
Query: 165 KPDGCDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVV 224
G + D +A +D +SP+ WIR DP++ L ++ QPV W L+ + DV+
Sbjct: 65 LSTGEELEID-LANMDP----DSPVLWIRIDPDLLLLRKVSIQQPVYQWEYMLKYERDVL 119
Query: 225 AQAQAIAALEALP--HLSFNVVNTLNNFLSDSKAFWR---VRIEAAYALANTASE--ETD 277
AQ QA+ L+ P H ++ T+ + F+R VR AAY L ++ ET
Sbjct: 120 AQLQALDVLQRFPSPHARAILIETIETEV----FFYRQVLVRCRAAYILVEVVNKLPET- 174
Query: 278 WAG----LLHLVKFYKSRRFDENIGLPRPNDFRDFSE----YFVLEAIPHAVAMVRAADN 329
W G ++ KF+ + +P+PN+F S YF++ AIP A+A +R++ N
Sbjct: 175 WLGPPALMILFKKFFGCKSAPH---IPKPNNFVATSANLQLYFLMHAIPQALARLRSSSN 231
Query: 330 KSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGEL-------EFGQQSILFLSS- 381
+ E F++ L+KYNDN N YSD + A+L+ S+ + S L LS+
Sbjct: 232 NALSEVHHFLIGLIKYNDNTVNRYSDDHYRASLINSLAATLVPAENRDITNMSPLSLSTE 291
Query: 382 ---LLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFN 438
LL L D L PS+ ++ I+ ++ + ++ + I LD K+ F N
Sbjct: 292 MKELLSEFTLALNMDTLKPSFGRVIGITALQAIYH--MQRTFHIPLDP--KVFWSFAQPN 347
Query: 439 TIWQVRVEASRALLDLEF 456
+R+ A L+D+ +
Sbjct: 348 MYVPMRLAALTFLIDMVY 365
>gi|340905010|gb|EGS17378.1| hypothetical protein CTHT_0067030 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 1589
Score = 114 bits (284), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 129/525 (24%), Positives = 234/525 (44%), Gaps = 84/525 (16%)
Query: 2 LEKQMGSNFFRKILQNIISRAQGASPVRT--LSTKEFRHFANKVGNLERPFLKEFFPRWV 59
+ K GS+ +++ ++S A + + T LS +FR K RP E + +WV
Sbjct: 492 MSKIPGSSGIVRVISQLVSNANINTTIETTSLSEADFRKACEKKSQY-RP--DELWEQWV 548
Query: 60 GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG------ 113
GCP L + +NK+ V++ + + K S+ ++ + ++
Sbjct: 549 HGAGCPKLSVKQRFNKKNLNVDITITQ-VQAKDSSQPKAITKDVFQRELFEEVHEVWAGP 607
Query: 114 ----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKK------ 162
+ G ++IRVHE DG Y+H L ++ ++Q +R + KK
Sbjct: 608 VPKYFTGPITIRVHEADGTPYEH-YLSISDKDKGHTQLQIAYNTKYKRMKRTKKAIMAAA 666
Query: 163 ----GSKPDGCDDN-------GDAVA--------GL-DMRSSMESPLS-----WIRADPE 197
G K D +D+ GD + GL D S++ + WIR D
Sbjct: 667 AASTGDKHDIQEDDIVYFNMLGDVLMSPQDAIDWGLQDWSESVQQQMDQESYEWIRFDCN 726
Query: 198 MEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLN-NFLSDSKA 256
E+L +H + P M+ QL++D DVVA A+ H + V +T++ + D +
Sbjct: 727 FEWLCALHSDMPGYMYSAQLQQDRDVVAHQDAMLFFSKQRHHA--VTSTIDVRTVMDRRY 784
Query: 257 FWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL-------PRPNDFRDFS 309
F+ +R+ AA+ L A+ + + G HL+ Y+ D+ IG P PNDF D +
Sbjct: 785 FYGIRVMAAHDLTKQANPDLNCIGKAHLILLYRHFFCDKVIGKNGQVTYPPAPNDFYDKA 844
Query: 310 EYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQS---- 365
+Y V AIP A+A +R + PR+A F+L LL N YSD F++ L+Q+
Sbjct: 845 QYEVQCAIPQAIARIR-ENGFCPRDARRFLLDLLVLNAETEGEYSDHFYICKLLQALTTS 903
Query: 366 -VGELEFGQQSIL---------------FLSSLLKRIDRLLQFDRLMPSYNGILTISCIR 409
+ E G+ +L F+ ++L I++ + D +++ I T + +
Sbjct: 904 IIPEKYTGETDLLSSLRSDDPVESELKDFIKAVLDEIEKYRRKDEWELTWHNIWTTTTLD 963
Query: 410 TLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
++ L + I ++ ++ ++ +D N + +R++A L++L
Sbjct: 964 --CKMRLMKARVIPVEP-IEFVRYLQDEN-LDLIRIKAHECLVEL 1004
>gi|238583594|ref|XP_002390290.1| hypothetical protein MPER_10455 [Moniliophthora perniciosa FA553]
gi|215453537|gb|EEB91220.1| hypothetical protein MPER_10455 [Moniliophthora perniciosa FA553]
Length = 672
Score = 112 bits (281), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 91/321 (28%), Positives = 150/321 (46%), Gaps = 41/321 (12%)
Query: 18 IISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRK 77
+S G LST F KV ++ + F +W+ GCP + ++N++K
Sbjct: 365 FLSAISGEMQNNCLSTHSFLRMCRKVSGVDP---RSFAEQWIYGSGCPSFGVTATFNRKK 421
Query: 78 NIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGW-PGMMSIRVHELDGM-YDHPILP 135
VE+ + ++C P ++ L N ++ + + G M+IR+HE DG Y+H +L
Sbjct: 422 MAVEITMRQEC---PAYQS--LEHNEFAKVMYKPVQFFEGTMTIRIHEADGTPYEH-VLD 475
Query: 136 MAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLDMRSSME--------- 186
+ ++ E+ ++K R K+ +P G D G + + E
Sbjct: 476 IR-QPFKRFEVPFNTKYKRVRR-NTKRPLRPMGMVDMGFGLEIWEKEQERENWKVADWTE 533
Query: 187 --------SPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPH 238
+ WIR D + E++A + F QP MW++QL++D DVVAQ +A+ AL P
Sbjct: 534 EDENVMSGATYEWIRMDADFEWIANLKFEQPDFMWVSQLQRDRDVVAQMEAVHALARQP- 592
Query: 239 LSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFD---- 294
+ V +TL + S +R+R+EAA AL N A D+ GL HL K + ++
Sbjct: 593 -TAIVSSTLTKTVLVSNYHYRIRVEAALALVNCAIHRLDFLGLFHLFKLFLRYCYEPEDP 651
Query: 295 -----ENIGLPRPNDFRDFSE 310
+ +P+PNDF D SE
Sbjct: 652 KQNLFTHTYVPKPNDFSDLSE 672
>gi|348588273|ref|XP_003479891.1| PREDICTED: transcription initiation factor TFIID subunit 2-like
[Cavia porcellus]
Length = 1111
Score = 112 bits (281), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 120/474 (25%), Positives = 202/474 (42%), Gaps = 94/474 (19%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 407 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 466
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 467 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 504
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G + + V ELDG ++H L + ++ + +I CHSK + K
Sbjct: 505 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 555
Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
NG+ V +D+ + +SPL WIR DP+M L + F Q MW QL + DVVAQ ++
Sbjct: 556 -NGEEV-DMDLSAMDADSPLLWIRIDPDMAVLRRVEFEQADFMWQYQLRYERDVVAQQES 613
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
I ALE P + + L + L + F+RVR+ A + LA A S + W G + +
Sbjct: 614 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMAACFCLAKIANSMVSTWTGPPAMKSLF 671
Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
+R F + + N+F F YF L K++
Sbjct: 672 -TRMFCCKTCPNIVKTNNFMSFQSYF-----------------------------LQKFS 701
Query: 347 DNNGNPYSDVFWLAALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRLMPSY 399
DN Y + AL SV + L +L L+ I R L ++L+PSY
Sbjct: 702 DN----YYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKLLPSY 757
Query: 400 NGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+T+SC+R + L+ +G + D L K + ++ +R+ A A++D
Sbjct: 758 RHTITVSCLRAIR--VLQKNGHVPSDPT--LFKSYAEYGHFVDIRIAALEAVVD 807
>gi|255078532|ref|XP_002502846.1| predicted protein [Micromonas sp. RCC299]
gi|226518112|gb|ACO64104.1| predicted protein [Micromonas sp. RCC299]
Length = 1703
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 107/391 (27%), Positives = 175/391 (44%), Gaps = 47/391 (12%)
Query: 12 RKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
R+ ++I A SP R L ++ F + + + + + F RW+ CGCP L +
Sbjct: 751 REHQESIAYHALAESPARFLRSRVFLNLCRQNATMTKADMSAFNARWIEGCGCPRLTAAY 810
Query: 72 SYNK--RKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELD-GM 128
++ K R+ ++ L C + S N +++R+ E D
Sbjct: 811 TFRKTRRQELLFAIKLDGCLAAAAADRVAWSKNPKVS-----------VTVRIQENDLPP 859
Query: 129 YDHPILPMAGD-AWQLLEIQCHSKLAARR--------ALKPKKGSKPDG--CDDNGDAVA 177
DH + A D A+ L+ +Q +K RR A++ ++ + G DN DA+A
Sbjct: 860 SDHAVSLSAHDRAYVLMPLQLVTKPKDRRTIARQQEAAMRRQQQAMEAGKANPDNLDAMA 919
Query: 178 GLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQ----MWINQL--EKDGDVVAQAQAIA 231
++ E P+SW+R DPE E+LA++ Q M QL E+ D AQ AI
Sbjct: 920 AAEVL-QWECPVSWVRVDPEGEWLAKVVLPSAQQGLEGMITKQLMSERPADPAAQMWAIR 978
Query: 232 AL-EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLL--HLVKFY 288
L E + S + VN L D K F RVR AA AL A + T + L + + Y
Sbjct: 979 FLRERAENGSTSAVNALLQCAEDPKTFCRVRALAAMALGACACDGTQRSNLALSSVTRTY 1038
Query: 289 KSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRA----------ADNK--SPREAV 336
+ RR D G P+P D DF+ V E + A+ + RA AD+ +P E V
Sbjct: 1039 RRRRCDPRTGKPKPTDLADFAAAIVDEGLLTALGLPRAPPESTSGWEDADDAWVTPTECV 1098
Query: 337 EFVLQLLKYNDNNGNPYSDVFWLAALVQSVG 367
+ ++ L + +++G+P+ +A + ++G
Sbjct: 1099 DLLVDALNHTESDGDPHDSSSLIATAIHALG 1129
>gi|406700376|gb|EKD03547.1| hypothetical protein A1Q2_02130 [Trichosporon asahii var. asahii
CBS 8904]
Length = 1847
Score = 110 bits (275), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 75/241 (31%), Positives = 125/241 (51%), Gaps = 27/241 (11%)
Query: 191 WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNF 250
WIR DP+ E+LA + WI+QLE + DVVAQ Q + + S + + L
Sbjct: 659 WIRVDPDCEWLATFDIQEKPWFWISQLEGESDVVAQLQRMCM-----YPSPVIASELAKV 713
Query: 251 LSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKS---RRFDE----NI-GLPRP 302
+ F+RVR+EAA AL S E D+ GL L+K + + DE N+ +P P
Sbjct: 714 VLVENYFYRVRMEAARALVLYNSAEADYMGLFLLLKLFHTFYCEPTDEEEPSNVKAVPLP 773
Query: 303 NDFRDFSEYFVLEAIPHAVAMVR-AADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAA 361
NDF +F+ YF+ +++ A++ +R A + R E +L LL+ NDN+ N YSD ++A
Sbjct: 774 NDFSNFANYFLKKSMVTALSDLRDPATRRVWRHVRELLLDLLRMNDNSQNEYSDSHYVAT 833
Query: 362 LVQSVGEL---------EFGQQSILFLSSLLKRI----DRLLQFDRLMPSYNGILTISCI 408
++ ++G ++ + L K +R + DRL+PSY+ ++T++ +
Sbjct: 834 VISAIGSAFSAGVNVGGNLSEEDQVMDGQLFKEASEANERAITVDRLVPSYHNVVTLAGL 893
Query: 409 R 409
+
Sbjct: 894 Q 894
>gi|367044074|ref|XP_003652417.1| hypothetical protein THITE_161557 [Thielavia terrestris NRRL 8126]
gi|346999679|gb|AEO66081.1| hypothetical protein THITE_161557 [Thielavia terrestris NRRL 8126]
Length = 1577
Score = 110 bits (274), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 134/544 (24%), Positives = 233/544 (42%), Gaps = 80/544 (14%)
Query: 2 LEKQMGSNFFRKILQNIISRAQGASPV--RTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
+ K GS +++ I+S A + +LS EFR K RP E + +WV
Sbjct: 547 MSKVPGSMGIARVISQIVSNANINVTIDSTSLSDAEFRRACEKKSQY-RP--DELWDQWV 603
Query: 60 GTCGCPVLRMGFSYNKRKNIVELAV----LRDCTVKPDSRTPVLSSNTDS--ENRDGDIG 113
GCP LR+ +NK+ V++++ +D KP ++ L + E G +
Sbjct: 604 HGAGCPRLRIKQRFNKKNLNVDISIAQVQTKDVAPKPITKEDFLKELLEEVHEVWAGPVP 663
Query: 114 --WPGMMSIRVHELDGM-YDHPILPMAGD-AWQLLEIQCHSKL----AARRALKPKKGSK 165
+ G ++RVHE DG Y+H + D L+I ++K ++A +
Sbjct: 664 KLFTGPFTVRVHEADGTPYEHYLSITDKDKVSTTLQIAYNTKYKRMKRTKKATAAAATNA 723
Query: 166 PDGCDDNGDAVAGLDM-----------------------RSSME-SPLSWIRADPEMEYL 201
D D D + +M ++SM+ WIR D E+L
Sbjct: 724 ADKHDIQEDDIVYFNMLGDVLMTPKDIEDWGLQDWDEKVQASMDQESYEWIRLDCNFEWL 783
Query: 202 AEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLS-DSKAFWRV 260
E+ + P M++ QL++D DVVA A+ V +T+ + D + + +
Sbjct: 784 CEMVTDMPGYMYLAQLQQDRDVVAHQDAMLFFRR--GRRHGVASTIETRTAMDRRYYHGI 841
Query: 261 RIEAAYALANTASEETDWAGLLHLV----KFYKSRRFDEN---IGLPRPNDFRDFSEYFV 313
R A L A ++ G L+ F+ R +N P PNDF D ++Y V
Sbjct: 842 RTMAIDDLPKQADPALNYIGKAQLILMFRHFFCERMVGKNGIVTSPPMPNDFHDRAQYAV 901
Query: 314 LEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV------- 366
AIP A+A R + + R+A F+L+LL +NDN N YSD F++A L+ ++
Sbjct: 902 QCAIPAAIARTRE-NGRCSRQARNFILELLLFNDNTENEYSDHFYIAKLLDALTTSIIPD 960
Query: 367 ---------GELEFGQ----QSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQ 413
+L + + FL ++ ID+ + D +Y I T + + +
Sbjct: 961 KKDGGRDLSSDLRMDEDDDTEPKAFLEKAVEEIDKYRRMDEWTSTYQNIWTTTALN--CK 1018
Query: 414 IALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVE 473
+ L +G I++D ++ ++ +D N + VR++A L++L S + L + +
Sbjct: 1019 MRLMKAGVIAVDP-IEFVQYLQDGNADF-VRIKAFDCLVELGLLAKA--SIMKLLLACLS 1074
Query: 474 EEPS 477
+PS
Sbjct: 1075 TDPS 1078
>gi|392578386|gb|EIW71514.1| hypothetical protein TREMEDRAFT_73329 [Tremella mesenterica DSM 1558]
Length = 2067
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 77/241 (31%), Positives = 123/241 (51%), Gaps = 18/241 (7%)
Query: 191 WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNF 250
WIR DPE E++A F + WI+QL+ D DV+AQ +AI L P S V + L
Sbjct: 1051 WIRFDPECEWIATFDFAEKPWYWISQLQGDRDVIAQVEAIQNLTMYP--SPVVASELART 1108
Query: 251 LSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKS----RRFDENIGLPRPNDFR 306
+ F+RVR+EAA AL + E D+ G LVK Y++ D+ P+ NDF
Sbjct: 1109 ILVKNYFYRVRMEAAKALTVYNTSECDYIGHFLLVKLYRALFCLPGDDDAPIRPKRNDFS 1168
Query: 307 DFSEYFVLEAIPHAVAMVR-AADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQS 365
D ++Y L + ++A +R A + R+ +L L++YNDN GN + D A + +
Sbjct: 1169 DVADYLSLVS---SLADLRDPATRNTRRDVRRLLLDLIRYNDNTGNLFDDSTMRATYITA 1225
Query: 366 VGEL----EFGQQSI--LFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS 419
+G + + + + + +DR + DRL PS++ +TI+ + L Q+ + L
Sbjct: 1226 LGNAFSLPAWNAEEVDRVMFDEANEAVDRAITMDRLAPSFHNSVTIAGL--LAQMKMILG 1283
Query: 420 G 420
G
Sbjct: 1284 G 1284
>gi|301100182|ref|XP_002899181.1| transcription initiation factor TFIID subunit 2, putative
[Phytophthora infestans T30-4]
gi|262104098|gb|EEY62150.1| transcription initiation factor TFIID subunit 2, putative
[Phytophthora infestans T30-4]
Length = 1293
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 108/452 (23%), Positives = 188/452 (41%), Gaps = 87/452 (19%)
Query: 22 AQGASPVRTLSTKEFRHFANKV-GNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIV 80
A GA+ L+T F + G + K F W+ G P +G+ YN+++
Sbjct: 563 ANGATATEPLTTLSFLKTVKTIAGAAGQDLTKSFLTSWIIEPGMPFFTVGYWYNRKQTQA 622
Query: 81 ELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGMYDHPILPMAGDA 140
E+ + ++ + P + + G ++I + E Y +
Sbjct: 623 EVVLQQE--IPPGGKL-----------------YTGPITITIVEDTSEYTYQ--KRIEHK 661
Query: 141 WQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLDMRSSMESPLSWIRADPEMEY 200
+ CHSK+ +R + + + PD G GL+ ++P+ W++ D +
Sbjct: 662 RHKFDFPCHSKVRKKRRRR-QGLADPDDSVSVGGPGMGLN-----DTPVFWVKIDTGCAW 715
Query: 201 LAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEAL--PHLSFNVVN------TLNNFLS 252
L + +QP W+ QL D V ++ A AL PH NV++ ++ +
Sbjct: 716 LRHVVIHQPDFNWMEQLLSDKKVGSRVHAARALALFHRPHEKPNVMSCRVLTECMSGLTT 775
Query: 253 DSKAFWRVRIEAAYALA----------NTASEETDWAGLLHLVKFYKSRRFDENIGLPRP 302
S+ R+R EAA +L N ++ W + +LV+ +K FD +P P
Sbjct: 776 HSR---RLRAEAAQSLGIWQSIHAPLTNANTQLPIWKAMHNLVRIFKEHFFDRATDMPLP 832
Query: 303 NDF--------------------------RDFS--EYFVLEAIPHAVAMVRAADNKSPRE 334
N F +D++ EY + ++IP A+AMVRA + KSP E
Sbjct: 833 NYFLPSGGKVILEALGAQQASISSKQIQVQDYAEGEYEIKKSIPKALAMVRAQNGKSPAE 892
Query: 335 AVEFVLQLLKYNDNNGNPYS---------DVFWLAALVQSVGELEFGQQSIL-FLSSLLK 384
F+LQLL NDN+ N D ++L +V S+ L + + +++
Sbjct: 893 IETFLLQLLTENDNSKNYVEMDDQSQVADDCYYLGNIVLSLSVLTLDRPARSDDGDAVVL 952
Query: 385 RIDRLLQFDRLMPSYNGILTISCIRTLTQIAL 416
I R L +D++ PSYN +T+ C+ L + L
Sbjct: 953 EILRYLHYDQVQPSYNKTITVCCLEALCNLVL 984
>gi|452823940|gb|EME30946.1| transcription initiation factor TFIID subunit D2 [Galdieria
sulphuraria]
Length = 1202
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 101/402 (25%), Positives = 182/402 (45%), Gaps = 37/402 (9%)
Query: 53 EFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDI 112
+F RW +C LR G+ Y+ +K VELAV + + + D +++
Sbjct: 505 DFISRWTRSCDILKLRCGYRYDSKKKFVELAVKQSIITFRSQDGKRIERDLDGDSKVS-- 562
Query: 113 GWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDN 172
+ G ++++V E++G +DH +L M D L+E+ ++ + K K S +
Sbjct: 563 MFKGGLTVQVMEMEGAHDH-LLEMDSDIL-LVELPTSTRRMKQHHRKRSKSSLKESDTVI 620
Query: 173 GDAVAGLDMRSSM--ESPLSWIRADPEMEYLAE-IHFNQPVQMWINQLEKDGDVVAQAQA 229
A LD + + P+ W+R DP+ E++ + + F + + I+ L DV Q +A
Sbjct: 621 PPTEANLDEEGVIIEKDPVLWVRIDPDQEWILKFVEFRRSERDSISCLRNCRDVWGQLEA 680
Query: 230 IAALEALP--HLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKF 287
+ L + +S + L L+DS+ +++VR E A LA + GL L+K
Sbjct: 681 LETLMSRTKGRVSVTSIRCLEQILTDSRYYFQVRGECARCLAKCYTTNDRLMGLEALLKL 740
Query: 288 YKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAM--------VRAADNKSPREAVEF- 338
+++ FD++ G N F D SEYFV A+ A+A + D++ F
Sbjct: 741 LRNKYFDKH-GAVLSNHFDDISEYFVKCAVIQAIAQAIHPISFPIYGYDSQFSLNLASFQ 799
Query: 339 ---VLQLLKY------------NDNNGNP-YSDVFWLAALVQSVGELEFGQQSILFLSSL 382
+ + L Y ND P + D F+L L+ + + + +
Sbjct: 800 MTHIQESLPYDVLQLLLSLLEQNDTAYPPLFDDCFYLHNLIIACAICSVYAKDQITRHRV 859
Query: 383 LKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISL 424
K++ R L+ + MPSY+ I+T + ++ LT L+L GF+ +
Sbjct: 860 YKQLFRYLKREEAMPSYHNIITGASLKGLT--ILQLGGFLRI 899
>gi|390358718|ref|XP_793421.2| PREDICTED: transcription initiation factor TFIID subunit 2-like
[Strongylocentrotus purpuratus]
Length = 1402
Score = 108 bits (269), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 93/328 (28%), Positives = 148/328 (45%), Gaps = 44/328 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTK------EFRHFANKVGNLERPFLKEF 54
++E ++G K+ ++S A AS + +S F + + LK F
Sbjct: 493 LIEIRVGQELLLKVFNKLLSLATSASHQKFMSCAWSNMLISTSGFLKSISTVSGKDLKLF 552
Query: 55 FPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGW 114
RWV G R F +N+++NIVEL + +D R G I +
Sbjct: 553 INRWVCQGGMAQFRGTFHFNRKRNIVELELRQD------------------PGRPGIIKY 594
Query: 115 PGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALK-PKKGSKPDGCDDNG 173
G + + + ELDG ++H + D EI CHSK + K P K NG
Sbjct: 595 VGPLKVTIQELDGSFEHTL--QVEDNLTKGEITCHSKSRRHKKKKIPLK---------NG 643
Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
+ V + +SP+ WIR D EM L E+ F+QP W QL + DVVAQ +A+ L
Sbjct: 644 EEVDMDLSKMDPDSPVLWIRVDTEMTLLREVIFDQPDYQWQCQLRYERDVVAQLEAVECL 703
Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETD-WAGLLHLVKFYK--- 289
+ S + L++ + S +++VR +AAY A ++E + ++G L+ +K
Sbjct: 704 KKFA--SLETTHALSDIILHSGCYYKVRTQAAYCYAQVSNELAETFSGHTALIAMFKKLF 761
Query: 290 SRRFDENIGLPRPNDFRDFSEYFVLEAI 317
S E+I R N+F DF Y + +A+
Sbjct: 762 SSFSSEDIV--RRNNFMDFRAYLIQKAV 787
>gi|405121726|gb|AFR96494.1| transcription initiation factor TFIId subunit [Cryptococcus
neoformans var. grubii H99]
Length = 1728
Score = 104 bits (259), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 83/238 (34%), Positives = 125/238 (52%), Gaps = 26/238 (10%)
Query: 191 WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNF 250
WIR DP+ E+LA F + WI+QL+ D DV+AQ +AI + P S + + L
Sbjct: 693 WIRVDPDCEWLAIFEFAEKPWCWISQLQGDRDVIAQLEAIQNMRMYP--SPVIASELART 750
Query: 251 LSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIG----------LP 300
+ F+RVR+EAA ALA S E D+ G ++K ++ + G LP
Sbjct: 751 VLVKNYFYRVRMEAARALAMYNSSECDYIGYFLIMKLFQVFYCQPSSGNETEPPEIQCLP 810
Query: 301 RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFV-LQLLKYNDNNGNPYSDVFWL 359
PNDF +F++YFV +++ A+A +R + V L LL+ NDN N YSD F+L
Sbjct: 811 LPNDFSNFADYFVKKSLIAAMADLRDPATRVVWRHVRITLLDLLRLNDNTSNQYSDSFYL 870
Query: 360 AALVQ------SVGELEFGQQSILFLSS--LLKR-----IDRLLQFDRLMPSYNGILT 404
AA++ +VG G S + S LL++ +DR + DRL+PSY+ ++T
Sbjct: 871 AAVITAISNAFTVGSAAQGMLSETEIESEQLLQKEATDVLDRAMTVDRLVPSYHNVVT 928
Score = 39.3 bits (90), Expect = 7.5, Method: Compositional matrix adjust.
Identities = 32/113 (28%), Positives = 52/113 (46%), Gaps = 18/113 (15%)
Query: 27 PVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLR 86
P TL+T F KV L+ L+ F +W+ GCP L + ++ K+K VE V +
Sbjct: 491 PGNTLTTSYFIRQCRKVSGLD---LQTFQDQWIFGSGCPRLSIRTNFIKKKFTVEFMVSQ 547
Query: 87 DCTVKPDSRTPVLSSNTDSENRDGDIGW-------PGMMSIRVHELDGM-YDH 131
++ V + N SE + W G +++R+HE DG ++H
Sbjct: 548 -------TQPAVEAINEMSEKQRKVAVWRRPTPFFEGSLTVRIHEADGAPFEH 593
>gi|134113885|ref|XP_774190.1| hypothetical protein CNBG1720 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50256825|gb|EAL19543.1| hypothetical protein CNBG1720 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 1729
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 82/238 (34%), Positives = 126/238 (52%), Gaps = 26/238 (10%)
Query: 191 WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNF 250
WIR DP+ E+LA F + WI+QL+ D DV+AQ +AI + P S + + L
Sbjct: 693 WIRVDPDCEWLAIFEFAEKPWCWISQLQGDRDVIAQLEAIQNMRMYP--SPVIASELART 750
Query: 251 LSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIG----------LP 300
+ F+RVR+EAA ALA S E D+ G ++K +++ + G P
Sbjct: 751 VLVKNYFYRVRMEAARALAMYNSSECDYIGYFLIMKLFQAFYCQPSSGNETEPPEIQCRP 810
Query: 301 RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFV-LQLLKYNDNNGNPYSDVFWL 359
PNDF +F++YFV +++ A+A +R ++ V L LL+ NDN N YSD F+L
Sbjct: 811 LPNDFSNFADYFVKKSLIAAMADLRDPATRAVWRHVRITLLDLLRLNDNTSNQYSDSFYL 870
Query: 360 AALVQ------SVGELEFGQQSILFLSS--LLKR-----IDRLLQFDRLMPSYNGILT 404
AA++ +VG G S + S LL++ +DR + DRL+PSY+ ++T
Sbjct: 871 AAVITAISNAFTVGSAAQGMLSETEIESEQLLQKEATDVLDRAMTVDRLVPSYHNVVT 928
Score = 39.7 bits (91), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 32/113 (28%), Positives = 52/113 (46%), Gaps = 18/113 (15%)
Query: 27 PVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLR 86
P TL+T F KV L+ L+ F +W+ GCP L + ++ K+K VE V +
Sbjct: 491 PGNTLTTSYFLRQCRKVSGLD---LQTFQDQWIFGSGCPRLSVRTNFIKKKFTVEFMVSQ 547
Query: 87 DCTVKPDSRTPVLSSNTDSENRDGDIGW-------PGMMSIRVHELDGM-YDH 131
++ V + N SE + W G +++R+HE DG ++H
Sbjct: 548 -------TQPAVEAINEMSEKQRKVAVWRRPTPFFEGSLTVRIHEADGAPFEH 593
>gi|321260951|ref|XP_003195195.1| TFIID subunit (150 kDa); Taf2p [Cryptococcus gattii WM276]
gi|317461668|gb|ADV23408.1| TFIID subunit (150 kDa); Taf2p [Cryptococcus gattii WM276]
Length = 1729
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 83/238 (34%), Positives = 127/238 (53%), Gaps = 26/238 (10%)
Query: 191 WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNF 250
WIR DP+ E+LA F + WI+QL+ D DVVAQ +AI + P S + + L
Sbjct: 693 WIRVDPDCEWLAVFEFIEKPWCWISQLQGDRDVVAQLEAIQNMRMYP--SPVIASELART 750
Query: 251 LSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIG----------LP 300
+ F+RVR+EAA ALA + E D+ G ++K ++ + G LP
Sbjct: 751 ILVKNYFYRVRMEAARALAMYNNSECDYIGYFLIMKIFQVFYCQPSSGNETVPTEIRCLP 810
Query: 301 RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFV-LQLLKYNDNNGNPYSDVFWL 359
PNDF +F++YFV +++ A+A +R ++ + V L LL+ NDN N YSD F+L
Sbjct: 811 LPNDFSNFADYFVKKSLIAAMADLRDPATRAVWKHVRVTLLDLLRLNDNTSNLYSDSFYL 870
Query: 360 AALVQ------SVGELEFGQQSILFLSS--LLKR-----IDRLLQFDRLMPSYNGILT 404
AA++ +VG G S + S LL++ +DR + DRL+PSY+ ++T
Sbjct: 871 AAIITAISNAFTVGSAAQGMLSETEIESEQLLQKEANDVLDRAMTVDRLVPSYHNVVT 928
>gi|325184532|emb|CCA19025.1| transcription initiation factor TFIID subunit putati [Albugo
laibachii Nc14]
Length = 1236
Score = 103 bits (258), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 107/452 (23%), Positives = 188/452 (41%), Gaps = 98/452 (21%)
Query: 31 LSTKEFRHFANKV-GNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCT 89
L+T F + G + K F W+ G P +G+ YN+++ E+ V ++
Sbjct: 525 LTTSSFLKTVKAIAGASGQDLTKSFLSMWIVDSGMPFFTVGYWYNRKQTQAEVVVQQE-- 582
Query: 90 VKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCH 149
V P + + + + + R H+ +D P CH
Sbjct: 583 VPPGGKLYTGAITVTIVEDTSEYTYQKRIESRRHK----WDFP---------------CH 623
Query: 150 SKLAARRALKPKKGSKPDGCDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQP 209
SK+ +R + D NG + GL+ ++P+ W++ D +L I +QP
Sbjct: 624 SKVRKKRRRRQGLSDLDDSVSVNGPGM-GLN-----DTPVFWVKIDTGQNWLRHIVLHQP 677
Query: 210 VQMWINQLEKDGDVVAQAQAIAALEAL--PHLSFNVVNT------LNNFLSDSKAFWRVR 261
W+ QL D V ++ A AL PH N+++ ++ + S+ R+R
Sbjct: 678 DFNWMEQLLSDKKVGSRVHAARALALFHRPHEKPNIMSCRVLTECMSGLTTHSR---RLR 734
Query: 262 IEAAYAL------------ANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDF---- 305
EAA +L ANT+ W + +L + +K FD N +P PN F
Sbjct: 735 AEAALSLGVWQSIHAPLTVANTSLP--IWKAMHNLFRIFKEHFFDRNTDMPLPNYFVPSG 792
Query: 306 ----------------------RDFS--EYFVLEAIPHAVAMVRAADNKSPREAVEFVLQ 341
+D++ EY + +++P A+AM+RA + KSP E +F+LQ
Sbjct: 793 GKVMLESLGAQQASLSSKQIQIQDYAAGEYEIKKSVPRALAMIRAQNEKSPPEIEQFLLQ 852
Query: 342 LLKYNDNNGNPY---------SDVFWLAALVQSVGELEFGQQSILFLS--------SLLK 384
LL NDN+ N D F++ ++V S+ L ++ S +++
Sbjct: 853 LLCENDNSKNYVKMDDQSQVADDCFYIGSIVLSLSLLNTPAENRRTSSMGKTPEDDRVVQ 912
Query: 385 RIDRLLQFDRLMPSYNGILTISCIRTLTQIAL 416
I R L++D++ PS+ I+T+ C+ L + L
Sbjct: 913 EILRYLRYDQVQPSHGKIITVCCLEALCNLIL 944
>gi|449329934|gb|AGE96201.1| transcription initiation factor TFIId150kDa subunit
[Encephalitozoon cuniculi]
Length = 1137
Score = 103 bits (256), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 92/378 (24%), Positives = 167/378 (44%), Gaps = 68/378 (17%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
+LE + F KIL + R + F V ++ L+ F +V
Sbjct: 395 LLEGNLSRAFMEKILNYTMKRRNDVKEDYSF------EFVRLVKDVTGKDLRSLFETYVF 448
Query: 61 TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSI 120
G +++GF+ NK+ N V+ ++ + T + S +G+ G ++I
Sbjct: 449 KPGVVRVKLGFTINKKSNRVDFTAMQ-------APTSIFS--------NGNRTVCGPVTI 493
Query: 121 RVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLD 180
+E++G+++H + + + H K ++
Sbjct: 494 CSYEMEGVFEHVFMLDRDNHFYY-----HPKTKKKKKS---------------------- 526
Query: 181 MRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLS 240
PL W+RADP+ E+LA+I QP M+I QL D +VV Q +A+ +L P S
Sbjct: 527 EEEEEVMPLLWVRADPKGEHLAKIIVEQPDYMFIEQL-LDKNVVGQMEALESLSTKP--S 583
Query: 241 FNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP 300
V L L ++ F+++RI Y L+ T ++ G L++F+ + + +
Sbjct: 584 TQVCEILERVLENTHIFYKIRIHILYILSRTTV--GNYYGFQRLIQFFVKKYCVQTSTVV 641
Query: 301 RPNDFRDFSEYFVLEAIPHAVAMVRAAD-----NKSPREA------VEFVLQLLKYNDNN 349
+PNDF F YF+ + H V + AD + S RE F++ +LK+NDN+
Sbjct: 642 KPNDF-SFIPYFIQK---HLVRALSIADPFVFRSYSGREVGSANIISAFIINILKFNDNS 697
Query: 350 GNPYSDVFWLAALVQSVG 367
N YSD ++++++++S+G
Sbjct: 698 FNSYSDGWYISSVIESIG 715
>gi|301092610|ref|XP_002997159.1| transcription initiation factor TFIID subunit, putative
[Phytophthora infestans T30-4]
gi|262111546|gb|EEY69598.1| transcription initiation factor TFIID subunit, putative
[Phytophthora infestans T30-4]
Length = 1129
Score = 102 bits (255), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 108/451 (23%), Positives = 190/451 (42%), Gaps = 68/451 (15%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGA------SPVRTLS-TKEFRHFANKVGNLERPFLKE 53
M+E+++G +Q +S GA P+ TLS K + A G + K
Sbjct: 431 MIEQRVGPKHLGIAIQRAVSGGAGAAATTATEPLTTLSFLKTVKTIAGAAG---QDLTKS 487
Query: 54 FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
F W+ G P +G+ YN+++ E+A+ ++ + P +
Sbjct: 488 FLTSWIIEPGMPFFTVGYWYNRKQTQAEVALQQE--IPPGGKL----------------- 528
Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
+ G ++I + E Y + + CHSK+ +R + + + PD G
Sbjct: 529 YTGPITITIVEDTSEYTYQ--KRIEHKRHKFDFPCHSKVRKKRRRR-QGLADPDDSVSVG 585
Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
GL+ ++P+ W++ D +L + +QP W+ QL D V ++ A AL
Sbjct: 586 GPGMGLN-----DTPVFWVKIDTGCAWLRHVVMHQPDFNWMEQLLSDKKVGSRVHAARAL 640
Query: 234 EAL--PHLSFNVVNT------LNNFLSDSKAFWRVRIEAAYALA----------NTASEE 275
PH NV++ ++ + S+ R+R EAA +L N ++
Sbjct: 641 ALFHRPHEKPNVMSCRVLTECMSGLTTHSR---RLRAEAAQSLGIWQSIHAPLTNANTQL 697
Query: 276 TDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREA 335
W + +LV+ +K FD +P PN F +LEA+ A + + +SP E
Sbjct: 698 PIWKAMHNLVRIFKEHFFDRATDMPLPNYFLPSGGKVILEALGAQQASISSKQIQSPAEI 757
Query: 336 VEFVLQLLKYNDNNGNPYS---------DVFWLAALVQSVGELEFGQQSIL-FLSSLLKR 385
F+LQLL NDN+ N D ++L +V S+ L + + +++
Sbjct: 758 ETFLLQLLTENDNSKNYVEMDDQSQVADDCYYLGNIVLSLSVLTLDRPARSDDGDAVVLE 817
Query: 386 IDRLLQFDRLMPSYNGILTISCIRTLTQIAL 416
I R L +D++ PSYN +T+ C+ L + L
Sbjct: 818 ILRYLHYDQVQPSYNKTITVCCLEALCNLVL 848
>gi|348666883|gb|EGZ06709.1| hypothetical protein PHYSODRAFT_530724 [Phytophthora sojae]
Length = 1140
Score = 102 bits (254), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 106/453 (23%), Positives = 189/453 (41%), Gaps = 70/453 (15%)
Query: 1 MLEKQMGSNFFRKILQNI-----ISRAQGAS----PVRTLS-TKEFRHFANKVGNLERPF 50
M+E+++G +Q ++ G S P+ TLS K + A G +
Sbjct: 431 MIEQRVGPKHLSIAIQRANASASTGKSTGTSASTEPLTTLSFLKTVKTIAGAPG---QDL 487
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
K F W+ G P +G+ YN+++ E+ + ++ + P +
Sbjct: 488 TKSFLTSWIIEPGMPFFTVGYWYNRKQTQAEVVLQQE--IPPGGKL-------------- 531
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
+ G ++I + E Y + + CHSK+ +R + + + PD
Sbjct: 532 ---YTGPITITIVEDTSEYTYQ--KRIEHKRHKFDFPCHSKVRKKRRKR-QGLADPDDSV 585
Query: 171 DNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAI 230
G GL+ ++P+ W++ D +L + +QP W+ QL D V ++ A
Sbjct: 586 SVGGPGMGLN-----DTPVFWVKIDTGCAWLRHVVMHQPDFNWMEQLLSDKKVGSRVHAA 640
Query: 231 AALEAL--PHLSFNVVNT------LNNFLSDSKAFWRVRIEAAYALA----------NTA 272
AL PH NV++ ++ + S+ R+R EAA +L N
Sbjct: 641 RALALFHRPHEKPNVMSCRVLTECMSGLTTHSR---RLRAEAAQSLGIWQSIHAPLTNAN 697
Query: 273 SEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSP 332
++ W + +LV+ +K FD +P PN F +LEA+ A + + +SP
Sbjct: 698 TQLPIWKAMHNLVRIFKEHFFDRATDMPLPNYFLPSGGKVILEALGAQQASISSKQIQSP 757
Query: 333 REAVEFVLQLLKYNDNNGNPYS---------DVFWLAALVQSVGELEFGQQSILFLSSLL 383
E F+LQLL NDN+ N D +++ LV S+ L + + ++
Sbjct: 758 PEIETFLLQLLTENDNSKNYVEMDDQSQVADDCYYIGNLVLSLSVLTVDRPARSEEGDVV 817
Query: 384 KRIDRLLQFDRLMPSYNGILTISCIRTLTQIAL 416
+ I R L +D++ PSYN +T+ C+ L + L
Sbjct: 818 REILRYLHYDQVQPSYNKTITVCCLEALCNLVL 850
>gi|320586964|gb|EFW99627.1| transcription initiation factor tfiid subunit tsm1 [Grosmannia
clavigera kw1407]
Length = 1826
Score = 101 bits (252), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 105/438 (23%), Positives = 181/438 (41%), Gaps = 99/438 (22%)
Query: 13 KILQNIISRAQ----GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLR 68
+++ IIS+A G + +S++ FR K + L+ F+ +WV GCP L
Sbjct: 602 RVISKIISKANTSGWGDNNNEIISSETFRRVCEKTSQYQ---LEAFWNQWVLGSGCPWLS 658
Query: 69 MGFSYNKRKNIVELAVLRD---CTVKP------DSRTPVLSSNTDSENRDGDIGWPGMMS 119
+ +N+++ VE+++L+ + KP D ++ + + +R + G M+
Sbjct: 659 IKQRFNRKRLCVEISILQTQEAGSAKPQPIRHEDFWRELVETKYGTYSRKAQRLFTGPMT 718
Query: 120 IRVHELDGM-YDHPILPMAGDA----WQLLEIQCHSKLA--------------------- 153
IR+HE +G Y+H I+ + DA +I ++K
Sbjct: 719 IRIHEANGTPYEH-IVEIREDAAFTGGAKFDIPYNTKYKRLKRNRRHRERAAAAAAADAN 777
Query: 154 ------------ARRALKPKKGSKPDGCDDNGDAVAGLDMRSSMESPLSWIRADPEMEYL 201
R P+ +K D D +D S WIR D + E+L
Sbjct: 778 GEHSEEMIVFSLGDRLDSPEDMAKWDLIDWTEQDEKNMDQES-----YEWIRIDSDFEWL 832
Query: 202 AEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVR 261
++H N P M+++QL++D DV AQ + L + + L L D + F +R
Sbjct: 833 VKMHTNMPSWMYLSQLQQDRDVAAQQDTLLYLSRIREHAM-AATILTRTLYDERYFHGLR 891
Query: 262 IEAAYALANTASEETD------------------------------------WAGLLHLV 285
AA L+ A + +D + G+ L+
Sbjct: 892 TMAASVLSRQAWKASDSHADADGAENGGQETEGSRNGGAGGNAGSSGGDTVVFRGMTQLM 951
Query: 286 KFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKY 345
+ ++ F RPNDF D Y V A+ A++ + D+K PREA F+L LL++
Sbjct: 952 RVFR-EFFCYPDATSRPNDFSDHQRYLVQCAMIEAISKAQ-YDDKHPREAQRFLLDLLRF 1009
Query: 346 NDNNGNPYSDVFWLAALV 363
NDN+ N +SD ++A L+
Sbjct: 1010 NDNSQNSFSDYIYVAKLL 1027
>gi|85014131|ref|XP_955561.1| hypothetical protein ECU09_0090 [Encephalitozoon cuniculi GB-M1]
gi|19171255|emb|CAD26980.1| TRANSCRIPTION INITIATION FACTOR TFIID 150kDa SUBUNIT
[Encephalitozoon cuniculi GB-M1]
Length = 1137
Score = 100 bits (250), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 91/378 (24%), Positives = 166/378 (43%), Gaps = 68/378 (17%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
+LE + F KIL + R + F V ++ L+ F +V
Sbjct: 395 LLEGNLSRAFMEKILNYTMKRRNDVKEDYSF------EFVRLVKDVTGKDLRSLFETYVF 448
Query: 61 TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSI 120
G +++GF+ NK+ N V+ ++ + T + S +G+ G ++I
Sbjct: 449 KPGVVRVKLGFTINKKSNRVDFTAMQ-------APTSIFS--------NGNRTVCGPVTI 493
Query: 121 RVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLD 180
+E++G+++H + + + H K ++
Sbjct: 494 CSYEMEGVFEHVFMLDRDNHFYY-----HPKTKKKKKS---------------------- 526
Query: 181 MRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLS 240
PL W+RADP+ E+LA+I QP M+I QL D +VV Q +A+ +L P S
Sbjct: 527 EEEEEVMPLLWVRADPKGEHLAKIIVEQPDYMFIEQL-LDKNVVGQMEALESLSTKP--S 583
Query: 241 FNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP 300
V L L ++ F+++RI Y L+ T ++ G L++F+ + + +
Sbjct: 584 TQVCEILERVLENTHIFYKIRIHILYILSRTTV--GNYYGFQRLIQFFVKKYCVQTSTVV 641
Query: 301 RPNDFRDFSEYFVLEAIPHAVAMVRAAD-----NKSPREA------VEFVLQLLKYNDNN 349
+PNDF F YF+ + H V + AD + S RE F++ +LK+NDN+
Sbjct: 642 KPNDF-SFIPYFIQK---HLVRALSIADPFVFRSYSGREVGSANIISAFIINILKFNDNS 697
Query: 350 GNPYSDVFWLAALVQSVG 367
N YSD ++++++++S+
Sbjct: 698 FNSYSDGWYISSVIESIS 715
>gi|402468467|gb|EJW03623.1| hypothetical protein EDEG_02054 [Edhazardia aedis USNM 41457]
Length = 2793
Score = 100 bits (248), Expect = 4e-18, Method: Composition-based stats.
Identities = 84/307 (27%), Positives = 143/307 (46%), Gaps = 33/307 (10%)
Query: 187 SPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNT 246
+PL +IR D E + +++ QP M+I QL D +V++Q +AI L P S NV++
Sbjct: 1136 NPLLFIRIDAGNENIVKVNLQQPDFMYIEQLNTDKNVISQIEAIQVLAKNP--SENVLHI 1193
Query: 247 LNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRF-DENIGLPRPNDF 305
L+DS+ F+RVRI L+ E+ + H + + RRF +P+PNDF
Sbjct: 1194 FEKVLNDSRVFYRVRILLIQTLSKMLFEDK---CIYHQIIHFFIRRFCIHGSTIPKPNDF 1250
Query: 306 RDFSEYFVLEAI--------PHAVAMVRAADNKSPREAVEFVLQLLKYNDNNG------- 350
FS YF+ +A P V + A D K+ F L L+KYN +
Sbjct: 1251 TSFSSYFIQKATVKSLAESDPEIVRNINAKDLKTKNIVSAFFLNLIKYNSVDEFNYMEPN 1310
Query: 351 ----NPYSDVFWLAALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTIS 406
N + F++A +++++ ++ +S + + +K+++ D L PS+ I+TIS
Sbjct: 1311 KFTINSVGNSFYIADIIKAL-SIQLSSKSSSDIQAYIKQLEICRLKDLLFPSFQNIVTIS 1369
Query: 407 CIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALS 466
+ +++ + +D +VK P VR A AL+ H ID+ +
Sbjct: 1370 VLNAYSRLCFYGHLEVDVDLLVKYCSP----KHFVGVREAALEALIVNYIH---IDNVFA 1422
Query: 467 LFIKSVE 473
+K E
Sbjct: 1423 FIVKFYE 1429
>gi|401828341|ref|XP_003887884.1| hypothetical protein EHEL_090070 [Encephalitozoon hellem ATCC
50504]
gi|392998892|gb|AFM98903.1| hypothetical protein EHEL_090070 [Encephalitozoon hellem ATCC
50504]
Length = 1129
Score = 99.0 bits (245), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 96/378 (25%), Positives = 164/378 (43%), Gaps = 68/378 (17%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
+LE + F KIL I + S VR + EF V + L+ F +V
Sbjct: 395 LLEGNLSRAFMEKILNYTIKKK---SDVRENYSFEFVKLVKDVTGKD---LRSLFETYVF 448
Query: 61 TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSI 120
G +R+GF+ NK+ N V+ S+ P S +G+ G ++I
Sbjct: 449 RPGVVKVRLGFTINKKNNRVDFTA---------SQIPT------SIFSNGNKTISGPVTI 493
Query: 121 RVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLD 180
+E++G+++H + L+ + H PK
Sbjct: 494 CSYEMEGIFEHVFM---------LDRENHF------YYHPKT------------KKKKKS 526
Query: 181 MRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLS 240
PL WIRAD + E+LA+I QP M+I QL D +++ Q +A+ L P S
Sbjct: 527 EEEEEVMPLLWIRADTKGEHLAKIVIEQPDYMFIEQL-LDKNIIGQMEALENLSIKP--S 583
Query: 241 FNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP 300
+V L ++ F+++RI Y L+ T + G L++++ + + +
Sbjct: 584 THVCEIFERMLENTHIFYKIRIHILYILSRTIV--GSYYGFQRLIQYFVKKYCVQTSTIV 641
Query: 301 RPNDFRDFSEYFVLEAIPHAVAMVRAAD-----NKSPREA------VEFVLQLLKYNDNN 349
+PNDF F YF+ + H V + AD N S RE F++ +LK+NDN+
Sbjct: 642 KPNDF-SFIPYFIQK---HLVKALSYADPFVFKNYSGREVGSASIISAFIINILKFNDNS 697
Query: 350 GNPYSDVFWLAALVQSVG 367
N YSD ++++++++S+
Sbjct: 698 FNSYSDGWYISSVIESLS 715
>gi|58269688|ref|XP_572000.1| hypothetical protein [Cryptococcus neoformans var. neoformans
JEC21]
gi|57228236|gb|AAW44693.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
Length = 1711
Score = 99.0 bits (245), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 74/237 (31%), Positives = 120/237 (50%), Gaps = 42/237 (17%)
Query: 191 WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNF 250
WIR DP+ E+LA F + WI+QL+ D DV+AQ +AI + P S + + L
Sbjct: 693 WIRVDPDCEWLAIFEFAEKPWCWISQLQGDRDVIAQLEAIQNMRMYP--SPVIASELART 750
Query: 251 LSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIG----------LP 300
+ F+RVR+EAA ALA S E D+ G ++K +++ + G P
Sbjct: 751 VLVKNYFYRVRMEAARALAMYNSSECDYIGYFLIMKLFQAFYCQPSSGNETEPPEIQCRP 810
Query: 301 RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLA 360
PNDF +F++YFV +++ A+A +R ++ + ++ YSD F+LA
Sbjct: 811 LPNDFSNFADYFVKKSLIAAMADLRDPATRA----------VWRH-------YSDSFYLA 853
Query: 361 ALVQ------SVGELEFGQQSILFLSS--LLKR-----IDRLLQFDRLMPSYNGILT 404
A++ +VG G S + S LL++ +DR + DRL+PSY+ ++T
Sbjct: 854 AVITAISNAFTVGSAAQGMLSETEIESEQLLQKEATDVLDRAMTVDRLVPSYHNVVT 910
Score = 39.7 bits (91), Expect = 6.4, Method: Compositional matrix adjust.
Identities = 32/113 (28%), Positives = 52/113 (46%), Gaps = 18/113 (15%)
Query: 27 PVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLR 86
P TL+T F KV L+ L+ F +W+ GCP L + ++ K+K VE V +
Sbjct: 491 PGNTLTTSYFLRQCRKVSGLD---LQTFQDQWIFGSGCPRLSVRTNFIKKKFTVEFMVSQ 547
Query: 87 DCTVKPDSRTPVLSSNTDSENRDGDIGW-------PGMMSIRVHELDGM-YDH 131
++ V + N SE + W G +++R+HE DG ++H
Sbjct: 548 -------TQPAVEAINEMSEKQRKVAVWRRPTPFFEGSLTVRIHEADGAPFEH 593
>gi|336276159|ref|XP_003352833.1| hypothetical protein SMAC_04947 [Sordaria macrospora k-hell]
gi|380092951|emb|CCC09188.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 1837
Score = 98.6 bits (244), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 125/561 (22%), Positives = 233/561 (41%), Gaps = 98/561 (17%)
Query: 2 LEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGT 61
+ K GS ++L ++S A ++ ++ F+ F + + E + ++V
Sbjct: 545 MSKIPGSTGVVRVLSQMVSTANISNTDPKATSLSFQDFRRQCEKRSQYVPDELWDQYVFA 604
Query: 62 CGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSE-----------NRDG 110
GCP L + +NK+ V++ +L+ + PV S + E G
Sbjct: 605 AGCPKLDIHQKFNKKNLNVDIQILQ-------YQGPVKKSISKDEFWRELQEEIHNVYAG 657
Query: 111 DIG--WPGMMSIRVHELDGM-YDHPILPMAGDAWQL-LEIQCHSKLAARRALKPKK---- 162
D+ + G ++R+HE DG Y+H L D L+I ++K +R + KK
Sbjct: 658 DVPKLFTGPFTVRIHEADGTPYEHYQLITEKDKGGTNLQIAYNTKY--KRLKRTKKAMAA 715
Query: 163 --GSKPDGCDDNGDAVAGLDM-------RSSME-----------------SPLSWIRADP 196
S D + D V +M + ME WIR D
Sbjct: 716 AANSNSDKHEIQEDDVVYFNMLGDVLTSQKDMEDWGLQDWAQEVQTKMDQESYEWIRLDA 775
Query: 197 EMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALP-HLSFNVVNTLNNFLSDSK 255
ME+L + + M++ QL++D DVVA A+ A + H + V T + D +
Sbjct: 776 NMEWLCFMKTDMQEYMYLAQLQQDRDVVAHQDAMLAFKREKRHPVHSTVETRT--VMDKR 833
Query: 256 AFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP----------RPNDF 305
+ VR+ A LA A + ++ G++HL+ ++ + F + + +P PNDF
Sbjct: 834 YYHGVRVMAVEDLAKQAHPDLNYIGMVHLILCFR-KFFCKKVPIPGKKDAYNYPIAPNDF 892
Query: 306 RDFSEYFVLEAIPHAVAMVRAAD----NKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAA 361
D +Y + AIP +A R D + + A +F++++L +NDN+ N +SD F++A
Sbjct: 893 TDKGQYAIQCAIPAVLA--RTTDLQEKGRCSKRARQFIMEMLMFNDNSENEFSDQFYIAK 950
Query: 362 LVQSVGELEFGQQSIL--------------------FLSSLLKRIDRLLQFDRLMPSYNG 401
L+ ++ Q+ I F+ ++ ID+ + D +Y
Sbjct: 951 LLDALTTSLIPQKEITEGYLIKNLDPFDDDDAEFKSFIEKTIEEIDKFRRMDEWTLTYQN 1010
Query: 402 ILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGI 461
I T + + ++ + + + + ++ +D N I VR++A L++L
Sbjct: 1011 IWTTTALHCKMRL---MKARVIPESALDFVQYLQDDN-IDLVRIKAFECLVELGMISKAP 1066
Query: 462 DSALSLFIKSVEEEPSLRGQV 482
L L S + P +R ++
Sbjct: 1067 ILKLLLSYMSTDSSPFVRDRL 1087
>gi|303390535|ref|XP_003073498.1| hypothetical protein Eint_090080 [Encephalitozoon intestinalis ATCC
50506]
gi|303302645|gb|ADM12138.1| hypothetical protein Eint_090080 [Encephalitozoon intestinalis ATCC
50506]
Length = 1134
Score = 97.8 bits (242), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 98/378 (25%), Positives = 162/378 (42%), Gaps = 68/378 (17%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
+LE + F KIL I R V+ + EF V + FL F +V
Sbjct: 395 LLEGNLSRAFMEKILNYTIKRK---GDVKEDYSFEFVKLVKDVTGKDLRFL---FETYVF 448
Query: 61 TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSI 120
G +R+GF+ NK+ N V+ +TP S +G+ G ++I
Sbjct: 449 KPGVVKVRLGFTINKKSNRVDFTA---------EQTPT------SIFSNGNKTICGPVTI 493
Query: 121 RVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLD 180
+E++G+++H + L+ + H PK
Sbjct: 494 CSYEMEGIFEHVFM---------LDRENHF------YYHPKT------------KKKKKS 526
Query: 181 MRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLS 240
PL WIRAD + E+LA I QP M+I QL D +VV Q +A+ L P S
Sbjct: 527 EEEEEVMPLLWIRADIKGEHLARIVIEQPDYMFIEQL-LDKNVVGQMEALENLSIKP--S 583
Query: 241 FNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP 300
V L L ++ F+++R+ Y L+ T + G L++++ + + +
Sbjct: 584 VQVCEILERMLENTHIFYKIRVHILYILSRTTI--GSYYGFQRLIQYFVKKYCVQTSTIV 641
Query: 301 RPNDFRDFSEYFVLEAIPHAVAMVRAAD-----NKSPREA------VEFVLQLLKYNDNN 349
+PNDF F YF+ + H V + AD N S RE F++ +LK+NDN+
Sbjct: 642 KPNDF-SFIPYFIQK---HLVRALSLADPFIFRNYSGREVGSASIISAFIINILKFNDNS 697
Query: 350 GNPYSDVFWLAALVQSVG 367
N YSD ++++++++S+
Sbjct: 698 FNSYSDGWYISSVIESLS 715
>gi|350297324|gb|EGZ78301.1| hypothetical protein NEUTE2DRAFT_80725 [Neurospora tetrasperma FGSC
2509]
Length = 1870
Score = 96.7 bits (239), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 123/529 (23%), Positives = 226/529 (42%), Gaps = 90/529 (17%)
Query: 2 LEKQMGSNFFRKILQNIISRA--QGASP-VRTLSTKEFRHFANKVGNLERPFLKEFFPRW 58
+ K GS ++L ++S A P TLS ++FR + K E + ++
Sbjct: 545 MSKIPGSTGVVRVLSQMVSAANISNTDPKATTLSFQDFRRWCEKRSQY---VPDELWDQY 601
Query: 59 VGTCGCPVLRMGFSYNKRKNIVELAVLR--DCTVKPDSRTPVLSSNTDSENR--DGDIG- 113
V GCP L + +NK+ V++ +L+ T K S+ + ++ GD+
Sbjct: 602 VFGAGCPRLDIHQKFNKKNLNVDIQILQYQGPTKKSISKDEFWRELQEEIHQVYAGDVPK 661
Query: 114 -WPGMMSIRVHELDGM-YDHPILPMAGDAWQL-LEIQCHSKLAARRALKPKK------GS 164
+ G ++R+HE DG Y+H L D L+I ++K +R + KK S
Sbjct: 662 LFTGPFTVRIHEADGTPYEHYQLITEKDKGGTNLQIAYNTKY--KRLKRTKKAMAAAANS 719
Query: 165 KPDGCDDNGDAVAGLDM-------RSSME-----------------SPLSWIRADPEMEY 200
D + D V +M + ME WIR D ME+
Sbjct: 720 NSDKHEIQEDDVVYFNMLGDVLTSQKDMEDWGLQDWASEVQTKMDQESYEWIRLDANMEW 779
Query: 201 LAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALP-HLSFNVVNTLNNFLSDSKAFWR 259
L + + M++ QL++D DVVA A+ A + H + + T + D + +
Sbjct: 780 LCFMKTDMQEYMYLAQLQQDRDVVAHQDAMLAFKREKRHAVHSTIETRT--VMDRRYYHG 837
Query: 260 VRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP----------RPNDFRDFS 309
VR+ A LA A + ++ G++HL+ ++ + F + + +P PNDF D
Sbjct: 838 VRVMALEDLAKQAHPDLNYIGMVHLILCFR-KFFCKKVPIPGREGGYHYPIAPNDFTDKG 896
Query: 310 EYFVLEAIPHAVAMVRAAD----NKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQS 365
+Y + AIP +A R D + + A +F++++L +NDN+ N +SD F++A L+ +
Sbjct: 897 QYAIQCAIPAVLA--RTTDLQEKGRCSKRARQFIMEMLLFNDNSENEFSDQFYIAKLLDA 954
Query: 366 VGELEFGQQSIL--------------------FLSSLLKRIDRLLQFDRLMPSYNGILTI 405
+ Q+ I F+ ++ ID+ + D +Y I T
Sbjct: 955 LTTSVIPQKEITEGYLIKNLDPFDEDDAEFKSFIEKTIEEIDKFRRMDEWTLTYQNIWTT 1014
Query: 406 SCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
+ + ++ + + + + ++ +D N I VR++A L++L
Sbjct: 1015 TALHCKMRL---MKARVIPESALDFVQYLQDDN-IDLVRIKAFECLVEL 1059
>gi|336463459|gb|EGO51699.1| hypothetical protein NEUTE1DRAFT_70654 [Neurospora tetrasperma FGSC
2508]
Length = 1870
Score = 96.3 bits (238), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 123/529 (23%), Positives = 226/529 (42%), Gaps = 90/529 (17%)
Query: 2 LEKQMGSNFFRKILQNIISRA--QGASP-VRTLSTKEFRHFANKVGNLERPFLKEFFPRW 58
+ K GS ++L ++S A P TLS ++FR + K E + ++
Sbjct: 545 MSKIPGSTGVVRVLSQMVSAANISNTDPKATTLSFQDFRRWCEKRSQY---VPDELWDQY 601
Query: 59 VGTCGCPVLRMGFSYNKRKNIVELAVLR--DCTVKPDSRTPVLSSNTDSENR--DGDIG- 113
V GCP L + +NK+ V++ +L+ T K S+ + ++ GD+
Sbjct: 602 VFGAGCPRLDIHQKFNKKNLNVDIQILQYQGPTKKSISKDEFWRELQEEIHQVYAGDVPK 661
Query: 114 -WPGMMSIRVHELDGM-YDHPILPMAGDAWQL-LEIQCHSKLAARRALKPKK------GS 164
+ G ++R+HE DG Y+H L D L+I ++K +R + KK S
Sbjct: 662 LFTGPFTVRIHEADGTPYEHYQLITEKDKGGTNLQIAYNTKY--KRLKRTKKAMAAAANS 719
Query: 165 KPDGCDDNGDAVAGLDM-------RSSME-----------------SPLSWIRADPEMEY 200
D + D V +M + ME WIR D ME+
Sbjct: 720 NSDKHEIQEDDVVYFNMLGDVLTSQKDMEDWGLQDWASEVQTKMDQESYEWIRLDANMEW 779
Query: 201 LAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALP-HLSFNVVNTLNNFLSDSKAFWR 259
L + + M++ QL++D DVVA A+ A + H + + T + D + +
Sbjct: 780 LCFMKTDMQEYMYLAQLQQDRDVVAHQDAMLAFKREKRHAVHSTIETRT--VMDRRYYHG 837
Query: 260 VRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP----------RPNDFRDFS 309
VR+ A LA A + ++ G++HL+ ++ + F + + +P PNDF D
Sbjct: 838 VRVMALEDLAKQAHPDLNYIGMVHLILCFR-KFFCKKVPIPGREGGYHYPIAPNDFTDKG 896
Query: 310 EYFVLEAIPHAVAMVRAAD----NKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQS 365
+Y + AIP +A R D + + A +F++++L +NDN+ N +SD F++A L+ +
Sbjct: 897 QYAIQCAIPAVLA--RTTDLQEKGRCSKRARQFIMEMLLFNDNSENEFSDQFYIAKLLDA 954
Query: 366 VGELEFGQQSIL--------------------FLSSLLKRIDRLLQFDRLMPSYNGILTI 405
+ Q+ I F+ ++ ID+ + D +Y I T
Sbjct: 955 LTTSVIPQKEITEGYLIKNLDPFDEDDAEFKSFIEKTIEEIDKFRRMDEWTLTYQNIWTT 1014
Query: 406 SCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
+ + ++ + + + + ++ +D N I VR++A L++L
Sbjct: 1015 TALHCKMRL---MKARVIPESALDFVQYLQDDN-IDLVRIKAFECLVEL 1059
>gi|159471369|ref|XP_001693829.1| predicted protein [Chlamydomonas reinhardtii]
gi|158283332|gb|EDP09083.1| predicted protein [Chlamydomonas reinhardtii]
Length = 758
Score = 96.3 bits (238), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 73/235 (31%), Positives = 114/235 (48%), Gaps = 18/235 (7%)
Query: 188 PLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTL 247
P+ ++R DP E+L E+ Q MW QL DV+ + A S +V+ L
Sbjct: 445 PVQYLRVDPLQEWLCEVVVLQGEAMWAAQLRSSKDVLESSGAAD--------SSSVIAAL 496
Query: 248 NNFLSDSKAFWRVRIEA-AYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFR 306
+ + + RVR+EA A +AS T A L FY R D + RPN +
Sbjct: 497 LEAVRGERLYCRVRVEAARALAAFSASMSTQPAAPSLLSYFYDCCR-DPATAVLRPNYWS 555
Query: 307 DFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV 366
D ++ VL+AIP +A +R S EA++ V +L NDN+ N Y D +L+A++++
Sbjct: 556 DLGQHLVLQAIPACLASIRDVSFNSTPEALQLVTDVLSNNDNSENLYDDASYLSAVIEAA 615
Query: 367 GEL---EFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKL 418
G L GQ L LL +D + + L PS+N + +C+ +T +AL +
Sbjct: 616 GLLRPPHAGQ-----LCRLLGLLDDYMLRECLQPSHNYAVGCACLAAMTSLALSV 665
>gi|85111588|ref|XP_964008.1| hypothetical protein NCU02052 [Neurospora crassa OR74A]
gi|28925766|gb|EAA34772.1| predicted protein [Neurospora crassa OR74A]
gi|38566902|emb|CAE76206.1| related to TSM1-component of TFIID complex [Neurospora crassa]
Length = 1870
Score = 95.9 bits (237), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 123/529 (23%), Positives = 226/529 (42%), Gaps = 90/529 (17%)
Query: 2 LEKQMGSNFFRKILQNIISRA--QGASP-VRTLSTKEFRHFANKVGNLERPFLKEFFPRW 58
+ K GS ++L ++S A P TLS ++FR + K E + ++
Sbjct: 545 MSKIPGSTGVVRVLSQMVSAANISNTDPKATTLSFQDFRRWCEKRSQY---VPDELWNQY 601
Query: 59 VGTCGCPVLRMGFSYNKRKNIVELAVLR--DCTVKPDSRTPVLSSNTDSENR--DGDIG- 113
V GCP L + +NK+ V++ +L+ T K S+ + ++ GD+
Sbjct: 602 VFGAGCPRLDIHQKFNKKNLNVDIQILQYQGPTKKSISKDEFWRELQEEIHQVYAGDVPK 661
Query: 114 -WPGMMSIRVHELDGM-YDHPILPMAGDAWQL-LEIQCHSKLAARRALKPKK------GS 164
+ G ++R+HE DG Y+H L D L+I ++K +R + KK S
Sbjct: 662 LFTGPFTVRIHEADGTPYEHYQLITEKDKGGTNLQIAYNTKY--KRLKRTKKAMAAAANS 719
Query: 165 KPDGCDDNGDAVAGLDM-------RSSME-----------------SPLSWIRADPEMEY 200
D + D V +M + ME WIR D ME+
Sbjct: 720 NSDKHEIQEDDVVYFNMLGDVLTSQKDMEDWGLQDWASEVQTKMDQESYEWIRLDANMEW 779
Query: 201 LAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALP-HLSFNVVNTLNNFLSDSKAFWR 259
L + + M++ QL++D DVVA A+ A + H + + T + D + +
Sbjct: 780 LCFMKTDMQEYMYLAQLQQDRDVVAHQDAMLAFKREKRHAVHSTIETRT--VMDRRYYHG 837
Query: 260 VRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP----------RPNDFRDFS 309
VR+ A LA A + ++ G++HL+ ++ + F + + +P PNDF D
Sbjct: 838 VRVMALEDLAKQAHPDLNYIGMVHLILCFR-KFFCKKVPIPGREGGYHYPIAPNDFTDKG 896
Query: 310 EYFVLEAIPHAVAMVRAAD----NKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQS 365
+Y + AIP +A R D + + A +F++++L +NDN+ N +SD F++A L+ +
Sbjct: 897 QYAIQCAIPAVLA--RTTDLQEKGRCSKRARQFIMEMLLFNDNSENEFSDQFYIAKLLDA 954
Query: 366 VGELEFGQQSIL--------------------FLSSLLKRIDRLLQFDRLMPSYNGILTI 405
+ Q+ I F+ ++ ID+ + D +Y I T
Sbjct: 955 LTTSVIPQKEITEGYLIKNLDPFDEDDAEFKSFIEKTIEEIDKFRRMDEWTLTYQNIWTT 1014
Query: 406 SCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
+ + ++ + + + + ++ +D N I VR++A L++L
Sbjct: 1015 TALHCKMRL---MKARVIPESALDFVQYLQDDN-IDLVRIKAFECLVEL 1059
>gi|396082012|gb|AFN83625.1| hypothetical protein EROM_090070 [Encephalitozoon romaleae SJ-2008]
Length = 1131
Score = 95.1 bits (235), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 92/375 (24%), Positives = 160/375 (42%), Gaps = 68/375 (18%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
+LE + F KIL I + + F V ++ L+ F +V
Sbjct: 395 LLEGNLSRAFMEKILNYTIKKKGDVKENYSF------EFVKLVKDVTGKDLRSLFETYVF 448
Query: 61 TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSI 120
G +R+GF+ NK+ N V+ D + SN G+ G ++I
Sbjct: 449 RPGVVKVRLGFTINKKSNRVDFT--------SDQIPTSIFSN-------GNKVISGPVTI 493
Query: 121 RVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLD 180
+E++G+++H + L+ + H PK
Sbjct: 494 CSYEMEGIFEHVFM---------LDRENHF------YYHPKT------------KKKKKS 526
Query: 181 MRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLS 240
PL WIRAD + E+LA+I QP M+I QL D +++ Q +A+ L P S
Sbjct: 527 EEEEEVMPLLWIRADTKGEHLAKIVVEQPDYMFIEQL-LDKNIIGQMEALENLSIKP--S 583
Query: 241 FNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP 300
V L L ++ F+++RI Y L+ T ++ G L++++ + + +
Sbjct: 584 TQVCEILERMLENTHIFYKIRIHILYILSRTIV--GNYYGFQRLIQYFVKKYCVQTSTIV 641
Query: 301 RPNDFRDFSEYFVLEAIPHAVAMVRAAD-----NKSPREA------VEFVLQLLKYNDNN 349
+PNDF F YF+ + H V + AD + S RE F++ +LK+NDN+
Sbjct: 642 KPNDF-SFIPYFIQK---HLVKALSYADPFVFKSYSGREVGSASIISAFIINILKFNDNS 697
Query: 350 GNPYSDVFWLAALVQ 364
NPYSD ++++++++
Sbjct: 698 FNPYSDGWYISSVIE 712
>gi|339248861|ref|XP_003373418.1| transcription initiation factor TFIID subunit 2 [Trichinella
spiralis]
gi|316970443|gb|EFV54377.1| transcription initiation factor TFIID subunit 2 [Trichinella
spiralis]
Length = 1125
Score = 94.4 bits (233), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 92/337 (27%), Positives = 154/337 (45%), Gaps = 45/337 (13%)
Query: 145 EIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAE 203
E CHSK ARR K K P D D +D+ + ESP+ WI DP + + +
Sbjct: 474 EFSCHSK--ARRHRKRKI---PIWTGDEID----MDLSAMDSESPVLWILVDPRLNTIRK 524
Query: 204 IHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHL-SFNVVNTLNNFLSDSKAFWRVRI 262
++ +QP W QL + +VAQ + AL+ LP + + + L+ + S+ ++RVR
Sbjct: 525 VNIDQPDYSWHYQLRYERIIVAQ---LDALKGLPKMRTVHTQAALSEIIESSQCYYRVRC 581
Query: 263 EAAYALANTASEETDWAG-LLHLVKFYKSRRFDENI-GLPRPNDFRDFSEYFVLEAIPHA 320
A +ALA+ +S T LV+ ++ +N L +PN+F +F Y++ +A+ A
Sbjct: 582 AACFALASMSSANTAVCNPHAVLVQLFRKLFCSQNYPTLAKPNNFTNFQLYYLKKALLLA 641
Query: 321 VAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGEL----------- 369
++ ++ + + ++FV L YN N+GN YSD + A +++++G +
Sbjct: 642 ISQLKTVHDTISTDVLQFVYHLSYYNTNSGNRYSDDAYRATIIKAIGNMVQPFAVLNSDA 701
Query: 370 ----------EFGQQSIL----FLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIA 415
+F Q L L K I L D PSY +T++ + +L +
Sbjct: 702 MEFSIRHFVDQFAQHDAYRARHLLMKLFKEIIISLALDAKFPSYRDEITVAALHSLAE-- 759
Query: 416 LKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALL 452
+ I D +L + DFN VR A A L
Sbjct: 760 FQYWKLIDCDN--RLWLSYLDFNRPPIVRRAAFEAFL 794
>gi|407040812|gb|EKE40341.1| hypothetical protein ENU1_093580 [Entamoeba nuttalli P19]
Length = 946
Score = 93.2 bits (230), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 116/487 (23%), Positives = 208/487 (42%), Gaps = 91/487 (18%)
Query: 37 RHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRT 96
R F ++ +K+F R V G + + F++ +R T++ D +
Sbjct: 450 RQFVKELQTQNSSLMKQFEERLVNGTGYSKIAVSFTFTQR------------TIQTDFKI 497
Query: 97 PVLSSNTDSENRDGDIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARR 156
S G ++ +R+HE++ + I + G+ I CHS++ +R
Sbjct: 498 ------HQSRKEKG----YNVVPVRIHEVEQTIEDTI-ELTGEETSK-SIPCHSRIRKQR 545
Query: 157 ALKPKKGSKPDGCDDNGDAVAGLDMRSSMES-------PLSWIRADPEMEYLAEIHFN-Q 208
+P D + LD R S P+ +I DP+ +Y I + Q
Sbjct: 546 -------KRPIKTDTGETIMYDLDSRDCKRSGQGETDVPILYIEIDPDNDYPRRIKRDKQ 598
Query: 209 PVQ--MWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFL-SDSKAFWRVRIEAA 265
Q MW+ QLE V Q I LE S +N L +FL ++ + F++V+I AA
Sbjct: 599 SFQEYMWLTQLEFVRTVRGQLVTIEGLEKCK--SEKGINALFDFLINNHRCFYKVQIAAA 656
Query: 266 YALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVR 325
+A+A + E+ + G L +FYK+ ++E++G+ P DF + Y + + + A A VR
Sbjct: 657 HAIARMSCEQNGYLGFHLLKQFYKTNYYEESVGVI-PLDFSNPLRYELQKGLIGAFADVR 715
Query: 326 A------------ADNK-----------SPREAVEFVLQLLKYNDNNGNPYSDVFWLAAL 362
D+K S EAV +L++ + + N N Y D ++ ++
Sbjct: 716 VHSLTDKINWGGLYDDKHKNRIDALSLPSYPEAVSLLLEIAENSTNERNKYQDGVFIKSV 775
Query: 363 VQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFI 422
+ ++G++ + + K + R L D+++PS I+T++ I+ +A L
Sbjct: 776 IGALGKIVSSKDED--NERITKFLKRELNKDKIIPSDRNIVTLAVIK----MAPGLCDLT 829
Query: 423 SLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLRGQV 482
SL V ++VRV A RALL + L++F+++ E G+V
Sbjct: 830 SLCSV----------GNFYEVRVAAIRALLK-----SDPQQGLTIFVET--HERGDNGRV 872
Query: 483 KLGIHAM 489
+ I M
Sbjct: 873 RYAICEM 879
>gi|67466781|ref|XP_649532.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56465995|gb|EAL44144.1| hypothetical protein EHI_108470 [Entamoeba histolytica HM-1:IMSS]
gi|449704970|gb|EMD45118.1| Hypothetical protein EHI5A_134100 [Entamoeba histolytica KU27]
Length = 946
Score = 92.8 bits (229), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 116/487 (23%), Positives = 208/487 (42%), Gaps = 91/487 (18%)
Query: 37 RHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRT 96
R F ++ +K+F R V G + + F++ +R T++ D +
Sbjct: 450 RQFVKELQTQNSSLMKQFEERLVNGTGYSKIAVSFTFTQR------------TIQTDFKI 497
Query: 97 PVLSSNTDSENRDGDIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARR 156
S G ++ +R+HE++ + I + G+ I CHS++ +R
Sbjct: 498 ------HQSRKEKG----YNVVPVRIHEVEQTIEDTI-ELTGEETSK-SIPCHSRIRKQR 545
Query: 157 ALKPKKGSKPDGCDDNGDAVAGLDMRSSMES-------PLSWIRADPEMEYLAEIHFN-Q 208
+P D + LD R S P+ +I DP+ +Y I + Q
Sbjct: 546 -------KRPIKTDTGETIMYDLDSRDCKRSGQGETDVPILYIEIDPDNDYPRRIKRDKQ 598
Query: 209 PVQ--MWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFL-SDSKAFWRVRIEAA 265
Q MW+ QLE V Q I LE S +N L +FL ++ + F++V+I AA
Sbjct: 599 SFQEYMWLTQLEFVRTVRGQLVTIEGLEKCK--SEKGINALFDFLINNHRCFYKVQIAAA 656
Query: 266 YALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVR 325
+A+A + E+ + G L +FYK+ ++E++G+ P DF + Y + + + A A VR
Sbjct: 657 HAIARMSCEQNGYLGFHLLKQFYKTNYYEESVGVI-PLDFSNPLRYELQKGLIGAFANVR 715
Query: 326 A------------ADNK-----------SPREAVEFVLQLLKYNDNNGNPYSDVFWLAAL 362
D+K S EAV +L++ + + N N Y D ++ ++
Sbjct: 716 IHSLTDKINWGGLYDDKHKNRIDALSLPSYPEAVSLLLEIAENSTNERNKYQDGVFIKSV 775
Query: 363 VQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFI 422
+ ++G++ + + K + R L D+++PS I+T++ I+ +A L
Sbjct: 776 IGALGKIVSSKDED--NERITKFLKRELNKDKIIPSDRNIVTLAVIK----MAPGLCDLT 829
Query: 423 SLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLRGQV 482
SL V ++VRV A RALL + L++F+++ E G+V
Sbjct: 830 SLCSV----------GNFYEVRVAAIRALLK-----SDPQQGLTIFVET--HERGDNGRV 872
Query: 483 KLGIHAM 489
+ I M
Sbjct: 873 RYAICEM 879
>gi|401882976|gb|EJT47215.1| hypothetical protein A1Q1_04073 [Trichosporon asahii var. asahii
CBS 2479]
Length = 1779
Score = 92.8 bits (229), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 68/241 (28%), Positives = 117/241 (48%), Gaps = 38/241 (15%)
Query: 191 WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNF 250
WIR DP+ E+LA + WI+QLE + DVVAQ Q + + S + + L
Sbjct: 659 WIRVDPDCEWLATFDIQEKPWFWISQLEGESDVVAQLQRMCM-----YPSPVIASELAKV 713
Query: 251 LSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKS---RRFDE----NI-GLPRP 302
+ F+R S E D+ GL L+K + + DE N+ +P P
Sbjct: 714 VLVENYFYRYN-----------SAEADYMGLFLLLKLFHTFYCEPTDEEEPSNVKAVPLP 762
Query: 303 NDFRDFSEYFVLEAIPHAVAMVR-AADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAA 361
NDF +F+ YF+ +++ A++ +R A + R E +L LL+ NDN+ N YSD ++A
Sbjct: 763 NDFSNFANYFLKKSMVTALSDLRDPATRRVWRHVRELLLDLLRMNDNSQNEYSDSHYVAT 822
Query: 362 LVQSVGEL---------EFGQQSILFLSSLLKRI----DRLLQFDRLMPSYNGILTISCI 408
++ ++G ++ + L K +R + DRL+PSY+ ++T++ +
Sbjct: 823 VISAIGSAFSAGVNVGGNLSEEDQVMDGQLFKEASEANERAITVDRLVPSYHNVVTLAGL 882
Query: 409 R 409
+
Sbjct: 883 Q 883
>gi|167385807|ref|XP_001737495.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165899650|gb|EDR26194.1| hypothetical protein EDI_075900 [Entamoeba dispar SAW760]
Length = 946
Score = 92.0 bits (227), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 109/450 (24%), Positives = 194/450 (43%), Gaps = 84/450 (18%)
Query: 37 RHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRT 96
R F ++ +K+F R V G + + F++ +R T++ D +
Sbjct: 450 RQFVKELQTQNSSLMKQFEERLVNGTGYSKIAVSFTFTQR------------TIQTDFKI 497
Query: 97 PVLSSNTDSENRDGDIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARR 156
S G ++ +R+HE++ + I + G+ I CHS++ +R
Sbjct: 498 ------HQSRKEKG----YNVVPVRIHEVEQTIEDTI-ELTGEETSK-SIPCHSRIRKQR 545
Query: 157 ALKPKKGSKPDGCDDNGDAVAGLDMRSSMES-------PLSWIRADPEMEYLAEIHFN-Q 208
+P D + LD R S P+ +I DP+ +Y I + Q
Sbjct: 546 -------KRPIKTDTGETIMYDLDSRDCKRSGQGETDVPILYIEIDPDNDYPRRIKRDKQ 598
Query: 209 PVQ--MWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFL-SDSKAFWRVRIEAA 265
Q MW+ QLE V Q I LE S +N L +FL ++ + F++V+I AA
Sbjct: 599 SFQEYMWLTQLEFVRTVRGQLVTIEGLEKCK--SEKGINALFDFLINNHRCFYKVQIAAA 656
Query: 266 YALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVR 325
+A+A + E+ + G L +FYK+ ++E++G+ P DF + Y + + + A A VR
Sbjct: 657 HAIARMSCEQNGYLGFHLLKQFYKTNYYEESVGVI-PLDFSNPLRYELQKGLIGAFADVR 715
Query: 326 A------------ADNK-----------SPREAVEFVLQLLKYNDNNGNPYSDVFWLAAL 362
D+K S EAV +L++ + + N N Y D ++ ++
Sbjct: 716 VHNLTDKINWGGLYDDKHKNRIDALSLPSYPEAVSLLLEIAENSTNERNKYQDGVFIKSV 775
Query: 363 VQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFI 422
+ ++G++ ++ + K + R L D+++PS I+T++ I+ +A L
Sbjct: 776 IGALGKIVSSKEED--NERITKFLKRELNKDKIIPSDRNIVTLAVIK----MAPGLCDLS 829
Query: 423 SLDQVVKLIKPFRDFNTIWQVRVEASRALL 452
SL V ++VRV A RALL
Sbjct: 830 SLCSV----------GNFYEVRVAAIRALL 849
>gi|440298463|gb|ELP91099.1| hypothetical protein EIN_268550 [Entamoeba invadens IP1]
Length = 936
Score = 90.5 bits (223), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 98/423 (23%), Positives = 184/423 (43%), Gaps = 77/423 (18%)
Query: 29 RTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDC 88
R L T R F ++ +K+F R V G + + FS+ +R
Sbjct: 444 RNLKT---RSFVKELQTQNSSLMKQFEERIVDGTGYSKIAVSFSFTQR------------ 488
Query: 89 TVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQC 148
++ ++ + G+ ++ +R+HE++ + I + G L I C
Sbjct: 489 ---------IVQTDFKIRQSKKEKGY-NVVPVRIHEVEQTIEDSI-ELTG-VETLKSIPC 536
Query: 149 HSKLAA--RRALKPKKGSK------PDGCDDNGDAVAGLDMRSSMESPLSWIRADPEMEY 200
HS++ +R +K G C NG +S + + +I DP+ +Y
Sbjct: 537 HSRIRKQRKRTIKTDTGETIMYDLDSRDCKRNG--------QSETDVAILYIEIDPDNDY 588
Query: 201 LAEIHFNQPV---QMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFL-SDSKA 256
+I N+ MW+ QLE V Q I LE S + L +FL ++ +
Sbjct: 589 PRKIKRNKQSFLEYMWLTQLEFVRTVRGQLVTIEGLEKCN--SEKGIQALYDFLINNHRC 646
Query: 257 FWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEA 316
F++V+IEAA+A+A + EE + G L FYK+ ++++IG+ P DF + +Y + +
Sbjct: 647 FYKVQIEAAHAIARMSCEENQYLGFNLLKTFYKTNYYEDSIGVV-PLDFSNALKYEIQKG 705
Query: 317 IPHAVAMVR---------------AADNKSPR--------EAVEFVLQLLKYNDNNGNPY 353
+ A A VR + PR EAV+ +L++ + + N N Y
Sbjct: 706 LIGAFADVRLHSINEKINWGRVYKKKQTERPRALSKVNHPEAVQLLLEIAENSTNEKNSY 765
Query: 354 SDVFWLAALVQSVGELEF-GQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLT 412
D +L +V+++G++ ++ + + + L+R+ L D+++ S I+T + + +
Sbjct: 766 QDGIFLKEVVKALGKINSPNKEENVRIETFLRRV---LNKDKIIASDRNIVTQAVVLSTP 822
Query: 413 QIA 415
Q A
Sbjct: 823 QFA 825
>gi|300707253|ref|XP_002995843.1| hypothetical protein NCER_101163 [Nosema ceranae BRL01]
gi|239605067|gb|EEQ82172.1| hypothetical protein NCER_101163 [Nosema ceranae BRL01]
Length = 1734
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 73/326 (22%), Positives = 149/326 (45%), Gaps = 55/326 (16%)
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+FF +V G + +S + +KN V+ + + +P S+ + + NT +
Sbjct: 432 MKDFFEFYVFKPGLLKVNFRYSIDTKKNRVDFKIEQ----RPTSK--IFNHNTKA----- 480
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
G +SI +E++G +DH I D H R
Sbjct: 481 ----IGNISIASYEVEGCFDHVINMQNSD---------HFYYHVRTK------------- 514
Query: 171 DNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAI 230
++ PL W+R DP+ E+L + QP M+I Q+ D +V+ Q +A+
Sbjct: 515 ----KRKKIEEEEDEIMPLLWMRMDPKREHLIDGFIEQPDYMFIEQV-LDKNVIGQLEAL 569
Query: 231 AALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKS 290
L ++S + L L S F+++RI+ Y L+ + ++ G L++F+
Sbjct: 570 DQLS--KNISIQICEVLERVLDSSHVFYKIRIKVMYILSKINID--NFIGFQRLIQFFTK 625
Query: 291 RRFDENIGLPRPNDFRDFSEYFVLEAIPHAVA-----MVRAADNK---SPREAVEFVLQL 342
+ F ++ + +PN+F F YF+ + A++ +++ + + S F++ +
Sbjct: 626 KYFVQSSTIVKPNEFL-FVNYFLQKHCVKALSYTNPSIIKGYNGRNVASFSVICSFIINI 684
Query: 343 LKYNDNNGNPYSDVFWLAALVQSVGE 368
LK+NDN+ N Y+D +++A++++ + +
Sbjct: 685 LKFNDNSANLYNDSWYIASVIECLSK 710
>gi|193786061|dbj|BAG50950.1| unnamed protein product [Homo sapiens]
Length = 504
Score = 87.8 bits (216), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 60/206 (29%), Positives = 102/206 (49%), Gaps = 19/206 (9%)
Query: 262 IEAAYALANTA-SEETDWAGLLHLVKFYKSRRF--DENIGLPRPNDFRDFSEYFVLEAIP 318
+ A + LA A S + W G + + +R F + + N+F+ F YF+ + P
Sbjct: 1 MSACFCLAKIANSMVSTWTGPPAMKSLF-TRMFCCKSCPNIVKTNNFKSFQSYFLQKTTP 59
Query: 319 HAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLA----ALVQSVGELEFGQQ 374
A+A++R N P+E + F+L L+KYNDN N +SD ++ A AL SV
Sbjct: 60 VAMALLRDVHNLCPKEVLTFILDLIKYNDNRKNKFSDNYYRAEMIDALANSVTPAVSVNN 119
Query: 375 SILFLSSL-------LKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQV 427
+ L +L L+ I R L ++L+PSY +T+SC+R + L+ +G + D
Sbjct: 120 EVRTLDNLNPDVRLILEEITRFLNMEKLLPSYRHTITVSCLRAIR--VLQKNGHVPSDPA 177
Query: 428 VKLIKPFRDFNTIWQVRVEASRALLD 453
L K + ++ +R+ A A++D
Sbjct: 178 --LFKSYAEYGHFVDIRIAALEAVVD 201
>gi|380496248|emb|CCF31831.1| hypothetical protein CH063_16140, partial [Colletotrichum
higginsianum]
Length = 191
Score = 86.3 bits (212), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 53/168 (31%), Positives = 83/168 (49%), Gaps = 18/168 (10%)
Query: 264 AAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAM 323
AA L + A+E+ AG+ HL+K + + PRPNDF D +Y V A+P A+A
Sbjct: 2 AADMLYHHATEKVAMAGMTHLLKAFTELFCYPDSATPRPNDFTDKKQYLVQSALPMAIAK 61
Query: 324 VRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVG---------------E 368
+R A+ + P A +L +++N+N NPYSD F++A L+++V +
Sbjct: 62 IRDANGRCPEPARRLLLNQVQFNNNANNPYSDHFYVAKLLEAVAHSLIPEKRRDAGDSMD 121
Query: 369 LEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILT---ISCIRTLTQ 413
L+ + FL + IDR + D SY I T + C R L +
Sbjct: 122 LDTSIEERSFLGEAIVEIDRFRRMDEWANSYQNIWTTTALECRRKLMK 169
>gi|171687459|ref|XP_001908670.1| hypothetical protein [Podospora anserina S mat+]
gi|170943691|emb|CAP69343.1| unnamed protein product [Podospora anserina S mat+]
Length = 1618
Score = 85.9 bits (211), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 131/548 (23%), Positives = 229/548 (41%), Gaps = 88/548 (16%)
Query: 2 LEKQMGSNFFRKILQNIISRAQGASPVRT--LSTKEFRHFANKVGNLERPFLKEFFPRWV 59
+ K GS +++ I+S A + T LS + FR K+ RP E + +W+
Sbjct: 550 MSKMTGSTGIVRVISKIVSTANIDTRNETTWLSAEPFRTACEKMSTY-RP--DELWAQWI 606
Query: 60 GTCGCPVLRMGFSYNKRKNIVELAVLRDCT-------VKPDSRTPVLSSNTDSENRDGDI 112
+ GCP L + +NK+ V++ V + + K D ++ D
Sbjct: 607 HSAGCPKLDIKQKFNKKNLNVDITVEQRQSKDTAKEITKEDFYRELVELRHDVWAGSVPK 666
Query: 113 GWPGMMSIRVHELDGM-YDHPILPMAGDA----WQLLEIQCHSKLAARR----------- 156
+ G S+R+ E DG Y+H + D WQ+ + ++ +
Sbjct: 667 HFTGPFSVRIQEADGTPYEHYMAISDKDKVSTNWQIAYNTKYKRMKRTKKATAAAAANSA 726
Query: 157 ------------------ALKPKKGSKPDGCDDNGDAV-AGLDMRSSMESPLSWIRADPE 197
L+ +K + G D A A +D S WIR D
Sbjct: 727 ADKDLQEDDVVYYNALGDVLQTEKDVQDWGLQDWSQATQAQMDQES-----YEWIRLDCN 781
Query: 198 MEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLS-DSKA 256
E+L + + P M + QL++D DVVAQ ++ L S VV+T+ + D +
Sbjct: 782 FEWLCSMKTDMPPYMIVAQLQQDRDVVAQQDSMLYLRRASRPS-GVVSTIETRTAMDRRY 840
Query: 257 FWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL-------PRPNDFRDFS 309
++ +R A L A E ++ G HL+ ++ D +G P PNDF D +
Sbjct: 841 YYGIRCMAIEDLPKMADPELNYIGKAHLILIFRHFFCDRMVGKNGAVTFPPMPNDFSDKA 900
Query: 310 EYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQS---- 365
+Y V A+ A+A R + + ++A F+L LL +NDN+ N YSD ++A L+Q+
Sbjct: 901 QYHVQCAMVSAIAKTR-ENGRCSKDARNFLLDLLLFNDNSENDYSDHPYVACLLQALTTS 959
Query: 366 -VGELEFGQQSIL---------------FLSSLLKRIDRLLQFDRLMPSYNGILTISCIR 409
+ + G + +L F+ ++ ID+ + D SY I T + +
Sbjct: 960 LIPKKVDGDRDLLSSLNVHDEADAEFKRFIEKTIEEIDKYRRMDEWTSSYQNIWTTAALE 1019
Query: 410 TLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFI 469
++ L + I + ++ I+ +D N I VR++A L++L + + L +
Sbjct: 1020 --CKMRLMKAKVIPV-APLEFIQYLQDDN-IDLVRIKAFECLVELGLLSK--PAIMKLLL 1073
Query: 470 KSVEEEPS 477
SV +PS
Sbjct: 1074 SSVSTDPS 1081
>gi|355759483|gb|EHH61623.1| hypothetical protein EGM_19632, partial [Macaca fascicularis]
Length = 1068
Score = 85.5 bits (210), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 51/164 (31%), Positives = 86/164 (52%), Gaps = 15/164 (9%)
Query: 301 RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLA 360
+ N+F F YF+ + +P A+A++R N P+E + F+L L+KYNDN N +SD ++ A
Sbjct: 606 KTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYNDNRKNKFSDNYYRA 665
Query: 361 ----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRLMPSYNGILTISCIR 409
AL SV + L +L L+ I R L ++L+PSY +T+SC+R
Sbjct: 666 EMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKLLPSYRHTITVSCLR 725
Query: 410 TLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
+ L+ +G + D L K + ++ +R+ A A++D
Sbjct: 726 AIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 765
Score = 40.0 bits (92), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 31/143 (21%), Positives = 61/143 (42%), Gaps = 32/143 (22%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 413 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 472
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T G
Sbjct: 473 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 510
Query: 111 DIGWPGMMSIRVHELDGMYDHPI 133
+ G + + V ELDG ++H +
Sbjct: 511 TQKYVGPLKVTVQELDGSFNHTL 533
>gi|432094711|gb|ELK26191.1| Transcription initiation factor TFIID subunit 2 [Myotis davidii]
Length = 713
Score = 85.1 bits (209), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 67/249 (26%), Positives = 108/249 (43%), Gaps = 62/249 (24%)
Query: 216 QLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEE 275
QL + DVVAQ ++I ALE P + + L + L + F+RVR+ A + LA T
Sbjct: 254 QLRFERDVVAQQESILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKT---- 307
Query: 276 TDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREA 335
+P A+A++R N P+E
Sbjct: 308 -----------------------------------------MPVAMALLRDVHNLCPKEV 326
Query: 336 VEFVLQLLKYNDNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LK 384
+ F+L L+KYNDN N +SD ++ A AL SV + L +L L+
Sbjct: 327 LTFILDLIKYNDNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILE 386
Query: 385 RIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVR 444
I R L ++L+PSY +T+ C+R + L+ +G + D L K + ++ +R
Sbjct: 387 EITRFLNMEKLLPSYRHTITVGCLRAIR--VLQKNGHVPSDPT--LFKSYAEYAHFVDIR 442
Query: 445 VEASRALLD 453
+ A A++D
Sbjct: 443 IAALEAVVD 451
>gi|298713843|emb|CBJ27215.1| TRANSCRIPTION INITIATION FACTOR TFIID [Ectocarpus siliculosus]
Length = 1445
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 91/384 (23%), Positives = 163/384 (42%), Gaps = 93/384 (24%)
Query: 187 SPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALP--------- 237
SP+ W++ DP++E++ + +P W+ QL D V AQ +A+ L LP
Sbjct: 809 SPVLWVKVDPQLEWMMSVRMRKPWHQWVEQLFCDESVEAQCEALRGLVELPLPFELRNSE 868
Query: 238 ---------------HLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET---DWA 279
H + + + + A W+ + +A S+ET +W
Sbjct: 869 LPVQAIAEAVRGRVSHTQMEHASCVRKEAAVALALWQNQHAPPHAGGFGESKETRAQNWR 928
Query: 280 GLLHLVKFYKSRRFDE-------------NIG--------------LPRPNDFRDFSEYF 312
GL HL+K + + RF++ IG L PN F D +EY
Sbjct: 929 GLEHLLKAF-AERFEQRDSKSPASSGMRLGIGHDEAGSGGGNRKSALLLPNWFEDSTEYR 987
Query: 313 VLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFG 372
+ +A+ A A +RA D +P E ++ ++ LL+ NDN N SD +++A ++ S+G++
Sbjct: 988 LKKALIWAAANMRARDGCTPLEVLDLLVVLLRENDNCENSRSDDYYVAQVLTSLGQVSTT 1047
Query: 373 QQSILFLSSLLKRIDR---LLQFD-----------------RLMPSYNGILTISCIRTLT 412
Q++ L SS+ + I + L D R S+NG++ ++ L
Sbjct: 1048 QEAGLPASSVAEAISQARLWLDLDMLGVSSSGHKATAKPATRTTKSHNGVVAACALQCLC 1107
Query: 413 QIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALL------------DLEFHCNG 460
Q+ + G +D + + + VR+ A+++++ D E G
Sbjct: 1108 QLEMLRGGRPEVD-----YEEYTEGRHPTNVRIAAAQSIIHLYLAYDSGLAEDAEDDGEG 1162
Query: 461 IDSALSLFIKSVE-EEPSLRGQVK 483
+ SAL + VE EEP+ G+ +
Sbjct: 1163 LSSALLWVLDFVESEEPAGDGRPR 1186
>gi|123479579|ref|XP_001322947.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121905802|gb|EAY10724.1| hypothetical protein TVAG_364490 [Trichomonas vaginalis G3]
Length = 980
Score = 83.6 bits (205), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 59/216 (27%), Positives = 114/216 (52%), Gaps = 10/216 (4%)
Query: 176 VAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEA 235
V G + S+ + W+ DP+ +L + P M N L+ DV +Q + +++LE
Sbjct: 523 VNGEQVEISIHHSILWVAIDPDHTWLMRVRPRLPQFMVHNMLDLVRDVYSQHETLSSLEE 582
Query: 236 LPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTAS-EETDWAGLLHLVKFYKSRRFD 294
+ + L +FL ++K F+ VR AA LA A+ ++ + + L+ +YK+ F
Sbjct: 583 WIK-NKDPQQWLKSFLENNKIFYGVRGHAARTLALFATPDDVENTNMSTLISWYKAEFFS 641
Query: 295 ENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYS 354
+ P+ N+F+D ++FV + +++++R +PRE E ++Q+L NDNNGN YS
Sbjct: 642 ND--KPKQNNFQDLQKHFVQLEVIKSMSVIRNNQGYTPREIAELLIQILDQNDNNGNQYS 699
Query: 355 DVFWLAALVQSVGELEFGQQSILFLSSLLKRIDRLL 390
D + + ++G + F + ++KR++++L
Sbjct: 700 DDNFRVEIALALGRV-FSDN-----ADIMKRVEQIL 729
>gi|302834603|ref|XP_002948864.1| hypothetical protein VOLCADRAFT_89164 [Volvox carteri f. nagariensis]
gi|300266055|gb|EFJ50244.1| hypothetical protein VOLCADRAFT_89164 [Volvox carteri f. nagariensis]
Length = 2124
Score = 79.0 bits (193), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 78/286 (27%), Positives = 125/286 (43%), Gaps = 57/286 (19%)
Query: 188 PLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAA---LEALPHL----- 239
P+ + R DP E+L ++ Q MW+ QL DVVAQ+ A+ A + P L
Sbjct: 884 PVQYFRVDPGQEWLCDVVVLQSESMWVAQLRGSKDVVAQSLAVQADSHVSPSPPLFKGLM 943
Query: 240 -------------SFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGL----- 281
S +V+ L + + + RVRIEAA ALA + + G+
Sbjct: 944 RLGLQLESSGAADSSSVIGVLLDCVRSGGVYCRVRIEAARALAAFSPSMSTQPGVGLVGA 1003
Query: 282 -LHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLE-------AIPHAVAMVRAADNKSPR 333
+L+ F+ + D + RPN + D + VL+ A+P +A +R S
Sbjct: 1004 APNLLSFFYDQCRDPVSAVLRPNYWADLGTHLVLQARGVQEKAVPACLASIRDVSFNSTP 1063
Query: 334 EAVEFVLQLLKYNDNNG---------------------NPYSDVFWLAALVQSVGELEFG 372
EA++ V +L NDN+ N Y D +LAA++++ G L
Sbjct: 1064 EAMQLVTDILTNNDNSAAYYKQTPPTYASFFPLCTTTENLYDDSSYLAAVIEAAGLLRPT 1123
Query: 373 QQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKL 418
S+ L+ LL +D L D L PS N ++ +C+ +T +AL +
Sbjct: 1124 HSSV--LARLLGLLDDYLLQDCLHPSPNYVVGSACLAAMTSLALSV 1167
>gi|116180768|ref|XP_001220233.1| hypothetical protein CHGG_01012 [Chaetomium globosum CBS 148.51]
gi|88185309|gb|EAQ92777.1| hypothetical protein CHGG_01012 [Chaetomium globosum CBS 148.51]
Length = 1486
Score = 78.6 bits (192), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 126/545 (23%), Positives = 235/545 (43%), Gaps = 78/545 (14%)
Query: 2 LEKQMGSNFFRKILQNIISRAQGASPVRT--LSTKEFRHFANKVGNLERPFLKEFFPRWV 59
+ K GS +++ I+S A + + + LS +FR K RP E + +WV
Sbjct: 546 MSKIPGSMGIVRVISQIVSNANINATITSTSLSDADFRRACEKKSQY-RP--DELWQQWV 602
Query: 60 GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVK--PDSRTPVLSSNTDSENRDGDIGWP-- 115
GCP L + +NK+ V++++ + + + P++ T +E P
Sbjct: 603 HGAGCPKLLIKQRFNKKNLNVDISIAQTQSKENGPNAITKEEFLKELAEEVHEVWAGPVP 662
Query: 116 ----GMMSIRVHELDGM-YDHPILPMAGD-AWQLLEIQCHSKL-------AARRALKPKK 162
G ++R+HE DG Y+H + D L+I ++K A A
Sbjct: 663 KLFTGPFTVRIHEADGTPYEHYLAITDKDKVATTLQIAYNTKYKRMKRTKKATAAAASTA 722
Query: 163 GSKPDGCDDN-------GDAVA--------GL-----DMRSSM-ESPLSWIRADPEMEYL 201
K D +D+ GD + GL +++++M + WIR D E+L
Sbjct: 723 VDKQDIQEDDIVYFNMLGDVLMTQRDLENWGLQEWSDNVQTAMDQESYEWIRLDCNFEWL 782
Query: 202 AEIHFNQPVQMWINQLEKDGDVVAQAQAIAALE-ALPHLSFNVVNTLNNFLSDSKAFWRV 260
E + P M++ QL++D DVVA A+ + H + + T + D + F +
Sbjct: 783 CETVTDMPGYMYLAQLQQDRDVVAHQDAMLFFQRGKRHGVASSIETRT--VMDRRYFHGI 840
Query: 261 RIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL-------PRPNDFRDFSEYFV 313
R A L A E ++ G L+ ++ D +G P PNDF D ++Y V
Sbjct: 841 RTMAIDDLPKQADPEINYIGTAQLILIFRHFFCDRMVGKNGNVTFPPVPNDFHDKAQYSV 900
Query: 314 LEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQS-----VGE 368
A+P A+A R + + R+A F+L LL +NDN+ N YSD F++A L+++ + +
Sbjct: 901 QCALPGAIARTRE-NGRCSRQARNFLLDLLLFNDNSDNEYSDHFYIAKLLEALTTSIIPD 959
Query: 369 LEFGQQSIL---------------FLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQ 413
G++ + F+ ++ ID+ + D +Y + T + + +
Sbjct: 960 KVEGERDLYSRLGADDDDDMELKRFIEKTIEEIDKYRRMDEWTSTYQNVWTTTALHCKMR 1019
Query: 414 IALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVE 473
+ + + Q ++ ++ +D N + +R++A L++L G+ + K+ +
Sbjct: 1020 L---MKANVIRTQPLEFVRYLQDGN-LDLIRIKAFECLVELGLLAKGMAAVALGESKAAQ 1075
Query: 474 EEPSL 478
EP +
Sbjct: 1076 LEPEV 1080
>gi|367019792|ref|XP_003659181.1| hypothetical protein MYCTH_2295880 [Myceliophthora thermophila ATCC
42464]
gi|347006448|gb|AEO53936.1| hypothetical protein MYCTH_2295880 [Myceliophthora thermophila ATCC
42464]
Length = 1543
Score = 78.6 bits (192), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 125/527 (23%), Positives = 227/527 (43%), Gaps = 88/527 (16%)
Query: 2 LEKQMGSNFFRKILQNIISRAQGASPVRT--LSTKEFRHFANKVGNLERPFLKEFFPRWV 59
+ K GS +++ I+S A + + + L+ +FR K RP E + +WV
Sbjct: 547 MSKIPGSMGIVRVISQIVSNANINATITSTSLADADFRRACEKKSQY-RP--DELWQQWV 603
Query: 60 GTCGCPVLRMGFSYNKRKNIVELAVL----RDCTVKPDSRTPVLSSNTDS--ENRDGDIG 113
GCP L + +NK+ V++++ +D K ++ L + E G +
Sbjct: 604 HGAGCPKLLIKQRFNKKNLNVDISIAQTQSKDNGPKAITKDEFLKELLEEVYEVWAGPVP 663
Query: 114 --WPGMMSIRVHELDGM-YDHPILPMAGD-AWQLLEIQCHSKLAARR---------ALKP 160
+ G ++R+HE DG Y+H + D L+I ++K + A
Sbjct: 664 KLFTGPFTVRIHEADGTPYEHYLAITDKDKVATTLQIAYNTKYKRMKRTKKATAAAAAAN 723
Query: 161 KKGSKPDGCDDN-------GDAVA--------GL-DMRSSMESPLS-----WIRADPEME 199
G K D +D+ GD + GL D S+++ + WIR D E
Sbjct: 724 SAGDKHDIQEDDIVYFNMLGDVLMSPRDIENWGLQDWSESVQAAMDQESYEWIRLDCNFE 783
Query: 200 YLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALE-ALPHLSFNVVNTLNNFLSDSKAFW 258
+L E+ + P M++ QL++D DVVA A+ + H + + T + D + +
Sbjct: 784 WLCEMITDMPGYMYLAQLQQDRDVVAHQDAMLFFKRGKRHGVASSIETRT--VMDRRYYH 841
Query: 259 RVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL-------PRPNDFRDFSEY 311
+R A L A E ++ G L+ Y+ +G P PNDF D ++Y
Sbjct: 842 GIRTMAIDDLPKQADPEINYIGTAQLILLYRHFFCHRIVGKNGSVTFPPAPNDFYDKAQY 901
Query: 312 FVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV----- 366
V A+P A+A R + + R A EF+L LL +NDN+ N YSD F++A L++++
Sbjct: 902 AVQCALPAAIARTRE-NGRCSRRAREFLLDLLLFNDNSDNEYSDQFYIAKLLEALTTSII 960
Query: 367 -------------------GELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISC 407
+LEF F+ ++ I++ + D +Y+ I T +
Sbjct: 961 PDKWESERDLYSSLRTDDDDDLEFK----TFIEKAIEEIEKYRRMDEWTSTYHNIWTTTA 1016
Query: 408 IRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
+ ++ + + + ++ ++ +D N + +R+EA L++L
Sbjct: 1017 LHCKMRL---MKANVIRTEPLEFVRYLQDEN-LDLIRIEAFDCLVEL 1059
>gi|452819327|gb|EME26388.1| transcription initiation factor TFIID subunit D2, partial
[Galdieria sulphuraria]
Length = 580
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 69/265 (26%), Positives = 122/265 (46%), Gaps = 31/265 (11%)
Query: 188 PLSWIRADPEMEYLAE-IHFNQPVQMWINQLEKDGDVVAQAQAIAALEALP--HLSFNVV 244
P+ W+R DP+ E++ + + F + + I+ L DV Q +A+ L + +S +
Sbjct: 16 PVLWVRIDPDQEWILKFVEFRRSERDSISCLRNCRDVWGQLEALETLMSRTKGRVSVTSI 75
Query: 245 NTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPND 304
L L+DS+ +++VR E A LA + GL L+K +++ FD++ G N
Sbjct: 76 RCLEQILTDSRYYFQVRGECARCLAKCYTTNDRLMGLEALLKLLRNKYFDKH-GAVLSNH 134
Query: 305 FRDFSEYFVLEAIPHAVAM--------VRAADNKSPREAVEF----VLQLLKY------- 345
F D SEYFV A+ A+A + D++ F + + L Y
Sbjct: 135 FDDISEYFVKCAVIQAIAQAIHPISFPIYGYDSQFSLNLASFQMTHIQESLPYDVLQLLL 194
Query: 346 -----NDNNGNP-YSDVFWLAALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSY 399
ND P + D F+L L+ + + + + K++ R L+ + MPSY
Sbjct: 195 SLLEQNDTAYPPLFDDCFYLHNLIIACAICSVYAKDQITRHRVYKQLFRYLKREEAMPSY 254
Query: 400 NGILTISCIRTLTQIALKLSGFISL 424
+ I+T + ++ LT L+L GF+ +
Sbjct: 255 HNIITGASLKGLT--ILQLGGFLRI 277
>gi|320166290|gb|EFW43189.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
Length = 1514
Score = 75.5 bits (184), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 70/287 (24%), Positives = 124/287 (43%), Gaps = 65/287 (22%)
Query: 222 DVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGL 281
D+ Q +AI L P L+ TL L + + F R+R+ A +ALA+ AS + + G+
Sbjct: 846 DINVQHEAIKKLADRPSLA--TWTTLVKVLHNERVFHRLRLAAVFALASMASLDPNLNGI 903
Query: 282 LHLVKFYKSR-------RFDE--------NIG------------------------LPRP 302
+L+ ++ R R E N+ L P
Sbjct: 904 RYLMDEFRKRFCITSHQRDKETQHAFAYLNVKAPPSPPPPTAAVPNPPVVSAFTDYLVEP 963
Query: 303 NDFRDFSEYFVLEAIPHAVAMVRAA-----DNKSPREAVEFVLQLLKYNDNNGNPYSDVF 357
NDF +F+ Y + +A A+ +R + P V F+L +L+ NDN+ NPY+D
Sbjct: 964 NDFTNFANYHLKKAFVRALGTIRLRRPGLRNPPCPWGVVRFLLDVLEMNDNSVNPYTDAH 1023
Query: 358 WLAALVQSVGE-------------LEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILT 404
++AA++ + E L F + + + +L RLL ++L PSY ++
Sbjct: 1024 YIAAIIDGLAETVTDDYDLKPTDTLPFPYEHPIHV--ILAEFTRLLSLEKLNPSYRHVVA 1081
Query: 405 ISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRAL 451
+SC+ ++ I + G +++D + + + VR+ A R L
Sbjct: 1082 VSCLNGMSLIQAR--GVLTID--IGVFSSHARYGHFTDVRLAAMRGL 1124
>gi|443920353|gb|ELU40290.1| transcription initiation factor TFIID subunit 2 [Rhizoctonia solani
AG-1 IA]
Length = 1574
Score = 73.6 bits (179), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 59/201 (29%), Positives = 94/201 (46%), Gaps = 55/201 (27%)
Query: 257 FWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRF--DENIG----------LPRPND 304
F+RVR AA ALA+ + GL HL+K + + D N G +PRPND
Sbjct: 851 FFRVRCAAARALASQGAP-----GLFHLLKIWTKWCYEPDSNNGNGGDGFAQRFIPRPND 905
Query: 305 FRDFSEYFVLEAIPHAVAMVRAADNKSPREAV-------EFVLQLLKYNDNNGNPYSDVF 357
F DF EYFV +A+ A++ +R D S A ++ L YNDNNGN Y+D
Sbjct: 906 FSDFQEYFVRKAVITALSQLR--DGASGTGAWLGGALVRGTIVDQLVYNDNNGNTYADDM 963
Query: 358 WLAALVQSVG--------------------ELEFGQ---------QSILFLSSLLKRIDR 388
++A L+ ++G E+E + ++ L + +++R
Sbjct: 964 YIATLITALGTALIPPDVSERNELAHNHTVEMEMEEAMRRQMEEDENAEVLRRAVTQVER 1023
Query: 389 LLQFDRLMPSYNGILTISCIR 409
DRL+PS + +++I+ I+
Sbjct: 1024 YRARDRLVPSRHNVISIAAIQ 1044
Score = 47.4 bits (111), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 31/96 (32%), Positives = 44/96 (45%), Gaps = 14/96 (14%)
Query: 135 PMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLDMRSSMESPLSWIRA 194
P A AW + E + +P + P + N A AG + E+ WIR
Sbjct: 693 PAARAAWHVEEFE---------DAEPPPHTGPIPFNPNSGAAAG-----NEETAYEWIRL 738
Query: 195 DPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAI 230
DPEM ++A Q MW +QL ++ DV AQ +AI
Sbjct: 739 DPEMMWVARTELKQSGVMWASQLARERDVGAQMEAI 774
>gi|403348202|gb|EJY73534.1| hypothetical protein OXYTRI_05334 [Oxytricha trifallax]
Length = 1887
Score = 71.6 bits (174), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 43/161 (26%), Positives = 85/161 (52%), Gaps = 8/161 (4%)
Query: 184 SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEAL-PHLSFN 242
++ESP+ WIR DP+ME++ ++ Q W+ QL ++ D++ Q +AI L L +
Sbjct: 1135 NLESPVLWIRVDPDMEFIRKVKVVQEQNNWLFQLLQEKDIIGQIEAINELNKFNSELVYE 1194
Query: 243 VVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLH---LVKFYKSRRFDENIGL 299
++ +++ ++ F++VR A L + + + L H L+K + R FD++IG
Sbjct: 1195 ILKSVS---TNENYFFKVRKIAIKKLQQMQTSQFN-KYLSHEKFLIKVFNQRNFDKSIGF 1250
Query: 300 PRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVL 340
+ NDF EY++ ++I ++ + + E E++L
Sbjct: 1251 YKSNDFSKVLEYYIDKSILKTLSKCKEVPLELKEEIKEYLL 1291
Score = 46.6 bits (109), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 25/91 (27%), Positives = 48/91 (52%), Gaps = 8/91 (8%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
M+E + +++ +I++ I A+P T+S F + +V + LK+FF W+
Sbjct: 884 MIESMIDKSYYERIIRQIYD---SANP--TISQGLFHNIFKQVCGFK---LKQFFQNWII 935
Query: 61 TCGCPVLRMGFSYNKRKNIVELAVLRDCTVK 91
+ CP L F+YNK+ N ++L + + +K
Sbjct: 936 STSCPKLTASFNYNKKNNSLDLNIAQTSAMK 966
>gi|449020077|dbj|BAM83479.1| TATA-box binding protein-associated factor 2 [Cyanidioschyzon merolae
strain 10D]
Length = 1444
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 68/287 (23%), Positives = 117/287 (40%), Gaps = 62/287 (21%)
Query: 188 PLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTL 247
P+ WIR DP+ME++ + + Q + W+ QL + DV +Q AL L + V
Sbjct: 857 PIMWIRVDPDMEWVRGVFWRQNERAWVRQLAGERDVSSQMDCCDAL-YLHATAIGVRGLA 915
Query: 248 NNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRF-DENIGLPR----- 301
+ L + + +RVR AA ++A ET + GL LV FY +R E+ LPR
Sbjct: 916 EHGLLNEQVHYRVRSAAAASIARWGGSETRFLGLEILVDFYANRYLVGEDQVLPRSDAKE 975
Query: 302 ------------------------------------------------PNDF-RDFSEYF 312
P DF R+ +EYF
Sbjct: 976 DNHLANVDASDALVGSQADAATTAATAPGRLQEKRARLGREWLQRLPRPLDFSRNLAEYF 1035
Query: 313 VLEAIPHAVAMVRAADN-----KSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVG 367
+ A+ A+ +R A ++P + + +L++++N N + D ++L + +
Sbjct: 1036 LQRAMLRALGGIRVAQTGEGAQRTPTDLFALLDHVLRFHNNEANMFDDAYFLRDALLAFA 1095
Query: 368 ELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQI 414
+ + + + + I+R L DRL PS+ LT CI ++
Sbjct: 1096 SASSTNEGLRTM-EVARCIERFLNRDRLFPSHQRALTAGCIEAAAEL 1141
>gi|326437301|gb|EGD82871.1| hypothetical protein PTSG_03502 [Salpingoeca sp. ATCC 50818]
Length = 723
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 61/247 (24%), Positives = 111/247 (44%), Gaps = 20/247 (8%)
Query: 187 SPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL-EALPHLSFNVVN 245
SP+ W++ D + + LA I Q W Q D V+ Q AI A+ + + H S VV
Sbjct: 113 SPILWVKFDHDSKLLASITVEQEHWAWGYQARLDRHVIFQLDAIKAMGKHVGHRS--VVF 170
Query: 246 TLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRF--DENIG-LPRP 302
TL + + + ++ VR +A ALA + + D G++ ++ + R + + G +P
Sbjct: 171 TLLDAAENEQLYFHVRTQAVLALAQCENVDEDQHGIVAALRAFYERHYCAKSDFGFMPLE 230
Query: 303 NDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFW---- 358
NDF +F YF+ +A A++ ++ P E VE + ++ D N Y D +
Sbjct: 231 NDFSNFPAYFLQQATLTALSRLKNTYGVPPLEIVEMLHTIVLATDQGSNQYDDATFRRHT 290
Query: 359 LAALVQSV--------GELEFGQQSIL--FLSSLLKRIDRLLQFDRLMPSYNGILTISCI 408
LA ++ ++ QQ+ ++ + + +D + SY +T+ C+
Sbjct: 291 LACFAHALRPHKRRNAQDMPADQQAKRQEWVRRAVSLFAHCISYDHVSASYKKRVTVGCV 350
Query: 409 RTLTQIA 415
LT A
Sbjct: 351 DALTSFA 357
>gi|322796793|gb|EFZ19220.1| hypothetical protein SINV_09096 [Solenopsis invicta]
Length = 1089
Score = 68.6 bits (166), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 47/179 (26%), Positives = 87/179 (48%), Gaps = 17/179 (9%)
Query: 310 EYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGE- 368
E ++ IP A+A +R P E + F++ L KYNDN+ N YSD ++ AAL++++G
Sbjct: 631 EMTIMRTIPVAMAGLRNNHGICPPEVLSFLMDLFKYNDNSKNRYSDNYYRAALIEALGAT 690
Query: 369 ----LEFGQQSIL---FLS----SLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALK 417
+ Q + + FLS ++L+ + R L ++L+P Y ++++C++ +
Sbjct: 691 VTPVISVQQGTAITAEFLSIDTKAILEEVTRNLNLEKLLPCYKYTVSVACLKVIR----I 746
Query: 418 LSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEP 476
L F L L + + + +R+ A AL+D +G L + E +P
Sbjct: 747 LQKFGHLPSNPHLFRAYAAYGQFEDIRIAALEALVDFT-RVDGKWEDLEFLLDMAENDP 804
Score = 50.4 bits (119), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 54/219 (24%), Positives = 93/219 (42%), Gaps = 40/219 (18%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRH-------FANKVGNLERPFLKE 53
MLE ++G ++ +S A A+ + + + + H FA + +
Sbjct: 457 MLEHRIGQEVLLQVFNKQLSLAANAAQ-QKIESGLWSHMLISTNVFAKATFTVTGKDMSV 515
Query: 54 FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
F +WV T G + F +N+++N VEL + +D T ++ G
Sbjct: 516 FIDQWVRTGGHTKFSLSFIFNRKRNTVELEIRQDTT-----------------HQRGIRK 558
Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
+ G + + + ELDG + H L + G + +I CHSK + K + G
Sbjct: 559 YVGPLVVNIQELDGTFKH-TLQIEGTMAR-ADITCHSKSRRNKKKKIPLCT--------G 608
Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQM 212
+ V +D+ + +SP+ WIR DPEM + I PV M
Sbjct: 609 EEV-DMDLSAMDDSPVLWIRLDPEMTIMRTI----PVAM 642
>gi|429962342|gb|ELA41886.1| hypothetical protein VICG_01070 [Vittaforma corneae ATCC 50505]
Length = 1504
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 70/314 (22%), Positives = 133/314 (42%), Gaps = 54/314 (17%)
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVEL-AVLRDCTVKPDSRTPVLSSNTDSENRD 109
L+ FF +V G +++ F NK+KN V++ A ++ P + +L
Sbjct: 561 LRSFFDFYVFRPGLMRVKLQFQINKKKNSVKVNASYTSTSLLPGANKRLL---------- 610
Query: 110 GDIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGC 169
G + I+ EL+G +DH I + + H++ ++
Sbjct: 611 ------GNVDIKSVELEGGFDHSIAFGNENVFYY-----HTRTKKKKK------------ 647
Query: 170 DDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
PL +IR DP+ + E QP M I QL+ D V+ Q +A
Sbjct: 648 -----------DDEEETMPLLYIRVDPKRLNIFEYTVEQPDYMHIEQLQ-DKSVIGQLEA 695
Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYK 289
I L P LS L L + F+++R + AY L + E D+ GL +++++
Sbjct: 696 IYNLGTKPSLS--SCEALERLLDNFHIFYKIRAKIAYILRSIKIE--DYDGLQRIIQYFI 751
Query: 290 SRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADN---KSPREAVEFVLQLLKYN 346
N + + N+F S YF+ + + +++ + + + + + F+ +LK+N
Sbjct: 752 RTYCVPNSTILKSNEFGLIS-YFIQKHLVDSISQINFKEECVISNSKIVLAFLENILKFN 810
Query: 347 DNNGNPYSDVFWLA 360
DN+ + + D +++A
Sbjct: 811 DNSLSQFEDSWYVA 824
>gi|322796774|gb|EFZ19201.1| hypothetical protein SINV_00189 [Solenopsis invicta]
Length = 451
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 38/146 (26%), Positives = 71/146 (48%), Gaps = 16/146 (10%)
Query: 321 VAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGE-----LEFGQQS 375
+A +R P E + F++ L KYNDN+ N YSD ++ AAL++++G + Q +
Sbjct: 1 MAGLRNNHGICPPEVLSFLMDLFKYNDNSKNRYSDNYYRAALIEALGATVTPVISVQQGT 60
Query: 376 ILFLSSL-------LKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVV 428
+ SL L+ + R L ++L+P Y ++++C++ + L F L
Sbjct: 61 AITAESLSIDTKAILEEVTRNLNLEKLLPCYKYTVSVACLKVIRI----LQKFGHLPSNP 116
Query: 429 KLIKPFRDFNTIWQVRVEASRALLDL 454
L + + + +R+ A AL+D
Sbjct: 117 HLFRAYAAYGQFIDIRIAALEALVDF 142
>gi|189203565|ref|XP_001938118.1| TATA-binding protein associated factor Taf2 [Pyrenophora
tritici-repentis Pt-1C-BFP]
gi|187985217|gb|EDU50705.1| TATA-binding protein associated factor Taf2 [Pyrenophora
tritici-repentis Pt-1C-BFP]
Length = 1359
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 44/146 (30%), Positives = 72/146 (49%), Gaps = 22/146 (15%)
Query: 2 LEKQMGSNFFRKILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
L KQ G N + L ++ A QG P ++T +F KVG+ + L+ FF +W+
Sbjct: 506 LVKQSGKNGVDRCLYRMLFNARQGKYPNGAITTDDFLDICEKVGHQK---LESFFQQWIV 562
Query: 61 TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSEN---------RD-- 109
GCP ++NK+K +V+L + + ++ DS L + +++ RD
Sbjct: 563 GSGCPTFHCLPTFNKKKQVVQLTIRQ---LQADSSITTLENGLSADDFMREAKEKSRDLR 619
Query: 110 GDIGWP---GMMSIRVHELDGM-YDH 131
G+P G M+IR+HE DG Y+H
Sbjct: 620 SATGYPTFVGPMTIRIHEADGTPYEH 645
>gi|422293197|gb|EKU20497.1| hypothetical protein NGA_2071100, partial [Nannochloropsis gaditana
CCMP526]
Length = 290
Score = 60.5 bits (145), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 45/158 (28%), Positives = 70/158 (44%), Gaps = 34/158 (21%)
Query: 188 PLSWIRADPEMEYLAEIHF-NQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNT 246
P+ W+R DP+++++AE + N P +++ QL +D D AQ A+ AL + P + +
Sbjct: 131 PVVWLRLDPDLQWIAEWEWENLPEYIFLEQLHRDPDAAAQCLALRALTSYPKATQDTTAL 190
Query: 247 LNNFLSDSK-------------------AFW-----RVRIEAAYALANTASE-------- 274
+ + S + W VR+EAA ALA S
Sbjct: 191 VGSVQSGTTLRPSLAAQAMSDCVRGKQGPGWVEPSLNVRMEAARALAAWQSNHAPGSMIT 250
Query: 275 -ETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEY 311
++ W GL L Y+ R FD +P P DF D +EY
Sbjct: 251 ADSGWRGLDFLFCAYRERYFDATSRVPLPADFTDEAEY 288
>gi|269861284|ref|XP_002650354.1| transcription initiation factor TFIID subunit [Enterocytozoon
bieneusi H348]
gi|220066214|gb|EED43707.1| transcription initiation factor TFIID subunit [Enterocytozoon
bieneusi H348]
Length = 1331
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 58/283 (20%), Positives = 124/283 (43%), Gaps = 26/283 (9%)
Query: 182 RSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSF 241
+ S + ++RAD + E +I NQP ++I QL D V+ Q +A+ + + S
Sbjct: 545 KKSDNGTVLFVRADCKREIFGKIRINQPEALYIEQL-LDKSVIGQFEAVEFFKN--NTSE 601
Query: 242 NVVNTLNNFLSDSKAFWRVRIEAAYALANTA--------SEETDWAGLLHLVKFYKSRRF 293
+ L + F RV+ E L N ++E+++ GL+ L++F+ +
Sbjct: 602 KAIEALERVAENPHTFHRVKNEIYKVLKNIYFQHYGNDNNKESNYNGLMRLIQFFIKLKC 661
Query: 294 DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREA-----VEFVLQLLKYNDN 348
N + + + + + + +++ + N E + F+ ++ YND
Sbjct: 662 VPNSTIVKNTELGILN-FLIQKSLVKNITDFENEQNVKSNEKNINVIIAFLNNIINYNDT 720
Query: 349 NGNPYSDVFWLAALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCI 408
++D F+L+ ++ + + S L +L I+R D + PS + +++ +C+
Sbjct: 721 TKEHFNDPFYLSNILNKLCIYKIHLSSTLDDQNLFSEIERFRFLDMVFPSPHNVISKTCL 780
Query: 409 RTLTQIA----LKLSGFISLDQVVKLIKPFRDFNTIWQVRVEA 447
L ++A + LS LD I + +F ++ Q+ +E
Sbjct: 781 ICLLRLAYAKKINLSKKFLLD-----ISKYPNFISLRQIAIEG 818
>gi|167526722|ref|XP_001747694.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163773798|gb|EDQ87434.1| predicted protein [Monosiga brevicollis MX1]
Length = 1159
Score = 59.3 bits (142), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 54/247 (21%), Positives = 106/247 (42%), Gaps = 36/247 (14%)
Query: 183 SSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFN 242
+S +S + W++AD M+++A + Q W E + V Q AI AL +
Sbjct: 601 NSTKSTIMWVQADVNMKWMARVEMKQEWWAWGYAAEHERHVGWQVDAIHALRQFK--DKD 658
Query: 243 VVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRF--------- 293
+ L+ ++ + + R R +A +LA + +LVK+ + R
Sbjct: 659 AIKLLDEIAANERYYHRARRQAILSLAQMSDAPVK-ERFQYLVKYLRKRYCAPPSMELIR 717
Query: 294 ---DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNG 350
E +P NDF+D Y VL + NK+ +A++ ++ +L+Y+DN+
Sbjct: 718 AGKTEIPVIPMENDFKDIRNYLVLRLV--------ETPNKTCEDALQVLIDMLQYSDNSD 769
Query: 351 NPYSDVFWLAALVQSVGELEF------GQQSILFLSSLLKRIDRLLQ-------FDRLMP 397
N + D + A ++ ++ + I+ L ++ + ++Q +D + P
Sbjct: 770 NMFDDGEYRAEIINAMVNVLLRSWRMPSGSRIMTEEQLCEKTETVIQLLVHQLSYDTVDP 829
Query: 398 SYNGILT 404
SYN +T
Sbjct: 830 SYNQPVT 836
>gi|355723103|gb|AES07784.1| TAF2 RNA polymerase II, TATA box binding protein -associated
factor, 150kDa [Mustela putorius furo]
Length = 424
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 38/126 (30%), Positives = 63/126 (50%), Gaps = 15/126 (11%)
Query: 339 VLQLLKYNDNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRID 387
+L L+KYNDN N +SD ++ A AL SV + L +L L+ I
Sbjct: 1 ILDLIKYNDNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEIT 60
Query: 388 RLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEA 447
R L ++L+PSY +T+SC+R + L+ +G + D L K + ++ +R+ A
Sbjct: 61 RFLNMEKLLPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAA 116
Query: 448 SRALLD 453
A++D
Sbjct: 117 LEAVVD 122
>gi|331214257|ref|XP_003319810.1| hypothetical protein PGTG_01984 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309298800|gb|EFP75391.1| hypothetical protein PGTG_01984 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 279
Score = 57.0 bits (136), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 64/124 (51%), Gaps = 12/124 (9%)
Query: 192 IRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFL 251
+R D E+E++ E MW+ QL++D DVVAQ +A+ AL+ +P S V + L
Sbjct: 115 VRLDVEVEWIFEFSIEMKEFMWLEQLQRDHDVVAQVEAVRALKTIP--SKVVSSHLCRVA 172
Query: 252 SDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSR-----RFDENI-----GLPR 301
+ + +R EA AL + A+ GL HL+K ++SR R + + +P+
Sbjct: 173 PVLEYIFHMRNEAVLALDSCATCRCGVLGLFHLLKLFQSRYCYPPRVEPDSPWHIRPIPK 232
Query: 302 PNDF 305
PN F
Sbjct: 233 PNQF 236
>gi|26341738|dbj|BAC34531.1| unnamed protein product [Mus musculus]
Length = 119
Score = 53.5 bits (127), Expect = 4e-04, Method: Composition-based stats.
Identities = 37/119 (31%), Positives = 58/119 (48%), Gaps = 6/119 (5%)
Query: 198 MEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAF 257
M L ++ F Q MW +L + DVVAQ ++I ALE P + + L + L + F
Sbjct: 1 MSVLRKVEFEQADFMWQYELRYERDVVAQQESILALEKFPTPASRL--ALTDILEQEQCF 58
Query: 258 WRVRIEAAYALANTA-SEETDWAGLLHLVKFYKSRRF--DENIGLPRPNDFRDFSEYFV 313
+RVR+ A + LA A S + W G + + +R F + + N+F F YF+
Sbjct: 59 YRVRMSACFCLAKIANSMVSTWTGPPAMKSLF-TRMFCCKTCPNIVKTNNFMSFQSYFL 116
>gi|326435506|gb|EGD81076.1| hypothetical protein PTSG_11021 [Salpingoeca sp. ATCC 50818]
Length = 498
Score = 52.4 bits (124), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 43/191 (22%), Positives = 85/191 (44%), Gaps = 17/191 (8%)
Query: 242 NVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRF--DENIG- 298
+VV TL + + + ++ VR +A ALA + + D G++ ++ + R + + G
Sbjct: 9 SVVFTLLDAAENEQLYFHVRTQAVLALAQCENVDEDQHGIVAALRAFYERHYCAKSDFGF 68
Query: 299 LPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFW 358
+P NDF +F YF+ +A A++ ++ P E VE + ++ D N Y D +
Sbjct: 69 VPLENDFSNFPAYFLQQATLTALSRLKNTYGVPPLEIVEMLHTIVLATDQGSNQYDDATF 128
Query: 359 ----LAALVQSV--------GELEFGQQSIL--FLSSLLKRIDRLLQFDRLMPSYNGILT 404
LA ++ ++ QQ+ ++ + + +D + SY +T
Sbjct: 129 RRHTLACFAHALRPHKRRNAQDMPADQQAKRQEWVRRAVSLFAHCISYDHVSASYKKRVT 188
Query: 405 ISCIRTLTQIA 415
+ C+ LT A
Sbjct: 189 VGCVDALTSFA 199
>gi|298243623|ref|ZP_06967430.1| Peptidase M1 membrane alanine aminopeptidase [Ktedonobacter
racemifer DSM 44963]
gi|297556677|gb|EFH90541.1| Peptidase M1 membrane alanine aminopeptidase [Ktedonobacter
racemifer DSM 44963]
Length = 889
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 61/278 (21%), Positives = 106/278 (38%), Gaps = 62/278 (22%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEF-RHFANKVGNLERPFLKEFFPRWV 59
ML Q+G + FR+ +++ + R + R + T + R G L FF +W+
Sbjct: 421 MLRNQLGDSAFRRGIKSYLERNR----TREVITADLERALEEATGRS----LAHFFHQWI 472
Query: 60 GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMS 119
+ G P + +S++ + + ++ V + K D TP P ++
Sbjct: 473 YSGGYPEFEVSYSWDGQHKLAKVKVRQ--AQKVDDLTPCFKV-------------PTEIA 517
Query: 120 IRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGL 179
+ +G + +P+ H L G D + + L
Sbjct: 518 FTIAASEGSDETRTIPL------------HVTL---------------GEDGEVEQIFYL 550
Query: 180 DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL--EALP 237
+E +R DPE L + F +P + + QLE D D++ + +A AL P
Sbjct: 551 ----PLEREPVMVRFDPEGRLLKSLKFERPTHLLVYQLEHDADILGRVEAAEALADRREP 606
Query: 238 HLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEE 275
++N LN S FW VR AA A+A + E
Sbjct: 607 QALTALINALN-----SDTFWGVRTTAAEAIAKHGTAE 639
>gi|296004883|ref|XP_002808789.1| conserved Plasmodium protein, unknown function [Plasmodium falciparum
3D7]
gi|225632177|emb|CAX64062.1| conserved Plasmodium protein, unknown function [Plasmodium falciparum
3D7]
Length = 3489
Score = 50.1 bits (118), Expect = 0.005, Method: Composition-based stats.
Identities = 29/84 (34%), Positives = 45/84 (53%), Gaps = 4/84 (4%)
Query: 169 CDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQ 228
C DN VA + ++ + P+ WIR D + L I +Q + MWI QL D +V++Q +
Sbjct: 2075 CLDNRKLVAKICSKNKI--PILWIRIDNDFVLLGRIRRSQSISMWIQQLYNDNNVLSQLE 2132
Query: 229 AIAALEALP--HLSFNVVNTLNNF 250
+ AL +P L+ NV + N F
Sbjct: 2133 SSFALAFIPFFFLNKNVFSNYNEF 2156
>gi|83317854|ref|XP_731343.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23491352|gb|EAA22908.1| hypothetical protein [Plasmodium yoelii yoelii]
Length = 1684
Score = 49.7 bits (117), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 37/136 (27%), Positives = 64/136 (47%), Gaps = 6/136 (4%)
Query: 169 CDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQ 228
C DN VA + ++ + PL WIR D + + I Q V MWI QL D +V+AQ +
Sbjct: 1329 CLDNRKLVAKICSKNKI--PLLWIRVDNDFYIIGRIRRTQSVSMWIQQLLNDNNVLAQLE 1386
Query: 229 AIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFY 288
+ +L +P N N +NF + ++ +++ + N + + L++ +
Sbjct: 1387 SSFSLAFIPFFFLN-KNITSNFNENKNNYFLGPXKSSTNINNNS---VHYGDLVNNITSI 1442
Query: 289 KSRRFDENIGLPRPND 304
S DENI P++
Sbjct: 1443 GSVDSDENINHLDPDN 1458
>gi|385303226|gb|EIF47315.1| tfiid subunit (150 kda) involved in rna polymerase ii transcription
initiation [Dekkera bruxellensis AWRI1499]
Length = 394
Score = 49.3 bits (116), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 62/284 (21%), Positives = 108/284 (38%), Gaps = 68/284 (23%)
Query: 31 LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTV 90
LST +H KV + + L FF WV G P + +NK++ +E+++ + +
Sbjct: 119 LSTSHXQHVCEKVVHHK---LDGFFENWVHNPGVPRFTISQKFNKKRMFIEMSIRQTQSS 175
Query: 91 K---------------PDSRTPVLSSN------------TDSENRDGDIGWPGMMSIRVH 123
P+ R V T E+ + + G ++IR+H
Sbjct: 176 TKMKRLAXENDEZLTLPEMRXYVQQFKNKYFIDEADQMITGEESFEPLPAFTGPITIRIH 235
Query: 124 ELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKP---------------------- 160
E DG Y+H + + + +IQ ++K R K
Sbjct: 236 EADGTPYEHILY--IREPYTKFDIQYNTKYRRTRRRKEGLLDDSEENKSSNSEGYLGDIL 293
Query: 161 -------KKGSKPDGCDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMW 213
K G K D D++ + +M+S W+R D + E++ E H N +M
Sbjct: 294 MTSSEAXKWGLKEDAXDESTNIA---EMQSY---AFEWMRFDADNEWICEKHINVSDEMQ 347
Query: 214 INQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAF 257
+ L++D DV AQ +A+ + + N L L D++ F
Sbjct: 348 ESMLQQDRDVDAQMEAVQYFGBIVRPKLHFANILLRTLLDTRYF 391
>gi|70949844|ref|XP_744296.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56524191|emb|CAH78218.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
Length = 1353
Score = 48.5 bits (114), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 39/130 (30%), Positives = 62/130 (47%), Gaps = 7/130 (5%)
Query: 169 CDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQ 228
C DN VA + ++ + PL WIR D + + I Q V MWI QL D +V+AQ +
Sbjct: 521 CLDNRKLVAKICSKNKI--PLLWIRVDNDFYIIGRIRRTQSVSMWIQQLLNDNNVLAQLE 578
Query: 229 AIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFY 288
+ +L +P N N +NF +D K + + + L N ++ + GL + +
Sbjct: 579 SSFSLAFIPFFFLN-KNITSNF-NDGKNNYFMGPDKGPNLNNNSAH---YGGLANNITSM 633
Query: 289 KSRRFDENIG 298
S DEN+
Sbjct: 634 GSDGSDENMN 643
>gi|147858618|emb|CAN81008.1| hypothetical protein VITISV_011583 [Vitis vinifera]
Length = 307
Score = 47.0 bits (110), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 28/65 (43%), Positives = 37/65 (56%), Gaps = 9/65 (13%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGAS-PVRTLSTKEFRH--------FANKVGNLERPFL 51
MLEKQMG FRKILQ I+ RAQ + +RTLSTKE + FA ++ +L
Sbjct: 233 MLEKQMGPESFRKILQTIVFRAQDTTRSLRTLSTKEVQRICSQEHIWFAGIYAXEQKSYL 292
Query: 52 KEFFP 56
++ P
Sbjct: 293 RKCIP 297
>gi|68067458|ref|XP_675693.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56495025|emb|CAH97801.1| hypothetical protein PB000540.02.0 [Plasmodium berghei]
Length = 926
Score = 46.6 bits (109), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 38/74 (51%), Gaps = 2/74 (2%)
Query: 169 CDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQ 228
C DN V+ + ++ + PL WIR D + + I Q V MWI QL D +++AQ +
Sbjct: 473 CLDNRKLVSKICSKNKI--PLLWIRVDNDFYIIGRIRRTQSVSMWIQQLLNDNNILAQLE 530
Query: 229 AIAALEALPHLSFN 242
+ +L +P N
Sbjct: 531 SSFSLAFIPFFFLN 544
>gi|221052886|ref|XP_002261166.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
gi|194247170|emb|CAQ38354.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
Length = 3228
Score = 46.2 bits (108), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 26/69 (37%), Positives = 35/69 (50%), Gaps = 2/69 (2%)
Query: 169 CDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQ 228
C DN VA + + + PL WIR D + L I Q MWI QL D +V+AQ +
Sbjct: 1913 CLDNRKLVAKICSKGKI--PLLWIRIDNDFNILGRIRRTQSASMWIQQLCNDNNVLAQLE 1970
Query: 229 AIAALEALP 237
+ AL +P
Sbjct: 1971 SSFALAFIP 1979
>gi|219115988|ref|XP_002178789.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217409556|gb|EEC49487.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 1836
Score = 45.8 bits (107), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 61/248 (24%), Positives = 86/248 (34%), Gaps = 87/248 (35%)
Query: 187 SPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALP--------- 237
SPL W+R DP Y I QP QL DGD AQ +AI L P
Sbjct: 1103 SPLLWVRVDPMGLYAGRISVCQPDACLAEQLFHDGDAAAQVEAIRTLAERPIRIQPSSKV 1162
Query: 238 HLSFNV------VNTLNNFLSDSKAFWR-------VRIEAAYALA----NTASEETD--- 277
+ ++V V L + L S A +R +AA A+A N A D
Sbjct: 1163 NAVYDVSVMELPVRVLGDCLRGSPALHSSLPHTPAIRSQAALAIAQWQNNKAPPSKDSVD 1222
Query: 278 ---WAGLLHLVKFYKSRRFDENIGLP--------RPND-------------------FRD 307
W G+ L+ +++ R + +I +P + +D F D
Sbjct: 1223 ADSWIGINLLLHYFRERFYSNSIIMPTKFSRLALKKSDVEMNQAAAANESAGANQPTFDD 1282
Query: 308 FSEYF----------------------------VLEAIPHAVAMVRAADNKSPREAVEFV 339
EY V A+ A+A RA D +P AVEF+
Sbjct: 1283 AYEYLDILEAGEERAAALEDADQVEIEEDEEYRVRSAVVTAIACTRAKDGLTPSIAVEFL 1342
Query: 340 LQLLKYND 347
+L+ D
Sbjct: 1343 ETVLEAED 1350
>gi|156093478|ref|XP_001612778.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148801652|gb|EDL43051.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 3314
Score = 45.4 bits (106), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 26/69 (37%), Positives = 34/69 (49%), Gaps = 2/69 (2%)
Query: 169 CDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQ 228
C DN VA + + + PL WIR D + L I Q MWI QL D +V AQ +
Sbjct: 1980 CLDNRKLVAKICSKGKI--PLLWIRIDNDFSILGRIRRTQSASMWIQQLFNDNNVFAQLE 2037
Query: 229 AIAALEALP 237
+ AL +P
Sbjct: 2038 SSFALAFIP 2046
>gi|118375202|ref|XP_001020786.1| hypothetical protein TTHERM_00408940 [Tetrahymena thermophila]
gi|89302553|gb|EAS00541.1| hypothetical protein TTHERM_00408940 [Tetrahymena thermophila SB210]
Length = 1956
Score = 44.3 bits (103), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 59/276 (21%), Positives = 110/276 (39%), Gaps = 36/276 (13%)
Query: 152 LAARRALKPKKGSKPDGCDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHF-NQPV 210
+A R + G K + +D+ + E WI+ DP +YL + F Q +
Sbjct: 1226 VAYRSETRKNNGKKKEDDEDDSKKKQQGSKKKQDELEKVWIKIDPSNKYLLSLDFPQQNL 1285
Query: 211 QMWINQLEKDGDVVAQAQAIAALEALPHL--------SFNVVNTLNNFLSDSKAFWRVRI 262
+ QL KD + ++ L L L S+++V L + +++
Sbjct: 1286 HSLLTQLMKDKKHRYKYISVT-LYLLRQLRNKNYDQDSYDIVKNLKLIIQKDDLETDLQV 1344
Query: 263 EAAYALANTASE-----ETDWAGLLHLVKFYKSRRFDENIGLPR---PNDFRDFSEYFVL 314
E + +S+ + + L L+K IG + PN+F+ + +Y +L
Sbjct: 1345 EILDVIIRNSSKVKKARQECFNLLEELIK-------KNVIGQDKSLLPNNFQSYKKYMIL 1397
Query: 315 EAIPHAVAM-------VRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVG 367
I + V A P + ++L+L+K NDN+ N + D + A L +G
Sbjct: 1398 MTIIRGLPYCYDENISVDADKQYPPENTIRYLLKLIKENDNSINRFDDSIYRANL---IG 1454
Query: 368 ELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGIL 403
L F ++LK + R L+ + + S N ++
Sbjct: 1455 SLLLTNHP-YFKKTILKEVIRCLELEYIECSLNDVV 1489
>gi|60688329|gb|AAH91307.1| Taf2 protein, partial [Rattus norvegicus]
Length = 379
Score = 43.1 bits (100), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 21/72 (29%), Positives = 40/72 (55%), Gaps = 4/72 (5%)
Query: 382 LLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIW 441
+L+ I R L ++L+PSY +T+SC+R + L+ +G + D L K + ++
Sbjct: 8 ILEEITRFLNMEKLLPSYRHTITVSCLRAIR--VLQKNGHVPSD--ASLFKSYAEYGHFV 63
Query: 442 QVRVEASRALLD 453
+R+ A A++D
Sbjct: 64 DIRIAALEAVVD 75
>gi|403224177|dbj|BAM42307.1| uncharacterized protein TOT_040000676 [Theileria orientalis strain
Shintoku]
Length = 1773
Score = 43.1 bits (100), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 45/182 (24%), Positives = 76/182 (41%), Gaps = 24/182 (13%)
Query: 171 DNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAI 230
DN VA + ++ + PL WIR D LA I Q MW QL+ D ++ A +A
Sbjct: 1142 DNRSIVAKICSKTRL--PLLWIRVDSAFSMLARIRRCQSGSMWEQQLQSDNNIYALIEAC 1199
Query: 231 AALEALPH--------LSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLL 282
AL ++ L+ + L+ + KA +R Y+L + + G L
Sbjct: 1200 QALGSIGKSEYVSDNPLTESACKRLDEVVKKQKAHPMIRARCVYSLVCLYNRDPRQHGYL 1259
Query: 283 H-LVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQ 341
+ Y +R + I P + R +E++ A+ ++R +P +FVL
Sbjct: 1260 YGFFTKYLTRLANLTI---HPAEARFIAEFY------KAMCLLRNDQGFTP----DFVLD 1306
Query: 342 LL 343
+L
Sbjct: 1307 IL 1308
>gi|45383890|ref|NP_989441.1| protein disulfide-isomerase A3 precursor [Gallus gallus]
gi|82243463|sp|Q8JG64.1|PDIA3_CHICK RecName: Full=Protein disulfide-isomerase A3; AltName:
Full=Endoplasmic reticulum resident protein 57; Short=ER
protein 57; Short=ERp57; AltName: Full=Glucose-regulated
thiol oxidoreductase 58 kDa protein; Flags: Precursor
gi|22651801|gb|AAM82759.1| glucose regulated thiol oxidoreductase protein precursor [Gallus
gallus]
Length = 505
Score = 41.6 bits (96), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 37/111 (33%), Positives = 47/111 (42%), Gaps = 19/111 (17%)
Query: 24 GASPVRTLSTKEFRHFANKVGNLERP--FLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVE 81
GAS V LS +F G ERP L EFF W G C + + K IV
Sbjct: 23 GASDVVELSDADFES-----GLAERPGLVLVEFFAPWCGHCKRLAPEYEAAATRLKGIVP 77
Query: 82 LAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGMYDHP 132
L V DCT +N+++ N+ G G+P + R E G YD P
Sbjct: 78 L-VKVDCT-----------ANSNTCNKYGVSGYPTLKIFRDGEESGTYDGP 116
>gi|315426927|dbj|BAJ48546.1| aminopeptidase N [Candidatus Caldiarchaeum subterraneum]
gi|315426988|dbj|BAJ48606.1| aminopeptidase N [Candidatus Caldiarchaeum subterraneum]
gi|343485662|dbj|BAJ51316.1| aminopeptidase N [Candidatus Caldiarchaeum subterraneum]
Length = 822
Score = 41.6 bits (96), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 53/111 (47%), Gaps = 5/111 (4%)
Query: 184 SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNV 243
SM++ + DPE++ L + +P++ ++ +EK G VV + + I +L H V
Sbjct: 505 SMKNRPEHVCVDPELKVLRVLDIERPLEEALSMVEKCGSVVCRIEMIDSLAK--HGGRRV 562
Query: 244 VNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGL--LHLVKFYKSRR 292
V L +S FW V AA AL +EE A L L VK K RR
Sbjct: 563 VEVLERVIS-GDPFWGVSYRAAKALGEIKTEEAKAALLRVLQKVKHPKIRR 612
>gi|315428074|dbj|BAJ49661.1| aminopeptidase N [Candidatus Caldiarchaeum subterraneum]
Length = 822
Score = 41.2 bits (95), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 53/111 (47%), Gaps = 5/111 (4%)
Query: 184 SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNV 243
SM++ + DPE++ L + +P++ ++ +EK G VV + + I +L H V
Sbjct: 505 SMKNRPEHVCVDPELKVLRVLDIERPLEEALSMVEKCGSVVCRIEMIDSLAK--HGGRRV 562
Query: 244 VNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGL--LHLVKFYKSRR 292
V L +S FW V AA AL +EE A L L VK K RR
Sbjct: 563 VEVLERVIS-GDPFWGVSYRAAKALGEIKTEEAKAALLRVLQKVKHPKIRR 612
>gi|344173724|emb|CCA88897.1| D-glycero-D-manno-heptose 1,7-bisphosphate phosphatase [Ralstonia
syzygii R24]
Length = 199
Score = 41.2 bits (95), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 41/154 (26%), Positives = 67/154 (43%), Gaps = 24/154 (15%)
Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
D V LD ++SP WI D +E +AE+ NQ + + G +A A
Sbjct: 13 DGVINLDSDQFIKSPDEWIAIDGSLEAIAEL--NQAGYQVVVATNQSGIGRGLFEAAA-- 68
Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDW----AGLLHLVKFYK 289
+N ++ + + A R++A + +TA+E D AG+ H +
Sbjct: 69 ----------LNAMHEKMYKALAAQGGRVDAVFFCPHTAAEGCDCRKPKAGMFHEI---- 114
Query: 290 SRRFDENI-GLPRPND-FRDFSEYFVLEAIPHAV 321
+RRFD ++ G+P D RD + A+PH V
Sbjct: 115 ARRFDVDLTGMPVVGDSLRDLQAGVEVGAVPHLV 148
>gi|84996353|ref|XP_952898.1| hypothetical protein [Theileria annulata strain Ankara]
gi|65303895|emb|CAI76274.1| hypothetical protein, conserved [Theileria annulata]
Length = 1684
Score = 40.8 bits (94), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 46/181 (25%), Positives = 75/181 (41%), Gaps = 22/181 (12%)
Query: 171 DNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDV---VAQA 227
DN VA + ++ + PL WIR D LA I Q MW QL D ++ + +
Sbjct: 1196 DNRSIVARICSKTRL--PLLWIRVDSSFSLLARIRRCQSGSMWEQQLYSDNNIYSLIEAS 1253
Query: 228 QAIAALEALPHLSFN-----VVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLL 282
QA+ ++ ++S N L+N + K+ +R Y+L + + L
Sbjct: 1254 QALGSIGKSEYISDNPLIESACKRLDNMIKKHKSHPVIRARCMYSLMCLYNRDPRQHKYL 1313
Query: 283 HLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQL 342
H V F K N+ P + R EY+ A+ ++R +P +FVL +
Sbjct: 1314 HGV-FTKYLTSFSNLA-THPTETRFIMEYY------KALCLLRNEFGFTP----DFVLDI 1361
Query: 343 L 343
L
Sbjct: 1362 L 1362
>gi|300692466|ref|YP_003753461.1| D-glycero-D-manno-heptose 1,7-bisphosphate phosphatase [Ralstonia
solanacearum PSI07]
gi|299079526|emb|CBJ52204.1| D-glycero-D-manno-heptose 1,7-bisphosphate phosphatase [Ralstonia
solanacearum PSI07]
Length = 199
Score = 40.8 bits (94), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 41/154 (26%), Positives = 67/154 (43%), Gaps = 24/154 (15%)
Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
D V LD ++SP WI D +E +AE+ NQ + + G +A A
Sbjct: 13 DGVINLDSDQFIKSPDEWIAIDGSLEAIAEL--NQAGYQVVVATNQSGIGRGLFEAAA-- 68
Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDW----AGLLHLVKFYK 289
+N ++ + + A R++A + +TA+E D AG+ H +
Sbjct: 69 ----------LNAMHEKMYKALAAQGGRVDAVFFCPHTAAEGCDCRKPKAGMFHEI---- 114
Query: 290 SRRFDENI-GLPRPND-FRDFSEYFVLEAIPHAV 321
+RRFD ++ G+P D RD + A+PH V
Sbjct: 115 ARRFDVDLAGVPVVGDSLRDLQAGVEVGAVPHLV 148
>gi|154284686|ref|XP_001543138.1| predicted protein [Ajellomyces capsulatus NAm1]
gi|150406779|gb|EDN02320.1| predicted protein [Ajellomyces capsulatus NAm1]
Length = 927
Score = 40.8 bits (94), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 38/70 (54%), Gaps = 3/70 (4%)
Query: 46 LERPFL--KEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNT 103
L PFL E+ WV + P+LR + Y+ +N VEL++ ++ S +L S T
Sbjct: 146 LLEPFLHTSEYLQTWVISTALPLLRFNYEYHW-ENDVELSLDIIQSLDSGSAVNILLSKT 204
Query: 104 DSENRDGDIG 113
+S NR GD+G
Sbjct: 205 ESRNRGGDVG 214
>gi|429329327|gb|AFZ81086.1| bromodomain-containing protein [Babesia equi]
Length = 1659
Score = 40.4 bits (93), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 32/63 (50%), Gaps = 2/63 (3%)
Query: 171 DNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAI 230
DN VA + ++ + PL WIR DP ++ + Q MW QL D ++ AQ +A
Sbjct: 998 DNRSVVAKVCSKTRL--PLLWIRIDPNFSTISRMRRCQSSGMWEQQLLSDNNIFAQMEAA 1055
Query: 231 AAL 233
AL
Sbjct: 1056 CAL 1058
>gi|375148747|ref|YP_005011188.1| Peptidase M1 membrane alanine aminopeptidase [Niastella koreensis
GR20-10]
gi|361062793|gb|AEW01785.1| Peptidase M1 membrane alanine aminopeptidase [Niastella koreensis
GR20-10]
Length = 531
Score = 40.4 bits (93), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 7/86 (8%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
ML +Q+G F K +Q SR G + V K + K L+ FF +W+
Sbjct: 392 MLRRQLGDELFWKGIQTYYSRFSGKNAVTGDLCKAMEDVSGKS-------LQPFFQQWLF 444
Query: 61 TCGCPVLRMGFSYNKRKNIVELAVLR 86
G P L + ++YN K V + V++
Sbjct: 445 RAGQPKLDLTWTYNATKKEVNITVIQ 470
>gi|408406094|ref|YP_006864078.1| membrane alanyl aminopeptidase [Candidatus Nitrososphaera gargensis
Ga9.2]
gi|408366690|gb|AFU60420.1| putative membrane alanyl aminopeptidase [Candidatus Nitrososphaera
gargensis Ga9.2]
Length = 843
Score = 40.4 bits (93), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 64/262 (24%), Positives = 99/262 (37%), Gaps = 75/262 (28%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLER-PFLKEFFPRWV 59
ML +G +FR+ L+ + R + T T + R K+ LE L++FF +W+
Sbjct: 412 MLRHHVGDKYFRRSLKTYLQRFANS----TAETDDLR----KIFELETGKSLQQFFDQWI 463
Query: 60 GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMS 119
G P L++ FS +DGDI
Sbjct: 464 FREGHPELKVDFS-----------------------------------QDGDIA-----K 483
Query: 120 IRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGL 179
I+V ++ AGD E LA R K + K ++V +
Sbjct: 484 IKVEQV----------QAGDP---FEFALDVNLAFARGKKTRYTFKIS----EKESVFQI 526
Query: 180 DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHL 239
+ S +E W DPE + L I P +M + QL +DGD V + + A AL
Sbjct: 527 PVDSEVE----WFSVDPEFKILKTISIRAPKEMLVRQL-RDGDTV--IERVEAARALKDK 579
Query: 240 SFN-VVNTLNNFLSDSKAFWRV 260
S + V++ L + + K FW V
Sbjct: 580 SSDAVIDALKDAVLHDK-FWGV 600
>gi|442317662|ref|YP_007357683.1| M1 family peptidase [Myxococcus stipitatus DSM 14675]
gi|441485304|gb|AGC41999.1| M1 family peptidase [Myxococcus stipitatus DSM 14675]
Length = 873
Score = 40.0 bits (92), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 23/84 (27%), Positives = 38/84 (45%), Gaps = 2/84 (2%)
Query: 192 IRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFL 251
+R DP + L + ++ V +W+ +L + A+ +A AL F V L L
Sbjct: 530 VRVDPRRDVLGTLDVDKSVGLWMEELRSAPESRARTEAAHALGR--DGGFRAVEALGAAL 587
Query: 252 SDSKAFWRVRIEAAYALANTASEE 275
D++ FW R +A AL + E
Sbjct: 588 GDARLFWGTRAASARALGRIRTPE 611
>gi|401410606|ref|XP_003884751.1| hypothetical protein NCLIV_051480 [Neospora caninum Liverpool]
gi|325119169|emb|CBZ54721.1| hypothetical protein NCLIV_051480 [Neospora caninum Liverpool]
Length = 3109
Score = 40.0 bits (92), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 46/87 (52%), Gaps = 3/87 (3%)
Query: 171 DNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAI 230
DN VA ++ ++ + P+ W+RADP ++ I Q MW QL D ++ AQ +A
Sbjct: 1814 DNRSLVAKVNSKTKL--PVLWLRADPMQVWIGRIRRCQSGSMWEQQLLNDVNISAQLEAA 1871
Query: 231 AALEALPHLSFNVVNTLNNFLSDSKAF 257
AL ++ +++ TL++ + S A
Sbjct: 1872 LALGSI-RSGEDILGTLSDPAAGSGAL 1897
>gi|399217801|emb|CCF74688.1| unnamed protein product [Babesia microti strain RI]
Length = 1529
Score = 39.7 bits (91), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 22/59 (37%), Positives = 30/59 (50%), Gaps = 2/59 (3%)
Query: 171 DNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
DN VA + +S + P+ W RADP+ + + Q MW QL D DVV Q +A
Sbjct: 887 DNRSLVAKVCSKSRL--PVLWTRADPQFALIGRLRRCQSPAMWEQQLLADNDVVGQMEA 943
>gi|28839716|gb|AAH47732.1| TAF2 protein, partial [Homo sapiens]
Length = 604
Score = 39.3 bits (90), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 32/143 (22%), Positives = 62/143 (43%), Gaps = 32/143 (22%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL +K D +P G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELE------IKQDYTSP------------G 556
Query: 111 DIGWPGMMSIRVHELDGMYDHPI 133
+ G + + V ELDG ++H +
Sbjct: 557 TQKYVGPLKVTVQELDGSFNHTL 579
>gi|27370696|gb|AAH35673.1| TAF2 protein, partial [Homo sapiens]
Length = 604
Score = 39.3 bits (90), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 32/143 (22%), Positives = 62/143 (43%), Gaps = 32/143 (22%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL +K D +P G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELE------IKQDYTSP------------G 556
Query: 111 DIGWPGMMSIRVHELDGMYDHPI 133
+ G + + V ELDG ++H +
Sbjct: 557 TQKYVGPLKVTVQELDGSFNHTL 579
>gi|40674143|gb|AAH64830.1| TAF2 protein, partial [Homo sapiens]
Length = 603
Score = 39.3 bits (90), Expect = 7.3, Method: Compositional matrix adjust.
Identities = 32/143 (22%), Positives = 62/143 (43%), Gaps = 32/143 (22%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL +K D +P G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELE------IKQDYTSP------------G 556
Query: 111 DIGWPGMMSIRVHELDGMYDHPI 133
+ G + + V ELDG ++H +
Sbjct: 557 TQKYVGPLKVTVQELDGSFNHTL 579
>gi|76779336|gb|AAI05766.1| Taf2 protein, partial [Rattus norvegicus]
Length = 600
Score = 39.3 bits (90), Expect = 8.2, Method: Compositional matrix adjust.
Identities = 37/160 (23%), Positives = 72/160 (45%), Gaps = 34/160 (21%)
Query: 1 MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
++E ++ F ++ ++S A AS + +ST F + +N G +P
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTYGFLKSISNVSGKDIQPL 518
Query: 51 LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
+K+ WV G F++N+++N++EL + +D T +P G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------SP------------G 556
Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHS 150
+ G + + V ELDG ++H L + ++ + +I CHS
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLK-HDIPCHS 594
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.322 0.137 0.406
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 10,756,617,159
Number of Sequences: 23463169
Number of extensions: 441596528
Number of successful extensions: 970388
Number of sequences better than 100.0: 404
Number of HSP's better than 100.0 without gapping: 339
Number of HSP's successfully gapped in prelim test: 65
Number of HSP's that attempted gapping in prelim test: 968614
Number of HSP's gapped (non-prelim): 635
length of query: 707
length of database: 8,064,228,071
effective HSP length: 150
effective length of query: 557
effective length of database: 8,839,720,017
effective search space: 4923724049469
effective search space used: 4923724049469
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 81 (35.8 bits)