BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 005234
         (707 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|225455842|ref|XP_002273382.1| PREDICTED: transcription initiation factor TFIID subunit 2-like
            isoform 2 [Vitis vinifera]
          Length = 1345

 Score = 1027 bits (2656), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 494/617 (80%), Positives = 540/617 (87%), Gaps = 3/617 (0%)

Query: 1    MLEKQMGSNFFRKILQNIISRAQGAS-PVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
            MLEKQMG   FRKILQ I+ RAQ  +  +RTLSTKEFRHFANKVGNLERPFLKEFFPRWV
Sbjct: 449  MLEKQMGPESFRKILQTIVFRAQDTTRSLRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 508

Query: 60   GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMS 119
            G+CGCPVLR G SYNKRKN+VELAVLR CT  PD+ T VL+ N DSENR+ DIGWPGMMS
Sbjct: 509  GSCGCPVLRAGLSYNKRKNLVELAVLRGCTAAPDTNTMVLNGNIDSENREVDIGWPGMMS 568

Query: 120  IRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGL 179
            IRVHELDGMYDHPILPMAG+ WQLLEIQCHSKLAARR  KPKKGSKPDG DDNGD V  +
Sbjct: 569  IRVHELDGMYDHPILPMAGETWQLLEIQCHSKLAARRFQKPKKGSKPDGSDDNGD-VPAV 627

Query: 180  DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHL 239
            DMRS+ ESPL W+R DPE+EYLAEIHFNQP QMWINQLE+D DVVAQAQAIA LEALP L
Sbjct: 628  DMRSNTESPLLWLRVDPELEYLAEIHFNQPAQMWINQLERDKDVVAQAQAIATLEALPQL 687

Query: 240  SFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL 299
            SF+VVN LNNFLSDSKAFWRVRIEAA+ALANTASEETDWAGLLHLVKFYKSRRFD NIGL
Sbjct: 688  SFSVVNALNNFLSDSKAFWRVRIEAAFALANTASEETDWAGLLHLVKFYKSRRFDANIGL 747

Query: 300  PRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 359
            P+PNDF DF EYFVLEAIPHA+AMVRAAD KSPREAVEFVLQLLKYNDNNGNPYSDVFWL
Sbjct: 748  PKPNDFHDFPEYFVLEAIPHAIAMVRAADKKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 807

Query: 360  AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS 419
            AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQI LKLS
Sbjct: 808  AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIGLKLS 867

Query: 420  GFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLR 479
            GFI LD+V++L+KPFRDF  IWQVR+EASRALL LEFH  GID+ALSLFIK VEEEPS+R
Sbjct: 868  GFIPLDRVIELVKPFRDFQAIWQVRIEASRALLGLEFHFKGIDAALSLFIKYVEEEPSIR 927

Query: 480  GQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRHHLFGILQILA 539
            GQVKLG+HAMR+CQIKGGS+S++++ + TLVALL LLESRIAFNNVFLRHHLF IL+ILA
Sbjct: 928  GQVKLGVHAMRLCQIKGGSESDNDIKSSTLVALLRLLESRIAFNNVFLRHHLFCILRILA 987

Query: 540  GRAPTLYGVPRDKLLLLGDGE-TSEQKNVFASFVTEMRRAEPPMDVPNLSQDNLAVRDAS 598
            GR PTLYGVPRD++  +   E  SEQKN F + V E +  EPP+D PN+S D LA+ +AS
Sbjct: 988  GRLPTLYGVPRDQIPQMDPAEICSEQKNGFITIVKETKSLEPPVDTPNVSHDGLALPEAS 1047

Query: 599  KEVDCVANGHAENILAV 615
            +E D V+N H   +  V
Sbjct: 1048 READTVSNSHERKMPVV 1064


>gi|225455844|ref|XP_002273351.1| PREDICTED: transcription initiation factor TFIID subunit 2-like
            isoform 1 [Vitis vinifera]
          Length = 1325

 Score =  984 bits (2544), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 477/617 (77%), Positives = 522/617 (84%), Gaps = 23/617 (3%)

Query: 1    MLEKQMGSNFFRKILQNIISRAQGAS-PVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
            MLEKQMG   FRKILQ I+ RAQ  +  +RTLSTKEFRHFANKVGNLERPFLKEFFPRWV
Sbjct: 449  MLEKQMGPESFRKILQTIVFRAQDTTRSLRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 508

Query: 60   GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMS 119
            G+CGCPVLR G SYNKRKN+VELAVLR CT  PD+ T VL+ N DSENR+ DIGWPGMMS
Sbjct: 509  GSCGCPVLRAGLSYNKRKNLVELAVLRGCTAAPDTNTMVLNGNIDSENREVDIGWPGMMS 568

Query: 120  IRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGL 179
            IRVHELDGMYDHPILPMAG+ WQLLEIQCHSKLAARR  KPKKGSKPDG DDNGD V  +
Sbjct: 569  IRVHELDGMYDHPILPMAGETWQLLEIQCHSKLAARRFQKPKKGSKPDGSDDNGD-VPAV 627

Query: 180  DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHL 239
            DMRS+ ESPL W+R DPE+EYLAEIHFNQP QMWINQLE+D DVVAQAQAIA LEALP L
Sbjct: 628  DMRSNTESPLLWLRVDPELEYLAEIHFNQPAQMWINQLERDKDVVAQAQAIATLEALPQL 687

Query: 240  SFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL 299
            SF+VVN LNNFLSDSKAFWRVRIEAA+ALANTASEETDWAGLLHLVKFYKSRRFD NIGL
Sbjct: 688  SFSVVNALNNFLSDSKAFWRVRIEAAFALANTASEETDWAGLLHLVKFYKSRRFDANIGL 747

Query: 300  PRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 359
            P+PNDF DF EYFVLEAIPHA+AMVRAAD KSPREAVEFVLQLLKYNDNNGNPYSDVFWL
Sbjct: 748  PKPNDFHDFPEYFVLEAIPHAIAMVRAADKKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 807

Query: 360  AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS 419
            AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQI LKLS
Sbjct: 808  AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIGLKLS 867

Query: 420  GFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLR 479
            GFI LD+V++L+KPFRDF  IWQVR+EASRALL LEFH                     +
Sbjct: 868  GFIPLDRVIELVKPFRDFQAIWQVRIEASRALLGLEFH--------------------FK 907

Query: 480  GQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRHHLFGILQILA 539
            GQVKLG+HAMR+CQIKGGS+S++++ + TLVALL LLESRIAFNNVFLRHHLF IL+ILA
Sbjct: 908  GQVKLGVHAMRLCQIKGGSESDNDIKSSTLVALLRLLESRIAFNNVFLRHHLFCILRILA 967

Query: 540  GRAPTLYGVPRDKLLLLGDGE-TSEQKNVFASFVTEMRRAEPPMDVPNLSQDNLAVRDAS 598
            GR PTLYGVPRD++  +   E  SEQKN F + V E +  EPP+D PN+S D LA+ +AS
Sbjct: 968  GRLPTLYGVPRDQIPQMDPAEICSEQKNGFITIVKETKSLEPPVDTPNVSHDGLALPEAS 1027

Query: 599  KEVDCVANGHAENILAV 615
            +E D V+N H   +  V
Sbjct: 1028 READTVSNSHERKMPVV 1044


>gi|297734162|emb|CBI15409.3| unnamed protein product [Vitis vinifera]
          Length = 1245

 Score =  973 bits (2516), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 467/554 (84%), Positives = 503/554 (90%), Gaps = 2/554 (0%)

Query: 1    MLEKQMGSNFFRKILQNIISRAQGAS-PVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
            MLEKQMG   FRKILQ I+ RAQ  +  +RTLSTKEFRHFANKVGNLERPFLKEFFPRWV
Sbjct: 670  MLEKQMGPESFRKILQTIVFRAQDTTRSLRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 729

Query: 60   GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMS 119
            G+CGCPVLR G SYNKRKN+VELAVLR CT  PD+ T VL+ N DSENR+ DIGWPGMMS
Sbjct: 730  GSCGCPVLRAGLSYNKRKNLVELAVLRGCTAAPDTNTMVLNGNIDSENREVDIGWPGMMS 789

Query: 120  IRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGL 179
            IRVHELDGMYDHPILPMAG+ WQLLEIQCHSKLAARR  KPKKGSKPDG DDNGD V  +
Sbjct: 790  IRVHELDGMYDHPILPMAGETWQLLEIQCHSKLAARRFQKPKKGSKPDGSDDNGD-VPAV 848

Query: 180  DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHL 239
            DMRS+ ESPL W+R DPE+EYLAEIHFNQP QMWINQLE+D DVVAQAQAIA LEALP L
Sbjct: 849  DMRSNTESPLLWLRVDPELEYLAEIHFNQPAQMWINQLERDKDVVAQAQAIATLEALPQL 908

Query: 240  SFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL 299
            SF+VVN LNNFLSDSKAFWRVRIEAA+ALANTASEETDWAGLLHLVKFYKSRRFD NIGL
Sbjct: 909  SFSVVNALNNFLSDSKAFWRVRIEAAFALANTASEETDWAGLLHLVKFYKSRRFDANIGL 968

Query: 300  PRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 359
            P+PNDF DF EYFVLEAIPHA+AMVRAAD KSPREAVEFVLQLLKYNDNNGNPYSDVFWL
Sbjct: 969  PKPNDFHDFPEYFVLEAIPHAIAMVRAADKKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 1028

Query: 360  AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS 419
            AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQI LKLS
Sbjct: 1029 AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIGLKLS 1088

Query: 420  GFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLR 479
            GFI LD+V++L+KPFRDF  IWQVR+EASRALL LEFH  GID+ALSLFIK VEEEPS+R
Sbjct: 1089 GFIPLDRVIELVKPFRDFQAIWQVRIEASRALLGLEFHFKGIDAALSLFIKYVEEEPSIR 1148

Query: 480  GQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRHHLFGILQILA 539
            GQVKLG+HAMR+CQIKGGS+S++++ + TLVALL LLESRIAFNNVFLRHHLF IL+ILA
Sbjct: 1149 GQVKLGVHAMRLCQIKGGSESDNDIKSSTLVALLRLLESRIAFNNVFLRHHLFCILRILA 1208

Query: 540  GRAPTLYGVPRDKL 553
            GR PTLYGVPRD++
Sbjct: 1209 GRLPTLYGVPRDQI 1222


>gi|224132980|ref|XP_002321457.1| predicted protein [Populus trichocarpa]
 gi|222868453|gb|EEF05584.1| predicted protein [Populus trichocarpa]
          Length = 1359

 Score =  962 bits (2488), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 472/628 (75%), Positives = 529/628 (84%), Gaps = 5/628 (0%)

Query: 1    MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
            MLEKQMG  FFRKILQ +ISRA+   PVR+LSTKEFRHFA KVGNLERPF+KEFF RWV 
Sbjct: 447  MLEKQMGPEFFRKILQKVISRARDTIPVRSLSTKEFRHFATKVGNLERPFVKEFFLRWVC 506

Query: 61   TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSI 120
            +CGCPVLRMGFSYNKRKN+VELAVLR+ T  PD+    L  N DSENR+GDIGWPGMMSI
Sbjct: 507  SCGCPVLRMGFSYNKRKNMVELAVLREFTAAPDANASFL--NLDSENREGDIGWPGMMSI 564

Query: 121  RVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLD 180
            RV+ELDGMYDHP+LP+AG+ WQLLEIQCHSKLAARR  KPKK SKPDG D+NGD  A  D
Sbjct: 565  RVYELDGMYDHPVLPLAGEMWQLLEIQCHSKLAARRFQKPKKSSKPDGFDENGDVPAS-D 623

Query: 181  MRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLS 240
            MRSS+ESPLSWIRADPEMEYLAEIHFNQP+QMWINQLE+D DVVAQAQAIAAL+ LP LS
Sbjct: 624  MRSSLESPLSWIRADPEMEYLAEIHFNQPIQMWINQLERDEDVVAQAQAIAALKTLPQLS 683

Query: 241  FNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP 300
            F+V N +NNFL+D+KAFWRVRIE A+ALANTASEE DWAGLLHLVKFYKSRRFD  IGLP
Sbjct: 684  FSVTNAMNNFLNDTKAFWRVRIETAFALANTASEENDWAGLLHLVKFYKSRRFDAAIGLP 743

Query: 301  RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLA 360
            +PNDF DF EYFVLEAIPHAVA VRAAD KSPREAVEF+LQLLKYNDN GNPYSDVFWLA
Sbjct: 744  KPNDFHDFPEYFVLEAIPHAVAKVRAADKKSPREAVEFILQLLKYNDNTGNPYSDVFWLA 803

Query: 361  ALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSG 420
            ALVQSVGELEFGQQ++LFLSSLLKRID LLQFDRLM SYNGILTISCIRTLTQIALKLSG
Sbjct: 804  ALVQSVGELEFGQQTVLFLSSLLKRIDCLLQFDRLMLSYNGILTISCIRTLTQIALKLSG 863

Query: 421  FISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLRG 480
             I  D V +LIKPFRDF TIWQ+R+EASRALLDLEFHC G+D+ALSLFI  +EEEPSLRG
Sbjct: 864  SIHHDHVFELIKPFRDFKTIWQIRIEASRALLDLEFHCKGMDAALSLFITYLEEEPSLRG 923

Query: 481  QVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRHHLFGILQILAG 540
            Q KLG HAMR+CQI+  SDS   +   TL+AL+ LLE  I FNN  LRHHLF ILQILAG
Sbjct: 924  QAKLGAHAMRLCQIQDESDSEDAIKCTTLLALIRLLEGHIGFNNTILRHHLFCILQILAG 983

Query: 541  RAPTLYGVPRDKLLLLGDGET-SEQKNVFASFVTEMRRAEPPMDVPNLSQDNLAVRDASK 599
            RA TLYG+PRD+ L +GD ET S+ +N+FA  VTE +  EPPM++P L+QDN A  +A K
Sbjct: 984  RAATLYGIPRDRTLCIGDSETCSDPRNIFAGLVTETKPLEPPMEIPKLAQDNFAFPEAIK 1043

Query: 600  EVDCVANGHAENI-LAVPEAQRMQMSFP 626
            E D ++N     + +A+PE    +++ P
Sbjct: 1044 EADIISNKDQHKMDMAIPEEASEEVAVP 1071


>gi|356563105|ref|XP_003549806.1| PREDICTED: transcription initiation factor TFIID subunit 2-like
            isoform 1 [Glycine max]
          Length = 1388

 Score =  908 bits (2346), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 455/593 (76%), Positives = 512/593 (86%), Gaps = 5/593 (0%)

Query: 1    MLEKQMGSNFFRKILQNIISRAQGAS-PVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
            MLEKQMG   FR+ILQ I+SRAQ  +  ++TLSTKEFRHFANKVGNLERPFLK+FFPRWV
Sbjct: 450  MLEKQMGPESFRRILQTIVSRAQDKTRSIKTLSTKEFRHFANKVGNLERPFLKDFFPRWV 509

Query: 60   GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMS 119
            G+CGCPVLRMGFSYNKRKN+VELAVLR CT    S T +L  N D+E RDGD GWPGMMS
Sbjct: 510  GSCGCPVLRMGFSYNKRKNMVELAVLRGCTALQTSNTSILDINPDTETRDGDTGWPGMMS 569

Query: 120  IRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGL 179
            IRV+ELDGMYDHPILPMAGDAWQLLEIQCHSKLAARR  KPKKG K DG DDNGD V  +
Sbjct: 570  IRVYELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRFQKPKKGLKLDGSDDNGD-VPSM 628

Query: 180  DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHL 239
            DMRS+ ESPL WIRADP+MEYLAE+HFNQPVQMWINQLEKD DV+AQAQAIAALEA P L
Sbjct: 629  DMRSNTESPLLWIRADPDMEYLAEVHFNQPVQMWINQLEKDKDVIAQAQAIAALEASPQL 688

Query: 240  SFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL 299
            SF++VN LNNFLSDSKAFWRVRIEAA+ALAN+ASEETD++GLLHL+KFYKSRRFD +IGL
Sbjct: 689  SFSIVNALNNFLSDSKAFWRVRIEAAFALANSASEETDFSGLLHLMKFYKSRRFDTDIGL 748

Query: 300  PRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 359
            P+PNDF DF+EYFVLEAIPHAVAMVRAAD KSPREA+EFVLQLLKYNDNNGNPYSDVFWL
Sbjct: 749  PKPNDFHDFAEYFVLEAIPHAVAMVRAADKKSPREAIEFVLQLLKYNDNNGNPYSDVFWL 808

Query: 360  AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS 419
            AALVQSVGELEFGQQSIL LSSLLKRIDRLLQFD LMPSYNGILTISCIRTLTQIALKLS
Sbjct: 809  AALVQSVGELEFGQQSILLLSSLLKRIDRLLQFDSLMPSYNGILTISCIRTLTQIALKLS 868

Query: 420  GFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLR 479
            GFI LD+V  L+KPFRD   +WQVR+EASRALLDLEFHC G+DSAL LFIK +EEE SLR
Sbjct: 869  GFIPLDRVYGLVKPFRDIKALWQVRIEASRALLDLEFHCKGMDSALLLFIKYIEEEHSLR 928

Query: 480  GQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRHHLFGILQILA 539
            GQ+KL  H MR+CQ++ G +SN E+ + TLV++LNLLE RIAFNN FLRH+LF ILQILA
Sbjct: 929  GQLKLATHVMRLCQMRDGLNSNDEITSQTLVSMLNLLEGRIAFNNAFLRHYLFCILQILA 988

Query: 540  GRAPTLYGVPRD-KLLLLGDGETSE-QKNVFASFVTEMRRAEPPMDVPNLSQD 590
             R PTL+G+PR+ ++L +   E S  QKN+ A   +E +  + P  + +L+Q+
Sbjct: 989  RRHPTLHGIPRENRMLHMSLTEASNYQKNMLA-LDSESKPLDLPSSIDDLTQN 1040


>gi|356513902|ref|XP_003525647.1| PREDICTED: transcription initiation factor TFIID subunit 2-like
            [Glycine max]
          Length = 1388

 Score =  905 bits (2338), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 461/618 (74%), Positives = 522/618 (84%), Gaps = 15/618 (2%)

Query: 1    MLEKQMGSNFFRKILQNIISRAQGAS-PVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
            MLEKQMG   FR+ILQ I+SRAQ  +  ++TLSTKEFRHFANKVGNLERPFLK+FFPRWV
Sbjct: 450  MLEKQMGPESFRRILQTIVSRAQDKTRSIKTLSTKEFRHFANKVGNLERPFLKDFFPRWV 509

Query: 60   GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMS 119
             +CGCPVLRMGFSYNKRKN+VELAVLR CT    S T +L  N D+E RDGDIGWPGMMS
Sbjct: 510  SSCGCPVLRMGFSYNKRKNMVELAVLRGCTTLQTSSTSILDINPDTETRDGDIGWPGMMS 569

Query: 120  IRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGL 179
            IRV+ELDGMYDHPILPMAG+AWQLLEIQCHSKLAARR  KPKKG K DG DDNGD V  +
Sbjct: 570  IRVYELDGMYDHPILPMAGEAWQLLEIQCHSKLAARRFQKPKKGLKLDGSDDNGD-VPSM 628

Query: 180  DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHL 239
            DMR + ESPL WIRADP+MEYLAE+HFNQPVQMWINQLEKD DV+AQAQAIAALEA P L
Sbjct: 629  DMRLNTESPLLWIRADPDMEYLAEVHFNQPVQMWINQLEKDKDVIAQAQAIAALEASPQL 688

Query: 240  SFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL 299
            SF++VN LNNFLSDSKAFWRVRIEAA+ALAN+ASEETD++GLLHLVKFYKSRRFD +IGL
Sbjct: 689  SFSIVNALNNFLSDSKAFWRVRIEAAFALANSASEETDFSGLLHLVKFYKSRRFDPDIGL 748

Query: 300  PRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 359
            P+PNDF+DF+EYFVLEAIPHAVAMVRAAD KSPREA+EFVLQLLKYNDNNGNPYSDVFWL
Sbjct: 749  PKPNDFQDFAEYFVLEAIPHAVAMVRAADKKSPREAIEFVLQLLKYNDNNGNPYSDVFWL 808

Query: 360  AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS 419
            AALVQSVGELEFGQQSIL LSSLLKRIDRLLQFD LMPSYNGILTISCIRTLTQIALKLS
Sbjct: 809  AALVQSVGELEFGQQSILLLSSLLKRIDRLLQFDSLMPSYNGILTISCIRTLTQIALKLS 868

Query: 420  GFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLR 479
            GFI LD+V +L+KPFRD   +WQV++EAS+ALLDLEFHC G+DSAL LFIK +EEE SLR
Sbjct: 869  GFIPLDRVYELVKPFRDLKALWQVQIEASKALLDLEFHCKGMDSALLLFIKYIEEEHSLR 928

Query: 480  GQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRHHLFGILQILA 539
            GQ+KL  H MR+CQ++ G +SN E+ + TLV++LNLLE RIAFNNV LRH+LF ILQILA
Sbjct: 929  GQLKLATHVMRLCQMRDGLNSNDEITSQTLVSMLNLLEGRIAFNNVSLRHYLFCILQILA 988

Query: 540  GRAPTLYGVPR-DKLLLLGDGETSE-QKNVFASFVTEMRRAEPPMDVPNLSQDNL----- 592
             R PTL+G+PR +++L +   E    QKN+FA   +E +  + P    NL+Q NL     
Sbjct: 989  RRPPTLHGIPRGNRMLHMSLAEACNYQKNIFA-LDSESKPLDLPSSTKNLTQ-NLGPTME 1046

Query: 593  ----AVRDASKEVDCVAN 606
                AV +A K+  C A+
Sbjct: 1047 GLRDAVDEAPKDQPCEAS 1064


>gi|449439379|ref|XP_004137463.1| PREDICTED: transcription initiation factor TFIID subunit 2-like
            [Cucumis sativus]
          Length = 1362

 Score =  903 bits (2333), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 449/646 (69%), Positives = 517/646 (80%), Gaps = 33/646 (5%)

Query: 1    MLEKQMGSNFFRKILQNIISRAQG-ASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
            MLEKQMG   FRKILQNI+S A+   S  + LSTKEFR  ANK+GNLERPFLKEFFPRWV
Sbjct: 448  MLEKQMGPESFRKILQNIVSHAKDTGSTSQLLSTKEFRQLANKIGNLERPFLKEFFPRWV 507

Query: 60   GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMS 119
             +CGCP+LRMGFSYNKRKN+VE+AV R+CT  P          T+ ENRD D GWPGMMS
Sbjct: 508  ESCGCPLLRMGFSYNKRKNMVEMAVSRECTATP---------ATNVENRDSDAGWPGMMS 558

Query: 120  IRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGL 179
            IR++ELDG++DHP+LPM G++WQLLEIQCHSKLAARR  K KKGSKPDG DDN D +  L
Sbjct: 559  IRIYELDGVFDHPVLPMTGESWQLLEIQCHSKLAARRLQKTKKGSKPDGSDDNAD-IPAL 617

Query: 180  DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHL 239
            D+RSS+ESPL W+RADPEMEYLAEIHF+QPVQMWINQLEKD DV+AQAQAIA LE LP  
Sbjct: 618  DIRSSVESPLLWLRADPEMEYLAEIHFHQPVQMWINQLEKDKDVIAQAQAIATLEMLPQP 677

Query: 240  SFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL 299
            SF++VN LNNFL D KAFWRVRIEAA A+A TASE+TDWAGLL+L+KF+KS+RFD + GL
Sbjct: 678  SFSIVNALNNFLKDPKAFWRVRIEAALAMAKTASEDTDWAGLLNLIKFFKSQRFDADTGL 737

Query: 300  PRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 359
            P+PN+FRDF EYFVLEAIPHAVAMVR  D KSPREAVEFVLQLLKYNDNNGNPYSDVFWL
Sbjct: 738  PKPNEFRDFPEYFVLEAIPHAVAMVRGTDQKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 797

Query: 360  AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS 419
            AALVQSVGELEFGQQSILFL+SLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS
Sbjct: 798  AALVQSVGELEFGQQSILFLASLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS 857

Query: 420  GFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLR 479
            G +SLD++++LI+PFRDFN++WQVR+EA+R+LLDLE+HCNGID+ L LFIK +EEE SLR
Sbjct: 858  GLLSLDRIIELIRPFRDFNSMWQVRIEATRSLLDLEYHCNGIDATLLLFIKYLEEENSLR 917

Query: 480  GQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRHHLFGILQILA 539
            GQVKL +H MR+CQI   S SN  V+  TLVALL LLE  +AFNNV+LRH+LF ILQ+L+
Sbjct: 918  GQVKLAVHVMRLCQIMRRSGSNDVVNNDTLVALLLLLEGNMAFNNVYLRHYLFSILQVLS 977

Query: 540  GRAPTLYGVPRD-KLLLLGD-GETSEQKNVFASFVTEMRRAEP----------------- 580
            GR+PTLYGVPR+ K L +GD G  SEQK +  S + E    EP                 
Sbjct: 978  GRSPTLYGVPREYKTLHMGDTGTFSEQKRMLTSIIPEFNPPEPSSVSAVAPMPCIPATLS 1037

Query: 581  --PMDVPNLSQDNLAVRDASKEVDCVANGHAENILAVPEAQRMQMS 624
              P+ VP    DNLAV + SKE   +A    +  +A+ EA R   S
Sbjct: 1038 SEPLHVPTPRPDNLAVPELSKEEGAIAED-PKQAMAIVEAPREAAS 1082


>gi|449516818|ref|XP_004165443.1| PREDICTED: transcription initiation factor TFIID subunit 2-like
            [Cucumis sativus]
          Length = 1362

 Score =  901 bits (2329), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 449/646 (69%), Positives = 517/646 (80%), Gaps = 33/646 (5%)

Query: 1    MLEKQMGSNFFRKILQNIISRAQG-ASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
            MLEKQMG   FRKILQNI+S A+   S  + LSTKEFR  ANK+GNLERPFLKEFFPRWV
Sbjct: 448  MLEKQMGPESFRKILQNIVSHAKDTGSTSQLLSTKEFRQLANKIGNLERPFLKEFFPRWV 507

Query: 60   GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMS 119
             +CGCP+LRMGFSYNKRKN+VE+AV R+CT  P          T+ ENRD D GWPGMMS
Sbjct: 508  ESCGCPLLRMGFSYNKRKNMVEMAVSRECTATP---------ATNVENRDSDAGWPGMMS 558

Query: 120  IRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGL 179
            IR++ELDG++DHP+LPM G++WQLLEIQCHSKLAARR  K KKGSKPDG DDN D +  L
Sbjct: 559  IRIYELDGVFDHPVLPMTGESWQLLEIQCHSKLAARRLQKTKKGSKPDGSDDNAD-IPAL 617

Query: 180  DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHL 239
            D+RSS+ESPL W+RADPEMEYLAEIHF+QPVQMWINQLEKD DV+AQAQAIA LE LP  
Sbjct: 618  DIRSSVESPLLWLRADPEMEYLAEIHFHQPVQMWINQLEKDKDVIAQAQAIATLEMLPQP 677

Query: 240  SFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL 299
            SF++VN LNNFL D KAFWRVRIEAA A+A TASE+TDWAGLL+L+KF+KS+RFD + GL
Sbjct: 678  SFSIVNALNNFLKDPKAFWRVRIEAALAMAKTASEDTDWAGLLNLIKFFKSQRFDADTGL 737

Query: 300  PRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 359
            P+PN+FRDF EYFVLEAIPHAVAMVR  D KSPREAVEFVLQLLKYNDNNGNPYSDVFWL
Sbjct: 738  PKPNEFRDFPEYFVLEAIPHAVAMVRGTDQKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 797

Query: 360  AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS 419
            AALVQSVGELEFGQQSILFL+SLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS
Sbjct: 798  AALVQSVGELEFGQQSILFLASLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS 857

Query: 420  GFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLR 479
            G +SLD++++LI+PFRDFN++WQVR+EA+R+LLDLE+HCNGID+ L LFIK +EEE SLR
Sbjct: 858  GLLSLDRIIELIRPFRDFNSMWQVRIEATRSLLDLEYHCNGIDATLLLFIKYLEEENSLR 917

Query: 480  GQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRHHLFGILQILA 539
            GQVKL +H MR+CQI   S SN  V+  TLVALL LLE  +AFNNV+LRH+LF ILQ+L+
Sbjct: 918  GQVKLAVHVMRLCQIMRRSGSNDVVNNDTLVALLLLLEGNMAFNNVYLRHYLFCILQVLS 977

Query: 540  GRAPTLYGVPRD-KLLLLGD-GETSEQKNVFASFVTEMRRAEP----------------- 580
            GR+PTLYGVPR+ K L +GD G  SEQK +  S + E    EP                 
Sbjct: 978  GRSPTLYGVPREYKTLHMGDTGTFSEQKRMLTSLIPEFNPPEPSSVSAVAPMPCIPATLS 1037

Query: 581  --PMDVPNLSQDNLAVRDASKEVDCVANGHAENILAVPEAQRMQMS 624
              P+ VP    DNLAV + SKE   +A    +  +A+ EA R   S
Sbjct: 1038 SEPLHVPTPRPDNLAVPELSKEEGAIAED-PKQAMAIVEAPREAAS 1082


>gi|356563107|ref|XP_003549807.1| PREDICTED: transcription initiation factor TFIID subunit 2-like
            isoform 2 [Glycine max]
          Length = 1368

 Score =  871 bits (2250), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 440/593 (74%), Positives = 496/593 (83%), Gaps = 25/593 (4%)

Query: 1    MLEKQMGSNFFRKILQNIISRAQGAS-PVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
            MLEKQMG   FR+ILQ I+SRAQ  +  ++TLSTKEFRHFANKVGNLERPFLK+FFPRWV
Sbjct: 450  MLEKQMGPESFRRILQTIVSRAQDKTRSIKTLSTKEFRHFANKVGNLERPFLKDFFPRWV 509

Query: 60   GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMS 119
            G+CGCPVLRMGFSYNKRKN+VELAVLR CT    S T +L  N D+E RDGD GWPGMMS
Sbjct: 510  GSCGCPVLRMGFSYNKRKNMVELAVLRGCTALQTSNTSILDINPDTETRDGDTGWPGMMS 569

Query: 120  IRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGL 179
            IRV+ELDGMYDHPILPMAGDAWQLLEIQCHSKLAARR  KPKKG K DG DDNGD V  +
Sbjct: 570  IRVYELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRFQKPKKGLKLDGSDDNGD-VPSM 628

Query: 180  DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHL 239
            DMRS+ ESPL WIRADP+MEYLAE+HFNQPVQMWINQLEKD DV+AQAQAIAALEA P L
Sbjct: 629  DMRSNTESPLLWIRADPDMEYLAEVHFNQPVQMWINQLEKDKDVIAQAQAIAALEASPQL 688

Query: 240  SFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL 299
            SF++VN LNNFLSDSKAFWRVRIEAA+ALAN+ASEETD++GLLHL+KFYKSRRFD +IGL
Sbjct: 689  SFSIVNALNNFLSDSKAFWRVRIEAAFALANSASEETDFSGLLHLMKFYKSRRFDTDIGL 748

Query: 300  PRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 359
            P+PNDF DF+EYFVLEAIPHAVAMVRAAD KSPREA+EFVLQLLKYNDNNGNPYSDVFWL
Sbjct: 749  PKPNDFHDFAEYFVLEAIPHAVAMVRAADKKSPREAIEFVLQLLKYNDNNGNPYSDVFWL 808

Query: 360  AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS 419
            AALVQSVGELEFGQQSIL LSSLLKRIDRLLQFD LMPSYNGILTISCIRTLTQIALKLS
Sbjct: 809  AALVQSVGELEFGQQSILLLSSLLKRIDRLLQFDSLMPSYNGILTISCIRTLTQIALKLS 868

Query: 420  GFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLR 479
            GFI LD+V  L+KPFRD   +WQVR+EASRALLDLEFHC                    +
Sbjct: 869  GFIPLDRVYGLVKPFRDIKALWQVRIEASRALLDLEFHC--------------------K 908

Query: 480  GQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRHHLFGILQILA 539
            GQ+KL  H MR+CQ++ G +SN E+ + TLV++LNLLE RIAFNN FLRH+LF ILQILA
Sbjct: 909  GQLKLATHVMRLCQMRDGLNSNDEITSQTLVSMLNLLEGRIAFNNAFLRHYLFCILQILA 968

Query: 540  GRAPTLYGVPRD-KLLLLGDGETSE-QKNVFASFVTEMRRAEPPMDVPNLSQD 590
             R PTL+G+PR+ ++L +   E S  QKN+ A   +E +  + P  + +L+Q+
Sbjct: 969  RRHPTLHGIPRENRMLHMSLTEASNYQKNMLA-LDSESKPLDLPSSIDDLTQN 1020


>gi|297842145|ref|XP_002888954.1| membrane alanyl aminopeptidase [Arabidopsis lyrata subsp. lyrata]
 gi|297334795|gb|EFH65213.1| membrane alanyl aminopeptidase [Arabidopsis lyrata subsp. lyrata]
          Length = 1390

 Score =  843 bits (2178), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 410/596 (68%), Positives = 483/596 (81%), Gaps = 4/596 (0%)

Query: 1    MLEKQMGSNFFRKILQNIISRAQGAS-PVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
            MLEKQMGS+ FRKILQ IISRA+  S  +R+LSTKEFR FANK+GNLERPFLKEFF RWV
Sbjct: 450  MLEKQMGSDSFRKILQKIISRAKDPSNSIRSLSTKEFRQFANKIGNLERPFLKEFFQRWV 509

Query: 60   GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMS 119
             +CGCPVLR+G SYNKRKN VE+A LR+CT   D+R  V+ + +DSE+RD D GWPG+MS
Sbjct: 510  SSCGCPVLRIGLSYNKRKNNVEMAALRECTAALDARLSVIGATSDSESRDVDAGWPGIMS 569

Query: 120  IRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGL 179
            IRV+ELDGM DHP LPMAGD WQLLE+ CHSKLAA+R  KPKKG KPDG +DN DA+A L
Sbjct: 570  IRVYELDGMSDHPKLPMAGDRWQLLELPCHSKLAAKRYQKPKKGGKPDGAEDNVDAIAPL 629

Query: 180  DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHL 239
            + ++S+ESPL+WI+ADPEMEY+AEIH +QP+QMW+NQLEKDGDVVAQAQAIA+LEAL   
Sbjct: 630  ENKTSIESPLAWIKADPEMEYIAEIHLHQPLQMWVNQLEKDGDVVAQAQAIASLEALKQH 689

Query: 240  SFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL 299
            SF++VN L N LSDSK FWR+RI AA+ALA TASEETDWAGL HL+KFYKSRRFD  IGL
Sbjct: 690  SFSIVNALKNVLSDSKVFWRIRIGAAFALAKTASEETDWAGLQHLIKFYKSRRFDAEIGL 749

Query: 300  PRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 359
            P+PN+F DF EYFVLEAIPHA+A VR A+ KSPREAVEF+LQLLKYNDN+GN YSDVFWL
Sbjct: 750  PKPNNFHDFPEYFVLEAIPHAIARVRGAEGKSPREAVEFILQLLKYNDNSGNRYSDVFWL 809

Query: 360  AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS 419
            A LVQSVG+LEF QQS+ FL+ LLKRIDRLLQFDRLMPSYNGILTISCIRTL Q ALKLS
Sbjct: 810  AVLVQSVGDLEFCQQSLTFLAPLLKRIDRLLQFDRLMPSYNGILTISCIRTLAQTALKLS 869

Query: 420  GFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLR 479
              IS D + KLI+PFR+ +TI Q+R+EASRALLD+E+   GI S L LF+K V EE SLR
Sbjct: 870  DSISFDHICKLIEPFRNSDTILQIRIEASRALLDIEYQSKGISSTLLLFMKYVVEESSLR 929

Query: 480  GQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRHHLFGILQILA 539
            GQVKL +H MR+CQI  G DS+  VDTV+L+ LL+L +S + FNN  LR++LF I QILA
Sbjct: 930  GQVKLCVHTMRLCQIAVGCDSDDCVDTVSLLDLLHLFKSHVVFNNELLRYYLFCIFQILA 989

Query: 540  GRAPTLYGVPRDKLLLLGD-GETSEQKNVFASFVTEMRRAEPPMDVPNLSQDNLAV 594
            GR PTL+GVP++K L L D     E KNVF+  V      EP + + + +  +L V
Sbjct: 990  GRPPTLFGVPKEKPLQLVDVAACIEPKNVFS--VPGAEAGEPSLALGDANGQSLDV 1043


>gi|22330618|ref|NP_177536.2| TBP-associated factor 2 [Arabidopsis thaliana]
 gi|20856938|gb|AAM26691.1| At1g73960/F2P9_17 [Arabidopsis thaliana]
 gi|332197409|gb|AEE35530.1| TBP-associated factor 2 [Arabidopsis thaliana]
          Length = 1390

 Score =  828 bits (2139), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 405/570 (71%), Positives = 474/570 (83%), Gaps = 2/570 (0%)

Query: 1    MLEKQMGSNFFRKILQNIISRAQGAS-PVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
            MLEKQMGS+ FRKILQ IISRA+  S  +R+LSTKEFR FANK+GNLERPFLKEFF RWV
Sbjct: 450  MLEKQMGSDSFRKILQKIISRAKDPSNSIRSLSTKEFRQFANKIGNLERPFLKEFFQRWV 509

Query: 60   GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMS 119
             + GCPVLR+G SYNKRKN VE+A LR+CT   D+R  V+ + +DSE+RD D GWPG+MS
Sbjct: 510  ASYGCPVLRIGLSYNKRKNNVEMAALRECTAALDARLSVIGATSDSESRDVDAGWPGIMS 569

Query: 120  IRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGL 179
            IRV+ELDGM DHP LPMAGD WQLLE+ CHSKLAA+R  KPKKG KPDG +DN DA+A L
Sbjct: 570  IRVYELDGMSDHPKLPMAGDRWQLLELPCHSKLAAKRYQKPKKGGKPDGAEDNVDAIAPL 629

Query: 180  DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHL 239
            + ++S+ESPL+WI+ADPEMEY+AEIH +QP+QMW+NQLEKDGDVVAQAQAIA+LEAL   
Sbjct: 630  ENKTSIESPLAWIKADPEMEYIAEIHLHQPLQMWVNQLEKDGDVVAQAQAIASLEALKQH 689

Query: 240  SFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL 299
            SF++VN L N L+DSK FWR+RI AA+ALA TASEE+DWAGL HL+KFYKSRRFD  IGL
Sbjct: 690  SFSIVNALKNVLTDSKVFWRIRIAAAFALAKTASEESDWAGLQHLIKFYKSRRFDAEIGL 749

Query: 300  PRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 359
            P+PNDFRDF EYFVLEAIPHA+A+VR A+ KSPREAVEF+LQLLKYNDN+GN YSDVFWL
Sbjct: 750  PKPNDFRDFPEYFVLEAIPHAIAIVRGAEGKSPREAVEFILQLLKYNDNSGNSYSDVFWL 809

Query: 360  AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS 419
            A LVQSVG+LEF QQS+ FL+ LLKRIDRLLQFDRLMPSYNGILTISCIRTL Q ALKLS
Sbjct: 810  AVLVQSVGDLEFCQQSLTFLAPLLKRIDRLLQFDRLMPSYNGILTISCIRTLAQTALKLS 869

Query: 420  GFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLR 479
              IS D + KLI+PFR+ +TI Q+R+E SRALLD+E+   GI SAL LF+K + EE SLR
Sbjct: 870  DSISFDHICKLIEPFRNSDTILQIRIEGSRALLDIEYQSKGISSALLLFMKYLVEESSLR 929

Query: 480  GQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRHHLFGILQILA 539
            GQVKL +H MR+CQI  G DS+  VDTVTL+ LL+L +S + FNN  LR++LF I QILA
Sbjct: 930  GQVKLCVHTMRLCQIAVGCDSDDCVDTVTLLDLLHLFKSHVVFNNELLRYYLFCIFQILA 989

Query: 540  GRAPTLYGVPRDKLLLLGDGETS-EQKNVF 568
            GR PTL+GVP++K L L D E   E KNVF
Sbjct: 990  GRPPTLFGVPKEKPLQLVDVEACIEPKNVF 1019


>gi|79321265|ref|NP_001031279.1| TBP-associated factor 2 [Arabidopsis thaliana]
 gi|332197410|gb|AEE35531.1| TBP-associated factor 2 [Arabidopsis thaliana]
          Length = 1370

 Score =  796 bits (2055), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 392/570 (68%), Positives = 460/570 (80%), Gaps = 22/570 (3%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGAS-PVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
           MLEKQMGS+ FRKILQ IISRA+  S  +R+LSTKEFR FANK+GNLERPFLKEFF RWV
Sbjct: 450 MLEKQMGSDSFRKILQKIISRAKDPSNSIRSLSTKEFRQFANKIGNLERPFLKEFFQRWV 509

Query: 60  GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMS 119
            + GCPVLR+G SYNKRKN VE+A LR+CT   D+R  V+ + +DSE+RD D GWPG+MS
Sbjct: 510 ASYGCPVLRIGLSYNKRKNNVEMAALRECTAALDARLSVIGATSDSESRDVDAGWPGIMS 569

Query: 120 IRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGL 179
           IRV+ELDGM DHP LPMAGD WQLLE+ CHSKLAA+R  KPKKG KPDG +DN DA+A L
Sbjct: 570 IRVYELDGMSDHPKLPMAGDRWQLLELPCHSKLAAKRYQKPKKGGKPDGAEDNVDAIAPL 629

Query: 180 DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHL 239
           + ++S+ESPL+WI+ADPEMEY+AEIH +QP+QMW+NQLEKDGDVVAQAQAIA+LEAL   
Sbjct: 630 ENKTSIESPLAWIKADPEMEYIAEIHLHQPLQMWVNQLEKDGDVVAQAQAIASLEALKQH 689

Query: 240 SFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL 299
           SF++VN L N L+DSK FWR+RI AA+ALA TASEE+DWAGL HL+KFYKSRRFD  IGL
Sbjct: 690 SFSIVNALKNVLTDSKVFWRIRIAAAFALAKTASEESDWAGLQHLIKFYKSRRFDAEIGL 749

Query: 300 PRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 359
           P+PNDFRDF EYFVLEAIPHA+A+VR A+ KSPREAVEF+LQLLKYNDN+GN YSDVFWL
Sbjct: 750 PKPNDFRDFPEYFVLEAIPHAIAIVRGAEGKSPREAVEFILQLLKYNDNSGNSYSDVFWL 809

Query: 360 AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS 419
           A LVQSVG+LEF QQS+ FL+ LLKRIDRLLQFDRLMPSYNGILTISCIRTL Q ALKLS
Sbjct: 810 AVLVQSVGDLEFCQQSLTFLAPLLKRIDRLLQFDRLMPSYNGILTISCIRTLAQTALKLS 869

Query: 420 GFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLR 479
             IS D + KLI+PFR+ +TI Q+R+E SRALLD+E+                      +
Sbjct: 870 DSISFDHICKLIEPFRNSDTILQIRIEGSRALLDIEYQS--------------------K 909

Query: 480 GQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRHHLFGILQILA 539
           GQVKL +H MR+CQI  G DS+  VDTVTL+ LL+L +S + FNN  LR++LF I QILA
Sbjct: 910 GQVKLCVHTMRLCQIAVGCDSDDCVDTVTLLDLLHLFKSHVVFNNELLRYYLFCIFQILA 969

Query: 540 GRAPTLYGVPRDKLLLLGDGETS-EQKNVF 568
           GR PTL+GVP++K L L D E   E KNVF
Sbjct: 970 GRPPTLFGVPKEKPLQLVDVEACIEPKNVF 999


>gi|38490055|gb|AAR21621.1| TFIID component TAF2 [Arabidopsis thaliana]
          Length = 1370

 Score =  795 bits (2054), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 392/570 (68%), Positives = 460/570 (80%), Gaps = 22/570 (3%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGAS-PVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
           MLEKQMGS+ FRKILQ IISRA+  S  +R+LSTKEFR FANK+GNLERPFLKEFF RWV
Sbjct: 450 MLEKQMGSDSFRKILQKIISRAKDPSNSIRSLSTKEFRQFANKIGNLERPFLKEFFQRWV 509

Query: 60  GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMS 119
            + GCPVLR+G SYNKRKN VE+A LR+CT   D+R  V+ + +DSE+RD D GWPG+MS
Sbjct: 510 ASYGCPVLRIGLSYNKRKNNVEMAALRECTAALDARLSVIGATSDSESRDVDAGWPGIMS 569

Query: 120 IRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGL 179
           IRV+ELDGM DHP LPMAGD WQLLE+ CHSKLAA+R  KPKKG KPDG +DN DA+A L
Sbjct: 570 IRVYELDGMSDHPKLPMAGDRWQLLELPCHSKLAAKRYQKPKKGGKPDGAEDNVDAIAPL 629

Query: 180 DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHL 239
           + ++S+ESPL+WI+ADPEMEY+AEIH +QP+QMW+NQLEKDGDVVAQAQAIA+LEAL   
Sbjct: 630 ENKTSIESPLAWIKADPEMEYIAEIHLHQPLQMWVNQLEKDGDVVAQAQAIASLEALKQH 689

Query: 240 SFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL 299
           SF++VN L N L+DSK FWR+RI AA+ALA TASEE+DWAGL HL+KFYKSRRFD  IGL
Sbjct: 690 SFSIVNALKNVLTDSKVFWRIRIAAAFALAKTASEESDWAGLQHLIKFYKSRRFDAEIGL 749

Query: 300 PRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 359
           P+PNDFRDF EYFVLEAIPHA+A+VR A+ KSPREAVEF+LQLLKYNDN+GN YSDVFWL
Sbjct: 750 PKPNDFRDFPEYFVLEAIPHAIAIVRGAEGKSPREAVEFILQLLKYNDNSGNSYSDVFWL 809

Query: 360 AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS 419
           A LVQSVG+LEF QQS+ FL+ LLKRIDRLLQFDRLMPSYNGILTISCIRTL Q ALKLS
Sbjct: 810 AVLVQSVGDLEFCQQSLTFLAPLLKRIDRLLQFDRLMPSYNGILTISCIRTLAQTALKLS 869

Query: 420 GFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLR 479
             IS D + KLI+PFR+ +TI Q+R+E SRALLD+E+                      +
Sbjct: 870 DSISFDHICKLIEPFRNSDTILQIRIEGSRALLDIEYQS--------------------K 909

Query: 480 GQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRHHLFGILQILA 539
           GQVKL +H MR+CQI  G DS+  VDTVTL+ LL+L +S + FNN  LR++LF I QILA
Sbjct: 910 GQVKLCVHTMRLCQIAVGCDSDDCVDTVTLLDLLHLFKSHVVFNNELLRYYLFCIFQILA 969

Query: 540 GRAPTLYGVPRDKLLLLGDGETS-EQKNVF 568
           GR PTL+GVP++K L L D E   E KNVF
Sbjct: 970 GRPPTLFGVPKEKPLQLVDVEACIEPKNVF 999


>gi|222641555|gb|EEE69687.1| hypothetical protein OsJ_29324 [Oryza sativa Japonica Group]
          Length = 1503

 Score =  779 bits (2012), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 384/640 (60%), Positives = 480/640 (75%), Gaps = 21/640 (3%)

Query: 1    MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
            MLEKQMG + FRKILQ I++  + +   RTLSTKEFRH ANKVGNLERPFLKEFFPRWV 
Sbjct: 597  MLEKQMGPDSFRKILQMIVAPTRAS---RTLSTKEFRHLANKVGNLERPFLKEFFPRWVE 653

Query: 61   TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSI 120
            + GCPV+R+G SY+KR+N+VELAV R CT K D   P + +N DS  R+GD GWPGMMS+
Sbjct: 654  SSGCPVMRLGISYSKRRNLVELAVSRGCTTKVDP-GPDIRTNGDS--REGDTGWPGMMSV 710

Query: 121  RVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLD 180
            RVHE DG+YDHPI+PMAG+A Q++EIQCHSK+AA+R  K KKGSKPDG D+N DA +  D
Sbjct: 711  RVHETDGVYDHPIVPMAGEALQVVEIQCHSKVAAKRFQKTKKGSKPDGSDENIDA-SNQD 769

Query: 181  MRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLS 240
             R+SM++PL WIR DPEMEYLAEIHF+QPVQMWINQLEKD DV++Q+QAI+ LE  P L+
Sbjct: 770  NRASMDAPLLWIRVDPEMEYLAEIHFHQPVQMWINQLEKDKDVISQSQAISVLEKSPQLT 829

Query: 241  FNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP 300
            F V N LNNFL+D+KAFWRVR+EAAYALA TASE T+  GLLHLVKFYKSRRFD +IGLP
Sbjct: 830  FAVTNALNNFLNDTKAFWRVRVEAAYALAVTASEGTELTGLLHLVKFYKSRRFDADIGLP 889

Query: 301  RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLA 360
            RPNDF D  EYFVLEAIPHAVA+VR+AD  SP+EA+EF+LQLLKYNDNNGN YSDV+WL+
Sbjct: 890  RPNDFHDIPEYFVLEAIPHAVALVRSADKSSPKEAIEFILQLLKYNDNNGNVYSDVYWLS 949

Query: 361  ALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSG 420
            A+VQ++GELEFGQQ +  LSSLLKRIDRLLQFD  MP YNG+LT+SCIRTL +IA ++S 
Sbjct: 950  AMVQAIGELEFGQQGVGLLSSLLKRIDRLLQFDNFMPGYNGVLTVSCIRTLARIAQRVSS 1009

Query: 421  FISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLRG 480
             I LD+V +LI P+R+ +  W+VR+EA R L+DLEFH  G+D+AL LF+K   EE SLRG
Sbjct: 1010 SICLDRVCELIVPYRNMDKPWKVRMEAGRVLIDLEFHHKGLDAALLLFLKYANEERSLRG 1069

Query: 481  QVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRHHLFGILQILAG 540
              KL +H +R+CQ    S  N+++   TLV LL LL  + A+NNV+LRH++F ILQI AG
Sbjct: 1070 GTKLAVHVLRLCQANIESHDNNQIQLPTLVGLLCLLAGKKAYNNVYLRHNVFCILQIAAG 1129

Query: 541  RAPTLYGVPR--DKLLLLGDGETSEQKNVFASFVTEMRRAEPPMDVPNLSQ--------- 589
            R+PTL+GVP+      ++ +  + +     +S   + R  EP    P++ +         
Sbjct: 1130 RSPTLHGVPKVVTPPQVVLEISSDQHTKADSSVPQQSRPQEPSTSTPSVREVLPTSGPLK 1189

Query: 590  --DNLAVRDASKEVDCVANGHAENILAVPEAQRMQMSFPT 627
              DN++     + V  +    A+NI    E +R  +S PT
Sbjct: 1190 DADNISNCSERRNVIFIPTKDADNISNCSE-RRNVISIPT 1228


>gi|218202136|gb|EEC84563.1| hypothetical protein OsI_31331 [Oryza sativa Indica Group]
          Length = 1511

 Score =  779 bits (2012), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 384/640 (60%), Positives = 479/640 (74%), Gaps = 21/640 (3%)

Query: 1    MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
            MLEKQMG + FRKILQ I++  + +   RTLSTKEFRH ANKVGNLERPFLKEFFPRWV 
Sbjct: 604  MLEKQMGPDSFRKILQMIVAPTRAS---RTLSTKEFRHLANKVGNLERPFLKEFFPRWVE 660

Query: 61   TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSI 120
            + GCPV+R+G SY+KR+N+VELAV R CT K D   P + +N DS  R+GD GWPGMMS+
Sbjct: 661  SSGCPVMRLGISYSKRRNLVELAVSRGCTTKVDP-GPDIRTNGDS--REGDTGWPGMMSV 717

Query: 121  RVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLD 180
            RVHE DG+YDHPI+PMAG+A Q++EIQCHSK+AA+R  K KKGSKPDG D+N DA +  D
Sbjct: 718  RVHETDGVYDHPIVPMAGEALQVVEIQCHSKVAAKRFQKTKKGSKPDGSDENIDA-SNQD 776

Query: 181  MRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLS 240
             R+SM++PL WIR DPEMEYLAEIHF+QPVQMWINQLEKD DV++Q+QAI+ LE  P L+
Sbjct: 777  NRASMDAPLLWIRVDPEMEYLAEIHFHQPVQMWINQLEKDKDVISQSQAISVLEKSPQLT 836

Query: 241  FNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP 300
            F V N LNNFL+D+KAFWRVR+EAAYALA TASE T+  GLLHLVKFYKSRRFD +IGLP
Sbjct: 837  FAVTNALNNFLNDTKAFWRVRVEAAYALAVTASEGTELTGLLHLVKFYKSRRFDADIGLP 896

Query: 301  RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLA 360
            RPNDF D  EYFVLEAIPHAVA+VR+AD  SP+EA+EF+LQLLKYNDNNGN YSDV+WL+
Sbjct: 897  RPNDFHDIPEYFVLEAIPHAVALVRSADKSSPKEAIEFILQLLKYNDNNGNVYSDVYWLS 956

Query: 361  ALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSG 420
            A+VQ++GELEFGQQ +  LSSLLKRIDRLLQFD  MP YNG+LT+SCIRTL +IA ++S 
Sbjct: 957  AMVQAIGELEFGQQGVGLLSSLLKRIDRLLQFDNFMPGYNGVLTVSCIRTLARIAQRVSS 1016

Query: 421  FISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLRG 480
             I LD+V +LI P+R+ +  W+VR+EA R L+DLEFH  G+D+AL LF+K   EE SLRG
Sbjct: 1017 SICLDRVCELIVPYRNMDKPWKVRMEAGRVLIDLEFHHKGLDAALLLFLKYANEERSLRG 1076

Query: 481  QVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRHHLFGILQILAG 540
              KL +H +R+CQ    S  N+++   TLV LL LL  + A+NNV+LRH++F ILQI AG
Sbjct: 1077 GTKLAVHVLRLCQANIESHDNNQIQLPTLVGLLCLLAGKKAYNNVYLRHNVFCILQIAAG 1136

Query: 541  RAPTLYGVPR--DKLLLLGDGETSEQKNVFASFVTEMRRAEPPMDVPNLSQ--------- 589
            R+PTL+GVP+      ++ +    +     +S   + R  EP    P++ +         
Sbjct: 1137 RSPTLHGVPKVVTPPQVVLEISNDQHTKADSSVPQQSRPQEPSTSTPSVREVLPTSGPLK 1196

Query: 590  --DNLAVRDASKEVDCVANGHAENILAVPEAQRMQMSFPT 627
              DN++     + V  +    A+NI    E +R  +S PT
Sbjct: 1197 DADNISNCSERRNVIFIPTKDADNISNCSE-RRNVISIPT 1235


>gi|357153634|ref|XP_003576516.1| PREDICTED: uncharacterized protein LOC100821849 [Brachypodium
            distachyon]
          Length = 1407

 Score =  772 bits (1994), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 377/608 (62%), Positives = 469/608 (77%), Gaps = 12/608 (1%)

Query: 2    LEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGT 61
            LEKQMG + FRKILQ I++  + +   RTLSTKEFRH ANKVGNLERPFLKEFFPRWV +
Sbjct: 541  LEKQMGPDSFRKILQMIVASTRAS---RTLSTKEFRHLANKVGNLERPFLKEFFPRWVES 597

Query: 62   CGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIR 121
            CGCPV+R+G SY+KR+N+VELAV R CT K D   P   S+ + + +D   G+PGMMS+R
Sbjct: 598  CGCPVMRLGISYSKRRNVVELAVSRGCTAKAD---PGSDSHVNDDIQDSANGFPGMMSVR 654

Query: 122  VHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLDM 181
            VHE DG+YDHPILPMAG+A Q++E+QCHSKLAA+R  K KKGSKPDG D+N +A +  + 
Sbjct: 655  VHETDGVYDHPILPMAGEALQVVELQCHSKLAAKRFQKTKKGSKPDGSDENVEA-STQEN 713

Query: 182  RSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSF 241
            R+SM+SPL WIR DPEMEYLAEIHF+QP+QMWINQLEKD DV++Q+QAIA LE LP +SF
Sbjct: 714  RTSMDSPLLWIRVDPEMEYLAEIHFHQPIQMWINQLEKDKDVISQSQAIAVLEKLPQISF 773

Query: 242  NVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPR 301
             V+N LNNFL+D+KAFWRVR+EAAYALA TASE+TD  GLLHLVKFYKSRRFD +IGLPR
Sbjct: 774  AVINALNNFLNDTKAFWRVRVEAAYALAVTASEDTDLTGLLHLVKFYKSRRFDADIGLPR 833

Query: 302  PNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAA 361
            PNDF D  EYFVLEAIPHAVA+VR++D  SPR+A+EFVLQLLKYNDNNGN YSDV+WLAA
Sbjct: 834  PNDFNDIPEYFVLEAIPHAVALVRSSDKSSPRQAIEFVLQLLKYNDNNGNVYSDVYWLAA 893

Query: 362  LVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGF 421
            +VQ++GELEFGQQ I  LSSLLKRIDRLLQFD  MP YNG+LT+SCIR L +IA ++S  
Sbjct: 894  MVQAIGELEFGQQGIGLLSSLLKRIDRLLQFDNFMPGYNGVLTVSCIRALARIAERVSSS 953

Query: 422  ISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLRGQ 481
            I +D++ +LI PFR+ +  W+V++EASR L+DLE H  G+D+AL LF+K   EE SL+G 
Sbjct: 954  ICIDRICELIAPFRNIDKPWKVQIEASRVLIDLEHHHKGLDAALLLFLKYTYEEKSLQGG 1013

Query: 482  VKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRHHLFGILQILAGR 541
             KL +H +R+CQ     D   ++   T+V LL+LL SR A+NNVFLRH++F ILQI AGR
Sbjct: 1014 TKLAVHVLRLCQANVEPDLTDQIKLPTVVGLLHLLGSRKAYNNVFLRHNVFCILQIAAGR 1073

Query: 542  APTLYGVPRDKLLL---LGDGETSEQKNVFASFVTEMRRAEPPMDVPNLSQDNLAVRDAS 598
            +PTL+GV   KL+    L    +S+Q     S V ++ R + P       ++ L     S
Sbjct: 1074 SPTLHGV--SKLVAPSPLVQEISSDQHAKADSSVPQLSRPQEPSSSTPSVREVLLTTGPS 1131

Query: 599  KEVDCVAN 606
            K+ D ++N
Sbjct: 1132 KDADNISN 1139


>gi|12325155|gb|AAG52531.1|AC016662_25 unknown protein; 58745-68005 [Arabidopsis thaliana]
          Length = 1273

 Score =  667 bits (1721), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 327/477 (68%), Positives = 382/477 (80%), Gaps = 32/477 (6%)

Query: 11  FRKILQNIISRAQGAS-PVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRM 69
           ++ ILQ IISRA+  S  +R+LSTKEFR FANK+GNLERPFLKEFF RWV + GCPVLR+
Sbjct: 398 WKSILQKIISRAKDPSNSIRSLSTKEFRQFANKIGNLERPFLKEFFQRWVASYGCPVLRI 457

Query: 70  GFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGMY 129
           G SYNKRKN VE+A LR+CT   D+R  V+ + +DSE+RD D GWPG+MSIRV+ELDGM 
Sbjct: 458 GLSYNKRKNNVEMAALRECTAALDARLSVIGATSDSESRDVDAGWPGIMSIRVYELDGMS 517

Query: 130 DHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLDMRSSMESPL 189
           DHP LPMAGD WQLLE+ CHSKLAA+R  KPKKG KPDG +DN DA+A L+ ++S+ESPL
Sbjct: 518 DHPKLPMAGDRWQLLELPCHSKLAAKRYQKPKKGGKPDGAEDNVDAIAPLENKTSIESPL 577

Query: 190 SWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNN 249
           +WI+ADPEMEY+AEIH +QP+QMW+NQLEKDGDVVAQAQAIA+LEAL   SF++VN L N
Sbjct: 578 AWIKADPEMEYIAEIHLHQPLQMWVNQLEKDGDVVAQAQAIASLEALKQHSFSIVNALKN 637

Query: 250 FLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFS 309
            L+DSK FWR+RI AA+ALA TASEE+DWAGL HL+KFYKSRRFD  IGLP+PNDFRDF 
Sbjct: 638 VLTDSKVFWRIRIAAAFALAKTASEESDWAGLQHLIKFYKSRRFDAEIGLPKPNDFRDFP 697

Query: 310 EYFVLE-------------------------------AIPHAVAMVRAADNKSPREAVEF 338
           EYFVLE                               AIPHA+A+VR A+ KSPREAVEF
Sbjct: 698 EYFVLEVGNELDIVSRIDLRSCFPLGVSIVIPLPETMAIPHAIAIVRGAEGKSPREAVEF 757

Query: 339 VLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPS 398
           +LQLLKYNDN+GN YSDVFWLA LVQSVG+LEF QQS+ FL+ LLKRIDRLLQFDRLMPS
Sbjct: 758 ILQLLKYNDNSGNSYSDVFWLAVLVQSVGDLEFCQQSLTFLAPLLKRIDRLLQFDRLMPS 817

Query: 399 YNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLE 455
           YNGILTISCIRTL Q ALKLS  IS D + KLI+PFR+ +TI Q+R+E SRALLD+E
Sbjct: 818 YNGILTISCIRTLAQTALKLSDSISFDHICKLIEPFRNSDTILQIRIEGSRALLDIE 874


>gi|168045101|ref|XP_001775017.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162673604|gb|EDQ60124.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1068

 Score =  621 bits (1601), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 319/548 (58%), Positives = 402/548 (73%), Gaps = 9/548 (1%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGAS-PVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
           MLEKQMG   FRK+LQ II RAQ  +   RTLST EFRHFANK+GNLERPFLK+FFPRWV
Sbjct: 437 MLEKQMGPEPFRKVLQRIIMRAQDPNRKNRTLSTTEFRHFANKLGNLERPFLKDFFPRWV 496

Query: 60  GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNT-DSENRDGDIGWPGMM 118
            + GCP LRMGF Y+KR+N+VELAV R+ T  P     +      +S+ R  D+GWPGMM
Sbjct: 497 ESSGCPRLRMGFVYSKRRNMVELAVHRESTAMPKHVGVIGKVEAPESKLRLSDMGWPGMM 556

Query: 119 SIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAG 178
           SIR+HELDGMYDHP LPMAGD +QLLE+QCHSKLA RR  +PKKG+K +  +DN DA   
Sbjct: 557 SIRLHELDGMYDHPSLPMAGDCYQLLELQCHSKLAGRRIQRPKKGAKGEAVEDNVDATPA 616

Query: 179 LDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPH 238
                S+ESPL WIR DPE+EYLAE++  QP  MWINQLEKD DV+ Q QA+AAL A P 
Sbjct: 617 ----QSLESPLLWIRGDPELEYLAELNLRQPEPMWINQLEKDRDVIGQIQAVAALRAYPR 672

Query: 239 LSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIG 298
            SF VVN LNN L D K F RVRIEAA ALA+TASE T + GL +L+K+YKSRRFD +IG
Sbjct: 673 TSFAVVNALNNCLIDPKVFCRVRIEAAAALASTASEGTSYMGLQNLLKYYKSRRFDPDIG 732

Query: 299 LPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFW 358
           LP+PNDF DFSEYFVLEAIP AVA VR +D KSP EA +F+L +LK+NDN+GN YSDVFW
Sbjct: 733 LPKPNDFHDFSEYFVLEAIPGAVAEVRGSDGKSPLEATDFILHILKHNDNSGNAYSDVFW 792

Query: 359 LAALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKL 418
           LA++++S+G LEFG+Q++  L+ +LK+IDR LQ+DRLMPSYNG++TISCI TL ++A++ 
Sbjct: 793 LASVIESIGSLEFGKQNLRTLARMLKQIDRFLQYDRLMPSYNGVVTISCIHTLAKLAIRF 852

Query: 419 SGFISLDQVVKLIKPFRDFNTI-WQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPS 477
           S  +  ++V +L+  F+D   + WQVRV A +ALLD+E    G+++A++L ++ V  + S
Sbjct: 853 SHALPTERVKQLLNQFKDRQKVQWQVRVAALKALLDIELQSKGLEAAVNLALQQVHGDLS 912

Query: 478 LRGQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRHHLFGILQI 537
             G+ K+  H +R+  I         +  VT+  LL LLES  AF+NV LRH +F  LQ+
Sbjct: 913 FGGESKIMKHVVRMASIT--EKGKCRIGGVTIARLLALLESSKAFHNVMLRHSIFSFLQV 970

Query: 538 LAGRAPTL 545
           LAGR   L
Sbjct: 971 LAGRYDML 978


>gi|302767542|ref|XP_002967191.1| hypothetical protein SELMODRAFT_408008 [Selaginella moellendorffii]
 gi|300165182|gb|EFJ31790.1| hypothetical protein SELMODRAFT_408008 [Selaginella moellendorffii]
          Length = 1670

 Score =  570 bits (1468), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 305/551 (55%), Positives = 380/551 (68%), Gaps = 30/551 (5%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRT-LSTKEFRHFANKVGNLERPFLKEFFPRWV 59
           MLEKQMG    RK+LQ I+SRA     + +     +FRHFANK+GNLERPFLKEFFPRWV
Sbjct: 409 MLEKQMGPESLRKVLQRIVSRAGEPCFLSSYFQGLQFRHFANKIGNLERPFLKEFFPRWV 468

Query: 60  GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMS 119
            + GCP+LRMGF YNKR+NIVE AV RDCT  P+    V+     S     D+GWPGMM+
Sbjct: 469 ESQGCPLLRMGFEYNKRRNIVEFAVHRDCTA-PE----VMMLGNKS---IADVGWPGMMN 520

Query: 120 IRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGL 179
           I++HELDGMYDHP L ++GD  QLLEI CHSK+A RR  +PKK  K +  DD  D     
Sbjct: 521 IQIHELDGMYDHPNLSLSGDVHQLLEIPCHSKMANRRPQRPKKPGKVEIIDDLADD---- 576

Query: 180 DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHL 239
             + S+ESPL W+RADP M+YL +I   QP QMWINQLE+D D VAQ QAIAAL+ LP +
Sbjct: 577 --KPSVESPLLWLRADPGMDYLTKIQLWQPEQMWINQLERDKDAVAQLQAIAALQKLP-V 633

Query: 240 SFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL 299
           SF++V            F RVR+EAA+ALANTA E T W GL HL+K+YKSR FD  IGL
Sbjct: 634 SFSIV------------FCRVRVEAAFALANTAGEGTSWLGLGHLMKYYKSRCFDMEIGL 681

Query: 300 PRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 359
           PRPN F DFSEYFVL+AIP AV  VR    KSP EAVEF+L +LK+NDN+GN YSD FWL
Sbjct: 682 PRPNSFHDFSEYFVLQAIPPAVGTVRGNQGKSPPEAVEFILNILKHNDNSGNAYSDSFWL 741

Query: 360 AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS 419
           +++++SVGELE  QQSI   + +LK+IDR  Q+DRLMPSYN ILT SCI+TLT++ALK +
Sbjct: 742 SSVIESVGELEISQQSIPKFARILKQIDRYFQYDRLMPSYNCILTRSCIQTLTRLALKFN 801

Query: 420 GFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLR 479
             ISL  +  L+ PF+   T WQVRV A +A LD++F    ++S+L + +  V E+P  R
Sbjct: 802 TVISLAPIETLLSPFKVLTTPWQVRVAALQAQLDMKFQSESLNSSLRVALDFVREDPCAR 861

Query: 480 GQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRHHLFGILQILA 539
            Q K+  +A+RIC  K  S S+  VD+ T+ +LL   ES     N  LRH++F ILQ+LA
Sbjct: 862 VQCKVMSYALRICLRKHASTSS--VDSSTVSSLLEFFESEQCAVNHKLRHYIFSILQVLA 919

Query: 540 GRAPTLYGVPR 550
           G+  TL  V R
Sbjct: 920 GKPATLLRVSR 930


>gi|302754206|ref|XP_002960527.1| hypothetical protein SELMODRAFT_402861 [Selaginella moellendorffii]
 gi|300171466|gb|EFJ38066.1| hypothetical protein SELMODRAFT_402861 [Selaginella moellendorffii]
          Length = 1689

 Score =  565 bits (1455), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 302/546 (55%), Positives = 377/546 (69%), Gaps = 30/546 (5%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRT-LSTKEFRHFANKVGNLERPFLKEFFPRWV 59
           MLEKQMG    RK+LQ I+SRA     + +     +FRHFANK+GNLERPFLKEFFPRWV
Sbjct: 446 MLEKQMGPESLRKVLQRIVSRAGEPCFLSSYFQGLQFRHFANKIGNLERPFLKEFFPRWV 505

Query: 60  GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMS 119
            + GCP+LRMGF YNKR+NIVE AV RDCT  P+    V+     S     D+GWPGMM+
Sbjct: 506 ESQGCPLLRMGFEYNKRRNIVEFAVHRDCTA-PE----VMMLGNKS---IADVGWPGMMN 557

Query: 120 IRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGL 179
           I++HELDGMYDHP L ++GD  QLLEI CHSK+A RR  +PKK  K +  DD  D     
Sbjct: 558 IQIHELDGMYDHPNLSLSGDVHQLLEIPCHSKMANRRPQRPKKPGKVEIIDDLADD---- 613

Query: 180 DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHL 239
             + S+ESPL W+RADP M+YL +I   QP QMWINQLE+D D VAQ QAIAAL+ LP +
Sbjct: 614 --KPSVESPLLWLRADPGMDYLTKIQLWQPEQMWINQLERDKDAVAQLQAIAALQKLP-V 670

Query: 240 SFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL 299
           SF++V            F RVR+EAA+ALANTA E T W GL HL+K+YKSR FD  IGL
Sbjct: 671 SFSIV------------FCRVRVEAAFALANTAGEGTSWLGLGHLMKYYKSRCFDMEIGL 718

Query: 300 PRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 359
           PRPN F DFSEYFVL+AIP AV  VR    KSP EAVEF+L +LK+NDN+GN Y D FWL
Sbjct: 719 PRPNSFHDFSEYFVLQAIPPAVGTVRGNQGKSPPEAVEFILNILKHNDNSGNAYLDSFWL 778

Query: 360 AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS 419
           +++++SVGELE  QQSI   + +LK+IDR  Q+DRLMPSYN ILT SCI+TLT++ALK +
Sbjct: 779 SSVIESVGELEISQQSIPKFARILKQIDRYFQYDRLMPSYNCILTRSCIQTLTRLALKFN 838

Query: 420 GFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLR 479
             ISL  +  L+ PF+   T WQVRV A +A LD++F    ++S+L + +  V E+P  R
Sbjct: 839 TVISLAPIETLLSPFKVLTTPWQVRVAALQAQLDMKFQSESLNSSLRVALDFVREDPCAR 898

Query: 480 GQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRHHLFGILQILA 539
            Q K+  +A+RIC  K  S S+  VD+ T+ +LL   ES     N  LRH++F ILQ+LA
Sbjct: 899 VQCKVMSYALRICLRKHASTSS--VDSSTVSSLLEFFESEQCAVNHKLRHYIFSILQVLA 956

Query: 540 GRAPTL 545
           G+  TL
Sbjct: 957 GKPATL 962


>gi|168042563|ref|XP_001773757.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674872|gb|EDQ61374.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1412

 Score =  500 bits (1288), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 286/573 (49%), Positives = 369/573 (64%), Gaps = 81/573 (14%)

Query: 1    MLEKQMGSNFFRKILQNIISRAQGA-SPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
            MLE+QMG   FRK+LQ II RAQ   S  +TLST EFRHFANK+GNLERPFLK+FFPRWV
Sbjct: 482  MLERQMGPEPFRKVLQRIIMRAQDPDSKNQTLSTTEFRHFANKLGNLERPFLKDFFPRWV 541

Query: 60   GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNT-DSENRDGDIGWPGMM 118
             + GCP L      +KR+N+VELAV R+ T  P     V      D + R+ D+GWPGMM
Sbjct: 542  ESSGCPRL------SKRRNMVELAVRRESTSIPKRVGAVGKLEIPDLKLRNSDMGWPGMM 595

Query: 119  SIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAG 178
            SIR+HELDGMYDHP LPMAGD +QLLE+QCHSKLA RR  +PKKG+K +  +D  DA   
Sbjct: 596  SIRLHELDGMYDHPSLPMAGDCYQLLELQCHSKLAGRRMQRPKKGTKGEAVEDRVDATPA 655

Query: 179  LDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPH 238
                 S+ESPL W+R DPE+EYLAE++  QP QMWINQLEKD DV+ Q QA+AAL A P 
Sbjct: 656  ----QSLESPLLWVRGDPELEYLAELNLIQPEQMWINQLEKDRDVIGQVQAVAALRAHPR 711

Query: 239  LSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIG 298
             SF VVN LNN L D K F RVRIEAA ALA+TAS                         
Sbjct: 712  TSFAVVNALNNCLIDPKVFCRVRIEAAAALASTAS------------------------- 746

Query: 299  LPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFW 358
                            EAIP A+A VR  D KSP EA EF+L +LK+NDNNGN +SDV+W
Sbjct: 747  ----------------EAIPGAIAEVRGNDGKSPLEASEFILHILKHNDNNGNVHSDVYW 790

Query: 359  LAALVQSVGELEFGQQSIL-----------------------FLSSLLKRIDRLLQFDRL 395
            LA+++++VG LEFG+Q +L                        L+ +LK+IDR LQ+DRL
Sbjct: 791  LASVIEAVGSLEFGKQFVLSRSNSGSIVILLGNWGLPIQDLRTLARILKQIDRFLQYDRL 850

Query: 396  MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTI-WQVRVEASRALLDL 454
            MPSYNG++  SCIRTL  IA++ S  +S+++V  L+K F+D + I W+VRV A +ALLD+
Sbjct: 851  MPSYNGVVATSCIRTLANIAIRFSYALSIERVKLLLKQFKDRDKINWRVRVAALKALLDI 910

Query: 455  EFHCNGIDSALSLFIKSVEEEPSLRGQVKLGIHAMRICQ-IKGGSDSNHEVDTVTLVALL 513
            E++  G+++A+ L ++ +  E S   + K+  H  RI   I+ GS    ++ +V++  L+
Sbjct: 911  EYYSKGLEAAVILALQLIHSEFSFPVKSKIMKHVGRIASLIEKGS---CKIGSVSVARLM 967

Query: 514  NLLESRIAFNNVFLRHHLFGILQILAGRAPTLY 546
             LL+S  AF+NV LRH +F  LQ+LAGR+ +LY
Sbjct: 968  TLLKSSTAFHNVMLRHSVFSFLQVLAGRSASLY 1000


>gi|242044652|ref|XP_002460197.1| hypothetical protein SORBIDRAFT_02g024430 [Sorghum bicolor]
 gi|241923574|gb|EER96718.1| hypothetical protein SORBIDRAFT_02g024430 [Sorghum bicolor]
          Length = 310

 Score =  457 bits (1177), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 218/317 (68%), Positives = 261/317 (82%), Gaps = 8/317 (2%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
           MLEKQMG + FRKILQ I++  + +   RTLSTKEFRH ANKVGNLERPFLKEFFPRWV 
Sbjct: 1   MLEKQMGPDSFRKILQVIVAPNRAS---RTLSTKEFRHLANKVGNLERPFLKEFFPRWVE 57

Query: 61  TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSI 120
           +CGCP++R+G SY+K++N++ELAV R CT K    TP    +T+ + R+GD GWPGMMS+
Sbjct: 58  SCGCPIMRLGISYSKKRNMIELAVSRGCTAKA---TPDPDIHTNGDTREGDAGWPGMMSV 114

Query: 121 RVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLD 180
           RVHE DG YDHP+LPMAG+A Q++EIQCHS+LAA+R  K KK +K DG DDN DA +  +
Sbjct: 115 RVHETDGAYDHPVLPMAGEALQVVEIQCHSRLAAKRVWK-KKNTKLDGSDDNIDA-STQE 172

Query: 181 MRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLS 240
            R+S++SPL WIR DPEMEYLAEIHF+QPVQMWINQLEKD DV++Q+QAI+ LE LP LS
Sbjct: 173 NRTSVDSPLLWIRVDPEMEYLAEIHFHQPVQMWINQLEKDKDVISQSQAISVLEKLPQLS 232

Query: 241 FNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP 300
           F V+N LNNFL+D+KAFWRVRIEAAYALA T+SE TD AGLLHLVKFYKSRRFD +IG+P
Sbjct: 233 FAVINALNNFLNDTKAFWRVRIEAAYALAVTSSEATDLAGLLHLVKFYKSRRFDADIGMP 292

Query: 301 RPNDFRDFSEYFVLEAI 317
           RPNDF D  EYFVLE +
Sbjct: 293 RPNDFHDVPEYFVLEVM 309


>gi|147776761|emb|CAN61428.1| hypothetical protein VITISV_006736 [Vitis vinifera]
          Length = 299

 Score =  441 bits (1133), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 211/259 (81%), Positives = 226/259 (87%), Gaps = 2/259 (0%)

Query: 63  GCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRV 122
           G   LR G SYNKRKN+VELAVLR CT  PD+ T VL+ N DSENR+ DIGWPGMMSIRV
Sbjct: 6   GLKFLRAGLSYNKRKNLVELAVLRGCTAAPDTNTXVLNGNIDSENREVDIGWPGMMSIRV 65

Query: 123 HELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLDMR 182
           HELDGMYDHPILPMAG+ WQLLEIQCHSKLAARR  KPKKGSKPDG DDNGD V  +DMR
Sbjct: 66  HELDGMYDHPILPMAGETWQLLEIQCHSKLAARRFQKPKKGSKPDGSDDNGD-VPAVDMR 124

Query: 183 SSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFN 242
           S+ ESPL W+R DPE+EYLAEIHFNQP QMWINQLE+D DVVAQAQAIA LEALP LSF+
Sbjct: 125 SNTESPLLWLRVDPELEYLAEIHFNQPAQMWINQLERDKDVVAQAQAIATLEALPQLSFS 184

Query: 243 VVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRP 302
           VVN LNNFLSDSKAFWRVRIEAA+ALANTASEETDWAGLLHLVKFYKSRRFD NIGLP+P
Sbjct: 185 VVNALNNFLSDSKAFWRVRIEAAFALANTASEETDWAGLLHLVKFYKSRRFDANIGLPKP 244

Query: 303 NDFRDFSEYFVLEAIPHAV 321
           NDF DF EYFVLE + H+V
Sbjct: 245 NDFHDFPEYFVLE-VAHSV 262


>gi|326491201|dbj|BAK05700.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 391

 Score =  393 bits (1010), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 190/310 (61%), Positives = 230/310 (74%), Gaps = 26/310 (8%)

Query: 6   MGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCP 65
           M  + FRKILQ I++  + +   RTLSTKEFRH  NKVGNLERPFLKEFFPRW+ +CGCP
Sbjct: 1   MWPDSFRKILQMIVASTRAS---RTLSTKEFRHLVNKVGNLERPFLKEFFPRWIESCGCP 57

Query: 66  VLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHEL 125
           ++R+G SY+KR+N+VELAV R CT K D   P   ++ + + ++G  GWPGMMS+RVHE 
Sbjct: 58  IMRLGISYSKRRNLVELAVSRGCTAKAD---PGSDNHVNGDIQEGATGWPGMMSVRVHET 114

Query: 126 DGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLDMRSSM 185
            G YDH ILPMAG+A  ++E+QCHSK+AA+R  K KKGSKPDG DDN DA +  + R+SM
Sbjct: 115 GGAYDHLILPMAGEALHVVELQCHSKVAAKRFQKTKKGSKPDGSDDNVDA-STQENRTSM 173

Query: 186 ESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVN 245
           +SPL WIR D EMEYLAEIHF+QP+QMWINQLEKD DV++Q+Q I  LE LP LSF V+N
Sbjct: 174 DSPLLWIRVDQEMEYLAEIHFHQPIQMWINQLEKDKDVISQSQEITVLEKLPQLSFAVIN 233

Query: 246 TLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDF 305
            LNNFL+D+K                   +TD AGLLHLVKFYKS RFD +IGLPRPNDF
Sbjct: 234 ALNNFLNDTK-------------------DTDLAGLLHLVKFYKSHRFDTDIGLPRPNDF 274

Query: 306 RDFSEYFVLE 315
            D  EYFVLE
Sbjct: 275 HDIPEYFVLE 284


>gi|384253674|gb|EIE27148.1| hypothetical protein COCSUDRAFT_55172 [Coccomyxa subellipsoidea
           C-169]
          Length = 1510

 Score =  230 bits (586), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 186/577 (32%), Positives = 301/577 (52%), Gaps = 49/577 (8%)

Query: 1   MLEKQMGSNFFRKILQNII------SRAQGASPVRTLSTKEFRHFANKVGNLERPF-LKE 53
           MLE++ G + F+++L  I+      + AQGA+P R L T EF +   K G  E    +  
Sbjct: 410 MLEQRAGEDNFKRLLSRIVRASCQPATAQGANP-RLLGTHEFLNQIVKAGGKESKHEISA 468

Query: 54  FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
           F  RW+   GCP+L     ++++++ +ELA+ +       +    +++   S        
Sbjct: 469 FHQRWIIGRGCPLLTAAVVFHRKRSTLELAIRQTGPQAARAAAEKVAAKEGSSA------ 522

Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSK----PDGC 169
             G + + V E +   +HP+  +   A +L+E++  SK A  +  KP +  K     +  
Sbjct: 523 --GFIKVLVRETEAPVEHPVN-LGTAALRLVELKVTSKPAKPKPGKPGRKKKLQVVHEDL 579

Query: 170 DDNGDA-------VAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGD 222
           DD+GDA       VAG   R     P+ W+R DP  E LA +   QP  M   QLE+  D
Sbjct: 580 DDDGDATMADAPDVAGEASRL----PVEWVRLDPHGESLAAVRLAQPEVMASAQLERSKD 635

Query: 223 VVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLL 282
           V AQ+ AIA L AL   S+ VVN L   +  +  + R+R+EAA AL  TA   T W+GL 
Sbjct: 636 VAAQSAAIAQLAALRPTSYGVVNALRACMQHAATYCRIRVEAALALGATAGPSTAWSGLE 695

Query: 283 HLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQL 342
           HL+K+Y++R FD +IGLPRP  F D S+++V +A+PHA+A VR A   S  EA E +L+ 
Sbjct: 696 HLLKYYRTRIFDSDIGLPRPWQFTDLSQHYVDQALPHAIAAVRDAAGHSFPEAAEALLEH 755

Query: 343 LKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGI 402
           L + DN+GN YSD   LAAL+ ++G L    QS   ++ +L R++R LQ + +MPS++ I
Sbjct: 756 LDHADNSGNAYSDDNALAALLDALGSLHL--QSPDAMAGILDRLERYLQREAVMPSFHSI 813

Query: 403 LTISCIRTLTQIALKLSGFISLDQVVKL-IKPFRDFNTIWQVRVEASRALLDLEFHCNGI 461
           +  + + +LT +A+ +         ++L ++ +   +    +R+ A++  + +    +G 
Sbjct: 814 VGCTALTSLTSLAVSMRSPPKQKARLRLFLQRYSRPDAPHHLRLTAAQCSVRIAACTDGA 873

Query: 462 DSALS--LFIKSVEEEPSLRGQV----------KLGIHAMRICQIKGGSDSNHEVDTVTL 509
           D A+   L   + E+ P+LRG +               A      K G+ +  ++ ++ +
Sbjct: 874 DGAVQAALAALAAEKSPALRGAILESLLGLLHRPAAAAARSGGPAKSGTAAKTKIVSMAV 933

Query: 510 VALLNLLESRIAFNNVFLRHHLFGILQILAGRAPTLY 546
           +A L+ L S  A  +V LRH  F  L  L G APTL+
Sbjct: 934 LAQLHALLS--AHGDVGLRHLAFMALMRLGGLAPTLF 968


>gi|328874793|gb|EGG23158.1| hypothetical protein DFA_05290 [Dictyostelium fasciculatum]
          Length = 1643

 Score =  227 bits (578), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 146/457 (31%), Positives = 249/457 (54%), Gaps = 43/457 (9%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRT-LSTKEFRHFANKVGNLERPFLKEFFPRWV 59
           M+EK++  +  RK+L +++  +    PV T + TK+F      +  L    LK+F  +W+
Sbjct: 532 MIEKRITEDGLRKVLNSLLV-SPTNEPVETNVPTKKFIKL---IKTLTGNDLKQFSEKWI 587

Query: 60  GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMS 119
              G P L  GF +NK+K+ +E A+ +  T   D    + SS                ++
Sbjct: 588 YGKGSPNLVCGFYFNKKKHQIEFAIKQVFTRSEDK---IASS----------------LT 628

Query: 120 IRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGL 179
           IR++EL+G Y+H I     D  Q  ++ C+S++   R  + +         D+G+ +  +
Sbjct: 629 IRIYELEGTYEHII--NFDDDMQEFDLTCYSRIRKSRKKRIRA--------DDGEEME-I 677

Query: 180 DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHL 239
           D+    ESPL  IR DPEME++ +I F QP  MWI QLE D DVVAQ++A+  L   P  
Sbjct: 678 DLSQKRESPLLHIRIDPEMEWIHKITFKQPEYMWIYQLELDKDVVAQSEALRGLCTFP-- 735

Query: 240 SFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL 299
           + N +  +  F ++ K ++RVR+EAA  LA    +ET++ GL  L+ ++K   +D +  L
Sbjct: 736 TTNSLRAVFKFFANPKHYYRVRMEAAIVLAKLTVKETNYEGLDLLINYFKQNFYDPHSRL 795

Query: 300 PRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 359
              N+F DFS YFV + IP AV+M++ +  ++P E ++F+L LLK NDN+ N YSD +++
Sbjct: 796 LAENNFSDFSAYFVKKIIPQAVSMIKDSQGRTPIEGIDFLLDLLKDNDNSLNAYSDNYYV 855

Query: 360 AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS 419
           A+++ S+  +     +   +S +  ++ R L+FD+L P Y+  +T +C+ ++ Q  L+ S
Sbjct: 856 ASVLSSLANVHTTNST--HISEIHGQLTRYLRFDQLYPCYHNTITSACLTSICQ--LQAS 911

Query: 420 GFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEF 456
             I LD  +K  K   ++    ++R  A +    L +
Sbjct: 912 KQIGLD--LKFFKDHFNYTYYEEIRKTAFQCYYRLAY 946


>gi|268637510|ref|XP_002649090.1| transcription initiation factor TFIID subunit [Dictyostelium
           discoideum AX4]
 gi|256012812|gb|EEU04038.1| transcription initiation factor TFIID subunit [Dictyostelium
           discoideum AX4]
          Length = 2044

 Score =  225 bits (574), Expect = 6e-56,   Method: Compositional matrix adjust.
 Identities = 137/431 (31%), Positives = 235/431 (54%), Gaps = 39/431 (9%)

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           LK F  +W+   G P L  GF++N++K + E A L+    +P+ +               
Sbjct: 602 LKHFSEKWIYGRGSPNLICGFNFNRKKFMTEFA-LKQVLNRPEDKIS------------- 647

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
                G + IR+ ELDG ++  ++    D ++  +  CHS+    R  K KK    DG +
Sbjct: 648 -----GTLMIRIVELDGSFEQ-VINFEDDMFEY-DFPCHSRC---RKTKKKKLLSEDGEE 697

Query: 171 DNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAI 230
                   +D+    E+PL W R DPE+E++  I F QP  MWI+QLE D DV+AQ +AI
Sbjct: 698 ------MEIDLSQKRETPLLWFRIDPELEWIHRITFRQPEYMWIHQLELDRDVIAQMEAI 751

Query: 231 AALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKS 290
             L+   ++SFN V  +  F+ +S+ F+RVR +AA  L+  +++ET++ GL  LV ++K 
Sbjct: 752 KGLKD--YISFNSVRAVFKFIQNSRYFYRVRCDAAMLLSKLSTKETNFEGLNVLVSYFKG 809

Query: 291 RRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNG 350
             FD +    +PN F+DF+ YF+ ++IP ++++++  + ++P E +EF+++LLK NDN+ 
Sbjct: 810 LFFDRDSNQLKPNSFKDFNSYFMQKSIPLSISLIKDKEGRTPIECIEFLIELLKDNDNSL 869

Query: 351 NPYSDVFWLAALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRT 410
           N YSD ++L AL+ +   +    +  L L  + K++ R L++D+L PSY   +TI C++ 
Sbjct: 870 NTYSDCYYLEALLSAFSNIHTTNE--LLLKKIKKQVARYLKYDQLYPSYRNCVTIGCLKA 927

Query: 411 LTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL--EFHCN-GIDSALSL 467
             Q+    +    LD    L K +  ++   +VR  A +++  L  +F C+ G  S    
Sbjct: 928 YCQLYSIKNSKDDLD--FTLFKEYSTYSHFEEVRKVALQSIYKLAYQFQCSPGTGSINDY 985

Query: 468 FIKSVEEEPSL 478
           F+   + E +L
Sbjct: 986 FLDLFDNEQTL 996


>gi|281210428|gb|EFA84594.1| hypothetical protein PPL_01584 [Polysphondylium pallidum PN500]
          Length = 1764

 Score =  219 bits (558), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 126/412 (30%), Positives = 230/412 (55%), Gaps = 38/412 (9%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRT-LSTKEFRHFANKVGNLERPFLKEFFPRWV 59
           M+E+ +  +  RK+L +++    G  P+ T + TK+F      +  L    LK+F  RW+
Sbjct: 401 MIERLISEDGLRKVLNSMLVSQTGNEPIETIMPTKKF---LKMIKTLTGQDLKQFSERWI 457

Query: 60  GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMS 119
              G P L  GF +N++++  E A ++    +P+ +                I  P  ++
Sbjct: 458 FGKGTPHLLCGFYFNRKRHQTEFA-MKQLAARPEDK----------------ITSP--IT 498

Query: 120 IRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGL 179
           +R+ ELD  Y+H I     D W   +  CHS+    +  K +         D+G+ +  +
Sbjct: 499 LRITELDRTYEHII--NFDDDWNQYDFPCHSRNRKSKKKKIRH--------DDGEEME-I 547

Query: 180 DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHL 239
           D+    E+PL W R DPE+E++ +I F QP  MW++QLE D DV+AQ++A+  L + P L
Sbjct: 548 DLSQKRETPLLWFRIDPEIEWIHKITFKQPEYMWVHQLELDRDVIAQSEALCGLASFPTL 607

Query: 240 SFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL 299
               + ++  F  + K +++VR+EAA  +A  + +ET++ GL  ++ ++KS+ +D+  GL
Sbjct: 608 Y--SLRSVFKFFCNQKQYYKVRMEAALVIAKISCKETNYEGLELMISYFKSQFYDQESGL 665

Query: 300 PRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 359
              N+F DF  Y++ + IP +V++++  D ++P EA+EF++ LLK N+N+ NP+SD ++L
Sbjct: 666 LDENNFTDFINYYIRKIIPVSVSLMKDKDGRTPAEAIEFLVDLLKNNENSSNPFSDNYYL 725

Query: 360 AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTL 411
           A+++ S+  +      +  + S+  +I R L+FD+L P Y   +TI+ ++ L
Sbjct: 726 ASIMSSLANVNTNNDQL--IKSMHNQIVRYLKFDQLYPCYRNTITIAALQAL 775


>gi|440798321|gb|ELR19389.1| PBS lyase HEATlike repeat domain containing protein [Acanthamoeba
           castellanii str. Neff]
          Length = 981

 Score =  214 bits (544), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 150/461 (32%), Positives = 249/461 (54%), Gaps = 49/461 (10%)

Query: 1   MLEKQMGSNFFRKILQNIIS--RAQGASPVRTLSTKEFRHFANKVGNLERPFLKE--FFP 56
           +LEK +G +  RK++Q +I+   AQ  +   TLS   F     K+   +   LK+  F  
Sbjct: 351 LLEKLLGLDAMRKVVQQLIAPPEAQAGTLPPTLSLHRFLKLIAKLTGHD---LKKDGFDK 407

Query: 57  RWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPG 116
           RW+   GC  +  GF +N++++++E AV                       R G      
Sbjct: 408 RWIHGSGCVEVECGFWFNRKRHLIEFAV-----------------------RQGGKAESN 444

Query: 117 MMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAV 176
            +++RVHE+DG +DH +L +  D +   ++ C+S+L   R  K + G  P+  +D    V
Sbjct: 445 KLAVRVHEVDGAHDH-LLDLEEDLYGT-DLPCYSRLRKNRRKKGQPGEGPEEKEDEKPDV 502

Query: 177 AGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEAL 236
                +   ++P+ WIR DPE E L ++ F QP +MWINQ+E D DVVAQ + +  +   
Sbjct: 503 P----QPRSDTPILWIRIDPEFEMLRKVQFRQPEKMWINQIELDRDVVAQYEGVQGVGQC 558

Query: 237 PHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRF--- 293
              S   V  L + + + + F+R+R  AA AL +   E +  A L  L+++++S  F   
Sbjct: 559 N--SAAAVAALASLMENPRMFYRIRALAAEALGSIRGEASA-AALEALLRYFRSAYFYDG 615

Query: 294 DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPY 353
           DEN  + RPNDF DF+ Y V +AIP A++ VR  +  +P E +EF+L LLK NDN GNPY
Sbjct: 616 DENSQM-RPNDFTDFAAYHVQKAIPVALSQVRDTEGFTPDEVLEFLLHLLKQNDNTGNPY 674

Query: 354 SDVFWLAALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQ 413
           SD  +LA +++++G  +   +S     S+LK++ R +  +RL+PSY+  +T++C++ + +
Sbjct: 675 SDYSYLATVIEALGNTQ--PKSNKGWDSILKQVQRFMHLERLLPSYHNRVTVACLKVMKK 732

Query: 414 IALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
             ++ SG   LD  + L   +  +    +VR+ A +AL +L
Sbjct: 733 --MQASGRARLD--LDLFTSYTSYGYYRKVRIVALKALGEL 769


>gi|330801236|ref|XP_003288635.1| hypothetical protein DICPUDRAFT_152902 [Dictyostelium purpureum]
 gi|325081308|gb|EGC34828.1| hypothetical protein DICPUDRAFT_152902 [Dictyostelium purpureum]
          Length = 1853

 Score =  207 bits (528), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 121/385 (31%), Positives = 216/385 (56%), Gaps = 37/385 (9%)

Query: 30  TLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCT 89
           T+ TK+F      +   +   LK+F  +W+   G P L  GF++N++K   E A L+   
Sbjct: 544 TIQTKKFLKLIKSMTGQD---LKQFAEKWIYGRGSPNLICGFNFNRKKFQTEFA-LKQVL 599

Query: 90  VKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCH 149
            +P+ +   +SS+               + IR+ EL+G ++  ++    D ++  +  CH
Sbjct: 600 NRPEDK---ISSS---------------LMIRITELEGSFEQ-VINFEDDMFE-YDFPCH 639

Query: 150 SKLAARRALKPKKGSKPDGCDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQP 209
           S+    R  K KK    DG +        +D+    E+PL W R DP++E++ +I F QP
Sbjct: 640 SRC---RKTKKKKLLSEDGEE------MEIDLSQKRETPLLWFRIDPDLEWIHKITFRQP 690

Query: 210 VQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALA 269
             MWI+QLE D DV+AQ +AI  L+    +S N V  +  F+ +++ F+RVR +A   L+
Sbjct: 691 EYMWIHQLELDKDVIAQIEAIKGLKDF--VSHNSVRAVFKFIQNTRFFYRVRCDATLLLS 748

Query: 270 NTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADN 329
             +++ET++ GL  LV ++K+  +D +    R N+F+DF+ YF+ +AIP ++++++  + 
Sbjct: 749 RLSTKETNFEGLNVLVSYFKNLFYDPDSNQLRSNNFKDFNSYFIQKAIPLSISLIKDKEG 808

Query: 330 KSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLSSLLKRIDRL 389
           ++P E +EF+L+LLK NDN+ N YSD ++L  L+ S+  +    + +  +  + K++ R 
Sbjct: 809 RTPLECIEFLLELLKDNDNSLNTYSDCYYLETLLISLSNIHTTNEQL--IKKIRKQVSRY 866

Query: 390 LQFDRLMPSYNGILTISCIRTLTQI 414
           L++D+L PSY   +T  C+R   Q+
Sbjct: 867 LKYDQLYPSYRNCVTAGCLRAFCQL 891


>gi|406604797|emb|CCH43782.1| Transcription initiation factor TFIID subunit 2 [Wickerhamomyces
            ciferrii]
          Length = 1347

 Score =  191 bits (484), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 148/518 (28%), Positives = 252/518 (48%), Gaps = 65/518 (12%)

Query: 31   LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTV 90
            LST  F+H   KV N  R  L+ FF +WV   G P+ R+   +NK++  +E+ + +    
Sbjct: 561  LSTYHFQHVCEKV-NHNR--LESFFRQWVYGSGVPIFRVTQRFNKKRMFIEMGIRQVQHQ 617

Query: 91   KPDSRTP-----VLSSNTDSENRDGDIG---WPGMMSIRVHELDGM-YDHPILPMAGDAW 141
            + +++ P     +  +    E  D  +    + G M+IR+HE DG  Y+H +    G  +
Sbjct: 618  ETENQHPSPDTFISEARRQQETTDAQVPATLFTGPMTIRIHEADGTPYEHIVDLKEG--F 675

Query: 142  QLLEIQCHSKLAA-RRALKPKKGSKPDGCDDNGDAVA----GLDMRSSME---------- 186
              L+IQ ++K    +R+ K +K  K     D  + V     G  ++S  E          
Sbjct: 676  TKLDIQYNTKYKRLKRSQKQQKSDKDKPEPDEDETVLLHCLGDILQSDQEIKEWDLTEWT 735

Query: 187  ---------SPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALE-AL 236
                         WIR D + E++ +IH NQP  M+ +QL++D DV AQ ++I     A 
Sbjct: 736  KEEEEKMNNEAFEWIRVDADFEWICKIHINQPDYMYASQLQQDRDVEAQLESINYFSIAN 795

Query: 237  PHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDEN 296
            P   ++ +  L   L DS+ ++ VR+EAA  +A  A E+++W GL HL+  +K     EN
Sbjct: 796  PSTLYSSI--LLRTLLDSRYYYGVRLEAAKGMAKYAKEKSNWIGLKHLLTAFKRLYCFEN 853

Query: 297  IGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDV 356
              +P PN+F DF  YF+ +AIP A++ ++  +N  P E   F+L +LK+NDN  NPYSD 
Sbjct: 854  SSIPFPNEFSDFPSYFLKKAIPEALSTIQDKNNNCPLEVRRFLLDILKFNDNLNNPYSDA 913

Query: 357  FWLAALVQSV-----------GELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTI 405
            F+++ L+  +              +       FL++ +  + R  + D  +PSYN ++ +
Sbjct: 914  FYVSHLITCLVRSLTSNVLIDNNSKLNSTDREFLNNAVAELSRHQKMDEWIPSYNSLIGV 973

Query: 406  SCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSAL 465
            S      ++A K  G + LD    L +  ++ N    VR+ A + L    F   G+ ++ 
Sbjct: 974  SAFNEKLKLARK--GLLELDPDY-LFEHTKE-NYFPDVRITAFQGL----FQSGGLKNSR 1025

Query: 466  SL----FIKSVEEEPSLRGQ-VKLGIHAMRICQIKGGS 498
             L     + S E+ P +R + ++    +  +  I+GGS
Sbjct: 1026 ILHYFFLLTSFEKSPYMRHKLIEAFAQSFGVVVIEGGS 1063


>gi|405952443|gb|EKC20254.1| Transcription initiation factor TFIID subunit 2 [Crassostrea gigas]
          Length = 1252

 Score =  184 bits (468), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 134/482 (27%), Positives = 229/482 (47%), Gaps = 68/482 (14%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEFRHFANKV-GNLERPF 50
           MLE+++G     +++  +++ A   +  +          LST  F    + V G    PF
Sbjct: 439 MLEQRIGEQLLIQVVNKLVALATTGASQKVGANTWGNLLLSTSSFLKIISTVTGKDISPF 498

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           L+     WV   GC      F +N+++N+VEL + +D   K                  G
Sbjct: 499 LE----LWVSQSGCARFFGNFVFNRKRNVVELELKQDLGAK------------------G 536

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
            + + G +++ + ELDG ++H       +     +I CHSK    R  K KK     G +
Sbjct: 537 ALKYVGPLTVTIQELDGSFNHNF--KIEENKTKFDITCHSKC---RRNKKKKIPLMTGEE 591

Query: 171 DNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAI 230
            + D ++ +D     +SP+ W+R DP+M  L+ + + QP  MW  QL  + D+VAQ +AI
Sbjct: 592 VDMD-LSAMDA----DSPVLWLRVDPDMNLLSHVTWEQPDYMWQYQLRYERDIVAQREAI 646

Query: 231 AALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETD-WAGLLHLVKFYK 289
            ALE  P  +      L + L D   F+R+R+EAA+ LA  A+     WAG   ++  ++
Sbjct: 647 RALENYP--TAGTRRALTDILEDDHCFYRIRMEAAHCLAKVANSMVGVWAGPPAMMTIFR 704

Query: 290 SRRFDENI-GLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDN 348
                 +   + R N+F +F  YF+L+ IP+A+A++R      P E  +F+++L KYNDN
Sbjct: 705 KMFGSHSCPAIIRQNNFSNFQNYFLLKTIPNAMALLRNIHTICPPEVFKFIMELFKYNDN 764

Query: 349 NGNPYSDVFWLAALVQSVGELEFGQQSILFLSS----------------LLKRIDRLLQF 392
           + N +SD ++ AALV++V +      +I  +S+                +L+ I R L  
Sbjct: 765 SRNKFSDNYYRAALVEAVSKT--VTPAITLMSTPGQGPTADNLSAETKQILEEITRYLNL 822

Query: 393 DRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALL 452
           ++L+P Y   +T+SC+  +      L  F  L   V+  K +        +R+ A  A++
Sbjct: 823 EKLLPCYRHTVTVSCLSAIR----TLQKFGHLPSEVEFFKSYAQHGVFRDIRLAAIDAIV 878

Query: 453 DL 454
           D 
Sbjct: 879 DF 880


>gi|330924409|ref|XP_003300628.1| hypothetical protein PTT_11925 [Pyrenophora teres f. teres 0-1]
 gi|311325144|gb|EFQ91276.1| hypothetical protein PTT_11925 [Pyrenophora teres f. teres 0-1]
          Length = 1526

 Score =  181 bits (460), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 166/608 (27%), Positives = 269/608 (44%), Gaps = 97/608 (15%)

Query: 2    LEKQMGSNFFRKILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
            L KQ G N   + L  ++  A QG  P   ++T +F     KVG+ +   L+ FF +W+ 
Sbjct: 506  LVKQSGKNGVDRCLYRMLFNARQGKYPNGAITTDDFLDICEKVGHQK---LESFFQQWIV 562

Query: 61   TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSEN---------RD-- 109
              GCP      ++NK+K +V+L + +   ++ DS    L +   +++         RD  
Sbjct: 563  GSGCPTFHCLPTFNKKKQVVQLTIRQ---LQADSSITTLENGLSADDFMREAKEKSRDLR 619

Query: 110  GDIGWP---GMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSK 165
               G+P   G M+IR+HE DG  Y+H +   + +    +      K   R  L  ++ + 
Sbjct: 620  SATGYPTFVGPMTIRIHEADGTPYEHIVDINSNNVKVEIPYNTKYKRLKRNRLNKERMAA 679

Query: 166  PDGCDDNGDAVAGL--------------------------DMRSSMESPLSWIRADPEME 199
              G D +G+    +                          D          WIR D + E
Sbjct: 680  AAGLDASGETQEDVTFYMLGDLFTTPQEHNEWRLDDWPAEDEAKQEGEAYEWIRIDADFE 739

Query: 200  YLAEIHF-NQPVQMWINQLEKDGDVVAQAQAIAAL---EALPHLSFNVVNTLNNFLSDSK 255
            ++   +  + P  M+++QL++D DVVAQA++I  L   E    +S  +V T    L D++
Sbjct: 740  WICRTNIEDMPSYMYVSQLQQDKDVVAQAESIQFLANKEGHKLISTFLVKT----LMDNR 795

Query: 256  AFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLE 315
             F  +R  AA  LA +A + TDW GL HL K ++      N G+ R NDF D S Y +  
Sbjct: 796  YFHGIRTMAASVLAQSALKRTDWVGLFHLKKAFQELFCLPNSGMTRSNDFSDRSMYLIQC 855

Query: 316  AIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGEL------ 369
            AIP A+A ++  D +SP EA  F+L L++YNDN GN YSD  ++A L++ + E       
Sbjct: 856  AIPQAIARIKGEDGRSPLEAKRFLLDLVRYNDNRGNDYSDDHYIATLMRCIAETITKSNT 915

Query: 370  -------EFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFI 422
                   E+ Q+   F S  +  + R  + D  +P+Y  + T + +      ALKL    
Sbjct: 916  TSELSHEEWAQEQD-FTSKAIGELTRHQRLDEWIPTYQNVYTTTAL----DCALKL---- 966

Query: 423  SLDQVVKLIKPFR--DFNTIWQ------VRVEASRALLDLEFHCNGIDSALSLFIKSVEE 474
                V+  + PF+  +F    Q      VR++A   L+ L       D+ L   +  +  
Sbjct: 967  ----VMSRVTPFKPSEFLQYAQLGNADNVRLKAWECLIRLGMFRK--DTVLKFLVHEIRS 1020

Query: 475  EPS--LRGQVKLGIHAMRICQIKGGS--DSNHEVDTVTLVALLNLLESRIAFNNVFLRHH 530
            +PS   R Q+ L +  + I Q+  G        VDT     ++    S     +   R +
Sbjct: 1021 DPSPYFRKQL-LRLFGIAIGQVATGDVWAPEKAVDTTADTLVVENEASNAERASALARQN 1079

Query: 531  LFGILQIL 538
            L G L+ L
Sbjct: 1080 LNGALRGL 1087


>gi|156359773|ref|XP_001624939.1| predicted protein [Nematostella vectensis]
 gi|156211747|gb|EDO32839.1| predicted protein [Nematostella vectensis]
          Length = 997

 Score =  177 bits (448), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 140/499 (28%), Positives = 235/499 (47%), Gaps = 62/499 (12%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEFRHFANKVGNLERPFL 51
           M+E ++G     ++   ++S A   +  +          LST  F    + V   +   +
Sbjct: 444 MIEIRIGQEPLLQVFNKLLSLANTTAQPKVDSSLWRNLLLSTSGFLKSISTVSGKD---I 500

Query: 52  KEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGD 111
             F  +W+   G       F +N+++NIVEL + +D +                    G 
Sbjct: 501 NVFIDQWICHSGIAKFHGSFVFNRKRNIVELDIKQDLS-------------------KGT 541

Query: 112 IGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDD 171
           + + G +++ V ELDG ++H +     D     E+ CHSK    +  K    +  +  D 
Sbjct: 542 VKYVGPLTVCVQELDGSFNHTV--QIEDITSRHELPCHSKSRRNKKKKIPLMTGEE-VDM 598

Query: 172 NGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIA 231
           N DA+         +SP+ W+R DPE+ +L ++ F QP  MW  QL  + D++AQ Q++ 
Sbjct: 599 NLDAMDS-------DSPVLWLRIDPEVTWLRQVSFEQPDYMWQYQLRHERDIMAQVQSLV 651

Query: 232 ALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETD-WAGLLHLVKFYKS 290
           ALE  P  +    + L + ++  + F+RVR EA   LA  AS   D W G   ++  +K+
Sbjct: 652 ALERFP--TAPTRDALMDIITQKECFYRVRQEACMCLAKVASASADSWTGQTAMLNIFKN 709

Query: 291 RRFDENI-GLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNN 349
               ++   + R NDF +F  YFVL+ +P A+A VR   ++ PRE V F++ LLKYNDN 
Sbjct: 710 MFGSKSAPQIIRYNDFSNFISYFVLKTLPIAIASVRNIHSQCPREVVLFLMDLLKYNDNR 769

Query: 350 GNPYSDVFWLAALVQSV------GELEFGQQSILFLS------SLLKRIDRLLQFDRLMP 397
            N YSD + LAAL++++      G      Q +L +        +L  + R L  ++L P
Sbjct: 770 LNKYSDSYHLAALIEALSATITPGVAMVSTQRLLLVKLKDDTQLVLAEVIRRLNLEKLQP 829

Query: 398 SYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFH 457
           SY   +T+SC++ L    L+++G +  D V    + +  F     VR  A   L D    
Sbjct: 830 SYRYCVTVSCLKALRN--LQVNGQVPADSVA--FEFYAGFGHFEDVRCSAIECLADYT-K 884

Query: 458 CNGIDSALSLFIKSVEEEP 476
               +  L+  ++ VE++P
Sbjct: 885 VEVSEKTLNFLMEIVEKDP 903


>gi|68487713|ref|XP_712267.1| hypothetical protein CaO19.9682 [Candida albicans SC5314]
 gi|46433641|gb|EAK93074.1| hypothetical protein CaO19.9682 [Candida albicans SC5314]
          Length = 917

 Score =  176 bits (447), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 142/470 (30%), Positives = 226/470 (48%), Gaps = 57/470 (12%)

Query: 24  GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELA 83
           G+ P  TL+T+ FR+   KV    R  L+ FF +WV   G PV R+   +NK+K ++E+ 
Sbjct: 103 GSLPNGTLNTEHFRYVCEKVN---RNKLENFFKQWVFGAGAPVFRISQRFNKKKGMIEMN 159

Query: 84  VLRDCTVKPDSRTPVLSSNTDSENRDGDIG--------------WPGMMSIRVHELDGM- 128
           + +   V+   RTP      DS +  GD                + G M+IRVHE DG  
Sbjct: 160 IRQ---VQHHERTP---KKLDSSSFIGDAVAYLDNEPSPPQQHVFTGPMTIRVHESDGSP 213

Query: 129 YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPD-GCDDNGDAVAG--------- 178
           Y+H +     +A   L+IQ +SK   R+  K   G  P    +  GD +           
Sbjct: 214 YEHIV--ELKEANTKLDIQYNSKF--RKIKKKDDGVDPGIAFNQFGDVLLSHAEMQEWNL 269

Query: 179 -----LDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
                 D  S    P  WIR D + E++A I   QP  M+ +QL  D D+ AQ +A+   
Sbjct: 270 RDWDKRDEESLSSEPFEWIRGDVDFEWIARIDIKQPDFMFGSQLVYDRDIEAQLEAVRYF 329

Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVK-FYKSRR 292
             L   +      L   L D + F+ VRI AA ALA+ ++E  ++ GL +L+K F K   
Sbjct: 330 GELEKTNTTYCTALTRTLMDDRYFYGVRIAAAQALADCSNEANNFIGLEYLLKAFGKLYC 389

Query: 293 FDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNP 352
           FD++  +P+ NDF DF  + +  A P  +  V+  D K P+     +L L+K+NDN+ N 
Sbjct: 390 FDDSF-IPKSNDFSDFRSFMLQSAFPRILCGVKDNDGKVPKRIQNLLLNLVKFNDNSMNA 448

Query: 353 YSDVFWLAALVQS-VGELEFGQQSIL-------FLSSLLKRIDRLLQFDRLMPSYNGILT 404
           + D  +++ LV S V     G ++ +       F   +   + RL + DR +PSY  I+ 
Sbjct: 449 FVDSLYVSELVVSLVTSTVSGWRNDIEEYNDTEFAGKVSSELFRLQKLDRWIPSYQNIIE 508

Query: 405 ISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
           ++CI+   ++AL  +  +S ++++ +           + RVEA R +L+L
Sbjct: 509 VTCIQQNIRMALVGARKLSFEELLYMTLD----KYPMEARVEAFRGILEL 554


>gi|68487423|ref|XP_712410.1| hypothetical protein CaO19.2135 [Candida albicans SC5314]
 gi|46433795|gb|EAK93224.1| hypothetical protein CaO19.2135 [Candida albicans SC5314]
          Length = 917

 Score =  176 bits (447), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 142/470 (30%), Positives = 226/470 (48%), Gaps = 57/470 (12%)

Query: 24  GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELA 83
           G+ P  TL+T+ FR+   KV    R  L+ FF +WV   G PV R+   +NK+K ++E+ 
Sbjct: 103 GSLPNGTLNTEHFRYVCEKVN---RNKLENFFKQWVFGAGAPVFRISQRFNKKKGMIEMN 159

Query: 84  VLRDCTVKPDSRTPVLSSNTDSENRDGDIG--------------WPGMMSIRVHELDGM- 128
           + +   V+   RTP      DS +  GD                + G M+IRVHE DG  
Sbjct: 160 IRQ---VQHHERTP---KKLDSSSFIGDAVAYLDNEPSPPQQHVFTGPMTIRVHESDGSP 213

Query: 129 YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPD-GCDDNGDAVAG--------- 178
           Y+H +     +A   L+IQ +SK   R+  K   G  P    +  GD +           
Sbjct: 214 YEHIV--ELKEANTKLDIQYNSKF--RKIKKKDDGVDPGIAFNQFGDVLLSHAEMQEWNL 269

Query: 179 -----LDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
                 D  S    P  WIR D + E++A I   QP  M+ +QL  D D+ AQ +A+   
Sbjct: 270 RDWDKRDEESLSSEPFEWIRGDVDFEWIARIDIKQPDFMFGSQLVYDRDIEAQLEAVRYF 329

Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVK-FYKSRR 292
             L   +      L   L D + F+ VRI AA ALA+ ++E  ++ GL +L+K F K   
Sbjct: 330 GELEKTNTTYCTALTRTLMDDRYFYGVRIAAAQALADCSNEANNFIGLEYLLKAFGKLYC 389

Query: 293 FDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNP 352
           FD++  +P+ NDF DF  + +  A P  +  V+  D K P+     +L L+K+NDN+ N 
Sbjct: 390 FDDSF-IPKSNDFSDFRSFMLQSAFPRILCGVKDNDGKVPKRIQNLLLNLVKFNDNSMNA 448

Query: 353 YSDVFWLAALVQS-VGELEFGQQSIL-------FLSSLLKRIDRLLQFDRLMPSYNGILT 404
           + D  +++ LV S V     G ++ +       F   +   + RL + DR +PSY  I+ 
Sbjct: 449 FVDSLYVSELVVSLVTSTVSGWRNDIEEYNDTEFAGKVSSELFRLQKLDRWIPSYQNIIE 508

Query: 405 ISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
           ++CI+   ++AL  +  +S ++++ +           + RVEA R +L+L
Sbjct: 509 VTCIQQNIRMALVGARKLSFEELLYMTLD----KYPMEARVEAFRGILEL 554


>gi|241956976|ref|XP_002421208.1| TATA binding protein-associated factor, putative; transcription
            initiation factor TFIID subunit, putative [Candida
            dubliniensis CD36]
 gi|223644551|emb|CAX41369.1| TATA binding protein-associated factor, putative [Candida
            dubliniensis CD36]
          Length = 1294

 Score =  176 bits (446), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 134/459 (29%), Positives = 218/459 (47%), Gaps = 42/459 (9%)

Query: 24   GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELA 83
            G+ P  TLST+ FR+   KV    R  L  FF +WV   G P+ R+   +NK+K ++E+ 
Sbjct: 563  GSLPNGTLSTEHFRYVCEKVN---RNKLDGFFKQWVFGAGAPIFRISQRFNKKKGMIEMN 619

Query: 84   V--LRDCTVKP---------DSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGM-YDH 131
            +  ++    KP         D     L   T    +     + G M+IRVHE DG  Y+H
Sbjct: 620  IRQVQHYERKPRKLDSCGFIDDAIAYLDEETVPPQQHV---FTGPMTIRVHESDGSPYEH 676

Query: 132  PILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPD-GCDDNGDAVAGL----------- 179
             +     +A   L+IQ +SK   R+  K  +G  P    +  GD +              
Sbjct: 677  IV--ELKEANTKLDIQYNSKF--RKIKKKDEGVDPGIAFNQFGDVLVSAAEMQEWNLRDW 732

Query: 180  ---DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEAL 236
               D  S    P  WIR D + E++A I   QP  M+ +QL  D D+ AQ +A+     L
Sbjct: 733  DKRDEESLSSEPFEWIRGDVDFEWIARIDIKQPDFMFGSQLVYDRDIEAQLEAVRYFGEL 792

Query: 237  PHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDEN 296
               +      L   L D + F+ VRI AA ALA+  SE  ++ GL +L+K +      + 
Sbjct: 793  EKTNVTYCTALTRTLMDERYFYGVRIAAAQALADCCSEANNFIGLEYLLKAFGELYCFDG 852

Query: 297  IGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDV 356
              +P+ NDF DF  + +  A P  +  ++  D + P+   E +L L+KYNDN+GN + D 
Sbjct: 853  SFIPKSNDFSDFRGFMLQRAFPKILCGIKDNDGRVPKRVQELLLNLVKYNDNSGNVFVDS 912

Query: 357  FWLAALVQS-VGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIA 415
             +++ LV++ V     G +   F+  ++  + RL + DR + SY  ++ ++CI+   ++A
Sbjct: 913  LYVSELVEALVSSTVSGGRDDGFVDKVVGELFRLQKMDRWVVSYQNVIEVTCIQQSIRMA 972

Query: 416  LKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
            L     +SL++++ +           + RVEA   +L L
Sbjct: 973  LAGVRKVSLEELLYMTLD----KYPMEARVEAFSGILGL 1007


>gi|156848157|ref|XP_001646961.1| hypothetical protein Kpol_2000p71 [Vanderwaltozyma polyspora DSM
            70294]
 gi|156117643|gb|EDO19103.1| hypothetical protein Kpol_2000p71 [Vanderwaltozyma polyspora DSM
            70294]
          Length = 1495

 Score =  176 bits (445), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 134/486 (27%), Positives = 229/486 (47%), Gaps = 76/486 (15%)

Query: 13   KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
            ++L  I  +A  G  P  +LS+  F+H   +V    +  L+ FF +W+   G P+ R+  
Sbjct: 687  RVLPKIFLQAMSGDLPNNSLSSAHFQHICERVN---KNKLENFFQQWIYGSGVPIFRVTQ 743

Query: 72   SYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSEN---------RDGDIGWPGMMSIRV 122
             +NK++ +VE+ + R C  +      V  S+  S +         R+    + G M+IR+
Sbjct: 744  RFNKKRMVVEMGI-RQCQAQELGSGKVTGSSGFSSSALNFLEHPTREMAPHFSGSMTIRI 802

Query: 123  HELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKP-----KKGSKPDGCDDNGDAV 176
            HE DG  Y+H +     D +  L+IQ ++K    RA KP     KK + P+  +D  +  
Sbjct: 803  HEPDGTPYEHIV--EVKDVFTKLDIQYNTKYRRSRARKPLGARSKKENTPEVSNDINEKY 860

Query: 177  AGLDMRSSME-------SP--------------------------LSWIRADPEMEYLAE 203
               D  ++++       SP                            WIR D + E++++
Sbjct: 861  HERDNNTNIQKLGTVLTSPQDCSEWNLTDISVSSEGHELQRQGEAFEWIRIDSDFEWISK 920

Query: 204  IHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLS----DSKAFWR 259
            I+ NQP  M+ +QL++DGDV AQ ++I   E +   S N     ++ L+    DS+ F+ 
Sbjct: 921  IYINQPDYMFTSQLQQDGDVEAQIESIRYFEDVVMSSQNNSKLYSSILTRTIVDSRYFYG 980

Query: 260  VRIEAAYALANTASEETDW----AGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLE 315
            VR+EA  AL+     + +      G  HL+K Y+     EN  +P  N F DF  Y+V +
Sbjct: 981  VRLEACRALSKFVVRDNESLSFRGGAQHLIKIYQKLFCYENSNIPLNNCFSDFKRYYVQQ 1040

Query: 316  AIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQS 375
            +IP  ++ V+   +++P     F++ LL YN+N GNPY D F++  L+++  +      S
Sbjct: 1041 SIPQNLSNVKDERDETPDFIKSFIIDLLTYNENTGNPYDDTFFVCTLIKAAVKCAINSPS 1100

Query: 376  -ILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPF 434
               F+  LL+++ R    D+  P+Y  ++T     T+    LKL+         K +  F
Sbjct: 1101 DTNFIGKLLEQLYRYESLDKWTPTYQLLIT----NTILSEKLKLNS--------KKLDRF 1148

Query: 435  RDFNTI 440
             D N+I
Sbjct: 1149 EDLNSI 1154


>gi|451849770|gb|EMD63073.1| hypothetical protein COCSADRAFT_44824, partial [Cochliobolus sativus
            ND90Pr]
          Length = 1157

 Score =  176 bits (445), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 156/565 (27%), Positives = 256/565 (45%), Gaps = 95/565 (16%)

Query: 2    LEKQMGSNFFRKILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
            L KQ G N   + L  ++  A QG  P   ++T +F     KVG+ +   L+ FF +W+ 
Sbjct: 498  LVKQSGKNGVDRCLYRMLFNARQGKYPNGAVTTDDFLDICEKVGHQK---LESFFQQWIV 554

Query: 61   TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSEN---------RD-- 109
              GCP      S+NK+K +V+L + +   ++ DS    + +   +++         RD  
Sbjct: 555  GAGCPTFHCVPSFNKKKQVVQLTIKQ---LQADSSITTIENGLAADDFMREAKQKSRDLR 611

Query: 110  GDIGWP---GMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSK 165
               G+P   G M+IR+HE DG  Y+H +   + +    +      K   R  L  ++ + 
Sbjct: 612  SATGYPTFVGPMTIRIHEADGTPYEHIVDINSNNVKVEIPYNTKYKRLKRNRLNKERMAA 671

Query: 166  PDGCDDNGDAVAGL------DMRSSME--------------------SPLSWIRADPEME 199
              G D +G+    +      D+ +S +                        WIR D + E
Sbjct: 672  AAGLDASGETQEDVTFYMLGDLFTSPQEHEEWRLDDWPQKDEAKQEGEAYEWIRIDADFE 731

Query: 200  YLAEIHF-NQPVQMWINQLEKDGDVVAQAQAIAAL---EALPHLSFNVVNTLNNFLSDSK 255
            ++   +  + P  M+++QL++D DVVAQA++I  L   E  P +S  +V T    L D++
Sbjct: 732  WICRTNIEDMPSYMYVSQLQQDKDVVAQAESIQFLANKEGHPLISTFLVKT----LMDNR 787

Query: 256  AFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLE 315
             F  +R  AA  LA +A + T W GL HL K ++      N  + R NDF D S Y +  
Sbjct: 788  YFHGIRTMAAMVLAQSALKRTQWVGLFHLKKAFRELFCLPNSPMTRSNDFSDRSMYLIQC 847

Query: 316  AIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGEL------ 369
            AIP A+A ++  D +SP EA  F+L L++YNDN GN YSD  ++A L++ + E       
Sbjct: 848  AIPRAIAKIKGEDGRSPLEAKRFLLDLVRYNDNRGNDYSDDHYIATLMRCLAETLTKTNI 907

Query: 370  -------EFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFI 422
                   E+ Q+   F +  +  + R  + D  +P+Y  + T + +      A KL    
Sbjct: 908  GSELSHEEWAQEQD-FTNKAIGELTRHQRLDEWIPTYQNVYTTTAL----DCAFKL---- 958

Query: 423  SLDQVVKLIKPFR--DFNTIWQ------VRVEASRALLDLEFHCNGIDSALSLFIKSVEE 474
                V   + PF+  +F    Q      VR++A   L+ L       DS +   +  +  
Sbjct: 959  ----VTSRVTPFKPTEFLQYAQLGNADNVRLKAWECLIRLGMFRK--DSVMRFLVHEIRS 1012

Query: 475  EPS--LRGQVKLGIHAMRICQIKGG 497
            +PS   R Q+ L I  + I Q+  G
Sbjct: 1013 DPSPYFRKQM-LRILGLAIGQVATG 1036


>gi|344300178|gb|EGW30518.1| hypothetical protein SPAPADRAFT_157654 [Spathaspora passalidarum NRRL
            Y-27907]
          Length = 1507

 Score =  174 bits (442), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 129/438 (29%), Positives = 204/438 (46%), Gaps = 65/438 (14%)

Query: 24   GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELA 83
            G  P  T++T  F++   KV    R  L  FF +WV + G P LR+   +NK++ ++E+ 
Sbjct: 599  GDLPNGTITTTHFQYLCEKVN---RNKLDGFFKQWVYSTGTPHLRIQQKFNKKRGMIEMQ 655

Query: 84   V-----LRDCTVKPDSRTPVLSSNTDSENRDGDIGWP---GMMSIRVHELDGM-YDHPIL 134
            +     L     +   R+ +  + +  E+  G    P   G ++IRVHE DG  Y+H + 
Sbjct: 656  IRQVQHLEKRQARLTPRSFITDATSYIEDEPGHPLLPVFTGPITIRVHEADGTPYEHIVE 715

Query: 135  PMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLDMRSSMES------- 187
               G+    ++IQ +SK+      KPKK           D  AGLDM S   S       
Sbjct: 716  LKEGNV--KIDIQYNSKMR-----KPKKK----------DIEAGLDMMSPFTSRLGDVLT 758

Query: 188  ------------------------PLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDV 223
                                    P  WIR D + E++A I   QP  M+  QL  D D+
Sbjct: 759  SPADMSSWGLSEFEKRDEEMMFNDPFEWIRVDVDYEWIAVIDIKQPDYMFAAQLCHDRDI 818

Query: 224  VAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLH 283
             AQ +++     L   S      L   L D + F+ VRI AA ALA  ++   ++ GL +
Sbjct: 819  EAQLESVRYFRGLDKPSVVHCTVLVRTLMDDRYFYGVRIAAAEALAKFSNSSNNFIGLGY 878

Query: 284  LVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLL 343
            L++ YK         +P+ NDF DF +YF+  AIP  ++ VR  D   P    E +L L+
Sbjct: 879  LLRAYKELFCFRGSTIPKSNDFADFRQYFIQVAIPKILSKVRDQDGIVPPVIQELLLNLV 938

Query: 344  KYNDNNGNPYSDVFWLAALVQSVGELEFG-----QQSILFLSSLLKRIDRLLQFDRLMPS 398
            K+NDN  N ++D  +++ L++++             +  F   +   IDRLL+ D+ MPS
Sbjct: 939  KFNDNTNNSFTDGIYVSKLIEALTTTAVTGRKDEPHTKQFAKHVNTEIDRLLKLDQYMPS 998

Query: 399  YNGILTISCIRTLTQIAL 416
            Y+ I+ I+CI+   ++A+
Sbjct: 999  YHNIIEITCIKQKIRLAV 1016


>gi|452001588|gb|EMD94047.1| hypothetical protein COCHEDRAFT_1192202 [Cochliobolus heterostrophus
            C5]
          Length = 1562

 Score =  174 bits (440), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 156/565 (27%), Positives = 256/565 (45%), Gaps = 95/565 (16%)

Query: 2    LEKQMGSNFFRKILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
            L KQ G N   + L  ++  A QG  P   ++T +F     KVG+ +   L+ FF +W+ 
Sbjct: 507  LVKQSGKNGVDRCLYRMLFNARQGKYPNGAVTTDDFLDICEKVGHQK---LESFFQQWII 563

Query: 61   TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSEN---------RD-- 109
              GCP      S+NK+K +V+L + +   ++ DS    + +   +++         RD  
Sbjct: 564  GAGCPTFHCVPSFNKKKQVVQLTIKQ---LQADSSITTIENGLAADDFMREAKQKSRDLR 620

Query: 110  GDIGWP---GMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSK 165
               G+P   G M+IR+HE DG  Y+H +   + +    +      K   R  L  ++ + 
Sbjct: 621  SATGYPTFVGPMTIRIHEADGTPYEHIVDINSNNVKVEIPYNTKYKRLKRNRLNKERMAA 680

Query: 166  PDGCDDNGDAVAGL------DMRSSME--------------------SPLSWIRADPEME 199
              G D +G+    +      D+ +S +                        WIR D + E
Sbjct: 681  AAGLDASGETQEDVTFYMLGDLFTSPQEHEEWRLDDWPPEDEAKQEGEAYEWIRIDADFE 740

Query: 200  YLAEIHF-NQPVQMWINQLEKDGDVVAQAQAIAAL---EALPHLSFNVVNTLNNFLSDSK 255
            ++   +  + P  M+++QL++D DVVAQA++I  L   E  P +S  +V T    L D++
Sbjct: 741  WICRTNIEDMPSYMYVSQLQQDKDVVAQAESIQFLANKEGHPLISTFLVKT----LMDNR 796

Query: 256  AFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLE 315
             F  +R  AA  LA +A + T W GL HL K ++      N  + R NDF D S Y +  
Sbjct: 797  YFHGIRTMAAMVLAQSALKRTQWVGLFHLKKAFRELFCLPNSPMTRSNDFSDRSMYLIQC 856

Query: 316  AIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGEL------ 369
            AIP A+A ++  D +SP EA  F+L L++YNDN GN YSD  ++A L++ + E       
Sbjct: 857  AIPRAIAKIKGEDGRSPLEAKRFLLDLVRYNDNRGNDYSDDHYIATLMRCLAETLTKTNI 916

Query: 370  -------EFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFI 422
                   E+ Q+   F +  +  + R  + D  +P+Y  + T + +      A KL    
Sbjct: 917  SSELSHEEWAQEQD-FTNKAIGELTRHQRLDEWIPTYQNVYTTTAL----DCAFKL---- 967

Query: 423  SLDQVVKLIKPFR--DFNTIWQ------VRVEASRALLDLEFHCNGIDSALSLFIKSVEE 474
                V   + PF+  +F    Q      VR++A   L+ L       DS +   +  +  
Sbjct: 968  ----VTSRVTPFKPTEFLQYAQLGNADNVRLKAWECLIRLGMFRK--DSVMRFLVHEIRS 1021

Query: 475  EPS--LRGQVKLGIHAMRICQIKGG 497
            +PS   R Q+ L I  + I Q+  G
Sbjct: 1022 DPSPYFRKQM-LRILGLAIGQVATG 1045


>gi|238882937|gb|EEQ46575.1| conserved hypothetical protein [Candida albicans WO-1]
          Length = 1423

 Score =  173 bits (438), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 141/470 (30%), Positives = 226/470 (48%), Gaps = 57/470 (12%)

Query: 24   GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELA 83
            G+ P  TL+T+ FR+   KV    R  L+ FF +WV   G PV R+   +NK+K ++E+ 
Sbjct: 609  GSLPNGTLNTEHFRYVCEKVN---RNKLENFFKQWVFGAGAPVFRISQRFNKKKGMIEMN 665

Query: 84   VLRDCTVKPDSRTPVLSSNTDSENRDGDIG--------------WPGMMSIRVHELDGM- 128
            + +   V+   RTP      DS +  GD                + G M+IRVHE DG  
Sbjct: 666  IRQ---VQHHERTP---KKLDSSSFIGDAVAYLDNEPSPPQQHVFTGPMTIRVHESDGSP 719

Query: 129  YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPD-GCDDNGDAVAG--------- 178
            Y+H +     +A   L+IQ +SK   R+  K   G  P    +  GD +           
Sbjct: 720  YEHIV--ELKEANTKLDIQYNSKF--RKIKKKDDGVDPGIAFNQFGDVLLSHAEMQEWNL 775

Query: 179  -----LDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
                  D  S    P  WIR D + E++A I   QP  M+ +QL  D D+ AQ +A+   
Sbjct: 776  RDWDKRDEESLSSEPFEWIRGDVDFEWIARIDIKQPDFMFGSQLVYDRDIEAQLEAVRYF 835

Query: 234  EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVK-FYKSRR 292
              L   +      L   L D + F+ VRI AA ALA+ ++E  ++ GL +L+K F +   
Sbjct: 836  GELEKTNTTYCTALTRTLMDDRYFYGVRIAAAQALADCSNEANNFIGLEYLLKAFGELYC 895

Query: 293  FDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNP 352
            FD++  +P+ NDF DF  + +  A P  +  V+  D K P+     +L L+K+NDN+ N 
Sbjct: 896  FDDSF-IPKSNDFSDFRGFMLQSAFPRILCGVKDNDGKVPKRIQNLLLNLVKFNDNSMNA 954

Query: 353  YSDVFWLAALVQS-VGELEFGQQSIL-------FLSSLLKRIDRLLQFDRLMPSYNGILT 404
            + D  +++ LV S V     G ++ +       F   +   + RL + DR +PSY  I+ 
Sbjct: 955  FVDSLYVSELVVSLVTSTVSGWRNDIEEYNDTEFAGKVSSELFRLQKLDRWIPSYQNIIE 1014

Query: 405  ISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
            ++CI+   ++AL  +  +S ++++ +           + RVEA R +L+L
Sbjct: 1015 VTCIQQNIRMALVGARKLSFEELLYMTLD----KYPMEARVEAFRGILEL 1060


>gi|388853437|emb|CCF52836.1| related to TAF2-component of TFIID complex [Ustilago hordei]
          Length = 1846

 Score =  171 bits (433), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 129/462 (27%), Positives = 221/462 (47%), Gaps = 87/462 (18%)

Query: 31  LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTV 90
           LST++F     KV   +   L+ F   W+   GCP      ++N++K ++E+ + ++   
Sbjct: 543 LSTQQFLRTCRKVSGAD---LRLFTEHWIRGSGCPRFICSANFNRKKLLIEMHIRQE--- 596

Query: 91  KPDSRTPVLSSNTDSENRDGDIG-------WPGMMSIRVHELDGM-YDHPILPMAGDAWQ 142
                   + +   +  R GD         + G M++R+HE DG  Y+H +L + G A +
Sbjct: 597 --------VPAAQFAAARPGDALAANPVPLFEGQMTVRIHEADGTPYEH-VLEIKGPA-K 646

Query: 143 LLEIQCHSKLA-ARRALK---------------PKKGSKPDGCDDNGDAVAGLDMRSSME 186
             E+  ++K    RR  K                ++  +  G  D G  +   +   + E
Sbjct: 647 RYEVPFNTKYKRVRRNTKRFQARQAAAAAAAQGDQEAQEAIGLIDLGFGLGMWEDEKARE 706

Query: 187 -----------------SPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
                            +P  WIR D + E+LA IHF QP  MW++QL++D DVVAQ  A
Sbjct: 707 EWKVADWTEEDEEKMSSAPYEWIRMDADFEWLASIHFEQPDYMWVSQLQRDRDVVAQVSA 766

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYK 289
           + AL  +P  S    + L   +  +K F+RVR EA +AL + A  + D  GL HL+K ++
Sbjct: 767 VHALAQMP--SLVTCSMLTRTVLVNKYFYRVRTEAVHALVHCAIPQLDNLGLFHLLKLFR 824

Query: 290 SRRF-----DENI-------GLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVE 337
           S+       D +I        +PR NDF D ++YF+  A+ HA++ VR  D ++P +   
Sbjct: 825 SQFCHDPPDDASIEDPLDVPCIPRANDFSDAADYFLQRALIHAISRVRDPDGRTPPQIKR 884

Query: 338 FVLQLLKYNDNNGNPYSDVFWLAALVQSVG--------------ELEFGQQSILFLSSLL 383
           F++ LL+YNDN+ N + D F+LA  + +V               + +   +    L+  +
Sbjct: 885 FLINLLRYNDNSTNHFVDDFYLAGAINAVASAFIPVDSAMAGSQDPDAANEDSFLLTHAI 944

Query: 384 KRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLD 425
             ++RL + DRL+PSY+ ++T++ +    Q+A+ L+     D
Sbjct: 945 AEVERLQELDRLVPSYHNVITLASLD--FQVAMMLANLKPRD 984


>gi|194215065|ref|XP_001916176.1| PREDICTED: LOW QUALITY PROTEIN: transcription initiation factor
           TFIID subunit 2 [Equus caballus]
          Length = 1205

 Score =  171 bits (433), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 137/477 (28%), Positives = 227/477 (47%), Gaps = 57/477 (11%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK + R AL      K     
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSK-SRRXALIYNDKLKKIPLM 613

Query: 171 DNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAI 230
            NG+ V       S +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ ++I
Sbjct: 614 -NGEEVDXXVFSFSADSPLLWIRIDPDMAVLRKVEFEQADFMWQYQLRYERDVVAQQESI 672

Query: 231 AALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFYK 289
            ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   + 
Sbjct: 673 LALEKFPTPASRLA--LTDILEQEQCFYRVRMAACFCLAKIANSMVSTWTGPPAMKSLF- 729

Query: 290 SRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYND 347
           +R F       + + N+F  F  YF+ + +P A+A++R   N  P+E + F+L L+KYND
Sbjct: 730 TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYND 789

Query: 348 NNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRLM 396
           N  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++L+
Sbjct: 790 NRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKLL 849

Query: 397 PSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           PSY   +T+SC+R +    L+ +G +  D    L K + ++     +R+ A  A++D
Sbjct: 850 PSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 902


>gi|255716052|ref|XP_002554307.1| KLTH0F02222p [Lachancea thermotolerans]
 gi|238935690|emb|CAR23870.1| KLTH0F02222p [Lachancea thermotolerans CBS 6340]
          Length = 1443

 Score =  171 bits (432), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 141/512 (27%), Positives = 226/512 (44%), Gaps = 85/512 (16%)

Query: 13   KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
            ++L  I  +A  G  P  TLST  F+H   +V   +   L+ FF  WV   G P+ R+  
Sbjct: 635  RVLPKIFLQAMSGDLPNSTLSTAHFQHVCERVNKNK---LENFFKEWVYGSGVPIFRITQ 691

Query: 72   SYNKRKNIVELAVLRDC---------TVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRV 122
             +NK++ +VE+ + R C         TV  +  +    ++  S N++    + G M+IR+
Sbjct: 692  RFNKKRMVVEMGI-RQCQVQELGLGKTVGDEGFSSSAMNHLCSPNKNLTPVFTGSMTIRI 750

Query: 123  HELDGM-YDHPILPMAGDAWQLLEIQCHS------------KLAARRALKPKKGSKPD-- 167
            HE DG  Y+H +     D +  L+IQ ++            K A +    P     PD  
Sbjct: 751  HEADGTPYEHIV--ELKDTFTKLDIQYNTKYKRLKRRRKINKTAKQEGKDPTPAENPDVL 808

Query: 168  -GCDDNGDAV---------------------------AGLDMRSSMESPLSWIRADPEME 199
               D+N D V                            G + +   E+   WIR D + E
Sbjct: 809  VAEDENEDIVLVNCLGDVLLSAKDCNQWNLTDPMVTSEGDEFQQQNEA-FEWIRIDADFE 867

Query: 200  YLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNT----------LNN 249
            ++ +IH NQP  M+ +QL++D DV AQ  +I   E +      VVN+          L  
Sbjct: 868  WVCKIHINQPDYMFASQLQQDRDVEAQIDSIRFFEDV------VVNSNVNSMVYSSILTR 921

Query: 250  FLSDSKAFWRVRIEAAYALANTASEETDW----AGLLHLVKFYKSRRFDENIGLPRPNDF 305
             + D + F+ VRIEA  AL+    ++ +      G  HL+K ++     EN  +P  NDF
Sbjct: 922  TVMDERYFYGVRIEACRALSRFIMQQPNTDNFSGGARHLIKIFQYYFCYENSNIPFDNDF 981

Query: 306  RDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQS 365
             D+  YF+ +AIP  +A VR  ++  PR    F+L +L+YN+N  N Y D  ++  L+ S
Sbjct: 982  SDYQRYFLQKAIPQLLAEVRNQNDVCPRYVKRFLLDILRYNENGDNSYYDTHYVVTLITS 1041

Query: 366  VGELEFGQ-QSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISL 424
            V        +   F   LL  + R    D+ +PSY  +L   CI  +  + L + GF   
Sbjct: 1042 VVTSALQDPEDASFYQDLLHELRRFKNLDQWIPSYQ-LLVTKCI-MIQHLRLAIDGFYVF 1099

Query: 425  DQVVKLIKPF--RDFNTIWQVRVEASRALLDL 454
            D +  +++       N+  +    A+  L DL
Sbjct: 1100 DDIANILEYTVKEKLNSTSETSFHATEGLQDL 1131


>gi|384496605|gb|EIE87096.1| hypothetical protein RO3G_11807 [Rhizopus delemar RA 99-880]
          Length = 1667

 Score =  171 bits (432), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 143/507 (28%), Positives = 231/507 (45%), Gaps = 95/507 (18%)

Query: 1   MLEKQM----GSNFFRKILQNI-ISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
           ML+K+M    G+    ++L  I +S   G      +ST  F     KV   +    K F 
Sbjct: 248 MLDKRMCKGGGTLGLSRVLPKILVSAMSGELAQNAISTHYFLRLCRKVSGFD---TKVFA 304

Query: 56  PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCT----VKPDSRTPVLSSNTDSENRDGD 111
            +WV   GCP     F +N++K +VE+ + ++ T    + PD+    +S   + +     
Sbjct: 305 EQWVYRSGCPKFSFSFHFNRKKMVVEIFMRQENTNALVIGPDN----MSYGVEYQELMAP 360

Query: 112 IGWPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLA-ARRALK---------- 159
           + + G +++R+HE DG  Y+H IL +  D  Q  E+Q ++K    RR  K          
Sbjct: 361 L-FTGNLTVRIHEADGTPYEH-ILDIHSDK-QKFEVQFNTKYKRIRRNTKRFQAKQAAAA 417

Query: 160 PKKGSKPDGCDDNGDAVAGLDMRSSM-------ESP------------------------ 188
                +    ++ GD    L +  S+       E P                        
Sbjct: 418 AAVAEEEQENEEGGDQTTVLGIIPSLGLGMPIFEDPNRKKEWRIVEWGQDEEDTSGAASA 477

Query: 189 -LSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTL 247
              WIR D E E++  + F QP  MW  QL KD DVVAQ +AI AL+ +P  S+    +L
Sbjct: 478 MFDWIRLDAEFEWVCVVEFKQPDYMWAAQLTKDRDVVAQHEAIDALKYMP--SYATSTSL 535

Query: 248 NNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSR-------RFDENIG-- 298
              + D K F+++R+EAAY LA+      +W GL  L K +  R       +   N G  
Sbjct: 536 LRAIMDPKCFYKIRMEAAYGLASCGIPSLNWVGLHQLSKMFHQRFCLPISSQSSGNDGKT 595

Query: 299 --------LPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNG 350
                   +P+PN+F D SEYF+ +A   A A +R     +P +  +F+L LL+YNDN G
Sbjct: 596 DELPLIRSIPKPNNFSDLSEYFLQKASVVAFAQIRDERGLTPVKIRQFLLDLLRYNDNIG 655

Query: 351 NPYSDVFWLAALVQSVGELEFGQQSILFL----------SSLLKRIDRLLQFDRLMPSYN 400
           N +SD +++A L+ ++G+         F           ++    I+R    D ++PSY+
Sbjct: 656 NEFSDCYYVATLISALGDALIPTADEGFQVDDPEGEQVNAAAKAEIERFRTLDYVIPSYH 715

Query: 401 GILTISCI---RTLTQIALKLSGFISL 424
            I+T++C+   R    + ++L+ F SL
Sbjct: 716 NIITVTCLKYTRYGNYLEVRLAAFDSL 742


>gi|190345979|gb|EDK37961.2| hypothetical protein PGUG_02059 [Meyerozyma guilliermondii ATCC 6260]
          Length = 1601

 Score =  169 bits (429), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 136/474 (28%), Positives = 229/474 (48%), Gaps = 75/474 (15%)

Query: 30   TLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCT 89
            TLST  F++   KV    R  L+ FF +WV   G P+  +   +NK+++I+E+ + +  +
Sbjct: 616  TLSTLHFQYVCEKVN---RNKLESFFKQWVFNAGVPLFSVAQRFNKKRSIIEMQIRQIQS 672

Query: 90   V-----KPDSRTPVLSSNTDSENRDGDIGWP---GMMSIRVHELDGM-YDHPILPMAGDA 140
                  KP+S   +  S    ++    +  P   G M+IR+HE +G  Y+H I+ +  + 
Sbjct: 673  QIIRKSKPNSNNFMEDSIAYLDDEPSFLVQPVFTGPMTIRIHEANGTPYEH-IIDLKHET 731

Query: 141  WQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAG-------LDMRSSME------- 186
             + L+IQ ++K    R +K K G      D+  + V+        L  ++ ME       
Sbjct: 732  SK-LDIQYNTKF---RRMKKKTG------DEVHETVSSFKQLGDILQTKAEMEEWNLVEW 781

Query: 187  ----------SPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEAL 236
                          W+R D + E+++++   QP  M+ +QL+ D DV AQ +A+      
Sbjct: 782  IKDEEDPSFNDAFEWLRVDADFEWISKMSVKQPDYMFASQLQYDRDVEAQYEAVRYFATR 841

Query: 237  PHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDEN 296
               + N    L   + DS+ ++ VRI AA ALA  +     + GL +L+K +K+      
Sbjct: 842  DKPNRNYCTVLTRTVMDSRYYYGVRIAAARALAGFSKTNNLFIGLEYLMKIFKTLYCFPE 901

Query: 297  IGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDV 356
              +P+ NDF D + YF+ + +    A +R  D  SP E   F+L LLKYNDN+ N Y D 
Sbjct: 902  SNVPKTNDFNDLASYFLKKEMISIFATIRDDDGNSPTEVKRFLLSLLKYNDNSNNDYQDG 961

Query: 357  FWLAALVQSV-------GELE-----FGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILT 404
            F++AAL++++       G+L      F      F+   L  I+RL + D  +PSY+ +++
Sbjct: 962  FYVAALIKALTSCIIITGDLRDIPDVFDNSKQKFVDDALVEINRLHKLDEWVPSYHNMIS 1021

Query: 405  ISCIRTLTQIALKLSGFISLDQVVKLIKPFRDF--NTIWQ----VRVEASRALL 452
            + C+++  Q+A+   G+  L        PF D    T+ +    VRVEA R L 
Sbjct: 1022 VVCLQSKVQLAVH--GYYEL--------PFEDLLPYTLGKFPLDVRVEAFRGLF 1065


>gi|392597793|gb|EIW87115.1| TATA-binding protein associated factor Taf2 [Coniophora puteana
           RWD-64-598 SS2]
          Length = 1830

 Score =  169 bits (429), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 141/529 (26%), Positives = 243/529 (45%), Gaps = 84/529 (15%)

Query: 1   MLEKQMGSN-----FFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
           +L++++G +       R + +  +S   G  P   LST  F     KV  ++   L+ F 
Sbjct: 481 ILDRRLGKSGTSLGLSRVLPKIFLSAMTGEMPNNALSTHSFLRTCRKVSGVD---LRTFA 537

Query: 56  PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGW- 114
            +W+   GCP      ++N++K  VE+ + ++C   P  R   L  N  S+  +  + + 
Sbjct: 538 EQWIYGSGCPAFGFTATFNRKKMAVEITMRQEC---PAYR--ALEQNEVSKLLNKPVPFF 592

Query: 115 PGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLA-ARRALK----------PKK 162
            G M++R+HE DG  Y+H +L +   A++  E+  ++K    RR  K             
Sbjct: 593 EGQMTVRIHEADGTPYEH-VLDIRS-AFKRYEVPFNTKYKRIRRNTKRYLARQAAAQAAA 650

Query: 163 GSKPDGCDDNG--DAVAGLDMRSSME--------------------SPLSWIRADPEMEY 200
              P+  +  G  D   GLD+    +                    +   WIR D + E+
Sbjct: 651 EGDPEAVEAMGLIDMGFGLDIWEKEKERENWKVADWTEEEEQVMSGATYEWIRMDADFEW 710

Query: 201 LAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRV 260
           +A + F QP  MW++QL++D DVVAQ +A+ AL   P  +  V +T    +  S  ++R+
Sbjct: 711 IAAVAFEQPDFMWVSQLQRDRDVVAQMEAVHALARQP--TAIVSSTFTKTVLVSNYYYRI 768

Query: 261 RIEAAYALANTASEETDWAGLLHLVKFYKSRRFD---------ENIGLPRPNDFRDFSEY 311
           R EAA  L N A  + D+ GL H+ K +    +D          +  +P+PNDF D +EY
Sbjct: 769 RCEAAACLVNCAIRKLDFLGLFHMFKLFLRYCYDPEDPNQDLFSHNYVPKPNDFSDLAEY 828

Query: 312 FVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV----- 366
           FV +++   +A VR  + K+P    +F++  L+YNDN  NPYSD F++  L+ +      
Sbjct: 829 FVRKSLVDGIAQVRFENGKTPSVVRQFLVDQLRYNDNTANPYSDAFYICTLMAASACANV 888

Query: 367 -------GEL-------EFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLT 412
                  GEL       E   +    L+  L+ ++R    DRL+PS + ++T + +    
Sbjct: 889 PTAPPERGELLKEEVRSEQTAEDASLLNQALEEVNRYRSMDRLIPSPHNVVTTAVLEFY- 947

Query: 413 QIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGI 461
            + L ++  +  D  +K+  P        QVR+ A   L   +++   I
Sbjct: 948 -LLLSVANLVPND--MKVFFPLTREGNYTQVRIAAFDGLFMTKWYSPPI 993


>gi|255733076|ref|XP_002551461.1| hypothetical protein CTRG_05759 [Candida tropicalis MYA-3404]
 gi|240131202|gb|EER30763.1| hypothetical protein CTRG_05759 [Candida tropicalis MYA-3404]
          Length = 1463

 Score =  169 bits (429), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 142/481 (29%), Positives = 231/481 (48%), Gaps = 49/481 (10%)

Query: 24   GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELA 83
            G  P   LST+ FR+   KV    R  L  FF +WV   G PV R+   +NK+K ++E+ 
Sbjct: 667  GDLPNGALSTEHFRYVCEKVN---RNKLDNFFKQWVFGSGAPVFRISQRFNKKKGMIEMN 723

Query: 84   VLRDCTVKPDSRTPVLSSN--TDSENRDGDIGWP-------GMMSIRVHELDGM-YDHPI 133
            + R      +++  +   N   D+ +   D   P       G M+IRVHE DG  Y+H I
Sbjct: 724  I-RQIQHHENTKRKLDKDNFVVDAISYLDDESAPPVQQVFTGPMTIRVHESDGSPYEH-I 781

Query: 134  LPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDN--GDAVAG------------- 178
            + +  +A   L+IQ +SK   R+  K  +G  P G   N  GD +               
Sbjct: 782  VDLK-EANTKLDIQYNSKF--RKIKKKDEGVDP-GIPFNQFGDVLTTPFEMQQWNLKDWN 837

Query: 179  -LDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALP 237
              D  S    P  WIR D ++E++A I   QP  M+ +QL  D D+ AQ +A+     L 
Sbjct: 838  KRDEESLSSEPFEWIRGDVDLEWIARIDIKQPDFMFGSQLIYDRDIEAQLEAVRYFGELE 897

Query: 238  HLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENI 297
              +     TL   L D + F+ VRI AA ALA  ++E  ++ G  +L++ ++S       
Sbjct: 898  KTNIVYCTTLVRTLMDERYFYGVRIAAAQALAGCSNEGNNFIGTDYLLQAFRSLYCFPGS 957

Query: 298  GLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVF 357
             +PR NDF DF  + + ++ P  ++ V+  D K P    E +L L+K+NDN+ N ++D  
Sbjct: 958  IIPRSNDFSDFRSFMLQKSFPKILSSVKDNDGKVPDRIRELLLNLVKFNDNSMNGFNDSH 1017

Query: 358  WLAALVQSVGELEFG------QQS--ILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIR 409
            ++  ++ ++ E          QQS    F+  ++  +DRL + D  MPSY   + I+C++
Sbjct: 1018 YVCTILSALTESAVSGWRKDLQQSNDARFVEKVVTELDRLQKLDHWMPSYQSSVEIACVK 1077

Query: 410  TLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFI 469
               ++AL  +  +S + ++ +           + RVEA + LL++    N   S L  F+
Sbjct: 1078 QKIRMALVGAITVSFEDLLHMTLD----KYPMEARVEAFKGLLEIGGLKN--KSVLDYFL 1131

Query: 470  K 470
            K
Sbjct: 1132 K 1132


>gi|343429422|emb|CBQ72995.1| related to TAF2-component of TFIID complex [Sporisorium reilianum
           SRZ2]
          Length = 1838

 Score =  169 bits (429), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 126/455 (27%), Positives = 221/455 (48%), Gaps = 73/455 (16%)

Query: 31  LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTV 90
           LST+ F     KV   +   L+ F   W+   GCP      ++N++K ++E+ + ++   
Sbjct: 536 LSTQHFLRTCRKVSGAD---LRLFTEHWIRGSGCPRFICSANFNRKKLLIEMHIRQEV-- 590

Query: 91  KPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCH 149
            P ++    +   D+ + +    + G M++R+HE DG  Y+H +L + G A +  E+  +
Sbjct: 591 -PAAQF-AAARPGDALSANAVPLFEGQMTVRIHEADGTPYEH-VLEIKGPA-KRYEVPFN 646

Query: 150 SKLA-ARRALK---------------PKKGSKPDGCDDNGDAVAGLDMRSSME------- 186
           +K    RR  K                ++  +  G  D G  +   +   + E       
Sbjct: 647 TKYKRVRRNTKRFQARQAAAAAAAQGDQEAQEAIGLIDLGFGLGMWEDEKAREEWKVADW 706

Query: 187 ----------SPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEAL 236
                     +P  WIR D + E+LA IHF QP  MW++QL++D DVVAQ  A+ AL  +
Sbjct: 707 TEEDEEKMSSAPYEWIRMDADFEWLASIHFEQPDYMWVSQLQRDRDVVAQVSAVHALAQM 766

Query: 237 PHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDEN 296
           P  S    + L   +  +K F+RVR EA +AL + A  + D  GL HL+K ++++   ++
Sbjct: 767 P--SLVTCSMLTRTVLVNKYFYRVRTEAVHALVHCAIPQLDNLGLFHLLKLFRTQFCHDS 824

Query: 297 IG------------LPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLK 344
                         +PR NDF D ++YF+  A+ HA++ VR  D ++P +   F++ LL+
Sbjct: 825 PDEAAIENPLDVPCIPRANDFSDAADYFLQRALIHAISRVRDPDGRTPPQVKRFLINLLR 884

Query: 345 YNDNNGNPYSDVFWLAALVQSVGELEFGQQSIL--------------FLSSLLKRIDRLL 390
           YNDN+ N + D F+LA  + ++       +S +               LS  +  ++RL 
Sbjct: 885 YNDNSTNHFVDDFYLAGAINALASAFIPVESAIASNQDADAVSEESFLLSHAIAEVERLQ 944

Query: 391 QFDRLMPSYNGILTISCIRTLTQIALKLSGFISLD 425
           + DRL+PSY+ ++T++ +    Q+A+ L+     D
Sbjct: 945 ELDRLVPSYHNVITLASLD--FQVAMMLANLKPRD 977


>gi|395334726|gb|EJF67102.1| hypothetical protein DICSQDRAFT_158655 [Dichomitus squalens
           LYAD-421 SS1]
          Length = 1880

 Score =  169 bits (429), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 137/484 (28%), Positives = 226/484 (46%), Gaps = 87/484 (17%)

Query: 31  LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTV 90
           +ST  F     KV  ++    + F  +W+   GCP      S+N++K  VE+ + +D   
Sbjct: 527 ISTHSFLRMCRKVSGIDP---RSFAEQWIYGSGCPTFGFSASFNRKKMAVEITMRQDAPA 583

Query: 91  KPDSRTPVLSSNTDSENRDGDIG-WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQC 148
                   L +N  ++     +  + G M+IR+HE DG  Y+H +L +    ++  E+  
Sbjct: 584 YKH-----LENNEITKMMLKPVPFFEGQMTIRIHEADGTPYEH-VLDIRS-PFKRYEVPF 636

Query: 149 HSKLA-ARRALK------------PKKGSKPDGCDDNGDAVAGLDMRSSMESPLSW---- 191
           ++K    RR  K             +  +      D  D   GLD+    +   +W    
Sbjct: 637 NTKYKRVRRNTKRYLARQAAAQAAAEGDADAAAAMDMVDMGFGLDIWEKEQERENWKVAD 696

Query: 192 ----------------IRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEA 235
                           IR D + E++A I F+Q   MW++QL++D DVVAQ +AI  L  
Sbjct: 697 WTEEDEQNMAGQTYEWIRIDADFEWIAAIAFDQKDYMWVSQLQRDRDVVAQYEAILVLSK 756

Query: 236 LPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFD- 294
            P+    + +T    +  S  ++R+R EAA AL   A++  +W GL HL K +    ++ 
Sbjct: 757 TPNPI--ISSTFTKTVLVSNYYFRIRCEAAAALVTCANQRLEWLGLFHLFKLFLRYCYEP 814

Query: 295 --------ENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
                    +  +PRPNDF DF+EYFV ++I  A++ VR  + KSP    +F++  L+YN
Sbjct: 815 EDPKADLFSHTYVPRPNDFSDFAEYFVRKSILSAISRVRFENGKSPPVIRQFLVDQLRYN 874

Query: 347 DNNGNPYSDVFWLAALVQSVG-------ELEFGQ------------QSILFLSSLLKRID 387
           DN  NPY+D F++  ++ ++G         E G+            Q    L + L  +D
Sbjct: 875 DNTTNPYADAFYICNIISALGCATVSSHPPERGEFISNDAAPAQDPQDAALLQAALSEVD 934

Query: 388 RLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPF----RDFNTIWQV 443
           R    DRL+P+Y+ ++T++ I    Q+ L L+G +  D      +PF    R+ N I QV
Sbjct: 935 RYRSMDRLIPTYHNVVTVAVIE--FQMLLSLAGLVPHDP-----RPFFPLTREGNYI-QV 986

Query: 444 RVEA 447
           R+ A
Sbjct: 987 RIAA 990


>gi|443894493|dbj|GAC71841.1| TATA binding protein associated factor [Pseudozyma antarctica T-34]
          Length = 1845

 Score =  169 bits (427), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 122/437 (27%), Positives = 217/437 (49%), Gaps = 70/437 (16%)

Query: 31  LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTV 90
           LST++F     KV   +   L+ F   W+   GCP      ++N++K ++E+ + ++   
Sbjct: 543 LSTQQFLRTCRKVSGAD---LRLFTEHWIRGSGCPRFICSANFNRKKLLIEMHIRQEV-- 597

Query: 91  KPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCH 149
            P ++    +   D+   +    + G M++R+HE DG  Y+H +L + G A +  E+  +
Sbjct: 598 -PAAQF-AAARPGDALAANAVPLFEGQMTVRIHEADGTPYEH-VLEIKGPA-KRYEVPFN 653

Query: 150 SKLA-ARRALK---------------PKKGSKPDGCDDNGDAVAGLDMRSSME------- 186
           +K    RR  K                ++  +  G  D G  +   +   + E       
Sbjct: 654 TKYKRVRRNTKRFQARQAAAAAAAQGDQEAQEAIGLIDLGFGLGMWEDEQAREEWKVADW 713

Query: 187 ----------SPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEAL 236
                     +P  WIR D + E+LA IHF QP  MW++QL++D DVVAQ  A+ AL  +
Sbjct: 714 TEEDEEKMSSAPYEWIRMDADFEWLASIHFEQPDHMWVSQLQRDRDVVAQVSAVHALAQM 773

Query: 237 PHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSR----- 291
           P  S    + L   +  +K F+RVR EA +AL + A  + D  GL HL+K ++S+     
Sbjct: 774 P--SLVTCSMLTRTVLVNKYFYRVRTEAVHALVHCAIPQLDNLGLFHLLKLFRSQFCHDP 831

Query: 292 ----RFDENIG---LPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLK 344
                F++ +    +PR NDF D ++YF+  A+ HA++ VR  D ++P +   F++ LL+
Sbjct: 832 PNEASFEDPLDVPCIPRANDFSDAADYFLQRALIHAISRVRNPDGRTPAQVKRFLINLLR 891

Query: 345 YNDNNGNPYSDVFWLAALVQSVGE----LEFG---------QQSILFLSSLLKRIDRLLQ 391
           YNDN+ N + D F+LA  + ++      ++ G          +    L+  +  ++RL +
Sbjct: 892 YNDNSTNHFVDDFYLAGAINALASAFIPIDSGLAQDSSANADEENFLLAHAIAEVERLQE 951

Query: 392 FDRLMPSYNGILTISCI 408
            DRL+PS++ ++T++ +
Sbjct: 952 LDRLVPSFHNVITLASL 968


>gi|146420870|ref|XP_001486388.1| hypothetical protein PGUG_02059 [Meyerozyma guilliermondii ATCC 6260]
          Length = 1601

 Score =  168 bits (426), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 138/474 (29%), Positives = 229/474 (48%), Gaps = 75/474 (15%)

Query: 30   TLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCT 89
            TLST  F++   KV    R  L+ FF +WV   G P+  +   +NK+++I+E+ + +   
Sbjct: 616  TLSTLHFQYVCEKVN---RNKLESFFKQWVFNAGVPLFLVAQRFNKKRSIIEMQIRQIQL 672

Query: 90   V-----KPDSRTPVLSSNTDSENRDGDIGWP---GMMSIRVHELDGM-YDHPILPMAGDA 140
                  KP+S   +  S    ++    +  P   G M+IR+HE +G  Y+H I+ +  + 
Sbjct: 673  QIIRKSKPNSNNFMEDSIAYLDDEPSFLVQPVFTGPMTIRIHEANGTPYEH-IIDLKHET 731

Query: 141  WQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAG-------LDMRSSM-------- 185
             + L+IQ ++K    R +K K G      D+  + V+        L  ++ M        
Sbjct: 732  SK-LDIQYNTKF---RRMKKKTG------DEVHETVSSFKQLGDILQTKAEMEEWNLVEW 781

Query: 186  ----ESPL-----SWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEAL 236
                E PL      W+R D + E+++++   QP  M+ +QL+ D DV AQ +A+      
Sbjct: 782  IKDEEDPLFNDAFEWLRVDADFEWISKMSVKQPDYMFASQLQYDRDVEAQYEAVRYFATR 841

Query: 237  PHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDEN 296
               + N    L   + DS+ ++ VRI AA ALA  +     + GL +L+K +K+      
Sbjct: 842  DKPNRNYCTVLTRTVMDSRYYYGVRIAAARALAGFSKTNNLFIGLEYLMKIFKTLYCFPE 901

Query: 297  IGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDV 356
              +P+ NDF D + YF+ + +    A +R  D  SP E   F+L LLKYNDN+ N Y D 
Sbjct: 902  SNVPKTNDFNDLASYFLKKEMISIFATIRDDDGNSPTEVKRFLLSLLKYNDNSNNDYQDG 961

Query: 357  FWLAALVQSV-------GELE-----FGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILT 404
            F++AAL++++       G+L      F      F+   L  I+RL + D  +PSY+ +++
Sbjct: 962  FYVAALIKALTSCIIITGDLRDIPDVFDNSKQKFVDDALVEINRLHKLDEWVPSYHNMIS 1021

Query: 405  ISCIRTLTQIALKLSGFISLDQVVKLIKPFRDF--NTIWQ----VRVEASRALL 452
            + C++   Q+A+   G+  L        PF D    T+ +    VRVEA R L 
Sbjct: 1022 VVCLQLKVQLAVH--GYYEL--------PFEDLLPYTLGKFPLDVRVEAFRGLF 1065


>gi|50548681|ref|XP_501810.1| YALI0C13904p [Yarrowia lipolytica]
 gi|49647677|emb|CAG82120.1| YALI0C13904p [Yarrowia lipolytica CLIB122]
          Length = 1264

 Score =  168 bits (426), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 134/484 (27%), Positives = 231/484 (47%), Gaps = 74/484 (15%)

Query: 31   LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTV 90
            L T  F     +V + +   L EFF +WV + G PVL++   +NK++  +E+ + +  T 
Sbjct: 567  LGTNHFYKLCERVSHCK---LDEFFDQWVYSSGVPVLQVTQRFNKKRMFIEMGIRQTQTD 623

Query: 91   KPDSRTPV--------LSSNTDSENRDGDIGWP------GMMSIRVHELDGM-YDHPILP 135
            K   + PV        + S    ++ D + G P      G ++IR+HE DG  Y+H +  
Sbjct: 624  KSGEKKPVEDTFIGDVIRSEEGVDHGDEEEGGPKTPLFTGPITIRIHEADGTPYEHVV-- 681

Query: 136  MAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGC-----------------DDNGDAVAG 178
                 +  L+IQ ++K   +R  + KK  +                     D+ G  ++ 
Sbjct: 682  DIKHIFTKLDIQYNTKY--KRLKRNKKEVEAAATVTATAAPPAASASNNEEDEVGVVISC 739

Query: 179  L----------------------DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQ 216
            L                      +MR S E+   W+R D + E++ +IH NQP  M+ +Q
Sbjct: 740  LGDVLQTDAELEEWELSEWGKDEEMRMSNEA-FEWLRIDADFEWICKIHVNQPDYMYASQ 798

Query: 217  LEKDGDVVAQAQAIAAL-EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEE 275
            L++D DVVAQ +++    E+ P   ++ +  L   L D + ++ VR+ AA  L   A  E
Sbjct: 799  LQQDRDVVAQYESVQFFAESRPSKVYSTI--LLRTLIDRRYYYGVRVAAALGLVKMAVSE 856

Query: 276  TDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREA 335
            T + G  HL+K +++     N  +PR NDF DF  YFV +AI  A + +R  D + P E 
Sbjct: 857  TSYIGQTHLLKAFQNLFCFPNSSIPRSNDFSDFPTYFVQKAITRAFSQIRNGDGECPAEI 916

Query: 336  VEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGEL-----EFGQQSILFLSSLLKRIDRLL 390
               +L +L YN+N+ N  SD F+L+ L+  + ++        + +  F +  +  IDR  
Sbjct: 917  RTLILDILAYNENSTNSMSDSFYLSDLITFLADVLCSSDSNDESTREFAARAVNEIDRCG 976

Query: 391  QFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRA 450
            + D  +PSY  ++T++ +    ++ LK  G  S  ++  LI+  R  N+  +VR+ A  A
Sbjct: 977  RLDTWIPSYQNVVTVAALEAKKRLFLK--GLFSA-KLQDLIQFTRSVNSS-EVRLAAFEA 1032

Query: 451  LLDL 454
            +L +
Sbjct: 1033 MLHI 1036


>gi|71015037|ref|XP_758766.1| hypothetical protein UM02619.1 [Ustilago maydis 521]
 gi|46098556|gb|EAK83789.1| hypothetical protein UM02619.1 [Ustilago maydis 521]
          Length = 1846

 Score =  168 bits (425), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 130/482 (26%), Positives = 229/482 (47%), Gaps = 75/482 (15%)

Query: 31   LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTV 90
            LST+ F     KV   +   L+ F   W+   GCP      ++N++K ++E+ + ++   
Sbjct: 542  LSTQLFLRTCRKVSGAD---LRLFTEHWIRGSGCPRFICSANFNRKKLLIEMNIRQEV-- 596

Query: 91   KPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCH 149
             P ++    +   D+   +    + G M++R+HE DG  Y+H +L + G A +  E+  +
Sbjct: 597  -PAAQF-AAARPGDALAANAVPLFEGQMTVRIHEADGTPYEH-VLEIKGPA-KRYEVPFN 652

Query: 150  SKLA-ARRALK---------------PKKGSKPDGCDDNGDAVAGLDMRSSME------- 186
            +K    RR  K                ++  +  G  D G  +   +   + E       
Sbjct: 653  TKYKRVRRNTKRFQARQAAAAAAAQGDQEAQEAIGLIDLGFGLGMWEDEKAREEWKVADW 712

Query: 187  ----------SPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEAL 236
                      +P  WIR D + E+LA IHF QP  MW++QL++D DVVAQ  A+ AL  +
Sbjct: 713  TEEDEEKMSSAPYEWIRMDADFEWLASIHFEQPDYMWVSQLQRDRDVVAQVSAVHALAQM 772

Query: 237  PHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDEN 296
            P  S    + L   +  +K F+RVR EA +AL + A  + D  GL HL+K ++++   + 
Sbjct: 773  P--SLVTCSMLTRTVLVNKYFYRVRTEAVHALVHCAIPQLDNLGLFHLLKLFRTQFCHDP 830

Query: 297  IG------------LPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLK 344
                          +PR NDF D ++YF+  A+ HA++ VR  D ++P +   F++ LL+
Sbjct: 831  PDEASIENPLDVPCIPRANDFSDAADYFLQRALIHAISRVRNPDGRTPPQVKRFLINLLR 890

Query: 345  YNDNNGNPYSDVFWLAALVQSVG--------------ELEFGQQSILFLSSLLKRIDRLL 390
            YNDN+ N + D F+LA  + ++               + +   +    L+  +  ++RL 
Sbjct: 891  YNDNSTNHFVDDFYLAGAINALASAFIPIESAVAATQDADMASEESFLLAHAVAEVERLQ 950

Query: 391  QFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRA 450
            + DRL+PSY+ ++T++ +    Q+A+ L+     D  ++L   +        VR+ A   
Sbjct: 951  ELDRLVPSYHNVITLASLD--FQVAMMLANLKPKD--LQLFFTYTRQGNFTPVRIAALNC 1006

Query: 451  LL 452
            LL
Sbjct: 1007 LL 1008


>gi|440640095|gb|ELR10014.1| hypothetical protein GMDG_00772 [Geomyces destructans 20631-21]
          Length = 1406

 Score =  167 bits (424), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 158/580 (27%), Positives = 253/580 (43%), Gaps = 91/580 (15%)

Query: 2    LEKQMGSNFFRKILQNIISRAQ-GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
            L K  GS    +++  ++  A  G    R ++T+ FR    K+G+ +   L  FF +W+ 
Sbjct: 554  LTKLSGSTGIVRVISRLLYNASIGEVSDRAVTTEGFRRLCEKIGHYK---LDAFFNQWIM 610

Query: 61   TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRD-------GDIG 113
              GCP  ++   +NK+K  VE+ + +    +P  R  +       E ++       GD+ 
Sbjct: 611  GAGCPRFQVTQKFNKKKLAVEMTISQKQDTQPTQRK-LEKHQFLREFKEEAYGVYAGDVQ 669

Query: 114  --WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKL-----AARRALKPKKGSK 165
              + G M+IR+HE DG  Y+H +    G   Q +EI  H+K      + R   +   G  
Sbjct: 670  PVFTGPMTIRIHESDGTPYEHIVEIREG--IQKIEIPYHTKYKRLKRSRREKERATAGPG 727

Query: 166  PDGCDDNGDAVA-----------------GL-DMRSSMESPL-----SWIRADPEMEYLA 202
             D   +N D V                  GL D  + ME  +      WIR D + E++ 
Sbjct: 728  VDITTENHDDVLLYCLGDVLQSKEDVQEWGLVDWDTEMEVKMDQESYEWIRMDADFEWIC 787

Query: 203  EIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRI 262
            E++ N P  M+++QL++D DVVAQ  ++  L   P     V   L   L D + FW +R 
Sbjct: 788  ELNLNMPSYMYLSQLQQDRDVVAQQDSMLFLAKSPPHPL-VSTILIRTLMDRRYFWGIRE 846

Query: 263  EAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVA 322
             A   LA    E   W G +HL K ++         +P+ NDF D ++Y +  AIP A+A
Sbjct: 847  MACRYLATHGVEAHSWIGRVHLEKAFQRMFCYPGSSMPKSNDFSDKTDYRMKLAIPKAMA 906

Query: 323  MVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV------------GELE 370
             VR  + K P EA +F+L LL++NDN  N YSD +++  L++++            G + 
Sbjct: 907  RVRNIEGKCPLEARQFLLDLLRFNDNGNNEYSDYYYICTLMKALADSLIPIKNETPGGMS 966

Query: 371  FG------QQSILFLSSLLKRIDRLLQFDRLMPSYNGILT---ISCIRTLTQIALKLSGF 421
            F       ++   F    L+ I+R  + D  + SY  I T   + C R L +  +  S  
Sbjct: 967  FSFADEDEEELQRFQQVALEEIERYRRMDEWILSYQNIYTKTALECKRRLMKSKITPSD- 1025

Query: 422  ISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLR-- 479
                   + +  +    T+  VR++A  AL DL +  N       L + S +  P +R  
Sbjct: 1026 ------PRELFAYTHDGTMDLVRIQAFDALTDLGYLSNHHLLRYFLNVMSTDASPFVRDR 1079

Query: 480  ---------GQVKLGIHAMRICQIKGGSDSNHEVDTVTLV 510
                     G V LG H+      K   +   E D  TL+
Sbjct: 1080 LCQILFSGLGAVALGEHS------KAAVEPKQEEDMDTLI 1113


>gi|393213296|gb|EJC98793.1| hypothetical protein FOMMEDRAFT_161628 [Fomitiporia mediterranea
           MF3/22]
          Length = 1807

 Score =  167 bits (422), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 135/502 (26%), Positives = 233/502 (46%), Gaps = 91/502 (18%)

Query: 1   MLEKQMGSN-----FFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
           +L++++G +       R + +  +S   G      LST  F     KV  ++   L+ F 
Sbjct: 471 ILDRRLGKSGTSLGLSRVLPKIFLSATAGDMAQNCLSTHFFLRTCRKVSGVD---LRSFV 527

Query: 56  PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG-- 113
            +W+   GCP      ++N++K  VE+ + ++C        P  + + +    +  +   
Sbjct: 528 DQWIHGSGCPQFSFLATFNRKKMAVEIQMRQEC--------PAYAIHQNDPISNSLLKPV 579

Query: 114 --WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLA-ARRALK------PKKG 163
             + G M++R+HE DG  Y+H +  ++    +  E+  ++K    RR  K          
Sbjct: 580 PFFEGQMTVRIHEADGTPYEHVLDILS--PIKRFEVPFNTKYKRVRRNTKRYLARQAAAQ 637

Query: 164 SKPDGCDDNGDAVAGLDMRSSME--------------------------SPLSWIRADPE 197
           +  +G  +  +A+  +DM   +E                          +   WIR D +
Sbjct: 638 AAAEGDTEAAEAMGMVDMSFGLEIWENEEEREKWKVTDWTEEEEQAMAGATYEWIRMDAD 697

Query: 198 MEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAF 257
            E++A I F QP  MW++QL++D DVVAQ +A+ AL   P  +  V +TL   +  +  F
Sbjct: 698 FEWIARIQFEQPDYMWVSQLQRDRDVVAQVEAVHALARQP--TPIVSSTLTKTVLVTNYF 755

Query: 258 WRVRIEAAYALANTASEETDWAGLLHLVKFY---------KSRRFDENIGLPRPNDFRDF 308
           +RVR EAA AL N AS   D+ GL HL K +          S+    +  +PRPNDF D 
Sbjct: 756 FRVRCEAALALVNCASRRLDFVGLFHLFKLFLRYCYEPEDSSQDLFSHKFVPRPNDFSDV 815

Query: 309 SEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV-- 366
           +EYF   A+ +A++ VR  + K+P E  +F +  L+YNDN  N YSD +++A+L+ ++  
Sbjct: 816 AEYF---ALVNAISQVRFENGKTPPEIRQFFIDQLRYNDNTTNAYSDAYYIASLITALAC 872

Query: 367 ----------GEL-------EFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIR 409
                     GEL       +   +    L + +  +DR    DRL+PS + ++TI+ + 
Sbjct: 873 ATISTAPPERGELTMSVPKNDITGEDHQLLQAAITEVDRYRSMDRLIPSVHNVVTIASLE 932

Query: 410 TLTQIALKLSGFISLDQVVKLI 431
               I L ++  I  D  + L+
Sbjct: 933 --FHIMLAMANLIPSDPKIFLL 952


>gi|150865037|ref|XP_001384087.2| hypothetical protein PICST_58795 [Scheffersomyces stipitis CBS 6054]
 gi|149386293|gb|ABN66058.2| predicted protein, partial [Scheffersomyces stipitis CBS 6054]
          Length = 1408

 Score =  166 bits (420), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 136/470 (28%), Positives = 223/470 (47%), Gaps = 57/470 (12%)

Query: 30   TLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLR-DC 88
            TLST+ F++   KV    R  L+ FF +WV   G P+  +   +NK++ ++E+++ +   
Sbjct: 589  TLSTQHFQYVCEKVN---RNKLENFFKQWVYGVGAPIFNITQRFNKKRGVIEMSIRQIQH 645

Query: 89   TVKPDSRTPVLSSNTDS-ENRDGDIGWP------GMMSIRVHELDGM-YDHPILPMAGDA 140
             V   S T   S   DS    + +  +P      G M+IRVHE DG  Y+H +    G+ 
Sbjct: 646  QVTRKSGTNAESFINDSIAYLEDEPTFPVQSIFTGPMTIRVHEADGTPYEHIVDLKEGNT 705

Query: 141  WQLLEIQCHSKLAARRALKPKKGSKPDGCDDN--GDAVAG---------LDMRSSMESPL 189
               L++Q +SK   RR  K +  +  +    +  GD +            D     + P+
Sbjct: 706  --KLDVQYNSKF--RRMKKNRDETSENAVTFSRLGDVLESEKEMEEWNLADWAKVDDDPM 761

Query: 190  S-----WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVV 244
            +     WIR D + E++A     QP  M+ +QL+ D DV AQ  A+  L  +   S    
Sbjct: 762  NIEAFEWIRVDVDFEWIARFDVKQPDYMFGSQLQHDRDVEAQFDAVRYLGNIEKPSTIHC 821

Query: 245  NTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPND 304
              L   + D + ++ VRI AA ALAN ++  T++ G+ +LVK Y+         +P  ND
Sbjct: 822  TALTRTVVDERYYYGVRIAAAEALANFSNSVTNFIGVPYLVKIYRELYCFPGSSIPLSND 881

Query: 305  FRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQ 364
            F DF  +F+ + IP  +  +R +D++ P      +L L+K+NDN  N + D F+++ LVQ
Sbjct: 882  FNDFGRFFLQKEIPKQLCKIRDSDDEVPVVIRNLILNLIKFNDNTNNNFQDSFYISELVQ 941

Query: 365  SVGELEF-------------------GQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTI 405
            S+                          +   F+++++  I+RL + D  +PSY+ ++++
Sbjct: 942  SLTTCAVNSSFPNSPKDIFPKSHPHGSSEKKKFVANVITEINRLQKLDEWIPSYHNVVSV 1001

Query: 406  SCIRTLTQ-IALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
            +C   LTQ I L L G + L     L      F    Q+RVEA R L  L
Sbjct: 1002 TC---LTQKIRLALHGHLDLSFEDLLYFTVEKFPI--QIRVEAFRGLFVL 1046


>gi|354543021|emb|CCE39739.1| hypothetical protein CPAR2_601590 [Candida parapsilosis]
          Length = 1364

 Score =  166 bits (419), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 132/435 (30%), Positives = 204/435 (46%), Gaps = 45/435 (10%)

Query: 24   GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELA 83
            G  P  TLST  F++   KV    R  L  FF +WV   G P  ++   +NK+K ++EL 
Sbjct: 621  GDLPNGTLSTDHFQYVCEKVN---RNKLDSFFKQWVFGSGAPSFKISQRFNKKKGMIELT 677

Query: 84   VLR---------DCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGM-YDHPI 133
            + +            VK      +   N +  ++  ++ + G M+IRVHE DG  Y+H I
Sbjct: 678  IRQVQHHENKNVQLNVKSFIDDAICFLNKEPLSQRQNV-FTGPMNIRVHESDGTPYEHII 736

Query: 134  LPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDN--GDAVA-------------- 177
                 +    ++I    K+  R+  K  +G  P G + N  GD +               
Sbjct: 737  --HIKEMKTTIDISLSKKV--RKIRKKDEGVDP-GLNFNQFGDILTTEEEKRTWYLRDWE 791

Query: 178  GLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALP 237
              D  S    P  WIR D + E++A+I   QP  M+ +QL  D DV AQ  A+     L 
Sbjct: 792  SRDEDSLSTEPFEWIRGDIDFEWIAKIEVLQPDFMYGSQLVYDRDVEAQFDALRYFGNLE 851

Query: 238  HLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENI 297
              S      L   L D + F+R+RI AA ALA+ ++E  ++ G+ +L+K YK      + 
Sbjct: 852  KPSIIYCTALVRTLLDERYFYRIRIAAAEALASCSNESNNYIGMKYLIKAYKYMYCFPDS 911

Query: 298  GLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVF 357
             +P+ NDF DF  + + +AIP  +  +   D + P E    +  LLKYNDN+ N Y D  
Sbjct: 912  SIPKSNDFGDFRSFLIQKAIPGILCKITDNDGRVPAEVQGLLFNLLKYNDNSMNDYQDTL 971

Query: 358  WLAALVQ--SVGELEFGQQSIL------FLSSLLKRIDRLLQFDRLMPSYNGILTISCIR 409
            +L+ L++  +V  +    ++ +      F   +   I R+ + D   PSY  IL I+C +
Sbjct: 972  YLSDLIKALTVSAISGWHKNTVEGNNLEFALKVDVEITRIQKLDEWRPSYQRILQITCFK 1031

Query: 410  TLTQIALKLSGFISL 424
            +  QIA  L G ISL
Sbjct: 1032 SRVQIA--LCGAISL 1044


>gi|361131910|gb|EHL03525.1| putative Transcription initiation factor TFIID subunit 2 [Glarea
            lozoyensis 74030]
          Length = 1328

 Score =  165 bits (418), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 152/541 (28%), Positives = 239/541 (44%), Gaps = 82/541 (15%)

Query: 8    SNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVL 67
            S   R I + IIS   G      ++TK+FR    KV        + F+ +W+   GCP L
Sbjct: 559  SGLPRIISKLIISGNTGLPEESVMTTKDFRRLVEKVTKYRA--TESFWNQWIYGAGCPRL 616

Query: 68   RMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRD-------GDIG--WPGMM 118
             +   +NK++  VE+ + +  +  P +R P+   N   E ++       G+I   + G +
Sbjct: 617  NITQRFNKKRLCVEMTINQSQSTLPTTR-PLKKENFPRELKEELHGIYAGEIQPLFTGPL 675

Query: 119  SIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAA-----RRALKPKKGSKPDGCDDN 172
            +IR+HE DG  Y+H +     +  Q +EI  ++K        R+  +   G+  DG   +
Sbjct: 676  TIRIHEADGTPYEHIVEIREQN--QKIEIPYNTKYKRLKRNRRQKERLNPGANVDGETGD 733

Query: 173  ------GDAVAG---------------LDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQ 211
                  GD + G                + R   ES   W+R D + E++     N P  
Sbjct: 734  TLYYCLGDVLQGEQDVAEWGLKDWDKETEARMDSES-YEWLRIDADFEWICGTTLNVPAY 792

Query: 212  MWINQLEKDGDVVAQAQAIAALE-ALPHLSFNVVNTLN-NFLSDSKAFWRVRIEAAYALA 269
            MWI+QL++D DVVAQ Q++  L  A PH    +V T     L D++ +  +R  AA  L 
Sbjct: 793  MWISQLQQDRDVVAQQQSMQFLALATPH---GLVATFCLRTLMDTRYYHEIRTMAANILK 849

Query: 270  NTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADN 329
              A E   W+GL HL + Y+     E   +PR NDF D   Y+V +AIP A++  R    
Sbjct: 850  KHACEAAHWSGLRHLTRAYQEFFCYEGTKMPRSNDFSDKRSYWVEQAIPTALSQSRTKAG 909

Query: 330  KSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVG---------------------- 367
            K P+ A EF+L  L++NDN  NP+SD   +A L+ ++                       
Sbjct: 910  KCPKAAREFILDTLQFNDNRANPFSDNHKVANLLTALAESLMSSDPGTYKEPANYTGNGP 969

Query: 368  ---ELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFI-- 422
               E+E   +   F   +L  +DR  + D  + SY  I T + +    +  L  +G I  
Sbjct: 970  SEPEVEDPDEIAQFRDKVLVELDRYRRMDEWINSYQNIYTTTVLEAYRK--LMKAGVIPK 1027

Query: 423  -SLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLRGQ 481
              LD V  L     D      VR++A  +L+DL +  +     L L + S +  P  R Q
Sbjct: 1028 EPLDFVQYLHDGTSDL-----VRIKAFESLIDLGYITHEGICGLLLNVLSTDISPYTRNQ 1082

Query: 482  V 482
            +
Sbjct: 1083 L 1083


>gi|396470788|ref|XP_003838714.1| hypothetical protein LEMA_P023870.1 [Leptosphaeria maculans JN3]
 gi|312215283|emb|CBX95235.1| hypothetical protein LEMA_P023870.1 [Leptosphaeria maculans JN3]
          Length = 1522

 Score =  165 bits (417), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 147/560 (26%), Positives = 242/560 (43%), Gaps = 84/560 (15%)

Query: 2    LEKQMGSNFFRKILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
            L KQ G N   + L  ++  A QG      ++T +F     KVG+ +   L+ FF +WV 
Sbjct: 508  LVKQSGKNGVDRCLYRMLFNARQGKYSNGAIATDDFLDICEKVGHQK---LESFFAQWVY 564

Query: 61   TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG------- 113
              GCP+ +   S+NK+K +V+L + ++      +R     +  D      + G       
Sbjct: 565  GSGCPIFQCFPSFNKKKQVVQLQITQEQAQPSITRIESGLTAEDFMREAKEKGHEVQSGT 624

Query: 114  -WP---GMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDG 168
             +P   G M+IR+HE DG  Y+H +   +      +      K   R  L  ++ +   G
Sbjct: 625  EYPTFVGPMTIRIHEADGTPYEHIVDINSASVKVEIPYNTKYKRLKRNRLNKERMAAAAG 684

Query: 169  CDDNGDAVAGL--------------------------DMRSSMESPLSWIRADPEMEYLA 202
             D +G+    +                          D     +    WIR D + E++ 
Sbjct: 685  LDASGETQEDVTFYMLGDLFTTPEEHAGWRLDDWPAEDEAKQEQEAYEWIRIDADFEWIC 744

Query: 203  EIHFNQ-PVQMWINQLEKDGDVVAQAQAI---AALEALPHLSFNVVNTLNNFLSDSKAFW 258
            +   N  P  M+++QL++D DVVAQA++I   A  E    +S  ++ T    L DS+ F 
Sbjct: 745  KTVINDMPSYMYVSQLQQDKDVVAQAESIQFLAQKEGHKLISTFLIKT----LMDSRYFH 800

Query: 259  RVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIP 318
             +R  AA  +A +A + T+W GL HL K ++         + R NDF D S Y +  AIP
Sbjct: 801  GIRTMAAEVMAQSALKRTEWIGLFHLKKAFRELFCIPGSPMTRSNDFTDRSSYLIQCAIP 860

Query: 319  HAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQS--- 375
             A+A +R  D ++P E   F+L +L+YNDN GN YSD  +++ L++ + +     ++   
Sbjct: 861  RAIAKIRGEDGRAPMEVKRFLLDVLQYNDNRGNDYSDDHYISTLMRCLADCLISSKNNND 920

Query: 376  ----------ILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIAL------KLS 419
                        F    +  + R  + D  +P+Y  I T + +   T++ L      K S
Sbjct: 921  DLSPAEWAAEQRFQEKAIGELTRYQRLDEWIPTYQNIYTTTALECATKLTLNRVTPFKPS 980

Query: 420  GFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPS-- 477
             F+   Q+               VR++A   L+ L       DS L   I  +  +PS  
Sbjct: 981  EFLQYAQL----------GNADNVRLKAWECLVRLGLFRK--DSVLRFMIHELRSDPSPY 1028

Query: 478  LRGQVKLGIHAMRICQIKGG 497
             R Q+ L I  + + QI  G
Sbjct: 1029 FRKQL-LRIFGVAVGQIATG 1047


>gi|403214688|emb|CCK69188.1| hypothetical protein KNAG_0C00750 [Kazachstania naganishii CBS 8797]
          Length = 1419

 Score =  165 bits (417), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 132/482 (27%), Positives = 221/482 (45%), Gaps = 73/482 (15%)

Query: 24   GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELA 83
            G  P  +LS+  F+H   +V N  R  L+ FF +WV   G PV R+   YNK++ ++E+ 
Sbjct: 632  GELPNNSLSSAHFQHVCERV-NKNR--LESFFQQWVYGAGVPVFRITQRYNKKRMVIEMG 688

Query: 84   VLRDCT---VKPDSRTPVLSSN--------TDSENRDGDIGWPGMMSIRVHELDGM-YDH 131
            + R C     +  SR  V S           ++ NR+    + G M+IR+HE DG  Y+H
Sbjct: 689  I-RQCQNQDSESHSRKVVGSDGFFKSAMNYIETPNRNTPSFFTGSMTIRIHESDGTPYEH 747

Query: 132  PILPMAGDAWQLLEIQCHSKLAARRALK----PKK-----------GSKPDGCDDNGDAV 176
             +     D +  ++IQ ++K    R  K    PK+            SK    D++    
Sbjct: 748  IV--ELKDTYTKIDIQYNTKYRRMRGRKVVATPKEESLERSTSNLAASKVVAVDEDDIPK 805

Query: 177  AGLDMRSS----------------------MESPLSWIRADPEMEYLAEIHFNQPVQMWI 214
             G  + S+                      M     WIR D + E+L ++H NQP  M+ 
Sbjct: 806  LGTVLTSAEDCYKWKLTDIAKTTEGNEALLMNEAFEWIRIDSDFEWLCKVHINQPDYMFA 865

Query: 215  NQLEKDGDVVAQAQAIAALEAL----PHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALAN 270
            +QL +DGDV AQ  +I   + +       S    + L     D K F+ VR+EA  AL+ 
Sbjct: 866  SQLRQDGDVEAQIDSIRYYQDVIMDSNQQSLVYSSILTRTAMDEKYFYGVRLEACKALSQ 925

Query: 271  -------TASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAM 323
                   + +E     G  HL+K ++     ++  +P+ NDF    +YF+ + IP  ++ 
Sbjct: 926  YVYKQPESPNESEFVGGSKHLIKIFQELFCFDDSNVPKSNDFSIIQKYFLQKWIPKYLSQ 985

Query: 324  VRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV-GELEFGQQSILFLSSL 382
            V++  N+ P+    F+L +L YN+N+ NP+ D F+LA L+  V   +        FL  +
Sbjct: 986  VKSETNECPKFVKNFLLDILTYNENSENPFDDSFYLANLISCVIDSILHDHDDTEFLHRI 1045

Query: 383  LKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS--GFISLDQVVKLIKPFRDFNTI 440
            + ++ R    D+ MPSY  ++T    RT+    LKL+  G  + + +  +++   +   I
Sbjct: 1046 MGQLHRYENLDKWMPSYQVLVT----RTILNQKLKLNIEGLYTFENLKDVLEYSLNIEHI 1101

Query: 441  WQ 442
            W+
Sbjct: 1102 WK 1103


>gi|170044706|ref|XP_001849978.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167867753|gb|EDS31136.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 1222

 Score =  164 bits (415), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 138/511 (27%), Positives = 231/511 (45%), Gaps = 66/511 (12%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRH-------FANKVGNLERPFLKE 53
           MLE ++G     ++    ++ A  A+  + +S+  +         F   +  +    +  
Sbjct: 450 MLEHRIGQELLLQVFNKQLALASNAASTK-ISSGLWHQLHISTNIFTKAIFTVTGKDMAV 508

Query: 54  FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
           F  +WV T G     M   +N+++N +EL + +D                   N+ G   
Sbjct: 509 FIDQWVRTGGHARFTMTSVFNRKRNTIELEIRQDAV-----------------NQKGVRK 551

Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
           + G + I++ ELDG + H +        Q+  I   S +      +  K  K   C   G
Sbjct: 552 YVGPLLIQLQELDGTFKHTL--------QIENIVVKSDITCHSKSRRNKKKKIPLC--TG 601

Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
           + V  +D+    ESP+ WIR DPEM  L  +H  QP   W  QL  + DV AQ  +IAAL
Sbjct: 602 EEV-DMDLSPMDESPVLWIRLDPEMTLLRSVHIEQPDFQWQFQLRHERDVTAQLDSIAAL 660

Query: 234 E--ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFYKS 290
           E  A P     + +T+ N     + F+ VR +AA  L   A S  T W G   ++  +K 
Sbjct: 661 ERYASPATRLALTDTIEN----EQIFYEVRCKAALCLTKVANSMVTSWQGPPAMLTIFK- 715

Query: 291 RRFDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYND 347
            +F  +   P   R N+F +F  YF+ + IP A+A +R A    P E + F+L L KYND
Sbjct: 716 -KFFGSFSAPHIIRQNNFTNFQHYFLQKTIPVAMAGLRTAHGICPPEVIRFLLDLFKYND 774

Query: 348 NNGNPYSDVFWLAALVQSVGE-----LEFGQQSILFLSS--------LLKRIDRLLQFDR 394
           N  N YSD ++ AALV+++G      +    Q +   S         +L+ + R+L  ++
Sbjct: 775 NTKNHYSDNYYRAALVEALGNSITPVVSMVHQGMALTSENLSADSKLVLEEVTRILNLEK 834

Query: 395 LMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
            +PSY   ++I+C++ + +  L+  G +  +   K+ + + ++     VRV A   L+D 
Sbjct: 835 HLPSYKYTVSIACLKVIRK--LQKCGHLPTNP--KIYRSYAEYGQYIDVRVAAMECLVDY 890

Query: 455 EFHCNGIDSALSLFIKSVEEEPSLRGQVKLG 485
               +G    L   I  +E +P    + +LG
Sbjct: 891 -VKIDGRWEDLENLIDLLETDPDPMARHRLG 920


>gi|348517379|ref|XP_003446211.1| PREDICTED: transcription initiation factor TFIID subunit 2-like
           [Oreochromis niloticus]
          Length = 1206

 Score =  164 bits (414), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 132/478 (27%), Positives = 231/478 (48%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          LST  F +  +N  G    P 
Sbjct: 448 LIENRISMEFMLQVFNKLLSLASTASSQKYQSHMWSQMLLSTHAFLKSISNVSGKDIGPL 507

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 508 IKQ----WVDQSGVVKFFGSFAFNRKRNVLELEIRQDYT------------------SSG 545

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H +L +  ++ +  +I CHSK    +  K           
Sbjct: 546 TQKYVGPIKVTVQELDGSFNH-MLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 596

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ +A
Sbjct: 597 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSILRKVEFEQADFMWQYQLRYERDVVAQEEA 654

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEE-TDWAGLLHLVKFY 288
           I+ALE  P  +  +   L + L   + F++VR++A + LA  A+   + W G   +   +
Sbjct: 655 ISALEKFPTPASRLA--LTDILEQEQCFYKVRMQACFCLAKIANAMVSTWQGPPAMKSLF 712

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F +F  YF+ + +P A+A++R   N  P++ + F+L L+KYN
Sbjct: 713 -TRMFCCKSCPNIVKTNNFINFQSYFLQKTMPVAMALLRDVQNLCPKDVLNFILDLIKYN 771

Query: 347 DNNGNPYSDVFWLAALVQSV-----------GELEFGQQSILFLSSLLKRIDRLLQFDRL 395
           DN  N +SD ++ A L++++            E+         +  +L+ I R L  ++L
Sbjct: 772 DNRKNKFSDNYYRADLIEALTNSLTPAISINNEVRTVDNLNADVRLILEEITRFLNMEKL 831

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PS+   +T+SC+R + Q  L+ +G I  D    L K +  +     VR+ A  AL+D
Sbjct: 832 LPSFRNTITVSCLRAIRQ--LQKNGHIPSDP--SLFKSYAQYGHFVDVRIAALEALVD 885


>gi|430812189|emb|CCJ30342.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 1052

 Score =  164 bits (414), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 143/508 (28%), Positives = 236/508 (46%), Gaps = 68/508 (13%)

Query: 2   LEKQMGSNFFRKILQNIISRAQGASPVR-TLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
           L K   S    +I+  +  +A G +     LST  F   + KV    R  L  F  +W+ 
Sbjct: 372 LTKSGASWGLSRIIPKLFLQAMGGTLANNALSTSHFLKISEKVS---RVSLDVFAQQWIH 428

Query: 61  TCGCPVLRMGFSYNKRKNIVELAV--LRDCTVKPDSRTPVLSSNTDSENRDGDIG----- 113
             G P LR+   +N++K ++E+ +  ++D    P S T V     D++    +I      
Sbjct: 429 GRGYPRLRVIQRFNRKKMLIEIGIRQVQDIETPPKSLT-VTDFFNDAKFHANNISNSTFP 487

Query: 114 -WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKP----D 167
            + G M+IR+HE DG  YDH +     +A+  L+IQ + K    R  + KK  K     D
Sbjct: 488 IFTGTMTIRIHEADGTPYDHVV--ELKEAFTKLDIQYNIKNKRFRRNQSKKQKKDMMDID 545

Query: 168 GC----DDNGDAVAGL---------------------DMRSSMESPLSWIRADPEMEYLA 202
           G     +DN  +V                        D     E    WIR D + E++ 
Sbjct: 546 GVGDIDNDNDFSVQCFGDILQSEQEIKNWKLEDWLKEDEEMMSEEAFEWIRLDADFEWIC 605

Query: 203 EIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVV--NTLNNFLSDSKAFWRV 260
            I  NQP  M+ +QL +D DV+AQ +AI   +   +   N +  + L   L DS+ ++ +
Sbjct: 606 IIWVNQPDYMYFSQLRQDRDVIAQYEAI---QYFSNSKGNCIYSSILLRTLMDSRYYYGI 662

Query: 261 RIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHA 320
           R EAA AL   A+E+ DW G  HL K +++     N  + + N+F + SEY+V   IP A
Sbjct: 663 RCEAALALVKCATEDLDWIGWYHLFKAFQTNYCFPNSSISKSNNFSNLSEYYVQCIIPIA 722

Query: 321 VAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGE--LEFGQQSILF 378
           ++ +R     S  +  +F++ LL+YNDN+ N YSD +++  L+ S+    +   ++S  F
Sbjct: 723 MSKIRDKKGNSLIQIKKFLIDLLRYNDNSSNEYSDCYYICTLLNSLASTLITENKESFEF 782

Query: 379 ------------LSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQ 426
                       + + L  I+R  + D+L+PS+  I++I+ ++   +  L  +G I L+ 
Sbjct: 783 NFAMNDPENKNIIDTALSEIERFQRIDQLLPSFQNIISITSLK--IKYLLSKAGIIPLN- 839

Query: 427 VVKLIKPFRDFNTIWQVRVEASRALLDL 454
             + I P+        VR+ A   LL L
Sbjct: 840 -FQEILPYTHSANYDLVRLTAFDVLLKL 866


>gi|403283587|ref|XP_003933197.1| PREDICTED: transcription initiation factor TFIID subunit 2 [Saimiri
           boliviensis boliviensis]
          Length = 1153

 Score =  163 bits (412), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 427 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 486

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 487 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 524

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 525 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 575

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ ++
Sbjct: 576 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 633

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   +
Sbjct: 634 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 691

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF+ + +P A+A++R   N  P+E + F+L L+KYN
Sbjct: 692 -TRMFCCKSCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 750

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++L
Sbjct: 751 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 810

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC+R +    L+ +G +  D    L K + ++     +R+ A  A++D
Sbjct: 811 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 864


>gi|390605011|gb|EIN14402.1| hypothetical protein PUNSTDRAFT_80735 [Punctularia strigosozonata
           HHB-11173 SS5]
          Length = 1886

 Score =  163 bits (412), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 129/449 (28%), Positives = 201/449 (44%), Gaps = 77/449 (17%)

Query: 24  GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELA 83
           G  P   LST  F     KV  L+    + F  +W+   GCP      S+N++K  VE+ 
Sbjct: 512 GDLPSNALSTHYFLRMCRKVSGLDP---RSFAEQWIYGSGCPSFGFSASFNRKKMAVEIT 568

Query: 84  VLRDCTVKPDSRTPVLSSNTDSENRDGDIGWP---GMMSIRVHELDGMYDHPILPMAGDA 140
           + ++    P  R        D   R      P   G M++R+HE DG     +L +  + 
Sbjct: 569 MRQEA---PAYRL----YEHDEVKRALWKPVPFFEGQMTVRIHEADGTPYEHVLDIRSE- 620

Query: 141 WQLLEIQCHSKLA-----ARRALKPKKGSKPDGCDDNG--------DAVAGLDMRSSME- 186
           ++  E+  ++K        +R L  +  ++     D          D   GLD+    E 
Sbjct: 621 FKRFEVPFNTKYKRVRRNTKRYLARQAAAQAAAEGDAEAAEAIGMIDMSFGLDIWEKEEE 680

Query: 187 -------------------SPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQA 227
                              +   WIR D + E++A I F QP  MW++QL++D DVVAQ 
Sbjct: 681 RENWKVADWTEEDEQIMSGATYEWIRMDADFEWIAAIAFEQPDFMWVSQLQRDRDVVAQL 740

Query: 228 QAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKF 287
           +A+ AL   P  +  V +TL      S  F+R+R EA  AL + A  + D+ GL HL K 
Sbjct: 741 EAVYALSRQP--TAIVSSTLTKTALVSNYFFRIRCEAIIALVSCAIRKLDYLGLFHLFKL 798

Query: 288 YKSRRFD---------ENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEF 338
           +    +D          +  +PRPNDF D SEYFV +++  A++ VR  + K+P     F
Sbjct: 799 FLRYCYDPEDPHQDLFTHTYVPRPNDFSDLSEYFVRKSLLTAISRVRFENGKTPAIIRRF 858

Query: 339 VLQLLKYNDNNGNPYSDVFWLAALVQSVG-------------------ELEFGQQSILFL 379
           ++  LKYNDN  NPYSD F++ +++ ++G                   + E     ++ L
Sbjct: 859 LVDQLKYNDNTSNPYSDAFYICSIISALGCAIVSLAPPERGELLPMESKTEQSSDDVVLL 918

Query: 380 SSLLKRIDRLLQFDRLMPSYNGILTISCI 408
              L  +DR    DRL+PS +  +TI+ I
Sbjct: 919 RQALAEVDRYRSMDRLIPSPHNAVTIAAI 947


>gi|393240985|gb|EJD48509.1| hypothetical protein AURDEDRAFT_112948 [Auricularia delicata
           TFB-10046 SS5]
          Length = 1830

 Score =  163 bits (412), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 125/479 (26%), Positives = 217/479 (45%), Gaps = 86/479 (17%)

Query: 1   MLEKQMGSN-----FFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
           +L++++G +       R + +  +S   G  P   LST  F     KV   +    + F 
Sbjct: 471 ILDRRLGKSGTSLGLSRVLPKLFLSAISGEMPNNALSTNVFFKMCRKVCGFDS---RGFA 527

Query: 56  PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG-- 113
            +W+   GCP LR   ++N+++  VE+ +           +P    N D   R   +   
Sbjct: 528 EQWIYASGCPRLRFAANFNRKRMAVEITM--------QQHSPAWEKNKDDPVRLAQLKPV 579

Query: 114 --WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLA-ARRALK------PKKG 163
             + G M++R+HE DG  Y+H +L +   A++  ++  ++K    RR  K          
Sbjct: 580 HIFEGHMTVRIHEADGTPYEH-VLDIQT-AYKKYDVPFNTKYKRVRRNTKRFIARQQAAA 637

Query: 164 SKPDGCDDNGDAVAGLDMRSSME--------------------------SPLSWIRADPE 197
               G  +  +A+  +D+   ++                          +   WIR D +
Sbjct: 638 LAAQGDTEAAEAIGMIDVGFGLDVWEDAKERENWKVADWTEEEEQVMAGATYEWIRMDAD 697

Query: 198 MEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAF 257
            E++A I F QP  MW++QL++D DVVAQ +AI AL   P  +  V  T    +  +  F
Sbjct: 698 FEWIATIAFEQPDFMWVSQLQRDRDVVAQLEAIRALSRQP--TAIVSGTFTKTVLVTNYF 755

Query: 258 WRVRIEAAYALANTASEETDWAGLLHLVKFY---------KSRRFDENIGLPRPNDFRDF 308
           +R+R+EAA  L + A+ +  + GL HL K +         K +    +  +P  NDF DF
Sbjct: 756 YRIRVEAALGLVSCATNKLGFIGLFHLFKIFLRYCHTPTNKDQDLFTHTYVPLQNDFSDF 815

Query: 309 SEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGE 368
           SEYFV +A+  AV+ VR  + K+P     F++  L++NDN  N YSD  +++ ++ ++G 
Sbjct: 816 SEYFVRKAVLTAVSKVRFENGKTPPIVRRFLIDQLRFNDNTNNAYSDARYISCIITALGH 875

Query: 369 L-------EFG------------QQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCI 408
                   E+G            Q+ +  L      ++R    DRL+PS + ++T+S +
Sbjct: 876 ACVSTAPPEYGELMQTQGRTDQNQEDVELLDQATAEVERYRTMDRLVPSSHNVVTVSVL 934


>gi|431901704|gb|ELK08581.1| Transcription initiation factor TFIID subunit 2 [Pteropus alecto]
          Length = 1252

 Score =  162 bits (411), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 460 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 519

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 520 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 557

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 558 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 608

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ ++
Sbjct: 609 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 666

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   +
Sbjct: 667 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 724

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF+ + +P A+A++R   N  P+E + F+L L+KYN
Sbjct: 725 -TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 783

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++L
Sbjct: 784 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 843

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC+R +    L+ +G +  D    L K + ++     +R+ A  A++D
Sbjct: 844 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 897


>gi|392570887|gb|EIW64059.1| hypothetical protein TRAVEDRAFT_55102 [Trametes versicolor
           FP-101664 SS1]
          Length = 1844

 Score =  162 bits (410), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 97/285 (34%), Positives = 152/285 (53%), Gaps = 34/285 (11%)

Query: 191 WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNF 250
           WIR D + E++A I F+Q   MW++QL++D DVVAQ +AIA L   P+    + +TL   
Sbjct: 702 WIRIDADFEWIAAIAFDQKDYMWVSQLQRDRDVVAQYEAIAVLSKSPNAI--IASTLTKT 759

Query: 251 LSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFY-------KSRRFD--ENIGLPR 301
           +  S  F+R+R EAA AL N A+++ DW GL HL K +       + ++ D   +  +PR
Sbjct: 760 ILVSNYFYRIRCEAAGALVNCATQKLDWLGLFHLFKLFLRYCYEPEDQKADLFSHTYVPR 819

Query: 302 PNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAA 361
           PNDF D +EYFV +AI  ++A VR    K+P     F++  L+YNDN  NPY+D  +++ 
Sbjct: 820 PNDFSDIAEYFVRKAILRSIAKVRFETGKTPTVIRRFLVDQLRYNDNTTNPYADALYIST 879

Query: 362 LVQSVGEL-------EFGQ------------QSILFLSSLLKRIDRLLQFDRLMPSYNGI 402
           ++ ++G         E G+            Q    L   L  +DR    DRL+P+++ +
Sbjct: 880 VISALGYAIVSAVPPERGEFVSSENGPVPDLQDAELLKLALSEVDRYRSMDRLIPTFHNV 939

Query: 403 LTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEA 447
           +T++ I     + L L+G +  D   +L  P        QVR+ A
Sbjct: 940 VTVAVIE--FHMLLTLAGLVPHDS--RLFFPLTREGNYTQVRIAA 980



 Score = 42.0 bits (97), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 28/106 (26%), Positives = 48/106 (45%), Gaps = 16/106 (15%)

Query: 31  LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTV 90
           +ST  F     KV  ++    + F  +W+   GCP      S+N++K  VE+ + +D   
Sbjct: 517 VSTHAFLRMCRKVSGIDP---RSFAEQWIYGSGCPTFGFSASFNRKKMAVEITMRQDA-- 571

Query: 91  KPDSRTPVLSSNTDSENRDGDIG----WPGMMSIRVHELDGM-YDH 131
                 P   ++  +E     +     + G M+IR+HE DG  Y+H
Sbjct: 572 ------PAYKAHETNEIAKLMMKPVPFFEGQMTIRIHEADGTPYEH 611


>gi|395818007|ref|XP_003782430.1| PREDICTED: transcription initiation factor TFIID subunit 2
           [Otolemur garnettii]
          Length = 1199

 Score =  162 bits (410), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 665

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   +
Sbjct: 666 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 723

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF+ + +P A+A++R   N  P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC+R +    L+ +G +  D    L K + ++     +R+ A  A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 896


>gi|2827282|gb|AAC68502.1| TATA binding protein associated factor [Homo sapiens]
          Length = 1189

 Score =  162 bits (410), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 449 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 508

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 509 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 546

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 547 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 597

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ ++
Sbjct: 598 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 655

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   +
Sbjct: 656 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 713

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF+ + +P A+A++R   N  P+E + F+L L+KYN
Sbjct: 714 -TRMFCCKSCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 772

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++L
Sbjct: 773 DNGKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 832

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC+R +    L+ +G +  D    L K + ++     +R+ A  A++D
Sbjct: 833 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 886


>gi|351715354|gb|EHB18273.1| Transcription initiation factor TFIID subunit 2, partial
           [Heterocephalus glaber]
          Length = 1172

 Score =  162 bits (410), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 136/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 431 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 490

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 491 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 528

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 529 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLL------- 579

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+  A +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ ++
Sbjct: 580 -NGEE-ADMDLSAMDADSPLLWIRIDPDMAVLRKVEFEQADFMWQYQLRYERDVVAQQES 637

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   +
Sbjct: 638 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMAACFCLAKIANSMVSTWTGPPAMKSLF 695

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + R N+F  F  YF+ + +P A+A++R   N  P+E + FVL L+KYN
Sbjct: 696 -TRMFCCKTCPNIVRTNNFLSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFVLDLIKYN 754

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++L
Sbjct: 755 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 814

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC+R +    L+ +G +  D    L K + ++     +R+ A  A++D
Sbjct: 815 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 868


>gi|4507347|ref|NP_003175.1| transcription initiation factor TFIID subunit 2 [Homo sapiens]
 gi|3046692|gb|AAC13540.1| TBP-associated factor TAFII150 [Homo sapiens]
 gi|148922381|gb|AAI46380.1| TAF2 RNA polymerase II, TATA box binding protein (TBP)-associated
           factor, 150kDa [synthetic construct]
          Length = 1199

 Score =  162 bits (409), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 665

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   +
Sbjct: 666 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 723

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF+ + +P A+A++R   N  P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKSCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC+R +    L+ +G +  D    L K + ++     +R+ A  A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 896


>gi|383413613|gb|AFH30020.1| transcription initiation factor TFIID subunit 2 [Macaca mulatta]
          Length = 1199

 Score =  162 bits (409), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 665

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   +
Sbjct: 666 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 723

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF+ + +P A+A++R   N  P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKSCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC+R +    L+ +G +  D    L K + ++     +R+ A  A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 896


>gi|45199193|ref|NP_986222.1| AFR674Cp [Ashbya gossypii ATCC 10895]
 gi|44985333|gb|AAS54046.1| AFR674Cp [Ashbya gossypii ATCC 10895]
 gi|374109455|gb|AEY98361.1| FAFR674Cp [Ashbya gossypii FDAG1]
          Length = 1385

 Score =  162 bits (409), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 126/453 (27%), Positives = 214/453 (47%), Gaps = 74/453 (16%)

Query: 30   TLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCT 89
            +LST  F++   +V    R  L++FF +WV   G PV R+   +NK++ +VE+ + R   
Sbjct: 583  SLSTAHFQYVCERVN---RSKLEQFFAQWVFGSGVPVFRITQRFNKKRMVVEMGI-RQVQ 638

Query: 90   VKPDSRTPVLSSNTDSENRDGDIGWP---------GMMSIRVHELDGM-YDHPILPMAGD 139
            ++   +  ++       +    +  P         G M+IR+HE DG  Y+H +     D
Sbjct: 639  LQELGQGHIIGKRGFHASAMDYLCHPDKQLTQIFTGSMTIRIHEADGTPYEHIV--ELKD 696

Query: 140  AWQLLEIQCHSKLAARRALKP-KKGSKPDGC------------------DDNGD------ 174
             +  L+IQ ++K    +  +   K  KPDG                   +DN D      
Sbjct: 697  VFTKLDIQYNTKYKRLKRRRKINKPIKPDGKEASIVEPLPKDEDMKILDEDNDDIVLVNC 756

Query: 175  -------------------AVAGLDMRSSMES-PLSWIRADPEMEYLAEIHFNQPVQMWI 214
                               A+ G D  S  ++    WIR D + E++ +++ NQP  M+ 
Sbjct: 757  LGDTLLSREDCQHWNLTDPAIIGEDDESQQQNEAFEWIRVDADFEWICKVYINQPDYMFA 816

Query: 215  NQLEKDGDVVAQAQAIAALEALPHLSFNVVNT------LNNFLSDSKAFWRVRIEAAYAL 268
            +QL++D DV AQ ++I   E +  ++ + VN+      L   + D + F+ VR+EA  AL
Sbjct: 817  SQLQQDRDVEAQIESIRFFEDV--VASSAVNSQVYSSILTRTVMDDRYFYGVRLEACNAL 874

Query: 269  ANTASEETD----WAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMV 324
            +    +E +      G  HL+  ++     EN  + + NDF DFS YF+  AIP  +A  
Sbjct: 875  SKFILKEVEPNQFSGGARHLIAIFRYYFCYENSNIQKNNDFTDFSRYFIQCAIPRFLANT 934

Query: 325  RAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQ-QSILFLSSLL 383
            R    K P     F+L+LL+YNDN+GNPYSDV ++ A++ SV     G  +   +++ LL
Sbjct: 935  RDTQGKCPTFVKRFLLELLRYNDNSGNPYSDVRYVVAIIDSVVACTLGDPEDKEYINDLL 994

Query: 384  KRIDRLLQFDRLMPSYNGILTISCIRTLTQIAL 416
              + R    DR +PSY  +++ + ++    +A+
Sbjct: 995  NELKRFENLDRWLPSYQLLVSKAILKQYLLLAV 1027


>gi|397504097|ref|XP_003822645.1| PREDICTED: transcription initiation factor TFIID subunit 2 [Pan
           paniscus]
 gi|410267202|gb|JAA21567.1| TAF2 RNA polymerase II, TATA box binding protein (TBP)-associated
           factor, 150kDa [Pan troglodytes]
 gi|410307198|gb|JAA32199.1| TAF2 RNA polymerase II, TATA box binding protein (TBP)-associated
           factor, 150kDa [Pan troglodytes]
          Length = 1199

 Score =  162 bits (409), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 665

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   +
Sbjct: 666 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 723

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF+ + +P A+A++R   N  P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKSCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC+R +    L+ +G +  D    L K + ++     +R+ A  A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 896


>gi|332214173|ref|XP_003256205.1| PREDICTED: transcription initiation factor TFIID subunit 2
           [Nomascus leucogenys]
          Length = 1199

 Score =  162 bits (409), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 665

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   +
Sbjct: 666 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 723

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF+ + +P A+A++R   N  P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKSCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC+R +    L+ +G +  D    L K + ++     +R+ A  A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 896


>gi|297683539|ref|XP_002819433.1| PREDICTED: transcription initiation factor TFIID subunit 2 [Pongo
           abelii]
          Length = 1199

 Score =  162 bits (409), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 665

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   +
Sbjct: 666 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 723

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF+ + +P A+A++R   N  P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKSCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC+R +    L+ +G +  D    L K + ++     +R+ A  A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 896


>gi|388453833|ref|NP_001253309.1| transcription initiation factor TFIID subunit 2 [Macaca mulatta]
 gi|380788081|gb|AFE65916.1| transcription initiation factor TFIID subunit 2 [Macaca mulatta]
 gi|384939964|gb|AFI33587.1| transcription initiation factor TFIID subunit 2 [Macaca mulatta]
          Length = 1199

 Score =  162 bits (409), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 665

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   +
Sbjct: 666 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 723

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF+ + +P A+A++R   N  P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKSCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC+R +    L+ +G +  D    L K + ++     +R+ A  A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 896


>gi|114621469|ref|XP_519923.2| PREDICTED: transcription initiation factor TFIID subunit 2 [Pan
           troglodytes]
 gi|410208974|gb|JAA01706.1| TAF2 RNA polymerase II, TATA box binding protein (TBP)-associated
           factor, 150kDa [Pan troglodytes]
 gi|410354347|gb|JAA43777.1| TAF2 RNA polymerase II, TATA box binding protein (TBP)-associated
           factor, 150kDa [Pan troglodytes]
          Length = 1199

 Score =  162 bits (409), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 665

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   +
Sbjct: 666 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 723

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF+ + +P A+A++R   N  P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKSCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC+R +    L+ +G +  D    L K + ++     +R+ A  A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 896


>gi|145559533|sp|Q6P1X5.3|TAF2_HUMAN RecName: Full=Transcription initiation factor TFIID subunit 2;
           AltName: Full=150 kDa cofactor of initiator function;
           AltName: Full=RNA polymerase II TBP-associated factor
           subunit B; AltName: Full=TBP-associated factor 150 kDa;
           AltName: Full=Transcription initiation factor TFIID 150
           kDa subunit; Short=TAF(II)150; Short=TAFII-150;
           Short=TAFII150
 gi|189067574|dbj|BAG38179.1| unnamed protein product [Homo sapiens]
          Length = 1199

 Score =  162 bits (409), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 665

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   +
Sbjct: 666 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 723

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF+ + +P A+A++R   N  P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKSCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC+R +    L+ +G +  D    L K + ++     +R+ A  A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 896


>gi|2739087|gb|AAC02966.1| cofactor of initiator function [Homo sapiens]
          Length = 1199

 Score =  162 bits (409), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 665

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   +
Sbjct: 666 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 723

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF+ + +P A+A++R   N  P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKSCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC+R +    L+ +G +  D    L K + ++     +R+ A  A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 896


>gi|410987700|ref|XP_004000133.1| PREDICTED: LOW QUALITY PROTEIN: transcription initiation factor
           TFIID subunit 2 [Felis catus]
          Length = 1207

 Score =  162 bits (409), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 467 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 526

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 527 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 564

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 565 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 615

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ ++
Sbjct: 616 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 673

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   +
Sbjct: 674 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 731

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF+ + +P A+A++R   N  P+E + F+L L+KYN
Sbjct: 732 -TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 790

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++L
Sbjct: 791 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 850

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC+R +    L+ +G +  D    L K + ++     +R+ A  A++D
Sbjct: 851 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 904


>gi|296227297|ref|XP_002759311.1| PREDICTED: transcription initiation factor TFIID subunit 2
           [Callithrix jacchus]
          Length = 1199

 Score =  162 bits (409), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 665

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   +
Sbjct: 666 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 723

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF+ + +P A+A++R   N  P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKSCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC+R +    L+ +G +  D    L K + ++     +R+ A  A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 896


>gi|73974557|ref|XP_532318.2| PREDICTED: transcription initiation factor TFIID subunit 2 [Canis
           lupus familiaris]
          Length = 1199

 Score =  162 bits (409), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 665

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   +
Sbjct: 666 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 723

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF+ + +P A+A++R   N  P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC+R +    L+ +G +  D    L K + ++     +R+ A  A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 896


>gi|291388465|ref|XP_002710796.1| PREDICTED: TBP-associated factor 2 [Oryctolagus cuniculus]
          Length = 1198

 Score =  162 bits (409), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 665

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   +
Sbjct: 666 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 723

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF+ + +P A+A++R   N  P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC+R +    L+ +G +  D    L K + ++     +R+ A  A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 896


>gi|292622006|ref|XP_002664849.1| PREDICTED: transcription initiation factor TFIID subunit 2 [Danio
           rerio]
          Length = 1191

 Score =  161 bits (408), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 137/478 (28%), Positives = 224/478 (46%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G    P 
Sbjct: 448 LIENRISMEFMLQVFNKLLSLASTASSQKYQSHMWSQMLVSTSGFLKSISNVSGKDIGPL 507

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 508 IKQ----WVDQSGVVKFFGSFAFNRKRNVLELEIRQDYT------------------SSG 545

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 546 TQKYVGPIKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 596

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ +A
Sbjct: 597 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSILRKVEFEQADFMWQYQLRYERDVVAQEEA 654

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I ALE  P  +      L + L   + F++VR+ A + LA  A S  + W G   +   +
Sbjct: 655 ILALEKFP--TPPSRRALTDILEQDQCFYKVRMHACFCLAKIANSMVSTWTGPPAMKSLF 712

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF+ + IP A+A +R   N  PRE + F+L L+KYN
Sbjct: 713 -TRMFCCKSCPNIVKTNNFISFQSYFLQKTIPVAMAQLRDVQNLCPREVLSFILDLIKYN 771

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  S+         +  + SL       L+ I R L  ++L
Sbjct: 772 DNRKNKFSDNYYRAELIDALTNSLTPAISINNEVRTVDSLNADVRLILEEITRFLNMEKL 831

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC+R +    L+ +G I  D    L K + ++     VR+ A  A++D
Sbjct: 832 LPSYRNTITVSCLRAIRM--LQKNGHIPSDPT--LFKSYAEYGHFVDVRIAALEAVID 885


>gi|79154077|gb|AAI07958.1| TAF2 RNA polymerase II, TATA box binding protein (TBP)-associated
           factor [Danio rerio]
          Length = 1191

 Score =  161 bits (408), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 137/478 (28%), Positives = 224/478 (46%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G    P 
Sbjct: 448 LIENRISMEFMLQVFNKLLSLASTASSQKYQSHMWSQMLVSTSGFLKSISNVSGKDIGPL 507

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 508 IKQ----WVDQSGVVKFFGSFAFNRKRNVLELEIRQDYT------------------SSG 545

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 546 TQKYVGPIKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 596

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ +A
Sbjct: 597 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSILRKVEFEQADFMWQYQLRYERDVVAQEEA 654

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I ALE  P  +      L + L   + F++VR+ A + LA  A S  + W G   +   +
Sbjct: 655 ILALEKFP--TPPSRRALTDILEQDQCFYKVRMHACFCLAKIANSMVSTWTGPPAMKSLF 712

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF+ + IP A+A +R   N  PRE + F+L L+KYN
Sbjct: 713 -TRMFCCKSCPNIVKTNNFISFQSYFLQKTIPVAMAQLRDVQNLCPREVLSFILDLIKYN 771

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  S+         +  + SL       L+ I R L  ++L
Sbjct: 772 DNRKNKFSDNYYRAELIDALTNSLTPAISINNEVRTVDSLNADVRLILEEITRFLNMEKL 831

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC+R +    L+ +G I  D    L K + ++     VR+ A  A++D
Sbjct: 832 LPSYRNTITVSCLRAIRM--LQKNGHIPSDPT--LFKSYAEYGHFVDVRIAALEAVID 885


>gi|426360591|ref|XP_004047522.1| PREDICTED: transcription initiation factor TFIID subunit 2-like,
           partial [Gorilla gorilla gorilla]
          Length = 959

 Score =  161 bits (408), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 135/478 (28%), Positives = 229/478 (47%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T      +P            G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------SP------------G 556

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 665

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   +
Sbjct: 666 ILALEKFPTPASRL--ALTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 723

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF+ + +P A+A++R   N  P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKSCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC+R +    L+ +G +  D    L K + ++     +R+ A  A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 896


>gi|116256083|sp|Q32PW3.2|TAF2_DANRE RecName: Full=Transcription initiation factor TFIID subunit 2;
           AltName: Full=TBP-associated factor 150 kDa; AltName:
           Full=Transcription initiation factor TFIID 150 kDa
           subunit; Short=TAF(II)150; Short=TAFII-150;
           Short=TAFII150
          Length = 1191

 Score =  161 bits (408), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 137/478 (28%), Positives = 224/478 (46%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G    P 
Sbjct: 448 LIENRISMEFMLQVFNKLLSLASTASSQKYQSHMWSQMLVSTSGFLKSISNVSGKDIGPL 507

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 508 IKQ----WVDQSGVVKFFGSFAFNRKRNVLELEIRQDYT------------------SSG 545

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 546 TQKYVGPIKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 596

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ +A
Sbjct: 597 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSILRKVEFEQADFMWQYQLRYERDVVAQEEA 654

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I ALE  P  +      L + L   + F++VR+ A + LA  A S  + W G   +   +
Sbjct: 655 ILALEKFP--TPPSRRALTDILEQDQCFYKVRMHACFCLAKIANSMVSTWTGPPAMKSLF 712

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF+ + IP A+A +R   N  PRE + F+L L+KYN
Sbjct: 713 -TRMFCCKSCPNIVKTNNFISFQSYFLQKTIPVAMAQLRDVQNLCPREVLSFILDLIKYN 771

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  S+         +  + SL       L+ I R L  ++L
Sbjct: 772 DNRKNKFSDNYYRAELIDALTNSLTPAISINNEVRTVDSLNADVRLILEEITRFLNMEKL 831

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC+R +    L+ +G I  D    L K + ++     VR+ A  A++D
Sbjct: 832 LPSYRNTITVSCLRAIRM--LQKNGHIPSDPT--LFKSYAEYGHFVDVRIAALEAVID 885


>gi|119612397|gb|EAW91991.1| TAF2 RNA polymerase II, TATA box binding protein (TBP)-associated
           factor, 150kDa, isoform CRA_a [Homo sapiens]
 gi|119612398|gb|EAW91992.1| TAF2 RNA polymerase II, TATA box binding protein (TBP)-associated
           factor, 150kDa, isoform CRA_a [Homo sapiens]
 gi|119612399|gb|EAW91993.1| TAF2 RNA polymerase II, TATA box binding protein (TBP)-associated
           factor, 150kDa, isoform CRA_a [Homo sapiens]
          Length = 1030

 Score =  161 bits (408), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 290 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 349

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 350 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 387

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 388 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 438

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ ++
Sbjct: 439 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 496

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   +
Sbjct: 497 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 554

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF+ + +P A+A++R   N  P+E + F+L L+KYN
Sbjct: 555 -TRMFCCKSCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 613

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++L
Sbjct: 614 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 673

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC+R +    L+ +G +  D    L K + ++     +R+ A  A++D
Sbjct: 674 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 727


>gi|82081311|sp|Q5ZIT8.1|TAF2_CHICK RecName: Full=Transcription initiation factor TFIID subunit 2;
           AltName: Full=TBP-associated factor 150 kDa; AltName:
           Full=Transcription initiation factor TFIID 150 kDa
           subunit; Short=TAF(II)150; Short=TAFII-150;
           Short=TAFII150
 gi|53134697|emb|CAG32355.1| hypothetical protein RCJMB04_23j21 [Gallus gallus]
          Length = 1168

 Score =  161 bits (408), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 136/478 (28%), Positives = 230/478 (48%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 449 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 508

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T      +P            G
Sbjct: 509 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------SP------------G 546

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 547 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 597

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ +A
Sbjct: 598 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQSDFMWQYQLRYERDVVAQEEA 655

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   +
Sbjct: 656 ILALEKFPTPASRL--ALTDILEQEQCFYRVRMLACFCLAKIANSMVSTWTGPPAMKSLF 713

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F +F  YF+ + +P A+A++R   N  P+E + F+L L+KYN
Sbjct: 714 -TRMFCCKTCPNIVKTNNFMNFQSYFLQKTMPVAMALLRDVHNLCPKEVLMFILDLIKYN 772

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++L
Sbjct: 773 DNRKNKFSDNYYRAELIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 832

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC++ +    L+ +G +  D    L K + ++     VR+ A  A++D
Sbjct: 833 LPSYRHTITVSCLKAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDVRIAALEAVVD 886


>gi|417406103|gb|JAA49727.1| Putative tata binding protein associated factor [Desmodus rotundus]
          Length = 1199

 Score =  161 bits (408), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 134/478 (28%), Positives = 228/478 (47%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYTSL------------------G 556

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 665

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I+ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   +
Sbjct: 666 ISALEKFPTPASRLA--LTDILEQEQCFYRVRMAACFCLAKIANSMVSTWTGPPAMKSLF 723

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF+ + +P A+A++R   N  P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC+R +    L+ +G +  D    L K + ++     +R+ A  A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 896


>gi|281347100|gb|EFB22684.1| hypothetical protein PANDA_019775 [Ailuropoda melanoleuca]
          Length = 1100

 Score =  161 bits (408), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 360 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 419

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 420 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 457

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 458 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 508

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ ++
Sbjct: 509 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 566

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   +
Sbjct: 567 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 624

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF+ + +P A+A++R   N  P+E + F+L L+KYN
Sbjct: 625 -TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 683

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++L
Sbjct: 684 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 743

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC+R +    L+ +G +  D    L K + ++     +R+ A  A++D
Sbjct: 744 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 797


>gi|301788308|ref|XP_002929574.1| PREDICTED: transcription initiation factor TFIID subunit 2-like,
           partial [Ailuropoda melanoleuca]
          Length = 1099

 Score =  161 bits (407), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 359 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 418

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 419 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 456

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 457 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 507

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ ++
Sbjct: 508 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 565

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   +
Sbjct: 566 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 623

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF+ + +P A+A++R   N  P+E + F+L L+KYN
Sbjct: 624 -TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 682

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++L
Sbjct: 683 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 742

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC+R +    L+ +G +  D    L K + ++     +R+ A  A++D
Sbjct: 743 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 796


>gi|350582943|ref|XP_001924394.4| PREDICTED: transcription initiation factor TFIID subunit 2 [Sus
           scrofa]
          Length = 1105

 Score =  161 bits (407), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 365 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 424

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 425 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 462

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 463 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 513

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ ++
Sbjct: 514 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 571

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   +
Sbjct: 572 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 629

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF+ + +P A+A++R   N  P+E + F+L L+KYN
Sbjct: 630 -TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 688

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++L
Sbjct: 689 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 748

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC+R +    L+ +G +  D    L K + ++     +R+ A  A++D
Sbjct: 749 LPSYRHTITVSCLRAIR--VLQKNGHVPSDP--SLFKSYAEYGHFVDIRIAALEAVVD 802


>gi|363731074|ref|XP_003640904.1| PREDICTED: transcription initiation factor TFIID subunit 2 [Gallus
           gallus]
          Length = 1200

 Score =  161 bits (407), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 136/478 (28%), Positives = 230/478 (48%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 449 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 508

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T      +P            G
Sbjct: 509 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------SP------------G 546

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 547 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 597

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ +A
Sbjct: 598 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQSDFMWQYQLRYERDVVAQEEA 655

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   +
Sbjct: 656 ILALEKFPTPASRL--ALTDILEQEQCFYRVRMLACFCLAKIANSMVSTWTGPPAMKSLF 713

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F +F  YF+ + +P A+A++R   N  P+E + F+L L+KYN
Sbjct: 714 -TRMFCCKTCPNIVKTNNFMNFQSYFLQKTMPVAMALLRDVHNLCPKEVLMFILDLIKYN 772

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++L
Sbjct: 773 DNRKNKFSDNYYRAELIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 832

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC++ +    L+ +G +  D    L K + ++     VR+ A  A++D
Sbjct: 833 LPSYRHTITVSCLKAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDVRIAALEAVVD 886


>gi|334326275|ref|XP_003340732.1| PREDICTED: LOW QUALITY PROTEIN: transcription initiation factor
           TFIID subunit 2-like [Monodelphis domestica]
          Length = 1200

 Score =  161 bits (407), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQSDFMWQYQLRYERDVVAQEES 665

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   +
Sbjct: 666 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMLACFCLAKIANSMVSTWTGPPAMKSLF 723

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF+ + +P A+A++R   N  P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC+R +    L+ +G +  D    L K + ++     +R+ A  A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 896


>gi|224046665|ref|XP_002199309.1| PREDICTED: transcription initiation factor TFIID subunit 2
           [Taeniopygia guttata]
          Length = 1200

 Score =  161 bits (407), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 136/478 (28%), Positives = 230/478 (48%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 449 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 508

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T      +P            G
Sbjct: 509 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------SP------------G 546

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 547 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 597

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ +A
Sbjct: 598 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQSDFMWQYQLRYERDVVAQEEA 655

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   +
Sbjct: 656 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMLACFCLAKIANSMVSTWTGPPAMKSLF 713

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F +F  YF+ + +P A+A++R   N  P+E + F+L L+KYN
Sbjct: 714 -TRMFCCKTCPNIVKTNNFMNFQSYFLQKTMPVAMALLRDVHNLCPKEVLMFILDLIKYN 772

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++L
Sbjct: 773 DNRKNKFSDNYYRAELIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 832

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC++ +    L+ +G +  D    L K + ++     VR+ A  A++D
Sbjct: 833 LPSYRHTITVSCLKAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDVRIAALEAVVD 886


>gi|410904835|ref|XP_003965897.1| PREDICTED: LOW QUALITY PROTEIN: transcription initiation factor
           TFIID subunit 2-like [Takifugu rubripes]
          Length = 1206

 Score =  161 bits (407), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 132/478 (27%), Positives = 237/478 (49%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G    P 
Sbjct: 448 LIENRISMEFMLQVFNKLLSLASTASSQKYQSQMWSQMLVSTHGFLKSISNVSGKDIGPL 507

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV           F++N+++N++EL + +D T      +P            G
Sbjct: 508 IKQ----WVDQSAVVKFFGSFAFNRKRNVLELEIRQDYT------SP------------G 545

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H +L +  ++ +  +I CHSK    +  K           
Sbjct: 546 TQKYVGPIKVTVQELDGSFNH-MLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 596

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ +A
Sbjct: 597 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSILRKVEFEQADFMWQYQLRYERDVVAQEEA 654

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEE-TDWAGLLHLVKFY 288
           I+ALE  P  +  +   L + L   + F++VR++A + L+  A+   + W G   +   +
Sbjct: 655 ISALEKFPTPASRL--ALTDILEQDQCFYKVRMQACFCLSKIANAMVSTWQGPPAMKSLF 712

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F +F  YF+ + +P A+A++R   N  P++ + F+L L+KYN
Sbjct: 713 -TRMFCCKSCPNIVKTNNFINFQSYFLQKTMPVAMALLRDVQNLCPKDVLNFILDLIKYN 771

Query: 347 DNNGNPYSDVFWLAALVQSVG-------ELEFGQQSILFLSS----LLKRIDRLLQFDRL 395
           DN  N +SD ++ A L++++         +    +++  L+S    +L+ I R L  ++L
Sbjct: 772 DNRKNKFSDNYYRAELIRALTNSLTXAISISNEVRTVDNLNSDVRLILEEITRYLNMEKL 831

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC+R + +  L+ +G I  D    L K + ++     VRV A  AL+D
Sbjct: 832 LPSYRNTITVSCLRAIRE--LQKNGHIPSD--ASLFKSYAEYGHFVDVRVAALEALVD 885


>gi|158294667|ref|XP_001688719.1| AGAP005731-PA [Anopheles gambiae str. PEST]
 gi|157015672|gb|EDO63725.1| AGAP005731-PA [Anopheles gambiae str. PEST]
          Length = 1272

 Score =  161 bits (407), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 136/511 (26%), Positives = 235/511 (45%), Gaps = 66/511 (12%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRH-------FANKVGNLERPFLKE 53
           MLE ++G     ++    ++ A  A+  + + +  + H       F+  +  +    +  
Sbjct: 460 MLEHRIGQELLLQVFNKQLALASNAAGTK-IGSGLWGHMLISTNVFSKAIFTVTGKDMSV 518

Query: 54  FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
           F  +WV T G     M   +N ++N +EL + +DC                  N+ G   
Sbjct: 519 FIDQWVRTGGHAKFTMTSVFNSKRNTIELEIRQDCV-----------------NQRGVRK 561

Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
           + G + I++ ELDG + H +        Q+  I   S +      +  K  K   C   G
Sbjct: 562 YVGPLLIQLQELDGTFKHTL--------QIENIIVKSDITCHSKSRRNKKKKIPLC--TG 611

Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
           + V  +D+    ESP+ WIR DPEM  L  ++  QP   W  QL  + DV AQ +AI AL
Sbjct: 612 EEV-DMDLSPMAESPVLWIRLDPEMTLLRSVNIEQPDFQWQFQLRHERDVTAQLEAIKAL 670

Query: 234 E--ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEE-TDWAGLLHLVKFYKS 290
           E  A P     + +T+ +     + F+ VR +AA  L   A+   T W G   ++  +K 
Sbjct: 671 EKYATPQTRLALTDTIES----ERVFYEVRCQAAICLTKVANAMVTSWQGPPAMLTIFK- 725

Query: 291 RRFDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYND 347
            +F  +   P   R N+F +F  YF+ + IP A+A +R A    P E + F+L L KYND
Sbjct: 726 -KFFGSFSAPHIIRQNNFTNFQHYFLQKTIPVAMAGLRTAHGICPPEVIRFLLDLFKYND 784

Query: 348 NNGNPYSDVFWLAALVQSVGELEFGQQSILF----LSS---------LLKRIDRLLQFDR 394
           N+ N YSD ++ AALV+++G       S++     LSS         +L+ + R+L  ++
Sbjct: 785 NSKNHYSDNYYRAALVEALGNTITPVISVVHQGRALSSENLSQDAKLVLEEVTRILNLEK 844

Query: 395 LMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
            +PSY   ++I+C++ + +  L+  G +  +   K+ + +  +     VR+ A   L+D 
Sbjct: 845 HLPSYKYTVSIACLKVIRK--LQKCGHLPTNP--KIYRSYAAYGQYIDVRLAAMECLVDY 900

Query: 455 EFHCNGIDSALSLFIKSVEEEPSLRGQVKLG 485
               +G    L   ++ +E +P    + +LG
Sbjct: 901 -VKIDGRWEDLEHLMELLETDPDPMARHQLG 930


>gi|254584730|ref|XP_002497933.1| ZYRO0F16830p [Zygosaccharomyces rouxii]
 gi|186929049|emb|CAQ43374.1| Transcription initiation factor TFIID subunit 2 [Zygosaccharomyces
            rouxii]
 gi|238940826|emb|CAR29000.1| ZYRO0F16830p [Zygosaccharomyces rouxii]
          Length = 1478

 Score =  161 bits (407), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 123/445 (27%), Positives = 206/445 (46%), Gaps = 64/445 (14%)

Query: 13   KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
            +IL  I  +A  G  P  +LS+  F++   +V    +  L+ FF +W+   G P+ R+  
Sbjct: 628  RILPKIFLQAMSGDLPNNSLSSSHFQYVCERVN---KNKLESFFQQWIYGSGVPIFRVTQ 684

Query: 72   SYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWP---------GMMSIRV 122
             +NK++ +VE+ + R C  +  S+  ++       +    +  P         G M+IR+
Sbjct: 685  RFNKKRMMVEMGI-RQCQDQELSQDKIVGKEGFCSSALNHLQHPVRAMTPFFTGSMTIRI 743

Query: 123  HELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKG------------------ 163
            HE DG  Y+H +     D +  L+IQ ++K    R  K   G                  
Sbjct: 744  HEADGTPYEHIV--ELKDVFTKLDIQYNTKYRRLRNRKINVGRPSNKAASPDLEEDLEFQ 801

Query: 164  ----------------SKPDGCDD--NGDAVAGLDMRSSME--SPLSWIRADPEMEYLAE 203
                            S P  C D    DA+   D   +        WIR D + E++ +
Sbjct: 802  EEKIDNSAIQRLGTVLSSPKDCKDWDLTDAIVTTDGNEAQRQNEAFEWIRIDSDFEWICK 861

Query: 204  IHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLS----DSKAFWR 259
            I+ NQP  M+ +QL++D DV AQ ++I   E +   S N     ++ L+    D + F+ 
Sbjct: 862  IYVNQPDYMFTSQLQQDSDVEAQVESIKYFEDVIIHSPNNSQIYSSILTRTAMDKRYFYG 921

Query: 260  VRIEAAYALANTASEETDWA----GLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLE 315
            VR+EA  ALA    ++ D A    G +HL+K +++    E   +P  NDF  F EYF+ +
Sbjct: 922  VRLEACKALARYTMKDHDPAHFTGGAMHLIKIFQNLFCYEGSSIPLNNDFTSFQEYFLQK 981

Query: 316  AIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFG-QQ 374
             IP  +A+VR   N+ P    +FVL +L YN+N+ N Y D  +L +L+++  +     ++
Sbjct: 982  NIPKFLALVRNERNECPHFVKQFVLDILTYNENSENLYDDTHYLCSLIEAAVDCCISNKK 1041

Query: 375  SILFLSSLLKRIDRLLQFDRLMPSY 399
               FL+ +L ++ R    D  MP+Y
Sbjct: 1042 DQEFLNKVLDQLHRYANLDDWMPTY 1066


>gi|118786940|ref|XP_315746.3| AGAP005731-PB [Anopheles gambiae str. PEST]
 gi|116126564|gb|EAA10740.3| AGAP005731-PB [Anopheles gambiae str. PEST]
          Length = 1233

 Score =  160 bits (406), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 136/511 (26%), Positives = 235/511 (45%), Gaps = 66/511 (12%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRH-------FANKVGNLERPFLKE 53
           MLE ++G     ++    ++ A  A+  + + +  + H       F+  +  +    +  
Sbjct: 460 MLEHRIGQELLLQVFNKQLALASNAAGTK-IGSGLWGHMLISTNVFSKAIFTVTGKDMSV 518

Query: 54  FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
           F  +WV T G     M   +N ++N +EL + +DC                  N+ G   
Sbjct: 519 FIDQWVRTGGHAKFTMTSVFNSKRNTIELEIRQDCV-----------------NQRGVRK 561

Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
           + G + I++ ELDG + H +        Q+  I   S +      +  K  K   C   G
Sbjct: 562 YVGPLLIQLQELDGTFKHTL--------QIENIIVKSDITCHSKSRRNKKKKIPLC--TG 611

Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
           + V  +D+    ESP+ WIR DPEM  L  ++  QP   W  QL  + DV AQ +AI AL
Sbjct: 612 EEV-DMDLSPMAESPVLWIRLDPEMTLLRSVNIEQPDFQWQFQLRHERDVTAQLEAIKAL 670

Query: 234 E--ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEE-TDWAGLLHLVKFYKS 290
           E  A P     + +T+ +     + F+ VR +AA  L   A+   T W G   ++  +K 
Sbjct: 671 EKYATPQTRLALTDTIES----ERVFYEVRCQAAICLTKVANAMVTSWQGPPAMLTIFK- 725

Query: 291 RRFDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYND 347
            +F  +   P   R N+F +F  YF+ + IP A+A +R A    P E + F+L L KYND
Sbjct: 726 -KFFGSFSAPHIIRQNNFTNFQHYFLQKTIPVAMAGLRTAHGICPPEVIRFLLDLFKYND 784

Query: 348 NNGNPYSDVFWLAALVQSVGELEFGQQSILF----LSS---------LLKRIDRLLQFDR 394
           N+ N YSD ++ AALV+++G       S++     LSS         +L+ + R+L  ++
Sbjct: 785 NSKNHYSDNYYRAALVEALGNTITPVISVVHQGRALSSENLSQDAKLVLEEVTRILNLEK 844

Query: 395 LMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
            +PSY   ++I+C++ + +  L+  G +  +   K+ + +  +     VR+ A   L+D 
Sbjct: 845 HLPSYKYTVSIACLKVIRK--LQKCGHLPTNP--KIYRSYAAYGQYIDVRLAAMECLVDY 900

Query: 455 EFHCNGIDSALSLFIKSVEEEPSLRGQVKLG 485
               +G    L   ++ +E +P    + +LG
Sbjct: 901 -VKIDGRWEDLEHLMELLETDPDPMARHQLG 930


>gi|209954626|ref|NP_579853.1| TAF2 RNA polymerase II, TATA box binding protein-associated factor,
           150 kD [Rattus norvegicus]
          Length = 1200

 Score =  160 bits (406), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 133/478 (27%), Positives = 227/478 (47%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTYGFLKSISNVSGKDIQPL 518

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  +L  + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYELRYERDVVAQQES 665

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   +
Sbjct: 666 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 723

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF+ + +P A+A++R   N  P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC+R +    L+ +G +  D    L K + ++     +R+ A  A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSD--ASLFKSYAEYGHFVDIRIAALEAVVD 896


>gi|124486849|ref|NP_001074757.1| transcription initiation factor TFIID subunit 2 [Mus musculus]
 gi|223462133|gb|AAI47639.1| TAF2 RNA polymerase II, TATA box binding protein (TBP)-associated
           factor [Mus musculus]
 gi|223462641|gb|AAI51080.1| TAF2 RNA polymerase II, TATA box binding protein (TBP)-associated
           factor [Mus musculus]
          Length = 1198

 Score =  160 bits (406), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 133/478 (27%), Positives = 227/478 (47%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTYGFLKSISNVSGKDIQPL 518

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  +L  + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYELRYERDVVAQQES 665

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   +
Sbjct: 666 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 723

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF+ + +P A+A++R   N  P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC+R +    L+ +G +  D    L K + ++     +R+ A  A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSD--ASLFKSYAEYGHFVDIRIAALEAVVD 896


>gi|389751631|gb|EIM92704.1| hypothetical protein STEHIDRAFT_89926 [Stereum hirsutum FP-91666
           SS1]
          Length = 1835

 Score =  160 bits (405), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 141/488 (28%), Positives = 219/488 (44%), Gaps = 85/488 (17%)

Query: 31  LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDC-T 89
           LST  F     KV  ++   L+ F  +W+   GCP      S+N++K  VE+ + +D   
Sbjct: 518 LSTHTFLRTCRKVSGVD---LRSFAEQWIYGSGCPSFGFSASFNRKKMAVEITMRQDAPA 574

Query: 90  VKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQC 148
            K     PV+ +            + G M+IR+HE DG  Y+H +L +    ++  E+  
Sbjct: 575 YKELENNPVVKALLKPVP-----FFEGQMTIRIHEADGTPYEH-VLDIRA-PFKRYEVPF 627

Query: 149 HSKLA-ARRALK---------------PKKGSKPDGCDDNGDAVAGLDMRSSMESPLSWI 192
           ++K    RR  K                 + ++  G  D G    GLD+  +     +W 
Sbjct: 628 NTKYKRVRRNTKRYLARQAAAQAAAEGDAEAAEAMGMVDMG---FGLDIWENERERENWK 684

Query: 193 RAD--------------------PEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAA 232
            AD                     + E++A + F+QP  MW++QL++D DVVAQ +A+ A
Sbjct: 685 VADWSEDDEQVMSGATYEWIRMDADFEWIAAVAFDQPDYMWVSQLQRDRDVVAQLEAVNA 744

Query: 233 LEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRR 292
           L  +P  S  V + L   +  S  F+RVR EAA AL   A    D+ GL HL K +    
Sbjct: 745 LSKMP--SAIVSSQLTKTVLVSNYFFRVRCEAALALVTCAIRRLDFLGLFHLFKLFLRYC 802

Query: 293 FD---------ENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLL 343
           +D          +  +PRPNDF DF++YFV  A+  A++ VR  + K P    +F+   L
Sbjct: 803 YDPEDPQQDLFTHTYVPRPNDFSDFADYFVRRALIKAISQVRFENGKCPSVVRKFLTDQL 862

Query: 344 KYNDNNGNPYSDVFWLAALVQSV------------GEL-------EFGQQSILFLSSLLK 384
           KYNDN  NPY+D F++  L+ S+            GEL       E   +  + L   L 
Sbjct: 863 KYNDNTANPYTDAFYICTLISSLACACVSTSPPERGELFPMEAKNEQSSEDAMILKDALA 922

Query: 385 RIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVR 444
             DR    DRL+ S + ++TI+ +    ++ L ++  I  D   ++  P        QVR
Sbjct: 923 EADRYRNMDRLIASPHNVVTIAALE--FRLILSVANLIPNDP--RVFFPLTREGNYVQVR 978

Query: 445 VEASRALL 452
           + A  AL 
Sbjct: 979 IAAFDALF 986


>gi|169603133|ref|XP_001794988.1| hypothetical protein SNOG_04573 [Phaeosphaeria nodorum SN15]
 gi|111067213|gb|EAT88333.1| hypothetical protein SNOG_04573 [Phaeosphaeria nodorum SN15]
          Length = 1480

 Score =  160 bits (405), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 146/564 (25%), Positives = 246/564 (43%), Gaps = 91/564 (16%)

Query: 2    LEKQMGSNFFRKILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
            L KQ G N   + L  ++  A QG      ++T++F     KVG+ +   L  FF +WV 
Sbjct: 506  LVKQSGKNGVDRCLYRLLFNARQGKYDNGAITTEDFLDICEKVGHQK---LDSFFSQWVV 562

Query: 61   TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDS-----ENRDGDIG-- 113
              GCP      S+NK+K +V+L + +   ++ DS    + +   +     E ++   G  
Sbjct: 563  GSGCPNFECFPSFNKKKQVVQLQIRQ---IQADSTVTTIENALSADDFMREAKESAHGFL 619

Query: 114  -------WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSK 165
                   + G M+IR+HE DG  Y+H +   + +    +      K   R  L  ++ + 
Sbjct: 620  HGNEFPTFVGPMTIRIHEADGTPYEHIVDINSSNVKVEIPYNTKYKRLKRNRLNKERQAA 679

Query: 166  PDGCDDNGDAVAGLDM-------------------------RSSMES-PLSWIRADPEME 199
              G D +G+A   +                            + ME     WIR D + E
Sbjct: 680  AAGLDASGEAQEDVTFYMLGDVFGTAKEQEEWRLSDWPAEDEAKMEGEAYEWIRIDADFE 739

Query: 200  YLAEIHFNQ-PVQMWINQLEKDGDVVAQAQAI---AALEALPHLSFNVVNTLNNFLSDSK 255
            ++   H N  P  M+++QL++D DVVAQ ++I   A+ E    +S  +V T    L D++
Sbjct: 740  WICRAHINDMPSYMYVSQLQQDKDVVAQYESIQFLASKEGHRLISTFLVKT----LMDNR 795

Query: 256  AFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLE 315
             F  +R  AA  +A +A + T+W GL HL K ++         + R NDF D S Y +  
Sbjct: 796  YFHGIRTYAADVMAQSALKRTEWTGLYHLKKAFRELFCIPGSPMTRSNDFSDRSLYLIQC 855

Query: 316  AIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGEL------ 369
            AIP A++ ++  D ++P EA  F+L +++YNDN GN YSD ++++ L+  + +       
Sbjct: 856  AIPKAISKIKGEDGRAPLEAKRFLLDIIRYNDNRGNEYSDDYYISTLMHCLADCLTANKS 915

Query: 370  --------EFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILT---ISCIRTLTQ---IA 415
                    E  +    F    ++ + R  + D  +P+Y  + T   + C R L       
Sbjct: 916  GDMELFDAETERDEFEFTKKAIEELTRHQRLDEWIPTYQNVYTTTALDCARKLIMNRVTP 975

Query: 416  LKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEE 475
             K S F+   Q+               VR++A   L+ L       D+ L   +  +  +
Sbjct: 976  FKPSEFLQYAQL----------GNADNVRLKAWDCLVRLGMLRK--DAVLKFLMHDIRSD 1023

Query: 476  PS--LRGQVKLGIHAMRICQIKGG 497
            PS   R Q+ L + +  I QI  G
Sbjct: 1024 PSPYFRKQL-LRVFSFAIGQIATG 1046


>gi|402879027|ref|XP_003903157.1| PREDICTED: LOW QUALITY PROTEIN: transcription initiation factor
           TFIID subunit 2 [Papio anubis]
          Length = 1173

 Score =  160 bits (405), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 133/478 (27%), Positives = 227/478 (47%), Gaps = 66/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 665

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   +
Sbjct: 666 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 723

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF+ + +P A+A++R   N  P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKSCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC+R +  +  +    +  DQ+  L K + ++     +R+ A  A++D
Sbjct: 843 LPSYRHTITVSCLRAIRVLRKR---HVPXDQL--LFKSYAEYGHFVDIRIAALEAVVD 895


>gi|344272789|ref|XP_003408212.1| PREDICTED: transcription initiation factor TFIID subunit 2
           [Loxodonta africana]
          Length = 1202

 Score =  160 bits (404), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 133/478 (27%), Positives = 227/478 (47%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQES 665

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   +
Sbjct: 666 ILALEKFPTPASRLA--LTDILDQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF 723

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF+ + +P A+A++R   N  P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKTCPNIVKTNNFLSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  SV         +  + +L       L+ I R L  ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTVDNLSPDVRLILEEITRFLNMEKL 842

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC+R +    L+ +G +  D    L K + ++     +R+ A  A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 896


>gi|116256085|sp|Q8C176.2|TAF2_MOUSE RecName: Full=Transcription initiation factor TFIID subunit 2;
           AltName: Full=TBP-associated factor 150 kDa; AltName:
           Full=Transcription initiation factor TFIID 150 kDa
           subunit; Short=TAF(II)150; Short=TAFII-150;
           Short=TAFII150
          Length = 1104

 Score =  160 bits (404), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 136/479 (28%), Positives = 231/479 (48%), Gaps = 67/479 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 449 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTYGFLKSISNVSGKDIQPL 508

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    W+   G       F++N+++N++EL + +D T      +P            G
Sbjct: 509 IKQ----WLDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------SP------------G 546

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 547 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 597

Query: 171 DNGDAVAGLDMRSSME--SPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQ 228
            NG+ V  +D+ S+ME  SPL WIR DP+M  L ++ F Q   MW  +L  + DVVAQ +
Sbjct: 598 -NGEEV-DMDL-SAMEADSPLLWIRIDPDMSVLRKVEFEQADFMWQYELRYERDVVAQQE 654

Query: 229 AIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKF 287
           +I ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   
Sbjct: 655 SILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSL 712

Query: 288 YKSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKY 345
           + +R F       + + N+F  F  YF+ + +P A+A++R   N  P+E + F+L L+KY
Sbjct: 713 F-TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKY 771

Query: 346 NDNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDR 394
           NDN  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++
Sbjct: 772 NDNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEK 831

Query: 395 LMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           L+PSY   +T+SC+R +    L+ +G +  D    L K + ++     +R+ A  A++D
Sbjct: 832 LLPSYRHTITVSCLRAIR--VLQKNGHVPSD--ASLFKSYAEYGHFVDIRIAALEAVVD 886


>gi|51467894|ref|NP_001003835.1| transcription initiation factor TFIID subunit 2 [Danio rerio]
 gi|49619029|gb|AAT68099.1| TATA box binding protein (TBP)-associated factor 150kDa [Danio
           rerio]
          Length = 1191

 Score =  160 bits (404), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 137/478 (28%), Positives = 223/478 (46%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G    P 
Sbjct: 448 LIENRISMEFMLQVFNKLLSLASTASSQKYQSHMWSQMLVSTSGFLKSISNVSGKDIGPL 507

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 508 IKQ----WVDQSGVVKFFGSFAFNRKRNVLELEIRQDYT------------------SSG 545

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 546 TQKYVGPIKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 596

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ +A
Sbjct: 597 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSILRKVEFEQADFMWQYQLRYERDVVAQEEA 654

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I ALE  P  +      L + L   + F++VR+ A + LA  A S  + W G   +   +
Sbjct: 655 ILALEKFP--TPPSRRALTDILEQDQCFYKVRMHACFCLAKIANSMVSTWTGPPAMKSLF 712

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF+ + IP A+A +R   N  PRE + F+L L+KYN
Sbjct: 713 -TRMFCCKSCPNIVKTNNFISFQSYFLQKTIPVAMAQLRDVQNLCPREVLSFILDLIKYN 771

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N  SD ++ A    AL  S+         +  + SL       L+ I R L  ++L
Sbjct: 772 DNRKNKLSDNYYRAELIDALTNSLTPAISINNEVRTVDSLNADVRLILEEITRFLNMEKL 831

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC+R +    L+ +G I  D    L K + ++     VR+ A  A++D
Sbjct: 832 LPSYRNTITVSCLRAIRM--LQKNGHIPSDPT--LFKSYAEYGHFVDVRIAALEAVID 885


>gi|145353402|ref|XP_001421003.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|145357369|ref|XP_001422892.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144581239|gb|ABO99296.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144583136|gb|ABP01251.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 1112

 Score =  159 bits (403), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 129/442 (29%), Positives = 215/442 (48%), Gaps = 54/442 (12%)

Query: 1   MLEKQMGSNFFRKILQNII------SRAQG-----------ASPVRTLSTKEFRHFANKV 43
           MLEK++G +  +K+L+         ++ +G           +S  R + T +        
Sbjct: 403 MLEKRLGPDVMQKVLKYFAGLHVRRNKKEGGTRAGPSAEVLSSNARWIHTLQLFDHCRAT 462

Query: 44  GNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNT 103
            NL +  +  F  RWV   G P L +G+   +RKN++E AV  + +V   +      +  
Sbjct: 463 VNLGKGEVNSFLERWVYGAGTPKLCVGYVVKRRKNVMEFAVKLEGSVAAAAADRAALAVA 522

Query: 104 DSENRDGDIGWPGMMSIRVHELD--GMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPK 161
            +            +++R+ E +     DH ++ +   AWQL+EI    K+  RR   PK
Sbjct: 523 RNHRTS--------VTVRMREENRPDANDH-VVSLGHSAWQLMEIPLQPKIKDRR---PK 570

Query: 162 KGSKPDGCDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPV----QMWINQL 217
                   +  GD     ++ ++M+ P+ ++R DPE E++  I  +        M    L
Sbjct: 571 TI-----IESGGDP----ELIAAMDCPVRYVRVDPEFEWMGNIEQSAKQVGLESMMAQML 621

Query: 218 EKDGDVVAQAQAIAAL-EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET 276
           EK+ D+VAQ  A+  L   + + S + V  L+  L+    F RVR EAA AL  +ASE+T
Sbjct: 622 EKEKDIVAQTIAVEFLGRRVANGSVSAVLVLDKCLNSEDTFCRVRAEAALALGKSASEKT 681

Query: 277 DWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNK-SPREA 335
            W GL  L++ YK  + D N G P+ NDFRD ++  V EA+  A+  V   DN  + ++A
Sbjct: 682 QWGGLNALIRHYKKFQCDANTGKPKQNDFRDLAKVIVDEAVITALGYV--MDNGITHQDA 739

Query: 336 VEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGEL--EFGQQSILFLSSLLKRIDRLLQFD 393
           +E ++  LKYNDN GNP SD  ++A  + ++G      G+Q    L ++++ I   ++ D
Sbjct: 740 LEAIVARLKYNDNEGNPQSDDGFVATCIAALGRCVPADGKQ----LQTVMQTIHYYIRRD 795

Query: 394 RLMPSYNGILTISCIRTLTQIA 415
              PS +  +T + IR L  +A
Sbjct: 796 DRFPSDDLCVTCAGIRALGLLA 817


>gi|198435612|ref|XP_002126792.1| PREDICTED: similar to TAF2 RNA polymerase II, TATA box binding
           protein (TBP)-associated factor [Ciona intestinalis]
          Length = 1154

 Score =  159 bits (403), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 126/477 (26%), Positives = 234/477 (49%), Gaps = 62/477 (12%)

Query: 2   LEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPF------LKEFF 55
           +E ++G     +    ++S A  ++  + +S+       +  G L+  +      +    
Sbjct: 444 IENRIGPQLLLQAFNKLLSLANTSAQQKVISSTWSTMLLSTAGFLKSTYSVSGKDISNLI 503

Query: 56  PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWP 115
            +WV   G       F +N+++N VEL       +KPD    +++  T          + 
Sbjct: 504 DQWVHKGGIVRFNSKFVFNRKRNTVEL------DIKPD----IINKGTQK--------YV 545

Query: 116 GMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDA 175
           G +++RV ELDG ++H I     +     ++ CHSK  +RR  + K          NG+ 
Sbjct: 546 GPLTVRVQELDGSFNHTI--QIEENTTKHDLPCHSK--SRRNKRKKIPLL------NGEE 595

Query: 176 VAGLDMRSSM---ESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAA 232
           V   DM  S    +SP+ WIR DP++     ++F QP  MW  QL  + DV+ Q++AI A
Sbjct: 596 V---DMDLSFMDPDSPVLWIRIDPDVNIPRLVNFEQPDYMWQYQLRYERDVIGQSEAIGA 652

Query: 233 LEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETD-WAGLLHLVKFYKSR 291
           LEA    +    N L + + +   +++VR++A  +LANTA+   + W G   ++  + +R
Sbjct: 653 LEAFSTPA--TRNALTDIIENEHYYYKVRMQACKSLANTANAMVNTWTGPPAMLNLF-TR 709

Query: 292 RF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNN 349
            F  D +  + R N+F +F +YF+ + +P A++M+R      P+E + ++  L+K+NDN+
Sbjct: 710 FFCCDTSPPIVRMNNFSNFQQYFLQKQLPVAMSMLRNTHGVCPKEILHYLFGLIKFNDNS 769

Query: 350 GNPYSDVFWLAALVQSVGE-------LEFGQQSI-----LFLSSLLKRIDRLLQFDRLMP 397
            N YSD ++ +ALV ++         L+  Q+       + + +++  + R L  ++ +P
Sbjct: 770 KNKYSDNYYRSALVDALNNTITPAVTLQSDQKVTADSMPMDIHTIVDEVTRFLNLEKHLP 829

Query: 398 SYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
           SY   +TISC+R +    L+ +G +  D  ++L K +        VR+ A   L+D+
Sbjct: 830 SYRYTITISCLRAIRN--LQRNGHLPSD--IELFKSYAAPGFFRDVRLAAVGCLVDV 882


>gi|342321629|gb|EGU13562.1| TATA-binding protein associated factor Taf2 [Rhodotorula glutinis
           ATCC 204091]
          Length = 1877

 Score =  159 bits (402), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 135/477 (28%), Positives = 228/477 (47%), Gaps = 71/477 (14%)

Query: 11  FRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMG 70
           F KI  + IS  +  S   +LST  F     K+       LK FF +WV   GCP   + 
Sbjct: 499 FPKIFLDAIS-GELTSQANSLSTTSFMRTCRKLCGGSSEALKIFFDQWVYGSGCPTFEVT 557

Query: 71  FSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG-WPGMMSIRVHELDGM- 128
            ++N++K  VEL V + C     S+     +  +++     I  + G M+IR+HE DG  
Sbjct: 558 ANFNRKKMAVELNVTQRCYAYAWSQ----KAPWEAQGHLRPIPLFDGQMTIRIHEADGTP 613

Query: 129 YDHPILPMAGDAWQLLEIQCHSKLAARR----------------ALKPKKGSKPDGCDDN 172
           Y+H +     D+++  E+  ++K    R                AL   +  +  G  D 
Sbjct: 614 YEHVL--TIQDSFKRHEVPFNTKYKRVRRNTKRYQARQAAATAAALGDIEAQEDMGLIDL 671

Query: 173 GDAVAGLDMRSSME-----------------SPLSWIRADPEMEYLAEIHFNQPVQMWIN 215
           G ++   +     E                 +   WIR D ++E++  + F QP  MWI+
Sbjct: 672 GFSLGMWEDEKERERWRVADWTEEDDLVMSQATYEWIRIDADIEWICIVKFTQPDFMWIS 731

Query: 216 QLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEE 275
           QL++D DVVAQ +AI AL ++        N     L  S  F+R+R+EAA AL + A+ E
Sbjct: 732 QLQRDRDVVAQLEAIHALSSI-PSPIVSSNMCKTVLV-SNYFFRIRMEAALALISCATLE 789

Query: 276 TDWAGLLHLVKFYKSRR-FDENI---------GLPRPNDFRDFSEYFVLEAIPHAVAMVR 325
             + GL HL K +++   F+ ++          +P+ NDF DF+ YF+ +A+  A++M+R
Sbjct: 790 QQYLGLFHLFKTFQTWYCFEPDVETKDPFGFRCIPKTNDFSDFTLYFMKKALVTAISMIR 849

Query: 326 AADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVG---------ELEFGQQSI 376
               ++P    +F++ LL YNDN GN YSD F++ +++ ++          +L    + I
Sbjct: 850 DERGQTPPVIQQFLVDLLAYNDNLGNKYSDAFYITSVMNALSHTLVNVVPRDLNGDTEQI 909

Query: 377 LFLSS------LLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQV 427
             + S       ++ ++R L  DRL+PSY+  +TI+ I    ++ L L+  I  D++
Sbjct: 910 TEVDSNENLAPAVQEVERYLASDRLVPSYHNAVTIAGIE--FKLKLTLASLIPEDRM 964


>gi|329744651|ref|NP_001178070.1| transcription initiation factor TFIID subunit 2 [Bos taurus]
          Length = 1199

 Score =  159 bits (402), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++   Q   MW  QL  + DVVAQ +A
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMAVLRKVDSEQADFMWQYQLRYERDVVAQQEA 665

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I+ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   +
Sbjct: 666 ISALERFPTPASRLA--LTDVLEQEQCFYRVRMAACFCLAKIANSMVSTWTGPPAMKSLF 723

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF+ + +P A+A++R   N  P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC+R +    L+ +G +  D    L K + ++     +R+ A  A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDP--SLFKSYAEYGHFVDIRIAALEAVVD 896


>gi|398397863|ref|XP_003852389.1| hypothetical protein MYCGRDRAFT_42345, partial [Zymoseptoria tritici
            IPO323]
 gi|339472270|gb|EGP87365.1| hypothetical protein MYCGRDRAFT_42345 [Zymoseptoria tritici IPO323]
          Length = 1180

 Score =  159 bits (402), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 136/513 (26%), Positives = 246/513 (47%), Gaps = 73/513 (14%)

Query: 2    LEKQMGSNFFRKILQNIISRAQGASPVR-TLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
            L K  GS    +I+  I   A+  S V   LST++F+    ++G+ +   L+ FF +WVG
Sbjct: 517  LIKSSGSTGVTRIINKIFLNAKTGSLVNGELSTQDFQRTCERLGHNK---LESFFRQWVG 573

Query: 61   TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRD---------GD 111
              GCP+  +   +NK+K +VE+ +++   ++  ++ P   SN   E ++           
Sbjct: 574  GAGCPIFYVTQRFNKKKLVVEMTIVQR-QLERKTKPPFEPSNFMREIKEHVNEVYMAGTQ 632

Query: 112  IGWPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAA-RRALKPKKGSKPDGC 169
              + G M+IR+HE DG  Y+H +     +    LEI  ++K    +R+ + K+ +  +G 
Sbjct: 633  NVFTGPMTIRIHEADGTPYEHIV--EIKEPITKLEIPYNTKYKRLKRSRRQKERAAAEG- 689

Query: 170  DDNGDAVAGL-------------------------DMRSSMESPLSWIRADPEMEYLAEI 204
            +  G+    L                         D     +    WIR D + E++ +I
Sbjct: 690  NPAGEGEEALLYCLGDILDTPEDIQEWELRDWTAEDEDKMGQESYEWIRMDADFEWIGKI 749

Query: 205  HFNQPVQMWINQLEKDGDVVAQAQAI-AALEALPHLSFNVVNTLNNFLSDSKAFWRVRIE 263
            H   P+ M+++QL++D D+VAQ +++   L + PH     ++ L   L D + F+ +R+ 
Sbjct: 750  HLVLPLYMYVSQLQQDRDLVAQHESMRYILGSNPH--HTSLSILLRTLMDHRYFYGIRVM 807

Query: 264  AAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAM 323
            AA  LA  A ++    G  HL K ++    DE+  +PRPNDF +   + +  AIP A+A 
Sbjct: 808  AAEGLAVLAKDDFTRIGQKHLTKAFEHLFCDEDSVMPRPNDFSNRVTFIMQCAIPQAMAR 867

Query: 324  VRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAAL--------------VQSVGEL 369
            +R AD K PR   +F +  LK+NDN+ N +SD  +++ L              V+ V + 
Sbjct: 868  LRDADGKVPRAVQQFFVDKLKFNDNSDNEFSDCHYVSTLMTCLTDSLVASHREVKPVYQF 927

Query: 370  EFGQQSIL--------FLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGF 421
            +FG+   L        F +  +  I+R  + D  + +Y  + + + I  L ++    +  
Sbjct: 928  DFGEDEPLEPENPDAEFEALAVGVIERYRRIDEWVLTYQNVYSTTAIECLQKLT---NAG 984

Query: 422  ISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
            I  D+  +L++  R  +T   VR++A R L+++
Sbjct: 985  IVKDKSKELLQYTRP-STAPNVRLQAFRCLIEM 1016


>gi|432908138|ref|XP_004077772.1| PREDICTED: transcription initiation factor TFIID subunit 2-like
           [Oryzias latipes]
          Length = 1227

 Score =  159 bits (401), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 132/478 (27%), Positives = 228/478 (47%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          LST  F +  +N  G    P 
Sbjct: 463 LIENRISMEFMLQVFNKLLSLASTASSQKYQSHMWSQMLLSTYGFLKSISNVSGKDIGPL 522

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 523 IKQ----WVDQSGVVKFFGSFAFNRKRNVLELEIRQDYT------------------SSG 560

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 561 TQKYVGPIKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 611

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ +A
Sbjct: 612 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSILRKVEFEQADFMWQYQLRYERDVVAQEEA 669

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I+ALE  P  +  +   L + L   + F++VR+ A + LA  A S  + W G   +   +
Sbjct: 670 ISALEKFPTPASRLA--LTDILEQEQCFYKVRMLACFCLAKIANSMVSTWQGPPAMKSLF 727

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF+ + +P A+A++R   N  P++ + F+L L+KYN
Sbjct: 728 -TRMFCCKSCPNIVKTNNFISFQSYFLQKTMPVAMALLRDVQNLCPKDVLNFILDLIKYN 786

Query: 347 DNNGNPYSDVFWLAALVQSV-----------GELEFGQQSILFLSSLLKRIDRLLQFDRL 395
           DN  N +SD ++ A L++++            E+         +  +L+ I R L  ++L
Sbjct: 787 DNRKNKFSDNYYRAELIEALTNSLTPAISINNEVRTVDNLNPDVRLILEEITRFLNMEKL 846

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PS+   +T+SC+R + Q  L+ +G I  D    L   + ++     VR+ A  AL+D
Sbjct: 847 LPSFRNTITVSCLRAIRQ--LQRNGHIPSDP--SLFTSYAEYGHFVDVRIAALEALVD 900


>gi|296480455|tpg|DAA22570.1| TPA: TAF2 RNA polymerase II, TATA box binding protein
           (TBP)-associated factor, 150kDa [Bos taurus]
          Length = 1203

 Score =  159 bits (401), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++   Q   MW  QL  + DVVAQ +A
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMAVLRKVDSEQADFMWQYQLRYERDVVAQQEA 665

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I+ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   +
Sbjct: 666 ISALERFPTPASRLA--LTDVLEQEQCFYRVRMAACFCLAKIANSMVSTWTGPPAMKSLF 723

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF+ + +P A+A++R   N  P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC+R +    L+ +G +  D    L K + ++     +R+ A  A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDP--SLFKSYAEYGHFVDIRIAALEAVVD 896


>gi|440894490|gb|ELR46931.1| Transcription initiation factor TFIID subunit 2, partial [Bos
           grunniens mutus]
          Length = 1197

 Score =  159 bits (401), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 457 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 516

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 517 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 554

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 555 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 605

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++   Q   MW  QL  + DVVAQ +A
Sbjct: 606 -NGEEV-DMDLSAMDADSPLLWIRIDPDMAVLRKVDSEQADFMWQYQLRYERDVVAQQEA 663

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I+ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   +
Sbjct: 664 ISALERFPTPASRLA--LTDVLEQEQCFYRVRMAACFCLAKIANSMVSTWTGPPAMKSLF 721

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF+ + +P A+A++R   N  P+E + F+L L+KYN
Sbjct: 722 -TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 780

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++L
Sbjct: 781 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 840

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC+R +    L+ +G +  D    L K + ++     +R+ A  A++D
Sbjct: 841 LPSYRHTITVSCLRAIR--VLQKNGHVPSDP--SLFKSYAEYGHFVDIRIAALEAVVD 894


>gi|295442960|ref|NP_593331.2| TATA-binding protein associated factor Taf2 (predicted)
           [Schizosaccharomyces pombe 972h-]
 gi|259016379|sp|P87121.3|TAF2_SCHPO RecName: Full=Transcription initiation factor TFIID subunit 2;
           AltName: Full=TBP-associated factor 2
 gi|254745512|emb|CAB08750.3| TATA-binding protein associated factor Taf2 (predicted)
           [Schizosaccharomyces pombe]
          Length = 1172

 Score =  158 bits (399), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 122/432 (28%), Positives = 202/432 (46%), Gaps = 54/432 (12%)

Query: 31  LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTV 90
           LST  F     K  ++    L  F  +W+   G P+ R+   +N++K I+E+ +      
Sbjct: 511 LSTSHFLKTCEKASHMR---LDVFAQQWIYGYGYPIFRVVQRFNRKKMIIEMGI-DQVQT 566

Query: 91  KPDSRTPVLSSNTDSE------NRDGDIGWP---GMMSIRVHELDGM-YDHPILPMAGDA 140
           K   R P+   N  S+      N     G P   G M+IR+HE DG  Y+H +     D+
Sbjct: 567 KEAPRAPMSDKNFLSDAIRHLNNESIPTGLPVFSGPMTIRIHEADGTPYEHVV--ELKDS 624

Query: 141 WQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGD----AVAGL-DMRSSMES-------- 187
           +  L+IQ ++K       +  K  K DG D NGD     +  L D+  S E         
Sbjct: 625 FTKLDIQYNTKYKRISRNRSTKNVKRDG-DHNGDDSDYVIRSLGDVLQSDEDIERWHLYD 683

Query: 188 ------------PLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEA 235
                          WIR D + E++ ++   QP  M+++QL++D DVVAQ + I    +
Sbjct: 684 YTKEEEDTMATEAFEWIRVDADFEWICDLRVRQPEHMYVSQLQQDRDVVAQLETIRHFTS 743

Query: 236 LPH-LSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFD 294
               +S  V   L   L D++ ++ +R EAA ALA  A  E DW G  HL   Y  +   
Sbjct: 744 ESFTVSQQVSTVLLRTLLDNRYYYGIRQEAARALARCAIPELDWVGYYHLRMAYLEKFCF 803

Query: 295 ENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYS 354
           ++  +P+ NDF + +EY+V  A+  +   +R     +P    + +L LL YNDN  N +S
Sbjct: 804 KDSTIPKSNDFSNITEYYVKCAMLESFPNIRDRKGVTPSSVKKLLLDLLCYNDNANNEFS 863

Query: 355 DVFWLAALVQSV--GELEFGQ---------QSILFLSSLLKRIDRLLQFDRLMPSYNGIL 403
           D +++  L+ S+  G +  G+         + + F++ ++  IDR ++ D  MPS+  I+
Sbjct: 864 DAYFICLLIDSLVEGLIPRGETVQYAFSDPEHMEFVNQVINEIDRYMRIDACMPSFKNII 923

Query: 404 TISCIRTLTQIA 415
           T   ++   ++A
Sbjct: 924 TCKSLKAKLRLA 935


>gi|149410593|ref|XP_001509779.1| PREDICTED: transcription initiation factor TFIID subunit 2, partial
           [Ornithorhynchus anatinus]
          Length = 1099

 Score =  157 bits (398), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 133/478 (27%), Positives = 226/478 (47%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 359 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 418

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV           F++N+++N++EL + +D T                    G
Sbjct: 419 IKQ----WVDQSSVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 456

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 457 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 507

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ ++
Sbjct: 508 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQEES 565

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   +
Sbjct: 566 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMLACFCLAKIANSMVSTWTGPPAMKSLF 623

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF+ + +P A+A++R   N  P+E + F+L L+KYN
Sbjct: 624 -TRMFCCKSCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 682

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++L
Sbjct: 683 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 742

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC+R +    L+ +G +  D    L K + ++     +R+ A  A++D
Sbjct: 743 LPSYRHTITVSCLRAIR--VLQKNGHVPSDP--SLFKSYAEYGHFVDIRLAALEAVVD 796


>gi|391343753|ref|XP_003746170.1| PREDICTED: transcription initiation factor TFIID subunit 2
           [Metaseiulus occidentalis]
          Length = 1191

 Score =  157 bits (397), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 133/501 (26%), Positives = 218/501 (43%), Gaps = 65/501 (12%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPV------RTLSTKEFRHFANKVGNLERPFLKEF 54
           MLE ++G     ++   ++S A  A+          +ST  F+     V   E   +  F
Sbjct: 444 MLENRLGKELLLQVFNKLLSLAGAATKGIGGLENMLVSTNSFQKAVFLVTGKE---INTF 500

Query: 55  FPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGW 114
              WV   G       F +N+++N VEL +          R P + +N       G   +
Sbjct: 501 IDHWVRQGGHAKFVGSFVFNRKRNTVELEI----------RQPDVGTN-------GIKKY 543

Query: 115 PGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGD 174
            G + + + ELDG + H +     D     +I CHSK    +  K    +        G+
Sbjct: 544 NGPLLVTIQELDGTFKHML--QIEDIVSKHDITCHSKSRRNKKKKIPLCT--------GE 593

Query: 175 AVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
            V  +D+ +   +SP+ WIR DPEM  L  +H NQ    W  QL  + D+ AQ QA+  L
Sbjct: 594 EV-DMDLSAMDADSPVLWIRIDPEMLVLRTVHINQQDFHWHYQLRYERDITAQLQALREL 652

Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRF 293
           E  P  S +    L + + + + F+RVR EA Y L   A+    WAG   ++  +K  + 
Sbjct: 653 E--PMCSQDTRKALTDTIENDQCFYRVRCEATYCLKQVANALQTWAGPPAMITIFK--KL 708

Query: 294 DENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNG 350
             +   P   + N+F +F  YF+ +AIP A+A +R      P E + F+L L KYN+N+ 
Sbjct: 709 YGSFASPNMVKQNNFSNFQHYFLQQAIPKAMAGLRTVHRICPPEVLSFLLDLFKYNENSK 768

Query: 351 NPYSDVFWLAALVQSVGELEFGQQSILF---------------LSSLLKRIDRLLQFDRL 395
           N YSD ++  AL+ ++ E      S++                  S+L+ I R L  ++L
Sbjct: 769 NRYSDCYYRRALIDALSETVTPVVSVMLNQSNSKVMVDSMSEETKSILEEITRCLNLEKL 828

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLE 455
           +P Y  + T+ C+  +     KL     +    +L + +        VR+ A   L+D  
Sbjct: 829 LPGYKLVCTVGCLHAIR----KLQKMGHIPNSAELFRQYASHQQFEDVRLAALEMLVDFT 884

Query: 456 FHCNGIDSALSLFIKSVEEEP 476
              +G    L+  +  VE +P
Sbjct: 885 -STDGSVEDLNHLLDVVESDP 904


>gi|345566660|gb|EGX49602.1| hypothetical protein AOL_s00078g91 [Arthrobotrys oligospora ATCC
           24927]
          Length = 1221

 Score =  157 bits (396), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 136/491 (27%), Positives = 227/491 (46%), Gaps = 90/491 (18%)

Query: 31  LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAV--LRDC 88
           LST +F     KVG+ +   L  FF +WV   G P   +   +NK+K IVE+ +   +D 
Sbjct: 477 LSTAQFIKQCEKVGHQK---LDSFFNQWVFRSGYPRFEISQRFNKKKMIVEMIIRQTQDK 533

Query: 89  TVKPDSRTPVLSSNTDSENR--DG--DIGWP----GMMSIRVHELDGM-YDHPILPMAGD 139
            V     TP  S   D+ NR  DG  D   P    G M+IR+HE DG  Y+H +      
Sbjct: 534 EVAQKGLTP-QSFLRDALNRQTDGVKDASPPQVFSGPMTIRIHEADGTPYEHVV------ 586

Query: 140 AWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLDMRSSMES------------ 187
             Q+ E+  +SKL      K K+  +     +   A AG+D+ +  ++            
Sbjct: 587 --QIHEV--YSKLEIPYNTKYKRLKRTRRQKERLAAFAGMDINAETQAEDVLLYCLGDVL 642

Query: 188 -------------------------PLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGD 222
                                       WIR D + E++  +  NQP  M+++QL++D D
Sbjct: 643 QSDEDTADWRLTDFTKEEEDKMAQEAFEWIRVDADFEWVCTMRLNQPDHMYLSQLQQDRD 702

Query: 223 VVAQAQAIAAL-EALP-HLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAG 280
           V AQ +++    EA P HL   +   L   + D + ++ +R EAA AL+  A+ E  W G
Sbjct: 703 VAAQYESVLHFKEAKPSHL---ISTILVRTVMDRRYYYGIRCEAAKALSRCATAELSWIG 759

Query: 281 LLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVL 340
             HL K ++      +  +PR NDF DF+ YFV + I      +R    KSP     ++L
Sbjct: 760 EFHLQKAFQEFYCFPDSLIPRANDFSDFTSYFVQQNIITGFGGIRDGMGKSPLSVQRYLL 819

Query: 341 QLLKYNDNNGNPYSDVFWLAALVQSVG---ELEFGQQSILFL--------------SSLL 383
            +++YNDN+ N +SD F LA L++S+G   +   GQ+ ++ +                ++
Sbjct: 820 DIIRYNDNSNNSFSDSFHLANLLRSLGSCLQAGSGQRGMMDMDVDETDTEEREKSKEEII 879

Query: 384 KRIDRLLQFDRLMPSYNGILTISCIRTLTQ-IALKLSGFISLDQVVKLIKPFRDFNTIWQ 442
           + I+RL + D+ M SY+ +++ +C+    + I  K+  +  L+ +      +    T  Q
Sbjct: 880 REIERLQRMDQWMSSYHNVISTTCLDVKNRLIKAKVQEYTPLEFLA-----YSKEGTSEQ 934

Query: 443 VRVEASRALLD 453
           +R++A   L++
Sbjct: 935 LRIKAINCLVE 945


>gi|426236245|ref|XP_004012081.1| PREDICTED: transcription initiation factor TFIID subunit 2 [Ovis
           aries]
          Length = 1178

 Score =  157 bits (396), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 134/478 (28%), Positives = 225/478 (47%), Gaps = 65/478 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L      Q   MW  QL  + DVVAQ +A
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMAVLRRADSEQADFMWQYQLRYERDVVAQQEA 665

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           IAALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   +
Sbjct: 666 IAALERFPTPASRLA--LTDVLEQEQCFYRVRMAACFCLAKIANSMVSTWTGPPAMKSLF 723

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF+ + +P A+A++R   N  P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782

Query: 347 DNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           DN  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++L
Sbjct: 783 DNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKL 842

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +PSY   +T+SC+R +    L+ +G +  D    L + + ++     +R+ A  A++D
Sbjct: 843 LPSYRHTITVSCLRAIR--VLQKNGHVPSDP--SLFRSYAEYGHFVDIRIAALEAVVD 896


>gi|340712189|ref|XP_003394646.1| PREDICTED: transcription initiation factor TFIID subunit 2-like
           [Bombus terrestris]
          Length = 1198

 Score =  156 bits (395), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 132/505 (26%), Positives = 228/505 (45%), Gaps = 73/505 (14%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRH-------FANKVGNLERPFLKE 53
           MLE ++G     ++    +S A  A+  + + +  + H       F   +  +    +  
Sbjct: 457 MLEHRIGQELLLQVFNKQLSLAANAAQ-QKIDSGLWSHMLISTNVFTKAIFTVTGKDMAV 515

Query: 54  FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
           F  +WV T G     + F +N+++N VEL + +D T                 N+ G   
Sbjct: 516 FIDQWVRTGGHAKFSLSFVFNRKRNTVELEIRQDTT-----------------NQRGIRK 558

Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
           + G + + + ELDG + H  L + G   +  +I CHSK    +  K    +        G
Sbjct: 559 YVGPLLVNIQELDGTFKH-TLQIEGTVAKA-DITCHSKSRRNKKKKIPLCT--------G 608

Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
           + V  +D+ +  +SP+ WIR DPEM  L  +   QP   W  QL  + DV AQ +AI AL
Sbjct: 609 EEV-DMDLSAMDDSPVLWIRLDPEMTILRAVQIEQPDYQWQYQLRHERDVTAQLEAIEAL 667

Query: 234 E--ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYKS 290
           +  A P     + +T+ N       +++VR+ AA+ L   A+     WAG   ++  ++ 
Sbjct: 668 QNHATPATRLALTDTIEN----EHCYYKVRLRAAHCLTKVANAMVATWAGPPAMLAIFRK 723

Query: 291 -------RRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLL 343
                  RR      + + N+F +F  YF+ + IP A+A +R A    P E + F++ L 
Sbjct: 724 LFGSASCRR------IIKQNNFSNFQHYFLQKTIPVAMAGLRNAHGICPPEVLAFLMDLF 777

Query: 344 KYNDNNGNPYSDVFWLAALVQSVGE-----LEFGQQSILFLSSL-------LKRIDRLLQ 391
           KYNDN+ N YSD ++ AAL++++G      +   Q + +   SL       L+ + R L 
Sbjct: 778 KYNDNSKNRYSDNYYRAALIEALGATVTPVISVQQGTAINAESLSVDTKAILEEVTRNLN 837

Query: 392 FDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRAL 451
            ++L+P Y   ++++CI+ +      L  F  L     L + +  +     VR+ A  AL
Sbjct: 838 LEKLLPCYKYTISVACIKVIR----ILQKFGHLPSNPHLFRAYATYGQFIDVRIAALEAL 893

Query: 452 LDLEFHCNGIDSALSLFIKSVEEEP 476
           +D     +G    L   +  +E +P
Sbjct: 894 VDFT-RVDGKWEDLEFLLDMIEMDP 917


>gi|365986418|ref|XP_003670041.1| hypothetical protein NDAI_0D04850 [Naumovozyma dairenensis CBS 421]
 gi|343768810|emb|CCD24798.1| hypothetical protein NDAI_0D04850 [Naumovozyma dairenensis CBS 421]
          Length = 1544

 Score =  156 bits (395), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 136/495 (27%), Positives = 236/495 (47%), Gaps = 85/495 (17%)

Query: 13   KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
            ++L  I  +A  GA    +L+++ F+H   +V    +  L+ FF +WV   G P+LR+  
Sbjct: 679  RVLPKIFLQAMSGALSNNSLTSQHFQHVCERVN---KNKLEYFFQQWVYGSGVPILRVTQ 735

Query: 72   SYNKRKNIVELAVLRDCTVKPDSRTPVLSSN---------TDSENRDGDIGWPGMMSIRV 122
             +NK+K +VE+ + R C  K   +  V+  +          D  +R+    + G M+IR+
Sbjct: 736  RFNKKKMVVEMGI-RQCQDKELDQGKVIGEDGFFASALEHIDHPDRNRTSFFTGSMTIRI 794

Query: 123  HELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKG--SKPDGC---------- 169
            HE DG  Y+H +     D +  +EIQ ++K    R  +   G  + P G           
Sbjct: 795  HEADGTPYEHIV--ELKDIFTKIEIQYNTKYRRIRNKRLNVGGTNTPSGSGALSSAASRT 852

Query: 170  --------DDNGDAV-AGLDMRSSME---------------------------------S 187
                    ++  DA+ +  D+ S +E                                  
Sbjct: 853  NLNSQEFLNEQMDAIPSNRDVDSKIEKLGNVLTTRENRERWNLTDFSKTTEGNEFQRQNE 912

Query: 188  PLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEAL----PHLSFNV 243
               W+R D + E++ +IH NQP  M+ +QL++DGDV AQ ++I   EA+    P  S   
Sbjct: 913  AFEWLRIDSDSEWICKIHINQPDYMFASQLQQDGDVEAQLESIRYYEAVILDSPVNSMIY 972

Query: 244  VNTLNNFLSDSKAFWRVRIEAAYALANTASEETD----WAGLLHLVKFYKSRRFDENIGL 299
             + L     D + F+ +R+EA  AL+  A  ETD      G  HL+K ++S    +N  +
Sbjct: 973  SSILTRTAMDRRYFYGIRLEACRALSKYAMNETDPNNFMGGSRHLIKVFQSLFCLDNSNI 1032

Query: 300  PRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 359
            P+ N+F DF  YF+ + IP  ++ VR   N+SP    +F+L LL YN+N+ N Y+D +++
Sbjct: 1033 PKNNNFSDFQLYFLQKEIPSFLSKVRNERNESPHFIKQFLLDLLTYNENSENNYNDCYYV 1092

Query: 360  AALVQSVGELEFG-QQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKL 418
            A L+ +V +     ++   FL  +L +I R    ++ +P+Y  I+T + I    ++ L +
Sbjct: 1093 ATLLHAVVDCAIADREDETFLPKVLDQIRRYENLEKWLPTYQLIVTYTIIE--IKLKLHV 1150

Query: 419  SGFISL---DQVVKL 430
            +G  S    D++++L
Sbjct: 1151 AGLYSFENNDEILQL 1165


>gi|196008193|ref|XP_002113962.1| hypothetical protein TRIADDRAFT_27954 [Trichoplax adhaerens]
 gi|190582981|gb|EDV23052.1| hypothetical protein TRIADDRAFT_27954 [Trichoplax adhaerens]
          Length = 890

 Score =  156 bits (395), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 111/443 (25%), Positives = 210/443 (47%), Gaps = 62/443 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTK------EFRHFANKVGNLERPFLKEF 54
           M+E ++G     +    +++ A   +   + S++        +HF   +  +    +KEF
Sbjct: 343 MIEIRIGQESLLQAFNKLLTLANTGAKHHSHSSQWGNLLISTKHFLKTIHLVSGKDIKEF 402

Query: 55  FPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGW 114
             +WV   G P  +  FS+N+++N +EL +                  T+ +++     +
Sbjct: 403 INQWVCRSGLPHFKGSFSFNRKRNTLELEL------------------TEKQDKSQFGFY 444

Query: 115 PGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGD 174
            G ++I V E+DG Y H +     ++    +I CHSK    +  K            NGD
Sbjct: 445 VGPLTICVQEVDGSYTHTL--QIEESTYHYDISCHSKSKKNKKKKIPLS--------NGD 494

Query: 175 AVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALE 234
                   +  +SP+ W+R DP+M +L ++ F+QP  MW  Q++ + D +AQ  A+ AL 
Sbjct: 495 ETEINFSSADSDSPVLWVRIDPDMHWLRKVTFDQPDYMWQYQMKYERDSLAQYLAVKALA 554

Query: 235 ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWA--------GLLHLV- 285
              ++S + V  L   ++D   ++R+R+ A  ALA    E   WA         L ++  
Sbjct: 555 K--YVSADSVCCLTEIINDRHIYYRIRVAAIKALAKVTGE---WALQRCSIATSLFNIYR 609

Query: 286 KFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKY 345
           K + SR       +PR NDF     YF+ +AIP A+  +R +    PRE ++F++ L+KY
Sbjct: 610 KMFGSRSCQT---IPRYNDFSYMIGYFIQKAIPKALGSIRNSHGNCPREILDFIIDLIKY 666

Query: 346 NDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLSS-----------LLKRIDRLLQFDR 394
           NDN+ N YSD +++++L+ ++        +I  +S            +L+ +   L +++
Sbjct: 667 NDNSQNEYSDNYYMSSLIDALTSAISPATTITGISESNYTLSEESKIILQEVTHALNYEK 726

Query: 395 LMPSYNGILTISCIRTLTQIALK 417
           ++ S+   +++SCI+ +  +  K
Sbjct: 727 ILASHRNCISVSCIKNIRMLQKK 749


>gi|194868201|ref|XP_001972244.1| GG13996 [Drosophila erecta]
 gi|190654027|gb|EDV51270.1| GG13996 [Drosophila erecta]
          Length = 1222

 Score =  156 bits (394), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 144/587 (24%), Positives = 261/587 (44%), Gaps = 76/587 (12%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEFRHFANKVGNLERPFL 51
           MLE ++G     ++    ++ A  A+  +          +ST  F      +  +    +
Sbjct: 463 MLENRIGQELLIQVFNKQLALASSAATTKIGAGLWSQLLISTNIF---IKAIFTVTGKDM 519

Query: 52  KEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGD 111
             F  +WV T G     +   +N+++N +EL + +D                   N+ G 
Sbjct: 520 SVFMDQWVRTGGHAKFSLTSVFNRKRNTIELEIRQDFV-----------------NQRGI 562

Query: 112 IGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDD 171
             + G + +++ ELDG + H +        Q+      S +      +  K  K   C  
Sbjct: 563 RKYNGPLMVQLQELDGTFKHTL--------QIESTLVKSDITCHSKSRRNKKKKIPLC-- 612

Query: 172 NGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIA 231
            G+ V  +D+ +  +SP+ WIR DPEM  L ++   QP   W  QL  + DV AQ QAI 
Sbjct: 613 TGEEV-DMDLSAMDDSPVLWIRLDPEMILLRDLIIEQPDFQWQYQLRHERDVTAQFQAIQ 671

Query: 232 ALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYKS 290
           AL+  P  +  +   L + +   + F++VR EAA++L   A++    W+G   ++  +  
Sbjct: 672 ALQKYPTNATRLA--LTDTIESERCFYKVRCEAAHSLTKVANQMVASWSGPPAMLNIF-- 727

Query: 291 RRFDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYND 347
           R+F  +   P   + N+F +F  YF+ +AIP A+A +R +    P E + F+  L KYN+
Sbjct: 728 RKFFGSFSAPHIIKLNNFSNFQLYFLQKAIPVAMAGLRTSHGICPPEVMRFLFDLFKYNE 787

Query: 348 NNGNPYSDVFWLAALVQSVGE---------LEFGQQSILFLSS----LLKRIDRLLQFDR 394
           N+ N Y+D ++ AALV+++GE         +   Q +   LS+    +L  + RLL  ++
Sbjct: 788 NSRNHYTDAYYRAALVEALGETLTPVVSVAIHGTQITTDSLSTDAKLVLDEVTRLLNMEK 847

Query: 395 LMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
            +PSY  ++++SC++ +     KL  F  L  +  + + + ++     +R+ A   L+D 
Sbjct: 848 HLPSYKYMVSVSCLKVIR----KLQKFGHLPSLPHIYRSYAEYGIYLDLRIAAMECLVDF 903

Query: 455 EFHCNGIDSALSLFIKSVEEEPSLRGQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLN 514
               +G    L   I  +E +P    +  L    + I       +S   +D   LV  L 
Sbjct: 904 -VKVDGRSEDLEHLITLLETDPDPAARHALA--QLLIDSPPFTRESRSRLDKPNLVDRLW 960

Query: 515 LLESRIAFNNVFLRHHLFGILQILAGRAPTLYGVPRDKLLLLGDGET 561
              +R+A++   LR  +  +          LYG  R   L  G+ ++
Sbjct: 961 FSINRLAYDTK-LRCDIVDLYY-------ALYGTKRPNCLQAGENQS 999


>gi|225561939|gb|EEH10219.1| TATA-binding protein associated factor Taf2 [Ajellomyces capsulatus
            G186AR]
          Length = 1247

 Score =  155 bits (393), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 142/545 (26%), Positives = 230/545 (42%), Gaps = 85/545 (15%)

Query: 1    MLEKQMGSNFFRKILQNIISRA-----QGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
            +LE+++     +  +  IISR       G  P   L+T  F+    ++G+ +   L  FF
Sbjct: 511  ILERRLSKASGKATISRIISRIFLNARMGDLPNGALATSFFQKTCERLGHTK---LDTFF 567

Query: 56   PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSS---NTDSENRDGDI 112
             +WV   GCP       +NK+K +VE+ + +  + +P  R   +S+   +   E R+   
Sbjct: 568  QQWVYGAGCPRFLATQRFNKKKLVVEMMIRQVQSEQPTVRDLEMSTFMRDVKEEIRNVYA 627

Query: 113  G-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKP 166
            G     + G M+IR+HE DG  Y+H +    G     +      K   R   + ++ +  
Sbjct: 628  GPVQHVFTGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAAAA 687

Query: 167  DGCDDN------------GDAVAGLDM---------------RSSMESPLSWIRADPEME 199
             G D N            GD +   +                R   ES   WIR D + E
Sbjct: 688  SGIDPNAELQDDVLLYCLGDVLQSEEEIQTWKLVEWSKEDEDRMGQES-YEWIRMDADFE 746

Query: 200  YLAEIHFNQPVQMWINQLEKDGDVVAQAQAI---AALEALPHLSFNVVNTLNNFLSDSKA 256
            ++ ++    P  M+++QL++D D+VAQ ++I   AA    P +S   V T    L D + 
Sbjct: 747  WICKMSLTMPGYMYLSQLQQDRDIVAQLESIQYMAAQREHPLISTIFVRT----LMDRRY 802

Query: 257  FWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEA 316
            F+ +R  AA AL   A +E DW GL HL   ++         + R NDF D + Y +   
Sbjct: 803  FYGIRAAAARALVKHAKDEVDWIGLYHLESAFQELFCLPGSPMTRSNDFSDRAAYILQLT 862

Query: 317  IPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFG---- 372
            IP A+A VR    K+P     F+   LK+NDN+ N  SD +++ AL++S+ E   G    
Sbjct: 863  IPQAIAQVRDNSGKTPMRVKRFLYDKLKFNDNSNNELSDCYYITALMKSICEALLGRRET 922

Query: 373  -----------------QQSILFLSSLLKRIDRLLQFDRLMPSYNGILT---ISCIRTLT 412
                             Q         +  IDR  + D    S+  I +   I C R L 
Sbjct: 923  EIDGVGIDFDVERELERQAEQQLDRDCIAEIDRYRRMDEWTTSFQNIYSRTAIECQRRLM 982

Query: 413  QIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSV 472
            Q  +       +D+ +    P+    T   +R++A   L+D +   N     L+ FI  +
Sbjct: 983  QANV-------IDRDIMQFLPYTRAGTYEMLRLDAFNILIDFDIFQNS--ELLTWFIYVM 1033

Query: 473  EEEPS 477
              +PS
Sbjct: 1034 SADPS 1038


>gi|24662032|ref|NP_729571.1| TBP-associated factor 2, isoform A [Drosophila melanogaster]
 gi|442631480|ref|NP_001261668.1| TBP-associated factor 2, isoform B [Drosophila melanogaster]
 gi|27924002|sp|Q24325.2|TAF2_DROME RecName: Full=Transcription initiation factor TFIID subunit 2;
           AltName: Full=Transcription initiation factor TFIID 150
           kDa subunit; Short=TAF(II)150; Short=TAFII-150;
           Short=TAFII150
 gi|17944283|gb|AAL48035.1| LD43729p [Drosophila melanogaster]
 gi|23093758|gb|AAF50190.2| TBP-associated factor 2, isoform A [Drosophila melanogaster]
 gi|220945982|gb|ACL85534.1| Taf2-PA [synthetic construct]
 gi|323301248|gb|ADX35966.1| LD23043p [Drosophila melanogaster]
 gi|440215588|gb|AGB94362.1| TBP-associated factor 2, isoform B [Drosophila melanogaster]
          Length = 1221

 Score =  155 bits (393), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 127/502 (25%), Positives = 231/502 (46%), Gaps = 66/502 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEFRHFANKVGNLERPFL 51
           MLE ++G     ++    ++ A  A+  +          +ST  F      +  +    +
Sbjct: 462 MLENRIGQELLIQVFNKQLALASSAATTKIGAGLWSQLLISTNIF---IKAIFTVTGKDM 518

Query: 52  KEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGD 111
             F  +WV T G     +   +N+++N +EL + +D                   N+ G 
Sbjct: 519 SVFMDQWVRTGGHAKFSLTSVFNRKRNTIELEIRQDYV-----------------NQRGI 561

Query: 112 IGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDD 171
             + G + +++ ELDG + H +        Q+      S +      +  K  K   C  
Sbjct: 562 RKYNGPLMVQLQELDGTFKHTL--------QIESTLVKSDITCHSKSRRNKKKKIPLC-- 611

Query: 172 NGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIA 231
            G+ V  +D+ +  +SP+ WIR DPEM  L ++   QP   W  QL  + DV AQ QAI 
Sbjct: 612 TGEEV-DMDLSAMDDSPVLWIRLDPEMILLRDLIIEQPDFQWQYQLRHERDVTAQFQAIQ 670

Query: 232 ALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYKS 290
           AL+  P  +  +   L + +   + F++VR EAA++L   A++    W+G   ++  +  
Sbjct: 671 ALQKYPTNATRLA--LTDTIESERCFYQVRCEAAHSLTKVANQMVASWSGPPAMLNIF-- 726

Query: 291 RRFDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYND 347
           R+F  +   P   + N+F +F  YF+ +AIP A+A +R +    P E + F+  L KYN+
Sbjct: 727 RKFFGSFSAPHIIKLNNFSNFQLYFLQKAIPVAMAGLRTSHGICPPEVMRFLFDLFKYNE 786

Query: 348 NNGNPYSDVFWLAALVQSVGE---------LEFGQQSILFLSS----LLKRIDRLLQFDR 394
           N+ N Y+D ++ AALV+++GE         +   Q +   LS+    +L  + RLL  ++
Sbjct: 787 NSRNHYTDAYYRAALVEALGETLTPVVSVAIHGTQITTDSLSTDAKLVLDEVTRLLNMEK 846

Query: 395 LMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
            +PSY  ++++SC++ +     KL  F  L  +  + + + ++     +R+ A   L+D 
Sbjct: 847 HLPSYKYMVSVSCLKVIR----KLQKFGHLPSLPHIYRSYAEYGIYLDLRIAAMECLVDF 902

Query: 455 EFHCNGIDSALSLFIKSVEEEP 476
               +G    L   I  +E +P
Sbjct: 903 -VKVDGRSEDLEHLITLLETDP 923


>gi|240275553|gb|EER39067.1| TATA-binding protein associated factor Taf2 [Ajellomyces capsulatus
            H143]
          Length = 1125

 Score =  155 bits (393), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 142/545 (26%), Positives = 230/545 (42%), Gaps = 85/545 (15%)

Query: 1    MLEKQMGSNFFRKILQNIISRA-----QGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
            +LE+++     +  +  IISR       G  P   L+T  F+    ++G+ +   L  FF
Sbjct: 511  ILERRLSKASGKATISRIISRIFLNARMGDLPNGALATSFFQKTCERLGHTK---LDTFF 567

Query: 56   PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSS---NTDSENRDGDI 112
             +WV   GCP       +NK+K +VE+ + +  + +P  R   +S+   +   E R+   
Sbjct: 568  QQWVYGAGCPRFLATQRFNKKKLVVEMMIRQVQSEQPTVRDLEMSTFMRDVKEEIRNVYA 627

Query: 113  G-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKP 166
            G     + G M+IR+HE DG  Y+H +    G     +      K   R   + ++ +  
Sbjct: 628  GPVQHVFTGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAAAA 687

Query: 167  DGCDDN------------GDAVAGLDM---------------RSSMESPLSWIRADPEME 199
             G D N            GD +   +                R   ES   WIR D + E
Sbjct: 688  SGIDPNAELQDDVLLYCLGDVLQSEEEIQTWKLVEWSKEDEDRMGQES-YEWIRMDADFE 746

Query: 200  YLAEIHFNQPVQMWINQLEKDGDVVAQAQAI---AALEALPHLSFNVVNTLNNFLSDSKA 256
            ++ ++    P  M+++QL++D D+VAQ ++I   AA    P +S   V T    L D + 
Sbjct: 747  WICKMSLTMPGYMYLSQLQQDRDIVAQLESIQYMAAQREHPLISTIFVRT----LMDRRY 802

Query: 257  FWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEA 316
            F+ +R  AA AL   A +E DW GL HL   ++         + R NDF D + Y +   
Sbjct: 803  FYGIRAAAARALVKHAKDEVDWIGLYHLESAFQELFCLPGSPMTRSNDFSDRAAYILQLT 862

Query: 317  IPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFG---- 372
            IP A+A VR    K+P     F+   LK+NDN+ N  SD +++ AL++S+ E   G    
Sbjct: 863  IPQAIAQVRDNSGKTPMRVKRFLYDKLKFNDNSNNELSDCYYITALMKSICEALLGRRET 922

Query: 373  -----------------QQSILFLSSLLKRIDRLLQFDRLMPSYNGILT---ISCIRTLT 412
                             Q         +  IDR  + D    S+  I +   I C R L 
Sbjct: 923  EIDGVGIDFDVERELERQAEQQLDRDCIAEIDRYRRMDEWTTSFQNIYSRTAIECQRRLM 982

Query: 413  QIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSV 472
            Q  +       +D+ +    P+    T   +R++A   L+D +   N     L+ FI  +
Sbjct: 983  QANV-------IDRDIMQFLPYTRAGTYEMLRLDAFNILIDFDIFQNS--ELLTWFIYVM 1033

Query: 473  EEEPS 477
              +PS
Sbjct: 1034 SADPS 1038


>gi|367012451|ref|XP_003680726.1| hypothetical protein TDEL_0C06260 [Torulaspora delbrueckii]
 gi|359748385|emb|CCE91515.1| hypothetical protein TDEL_0C06260 [Torulaspora delbrueckii]
          Length = 1396

 Score =  155 bits (393), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 119/457 (26%), Positives = 213/457 (46%), Gaps = 63/457 (13%)

Query: 30   TLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCT 89
            +LST  F++   +V    R  L+ FF +WV   G P+ R+   +NK++ +VE+ + R C 
Sbjct: 625  SLSTAHFQYVCERVN---RNKLEPFFQQWVYGSGVPIFRVTQRFNKKRMMVEMGI-RQCQ 680

Query: 90   VKPDSRTPVLSSN---------TDSENRDGDIGWPGMMSIRVHELDGM-YDHPILPMAGD 139
             +   +  V+  +             N++    + G M+IR+HE DG  Y+H +     D
Sbjct: 681  AQELGQDKVVGEDGFCSSSLGYMQHPNKNLTTCFTGSMTIRIHEADGTPYEHIV--ELKD 738

Query: 140  AWQLLEIQCHSKLAARR-----ALKPK-KGSKPDGCDDN-------------GDAVAGL- 179
             +  ++IQ ++K    R     A KP  KG  PD  +D+             G+ ++G  
Sbjct: 739  VFTKIDIQYNTKYRRIRNKRLNATKPTGKGMTPDITEDSFTERVEGGVIQKLGNVLSGPN 798

Query: 180  DMRS----------------SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDV 223
            D R                        WIR D + E++ +++ NQP  M+ +QL++D DV
Sbjct: 799  DCREWNLTEHSVTTDGNEAQRQNEAFEWIRIDSDFEWVCKLYVNQPDYMFTSQLQQDSDV 858

Query: 224  VAQAQAIAALEAL----PHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETD-- 277
             AQ +++   E      P  S    + L   + D + F+ VR+EA  ALA     + +  
Sbjct: 859  EAQVESVRFFEDTIIHSPVSSQIYSSILTRTIMDKRYFYGVRLEACRALARYVVRDFEPN 918

Query: 278  --WAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREA 335
                G  HL+K +      +N  +P  NDF +F EYF+ + IP  + +V+    ++P   
Sbjct: 919  NFSGGAKHLIKIFHELFCYQNSNIPLNNDFTNFQEYFLQKNIPQYLTLVKNERKETPAFV 978

Query: 336  VEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEF-GQQSILFLSSLLKRIDRLLQFDR 394
             +F+L LL YN+N+ NPY D  +   L+QSV +     +    +L   L+++ R    + 
Sbjct: 979  KQFLLDLLTYNENSENPYDDSHYRCTLIQSVVDCALTNRHDRDYLEKALEQLRRYQNLEE 1038

Query: 395  LMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLI 431
             MP+Y  ++  + ++   ++ L + G   ++ + K++
Sbjct: 1039 WMPTYQLLVKNAILK--AKLRLGVEGLYDMNDLNKIL 1073


>gi|350413992|ref|XP_003490174.1| PREDICTED: transcription initiation factor TFIID subunit 2-like
           [Bombus impatiens]
          Length = 1189

 Score =  155 bits (392), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 129/499 (25%), Positives = 227/499 (45%), Gaps = 61/499 (12%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRH-------FANKVGNLERPFLKE 53
           MLE ++G     ++    +S A  A+  + + +  + H       F   +  +    +  
Sbjct: 447 MLEHRIGQELLLQVFNKQLSLAANAAQ-QKIDSGLWSHMLISTNVFTKAIFTVTGKDMAV 505

Query: 54  FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
           F  +WV T G     + F +N+++N VEL + +D T                 N+ G   
Sbjct: 506 FIDQWVRTGGHAKFSLSFVFNRKRNTVELEIRQDTT-----------------NQRGIRK 548

Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
           + G + + + ELDG + H  L + G   +  +I CHSK    +  K    +        G
Sbjct: 549 YVGPLLVNIQELDGTFKH-TLQIEGTVAKA-DITCHSKSRRNKKKKIPLCT--------G 598

Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
           + V  +D+ +  +SP+ WIR DPEM  L  +   QP   W  QL  + DV AQ +AI AL
Sbjct: 599 EEV-DMDLSAMDDSPVLWIRLDPEMTLLRAVQIEQPDYQWQYQLRHERDVTAQLEAIEAL 657

Query: 234 E--ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYKS 290
           +  A P     + +T+ N       +++VR+ AA+ L   A+     WAG   ++  ++ 
Sbjct: 658 QNHATPATRLALTDTIEN----EHCYYKVRLRAAHCLTKVANAMVATWAGPPAMLAIFRK 713

Query: 291 RRFDENIG-LPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNN 349
                +   + + N+F +F  YF+ + IP A+A +R A    P E + F++ L KYNDN+
Sbjct: 714 LFGSASCRRIIKQNNFSNFQHYFLQKTIPVAMAGLRNAHGICPPEVLAFLMDLFKYNDNS 773

Query: 350 GNPYSDVFWLAALVQSVGE-----LEFGQQSILFLSSL-------LKRIDRLLQFDRLMP 397
            N YSD ++ AAL++++G      +   Q + +   SL       L+ + R L  ++L+P
Sbjct: 774 KNRYSDNYYRAALIEALGATVTPVISVQQGTAITAESLSVDTRAILEEVTRNLNLEKLLP 833

Query: 398 SYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFH 457
            Y   ++++C++ +      L  F  L     L + +  +     VR+ A  AL+D    
Sbjct: 834 CYKYTVSVACLKVIR----ILQKFGHLPSNPHLFRAYATYGQFIDVRIAALEALVDFT-R 888

Query: 458 CNGIDSALSLFIKSVEEEP 476
            +G    L   +  +E +P
Sbjct: 889 VDGKWEDLEFLLDMIEMDP 907


>gi|541665|emb|CAA55830.1| dTAF II 150 [Drosophila melanogaster]
          Length = 1213

 Score =  155 bits (392), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 127/502 (25%), Positives = 231/502 (46%), Gaps = 66/502 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEFRHFANKVGNLERPFL 51
           MLE ++G     ++    ++ A  A+  +          +ST  F      +  +    +
Sbjct: 462 MLENRIGQELLIQVFNKQLALASSAATTKIGAGLWSQLLISTNIF---IKAIFTVTGKDM 518

Query: 52  KEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGD 111
             F  +WV T G     +   +N+++N +EL + +D                   N+ G 
Sbjct: 519 SVFMDQWVRTGGHAKFSLTSVFNRKRNTIELEIRQDYV-----------------NQRGI 561

Query: 112 IGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDD 171
             + G + +++ ELDG + H +        Q+      S +      +  K  K   C  
Sbjct: 562 RKYNGPLMVQLQELDGTFKHTL--------QIESTLVKSDITCHSKSRRNKKKKIPLC-- 611

Query: 172 NGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIA 231
            G+ V  +D+ +  +SP+ WIR DPEM  L ++   QP   W  QL  + DV AQ QAI 
Sbjct: 612 TGEEV-DMDLSAMDDSPVLWIRLDPEMILLRDLIIEQPDFQWQYQLRHERDVTAQFQAIQ 670

Query: 232 ALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYKS 290
           AL+  P  +  +   L + +   + F++VR EAA++L   A++    W+G   ++  +  
Sbjct: 671 ALQKYPTNATRLA--LTDTIESERCFYQVRCEAAHSLTKVANQMVASWSGPPAMLNIF-- 726

Query: 291 RRFDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYND 347
           R+F  +   P   + N+F +F  YF+ +AIP A+A +R +    P E + F+  L KYN+
Sbjct: 727 RKFFGSFSAPHIIKLNNFSNFQLYFLQKAIPVAMAGLRTSHGICPPEVMRFLFDLFKYNE 786

Query: 348 NNGNPYSDVFWLAALVQSVGE---------LEFGQQSILFLSS----LLKRIDRLLQFDR 394
           N+ N Y+D ++ AALV+++GE         +   Q +   LS+    +L  + RLL  ++
Sbjct: 787 NSRNHYTDAYYRAALVEALGETLTPVVSVAIHGTQITTDSLSTDAKLVLDEVTRLLNMEK 846

Query: 395 LMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
            +PSY  ++++SC++ +     KL  F  L  +  + + + ++     +R+ A   L+D 
Sbjct: 847 HLPSYKYMVSVSCLKVIR----KLQKFGHLPSLPHIYRSYAEYGIYLDLRIAAMECLVDF 902

Query: 455 EFHCNGIDSALSLFIKSVEEEP 476
               +G    L   I  +E +P
Sbjct: 903 -VKVDGRSEDLEHLITLLETDP 923


>gi|195326481|ref|XP_002029957.1| GM24832 [Drosophila sechellia]
 gi|194118900|gb|EDW40943.1| GM24832 [Drosophila sechellia]
          Length = 1197

 Score =  155 bits (392), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 127/502 (25%), Positives = 231/502 (46%), Gaps = 66/502 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEFRHFANKVGNLERPFL 51
           MLE ++G     ++    ++ A  A+  +          +ST  F      +  +    +
Sbjct: 440 MLENRIGQELLIQVFNKQLALASSAATTKIGAGLWSQLLISTNIF---IKAIFTVTGKDM 496

Query: 52  KEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGD 111
             F  +WV T G     +   +N+++N +EL + +D                   N+ G 
Sbjct: 497 SVFMDQWVRTGGHAKFSLTSVFNRKRNTIELEIRQDYV-----------------NQRGI 539

Query: 112 IGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDD 171
             + G + +++ ELDG + H +        Q+      S +      +  K  K   C  
Sbjct: 540 RKYNGPLMVQLQELDGTFKHTL--------QIESTLVKSDITCHSKSRRNKKKKIPLC-- 589

Query: 172 NGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIA 231
            G+ V  +D+ +  +SP+ WIR DPEM  L ++   QP   W  QL  + DV AQ QAI 
Sbjct: 590 TGEEV-DMDLSAMDDSPVLWIRLDPEMILLRDLIIEQPDFQWQYQLRHERDVTAQFQAIQ 648

Query: 232 ALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYKS 290
           AL+  P  +  +   L + +   + F++VR EAA++L   A++    W+G   ++  +  
Sbjct: 649 ALQKYPTNATRLA--LTDTIESERCFYQVRCEAAHSLTKVANQMVASWSGPPAMLNIF-- 704

Query: 291 RRFDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYND 347
           R+F  +   P   + N+F +F  YF+ +AIP A+A +R +    P E + F+  L KYN+
Sbjct: 705 RKFFGSFSAPHIIKLNNFSNFQLYFLQKAIPVAMAGLRTSHGICPPEVMRFLFDLFKYNE 764

Query: 348 NNGNPYSDVFWLAALVQSVGE---------LEFGQQSILFLSS----LLKRIDRLLQFDR 394
           N+ N Y+D ++ AALV+++GE         +   Q +   LS+    +L  + RLL  ++
Sbjct: 765 NSRNHYTDAYYRAALVEALGETLTPVVSVAIHGTQITTDSLSTDAKLVLDEVTRLLNMEK 824

Query: 395 LMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
            +PSY  ++++SC++ +     KL  F  L  +  + + + ++     +R+ A   L+D 
Sbjct: 825 HLPSYKYMVSVSCLKVIR----KLQKFGHLPSLPHIYRSYAEYGIYLDLRIAAMECLVDF 880

Query: 455 EFHCNGIDSALSLFIKSVEEEP 476
               +G    L   I  +E +P
Sbjct: 881 -VKVDGRSEDLEHLITLLETDP 901


>gi|195376663|ref|XP_002047112.1| GJ12109 [Drosophila virilis]
 gi|194154270|gb|EDW69454.1| GJ12109 [Drosophila virilis]
          Length = 1207

 Score =  155 bits (392), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 118/440 (26%), Positives = 212/440 (48%), Gaps = 54/440 (12%)

Query: 54  FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
           F  +WV T G     +   +N+++N +EL + +D                   N+ G   
Sbjct: 522 FMDQWVRTGGHAKFSLTSVFNRKRNTIELEIRQDFV-----------------NQRGVRK 564

Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
           + G + +++ ELDG + H +     +     +I CHSK    +  K    +        G
Sbjct: 565 YNGPLLVQLQELDGTFKHTL--QIENTLVKADITCHSKSRRNKKKKIPLCT--------G 614

Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
           + V  +D+ +  +SP+ WIR DPEM  L ++   QP   W  QL  + DV AQ QAI AL
Sbjct: 615 EEV-DMDLSAMDDSPVLWIRLDPEMILLRDLIIEQPDFQWQYQLRHERDVTAQFQAIQAL 673

Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYKSRR 292
           +  P  +  +   L + +     F++VR +AA++L   A++    W+G   ++  +  R+
Sbjct: 674 QKYPTNATRMA--LTDTIESESCFYKVRCQAAHSLTKVANQMVASWSGPPAMLNIF--RK 729

Query: 293 FDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNN 349
           F  +   P   + N+F +F  YF+ +AIP A+A +R +    P E + F+  L KYN+N+
Sbjct: 730 FFGSFSAPHIIKLNNFSNFQLYFLQKAIPVAMAGLRTSHGICPPEVMRFLFDLFKYNENS 789

Query: 350 GNPYSDVFWLAALVQSVGE---------LEFGQQSILFLSS----LLKRIDRLLQFDRLM 396
            N Y+D ++ AALV+++GE         +   Q ++  LS+    +L  + RLL  ++ +
Sbjct: 790 RNHYTDAYYRAALVEALGETLTPVVSVAMHGTQITVDSLSTDAKLVLDEVTRLLNMEKHL 849

Query: 397 PSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEF 456
           PSY  ++++SC++ +     KL  F  L  +  + + + ++     +R+ A   L+D   
Sbjct: 850 PSYKYMVSVSCLKVIR----KLQKFGHLPSLPHIYRSYAEYGIFLDLRIAAMECLVDF-V 904

Query: 457 HCNGIDSALSLFIKSVEEEP 476
             +G    L   I  +E +P
Sbjct: 905 KVDGRSEDLEHLITLLETDP 924


>gi|325091384|gb|EGC44694.1| TATA-binding protein associated factor Taf2 [Ajellomyces capsulatus
            H88]
          Length = 1247

 Score =  155 bits (392), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 142/545 (26%), Positives = 230/545 (42%), Gaps = 85/545 (15%)

Query: 1    MLEKQMGSNFFRKILQNIISRA-----QGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
            +LE+++     +  +  IISR       G  P   L+T  F+    ++G+ +   L  FF
Sbjct: 511  ILERRLSKASGKATISRIISRIFLNARMGDLPNGALATSFFQKTCERLGHTK---LDTFF 567

Query: 56   PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSS---NTDSENRDGDI 112
             +WV   GCP       +NK+K +VE+ + +  + +P  R   +S+   +   E R+   
Sbjct: 568  QQWVYGAGCPRFLATQRFNKKKLVVEMMIRQVQSEQPTVRDLEMSTFMRDVKEEIRNVYA 627

Query: 113  G-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKP 166
            G     + G M+IR+HE DG  Y+H +    G     +      K   R   + ++ +  
Sbjct: 628  GPVQHVFTGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAAAA 687

Query: 167  DGCDDN------------GDAVAGLDM---------------RSSMESPLSWIRADPEME 199
             G D N            GD +   +                R   ES   WIR D + E
Sbjct: 688  SGIDPNAELQDDVLLYCLGDVLQSEEEIQTWKLVEWSKEDEDRMGQES-YEWIRMDADFE 746

Query: 200  YLAEIHFNQPVQMWINQLEKDGDVVAQAQAI---AALEALPHLSFNVVNTLNNFLSDSKA 256
            ++ ++    P  M+++QL++D D+VAQ ++I   AA    P +S   V T    L D + 
Sbjct: 747  WICKMSLTMPGYMYLSQLQQDRDIVAQLESIQYMAAQREHPLISTIFVRT----LMDRRY 802

Query: 257  FWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEA 316
            F+ +R  AA AL   A +E DW GL HL   ++         + R NDF D + Y +   
Sbjct: 803  FYGIRAAAARALVKHAKDEVDWIGLYHLESAFQELFCLPGSPMTRSNDFSDRAAYILQLT 862

Query: 317  IPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFG---- 372
            IP A+A VR    K+P     F+   LK+NDN+ N  SD +++ AL++S+ E   G    
Sbjct: 863  IPQAIAQVRDNSGKTPMRVKRFLYDKLKFNDNSNNELSDCYYITALMKSICEALLGRRET 922

Query: 373  -----------------QQSILFLSSLLKRIDRLLQFDRLMPSYNGILT---ISCIRTLT 412
                             Q         +  IDR  + D    S+  I +   I C R L 
Sbjct: 923  EIDGVGIDFDVERELERQAEQQLDRDCIAEIDRYRRMDEWTTSFQNIYSRTAIECQRRLM 982

Query: 413  QIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSV 472
            Q  +       +D+ +    P+    T   +R++A   L+D +   N     L+ FI  +
Sbjct: 983  QANV-------IDRDIMQFLPYTRAGTYEMLRLDAFNILIDFDIFQNS--ELLTWFIYVM 1033

Query: 473  EEEPS 477
              +PS
Sbjct: 1034 SADPS 1038


>gi|195589193|ref|XP_002084340.1| GD12884 [Drosophila simulans]
 gi|194196349|gb|EDX09925.1| GD12884 [Drosophila simulans]
          Length = 1224

 Score =  155 bits (392), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 127/502 (25%), Positives = 231/502 (46%), Gaps = 66/502 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEFRHFANKVGNLERPFL 51
           MLE ++G     ++    ++ A  A+  +          +ST  F      +  +    +
Sbjct: 467 MLENRIGQELLIQVFNKQLALASSAATTKIGAGLWSQLLISTNIF---IKAIFTVTGKDM 523

Query: 52  KEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGD 111
             F  +WV T G     +   +N+++N +EL + +D                   N+ G 
Sbjct: 524 SVFMDQWVRTGGHAKFSLTSVFNRKRNTIELEIRQDFV-----------------NQRGI 566

Query: 112 IGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDD 171
             + G + +++ ELDG + H +        Q+      S +      +  K  K   C  
Sbjct: 567 RKYNGPLMVQLQELDGTFKHTL--------QIESTLVKSDITCHSKSRRNKKKKIPLC-- 616

Query: 172 NGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIA 231
            G+ V  +D+ +  +SP+ WIR DPEM  L ++   QP   W  QL  + DV AQ QAI 
Sbjct: 617 TGEEV-DMDLSAMDDSPVLWIRLDPEMILLRDLIIEQPDFQWQYQLRHERDVTAQFQAIQ 675

Query: 232 ALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYKS 290
           AL+  P  +  +   L + +   + F++VR EAA++L   A++    W+G   ++  +  
Sbjct: 676 ALQKYPTNATRLA--LTDTIESERCFYQVRCEAAHSLTKVANQMVASWSGPPAMLNIF-- 731

Query: 291 RRFDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYND 347
           R+F  +   P   + N+F +F  YF+ +AIP A+A +R +    P E + F+  L KYN+
Sbjct: 732 RKFFGSFSAPHIIKLNNFSNFQLYFLQKAIPVAMAGLRTSHGICPPEVMRFLFDLFKYNE 791

Query: 348 NNGNPYSDVFWLAALVQSVGE---------LEFGQQSILFLSS----LLKRIDRLLQFDR 394
           N+ N Y+D ++ AALV+++GE         +   Q +   LS+    +L  + RLL  ++
Sbjct: 792 NSRNHYTDAYYRAALVEALGETLTPVVSVAIHGTQITTDSLSTDAKLVLDEVTRLLNMEK 851

Query: 395 LMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
            +PSY  ++++SC++ +     KL  F  L  +  + + + ++     +R+ A   L+D 
Sbjct: 852 HLPSYKYMVSVSCLKVIR----KLQKFGHLPSLPHIYRSYAEYGIYLDLRIAAMECLVDF 907

Query: 455 EFHCNGIDSALSLFIKSVEEEP 476
               +G    L   I  +E +P
Sbjct: 908 -VKVDGRSEDLEHLITLLETDP 928


>gi|194750911|ref|XP_001957773.1| GF23856 [Drosophila ananassae]
 gi|190625055|gb|EDV40579.1| GF23856 [Drosophila ananassae]
          Length = 1221

 Score =  155 bits (391), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 118/440 (26%), Positives = 211/440 (47%), Gaps = 54/440 (12%)

Query: 54  FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
           F  +WV T G     +   +N+++N +EL + +D                   N+ G   
Sbjct: 521 FMDQWVRTGGHAKFSLTSVFNRKRNTIELEIRQDFV-----------------NQRGVRK 563

Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
           + G + +++ ELDG + H +     +     +I CHSK    +  K    +        G
Sbjct: 564 YNGPLLVQLQELDGTFKHTL--QIENTLVKADITCHSKSRRNKKKKIPLCT--------G 613

Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
           + V  +D+ +  +SP+ WIR DPEM  + ++   QP   W  QL  + DV AQ QAI AL
Sbjct: 614 EEV-DMDLSAMDDSPVLWIRLDPEMILMRDLIIEQPDFQWQYQLRHEKDVTAQFQAIQAL 672

Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYKSRR 292
           +  P  +  +   L + +   + F++VR EAA++L   A++    W+G   ++  +  R+
Sbjct: 673 QKYPTNATRMA--LTDTIESERCFYKVRCEAAHSLTKVANQMVASWSGPPAMLNIF--RK 728

Query: 293 FDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNN 349
           F  +   P   + N+F +F  YF+ +AIP A+A +R +    P E + F+  L KYN+N+
Sbjct: 729 FFGSFSAPHIIKLNNFSNFQLYFLQKAIPVAMAGLRTSHGICPTEVMRFLFDLFKYNENS 788

Query: 350 GNPYSDVFWLAALVQSVGE---------LEFGQQSILFLSS----LLKRIDRLLQFDRLM 396
            N Y+D ++ AALV+++GE         +   Q +   LS+    +L  + RLL  ++ +
Sbjct: 789 RNHYTDAYYRAALVEALGETLTPVVSVAIHGTQITTDSLSTDAKLVLDEVTRLLNMEKHL 848

Query: 397 PSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEF 456
           PSY  ++++SC++ +     KL  F  L  +  + + +  +     +R+ A   L+D   
Sbjct: 849 PSYKYMVSVSCLKVIR----KLQKFGHLPSLPHIYRSYAAYGIFLDLRIAAMECLVDF-V 903

Query: 457 HCNGIDSALSLFIKSVEEEP 476
             +G    L   I  +E +P
Sbjct: 904 KVDGRSEDLEHLITLLETDP 923


>gi|407929393|gb|EKG22223.1| hypothetical protein MPH_00402 [Macrophomina phaseolina MS6]
          Length = 1499

 Score =  155 bits (391), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 155/586 (26%), Positives = 257/586 (43%), Gaps = 78/586 (13%)

Query: 4    KQMGSNFFRKILQNIISRAQ-GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTC 62
            K+M      KI+  ++  A+ G      +ST++F    +K   LE   + +F+ +WV   
Sbjct: 530  KRMAKMGVTKIIYKVLLAAKTGKLQNGAISTEDFIQIVDKY--LESKPIPDFWRQWVYGA 587

Query: 63   GCPVLRMGFSYNKRKNIVELAVLR------DCTVKPDSRTPVLSSNTDSENRDGDIG--- 113
            GCP   +   +NK+K+++E A+ +      +   K     P       +E++ G      
Sbjct: 588  GCPHFYVTQRFNKKKSVIEFAIRQTHGDRSELEEKAQDLDPCTFLKEVNEHQSGMSKKKP 647

Query: 114  ---WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLA--ARRALKPKKGSKPD 167
               + G M++ VHE DG  YDH I+P+  D     +   ++K     R   + ++ +   
Sbjct: 648  TRVFTGAMTVTVHEADGTPYDH-IIPI-NDMTTRFDAPYNTKYTRLKRNRRQKERAAAAQ 705

Query: 168  GCDDNGDAVAG-----------LDMRSSMES-----------------PLSWIRADPEME 199
            G D NGD               LD    ++                     WIR D + E
Sbjct: 706  GIDLNGDGELPDEIPLYCLGDVLDTEKEIQEWNFDEWTKEEEETMSNEQFEWIRVDADFE 765

Query: 200  YLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWR 259
            ++A+I F+QP+ M + QL++D DVVAQA+A+ +L A    S  +   L   + D + F  
Sbjct: 766  WIAKITFSQPLWMLVAQLQQDPDVVAQAEALQSLSAQ-RPSKLISTYLTRTVMDRRYFHG 824

Query: 260  VRIEAAYALANTASEETD-----WAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVL 314
            +R  AA +LA  A    D     W GL HL K ++      +  + R NDF D S+Y V 
Sbjct: 825  IRTMAAESLAKCAIPRIDDVDLNWIGLSHLEKVFQELFCLPDSQMVRSNDFSDRSQYLVQ 884

Query: 315  EAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGEL----- 369
             AIP A+A VR    K    A +++L  LK+NDN+ N YSD +++A L+ ++ +      
Sbjct: 885  CAIPRAIAKVREPSGKGLMRAKKWILDKLKFNDNSHNEYSDCYYVATLLSALAQQLAYKD 944

Query: 370  ---------EFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSG 420
                     E  Q  I F    L  I+R  + D  +PSY  ++T++ ++   Q  L  +G
Sbjct: 945  VSTISFDFDEDEQDEIDFQRDALNEIERHRRLDEWIPSYQNVVTVTALQ--CQGLLMENG 1002

Query: 421  FI--SLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSL 478
             I   +   ++  KP         VR++A   L+DL    +       +F  S +  P +
Sbjct: 1003 IIPRKILDFLQYAKP----GAAESVRLKALECLVDLNMMKHPAILRYLIFTLSYDPSPYM 1058

Query: 479  RGQVK--LGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAF 522
            R  +   +G    RI       D++++V    ++      ESR A 
Sbjct: 1059 RENLYYLIGRGLARIAIGDKKEDTSNQVFGGLVIEQETSTESRAAM 1104


>gi|195493074|ref|XP_002094263.1| GE20292 [Drosophila yakuba]
 gi|194180364|gb|EDW93975.1| GE20292 [Drosophila yakuba]
          Length = 1219

 Score =  155 bits (391), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 128/502 (25%), Positives = 233/502 (46%), Gaps = 66/502 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEFRHFANKVGNLERPFL 51
           MLE ++G     ++    ++ A  A+  +          +ST  F      +  +    +
Sbjct: 460 MLENRIGQELLIQVFNKQLALASSAATTKIGAGLWSQLLISTNIF---IKAIFTVTGKDM 516

Query: 52  KEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGD 111
             F  +WV T G     +   +N+++N +EL + +D                   N+ G 
Sbjct: 517 SVFMDQWVRTGGHAKFSLTSVFNRKRNTIELEIRQDFV-----------------NQRGI 559

Query: 112 IGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDD 171
             + G + +++ ELDG + H +     +     +I CHSK    +  K    +       
Sbjct: 560 RKYNGPLMVQLQELDGTFKHTL--QIENTLVKSDITCHSKSRRNKKKKIPLCT------- 610

Query: 172 NGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIA 231
            G+ V  +D+ +  +SP+ WIR DPEM  L ++   QP   W  QL  + DV AQ QAI 
Sbjct: 611 -GEEV-DMDLSAMDDSPVLWIRLDPEMILLRDLIIEQPDFQWQYQLRHERDVTAQFQAIQ 668

Query: 232 ALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYKS 290
           AL+  P  +  +   L + +   + F++VR EAA++L   A++    W+G   ++  +  
Sbjct: 669 ALQKYPTNATRLA--LTDTIESERCFYKVRCEAAHSLTKVANQMVASWSGPPAMLNIF-- 724

Query: 291 RRFDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYND 347
           R+F  +   P   + N+F +F  YF+ +AIP A+A +R +    P E + F+  L KYN+
Sbjct: 725 RKFFGSFSAPHIIKLNNFSNFQLYFLQKAIPVAMAGLRTSHGICPPEVMRFLFDLFKYNE 784

Query: 348 NNGNPYSDVFWLAALVQSVGE---------LEFGQQSILFLSS----LLKRIDRLLQFDR 394
           N+ N Y+D ++ AALV+++GE         +   Q +   LS+    +L  + RLL  ++
Sbjct: 785 NSRNHYTDAYYRAALVEALGETLTPVVSVAIHGTQITTDSLSTDAKLVLDEVTRLLNMEK 844

Query: 395 LMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
            +PSY  ++++SC++ +     KL  F  L  +  + + + ++     +R+ A   L+D 
Sbjct: 845 HLPSYKYMVSVSCLKVIR----KLQKFGHLPSLPHIYRSYAEYGIYLDLRIAAMECLVDF 900

Query: 455 EFHCNGIDSALSLFIKSVEEEP 476
               +G    L   I  +E +P
Sbjct: 901 -VKVDGRIEDLEHLITLLETDP 921


>gi|156053606|ref|XP_001592729.1| hypothetical protein SS1G_05650 [Sclerotinia sclerotiorum 1980]
 gi|154703431|gb|EDO03170.1| hypothetical protein SS1G_05650 [Sclerotinia sclerotiorum 1980 UF-70]
          Length = 1524

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 150/543 (27%), Positives = 246/543 (45%), Gaps = 81/543 (14%)

Query: 5    QMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGC 64
            Q G+   R I + I++         T+ST+ FR    K+          FF +WV   GC
Sbjct: 551  QGGTGISRVISKMIVNANASGGADSTISTEGFRKSCEKITKYRD--TDSFFTQWVLGAGC 608

Query: 65   PVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDS------ENRDGDIG----- 113
            P   +   +NK++  VE+ + +    + +S T  L  + D       E+  G I      
Sbjct: 609  PHFTITQKFNKKRLCVEMTITQ---TQANSETE-LKLHKDGFYRSFKESTAGVIAPKAQN 664

Query: 114  -WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSK---LAARRALKPKK--GSKP 166
             + G ++IR+HE DG  Y+H +  M  +    +EI  ++K   L   R +K +   G   
Sbjct: 665  TFTGPLTIRIHEADGTPYEHIV--MIREKTSRIEIPYNTKYKRLKRNRRMKERANAGIAM 722

Query: 167  DGCDDNGDAVAGLDMRSSMESP-----------------------LSWIRADPEMEYLAE 203
            D   +N D      +   ++SP                         WIR D + E++ +
Sbjct: 723  DINAENHDDALIYCLGDVLQSPQDIQEWGLRDWDPATEAKMEAESYEWIRVDADFEWICD 782

Query: 204  IHFN-QPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRI 262
              F      M+++QL++D D+ AQ +++A L+  P     V   L   L D + F+ +R 
Sbjct: 783  KKFTGMEAYMYVSQLQQDRDISAQYESMAYLDNAPPHPL-VATFLIRTLMDRRYFYGIRT 841

Query: 263  EAAYALANTASE-ETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAV 321
             AA +LA  A E +  W GL HL K ++         +PR NDF D   Y++  AIP A+
Sbjct: 842  MAAKSLAKHAVEKDVKWIGLTHLEKAFQEFFCYPGTKMPRSNDFSDKQAYWIECAIPEAM 901

Query: 322  AMVRA-ADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVG------------- 367
            + VR+  D+  P+ A +F+L  L++NDN  N YSD F +A L++S+              
Sbjct: 902  SRVRSLYDSLCPKRARQFILDQLRFNDNGNNDYSDNFKVANLLKSLADSLIPGEKGVNND 961

Query: 368  --------ELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS 419
                    E+E  QQ   F + +L+ +DR  + D  + SY  I TI+ I    ++ L  +
Sbjct: 962  ALQLDEAEEIEEPQQ---FKNIVLEELDRYSRMDEWIDSYQNIYTITVIE--CKLKLMNA 1016

Query: 420  GFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLR 479
            G I +D  +   +   D ++   VR++AS AL DL F  N   + L +   S +  P +R
Sbjct: 1017 GVIPVD-ALDFAQYLHDGSSP-NVRIKASWALADLGFLKNTAVACLLVTAMSTDPSPWVR 1074

Query: 480  GQV 482
            G++
Sbjct: 1075 GEL 1077


>gi|406865162|gb|EKD18205.1| TATA-binding protein associated factor Taf2 [Marssonina brunnea f.
            sp. 'multigermtubi' MB_m1]
          Length = 1530

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 150/559 (26%), Positives = 249/559 (44%), Gaps = 79/559 (14%)

Query: 2    LEKQMGSNFFRKILQNIISRAQ-GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
            L K  GS    +I+  ++  A  G +    +ST+ FR    KV    +     F+  WV 
Sbjct: 544  LVKASGSAGLVRIISKMVMIANTGDAKESFVSTEGFRKSCEKVTKYRQ--TDSFWKEWVY 601

Query: 61   TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRD-----GDIG-- 113
              GCP L +   +NK++  VEL + +     P  R        DS  R+      DI   
Sbjct: 602  GSGCPRLSITQKFNKKRLCVELTISQKQDTLPTQR----EMQKDSFLRELKEVVHDIYSL 657

Query: 114  -----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSK-- 165
                 + G M+IR+HE DG  Y+H +     +    +EI  ++K    +  + +K  +  
Sbjct: 658  PVQPVFTGPMTIRIHEADGTPYEHIV--EIRELHSKIEIPYNTKYKRLKRTRKQKEQQNA 715

Query: 166  -------PDGCDDN-----GDAVAGLDM--------------RSSMESPLSWIRADPEME 199
                    +G +D      GD + G DM              +  +ES   WIR D + E
Sbjct: 716  MANADPNAEGGEDTLIYCLGDTIQGPDMEEWGVIEWDAENLAKMELES-YEWIRVDADFE 774

Query: 200  YLAEIHFNQ-PVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFW 258
            ++ E  F+Q P  M+++QL++D DVVAQ  ++  L+ +      V   L   L D++ F 
Sbjct: 775  WICEKQFSQMPAYMYVSQLQQDRDVVAQQDSMLYLKNMQAHKL-VATFLTRTLMDTRYFH 833

Query: 259  RVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIP 318
             +R  AA AL   AS    W G+  L K Y           PRPNDF D   Y V  +I 
Sbjct: 834  GIRSMAAEALNIHASPAVQWVGMKQLEKAYVEFYAYAKDKTPRPNDFSDKRAYTVEMSII 893

Query: 319  HAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGEL--------- 369
             ++A +R ++   P+++  F++ +L++NDN+ N +SD F +A L+ ++ +          
Sbjct: 894  KSLAKIRDSNGNCPKQSRLFLMDILRFNDNSNNQFSDNFKIATLLSALADSLIPLEDADN 953

Query: 370  ------EFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTIS---CIRTLTQIALKLSG 420
                  E   + + F   +L+ +DR  + D  + SY    T++   C + L Q     + 
Sbjct: 954  LDIYDDEEDMEPVEFKKQVLEELDRYARMDEWINSYQNCFTVTVLDCKQRLMQ-----AK 1008

Query: 421  FISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLRG 480
             I  D  V+ ++   D  + + VR++A R+L+DL F  N   ++L L + S +  P  R 
Sbjct: 1009 VIPTD-AVEFVQYLHDGTSEF-VRIKAFRSLIDLGFLTNNAVASLLLNVASTDPSPYTRR 1066

Query: 481  QVKLGIHAMRICQIKGGSD 499
            ++   I  + I  I  G D
Sbjct: 1067 KL-FEIFCLGIASIAFGED 1084


>gi|380013884|ref|XP_003690975.1| PREDICTED: LOW QUALITY PROTEIN: transcription initiation factor
           TFIID subunit 2-like [Apis florea]
          Length = 1181

 Score =  154 bits (388), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 131/505 (25%), Positives = 228/505 (45%), Gaps = 73/505 (14%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRH-------FANKVGNLERPFLKE 53
           MLE ++G     ++    +S A  A+  + + +  + H       F   +  +    +  
Sbjct: 439 MLEHRIGQELLLQVFNKQLSLAANAAQ-QKIDSGLWSHMLISTNVFTKAIFTVTGKDMAV 497

Query: 54  FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
           F  +WV T G     + F +N+++N VEL + ++ T                 N+ G   
Sbjct: 498 FIDQWVRTGGHAKFSLSFIFNRKRNTVELEIRQETT-----------------NQRGIRK 540

Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
           + G + + + ELDG + H  L + G   +  +I CHSK    +  K    +        G
Sbjct: 541 YVGPLLVNIQELDGTFKH-TLQIEGTVAKA-DITCHSKSRRNKKKKIPLCT--------G 590

Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
           + V  +D+ +  +SP+ WIR DPEM  L  +   QP   W  QL  + DV AQ +AI AL
Sbjct: 591 EEV-DMDLSAMDDSPVLWIRLDPEMTLLRAVQVEQPDYQWQYQLRHERDVTAQLEAIEAL 649

Query: 234 E--ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYK- 289
           +  A P     + +T+ N       +++VR+ AA+ L   A+     WAG   ++  ++ 
Sbjct: 650 QNHATPATRLALTDTIEN----EHCYYKVRLRAAHCLTKVANAMVATWAGPPAMLAIFRK 705

Query: 290 ------SRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLL 343
                  RR      + + N+F +F  YF+ + IP A+A +R A    P E + F++ L 
Sbjct: 706 LFGSASCRR------IIKQNNFTNFQHYFLQKTIPVAMAGLRNAHGICPPEVLAFLMDLF 759

Query: 344 KYNDNNGNPYSDVFWLAALVQSVGE-----LEFGQQSILFLSSL-------LKRIDRLLQ 391
           KYNDN+ N YSD ++ AAL++++G      +   Q + +   SL       L+ + R L 
Sbjct: 760 KYNDNSKNRYSDNYYRAALIEALGATVTPVISVQQGTAITAESLSVDTRAILEEVTRNLN 819

Query: 392 FDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRAL 451
            ++L+P Y   +++SC++ +      L  F  L     L + +  +     VR+ A  AL
Sbjct: 820 LEKLLPCYKYTVSVSCLKVIR----ILQKFGHLPSNPHLFRAYAAYGQFIDVRIAALEAL 875

Query: 452 LDLEFHCNGIDSALSLFIKSVEEEP 476
           +D     +G    L   +  +E +P
Sbjct: 876 VDFT-RVDGKWEDLEFLLDMIEMDP 899


>gi|328787388|ref|XP_393397.4| PREDICTED: transcription initiation factor TFIID subunit 2, partial
           [Apis mellifera]
          Length = 1185

 Score =  154 bits (388), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 131/505 (25%), Positives = 228/505 (45%), Gaps = 73/505 (14%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRH-------FANKVGNLERPFLKE 53
           MLE ++G     ++    +S A  A+  + + +  + H       F   +  +    +  
Sbjct: 443 MLEHRIGQELLLQVFNKQLSLAANAAQ-QKIDSGLWSHMLISTNVFTKAIFTVTGKDMAV 501

Query: 54  FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
           F  +WV T G     + F +N+++N VEL + ++ T                 N+ G   
Sbjct: 502 FIDQWVRTGGHAKFSLSFIFNRKRNTVELEIRQETT-----------------NQRGIRK 544

Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
           + G + + + ELDG + H  L + G   +  +I CHSK    +  K    +        G
Sbjct: 545 YVGPLLVNIQELDGTFKH-TLQIEGTVAKA-DITCHSKSRRNKKKKIPLCT--------G 594

Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
           + V  +D+ +  +SP+ WIR DPEM  L  +   QP   W  QL  + DV AQ +AI AL
Sbjct: 595 EEV-DMDLSAMDDSPVLWIRLDPEMTLLRAVQVEQPDYQWQYQLRHERDVTAQLEAIEAL 653

Query: 234 E--ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYK- 289
           +  A P     + +T+ N       +++VR+ AA+ L   A+     WAG   ++  ++ 
Sbjct: 654 QNHATPATRLALTDTIEN----EHCYYKVRLRAAHCLTKVANAMVATWAGPPAMLAIFRK 709

Query: 290 ------SRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLL 343
                  RR      + + N+F +F  YF+ + IP A+A +R A    P E + F++ L 
Sbjct: 710 LFGSASCRR------IIKQNNFTNFQHYFLQKTIPVAMAGLRNAHGICPPEVLAFLMDLF 763

Query: 344 KYNDNNGNPYSDVFWLAALVQSVGE-----LEFGQQSILFLSSL-------LKRIDRLLQ 391
           KYNDN+ N YSD ++ AAL++++G      +   Q + +   SL       L+ + R L 
Sbjct: 764 KYNDNSKNRYSDNYYRAALIEALGATVTPVISVQQGTAITAESLSVDTRAILEEVTRNLN 823

Query: 392 FDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRAL 451
            ++L+P Y   +++SC++ +      L  F  L     L + +  +     VR+ A  AL
Sbjct: 824 LEKLLPCYKYTVSVSCLKVIR----ILQKFGHLPSNPHLFRAYAAYGQFIDVRIAALEAL 879

Query: 452 LDLEFHCNGIDSALSLFIKSVEEEP 476
           +D     +G    L   +  +E +P
Sbjct: 880 VDFT-RVDGKWEDLEFLLDMIEMDP 903


>gi|383856938|ref|XP_003703963.1| PREDICTED: transcription initiation factor TFIID subunit 2-like
           [Megachile rotundata]
          Length = 1199

 Score =  153 bits (387), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 127/499 (25%), Positives = 227/499 (45%), Gaps = 61/499 (12%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRH-------FANKVGNLERPFLKE 53
           MLE ++G     ++    +S A  A+  + + +  + H       F   +  +    +  
Sbjct: 457 MLEHRIGQELLLQVFNKQLSLAANAAQ-QKIDSGLWSHMLISTNVFTKAIFTVTGKDMAV 515

Query: 54  FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
           F  +WV T G     + F +N+++N VEL + +D T                 N+ G   
Sbjct: 516 FIDQWVRTGGHAKFSLSFVFNRKRNTVELEIRQDTT-----------------NQRGIRK 558

Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
           + G + + + ELDG + H  L + G   +  +I CHSK    +  K    +        G
Sbjct: 559 YVGPLLVNIQELDGTFKH-TLQIEGTVAKA-DITCHSKSRRNKKKKIPLCT--------G 608

Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
           + V  +D+ +  +SP+ WIR DP+M  +  +   QP   W  QL  + DV AQ +AI AL
Sbjct: 609 EEV-DMDLSAMDDSPVLWIRLDPDMTLMRAVQIEQPDYQWQYQLRHERDVTAQLEAIEAL 667

Query: 234 E--ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYKS 290
           +  A P     + +T+ N       +++VR+ AA+ L   A+     WAG   ++  ++ 
Sbjct: 668 QNHATPATRLALTDTIEN----EHCYYKVRLRAAHCLTKVANAMVATWAGPPAMLAIFRK 723

Query: 291 RRFDENIG-LPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNN 349
                +   + + N+F +F  YF+ + IP A+A +R A    P E + F++ L KYNDN+
Sbjct: 724 LFGSASCRRIIKQNNFSNFQHYFLQKTIPVAMAGLRNAHGICPPEVLAFLMDLFKYNDNS 783

Query: 350 GNPYSDVFWLAALVQSVGE-----LEFGQQSILFLSSL-------LKRIDRLLQFDRLMP 397
            N YSD ++ AAL++++G      +   Q + +   SL       L+ + R L  ++L+P
Sbjct: 784 KNRYSDNYYRAALIEALGATVTPVISIQQGTAITAESLSVDTKAILEEVTRNLNLEKLLP 843

Query: 398 SYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFH 457
            Y   ++++C++ +      L  F  L     L + +  +     VR+ A  AL+D    
Sbjct: 844 CYKYTVSVACLKVIR----ILQKFGHLPSNPHLFRAYAAYGQFIDVRIAALEALVDFT-R 898

Query: 458 CNGIDSALSLFIKSVEEEP 476
            +G    L   +  +E +P
Sbjct: 899 VDGKWEDLEFLLDMIEMDP 917


>gi|303323785|ref|XP_003071884.1| hypothetical protein CPC735_074210 [Coccidioides posadasii C735 delta
            SOWgp]
 gi|240111586|gb|EER29739.1| hypothetical protein CPC735_074210 [Coccidioides posadasii C735 delta
            SOWgp]
          Length = 1253

 Score =  153 bits (387), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 140/538 (26%), Positives = 237/538 (44%), Gaps = 98/538 (18%)

Query: 1    MLEKQMGSNFFRKILQNIISRA-----QGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
            +LE+++     +  +  IISR       G  P   L+T  F+    + G+++   L  FF
Sbjct: 496  ILERRLSKASGKATVSRIISRIFLNCRMGDIPNSALTTIFFQKLCERFGHVK---LDAFF 552

Query: 56   PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTP------VLSSNTDSENRD 109
             +WV   GCP       +NK+K +VE+ + +   V+ +  +P          +   E ++
Sbjct: 553  QQWVYGAGCPRFVATQRFNKKKLVVEMMIRQ---VQSEQSSPRDLEKSTFMRDVKEELQN 609

Query: 110  GDIG-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLA--ARRALKPK 161
               G     + G M+IR+HE DG  Y+H +     +A    +I  ++K     R   + +
Sbjct: 610  IYAGSLQHVFTGSMTIRIHEADGTPYEHIV--EIKEAVTKFDIPYNTKYKRLKRNKRQRE 667

Query: 162  KGSKPDGCDDNGDAVAGL------DMRSSMESPL--------------------SWIRAD 195
            + +   G D N +A   +      D+  S E  L                     WIR D
Sbjct: 668  RAAASSGVDPNSEAQDDVLLYCLGDVLQSEEEILKWRLVEWTKEDEDRMGQESYEWIRMD 727

Query: 196  PEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA-------IAALEALPHLSFNVVNTLN 248
             + E++ ++    P  M+++QL++D DVVAQ +A       +AA   +  + +      +
Sbjct: 728  ADFEWICKMSLTMPGYMYLSQLQQDRDVVAQLEASNSRVGFMAAANCIQSIQYMAAQREH 787

Query: 249  NFLS--------DSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP 300
              +S        DS+ F  +R  AA+AL   A E+ DW GL HL   ++      N  + 
Sbjct: 788  PLISTIFARTLMDSRYFHGIRTAAAHALVKHAKEDVDWIGLFHLETAFQELFCLPNSPMT 847

Query: 301  RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLA 360
            R N+F D S Y +  AIP+A+A VR  + + P     F+ + LK+NDN+ N +SD +++A
Sbjct: 848  RSNNFSDRSSYALQIAIPNAIAKVRNRNGRCPMRVKRFLYEKLKFNDNSNNEFSDCYYVA 907

Query: 361  ALVQSVG--------------------ELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYN 400
            AL++S+                     ELE   +  L+L  + + I+R  + D    S+ 
Sbjct: 908  ALMRSIANALIGGKESIHTKSKLDMTQELERQAEDQLYLDCIAE-INRYKRMDEWTSSFQ 966

Query: 401  GILT---ISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLE 455
             + T   + C+R L      +     LD  V    P+    T   VR++A R LLDL+
Sbjct: 967  NLYTRTALDCLRRL------MKEHRVLDFDVAQFLPYTRIGTFDLVRLDAFRILLDLD 1018


>gi|195012688|ref|XP_001983726.1| GH15413 [Drosophila grimshawi]
 gi|193897208|gb|EDV96074.1| GH15413 [Drosophila grimshawi]
          Length = 1207

 Score =  153 bits (387), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 117/440 (26%), Positives = 211/440 (47%), Gaps = 54/440 (12%)

Query: 54  FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
           F  +WV T G     +   +N+++N +EL + +D                   N+ G   
Sbjct: 522 FMDQWVRTGGHAKFSLTSVFNRKRNTIELEIRQDFV-----------------NQRGVRK 564

Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
           + G + +++ ELDG + H +     +     +I CHSK    +  K    +        G
Sbjct: 565 YNGPLLVQLQELDGTFKHTL--QIENTLVKADITCHSKSRRNKKKKIPLCT--------G 614

Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
           + V  +D+ +  +SP+ WIR DPEM  L ++   QP   W  QL  + DV AQ QAI AL
Sbjct: 615 EEV-DMDLSAMGDSPVLWIRLDPEMILLRDLIIEQPDFQWQYQLRHERDVTAQFQAIQAL 673

Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYKSRR 292
           +  P  +  +   L + +     F++VR +AA++L   A++    W+G   ++  +  R+
Sbjct: 674 QKYPTNATRLA--LTDTIESESCFYKVRCQAAHSLTKVANQMVASWSGPPAMLNIF--RK 729

Query: 293 FDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNN 349
           F  +   P   + N+F +F  Y++ +AIP A+A +R +    P E + F+  L KYN+N+
Sbjct: 730 FFGSFSAPHIIKLNNFSNFQLYYLQKAIPVAMAGLRTSHGICPPEVMRFLFDLFKYNENS 789

Query: 350 GNPYSDVFWLAALVQSVGE---------LEFGQQSILFLSS----LLKRIDRLLQFDRLM 396
            N Y+D ++ AALV+++GE         +   Q +   LS+    +L  + RLL  ++ +
Sbjct: 790 RNHYTDAYYRAALVEALGETLTPVVSVAMHGTQITTDSLSTDAKLVLDEVTRLLNMEKHL 849

Query: 397 PSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEF 456
           PSY  ++++SC++ +     KL  F  L  +  + + + ++     +R+ A   L+D   
Sbjct: 850 PSYKYMVSVSCLKVIR----KLQKFGHLPSLPHIYRSYAEYGIFLDLRIAAMECLVDF-V 904

Query: 457 HCNGIDSALSLFIKSVEEEP 476
             +G    L   I  +E +P
Sbjct: 905 KVDGRSEDLEHLITLLETDP 924


>gi|260943123|ref|XP_002615860.1| hypothetical protein CLUG_04742 [Clavispora lusitaniae ATCC 42720]
 gi|238851150|gb|EEQ40614.1| hypothetical protein CLUG_04742 [Clavispora lusitaniae ATCC 42720]
          Length = 1698

 Score =  153 bits (386), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 114/438 (26%), Positives = 205/438 (46%), Gaps = 67/438 (15%)

Query: 30   TLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCT 89
            TLST+ F++   KV    R  L+ FF +WV   G P+  +   +NK+++++E+ + +  +
Sbjct: 707  TLSTQHFQYVCEKVN---RNRLESFFKQWVYGIGTPIFNITQRFNKKRSMIEVTIRQTQS 763

Query: 90   VKPDSRTP--------VLSSNTDSENRDGDIGWPGMMSIRVHELDGM-YDHPILPMAGDA 140
             +P +  P         L+   D+        + G M+IRVHE DG  Y+H +     D+
Sbjct: 764  QQPKNVHPQAETFIEDSLTYLNDTPIYPVQQTFLGPMTIRVHEADGTPYEHIV--DIKDS 821

Query: 141  WQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVA-----GLDMRSSME--------- 186
                ++Q ++K    R LK  K       D+N ++       G  ++S  E         
Sbjct: 822  VVKFDVQYNTKF---RRLKKNK-------DENSESTPIFAKLGDVLQSEQEMQDWNLADW 871

Query: 187  --------SPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPH 238
                     P  WIR D + E++A  +  QP  M+  QL++D D+ AQ  AI    +   
Sbjct: 872  PKRDEEFLDPFEWIRVDTDFEWIARFNVKQPDYMFGAQLQQDRDIEAQIAAIEYFGSQEK 931

Query: 239  LSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIG 298
             +     +L   L DS+ F+ VRI AA +LA  +    ++AGL +L++ +K     EN  
Sbjct: 932  PNTIYCTSLLRTLMDSRYFYGVRIAAAKSLAIISKSNNNFAGLGYLIRAFKELYCFENSS 991

Query: 299  LPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFW 358
            +P  N F DF + F+ +AIP  +  ++  D  +P+     +  +LKYNDN+ N + D ++
Sbjct: 992  VPFSNSFDDFGKLFLQKAIPRFLGSIKDDDGNTPQSIKSLLFNVLKYNDNSNNEFQDCYY 1051

Query: 359  LAALVQSVGE-----------LEFG----------QQSILFLSSLLKRIDRLLQFDRLMP 397
            ++ L+ S+ E           ++F            +   F+  +++   R+ + D  +P
Sbjct: 1052 VSDLIWSLTEALIPANSEQLNVDFHLEREKAEASYSKDAKFIQKVVEEFSRIQKLDGWVP 1111

Query: 398  SYNGILTISCIRTLTQIA 415
            SY  +++ +CI    ++A
Sbjct: 1112 SYQSVVSYACIEQKIRLA 1129


>gi|189241274|ref|XP_974531.2| PREDICTED: similar to conserved hypothetical protein [Tribolium
           castaneum]
          Length = 1158

 Score =  153 bits (386), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 125/478 (26%), Positives = 219/478 (45%), Gaps = 61/478 (12%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRH-------FANKVGNLERPFLKE 53
           MLE ++G     ++    +S A  A+  + + +  + H       F   +  +    +  
Sbjct: 447 MLEHRIGQELLLQVFNKQLSLATNAAQ-QKIGSGLWGHMLISTNVFTKAIFTVTGKDMAV 505

Query: 54  FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
           F  +WV T G     +   +N+++N +EL + +D T +   R  V               
Sbjct: 506 FIDQWVRTGGHAKFHLTSVFNRKRNTIELEIRQDATTQLGIRKYV--------------- 550

Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
             G + + + ELDG + H +L +     +  +I CHSK    +  K    +        G
Sbjct: 551 --GPLLVTLQELDGTFKH-MLQIENTVVKA-DITCHSKSRRNKKKKIPLCT--------G 598

Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
           D V  +D+ +  +SP+ WIR DPEM  L  +   QP   W  QL  + DV AQ +AI AL
Sbjct: 599 DEV-DMDLSAMDDSPVLWIRLDPEMTLLRSVVIEQPDYQWQYQLRHERDVTAQMEAIYAL 657

Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYKSRR 292
           E  P  +  +   L + + + + +++VR  AA+ L   A+    +WAG   ++  +  R+
Sbjct: 658 ERYPTPATRLA--LTDTIENEQCYYKVRCRAAHCLTKVANAMVANWAGPPAMLAIF--RK 713

Query: 293 FDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNN 349
              +   P   R N F +F  Y++ + IP A+A +R A    P + + F+L L KYNDN+
Sbjct: 714 LFGSFAAPHIIRQNSFTNFQHYYLQKTIPLAMAGLRNAHGICPPDVIRFLLDLFKYNDNS 773

Query: 350 GNPYSDVFWLAALVQSVGELEFGQQSILF---------LSS----LLKRIDRLLQFDRLM 396
            N YSD ++ AALV+++G       S++          LS+    +L+ + R+L  ++ +
Sbjct: 774 KNRYSDNYYRAALVEALGNTVTPVVSVIHQGAPITAESLSTDTKLILEEVTRMLNLEKAL 833

Query: 397 PSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
           P Y   ++++C++ +     KL  F  L     + K +  +     VRV A   L+D 
Sbjct: 834 PCYKYTVSVACLKAIR----KLQKFGHLPSRPTIFKSYASYGHFIDVRVAALECLVDF 887


>gi|270013200|gb|EFA09648.1| hypothetical protein TcasGA2_TC011774 [Tribolium castaneum]
          Length = 1169

 Score =  153 bits (386), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 125/478 (26%), Positives = 219/478 (45%), Gaps = 61/478 (12%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRH-------FANKVGNLERPFLKE 53
           MLE ++G     ++    +S A  A+  + + +  + H       F   +  +    +  
Sbjct: 458 MLEHRIGQELLLQVFNKQLSLATNAAQ-QKIGSGLWGHMLISTNVFTKAIFTVTGKDMAV 516

Query: 54  FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
           F  +WV T G     +   +N+++N +EL + +D T +   R  V               
Sbjct: 517 FIDQWVRTGGHAKFHLTSVFNRKRNTIELEIRQDATTQLGIRKYV--------------- 561

Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
             G + + + ELDG + H +L +     +  +I CHSK    +  K    +        G
Sbjct: 562 --GPLLVTLQELDGTFKH-MLQIENTVVKA-DITCHSKSRRNKKKKIPLCT--------G 609

Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
           D V  +D+ +  +SP+ WIR DPEM  L  +   QP   W  QL  + DV AQ +AI AL
Sbjct: 610 DEV-DMDLSAMDDSPVLWIRLDPEMTLLRSVVIEQPDYQWQYQLRHERDVTAQMEAIYAL 668

Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYKSRR 292
           E  P  +  +   L + + + + +++VR  AA+ L   A+    +WAG   ++  +  R+
Sbjct: 669 ERYPTPATRLA--LTDTIENEQCYYKVRCRAAHCLTKVANAMVANWAGPPAMLAIF--RK 724

Query: 293 FDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNN 349
              +   P   R N F +F  Y++ + IP A+A +R A    P + + F+L L KYNDN+
Sbjct: 725 LFGSFAAPHIIRQNSFTNFQHYYLQKTIPLAMAGLRNAHGICPPDVIRFLLDLFKYNDNS 784

Query: 350 GNPYSDVFWLAALVQSVGELEFGQQSILF---------LSS----LLKRIDRLLQFDRLM 396
            N YSD ++ AALV+++G       S++          LS+    +L+ + R+L  ++ +
Sbjct: 785 KNRYSDNYYRAALVEALGNTVTPVVSVIHQGAPITAESLSTDTKLILEEVTRMLNLEKAL 844

Query: 397 PSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
           P Y   ++++C++ +     KL  F  L     + K +  +     VRV A   L+D 
Sbjct: 845 PCYKYTVSVACLKAIR----KLQKFGHLPSRPTIFKSYASYGHFIDVRVAALECLVDF 898


>gi|225679290|gb|EEH17574.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
          Length = 1253

 Score =  152 bits (385), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 149/550 (27%), Positives = 237/550 (43%), Gaps = 95/550 (17%)

Query: 1    MLEKQMGSNFFRKILQNIISRA-----QGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
            +LE+++     +  +  IISR       G  P  +L+T   +    ++G+ +   L +FF
Sbjct: 511  ILERRLSKASGKATISRIISRIFLHARMGDLPNGSLATSFLQKICERLGHTK---LDQFF 567

Query: 56   PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSR---TPVLSSNTDSENRDGDI 112
             +WV   GCP       +NK+K +VE+ + +  + +P  R         +   E ++   
Sbjct: 568  QQWVYGAGCPRFLATQRFNKKKLVVEMMIRQVQSEQPTVRDLDKLTFMRDVKEEIKNVYA 627

Query: 113  G-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKP 166
            G     + G M+IR+HE DG  Y+H +    G     +      K   R   + ++ +  
Sbjct: 628  GAVQHVFTGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAAAA 687

Query: 167  DGCDDN------------GDAVAGLDM---------------RSSMESPLSWIRADPEME 199
             G D N            GD +   D                R   ES   WIR D + E
Sbjct: 688  SGLDPNAELQDDVLLYCLGDVLQSEDEIQKWKLVEWSKEDEDRMGQES-YEWIRMDADFE 746

Query: 200  YLAEIHFNQPVQMWINQLEKDGDVVAQAQAI---AALEALPHLSFNVVNTLNNFLSDSKA 256
            ++ ++    P  M+++QL++D D+VAQ ++I   AA    P +S   + TL     D + 
Sbjct: 747  WICKMSLTMPGYMYLSQLQQDRDIVAQLESIQYMAAQREHPLISTIFIRTL----MDRRY 802

Query: 257  FWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP-----RPNDFRDFSEY 311
            F+ +R  AA AL   A E+ DW GL HL        F E   LP     R NDF D S Y
Sbjct: 803  FYGIRAAAAKALVKHAKEDVDWIGLFHL-----ESAFQELFCLPGSPMTRSNDFSDRSSY 857

Query: 312  FVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVG---- 367
             +   IP A+A VR +  K+P     F+   LK+NDN+ +  SD +++AAL++S+     
Sbjct: 858  ILQLTIPQAIAQVRDSTGKTPMRVKRFLYDKLKFNDNSNSEISDCYYVAALMKSICDALL 917

Query: 368  ----------------ELEFGQQSILFLS-SLLKRIDRLLQFDRLMPSYNGIL---TISC 407
                            E+E  +Q+   L    +  IDR  + D    S+  I     I C
Sbjct: 918  GRKENEPRNIDMDFDMEMELERQAEQQLDRDCIAEIDRYRRMDEWTSSFQNIYARTAIGC 977

Query: 408  IRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSL 467
             R   Q     +G I LD +V+ ++  R   T   +R++A   L  +EF    I   L+ 
Sbjct: 978  QRRFMQ-----AGVIELD-LVQFLQYTRA-GTYEMLRLDAFDIL--IEFDIFQIPELLTW 1028

Query: 468  FIKSVEEEPS 477
            FI  +  +PS
Sbjct: 1029 FIYVMSTDPS 1038


>gi|336389953|gb|EGO31096.1| hypothetical protein SERLADRAFT_444675 [Serpula lacrymans var.
            lacrymans S7.9]
          Length = 1798

 Score =  152 bits (385), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 117/373 (31%), Positives = 176/373 (47%), Gaps = 39/373 (10%)

Query: 191  WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNF 250
            WIR D + E++A I F QP  MW++QL++D DVVAQ +AI AL   P  +  V +T    
Sbjct: 703  WIRMDADFEWIAAIAFEQPDFMWVSQLQRDRDVVAQLEAIHALAKQP--TAIVSSTFTKT 760

Query: 251  LSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRF---DENIGL------PR 301
            +  S  F+R+R EAA AL N A    D+ GL HL K +    +   D N  L      P+
Sbjct: 761  VLVSNYFYRIRCEAALALVNCAIRRLDFLGLFHLFKLFLRNCYEPEDSNQDLFAHTYVPK 820

Query: 302  PNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAA 361
            PNDF D SEYFV +++ +A++ VR  + KSP    +F++  L+YNDN  NPYSD F++  
Sbjct: 821  PNDFSDLSEYFVRKSLVNAISRVRFENGKSPSVVRQFLIDQLRYNDNTANPYSDAFYICT 880

Query: 362  LVQSV------------GEL-------EFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGI 402
            ++ +             GEL       E   +    L   +  IDR    DRL+ S + +
Sbjct: 881  IISASACATVSTAPPERGELLPTETRSEQNSEDTSLLKQAISEIDRYRSMDRLITSPHNV 940

Query: 403  LTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGI- 461
            +TI+ +     + L ++     D  +++  P        QVR+ A   L   +++   I 
Sbjct: 941  VTIAALEFY--LVLTVANLAPSD--LRVFFPLTREGNYTQVRIAAFDGLFLSKWYTPAIM 996

Query: 462  DSALSLFIKSVEEEPSLRGQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIA 521
              +L+L +   E + +L+    L I        K       EVD +  V    L + R  
Sbjct: 997  KYSLALLVSMGEMKNALKETESLLIEEDGSMPEKAKEVKKSEVDLMIKV----LRKDREI 1052

Query: 522  FNNVFLRHHLFGI 534
              N  LR  L  I
Sbjct: 1053 GKNEVLREFLMPI 1065



 Score = 48.9 bits (115), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 36/141 (25%), Positives = 63/141 (44%), Gaps = 21/141 (14%)

Query: 1   MLEKQMGSN-----FFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
           +L++++G +       R + +  +S   G  P   LST  F     KV  +E   L+ F 
Sbjct: 483 ILDRRLGKSGTSLGLSRVLPKVFLSGISGEMPNNALSTHSFLRTCRKVSGVE---LRSFA 539

Query: 56  PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG-- 113
            +W+   GCP      S+N++K  VE+ + ++         P    +  +E     +   
Sbjct: 540 EQWIYGSGCPAFGFSASFNRKKMAVEITMRQEA--------PAFKLHESNEVSKALLKPV 591

Query: 114 --WPGMMSIRVHELDGM-YDH 131
             + G M+IR+HE DG  Y+H
Sbjct: 592 PFFEGQMTIRIHEADGTPYEH 612


>gi|308453099|ref|XP_003089299.1| hypothetical protein CRE_11590 [Caenorhabditis remanei]
 gi|308241221|gb|EFO85173.1| hypothetical protein CRE_11590 [Caenorhabditis remanei]
          Length = 1092

 Score =  152 bits (384), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 143/515 (27%), Positives = 249/515 (48%), Gaps = 75/515 (14%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEFRHFANKVGNLERP-F 50
           ML+K++G   F ++L  +++     S  +T         +ST+ F    + V   E P F
Sbjct: 494 MLQKRIGLEPFMRVLHRLLTVGLDMSEKKTTPAAWRHLLISTEAFFRSVSSVTGREIPTF 553

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           L++F    V T G     + F +N+++NIVE+ V +D                D+E  +G
Sbjct: 554 LEQF----VRTGGHAAFAVKFDFNRKRNIVEIEVKQD----------------DTEG-NG 592

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPK--KGSKPDG 168
              + G +S+ V E+DG + H I      A    EI CHSK   ++  K     G + + 
Sbjct: 593 RTQYTGPLSVVVQEVDGAFSHTI--QIDGAISHAEISCHSKGRKQKKKKVPILTGEEIEI 650

Query: 169 CDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQ 228
              N DA          ESP+ W+R D +   + EI  +QP+  W   L+ + DV+AQ +
Sbjct: 651 DLTNMDA----------ESPILWLRIDNDYLLIREISISQPMFHWEYMLKYERDVIAQME 700

Query: 229 AIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALA---NTASEETDWAGLLHLV 285
           A+  ++ALP  S +  + + + +S+ + F+R+R  AA+ L    N  SE       + + 
Sbjct: 701 ALERIQALP--SAHSRSVIVDAVSNERFFYRIRCRAAFVLTSVQNRKSEAIAVGTPVLIN 758

Query: 286 KFYKSRRFDENIGLPRPNDF----RDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQ 341
            F +S     +  +P+PN+F    ++  +YFV++++P A++ +R    +   +   F+L 
Sbjct: 759 MFRESFGCKSSTNIPKPNNFVVTAQNLQQYFVMQSLPQAISRLRKQTGECHEDVQSFLLD 818

Query: 342 LLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILF-------LSS----LLKRIDRLL 390
           L+K+NDN+ N YSD F+ A+L  S+       + + F       LSS    L+K     L
Sbjct: 819 LIKFNDNSTNRYSDDFYRASLFNSLAASIHPSEFLPFRVDLPENLSSEFRILIKEFTYAL 878

Query: 391 QFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRA 450
             D + PSY  ++ I+ +  L Q  L+  G++ LD   KL+  F   N+  Q+R  A   
Sbjct: 879 NMDTVNPSYGRVVGIAALSGLYQ--LQKCGYLPLDS--KLLWAFAHRNSCVQMRRCAVTL 934

Query: 451 LLD-LEFHCNGIDSA---LSLFIKSV--EEEPSLR 479
           ++D +    + +D+    LS+ +  V  E++PS+R
Sbjct: 935 IIDRIVNDIHAVDTKMYDLSMLLNLVLTEKDPSIR 969


>gi|358057254|dbj|GAA96863.1| hypothetical protein E5Q_03536 [Mixia osmundae IAM 14324]
          Length = 1605

 Score =  152 bits (384), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 118/454 (25%), Positives = 210/454 (46%), Gaps = 71/454 (15%)

Query: 12  RKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLE---RPFLKEFFPRWVGTCGCPVLR 68
           R I + +IS   G     T+ T  F     K+ N     RP+  +    W+   GCP   
Sbjct: 516 RVIPKLLISVMAGELKDNTIGTPGFLRMCRKLANAGVDLRPWANQ----WIYGSGCPHFN 571

Query: 69  MGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGM 128
           +  ++N++K I+E+ + +      D+ T   S+   +        + G ++IR+HE DG 
Sbjct: 572 VVINFNRKKMIIEVDITQHLPAANDAATQPWSATAHTNPVQS---FDGALTIRIHEFDGT 628

Query: 129 -YDHPILPMAGDAWQLLEIQCHSKL----AARRALKPKKGSKPDGCDDNGDAVAGLDMRS 183
            YDH +   AG  ++  E+  +++     +  R+  P++    D  DD G  +   + R 
Sbjct: 629 PYDHVLGVKAG--YKRHEVAYNTRFKRIRSGIRSFIPRQH---DANDDPGMGIKASESRF 683

Query: 184 SME--------------------------SPLSWIRADPEMEYLAEIHFNQPVQMWINQL 217
            ++                          +P  WIR D +M+++  + + QP   WI QL
Sbjct: 684 HLQLWEDAAQRTRWKVADYTEEEVEAIANTPYEWIRIDADMDWICSVAYVQPEYAWIAQL 743

Query: 218 EKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETD 277
           E+D DV+AQ  A+  L A P  S    ++L   L     F+R+RIEA YA+   A++   
Sbjct: 744 ERDKDVIAQLDAVHGLLARP--SLMAASSLAKTLLVETYFFRIRIEAIYAMLGCATDALQ 801

Query: 278 WAGLLHLVK-FYK---------SRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAA 327
             GL +L++ FY+             +E + +P PNDF +  +Y + +A   AV++VR A
Sbjct: 802 QRGLFYLLRLFYQFFDTGKTPFPADHNEYVCVPSPNDFSNVPDYLIKKAFISAVSLVRDA 861

Query: 328 DNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSIL----FL---- 379
              SP  A   ++ LL YNDN+ N + D F ++ ++  +       ++      F+    
Sbjct: 862 SGLSPHVAKIMLIDLLAYNDNSRNAFDDSFMISEVISDLAVALLADKTDTSDANFVPTPA 921

Query: 380 -SSLLKR----IDRLLQFDRLMPSYNGILTISCI 408
            ++L++R    ++R    DRL+PSY  ++T++ +
Sbjct: 922 DTALVQRASDEVERYQSLDRLVPSYQNLVTVAAL 955


>gi|195126212|ref|XP_002007568.1| GI13016 [Drosophila mojavensis]
 gi|193919177|gb|EDW18044.1| GI13016 [Drosophila mojavensis]
          Length = 1212

 Score =  152 bits (384), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 113/418 (27%), Positives = 203/418 (48%), Gaps = 53/418 (12%)

Query: 54  FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
           F  +WV T G     +   +N+++N +EL + +D                   N+ G   
Sbjct: 522 FMDQWVRTGGHAKFSLTSVFNRKRNTIELEIRQDFV-----------------NQRGVRK 564

Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
           + G + +++ ELDG + H +     +     +I CHSK    +  K    +        G
Sbjct: 565 YNGPLLVQLQELDGTFKHTL--QIENTLVKADITCHSKSRRNKKKKIPLCT--------G 614

Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
           + V  +D+ +  +SP+ WIR DPEM  L ++   QP   W  QL  + DV AQ QAI AL
Sbjct: 615 EEV-DMDLSAMDDSPVLWIRLDPEMILLRDLIIEQPDFQWQYQLRHERDVTAQFQAIQAL 673

Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYKSRR 292
           +  P  +  +   L + +     F++VR +AA++L   A++    W+G   ++  +  R+
Sbjct: 674 QKYPTNATRLA--LTDTIESDHCFYKVRCQAAHSLTKVANQMVASWSGPPAMLNIF--RK 729

Query: 293 FDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNN 349
           F  +   P   + N+F +F  YF+ +AIP A+A +R +    P E + F+  L KYN+N+
Sbjct: 730 FFGSFSAPHIIKLNNFSNFQLYFLQKAIPVAMAGLRTSHGICPPEVMRFLFDLFKYNENS 789

Query: 350 GNPYSDVFWLAALVQSVGE---------LEFGQQSILFLSS----LLKRIDRLLQFDRLM 396
            N Y+D ++ AALV+++GE         +   Q +   LS+    +L  + RLL  ++ +
Sbjct: 790 RNHYTDAYYRAALVEALGETLTPVVSVAMHGTQITTDSLSTDAKLVLDEVTRLLNMEKHL 849

Query: 397 PSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
           PSY  ++++SC++ +     KL  F  L  +  + + + ++     +R+ A   L+D 
Sbjct: 850 PSYKYMVSVSCLKVIR----KLQKFGHLPSLPHIYRSYAEYGIFLDLRIAAMECLVDF 903


>gi|195428287|ref|XP_002062205.1| GK17418 [Drosophila willistoni]
 gi|194158290|gb|EDW73191.1| GK17418 [Drosophila willistoni]
          Length = 1224

 Score =  152 bits (384), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 131/503 (26%), Positives = 237/503 (47%), Gaps = 62/503 (12%)

Query: 54  FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
           F  +WV T G     +   +N+++N +EL + +D                   N+ G   
Sbjct: 521 FMDQWVRTGGHAKFSLTSVFNRKRNTIELEIRQDFV-----------------NQRGVRK 563

Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
           + G + +++ ELDG + H +     +     +I CHSK    +  K    +        G
Sbjct: 564 YNGPLLVQLQELDGTFKHTL--QIENTLVKADITCHSKSRRNKKKKIPLCT--------G 613

Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
           + V  +D+ +  +SP+ WIR DPEM  L ++   QP   W  QL  + DV AQ QAI AL
Sbjct: 614 EEV-DMDLSAMDDSPVLWIRLDPEMILLRDLIIEQPDFQWQYQLRHERDVTAQFQAIQAL 672

Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETD-WAGLLHLVKFYKSRR 292
           +  P  +  +   L + +   + F++VR +AA++L   A++    W+G   ++  +  R+
Sbjct: 673 QKYPTNATRMA--LTDTIESERCFYKVRCQAAHSLTKVANQMVAAWSGPPAMLNIF--RK 728

Query: 293 FDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNN 349
           F  +   P   + N+F +F  YF+ ++IP A+A +R +    P E + F+  L KYN+N+
Sbjct: 729 FFGSFSAPHIIKLNNFSNFQLYFLQKSIPVAMAGLRTSHGICPPEVMRFLFDLFKYNENS 788

Query: 350 GNPYSDVFWLAALVQSVGE---------LEFGQQSILFLSS----LLKRIDRLLQFDRLM 396
            N Y+D ++ AALV+++GE         +   Q +   LS+    +L  + RLL  ++ +
Sbjct: 789 RNHYTDAYYRAALVEALGETLTPVVSVAMHGTQITTESLSTDAKLVLDEVTRLLNMEKHL 848

Query: 397 PSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEF 456
           PSY  ++++SC++ + ++  KL    SL  +    + +  +     +R+ A   L+D   
Sbjct: 849 PSYKYMVSVSCLKVIRKLQ-KLGHLPSLPHIY---RSYASYGIFLDLRIAAMECLVDF-V 903

Query: 457 HCNGIDSALSLFIKSVEEEPSLRGQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALL--- 513
             +G    L   I  +E +P      + G+  + I       +S   +D   LV  L   
Sbjct: 904 KVDGRSDDLEDLITLLETDPD--PAARHGLAQLLIDNPPFTRESRSRLDKPNLVERLWYS 961

Query: 514 -NLL--ESRIAFNNVFLRHHLFG 533
            N L  ++++  + V L H L+G
Sbjct: 962 INTLAYDTKLRCDTVDLYHSLYG 984


>gi|308492512|ref|XP_003108446.1| CRE-TAF-2 protein [Caenorhabditis remanei]
 gi|308248186|gb|EFO92138.1| CRE-TAF-2 protein [Caenorhabditis remanei]
          Length = 1092

 Score =  152 bits (384), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 143/515 (27%), Positives = 249/515 (48%), Gaps = 75/515 (14%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEFRHFANKVGNLERP-F 50
           ML+K++G   F ++L  +++     S  +T         +ST+ F    + V   E P F
Sbjct: 494 MLQKRIGLEPFMRVLHRLLTVGLDMSEKKTTPAAWRHLLISTEAFFRSVSSVTGREIPTF 553

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           L++F    V T G     + F +N+++NIVE+ V +D                D+E  +G
Sbjct: 554 LEQF----VRTGGHAAFAVKFDFNRKRNIVEIEVKQD----------------DTEG-NG 592

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPK--KGSKPDG 168
              + G +S+ V E+DG + H I      A    EI CHSK   ++  K     G + + 
Sbjct: 593 RTQYTGPLSVVVQEVDGAFSHTI--QIDGAISHAEISCHSKGRKQKKKKVPILTGEEIEI 650

Query: 169 CDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQ 228
              N DA          ESP+ W+R D +   + EI  +QP+  W   L+ + DV+AQ +
Sbjct: 651 DLTNMDA----------ESPILWLRIDNDYLLIREISISQPMFHWEYMLKYERDVIAQME 700

Query: 229 AIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALA---NTASEETDWAGLLHLV 285
           A+  ++ALP  S +  + + + +S+ + F+R+R  AA+ L    N  SE       + + 
Sbjct: 701 ALERIQALP--SAHSRSVIVDAVSNERFFYRIRCRAAFVLTSVQNRKSEAIAVGTPVLIN 758

Query: 286 KFYKSRRFDENIGLPRPNDF----RDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQ 341
            F +S     +  +P+PN+F    ++  +YFV++++P A++ +R    +   +   F+L 
Sbjct: 759 MFRESFGCKSSTNIPKPNNFVVTAQNLQQYFVMQSLPQAISRLRKQTGECHEDVQSFLLD 818

Query: 342 LLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILF-------LSS----LLKRIDRLL 390
           L+K+NDN+ N YSD F+ A+L  S+       + + F       LSS    L+K     L
Sbjct: 819 LIKFNDNSTNRYSDDFYRASLFNSLAASIHPSEFLPFRVDLPENLSSEFRVLIKEFTYAL 878

Query: 391 QFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRA 450
             D + PSY  ++ I+ +  L Q  L+  G++ LD   KL+  F   N+  Q+R  A   
Sbjct: 879 NMDTVNPSYGRVVGIAALSGLYQ--LQKCGYLPLDS--KLLWAFAHRNSCVQMRRCAVTL 934

Query: 451 LLD-LEFHCNGIDSA---LSLFIKSV--EEEPSLR 479
           ++D +    + +D+    LS+ +  V  E++PS+R
Sbjct: 935 IIDRIVNDIHAVDTKMYDLSMLLNLVLTEKDPSIR 969


>gi|336376907|gb|EGO05242.1| hypothetical protein SERLA73DRAFT_82815 [Serpula lacrymans var.
            lacrymans S7.3]
          Length = 1798

 Score =  152 bits (383), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 117/373 (31%), Positives = 175/373 (46%), Gaps = 39/373 (10%)

Query: 191  WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNF 250
            WIR D + E++A I F QP  MW++QL++D DVVAQ +AI AL   P  +  V +T    
Sbjct: 703  WIRMDADFEWIAAIAFEQPDFMWVSQLQRDRDVVAQLEAIHALAKQP--TAIVSSTFTKT 760

Query: 251  LSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRF---DENIGL------PR 301
            +  S  F+R+R EAA AL N A    D+ GL HL K +    +   D N  L      P+
Sbjct: 761  VLVSNYFYRIRCEAALALVNCAIRRLDFLGLFHLFKLFLRNCYEPEDSNQDLFAHTYVPK 820

Query: 302  PNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAA 361
            PNDF D SEYFV +++ +A++ VR  + KSP    +F++  L+YNDN  NPYSD F++  
Sbjct: 821  PNDFSDLSEYFVRKSLVNAISRVRFENGKSPSVVRQFLIDQLRYNDNTANPYSDAFYICT 880

Query: 362  LVQSV------------GEL-------EFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGI 402
            ++ +             GEL       E   +    L   +  IDR    DRL+ S + +
Sbjct: 881  IISASACATVSTAPPERGELLPTETRSEQNSEDTSLLKQAISEIDRYRSMDRLITSPHNV 940

Query: 403  LTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGI- 461
            +TI+ +     + L ++     D  +++  P        QVR+ A   L   +++   I 
Sbjct: 941  VTIAALEFY--LVLTVANLAPSD--LRVFFPLTREGNYTQVRIAAFDGLFLSKWYTPAIM 996

Query: 462  DSALSLFIKSVEEEPSLRGQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIA 521
               L+L +   E + +L+    L I        K       EVD +  V    L + R  
Sbjct: 997  KYVLALLVSMGEMKNALKETESLLIEEDGSMPEKAKEVKKSEVDLMIKV----LRKDREI 1052

Query: 522  FNNVFLRHHLFGI 534
              N  LR  L  I
Sbjct: 1053 GKNEVLREFLMPI 1065



 Score = 48.9 bits (115), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 36/141 (25%), Positives = 63/141 (44%), Gaps = 21/141 (14%)

Query: 1   MLEKQMGSN-----FFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
           +L++++G +       R + +  +S   G  P   LST  F     KV  +E   L+ F 
Sbjct: 483 ILDRRLGKSGTSLGLSRVLPKVFLSGISGEMPNNALSTHSFLRTCRKVSGVE---LRSFA 539

Query: 56  PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG-- 113
            +W+   GCP      S+N++K  VE+ + ++         P    +  +E     +   
Sbjct: 540 EQWIYGSGCPAFGFSASFNRKKMAVEITMRQEA--------PAFKLHESNEVSKALLKPV 591

Query: 114 --WPGMMSIRVHELDGM-YDH 131
             + G M+IR+HE DG  Y+H
Sbjct: 592 PFFEGQMTIRIHEADGTPYEH 612


>gi|358372687|dbj|GAA89289.1| transcription initiation factor TFIID subunit TSM1/127kD [Aspergillus
            kawachii IFO 4308]
          Length = 1273

 Score =  152 bits (383), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 134/525 (25%), Positives = 231/525 (44%), Gaps = 88/525 (16%)

Query: 1    MLEKQMGSNFFRKILQNIISR-----AQGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
            +L++++     +  +  IISR       G  P   ++T  F+    ++G+ +   L  FF
Sbjct: 511  ILDRRLTKASGKATMSRIISRLFLNSRMGDIPNGAVTTSLFQRTCERLGHAK---LDSFF 567

Query: 56   PRWVGTCGCPVLRMGFSYNKRKNIVELAVLR---DCTVKPDSRTPVLSSNTDSENRDGDI 112
             +WV   GCP  +    +NK+K +VE+ + +   D +   D        +   E R    
Sbjct: 568  QQWVYGAGCPRFQATQRFNKKKLVVEMMIRQVQADASSTRDLDKNAFMRDVKEEIRGVYA 627

Query: 113  G-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKP 166
            G     + G M+IR+HE DG  Y+H +    G     +      K   R   + ++ +  
Sbjct: 628  GVIQPVFTGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAAAV 687

Query: 167  DGCDDNGDA------------------VAGLDMRS--------SMESPLSWIRADPEMEY 200
             G D N +A                  VA   +R                WIR D + E+
Sbjct: 688  TGGDPNAEAQEDVLLYCLGDVLQTEEEVAEWKLRDWSEEEEERMSRESYEWIRMDADFEW 747

Query: 201  LAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFN-----VVNTLNNFLSDSK 255
            + ++    P  M+++QL++D DVVAQ      LE+L +++       +       L D +
Sbjct: 748  ICKLSLAMPGYMYLSQLQQDRDVVAQ------LESLQYMAIQREHELISTIFVRTLMDRR 801

Query: 256  AFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP-----RPNDFRDFSE 310
             F+ +R+ AA AL   A+E  +W GL HL      R F E   LP     R NDF D + 
Sbjct: 802  YFYGIRVAAARALVKHATEAVEWVGLFHL-----ERAFQELFCLPGSPMTRSNDFSDRAA 856

Query: 311  YFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGEL- 369
            YF+  AIP +++ VR ++ K+P     F+   LK+NDN+ N YSD F++A L++S+    
Sbjct: 857  YFIQLAIPESISKVRDSNGKTPMRVKRFLYDKLKFNDNSNNEYSDNFYVATLMESLCHAM 916

Query: 370  ------------EFGQQSILFLSS-------LLKRIDRLLQFDRLMPSYNGILTISCIRT 410
                        +F  + +L   +        +  IDR  + D    S+  I + + +R 
Sbjct: 917  LGKPERPSDEMDDFDMERVLAAQAEEQLEKDAIAEIDRYRRMDEWSSSFQNIYSRAALR- 975

Query: 411  LTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLE 455
              Q+ L  +  +SLD +++ + P+    T   +R++A   L++L+
Sbjct: 976  -CQVRLMEAKILSLD-IMQFL-PYTRAGTYDLLRLDAFECLIELD 1017


>gi|357612256|gb|EHJ67886.1| hypothetical protein KGM_13825 [Danaus plexippus]
          Length = 1189

 Score =  152 bits (383), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 144/575 (25%), Positives = 260/575 (45%), Gaps = 73/575 (12%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRH-------FANKVGNLERPFLKE 53
           MLE+++G     ++    +S A  A+  + + +  + H       F   +  +    +  
Sbjct: 446 MLEQRIGQELLLQVFNKQLSLATNAANTK-IGSGLWGHLLLSTNLFVKAIFTVTGKDMAV 504

Query: 54  FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
           F  +WV T G    ++   +N+++N VEL + +D                 S +  G   
Sbjct: 505 FVDQWVRTGGHAKFQLTSVFNRKRNTVELEIRQD-----------------SVHERGIRK 547

Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
           + G + +++ ELDG + H +     +     +I CHSK    +  K    +        G
Sbjct: 548 YVGPLLVQLQELDGTFKHTL--QIENTVVKADITCHSKSRRNKKKKIPLCT--------G 597

Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
           + V  +D+ +  +SP+ WIR DPEM  L     +QP   W  QL  + DV AQ++AI AL
Sbjct: 598 EEV-DMDLSAMDDSPVLWIRLDPEMSLLRSTVISQPDYQWQYQLRHERDVTAQSEAIDAL 656

Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEE-TDWAGLLHLVKFYKSRR 292
              P  +      L + + + +  +++R  AA+ L   A+   + WAG   ++  +  R+
Sbjct: 657 HNYPEPATR--KALTDIIENEQTHYKIRCRAAHCLTKVANAMISSWAGPPAMLTIF--RK 712

Query: 293 FDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNN 349
              +   P   + N+F +   YF+ + IP A+A +R      P E V F+L L KYNDN+
Sbjct: 713 MFGSFAAPHIIKQNNFDNLQHYFLQKTIPVAMAGLRNIHGICPPEVVRFLLDLFKYNDNS 772

Query: 350 GNPYSDVFWLAALVQSVGELEFGQQSIL---------FLSS----LLKRIDRLLQFDRLM 396
            N +SD ++ AALV ++        S+L          LS+    +L+ I R+L  ++++
Sbjct: 773 KNHFSDNYYKAALVDALAATITPVISVLQPGAPITAESLSADTRLVLEEITRVLNLEKVL 832

Query: 397 PSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEF 456
           P Y   +T+SC+R + +  L+  G   L  +  + + +  +     VR+ A   L+D   
Sbjct: 833 PCYKNTVTVSCLRAIRR--LQQCG--HLPSIPTVFRAYAQYGQYIDVRLAAFEGLVDF-V 887

Query: 457 HCNGIDSALSLFIKSVEEEPSLRGQVKLGIHAMRICQIKGGSDSNHEVDTVTLVA-LLNL 515
             +G    LS  + ++E +P     V+ G+  + +          H +DT ++V  L N 
Sbjct: 888 RVDGKPEDLSYLLTAIENDPD--PGVRHGLARLMVSMPPFERAQRHRLDTESVVHRLWNN 945

Query: 516 LESRIAFNNVFLRHHLFGILQILAGRAPTLYGVPR 550
           + S+++ N+  LR  L  +         TLYG+ R
Sbjct: 946 INSQLS-NDARLRCDLVDLYY-------TLYGLKR 972


>gi|449541548|gb|EMD32531.1| hypothetical protein CERSUDRAFT_118583 [Ceriporiopsis subvermispora
           B]
          Length = 1819

 Score =  152 bits (383), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 93/300 (31%), Positives = 154/300 (51%), Gaps = 36/300 (12%)

Query: 191 WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNF 250
           WIR D + E++A I F+Q   MW++QL++D DVVAQ +AI+ L  +P+    + +T    
Sbjct: 699 WIRIDADFEWIAAIAFDQKDYMWVSQLQRDRDVVAQYEAISVLSKMPNPI--ISSTFTRT 756

Query: 251 LSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFY----------KSRRFDENIGLP 300
           +  S  ++R+R EAA AL   A +  +W GL HL K +          K   F+    +P
Sbjct: 757 VLVSNYYFRIRCEAAQALVTCAVQRIEWLGLFHLFKLFLRYCYEPEDPKPDLFNHKY-VP 815

Query: 301 RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLA 360
           RPNDF DF+EYFV +++  A++ VR  + K+P    +F++  L+YNDN  N +SD  ++ 
Sbjct: 816 RPNDFSDFAEYFVRKSLVKAISRVRFENGKTPPIIRQFLIDQLRYNDNTANLFSDAMYIC 875

Query: 361 ALVQSV------------GELEFGQ-------QSILFLSSLLKRIDRLLQFDRLMPSYNG 401
            ++ ++            GE    +       Q +  L   L  +DR    DRL+PSY+ 
Sbjct: 876 TVISALACATMSTVPPERGEFISAESQPVQDAQDVDLLKQALAEVDRYRSMDRLIPSYHN 935

Query: 402 ILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGI 461
           ++T++ I     + L ++GF+  D   ++  P        QVR+ A   L   +++   I
Sbjct: 936 VVTVAVIEFY--LMLSVAGFVPHDP--RMFFPLTREGNYHQVRMAAFDTLFLTKWYTPKI 991


>gi|366999642|ref|XP_003684557.1| hypothetical protein TPHA_0B04540 [Tetrapisispora phaffii CBS 4417]
 gi|357522853|emb|CCE62123.1| hypothetical protein TPHA_0B04540 [Tetrapisispora phaffii CBS 4417]
          Length = 1523

 Score =  151 bits (382), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 123/454 (27%), Positives = 209/454 (46%), Gaps = 62/454 (13%)

Query: 13   KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
            +IL  I  +A  G  P  +LS   F+H   +V    +  L +FF +WV + G P+ R+  
Sbjct: 655  RILPKIFLQAMSGDLPNNSLSAAHFQHVCERVN---KSKLGKFFQQWVYSSGVPIFRVTQ 711

Query: 72   SYNKRKNIVELAVLR---------DCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRV 122
             +NK++ ++E+ + +            VK    +  L+   DS  R+    + G M+IR+
Sbjct: 712  RFNKKRMVIEMGIRQVQEQKVSGSKNGVKEGFSSRALNYIQDS-TRETIPHFTGSMTIRI 770

Query: 123  HELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARR-----ALKPKKGSKPDGCDDNGDAV 176
            HE DG  Y+H +     D +  L+IQ ++K    R     +LK  K  K +  +     V
Sbjct: 771  HEPDGTPYEHIV--EIKDVFTKLDIQYNTKYRRSRTKKQSSLKLDKDGKTELNEQPSQVV 828

Query: 177  -----------AGLDMRSSMES--------------------PLSWIRADPEMEYLAEIH 205
                         L  +S  ES                       WIR D ++E++ E+H
Sbjct: 829  KFPNVNIQKFGTVLSTQSDCESWNLADPSLSSEVLEIQRQNEAFQWIRIDSDLEWICELH 888

Query: 206  FNQPVQMWINQLEKDGDVVAQAQAI----AALEALPHLSFNVVNTLNNFLSDSKAFWRVR 261
             NQ   M+  QL++D D+ AQ Q+I      +E     S    + L   + DS+ F+ VR
Sbjct: 889  INQSDYMFAAQLQQDADIEAQIQSIRYYDDTIEQSQVTSLLYSSILTRTVMDSRYFYGVR 948

Query: 262  IEAAYALANTASEETDW----AGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAI 317
            +EA  AL      +        G  HL+K Y+         +P  NDF D  +Y++ +AI
Sbjct: 949  LEACRALTKYIVRDLPNRVFPGGPRHLIKIYQELFCYSESNIPLNNDFSDIRKYYLQKAI 1008

Query: 318  PHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEF-GQQSI 376
            P  ++ V +  ++SP     F+L +L YN+N+GN Y+D++++  L+Q++ E         
Sbjct: 1009 PVYLSEVVSETDESPEFVKNFLLDILTYNENSGNNYNDIYFICYLIQALVECATKSANDR 1068

Query: 377  LFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRT 410
             F+S +L+++ R    D+  PSY  +LT + + +
Sbjct: 1069 SFISKVLEQLQRYENLDKWTPSYQLLLTKTILES 1102


>gi|134083644|emb|CAK47036.1| unnamed protein product [Aspergillus niger]
          Length = 1271

 Score =  151 bits (381), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 134/524 (25%), Positives = 227/524 (43%), Gaps = 79/524 (15%)

Query: 1    MLEKQMGSNFFRKILQNIISR-----AQGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
            +L++++     +  +  IISR       G  P   ++T  F+    ++G+ +   L  FF
Sbjct: 511  ILDRRLTKASGKATMSRIISRLFLNSRMGDIPNGAVTTSLFQRTCERLGHAK---LDSFF 567

Query: 56   PRWVGTCGCPVLRMGFSYNKRKNIVELAVLR---DCTVKPDSRTPVLSSNTDSENRDGDI 112
             +WV   GCP  +    +NK+K +VE+ + +   D +   D        +   E R    
Sbjct: 568  QQWVYGAGCPRFQATQRFNKKKLVVEMMIRQVQADASSTRDLDKNAFMRDVKEEIRGVYA 627

Query: 113  G-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKP 166
            G     + G M+IR+HE DG  Y+H +    G     +      K   R   + ++ +  
Sbjct: 628  GVIQPVFTGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAAAV 687

Query: 167  DGCDDNGDAVAGL------DMRSSMES--------------------PLSWIRADPEMEY 200
             G D N +    +      D+  S E                        WIR D + E+
Sbjct: 688  TGGDPNAETQEDVLLYCLGDVLQSEEEVAEWRLRDWSEEEEERMSRESYEWIRMDADFEW 747

Query: 201  LAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRV 260
            + ++    P  M+++QL++D DVVAQ +    L +L H   + +      L D + F+ +
Sbjct: 748  ICKLSLAMPGYMYLSQLQQDRDVVAQLEVSKFLYSLEHELISTI--FVRTLMDRRYFYGI 805

Query: 261  RIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP-----RPNDFRDFSEYFVLE 315
            R+ AA AL   A++   W GL HL      R F E   LP     R NDF D + YF+  
Sbjct: 806  RVAAARALVKHATDAVGWVGLFHL-----ERAFQELFCLPGSPMTRSNDFSDRAAYFIQL 860

Query: 316  AIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGEL------ 369
            AIP +++ +R A+ K+P     F+   LK+NDN+ N YSD F++A L++S+         
Sbjct: 861  AIPESISKIRDANGKTPMRVKRFLYDKLKFNDNSNNEYSDNFYVATLMESLCHAMLGKPE 920

Query: 370  -------EFGQQSIL-------FLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIA 415
                   +F  + +L            +  IDR  + D    S+  I + + +R   Q+ 
Sbjct: 921  RPSDEMDDFDMERVLEAQAEEQLEKDAIAEIDRYRRMDEWSSSFQNIYSRAALR--CQVR 978

Query: 416  LKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCN 459
            L  +  +SLD +  L  P+    T   +R++A   L++L+   N
Sbjct: 979  LMEAKILSLDIMQFL--PYTRAGTYDLLRLDAFECLIELDIFRN 1020


>gi|154299901|ref|XP_001550368.1| hypothetical protein BC1G_10841 [Botryotinia fuckeliana B05.10]
 gi|347841569|emb|CCD56141.1| similar to aminopeptidase N [Botryotinia fuckeliana]
          Length = 1585

 Score =  151 bits (381), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 145/541 (26%), Positives = 246/541 (45%), Gaps = 74/541 (13%)

Query: 5    QMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGC 64
            Q G+   R I + I++         T+ST+ FR    K+          FF +WV   GC
Sbjct: 551  QGGTGISRVISKMIVNANANGGTDSTISTEGFRKSCEKITKYRD--TDSFFAQWVHGSGC 608

Query: 65   PVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNT---DSENRDGDIGWP------ 115
            P   +   +NK++  VE+ + +  T       P L  +      +  DG +  P      
Sbjct: 609  PHFTITQKFNKKRLCVEMTITQ--TQAHLETEPKLHKDGFYRAFKELDGGVSQPKTQSTF 666

Query: 116  -GMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSK---LAARRALKPKK--GSKPDG 168
             G ++IR+HE DG  Y+H +  M  +    +EI  ++K   L   R +K +   G+  DG
Sbjct: 667  TGPLTIRIHEADGTPYEHIV--MIREKTSRIEIPYNTKYKRLKRNRRMKERANAGTAIDG 724

Query: 169  CDDNGDAV---------AGLDMR------------SSMES-PLSWIRADPEMEYLAEIHF 206
             +++ DA+         +  DM+            + ME+    WIR D + E++ +  F
Sbjct: 725  LENHDDALIYCLGDVLQSPQDMQEWGLRDWDPITEAKMEAESYEWIRVDADFEWICDKKF 784

Query: 207  N-QPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAA 265
                  M+++QL++D D+ AQ +++A L+  P     V   L   L D + F+ +R  AA
Sbjct: 785  TGMEAYMYVSQLQQDRDISAQYESMAYLDNAPPHPL-VATFLIRTLMDRRYFYGIRAMAA 843

Query: 266  YALANTASE-ETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMV 324
             +LA  A E +  W GL HL K Y    +      P+ NDF D   Y++  AI  AV+ +
Sbjct: 844  KSLAKHAVEKDVKWIGLKHLEKAYLEFFYFPGGKTPKANDFSDKQAYWIECAIVEAVSRI 903

Query: 325  RAA-DNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV----------------- 366
            R+  D+  P  A  F+  +L++NDN  NPYSD F +A +++S+                 
Sbjct: 904  RSMYDSTCPSTARRFIFDVLRFNDNGNNPYSDSFKIANILKSLADSLIPGAKGVNNDVLQ 963

Query: 367  ---GELEFG--QQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGF 421
                E + G  ++   +   +L  +DR  + D  + SY  I TI+ I    ++ L  +G 
Sbjct: 964  LDDAEEDGGDKKEDPDYRKLILDELDRYSRMDEWIDSYQNIYTITAIE--CKLKLMNAGV 1021

Query: 422  ISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLRGQ 481
            I +D  +   + F D  +  +VR++A+ AL DL F  +   + L +   S +  P +R +
Sbjct: 1022 IQVD-ALDFAQYFHD-GSAQRVRIKAAWALADLGFLKSTAVACLLVTAASTDPSPWVRKE 1079

Query: 482  V 482
            +
Sbjct: 1080 L 1080


>gi|226291005|gb|EEH46433.1| transcription initiation factor TFIId 127kD subunit [Paracoccidioides
            brasiliensis Pb18]
          Length = 1238

 Score =  151 bits (381), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 148/550 (26%), Positives = 237/550 (43%), Gaps = 95/550 (17%)

Query: 1    MLEKQMGSNFFRKILQNIISRA-----QGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
            +LE+++     +  +  IISR       G  P  +L+T   +    ++G+ +   L +FF
Sbjct: 511  ILERRLSKASGKATISRIISRIFLHARMGDLPNGSLATSFLQKICERLGHTK---LDQFF 567

Query: 56   PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSR---TPVLSSNTDSENRDGDI 112
             +WV   GCP       +NK+K +VE+ + +  + +P  R         +   E ++   
Sbjct: 568  QQWVYGAGCPRFLATQRFNKKKLVVEMMIRQVQSEQPTVRDLDKLTFMRDVKEEIKNVYA 627

Query: 113  G-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKP 166
            G     + G M+IR+HE DG  Y+H +    G     +      K   R   + ++ +  
Sbjct: 628  GAVQHVFTGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAAAA 687

Query: 167  DGCDDN------------GDAVAGLDM---------------RSSMESPLSWIRADPEME 199
             G D N            GD +   +                R   ES   WIR D + E
Sbjct: 688  SGLDPNAELQDDVLLYCLGDVLQSEEEIQKWKLVEWSKEDEDRMGQES-YEWIRMDADFE 746

Query: 200  YLAEIHFNQPVQMWINQLEKDGDVVAQAQAI---AALEALPHLSFNVVNTLNNFLSDSKA 256
            ++ ++    P  M+++QL++D D+VAQ ++I   AA    P +S   + T    L D + 
Sbjct: 747  WICKMSLTMPGYMYLSQLQQDRDIVAQLESIQYMAAQREHPLISTIFIRT----LMDRRY 802

Query: 257  FWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP-----RPNDFRDFSEY 311
            F+ +R  AA AL   A E+ DW GL HL        F E   LP     R NDF D S Y
Sbjct: 803  FYGIRAAAAKALVKHAKEDVDWIGLFHL-----ESAFQELFCLPGSPMTRSNDFSDRSSY 857

Query: 312  FVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVG---- 367
             +   IP A+A VR +  K+P     F+   LK+NDN+ +  SD +++AAL++S+     
Sbjct: 858  ILQLTIPQAIAQVRDSTGKTPMRVKRFLYDKLKFNDNSNSEISDCYYVAALMKSICDALL 917

Query: 368  ----------------ELEFGQQSILFLS-SLLKRIDRLLQFDRLMPSYNGIL---TISC 407
                            E+E  +Q+   L    +  IDR  + D    S+  I     I C
Sbjct: 918  GRKENEPRNIDMDFDMEMELERQAEQQLDRDCIAEIDRYRRMDEWTSSFQNIYARTAIGC 977

Query: 408  IRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSL 467
             R   Q     +G I LD +V+ ++  R   T   +R++A   L  +EF    I   L+ 
Sbjct: 978  QRRFMQ-----AGVIELD-LVQFLQYTRA-GTYEMLRLDAFDIL--IEFDIFQIPELLTW 1028

Query: 468  FIKSVEEEPS 477
            FI  +  +PS
Sbjct: 1029 FIYVMSTDPS 1038


>gi|70983546|ref|XP_747300.1| transcription initiation factor TFIID subunit TSM1/127kD, putative
            [Aspergillus fumigatus Af293]
 gi|66844926|gb|EAL85262.1| transcription initiation factor TFIID subunit TSM1/127kD, putative
            [Aspergillus fumigatus Af293]
 gi|159123694|gb|EDP48813.1| transcription initiation factor TFIID subunit TSM1/127kD, putative
            [Aspergillus fumigatus A1163]
          Length = 1358

 Score =  151 bits (381), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 139/526 (26%), Positives = 232/526 (44%), Gaps = 90/526 (17%)

Query: 1    MLEKQMGSNFFRKILQNIISRA-----QGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
            +L++++     +  +  IISR       G  P   +++ +F+    ++G+ +   L  FF
Sbjct: 508  ILDRRLTKASGKATMSRIISRMFLNARMGDIPNGAITSSQFQKICERLGHAK---LDTFF 564

Query: 56   PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG-- 113
             +WV   GCP  +    +NK+K +VE+ + +    +P +R   L  NT   +   +I   
Sbjct: 565  QQWVYGAGCPRFQATQRFNKKKLVVEMMIKQVQADQPSTRD--LDKNTFMRDVKEEIRGV 622

Query: 114  --------WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGS 164
                    + G M+IR+HE DG  Y+H +    G     +      K   R   + ++ +
Sbjct: 623  YAGNIQPVFSGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAA 682

Query: 165  KPDGCDDNGDAVAGL---------------------------DMRSSMESPLSWIRADPE 197
               G D N +    +                           + R   ES   WIR D +
Sbjct: 683  AVSGGDPNAETQEDVLLYCLGDVLQTEEEMQEWRLADWSKEDEDRMGQES-YEWIRMDAD 741

Query: 198  MEYLAEIHFNQPVQMWINQLEKDGDVVAQ---AQAIAALEALPHLSFNVVNTLNNFLSDS 254
             E++ ++    P  M+++QL++D DVVAQ    Q +AA    P +S   V T    L D 
Sbjct: 742  FEWICKLSLVMPGYMYLSQLQQDRDVVAQLESLQYMAAQREHPLISTIFVRT----LMDR 797

Query: 255  KAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP-----RPNDFRDFS 309
            + F+ +R  AA AL   A EET+W GL HL      R F E   LP     R NDF D +
Sbjct: 798  RYFYGIREAAARALVKHAKEETNWLGLFHL-----ERAFQELFCLPGSPMTRSNDFSDRA 852

Query: 310  EYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQS---- 365
             Y +  AIP A++ VR  D K+P     F+   LK+NDN+ N YSD F++A L+QS    
Sbjct: 853  AYALQLAIPEAISKVRDNDGKTPLRVKRFLYDKLKFNDNSNNEYSDHFYVATLMQSLCHA 912

Query: 366  -VGELE--------FGQQSILFLSS-------LLKRIDRLLQFDRLMPSYNGILTISCIR 409
             +G++E        F  + +L + +        +   DR  + D    S+  + + + +R
Sbjct: 913  MLGKVESRDVDIDDFDMERMLEIQAEEQLEKDAIAEFDRYRRMDEWSSSFQNVYSRAALR 972

Query: 410  TLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLE 455
               Q+ L  +  + +D  +    P+    T   +R +A   L++L+
Sbjct: 973  --CQMQLMRAKILDVD--IMQFLPYTRAGTYDLLRADAFECLVELD 1014


>gi|307172277|gb|EFN63782.1| Transcription initiation factor TFIID subunit 2 [Camponotus
           floridanus]
          Length = 1192

 Score =  150 bits (380), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 123/479 (25%), Positives = 219/479 (45%), Gaps = 64/479 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGAS---------PVRTLSTKEFRHFANKVGNLERPFL 51
           MLE ++G     ++    +S A  A+         P   +ST  F   A  +  +    +
Sbjct: 448 MLEHRIGQELLLQVFNKQLSLAANAAQQKIESGLWPHMLISTNVF---AKAIFTVTGKDM 504

Query: 52  KEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGD 111
             F  +WV T G     + F +N+++N VEL + +D                 + ++ G 
Sbjct: 505 SVFIDQWVRTGGHAKFSLSFVFNRKRNTVELEIRQD-----------------TAHQRGI 547

Query: 112 IGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDD 171
             + G + + + ELDG + H  L + G   +  +I CHSK    +  K    +       
Sbjct: 548 RKYVGPLVVNIQELDGTFKH-TLQIEGTMARA-DITCHSKSRRNKKKKIPLCT------- 598

Query: 172 NGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIA 231
            G+ V  +D+ +  +SP+ WIR DPEM  +  +   QP   W  QL  + DV AQ +AI 
Sbjct: 599 -GEEV-DMDLSAMDDSPVLWIRLDPEMTIMRAVQIEQPDYQWQYQLRHERDVTAQLEAIV 656

Query: 232 ALE--ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFY 288
           AL+  + P     + +T+ N       +++VR+ AA+ L   A+     WAG   ++  +
Sbjct: 657 ALQHHSTPATRLALTDTIEN----EHCYYKVRLRAAHCLTKVANAMVATWAGPPAMLAIF 712

Query: 289 KSRRFDENIG-LPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYND 347
           +      +   + + N+F +F  YF+ + IP A+A +R A    P E + F++ L KYND
Sbjct: 713 RKLFGSASCRRIIKQNNFSNFQHYFLQKTIPVAMAGLRNAHGICPPEVLSFLMDLFKYND 772

Query: 348 NNGNPYSDVFWLAALVQSVGE-----LEFGQQSILFLSSL-------LKRIDRLLQFDRL 395
           N+ N YSD ++ AAL++++G      +   Q + +   SL       L+ + R L  ++L
Sbjct: 773 NSKNRYSDNYYRAALIEALGATVTPVISVQQGTAITAESLSIDTKAILEEVTRNLNLEKL 832

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
           +P Y   ++++C++ +      L  F  L     + + +  +     VR+ A  AL+D 
Sbjct: 833 LPCYKYTVSVACLKVIR----ILQKFGHLPSNPHIFRAYAAYGQFIDVRIAALEALVDF 887


>gi|17542624|ref|NP_500378.1| Protein TAF-2 [Caenorhabditis elegans]
 gi|351065369|emb|CCD61342.1| Protein TAF-2 [Caenorhabditis elegans]
          Length = 1086

 Score =  150 bits (380), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 138/518 (26%), Positives = 239/518 (46%), Gaps = 75/518 (14%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRT--------LSTKE--FRHFANKVGNLERPF 50
           ML++++G   F ++L  +++     S  +T        L+T E  FR   +  G     F
Sbjct: 494 MLQRRIGLEPFMRVLHRVLTVGLDMSEKKTTPAAWRHLLTTTESFFRSVISVTGKEIPSF 553

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           L +F    V T G     + F +N+++NIVE+ + +D                D+E  +G
Sbjct: 554 LSQF----VRTGGHAAFAVKFDFNRKRNIVEIEIKQD----------------DTEG-NG 592

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPK--KGSKPDG 168
              + G +S+ V E+DG + H I      A    EI CHSK   +R  K     G + + 
Sbjct: 593 RTQYTGPLSVVVQEVDGAFTHTI--QIDGAVSHAEISCHSKGRKQRKKKVPLLTGEEIEI 650

Query: 169 CDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQ 228
              N DA          ESP+ W+R DP+   + EI  +QP+  W   L  + DV+AQ +
Sbjct: 651 DLTNMDA----------ESPILWLRIDPDYLLIREITISQPMFHWEYMLRYERDVIAQME 700

Query: 229 AIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALA---NTASEETDWAGLLHLV 285
           A+  ++ALP  S +  + + + +++ K F+R+R  AA+ L    N  SE       + + 
Sbjct: 701 ALERIQALP--SAHSRSVIVDAVANEKFFYRIRYRAAFVLTFVQNRKSEALTVGTPVLIN 758

Query: 286 KFYKSRRFDENIGLPRPNDF----RDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQ 341
            F +S        +PR N+F    ++  +YFV++A+P A+A +R    +   +   F+L 
Sbjct: 759 MFRESFGSKAATNIPRSNNFVVTAQNLQQYFVMQALPQAIARLRRQSGECHEDVQPFLLD 818

Query: 342 LLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILF-----------LSSLLKRIDRLL 390
           L+K+NDN+ N YSD F+ AAL  S+    F   ++             +  L+K     L
Sbjct: 819 LIKFNDNSTNRYSDDFYRAALYNSLASSVFPHDALPCHVELPENLSNDVRVLIKEFTYAL 878

Query: 391 QFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRA 450
             D + PS+  ++  + +  L Q  L+  G++ LD   +L+  F   N   Q+R  A   
Sbjct: 879 NMDTVSPSWGRVVGAAALTGLYQ--LQKCGYLPLDS--QLLWTFSHPNCCVQMRRCAITL 934

Query: 451 LLDLEFH------CNGIDSALSLFIKSVEEEPSLRGQV 482
           ++D   +         +D +  L +  +E++PS+R  +
Sbjct: 935 IIDRIVNDPHAADTRMLDLSRILELAELEQDPSIRRMI 972


>gi|302694747|ref|XP_003037052.1| hypothetical protein SCHCODRAFT_64031 [Schizophyllum commune H4-8]
 gi|300110749|gb|EFJ02150.1| hypothetical protein SCHCODRAFT_64031 [Schizophyllum commune H4-8]
          Length = 1812

 Score =  150 bits (380), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 94/290 (32%), Positives = 151/290 (52%), Gaps = 34/290 (11%)

Query: 191 WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNF 250
           WIR D + E++A I F+QP  MW++QL++D DVVAQ +AI AL   P  +  V +TL   
Sbjct: 695 WIRMDADFEWIAFISFDQPDFMWVSQLQRDRDVVAQLEAIYALSRTP--TAIVSSTLTRT 752

Query: 251 LSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFD---------ENIGLPR 301
           +  +  F+R+R EAA AL + A   T++ GL HL K +    +D         ++  +P+
Sbjct: 753 VLVTNYFYRIRCEAALALVHCAVRRTEFLGLFHLFKLFLRYCYDTEDPKQDLFKHSYVPK 812

Query: 302 PNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAA 361
           PNDF D +EYFV + +  A+A VR  + K+P    +F++  LK+NDN  N YSD F++  
Sbjct: 813 PNDFSDIAEYFVRKHLVTAIARVRFENGKTPPVVRQFLIDQLKFNDNTTNAYSDAFYICT 872

Query: 362 LVQSV-------------------GELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGI 402
           ++  +                   G  E   + +  L+  +  I+R  + D+L+PS + +
Sbjct: 873 IIDGLACATTSTAPPERGELLPTEGRAELTTEDVHLLNEAMSEIERYQEMDKLIPSPHNV 932

Query: 403 LTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALL 452
           +T++ I   T   L ++ +I      +     RD N   QVR+ A  AL 
Sbjct: 933 VTVAVIEFYT--VLMVANYIP-SHPYRFFFYTRDGNYT-QVRIAAFDALF 978



 Score = 43.9 bits (102), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 27/102 (26%), Positives = 52/102 (50%), Gaps = 8/102 (7%)

Query: 31  LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTV 90
           LST  F     KV  ++   ++ F  +W+   GCP      ++N++K  VE+ + ++C  
Sbjct: 510 LSTSYFLRVCRKVSGVD---VRGFAEQWIYGSGCPSFAFQATFNRKKMAVEITMRQEC-- 564

Query: 91  KPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGM-YDH 131
            P  +    ++ +  + +   + + G M+IR+HE DG  Y+H
Sbjct: 565 -PAYKAHEGNAFSMEQFKPAPL-FEGQMTIRIHEADGTPYEH 604


>gi|164661735|ref|XP_001731990.1| hypothetical protein MGL_1258 [Malassezia globosa CBS 7966]
 gi|159105891|gb|EDP44776.1| hypothetical protein MGL_1258 [Malassezia globosa CBS 7966]
          Length = 1113

 Score =  150 bits (379), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 84/248 (33%), Positives = 137/248 (55%), Gaps = 28/248 (11%)

Query: 187 SPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNT 246
           +P  WIR D + E++A+I F QP  MW++QL++D DVVAQ  A+ AL  +P  S    +T
Sbjct: 102 APYEWIRLDADFEWMAQIQFEQPDYMWVSQLQRDRDVVAQLAAVHALSQMP--SLITSST 159

Query: 247 LNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYK-SRRFD----------E 295
           L   +  +K F+R+R EAAY LAN A    D  GL HL   ++ S   D          E
Sbjct: 160 LTRTVLVTKYFYRIRAEAAYGLANCALPHLDLLGLFHLFMLFRTSYCLDVPHEGDSTSLE 219

Query: 296 NIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSD 355
              +P+PNDF D ++YFV  A+ HA+A VR    ++      F++ LL+YNDN+ N + D
Sbjct: 220 APCIPKPNDFSDMADYFVRRALIHAIARVRDHRGRALVIVQRFLIYLLRYNDNSTNRFVD 279

Query: 356 VFWLAALVQSV---------------GELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYN 400
            ++LA+++ ++                +  +  + +   +  ++ ++RL   DRL+PSY+
Sbjct: 280 DYYLASIINALAGTLIPIDSAGYSTHADETYSPEEMELRAVAIQEVERLQVLDRLVPSYH 339

Query: 401 GILTISCI 408
            ++TI+ +
Sbjct: 340 NVVTIAAL 347


>gi|409051976|gb|EKM61452.1| hypothetical protein PHACADRAFT_156702 [Phanerochaete carnosa
            HHB-10118-sp]
          Length = 1899

 Score =  150 bits (378), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 91/299 (30%), Positives = 152/299 (50%), Gaps = 34/299 (11%)

Query: 191  WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNF 250
            WIR D + E++A I F+Q   MW++QL++D DVVAQ +AIA L   P+    + +T    
Sbjct: 714  WIRIDADFEWIANIAFDQKDYMWVSQLQRDRDVVAQYEAIAVLAKQPNAI--ISSTFTKT 771

Query: 251  LSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFD---------ENIGLPR 301
            +  S  F+R+R EAA AL   A +  DW GL HL K +    +D          +  +PR
Sbjct: 772  VLVSNYFFRIRCEAAQALVTCAIQRLDWIGLFHLFKLFLRYCYDPEDPNQDLFRHTYVPR 831

Query: 302  PNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAA 361
            PNDF DF+EYFV +++   ++ VR  + K+P    +F++  L+YNDN  NP+SD  ++ +
Sbjct: 832  PNDFSDFAEYFVRKSLLQTISRVRFENGKTPAVVRQFLIDQLRYNDNTYNPFSDALYICS 891

Query: 362  LVQSV------------GELEFGQ-------QSILFLSSLLKRIDRLLQFDRLMPSYNGI 402
            ++ ++            GE    +       +    L   +  +DR    DRL+PS + +
Sbjct: 892  IISALACATVSTVPPERGEFVAAEARPPMDPEDDKLLKEAVSEVDRYRSMDRLIPSVHNV 951

Query: 403  LTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGI 461
            ++I+ I     + L ++  I  D   +++ P        QVR+ A  +L   +++   I
Sbjct: 952  VSIATIEFY--LLLSVANLIPHDP--RMLIPLTREGNFTQVRMAAFDSLFFTKWYTPAI 1006


>gi|260807981|ref|XP_002598786.1| hypothetical protein BRAFLDRAFT_212923 [Branchiostoma floridae]
 gi|229284061|gb|EEN54798.1| hypothetical protein BRAFLDRAFT_212923 [Branchiostoma floridae]
          Length = 995

 Score =  150 bits (378), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 127/503 (25%), Positives = 232/503 (46%), Gaps = 67/503 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEFRHFANKVGNLERPFL 51
           ++E+++G     ++   ++S A  A+  +          LS+  F    + V   +   L
Sbjct: 448 LVEQKIGFESLLQVFNKLLSLATSAASQKYSPTNWSNMLLSSASFVKLISTVSGKDIHLL 507

Query: 52  KEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGD 111
            +   +WV   G       F +N+++N+VEL +                         G 
Sbjct: 508 LD---QWVYQGGVVQFSGSFVFNRKRNLVELEL------------------NQKVGEHGI 546

Query: 112 IGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDD 171
             + G +++ + ELDG ++H +     D     E+ CHSK    +  K            
Sbjct: 547 QKYVGPLTVTIQELDGSFNHSV--QIEDNTTRHELVCHSKSRRNKKKKIPLV-------- 596

Query: 172 NGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAI 230
           NG+ V  +D+ +   +SP+ WIR DP++  + ++ + QP  MW  QL  + D++AQ  +I
Sbjct: 597 NGEEV-DMDLSAMDADSPVLWIRVDPQLTLVRKVEWQQPDFMWQYQLRYERDIIAQHDSI 655

Query: 231 AALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEE-TDWAGLLHLVKFYK 289
            ALE  P  +  +   L + L +   F+RVR+EAA+ L+  A+   + W G   ++  +K
Sbjct: 656 MALEQFPTPASRLA--LTDILENESCFYRVRMEAAFCLSKVANAMVSSWMGPPAMMSMFK 713

Query: 290 SRRFDENI--GLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYND 347
            R F   +   L + N+F +   YF+ + +P A+  VR A  + P++ ++F+LQL+KYND
Sbjct: 714 -RMFCTQMYTNLIKCNNFSNLQGYFLQKTMPVAMGAVRDAHGRCPKDVLQFILQLIKYND 772

Query: 348 NNGNPYSDVFWLAALVQSVGELEFGQQSILFLSS--------------LLKRIDRLLQFD 393
           N  N +SD ++ A+L+ ++        ++L  S+              +L+ + RLL  +
Sbjct: 773 NGKNKFSDNYYRASLIDALAATVTPSVTMLDTSTDSRTEQALLPETKAVLEDVVRLLNLE 832

Query: 394 RLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +L+P Y   +T+SC++ +    L+ +G   L     L K + D      VR+ A   L D
Sbjct: 833 KLLPCYQFTITVSCLKAIR--TLQKNG--HLPSNAALFKSYADPGQYVDVRLVAMEILTD 888

Query: 454 LEFHCNGIDSALSLFIKSVEEEP 476
              H +  +  L   +  VE +P
Sbjct: 889 F-VHVDQDEEVLGWLLDYVEIDP 910


>gi|119188493|ref|XP_001244853.1| hypothetical protein CIMG_04294 [Coccidioides immitis RS]
 gi|392867767|gb|EAS33464.2| transcription initiation factor TFIID complex 127kD subunit
            [Coccidioides immitis RS]
          Length = 1253

 Score =  150 bits (378), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 138/530 (26%), Positives = 232/530 (43%), Gaps = 88/530 (16%)

Query: 2    LEKQMGSNFFRKILQNIISRAQ-GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
            L K  G     +IL  I    + G  P   L+T  F+    + G+++   L  FF +WV 
Sbjct: 501  LSKASGKATVSRILSRIFLNCRMGDIPNSALTTIFFQKLCERFGHVK---LDAFFQQWVY 557

Query: 61   TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSR---TPVLSSNTDSENRDGDIG---- 113
              GCP       +NK+K +VE+ + +  + +  +R         +   E ++   G    
Sbjct: 558  GAGCPRFVATQRFNKKKLVVEMMIRQVQSEQLSARDLEKSTFMRDVKEELQNIYAGTLQH 617

Query: 114  -WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLA--ARRALKPKKGSKPDGC 169
             + G M+IR+HE DG  Y+H +     +A    +I  ++K     R   + ++ +   G 
Sbjct: 618  VFTGSMTIRIHEADGTPYEHIV--EIKEAVTKFDIPYNTKYKRLKRNKRQRERAAASSGV 675

Query: 170  DDNGDAVAGL------DMRSSMESPL--------------------SWIRADPEMEYLAE 203
            D N +A   +      D+  S E  L                     WIR D + E++ +
Sbjct: 676  DPNSEAQDDVLLYCLGDVLQSEEEILKWRLVEWTKEDEDRMGQESYEWIRMDADFEWICK 735

Query: 204  IHFNQPVQMWINQLEKDGDVVAQAQA-------IAALEALPHLSFNVVNTLNNFLS---- 252
            +    P  M+++QL++D DVVAQ +A       +AA   +  + +      +  +S    
Sbjct: 736  MSLTMPGYMYLSQLQQDRDVVAQLEASNSRVGFMAAANCIQSIQYMAAQREHPLISTIFA 795

Query: 253  ----DSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDF 308
                DS+ F  +R  AA+AL   A E+ DW GL HL   ++      N  + R N+F D 
Sbjct: 796  RTLMDSRYFHGIRTAAAHALVKHAKEDVDWIGLFHLETAFQELFCLPNSPMTRSNNFSDR 855

Query: 309  SEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVG- 367
            S Y +  AIP+A++ VR  + + P     F+ + LK+NDN+ N +SD +++AAL++S+  
Sbjct: 856  SSYALQIAIPNAISKVRNRNGRCPMRVKRFLYEKLKFNDNSNNEFSDCYYVAALMRSIAN 915

Query: 368  -------------------ELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILT---I 405
                               ELE   +  L+L  + + I+R  + D    S+  + T   +
Sbjct: 916  ALIGGKESIHTKSKLDMTQELERQAEDQLYLDCIAE-INRYKRMDEWTSSFQNLYTRTAL 974

Query: 406  SCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLE 455
             C+R L      +     LD  V    P+    T   VR++A R LLDL+
Sbjct: 975  DCLRRL------MKEHRVLDFDVAQFLPYTRIGTFDLVRLDAFRILLDLD 1018


>gi|307205887|gb|EFN84045.1| Transcription initiation factor TFIID subunit 2 [Harpegnathos
           saltator]
          Length = 1103

 Score =  150 bits (378), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 123/483 (25%), Positives = 220/483 (45%), Gaps = 72/483 (14%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRH-------FANKVGNLERPFLKE 53
           MLE ++G     ++    +S A  A+  + + +  + H       F   +  +    +  
Sbjct: 361 MLEHRIGQELLLQVFNKQLSLAANAAQ-QKIDSGLWSHMLISTNVFTKAIFTVTGKDMAV 419

Query: 54  FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
           F  +WV T G     + F +N+++N VEL + +D T                 ++ G   
Sbjct: 420 FIDQWVRTGGHAKFTLSFVFNRKRNTVELEIKQDTT-----------------HQRGIRK 462

Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
           + G + + + ELDG + H  L + G   +  +I CHSK    +  K    +        G
Sbjct: 463 YVGTLLVNIQELDGTFKH-TLQIEGTLAKA-DITCHSKSRRNKKKKIPLCT--------G 512

Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
           + V  +D+ +  +SP+ WIR DPEM  +  +   QP   W  QL  + DV AQ +AI AL
Sbjct: 513 EEV-DMDLSAMDDSPVLWIRLDPEMTLMRAVQIEQPDYQWQYQLRHERDVTAQLEAIEAL 571

Query: 234 E--ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYK- 289
           +  + P     + +T+ N       +++VR+ AA+ L   A+     WAG   ++  ++ 
Sbjct: 572 QNHSTPATRLALTDTIEN----EHCYYKVRLRAAHCLTKVANAMVATWAGPPAMMAIFRK 627

Query: 290 ------SRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLL 343
                  RR      + + N+F +F  YF+ + IP A+A +R A    P E + F++ L 
Sbjct: 628 LFGSASCRR------IIKQNNFSNFQHYFLQKTIPVAMAGLRNAHGICPPEVLAFLMDLF 681

Query: 344 KYNDNNGNPYSDVFWLAALVQSVGE-----LEFGQQSILFLSSL-------LKRIDRLLQ 391
           KYNDN+ N YSD ++ AAL++++G      +   Q + +   SL       L+ + R L 
Sbjct: 682 KYNDNSKNRYSDNYYRAALIEALGATVTPVISVQQGTAITAESLSIDTKAILEEVTRNLN 741

Query: 392 FDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRAL 451
            ++L+P Y   ++++C++ +      L  F  L     L + +  +     +R+ A  AL
Sbjct: 742 LEKLLPCYKYTVSVACLKVIR----ILQKFGHLPSNPNLFRAYAAYGQFIDIRIAALEAL 797

Query: 452 LDL 454
           +D 
Sbjct: 798 VDF 800


>gi|403417076|emb|CCM03776.1| predicted protein [Fibroporia radiculosa]
          Length = 1900

 Score =  149 bits (376), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 94/285 (32%), Positives = 144/285 (50%), Gaps = 34/285 (11%)

Query: 191  WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNF 250
            WIR D + E++A I F+Q   MW++QL++D DVVAQ +AI  L   P+    + +TL   
Sbjct: 789  WIRIDADFEWIAGIAFDQKDYMWVSQLQRDRDVVAQYEAILVLSKSPNAI--ISSTLTKT 846

Query: 251  LSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRF---DENIGL------PR 301
            +  S  ++R+R EAA AL +   +  +W GL HL K +    +   D N  L      PR
Sbjct: 847  VLVSNYYFRIRCEAAQALVSCGIQRLEWLGLFHLFKLFLRYCYEPEDPNPDLFSHTYVPR 906

Query: 302  PNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAA 361
            PNDF D +EY+V +++  A+A VR  + K+P    +F++  L+YNDN  NPYSD  ++  
Sbjct: 907  PNDFSDLTEYYVRKSLLRAIARVRFENGKTPPIIRQFLIDQLRYNDNTANPYSDALYICT 966

Query: 362  LVQ-------SVGELEFGQ------------QSILFLSSLLKRIDRLLQFDRLMPSYNGI 402
            ++        SV   E G+            Q    L   L  +DR    DRL+P+Y+ +
Sbjct: 967  IISALATATISVVPPERGEFIATDSQPVQDSQDADLLRQALAEVDRYRSMDRLIPTYHNV 1026

Query: 403  LTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEA 447
            +T++ I     + L + G I  D   +   P        QVR+ A
Sbjct: 1027 VTVAVIEFY--LMLSMGGLIPHD--ARTFFPLTREGNYTQVRIAA 1067



 Score = 45.1 bits (105), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 32/141 (22%), Positives = 65/141 (46%), Gaps = 21/141 (14%)

Query: 1   MLEKQMGSN-----FFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
           +L++++G +       R + +  +S   G  P   +ST  F     KV  ++    + F 
Sbjct: 569 ILDRRLGKSGTSLGLSRVLPKVFLSAISGEMPNNAISTHSFLRTCRKVSGIDP---RSFA 625

Query: 56  PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG-- 113
            +W+   GCP      ++N++K  VE+++ ++         P   ++ ++E     +   
Sbjct: 626 EQWIYGSGCPTFGFSATFNRKKMAVEISMRQEA--------PAYKAHENNEITRTLMKPV 677

Query: 114 --WPGMMSIRVHELDGM-YDH 131
             + G M+IR+HE DG  Y+H
Sbjct: 678 PFFEGQMTIRIHEADGTPYEH 698


>gi|332022415|gb|EGI62723.1| Transcription initiation factor TFIID subunit 2 [Acromyrmex
           echinatior]
          Length = 1205

 Score =  149 bits (375), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 120/477 (25%), Positives = 219/477 (45%), Gaps = 60/477 (12%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRH-------FANKVGNLERPFLKE 53
           MLE ++G     ++    +S A  A+  + + +  + H       FA  +  +    +  
Sbjct: 457 MLEHRIGQELLLQVFNKQLSLAANAAQ-QKIESGLWSHMLISTNVFAKAIFTVTGKDMSV 515

Query: 54  FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
           F  +WV T G     + F +N+++N VEL + +D T                 ++ G   
Sbjct: 516 FIDQWVRTGGHAKFSLSFIFNRKRNTVELEIRQDTT-----------------HQRGIRK 558

Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
           + G + + + ELDG + H  L + G   +  +I CHSK    +  K    +        G
Sbjct: 559 YVGPLVVNIQELDGTFKH-TLQIEGTMARA-DITCHSKSRRNKKKKIPLCT--------G 608

Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
           + V  +D+ +  +SP+ WIR DPEM  +  +   QP   W  QL  + DV AQ +AI AL
Sbjct: 609 EEV-DMDLSAMDDSPVLWIRLDPEMTIMRAVQIEQPDYQWQYQLRHERDVTAQLEAIEAL 667

Query: 234 E--ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYKS 290
           +  + P     + +T+ N       +++VR+ AA+ L    +     W+G   ++  ++ 
Sbjct: 668 QHHSTPATRLALTDTIEN----EHCYYKVRLRAAHCLTKVGNAMVATWSGPPAMLAIFRK 723

Query: 291 RRFDENIG-LPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNN 349
                +   + + N+F +F  YF+ + IP A+A +R A    P E + F++ L KYNDN+
Sbjct: 724 LFGSASCRRIIKQNNFSNFQHYFLQKTIPVAMAGLRNAHGICPPEVLAFLMDLFKYNDNS 783

Query: 350 GNPYSDVFWLAALVQSVGE-----LEFGQQSILFLSSL-------LKRIDRLLQFDRLMP 397
            N YSD ++ AAL++++G      +   Q + +   SL       L+ + R L  ++L+P
Sbjct: 784 KNRYSDNYYRAALIEALGATVTPVISVQQGTAITAESLSIDTKAILEEVTRNLNLEKLLP 843

Query: 398 SYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
            Y   ++++C++ +      L  F  L     L + +  +     +R+ A  AL+D 
Sbjct: 844 CYKYTVSVACLKVIR----ILQKFGHLPSNPHLFRAYAAYGQFIDIRIAALEALVDF 896


>gi|154283629|ref|XP_001542610.1| transcription initiation factor TFIId 127kD subunit [Ajellomyces
            capsulatus NAm1]
 gi|150410790|gb|EDN06178.1| transcription initiation factor TFIId 127kD subunit [Ajellomyces
            capsulatus NAm1]
          Length = 1170

 Score =  149 bits (375), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 133/512 (25%), Positives = 216/512 (42%), Gaps = 83/512 (16%)

Query: 1    MLEKQMGSNFFRKILQNIISRA-----QGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
            +LE+++     +  +  IISR       G  P   L+T  F+    ++G+ +   L  FF
Sbjct: 511  ILERRLSKASGKATISRIISRIFLNARMGDLPNGALATSFFQKTCERLGHTK---LDTFF 567

Query: 56   PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSS---NTDSENRDGDI 112
             +WV   GCP       +NK+K +VE+ + +  + +P  R   +S+   +   E R+   
Sbjct: 568  QQWVYGAGCPRFLATQRFNKKKLVVEMMIRQVQSEQPTVRDLEMSTFMRDVKEEIRNVYA 627

Query: 113  G-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKP 166
            G     + G M+IR+HE DG  Y+H +    G     +      K   R   + ++ +  
Sbjct: 628  GPVQHVFTGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAAAA 687

Query: 167  DGCDDN------------GDAVAGLDM---------------RSSMESPLSWIRADPEME 199
             G D N            GD +   +                R   ES   WIR D + E
Sbjct: 688  SGIDPNAELQDDVLLYCLGDVLQSEEEIQTWKLVEWSKEDEDRMGQES-YEWIRMDADFE 746

Query: 200  YLAEIHFNQPVQMWINQLEKDGDVVAQAQAI---AALEALPHLSFNVVNTLNNFLSDSKA 256
            ++ ++    P  M+++QL++D D+VAQ ++I   AA    P +S   V T    L D + 
Sbjct: 747  WICKMSLTMPGYMYLSQLQQDRDIVAQLESIQYMAAQREHPLISTIFVRT----LMDRRY 802

Query: 257  FWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEA 316
            F+ +R  AA AL   A +E DW GL HL   ++         + R NDF D + Y +   
Sbjct: 803  FYGIRAAAARALVKHAKDEVDWIGLYHLESAFQELFCLPGSPMTRSNDFSDRAAYILQLT 862

Query: 317  IPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFG---- 372
            IP A+A VR    K+P     F+   LK+NDN+ N  SD +++ AL++S+ E   G    
Sbjct: 863  IPQAIAQVRDNSGKTPMRVKRFLYDKLKFNDNSNNELSDCYYITALMKSICEALLGRRET 922

Query: 373  -----------------QQSILFLSSLLKRIDRLLQFDRLMPSYNGILT---ISCIRTLT 412
                             Q         +  IDR  + D    S+  I +   I C R L 
Sbjct: 923  EIDGVGVDFDVERELERQAERQLDRDCIAEIDRYRRMDEWTTSFQNIYSRTAIECQRRLM 982

Query: 413  QIALKLSGFISLDQVVKLIKPFRDFNTIWQVR 444
            Q  +       +D+ +    P+ +  +  +VR
Sbjct: 983  QANV-------IDRDIMQFLPYTELESSTEVR 1007


>gi|213410437|ref|XP_002175988.1| transcription initiation factor TFIID subunit 2
           [Schizosaccharomyces japonicus yFS275]
 gi|212004035|gb|EEB09695.1| transcription initiation factor TFIID subunit 2
           [Schizosaccharomyces japonicus yFS275]
          Length = 1147

 Score =  148 bits (374), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 134/489 (27%), Positives = 222/489 (45%), Gaps = 74/489 (15%)

Query: 2   LEKQMGSNFFRKILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
           L K  GS    +++  ++ +A  G      LST  F     K  +++   L  F  +W+ 
Sbjct: 452 LTKTGGSLGMSRVIPKLLLQAMSGDMHNGCLSTTHFLKTCEKASHIK---LDVFAQQWIY 508

Query: 61  TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNT----DSENR--DGDIGW 114
             G P+ R+   +N++K I+E+ + +          P LS N+    D   R  +G+   
Sbjct: 509 GFGYPIFRVVQRFNRKKMIIEMGINQ----VQAKEMPKLSRNSLDFMDEATRHLNGETVK 564

Query: 115 P------GMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPD 167
           P      G M+IR+HE DG  Y+H +     D +  L+IQ ++K       +  K  + +
Sbjct: 565 PVIPVFSGPMTIRIHEADGTPYEHVV--ELKDTFTKLDIQYNTKYKRISRNRSSKNVRKE 622

Query: 168 GCDDNGDAVAGLDMRSSMES------------------------------PLSWIRADPE 197
           G DD+G    GLD    ++S                                 WIR D +
Sbjct: 623 G-DDHG----GLDSDYVIKSLGDILQTDEDVENWHLYDYTKEDEDAMATEAFEWIRVDAD 677

Query: 198 MEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHL-SFNVVNTLNNFLSDSKA 256
            E++  +   QP  M+++QL++D DVVAQ +AI    +     S  V   L   L DS+ 
Sbjct: 678 FEWICGLRVGQPEHMYVSQLQQDRDVVAQLEAIRYFCSDSFRPSRQVSTVLLRTLMDSRY 737

Query: 257 FWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEA 316
           ++ +R EAA ALA  A+EE +W G  HL K +  +   EN  +P+ N F    +Y++   
Sbjct: 738 YYGIRQEAALALAKCATEELEWIGHFHLRKAFLDKFCFENSTIPKSNVFSSIIDYYLKCT 797

Query: 317 IPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV-------GE- 368
           + HA + +R     +P    + +L LL +NDN  N  SD  ++  L++++       GE 
Sbjct: 798 MVHAFSRIRDRKGDTPITVKKLLLDLLCFNDNANNELSDAHYICHLIEALVDALIPRGET 857

Query: 369 LEFG---QQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLD 425
           ++F       + F+ +++  IDR L+ D  MPSY  +++    RT  +   KLS    L 
Sbjct: 858 MQFAFSDAGHMQFVKTVIAEIDRYLRIDGWMPSYCNVIS----RTALEGKNKLSQVFHLG 913

Query: 426 QVVKLIKPF 434
             VK + P+
Sbjct: 914 FEVKDLFPY 922


>gi|393907247|gb|EFO22550.2| TATA binding protein associated factor [Loa loa]
          Length = 1083

 Score =  148 bits (374), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 137/485 (28%), Positives = 231/485 (47%), Gaps = 68/485 (14%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEFRHFANKVGNLERPFL 51
           ML +++G   F +++Q I+S A  AS ++         T+ST+ F    + V   E P  
Sbjct: 480 MLNQRLGKEPFLQVIQKILSVAMQASQLQKEPAHWQHMTVSTESFFRTVSSVTGQELP-- 537

Query: 52  KEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGD 111
             F  +W+   G     + +++N+++N++EL V ++                 S NR G 
Sbjct: 538 -TFLEQWIHNGGHAKFWVQYAFNRKRNMIELEVKQE----------------PSANR-GR 579

Query: 112 IGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDD 171
             + G + + V ELDG + H +   +  +    ++QCHSK   RR  K KK     G + 
Sbjct: 580 QSYVGPLIVVVQELDGSFTHTVQIDSNHSKH--DLQCHSK--GRRQ-KKKKIPLSTGEEL 634

Query: 172 NGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIA 231
             D +A +D     +SP+ WIR DP++  L +++  QPV  W   L+ + DV+AQ QA+ 
Sbjct: 635 EID-LANMDP----DSPVLWIRIDPDLLLLRKVNIQQPVYQWEYMLKYERDVLAQLQALD 689

Query: 232 ALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETD-WAG----LLHLVK 286
            L+  P  S +    L   +     F+RVR  AAY LA   ++  + W G    ++   K
Sbjct: 690 ILQRFP--SPHARAVLIETIETEAFFYRVRCRAAYILAEVVNKLPETWLGPPALMILFKK 747

Query: 287 FYKSRRFDENIGLPRPNDFRDFSE----YFVLEAIPHAVAMVRAADNKSPREAVEFVLQL 342
           F+  +       +P+PN+F   S     YF++ AIP A+A +R++ N +  E   F++ L
Sbjct: 748 FFGCKSAPH---IPKPNNFVATSANLQLYFLMHAIPQALARLRSSSNGALSEVHHFLIGL 804

Query: 343 LKYNDNNGNPYSDVFWLAALVQSVGELEFGQQS--ILFLS---------SLLKRIDRLLQ 391
           +KYNDN  N YSD  + A+L+ S+       +S  I+ +S          LL      L 
Sbjct: 805 IKYNDNTVNRYSDDHYRASLINSLAATLIPAESRDIIDMSPLSLDTEMRELLSEFTLALN 864

Query: 392 FDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRAL 451
            D L PS+  I+ I+ ++ +    ++ +  I LD   K+   F   N    +R+ A   L
Sbjct: 865 MDTLKPSFGRIIGITALQAIYH--MQRTFHIPLDP--KVFWSFAQPNMYVPMRLAALTFL 920

Query: 452 LDLEF 456
           +D+ +
Sbjct: 921 VDMVY 925


>gi|358335688|dbj|GAA28713.2| transcription initiation factor TFIID subunit 2 [Clonorchis sinensis]
          Length = 1343

 Score =  148 bits (374), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 121/443 (27%), Positives = 206/443 (46%), Gaps = 58/443 (13%)

Query: 31   LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTV 90
            LST  FR     V   +   ++ F  +WV   G   L + F +N+++N+VEL + +D   
Sbjct: 692  LSTSSFRRIILMVTGQD---IQNFLDQWVFRTGHVRLFVKFHFNRKRNVVELELKQDLQS 748

Query: 91   KPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHS 150
            +                  G + + G +++ + ELDG + H      G   +  ++ CHS
Sbjct: 749  R------------------GTLSYTGPLTVMLQELDGAFMHTFKLEEGRMAR--DLPCHS 788

Query: 151  KLAARRALKPKKGSKPDGCDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPV 210
            K     + K +K   P     NGD V     R   ESPL W+R DP++  +  I+  QP 
Sbjct: 789  K-----SRKHRKKRIPLA---NGDEVDMDLGRIDAESPLLWVRMDPDLAVIHTIYVEQPD 840

Query: 211  QMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALAN 270
             MW   L  D D + Q +A  AL   P  S    + L N +++ + F+RVR +A ++L +
Sbjct: 841  FMWHLMLLHDRDCLGQLEATHALRDFP--SPETRHALANVIANERVFYRVRTDACFSLCH 898

Query: 271  TASEETDWAGL----------LHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHA 320
             A+E +   G           + L  F++        GL R N+F+    YF++ AI  +
Sbjct: 899  IANELSAMGGAQGTTANAATSVLLPLFWQLFSSPAARGLVRQNNFKHLQHYFLMRAICRS 958

Query: 321  VAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGEL---EFGQQSIL 377
            +A +R      PRE + F+ +LL++NDN  N YSD  +LA L++++ ++       + ++
Sbjct: 959  LATLR-VQQVCPREVLPFISELLRHNDNTRNEYSDYCYLADLIRTMKDILTPAIVVRGVM 1017

Query: 378  FLSSL-------LKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKL 430
              SSL       ++ +   L  D   PSY G +T+ C+ T+ +  L+  GF+ +D  +  
Sbjct: 1018 SSSSLPVEARTVIEEVVHHLNLDTQTPSYKGTITVVCLATIRR--LQRLGFLPVDPSLFY 1075

Query: 431  IKPFRDFNTIWQVRVEASRALLD 453
            +     F   + VR+     L+D
Sbjct: 1076 LYASPGF--YYDVRLAGLECLVD 1096


>gi|310789844|gb|EFQ25377.1| hypothetical protein GLRG_00521 [Glomerella graminicola M1.001]
          Length = 1554

 Score =  148 bits (374), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 142/527 (26%), Positives = 234/527 (44%), Gaps = 71/527 (13%)

Query: 31   LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTV 90
            ++T+ F+    K     R  L+ F+ +WV   GCP   +   +NK++  VE+ + R    
Sbjct: 612  INTESFQRVCEKNA---RNKLEAFWAQWVYGSGCPRFDVFQRFNKKRLCVEMTI-RQVQD 667

Query: 91   KPDSRTPVLSSN---------TDSENRDGDIG-WPGMMSIRVHELDGM-YDHPILPMAGD 139
            K  S   VL             +  + D  +G + G M+IR+HE DG  Y+H I+ +  D
Sbjct: 668  KALSEAEVLKKTEFLRAIKEKANGVHVDAAMGLFTGPMTIRIHEADGTPYEH-IVEIRED 726

Query: 140  AWQLLEIQC---------HSKLAARRALKPKKGSKPDGCDDN-----GDAVAGL------ 179
            A +  + +            K  A   L  +  + PD  ++      GD +         
Sbjct: 727  AAKAAKFEIPYNTKYKRLKRKRRATEKLSAQSAADPDNNEEALLYCLGDVLTSPEDMRKW 786

Query: 180  -------DMRSSM-ESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIA 231
                   D+  +M +    WIR D + E+  E+  N    M+++QL++D DVVAQ  ++ 
Sbjct: 787  EFSEWEPDIEKAMDQESYEWIRMDADFEWACEMKTNLQSYMYVSQLQQDRDVVAQQDSML 846

Query: 232  ALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSR 291
             L+  P     V   L   L D + F  VR  AA  L + A+E+   AG+ HL++ +   
Sbjct: 847  YLKKSPPHPL-VSTFLTRTLVDRRYFHGVRTMAADMLYHHATEKVAMAGMTHLMRAFTEL 905

Query: 292  RFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGN 351
                +   PRPNDF D  +Y V  A+P A+A VR ++ K P  A   +L  +++N+N  N
Sbjct: 906  FCYPDSATPRPNDFSDKKQYLVQSALPMAIAKVRDSNGKCPEPARRLLLNQVQFNNNANN 965

Query: 352  PYSDVFWLAALVQSVG---------------ELEFGQQSILFLSSLLKRIDRLLQFDRLM 396
            PYSD F++A L++++                +L+   +   FL   +  IDR  + D   
Sbjct: 966  PYSDHFYVAKLLEALAHSLIPEKRRDAGDSMDLDTSAEESSFLEEAIVEIDRFRRMDEWA 1025

Query: 397  PSYNGILTIS---CIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
             SY  I T +   C R L +     +G I     +  I+  +D +T   VRV+A  AL+D
Sbjct: 1026 NSYQNIWTTTALECRRKLMK-----AGVIP-KSALDFIQYLQD-DTCDLVRVKAFEALVD 1078

Query: 454  LEFHCNGIDSALSLFIKSVEEEPSLRGQVKLGIHAMRICQIKGGSDS 500
            L +        L + + S +  P +R ++ L I    +  +  G  S
Sbjct: 1079 LGYMMELPIFRLIVSVMSTDRSPFVRDKL-LKIFCTGLASLAFGEQS 1124


>gi|443429476|gb|AGC92654.1| transcription initiation factor TFIID subunit 2-like protein
           [Heliconius erato]
          Length = 1364

 Score =  148 bits (374), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 133/536 (24%), Positives = 240/536 (44%), Gaps = 68/536 (12%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEFRHFANKVGNLERPFL 51
           MLE+++G     ++    +S A  A+  +          LST  F      +  +    +
Sbjct: 434 MLEQRIGQELLLQVFNKQLSLATNAANTKIGSGLWGHLLLSTNLF---VKAIFTVTGKDM 490

Query: 52  KEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGD 111
             F  +WV T G    ++   +N+++N VEL + +D                 S +  G 
Sbjct: 491 AVFVDQWVRTGGHAKFQLTSVFNRKRNTVELEIRQD-----------------SVHERGI 533

Query: 112 IGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDD 171
             + G + +++ ELDG + H +        Q+  I   + +      +  K  K   C  
Sbjct: 534 RKYVGPLLVQLQELDGTFKHTL--------QIENIVVKADITCHSKSRRNKKKKIPLC-- 583

Query: 172 NGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIA 231
            G+ V  +D+ +  +SP+ WIR DPEM  L     +QP   W  QL  + DV AQ++AI 
Sbjct: 584 TGEEV-DMDLSAMEDSPVLWIRLDPEMSLLRSTIISQPDYQWQYQLRHERDVTAQSEAID 642

Query: 232 ALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEE-TDWAGLLHLVKFYKS 290
           AL   P  +      L + + + + ++++R  AA+ L   A+   + WAG   ++  +  
Sbjct: 643 ALHNYPEPA--TRKALTDTIENEQTYYKIRCRAAHCLTKVANAMISSWAGPPAMLTIF-- 698

Query: 291 RRFDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYND 347
           R+   +   P   + N+F +   YF+   IP A+A +R      P E V F+L L KYND
Sbjct: 699 RKMFGSFSAPHIIKQNNFDNLQHYFLQRTIPVAMAGLRNIHGICPPEVVRFLLDLFKYND 758

Query: 348 NNGNPYSDVFWLAALVQSVGELEFGQQSIL---------FLSS----LLKRIDRLLQFDR 394
           N+ N +SD ++ AALV ++        S+L          LS+    +L+ I R+L  ++
Sbjct: 759 NSKNHFSDNYYKAALVDALAATITPVISVLQPGAPITADSLSADTRLVLEEITRVLNLEK 818

Query: 395 LMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
           ++P Y   ++++C+R + +  L+  G   L  +  + + + ++     VR+ A   L+D 
Sbjct: 819 VLPCYKNTVSVNCLRAIRR--LQQCG--HLPSIPTIFRAYAEYGQYIDVRLAAFECLVDF 874

Query: 455 EFHCNGIDSALSLFIKSVEEEPSLRGQVKLGIHAMRICQIKGGSDSNHEVDTVTLV 510
               +G    LS  + +VE +P     V+ G+  + +        S H +DT  +V
Sbjct: 875 -VRVDGKPEDLSSLLTAVENDPD--PGVRHGLARLLVSMPPFEKASRHRLDTEAVV 927


>gi|350633147|gb|EHA21513.1| hypothetical protein ASPNIDRAFT_45548 [Aspergillus niger ATCC 1015]
          Length = 1272

 Score =  148 bits (374), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 134/529 (25%), Positives = 227/529 (42%), Gaps = 88/529 (16%)

Query: 1    MLEKQMGSNFFRKILQNIISR-----AQGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
            +L++++     +  +  IISR       G  P   ++T  F+    ++G+ +   L  FF
Sbjct: 511  ILDRRLTKASGKATMSRIISRLFLNSRMGDIPNGAVTTSLFQRTCERLGHAK---LDSFF 567

Query: 56   PRWVGTCGCPVLRMGFSYNKRKNIVELAVLR---DCTVKPDSRTPVLSSNTDSENRDGDI 112
             +WV   GCP  +    +NK+K +VE+ + +   D +   D        +   E R    
Sbjct: 568  QQWVYGAGCPRFQATQRFNKKKLVVEMMIRQVQADASSTRDLDKNAFMRDVKEEIRGVYA 627

Query: 113  G-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKP 166
            G     + G M+IR+HE DG  Y+H +    G     +      K   R   + ++ +  
Sbjct: 628  GVIQPVFTGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAAAV 687

Query: 167  DGCDDNGDA------------------VAGLDMRS--------SMESPLSWIRADPEMEY 200
             G D N +                   VA   +R                WIR D + E+
Sbjct: 688  TGGDPNAETQEDVLLYCLGDVLQSEEEVAEWRLRDWSEEEEERMSRESYEWIRMDADFEW 747

Query: 201  LAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFN-----VVNTLNNFLSDSK 255
            + ++    P  M+++QL++D DVVAQ      LE+L +++       +       L D +
Sbjct: 748  ICKLSLAMPGYMYLSQLQQDRDVVAQ------LESLQYMAIQREHELISTIFVRTLMDRR 801

Query: 256  AFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP-----RPNDFRDFSE 310
             F+ +R+ AA AL   A++   W GL HL      R F E   LP     R NDF D + 
Sbjct: 802  YFYGIRVAAARALVKHATDAVGWVGLFHL-----ERAFQELFCLPGSPMTRSNDFSDRAA 856

Query: 311  YFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGEL- 369
            YF+  AIP +++ +R A+ K+P     F+   LK+NDN+ N YSD F++A L++S+    
Sbjct: 857  YFIQLAIPESISKIRDANGKTPMRVKRFLYDKLKFNDNSNNEYSDNFYVATLMESLCHAM 916

Query: 370  ------------EFGQQSIL-------FLSSLLKRIDRLLQFDRLMPSYNGILTISCIRT 410
                        +F  + +L            +  IDR  + D    S+  I + + +R 
Sbjct: 917  LGKPERPSDEMDDFDMERVLEAQAEEQLEKDAIAEIDRYRRMDEWSSSFQNIYSRAALR- 975

Query: 411  LTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCN 459
              Q+ L  +  +SLD +  L  P+    T   +R++A   L++L+   N
Sbjct: 976  -CQVRLMEAKILSLDIMQFL--PYTRAGTYDLLRLDAFECLIELDIFRN 1021


>gi|260808452|ref|XP_002599021.1| hypothetical protein BRAFLDRAFT_130728 [Branchiostoma floridae]
 gi|229284297|gb|EEN55033.1| hypothetical protein BRAFLDRAFT_130728 [Branchiostoma floridae]
          Length = 1059

 Score =  148 bits (374), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 125/503 (24%), Positives = 232/503 (46%), Gaps = 67/503 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEFRHFANKVGNLERPFL 51
           ++E+++G     ++   ++S A  A+  +          LS+  F    + V   +   L
Sbjct: 448 LVEQKIGFESLLQVFNKLLSLATSAASQKYTPTNWSNMLLSSASFVKLISTVSGKDIHLL 507

Query: 52  KEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGD 111
            +   +WV   G       F +N+++N+VEL +                         G 
Sbjct: 508 LD---QWVYQGGVVHFSGSFVFNRKRNLVELEL------------------NQKVGEHGI 546

Query: 112 IGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDD 171
             + G +++ + ELDG ++H +     D     E+ CHSK    +  K            
Sbjct: 547 QKYVGPLTVTIQELDGSFNHSV--QIEDNTTRHELVCHSKSRRNKKKKIPLV-------- 596

Query: 172 NGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAI 230
           NG+ V  +D+ +   +SP+ WIR DP++  + ++ + QP  MW  QL  + D++AQ  +I
Sbjct: 597 NGEEV-DMDLSAMDADSPVLWIRVDPQLTLVRKVEWQQPDFMWQYQLRYERDIIAQHDSI 655

Query: 231 AALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEE-TDWAGLLHLVKFYK 289
            ALE  P  +  +   L + L +   F+RVR+EAA+ L+  A+   + W G   ++  +K
Sbjct: 656 MALEQFPTPASRLA--LTDILENESCFYRVRMEAAFCLSKVANAMVSSWMGPPAMMSMFK 713

Query: 290 SRRFDENI--GLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYND 347
            R F   +   L + N+F +   YF+ + +P A+  VR A  + P++ ++F+LQL+KYND
Sbjct: 714 -RMFCTQMYTNLIKCNNFSNLQGYFLQKTMPVAMGAVRDAHGRCPKDVLQFILQLIKYND 772

Query: 348 NNGNPYSDVFWLAALVQSVGELEFGQQSILFLSS--------------LLKRIDRLLQFD 393
           N  N +SD ++ A+L+ ++        ++L  S+              +L+ + R L  +
Sbjct: 773 NGKNKFSDNYYRASLIDALAATVTPSVTMLDTSTDSRTEQALLPETKAVLEDVVRHLNLE 832

Query: 394 RLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           +L+P Y   +T+SC++ +    L+ +G +  +    L K + D      VR+ A   L D
Sbjct: 833 KLLPCYQFTITVSCLKAIR--TLQKNGHLPSN--AALFKSYADPGQYVDVRLVAMEILTD 888

Query: 454 LEFHCNGIDSALSLFIKSVEEEP 476
              H +  +  L   +  VE +P
Sbjct: 889 F-VHVDQDEEVLGWLLDYVENDP 910


>gi|317036312|ref|XP_001398085.2| TATA-binding protein associated factor Taf2 [Aspergillus niger CBS
           513.88]
          Length = 1245

 Score =  148 bits (373), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 133/529 (25%), Positives = 230/529 (43%), Gaps = 88/529 (16%)

Query: 1   MLEKQMGSNFFRKILQNIISR-----AQGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
           +L++++     +  +  IISR       G  P   ++T  F+    ++G+ +   L  FF
Sbjct: 484 ILDRRLTKASGKATMSRIISRLFLNSRMGDIPNGAVTTSLFQRTCERLGHAK---LDSFF 540

Query: 56  PRWVGTCGCPVLRMGFSYNKRKNIVELAVLR---DCTVKPDSRTPVLSSNTDSENRDGDI 112
            +WV   GCP  +    +NK+K +VE+ + +   D +   D        +   E R    
Sbjct: 541 QQWVYGAGCPRFQATQRFNKKKLVVEMMIRQVQADASSTRDLDKNAFMRDVKEEIRGVYA 600

Query: 113 G-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKP 166
           G     + G M+IR+HE DG  Y+H +    G     +      K   R   + ++ +  
Sbjct: 601 GVIQPVFTGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAAAV 660

Query: 167 DGCDDNGDAVAGL------DMRSSMES--------------------PLSWIRADPEMEY 200
            G D N +    +      D+  S E                        WIR D + E+
Sbjct: 661 TGGDPNAETQEDVLLYCLGDVLQSEEEVAEWRLRDWSEEEEERMSRESYEWIRMDADFEW 720

Query: 201 LAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFN-----VVNTLNNFLSDSK 255
           + ++    P  M+++QL++D DVVAQ      LE+L +++       +       L D +
Sbjct: 721 ICKLSLAMPGYMYLSQLQQDRDVVAQ------LESLQYMAIQREHELISTIFVRTLMDRR 774

Query: 256 AFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP-----RPNDFRDFSE 310
            F+ +R+ AA AL   A++   W GL HL      R F E   LP     R NDF D + 
Sbjct: 775 YFYGIRVAAARALVKHATDAVGWVGLFHL-----ERAFQELFCLPGSPMTRSNDFSDRAA 829

Query: 311 YFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGEL- 369
           YF+  AIP +++ +R A+ K+P     F+   LK+NDN+ N YSD F++A L++S+    
Sbjct: 830 YFIQLAIPESISKIRDANGKTPMRVKRFLYDKLKFNDNSNNEYSDNFYVATLMESLCHAM 889

Query: 370 ------------EFGQQSIL-------FLSSLLKRIDRLLQFDRLMPSYNGILTISCIRT 410
                       +F  + +L            +  IDR  + D    S+  I + + +R 
Sbjct: 890 LGKPERPSDEMDDFDMERVLEAQAEEQLEKDAIAEIDRYRRMDEWSSSFQNIYSRAALR- 948

Query: 411 LTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCN 459
             Q+ L  +  +SLD +++ + P+    T   +R++A   L++L+   N
Sbjct: 949 -CQVRLMEAKILSLD-IMQFL-PYTRAGTYDLLRLDAFECLIELDIFRN 994


>gi|83765542|dbj|BAE55685.1| unnamed protein product [Aspergillus oryzae RIB40]
 gi|391864701|gb|EIT73995.1| TATA binding protein associated factor [Aspergillus oryzae 3.042]
          Length = 1269

 Score =  148 bits (373), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 138/525 (26%), Positives = 231/525 (44%), Gaps = 88/525 (16%)

Query: 1    MLEKQMGSNFFRKILQNIISR-----AQGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
            +L++++     +  +  IISR       G  P   +ST  F+    ++G+ +   L  FF
Sbjct: 511  ILDRRLTKASGKATMSRIISRLFLNARMGDIPNGAVSTMLFQKTCERLGHAK---LDSFF 567

Query: 56   PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG-- 113
             +WV   GCP  +    +NK+K +VE+ + +  + +P +R   L  NT   +   +I   
Sbjct: 568  QQWVLGAGCPRFQATQRFNKKKLVVEMMIKQVQSEQPSTRD--LDKNTFMRDVKEEIRGI 625

Query: 114  --------WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGS 164
                    + G M+IR+HE DG  Y+H +    G     +      K   R   + ++ +
Sbjct: 626  YAGVVQPVFTGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAA 685

Query: 165  KPDGCDDN------------GDAV--------------AGLDMRSSMESPLSWIRADPEM 198
               G D N            GD +              +  D     +    WIR D + 
Sbjct: 686  AASGGDPNAEVQEDVLLYCLGDVLQSEEEIQEWKLADWSKEDEERMGQESYEWIRMDADF 745

Query: 199  EYLAEIHFNQPVQMWINQLEKDGDVVAQ---AQAIAALEALPHLSFNVVNTLNNFLSDSK 255
            E++ ++    P  M+++QL++D DV+AQ    Q +AA    P +S   V T    L D +
Sbjct: 746  EWICKLSLVMPGYMYLSQLQQDRDVIAQLESLQYMAAQREHPLISTIFVRT----LMDRR 801

Query: 256  AFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP-----RPNDFRDFSE 310
             F+ +R+ AA AL   A EE +W GL HL      R F E   LP     R NDF D + 
Sbjct: 802  YFYGIRVAAAKALIKHAKEEINWLGLFHL-----ERAFQELFCLPGSPMTRSNDFSDRAA 856

Query: 311  YFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQS----- 365
            Y +   IP A++ VR  + K+P     F+   L++NDN+ N YSD +++A L+QS     
Sbjct: 857  YVLQLVIPEAISKVRDNNGKTPMRVKRFLYDKLRFNDNSNNEYSDNYYVATLMQSLCHAM 916

Query: 366  VGELE--------FGQQSIL-------FLSSLLKRIDRLLQFDRLMPSYNGILTISCIRT 410
            +G +E        F  + +L            +  IDR  + D    SY  + + + +R 
Sbjct: 917  LGRVESRSHDLDDFDMEGVLESQAEEKLEKDAIAEIDRYRRMDEWSSSYQNLYSRTTLR- 975

Query: 411  LTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLE 455
              Q+ L  +    LD +++ + P+    T   +R+EA   L++ +
Sbjct: 976  -CQMQLMQAKITELD-IMQFL-PYTRAGTYDLLRLEAFECLVETD 1017


>gi|317139688|ref|XP_001817687.2| TATA-binding protein associated factor Taf2 [Aspergillus oryzae
           RIB40]
          Length = 1242

 Score =  148 bits (373), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 138/525 (26%), Positives = 231/525 (44%), Gaps = 88/525 (16%)

Query: 1   MLEKQMGSNFFRKILQNIISR-----AQGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
           +L++++     +  +  IISR       G  P   +ST  F+    ++G+ +   L  FF
Sbjct: 484 ILDRRLTKASGKATMSRIISRLFLNARMGDIPNGAVSTMLFQKTCERLGHAK---LDSFF 540

Query: 56  PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG-- 113
            +WV   GCP  +    +NK+K +VE+ + +  + +P +R   L  NT   +   +I   
Sbjct: 541 QQWVLGAGCPRFQATQRFNKKKLVVEMMIKQVQSEQPSTRD--LDKNTFMRDVKEEIRGI 598

Query: 114 --------WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGS 164
                   + G M+IR+HE DG  Y+H +    G     +      K   R   + ++ +
Sbjct: 599 YAGVVQPVFTGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAA 658

Query: 165 KPDGCDDN------------GDAV--------------AGLDMRSSMESPLSWIRADPEM 198
              G D N            GD +              +  D     +    WIR D + 
Sbjct: 659 AASGGDPNAEVQEDVLLYCLGDVLQSEEEIQEWKLADWSKEDEERMGQESYEWIRMDADF 718

Query: 199 EYLAEIHFNQPVQMWINQLEKDGDVVAQ---AQAIAALEALPHLSFNVVNTLNNFLSDSK 255
           E++ ++    P  M+++QL++D DV+AQ    Q +AA    P +S   V T    L D +
Sbjct: 719 EWICKLSLVMPGYMYLSQLQQDRDVIAQLESLQYMAAQREHPLISTIFVRT----LMDRR 774

Query: 256 AFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP-----RPNDFRDFSE 310
            F+ +R+ AA AL   A EE +W GL HL      R F E   LP     R NDF D + 
Sbjct: 775 YFYGIRVAAAKALIKHAKEEINWLGLFHL-----ERAFQELFCLPGSPMTRSNDFSDRAA 829

Query: 311 YFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQS----- 365
           Y +   IP A++ VR  + K+P     F+   L++NDN+ N YSD +++A L+QS     
Sbjct: 830 YVLQLVIPEAISKVRDNNGKTPMRVKRFLYDKLRFNDNSNNEYSDNYYVATLMQSLCHAM 889

Query: 366 VGELE--------FGQQSIL-------FLSSLLKRIDRLLQFDRLMPSYNGILTISCIRT 410
           +G +E        F  + +L            +  IDR  + D    SY  + + + +R 
Sbjct: 890 LGRVESRSHDLDDFDMEGVLESQAEEKLEKDAIAEIDRYRRMDEWSSSYQNLYSRTTLR- 948

Query: 411 LTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLE 455
             Q+ L  +    LD +++ + P+    T   +R+EA   L++ +
Sbjct: 949 -CQMQLMQAKITELD-IMQFL-PYTRAGTYDLLRLEAFECLVETD 990


>gi|238483151|ref|XP_002372814.1| transcription initiation factor TFIID subunit TSM1/127kD, putative
            [Aspergillus flavus NRRL3357]
 gi|220700864|gb|EED57202.1| transcription initiation factor TFIID subunit TSM1/127kD, putative
            [Aspergillus flavus NRRL3357]
          Length = 1290

 Score =  147 bits (372), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 138/525 (26%), Positives = 228/525 (43%), Gaps = 88/525 (16%)

Query: 1    MLEKQMGSNFFRKILQNIISR-----AQGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
            +L++++     +  +  IISR       G  P   +ST  F+    ++G+ +   L  FF
Sbjct: 511  ILDRRLTKASGKATMSRIISRLFLNARMGDIPNGAVSTMLFQKTCERLGHAK---LDSFF 567

Query: 56   PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG-- 113
             +WV   GCP  +    +NK+K +VE+ + +  + +P +R   L  NT   +   +I   
Sbjct: 568  QQWVLGAGCPRFQATQRFNKKKLVVEMMIKQVQSEQPSTRD--LDKNTFMRDVKEEIRGI 625

Query: 114  --------WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGS 164
                    + G M+IR+HE DG  Y+H +    G     +      K   R   + ++ +
Sbjct: 626  YAGVVQPVFTGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAA 685

Query: 165  KPDGCDDN------------GDAV--------------AGLDMRSSMESPLSWIRADPEM 198
               G D N            GD +              +  D     +    WIR D + 
Sbjct: 686  AASGGDPNAEVQEDVLLYCLGDVLQSEEEIQEWKLADWSKEDEERMGQESYEWIRMDADF 745

Query: 199  EYLAEIHFNQPVQMWINQLEKDGDVVAQ---AQAIAALEALPHLSFNVVNTLNNFLSDSK 255
            E++ ++    P  M+++QL++D DV+AQ    Q +AA    P +S   V T    L D +
Sbjct: 746  EWICKLSLVMPGYMYLSQLQQDRDVIAQLESLQYMAAQREHPLISTIFVRT----LMDRR 801

Query: 256  AFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP-----RPNDFRDFSE 310
             F+ +R+ AA AL   A EE +W GL HL      R F E   LP     R NDF D + 
Sbjct: 802  YFYGIRVAAAKALIKHAKEEINWLGLFHL-----ERAFQELFCLPGSPMTRSNDFSDRAA 856

Query: 311  YFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQS----- 365
            Y +   IP A++ VR  + K+P     F+   L++NDN+ N YSD +++A L+QS     
Sbjct: 857  YVLQLVIPEAISKVRDNNGKTPMRVKRFLYDKLRFNDNSNNEYSDNYYVATLMQSLCHAM 916

Query: 366  VGELE--------FGQQSIL-------FLSSLLKRIDRLLQFDRLMPSYNGILTISCIRT 410
            +G +E        F  + +L            +  IDR  + D    SY  + + + +R 
Sbjct: 917  LGRVESRSHDLDDFDMEGVLESQAEEKLEKDAIAEIDRYRRMDEWSSSYQNLYSRTTLR- 975

Query: 411  LTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLE 455
              Q+ L  +    LD  +    P+    T   +R+EA   L++ +
Sbjct: 976  -CQMQLMQAKITELD--IMQFLPYTRAGTYDLLRLEAFECLVETD 1017


>gi|403161521|ref|XP_003321845.2| hypothetical protein PGTG_03382 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375171304|gb|EFP77426.2| hypothetical protein PGTG_03382 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 2059

 Score =  147 bits (372), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 120/443 (27%), Positives = 199/443 (44%), Gaps = 69/443 (15%)

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           LK +  +W+   GCP+ ++   +N++K   EL  LR             +S  +  N   
Sbjct: 523 LKSWADQWIYASGCPIFQVTAYFNRKKLTHEL-TLRQLNRSKQYYEDDQTSWAEKCNLKP 581

Query: 111 DIGWPGMMSIRVHELDGM-YDHPI-LPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDG 168
              + G MSIR+ E DG+ Y+H + +      ++L     + K              PD 
Sbjct: 582 VDRFEGQMSIRIQESDGIPYEHVLDIKEPFKRYELPSNHKNKKFKRSSRRYASSNQAPDL 641

Query: 169 CDDNGDAVAGLDMRSSME--------------------SPLSWIRADPEMEYLAEIHFNQ 208
            + + +         S E                    S + WIR D E+E++ E  F  
Sbjct: 642 TETSTETFTYPPWERSQEERDRWKVEDWTEEEDLFITQSNVEWIRLDVEVEWICEFRFEM 701

Query: 209 PVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYAL 268
              MW+ QL++D DVVAQ +A+ AL+ LP  S  V + L   +  S+ F+R+RIEA  AL
Sbjct: 702 KEFMWLEQLQRDRDVVAQLEAVRALKTLP--SKVVSSHLCRVVLISEYFFRIRIEAILAL 759

Query: 269 ANTASEETDWAGLLHLVKFYKSR-RFDENI---------GLPRPNDFRDFSEYFVLEAIP 318
            + A+    + GL HL+K ++SR  +   +          +P+PN F +F++YF+ +A+ 
Sbjct: 760 VSCATSRCGFLGLFHLLKIFQSRFCYPPQVEPDSPWHIRAIPKPNQFDNFADYFIRKALV 819

Query: 319 HAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV------------ 366
            A++ VR  +N SP    +F L  L YNDN+ NPYSD  ++A  + S+            
Sbjct: 820 VALSHVRDQNNNSPSVCHQFFLDQLVYNDNSLNPYSDAHYIATTIGSMTDSLIHLADNAG 879

Query: 367 ------------------GELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCI 408
                             G+LE      L L  ++  ++R    DRL+PSY  ++ ++ I
Sbjct: 880 REQLVEIFGTKALSTGTNGDLEKTCDKRLLL--MINELERCFMADRLVPSYRNLIAMAVI 937

Query: 409 RTLTQIALKLSGFISLDQVVKLI 431
               ++ L  +G + +D V  L+
Sbjct: 938 E--AKLKLMFAGLLPVDLVFFLL 958


>gi|444314387|ref|XP_004177851.1| hypothetical protein TBLA_0A05390 [Tetrapisispora blattae CBS 6284]
 gi|387510890|emb|CCH58332.1| hypothetical protein TBLA_0A05390 [Tetrapisispora blattae CBS 6284]
          Length = 1685

 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 121/467 (25%), Positives = 213/467 (45%), Gaps = 71/467 (15%)

Query: 13   KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
            ++L  I  +A  G  P  +LS   F H   KV    +  L++FF +WV + G P+ R+  
Sbjct: 802  RVLPKIFLQAMSGDLPNNSLSASHFIHVCEKVN---KNKLEDFFQQWVYSSGVPIFRITQ 858

Query: 72   SYNKRKNIVELAVLR--------DCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVH 123
             +NK++ +VE+ + +        D  +  D            +NR+    + G M+IR+H
Sbjct: 859  KFNKKRMVVEVGIRQVQNQELSQDKIIGKDGFCSSALHTLQDQNRNLPPFFTGSMTIRIH 918

Query: 124  ELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKP-----DGCDDNGDAVA 177
            E DG  Y+H +     +++  L+IQ ++K   R  +K    +K      D   D+ + V 
Sbjct: 919  EADGAPYEHIV--ELKESFTKLDIQYNTKYK-RTKIKKIVSTKSSLNLKDNAPDDYETVE 975

Query: 178  GLDMRSS-----------MESPLS--------------------------WIRADPEMEY 200
              + + S           + SP+                           WIR D + E+
Sbjct: 976  IFNEKDSEVIKIQKLGNVLTSPMDCYNWNLVDFNTTTEAHEYQKQADSFEWIRIDSDFEW 1035

Query: 201  LAEIHFNQPVQMWINQLEKDGDVVAQAQAI----AALEALPHLSFNVVNTLNNFLSDSKA 256
            + +I  NQP  M+ +QL +DGDV AQ ++I      +E     S    + L   + DS+ 
Sbjct: 1036 ICKIIINQPDYMFASQLRQDGDVEAQLESIRYYEDVIEHSSTSSNVYSSILTRTIMDSRY 1095

Query: 257  FWRVRIEAAYALANTASEETD----WAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYF 312
            F+ VRI A  AL+    +E D      G LHLV+ +++     N  +P+ NDF DF  Y+
Sbjct: 1096 FYGVRIAACRALSKYIFKEIDPDNFRGGPLHLVRIFQNLFCFNNSNIPKNNDFSDFRNYY 1155

Query: 313  VLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQS-VGELEF 371
            + +AIP  ++ VR      P    + +L LL YN+N+ N + D +++  L++S V  +  
Sbjct: 1156 LRKAIPFFLSSVRDEHGNCPSFIRQLLLDLLTYNENSENIFDDTYYICTLIKSLVNAMLT 1215

Query: 372  GQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKL 418
              Q  +F+  + ++  R    +  +P+Y  ++    ++T+ +   KL
Sbjct: 1216 EDQDKVFIKHVSEQFHRYYNLEEWLPTYQKLV----LKTIIEERFKL 1258


>gi|119484530|ref|XP_001262044.1| hypothetical protein NFIA_097760 [Neosartorya fischeri NRRL 181]
 gi|119410200|gb|EAW20147.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
          Length = 1272

 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 121/426 (28%), Positives = 191/426 (44%), Gaps = 68/426 (15%)

Query: 1   MLEKQMGSNFFRKILQNIISRA-----QGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
           +L++++     +  +  IISR       G  P   +++ +F+    ++G+ +   L  FF
Sbjct: 508 ILDRRLTKASGKATMSRIISRMFLNARMGDIPNGAITSSQFQKICERLGHAK---LDTFF 564

Query: 56  PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG-- 113
            +WV   GCP  +    +NK+K +VE+ + +    +P +R   L  NT   +   +I   
Sbjct: 565 QQWVYGAGCPRFQATQRFNKKKLVVEMMIKQVQADQPSTRD--LDKNTFMRDVKEEIRGV 622

Query: 114 --------WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGS 164
                   + G M+IR+HE DG  Y+H +    G     +      K   R   + ++ +
Sbjct: 623 YAGNIQPVFSGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAA 682

Query: 165 KPDGCDDNGDAVAGL---------------------------DMRSSMESPLSWIRADPE 197
              G D N +    +                           + R   ES   WIR D +
Sbjct: 683 AVTGGDPNAETQEDVLLYCLGDVLQTEEEMQEWRLADWSKEDEDRMGQES-YEWIRMDAD 741

Query: 198 MEYLAEIHFNQPVQMWINQLEKDGDVVAQ-----AQAIAALEALPHLSFNVVNTLNNFLS 252
            E++ ++    P  M+++QL++D DVVAQ      Q +AA    P +S   V T    L 
Sbjct: 742 FEWICKLSLVMPGYMYLSQLQQDRDVVAQLEVSSLQYMAAQREHPLISTIFVRT----LM 797

Query: 253 DSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP-----RPNDFRD 307
           D + F+ +R  AA AL   A EE +W GL HL      R F E   LP     R NDF D
Sbjct: 798 DRRYFYGIREAAARALVKHAKEEINWLGLFHL-----ERAFQELFCLPGSPMTRSNDFSD 852

Query: 308 FSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVG 367
            + Y +  AIP A++ VR  D K+P     F+   LK+NDN+ N YSD F++A L+QS+ 
Sbjct: 853 RAAYVLQLAIPEAISKVRDNDGKTPLRVKRFLYDKLKFNDNSNNEYSDNFYVATLMQSLC 912

Query: 368 ELEFGQ 373
               G+
Sbjct: 913 HAMLGK 918


>gi|148697321|gb|EDL29268.1| mCG116451 [Mus musculus]
          Length = 1199

 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 130/477 (27%), Positives = 220/477 (46%), Gaps = 72/477 (15%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTYGFLKSISNVSGKDIQPL 518

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607

Query: 171 DNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAI 230
            NG+ V   DM       LS +  DP+M  L ++ F Q   MW  +L  + DVVAQ ++I
Sbjct: 608 -NGEEV---DMD------LSAMEIDPDMSVLRKVEFEQADFMWQYELRYERDVVAQQESI 657

Query: 231 AALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFYK 289
            ALE  P  +  +   L + L   + F+RVR  A + LA  A S  + W G   +   + 
Sbjct: 658 LALEKFPTPASRLA--LTDILEQEQCFYRVRTSACFCLAKIANSMVSTWTGPPAMKSLF- 714

Query: 290 SRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYND 347
           +R F       + + N+F  F  YF+ + +P A+A++R   N  P+E + F+L L+KYND
Sbjct: 715 TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYND 774

Query: 348 NNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRLM 396
           N  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++L+
Sbjct: 775 NRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKLL 834

Query: 397 PSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           PSY   +T+SC+R +    L+ +G +  D    L K + ++     +R+ A  A++D
Sbjct: 835 PSYRHTITVSCLRAIR--VLQKNGHVPSD--ASLFKSYAEYGHFVDIRIAALEAVVD 887


>gi|149066382|gb|EDM16255.1| TAF2 RNA polymerase II, TATA box binding protein (TBP)-associated
           factor [Rattus norvegicus]
          Length = 1015

 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 130/477 (27%), Positives = 221/477 (46%), Gaps = 72/477 (15%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 290 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTYGFLKSISNVSGKDIQPL 349

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 350 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 387

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 388 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 438

Query: 171 DNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAI 230
            NG+ V   DM       LS +  DP+M  L ++ F Q   MW  +L  + DVVAQ ++I
Sbjct: 439 -NGEEV---DMD------LSAMEIDPDMSVLRKVEFEQADFMWQYELRYERDVVAQQESI 488

Query: 231 AALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFYK 289
            ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   + 
Sbjct: 489 LALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKIANSMVSTWTGPPAMKSLF- 545

Query: 290 SRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYND 347
           +R F       + + N+F  F  YF+ + +P A+A++R   N  P+E + F+L L+KYND
Sbjct: 546 TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYND 605

Query: 348 NNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRLM 396
           N  N +SD ++ A    AL  SV         +  L +L       L+ I R L  ++L+
Sbjct: 606 NRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKLL 665

Query: 397 PSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           PSY   +T+SC+R +    L+ +G +  D    L K + ++     +R+ A  A++D
Sbjct: 666 PSYRHTITVSCLRAIR--VLQKNGHVPSD--ASLFKSYAEYGHFVDIRIAALEAVVD 718


>gi|353235918|emb|CCA67923.1| related to TAF2-component of TFIID complex [Piriformospora indica
           DSM 11827]
          Length = 1782

 Score =  147 bits (370), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 122/475 (25%), Positives = 217/475 (45%), Gaps = 84/475 (17%)

Query: 2   LEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLE-RPFLKEFFPRWVG 60
           + K   S    +I+  II  A        +ST  F     K   ++ RPF ++    WV 
Sbjct: 513 MHKTGTSQGLHRIIPKIILDALENESAAAISTHSFLRMCRKTAGIDFRPFAEQ----WVY 568

Query: 61  TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG----WPG 116
             GCP   +   +N++K  VE+ V ++        +P    N +   +   +     + G
Sbjct: 569 GSGCPRFAVRSHFNRKKMAVEMHVKQE--------SPAYEFNKEDPVQLALLRPMPFFDG 620

Query: 117 MMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLA-----ARRALKPKKGSKPDGCD 170
            M+IR+HE DG  Y+H +L +    ++  E+  ++K        +R +  ++ ++  G  
Sbjct: 621 QMTIRIHEADGTPYEH-VLDIR-QPYKKFEVPFNTKYKRVRRNTKRFIARQQAAEAAGNQ 678

Query: 171 DNGDAVAGLDMRSSM--------------------------ESPLSWIRADPEMEYLAEI 204
           +  + +  +DM   +                           +   WIR D ++E+L +I
Sbjct: 679 ETAELIGVVDMSFGLGIWEDDASREEWKVADWTEADEAHMATATYEWIRLDTDIEWLTKI 738

Query: 205 HFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEA 264
            F+Q   MW++QL++D DV+AQ +AI AL  +P  S  V +T    +  +  F+RVR EA
Sbjct: 739 SFDQEPFMWVSQLQRDRDVIAQIEAIHALSRIP--SKIVSSTFTKTVLTTTYFFRVRSEA 796

Query: 265 AYALANTASEETDWAGLLHLVKFYKSRRFD--------ENIGLPRPNDFRDFSEYFVLEA 316
           A +L   A    ++ GL HL K +    +D        +N  +P+ NDF DFSEYF+ +A
Sbjct: 797 ALSLVPCAGAAVEYLGLYHLYKLFFRYCYDPVNKEDIFKNEVVPKANDFSDFSEYFLRKA 856

Query: 317 IPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV---------- 366
           +  A++ VR  + ++      F++  L+YNDN+ N +SD ++LA ++ ++          
Sbjct: 857 LITALSQVRVENGETVPIVRRFLIDQLRYNDNSTNNFSDSYYLAHIIGALGCSLVSTTPP 916

Query: 367 --GELE-----------FGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCI 408
             GEL+             Q +   L   +  +DR    DR++PS+   +T++ I
Sbjct: 917 ERGELQGDHIPKIVDDPRAQINKDILDGAIAEVDRYRDMDRVIPSWRNAITVATI 971


>gi|345483769|ref|XP_001599737.2| PREDICTED: transcription initiation factor TFIID subunit 2-like
           [Nasonia vitripennis]
          Length = 1227

 Score =  146 bits (369), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 124/499 (24%), Positives = 222/499 (44%), Gaps = 61/499 (12%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRH-------FANKVGNLERPFLKE 53
           MLE ++G     ++    +S A  A+  + + +  + H       F   +  +    +  
Sbjct: 457 MLEHRIGQELLLQVFNKQLSLATNAAQ-QKIDSGLWSHMLISTNVFTKAIFTVTGKDMAV 515

Query: 54  FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
           F  +WV T G     + F +N+++N VEL + +D T                 +  G   
Sbjct: 516 FIDQWVRTGGHAKFTLSFKFNRKRNTVELEIRQDKT-----------------HERGIRK 558

Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
           + G + + + ELDG + H  L + G   +  +I CHSK    +  K    +        G
Sbjct: 559 YVGPLLVNIQELDGTFKH-TLQIEGTVAKA-DITCHSKSRRNKKKKIPLCT--------G 608

Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
           + V  +D+ +  +SP+ WIR DP+M  +      QP   W  QL  + DV AQ +AI AL
Sbjct: 609 EEV-DMDLSAMDDSPVLWIRLDPDMTLMRAAQIEQPDYQWQYQLRHERDVTAQMEAIDAL 667

Query: 234 E--ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYKS 290
           +  A P     + +T+ N       +++VR+ AA+ L   A+     WAG   ++  ++ 
Sbjct: 668 QNHATPATRLALTDTIEN----EHCYYKVRLRAAHCLTKVANAMVATWAGPPAMLAIFRK 723

Query: 291 RRFDENIG-LPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNN 349
                +   + + N+F +F  YF+ + IP A+A +R A    P E + F++ L KYNDN+
Sbjct: 724 LFGSASCRRIIKQNNFTNFQHYFLQKTIPVAMAGLRNAHGICPPEVLSFLMDLFKYNDNS 783

Query: 350 GNPYSDVFWLAALVQSVGELEFGQQSILF------------LSSLLKRIDRLLQFDRLMP 397
            N YSD ++ AAL++++G       SI                ++L+ + R L  ++L+P
Sbjct: 784 KNRYSDNYYRAALIEALGATVTPVISIQHGAAITAESLSADTKAILEEVTRHLNLEKLLP 843

Query: 398 SYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFH 457
            Y   ++++C++ +      L  F  L     + + +  +     VR+ A  AL+D    
Sbjct: 844 CYKYTVSVACLKVIR----ILQKFGHLPSTPLIFREYAAYGQFIDVRIAALTALVDFT-K 898

Query: 458 CNGIDSALSLFIKSVEEEP 476
            +G    L   +   E +P
Sbjct: 899 VDGKWEDLEFLLDMAENDP 917


>gi|295665422|ref|XP_002793262.1| transcription initiation factor TFIId 127kD subunit
           [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226278176|gb|EEH33742.1| transcription initiation factor TFIId 127kD subunit
           [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 1196

 Score =  146 bits (369), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 135/500 (27%), Positives = 213/500 (42%), Gaps = 91/500 (18%)

Query: 1   MLEKQMGSNFFRKILQNIISRA-----QGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
           +LE+++     +  +  IISR       G  P  +L+T   +    ++G+ +   L +FF
Sbjct: 511 ILERRLSKASGKATISRIISRIFLHARMGDLPNGSLATSFLQKICERLGHTK---LDQFF 567

Query: 56  PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKP---DSRTPVLSSNTDSENRDGDI 112
            +WV   GCP       +NK+K +VE+ + +  + +P   D        +   E ++   
Sbjct: 568 QQWVYGAGCPRFLATQRFNKKKLVVEMMIRQVQSEQPTVRDLHKLTFMRDVKEEIKNVYA 627

Query: 113 G-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKP 166
           G     + G M+IR+HE DG  Y+H +    G     +      K   R   + ++ +  
Sbjct: 628 GAVQHVFTGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAAAA 687

Query: 167 DGCDDN------------GDAVAGLDM---------------RSSMESPLSWIRADPEME 199
            G D N            GD +   +                R   ES   WIR D + E
Sbjct: 688 SGLDPNAELQDDVLLYCLGDVLQSEEEIQKWKLVEWSKEDEDRMGQES-YEWIRMDADFE 746

Query: 200 YLAEIHFNQPVQMWINQLEKDGDVVAQAQAI---AALEALPHLSFNVVNTLNNFLSDSKA 256
           ++ ++    P  M+++QL++D D+VAQ ++I   AA    P +S   + T    L D + 
Sbjct: 747 WICKMSLTMPGYMYLSQLQQDRDIVAQLESIQYMAAQREHPLISTIFIRT----LMDRRY 802

Query: 257 FWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP-----RPNDFRDFSEY 311
           F+ +R  AA AL   A E+ DW GL HL        F E   LP     R NDF D S Y
Sbjct: 803 FYGIRAAAAKALVKHAKEDVDWIGLFHL-----ESAFQELFCLPGSPMTRSNDFSDRSSY 857

Query: 312 FVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVG---- 367
            +   IP A+A VR +  K+P     F+   LK+NDN+ +  SD +++AAL++S+     
Sbjct: 858 ILQLTIPQAIAQVRDSTGKTPMRVKRFLYDKLKFNDNSNSEISDCYYVAALMKSICDALL 917

Query: 368 ----------------ELEFGQQSILFLS-SLLKRIDRLLQFDRLMPSYNGIL---TISC 407
                           E+E  +Q+   L    +  IDR  + D    S+  I     I C
Sbjct: 918 GRKENEPGNIDMDFDMEMELERQAEQQLDRDCIAEIDRYRRMDEWTSSFQNIYARTAIGC 977

Query: 408 IRTLTQIALKLSGFISLDQV 427
            R   Q     +G I LD V
Sbjct: 978 QRRFMQ-----AGVIELDLV 992


>gi|452842010|gb|EME43946.1| hypothetical protein DOTSEDRAFT_71677 [Dothistroma septosporum NZE10]
          Length = 1498

 Score =  146 bits (369), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 137/516 (26%), Positives = 235/516 (45%), Gaps = 83/516 (16%)

Query: 2    LEKQMGSNFFRKILQNIISRAQ-GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
            L K  GS    +I+  I+  A+ G      LST EF+    K+G+ +   L+ FF +WV 
Sbjct: 528  LIKSSGSTGVMRIINKILLNAKTGTLHNGALSTAEFQRTCEKLGHNK---LEHFFRQWVQ 584

Query: 61   TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRD-------GDIG 113
              GCP+  +   +NK+K +VE+ + R    +  ++     +N   E ++       G++ 
Sbjct: 585  GAGCPIFYVTQRFNKKKLVVEMEI-RQRQHERQTKPLFAPNNFMREVKEHVREVWAGELQ 643

Query: 114  --WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSK---LAARRALKPKKGSKPD 167
              + G M+IR+HE DG  Y+H +    G     LEI  ++K   L   R  + +  ++  
Sbjct: 644  PVFTGPMTIRIHEADGTPYEHIVEIKEGTT--KLEIPYNTKYKRLKRSRRQRERAAAEGQ 701

Query: 168  GCDDNGDAVAGLDMRSSMESP-----------------------LSWIRADPEMEYLAEI 204
               +N D      +   ++ P                         WIR D + E+L  I
Sbjct: 702  ANGENTDEALLYCLGDVLDRPEDMKEWNLMDWSKEDEDKMGGESYEWIRMDADFEWLGRI 761

Query: 205  HFNQPVQMWINQLEKDGDVVAQAQAIAAL--EALPHLSFNVVNTLNNFLSDSKAFWRVRI 262
            H  QP+ M+I+QL++D D+VAQ +++  L   A  H+S ++   L   L D + F+ +RI
Sbjct: 762  HLTQPLYMYISQLQQDRDIVAQHESMRHLCGAAAHHVSLSI---LLRTLMDERYFYGIRI 818

Query: 263  EAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVA 322
             AA  LA+ A  +    G +HL K +     +E+  +PRPNDF     + +  AIP A+ 
Sbjct: 819  LAAEGLASLARGQLLKIGQMHLTKAFAEMFCEEDGVMPRPNDFTSRVAFIMQCAIPRAMT 878

Query: 323  MVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVG--------------- 367
             +R  + K P+   +F +  LK+NDN+ N YSD  +++ L+  +                
Sbjct: 879  KLRDDEGKVPKAVEQFFVDKLKFNDNSDNLYSDCHYVSTLMTCLAESLVVSHRIIRPTVA 938

Query: 368  ---ELEFGQQSIL-------------FLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTL 411
               E  FG++  +             F    +  I+R  + D  + +Y  I + + IR L
Sbjct: 939  PTYEFNFGEEEPVEEEEQEEFNPDADFEQMAIDEIERYRRIDEWVVTYQNIYSTTAIRCL 998

Query: 412  TQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEA 447
             +  L  +G +  D+  +L++  R  +T   VR+EA
Sbjct: 999  QK--LTKAGVVK-DKTKELLQYTRP-STADNVRLEA 1030


>gi|448533340|ref|XP_003870613.1| Tsm1 transcription initiation factor TFIID subunit [Candida
            orthopsilosis Co 90-125]
 gi|380354968|emb|CCG24484.1| Tsm1 transcription initiation factor TFIID subunit [Candida
            orthopsilosis]
          Length = 1371

 Score =  146 bits (369), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 120/445 (26%), Positives = 207/445 (46%), Gaps = 55/445 (12%)

Query: 24   GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELA 83
            G  P  TLST  F++   KV    R  L  FF +WV   G P  ++   +NK+K ++E+ 
Sbjct: 617  GDLPNGTLSTDHFQYVCEKVN---RNKLDSFFKQWVFGSGAPSFKISQRFNKKKGMIEMT 673

Query: 84   VLR-----DCTVKPDSRTPVLSS----NTDSENRDGDIGWPGMMSIRVHELDGM-YDHPI 133
            + +     + TV+ ++ + +  +    N +   +  ++ + G M+IRVHE DG  Y+H I
Sbjct: 674  IRQVQHHENKTVRLNAESFIDDALCFINKEPLPQRQNV-FTGPMNIRVHESDGTPYEHII 732

Query: 134  LPMAGDAWQLLEIQCHSKLA----ARRALKPKKGSKPDGCDDN--GDAVAGLDMRSSME- 186
                     + E++    ++     R+  K  +G  P G + N  GD +   + +   E 
Sbjct: 733  --------HIKEVKTTIDISLSKKVRKIKKKDEGVDP-GINFNQFGDILISEEDKRKWEL 783

Query: 187  -------------SPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
                          P  WIR D + E++A+I   QP  M+ +QL  D DV AQ  A+   
Sbjct: 784  QDWESRDEDSLGAEPFEWIRGDIDFEWIAKIEVLQPDFMYGSQLVYDRDVEAQFDALRYF 843

Query: 234  EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRF 293
              L   S      L     D + F+R+RI AA ALA  ++   D+ G+ +L+K YK    
Sbjct: 844  GNLEKPSVIYCTALVRTFLDERYFYRIRIAAAEALATCSNASNDFIGMKYLIKAYKHLYC 903

Query: 294  DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPY 353
              +  +P+ NDF DF  + +  A+P  +  +   + + P++  + +   +K+NDN  N Y
Sbjct: 904  FPDSSIPKANDFNDFKLFLIQRAVPGILCRITDDEGRVPKDVQDLLFNFVKFNDNAMNDY 963

Query: 354  SDVFWLAALVQSV----------GELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGIL 403
             D  +L+ L++++             E   Q   F   +   I RL + D   PSY  IL
Sbjct: 964  QDTLYLSDLIKALTTSAISGWRKDTAEGNNQE--FAHKVNAEITRLQKLDEWRPSYQRIL 1021

Query: 404  TISCIRTLTQIALKLSGFISLDQVV 428
              +C ++  Q+AL  +  +SL++++
Sbjct: 1022 QTTCFKSRVQMALYGAITLSLEEMM 1046


>gi|328708804|ref|XP_001952652.2| PREDICTED: transcription initiation factor TFIID subunit 2-like
           isoform 1 [Acyrthosiphon pisum]
          Length = 1215

 Score =  146 bits (368), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 148/580 (25%), Positives = 263/580 (45%), Gaps = 75/580 (12%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGAS--PVRT-------LSTKEFRHFANKVGNLERPFL 51
           MLE ++G +   ++L   +S A  AS  P+ +       +ST  F      V  ++   +
Sbjct: 436 MLENRIGYSLLLQVLNKHLSLASNASQNPMNSGLWGHMLISTNIFNKTIFTVSGID---M 492

Query: 52  KEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGD 111
             F  +WV T G       F +N+++N +EL + +D   +   R  V             
Sbjct: 493 SVFIDQWVRTGGHAKFTCTFVFNRKRNTLELEIKQDAVCQRGIRKYV------------- 539

Query: 112 IGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDD 171
               G + + + ELDG + H  L + G+  +  +I CHSK    +  K    +       
Sbjct: 540 ----GPLLVSIQELDGPFKH-TLQVEGNVARA-DITCHSKSRRNKKKKIPLCT------- 586

Query: 172 NGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAI 230
            GD V  +D+ +   +SP+ WIR DPEM  L      QP   W  QL  + DV AQ +A+
Sbjct: 587 -GDEV-DIDLNAMDNDSPVLWIRLDPEMTILRSCIIEQPEYQWNYQLRHEKDVTAQMEAV 644

Query: 231 AALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEE-TDWAGLLHLVKFYK 289
            AL+  P ++    N L + + + + +  VR +AA+ L   A+   T+W G   ++  +K
Sbjct: 645 VALDKFPTIA--TRNALCDIIENEQVYIHVRCKAAHCLTKIANAMVTNWTGPPAMLATFK 702

Query: 290 SRRFDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
             +   +   P   + N+F +   Y++ + IP A+A +R +    P E ++F+L L KYN
Sbjct: 703 --KMFGSFSCPHIIKQNNFSNLQHYYLQKTIPIAMAGLRNSHGICPPEILKFILDLFKYN 760

Query: 347 DNNGNPYSDVFWLAALVQSVGE-----LEFGQQSILFLSS--------LLKRIDRLLQFD 393
           DN+ N YSD ++ A+L+ ++G      +   Q++    +         +L+ I + L  +
Sbjct: 761 DNSKNRYSDNYYRASLIDALGATITPVISLMQKACDITADSLSGDSKLILEEITQHLNLE 820

Query: 394 RLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           + + +Y   +TI+C++ +     KL  F  L     + + +  +     VR+ A  AL+D
Sbjct: 821 KRLQTYKYPITITCLKAIR----KLQKFGHLPNNPNIFRSYSSYGNFIGVRLCALEALVD 876

Query: 454 LEFHC-NG--IDSALSLFIKSVEEEPSLRGQ-VKLGIHAMRICQIKGGSDSNHEVDTVTL 509
             F C +G   D    + I   + +P +R + V++          K  +  N++  +  L
Sbjct: 877 --FTCLDGRWDDLKYLIDIADSDPDPKVRNELVRMLCKNPPFTPKKPNNRLNNDYLSHHL 934

Query: 510 VALLNLL---ESRIAFNNVFLRHHLFGILQILAGRAPTLY 546
             L+N L   ++ +  N V L + LFG  +IL      LY
Sbjct: 935 WKLINSLLSYDTTLRCNFVDLYYTLFGRQKILRAEVAALY 974


>gi|328708802|ref|XP_003243802.1| PREDICTED: transcription initiation factor TFIID subunit 2-like
            isoform 2 [Acyrthosiphon pisum]
          Length = 1249

 Score =  145 bits (367), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 148/580 (25%), Positives = 263/580 (45%), Gaps = 75/580 (12%)

Query: 1    MLEKQMGSNFFRKILQNIISRAQGAS--PVRT-------LSTKEFRHFANKVGNLERPFL 51
            MLE ++G +   ++L   +S A  AS  P+ +       +ST  F      V  ++   +
Sbjct: 470  MLENRIGYSLLLQVLNKHLSLASNASQNPMNSGLWGHMLISTNIFNKTIFTVSGID---M 526

Query: 52   KEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGD 111
              F  +WV T G       F +N+++N +EL + +D   +   R  V             
Sbjct: 527  SVFIDQWVRTGGHAKFTCTFVFNRKRNTLELEIKQDAVCQRGIRKYV------------- 573

Query: 112  IGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDD 171
                G + + + ELDG + H  L + G+  +  +I CHSK    +  K    +       
Sbjct: 574  ----GPLLVSIQELDGPFKH-TLQVEGNVARA-DITCHSKSRRNKKKKIPLCT------- 620

Query: 172  NGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAI 230
             GD V  +D+ +   +SP+ WIR DPEM  L      QP   W  QL  + DV AQ +A+
Sbjct: 621  -GDEV-DIDLNAMDNDSPVLWIRLDPEMTILRSCIIEQPEYQWNYQLRHEKDVTAQMEAV 678

Query: 231  AALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEE-TDWAGLLHLVKFYK 289
             AL+  P ++    N L + + + + +  VR +AA+ L   A+   T+W G   ++  +K
Sbjct: 679  VALDKFPTIA--TRNALCDIIENEQVYIHVRCKAAHCLTKIANAMVTNWTGPPAMLATFK 736

Query: 290  SRRFDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
              +   +   P   + N+F +   Y++ + IP A+A +R +    P E ++F+L L KYN
Sbjct: 737  --KMFGSFSCPHIIKQNNFSNLQHYYLQKTIPIAMAGLRNSHGICPPEILKFILDLFKYN 794

Query: 347  DNNGNPYSDVFWLAALVQSVGE-----LEFGQQSILFLSS--------LLKRIDRLLQFD 393
            DN+ N YSD ++ A+L+ ++G      +   Q++    +         +L+ I + L  +
Sbjct: 795  DNSKNRYSDNYYRASLIDALGATITPVISLMQKACDITADSLSGDSKLILEEITQHLNLE 854

Query: 394  RLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
            + + +Y   +TI+C++ +     KL  F  L     + + +  +     VR+ A  AL+D
Sbjct: 855  KRLQTYKYPITITCLKAIR----KLQKFGHLPNNPNIFRSYSSYGNFIGVRLCALEALVD 910

Query: 454  LEFHC-NG--IDSALSLFIKSVEEEPSLRGQ-VKLGIHAMRICQIKGGSDSNHEVDTVTL 509
              F C +G   D    + I   + +P +R + V++          K  +  N++  +  L
Sbjct: 911  --FTCLDGRWDDLKYLIDIADSDPDPKVRNELVRMLCKNPPFTPKKPNNRLNNDYLSHHL 968

Query: 510  VALLNLL---ESRIAFNNVFLRHHLFGILQILAGRAPTLY 546
              L+N L   ++ +  N V L + LFG  +IL      LY
Sbjct: 969  WKLINSLLSYDTTLRCNFVDLYYTLFGRQKILRAEVAALY 1008


>gi|121719809|ref|XP_001276603.1| conserved hypothetical protein [Aspergillus clavatus NRRL 1]
 gi|119404815|gb|EAW15177.1| conserved hypothetical protein [Aspergillus clavatus NRRL 1]
          Length = 1269

 Score =  145 bits (366), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 140/524 (26%), Positives = 224/524 (42%), Gaps = 92/524 (17%)

Query: 2    LEKQMGSNFFRKILQNIISRAQ-GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
            L K  G     +I+  I   A+ G  P   +++ +F+    ++G+ +   L  FF +WV 
Sbjct: 513  LTKASGKATMSRIISRIFLNARMGDLPNGAITSSQFQKTCERLGHAK---LDTFFQQWVY 569

Query: 61   TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNT-----DSENRDGDIG-- 113
              GCP  +    +NK+K +VE+ + +    +P +R   L  NT       E R    G  
Sbjct: 570  GAGCPRFQATQRFNKKKLVVEMMIKQVQADQPSTRD--LDRNTFMRDVKEEIRGVYAGHV 627

Query: 114  ---WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGC 169
               + G M+IR+HE DG  Y+H +    G     +      K   R   + ++ +   G 
Sbjct: 628  QPVFSGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAAATIGG 687

Query: 170  DDNGDAVAGL---------------------------DMRSSMESPLSWIRADPEMEYLA 202
            D N +    +                           + R   ES   WIR D + E++ 
Sbjct: 688  DPNAEVQEDVLLYCLGDVLQTEEEMQEWRLADWSKEDEDRMGQES-YEWIRMDADFEWIC 746

Query: 203  EIHFNQPVQMWINQLEKDGDVVAQ---AQAIAALEALPHLSFNVVNTLNNFLSDSKAFWR 259
            ++    P  M+++QL++D DV+AQ    Q +AA    P +S   V T    L D + F+ 
Sbjct: 747  KLSLVMPGYMYLSQLQQDRDVIAQLESLQYMAAQREHPLISTIFVRT----LMDRRYFYG 802

Query: 260  VRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP-----RPNDFRDFSEYFVL 314
            +R  AA AL   A EE +W GL HL      R F E   LP     R NDF D + Y + 
Sbjct: 803  IRAAAARALVKHAKEEVNWLGLFHL-----ERAFQELFCLPGSPMTRSNDFSDRAAYVLQ 857

Query: 315  EAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQS-----VGEL 369
              IP A++ VR  D K+P     F+   LK+NDN+ N YSD F++A L++S     +G++
Sbjct: 858  LVIPEAISKVRDNDGKTPMRVKRFLYDKLKFNDNSNNEYSDNFYVATLMRSLCHAMLGKV 917

Query: 370  --------EFGQQSIL-------FLSSLLKRIDRLLQFDRLMPSYNGILT---ISCIRTL 411
                    EF  + +L          + +  IDR  + D    S+  + +   + C R  
Sbjct: 918  ESRSADMDEFDMERMLEAQAEEELDKAAIAEIDRYRRMDEWSSSFQNVYSRAALHCQRQF 977

Query: 412  TQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLE 455
             Q  +       LD  +    P+    T   +R +A   L++L+
Sbjct: 978  MQAKI-------LDLDIMQFLPYTRAGTYDLLRADAFEGLVELD 1014


>gi|195035725|ref|XP_001989322.1| GH10122 [Drosophila grimshawi]
 gi|193905322|gb|EDW04189.1| GH10122 [Drosophila grimshawi]
          Length = 1138

 Score =  145 bits (366), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 129/447 (28%), Positives = 218/447 (48%), Gaps = 54/447 (12%)

Query: 57  RWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPG 116
           +WV T G     +   +N ++N +EL + +D   K   R                  + G
Sbjct: 516 QWVRTGGHVKFSVISVFNCKRNTIELDIRQDYIGKRGVRQ-----------------YNG 558

Query: 117 MMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAV 176
            + +++ ELDG + H +     D      I CHSK    +  K    +        G+ +
Sbjct: 559 PLMVQLQELDGTFKHTL--QIEDTLTRSVISCHSKSRRNKKKKIPLWT--------GEEI 608

Query: 177 AGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEAL 236
             +D+ +  +SP+ W+R DPEM  L ++   QP   W  QL  + DV AQ QAI ALE  
Sbjct: 609 -DMDLSAMDDSPVLWVRIDPEMILLRDLIVEQPDFQWQYQLRHERDVTAQFQAIRALER- 666

Query: 237 PHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETD-WAG----LLHLVKFYKSR 291
            + +    + L + + +   F+RVR EAA+ L   A++  D W+G    L   +K + S 
Sbjct: 667 -YATSATRSALEDTIENECCFYRVRCEAAHCLTKVANQMVDSWSGPPPMLDIFIKMFGSL 725

Query: 292 RFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGN 351
           +    + L   NDF +F  Y++ +AIP A+A +R      P + V F+L L+KYN+N+ N
Sbjct: 726 QAPHIVRL---NDFSNFQLYYLQKAIPVAMAGLRTLHGICPPDVVRFLLDLIKYNENSQN 782

Query: 352 PYSDVFWLAALVQSVGE-LEFGQQSILFLSS---------LLKRIDRLLQFDRLMPSYNG 401
           PYSD ++ AALV ++G+ L  G QS+   SS         +L  + R L+ +  +PSY  
Sbjct: 783 PYSDSYYRAALVTALGDTLSPGYQSLTTDSSSSISTDPKLVLSAVTRFLRGEHHLPSYKY 842

Query: 402 ILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL-EFHCNG 460
           ++ ++C++ + ++  KL    SL  + +    +R +     +R+ A   L+DL + H +G
Sbjct: 843 MVLVACLKVIRRLQ-KLGHLPSLPHIFRSYSQYRSY---LDLRLSAMECLVDLVQVHESG 898

Query: 461 IDSALSLFIKSVEEEPSLR-GQVKLGI 486
            D    + +   + +P+ R G V+L I
Sbjct: 899 EDLEHLMHLLETDPDPAARYGLVQLLI 925


>gi|410078536|ref|XP_003956849.1| hypothetical protein KAFR_0D00670 [Kazachstania africana CBS 2517]
 gi|372463434|emb|CCF57714.1| hypothetical protein KAFR_0D00670 [Kazachstania africana CBS 2517]
          Length = 1388

 Score =  145 bits (366), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 125/491 (25%), Positives = 221/491 (45%), Gaps = 79/491 (16%)

Query: 13   KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
            ++L  I  +A  G  P  +LS   F+H   +V    +  L+ FF +WV   G P+ R+  
Sbjct: 601  RVLPKIFLQAMSGDLPNNSLSASHFQHICERVN---KNKLEYFFQQWVYGSGVPIFRVTQ 657

Query: 72   SYNKRKNIVELAVLRDCTVKPDSRTPVLSS---------NTDSENRDGDIGWPGMMSIRV 122
             +NK++ ++EL + R C      +   + S         +   E +D    + G M+IR+
Sbjct: 658  RFNKKRMVIELGI-RQCQNTELGQDKTVGSSGFFNSALHHLKHEEKDIVPFFTGSMTIRI 716

Query: 123  HELDGM-YDHPILPMAGDAWQLLEIQCHSK---LAARRALKPKK--GSKPDGCDD--NGD 174
            HE DG  Y+H +     D +  ++IQ ++K   L  RR     K   S  +  DD  +G 
Sbjct: 717  HEADGTPYEHIV--ELKDIFTKIDIQYNTKYKKLRGRRMASTLKLNASTSNLQDDFIDGS 774

Query: 175  AVAGLDMRSSMESPLS------------------------------WIRADPEMEYLAEI 204
             V   +      + LS                              WIR D + E++ ++
Sbjct: 775  VVDKDEEVQKFGTVLSSEEDCKKWNLENLTKTTEGNELQKQNEVFEWIRIDSDFEWICKM 834

Query: 205  HFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFL----------SDS 254
            + NQP  M+ +QL++DGDV AQ ++I   E +       VN   N L           D 
Sbjct: 835  YINQPDYMFASQLQQDGDVEAQFESIKYFEDV------TVNANKNSLIYSSILARTAMDD 888

Query: 255  KAFWRVRIEAAYALAN--TASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYF 312
            + F+ +R+EA  ALA      EE    G  HL+K +++    E+  +P+ N F  F +Y 
Sbjct: 889  RYFYGIRLEACKALAKFIYRPEEIFLGGSRHLIKIFQTLFCFEDSNIPKNNRFDSFQKYL 948

Query: 313  VLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFG 372
            +   IP  ++ VR   N  P    +F+L +L YN+N  NP+ D+ +L  L+ +V      
Sbjct: 949  LQREIPSFLSSVRTEQNDCPSFVKQFLLDILTYNENAENPFDDIIYLTRLINNVVTCTIN 1008

Query: 373  -QQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKL--SGFISLDQVVK 429
             ++   FL  ++++++R    D  +P+Y     +  ++T+ +  L+L  +G      + K
Sbjct: 1009 DKEDKSFLKKVVEQLNRYENLDTWIPTYQ----VKLMKTILEEKLRLNIAGLYDFGNLKK 1064

Query: 430  LIKPFRDFNTI 440
             ++   +++T+
Sbjct: 1065 YLQYSLNYDTL 1075


>gi|443691774|gb|ELT93532.1| hypothetical protein CAPTEDRAFT_221667 [Capitella teleta]
          Length = 1210

 Score =  145 bits (366), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 126/470 (26%), Positives = 209/470 (44%), Gaps = 67/470 (14%)

Query: 25  ASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAV 84
           ++P + +ST          G   +PF++E    WV   G       F +N+++NI+EL +
Sbjct: 468 STPAKAISTV--------TGKDIQPFIQE----WVQQGGHARFFGQFVFNRKRNIIELEL 515

Query: 85  LRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLL 144
            ++ T K                  G + + G +++ + ELDG ++H       +     
Sbjct: 516 KQEWTTK------------------GALRYVGPLTVVLQELDGTFNHTF--KIEENKTKF 555

Query: 145 EIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEI 204
           EI CHSK    +  K    +  +   D GD  A        +SP+ W+R DPEM  L  +
Sbjct: 556 EITCHSKSRRHKKKKIPLCTGEEIDMDLGDTDA--------DSPVLWMRIDPEMFILRSV 607

Query: 205 HFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEA 264
              QP  MW   L  +  VV+Q QAI AL   P     +   L + +   + F+RVRI A
Sbjct: 608 QLEQPDFMWQYLLRYERCVVSQCQAITALTNYPTPKTRLA--LTDTIESDQCFYRVRIAA 665

Query: 265 AYALANTASEET-DWAGLLHLVKFYKSRRFDENIGLP---RPNDFRDFSEYFVLEAIPHA 320
           A  L   A+     WAG   ++  +  R+   +   P   R N F +F  YF+ +AIP  
Sbjct: 666 AEGLVKVANSMVGSWAGPPAMMTIF--RKLFGSHSCPTIVRQNHFNNFQHYFLQKAIPGL 723

Query: 321 VAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLS 380
           +A +R      P +   F+L L KYNDN+ N YSD ++ AAL+ +V E      +++ ++
Sbjct: 724 MAGLRTVHGICPPDVHGFILDLFKYNDNSKNKYSDNYYRAALIDAVAETITPAVTMVTVT 783

Query: 381 SL--------------LKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQ 426
            +              L+ + R L  ++L+P Y  ++T+SC++ +     KL  F  L  
Sbjct: 784 GMCPSTDSLTPDTSLILEELTRYLNLEKLLPCYRLVITVSCLKAIR----KLQKFGHLPS 839

Query: 427 VVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEP 476
              + + + +      VR+ A  AL+D      G        ++ +EE+P
Sbjct: 840 DAAIFRSYAEQGNFLDVRLAALEALVDFT-KVEGSFEIWQWLLEVIEEDP 888


>gi|50292707|ref|XP_448786.1| hypothetical protein [Candida glabrata CBS 138]
 gi|49528099|emb|CAG61756.1| unnamed protein product [Candida glabrata]
          Length = 1385

 Score =  145 bits (366), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 121/458 (26%), Positives = 211/458 (46%), Gaps = 69/458 (15%)

Query: 13   KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
            ++L  I  +A  G  P   L++  F+H   KV    +  L+ FF +WV   G P+ R+  
Sbjct: 580  RVLPKIFLQAMSGDLPNNALTSSHFQHVCEKVN---KSKLESFFQQWVFGSGVPLFRVTQ 636

Query: 72   SYNKRKNIVELAVL-----------RDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSI 120
             YNK++ +VE+ +            RD +   D     L+S  ++ N +    + G ++I
Sbjct: 637  RYNKKRMVVEMGIRQYPNDTDEFNERDKSKMEDFFNKALNS-LEAPNVNKTSIFTGSITI 695

Query: 121  RVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDG----------- 168
            R+H+ DG  Y+H  L    D +  ++IQ ++K    R  K  +  K +            
Sbjct: 696  RIHDSDGTPYEH--LVEIKDIYTKVDIQYNTKHKKTRTRKNNEDEKTNEIRYGTSGSDIS 753

Query: 169  -----------CDDNG-----DAVAGL---DMRSSMESPLSWIRADPEMEYLAEIHFNQP 209
                       C D G      AVA L   +++ + ES + WIR D + E+L+  + NQP
Sbjct: 754  FLGNILDSDTECQDWGIVHPSTAVANLSGDEIQRANESTIEWIRIDSDFEWLSTFYLNQP 813

Query: 210  VQMWINQLEKDGDVVAQAQAIAALEALPHLSFNV---------VNTLNNFLSDSKAFWRV 260
              M+  QL++D D+ AQ      LE++ +   N+          + L   L D + F+ +
Sbjct: 814  DYMFATQLQRDSDIEAQ------LESVWYYGDNLRDNMASKIYASQLVRTLMDDRYFYGI 867

Query: 261  RIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHA 320
            RI+AA AL+    + TD    L  +  +K     +N  +P+ N+F  F +YF+  + P  
Sbjct: 868  RIQAAKALSRYICKPTDKEHFL--LTAFKKLYCHDNSNIPKSNNFSSFQDYFLQMSFPSL 925

Query: 321  VAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQ-SVGELEFGQQSILFL 379
            +A ++     +P    +F+L +L YNDN+ N Y D ++L  + + +V     G ++   +
Sbjct: 926  IAEIKDHYGDTPDYIGDFLLDILSYNDNSENSYYDTYYLCEIFEFTVNCCNGGNRN--HV 983

Query: 380  SSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALK 417
            + L+++I R    +   PSYN  LT   I    ++  K
Sbjct: 984  TKLMEQIRRYEHLEYWSPSYNLALTSKLIEIKCRLHTK 1021


>gi|115491151|ref|XP_001210203.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114197063|gb|EAU38763.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 1260

 Score =  145 bits (365), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 136/527 (25%), Positives = 231/527 (43%), Gaps = 91/527 (17%)

Query: 1    MLEKQMGSNFFRKILQNIISRA-----QGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
            +LE+++     +  +  I+SR       G  P   +++  F+    ++G+ +   L  FF
Sbjct: 503  ILERRLNKASGKPTMTRIVSRVFLNSRMGDIPNGAITSSHFQKTCERLGHAK---LDTFF 559

Query: 56   PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG-- 113
             +WV   GCP  +    +NK+K +VE+ + +  + +P +R   L  NT   +   +I   
Sbjct: 560  QQWVYGAGCPRFQATQRFNKKKLVVEMVIRQIQSEQPSTRD--LDRNTFMRDVKEEIRGV 617

Query: 114  --------WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGS 164
                    + G M+IR+HE DG  Y+H +    G     +      K   R   + ++ +
Sbjct: 618  YAGAIQPVFTGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAA 677

Query: 165  KPDGCDDNGDAVAGL---------------------------DMRSSMESPLSWIRADPE 197
               G D N +    +                           + R   ES   WIR D +
Sbjct: 678  AVTGTDPNAETQEDVLLYCLGDVLQSEEEMQEWRLADWSKEEEERMGQES-YEWIRMDAD 736

Query: 198  MEYLAEIHFNQPVQMWINQLEKDGDVVAQ---AQAIAALEALPHLSFNVVNTLNNFLSDS 254
             E++ ++    P  M+++QL++D DVVAQ    Q +AA    P +S   V T    L D 
Sbjct: 737  FEWICKLSLVMPGYMYLSQLQQDRDVVAQLESLQYMAAQREHPLISTIFVRT----LMDR 792

Query: 255  KAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP-----RPNDFRDFS 309
            + F  +R  AA AL   A E+ +W GL HL      R F E   +P     R NDF D +
Sbjct: 793  RYFHGIRAAAARALVKHAKEDINWLGLFHL-----ERAFQELFCIPGSPMTRSNDFSDRA 847

Query: 310  EYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQS---- 365
             Y +   IP A++ VR  + K+P     F+   L++NDN+ N YSD F++A L+QS    
Sbjct: 848  AYALQLIIPEAISKVRDNNGKTPMRVKRFLYDKLRFNDNSNNEYSDNFYVATLMQSLCHA 907

Query: 366  -VGELE---------FGQQSIL-------FLSSLLKRIDRLLQFDRLMPSYNGILTISCI 408
             +G++E         F  + +L            L  IDR  + D    SY  + + + +
Sbjct: 908  LLGKVETRSEDLDDNFDMERVLEAQAEEQLERDALAEIDRYRRMDEWSSSYQNVYSRAAL 967

Query: 409  RTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLE 455
            R   Q+ L  +  + +D V++ + P+    T   +R++A   L++ +
Sbjct: 968  R--CQMQLMQANILDMD-VMQFL-PYTRAGTYDLLRLDAFECLVETD 1010


>gi|261198060|ref|XP_002625432.1| transcription initiation factor TFIId 127kD subunit [Ajellomyces
           dermatitidis SLH14081]
 gi|239595395|gb|EEQ77976.1| transcription initiation factor TFIId 127kD subunit [Ajellomyces
           dermatitidis SLH14081]
          Length = 1250

 Score =  145 bits (365), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 121/424 (28%), Positives = 188/424 (44%), Gaps = 64/424 (15%)

Query: 1   MLEKQMGSNFFRKILQNIISRA-----QGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
           +LE+++     +  +  IISR       G  P   L+T  F+    ++G+ +   L  FF
Sbjct: 511 ILERRLSKASGKATISRIISRIFLNARMGDLPNGALTTSFFQRTCERLGHTK---LDSFF 567

Query: 56  PRWVGTCGCPVLRMGFSYNKRKNIVELAVLR----DCTVKPDSRTPVLSSNTDSENRDGD 111
            +WV   GCP       +NK+K +VE+ + +      TV+ D            E ++  
Sbjct: 568 QQWVYGAGCPRFLATQRFNKKKLVVEMMIRQVQSEQATVR-DLEKSTFMREVKEEIKNIY 626

Query: 112 IG-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSK 165
            G     + G M+IR+HE DG  Y+H +    G     +      K   R   + ++ + 
Sbjct: 627 AGPVQHVFTGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAAA 686

Query: 166 PDGCDDN------------GDAVAGLDM---------------RSSMESPLSWIRADPEM 198
             G D N            GD +   +                R   ES   WIR D + 
Sbjct: 687 TSGMDPNAELQDDVLLYCLGDVLQSEEEIQKWKLVEWSKEDEDRMGQES-YEWIRMDADF 745

Query: 199 EYLAEIHFNQPVQMWINQLEKDGDVVAQAQAI---AALEALPHLSFNVVNTLNNFLSDSK 255
           E++ ++    P  M+++QL++D D+VAQ ++I   AA    P +S   V T    L D +
Sbjct: 746 EWICKMSLTMPGYMYLSQLQQDRDIVAQLESIQYMAAQREHPLISTIFVRT----LMDRR 801

Query: 256 AFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP-----RPNDFRDFSE 310
            F+ +R  AA AL   A +E DW GL HL        F E   LP     R NDF D + 
Sbjct: 802 YFYGIRAAAAKALVKHAKDEVDWIGLYHL-----ETAFQELFCLPGSPMTRSNDFSDRAA 856

Query: 311 YFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELE 370
           Y +   IP A+A VR    K+P     F+   LK+NDN+ N  SD +++AAL++S+ E  
Sbjct: 857 YILQLTIPQAIAQVRDNTGKTPMRVKRFLYDKLKFNDNSNNELSDCYYIAALMKSICEAL 916

Query: 371 FGQQ 374
            G++
Sbjct: 917 LGRK 920


>gi|239607754|gb|EEQ84741.1| transcription initiation factor TFIId 127kD subunit [Ajellomyces
           dermatitidis ER-3]
 gi|327354600|gb|EGE83457.1| transcription initiation factor TFIId 127kD subunit [Ajellomyces
           dermatitidis ATCC 18188]
          Length = 1250

 Score =  145 bits (365), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 121/424 (28%), Positives = 188/424 (44%), Gaps = 64/424 (15%)

Query: 1   MLEKQMGSNFFRKILQNIISRA-----QGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
           +LE+++     +  +  IISR       G  P   L+T  F+    ++G+ +   L  FF
Sbjct: 511 ILERRLSKASGKATISRIISRIFLNARMGDLPNGALTTSFFQRTCERLGHTK---LDSFF 567

Query: 56  PRWVGTCGCPVLRMGFSYNKRKNIVELAVLR----DCTVKPDSRTPVLSSNTDSENRDGD 111
            +WV   GCP       +NK+K +VE+ + +      TV+ D            E ++  
Sbjct: 568 QQWVYGAGCPRFLATQRFNKKKLVVEMMIRQVQSEQATVR-DLEKSTFMREVKEEIKNIY 626

Query: 112 IG-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSK 165
            G     + G M+IR+HE DG  Y+H +    G     +      K   R   + ++ + 
Sbjct: 627 AGPVQHVFTGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAAA 686

Query: 166 PDGCDDN------------GDAVAGLDM---------------RSSMESPLSWIRADPEM 198
             G D N            GD +   +                R   ES   WIR D + 
Sbjct: 687 TSGMDPNAELQDDVLLYCLGDVLQSEEEIQKWKLVEWSKEDEDRMGQES-YEWIRMDADF 745

Query: 199 EYLAEIHFNQPVQMWINQLEKDGDVVAQAQAI---AALEALPHLSFNVVNTLNNFLSDSK 255
           E++ ++    P  M+++QL++D D+VAQ ++I   AA    P +S   V T    L D +
Sbjct: 746 EWICKMSLTMPGYMYLSQLQQDRDIVAQLESIQYMAAQREHPLISTIFVRT----LMDRR 801

Query: 256 AFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP-----RPNDFRDFSE 310
            F+ +R  AA AL   A +E DW GL HL        F E   LP     R NDF D + 
Sbjct: 802 YFYGIRAAAAKALVKHAKDEVDWIGLYHL-----ETAFQELFCLPGSPMTRSNDFSDRAA 856

Query: 311 YFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELE 370
           Y +   IP A+A VR    K+P     F+   LK+NDN+ N  SD +++AAL++S+ E  
Sbjct: 857 YILQLTIPQAIAQVRDNTGKTPMRVKRFLYDKLKFNDNSNNELSDCYYIAALMKSICEAL 916

Query: 371 FGQQ 374
            G++
Sbjct: 917 LGRK 920


>gi|346319804|gb|EGX89405.1| transcription initiation factor [Cordyceps militaris CM01]
          Length = 1463

 Score =  144 bits (364), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 139/513 (27%), Positives = 231/513 (45%), Gaps = 74/513 (14%)

Query: 2    LEKQMGSNFFRKILQNIISRAQ--GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
            + K  G +   ++LQ +++RAQ  G+     L T++F+    K G+  R  L+ F+ +WV
Sbjct: 534  MNKASGGHGLTRVLQKLLTRAQIEGSDKSMILDTEKFKTICEK-GSRYR--LETFWNQWV 590

Query: 60   GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNT---DSENRDGDIG--- 113
               GCP   +   +NK++  VEL + +    K  ++ P L  +    +   R  ++    
Sbjct: 591  YGSGCPKFDVKARFNKKRLCVELTLNQVQFQK--AKKPQLEKDDFLREIRERRAEVPLGE 648

Query: 114  ----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQC-----HSKLAARRALKPKKG 163
                + G M++R+HE DG  Y+H IL +  DA +  + +      + +L   R +K K  
Sbjct: 649  TQPLFTGPMTVRIHEADGTPYEH-ILEIREDATRATKFEIPYNTKYKRLKRTRRMKEKHT 707

Query: 164  SKPDGCDDNGDAVAGLDMRSSMESP-----------------------LSWIRADPEMEY 200
             +     DN D      +   ++SP                         WIR D + E+
Sbjct: 708  GQNIEITDNNDDALLYCLGDVLQSPEDQREWELIDWDPETERKMDQESYEWIRLDADFEW 767

Query: 201  LAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRV 260
             +++       M+++QL++D DVVAQ  A+  L   P L       L   L D + F  +
Sbjct: 768  ASDMSRTLEPYMYVSQLQQDRDVVAQQDAMLYLAQGP-LHPIASGFLVRTLIDRRYFHGI 826

Query: 261  RIEAAYALANTAS-EETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPH 319
            R  AA AL   ++ ++    G+  L++ ++     E    PRPNDF D  +Y V  AI  
Sbjct: 827  RTMAAAALTRQSNIKDIPLLGMRQLMRAFREMYCYEQTNQPRPNDFTDKKQYNVRCAIIK 886

Query: 320  AVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEF-------- 371
            A+A VR   NK P EA  F+L  L+ N+N GNPYSD F++A LV+S+             
Sbjct: 887  AIAEVRDHTNKCPMEARRFILDQLRLNNNEGNPYSDHFYIAFLVKSLSISLIPSKQDDWL 946

Query: 372  -------GQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCI---RTLTQIALKLSGF 421
                    ++   FL   L++I+R+L+ D    SY    T++ +   + L +  +    F
Sbjct: 947  TRQNRIPNEEETAFLEDALEQIERVLRRDEWTNSYQNTWTLAGLDAKQRLMKADVIPKNF 1006

Query: 422  ISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
                Q   L+   RD      +R++   AL+DL
Sbjct: 1007 TDFGQY--LLDGTRDL-----IRIKCFEALIDL 1032


>gi|449667464|ref|XP_002158433.2| PREDICTED: transcription initiation factor TFIID subunit 2-like,
           partial [Hydra magnipapillata]
          Length = 1033

 Score =  144 bits (364), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 103/375 (27%), Positives = 183/375 (48%), Gaps = 61/375 (16%)

Query: 54  FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
           F  +W+   G   +   FS+N+++NI+EL V   C  K                  G   
Sbjct: 506 FLDQWICHSGVAKINASFSFNRKRNIIELEV-SQCVGK------------------GCQK 546

Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
           + G M I + ELDG + H +     D     E+ CHSK    +         P    D  
Sbjct: 547 YVGPMIICIQELDGSFPHTV--QVEDIISQHELPCHSKSRRNKKK-----KIPLITGDEV 599

Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
           D    + + S  +SP+ W+R DP+M +   +   QP  MW  QL  + DV +Q +A+ AL
Sbjct: 600 DMDLSVTLDS--DSPVLWVRIDPDMTWPRHVSLQQPDYMWQYQLRYERDVTSQIEAVRAL 657

Query: 234 EALPHLSFN--VVNTLNNFLSDSKAFWRVRIEAAYALANTASEETD-WAGLLHLVKFYKS 290
           E  P       +++T+NN     +AF++VR+EAA+ LA  A+ E + W G   +++ ++ 
Sbjct: 658 EKYPCSGTRSALIDTINNV----EAFFKVRMEAAFCLAKIANTEVEVWDGESAMIEIFE- 712

Query: 291 RRFDENI--GLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDN 348
           + F  N    +P+ N+F +F+ YF+ + +P +++M++ + ++  R  ++F+L+LLKYNDN
Sbjct: 713 KMFGSNACASIPKFNNFSNFTGYFMQKVLPTSISMLKNSQSQCTRRTLQFILELLKYNDN 772

Query: 349 NGNPYSDVFWLAALVQSV-------------------GELEFGQQSILFLSSLLKRIDRL 389
             N + D F++++L+  +                     L   + + L LS +++R    
Sbjct: 773 RFNQFHDGFYVSSLIDGLKNVIGPALHLASATSFDERKNLPVDEHAKLVLSEVVRR---- 828

Query: 390 LQFDRLMPSYNGILT 404
           L  D+++PS+   +T
Sbjct: 829 LNMDKIIPSFRYAVT 843


>gi|302924510|ref|XP_003053905.1| predicted protein [Nectria haematococca mpVI 77-13-4]
 gi|256734846|gb|EEU48192.1| predicted protein [Nectria haematococca mpVI 77-13-4]
          Length = 1486

 Score =  144 bits (363), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 151/544 (27%), Positives = 249/544 (45%), Gaps = 79/544 (14%)

Query: 2    LEKQMGSNFFRKILQNIISRAQ--GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
            L K  G +   +ILQ ++++ Q   +     L T++FR    K     R  L+ F+ +WV
Sbjct: 546  LIKASGGHGLTRILQKLLTKVQIEASDKSTILETEKFRAVCEKGA---RYRLESFWNQWV 602

Query: 60   GTCGCPVLRMGFSYNKRKNIVELAV--LRDCTVKPDSRTPVLSSN----TDSENRDG-DI 112
               GCP   +   +NK++  VEL +  ++  TVK     P L  +       E R G   
Sbjct: 603  YGSGCPRFDVKAKFNKKRLCVELTLNQIQFQTVK----KPALERDDFLRVIKERRAGVKT 658

Query: 113  G-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQC-----HSKLAARRALKPK 161
            G     + G M++R+HE DG  Y+H IL +  DA +  + +      + +L   R +K K
Sbjct: 659  GEVQPLFTGPMTVRIHEADGTPYEH-ILEIREDATRSTKFEIPYNTKYKRLKRTRRMKEK 717

Query: 162  K--GSKPDG--CDDN-----GDAVAGLDMRSSME--------------SPLSWIRADPEM 198
            +  G+  D    DD      GD +   D ++  E                  WIR D + 
Sbjct: 718  QNVGASMDAENLDDTLLYCLGDVLQTPDEQAQWELIDWDPDTERKMDQESYEWIRVDADF 777

Query: 199  EYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFW 258
            E+  ++       M+++QL++D DVVAQ  A+  L   P L       L   L D + F 
Sbjct: 778  EWACDMKRTLEPYMYVSQLQQDRDVVAQQDAMLYLTHGP-LHPIASGFLTRTLVDRRYFH 836

Query: 259  RVRIEAAYALANTAS-EETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAI 317
             +R  AA AL   A+ ++    GL  L+K +      +    P+PNDF D  +Y V  AI
Sbjct: 837  GIRTMAAEALPRQANIKDISMLGLRQLMKAFSEMFCHKGTNQPKPNDFSDKKQYNVQCAI 896

Query: 318  PHAVAMVRAADN-KSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVG--------E 368
              A+A VR A+  K P EA +F+L  L +N+N  NPYSD F++A LV+++         +
Sbjct: 897  IKAIAQVRDANTYKCPLEARQFILDQLLFNNNEDNPYSDHFYIATLVEALATSLIPSKKD 956

Query: 369  LEFGQQSIL-------FLSSLLKRIDRLLQFDRLMPSYNGILTISCI---RTLTQIALKL 418
              F  Q+ +       FL   +++I+R+L+ D    SY  + TI+ +   + L +  +  
Sbjct: 957  DWFAMQNKIPNEEETQFLDRAMEQIERVLRRDEWTNSYQNVWTIAGLNAKQRLMKAEVIP 1016

Query: 419  SGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSL 478
              +I   Q   L+   RD      +R++A  AL+DL    +    +L ++  + +  P +
Sbjct: 1017 KAYIDFGQY--LLDGTRDL-----IRIKAFEALIDLGAMMDPTMFSLLIYSLTTDRSPYV 1069

Query: 479  RGQV 482
            R ++
Sbjct: 1070 RNKL 1073


>gi|326477130|gb|EGE01140.1| transcription initiation factor TFIId 127kD subunit [Trichophyton
           equinum CBS 127.97]
          Length = 1240

 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 112/404 (27%), Positives = 188/404 (46%), Gaps = 49/404 (12%)

Query: 1   MLEKQMGSNFFRKILQNIISRA-----QGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
           +LE+++     +  +  IISR       G  P   L++  F+    ++G+ +   L  FF
Sbjct: 514 ILERRLSKASGKATISRIISRIFLNARMGDLPNCELTSAHFQKLCERMGHTK---LDAFF 570

Query: 56  PRWVGTCGCPVLRMGFSYNKRKNIVELAVLR---DCTVKPDSRTPVLSSNTDSENRDGDI 112
            +WV   GCP       +NK+K +VE+ + +   + T   D        +   E R+   
Sbjct: 571 SQWVYGAGCPRFTASQRFNKKKLVVEMMISQVQSEQTTARDLERSTFMRDVKEEIRNVYA 630

Query: 113 G-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEI-QCHSKLAARRALKPKKGSK 165
           G     + G M+I++HE DG  Y+H           ++EI +  +K       K K+ S 
Sbjct: 631 GTVQNVFTGSMTIQIHEADGTPYEH-----------IVEIKEAITKFDIPYNTKYKRLS- 678

Query: 166 PDGCDDNGDAVAGL-DM--------------RSSMESPLSWIRADPEMEYLAEIHFNQPV 210
             G  D G+ +    DM              R   ES   WIR D + E++ ++    P 
Sbjct: 679 --GIGDKGNELRPYEDMEKWKIVEWTKEDESRMGQES-YEWIRMDADFEWICKMSLGMPG 735

Query: 211 QMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALAN 270
            M+++QL++D DVVAQ ++I  + A    +  + +     + D + F  +R  A  AL  
Sbjct: 736 YMYLSQLQQDRDVVAQLESIQHMAAQREHAL-ISSIFVRTILDRRYFHGIRTAATKALVK 794

Query: 271 TASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNK 330
            A +E DW GL HL   ++      N  + R NDF D S Y V  A+  A++ VR  D +
Sbjct: 795 HAKQEVDWIGLFHLENAFQELYCLPNSSMARSNDFSDRSSYAVQLALIEAISEVRDEDGR 854

Query: 331 SPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQ 374
           +P    +F+ + LK+NDN+ N YSD F+++ L++S+     G++
Sbjct: 855 TPMRVKQFLYEKLKFNDNSNNEYSDCFYISTLLKSICNALLGRR 898


>gi|268553097|ref|XP_002634531.1| C. briggsae CBR-TAF-2 protein [Caenorhabditis briggsae]
          Length = 1090

 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 131/522 (25%), Positives = 243/522 (46%), Gaps = 83/522 (15%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRTL---------STKEFRHFANKVGNLERPFL 51
           M++K++G   F ++L  +++     S  + L         +T  F    + V   E P  
Sbjct: 493 MIQKRIGMEPFMRVLHRLLTVGLDMSEKKLLPPAWKHLLITTDSFFRNIHSVTGKEIP-- 550

Query: 52  KEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGD 111
             F  +WV   G     + F +N+++N+VE+ + ++                DSE  +G 
Sbjct: 551 -SFLDQWVRVGGHASFAIKFDFNRKRNLVEMEIKQE----------------DSEG-NGR 592

Query: 112 IGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPK--KGSKPDGC 169
           + + G +S+ V E+DG + H I      A    EI CHSK   ++  K     G + +  
Sbjct: 593 MQYTGPLSVVVQEVDGAFSHTI--QIDGAVSHAEISCHSKGRKQKKKKVPILTGEEIEID 650

Query: 170 DDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
             N DA          ESP+ W+R D +   + EI  +QP+  W   L+ + DV+AQ +A
Sbjct: 651 LTNMDA----------ESPILWLRIDHDYLLIREITISQPMFHWEYMLKYERDVIAQMEA 700

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDW--AGLLHLVKF 287
           +  ++ALP  S +  + + + +S+ + F+R+R  AA+ L +  + +++    G   L+  
Sbjct: 701 LERIQALP--SAHSRSVIVDAVSNERFFYRIRSRAAFVLTSVQNRKSEAIAVGTPVLINM 758

Query: 288 YKSRRFDENIG------LPRPNDF----RDFSEYFVLEAIPHAVAMVRAADNKSPREAVE 337
           ++     EN G      +PR N+F        +YFV+++IP A++ +R    +   + + 
Sbjct: 759 FR-----ENFGSKAVTNIPRSNNFVVTAEKLQQYFVMQSIPQAISRLRKPSGECNEDVLP 813

Query: 338 FVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILF-----------LSSLLKRI 386
           F+L L+K+NDN+ N YSD F+ A+L  S+    +  + + +              ++K  
Sbjct: 814 FLLDLIKFNDNSTNRYSDDFYRASLFNSLAASIYPSEYLPYRVDLPENLSNEFRVMIKEF 873

Query: 387 DRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVE 446
              L  D + PSY  ++ I+ +  L Q  L+  G++ LD   +L+  F + N   Q+R  
Sbjct: 874 TYALNMDTVNPSYGRVVGIAALSGLYQ--LQKCGYLPLDS--ELLWTFANRNLCIQMRRC 929

Query: 447 ASRALLD-LEFHCNGIDSALS-----LFIKSVEEEPSLRGQV 482
           A   ++D +    N +D+ +      L +   E +PS+R  +
Sbjct: 930 AITLIVDRISNDTNALDTRMEDLSRLLDLVETETDPSIRAMI 971


>gi|308810875|ref|XP_003082746.1| TFIID component TAF2 (ISS) [Ostreococcus tauri]
 gi|116061215|emb|CAL56603.1| TFIID component TAF2 (ISS) [Ostreococcus tauri]
          Length = 969

 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 119/437 (27%), Positives = 206/437 (47%), Gaps = 50/437 (11%)

Query: 1   MLEKQMGSNFFRKILQNI----ISRAQGASPVRTLSTKE--------------FRHFANK 42
           MLEK++G +  +K+L+      + R +     RT  + E              F H    
Sbjct: 400 MLEKRLGPDVMQKVLKYFAGLHVRRNKKEQATRTGPSNEVLAGNARWIHTLQLFDHCRAT 459

Query: 43  VGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSN 102
           V NL +  +  F  RWV   G P L +G+   ++KN++E AV  + +    +      + 
Sbjct: 460 V-NLGKGEVNSFLERWVYGAGTPKLSVGYVVKRKKNVMEFAVKLEGSAAAAAADRAALAV 518

Query: 103 TDSENRDGDIGWPGMMSIRVHELD--GMYDHPILPMAGDAWQLLEIQCHSKLAARRALKP 160
             +            +++R+ E +     DH ++ +   +WQL+EI    +   RR   P
Sbjct: 519 ARNHRSS--------VTVRMREENRADANDH-VVSLGQSSWQLMEIPLQPRPKDRR---P 566

Query: 161 KKGSKPDGCDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPV----QMWINQ 216
           K        +  GD     ++ ++M+ P+ ++R DPE E++  I  +        M    
Sbjct: 567 KTI-----IESGGDP----ELIAAMDCPVRYVRVDPEFEWMGNIEQSARQVGLESMMAQM 617

Query: 217 LEKDGDVVAQAQAIAAL-EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEE 275
           LEK+ D+VAQ  A+  L   + + S + V  L+  L+    F RVR EAA AL  +ASE+
Sbjct: 618 LEKEKDIVAQIVAVEFLGRRVANGSVSAVLVLDKCLNSEDTFCRVRAEAAMALGKSASEK 677

Query: 276 TDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPR-E 334
           T W  L  L++ YK  + +E  G P+ NDFRD ++  V +A+  A+A V   + K    +
Sbjct: 678 TQWGSLHALLRHYKKYQCNERTGKPKQNDFRDLAKVIVDKAVITALAHVYDEEAKCTHAD 737

Query: 335 AVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDR 394
           A++ ++  LKYN+N GNP SD  ++A  + ++        ++  L ++  +I   ++ D 
Sbjct: 738 AIDAIVNRLKYNNNEGNPQSDDGFVATCLSALARCV--PATLGELRAITAQIHYYIRRDS 795

Query: 395 LMPSYNGILTISCIRTL 411
             PS +  +T + IR L
Sbjct: 796 RFPSVDLCVTCAGIRAL 812


>gi|312077933|ref|XP_003141518.1| TATA binding protein associated factor [Loa loa]
          Length = 1086

 Score =  143 bits (361), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 137/491 (27%), Positives = 235/491 (47%), Gaps = 77/491 (15%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEFRHFANKVGNLERPFL 51
           ML +++G   F +++Q I+S A  AS ++         T+ST+ F    + V   E P  
Sbjct: 480 MLNQRLGKEPFLQVIQKILSVAMQASQLQKEPAHWQHMTVSTESFFRTVSSVTGQELP-- 537

Query: 52  KEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGD 111
             F  +W+   G     + +++N+++N++EL V ++                 S NR G 
Sbjct: 538 -TFLEQWIHNGGHAKFWVQYAFNRKRNMIELEVKQE----------------PSANR-GR 579

Query: 112 IGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDD 171
             + G + + V ELDG + H +   +  +    ++QCHSK   RR  K KK     G + 
Sbjct: 580 QSYVGPLIVVVQELDGSFTHTVQIDSNHSKH--DLQCHSK--GRRQ-KKKKIPLSTGEEL 634

Query: 172 NGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIA 231
             D +A +D     +SP+ WIR DP++  L +++  QPV  W   L+ + DV+AQ QA+ 
Sbjct: 635 EID-LANMDP----DSPVLWIRIDPDLLLLRKVNIQQPVYQWEYMLKYERDVLAQLQALD 689

Query: 232 ALEALP--HLSFNVVNTLNNFLSDSKAFWR----VRIEAAYALANTASEETD-WAG---- 280
            L+  P  H    ++ T+     +++AF+     VR  AAY LA   ++  + W G    
Sbjct: 690 ILQRFPSPHARAVLIETI-----ETEAFFYRQALVRCRAAYILAEVVNKLPETWLGPPAL 744

Query: 281 LLHLVKFYKSRRFDENIGLPRPNDFRDFSE----YFVLEAIPHAVAMVRAADNKSPREAV 336
           ++   KF+  +       +P+PN+F   S     YF++ AIP A+A +R++ N +  E  
Sbjct: 745 MILFKKFFGCKSAPH---IPKPNNFVATSANLQLYFLMHAIPQALARLRSSSNGALSEVH 801

Query: 337 EFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQS--ILFLS---------SLLKR 385
            F++ L+KYNDN  N YSD  + A+L+ S+       +S  I+ +S          LL  
Sbjct: 802 HFLIGLIKYNDNTVNRYSDDHYRASLINSLAATLIPAESRDIIDMSPLSLDTEMRELLSE 861

Query: 386 IDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRV 445
               L  D L PS+  I+ I+ ++ +    ++ +  I LD   K+   F   N    +R+
Sbjct: 862 FTLALNMDTLKPSFGRIIGITALQAIYH--MQRTFHIPLDP--KVFWSFAQPNMYVPMRL 917

Query: 446 EASRALLDLEF 456
            A   L+D+ +
Sbjct: 918 AALTFLVDMVY 928


>gi|321476028|gb|EFX86989.1| hypothetical protein DAPPUDRAFT_307126 [Daphnia pulex]
          Length = 1171

 Score =  143 bits (361), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 121/480 (25%), Positives = 215/480 (44%), Gaps = 59/480 (12%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTK------EFRHFANKVGNLERPFLKEF 54
           MLE ++G     ++    +S A  AS  +  + +          FA  +  +    +  F
Sbjct: 479 MLELRIGQPLLIQVFNKQLSLAVQASQQKCQANQWGAMLISTATFAKAIFTVTGKDIGVF 538

Query: 55  FPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGW 114
              WV   G    ++ F +N+++N VEL + +D             + + ++   G   +
Sbjct: 539 VDTWVRQGGHARFQLSFVFNRKRNTVELEIRQD-------------AASGAQPARGLRRY 585

Query: 115 PGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGD 174
            G + + + ELDG + H +   +  A   L     S+   ++ +    G + D       
Sbjct: 586 VGPLVVALQELDGTFRHTLQIESSVAKNDLTCHSKSRRNKKKKIPLCTGEEVD------- 638

Query: 175 AVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
               +D+ +   +SP+ WIR DP+M  L  +  +QP   W  QL  + DV AQ +A+ AL
Sbjct: 639 ----MDLSAMDADSPVLWIRLDPDMSLLRAVEISQPDYQWQYQLRHERDVTAQMEAVMAL 694

Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEE-TDWAG----LLHLVKFY 288
           E     +  +   L + + +   F++VR +AAY LA  A+   + WAG    L+   K +
Sbjct: 695 ERFATPATRLA--LTDIIENENCFYKVRCDAAYCLAKVANGMVSSWAGPPAMLMIFRKLF 752

Query: 289 KSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDN 348
            S      +   R N+F +F  YF+ + IP A+A +R A    P+E + F+L L KYNDN
Sbjct: 753 GSFSCQHIV---RQNNFANFQHYFLQKTIPVAMAQLRNALGVCPQEVLRFLLDLFKYNDN 809

Query: 349 NGNPYSDVFWLAALVQSVGEL-------------EFGQQSILFLS-SLLKRIDRLLQFDR 394
             N  SD ++ AAL++++G                   +S+   +  +L+ + R L  ++
Sbjct: 810 TKNRLSDCWYRAALIEALGNTASPVVAVVAQSAASLTPESVTKETHQVLEEVARALNLEK 869

Query: 395 LMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
           L+P+Y   +T+ C++ +     KL     L     L K +  +     VRV A  AL+++
Sbjct: 870 LLPTYKFAVTVGCLKAIR----KLQKCGQLPSKSDLFKSYAQYGQFHDVRVTALEALVEV 925


>gi|341880460|gb|EGT36395.1| hypothetical protein CAEBREN_08215 [Caenorhabditis brenneri]
          Length = 833

 Score =  143 bits (361), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 123/458 (26%), Positives = 218/458 (47%), Gaps = 73/458 (15%)

Query: 1   MLEKQMGSNFFRKILQNIIS--------RAQGASPVRTLSTKE--FRHFANKVGNLERPF 50
           ML+K++G   F ++L  +++        +A  A+    L+T E  FR  ++  G     F
Sbjct: 236 MLQKRIGLEPFMRVLHRVLTVGLDMSEKKATPAAWRHLLTTTEAFFRSVSSVTGREIPTF 295

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           L +F    V T G     + F +N+++NIVE+ V +D                D+E  +G
Sbjct: 296 LDQF----VRTGGHVAFAVKFDFNRKRNIVEIEVKQD----------------DTEG-NG 334

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPK--KGSKPDG 168
            + + G +S+ V E+DG + H I      A    EI CHSK   ++  K     G + + 
Sbjct: 335 RVQYTGPLSVVVQEVDGAFSHTI--QIDGATSHAEISCHSKGRKQKKKKVPILTGEEIEI 392

Query: 169 CDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQ 228
              N DA          ESP+ W+R D +   + EI  +QP+  W   L+ + DV+AQ +
Sbjct: 393 DLTNMDA----------ESPILWLRIDHDYLLIREITISQPMFHWEYMLKYEKDVIAQME 442

Query: 229 AIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFY 288
           A+  ++ALP  S +  + + + +S+ K F+R+R  AA+ L +  + +++   +   V   
Sbjct: 443 ALERIQALP--SAHSRSVIVDAVSNDKFFYRIRSRAAFVLTSVLNRKSEAIAVGTPVLI- 499

Query: 289 KSRRFDENIG------LPRPNDF----RDFSEYFVLEAIPHAVAMVRAADNKSPREAVEF 338
               F EN G      +PR N+F    ++  +YFV++++P A++ +R    +   E   F
Sbjct: 500 --NMFRENFGSKAATNIPRSNNFVVTAQNLQQYFVMQSLPQAISRLRKQSGECHEEVQPF 557

Query: 339 VLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILF-----------LSSLLKRID 387
           +L L+K+NDN+ N YSD F+ A+L  S+    +  + + +              L+K   
Sbjct: 558 LLDLIKFNDNSTNRYSDDFYRASLYNSLAASVYSSEYLPYRVDLPENLNQDFRILIKEFT 617

Query: 388 RLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLD 425
             L  D + PS+  ++ I+ +  + Q  L+  G+  LD
Sbjct: 618 HALNMDTVNPSWGRVVGIAALSGIYQ--LQKCGYFPLD 653


>gi|170584208|ref|XP_001896898.1| TATA binding protein associated factor [Brugia malayi]
 gi|158595723|gb|EDP34252.1| TATA binding protein associated factor, putative [Brugia malayi]
          Length = 1096

 Score =  143 bits (361), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 140/491 (28%), Positives = 233/491 (47%), Gaps = 77/491 (15%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGAS-----PVR----TLSTKEFRHFANKVGNLERPFL 51
           ML +++G   F +++Q I+S A  AS     PV     T+ST+ F    + V   E P  
Sbjct: 490 MLNQRLGKESFLQVIQKILSVAMQASQLQKEPVHWQHMTVSTESFFRTVSSVTGQELP-- 547

Query: 52  KEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGD 111
             F  +W+ + G     + +++N+++N++EL V ++                 S NR G 
Sbjct: 548 -TFLEQWIHSGGHAKFWVQYAFNRKRNMIELEVKQE----------------PSANR-GR 589

Query: 112 IGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDD 171
             + G + + V ELDG + H +   +  +    ++QCHSK   RR  K KK     G + 
Sbjct: 590 QSYVGPLIVVVQELDGSFTHTVQIDSNHSKH--DLQCHSK--GRRQ-KKKKIPLSTGEEL 644

Query: 172 NGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIA 231
             D +A +D     +SP+ WIR DP++  L ++   QPV  W   L+ + DV+AQ QA+ 
Sbjct: 645 EID-LANMDP----DSPVLWIRIDPDLLLLRKVSIQQPVYQWEYMLKYERDVLAQLQALD 699

Query: 232 ALEALP--HLSFNVVNTLNNFLSDSKAFWR---VRIEAAYALANTASE--ETDWAG---- 280
            L+  P  H    ++ T+         F+R   VR  AAY LA   ++  ET W G    
Sbjct: 700 VLQRFPSPHARAILIETIET----EVFFYRQVLVRCRAAYILAEVVNKLPET-WLGPPAL 754

Query: 281 LLHLVKFYKSRRFDENIGLPRPNDFRDFSE----YFVLEAIPHAVAMVRAADNKSPREAV 336
           ++   KF+  +       +P+PN+F   S     YF++ AIP A+A +R++ N +  E  
Sbjct: 755 MILFKKFFGCKSAPH---IPKPNNFVATSANLQLYFLMHAIPQALARLRSSSNNALSEVH 811

Query: 337 EFVLQLLKYNDNNGNPYSDVFWLAALVQSVGEL-------EFGQQSILFLSS----LLKR 385
            F++ L+KYNDN  N YSD  + A+L+ S+          +    S L L++    LL  
Sbjct: 812 HFLIGLIKYNDNTVNRYSDDHYRASLINSLAATLVPAENRDIINMSPLSLNTEMKELLSE 871

Query: 386 IDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRV 445
               L  D L PS+  ++ I+ ++ +    ++ +  I LD   K+   F   N    +R+
Sbjct: 872 FTLALNMDTLKPSFGRVIGITALQAIYH--MQRTFHIPLDP--KVFWSFAQPNMYVPMRL 927

Query: 446 EASRALLDLEF 456
            A   L+D+ +
Sbjct: 928 AALTFLIDMVY 938


>gi|346973924|gb|EGY17376.1| transcription initiation factor TFIID 127kD subunit [Verticillium
            dahliae VdLs.17]
          Length = 1533

 Score =  143 bits (360), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 143/514 (27%), Positives = 233/514 (45%), Gaps = 75/514 (14%)

Query: 31   LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTV 90
            ++T+ FR F  K    +   L+ F+ +W+   GCP   +   +NK+K  VE+ V R   V
Sbjct: 599  INTESFRRFCEKNSKSK---LESFWKQWIYGTGCPRFDVQQKFNKKKLCVEMMV-RQVQV 654

Query: 91   KPDSRTPVLSSN-----TDSENRDGDIG-----WPGMMSIRVHELDGM-YDHPILPMAGD 139
            K       L  N        +  D  IG     + G M+IR+HE DG  Y+H I+ +  D
Sbjct: 655  KERDSNRALDKNDFLRLVKEKLYDVAIGEIQPWFTGPMTIRIHEADGTPYEH-IVEVRED 713

Query: 140  AWQL----LEIQCHSKL-----AARRALKPKKGSKPDG--CDDN-----GDAVAG----- 178
            A  +     +I  ++K      + R+  +    +  DG   +D+     GD +       
Sbjct: 714  AAAMKMCKFDIPYNTKYKRLKRSRRQRDRAAAQTTADGEVNEDSLLYCLGDVLQSQDDIQ 773

Query: 179  ----LDMRSSMESPLS-----WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
                LD  + ME  +      WIR D + E++  +  N    M+++QL++D DVVAQ   
Sbjct: 774  EWDLLDWPADMEKKMDQESYEWIRMDADFEWICTMKTNSESYMYVSQLQQDRDVVAQQDT 833

Query: 230  IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALA--NTASEETDWAGLLHLVKF 287
            +  L   P     V + L   L D + F  +R  AA  L   +    +    GL HL++ 
Sbjct: 834  MLYLGRSPPHPL-VSSILTRTLFDKRYFHGIRTMAASQLHRHSVPGPKDSMVGLFHLLRA 892

Query: 288  YKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYND 347
            +          +PRPNDF D  +Y    AIP A+A VR ++ K P  A  F+L+ +++N+
Sbjct: 893  FTQLCCFPGTQMPRPNDFSDRKQYLAQSAIPLAIARVRDSNGKCPERARHFLLEQVQFNN 952

Query: 348  NNGNPYSDVFWLAALVQSVG---------ELEFGQQSIL-------FLSSLLKRIDRLLQ 391
            N  NP+SD F++A L++++          + E     I+       FL + L  IDR  +
Sbjct: 953  NANNPFSDNFYVAKLLEALATSMIPDKRPDTEVMLDDIIRTDEEDQFLQNALTEIDRYRR 1012

Query: 392  FDRLMPSYNGILTIS---CIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEAS 448
             D    SY+ I TI+   C R L +     +G I  D  +  ++  +D  T   VR++A 
Sbjct: 1013 MDEWANSYHNIWTITALDCKRRLMK-----AGVIPKDG-LDFMQYLQD-ETYDLVRIKAF 1065

Query: 449  RALLDLEFHCNGIDSALSLFIKSVEEEPSLRGQV 482
             AL+DL F  + +     L   S ++ P +R ++
Sbjct: 1066 EALVDLGFMMDDVVFPHLLRTLSTDKSPFVRHKL 1099


>gi|302416095|ref|XP_003005879.1| transcription initiation factor TFIId 127kD subunit [Verticillium
            albo-atrum VaMs.102]
 gi|261355295|gb|EEY17723.1| transcription initiation factor TFIId 127kD subunit [Verticillium
            albo-atrum VaMs.102]
          Length = 1552

 Score =  143 bits (360), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 160/602 (26%), Positives = 262/602 (43%), Gaps = 105/602 (17%)

Query: 31   LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTV 90
            ++T+ FR F  K    +   L+ F+ +W+   GCP   +   +NK+K  VE+ V R   V
Sbjct: 599  INTESFRRFCEKNSKSK---LESFWKQWIYGTGCPRFDVQQKFNKKKLCVEMMV-RQVQV 654

Query: 91   KPDSRTPVLSSN-----TDSENRDGDIG-----WPGMMSIRVHELDGM-YDHPILPMAGD 139
            K       L  +        +  D  IG     + G M+IR+HE DG  Y+H I+ +  D
Sbjct: 655  KERDSNRALDKDDFLRLVKEKLYDVAIGEIQPWFTGPMTIRIHEADGTPYEH-IVEVRED 713

Query: 140  AWQL----LEIQCHSKL-----AARRALKPKKGSKPDG--CDDN-----GDAVAG----- 178
            A  +     +I  ++K      + R+  +    +  DG   +D+     GD +       
Sbjct: 714  AAAMKMCKFDIPYNTKYKRLKRSRRQRDRAAAQTTADGEVNEDSLLYCLGDVLQSQDDIQ 773

Query: 179  ----LDMRSSMESPLS-----WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
                LD  + ME  +      WIR D + E++  +  N    M+++QL++D DVVAQ   
Sbjct: 774  EWDLLDWPADMEKKMDQESYEWIRMDADFEWICTMKTNSESYMYVSQLQQDRDVVAQQDT 833

Query: 230  IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALA--NTASEETDWAGLLHLVKF 287
            +  L   P     V + L   L D + F  +R  AA  L   +    +    GL HL++ 
Sbjct: 834  MLYLGRSPPHPL-VSSILTRTLFDKRYFHGIRTMAASQLHRHSVPGPKDSMVGLFHLLRA 892

Query: 288  YKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYND 347
            +          +PRPNDF D  +Y    AIP A+A VR A+ K P  A  F+L+ +++N+
Sbjct: 893  FTQLCCFPGTQMPRPNDFSDRKQYLAQSAIPLAIARVRDANGKCPERARHFLLEQVQFNN 952

Query: 348  NNGNPYSDVFWLAALVQSVG---------ELEFGQQSIL-------FLSSLLKRIDRLLQ 391
            N  NP+SD F++A L++++          + E     I+       FL + L  IDR  +
Sbjct: 953  NANNPFSDNFYVAKLLEALATSMIPDKRPDTEVMLDDIIRTDEEDQFLQNALTEIDRYRR 1012

Query: 392  FDRLMPSYNGILTIS---CIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEAS 448
             D    SY+ I TI+   C R L +     +G I  D  +  ++  +D  T   VR++A 
Sbjct: 1013 MDEWANSYHNIWTITALDCKRRLMK-----AGVIPKDG-LDFMQYLQD-ETYDLVRIKAF 1065

Query: 449  RALLDLEFHCNGIDSALSLFIKSVEEEPSLRGQVKLGIHAMRICQIKGGSDSNHEVDTVT 508
             AL+DL F  + +     L   S ++ P +R                      H++  V 
Sbjct: 1066 EALVDLGFMMDDVVFPHLLRTLSTDKSPFVR----------------------HKLLKVF 1103

Query: 509  LVALLNLLESRIAFNNVFLRHHLFGILQILAGRAPTLYGVPRDKLLLL--GDGETSEQKN 566
             + L  L     +  N  +R     +   L G    L   P+D +L++  GD ET+E++ 
Sbjct: 1104 SIGLAGLAFGEFSKPNSAIRPQTGDMF--LDG----LADEPKDDMLVVEAGDKETTERRQ 1157

Query: 567  VF 568
             +
Sbjct: 1158 EY 1159


>gi|242003978|ref|XP_002422931.1| transcription initiation factor TFIID subunit, putative [Pediculus
           humanus corporis]
 gi|212505824|gb|EEB10193.1| transcription initiation factor TFIID subunit, putative [Pediculus
           humanus corporis]
          Length = 1197

 Score =  142 bits (359), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 125/500 (25%), Positives = 222/500 (44%), Gaps = 71/500 (14%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRH-------FANKVGNLERPFLKE 53
           MLE ++G     ++    +S A  AS  + + +  + H       F   +  +    +  
Sbjct: 445 MLEHRIGQELLLQVFNKQLSLASNASQ-QKIGSGLWSHLLISTNVFHKAIFTVTGKDMSV 503

Query: 54  FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
           F  +WV T G     + F +N+++N VEL + +D T                 N+ G   
Sbjct: 504 FIDQWVRTGGHAKFSLTFVFNRKRNTVELEIRQDST-----------------NQRGIRK 546

Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
           + G + + + ELDG + H  L + G +    +I CHSK    +                 
Sbjct: 547 YMGPLLVTIQELDGTFKH-TLQVEGTSRA--DITCHSKSRRNKK---------------- 587

Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
                L     ++  LS +  DPE+  L   +  QP   W  QL  + DV+AQ +AIA+L
Sbjct: 588 -KKIPLCTGEEVDMDLSAMELDPEITLLRSTNIEQPDYQWQYQLRHERDVIAQIEAIASL 646

Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEE-TDWAGLLHLVKFYKSRR 292
           E  P  +  +   L + + +   +++VR +AA+ L   A+    +WAG   ++  +  R+
Sbjct: 647 ERYPTPATRLA--LTDTIENEGTYYKVRCKAAHCLTKVANAMIANWAGPPAMLTIF--RK 702

Query: 293 FDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNN 349
              +   P   + N+F++F  Y++ + IP A+A +R      P + + F+L L KYNDN+
Sbjct: 703 LFGSFSCPHIMKQNNFQNFQHYYLQKTIPIAMAGIRNVHGICPPDVIRFLLDLFKYNDNS 762

Query: 350 GNPYSDVFWLAALVQSVGE-----LEFGQQSILF----LSS----LLKRIDRLLQFDRLM 396
            N +SD ++ A+L++++       +   QQ        LSS    +L+ I R L  ++L+
Sbjct: 763 KNLFSDNYYRASLIEALAASVTPVISVVQQGSAIKAENLSSDARLILEEITRCLNLEKLL 822

Query: 397 PSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEF 456
           P Y   +T++C++ +     KL  F  L     L + +  +     VR+ A  AL+D   
Sbjct: 823 PCYKYTVTVACLKAIR----KLQKFGHLPNHPALFRSYAAYGQFIDVRLAALEALVDYT- 877

Query: 457 HCNGIDSALSLFIKSVEEEP 476
             +G  S L   +   E +P
Sbjct: 878 KVDGKWSDLEFLLNLAESDP 897


>gi|426201311|gb|EKV51234.1| hypothetical protein AGABI2DRAFT_182197 [Agaricus bisporus var.
           bisporus H97]
          Length = 1842

 Score =  142 bits (359), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 84/246 (34%), Positives = 130/246 (52%), Gaps = 30/246 (12%)

Query: 191 WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNF 250
           WIR D + E++A I F+QP  MW++QL++D DVVAQ +A+ AL   P  +  V +TL   
Sbjct: 697 WIRMDADFEWIANIKFDQPDFMWVSQLQRDRDVVAQLEALHALAEKP--TAIVSSTLTKT 754

Query: 251 LSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFD---------ENIGLPR 301
           +  S  ++R+R EAA AL   A    D+ GL HL K +    +D          +  +P+
Sbjct: 755 VLVSNYYYRIRCEAALALVTCAIRRLDFLGLFHLFKLFLRYCYDPEDPTQDLFTHTYVPK 814

Query: 302 PNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAA 361
           PN+F D +EYFV +++  A++ VR  + KSP    +F +  L+YNDN  NPYSD  ++  
Sbjct: 815 PNNFSDLAEYFVRKSLITAISRVRFENGKSPSVVRQFFIDQLRYNDNTSNPYSDGLYICT 874

Query: 362 LVQSV------------GEL-------EFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGI 402
           ++ +             GEL       E   +    +   +  IDR    DRL+PS + I
Sbjct: 875 IISAAAYAAVSVAPPERGELLPAEIRSEQTAEDKELVRQTIAEIDRYRSMDRLIPSPHNI 934

Query: 403 LTISCI 408
           +TI+ +
Sbjct: 935 VTIAAL 940


>gi|326472005|gb|EGD96014.1| transcription initiation factor TFIId 127kD subunit [Trichophyton
           tonsurans CBS 112818]
          Length = 1254

 Score =  142 bits (359), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 108/416 (25%), Positives = 191/416 (45%), Gaps = 48/416 (11%)

Query: 1   MLEKQMGSNFFRKILQNIISRA-----QGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
           +LE+++     +  +  IISR       G  P   L++  F+    ++G+ +   L  FF
Sbjct: 503 ILERRLSKASGKATISRIISRIFLNARMGDLPNCELTSAHFQKLCERMGHTK---LDAFF 559

Query: 56  PRWVGTCGCPVLRMGFSYNKRKNIVELAVLR---DCTVKPDSRTPVLSSNTDSENRDGDI 112
            +WV   GCP       +NK+K +VE+ + +   + T   D        +   E R+   
Sbjct: 560 SQWVYGAGCPRFTASQRFNKKKLVVEMMISQVQSEQTTARDLERSTFMRDVKEEIRNVYA 619

Query: 113 G-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKL-------------A 153
           G     + G M+I++HE DG  Y+H +     +A    +I  ++K              A
Sbjct: 620 GTVQNVFTGSMTIQIHEADGTPYEHIV--EIKEAITKFDIPYNTKYKRLKRNRRQKERAA 677

Query: 154 ARRALKPKKGSKPD----------GCDDNGDAVAGLDMRSSMESPLS-----WIRADPEM 198
           A   + P   ++ D            D++ +    ++     ES +      WIR D + 
Sbjct: 678 AMSGIDPSSETQDDVLLYCLGDVLQSDEDMEKWKIVEWTKEDESRMGQESYEWIRMDADF 737

Query: 199 EYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFW 258
           E++ ++    P  M+++QL++D DVVAQ ++I  + A    +  + +     + D + F 
Sbjct: 738 EWICKMSLGMPGYMYLSQLQQDRDVVAQLESIQHMAAQREHAL-ISSIFVRTILDRRYFH 796

Query: 259 RVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIP 318
            +R  A  AL   A +E DW GL HL   ++      N  + R NDF D S Y V  A+ 
Sbjct: 797 GIRTAATKALVKHAKQEVDWIGLFHLENAFQELYCLPNSSMARSNDFSDRSSYAVQLALI 856

Query: 319 HAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQ 374
            A++ VR  D ++P    +F+ + LK+NDN+ N YSD F+++ L++S+     G++
Sbjct: 857 EAISEVRDEDGRTPMRVKQFLYEKLKFNDNSNNEYSDCFYISTLLKSICNALLGRR 912


>gi|448117256|ref|XP_004203211.1| Piso0_000812 [Millerozyma farinosa CBS 7064]
 gi|359384079|emb|CCE78783.1| Piso0_000812 [Millerozyma farinosa CBS 7064]
          Length = 1539

 Score =  142 bits (358), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 139/510 (27%), Positives = 239/510 (46%), Gaps = 64/510 (12%)

Query: 31   LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVL----- 85
            LST  F++   KV    R  L+ FF +WV   G PVL +   +N++++IVE+++      
Sbjct: 657  LSTSHFQYVCEKVN---RNKLESFFKQWVYGSGIPVLYVIQRFNRKRSIVEVSIRQIQHR 713

Query: 86   --RDCTVKPDS-RTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGM-YDHPILPMAGDAW 141
              R  T+KP++     ++   D    +    + G M+IR+HE DG  Y+H +    G  +
Sbjct: 714  EPRKNTLKPENFLNDAIAYIDDEPVYNSQSVFTGPMTIRIHEADGTPYEHIVYLKEG--Y 771

Query: 142  QLLEIQCHSKLAARR-----ALKPKKG---------SKPDGCDDNGDAVAGLDMRSSMES 187
              L+IQ +++    R     A +P            S+ D  + N       D  +    
Sbjct: 772  SKLDIQYNTRFRRLRKNREEAYEPSTAFTGLGDVYSSEADMKEWNFAEFEKNDDENIYND 831

Query: 188  PLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTL 247
               W+R D + E++A+++ NQ  QM+ +QL+ D  V AQ  AI+        +      L
Sbjct: 832  TFEWMRVDADSEWIAKVNINQSDQMFASQLQYDRHVEAQYDAISFFGDRERPTNIHCTVL 891

Query: 248  NNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRD 307
               L D + ++ +RI AA ALAN + E+ ++ GL +L+K YK      N  +P  NDF +
Sbjct: 892  TRTLLDDRYYYGIRIAAARALANFSKEDNEFIGLTYLLKAYKEMFCYPNSEIPLANDFSN 951

Query: 308  FSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVG 367
               + V   +P  ++ VR  +   P    +F+L L+KYNDN+ N + D  +L+ L+ ++ 
Sbjct: 952  VGRFLVQRELPIIMSSVRDNEGNVPEMIKKFLLNLVKYNDNSNNFFQDPLYLSKLIHALC 1011

Query: 368  EL----------EFGQQSI--------LFLSSLLKRIDRLLQFDRLMPSYNGILTISCIR 409
                        +F QQ +         + + ++  I+RL + D L+PSY  ++  +C++
Sbjct: 1012 SSIINPNKSIIHDFLQQYLGSSDSVNDSYANEVVMEINRLQKLDGLVPSYQSVIYTTCLK 1071

Query: 410  TLTQIALKLSGFISLDQVV--KLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSA--L 465
               ++AL     +S ++++   L K   D      +R+EA R +    F   GI +   L
Sbjct: 1072 EKVRLALHGQINVSFEELLYYTLDKYCLD------IRIEAFRGI----FLLGGIKNKNIL 1121

Query: 466  SLFIKSV---EEEPSLRGQVKLGIHAMRIC 492
            S F+K     +  P LR Q+ L      IC
Sbjct: 1122 SYFLKICLLDDSSPYLRTQL-LEAFKYSIC 1150


>gi|400598146|gb|EJP65866.1| transcription initiation factor [Beauveria bassiana ARSEF 2860]
          Length = 1484

 Score =  142 bits (358), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 131/465 (28%), Positives = 216/465 (46%), Gaps = 65/465 (13%)

Query: 2   LEKQMGSNFFRKILQNIISRAQ--GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
           + K  G +   ++LQ +++RAQ  G+     L T++F+    K G+  R  L+ F+ +WV
Sbjct: 535 MNKASGGHGLTRVLQKLLTRAQIEGSDKSTILDTEKFKTLCEK-GSRYR--LETFWNQWV 591

Query: 60  GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNT--------DSENRDGD 111
              GCP   +   +NK++  VEL + +    K  ++ P L  +          +E   G+
Sbjct: 592 YGSGCPRFDVKARFNKKRLCVELTLNQVQLHK--AKKPQLEKDDFLREIKERRAEVPRGE 649

Query: 112 IG--WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQC-----HSKLAARRALKPK-- 161
           I   + G M++R+HE DG  Y+H IL +  DA +  + +      + +L   R +K K  
Sbjct: 650 IQPLFTGPMTVRIHEADGTPYEH-ILEIREDATRATKFEIPYNTKYKRLKRTRRMKEKHS 708

Query: 162 KGSKPDGCDDNGDAVAG-----------------LDMRSSMESPL-----SWIRADPEME 199
            G   +  ++N DA+                   +D     E  +      WIR D + E
Sbjct: 709 TGQSMEIAENNDDALLYCLGDVLQTPEDQREWELIDWDPETERKMDQESYEWIRLDADFE 768

Query: 200 YLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWR 259
           + +++       M+++QL++D DVVAQ  A+  L   P L       L   L D + F  
Sbjct: 769 WASDMSRTLEPYMYVSQLQQDRDVVAQQDAMLYLAQGP-LHPIASGFLVRTLIDRRYFHG 827

Query: 260 VRIEAAYALANTAS-EETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIP 318
           +R  AA AL   ++ ++    G+  L++ ++     E    PRPNDF D  +Y V  AI 
Sbjct: 828 IRTMAAAALTRQSNIKDIPLLGMRQLMRAFREMYCYEQTNQPRPNDFTDKKQYNVRCAII 887

Query: 319 HAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEF------- 371
            A+A VR   NK P EA  F+L  L+ N+N GNPYSD F++A LV+S+            
Sbjct: 888 KAIAEVRDHTNKCPMEARRFILDQLRLNNNEGNPYSDHFYIAFLVESLATSLIPSKQDDW 947

Query: 372 --------GQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCI 408
                    ++   FL   L++I+R+L+ D    SY    T++ +
Sbjct: 948 LTRQNKIPNEEENAFLEDALEQIERVLRRDEWTNSYRNTWTVAGL 992


>gi|341878003|gb|EGT33938.1| hypothetical protein CAEBREN_01134 [Caenorhabditis brenneri]
          Length = 652

 Score =  142 bits (358), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 123/460 (26%), Positives = 220/460 (47%), Gaps = 77/460 (16%)

Query: 1   MLEKQMGSNFFRKILQNIIS--------RAQGASPVRTLSTKE--FRHFANKVGNLERPF 50
           ML+K++G   F ++L  +++        +A  A+    L+T E  FR  ++  G     F
Sbjct: 55  MLQKRIGLEPFMRVLHRVLTVGLDMSEKKATPAAWRHLLTTTEAFFRSVSSVTGREIPTF 114

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           L +F    V T G     + F +N+++NIVE+ V +D                D+E  +G
Sbjct: 115 LDQF----VRTGGHVAFAVKFDFNRKRNIVEIEVKQD----------------DTEG-NG 153

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPK--KGSKPDG 168
            + + G +S+ V E+DG + H I      A    EI CHSK   ++  K     G + + 
Sbjct: 154 RVQYTGPLSVVVQEVDGAFSHTI--QIDGATSHAEISCHSKGRKQKKKKVPILTGEEIEI 211

Query: 169 CDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQ 228
              N DA          ESP+ W+R D +   + EI  +QP+  W   L+ + DV+AQ +
Sbjct: 212 DLTNMDA----------ESPILWLRIDHDYLLIREITISQPMFHWEYMLKYEKDVIAQME 261

Query: 229 AIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDW--AGLLHLVK 286
           A+  ++ALP  S +  + + + +S+ K F+R+R  AA+ L +  + +++    G   L+ 
Sbjct: 262 ALERIQALP--SAHSRSVIVDAVSNDKFFYRIRSRAAFVLTSVLNRKSEAIAVGTPVLIN 319

Query: 287 FYKSRRFDENIG------LPRPNDF----RDFSEYFVLEAIPHAVAMVRAADNKSPREAV 336
            ++     EN G      +PR N+F    ++  +YFV++++P A++ +R    +   E  
Sbjct: 320 MFR-----ENFGSKAATNIPRSNNFVVTAQNLQQYFVMQSLPQAISRLRKQSGECHEEVQ 374

Query: 337 EFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILF-----------LSSLLKR 385
            F+L L+K+NDN+ N YSD F+ A+L  S+    +  + + +              L+K 
Sbjct: 375 PFLLDLIKFNDNSTNRYSDDFYRASLYNSLAASVYPSEYLPYRVDLPENLNQDFRILIKE 434

Query: 386 IDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLD 425
               L  D + PS+  ++ I+ +  + Q  L+  G+  LD
Sbjct: 435 FTHALNMDTVNPSWGRVVGIAALSGIYQ--LQKCGYFPLD 472


>gi|307104297|gb|EFN52551.1| hypothetical protein CHLNCDRAFT_138525 [Chlorella variabilis]
          Length = 1744

 Score =  142 bits (358), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 125/440 (28%), Positives = 203/440 (46%), Gaps = 45/440 (10%)

Query: 1    MLEKQMGSNFFRKILQNIIS---------------RAQGAS-PVRTLSTKEFRHFANKVG 44
            MLE++ G   FRK + ++++                A GA+ P ++ + +     A   G
Sbjct: 830  MLERRAGEELFRKHVGDVVAGAIAAVQAEQAAVAGTASGAAGPSQSPAARP--SPAAGAG 887

Query: 45   NLERPFLKEFFP---RWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSS 101
            +    F KE  P   RWV   G P +   + ++ R + +ELAV +     P+S      +
Sbjct: 888  S----FRKEMGPFAVRWVYGRGVPHITAAYIHHSRVSALELAVKQ---AGPESAKRAADA 940

Query: 102  NTDSENRDGDIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPK 161
               +    G         +R  E     DHP+  + G+A+ L E++    +  ++  + K
Sbjct: 941  AERAAVAGGSNAGIIKAVVREGEPPTSTDHPVH-LGGEAFVLTELKVTPVVEKKQPGRRK 999

Query: 162  KGSKPDGCDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDG 221
            +  +    D+    +      ++ E P+ W+R DP  E L +I   QP +    QL +  
Sbjct: 1000 RKVEGVEVDEAPKTIV-----ATPEEPVLWVRLDPACELLCDIRLMQPERCLAAQLLQSR 1054

Query: 222  DVVAQAQAIAALEALPHLSFNV------VNTLNNFLSDSKAFWRVRIEAAYALANTASEE 275
            DVVAQA+AI  L  L   S         V+ L   L+D  AF+RVR +AA ALA    E+
Sbjct: 1055 DVVAQAEAIQRLAELRVHSAATWRQQPGVDALRRTLADRAAFYRVRCDAALALAGLRDEQ 1114

Query: 276  TDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREA 335
               AGL  L+ FY+S  F     +PR     D SEYFV +A+  AV++ R  D  +P + 
Sbjct: 1115 GQAAGLPLLLDFYRSHYFLPGSEVPRSIAIADPSEYFVAQAVATAVSLCREPDGSTPIDV 1174

Query: 336  VEFVLQLL-KYNDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDR 394
            V F+   L +Y    G P+SD   +AAL  S+G ++F      F  SLL  +++ L+ D 
Sbjct: 1175 VLFLSTCLNEYVSVPGAPFSDYGLVAALCASLGNVQFEG----FGDSLLTLLEKFLERDS 1230

Query: 395  LMPSYNGILTISCIRTLTQI 414
            ++ S    +  +C+ T+  +
Sbjct: 1231 VLVSPGSAVGCACLSTMAAL 1250


>gi|344232622|gb|EGV64495.1| hypothetical protein CANTEDRAFT_120010 [Candida tenuis ATCC 10573]
          Length = 1370

 Score =  142 bits (357), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 115/431 (26%), Positives = 205/431 (47%), Gaps = 53/431 (12%)

Query: 30   TLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCT 89
            TLST+ F++   KV    R  L+ FF +WV   G PV  +   +N++++++E+ V+R   
Sbjct: 624  TLSTQHFQYLCEKVN---RNRLETFFKQWVFGSGTPVFNVSQRFNRKRSMIEV-VVRQAQ 679

Query: 90   VKPDSRT-PVLSSNTDS-----ENRDGDI---GWPGMMSIRVHELDGM-YDHPILPMAGD 139
            V+   R+ P++ +  D+     +N         + G M+IRVHE DG+ Y+H I+ +   
Sbjct: 680  VQERKRSAPLVENFIDTSIAFLDNEPAPTVQSAFLGPMTIRVHEADGVPYEH-IVELKDS 738

Query: 140  AWQLLEIQCHSKLAARRALKPKKGSKPDGCDDN-GDAVAG--------------LDMRSS 184
            A +L +I  +SK   RR  K K+ +   G     GD +                +D    
Sbjct: 739  AVKL-DINYNSKF--RRMKKHKEDAAEGGTYTRFGDIIQSPEELAAWGLQEWPPIDDEEL 795

Query: 185  MESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVV 244
              +   WIR D + E++A+++  QP  M+ +QL+ D DV AQ +AI              
Sbjct: 796  YNNAFEWIRIDTDFEWIAKVNVRQPDYMYSSQLQYDRDVEAQYEAIRFFGNSEKPIIAYC 855

Query: 245  NTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPND 304
             +L   L D + ++ VRI AA  LA  A +E ++ G+  L+K Y       N  +P  ND
Sbjct: 856  TSLIRTLMDPRYYYGVRIAAAEGLARIARQENNFMGMNCLIKAYTQLFCFPNSTIPLAND 915

Query: 305  FRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQ 364
            F +  ++ +    P     +R  +   P    + +L L+ YN+N+ N + D  ++++L++
Sbjct: 916  FGNIPKFMLQREFPRIFCSIRDDNGNVPFALKDIILNLVVYNENSNNSFEDSLYVSSLIE 975

Query: 365  SV-GELEFGQQSIL-------------------FLSSLLKRIDRLLQFDRLMPSYNGILT 404
            S+ G +     +IL                   F+  +++ I+R  + D L+PS++  L 
Sbjct: 976  SLAGSIINRNATILSNLTLPPEQDANLDPKEKEFIEVVVEEINRQQKLDELVPSHHHTLA 1035

Query: 405  ISCIRTLTQIA 415
            I+C+    ++A
Sbjct: 1036 ITCLDQKIRLA 1046


>gi|448119681|ref|XP_004203792.1| Piso0_000812 [Millerozyma farinosa CBS 7064]
 gi|359384660|emb|CCE78195.1| Piso0_000812 [Millerozyma farinosa CBS 7064]
          Length = 1536

 Score =  140 bits (354), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 137/512 (26%), Positives = 237/512 (46%), Gaps = 68/512 (13%)

Query: 31   LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVL----- 85
            LST  F++   KV    R  L+ FF +WV   G P+L +   +N++++IVE+++      
Sbjct: 657  LSTSHFQYVCEKVN---RNKLESFFKQWVYGSGIPILYVIQRFNRKRSIVEVSIRQIQHR 713

Query: 86   --RDCTVKPDS-RTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGM-YDHPILPMAGDAW 141
              R  T+KP++     ++   D    +    + G M+IR+HE DG  Y+H +    G  +
Sbjct: 714  EPRKNTLKPENFLNDAIAYIDDEPVYNSQSVFTGPMTIRIHEADGTPYEHIVYLKEG--Y 771

Query: 142  QLLEIQCHSKLAARR-----ALKPKKG---------SKPDGCDDNGDAVAGLDMRSSMES 187
              L+IQ +++    R     A +P            S+ D  + N       D  +    
Sbjct: 772  SKLDIQYNTRFRRLRKNREEAYEPSTAFTGLGDVYSSEADMKEWNFAEFEKNDDENIYND 831

Query: 188  PLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTL 247
               W+R D + E++A+++ NQ  QM+ +QL+ D  V AQ  AI+        +      L
Sbjct: 832  TFEWMRVDADSEWIAKVNINQSDQMFASQLQYDRGVEAQYDAISFFGDRERPTNIHCTVL 891

Query: 248  NNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRD 307
               L D + ++ +RI AA ALAN + E+ ++ GL +L+K YK      N  +P  NDF +
Sbjct: 892  TRTLLDDRYYYGIRIAAARALANFSKEDNNFIGLTYLLKAYKEMFCYPNSEIPLANDFSN 951

Query: 308  FSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALV---- 363
               +FV   +P  ++ VR  +   P    +F+L L+KYNDN+ N + D  +L+ L+    
Sbjct: 952  VGRFFVQHELPIIMSSVRDNEGNVPEMIKKFLLNLVKYNDNSNNFFQDPLYLSKLIHALC 1011

Query: 364  ----------------QSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISC 407
                            Q +G  +    S  + + ++  I+RL + D L+PSY  ++  +C
Sbjct: 1012 NSVINPNKSIIHDSLQQYLGSSDSANNS--YANEVVMEINRLQKLDGLVPSYQSVIYTTC 1069

Query: 408  IRTLTQIALKLSGFISLDQVV--KLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSA- 464
            ++   ++AL     +S ++++   L K   D      +R+EA + +    F   GI +  
Sbjct: 1070 LKEKVRLALHGQINVSFEELLYYTLDKYCLD------IRIEAFKGI----FLLGGIKNKN 1119

Query: 465  -LSLFIKSV---EEEPSLRGQVKLGIHAMRIC 492
             L  F+K     +  P LR Q+ L      IC
Sbjct: 1120 ILQYFLKICLLDDSSPYLRTQL-LEAFKYSIC 1150


>gi|47213759|emb|CAF95588.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 1264

 Score =  140 bits (354), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 95/297 (31%), Positives = 158/297 (53%), Gaps = 23/297 (7%)

Query: 172 NGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAI 230
           NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ +AI
Sbjct: 561 NGEEV-DMDLSAMDADSPLLWIRIDPDMSILRKVEFEQADFMWQYQLRYERDVVAQEEAI 619

Query: 231 AALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEE-TDWAGLLHLVKFYK 289
           +ALE  P  +  +   L + L   + F++VR++A + L+  A+   + W G   + K   
Sbjct: 620 SALEKFPTPASRLA--LTDILEQDQCFYKVRMQACFCLSKIANAMVSTWQGPPAM-KILF 676

Query: 290 SRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYND 347
           +R F       + + N+F +F  YF+ + +P A+A++R   N  P+E + F+L L+KYND
Sbjct: 677 TRMFCCKSCPNIVKTNNFINFQSYFLQKTMPVAMALLRDIQNLCPKEVLNFILDLIKYND 736

Query: 348 NNGNPYSDVFWLAALVQSVGELEFGQQSILF-----------LSSLLKRIDRLLQFDRLM 396
           N  N +SD ++ A L++++        SI             +  +L+ I R L  ++L+
Sbjct: 737 NRKNKFSDNYYRADLIRALTNSLTPAISISNEVRTVDNLNPDVRLILEEITRYLNMEKLL 796

Query: 397 PSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
           PS+   +T+SC+R + +  L+ +G I  D    L K +  +     VRV A  AL+D
Sbjct: 797 PSFRNTITVSCLRAIRE--LQKNGHIPSD--ASLFKSYAQYGHFVDVRVAALEALVD 849


>gi|315046148|ref|XP_003172449.1| hypothetical protein MGYG_05040 [Arthroderma gypseum CBS 118893]
 gi|311342835|gb|EFR02038.1| hypothetical protein MGYG_05040 [Arthroderma gypseum CBS 118893]
          Length = 1255

 Score =  140 bits (354), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 108/416 (25%), Positives = 191/416 (45%), Gaps = 48/416 (11%)

Query: 1   MLEKQMGSNFFRKILQNIISRA-----QGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
           +LE+++     +  +  IISR       G  P   L++  F+    ++G+ +   L  FF
Sbjct: 514 ILERRLSKASGKATVSRIISRIFLNARMGDLPNCELTSAHFQKLCERMGHTK---LDAFF 570

Query: 56  PRWVGTCGCPVLRMGFSYNKRKNIVELAVLR---DCTVKPDSRTPVLSSNTDSENRDGDI 112
            +WV   GCP       +NK+K +VE+ + +   + T   D        +   E R+   
Sbjct: 571 NQWVYGAGCPRFTASQRFNKKKLVVEMMISQVQVEQTTARDLERSTFMRDVKEEIRNVYA 630

Query: 113 G-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKL-------------A 153
           G     + G M+I++HE DG  Y+H +     +A    +I  ++K              A
Sbjct: 631 GTVQNVFTGSMTIQIHEADGTPYEHIV--EIKEAITKFDIPYNTKYKRLKRNRRQKERAA 688

Query: 154 ARRALKPKKGSKPD----------GCDDNGDAVAGLDMRSSMESPLS-----WIRADPEM 198
           A   + P   ++ D            D++ +    ++     ES +      WIR D + 
Sbjct: 689 AMSGIDPSSETQDDVLLYCLGDVLQSDEDMEKWKIVEWTKEDESRMGQESYEWIRMDADF 748

Query: 199 EYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFW 258
           E++ ++    P  M+++QL++D DVVAQ ++I  + A    +  + +     + D + F 
Sbjct: 749 EWICKMSLGMPGYMYLSQLQQDRDVVAQLESIQHMAAQREHAL-ISSIFVRTILDRRYFH 807

Query: 259 RVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIP 318
            +R  A  AL   A +E DW GL HL   +K      N  + R NDF D S Y V  A+ 
Sbjct: 808 GIRTAATKALVKHAKQEVDWIGLFHLENAFKELYCLPNSSMARSNDFSDRSSYAVQLALI 867

Query: 319 HAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQ 374
            A++ VR  D ++P    +F+ + LK+NDN+ N +SD F+++ L++S+     G++
Sbjct: 868 EAISEVRDEDGRTPMRVKQFLYEKLKFNDNSNNEFSDCFYISTLLKSICNALLGRK 923


>gi|366990875|ref|XP_003675205.1| hypothetical protein NCAS_0B07500 [Naumovozyma castellii CBS 4309]
 gi|342301069|emb|CCC68834.1| hypothetical protein NCAS_0B07500 [Naumovozyma castellii CBS 4309]
          Length = 1212

 Score =  140 bits (353), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 113/413 (27%), Positives = 197/413 (47%), Gaps = 53/413 (12%)

Query: 13  KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
           ++L  I  +A  G  P  TL+   F+H   +V    +  L  FF  W+   G PV R+  
Sbjct: 587 RVLPKIFLQAMSGDLPNNTLTASHFQHVCERVN---KSRLGSFFQEWIYGSGVPVFRVTQ 643

Query: 72  SYNKRKNIVELAVLR-----DCTVKPDS--RTPVLSSNTDSENRDGDIGWPGMMSIRVHE 124
            +N+++ +VE+ + +        +  D    + +     D+ N+     + G M+IR+HE
Sbjct: 644 RFNRKRMVVEMGIRQVDEEDHAILGKDGFFNSALRYLQNDTLNKTN--VFTGSMTIRIHE 701

Query: 125 LDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKP-----DGCDDNGDAVAG 178
            DG  Y+H +     D +  +EIQ ++K   R+  K  +G +      +G  D      G
Sbjct: 702 ADGTPYEHIV--EIKDVFTKIEIQYNTKY--RKMKKDTEGPRLGNVSFEGLTDISQTSDG 757

Query: 179 LDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPH 238
            +++   E    WIR D + E+L + H NQP  M+  QL++DGDV AQ  A+   E +  
Sbjct: 758 KELQMQNEG-FEWIRIDSDFEWLGKFHLNQPDYMYAAQLQQDGDVEAQVDALRYYEDVGV 816

Query: 239 LSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIG 298
            S    + L   + D K F+ VR+EA   LA +        G  +LV+ +K+        
Sbjct: 817 ESDVYCSILTRTILDEKYFYGVRVEACKTLACSVK------GAGYLVRIFKALFCLPGCD 870

Query: 299 LPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFW 358
           +P+ NDF +F+ YF+ + I  ++  +   D+++ +   +F+L +L+YN N+ N Y D  +
Sbjct: 871 IPKNNDFTNFTTYFLQKGIVESLGTI---DDENIQ---KFLLSILEYNSNDENKYDDTEY 924

Query: 359 LAALVQSVGELEFGQQSILFLSSLLKR--IDRLLQFDRLMPSYNGILTISCIR 409
           L  L+ +V +               KR  + RL+  ++ +PSY+  L I+  R
Sbjct: 925 LILLLNAVKD---------------KRDIVARLVNMEKWVPSYHDKLNIALQR 962


>gi|302506745|ref|XP_003015329.1| hypothetical protein ARB_06452 [Arthroderma benhamiae CBS 112371]
 gi|291178901|gb|EFE34689.1| hypothetical protein ARB_06452 [Arthroderma benhamiae CBS 112371]
          Length = 1265

 Score =  140 bits (353), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 108/416 (25%), Positives = 194/416 (46%), Gaps = 48/416 (11%)

Query: 1   MLEKQMGSNFFRKILQNIISRA-----QGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
           +LE+++     +  +  IISR       G  P   L++  F+    ++G+ +   L  FF
Sbjct: 514 ILERRLSKASGKATVSRIISRIFLNARMGDLPNCELTSAHFQKLCERMGHTK---LDAFF 570

Query: 56  PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSS---NTDSENRDGDI 112
            +WV   GCP       +NK+K +VE+ + +  + +  +R    S+   +   E R+   
Sbjct: 571 SQWVYGAGCPRFTASQRFNKKKLVVEMMISQVQSEQATARDLERSTFMRDVKEEIRNVYA 630

Query: 113 G-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKL-------------A 153
           G     + G M+I++HE DG  Y+H +     +A    +I  ++K              A
Sbjct: 631 GTVQNVFTGSMTIQIHEADGTPYEHIV--EIKEAITKFDIPYNTKYKRLKRNRRQKERAA 688

Query: 154 ARRALKPKKGSKPD----------GCDDNGDAVAGLDMRSSMESPLS-----WIRADPEM 198
           A   + P   ++ D            D++ +    ++     ES +      WIR D + 
Sbjct: 689 AMSGIDPSSETQDDVLLYCLGDVLQSDEDMEKWRIVEWTKEDESRMGQESYEWIRMDADF 748

Query: 199 EYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFW 258
           E++ ++    P  M+++QL++D DVVAQ ++I  + A    +  + +     + D + F 
Sbjct: 749 EWICKMSLGMPGYMYLSQLQQDRDVVAQLESIQHMAAQREHAL-ISSIFVRTILDRRYFH 807

Query: 259 RVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIP 318
            +R  A  AL   A +E DW GL HL   ++      N  + R NDF D S Y V  A+ 
Sbjct: 808 GIRTAATKALVKHAKQEVDWIGLFHLENAFQELYCLPNSSMARSNDFSDRSSYAVQLALI 867

Query: 319 HAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQ 374
            A++ VR  D ++P    +F+ + LK+NDN+ N YSD F+++ L++S+     G++
Sbjct: 868 EAISEVRDEDGRTPMRVKQFLYEKLKFNDNSNNEYSDCFYISTLLKSICNALLGRR 923


>gi|67522074|ref|XP_659098.1| hypothetical protein AN1494.2 [Aspergillus nidulans FGSC A4]
 gi|40744651|gb|EAA63807.1| hypothetical protein AN1494.2 [Aspergillus nidulans FGSC A4]
 gi|259486816|tpe|CBF84980.1| TPA: transcription initiation factor TFIID subunit TSM1/127kD,
            putative (AFU_orthologue; AFUA_8G04950) [Aspergillus
            nidulans FGSC A4]
          Length = 1282

 Score =  140 bits (352), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 132/542 (24%), Positives = 237/542 (43%), Gaps = 78/542 (14%)

Query: 1    MLEKQMGSNFFRKILQNIISR-----AQGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
            +L++++     +  +  IISR       G  P   +++  F+    ++G+ +   L  FF
Sbjct: 508  ILDRRLTKASGKATMSRIISRLFLNARMGELPNGAVTSSVFQKTCERLGHAK---LDTFF 564

Query: 56   PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRT---PVLSSNTDSENRDGDI 112
             +W+   GCP  +    +NK+K +VE+ + +  + +P +R         +   E R+   
Sbjct: 565  NQWIYGAGCPRFQATQRFNKKKLVVEMMIKQVQSDQPTARDLERDAFMRDVKEEIRNVYA 624

Query: 113  G-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKP 166
            G     + G M+IR+HE DG  Y+H +    G     +      K   R   + ++ +  
Sbjct: 625  GTIQPVFTGSMTIRIHEADGTPYEHIVEIKEGVTKFDIPYNTKYKRLKRNKRQKERAAAV 684

Query: 167  DGCDDNGDAVAGL---------------------------DMRSSMESPLSWIRADPEME 199
             G D N +    +                           + R   ES   WIR D + E
Sbjct: 685  IGADGNAEVQEDILLYCLGDVLQSEEEMQQWRLADWSKEDEERMGQES-YEWIRMDADFE 743

Query: 200  YLAEIHFNQPVQMWINQLEKDGDVVAQ---AQAIAALEALPHLSFNVVNTLNNFLSDSKA 256
            ++ ++    P  M+++QL++D DVVAQ    Q +AA    P +S   + T    L D + 
Sbjct: 744  WICKLSLVMPGYMYLSQLQQDRDVVAQLESLQYMAAQREHPLISTIFLRT----LMDRRY 799

Query: 257  FWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEA 316
            F+ +R+ AA AL   A EE +W GL HL K ++         + R N+F D + Y +   
Sbjct: 800  FYGIRVAAARALVKHAKEEINWLGLFHLEKAFQELFCLPGSPMTRSNNFSDRAAYVLQLV 859

Query: 317  IPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQS-----VGELE- 370
            IP A++ VR    K+P     F+   LK+NDN+ N YSD F++A L++S     +G +E 
Sbjct: 860  IPEAISKVRDVSGKTPMRVKRFLYDKLKFNDNSNNEYSDNFYVATLMKSLCHAMLGRVES 919

Query: 371  -------FGQQSIL-------FLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIAL 416
                   F   ++L            +  IDR  + D    S+  + + + +    Q+  
Sbjct: 920  RTDDLNNFDMDAVLEAQAEEQLEKDAIAEIDRYRRMDEWSSSFQNLYSRAALYCQMQL-- 977

Query: 417  KLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEP 476
             +   I+   V++ + P+    T   +R++A   L++L+   +     L  FI ++  +P
Sbjct: 978  -MQAKIAEVDVMQFL-PYTRVGTYDLLRLDAFECLVELDIFKS--PELLRWFIFTMSNDP 1033

Query: 477  SL 478
            S+
Sbjct: 1034 SI 1035


>gi|50304677|ref|XP_452294.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|49641427|emb|CAH01145.1| KLLA0C02189p [Kluyveromyces lactis]
          Length = 1380

 Score =  139 bits (351), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 116/447 (25%), Positives = 202/447 (45%), Gaps = 68/447 (15%)

Query: 30   TLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAV----- 84
            +LS+  F+    +V N  R  L+ FF +WV   G P+ R+   +NK++  VE+ +     
Sbjct: 617  SLSSTHFQRVCERV-NKNR--LETFFKQWVYGSGVPIFRITQRFNKKRMAVEMGIRQVQM 673

Query: 85   ----LRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGM-YDHPILPMAGD 139
                L     K +  +  +    D +     + + G M+IR+HE DG  Y+H +     D
Sbjct: 674  QELGLGKIVGKENFNSSAMEYLQDQDKSPTQV-FTGSMTIRIHEADGTPYEHIV--ELKD 730

Query: 140  AWQLLEIQCHSKLA-------ARRALKPKKGS--------------------------KP 166
             +  L++Q ++K           R  K  K                              
Sbjct: 731  VFTKLDVQYNTKYKRLKRRRKVNRVTKEAKEEINEDTQEEENANEDIVLINCMGDVLVSK 790

Query: 167  DGCD-----DNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDG 221
            D C+     D      G D++   E+   WIR D + E+++++H NQP  M+ +QL++D 
Sbjct: 791  DDCNKWNLTDPMTTSEGDDLQQQNEA-FEWIRIDSDFEWISKVHINQPDYMFASQLQQDR 849

Query: 222  DVVAQAQAIAALEAL----PHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALAN---TASE 274
            DV AQ +++   E +       S    + L     D K F  VR+ A  AL+     A E
Sbjct: 850  DVEAQVESVRFFEDVILNGTSNSLIYSSILTRTAMDPKYFHGVRLAACRALSKFTIKAEE 909

Query: 275  ETDW-AGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPR 333
            + ++  G  HLV+ ++     E+  +P+ NDF D+ +YF+ +AIP  ++ V+  +   P 
Sbjct: 910  QQEFKGGPRHLVQIFRELFCYEDSNIPKNNDFSDYPKYFLQKAIPKLLSSVKNDEGVCPA 969

Query: 334  EAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQ-QSILFLSSLLKRIDRLLQF 392
               EF+L +++YNDN  N YSD F++  L++S+ +          ++  ++  I R    
Sbjct: 970  FVKEFLLDVVRYNDNTDNIYSDTFYITELLESLVDCSISDINDTKYIDEVMNEIQRFYNL 1029

Query: 393  DRLMPSYNGILTISCIRTLTQIALKLS 419
            D+ M SY  + T    +T+ +  LKL+
Sbjct: 1030 DQWMSSYQFLTT----KTIQKQRLKLA 1052


>gi|255945547|ref|XP_002563541.1| Pc20g10490 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211588276|emb|CAP86378.1| Pc20g10490 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 1243

 Score =  139 bits (351), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 150/609 (24%), Positives = 260/609 (42%), Gaps = 94/609 (15%)

Query: 1    MLEKQMGSNFFRKILQNIISR-----AQGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
            +L++++     +  +  IISR       G  P   +++  F+    ++G+ +   L  FF
Sbjct: 508  ILDRRLTKASGKATMSRIISRLFLNSRMGDIPNGAVTSSLFQKTCERLGHAK---LDVFF 564

Query: 56   PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRD------ 109
             +WV   GCP  +    +NK+K +VE+ ++R    +P +   +   + DS  RD      
Sbjct: 565  QQWVYGAGCPRFQATQRFNKKKLVVEM-MIRQVQSEPQAPRDL---HKDSFLRDVKEEVQ 620

Query: 110  ----GDIG--WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKK 162
                G +   + G M++R+HE DG  Y+H +    G      +I  ++K    + LK  K
Sbjct: 621  HVYAGPVQPVFTGSMTLRIHEADGTPYEHIVEIKEGVT--KFDIPYNTKY---KRLKRNK 675

Query: 163  GSKPDGCDDNGDAVAGLDM-----------------------------RSSMESPLSWIR 193
              +      +GDA    D+                             R   ES   WIR
Sbjct: 676  KQRERAVA-SGDAEVQEDVLLYCLGDVLQTEEETQAWRLADWTKEDEERMGQES-YEWIR 733

Query: 194  ADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQA---QAIAALEALPHLSFNVVNTLNNF 250
             D + E++ ++    P  M+++QL++D DVVAQ    Q +AA    P +S   V TL   
Sbjct: 734  MDADFEWICKLSLGMPGYMYLSQLQQDRDVVAQLESLQYMAAQREHPLISTIFVRTL--- 790

Query: 251  LSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSE 310
              DS+ F  +R  AA AL   A +E DW GL HL K ++         + R NDF D + 
Sbjct: 791  -MDSRYFHGIRTTAARALVKHAKDEVDWIGLYHLEKAFQELFCLPGSPMTRSNDFSDRAA 849

Query: 311  YFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELE 370
            Y + + IP A++ VR    K+P     F+   LK+NDN+ N YSD F++A+L++S+    
Sbjct: 850  YILQQVIPEAISKVRDNSGKTPLRVKRFLFDKLKFNDNSNNEYSDNFYVASLMESLCHAM 909

Query: 371  FGQQSILFLSSL---------------------LKRIDRLLQFDRLMPSYNGILTISCIR 409
             G+        L                     +   DR  + D    S+  + + + ++
Sbjct: 910  LGKNEARTDEELDHFDMERVLEAQAEEQLDKDAIAEFDRYRRMDEWSSSFQNLYSRAALK 969

Query: 410  TLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFI 469
               Q  L  +  I LD +++ + P+    T   +R++A + L+DL+   N       +F 
Sbjct: 970  --CQAQLMQAKMIDLD-LMRFV-PYTRAGTYDLLRIQAFQCLVDLDITQNPELLRWLMFT 1025

Query: 470  KSVEEEPSLRGQVKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAFNNVFLRH 529
             S +    +R Q  L +    + QI  G     E      + +     ++I  +++  R 
Sbjct: 1026 LSADSSAWIRHQ-SLRLLGQTLAQIAFGRPQEAEAAQTDGLIIEQESSTQIRRDDIARRQ 1084

Query: 530  HLFGILQIL 538
             + G ++ L
Sbjct: 1085 TIPGAIEAL 1093


>gi|302660060|ref|XP_003021714.1| hypothetical protein TRV_04172 [Trichophyton verrucosum HKI 0517]
 gi|291185624|gb|EFE41096.1| hypothetical protein TRV_04172 [Trichophyton verrucosum HKI 0517]
          Length = 1265

 Score =  139 bits (351), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 108/416 (25%), Positives = 194/416 (46%), Gaps = 48/416 (11%)

Query: 1   MLEKQMGSNFFRKILQNIISRA-----QGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
           +LE+++     +  +  IISR       G  P   L++  F+    ++G+ +   L  FF
Sbjct: 514 ILERRLSKASGKATVSRIISRIFLNARMGDLPNCELTSAHFQKLCERMGHTK---LDAFF 570

Query: 56  PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSS---NTDSENRDGDI 112
            +WV   GCP       +NK+K +VE+ + +  + +  +R    S+   +   E R+   
Sbjct: 571 SQWVYGAGCPRFTASQRFNKKKLVVEMMISQVQSEQATARDLERSTFMRDVKEEIRNVYA 630

Query: 113 G-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKL-------------A 153
           G     + G M+I++HE DG  Y+H +     +A    +I  ++K              A
Sbjct: 631 GTVQNVFTGSMTIQIHEADGTPYEHIV--EIKEAITKFDIPYNTKYKRLKRNRRQKERAA 688

Query: 154 ARRALKPKKGSKPD----------GCDDNGDAVAGLDMRSSMESPLS-----WIRADPEM 198
           A   + P   ++ D            D++ +    ++     ES +      WIR D + 
Sbjct: 689 AMSGIDPSSETQDDVLLYCLGDVLQSDEDMEKWRIVEWTKEDESRMGQESYEWIRMDADF 748

Query: 199 EYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFW 258
           E++ ++    P  M+++QL++D DVVAQ ++I  + A    +  + +     + D + F 
Sbjct: 749 EWICKMSLGMPGYMYLSQLQQDRDVVAQLESIQHMAAQREHAL-ISSIFVRTILDRRYFH 807

Query: 259 RVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIP 318
            +R  A  AL   A +E DW GL HL   ++      N  + R NDF D S Y V  A+ 
Sbjct: 808 GIRTAATKALVKHAKQEVDWIGLFHLENAFQELYCLPNSSMARSNDFSDRSSYAVQLALI 867

Query: 319 HAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQ 374
            A++ VR  D ++P    +F+ + LK+NDN+ N YSD F+++ L++S+     G++
Sbjct: 868 EAISEVRDEDGRTPIRVKQFLYEKLKFNDNSNNEYSDCFYISTLLKSICNALLGRR 923


>gi|212541210|ref|XP_002150760.1| transcription initiation factor TFIID subunit TSM1/127kD, putative
            [Talaromyces marneffei ATCC 18224]
 gi|210068059|gb|EEA22151.1| transcription initiation factor TFIID subunit TSM1/127kD, putative
            [Talaromyces marneffei ATCC 18224]
          Length = 1295

 Score =  139 bits (351), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 144/536 (26%), Positives = 227/536 (42%), Gaps = 107/536 (19%)

Query: 1    MLEKQMGSNFFRKILQNIISR-----AQGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
            +LE+++     +  +  IISR       G      +ST  F     K G+ +   L  FF
Sbjct: 511  ILERRLVKASGKPTVSRIISRLLLNNRMGDLTHGAVSTTSFIKLCEKFGHCK---LDVFF 567

Query: 56   PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG-- 113
             +WV   GCP       +NK+K +VE+ +      K     P    N +  +   D+   
Sbjct: 568  QQWVFGAGCPRFTASQRFNKKKLVVEIMI------KQSQGDPQPPKNLEKSSFLRDMKEE 621

Query: 114  ------------WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKP 160
                        + G M+IR+HE DG  Y+H +     +A    E+  ++K   +R  + 
Sbjct: 622  IRSIYAAPIQPVFTGSMTIRIHEADGTPYEHIV--EIKEAITKFEVPYNTKY--KRLKRN 677

Query: 161  KKGSKPDGCDDNGDAVAGL-----------------DM--------------RSSMESPL 189
            K+  +       GD+ A                   DM              R   ES  
Sbjct: 678  KRQKERVTHTAGGDSTAETQDDVLLYCLGDVLQTEEDMQEWRLSDWSKEEEERMGQES-Y 736

Query: 190  SWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQ---AQAIAALEALPHLSFNVVNT 246
             WIR D + E++ ++    P  M+++QL++D DVVAQ    Q +A  +A P +S   V T
Sbjct: 737  EWIRMDADFEWICKMSLGMPGYMYLSQLQQDRDVVAQLESMQYMAIQKAHPLISTIFVRT 796

Query: 247  LNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP-----R 301
            L     D + F  +R  AA AL   A EE DW GL HL      R F E   LP     R
Sbjct: 797  L----MDKRYFHGIRTAAANALVKHAKEELDWLGLYHL-----ERAFQELFCLPGTHVTR 847

Query: 302  PNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAA 361
             NDF D + Y + +AIP A++ VR  +  +P     F+   L+++DN+ N YSD F++A+
Sbjct: 848  DNDFSDQASYILQKAIPEAISRVRDNNGNTPLRVKRFIFDKLRFSDNSNNEYSDNFYIAS 907

Query: 362  LVQSV--------------GELEFGQQSIL-------FLSSLLKRIDRLLQFDRLMPSYN 400
            L+ S+               E+EF  +  L           ++  IDR  + D    S+ 
Sbjct: 908  LMTSLVNAMEGRKAEASYSNEMEFDMERELERQAEEKLEQDVITEIDRYRRMDEWSSSFQ 967

Query: 401  GILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEF 456
             I + + +    Q+ L  S  + LD ++  ++  R   T   +RVEA   L++L+ 
Sbjct: 968  NIYSRTALH--CQLQLTKSKIMDLD-LIHFLQYTRS-GTFDMLRVEAFECLVELDL 1019


>gi|299756392|ref|XP_001829302.2| TATA-binding protein associated factor Taf2 [Coprinopsis cinerea
           okayama7#130]
 gi|298411655|gb|EAU92262.2| TATA-binding protein associated factor Taf2 [Coprinopsis cinerea
           okayama7#130]
          Length = 1794

 Score =  139 bits (350), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 92/299 (30%), Positives = 146/299 (48%), Gaps = 34/299 (11%)

Query: 191 WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNF 250
           WIR D + E++A + F QP  MW++QL++D DVVAQ +AI AL   P  +  V +TL   
Sbjct: 621 WIRIDADFEWIANMKFEQPDFMWVSQLQRDRDVVAQLEAIYALAEKP--TAIVSSTLTKT 678

Query: 251 LSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFD---------ENIGLPR 301
           +  S  ++RVR EAA AL + A    D+ GL HL K +    +D          +  +P+
Sbjct: 679 VLVSNYYYRVRCEAALALVSCAIRRLDFLGLFHLFKLFLRYCYDPEDPNQDLFTHKYVPK 738

Query: 302 PNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWL-- 359
           PN+F DF EYFV + +  A++ VR  +  +P    +F++  L+YNDN  NPY+D +++  
Sbjct: 739 PNNFSDFPEYFVRKTLITAISQVRFENGNTPSVVRQFLIDQLRYNDNTSNPYTDGYYICT 798

Query: 360 -----------------AALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGI 402
                              L+ S    E   + +  +      +DR    DRL+PS + I
Sbjct: 799 IIAAAACAIISMAPPERGELMPSDTSTELNAEDLELIKQTRIEVDRYRSMDRLIPSPHNI 858

Query: 403 LTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGI 461
           +TI+ +     +AL ++  I      K+  P        QVR+ A   L   +++   I
Sbjct: 859 VTIAALEFY--MALGVANLIPSHP--KVFFPLTREGNYTQVRIAAFDGLFMTKWYAPQI 913



 Score = 41.6 bits (96), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 29/106 (27%), Positives = 48/106 (45%), Gaps = 16/106 (15%)

Query: 31  LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTV 90
           LST  F     KV  ++   L+ F  +W+   GCP      S+N++K  VE+ + ++   
Sbjct: 436 LSTHAFLRTCRKVSGVD---LRSFAEQWIYGSGCPSFGFSASFNRKKMAVEITMRQEA-- 490

Query: 91  KPDSRTPVLSSNTDSENRDGDIG----WPGMMSIRVHELDGM-YDH 131
                 P   +   +E     +     + G M+IR+HE DG  Y+H
Sbjct: 491 ------PAFKALEHNEIGKALLKPVPFFEGQMTIRIHEADGTPYEH 530


>gi|313228775|emb|CBY17926.1| unnamed protein product [Oikopleura dioica]
          Length = 984

 Score =  139 bits (350), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 122/458 (26%), Positives = 203/458 (44%), Gaps = 63/458 (13%)

Query: 13  KILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFS 72
           K+L N   R + +  +  +S+  F  F   V ++    +K+    WV   G   +   FS
Sbjct: 432 KLLANKSGRFEKSPQI--ISSVSFLKF---VASVTEKDIKDLLDLWVRRGGVANIVASFS 486

Query: 73  YNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGMYDHP 132
           + + KN +E+ + +D                    R G + + G M++RV ELDG ++H 
Sbjct: 487 FQRNKNTIEIEIKQD--------------------RSGSVRYVGPMTVRVQELDGTFNHT 526

Query: 133 ILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLDMRSSMESPLSWI 192
           I     D    + I CHS     RA + K+   P     NG+ V     +S  +SP+ WI
Sbjct: 527 I--QVEDKTNKISINCHS-----RARRNKRKKIPLL---NGEEVDIDLSQSDQDSPVLWI 576

Query: 193 RADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLS 252
           R DP+   L  I   QP       ++ + +++ Q + I  L+  P  + N V +L   + 
Sbjct: 577 RLDPDCNLLRSIELRQPDYCLQYLVKYEREIIGQLEGIEHLKDFP--TENTVKSLKEAVE 634

Query: 253 DSKAFWRVRIEAAYALANTASEETDWA----GLLHLVKFYKSRRFDENIG---LPRPNDF 305
               FWRVR+ AA  +A  A +  D       LL + K Y      E +    L R NDF
Sbjct: 635 SLDFFWRVRVSAAEMMARIADQMPDPVPVKDALLSVFKGYFC--LSETVSKERLVRRNDF 692

Query: 306 RDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQS 365
            +F +YFV  AIP +++ +R +   +P     F+  LLK+NDN+ N Y D  W A L+ +
Sbjct: 693 SNFQQYFVQCAIPVSLSKIRLSSGMTPAGVFTFIANLLKHNDNSKNAYLDCQWKAYLITA 752

Query: 366 VGELEFGQQSILFLSS-------LLKRIDRLLQFDRLMPSYNG---ILTISCIRTLTQIA 415
           +            LSS       +++ + R++  + L+P+Y     I  ++ +RTL +  
Sbjct: 753 LSNTLCRAVVAQKLSSVSEEAKEVIEEVMRVMNTEWLLPTYKHVTFIAALTAVRTLQRQQ 812

Query: 416 LKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
                   +    +L + F + +    VRV A + L+D
Sbjct: 813 -------HIPACPELFRRFSESDQYSTVRVAAIKCLVD 843


>gi|425781951|gb|EKV19885.1| Transcription initiation factor TFIID subunit TSM1/127kD, putative
            [Penicillium digitatum PHI26]
 gi|425783990|gb|EKV21801.1| Transcription initiation factor TFIID subunit TSM1/127kD, putative
            [Penicillium digitatum Pd1]
          Length = 1240

 Score =  139 bits (350), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 138/523 (26%), Positives = 237/523 (45%), Gaps = 85/523 (16%)

Query: 1    MLEKQMGSNFFRKILQNIISR-----AQGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
            +L++++     +  +  IISR       G  P   +++  F+    ++G+ +   L  FF
Sbjct: 505  ILDRRLTKASGKATMSRIISRLFLNSRMGDIPNGAITSSLFQKTCERLGHAK---LDVFF 561

Query: 56   PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRD------ 109
             +WV   GCP  +    +NK+K +VE+ ++R    +P    P    + DS  RD      
Sbjct: 562  QQWVYGAGCPRFQATQRFNKKKLVVEM-MIRQVQSEPQ---PPRDLDKDSFLRDVKEEVR 617

Query: 110  ----GDIG--WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAA-RRALKPK 161
                G +   + G M+IR+HE DG  Y+H +    G      +I  ++K    +R  K +
Sbjct: 618  HVYAGAVQPVFTGSMTIRIHEADGTPYEHIVEIKEGVT--KFDIPYNTKYKRLKRNKKQR 675

Query: 162  KGSKPDG-----------CDDN-------------GDAVAGLDMRSSMESPLSWIRADPE 197
            + +   G           C  +              D     + R   ES   WIR D +
Sbjct: 676  ERAVASGDAEVQEEVLLYCLGDVLQTEEETQAWRLADWTKEDEERMGQES-YEWIRMDAD 734

Query: 198  MEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEAL---PHLSFNVVNTLNNFLSDS 254
             E++ ++    P  M+++QL++D DVVAQ +++  + A    P +S   V TL     DS
Sbjct: 735  FEWICKLSLGMPGYMYLSQLQQDRDVVAQLESLQYMTAQREHPLISTIFVRTL----MDS 790

Query: 255  KAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVL 314
            + F  +R  AA AL   A +E DW GL HL K ++         + R NDF D + Y + 
Sbjct: 791  RYFHGIRTTAARALVKHAKDEIDWVGLHHLEKAFQELFCLPGSPMTRSNDFSDRAAYILQ 850

Query: 315  EAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQS-----VGEL 369
            + IP A++ VR    K+P     F+   LK+NDN+ N YSD F++A+L++S     +G+ 
Sbjct: 851  QVIPDAISKVRDNSGKTPLRVKRFLFDKLKFNDNSNNEYSDNFYVASLMKSLCHAMLGKS 910

Query: 370  E---------FGQQSILFLSS-------LLKRIDRLLQFDRLMPSYNGILTISCIRTLTQ 413
            E         F  + +L + +        +   DR  + D    S+  + + + ++   Q
Sbjct: 911  EARTDEDLDHFDMERVLEVQAEEQLDKDAIAEFDRYRRMDEWSSSFQNLYSRAALQ--CQ 968

Query: 414  IALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEF 456
              L  +  I LD +++ I P+    T   +R+EA + L+DL+ 
Sbjct: 969  AQLMQAKMIDLD-LMRFI-PYTRAGTYDLLRIEAFQCLVDLDI 1009


>gi|320031839|gb|EFW13797.1| TBP-associated factor [Coccidioides posadasii str. Silveira]
          Length = 1208

 Score =  139 bits (349), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 125/476 (26%), Positives = 209/476 (43%), Gaps = 90/476 (18%)

Query: 58  WVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTP------VLSSNTDSENRDGD 111
           WV   GCP       +NK+K +VE+ + +   V+ +  +P          +   E ++  
Sbjct: 510 WVYGAGCPRFVATQRFNKKKLVVEMMIRQ---VQSEQSSPRDLEKSTFMRDVKEELQNIY 566

Query: 112 IG-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLA--ARRALKPKKG 163
            G     + G M+IR+HE DG  Y+H +     +A    +I  ++K     R   + ++ 
Sbjct: 567 AGSLQHVFTGSMTIRIHEADGTPYEHIV--EIKEAVTKFDIPYNTKYKRLKRNKRQRERA 624

Query: 164 SKPDGCDDNGDAVAGL------DMRSSMESPL--------------------SWIRADPE 197
           +   G D N +A   +      D+  S E  L                     WIR D +
Sbjct: 625 AASSGVDPNSEAQDDVLLYCLGDVLQSEEEILKWRLVEWTKEDEDRMGQESYEWIRMDAD 684

Query: 198 MEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA-------IAALEALPHLSFNVVNTLNNF 250
            E++ ++    P  M+++QL++D DVVAQ +A       +AA   +  + +      +  
Sbjct: 685 FEWICKMSLTMPGYMYLSQLQQDRDVVAQLEASNSRVGFMAAANCIQSIQYMAAQREHPL 744

Query: 251 LS--------DSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRP 302
           +S        DS+ F  +R  AA+AL   A E+ DW GL HL   ++      N  + R 
Sbjct: 745 ISTIFARTLMDSRYFHGIRTAAAHALVKHAKEDVDWIGLFHLETAFQELFCLPNSPMTRS 804

Query: 303 NDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAAL 362
           N+F D S Y +  AIP+A+A VR  + + P     F+ + LK+NDN+ N +SD +++AAL
Sbjct: 805 NNFSDRSSYALQIAIPNAIAKVRNRNGRCPMRVKRFLYEKLKFNDNSNNEFSDCYYVAAL 864

Query: 363 VQSVG--------------------ELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGI 402
           ++S+                     ELE   +  L+L  + + I+R  + D    S+  +
Sbjct: 865 MRSIANALIGGKESIHTKSKLDMTQELERQAEDQLYLDCIAE-INRYKRMDEWTSSFQNL 923

Query: 403 LT---ISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLE 455
            T   + C+R L      +     LD  V    P+    T   VR++A R LLDL+
Sbjct: 924 YTRTALDCLRRL------MKEHRVLDFDVAQFLPYTRIGTFDLVRLDAFRILLDLD 973


>gi|409083644|gb|EKM84001.1| hypothetical protein AGABI1DRAFT_104011 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 1854

 Score =  138 bits (348), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 84/248 (33%), Positives = 130/248 (52%), Gaps = 32/248 (12%)

Query: 191 WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNF 250
           WIR D + E++A I F+QP  MW++QL++D DVVAQ +A+ AL   P  +  V +TL   
Sbjct: 707 WIRMDADFEWIANIKFDQPDFMWVSQLQRDRDVVAQLEALHALAEKP--TAIVSSTLTKT 764

Query: 251 LSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFD---------ENIGLPR 301
           +  S  ++R+R EAA AL   A    D+ GL HL K +    +D          +  +P+
Sbjct: 765 VLVSNYYYRIRCEAALALVTCAIRRLDFLGLFHLFKLFLRYCYDPEDPTQDLFTHTYVPK 824

Query: 302 PNDFRDFSEYFV--LEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWL 359
           PN+F D +EYFV   +++  A++ VR  + KSP    +F +  L+YNDN  NPYSD  ++
Sbjct: 825 PNNFSDLAEYFVRKYKSLITAISRVRFENGKSPSVVRQFFIDQLRYNDNTSNPYSDGLYI 884

Query: 360 AALVQSV------------GEL-------EFGQQSILFLSSLLKRIDRLLQFDRLMPSYN 400
             ++ +             GEL       E   +    +   +  IDR    DRL+PS +
Sbjct: 885 CTIISAAAYAAVSVAPPERGELLPAEIRSEQTAEDKELVRQTIAEIDRYRSMDRLIPSPH 944

Query: 401 GILTISCI 408
            I+TI+ +
Sbjct: 945 NIVTIAAL 952


>gi|449299494|gb|EMC95508.1| hypothetical protein BAUCODRAFT_123936 [Baudoinia compniacensis UAMH
            10762]
          Length = 1237

 Score =  137 bits (346), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 139/559 (24%), Positives = 260/559 (46%), Gaps = 100/559 (17%)

Query: 2    LEKQMGSNFFRKILQNIISRAQ-GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
            L K  G+   ++I+  I+  A  G      LST +F+    K+G+ +   L+ FF +WV 
Sbjct: 544  LLKASGTTGVQRIINKILVNATTGTLENGELSTVDFQRQCEKLGHNK---LESFFRQWVF 600

Query: 61   TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG--DIGWP--- 115
              GCP+ ++   +NK+K +VE+ +L+   ++  ++ P   +N   E ++   +I  P   
Sbjct: 601  GAGCPIFQVSQRFNKKKLVVEMTILQR-QIERQTKPPFAPNNFMREIKEHVQEIWAPETL 659

Query: 116  ----GMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAA-RRALKPKKGSKPDGC 169
                G M+IR+HE DG  Y+H +     +    LEI  ++K    +R+ + K  +  +  
Sbjct: 660  PVFTGPMTIRIHEADGTPYEHIV--EIKEQTTKLEIPYNTKYKRLKRSRRQKDRAMAESN 717

Query: 170  DDNG-------------------------DAVAGLDMRSSMESPLSWIRADPEMEYLAEI 204
            +  G                         D  +  + R   ES   WIR D + E++ +I
Sbjct: 718  NGEGGEESLLYCLGDILDSPEECREWNLVDWTSEDEDRMGQES-YEWIRMDADFEWIGKI 776

Query: 205  HFNQPVQMWINQLEKDGDVVAQAQAIAALE-ALP-HLSFNVVNTLNNFLSDSKAFWRVRI 262
            H   P+ M+++QL++D D+VAQ  ++  L  A P H+SF++   L   L D + F  +R 
Sbjct: 777  HLTMPLYMYVSQLQQDRDLVAQYDSMRYLTLANPHHVSFSI---LVRTLMDHRYFHGIRA 833

Query: 263  EAAYALA---------NTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFV 313
             AA  LA         ++ ++E    G  HL+K ++         +PRPND+ + + + V
Sbjct: 834  MAADGLAILVARVDPQSSNADELRTIGQFHLMKAFEELFCFPGENMPRPNDWTNRANFIV 893

Query: 314  LEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGE----- 368
              AIP ++A +R  + K P     F +  L++NDN+ NP+SD  ++A L+  + +     
Sbjct: 894  QCAIPRSMAKLRDGEGKVPLAIRRFFMDKLRFNDNSDNPFSDCHYVATLLTCLADSLVVS 953

Query: 369  ---------LEFGQQSIL--------FLSSLLKRIDRLLQFDRLMPSYNGILTIS---CI 408
                       FG+   +          +  +  I+R  + D    +Y+ I +++   C+
Sbjct: 954  HREPQPTYTFSFGEDEPMEPENPDEPLEAEGINAIERYRRIDEWETTYHNIYSLTALECL 1013

Query: 409  RTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSL- 467
            + LT+  +      + D+  ++++  R  N    VRV A R L+++     G+   +S+ 
Sbjct: 1014 QKLTKAGM------AKDKTSEVLQYTRPANADL-VRVAAFRCLVEI-----GVARKMSMM 1061

Query: 468  --FIKSVEEEPS--LRGQV 482
               +  + +EPS  +RG++
Sbjct: 1062 AYLLHCIADEPSPFVRGRL 1080


>gi|294654543|ref|XP_002769993.1| DEHA2A06424p [Debaryomyces hansenii CBS767]
 gi|199428965|emb|CAR65370.1| DEHA2A06424p [Debaryomyces hansenii CBS767]
          Length = 1628

 Score =  137 bits (346), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 139/541 (25%), Positives = 247/541 (45%), Gaps = 60/541 (11%)

Query: 31   LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAV--LRDC 88
            LST+ F++   KV    R  L  FF +WV   G P+  +   +NK+++++E+++  ++  
Sbjct: 636  LSTQHFQYVCEKVN---RNKLDTFFKQWVFGSGVPIFNISQKFNKKRSMIEMSIRQVQQS 692

Query: 89   TVKPDSRTPVLSSNTDSENRDGDIGWP------GMMSIRVHELDGM-YDHPILPMAGDAW 141
              K     P    N    + D +  +       G M+IR+HE DG  Y+H I+ +  D  
Sbjct: 693  DTKKLHPNPESFINDSISHLDDEPSFSIQPVFTGPMTIRIHEADGTPYEH-IVDLK-DGH 750

Query: 142  QLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVA--------GLDMRSSMESPL---- 189
              L+IQ ++K    +  +             GD +         GL+     E  +    
Sbjct: 751  VKLDIQYNTKFKRLKKNREDLNEPNTTFSKLGDILTTPEEMDEWGLNEWVKHEEDMLYND 810

Query: 190  --SWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTL 247
               WIR D + E++A+I+  QP  M+ +QL+ D DV AQ +A+         +      L
Sbjct: 811  AFEWIRVDADYEWIAQINVKQPDYMFGSQLQYDRDVEAQYEAVRYFGDREKPNTLYCTAL 870

Query: 248  NNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRD 307
               + D + ++ +RI AA ALA+ +  + ++ G+ +L+K +K         +P  NDF +
Sbjct: 871  TRTVMDDRYYYGLRIAAAQALADISKPDNNFIGIGYLIKIFKELYCFPGSSIPLSNDFNN 930

Query: 308  FSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVG 367
            F+ +F+ + IP  ++ VR  + + P      +L L+K+N+N+ N + D F+++ L+ S+ 
Sbjct: 931  FNRFFLQKQIPQILSEVRDENGEVPFVIKNLLLNLVKFNENSNNNFQDCFYMSDLITSLT 990

Query: 368  ELEFGQQSIL---------------------FLSSLLKRIDRLLQFDRLMPSYNGILTIS 406
                  +S L                     FL  ++  I+RL + D  +PS+  I+ I+
Sbjct: 991  TSALNAKSALTSQDPLNLLDTSDSINSEKQKFLQDVITEINRLQKLDEWVPSHQSIIAIT 1050

Query: 407  CIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALS 466
            CI+  ++I L   G + L     L      +   + +R+EA R LL L    N   S L 
Sbjct: 1051 CIK--SKIRLATHGLLQLSFEDLLYYTLEKYP--FDIRIEAFRGLLILGGLKNA--SVLQ 1104

Query: 467  LFIKSV---EEEPSLRGQ-VKLGIHAMRICQIKGGSDSNHEVDTVTLVALLNLLESRIAF 522
             ++K        P  RG+ + L I+A+ +  I G   +  + +  TL  L +   SR+A 
Sbjct: 1105 YYLKVCLLGATTPYFRGKLIDLLINAICVAAIDGTPSTLDDPEFKTLEKLFD-ENSRMAN 1163

Query: 523  N 523
            N
Sbjct: 1164 N 1164


>gi|452979536|gb|EME79298.1| hypothetical protein MYCFIDRAFT_34831, partial [Pseudocercospora
            fijiensis CIRAD86]
          Length = 1197

 Score =  137 bits (346), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 134/524 (25%), Positives = 239/524 (45%), Gaps = 88/524 (16%)

Query: 2    LEKQMGSNFFRKILQNIISRAQ-GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
            L K  GS    +I+  I   A+ GA     LST + +    K+G+ +   L+ FF +WV 
Sbjct: 525  LVKSSGSTGVMRIINKIFLNAKTGALTNCELSTIDLQRTCEKLGHNK---LESFFKQWVL 581

Query: 61   TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG--DIGWP--- 115
              GCP+  +   +NK+K +VE+ +++   ++  ++     +N   E ++   ++  P   
Sbjct: 582  GSGCPIFHVSQRFNKKKLVVEMTIVQR-QLERQTKPEFAPNNFMREIKEHVQEVWAPEIQ 640

Query: 116  ----GMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAA-RRALKPKKGSKPDGC 169
                G M+IR+HE DG  Y+H +     +    LEI  ++K    +R+ + K+ +  +G 
Sbjct: 641  TVFSGPMTIRIHEADGTPYEHIV--EIKEPITKLEIPYNTKYKRLKRSRRAKERAVAEGA 698

Query: 170  --DDNGDAV----------------------AGLDMRSSMESPLSWIRADPEMEYLAEIH 205
              ++  DA+                      +  D     +    WIR D + E++ ++H
Sbjct: 699  AGEEGSDALLYCLGDILDNEQDQQEWNLVDWSAEDEDKMGQESYEWIRMDADFEWIGKMH 758

Query: 206  FNQPVQMWINQLEKDGDVVAQAQAIAAL--EALPHLSFNVVNTLNNFLSDSKAFWRVRIE 263
               P+ M+I+QL++D D+VAQ +++  L      H+S  +   L   L D + F+ +R+ 
Sbjct: 759  LVMPLYMYISQLQQDRDLVAQYESMRYLLGSNPHHVSLTI---LVRTLMDERYFYGIRVM 815

Query: 264  AAYALANTASEETDWAGLLHLVKFYKSRRFDENI-GLPRPNDFRDFSEYFVLEAIPHAVA 322
            AA  L+  A ++    G LHL K +     D +I  +PRPNDF     + +  AIP A++
Sbjct: 816  AAEGLSLMAKDKLTEIGTLHLTKAFAHFFCDGDIKSMPRPNDFSSRVSFIMQCAIPRAMS 875

Query: 323  MVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGE-------------- 368
             +R  + K P +   F +  L YNDN+ N YSD  ++A L+  + +              
Sbjct: 876  KLRDIEGKVPNKVQRFFVDKLIYNDNSDNAYSDCHYVATLMTCLTDALVASHRIIEPKQT 935

Query: 369  ----------LEFGQQSI----LFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQI 414
                      +E  Q+ I     F    +  I+R  + D  + +Y  I + + I  L + 
Sbjct: 936  FEFNFDDDMGMEEDQEPINPNAAFEQEAVNVIERYRRLDEWIVTYQNIYSTTAIECLQK- 994

Query: 415  ALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHC 458
             L  +G +  D+  +L++  RD +T   VR++A        FHC
Sbjct: 995  -LTKAGVVK-DKAKELLQYTRD-STADNVRIQA--------FHC 1027


>gi|378732524|gb|EHY58983.1| transcription initiation factor TFIID subunit D2 [Exophiala
           dermatitidis NIH/UT8656]
          Length = 1299

 Score =  137 bits (344), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 110/409 (26%), Positives = 188/409 (45%), Gaps = 48/409 (11%)

Query: 2   LEKQMGSNFFRKILQNIISRAQGASPVR-TLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
           + K  G++    IL  I++RA+    V   LST+ F+  A ++G+++   + +F  +WV 
Sbjct: 517 IAKTQGASKMPGILAKILTRARTGDLVNNALSTELFQKLAERMGHMK---IDDFLNQWVK 573

Query: 61  TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRD--------GDI 112
             GCP  +    +NK+K +VE+ + +        R   + S       D           
Sbjct: 574 GAGCPRFQAFQRFNKKKLVVEMLIKQVQGTTTTERELEIDSFMRDAREDFQSVYAAPAQP 633

Query: 113 GWPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKL-----AARRALKPKKGSKP 166
            + G M+IR+HE DG  Y+H +     DA    +I  ++K      + ++  +    +  
Sbjct: 634 VFTGPMTIRIHEADGTPYEHIV--EIKDASTTFDIPYNTKYKRLKRSRKQRARATAKAAA 691

Query: 167 DGCDDNGDAV----------------------AGLDMRSSMESPLSWIRADPEMEYLAEI 204
           +G D+  D++                      +G D          WIR D + E++  I
Sbjct: 692 EGGDEENDSLVYCLGDVLQSEEEIANWKITEWSGEDEEKMNAESYEWIRLDADFEWICRI 751

Query: 205 HFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEA 264
           +   P  M+ +QL++D DVVAQ +A+  +   P     V +     L D + F  VR  A
Sbjct: 752 NLQMPGYMFTSQLQQDRDVVAQLEALRHIAVYPPSPL-VSSIFVRTLMDRRYFHGVRAMA 810

Query: 265 AYALANTASEETDWA--GLLHLVKFYKSR---RFDENIGLPRPNDFRDFSEYFVLEAIPH 319
           A+ L   A +E D    GL HL K ++     R ++   + RPNDF D   Y +   I  
Sbjct: 811 AHCLVKHAKDEGDAKNIGLFHLKKAFEELYCVRDEQGSAMTRPNDFSDQLSYMMQCEIVG 870

Query: 320 AVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGE 368
           A+  VR +   +P+E  EF+L  L++NDN+ N YSD F++A L++++ +
Sbjct: 871 AIGRVRDSRGHTPKEVKEFLLDKLRFNDNSVNEYSDAFYVARLMKALAQ 919


>gi|296806237|ref|XP_002843928.1| transcription initiation factor TFIId 127kD subunit [Arthroderma
           otae CBS 113480]
 gi|238845230|gb|EEQ34892.1| transcription initiation factor TFIId 127kD subunit [Arthroderma
           otae CBS 113480]
          Length = 1265

 Score =  137 bits (344), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 111/417 (26%), Positives = 191/417 (45%), Gaps = 50/417 (11%)

Query: 1   MLEKQMGSNFFRKILQNIISRA-----QGASPVRTLSTKEFRHFANKVGNLERPFLKEFF 55
           +LE+++     +  +  IISR       G  P   L++  F+    ++G+ +   L  FF
Sbjct: 514 ILERRLSKASGKATVSRIISRIFLNARMGDLPNCELTSAYFQKLCERMGHTK---LDAFF 570

Query: 56  PRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSS---NTDSENRDGDI 112
            +WV   GCP       +NK+K +VE+ + +  T    +R    S+   +   E R+   
Sbjct: 571 NQWVYGAGCPRFTASQRFNKKKLVVEMMISQVQTEHMSARDLERSTFMRDVKEEIRNIYA 630

Query: 113 G-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLA--ARRALKPKKGS 164
           G     + G M+I++HE DG  Y+H +     +A    +I  ++K     R   + ++ +
Sbjct: 631 GTVQHVFTGSMTIQIHEADGTPYEHIV--EIKEAVTKFDIPYNTKYKRLKRNRRQKERAA 688

Query: 165 KPDGCDDNGDAVAGL-------------DM--------------RSSMESPLSWIRADPE 197
              G D N +A   +             DM              R   ES   WIR D +
Sbjct: 689 AMSGIDPNSEAQDDVLLYCLGDVLQSDEDMEKWKLVEWTKEDESRMGQES-YEWIRMDAD 747

Query: 198 MEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAF 257
            E++ ++    P  M+++QL++D DVVAQ ++I  + A    +  +       + D + F
Sbjct: 748 FEWICKMSLGMPGYMYLSQLQQDRDVVAQLESIQHMAAQREHAL-ISTIFVRTILDRRYF 806

Query: 258 WRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAI 317
             +R  A  AL   A +E DW GL HL   ++      N  + R NDF D + Y V  A+
Sbjct: 807 HGIRTAATKALVKHAKQEVDWIGLFHLENAFQELYCLPNSTMVRSNDFSDRASYAVQLAL 866

Query: 318 PHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQ 374
             A++ VR  D K+P    +F+   LK+NDN+ N +SD F+++ L++S+     G++
Sbjct: 867 IEAISEVRDDDGKTPVRVKQFLYDKLKFNDNSNNEFSDCFYISTLLKSICNALLGKR 923


>gi|322703910|gb|EFY95511.1| transcription initiation factor TFIID subunit TSM1, putative
            [Metarhizium anisopliae ARSEF 23]
          Length = 1457

 Score =  135 bits (340), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 144/519 (27%), Positives = 233/519 (44%), Gaps = 78/519 (15%)

Query: 1    MLEKQM----GSNFFRKILQNIISRAQGASPVRT--LSTKEFRHFANKVGNLERPFLKEF 54
            +L+K+M    G +   +ILQ  +++AQ  S  R+  L T++FR    K G+  R  L+ F
Sbjct: 542  ILDKRMIKASGGHGLTRILQKFLTKAQIESSDRSTILDTEKFRAVCEK-GSRYR--LEAF 598

Query: 55   FPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSN----TDSENRDG 110
            + +WV   GCP   +   +NK++  VEL  L     +   RT  L  +       E R G
Sbjct: 599  WNQWVYGSGCPRFEVKAKFNKKRLCVEL-TLNQIQHQAAKRTGGLEKDDFLRVVKERRAG 657

Query: 111  -DIG-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQC-----HSKLAARRAL 158
              +G     + G M++R+HE DG  Y+H IL +  DA +  + +      + +L   R +
Sbjct: 658  LKLGEVQPLFTGPMTVRIHEADGTPYEH-ILEIREDATRSTKFEIPYNTKYKRLKRTRRM 716

Query: 159  KPKKGSK-----PDGCDDN-----GDAVAGLD--------------MRSSMESPLSWIRA 194
            K K          +  DD      GD +   D               R   +    WIR 
Sbjct: 717  KEKHSGGGSMEITENADDALLYCLGDVLQTPDDHRDWELIEWDPETERKMDQESYEWIRV 776

Query: 195  DPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDS 254
            D + E+  ++       M+++QL++D DVVAQ  A+  L   P L       L   L D 
Sbjct: 777  DADFEWACDMKRTLEPYMYVSQLQQDRDVVAQQDAMLYLTHGP-LHPIASGFLVRTLIDR 835

Query: 255  KAFWRVRIEAAYALANTAS-EETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFV 313
            + F  +R  AA AL   ++ +     GL  L++ ++     E    P PNDF D  +Y V
Sbjct: 836  RYFHGIRTMAAEALPRQSNIKGIPLLGLRQLMRAFREMFCYEQTNQPIPNDFTDKKQYNV 895

Query: 314  LEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEF-- 371
              AI  A++ VR    + P EA +F+L  L +N+N  NPYSD F++A LV+++       
Sbjct: 896  RRAIIKAISEVRDHTYRCPLEARQFILNQLLFNNNEDNPYSDHFYIATLVEALATSLIPS 955

Query: 372  -------------GQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCI---RTLTQIA 415
                          ++  LFL   +++I+R+L+ D    SY  + TI+ +   + L +  
Sbjct: 956  KKDDWLSRQSKAPDEEERLFLEKAMEQIERVLRRDEWTNSYQNVWTIAGLDAKQRLMKAE 1015

Query: 416  LKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
            +    +    Q   L+   RD      +R++A  AL+DL
Sbjct: 1016 VIPKSYPDFAQY--LLDGSRDL-----IRIKAFEALIDL 1047


>gi|254569566|ref|XP_002491893.1| TFIID subunit (150 kDa), involved in RNA polymerase II
           transcription initiation [Komagataella pastoris GS115]
 gi|238031690|emb|CAY69613.1| TFIID subunit (150 kDa), involved in RNA polymerase II
           transcription initiation [Komagataella pastoris GS115]
          Length = 1571

 Score =  135 bits (340), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 115/388 (29%), Positives = 183/388 (47%), Gaps = 58/388 (14%)

Query: 31  LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAV------ 84
           LST  F+H   KV + +   L  FF  WV   G PVLR+   +NK++  +E+ +      
Sbjct: 609 LSTSHFQHVCEKVAHHK---LDPFFNNWVHNSGTPVLRVTQRFNKKRMFIEMGIRQVQGY 665

Query: 85  --LRDCTVKPDSRTPV--LSSNTDSENRDG----DIGWPGMMSIRVHELDGM-YDHPILP 135
              R    K   + PV  L+      N+ G       + G M+IR+HE DG  Y+H +  
Sbjct: 666 ELARSDASKSRKKDPVSFLNEACQHVNQTGPGITSQIFTGPMTIRIHEADGTPYEHIVDL 725

Query: 136 MAGDAWQLLEIQCHSKLAARRALK-----PKKGSKPDGCDDNGDAVA-GLDMR------- 182
             G  +  L+IQ ++K    +  K      KK    +  +  GD ++   DM+       
Sbjct: 726 KEG--FTKLDIQYNTKYKRIKKNKKDEELTKKDDSENTVNRLGDILSRSKDMQEWHLEDL 783

Query: 183 ------SSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEA- 235
                 +  +    WIR D + E++ +IH NQP  M+ +QL++D DV AQ  ++      
Sbjct: 784 SAEGEEARTQDAFEWIRIDADFEWICQIHLNQPDYMYQSQLQQDRDVEAQLDSVNFFSNS 843

Query: 236 -LPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVK------FY 288
             P++ ++ V  L   L DS+ F+ VR EAA  LA  + EE +  GL HL+K       Y
Sbjct: 844 LRPNVFYSSV--LVRTLMDSRYFYGVRCEAAKGLARLSKEENNHIGLHHLLKALKQLFCY 901

Query: 289 KSRRFDENIG---------LPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFV 339
                 E+           LP PNDF DF   F+ E IP A++ ++  D+         +
Sbjct: 902 PMHDSHEDATTLINNPKNFLPLPNDFSDFQRMFIQETIPAALSTIKIKDSDLFLNLRRIL 961

Query: 340 LQLLKYNDNNGNPYSDVFWLAALVQSVG 367
           L+LL+YNDN  N ++D F++ +L++++ 
Sbjct: 962 LRLLQYNDNLSNDFNDCFYICSLIRALS 989


>gi|340517026|gb|EGR47272.1| predicted protein [Trichoderma reesei QM6a]
          Length = 1532

 Score =  135 bits (339), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 128/462 (27%), Positives = 207/462 (44%), Gaps = 63/462 (13%)

Query: 2    LEKQMGSNFFRKILQNIISRAQ--GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
            L K  G +   +ILQ  ++RAQ  G+     L T++FR    K        L+ F+ +WV
Sbjct: 565  LSKASGGHGLTRILQKQLTRAQIEGSDKFLILDTEKFRATCEKAAKYR---LESFWNQWV 621

Query: 60   GTCGCPVLRMGFSYNKRKNIVEL-------AVLRDCTVKPDSRTPVLSSNTDSENRDGDI 112
               GCP   +   +NK++  VEL         ++   +  D    V+     +  + G+I
Sbjct: 622  YGSGCPRFDVKAKFNKKRLCVELMLIQVQHQTIKKMELSKDQFLRVVRERR-AHLKTGEI 680

Query: 113  G--WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQC-----HSKLAARRALKPKKG- 163
               + G M++R+HE DG  Y+H IL +  DA +  + +      + +L   R +K K   
Sbjct: 681  QPLFTGPMTVRIHEADGTPYEH-ILEIREDATRSTKFEIPYNTKYKRLKRTRRMKEKHNV 739

Query: 164  ----SKPDGCDDN-----GDAVAGLD--------------MRSSMESPLSWIRADPEMEY 200
                  P+  DD      GD +   +               R   +    WIR D + E+
Sbjct: 740  GASMEIPENTDDALLYCLGDVLQSPEDLRDWELIEWDPETERKMDQESYEWIRVDADFEW 799

Query: 201  LAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRV 260
               ++      M+++QL++D DVVAQ  A+  L   P L       L   L D + F  +
Sbjct: 800  ACSMNRTLEPYMYVSQLQQDRDVVAQQDAMLYLSTGP-LHPIASGFLVRTLVDRRYFHGI 858

Query: 261  RIEAAYALANTAS-EETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPH 319
            R  AA AL   A+ ++    GL  L++ Y+     +    P+PNDF D  +Y V  +I  
Sbjct: 859  RTMAAQALTRQANIKDIPMLGLRQLMRAYREMFCYDQTNQPQPNDFSDKRQYNVRCSIIR 918

Query: 320  AVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV--------GELEF 371
            A+A VR      P EA +F+L  L +N+N  NP+SD  ++A LV+++         +  F
Sbjct: 919  AIAEVRDGAGYCPVEARQFILDQLLFNNNENNPFSDHHYIAVLVEALATSLIPTQADEWF 978

Query: 372  GQQSIL-------FLSSLLKRIDRLLQFDRLMPSYNGILTIS 406
             +Q  +       FL   L++I+R+L+ D    SY  I TI+
Sbjct: 979  KRQVKIPSDEERHFLDQALEQIERVLRRDEWTSSYQNIWTIA 1020


>gi|342883575|gb|EGU84038.1| hypothetical protein FOXB_05458 [Fusarium oxysporum Fo5176]
          Length = 1499

 Score =  135 bits (339), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 145/517 (28%), Positives = 232/517 (44%), Gaps = 81/517 (15%)

Query: 2    LEKQMGSNFFRKILQNIISRAQGASPVRT--LSTKEFRHFANKVGNLERPFLKEFFPRWV 59
            L K  G +   +IL  ++++ Q  S  R   L T++FR    K        L+ F+ +WV
Sbjct: 551  LIKASGGHGLTRILSKLLTKVQIESSDRATILETEKFRTTCEKGAKYR---LESFWSQWV 607

Query: 60   GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNT--------DSENRDGD 111
               GCP   +   +NK++  VEL  L     +  ++ P L  N          S  + G+
Sbjct: 608  YGSGCPRFDVKAKFNKKRLCVEL-TLNQIQFQ-TAKKPPLDKNDFLRVVKERRSGVKPGE 665

Query: 112  IG--WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQC-----HSKLAARRALKPKK- 162
            +   + G M++R+HE DG  Y+H IL +  DA +  + +      + +L   R +K K+ 
Sbjct: 666  VQPLFTGPMTVRIHEADGTPYEH-ILEIREDATRSTKFEIPYNTKYKRLKRTRRMKEKQN 724

Query: 163  -GSKPDG--CDDN-----GDAVAGLDMRSSME--------------SPLSWIRADPEMEY 200
             G+  D    DD      GD +   D ++  E                  WIR D + E+
Sbjct: 725  VGASMDAENLDDALLYCLGDVLQTPDEQAQWELIDWDPETERKMDQESYEWIRVDADFEW 784

Query: 201  LAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRV 260
              ++       M+++QL++D DVVAQ  A+  L   P L       L   L D + F  +
Sbjct: 785  SCDMKRTLEPYMYVSQLQQDRDVVAQQDAMLYLTHGP-LHPIASGFLTRTLVDRRYFHGI 843

Query: 261  RIEAAYALANTAS-EETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPH 319
            R  AA AL   A+ ++    GL  L+K +      +    P+PNDF D  +Y V  AI  
Sbjct: 844  RTMAAEALPRQANIKDLSMLGLRQLMKAFSEMFCHKGTNQPKPNDFSDKKQYNVQCAIIK 903

Query: 320  AVAMVRAADN-KSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVG--------ELE 370
            A+A VR A   K P +A +F+L  L +N+N  NPYSD F++A LV+++         +  
Sbjct: 904  AIAQVRDAHTYKCPLDARQFILDQLLFNNNEDNPYSDHFYIATLVEALATSLIPSKQDDW 963

Query: 371  FGQQSIL-------FLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFIS 423
            F  Q+ +       FL   +++I+R+L+ D    SY  + TI+           LS    
Sbjct: 964  FAMQNKIPNEDEKQFLERAIEQIERVLRRDEWTHSYQNVWTIAG----------LSAKQR 1013

Query: 424  LDQVVKLIKPFRDF------NTIWQVRVEASRALLDL 454
            L +   + K + DF       T   +R++A  AL+DL
Sbjct: 1014 LMKAEVIPKKYSDFGQYLLDGTRDLIRIKAFEALIDL 1050


>gi|380478690|emb|CCF43451.1| transcription initiation factor TFIId 127kD subunit [Colletotrichum
           higginsianum]
          Length = 702

 Score =  134 bits (338), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 107/375 (28%), Positives = 176/375 (46%), Gaps = 45/375 (12%)

Query: 31  LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTV 90
           ++T+ F+    K     R  L+ F+ +WV   GCP   +   +NK++  VE+ + R    
Sbjct: 332 INTESFQRVCEKNA---RNKLEAFWAQWVYGSGCPRFDVFQRFNKKRLCVEMTI-RQVQD 387

Query: 91  KPDSRTPVLSSN---------TDSENRDGDIG-WPGMMSIRVHELDGM-YDHPILPMAGD 139
           +  S + VL             +    D  +G + G M+IR+HE DG  Y+H I+ +  D
Sbjct: 388 RALSESEVLKKTEFLRAIKEKANGVQVDEAMGLFTGPMTIRIHEADGTPYEH-IVEIRED 446

Query: 140 AWQLLEIQC---------HSKLAARRALKPKKGSKPDGCDDN-----GDAVAG-LDMR-- 182
           A +  + +            K  A   L  +  + PD  ++      GD +    DMR  
Sbjct: 447 AAKAAKFEIPYNTKYKRLKRKRRATEKLSAQSAADPDNNEETLLYCLGDVLTNPEDMRKW 506

Query: 183 -----------SSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIA 231
                      +  +    WIR D + E+  E+  N    M+++QL++D DVVAQ  ++ 
Sbjct: 507 EFSEWEPEIEKAMDQESYEWIRMDADFEWACEMKTNLQSYMYVSQLQQDRDVVAQQDSML 566

Query: 232 ALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSR 291
            L+  P     V   L   L D + F  +R  AA  L + A+E+   AG+ HL+K +   
Sbjct: 567 YLKKSPPHPL-VSTFLTRTLVDRRYFHGIRTMAADMLYHHATEKVAMAGMTHLLKAFTEL 625

Query: 292 RFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGN 351
               +   PRPNDF D  +Y V  A+P A+A +R A+ + P  A   +L  +++N+N  N
Sbjct: 626 FCYPDSATPRPNDFTDKKQYLVQSALPMAIAKIRDANGRCPEPARRLLLNQVQFNNNANN 685

Query: 352 PYSDVFWLAALVQSV 366
           PYSD F++A L+++V
Sbjct: 686 PYSDHFYVAKLLEAV 700


>gi|389637227|ref|XP_003716252.1| TATA-binding protein associated factor Taf2 [Magnaporthe oryzae
            70-15]
 gi|351642071|gb|EHA49933.1| TATA-binding protein associated factor Taf2 [Magnaporthe oryzae
            70-15]
          Length = 1579

 Score =  134 bits (337), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 136/536 (25%), Positives = 242/536 (45%), Gaps = 87/536 (16%)

Query: 2    LEKQMGSNFFRKILQNIISRAQG-ASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
            + K+   N  R+++  ++SR+   AS VR +S++ FR    KV       L+ F+ +WV 
Sbjct: 519  MTKKASHNGVRRVVLRLVSRSNTEASDVRFISSESFRKQCEKVHKGAP--LELFWQQWVQ 576

Query: 61   TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNT------------DSENR 108
            + GC    M   YNK+ + V+L V R    K     P  + N              ++  
Sbjct: 577  SAGCARFDMSQRYNKKHSRVDLTV-RQPEEKSQLPQPPFNKNNFWGVLQDGRHVPAAKKE 635

Query: 109  DGDIGWPGMMSIRVHELDGM-YDH--PILPMAGDAWQLLEIQCHSKLAA-RRALKPKKGS 164
            +  + + G ++I +HE +G  Y+H   I       ++ +E+  ++K    +R  + K+ +
Sbjct: 636  NVSVWFTGPITIGIHEANGTPYEHVAEIQRTGPRQFKAIEMPYNTKYKRLKRTRRQKERA 695

Query: 165  KP-----DGCDDNGDAVA----------------GL------DMRSSMESPLSWIRADPE 197
            +      +G +   +AV                 GL       +++  +    W+R D +
Sbjct: 696  QALVQAGEGGEQAEEAVLYSLGDVLQSPEEEKEWGLADWTPEQVKAMEQDSFEWLRIDAD 755

Query: 198  MEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAF 257
             E++  +  N    M+++QL++D DVVAQ +++  L         +   L   L D++ +
Sbjct: 756  FEWICSMKTNMQPWMYVSQLQQDRDVVAQQESVLWLSQKAGGEI-LATVLGRTLMDTRYY 814

Query: 258  WRVRIEAAYALAN---TASEE--------------TDWAGLLHLVKFYKSRRFDENIGLP 300
              +R +A  AL     T  EE              TD  GL  L+K +K+     +   P
Sbjct: 815  HGIRTQAIKALQKQVWTKEEESLDPDSGIVNKLKKTDAGGLRILMKAFKAFFCYPDSQTP 874

Query: 301  RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLA 360
            RPNDF +  +Y V  AIP A+A VR    + PREA +F+L+ L++N+N GN YSD F++A
Sbjct: 875  RPNDFSNKMQYAVQRAIPEAIARVREK-GRCPREAQQFLLEQLQFNNNTGNEYSDHFYVA 933

Query: 361  ALVQSVGELEFGQQSI-----------------LFLSSLLKRIDRLLQFDRLMPSYNGIL 403
             L+ ++        S+                  FL + L+ I+R L+ D    SY  I 
Sbjct: 934  GLLDALATSLIPDNSVETVTLRNSDDPETALDEEFLEAALQMIERYLRMDEWTNSYQNIW 993

Query: 404  TISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCN 459
            T++ +    +  L  +G I +   +  ++  +D +T+  + ++A  AL+DL F  N
Sbjct: 994  TVTALGCKQK--LMKAGVIPIS-AIDFVRYLQD-STLDLISIKAFEALVDLGFLFN 1045


>gi|328351608|emb|CCA38007.1| Transcription initiation factor TFIID subunit 2 [Komagataella
            pastoris CBS 7435]
          Length = 2310

 Score =  134 bits (337), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 115/388 (29%), Positives = 183/388 (47%), Gaps = 58/388 (14%)

Query: 31   LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAV------ 84
            LST  F+H   KV + +   L  FF  WV   G PVLR+   +NK++  +E+ +      
Sbjct: 1348 LSTSHFQHVCEKVAHHK---LDPFFNNWVHNSGTPVLRVTQRFNKKRMFIEMGIRQVQGY 1404

Query: 85   --LRDCTVKPDSRTPV--LSSNTDSENRDG----DIGWPGMMSIRVHELDGM-YDHPILP 135
               R    K   + PV  L+      N+ G       + G M+IR+HE DG  Y+H +  
Sbjct: 1405 ELARSDASKSRKKDPVSFLNEACQHVNQTGPGITSQIFTGPMTIRIHEADGTPYEHIVDL 1464

Query: 136  MAGDAWQLLEIQCHSKLAARRALK-----PKKGSKPDGCDDNGDAVA-GLDMR------- 182
              G  +  L+IQ ++K    +  K      KK    +  +  GD ++   DM+       
Sbjct: 1465 KEG--FTKLDIQYNTKYKRIKKNKKDEELTKKDDSENTVNRLGDILSRSKDMQEWHLEDL 1522

Query: 183  ------SSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEA- 235
                  +  +    WIR D + E++ +IH NQP  M+ +QL++D DV AQ  ++      
Sbjct: 1523 SAEGEEARTQDAFEWIRIDADFEWICQIHLNQPDYMYQSQLQQDRDVEAQLDSVNFFSNS 1582

Query: 236  -LPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVK------FY 288
              P++ ++ V  L   L DS+ F+ VR EAA  LA  + EE +  GL HL+K       Y
Sbjct: 1583 LRPNVFYSSV--LVRTLMDSRYFYGVRCEAAKGLARLSKEENNHIGLHHLLKALKQLFCY 1640

Query: 289  KSRRFDENIG---------LPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFV 339
                  E+           LP PNDF DF   F+ E IP A++ ++  D+         +
Sbjct: 1641 PMHDSHEDATTLINNPKNFLPLPNDFSDFQRMFIQETIPAALSTIKIKDSDLFLNLRRIL 1700

Query: 340  LQLLKYNDNNGNPYSDVFWLAALVQSVG 367
            L+LL+YNDN  N ++D F++ +L++++ 
Sbjct: 1701 LRLLQYNDNLSNDFNDCFYICSLIRALS 1728


>gi|46135781|ref|XP_389582.1| hypothetical protein FG09406.1 [Gibberella zeae PH-1]
          Length = 1505

 Score =  134 bits (336), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 130/466 (27%), Positives = 214/466 (45%), Gaps = 67/466 (14%)

Query: 2    LEKQMGSNFFRKILQNIISRAQ--GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
            L K  G +   +IL  ++++ Q  G+     L T++FR    K        L+ F+ +WV
Sbjct: 551  LIKASGGHGLTRILSKMLTKVQIEGSDKATILETEKFRATCEKGAKYR---LESFWNQWV 607

Query: 60   GTCGCPVLRMGFSYNKRKNIVELAV--LRDCTVKPDSRTPVLSSN---TDSENRDGDIG- 113
               GCP   +   +NK++  VEL +  ++   VK   + P+   +      E R G    
Sbjct: 608  YGSGCPRFDVKAKFNKKRLCVELTLNQIQSMMVK---KAPLEKEDFLRVVKERRSGVKAG 664

Query: 114  -----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQC-----HSKLAARRALKPKK 162
                 + G M++R+HE DG  Y+H IL +  DA +  + +      + +L   R +K K+
Sbjct: 665  EVQPLFTGPMTVRIHEADGTPYEH-ILEIREDATRSTKFEIPYNTKYKRLKRTRRMKEKQ 723

Query: 163  --GSKPDG--CDDN-----GDAVAGLDMRSSME--------------SPLSWIRADPEME 199
              G+  D    DD      GD +   + ++  E                  WIR D + E
Sbjct: 724  NVGASMDAENLDDTLLYCLGDVLQTPEEQAQWELIDWDPETERKMDQESYEWIRVDADFE 783

Query: 200  YLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWR 259
            +  ++       M+++QL++D DVVAQ  A+  L   P L       L   L D + F  
Sbjct: 784  WSCDMKRTLEPYMYVSQLQQDRDVVAQQDAMLYLTQGP-LHPIASGFLTRTLVDRRYFHG 842

Query: 260  VRIEAAYALANTAS-EETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIP 318
            +R  AA AL   A+ ++    GL  L+K +      +    P+PNDF D  +Y V  AI 
Sbjct: 843  IRTMAAEALPRQANIKDLSMLGLRQLMKAFSEMFCHKGTNQPKPNDFSDKRQYNVQCAII 902

Query: 319  HAVAMVRAADN-KSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVG--------EL 369
             A+A VR A+  K P EA +F+L  L +N+N  NP+SD F++A LV+++         + 
Sbjct: 903  KAIAQVRDANTYKCPLEARQFILDQLLFNNNEDNPFSDHFYIATLVEALATSLIPSKQDD 962

Query: 370  EFGQQSIL-------FLSSLLKRIDRLLQFDRLMPSYNGILTISCI 408
             F  Q+ +       FL   +++I+R+L+ D    SY  + TI+ +
Sbjct: 963  WFAMQNKIPNEEEKQFLEKAIEQIERVLRRDEWTHSYQNVWTIAGL 1008


>gi|408392483|gb|EKJ71837.1| hypothetical protein FPSE_07938 [Fusarium pseudograminearum CS3096]
          Length = 1505

 Score =  133 bits (335), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 130/466 (27%), Positives = 214/466 (45%), Gaps = 67/466 (14%)

Query: 2    LEKQMGSNFFRKILQNIISRAQ--GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
            L K  G +   +IL  ++++ Q  G+     L T++FR    K        L+ F+ +WV
Sbjct: 551  LIKASGGHGLTRILSKMLTKVQIEGSDKATILETEKFRATCEKGAKYR---LESFWNQWV 607

Query: 60   GTCGCPVLRMGFSYNKRKNIVELAV--LRDCTVKPDSRTPVLSSN---TDSENRDGDIG- 113
               GCP   +   +NK++  VEL +  ++   VK   + P+   +      E R G    
Sbjct: 608  YGSGCPRFDVKAKFNKKRLCVELTLNQIQSMMVK---KAPLEKEDFLRVVKERRSGVKAS 664

Query: 114  -----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQC-----HSKLAARRALKPKK 162
                 + G M++R+HE DG  Y+H IL +  DA +  + +      + +L   R +K K+
Sbjct: 665  EVQPLFTGPMTVRIHEADGTPYEH-ILEIREDATRSTKFEIPYNTKYKRLKRTRRMKEKQ 723

Query: 163  --GSKPDG--CDDN-----GDAVAGLDMRSSME--------------SPLSWIRADPEME 199
              G+  D    DD      GD +   + ++  E                  WIR D + E
Sbjct: 724  NVGASMDAENLDDTLLYCLGDVLQTPEEQAQWELIDWDPETERKMDQESYEWIRVDADFE 783

Query: 200  YLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWR 259
            +  ++       M+++QL++D DVVAQ  A+  L   P L       L   L D + F  
Sbjct: 784  WSCDMKRTLEPYMYVSQLQQDRDVVAQQDAMLYLTQGP-LHPIASGFLTRTLVDRRYFHG 842

Query: 260  VRIEAAYALANTAS-EETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIP 318
            +R  AA AL   A+ ++    GL  L+K +      +    P+PNDF D  +Y V  AI 
Sbjct: 843  IRTMAAEALPRQANIKDLSMLGLRQLMKAFSEMFCHKGTNQPKPNDFSDKRQYNVQCAII 902

Query: 319  HAVAMVRAADN-KSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVG--------EL 369
             A+A VR A+  K P EA +F+L  L +N+N  NP+SD F++A LV+++         + 
Sbjct: 903  KAIAQVRDANTYKCPLEARQFILDQLLFNNNEDNPFSDHFYIATLVEALATSLIPSKQDD 962

Query: 370  EFGQQSIL-------FLSSLLKRIDRLLQFDRLMPSYNGILTISCI 408
             F  Q+ +       FL   +++I+R+L+ D    SY  + TI+ +
Sbjct: 963  WFAMQNKIPNEEEKQFLEKAIEQIERVLRRDEWTHSYQNVWTIAGL 1008


>gi|429860015|gb|ELA34770.1| transcription initiation factor tfiid 127kd subunit [Colletotrichum
            gloeosporioides Nara gc5]
          Length = 1551

 Score =  133 bits (335), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 133/508 (26%), Positives = 217/508 (42%), Gaps = 69/508 (13%)

Query: 31   LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVEL-------- 82
            ++T+ F+    K     R  L+ F+ +WV   GCP   +   +NK++  VE+        
Sbjct: 607  INTESFQRVCEKNA---RNKLEAFWGQWVYGSGCPRFDVFQRFNKKRLCVEMNIRQVQDK 663

Query: 83   AVLRDCTVKPDSRTPVLSSNTDSENRDGDIG-WPGMMSIRVHELDGM-YDHPILPMAGDA 140
            A+    T+K       +         +  IG + G M+IR+HE DG  Y+H I+ +  DA
Sbjct: 664  ALAEAQTLKKTEFLRAIKEKYHGVKTEETIGLFTGPMTIRIHEADGTPYEH-IVEIREDA 722

Query: 141  WQLLEIQC---------HSKLAARRALKPKKGSKPDGCDDN-----GDAVAGL------- 179
             +  + +            K  A   L  +  + PD  ++      GD + G        
Sbjct: 723  AKAAKFEIPYNTKYKRLKRKRRATEKLSAQSAADPDNNEETLLYCLGDVLTGPEDVRKWE 782

Query: 180  ------DMRSSM-ESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAA 232
                  D+  +M +    WIR D + E+  E+  N    M+++QL++D DVVAQ  ++  
Sbjct: 783  FAEWEPDIEKAMDQESYEWIRMDADFEWACEMKTNLQSYMYVSQLQQDRDVVAQQDSMLY 842

Query: 233  LEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRR 292
            L   P     V   L   L D + F  +R  AA  L   A+++   AG+ HL++ ++   
Sbjct: 843  LRKSPPHPL-VSTFLTRTLVDRRYFHGIRTMAAETLHQHATDKVGMAGMTHLLRAFRELF 901

Query: 293  FDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNP 352
             DE  G PR NDF D  +Y V  A+P A+A +R +  K P  A   +L+ L  N N+ NP
Sbjct: 902  CDEK-GEPRHNDFTDKKQYIVQCALPMAIAKIRDSTGKCPEPAQRLLLKELATNKNDNNP 960

Query: 353  YSDVFWLAALVQSVG---------------ELEFGQQSILFLSSLLKRIDRLLQFDRLMP 397
            YS  F+ + ++ ++                +L+        L  +L  IDR  + D    
Sbjct: 961  YSADFYNSKILDALATSLVPEKRRDSGDQMDLDTSVDERAVLVEVLAEIDRHRRMDEWNS 1020

Query: 398  SYNGILTIS---CIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
            SY  I T +   C R L +  +     I+  Q ++         T   VR++A   L+DL
Sbjct: 1021 SYQNIYTTTALECWRKLMKAGVIPKSPINFWQYLQ-------DGTYDLVRIKAFECLVDL 1073

Query: 455  EFHCNGIDSALSLFIKSVEEEPSLRGQV 482
             F        L L + S +  P +R +V
Sbjct: 1074 GFMMEMPVFRLLLSVISTDRSPFVRHKV 1101


>gi|402079196|gb|EJT74461.1| TATA-binding protein associated factor Taf2 [Gaeumannomyces graminis
            var. tritici R3-111a-1]
          Length = 1592

 Score =  133 bits (335), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 132/532 (24%), Positives = 237/532 (44%), Gaps = 84/532 (15%)

Query: 2    LEKQMGSNFFRKILQNIISRAQG-ASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
            L K+   N  R+++  II++++  A   R +++  FR    KV +     L+ F+ +WV 
Sbjct: 528  LTKRATHNGLRRVIYKIITKSRSYAEESRIVNSDSFRKACEKVAH-NSSSLEPFWQQWVH 586

Query: 61   TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSN-----------TDSENRD 109
            + GC    M   YNK+ + V+L V +       S  P+                 S+  +
Sbjct: 587  SSGCARFEMSQRYNKKHSRVDLTVRQPEDKGSISPLPLQKETFWGALQDRLHVPASKKEN 646

Query: 110  GDIGWPGMMSIRVHELDGM-YDH--PILPMAGDAWQLLEIQCHSKLAARRALKPKK---- 162
              + + G ++I +HE +G  Y+H   I       ++ +E+  ++K    + +K +K    
Sbjct: 647  VSVWFTGPITIGIHEANGTPYEHVAEIRRTGPKQFKTIEMPYNTKYKRLKRVKRQKERAA 706

Query: 163  --GSKPDGCDDNGDAVAGLDMRSSMESP-----------------------LSWIRADPE 197
               +  +G  DN +      +   +++P                         W+R D +
Sbjct: 707  ATAAHAEGGGDNAEEAVLYSLGDVLQTPEDVELWQFADWTPEQEKSMEHDSFEWLRLDAD 766

Query: 198  MEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAF 257
             E++  +  N    M+++QL++D DVVAQ  +I  +         + + L   L D++ F
Sbjct: 767  FEWICSMKTNMQPWMYVSQLQQDRDVVAQQDSILYMSQKAGGEI-LASVLGRTLMDTRYF 825

Query: 258  WRVRIEAAYAL-------------ANT----ASEETDWAGLLHLVKFYKSRRFDENIGLP 300
              +R +A  AL             A+T     S++TD  GL  L+  +KS     +   P
Sbjct: 826  HGIRTQAVKALQRQTWTKEVETYNADTRVTQKSKKTDLGGLRILMMAFKSFFCYPDTQTP 885

Query: 301  RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLA 360
            RPNDF +  +YFV +AIP A+A VR    + P EA  F+L+ L++N+N  N YSD F+++
Sbjct: 886  RPNDFSNKMQYFVQQAIPGAIARVRDK-GRCPPEAQRFLLEQLRFNNNAENEYSDHFYVS 944

Query: 361  ALVQS-----VGELEFG-----------QQSILFLSSLLKRIDRLLQFDRLMPSYNGILT 404
            +L+ +     V + E             Q+   FL   L  I+R L+ D    SY  I T
Sbjct: 945  SLLDALATSLVPDTESPPLPLRANDPQIQEDKAFLEEALMEIERYLRMDEWTNSYQNIWT 1004

Query: 405  ISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEF 456
             + +    +  L  +G I +   +  ++  +D  T+  ++++A  AL+DL F
Sbjct: 1005 TTALGCKQK--LMKAGVIPI-SAIDFVQYLQD-GTLDLIQIKAFEALVDLGF 1052


>gi|322693301|gb|EFY85166.1| transcription initiation factor TFIID subunit TSM1/127kD, putative
            [Metarhizium acridum CQMa 102]
          Length = 1457

 Score =  133 bits (335), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 142/519 (27%), Positives = 233/519 (44%), Gaps = 78/519 (15%)

Query: 1    MLEKQM----GSNFFRKILQNIISRAQGASPVRT--LSTKEFRHFANKVGNLERPFLKEF 54
            +L+K+M    G +   +ILQ  +++AQ  S  R+  L T++FR    K G+  R  L+ F
Sbjct: 542  ILDKRMIKASGGHGLTRILQKFLTKAQIESSDRSTILDTEKFRAVCEK-GSRYR--LEAF 598

Query: 55   FPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSN----TDSENRDG 110
            + +WV   GCP   +   +NK++  VEL  L     +   RT  L  +       E R G
Sbjct: 599  WNQWVYGSGCPRFEVKAKFNKKRLCVEL-TLNQIQHQEAKRTGGLEKDDFLRVVKERRAG 657

Query: 111  -DIG-----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQC-----HSKLAARRAL 158
              +G     + G M++R+HE DG  Y+H IL +  DA +  + +      + +L   R +
Sbjct: 658  LKLGEVQPLFTGPMTVRIHEADGTPYEH-ILEIREDATRSTKFEIPYNTKYKRLKRTRRM 716

Query: 159  KPKKGSK-----PDGCDDN-----GDAVAGLD--------------MRSSMESPLSWIRA 194
            K K          +  DD      GD +   D               R   +    WIR 
Sbjct: 717  KEKHSGGGSMEITENADDALLYCLGDVLQTQDDLRDWELIEWDPETERKMDQESYEWIRV 776

Query: 195  DPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDS 254
            D + E+  ++       M+++QL++D DVVAQ  A+  L   P L       L   L D 
Sbjct: 777  DADFEWACDMKRTLEPYMYVSQLQQDRDVVAQQDAMLYLTHGP-LHPIASGFLVRTLIDR 835

Query: 255  KAFWRVRIEAAYALANTAS-EETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFV 313
            + F  +R  AA AL   ++ +     GL  L++ ++     +    P PNDF D  +Y V
Sbjct: 836  RYFHGIRTMAAEALPRQSNIKGIPLLGLRQLMRAFREMFCFQQTNQPFPNDFTDKKQYNV 895

Query: 314  LEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEF-- 371
              A+  A++ VR    + P EA +F+L  L +N+N  NPYSD F++A LV+++       
Sbjct: 896  RRAVIRAISEVRDHTYRCPLEARQFILDQLLFNNNEDNPYSDHFYIATLVEALATSLIPS 955

Query: 372  -------------GQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCI---RTLTQIA 415
                          ++  LFL   +++I+R+L+ D    SY  + TI+ +   + L +  
Sbjct: 956  KQDDWLSRQSKAPDEEERLFLEKAMEQIERVLRRDEWTNSYQNVWTIAGLDAKQRLMKAE 1015

Query: 416  LKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
            +    +    Q   L+   RD      +R++A  AL+DL
Sbjct: 1016 VIPKSYPDFAQY--LLDGSRDL-----IRIKAFEALIDL 1047


>gi|296421671|ref|XP_002840388.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295636603|emb|CAZ84579.1| unnamed protein product [Tuber melanosporum]
          Length = 1269

 Score =  133 bits (335), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 128/502 (25%), Positives = 228/502 (45%), Gaps = 65/502 (12%)

Query: 31   LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLR-DCT 89
            +ST+ F     K  + +   L+ FF +WV   G P   +   +NK++ IVE+ + +   +
Sbjct: 537  MSTQHFTRLCEKNSHTK---LESFFQQWVYGSGYPRFEVSQRFNKKRMIVEMGIRQVQSS 593

Query: 90   VKPDSRTPVLSSNTDSENRDGDIG-------WPGMMSIRVHELDGM-YDHPILPMAGDAW 141
              P +R    +   D+ N +  I        + G M+IR+HE DG  Y+H +    G  +
Sbjct: 594  ETPAARVTAENFMQDAMNHERHIVVGPTRPVYTGPMTIRIHEADGTPYEHVVDIKEG--Y 651

Query: 142  QLLEIQCHSKL-----AARRALKPKKGSKPDGCDDN----------GDAVAGLDM----- 181
              L+I  ++K      + R+  +   G+  D   D           GD +   D      
Sbjct: 652  TKLDIPYNTKYKRLKRSRRQKERAAAGAGVDVSIDAQNDDVLLYCLGDVLQAEDEVVDWK 711

Query: 182  ----------RSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIA 231
                      + + ES   WIR D + E++  +  NQP  M+++QL++D DV+AQ +A+ 
Sbjct: 712  MEDWSKEEEDKMAQES-FEWIRMDADFEWICTLKVNQPDYMFLSQLQQDRDVIAQYEALQ 770

Query: 232  ALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSR 291
               ++   +  + +     L D + +W +R+EAA  LA  A+ E +W G  HL+K ++  
Sbjct: 771  YFSSVKESAL-ISSIFVRTLMDRRYYWAIRVEAAMGLAKCATAELNWIGQFHLMKAFQVF 829

Query: 292  RFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGN 351
                   +P+ NDF D + Y+V +AIP AV+ +R      P  A  ++L +L++NDNN N
Sbjct: 830  FCFPESSVPKSNDFADATAYYVQKAIPTAVSRIRNVSGACPTSAQRWILDVLRWNDNNNN 889

Query: 352  PYSDVFWLAALVQSVGE----LEF---------GQQSILFLSSLLKRIDRLLQFDRLMPS 398
             Y D  ++A L++++       +F          +++I   +  +  I+R L+ D+    
Sbjct: 890  NYDDCLYIATLMKALANTITITKFDLNNDSGYEDEEAIQTQNESINEIERYLRIDKWQSV 949

Query: 399  YNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL-EFH 457
            Y  +L+ + +  L      LSG    D   K    +    T+  +R +    L++L   H
Sbjct: 950  YQNVLSETAVEILKDWI--LSGIKDKDS--KFFLSYTREGTLDSLRAKCFECLVELGAMH 1005

Query: 458  CNGIDSALSLFIKSVEEEPSLR 479
              G    L+  + S +  P LR
Sbjct: 1006 RPGFVKYLAKVMSS-DPSPYLR 1026


>gi|358387804|gb|EHK25398.1| hypothetical protein TRIVIDRAFT_177367, partial [Trichoderma virens
            Gv29-8]
          Length = 1429

 Score =  132 bits (332), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 127/463 (27%), Positives = 214/463 (46%), Gaps = 61/463 (13%)

Query: 2    LEKQMGSNFFRKILQNIISRAQ--GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
            L K  G +   +ILQ  ++RAQ  G+     L +++FR    K        L+ F+ +WV
Sbjct: 549  LSKASGGHGLTRILQKHLTRAQIEGSDKFLVLDSEKFRATCEKAAKYR---LESFWNQWV 605

Query: 60   GTCGCPVLRMGFSYNKRKNIVELAVLR---DCTVKPD-SRTPVLSSNTD--SENRDGDIG 113
               GCP   +   +NK++  VEL +++     + K + SR   L +  +  ++ + G++ 
Sbjct: 606  YGSGCPRFDVKAKFNKKRLCVELMLIQVQHQTSKKTELSRDDFLRAIKERRADLKTGEVQ 665

Query: 114  --WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQC-----HSKLAARRALKPKKGSK 165
              + G M++R+HE DG  Y+H IL +  DA +  + +      + +L   R +K K  + 
Sbjct: 666  PLFTGPMTVRIHEADGTPYEH-ILEIREDATRSTKFEIPYNTKYKRLKRTRRMKEKHNAG 724

Query: 166  -----PDGCDDN-----GDAVAGLD--------------MRSSMESPLSWIRADPEMEYL 201
                 P+  DD      GD +   +               R   +    WIR D + E+ 
Sbjct: 725  ASMEIPENTDDALLYCLGDVLQSPEDLRDWELIEWDPETERKMDQESYEWIRVDADFEWA 784

Query: 202  AEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVR 261
              ++      M+++QL++D DVVAQ  A+  L   P L       L   L D + F  +R
Sbjct: 785  CSMNRTLEPYMYVSQLQQDRDVVAQQDAMLYLSTGP-LHPIASGFLVRTLVDRRYFHGIR 843

Query: 262  IEAAYALANTAS-EETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHA 320
              AA AL   A+ ++    GL  L++ ++     +    P  NDF D  +Y V  +I  A
Sbjct: 844  TMAAQALTRQANIKDIPMLGLRQLMRAFRDMFCYDQTNQPLQNDFSDKRQYNVRCSIIKA 903

Query: 321  VAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV--------GELEFG 372
            ++ VR   N  P EA +F+L  L +N+N  NPYSD  ++A LV+S+         +  F 
Sbjct: 904  ISEVRDGTNYCPFEARQFILDQLLFNNNENNPYSDHHYIAMLVESLATSLIPAQADEWFK 963

Query: 373  QQSIL-------FLSSLLKRIDRLLQFDRLMPSYNGILTISCI 408
            +Q  +       FL   +++I+R+L+ D    SY  I TI+ +
Sbjct: 964  RQVKVPTDEERQFLDEAMEQIERVLRRDEWTNSYQNIWTIAGL 1006


>gi|149236281|ref|XP_001524018.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
 gi|146452394|gb|EDK46650.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
          Length = 1152

 Score =  131 bits (330), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 118/432 (27%), Positives = 193/432 (44%), Gaps = 53/432 (12%)

Query: 24  GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELA 83
           G  P  TLS   F+H   KV    R  L  FF +W    G P   +   +NK+K ++E+ 
Sbjct: 162 GELPNGTLSCDHFQHVCEKVN---RNKLDAFFKQWAFGAGAPSFVITQRFNKKKGMLEMT 218

Query: 84  VLRDCTVKPDSRTPVLSSNTDSENRDGDIG----------WPGMMSIRVHELDGM-YDHP 132
           + +    + D R   L  +T   +   ++           + G M+IRVHE DG  Y+H 
Sbjct: 219 IRQ--VQQHDRRERKLDKDTYMNDALFELNDEPLLQRQNLFTGPMTIRVHESDGTPYEHI 276

Query: 133 ILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDN------GDAVAG-------- 178
           I     +    +++Q + K+   +          DG D        GDAV          
Sbjct: 277 I--HIKEVKTTIDLQLNKKVRKIKKKD-------DGVDPGIPFNQFGDAVPEEAKKLFAL 327

Query: 179 -----LDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
                 D  S    P  WIR D + E++A I   QP  M+ +QL  D DV AQ  AI   
Sbjct: 328 QDWERRDEESLATLPFEWIRGDVDFEWIANIEVRQPDYMFGSQLIYDRDVEAQYDAIQYF 387

Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTAS-EETDWAGLLHLVKFYKSRR 292
            ++  L+      L   L D + F+ VRI AA ALA+ AS ++T++ G+ +L++ +K   
Sbjct: 388 ASVERLNTVYCTALVRTLFDKRYFYGVRIAAAQALASPASNQQTNYLGIKYLIEAFKYLH 447

Query: 293 FDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNP 352
                 +P+ NDF D   + +  AIP  ++ V  ++ K P    + +  L+KYNDN+ N 
Sbjct: 448 CFPGSSIPKSNDFADIQAFLIQTAIPDTLSRVTDSEGKVPTIVRDLLYDLVKYNDNSMNE 507

Query: 353 YSDVFWLAALVQSVGELEFG--------QQSILFLSSLLKRIDRLLQFDRLMPSYNGILT 404
           + D  +++ L+ S+              ++   F + +   + RL + D   PS+N IL 
Sbjct: 508 FQDSQYVSKLIFSLTRSAISGWRSDMVDEKDKQFANKVFLEVSRLQKLDEWRPSHNRILQ 567

Query: 405 ISCIRTLTQIAL 416
           ++C +   ++ L
Sbjct: 568 VACFKAKMKMVL 579


>gi|358390390|gb|EHK39796.1| hypothetical protein TRIATDRAFT_296783 [Trichoderma atroviride IMI
            206040]
          Length = 1512

 Score =  131 bits (330), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 128/463 (27%), Positives = 209/463 (45%), Gaps = 65/463 (14%)

Query: 2    LEKQMGSNFFRKILQNIISRAQGASPVR--TLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
            L K  G +   +ILQ  ++RAQ  S  +   L +++FR    K        L+ F+ +WV
Sbjct: 549  LSKASGGHGLTRILQKQLTRAQIESSDKFLILDSEKFRATCEKAAKYR---LESFWNQWV 605

Query: 60   GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSN-----TDSENRDGDIG- 113
               GCP   +   +NK++  VEL +++  T +   +   LS +           D  IG 
Sbjct: 606  YGSGCPRFDVKAKFNKKRLCVELMLIQ--TQQQAIKKTELSKDEFFRAIKERRADLKIGE 663

Query: 114  ----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQC-----HSKLAARRALKPKKG 163
                + G M+IR+HE DG  Y+H IL +  DA +  + +      + +L   R +K K  
Sbjct: 664  AQPLFTGPMTIRIHEADGTPYEH-ILEIREDAARSTKFEIPYNTKYKRLKRTRRMKEKHN 722

Query: 164  SK-----PDGCDDN-----GDAVAGLD--------------MRSSMESPLSWIRADPEME 199
            +      P+  DD      GD +   +               R   +    WIR D + E
Sbjct: 723  AGASMEIPENTDDALLYCLGDVLQSPEDLRDWELIEWDPETERKMDQESYEWIRVDADFE 782

Query: 200  YLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWR 259
            +   ++      M+++QL++D DVVAQ  A+  L   P L       L   L D + F  
Sbjct: 783  WACSMNRTLEPYMYVSQLQQDRDVVAQQDAMLYLSTGP-LHPIASGFLVRTLVDRRYFHG 841

Query: 260  VRIEAAYALANTAS-EETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIP 318
            +R  AA AL   A+ ++    GL  L++ ++     +    P  NDF D  +Y V  +I 
Sbjct: 842  IRTMAAKALTRQANIKDIPMLGLRQLMRAFRDMFCYDQTNQPLQNDFSDKRQYNVRCSII 901

Query: 319  HAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV--------GELE 370
             A++ VR   N  P EA +F+L  L +N+N  NPYSD +++A LV+++         +  
Sbjct: 902  RAISEVRDGTNYCPFEARQFILDQLLFNNNENNPYSDHYYIALLVEALATSLIPAQADEW 961

Query: 371  FGQQSIL-------FLSSLLKRIDRLLQFDRLMPSYNGILTIS 406
            F +Q  +       FL   +++I+R+L+ D    SY  I TI+
Sbjct: 962  FKRQVKIPTEEERRFLEEAMEQIERVLRRDDWTHSYQNIWTIA 1004


>gi|195169174|ref|XP_002025400.1| GL11884 [Drosophila persimilis]
 gi|194108868|gb|EDW30911.1| GL11884 [Drosophila persimilis]
          Length = 1217

 Score =  131 bits (330), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 114/440 (25%), Positives = 202/440 (45%), Gaps = 54/440 (12%)

Query: 54  FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
           F  + V T G     +   +N+++N +EL + +D                   N  G   
Sbjct: 521 FMDQSVRTGGHAKFSLSSVFNRKRNTIELEIRQDFV-----------------NHRGVRK 563

Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
           + G + +++ ELDG + H +     +     +I CHSK    +  K    +        G
Sbjct: 564 YNGPLLVQLQELDGTFKHTL--QIENTLVKADITCHSKSRRNKKKKIPLST--------G 613

Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
           + V  +D+ +  +SP+ WIR DPEM  L ++   QP   W  QL  + DV AQ QAI AL
Sbjct: 614 EEV-DMDLSAMDDSPVLWIRLDPEMILLRDLIIEQPDFQWQYQLRHERDVTAQFQAIQAL 672

Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYKSRR 292
           +       ++ +     L    A  +  ++AA++L   A++    W+G   ++  +  R+
Sbjct: 673 QQARQTPRDLPSPTP--LRAIAASIKCGVQAAHSLTKVANQMVASWSGPPAMLNIF--RK 728

Query: 293 FDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNN 349
           F  +   P   + N+F +F  YF+ +AIP A+A +R +    P E + F+  L KYN+N+
Sbjct: 729 FFGSFSAPHIIKLNNFSNFQLYFLQKAIPVAMAGLRTSHGICPPEVMRFLFDLFKYNENS 788

Query: 350 GNPYSDVFWLAALVQSVGE---------LEFGQQSILFLSS----LLKRIDRLLQFDRLM 396
            N Y+D ++ AALV+++GE         +   Q +   LS+    +L  + RLL  ++ +
Sbjct: 789 RNHYTDAYYRAALVEALGETLTPVVSVSMHGTQITTDSLSTDAKLVLDEMTRLLNMEKSL 848

Query: 397 PSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEF 456
           PSY  ++++SC++ +     KL  F  L  +      +  +     +R+ A   L+D   
Sbjct: 849 PSYKYVVSVSCLKVIR----KLQKFGHLPSLPNFYHRYAAYGVFLDLRIAAMECLVDF-V 903

Query: 457 HCNGIDSALSLFIKSVEEEP 476
             +G    L   I  +E +P
Sbjct: 904 KVDGRSEDLDHLINLLETDP 923


>gi|395512337|ref|XP_003760397.1| PREDICTED: transcription initiation factor TFIID subunit 2
           [Sarcophilus harrisii]
          Length = 1071

 Score =  131 bits (329), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 108/379 (28%), Positives = 180/379 (47%), Gaps = 52/379 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 556

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 607

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L ++ F Q   MW  QL  + DVVAQ ++
Sbjct: 608 -NGEEV-DMDLSAMDADSPLLWIRIDPDMSVLRKVEFEQSDFMWQYQLRYERDVVAQEES 665

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   +
Sbjct: 666 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMLACFCLAKIANSMVSTWTGPPAMKSLF 723

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF+ + +P A+A++R   N  P+E + F+L L+KYN
Sbjct: 724 -TRMFCCKTCPNIVKTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYN 782

Query: 347 DNNGNPYSD--VFWLAALV 363
           DN  N      +FW AA+ 
Sbjct: 783 DNRKNKQQSLGIFWTAAVT 801


>gi|440475330|gb|ELQ44013.1| TATA-binding protein associated factor Taf2 [Magnaporthe oryzae Y34]
 gi|440486210|gb|ELQ66100.1| TATA-binding protein associated factor Taf2 [Magnaporthe oryzae P131]
          Length = 1579

 Score =  131 bits (329), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 135/536 (25%), Positives = 241/536 (44%), Gaps = 87/536 (16%)

Query: 2    LEKQMGSNFFRKILQNIISRAQG-ASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
            + K+   N  R+++  ++SR+   AS VR +S++ FR    KV       L+ F+ +WV 
Sbjct: 519  MTKKASHNGVRRVVLRLVSRSNTEASDVRFISSESFRKQCEKVHKGAP--LELFWQQWVQ 576

Query: 61   TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNT------------DSENR 108
            + GC    M   YNK+ + V+L V R    K     P  + N              ++  
Sbjct: 577  SAGCARFDMSQRYNKKHSRVDLTV-RQPEEKSQLPQPPFNKNNFWGVLQDGRHVPAAKKE 635

Query: 109  DGDIGWPGMMSIRVHELDGM-YDH--PILPMAGDAWQLLEIQCHSKLAA-RRALKPKKGS 164
            +  + + G ++I +HE +G  Y+H   I       ++ +E+  ++K    +R  + K+ +
Sbjct: 636  NVSVWFTGPITIGIHEANGTPYEHVAEIQRTGPRQFKAIEMPYNTKYKRLKRTRRQKERA 695

Query: 165  KP-----DGCDDNGDAVA----------------GL------DMRSSMESPLSWIRADPE 197
            +      +G +   +AV                 GL       +++  +    W+R D +
Sbjct: 696  QALVQAGEGGEQAEEAVLYSLGDVLQSPEEEKEWGLADWTPEQVKAMEQDSFEWLRIDAD 755

Query: 198  MEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAF 257
             E++  +  N    M+++QL++D DVVAQ +++  L         +   L   L D++ +
Sbjct: 756  FEWICSMKTNMQPWMYVSQLQQDRDVVAQQESVLWLSQKAGGEI-LATVLGRTLMDTRYY 814

Query: 258  WRVRIEAAYALAN---TASEE--------------TDWAGLLHLVKFYKSRRFDENIGLP 300
              +R +A  AL     T  EE              TD  GL  L+K +K+     +   P
Sbjct: 815  HGIRTQAIKALQKQVWTKEEESLDPDSGIVNKLKKTDAGGLRILMKAFKAFFCYPDSQTP 874

Query: 301  RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLA 360
            RPNDF +  +Y V  AI  A+A VR    + PREA +F+L+ L++N+N GN YSD F++A
Sbjct: 875  RPNDFSNKMQYAVQRAISEAIARVREK-GRCPREAQQFLLEQLQFNNNTGNEYSDHFYVA 933

Query: 361  ALVQSVGELEFGQQSI-----------------LFLSSLLKRIDRLLQFDRLMPSYNGIL 403
             L+ ++        S+                  FL + L+ I+R L+ D    SY  I 
Sbjct: 934  GLLDALATSLIPDNSVETVTLRNSDDPETALDEEFLEAALQMIERYLRMDEWTNSYQNIW 993

Query: 404  TISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCN 459
            T++ +    +  L  +G I +   +  ++  +D +T+  + ++A  AL+DL F  N
Sbjct: 994  TVTALGCKQK--LMKAGVIPIS-AIDFVRYLQD-STLDLISIKAFEALVDLGFLFN 1045


>gi|242799450|ref|XP_002483381.1| transcription initiation factor TFIID subunit TSM1/127kD, putative
            [Talaromyces stipitatus ATCC 10500]
 gi|218716726|gb|EED16147.1| transcription initiation factor TFIID subunit TSM1/127kD, putative
            [Talaromyces stipitatus ATCC 10500]
          Length = 1286

 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 132/499 (26%), Positives = 211/499 (42%), Gaps = 98/499 (19%)

Query: 30   TLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCT 89
            ++ T+ F     + G+ +   L+ FF +WV   GCP       +NK+K +VE+ +     
Sbjct: 546  SVYTQGFIKLCERFGHCK---LEVFFNQWVFGAGCPRFTATQRFNKKKLVVEIMI----- 597

Query: 90   VKPDSRTPVLSSNTDSENRDGDIG--------------WPGMMSIRVHELDGM-YDHPIL 134
             K     P    N +  +   D+               + G M+IR+HE DG  Y+H + 
Sbjct: 598  -KQSQGEPQPPKNLEKSSFLRDMKEEIRSIYAAPVQPIFTGSMTIRIHEADGTPYEHIV- 655

Query: 135  PMAGDAWQLLEIQCHSKLA---------ARRALKPKKGSKPDGCDDN-----GDAV-AGL 179
                +A    E+  ++K            R  L     S  +  DD      GD +    
Sbjct: 656  -EIKEAVTKFEVPYNTKYKRLKRNKRQKERVTLNAGADSTAESQDDVLLYCLGDVLQTEE 714

Query: 180  DM--------------RSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVA 225
            DM              R   ES   WIR D + E++ ++    P  M+++QL++D DVVA
Sbjct: 715  DMQEWRLSDWSKEEEERMGQES-YEWIRMDADFEWICKMSLVMPGYMYLSQLQQDRDVVA 773

Query: 226  Q---AQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLL 282
            Q    Q +A  +  P +S   V TL     D + F  +R  AAYAL   A EE DW GL 
Sbjct: 774  QLESMQYMAIQKPHPLISTIFVRTL----MDKRYFHGIRTAAAYALVKHAKEELDWLGLY 829

Query: 283  HLVKFYKSRRFDENIGLP-----RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVE 337
            HL      R F E   LP     R NDF D + Y + + IP A++ +R  + K+P    +
Sbjct: 830  HL-----ERAFQELFCLPGTHVTRDNDFSDHASYILQKVIPEAISRIRDNNGKTPLRVKQ 884

Query: 338  FVLQLLKYNDNNGNPYSDVFWLAALVQSV--------------GELEFGQQSIL------ 377
            F+   L+++DN+ N YSD F++A+L+ S+               ++EF     L      
Sbjct: 885  FLFDKLRFSDNSNNEYSDNFYIASLMASLVNALEGRIAETPSQDDMEFDMDKELERQAED 944

Query: 378  -FLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRD 436
                 ++  IDR  + D    S+  I +    RT     L+L+    +D  +     +  
Sbjct: 945  KLEQDVIAEIDRYRRMDEWSSSFQNIYS----RTALHCQLQLTKARIMDLDLIHFLQYTR 1000

Query: 437  FNTIWQVRVEASRALLDLE 455
              T   +R++A   L++L+
Sbjct: 1001 AGTFDMLRIDAFECLVELD 1019


>gi|378754734|gb|EHY64763.1| hypothetical protein NERG_02166 [Nematocida sp. 1 ERTm2]
          Length = 867

 Score =  130 bits (328), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 117/500 (23%), Positives = 215/500 (43%), Gaps = 72/500 (14%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
           +LE  +   F +KI++ ++         RT++T++       +   +   ++  F  +V 
Sbjct: 356 ILENNLTRAFMQKIMKEVLE-------ARTMNTQDLIRVIKGITGKD---VRALFDAYVY 405

Query: 61  TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSI 120
             G P +      N R     L               VL     S + D +    G + +
Sbjct: 406 RTGIPTVTAQVEQNSRAGGFSL---------------VLRQKIHSVHPDANRHMAGNICV 450

Query: 121 RVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLD 180
           RV+E D + DH +    G A    E+ C S+   RR  K +  S                
Sbjct: 451 RVYETDSVLDHILF--LGSAPITHELVC-SQRNHRRKQKEENVS---------------- 491

Query: 181 MRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLS 240
                   L W+R DP +E++      Q   M+  QL  + DV  Q +A+  ++  P  S
Sbjct: 492 --------LLWVRIDPGIEWMKVAAVEQADYMFAEQLVSEKDVYGQMEALVGVQKNP--S 541

Query: 241 FNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP 300
             + + L   + D   F++V I A   LA + +EE+ + G   +V+F+ +    +N  + 
Sbjct: 542 ETICSILERVMGDPGVFYKVSIAAGILLAKSVNEESGYFGFQRVVQFFINTYCIQNTTIV 601

Query: 301 RPNDFRDFSEYFVLEAIPHAVAMVRAADNKS-------PREAVE-FVLQLLKYNDNNGNP 352
           R NDF     YF+ + I  ++++ +    KS        +  V  F+L L++YNDN GN 
Sbjct: 602 RTNDFSQHRMYFMQKNIASSMSLCQLDSTKSLGGRAIRAKNVVSAFLLNLMRYNDNTGNS 661

Query: 353 YSDVFWLAALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLT 412
           + D F+LA ++ ++  +     + L  +  +  ++RL Q D L PS+  I+T + I+   
Sbjct: 662 FEDAFYLADIITALA-VALCSDAHLDPAPFIVEMERLRQKDLLFPSHQNIVTCAAIKAFA 720

Query: 413 QIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSV 472
           ++ ++    +S   +++ + P       ++VR+ A   L+ L  H + I + L LF   V
Sbjct: 721 RLGVQGYVQVSFKGMLQYVAP----GNFYKVRMAAYECLILL--HADKIGTILDLF---V 771

Query: 473 EEEPSLRGQVKLGIHAMRIC 492
            EE  +R ++   I    +C
Sbjct: 772 NEERPVRIKILRTIRDSAVC 791


>gi|290993096|ref|XP_002679169.1| predicted protein [Naegleria gruberi]
 gi|284092785|gb|EFC46425.1| predicted protein [Naegleria gruberi]
          Length = 1957

 Score =  130 bits (328), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 119/470 (25%), Positives = 217/470 (46%), Gaps = 66/470 (14%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFR-------HFANKVGNLERPFLKE 53
           M+++Q+ ++  R +L   +  A       TL +K +         F   +G      LKE
Sbjct: 564 MIDRQISTSNMRNLLNKFLKNAH------TLDSKRYNDRLISSSSFIKTIGKTFATDLKE 617

Query: 54  FFPRWVGTCGCPVLRMGFSYNKRKNIVELAV---LRDCTVKPDSRTPVLSSNTDSENRDG 110
           F   W+   G P   + F Y+  ++ V+L V   L    ++P                  
Sbjct: 618 FSSYWIYGTGIPQFAISFKYDDMRSCVDLKVKQILPFPEIQP------------------ 659

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKG-SKPDGC 169
              + G +++R+ E +G  +  I  +  +  Q  E   +SK   +   K + G  +P   
Sbjct: 660 ---FRGKVTVRILESEGAPNDFIANLDKEENQF-EFHINSKPIKKHHKKVQGGQGQP--- 712

Query: 170 DDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
               +AV  +        PL WIR DPE +++    F+Q  +MW  QL    DVVAQ ++
Sbjct: 713 ---TEAVRVI-------IPLKWIRFDPEQDWIKVQSFSQTQEMWNYQLNDVKDVVAQYES 762

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYK 289
           I AL         V+  LN  L D+  ++++R+ AA+AL+           L  L+ F+K
Sbjct: 763 IKALTFFADTE-PVITILNRILRDNSYYYQIRVCAAHALSKIPQPIEKKEALNVLIGFFK 821

Query: 290 SRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAV----EFVLQLLKY 345
           ++ +DE +   +PN+F++F++YF+ + IP  +A+    D+K   E        +L+L+KY
Sbjct: 822 TQFYDEQLTYLKPNNFKNFADYFLKKEIP--LALTNFKDDKVNNETFPEVVLLLLELIKY 879

Query: 346 NDNNGNPYSDVFWLAALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDR-LMPSYNGILT 404
           ND++ N +S  F++A L++S+  L     +  F   + K + + L  D  L+PSY+ I T
Sbjct: 880 NDDSENLFSGNFYIANLIKSISLLNTSNPN--FNQKIEKILYKYLVIDSDLIPSYHYIKT 937

Query: 405 ISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
           ++ + ++  +       I+LD   +    + DF     VR  + R ++++
Sbjct: 938 VTSLESIANLQSSGKSKINLDVFYR----YLDFKKSDHVRFSSWRCVMNI 983


>gi|320581027|gb|EFW95249.1| transcription initiation factor TFIID subunit [Ogataea parapolymorpha
            DL-1]
          Length = 1543

 Score =  129 bits (325), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 144/553 (26%), Positives = 240/553 (43%), Gaps = 119/553 (21%)

Query: 31   LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLR---- 86
            LST  F H   KV + +   L+ FF  W+ +CG P+ ++   +NK++  +E+++ +    
Sbjct: 623  LSTAHFHHVCEKVAHHK---LESFFQNWIHSCGVPIFQVTQRFNKKRMFIEMSIRQIQKN 679

Query: 87   -----------DCTVKPDSRTP------VLSSN---TDSENRDGDIGWPGMMSIRVHELD 126
                       D   K   R        V  +N   ++ E       + G ++IR+HE D
Sbjct: 680  NKLETNDELQMDINSKETLRKQHQREHFVDEANKFLSEEETFSAPNVFTGPITIRIHEAD 739

Query: 127  GM-YDHPILPMAGDAWQLLEIQCHSKL--AARRA----------------------LKPK 161
            G  Y+H +     D    L+IQ ++K   + RR                        K +
Sbjct: 740  GTPYEHIV--DIKDPVTKLDIQYNTKYRRSKRRKEEDQEEAERKEREREKRERDKLRKER 797

Query: 162  KGSKPDGCDDN------GDAVA--------GLDMRSSME--------SPLSWIRADPEME 199
            +  K  G DD       GD           G   +  +E            W+R D + E
Sbjct: 798  EKDKLRGDDDEPTVKKLGDVFMKPKDLEDWGFTEKQKIEENDYEILGDAFEWLRVDADFE 857

Query: 200  YLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWR 259
            ++ +++ N    M+ +QL +D DV AQ +++       H S      L   + D + F+ 
Sbjct: 858  WICKVYINLNENMFESQLRQDRDVEAQLESVKFFSESLHPSIYFATVLMRTILDKRYFYG 917

Query: 260  VRIEAAYALANTASEETDWAGLLHLVKFYK---------SRRF---DENIGLPRPNDFRD 307
            VR EAA  LA  + E+ D  G+  L+K YK         S+ +   D N  LP PNDF +
Sbjct: 918  VRAEAALGLAKLSKEDNDHLGMRFLLKAYKYFYCYNNTISKGYPELDPNEYLPLPNDFSE 977

Query: 308  FSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVG 367
            FS+YFV++AI   ++ V   +  SP E  + +L +LK+NDNN N + D F+LA  +  + 
Sbjct: 978  FSDYFVMKAIVTGLSTVTNKNGDSPIELKKIMLNILKFNDNNNNYFDDSFYLANFITCLT 1037

Query: 368  EL----------------EFGQQSIL-----FLSSLLKRIDRLLQFDRLMPSYNGILTIS 406
             L                E  Q+++      FL   +  I+R  + D+  PSY+ I+T++
Sbjct: 1038 NLIVSSNKKIPNHNESHVEMDQRNLTDPTEKFLIESIVEINRAAKMDQWSPSYHHIVTVT 1097

Query: 407  CIRTLTQIALKLSGFISLDQV--VKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSA 464
             ++   ++ L L G + L  +  +   +PF D +    +R+ A  +LL L    N     
Sbjct: 1098 MLQ--EKVRLALLGLVDLSYIDLIPYTRPFYDSD----IRLRAFESLLLLGGLRNS--HV 1149

Query: 465  LSLFIKSVEEEPS 477
            LSLF  +++ E S
Sbjct: 1150 LSLFFTTLKLETS 1162


>gi|388580227|gb|EIM20543.1| hypothetical protein WALSEDRAFT_60853 [Wallemia sebi CBS 633.66]
          Length = 1762

 Score =  129 bits (323), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 83/265 (31%), Positives = 139/265 (52%), Gaps = 40/265 (15%)

Query: 191 WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNF 250
           WIR D + E+++ I F QP  MW++QL++D DVVAQ +A+ AL+++P  S  V   L   
Sbjct: 709 WIRVDADFEWISMITFEQPDFMWVSQLQRDRDVVAQLEAVHALKSIP--SKLVSTQLTRT 766

Query: 251 LSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSR-------RFDENIG---LP 300
           +  +  F+R+R+EAA  L   A+ E D+ G+ HL K ++SR          +  G   +P
Sbjct: 767 VLVTNYFFRIRMEAAMVLHTCATPELDYIGMFHLFKIFQSRYCYPPPQEIMDPFGFKCIP 826

Query: 301 RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLA 360
             N+F D SEYFV +A+  ++  +R   ++S     EF++  L+YNDN+ N YSD ++L+
Sbjct: 827 LANNFDDLSEYFVRKAVIVSLTSIRDQSSRSMLNIREFLVDQLRYNDNSSNRYSDSYYLS 886

Query: 361 --------ALVQSV---------------GELEFGQQSILFLSSLLKRIDRLLQFDRLMP 397
                   ALV +V                +  FG          ++ ++R    DRL+P
Sbjct: 887 GIISCLAIALVPTVQHEASVLQQDEDPDANDHPFGPH---VRRQAIEEVERYRTMDRLVP 943

Query: 398 SYNGILTISCIRTLTQIALKLSGFI 422
           SY  +++++ +     + L ++G I
Sbjct: 944 SYQNVISVATLE--FNVKLMIAGII 966



 Score = 47.8 bits (112), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 33/104 (31%), Positives = 54/104 (51%), Gaps = 12/104 (11%)

Query: 31  LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTV 90
           +ST  F     KV  ++   L++F  +W+   G P   +G ++N++K  VE+ V ++   
Sbjct: 524 ISTNGFLRTCRKVAGVD---LRQFADQWIFGSGTPRFTIGANFNRKKMCVEMTVKQESPA 580

Query: 91  KPDSRTPVLSSNTDS--ENRDGDIGWPGMMSIRVHELDGM-YDH 131
             D+ TP    NT     NR   I + G M++R+HE DG  Y+H
Sbjct: 581 Y-DAHTP----NTPEMHYNRPTQI-FEGQMTVRIHEADGTPYEH 618


>gi|198465660|ref|XP_002135019.1| GA23812 [Drosophila pseudoobscura pseudoobscura]
 gi|198150260|gb|EDY73646.1| GA23812 [Drosophila pseudoobscura pseudoobscura]
          Length = 1153

 Score =  129 bits (323), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 92/319 (28%), Positives = 159/319 (49%), Gaps = 36/319 (11%)

Query: 54  FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
           F  + V T G     +   +N+++N +EL + +D                   N  G   
Sbjct: 521 FMDQSVRTGGHAKFSLSSVFNRKRNTIELEIRQDFV-----------------NHRGVRK 563

Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
           + G + +++ ELDG + H +     +     +I CHSK    +  K    +        G
Sbjct: 564 YNGPLLVQLQELDGTFKHTL--QIENTLVKADITCHSKSRRNKKKKIPLST--------G 613

Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
           + V  +D+ +  +SP+ WIR DPEM  L ++   QP   W  QL  + DV AQ QAI AL
Sbjct: 614 EEV-DMDLSAMDDSPVLWIRLDPEMILLRDLIIEQPDFQWQYQLRHERDVTAQFQAIQAL 672

Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET-DWAGLLHLVKFYKSRR 292
           +  P  +  +   L + +   + F++VR++AA++L   A++    W+G   ++  +  R+
Sbjct: 673 QHYPTNATRLA--LTDTIESDRCFYKVRVQAAHSLTKVANQMVASWSGPPAMLNIF--RK 728

Query: 293 FDENIGLP---RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNN 349
           F  +   P   + N+F +F  YF+ +AIP A+A +R +    P E + F+  L KYN+N+
Sbjct: 729 FFGSFSAPHIIKLNNFSNFQLYFLQKAIPVAMAGLRTSHGICPPEVMRFLFDLFKYNENS 788

Query: 350 GNPYSDVFWLAALVQSVGE 368
            N Y+D ++ AALV+++GE
Sbjct: 789 RNHYTDAYYRAALVEALGE 807


>gi|444732446|gb|ELW72741.1| Transcription initiation factor TFIID subunit 2 [Tupaia chinensis]
          Length = 1407

 Score =  127 bits (319), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 109/396 (27%), Positives = 179/396 (45%), Gaps = 92/396 (23%)

Query: 70   GFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGMY 129
             F++N+++N++EL + +D T      +P            G   + G + + V ELDG +
Sbjct: 737  SFAFNRKRNVLELEIKQDYT------SP------------GTQKYVGPLKVTVQELDGSF 778

Query: 130  DHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLDMRS-SMESP 188
            +H  L +  ++ +  +I CHSK    +  K            NG+ V  +D+ +   +SP
Sbjct: 779  NH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM--------NGEEV-DMDLSAMDADSP 827

Query: 189  LSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLN 248
            L WIR DP+M  L ++ F Q   MW  QL  + DVVAQ ++I ALE  P  +  +   L 
Sbjct: 828  LLWIRIDPDMSVLRKVEFEQADFMWQYQLRYERDVVAQQESILALEKFPTPASRLA--LT 885

Query: 249  NFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDF 308
            + L   + F+RVR+ A + LA T                                     
Sbjct: 886  DILEQEQCFYRVRMSACFCLAKT------------------------------------- 908

Query: 309  SEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLA----ALVQ 364
                    +P A+A++R   N  P+E + F+L L+KYNDN  N +SD ++ A    AL  
Sbjct: 909  --------MPVAMALLRDVHNLCPKEVLTFILDLIKYNDNRKNKFSDNYYRAEMIDALAN 960

Query: 365  SVGELEFGQQSILFLSSL-------LKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALK 417
            SV         +  L +L       L+ I R L  ++L+PSY   +T+SC+R +    L+
Sbjct: 961  SVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKLLPSYRHTITVSCLRAIR--VLQ 1018

Query: 418  LSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
             +G +  D    L K + ++     +R+ A  A++D
Sbjct: 1019 KNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 1052


>gi|256274023|gb|EEU08938.1| Taf2p [Saccharomyces cerevisiae JAY291]
          Length = 1407

 Score =  125 bits (315), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 105/402 (26%), Positives = 190/402 (47%), Gaps = 58/402 (14%)

Query: 13   KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
            ++L  I  +A  G  P  +L++  F+H   +V    +  L+ FF  WV   G P+LR+  
Sbjct: 630  RVLPKIFLQAMSGDLPNNSLTSSHFQHVCERVN---KSKLENFFNEWVYGSGVPILRVTQ 686

Query: 72   SYNKRKNIVELAVLRDCTVKPDSRTPVLSS---------NTDSENRDGDIGWPGMMSIRV 122
             +N+++ ++EL + R    +      VL           + +  + +    + G M+IR+
Sbjct: 687  RFNRKRMVIELGI-RQVQDEELGHEKVLGEEGFFKSALDHLEHPDLNRTECFTGSMTIRI 745

Query: 123  HELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGS------------KPDGC 169
            HE DG  Y+H +     D +  ++IQ ++K   RR  K   G+            KP   
Sbjct: 746  HEHDGTPYEHIV--EIKDTFTKIDIQYNTKY--RRLRKRGGGANDENGVENNNEEKPIVV 801

Query: 170  DDN--GDAVAGLDM----------RSSMESPL-------SWIRADPEMEYLAEIHFNQPV 210
            D N  G+     +           R+S  + L        WIR D ++E++ ++H NQP 
Sbjct: 802  DVNCLGNVYMSPEECSRFSLTEFNRTSESNELLKQNEAFEWIRIDSDLEWICQMHINQPD 861

Query: 211  QMWINQLEKDGDVVAQAQAIAALE------ALPHLSFNVVNTLNNFLSDSKAFWRVRIEA 264
             M+ +QL +DGD+ AQ +AI   E       +  L ++ +  L     D + F+ +R+ A
Sbjct: 862  YMFSSQLRQDGDIEAQLEAIRYYEDVVVNGGVKSLVYSSI--LFRTAIDERYFFGIRLAA 919

Query: 265  AYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMV 324
              AL+    +     G+ HL++ ++     E+  +P+ N+F +   YF+   IP  +A V
Sbjct: 920  CEALSKYVYDPDFTGGVKHLIQIFQILFCLEDSNIPKSNNFENPKLYFLQCNIPKYLAKV 979

Query: 325  RAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV 366
            +  + K P+   +F+L +L YN+N  N YSD  ++ +L+++V
Sbjct: 980  KNENGKCPKLVKQFLLDILVYNENGENKYSDDAYVRSLIENV 1021


>gi|387592660|gb|EIJ87684.1| hypothetical protein NEQG_02231 [Nematocida parisii ERTm3]
 gi|387595289|gb|EIJ92914.1| hypothetical protein NEPG_02313 [Nematocida parisii ERTm1]
          Length = 1348

 Score =  125 bits (314), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 109/485 (22%), Positives = 209/485 (43%), Gaps = 71/485 (14%)

Query: 8   SNFFRKILQNIISRA------QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGT 61
           S  F KIL+N ++RA      +     RT+ T++       +   +   ++  F  +V  
Sbjct: 350 SGAFIKILENNLTRAFMQKIMKEVLEARTIGTQDLIRVVKGITGKD---VRMLFEAYVYR 406

Query: 62  CGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIR 121
            G P +      N R     +               +L     S + + +    G + +R
Sbjct: 407 TGIPTVIAQVEQNSRAGGFSI---------------ILKQKIHSVHPEANRHITGNICVR 451

Query: 122 VHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLDM 181
           V+E D + DH +    G      E+ C+ +   R+    +KG +                
Sbjct: 452 VYETDSVLDHVLF--IGSTPITHELACNQRSQRRK----QKGEES--------------- 490

Query: 182 RSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSF 241
                + L WIR DP +E++      Q   M+  QL  + DV  Q +A+A ++  P  S 
Sbjct: 491 -----ASLLWIRIDPGLEWMKVSVVEQADYMFAEQLVSEKDVYGQMEALAGVQKNP--SE 543

Query: 242 NVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPR 301
            +   L   + D++ F++V + A   LA + +EE+ + G   +V+++ +    +N  + +
Sbjct: 544 TICGILERVMGDAQMFYKVSVAAGILLAKSVNEESGYFGFQRVVQYFINSHCIQNTTIVK 603

Query: 302 PNDFRDFSEYFVLEAIPHAVAMVRAADNKS-------PREAVE-FVLQLLKYNDNNGNPY 353
            NDF  +  YF+ + I  ++++ +    KS        +  V  F+L L++YNDN GN +
Sbjct: 604 TNDFSQYRTYFMQKNIASSMSLCQLDSTKSLSGRIIRAKNVVSAFLLNLMRYNDNTGNQF 663

Query: 354 SDVFWLAALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQ 413
            D F++A ++ S+  +     + L     +  I+RL + D L PS+  I+T + I+  T+
Sbjct: 664 EDSFYIADIITSLS-IALCSDAYLDTGPFVMEIERLRKKDLLFPSHQNIVTCATIKAFTR 722

Query: 414 IALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVE 473
           ++++    +S   +++   P       ++VR+ A  +L+ L          L L I+  E
Sbjct: 723 LSIQGCITVSFKGILRYTLP----ENFYKVRMAAYESLILLH------PDQLKLIIEMAE 772

Query: 474 EEPSL 478
            E  L
Sbjct: 773 SEERL 777


>gi|190406471|gb|EDV09738.1| transcription initiation factor TFIID subunit 2 [Saccharomyces
            cerevisiae RM11-1a]
          Length = 1407

 Score =  125 bits (314), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 103/401 (25%), Positives = 190/401 (47%), Gaps = 56/401 (13%)

Query: 13   KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
            ++L  I  +A  G  P  +L++  F+H   +V    +  L+ FF  WV   G P+LR+  
Sbjct: 630  RVLPKIFLQAMSGDLPNNSLTSSHFQHVCERVN---KSKLENFFNEWVYGSGVPILRVTQ 686

Query: 72   SYNKRKNIVELAVLR--------DCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVH 123
             +N+++ ++EL + +        +  V  +        + +  + +    + G M+IR+H
Sbjct: 687  RFNRKRMVIELGIRQVQDEELGHEKVVGEEGFFKSALDHLEHPDLNRTECFTGSMTIRIH 746

Query: 124  ELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGS------------KPDGCD 170
            E DG  Y+H +     D +  ++IQ ++K   RR  K   G+            KP   D
Sbjct: 747  EHDGTPYEHIV--EIKDTFTKIDIQYNTKY--RRLRKRGGGANDENGVENNNEEKPIVVD 802

Query: 171  DN--GDAVAGLDM----------RSSMESPL-------SWIRADPEMEYLAEIHFNQPVQ 211
             N  G+     +           R+S  + L        WIR D ++E++ ++H NQP  
Sbjct: 803  VNCLGNVYMSPEECSRFSLTEFNRTSESNELLKQNEAFEWIRIDSDLEWICQMHINQPDY 862

Query: 212  MWINQLEKDGDVVAQAQAIAALE------ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAA 265
            M+ +QL +DGD+ AQ +AI   E       +  L ++ +  L     D + F+ +R+ A 
Sbjct: 863  MFSSQLRQDGDIEAQLEAIRYYEDVVVNGGVKSLVYSSI--LFRTAIDERYFFGIRLAAC 920

Query: 266  YALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVR 325
             AL+    +     G+ HL++ ++     E+  +P+ N+F +   YF+   IP  +A V+
Sbjct: 921  EALSKYVYDPDFTGGVKHLIQIFQILFCLEDSNIPKSNNFENPKLYFLQCNIPKYLAKVK 980

Query: 326  AADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV 366
              + K P+   +F+L +L YN+N  N YSD  ++ +L+++V
Sbjct: 981  NENGKCPKLVKQFLLDILVYNENGENKYSDDAYVRSLIENV 1021


>gi|207347283|gb|EDZ73510.1| YCR042Cp-like protein [Saccharomyces cerevisiae AWRI1631]
          Length = 1407

 Score =  125 bits (313), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 103/401 (25%), Positives = 190/401 (47%), Gaps = 56/401 (13%)

Query: 13   KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
            ++L  I  +A  G  P  +L++  F+H   +V    +  L+ FF  WV   G P+LR+  
Sbjct: 630  RVLPKIFLQAMSGDLPNNSLTSSHFQHVCERVN---KSKLENFFNEWVYGSGVPILRVTQ 686

Query: 72   SYNKRKNIVELAVLR--------DCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVH 123
             +N+++ ++EL + +        +  V  +        + +  + +    + G M+IR+H
Sbjct: 687  RFNRKRMVIELGIRQVQDEELGHEKVVGEEGFFKSALDHLEHPDLNRTECFTGSMTIRIH 746

Query: 124  ELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGS------------KPDGCD 170
            E DG  Y+H +     D +  ++IQ ++K   RR  K   G+            KP   D
Sbjct: 747  EHDGTPYEHIV--EIKDTFTKIDIQYNTKY--RRLRKRGGGANDENGVENNNEEKPIVVD 802

Query: 171  DN--GDAVAGLDM----------RSSMESPL-------SWIRADPEMEYLAEIHFNQPVQ 211
             N  G+     +           R+S  + L        WIR D ++E++ ++H NQP  
Sbjct: 803  VNCLGNVYMSPEECSRFSLTEFNRTSESNELLKQNEAFEWIRIDSDLEWICQMHINQPDY 862

Query: 212  MWINQLEKDGDVVAQAQAIAALE------ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAA 265
            M+ +QL +DGD+ AQ +AI   E       +  L ++ +  L     D + F+ +R+ A 
Sbjct: 863  MFSSQLRQDGDIEAQLEAIRYYEDVVVNGGVKSLVYSSI--LFRTAIDERYFFGIRLAAC 920

Query: 266  YALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVR 325
             AL+    +     G+ HL++ ++     E+  +P+ N+F +   YF+   IP  +A V+
Sbjct: 921  EALSKYVYDPDFTGGVKHLIQIFQILFCLEDSNIPKSNNFENPKLYFLQCNIPKYLAKVK 980

Query: 326  AADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV 366
              + K P+   +F+L +L YN+N  N YSD  ++ +L+++V
Sbjct: 981  NENGKCPKLVKQFLLDILVYNENGENKYSDDAYVRSLIENV 1021


>gi|228348|prf||1803221A TSM1 gene
          Length = 1410

 Score =  125 bits (313), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 103/408 (25%), Positives = 193/408 (47%), Gaps = 56/408 (13%)

Query: 13   KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
            ++L  I  +A  G  P  +L++  F+H   +V    +  L+ FF  WV   G P+LR+  
Sbjct: 633  RVLPKIFLQAMSGDLPNNSLTSSHFQHVCERVN---KSKLENFFNEWVYGSGVPILRVTQ 689

Query: 72   SYNKRKNIVELAVLR--------DCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVH 123
             +N+++ ++EL + +        +  V  +        + +  + +    + G M+IR+H
Sbjct: 690  RFNRKRMVIELGIRQVQDEELGHEKVVGEEGFFKSALDHLEHPDLNRTECFTGSMTIRIH 749

Query: 124  ELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGS------------KPDGCD 170
            E DG  Y+H +     D +  ++IQ ++K   RR  K   G+            KP   D
Sbjct: 750  EHDGTPYEHIV--EIKDTFTKIDIQYNTKY--RRLRKRGGGANDENGVENNNEEKPIVVD 805

Query: 171  DN--GDAVAGLDM----------RSSMESPL-------SWIRADPEMEYLAEIHFNQPVQ 211
             N  G+     +           R+S  + L        WIR D ++E++ ++H NQP  
Sbjct: 806  VNCLGNVYMSPEECSRFSLTEFNRTSESNELLKQNEAFEWIRIDSDLEWICQMHINQPDY 865

Query: 212  MWINQLEKDGDVVAQAQAIAALE------ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAA 265
            M+ +QL +DGD+ AQ +AI   E       +  L ++ +  L     D + F+ +R+ A 
Sbjct: 866  MFSSQLRQDGDIEAQLEAIRYYEDVVVNGGVKSLVYSSI--LFRTAIDERYFFGIRLAAC 923

Query: 266  YALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVR 325
             AL+    +     G+ HL++ ++     E+  +P+ N+F +   YF+   IP  +A V+
Sbjct: 924  EALSKYVYDPDFTGGVKHLIQIFQILFCLEDSNIPKSNNFENPKLYFLQCNIPKYLAKVK 983

Query: 326  AADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQ 373
              + K P+   +F+L +L YN+N  N YSD  ++ +L+++V ++   +
Sbjct: 984  NENGKCPKLVKQFLLDILVYNENGENKYSDDAYVRSLIENVVKVALNE 1031


>gi|173056|gb|AAA35179.1| TSM1 [Saccharomyces cerevisiae]
          Length = 1407

 Score =  125 bits (313), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 103/401 (25%), Positives = 190/401 (47%), Gaps = 56/401 (13%)

Query: 13   KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
            ++L  I  +A  G  P  +L++  F+H   +V    +  L+ FF  WV   G P+LR+  
Sbjct: 630  RVLPKIFLQAMSGDLPNNSLTSSHFQHVCERVN---KSKLENFFNEWVYGSGVPILRVTQ 686

Query: 72   SYNKRKNIVELAVLR--------DCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVH 123
             +N+++ ++EL + +        +  V  +        + +  + +    + G M+IR+H
Sbjct: 687  RFNRKRMVIELGIRQVQDEELGHEKVVGEEGFFKSALDHLEHPDLNRTECFTGSMTIRIH 746

Query: 124  ELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGS------------KPDGCD 170
            E DG  Y+H +     D +  ++IQ ++K   RR  K   G+            KP   D
Sbjct: 747  EHDGTPYEHIV--EIKDTFTKIDIQYNTKY--RRLRKRGGGANDENGVENNNEEKPIVVD 802

Query: 171  DN--GDAVAGLDM----------RSSMESPL-------SWIRADPEMEYLAEIHFNQPVQ 211
             N  G+     +           R+S  + L        WIR D ++E++ ++H NQP  
Sbjct: 803  VNCLGNVYMSPEECSRFSLTEFNRTSESNELLKQNEAFEWIRIDSDLEWICQMHINQPDY 862

Query: 212  MWINQLEKDGDVVAQAQAIAALE------ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAA 265
            M+ +QL +DGD+ AQ +AI   E       +  L ++ +  L     D + F+ +R+ A 
Sbjct: 863  MFSSQLRQDGDIEAQLEAIRYYEDVVVNGGVKSLVYSSI--LFRTAIDERYFFGIRLAAC 920

Query: 266  YALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVR 325
             AL+    +     G+ HL++ ++     E+  +P+ N+F +   YF+   IP  +A V+
Sbjct: 921  EALSKYVYDPDFTGGVKHLIQIFQILFCLEDSNIPKSNNFENPKLYFLQCNIPKYLAKVK 980

Query: 326  AADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV 366
              + K P+   +F+L +L YN+N  N YSD  ++ +L+++V
Sbjct: 981  NENGKCPKLVKQFLLDILVYNENGENKYSDDAYVRSLIENV 1021


>gi|259144983|emb|CAY78248.1| Taf2p [Saccharomyces cerevisiae EC1118]
          Length = 1407

 Score =  125 bits (313), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 105/402 (26%), Positives = 189/402 (47%), Gaps = 58/402 (14%)

Query: 13   KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
            ++L  I  +A  G  P  +L++  F+H   +V    +  L+ FF  WV   G P+LR+  
Sbjct: 630  RVLPKIFLQAMSGDLPNNSLTSSHFQHVCERVN---KSKLENFFNEWVYGSGVPILRVTQ 686

Query: 72   SYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWP---------GMMSIRV 122
             +N+++ ++EL + R    +      V+      ++    +  P         G M+IR+
Sbjct: 687  RFNRKRMVIELGI-RQVQDEELGHEKVVGEEGFFKSELDHLEHPDLNRTECFTGSMTIRI 745

Query: 123  HELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGS------------KPDGC 169
            HE DG  Y+H +     D +  ++IQ ++K   RR  K   G+            KP   
Sbjct: 746  HEHDGTPYEHIV--EIKDTFTKIDIQYNTKY--RRLRKRGGGANDENGVENNNEEKPIVV 801

Query: 170  DDN--GDAVAGLDM----------RSSMESPL-------SWIRADPEMEYLAEIHFNQPV 210
            D N  G+     +           R+S  + L        WIR D ++E++ ++H NQP 
Sbjct: 802  DVNCLGNVYMSPEECSRFSLTEFNRTSESNELLKQNEAFEWIRIDSDLEWICQMHINQPD 861

Query: 211  QMWINQLEKDGDVVAQAQAIAALE------ALPHLSFNVVNTLNNFLSDSKAFWRVRIEA 264
             M+ +QL +DGD+ AQ +AI   E       +  L ++ +  L     D + F+ +R+ A
Sbjct: 862  YMFSSQLRQDGDIEAQLEAIRYYEDVVVNGGVKSLVYSSI--LFRTAIDERYFFGIRLAA 919

Query: 265  AYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMV 324
              AL+    +     G+ HL++ ++     E+  +P+ N+F +   YF+   IP  +A V
Sbjct: 920  CEALSKYVYDPDFTGGVKHLIQIFQILFCLEDSNIPKSNNFENPKLYFLQCNIPKYLAKV 979

Query: 325  RAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV 366
            +  + K P+   +F+L +L YN+N  N YSD  ++ +L+++V
Sbjct: 980  KNENGKCPKLVKQFLLDILVYNENGENKYSDDAYVRSLIENV 1021


>gi|323349526|gb|EGA83748.1| Taf2p [Saccharomyces cerevisiae Lalvin QA23]
          Length = 1406

 Score =  125 bits (313), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 103/401 (25%), Positives = 190/401 (47%), Gaps = 56/401 (13%)

Query: 13   KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
            ++L  I  +A  G  P  +L++  F+H   +V    +  L+ FF  WV   G P+LR+  
Sbjct: 629  RVLPKIFLQAMSGDLPNNSLTSSHFQHVCERVN---KSKLENFFNEWVYGSGVPILRVTQ 685

Query: 72   SYNKRKNIVELAVLR--------DCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVH 123
             +N+++ ++EL + +        +  V  +        + +  + +    + G M+IR+H
Sbjct: 686  RFNRKRMVIELGIRQVQDEELGHEKVVGEEGFFKSXLDHLEHPDLNRTECFTGSMTIRIH 745

Query: 124  ELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGS------------KPDGCD 170
            E DG  Y+H +     D +  ++IQ ++K   RR  K   G+            KP   D
Sbjct: 746  EHDGTPYEHIV--EIKDTFTKIDIQYNTKY--RRLRKRGGGANDENGVENNNEEKPIVVD 801

Query: 171  DN--GDAVAGLDM----------RSSMESPL-------SWIRADPEMEYLAEIHFNQPVQ 211
             N  G+     +           R+S  + L        WIR D ++E++ ++H NQP  
Sbjct: 802  VNCLGNVYMSPEECSRFSLTEFNRTSESNELLKQNEAFEWIRIDSDLEWICQMHINQPDY 861

Query: 212  MWINQLEKDGDVVAQAQAIAALE------ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAA 265
            M+ +QL +DGD+ AQ +AI   E       +  L ++ +  L     D + F+ +R+ A 
Sbjct: 862  MFSSQLXQDGDIEAQLEAIRYYEDVVVNGGVKSLVYSSI--LFRTAIDERYFFGIRLAAC 919

Query: 266  YALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVR 325
             AL+    +     G+ HL++ ++     E+  +P+ N+F +   YF+   IP  +A V+
Sbjct: 920  EALSKYVYDPDFTGGVKHLIQIFQILFCLEDSNIPKSNNFENPKLYFLQCNIPKYLAKVK 979

Query: 326  AADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV 366
              + K P+   +F+L +L YN+N  N YSD  ++ +L+++V
Sbjct: 980  NENGKCPKLVKQFLLDILVYNENGENKYSDDAYVRSLIENV 1020


>gi|6319890|ref|NP_009971.1| Taf2p [Saccharomyces cerevisiae S288c]
 gi|136453|sp|P23255.3|TAF2_YEAST RecName: Full=Transcription initiation factor TFIID subunit 2;
            AltName: Full=TAFII-150; AltName: Full=TBP-associated
            factor 150 kDa; AltName: Full=TBP-associated factor 2;
            AltName: Full=TSM-1
 gi|3904|emb|CAA45337.1| YCR724 [Saccharomyces cerevisiae]
 gi|1907184|emb|CAA42290.1| component of TFIID complex [Saccharomyces cerevisiae]
 gi|285810736|tpg|DAA07520.1| TPA: Taf2p [Saccharomyces cerevisiae S288c]
          Length = 1407

 Score =  125 bits (313), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 103/401 (25%), Positives = 190/401 (47%), Gaps = 56/401 (13%)

Query: 13   KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
            ++L  I  +A  G  P  +L++  F+H   +V    +  L+ FF  WV   G P+LR+  
Sbjct: 630  RVLPKIFLQAMSGDLPNNSLTSSHFQHVCERVN---KSKLENFFNEWVYGSGVPILRVTQ 686

Query: 72   SYNKRKNIVELAVLR--------DCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVH 123
             +N+++ ++EL + +        +  V  +        + +  + +    + G M+IR+H
Sbjct: 687  RFNRKRMVIELGIRQVQDEELGHEKVVGEEGFFKSALDHLEHPDLNRTECFTGSMTIRIH 746

Query: 124  ELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGS------------KPDGCD 170
            E DG  Y+H +     D +  ++IQ ++K   RR  K   G+            KP   D
Sbjct: 747  EHDGTPYEHIV--EIKDTFTKIDIQYNTKY--RRLRKRGGGANDENGVENNNEEKPIVVD 802

Query: 171  DN--GDAVAGLDM----------RSSMESPL-------SWIRADPEMEYLAEIHFNQPVQ 211
             N  G+     +           R+S  + L        WIR D ++E++ ++H NQP  
Sbjct: 803  VNCLGNVYMSPEECSRFSLTEFNRTSESNELLKQNEAFEWIRIDSDLEWICQMHINQPDY 862

Query: 212  MWINQLEKDGDVVAQAQAIAALE------ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAA 265
            M+ +QL +DGD+ AQ +AI   E       +  L ++ +  L     D + F+ +R+ A 
Sbjct: 863  MFSSQLRQDGDIEAQLEAIRYYEDVVVNGGVKSLVYSSI--LFRTAIDERYFFGIRLAAC 920

Query: 266  YALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVR 325
             AL+    +     G+ HL++ ++     E+  +P+ N+F +   YF+   IP  +A V+
Sbjct: 921  EALSKYVYDPDFTGGVKHLIQIFQILFCLEDSNIPKSNNFENPKLYFLQCNIPKYLAKVK 980

Query: 326  AADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV 366
              + K P+   +F+L +L YN+N  N YSD  ++ +L+++V
Sbjct: 981  NENGKCPKLVKQFLLDILVYNENGENKYSDDAYVRSLIENV 1021


>gi|392300827|gb|EIW11917.1| Taf2p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 1369

 Score =  125 bits (313), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 103/401 (25%), Positives = 190/401 (47%), Gaps = 56/401 (13%)

Query: 13   KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
            ++L  I  +A  G  P  +L++  F+H   +V    +  L+ FF  WV   G P+LR+  
Sbjct: 630  RVLPKIFLQAMSGDLPNNSLTSSHFQHVCERVN---KSKLENFFNEWVYGSGVPILRVTQ 686

Query: 72   SYNKRKNIVELAVLR--------DCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVH 123
             +N+++ ++EL + +        +  V  +        + +  + +    + G M+IR+H
Sbjct: 687  RFNRKRMVIELGIRQVQDEELGHEKVVGEEGFFKSALDHLEHPDLNRTECFTGSMTIRIH 746

Query: 124  ELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGS------------KPDGCD 170
            E DG  Y+H +     D +  ++IQ ++K   RR  K   G+            KP   D
Sbjct: 747  EHDGTPYEHIV--EIKDTFTKIDIQYNTKY--RRLRKRGGGANDENGVENNNEEKPIVVD 802

Query: 171  DN--GDAVAGLDM----------RSSMESPL-------SWIRADPEMEYLAEIHFNQPVQ 211
             N  G+     +           R+S  + L        WIR D ++E++ ++H NQP  
Sbjct: 803  VNCLGNVYMSPEECSRFSLTEFNRTSESNELLKQNEAFEWIRIDSDLEWICQMHINQPDY 862

Query: 212  MWINQLEKDGDVVAQAQAIAALE------ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAA 265
            M+ +QL +DGD+ AQ +AI   E       +  L ++ +  L     D + F+ +R+ A 
Sbjct: 863  MFSSQLRQDGDIEAQLEAIRYYEDVVVNGGVKSLVYSSI--LFRTAIDERYFFGIRLAAC 920

Query: 266  YALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVR 325
             AL+    +     G+ HL++ ++     E+  +P+ N+F +   YF+   IP  +A V+
Sbjct: 921  EALSKYVYDPDFTGGVKHLIQIFQILFCLEDSNIPKSNNFENPKLYFLQCNIPKYLAKVK 980

Query: 326  AADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV 366
              + K P+   +F+L +L YN+N  N YSD  ++ +L+++V
Sbjct: 981  NENGKCPKLVKQFLLDILVYNENGENKYSDDAYVRSLIENV 1021


>gi|151943862|gb|EDN62162.1| TATA binding protein-associated factor [Saccharomyces cerevisiae
            YJM789]
          Length = 1405

 Score =  124 bits (312), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 103/401 (25%), Positives = 190/401 (47%), Gaps = 56/401 (13%)

Query: 13   KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
            ++L  I  +A  G  P  +L++  F+H   +V    +  L+ FF  WV   G P+LR+  
Sbjct: 628  RVLPKIFLQAMSGDLPNNSLTSSHFQHVCERVN---KSKLENFFNEWVYGSGVPILRVTQ 684

Query: 72   SYNKRKNIVELAVLR--------DCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVH 123
             +N+++ ++EL + +        +  V  +        + +  + +    + G M+IR+H
Sbjct: 685  RFNRKRMVIELGIRQVQDEELGHEKVVGEEGFFKSALDHLEHPDLNRTECFTGSMTIRIH 744

Query: 124  ELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGS------------KPDGCD 170
            E DG  Y+H +     D +  ++IQ ++K   RR  K   G+            KP   D
Sbjct: 745  EHDGTPYEHIV--EIKDTFTKIDIQYNTKY--RRLRKRGGGANDENGVENNNEEKPIVVD 800

Query: 171  DN--GDAVAGLDM----------RSSMESPL-------SWIRADPEMEYLAEIHFNQPVQ 211
             N  G+     +           R+S  + L        WIR D ++E++ ++H NQP  
Sbjct: 801  VNCLGNVYMSPEECSRFNLTEFNRTSESNELLKQNEAFEWIRIDSDLEWICQMHINQPDY 860

Query: 212  MWINQLEKDGDVVAQAQAIAALE------ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAA 265
            M+ +QL +DGD+ AQ +AI   E       +  L ++ +  L     D + F+ +R+ A 
Sbjct: 861  MFSSQLRQDGDIEAQLEAIRYYEDVVVNGGVKSLVYSSI--LFRTAIDERYFFGIRLAAC 918

Query: 266  YALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVR 325
             AL+    +     G+ HL++ ++     E+  +P+ N+F +   YF+   IP  +A V+
Sbjct: 919  EALSKYVYDPDFTGGVKHLIQIFQILFCLEDSNIPKSNNFENPKLYFLQCNIPKYLAKVK 978

Query: 326  AADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV 366
              + K P+   +F+L +L YN+N  N YSD  ++ +L+++V
Sbjct: 979  NENGKCPKLVKQFLLDILVYNENGENKYSDDAYVRSLIENV 1019


>gi|349576782|dbj|GAA21952.1| K7_Taf2p [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 1405

 Score =  124 bits (312), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 103/401 (25%), Positives = 190/401 (47%), Gaps = 56/401 (13%)

Query: 13   KILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
            ++L  I  +A  G  P  +L++  F+H   +V    +  L+ FF  WV   G P+LR+  
Sbjct: 628  RVLPKIFLQAMSGDLPNNSLTSSHFQHVCERVN---KSKLENFFNEWVYGSGVPILRVTQ 684

Query: 72   SYNKRKNIVELAVLR--------DCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVH 123
             +N+++ ++EL + +        +  V  +        + +  + +    + G M+IR+H
Sbjct: 685  RFNRKRMVIELGIRQVQDEELGHEKVVGEEGFFKSALDHLEHPDLNRTECFTGSMTIRIH 744

Query: 124  ELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGS------------KPDGCD 170
            E DG  Y+H +     D +  ++IQ ++K   RR  K   G+            KP   D
Sbjct: 745  EHDGTPYEHIV--EIKDTFTKIDIQYNTKY--RRLRKRGGGANDENGVENNNEEKPIVVD 800

Query: 171  DN--GDAVAGLDM----------RSSMESPL-------SWIRADPEMEYLAEIHFNQPVQ 211
             N  G+     +           R+S  + L        WIR D ++E++ ++H NQP  
Sbjct: 801  VNCLGNVYMSPEECSRFNLTEFNRTSESNELLKQNEAFEWIRIDSDLEWICQMHINQPDY 860

Query: 212  MWINQLEKDGDVVAQAQAIAALE------ALPHLSFNVVNTLNNFLSDSKAFWRVRIEAA 265
            M+ +QL +DGD+ AQ +AI   E       +  L ++ +  L     D + F+ +R+ A 
Sbjct: 861  MFSSQLRQDGDIEAQLEAIRYYEDVVVNGGVKSLVYSSI--LFRTAIDERYFFGIRLAAC 918

Query: 266  YALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVR 325
             AL+    +     G+ HL++ ++     E+  +P+ N+F +   YF+   IP  +A V+
Sbjct: 919  EALSKYVYDPDFTGGVKHLIQIFQILFCLEDSNIPKSNNFENPKLYFLQCNIPKYLAKVK 978

Query: 326  AADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV 366
              + K P+   +F+L +L YN+N  N YSD  ++ +L+++V
Sbjct: 979  NENGKCPKLVKQFLLDILVYNENGENKYSDDAYVRSLIENV 1019


>gi|157138539|ref|XP_001664244.1| hypothetical protein AaeL_AAEL014028 [Aedes aegypti]
 gi|108869468|gb|EAT33693.1| AAEL014028-PA [Aedes aegypti]
          Length = 619

 Score =  124 bits (312), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 92/307 (29%), Positives = 148/307 (48%), Gaps = 30/307 (9%)

Query: 198 MEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALE--ALPHLSFNVVNTLNNFLSDSK 255
           M  L  +H  QP   W  QL  + DV AQ +AI ALE  A P     + +T+ N     +
Sbjct: 1   MTILRSVHIEQPDFQWQFQLRHERDVTAQLEAIDALERYASPATRLALTDTIEN----EQ 56

Query: 256 AFWRVRIEAAYALANTASEE-TDWAGLLHLVKFYKSRRFDENIGLP---RPNDFRDFSEY 311
            F+ VR +AA  L   A+   T W G   ++  +K  +F  +   P   R N+F +F  Y
Sbjct: 57  VFYEVRCKAALCLTKVATAMVTSWQGPPAMLTIFK--KFFGSFSAPHIIRQNNFTNFQHY 114

Query: 312 FVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGE--- 368
           F+ + IP A+A +R A    P E + F+L L KYNDN  N YSD ++ AALV+++G    
Sbjct: 115 FLQKTIPVAMAGLRTAHGICPPEVIRFLLDLFKYNDNTKNHYSDNYYRAALVEALGNSIT 174

Query: 369 --LEFGQQSILFLSS--------LLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKL 418
             +    Q +   S         +L+ + R+L  ++ +PSY   ++I+C++ + +  L+ 
Sbjct: 175 PVISVVHQGMALTSENLSADSKLVLEEVTRILNLEKHLPSYKYTVSIACLKVIRK--LQK 232

Query: 419 SGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSL 478
            G +  +   K+ + +  +     VRV A   L+D     +G    L   I  +E +P  
Sbjct: 233 CGHLPTNP--KIYRSYAAYGQYIDVRVAAMECLVDF-VKIDGRWEDLEHLIDMLETDPDP 289

Query: 479 RGQVKLG 485
             + KLG
Sbjct: 290 MARHKLG 296


>gi|241095546|ref|XP_002409520.1| transcription initiation factor TFII-D subunit, putative [Ixodes
           scapularis]
 gi|215492763|gb|EEC02404.1| transcription initiation factor TFII-D subunit, putative [Ixodes
           scapularis]
          Length = 1137

 Score =  124 bits (312), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 127/509 (24%), Positives = 217/509 (42%), Gaps = 102/509 (20%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRT---------LSTKEFRHFANKVGNLERPFL 51
           MLE ++G     ++   ++S A  A+  +          LST  F+     V   +   +
Sbjct: 352 MLEDRIGRELLLQVFNKLLSLASSAAQQKVTSNMWHNMLLSTNSFQKAIFIVTGKD---I 408

Query: 52  KEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGD 111
           K F  +WV   G       F +N+++N VEL +                  TD+ +  G 
Sbjct: 409 KTFLEQWVRQGGHAKFHGTFVFNRKRNTVELEI----------------KQTDTHSL-GI 451

Query: 112 IGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDD 171
             + G +++ + ELDG + H  L +  +A +  +I CHSK    +  K    +       
Sbjct: 452 KKYVGPLTVTIQELDGTFKH-NLQIEENATKH-DITCHSKSRRNKKKKIPLCT------- 502

Query: 172 NGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAI 230
            G+ V  +D+ +   +SP+ WIR DP+M  L ++ F QP   W  QL  + DV +Q    
Sbjct: 503 -GEEV-DMDLTAMDADSPVLWIRIDPDMLVLRQVVFEQPDYQWQYQLRYERDVTSQ---- 556

Query: 231 AALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKS 290
                                          IE A  + +T      WAG   ++  +K 
Sbjct: 557 -------------------------------IEVANHMVST------WAGPPAMMTIFKK 579

Query: 291 RRFDENI-GLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNN 349
                +   + R N+F +F  YF+ +AIP A+A +R      P E ++F+  L KYN+N+
Sbjct: 580 MFGSHSCPNIIRQNNFSNFQHYFLQKAIPVAMAGLRTIHGICPPEVMKFLFDLFKYNENS 639

Query: 350 GNPYSDVFWLAALVQSVGELEFGQQSILF----------LSS----LLKRIDRLLQFDRL 395
            N +SD ++  AL++++GE      S+L           LS+    +L+ I R L  ++L
Sbjct: 640 KNKFSDNYYRTALIEALGETVTPVVSVLAHNGQPITAESLSADTKLILEEITRCLNLEKL 699

Query: 396 MPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLE 455
           +P Y   +T+ C++ +     KL     L  +  L + +  +     VR+ A  AL+D  
Sbjct: 700 LPCYKFTITVGCLKAIR----KLQKMGHLPNISALFREYAQYGLFLDVRLAALEALVDYT 755

Query: 456 FHCNGIDSALSLFIKSVEEEPSLRGQVKL 484
              +G    LS  +  VE +P  R ++ L
Sbjct: 756 -TVDGNADDLSFLLDIVENDPVPRVRLAL 783


>gi|353229094|emb|CCD75265.1| Tata binding protein associated factor (M01 family) [Schistosoma
           mansoni]
          Length = 1192

 Score =  124 bits (312), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 115/445 (25%), Positives = 203/445 (45%), Gaps = 62/445 (13%)

Query: 31  LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTV 90
           LST  FR   + V   +   ++ F  +W    G   +   F +N+++N+VEL + +D   
Sbjct: 563 LSTASFRRIISMVTGQD---IRNFLNQWACHAGHVRIFAKFHFNRKRNVVELELKQDLQS 619

Query: 91  KPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHS 150
           +                  G + + G +++ + ELDG + H      G   +  ++ CHS
Sbjct: 620 R------------------GTLNYTGPITVMLQELDGAFIHTFKLEEGRMSR--DLPCHS 659

Query: 151 KLAARRALKPKKGSKPDGCDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPV 210
           K    R  K            NGD V     R   ESPL W+R DP++  + +IH +QP 
Sbjct: 660 KSRKHRKKKISLA--------NGDEVDMDLARIDPESPLLWLRIDPDLAIIHDIHVDQPD 711

Query: 211 QMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALAN 270
            MW   L  D D + Q +A+ AL+     S    + L+N + + + F+ VR+EA + L +
Sbjct: 712 FMWHLMLAHDRDCLGQLEAVNALKDFA--SPETRHVLSNIILNERIFYHVRMEACFTLCH 769

Query: 271 ------------TASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIP 318
                       T +  T  + LL +  F++        GL R N+F +   YF+  A+ 
Sbjct: 770 VVNELSTLSNNPTVASATVTSCLLPM--FWQLFSSPAARGLIRQNNFTNLQFYFLQRAMV 827

Query: 319 HAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGEL---EFGQQS 375
            A++ +R      P + + F+  LL++NDN+ NPYSD +++A LV+++ +        + 
Sbjct: 828 KALSTLR-VQQVCPHDILIFISDLLRHNDNSRNPYSDSYYVADLVKAMKDTLTPAIIVRG 886

Query: 376 ILFLSSL-------LKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVV 428
           ++  S+L       ++ +  LL  +    SY G +T+ C+  + +  L+  GF+ +D   
Sbjct: 887 VMSASTLPFEARTVIEEVSHLLNLESETISYKGTVTVVCLAAIRR--LQRLGFLPIDP-- 942

Query: 429 KLIKPFRDFNTIWQVRVEASRALLD 453
            L   +   N    +RV A  A++D
Sbjct: 943 SLFYQYALSNFYCDIRVAAIEAVVD 967


>gi|328772360|gb|EGF82398.1| hypothetical protein BATDEDRAFT_86202 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 1595

 Score =  124 bits (310), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 113/427 (26%), Positives = 174/427 (40%), Gaps = 82/427 (19%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKV-GNLERPFLKEFFPRWV 59
           MLEK+MG N  +KI   ++  A        LST  F     K+ G LE    KEF  +W+
Sbjct: 519 MLEKRMGKNLLQKIANKLMVSAMSGELPTGLSTTHFLKMVRKISGKLE---TKEFADQWI 575

Query: 60  GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMS 119
              GCP+  + + +N++K ++EL V +DC     S   V+ +N           + G   
Sbjct: 576 FRSGCPIFTIRYHFNRKKMVIELRVKQDC-----SNVDVIGANQR---------FSGAFM 621

Query: 120 IRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSK-------------- 165
           +RV E  G +D  +     +  +  +I  H+K    R    KK  K              
Sbjct: 622 VRVQEPGGTFDTEV--RIEEKTKQYDIIYHTKYKRIRRRAGKKYKKVADNDDDDDDEDVY 679

Query: 166 ------------PDGCDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMW 213
                        D    N D     ++         WIR DP+  ++    F Q   MW
Sbjct: 680 YMEEDEKANDAIADAEAQNQDTNGNFNIIEPDRITFEWIRLDPDNVWICTKTFEQEDFMW 739

Query: 214 INQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTAS 273
              L K+ D+ AQ++AI AL  +  L+   +NT+  F+ D   F+ +R+ A   L+   S
Sbjct: 740 NALLRKEKDIGAQSEAIVALSKIQSLA--SLNTITTFVRDHSHFYHLRLLAVSVLSKFDS 797

Query: 274 EETDWAGLLHLVKFYKS---RRFDENIGLPRPNDFRDFSEY-FVLEAIPHAVAMVRAADN 329
           +E     L +L+K Y+      +D  I +P+ N+F    +Y   LE +      V A   
Sbjct: 798 DELYSTALGNLIKIYRDIFCISYDGPI-IPKKNNFSSLQDYKLRLEIVRSLAKFVDARGA 856

Query: 330 KSPREAVEFVLQLLKYNDNNGN----------------------------PYSDVFWLAA 361
             P   + FV  LLKYNDN+GN                             YSD  W++ 
Sbjct: 857 TLPVVRMMFV-NLLKYNDNSGNEVKIAACTEYILMSVFFNYHYMLYRGQFKYSDGEWVSL 915

Query: 362 LVQSVGE 368
           LV+ +G+
Sbjct: 916 LVKLIGD 922


>gi|258575701|ref|XP_002542032.1| predicted protein [Uncinocarpus reesii 1704]
 gi|237902298|gb|EEP76699.1| predicted protein [Uncinocarpus reesii 1704]
          Length = 667

 Score =  122 bits (307), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 115/413 (27%), Positives = 184/413 (44%), Gaps = 88/413 (21%)

Query: 116 GMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLA--ARRALKPKKGSKPDGCDDN 172
           G M+IR+HE DG  Y+H +     +A    +I  ++K     R   + ++ +   G D N
Sbjct: 42  GSMTIRIHEADGTPYEHIV--EIKEAVTKFDIPYNTKYKRLKRNKRQKERAAASSGFDPN 99

Query: 173 ------------GDAVAGLDM---------------RSSMESPLSWIRADPEMEYLAEIH 205
                       GD +   +                R   ES   WIR D + E++ ++ 
Sbjct: 100 TEIQDDVLLYCLGDVLQSEEEMRKWRLVEWTKEDEDRMGQES-YEWIRMDADFEWICKMS 158

Query: 206 FNQPVQMWINQLEKDGDVVAQAQAIAALEAL---PHLSFNVVNTLNNFLSDSKAFWRVRI 262
              P  M+++QL++D DVVAQ ++I  + A    P +S   V TL     D + F+ +R 
Sbjct: 159 LTMPGYMYLSQLQQDRDVVAQLESIQYMVAQREHPLISTIFVRTL----MDKRYFYGIRS 214

Query: 263 EAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVA 322
            AA+AL   A EE  W GL HL   ++      N  + R NDF D + Y + +AIP A+A
Sbjct: 215 AAAHALVKHAKEEVHWIGLFHLETAFQELFCLPNSPMSRSNDFSDRASYCLQKAIPEAMA 274

Query: 323 MVRAADNKSPREAVEFVLQLLKYNDNNGNP------------------YSDVFWLAALVQ 364
            VR    +SP    +F+ + LK+NDN+ N                   +SD +++AAL++
Sbjct: 275 KVRNRHGQSPMRVKKFLYEKLKFNDNSNNKVPLSLGSHTNDCELMLPQFSDCYYVAALIR 334

Query: 365 SVGEL-----------------EFGQQS--ILFLSSLLKRIDRLLQFDRLMPSYNGILT- 404
           S+ +                  E G+Q+   L+L  + + IDR  + D    S+  + + 
Sbjct: 335 SITDALINGKENSDGVDFDINRELGRQAENQLYLDCIAE-IDRYKRMDEWTSSFQNLYSR 393

Query: 405 --ISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLE 455
             I C R L      + G + LD  V    P+    T   +R++A R LLDL+
Sbjct: 394 TAIDCQRRL------MKGKV-LDVDVAQFLPYARVGTFDLLRLDAFRILLDLD 439


>gi|453085709|gb|EMF13752.1| hypothetical protein SEPMUDRAFT_148949 [Mycosphaerella populorum
            SO2202]
          Length = 1596

 Score =  121 bits (303), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 133/547 (24%), Positives = 232/547 (42%), Gaps = 111/547 (20%)

Query: 2    LEKQMGSNFFRKILQNIISRAQ-GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
            L K  GS    +I+  I   A+ GA     LST +F+    K+G+ +   L+ FF +WV 
Sbjct: 545  LIKASGSTGVLRIINKIFLNAKTGALSNGELSTADFQRTCEKLGHNK---LESFFRQWVW 601

Query: 61   TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG--DIGWP--- 115
              GCP+  +   +NK+K +VE+ + +   ++  ++    +SN   E ++   ++  P   
Sbjct: 602  GSGCPIFYVQQRFNKKKLVVEMTIFQR-QLERQTKPAFEASNFMREVKEHIQEVWAPEVQ 660

Query: 116  ----GMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLA--ARRALKPKKGSKPDG 168
                G M+IR+HE DG  Y+H +     +    LEI  ++K     R  L+  +    +G
Sbjct: 661  PVFTGPMTIRIHEADGTPYEHIV--EIKEPITKLEIPYNTKYKRLKRSRLQKNRAEAANG 718

Query: 169  CDDNGDAVAGL-------------------------DMRSSMESPLSWIRADPEMEYLAE 203
              ++ D  + L                         D     +    WIR D + E++ +
Sbjct: 719  QLNDEDGQSALLYCLGDILATNEDKEEWKLVDWSAEDEEKMNQESYEWIRMDADFEWIGK 778

Query: 204  IHFNQPVQMWINQLEKDGDVVAQAQAIAALEAL--PHLSFNVVNTLNNFLSDSKAFWRVR 261
            IH   P+ M+I+QL++D D+ AQ +++  L A    H+S ++   L   L D + F  +R
Sbjct: 779  IHLVMPLYMYISQLQQDRDIAAQYESMRYLLASNPHHVSLSI---LLRTLMDERYFHGIR 835

Query: 262  IEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIG----LPRPNDFRDFSEYFVLEAI 317
            + AA  LA  A ++    G   L+K ++   F E  G    +PRPNDF       +  AI
Sbjct: 836  VMAAEGLAKLAKDKLMDIGKHQLIKAFE-WFFCETEGNGPVMPRPNDFSSRITLIMQCAI 894

Query: 318  PHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGE--------- 368
            P A+A +R  D K P +  E    LL +NDN     SD  ++A L+  + +         
Sbjct: 895  PEAMAKLRDEDGKVPMDVRELFQALLTHNDNANPDISDCHYVATLMNCLADSLVASKWDG 954

Query: 369  ------------------LEFGQQS--------------------------ILFLSSLLK 384
                               EFG+                              F  S + 
Sbjct: 955  GKKKEVVAPEPEVPQSYTFEFGEDEEDTGMMDAGGPVKDLIDFDEEPVDPDAPFTKSAID 1014

Query: 385  RIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVR 444
             I+R  + D  + +Y  + +++ I+ L ++   +   +  D+ V+L++ +   +T   VR
Sbjct: 1015 AIERHRRLDEYVVTYQNLYSVTAIKCLQKL---IKAGVVKDKTVELMQ-YTSHSTADNVR 1070

Query: 445  VEASRAL 451
            +EA R +
Sbjct: 1071 LEAFRCM 1077


>gi|402222601|gb|EJU02667.1| hypothetical protein DACRYDRAFT_21692 [Dacryopinax sp. DJM-731 SS1]
          Length = 1785

 Score =  120 bits (302), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 93/300 (31%), Positives = 153/300 (51%), Gaps = 37/300 (12%)

Query: 187 SPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNT 246
           S   WIR D + E++A I F QP  MW +QL++D DVVAQ +AI AL  +P L  + + T
Sbjct: 689 STYEWIRCDADFEWIATIRFEQPDFMWTSQLQRDRDVVAQLEAIHALARMPSLVVSSILT 748

Query: 247 LNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDE-NIG------- 298
               + +   F+RVR+EAA AL + A+ E    G+ HL K +    + E N G       
Sbjct: 749 KTVLVENY--FFRVRMEAALALQSCANIE--HIGMFHLFKLFTRYCYSEANHGDRDFREK 804

Query: 299 -LPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPRE-AVEFVLQLLKYNDNNGNPYSDV 356
            +P PNDF+DF+ YF+ +++  A++  R   N +    A    + +L+YN+N  N YSD 
Sbjct: 805 FIPEPNDFKDFASYFIRKSVLVAISRCRDDSNGNTWSVARRLFIDVLRYNNNAKNQYSDS 864

Query: 357 FWLA--------ALVQSV-----------GELEFGQQSILFLSSLLKRIDRLLQFDRLMP 397
            +L+        ALV  V           G +   +     L   ++ ++R    D+ +P
Sbjct: 865 RYLSTAIVALGGALVPGVPPERGILTDAEGPVPLTESDATILEEAVQEVERYRLRDQYVP 924

Query: 398 SYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFH 457
           S + ++TI+ +    ++ L L G I+ D     ++  R+ N +  VR+ A  ALL ++++
Sbjct: 925 SQHNVITIAALE--FKLMLMLGGLIATDGNF-FLEYTREENFV-PVRLAAFDALLVMKWY 980



 Score = 42.0 bits (97), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 32/113 (28%), Positives = 52/113 (46%), Gaps = 28/113 (24%)

Query: 30  TLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCT 89
           TLST  F     KV  ++    K F  +W+   G P   +  S+N++K  VE+ + +DC 
Sbjct: 507 TLSTHSFLRTCRKVSGVDP---KTFADQWIYGSGTPRFGVTASFNRKKMAVEVTIAQDC- 562

Query: 90  VKPDSRTPVLSSNTDSENRDGD-IG---------WPGMMSIRVHELDGM-YDH 131
                  P   ++      +GD +G         + G ++IR+HE DG  Y+H
Sbjct: 563 -------PAYMAH------EGDPVGLAVFNPVQLFEGPITIRIHEADGTPYEH 602


>gi|327305057|ref|XP_003237220.1| transcription initiation factor TFIId 127kD subunit [Trichophyton
           rubrum CBS 118892]
 gi|326460218|gb|EGD85671.1| transcription initiation factor TFIId 127kD subunit [Trichophyton
           rubrum CBS 118892]
          Length = 1251

 Score =  120 bits (300), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 106/404 (26%), Positives = 182/404 (45%), Gaps = 51/404 (12%)

Query: 14  ILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTC-----GCPVLR 68
           IL+  +S+A G + +  + ++ F     ++G   RP L   +      C     G   L 
Sbjct: 514 ILERRLSKASGKATISRIISRIF--LNARMGR--RPRLLVHYHTAGVACLHTLTGEMALS 569

Query: 69  MGFSYNKRKNIVELAVLR---DCTVKPDSRTPVLSSNTDSENRDGDIG-----WPGMMSI 120
               +NK+K +VE+ + +   + T   D        +   E R    G     + G M+I
Sbjct: 570 SANGFNKKKLVVEMMISQVQSEQTTARDLERSTFMRDVKEEIRSVYAGTVQNVFTGSMTI 629

Query: 121 RVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLA--ARRALKPKKGSKPDGCDDN----- 172
           ++HE DG  Y+H +     +A    +I  ++K     R   + ++ +   G D N     
Sbjct: 630 QIHEADGTPYEHIV--EIKEAITKFDIPYNTKYKRLKRNRRQKERAAAMSGIDPNSETQD 687

Query: 173 -------GDAV-AGLDM--------------RSSMESPLSWIRADPEMEYLAEIHFNQPV 210
                  GD + +  DM              R   ES   WIR D + E++ ++    P 
Sbjct: 688 DVLLYCLGDVLQSDEDMEKWRIVEWTKEDESRMGQES-YEWIRMDADFEWICKMSLGMPG 746

Query: 211 QMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALAN 270
            M+++QL++D DVVAQ ++I  + A    +  + +     + D + F  +R  A  AL  
Sbjct: 747 YMYLSQLQQDRDVVAQLESIQHMAAQREHAL-ISSIFVRTILDRRYFHGIRTAATKALVK 805

Query: 271 TASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNK 330
            A +E DW GL HL   ++      N  + R NDF D S Y V  A+  A++ VR  D +
Sbjct: 806 HAKQEVDWIGLFHLENAFQELYCLPNSSMARSNDFSDRSSYAVQLALIEAISEVRDEDGR 865

Query: 331 SPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFGQQ 374
           +P    +F+ + LK+NDN+ N YSD F+++ L++S+     G++
Sbjct: 866 TPMRVKQFLYEKLKFNDNSNNEYSDCFYISTLLKSICNALLGRR 909


>gi|412991384|emb|CCO16229.1| predicted protein [Bathycoccus prasinos]
          Length = 1510

 Score =  119 bits (299), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 118/427 (27%), Positives = 184/427 (43%), Gaps = 59/427 (13%)

Query: 25   ASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVE--L 82
            AS  R +S++ F        NL +  +  F  RWV  CG P   + +S  KRKN +E  L
Sbjct: 807  ASNARWISSQHFLDHCRSAVNLGKGEIASFVERWVYGCGSPTFDVEYSIRKRKNKLEFVL 866

Query: 83   AVLRD-CTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGMYDH---------- 131
             V+    T+  D     ++ N  +            ++IR+ E     DH          
Sbjct: 867  KVIHSPATLAADEAASRVARNHRTS-----------VTIRLQEDATASDHVVALTAGGAT 915

Query: 132  ----PILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKP-----DGCDDNGDAVAGLDMR 182
                 + P   + +  + +Q  +K +     + KK S P      G D N D +   D  
Sbjct: 916  AAPGTVAPSEHEHFAEMPLQVKTKESR---YEKKKFSAPVVESGTGADINNDLLTAKD-- 970

Query: 183  SSMESPLSWIRADPEMEYLAEIHF-NQPVQMWINQ---LEKDGDVVAQAQAIAALEALPH 238
                + +SW+R DP+ E+L  I    + V +   Q   L+K+ DVVAQ+ A+  L     
Sbjct: 971  ----NQVSWVRVDPDGEWLCNIRLPTEQVGLESMQALLLDKEMDVVAQSSAVRFLAERAK 1026

Query: 239  L-SFNVVNTLNNFL-SDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDEN 296
            + S      L   L S+   F RVR EAA AL  +   ++ + GL  LV +Y+  + D N
Sbjct: 1027 IGSQTACAALEKCLRSNKTTFCRVRAEAAMALGKSTGVKSKFLGLSALVSYYRRTQCDPN 1086

Query: 297  IGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADN------KSPREAVEFVLQLLKYNDNNG 350
             G P+PND RD SE  V EA+  A+       N       +P  A + ++  L  NDN G
Sbjct: 1087 TGFPKPNDTRDVSETIVDEAVLLALGASVDDSNSMNGVISTPTSAFDCLVDRLSRNDNTG 1146

Query: 351  NPYSDVFWLAALVQ--SVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCI 408
            NP SDV  + A ++  S+      ++  L ++  ++   R   F   + S   ++  SC 
Sbjct: 1147 NPNSDVGLIVAALEALSISMPPDAKRRDLAMAQAVRYAHRDPVF---IKSDRNVVARSCY 1203

Query: 409  RTLTQIA 415
            R L  +A
Sbjct: 1204 RALANLA 1210


>gi|312384150|gb|EFR28947.1| hypothetical protein AND_02477 [Anopheles darlingi]
          Length = 657

 Score =  118 bits (296), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 92/307 (29%), Positives = 148/307 (48%), Gaps = 30/307 (9%)

Query: 198 MEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALE--ALPHLSFNVVNTLNNFLSDSK 255
           M  L  ++  QP   W  QL  + DV AQ  AI ALE  A P     + +T+ N     +
Sbjct: 1   MTLLRSVNIEQPDFQWQFQLRHERDVTAQLDAIQALEKYATPQARLALTDTIEN----ER 56

Query: 256 AFWRVRIEAAYALANTASEE-TDWAGLLHLVKFYKSRRFDENIGLP---RPNDFRDFSEY 311
            F+ VR +AA  L   A+   T W G   ++  +K  +F  +   P   R N+F +F  Y
Sbjct: 57  IFYEVRCQAALCLTKVANAMVTSWQGPPAMLTIFK--KFFGSYSAPHIIRQNNFTNFQHY 114

Query: 312 FVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEF 371
           F+ + IP A+A +R A    P E + F+L L KYNDN+ N YSD ++ AALV+++G    
Sbjct: 115 FLQKTIPVAMAGLRTAHGICPPEVIRFLLDLFKYNDNSKNHYSDNYYRAALVEALGNTIT 174

Query: 372 GQQSILF----LSS---------LLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKL 418
              S++     LSS         +L+ + R+L  ++ +PSY   ++I+C++ +     KL
Sbjct: 175 PIISVVHQGRALSSENLSADAKLVLEEVTRILNLEKHLPSYKYTVSIACLKVIR----KL 230

Query: 419 SGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSL 478
                L    K+ + +  +     VR+ A   L+D     +G    L   I+ +E +P  
Sbjct: 231 QKCGHLPTNPKIYRSYAAYGQYIDVRLAAMECLVDY-VKIDGRWEDLEHLIELLETDPDP 289

Query: 479 RGQVKLG 485
             + +LG
Sbjct: 290 MARHQLG 296


>gi|303283346|ref|XP_003060964.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226457315|gb|EEH54614.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 1697

 Score =  116 bits (291), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 120/450 (26%), Positives = 198/450 (44%), Gaps = 67/450 (14%)

Query: 26   SPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNK--RKNIVELA 83
            SP R L    F     +   + +  +  F  RW+  CGCP L +G+++ K  R+ ++   
Sbjct: 755  SPARFLRQSVFLTSCRQNATMTKQEMAAFTARWIEGCGCPRLTVGYTFRKSRRQELLFAI 814

Query: 84   VLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELD-GMYDHPILPMAGD-AW 141
             L  C     +       N               +++RV E D    DH +   A D A+
Sbjct: 815  KLDGCRAAAAADKIAFRKNPKIS-----------VTVRVQETDLPPSDHAVSLSAHDRAY 863

Query: 142  QLLEIQCHSKLAARR--------ALKPKKGSKPDGCDDNGDAVAGLDMRSSM-----ESP 188
             L+ +Q  +K   RR        AL  +  ++  G  +  D    LD  ++      E P
Sbjct: 864  CLMPLQLVTKPKDRRTIARQQQAALDRQARARETGNQNPDD----LDAAAAAEVLQWECP 919

Query: 189  LSWIRADPEMEYLAEIHFNQPVQ------MWINQLEKD--GDVVAQAQAIAALEALPHL- 239
            ++W+R DPE E+LAE+    P++      M   QL K+   DV AQ  AIA L A     
Sbjct: 920  VNWVRIDPEGEWLAEVRV--PIEQIGLEGMITAQLRKERPADVGAQLHAIAYLRARAEAG 977

Query: 240  SFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGL--LHLVKFYKSRRFDENI 297
            S + VN L + + D+K F RVR +AA AL   A ++T  + L    +++ Y+ RR D   
Sbjct: 978  STSAVNALLHCVEDAKTFCRVRADAAKALGACACDDTQRSNLAMTSVMRTYRKRRCDPET 1037

Query: 298  GLPRPNDFRDFSEYFVLEAIPHAVAMVRA----------ADNKSPREAVEFVLQLLKYND 347
            GLP P D R++++  V E +  A+ + R            D  +P E V+ ++ +L   +
Sbjct: 1038 GLPAPTDLRNYADALVDEGLVAALGLPRVPAREAAANDMVDYTTPSECVDLLVDMLNNLE 1097

Query: 348  NNGNPYSDVFWLAALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNG------ 401
            ++G+P+     +A  ++++G      +    L S L  I R L  D  +    G      
Sbjct: 1098 SDGDPHDMSSLVATALRALGAASL--EKTDALQSSLDAISRWLTKDVNICGVGGRTHAGG 1155

Query: 402  -ILTISCIRTLTQIALKLSGFISLDQVVKL 430
              +T + +  L  +A +L    SL +VV +
Sbjct: 1156 RRVTTAALEALRAVATRLP---SLARVVAV 1182


>gi|402589753|gb|EJW83684.1| hypothetical protein WUBG_05405 [Wuchereria bancrofti]
          Length = 523

 Score =  115 bits (289), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 113/378 (29%), Positives = 182/378 (48%), Gaps = 49/378 (12%)

Query: 105 SENRDGDIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGS 164
           S NR G   + G + + V ELDG + H +   +  +    ++QCHSK   RR  K KK  
Sbjct: 11  SANR-GRQSYVGPLIVVVQELDGSFTHTVQIDSNHSKH--DLQCHSK--GRRQ-KKKKIP 64

Query: 165 KPDGCDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVV 224
              G +   D +A +D     +SP+ WIR DP++  L ++   QPV  W   L+ + DV+
Sbjct: 65  LSTGEELEID-LANMDP----DSPVLWIRIDPDLLLLRKVSIQQPVYQWEYMLKYERDVL 119

Query: 225 AQAQAIAALEALP--HLSFNVVNTLNNFLSDSKAFWR---VRIEAAYALANTASE--ETD 277
           AQ QA+  L+  P  H    ++ T+   +     F+R   VR  AAY L    ++  ET 
Sbjct: 120 AQLQALDVLQRFPSPHARAILIETIETEV----FFYRQVLVRCRAAYILVEVVNKLPET- 174

Query: 278 WAG----LLHLVKFYKSRRFDENIGLPRPNDFRDFSE----YFVLEAIPHAVAMVRAADN 329
           W G    ++   KF+  +       +P+PN+F   S     YF++ AIP A+A +R++ N
Sbjct: 175 WLGPPALMILFKKFFGCKSAPH---IPKPNNFVATSANLQLYFLMHAIPQALARLRSSSN 231

Query: 330 KSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGEL-------EFGQQSILFLSS- 381
            +  E   F++ L+KYNDN  N YSD  + A+L+ S+          +    S L LS+ 
Sbjct: 232 NALSEVHHFLIGLIKYNDNTVNRYSDDHYRASLINSLAATLVPAENRDITNMSPLSLSTE 291

Query: 382 ---LLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFN 438
              LL      L  D L PS+  ++ I+ ++ +    ++ +  I LD   K+   F   N
Sbjct: 292 MKELLSEFTLALNMDTLKPSFGRVIGITALQAIYH--MQRTFHIPLDP--KVFWSFAQPN 347

Query: 439 TIWQVRVEASRALLDLEF 456
               +R+ A   L+D+ +
Sbjct: 348 MYVPMRLAALTFLIDMVY 365


>gi|340905010|gb|EGS17378.1| hypothetical protein CTHT_0067030 [Chaetomium thermophilum var.
            thermophilum DSM 1495]
          Length = 1589

 Score =  114 bits (284), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 129/525 (24%), Positives = 234/525 (44%), Gaps = 84/525 (16%)

Query: 2    LEKQMGSNFFRKILQNIISRAQGASPVRT--LSTKEFRHFANKVGNLERPFLKEFFPRWV 59
            + K  GS+   +++  ++S A   + + T  LS  +FR    K     RP   E + +WV
Sbjct: 492  MSKIPGSSGIVRVISQLVSNANINTTIETTSLSEADFRKACEKKSQY-RP--DELWEQWV 548

Query: 60   GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG------ 113
               GCP L +   +NK+   V++ + +    K  S+   ++ +        ++       
Sbjct: 549  HGAGCPKLSVKQRFNKKNLNVDITITQ-VQAKDSSQPKAITKDVFQRELFEEVHEVWAGP 607

Query: 114  ----WPGMMSIRVHELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKPKK------ 162
                + G ++IRVHE DG  Y+H  L ++       ++Q       +R  + KK      
Sbjct: 608  VPKYFTGPITIRVHEADGTPYEH-YLSISDKDKGHTQLQIAYNTKYKRMKRTKKAIMAAA 666

Query: 163  ----GSKPDGCDDN-------GDAVA--------GL-DMRSSMESPLS-----WIRADPE 197
                G K D  +D+       GD +         GL D   S++  +      WIR D  
Sbjct: 667  AASTGDKHDIQEDDIVYFNMLGDVLMSPQDAIDWGLQDWSESVQQQMDQESYEWIRFDCN 726

Query: 198  MEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLN-NFLSDSKA 256
             E+L  +H + P  M+  QL++D DVVA   A+       H +  V +T++   + D + 
Sbjct: 727  FEWLCALHSDMPGYMYSAQLQQDRDVVAHQDAMLFFSKQRHHA--VTSTIDVRTVMDRRY 784

Query: 257  FWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL-------PRPNDFRDFS 309
            F+ +R+ AA+ L   A+ + +  G  HL+  Y+    D+ IG        P PNDF D +
Sbjct: 785  FYGIRVMAAHDLTKQANPDLNCIGKAHLILLYRHFFCDKVIGKNGQVTYPPAPNDFYDKA 844

Query: 310  EYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQS---- 365
            +Y V  AIP A+A +R  +   PR+A  F+L LL  N      YSD F++  L+Q+    
Sbjct: 845  QYEVQCAIPQAIARIR-ENGFCPRDARRFLLDLLVLNAETEGEYSDHFYICKLLQALTTS 903

Query: 366  -VGELEFGQQSIL---------------FLSSLLKRIDRLLQFDRLMPSYNGILTISCIR 409
             + E   G+  +L               F+ ++L  I++  + D    +++ I T + + 
Sbjct: 904  IIPEKYTGETDLLSSLRSDDPVESELKDFIKAVLDEIEKYRRKDEWELTWHNIWTTTTLD 963

Query: 410  TLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
               ++ L  +  I ++  ++ ++  +D N +  +R++A   L++L
Sbjct: 964  --CKMRLMKARVIPVEP-IEFVRYLQDEN-LDLIRIKAHECLVEL 1004


>gi|238583594|ref|XP_002390290.1| hypothetical protein MPER_10455 [Moniliophthora perniciosa FA553]
 gi|215453537|gb|EEB91220.1| hypothetical protein MPER_10455 [Moniliophthora perniciosa FA553]
          Length = 672

 Score =  112 bits (281), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 91/321 (28%), Positives = 150/321 (46%), Gaps = 41/321 (12%)

Query: 18  IISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRK 77
            +S   G      LST  F     KV  ++    + F  +W+   GCP   +  ++N++K
Sbjct: 365 FLSAISGEMQNNCLSTHSFLRMCRKVSGVDP---RSFAEQWIYGSGCPSFGVTATFNRKK 421

Query: 78  NIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGW-PGMMSIRVHELDGM-YDHPILP 135
             VE+ + ++C   P  ++  L  N  ++     + +  G M+IR+HE DG  Y+H +L 
Sbjct: 422 MAVEITMRQEC---PAYQS--LEHNEFAKVMYKPVQFFEGTMTIRIHEADGTPYEH-VLD 475

Query: 136 MAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLDMRSSME--------- 186
           +    ++  E+  ++K    R    K+  +P G  D G  +   +     E         
Sbjct: 476 IR-QPFKRFEVPFNTKYKRVRR-NTKRPLRPMGMVDMGFGLEIWEKEQERENWKVADWTE 533

Query: 187 --------SPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPH 238
                   +   WIR D + E++A + F QP  MW++QL++D DVVAQ +A+ AL   P 
Sbjct: 534 EDENVMSGATYEWIRMDADFEWIANLKFEQPDFMWVSQLQRDRDVVAQMEAVHALARQP- 592

Query: 239 LSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFD---- 294
            +  V +TL   +  S   +R+R+EAA AL N A    D+ GL HL K +    ++    
Sbjct: 593 -TAIVSSTLTKTVLVSNYHYRIRVEAALALVNCAIHRLDFLGLFHLFKLFLRYCYEPEDP 651

Query: 295 -----ENIGLPRPNDFRDFSE 310
                 +  +P+PNDF D SE
Sbjct: 652 KQNLFTHTYVPKPNDFSDLSE 672


>gi|348588273|ref|XP_003479891.1| PREDICTED: transcription initiation factor TFIID subunit 2-like
           [Cavia porcellus]
          Length = 1111

 Score =  112 bits (281), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 120/474 (25%), Positives = 202/474 (42%), Gaps = 94/474 (19%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 407 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 466

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 467 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 504

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G + + V ELDG ++H  L +  ++ +  +I CHSK    +  K           
Sbjct: 505 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLKH-DIPCHSKSRRNKKKKIPLM------- 555

Query: 171 DNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
            NG+ V  +D+ +   +SPL WIR DP+M  L  + F Q   MW  QL  + DVVAQ ++
Sbjct: 556 -NGEEV-DMDLSAMDADSPLLWIRIDPDMAVLRRVEFEQADFMWQYQLRYERDVVAQQES 613

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTA-SEETDWAGLLHLVKFY 288
           I ALE  P  +  +   L + L   + F+RVR+ A + LA  A S  + W G   +   +
Sbjct: 614 ILALEKFPTPASRLA--LTDILEQEQCFYRVRMAACFCLAKIANSMVSTWTGPPAMKSLF 671

Query: 289 KSRRF--DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYN 346
            +R F       + + N+F  F  YF                             L K++
Sbjct: 672 -TRMFCCKTCPNIVKTNNFMSFQSYF-----------------------------LQKFS 701

Query: 347 DNNGNPYSDVFWLAALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRLMPSY 399
           DN    Y     + AL  SV         +  L +L       L+ I R L  ++L+PSY
Sbjct: 702 DN----YYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKLLPSY 757

Query: 400 NGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
              +T+SC+R +    L+ +G +  D    L K + ++     +R+ A  A++D
Sbjct: 758 RHTITVSCLRAIR--VLQKNGHVPSDPT--LFKSYAEYGHFVDIRIAALEAVVD 807


>gi|255078532|ref|XP_002502846.1| predicted protein [Micromonas sp. RCC299]
 gi|226518112|gb|ACO64104.1| predicted protein [Micromonas sp. RCC299]
          Length = 1703

 Score =  111 bits (277), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 107/391 (27%), Positives = 175/391 (44%), Gaps = 47/391 (12%)

Query: 12   RKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGF 71
            R+  ++I   A   SP R L ++ F +   +   + +  +  F  RW+  CGCP L   +
Sbjct: 751  REHQESIAYHALAESPARFLRSRVFLNLCRQNATMTKADMSAFNARWIEGCGCPRLTAAY 810

Query: 72   SYNK--RKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELD-GM 128
            ++ K  R+ ++    L  C     +     S N               +++R+ E D   
Sbjct: 811  TFRKTRRQELLFAIKLDGCLAAAAADRVAWSKNPKVS-----------VTVRIQENDLPP 859

Query: 129  YDHPILPMAGD-AWQLLEIQCHSKLAARR--------ALKPKKGSKPDG--CDDNGDAVA 177
             DH +   A D A+ L+ +Q  +K   RR        A++ ++ +   G    DN DA+A
Sbjct: 860  SDHAVSLSAHDRAYVLMPLQLVTKPKDRRTIARQQEAAMRRQQQAMEAGKANPDNLDAMA 919

Query: 178  GLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQ----MWINQL--EKDGDVVAQAQAIA 231
              ++    E P+SW+R DPE E+LA++      Q    M   QL  E+  D  AQ  AI 
Sbjct: 920  AAEVL-QWECPVSWVRVDPEGEWLAKVVLPSAQQGLEGMITKQLMSERPADPAAQMWAIR 978

Query: 232  AL-EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLL--HLVKFY 288
             L E   + S + VN L     D K F RVR  AA AL   A + T  + L    + + Y
Sbjct: 979  FLRERAENGSTSAVNALLQCAEDPKTFCRVRALAAMALGACACDGTQRSNLALSSVTRTY 1038

Query: 289  KSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRA----------ADNK--SPREAV 336
            + RR D   G P+P D  DF+   V E +  A+ + RA          AD+   +P E V
Sbjct: 1039 RRRRCDPRTGKPKPTDLADFAAAIVDEGLLTALGLPRAPPESTSGWEDADDAWVTPTECV 1098

Query: 337  EFVLQLLKYNDNNGNPYSDVFWLAALVQSVG 367
            + ++  L + +++G+P+     +A  + ++G
Sbjct: 1099 DLLVDALNHTESDGDPHDSSSLIATAIHALG 1129


>gi|406700376|gb|EKD03547.1| hypothetical protein A1Q2_02130 [Trichosporon asahii var. asahii
           CBS 8904]
          Length = 1847

 Score =  110 bits (275), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 75/241 (31%), Positives = 125/241 (51%), Gaps = 27/241 (11%)

Query: 191 WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNF 250
           WIR DP+ E+LA     +    WI+QLE + DVVAQ Q +       + S  + + L   
Sbjct: 659 WIRVDPDCEWLATFDIQEKPWFWISQLEGESDVVAQLQRMCM-----YPSPVIASELAKV 713

Query: 251 LSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKS---RRFDE----NI-GLPRP 302
           +     F+RVR+EAA AL    S E D+ GL  L+K + +      DE    N+  +P P
Sbjct: 714 VLVENYFYRVRMEAARALVLYNSAEADYMGLFLLLKLFHTFYCEPTDEEEPSNVKAVPLP 773

Query: 303 NDFRDFSEYFVLEAIPHAVAMVR-AADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAA 361
           NDF +F+ YF+ +++  A++ +R  A  +  R   E +L LL+ NDN+ N YSD  ++A 
Sbjct: 774 NDFSNFANYFLKKSMVTALSDLRDPATRRVWRHVRELLLDLLRMNDNSQNEYSDSHYVAT 833

Query: 362 LVQSVGEL---------EFGQQSILFLSSLLKRI----DRLLQFDRLMPSYNGILTISCI 408
           ++ ++G              ++  +    L K      +R +  DRL+PSY+ ++T++ +
Sbjct: 834 VISAIGSAFSAGVNVGGNLSEEDQVMDGQLFKEASEANERAITVDRLVPSYHNVVTLAGL 893

Query: 409 R 409
           +
Sbjct: 894 Q 894


>gi|367044074|ref|XP_003652417.1| hypothetical protein THITE_161557 [Thielavia terrestris NRRL 8126]
 gi|346999679|gb|AEO66081.1| hypothetical protein THITE_161557 [Thielavia terrestris NRRL 8126]
          Length = 1577

 Score =  110 bits (274), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 134/544 (24%), Positives = 233/544 (42%), Gaps = 80/544 (14%)

Query: 2    LEKQMGSNFFRKILQNIISRAQGASPV--RTLSTKEFRHFANKVGNLERPFLKEFFPRWV 59
            + K  GS    +++  I+S A     +   +LS  EFR    K     RP   E + +WV
Sbjct: 547  MSKVPGSMGIARVISQIVSNANINVTIDSTSLSDAEFRRACEKKSQY-RP--DELWDQWV 603

Query: 60   GTCGCPVLRMGFSYNKRKNIVELAV----LRDCTVKPDSRTPVLSSNTDS--ENRDGDIG 113
               GCP LR+   +NK+   V++++     +D   KP ++   L    +   E   G + 
Sbjct: 604  HGAGCPRLRIKQRFNKKNLNVDISIAQVQTKDVAPKPITKEDFLKELLEEVHEVWAGPVP 663

Query: 114  --WPGMMSIRVHELDGM-YDHPILPMAGD-AWQLLEIQCHSKL----AARRALKPKKGSK 165
              + G  ++RVHE DG  Y+H +     D     L+I  ++K       ++A      + 
Sbjct: 664  KLFTGPFTVRVHEADGTPYEHYLSITDKDKVSTTLQIAYNTKYKRMKRTKKATAAAATNA 723

Query: 166  PDGCDDNGDAVAGLDM-----------------------RSSME-SPLSWIRADPEMEYL 201
             D  D   D +   +M                       ++SM+     WIR D   E+L
Sbjct: 724  ADKHDIQEDDIVYFNMLGDVLMTPKDIEDWGLQDWDEKVQASMDQESYEWIRLDCNFEWL 783

Query: 202  AEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLS-DSKAFWRV 260
             E+  + P  M++ QL++D DVVA   A+            V +T+    + D + +  +
Sbjct: 784  CEMVTDMPGYMYLAQLQQDRDVVAHQDAMLFFRR--GRRHGVASTIETRTAMDRRYYHGI 841

Query: 261  RIEAAYALANTASEETDWAGLLHLV----KFYKSRRFDEN---IGLPRPNDFRDFSEYFV 313
            R  A   L   A    ++ G   L+     F+  R   +N      P PNDF D ++Y V
Sbjct: 842  RTMAIDDLPKQADPALNYIGKAQLILMFRHFFCERMVGKNGIVTSPPMPNDFHDRAQYAV 901

Query: 314  LEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV------- 366
              AIP A+A  R  + +  R+A  F+L+LL +NDN  N YSD F++A L+ ++       
Sbjct: 902  QCAIPAAIARTRE-NGRCSRQARNFILELLLFNDNTENEYSDHFYIAKLLDALTTSIIPD 960

Query: 367  ---------GELEFGQ----QSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQ 413
                      +L   +    +   FL   ++ ID+  + D    +Y  I T + +    +
Sbjct: 961  KKDGGRDLSSDLRMDEDDDTEPKAFLEKAVEEIDKYRRMDEWTSTYQNIWTTTALN--CK 1018

Query: 414  IALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVE 473
            + L  +G I++D  ++ ++  +D N  + VR++A   L++L        S + L +  + 
Sbjct: 1019 MRLMKAGVIAVDP-IEFVQYLQDGNADF-VRIKAFDCLVELGLLAKA--SIMKLLLACLS 1074

Query: 474  EEPS 477
             +PS
Sbjct: 1075 TDPS 1078


>gi|392578386|gb|EIW71514.1| hypothetical protein TREMEDRAFT_73329 [Tremella mesenterica DSM 1558]
          Length = 2067

 Score =  108 bits (270), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 77/241 (31%), Positives = 123/241 (51%), Gaps = 18/241 (7%)

Query: 191  WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNF 250
            WIR DPE E++A   F +    WI+QL+ D DV+AQ +AI  L   P  S  V + L   
Sbjct: 1051 WIRFDPECEWIATFDFAEKPWYWISQLQGDRDVIAQVEAIQNLTMYP--SPVVASELART 1108

Query: 251  LSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKS----RRFDENIGLPRPNDFR 306
            +     F+RVR+EAA AL    + E D+ G   LVK Y++       D+    P+ NDF 
Sbjct: 1109 ILVKNYFYRVRMEAAKALTVYNTSECDYIGHFLLVKLYRALFCLPGDDDAPIRPKRNDFS 1168

Query: 307  DFSEYFVLEAIPHAVAMVR-AADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQS 365
            D ++Y  L +   ++A +R  A   + R+    +L L++YNDN GN + D    A  + +
Sbjct: 1169 DVADYLSLVS---SLADLRDPATRNTRRDVRRLLLDLIRYNDNTGNLFDDSTMRATYITA 1225

Query: 366  VGEL----EFGQQSI--LFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLS 419
            +G       +  + +  +      + +DR +  DRL PS++  +TI+ +  L Q+ + L 
Sbjct: 1226 LGNAFSLPAWNAEEVDRVMFDEANEAVDRAITMDRLAPSFHNSVTIAGL--LAQMKMILG 1283

Query: 420  G 420
            G
Sbjct: 1284 G 1284


>gi|301100182|ref|XP_002899181.1| transcription initiation factor TFIID subunit 2, putative
           [Phytophthora infestans T30-4]
 gi|262104098|gb|EEY62150.1| transcription initiation factor TFIID subunit 2, putative
           [Phytophthora infestans T30-4]
          Length = 1293

 Score =  108 bits (270), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 108/452 (23%), Positives = 188/452 (41%), Gaps = 87/452 (19%)

Query: 22  AQGASPVRTLSTKEFRHFANKV-GNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIV 80
           A GA+    L+T  F      + G   +   K F   W+   G P   +G+ YN+++   
Sbjct: 563 ANGATATEPLTTLSFLKTVKTIAGAAGQDLTKSFLTSWIIEPGMPFFTVGYWYNRKQTQA 622

Query: 81  ELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGMYDHPILPMAGDA 140
           E+ + ++  + P  +                  + G ++I + E    Y +         
Sbjct: 623 EVVLQQE--IPPGGKL-----------------YTGPITITIVEDTSEYTYQ--KRIEHK 661

Query: 141 WQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLDMRSSMESPLSWIRADPEMEY 200
               +  CHSK+  +R  + +  + PD     G    GL+     ++P+ W++ D    +
Sbjct: 662 RHKFDFPCHSKVRKKRRRR-QGLADPDDSVSVGGPGMGLN-----DTPVFWVKIDTGCAW 715

Query: 201 LAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEAL--PHLSFNVVN------TLNNFLS 252
           L  +  +QP   W+ QL  D  V ++  A  AL     PH   NV++       ++   +
Sbjct: 716 LRHVVIHQPDFNWMEQLLSDKKVGSRVHAARALALFHRPHEKPNVMSCRVLTECMSGLTT 775

Query: 253 DSKAFWRVRIEAAYALA----------NTASEETDWAGLLHLVKFYKSRRFDENIGLPRP 302
            S+   R+R EAA +L           N  ++   W  + +LV+ +K   FD    +P P
Sbjct: 776 HSR---RLRAEAAQSLGIWQSIHAPLTNANTQLPIWKAMHNLVRIFKEHFFDRATDMPLP 832

Query: 303 NDF--------------------------RDFS--EYFVLEAIPHAVAMVRAADNKSPRE 334
           N F                          +D++  EY + ++IP A+AMVRA + KSP E
Sbjct: 833 NYFLPSGGKVILEALGAQQASISSKQIQVQDYAEGEYEIKKSIPKALAMVRAQNGKSPAE 892

Query: 335 AVEFVLQLLKYNDNNGNPYS---------DVFWLAALVQSVGELEFGQQSIL-FLSSLLK 384
              F+LQLL  NDN+ N            D ++L  +V S+  L   + +      +++ 
Sbjct: 893 IETFLLQLLTENDNSKNYVEMDDQSQVADDCYYLGNIVLSLSVLTLDRPARSDDGDAVVL 952

Query: 385 RIDRLLQFDRLMPSYNGILTISCIRTLTQIAL 416
            I R L +D++ PSYN  +T+ C+  L  + L
Sbjct: 953 EILRYLHYDQVQPSYNKTITVCCLEALCNLVL 984


>gi|452823940|gb|EME30946.1| transcription initiation factor TFIID subunit D2 [Galdieria
           sulphuraria]
          Length = 1202

 Score =  108 bits (269), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 101/402 (25%), Positives = 182/402 (45%), Gaps = 37/402 (9%)

Query: 53  EFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDI 112
           +F  RW  +C    LR G+ Y+ +K  VELAV +            +  + D +++    
Sbjct: 505 DFISRWTRSCDILKLRCGYRYDSKKKFVELAVKQSIITFRSQDGKRIERDLDGDSKVS-- 562

Query: 113 GWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDN 172
            + G ++++V E++G +DH +L M  D   L+E+   ++   +   K  K S  +     
Sbjct: 563 MFKGGLTVQVMEMEGAHDH-LLEMDSDIL-LVELPTSTRRMKQHHRKRSKSSLKESDTVI 620

Query: 173 GDAVAGLDMRSSM--ESPLSWIRADPEMEYLAE-IHFNQPVQMWINQLEKDGDVVAQAQA 229
               A LD    +  + P+ W+R DP+ E++ + + F +  +  I+ L    DV  Q +A
Sbjct: 621 PPTEANLDEEGVIIEKDPVLWVRIDPDQEWILKFVEFRRSERDSISCLRNCRDVWGQLEA 680

Query: 230 IAALEALP--HLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKF 287
           +  L +     +S   +  L   L+DS+ +++VR E A  LA   +      GL  L+K 
Sbjct: 681 LETLMSRTKGRVSVTSIRCLEQILTDSRYYFQVRGECARCLAKCYTTNDRLMGLEALLKL 740

Query: 288 YKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAM--------VRAADNKSPREAVEF- 338
            +++ FD++ G    N F D SEYFV  A+  A+A         +   D++       F 
Sbjct: 741 LRNKYFDKH-GAVLSNHFDDISEYFVKCAVIQAIAQAIHPISFPIYGYDSQFSLNLASFQ 799

Query: 339 ---VLQLLKY------------NDNNGNP-YSDVFWLAALVQSVGELEFGQQSILFLSSL 382
              + + L Y            ND    P + D F+L  L+ +        +  +    +
Sbjct: 800 MTHIQESLPYDVLQLLLSLLEQNDTAYPPLFDDCFYLHNLIIACAICSVYAKDQITRHRV 859

Query: 383 LKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISL 424
            K++ R L+ +  MPSY+ I+T + ++ LT   L+L GF+ +
Sbjct: 860 YKQLFRYLKREEAMPSYHNIITGASLKGLT--ILQLGGFLRI 899


>gi|390358718|ref|XP_793421.2| PREDICTED: transcription initiation factor TFIID subunit 2-like
           [Strongylocentrotus purpuratus]
          Length = 1402

 Score =  108 bits (269), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 93/328 (28%), Positives = 148/328 (45%), Gaps = 44/328 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTK------EFRHFANKVGNLERPFLKEF 54
           ++E ++G     K+   ++S A  AS  + +S            F   +  +    LK F
Sbjct: 493 LIEIRVGQELLLKVFNKLLSLATSASHQKFMSCAWSNMLISTSGFLKSISTVSGKDLKLF 552

Query: 55  FPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGW 114
             RWV   G    R  F +N+++NIVEL + +D                    R G I +
Sbjct: 553 INRWVCQGGMAQFRGTFHFNRKRNIVELELRQD------------------PGRPGIIKY 594

Query: 115 PGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALK-PKKGSKPDGCDDNG 173
            G + + + ELDG ++H +     D     EI CHSK    +  K P K         NG
Sbjct: 595 VGPLKVTIQELDGSFEHTL--QVEDNLTKGEITCHSKSRRHKKKKIPLK---------NG 643

Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
           + V     +   +SP+ WIR D EM  L E+ F+QP   W  QL  + DVVAQ +A+  L
Sbjct: 644 EEVDMDLSKMDPDSPVLWIRVDTEMTLLREVIFDQPDYQWQCQLRYERDVVAQLEAVECL 703

Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETD-WAGLLHLVKFYK--- 289
           +     S    + L++ +  S  +++VR +AAY  A  ++E  + ++G   L+  +K   
Sbjct: 704 KKFA--SLETTHALSDIILHSGCYYKVRTQAAYCYAQVSNELAETFSGHTALIAMFKKLF 761

Query: 290 SRRFDENIGLPRPNDFRDFSEYFVLEAI 317
           S    E+I   R N+F DF  Y + +A+
Sbjct: 762 SSFSSEDIV--RRNNFMDFRAYLIQKAV 787


>gi|405121726|gb|AFR96494.1| transcription initiation factor TFIId subunit [Cryptococcus
           neoformans var. grubii H99]
          Length = 1728

 Score =  104 bits (259), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 83/238 (34%), Positives = 125/238 (52%), Gaps = 26/238 (10%)

Query: 191 WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNF 250
           WIR DP+ E+LA   F +    WI+QL+ D DV+AQ +AI  +   P  S  + + L   
Sbjct: 693 WIRVDPDCEWLAIFEFAEKPWCWISQLQGDRDVIAQLEAIQNMRMYP--SPVIASELART 750

Query: 251 LSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIG----------LP 300
           +     F+RVR+EAA ALA   S E D+ G   ++K ++      + G          LP
Sbjct: 751 VLVKNYFYRVRMEAARALAMYNSSECDYIGYFLIMKLFQVFYCQPSSGNETEPPEIQCLP 810

Query: 301 RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFV-LQLLKYNDNNGNPYSDVFWL 359
            PNDF +F++YFV +++  A+A +R    +     V    L LL+ NDN  N YSD F+L
Sbjct: 811 LPNDFSNFADYFVKKSLIAAMADLRDPATRVVWRHVRITLLDLLRLNDNTSNQYSDSFYL 870

Query: 360 AALVQ------SVGELEFGQQSILFLSS--LLKR-----IDRLLQFDRLMPSYNGILT 404
           AA++       +VG    G  S   + S  LL++     +DR +  DRL+PSY+ ++T
Sbjct: 871 AAVITAISNAFTVGSAAQGMLSETEIESEQLLQKEATDVLDRAMTVDRLVPSYHNVVT 928



 Score = 39.3 bits (90), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 32/113 (28%), Positives = 52/113 (46%), Gaps = 18/113 (15%)

Query: 27  PVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLR 86
           P  TL+T  F     KV  L+   L+ F  +W+   GCP L +  ++ K+K  VE  V +
Sbjct: 491 PGNTLTTSYFIRQCRKVSGLD---LQTFQDQWIFGSGCPRLSIRTNFIKKKFTVEFMVSQ 547

Query: 87  DCTVKPDSRTPVLSSNTDSENRDGDIGW-------PGMMSIRVHELDGM-YDH 131
                  ++  V + N  SE +     W        G +++R+HE DG  ++H
Sbjct: 548 -------TQPAVEAINEMSEKQRKVAVWRRPTPFFEGSLTVRIHEADGAPFEH 593


>gi|134113885|ref|XP_774190.1| hypothetical protein CNBG1720 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|50256825|gb|EAL19543.1| hypothetical protein CNBG1720 [Cryptococcus neoformans var.
           neoformans B-3501A]
          Length = 1729

 Score =  103 bits (258), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 82/238 (34%), Positives = 126/238 (52%), Gaps = 26/238 (10%)

Query: 191 WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNF 250
           WIR DP+ E+LA   F +    WI+QL+ D DV+AQ +AI  +   P  S  + + L   
Sbjct: 693 WIRVDPDCEWLAIFEFAEKPWCWISQLQGDRDVIAQLEAIQNMRMYP--SPVIASELART 750

Query: 251 LSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIG----------LP 300
           +     F+RVR+EAA ALA   S E D+ G   ++K +++     + G           P
Sbjct: 751 VLVKNYFYRVRMEAARALAMYNSSECDYIGYFLIMKLFQAFYCQPSSGNETEPPEIQCRP 810

Query: 301 RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFV-LQLLKYNDNNGNPYSDVFWL 359
            PNDF +F++YFV +++  A+A +R    ++    V    L LL+ NDN  N YSD F+L
Sbjct: 811 LPNDFSNFADYFVKKSLIAAMADLRDPATRAVWRHVRITLLDLLRLNDNTSNQYSDSFYL 870

Query: 360 AALVQ------SVGELEFGQQSILFLSS--LLKR-----IDRLLQFDRLMPSYNGILT 404
           AA++       +VG    G  S   + S  LL++     +DR +  DRL+PSY+ ++T
Sbjct: 871 AAVITAISNAFTVGSAAQGMLSETEIESEQLLQKEATDVLDRAMTVDRLVPSYHNVVT 928



 Score = 39.7 bits (91), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 32/113 (28%), Positives = 52/113 (46%), Gaps = 18/113 (15%)

Query: 27  PVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLR 86
           P  TL+T  F     KV  L+   L+ F  +W+   GCP L +  ++ K+K  VE  V +
Sbjct: 491 PGNTLTTSYFLRQCRKVSGLD---LQTFQDQWIFGSGCPRLSVRTNFIKKKFTVEFMVSQ 547

Query: 87  DCTVKPDSRTPVLSSNTDSENRDGDIGW-------PGMMSIRVHELDGM-YDH 131
                  ++  V + N  SE +     W        G +++R+HE DG  ++H
Sbjct: 548 -------TQPAVEAINEMSEKQRKVAVWRRPTPFFEGSLTVRIHEADGAPFEH 593


>gi|321260951|ref|XP_003195195.1| TFIID subunit (150 kDa); Taf2p [Cryptococcus gattii WM276]
 gi|317461668|gb|ADV23408.1| TFIID subunit (150 kDa); Taf2p [Cryptococcus gattii WM276]
          Length = 1729

 Score =  103 bits (258), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 83/238 (34%), Positives = 127/238 (53%), Gaps = 26/238 (10%)

Query: 191 WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNF 250
           WIR DP+ E+LA   F +    WI+QL+ D DVVAQ +AI  +   P  S  + + L   
Sbjct: 693 WIRVDPDCEWLAVFEFIEKPWCWISQLQGDRDVVAQLEAIQNMRMYP--SPVIASELART 750

Query: 251 LSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIG----------LP 300
           +     F+RVR+EAA ALA   + E D+ G   ++K ++      + G          LP
Sbjct: 751 ILVKNYFYRVRMEAARALAMYNNSECDYIGYFLIMKIFQVFYCQPSSGNETVPTEIRCLP 810

Query: 301 RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFV-LQLLKYNDNNGNPYSDVFWL 359
            PNDF +F++YFV +++  A+A +R    ++  + V    L LL+ NDN  N YSD F+L
Sbjct: 811 LPNDFSNFADYFVKKSLIAAMADLRDPATRAVWKHVRVTLLDLLRLNDNTSNLYSDSFYL 870

Query: 360 AALVQ------SVGELEFGQQSILFLSS--LLKR-----IDRLLQFDRLMPSYNGILT 404
           AA++       +VG    G  S   + S  LL++     +DR +  DRL+PSY+ ++T
Sbjct: 871 AAIITAISNAFTVGSAAQGMLSETEIESEQLLQKEANDVLDRAMTVDRLVPSYHNVVT 928


>gi|325184532|emb|CCA19025.1| transcription initiation factor TFIID subunit putati [Albugo
           laibachii Nc14]
          Length = 1236

 Score =  103 bits (258), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 107/452 (23%), Positives = 188/452 (41%), Gaps = 98/452 (21%)

Query: 31  LSTKEFRHFANKV-GNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCT 89
           L+T  F      + G   +   K F   W+   G P   +G+ YN+++   E+ V ++  
Sbjct: 525 LTTSSFLKTVKAIAGASGQDLTKSFLSMWIVDSGMPFFTVGYWYNRKQTQAEVVVQQE-- 582

Query: 90  VKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCH 149
           V P  +    +          +  +   +  R H+    +D P               CH
Sbjct: 583 VPPGGKLYTGAITVTIVEDTSEYTYQKRIESRRHK----WDFP---------------CH 623

Query: 150 SKLAARRALKPKKGSKPDGCDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQP 209
           SK+  +R  +       D    NG  + GL+     ++P+ W++ D    +L  I  +QP
Sbjct: 624 SKVRKKRRRRQGLSDLDDSVSVNGPGM-GLN-----DTPVFWVKIDTGQNWLRHIVLHQP 677

Query: 210 VQMWINQLEKDGDVVAQAQAIAALEAL--PHLSFNVVNT------LNNFLSDSKAFWRVR 261
              W+ QL  D  V ++  A  AL     PH   N+++       ++   + S+   R+R
Sbjct: 678 DFNWMEQLLSDKKVGSRVHAARALALFHRPHEKPNIMSCRVLTECMSGLTTHSR---RLR 734

Query: 262 IEAAYAL------------ANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDF---- 305
            EAA +L            ANT+     W  + +L + +K   FD N  +P PN F    
Sbjct: 735 AEAALSLGVWQSIHAPLTVANTSLP--IWKAMHNLFRIFKEHFFDRNTDMPLPNYFVPSG 792

Query: 306 ----------------------RDFS--EYFVLEAIPHAVAMVRAADNKSPREAVEFVLQ 341
                                 +D++  EY + +++P A+AM+RA + KSP E  +F+LQ
Sbjct: 793 GKVMLESLGAQQASLSSKQIQIQDYAAGEYEIKKSVPRALAMIRAQNEKSPPEIEQFLLQ 852

Query: 342 LLKYNDNNGNPY---------SDVFWLAALVQSVGELEFGQQSILFLS--------SLLK 384
           LL  NDN+ N            D F++ ++V S+  L    ++    S         +++
Sbjct: 853 LLCENDNSKNYVKMDDQSQVADDCFYIGSIVLSLSLLNTPAENRRTSSMGKTPEDDRVVQ 912

Query: 385 RIDRLLQFDRLMPSYNGILTISCIRTLTQIAL 416
            I R L++D++ PS+  I+T+ C+  L  + L
Sbjct: 913 EILRYLRYDQVQPSHGKIITVCCLEALCNLIL 944


>gi|449329934|gb|AGE96201.1| transcription initiation factor TFIId150kDa subunit
           [Encephalitozoon cuniculi]
          Length = 1137

 Score =  103 bits (256), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 92/378 (24%), Positives = 167/378 (44%), Gaps = 68/378 (17%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
           +LE  +   F  KIL   + R        +        F   V ++    L+  F  +V 
Sbjct: 395 LLEGNLSRAFMEKILNYTMKRRNDVKEDYSF------EFVRLVKDVTGKDLRSLFETYVF 448

Query: 61  TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSI 120
             G   +++GF+ NK+ N V+   ++       + T + S        +G+    G ++I
Sbjct: 449 KPGVVRVKLGFTINKKSNRVDFTAMQ-------APTSIFS--------NGNRTVCGPVTI 493

Query: 121 RVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLD 180
             +E++G+++H  +    + +       H K   ++                        
Sbjct: 494 CSYEMEGVFEHVFMLDRDNHFYY-----HPKTKKKKKS---------------------- 526

Query: 181 MRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLS 240
                  PL W+RADP+ E+LA+I   QP  M+I QL  D +VV Q +A+ +L   P  S
Sbjct: 527 EEEEEVMPLLWVRADPKGEHLAKIIVEQPDYMFIEQL-LDKNVVGQMEALESLSTKP--S 583

Query: 241 FNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP 300
             V   L   L ++  F+++RI   Y L+ T     ++ G   L++F+  +   +   + 
Sbjct: 584 TQVCEILERVLENTHIFYKIRIHILYILSRTTV--GNYYGFQRLIQFFVKKYCVQTSTVV 641

Query: 301 RPNDFRDFSEYFVLEAIPHAVAMVRAAD-----NKSPREA------VEFVLQLLKYNDNN 349
           +PNDF  F  YF+ +   H V  +  AD     + S RE         F++ +LK+NDN+
Sbjct: 642 KPNDF-SFIPYFIQK---HLVRALSIADPFVFRSYSGREVGSANIISAFIINILKFNDNS 697

Query: 350 GNPYSDVFWLAALVQSVG 367
            N YSD ++++++++S+G
Sbjct: 698 FNSYSDGWYISSVIESIG 715


>gi|301092610|ref|XP_002997159.1| transcription initiation factor TFIID subunit, putative
           [Phytophthora infestans T30-4]
 gi|262111546|gb|EEY69598.1| transcription initiation factor TFIID subunit, putative
           [Phytophthora infestans T30-4]
          Length = 1129

 Score =  102 bits (255), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 108/451 (23%), Positives = 190/451 (42%), Gaps = 68/451 (15%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGA------SPVRTLS-TKEFRHFANKVGNLERPFLKE 53
           M+E+++G       +Q  +S   GA       P+ TLS  K  +  A   G   +   K 
Sbjct: 431 MIEQRVGPKHLGIAIQRAVSGGAGAAATTATEPLTTLSFLKTVKTIAGAAG---QDLTKS 487

Query: 54  FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
           F   W+   G P   +G+ YN+++   E+A+ ++  + P  +                  
Sbjct: 488 FLTSWIIEPGMPFFTVGYWYNRKQTQAEVALQQE--IPPGGKL----------------- 528

Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
           + G ++I + E    Y +             +  CHSK+  +R  + +  + PD     G
Sbjct: 529 YTGPITITIVEDTSEYTYQ--KRIEHKRHKFDFPCHSKVRKKRRRR-QGLADPDDSVSVG 585

Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
               GL+     ++P+ W++ D    +L  +  +QP   W+ QL  D  V ++  A  AL
Sbjct: 586 GPGMGLN-----DTPVFWVKIDTGCAWLRHVVMHQPDFNWMEQLLSDKKVGSRVHAARAL 640

Query: 234 EAL--PHLSFNVVNT------LNNFLSDSKAFWRVRIEAAYALA----------NTASEE 275
                PH   NV++       ++   + S+   R+R EAA +L           N  ++ 
Sbjct: 641 ALFHRPHEKPNVMSCRVLTECMSGLTTHSR---RLRAEAAQSLGIWQSIHAPLTNANTQL 697

Query: 276 TDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREA 335
             W  + +LV+ +K   FD    +P PN F       +LEA+    A + +   +SP E 
Sbjct: 698 PIWKAMHNLVRIFKEHFFDRATDMPLPNYFLPSGGKVILEALGAQQASISSKQIQSPAEI 757

Query: 336 VEFVLQLLKYNDNNGNPYS---------DVFWLAALVQSVGELEFGQQSIL-FLSSLLKR 385
             F+LQLL  NDN+ N            D ++L  +V S+  L   + +      +++  
Sbjct: 758 ETFLLQLLTENDNSKNYVEMDDQSQVADDCYYLGNIVLSLSVLTLDRPARSDDGDAVVLE 817

Query: 386 IDRLLQFDRLMPSYNGILTISCIRTLTQIAL 416
           I R L +D++ PSYN  +T+ C+  L  + L
Sbjct: 818 ILRYLHYDQVQPSYNKTITVCCLEALCNLVL 848


>gi|348666883|gb|EGZ06709.1| hypothetical protein PHYSODRAFT_530724 [Phytophthora sojae]
          Length = 1140

 Score =  102 bits (254), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 106/453 (23%), Positives = 189/453 (41%), Gaps = 70/453 (15%)

Query: 1   MLEKQMGSNFFRKILQNI-----ISRAQGAS----PVRTLS-TKEFRHFANKVGNLERPF 50
           M+E+++G       +Q         ++ G S    P+ TLS  K  +  A   G   +  
Sbjct: 431 MIEQRVGPKHLSIAIQRANASASTGKSTGTSASTEPLTTLSFLKTVKTIAGAPG---QDL 487

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
            K F   W+   G P   +G+ YN+++   E+ + ++  + P  +               
Sbjct: 488 TKSFLTSWIIEPGMPFFTVGYWYNRKQTQAEVVLQQE--IPPGGKL-------------- 531

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
              + G ++I + E    Y +             +  CHSK+  +R  + +  + PD   
Sbjct: 532 ---YTGPITITIVEDTSEYTYQ--KRIEHKRHKFDFPCHSKVRKKRRKR-QGLADPDDSV 585

Query: 171 DNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAI 230
             G    GL+     ++P+ W++ D    +L  +  +QP   W+ QL  D  V ++  A 
Sbjct: 586 SVGGPGMGLN-----DTPVFWVKIDTGCAWLRHVVMHQPDFNWMEQLLSDKKVGSRVHAA 640

Query: 231 AALEAL--PHLSFNVVNT------LNNFLSDSKAFWRVRIEAAYALA----------NTA 272
            AL     PH   NV++       ++   + S+   R+R EAA +L           N  
Sbjct: 641 RALALFHRPHEKPNVMSCRVLTECMSGLTTHSR---RLRAEAAQSLGIWQSIHAPLTNAN 697

Query: 273 SEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSP 332
           ++   W  + +LV+ +K   FD    +P PN F       +LEA+    A + +   +SP
Sbjct: 698 TQLPIWKAMHNLVRIFKEHFFDRATDMPLPNYFLPSGGKVILEALGAQQASISSKQIQSP 757

Query: 333 REAVEFVLQLLKYNDNNGNPYS---------DVFWLAALVQSVGELEFGQQSILFLSSLL 383
            E   F+LQLL  NDN+ N            D +++  LV S+  L   + +      ++
Sbjct: 758 PEIETFLLQLLTENDNSKNYVEMDDQSQVADDCYYIGNLVLSLSVLTVDRPARSEEGDVV 817

Query: 384 KRIDRLLQFDRLMPSYNGILTISCIRTLTQIAL 416
           + I R L +D++ PSYN  +T+ C+  L  + L
Sbjct: 818 REILRYLHYDQVQPSYNKTITVCCLEALCNLVL 850


>gi|320586964|gb|EFW99627.1| transcription initiation factor tfiid subunit tsm1 [Grosmannia
            clavigera kw1407]
          Length = 1826

 Score =  101 bits (252), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 105/438 (23%), Positives = 181/438 (41%), Gaps = 99/438 (22%)

Query: 13   KILQNIISRAQ----GASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLR 68
            +++  IIS+A     G +    +S++ FR    K    +   L+ F+ +WV   GCP L 
Sbjct: 602  RVISKIISKANTSGWGDNNNEIISSETFRRVCEKTSQYQ---LEAFWNQWVLGSGCPWLS 658

Query: 69   MGFSYNKRKNIVELAVLRD---CTVKP------DSRTPVLSSNTDSENRDGDIGWPGMMS 119
            +   +N+++  VE+++L+     + KP      D    ++ +   + +R     + G M+
Sbjct: 659  IKQRFNRKRLCVEISILQTQEAGSAKPQPIRHEDFWRELVETKYGTYSRKAQRLFTGPMT 718

Query: 120  IRVHELDGM-YDHPILPMAGDA----WQLLEIQCHSKLA--------------------- 153
            IR+HE +G  Y+H I+ +  DA        +I  ++K                       
Sbjct: 719  IRIHEANGTPYEH-IVEIREDAAFTGGAKFDIPYNTKYKRLKRNRRHRERAAAAAAADAN 777

Query: 154  ------------ARRALKPKKGSKPDGCDDNGDAVAGLDMRSSMESPLSWIRADPEMEYL 201
                          R   P+  +K D  D        +D  S       WIR D + E+L
Sbjct: 778  GEHSEEMIVFSLGDRLDSPEDMAKWDLIDWTEQDEKNMDQES-----YEWIRIDSDFEWL 832

Query: 202  AEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVR 261
             ++H N P  M+++QL++D DV AQ   +  L  +   +      L   L D + F  +R
Sbjct: 833  VKMHTNMPSWMYLSQLQQDRDVAAQQDTLLYLSRIREHAM-AATILTRTLYDERYFHGLR 891

Query: 262  IEAAYALANTASEETD------------------------------------WAGLLHLV 285
              AA  L+  A + +D                                    + G+  L+
Sbjct: 892  TMAASVLSRQAWKASDSHADADGAENGGQETEGSRNGGAGGNAGSSGGDTVVFRGMTQLM 951

Query: 286  KFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKY 345
            + ++   F       RPNDF D   Y V  A+  A++  +  D+K PREA  F+L LL++
Sbjct: 952  RVFR-EFFCYPDATSRPNDFSDHQRYLVQCAMIEAISKAQ-YDDKHPREAQRFLLDLLRF 1009

Query: 346  NDNNGNPYSDVFWLAALV 363
            NDN+ N +SD  ++A L+
Sbjct: 1010 NDNSQNSFSDYIYVAKLL 1027


>gi|85014131|ref|XP_955561.1| hypothetical protein ECU09_0090 [Encephalitozoon cuniculi GB-M1]
 gi|19171255|emb|CAD26980.1| TRANSCRIPTION INITIATION FACTOR TFIID 150kDa SUBUNIT
           [Encephalitozoon cuniculi GB-M1]
          Length = 1137

 Score =  100 bits (250), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 91/378 (24%), Positives = 166/378 (43%), Gaps = 68/378 (17%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
           +LE  +   F  KIL   + R        +        F   V ++    L+  F  +V 
Sbjct: 395 LLEGNLSRAFMEKILNYTMKRRNDVKEDYSF------EFVRLVKDVTGKDLRSLFETYVF 448

Query: 61  TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSI 120
             G   +++GF+ NK+ N V+   ++       + T + S        +G+    G ++I
Sbjct: 449 KPGVVRVKLGFTINKKSNRVDFTAMQ-------APTSIFS--------NGNRTVCGPVTI 493

Query: 121 RVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLD 180
             +E++G+++H  +    + +       H K   ++                        
Sbjct: 494 CSYEMEGVFEHVFMLDRDNHFYY-----HPKTKKKKKS---------------------- 526

Query: 181 MRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLS 240
                  PL W+RADP+ E+LA+I   QP  M+I QL  D +VV Q +A+ +L   P  S
Sbjct: 527 EEEEEVMPLLWVRADPKGEHLAKIIVEQPDYMFIEQL-LDKNVVGQMEALESLSTKP--S 583

Query: 241 FNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP 300
             V   L   L ++  F+++RI   Y L+ T     ++ G   L++F+  +   +   + 
Sbjct: 584 TQVCEILERVLENTHIFYKIRIHILYILSRTTV--GNYYGFQRLIQFFVKKYCVQTSTVV 641

Query: 301 RPNDFRDFSEYFVLEAIPHAVAMVRAAD-----NKSPREA------VEFVLQLLKYNDNN 349
           +PNDF  F  YF+ +   H V  +  AD     + S RE         F++ +LK+NDN+
Sbjct: 642 KPNDF-SFIPYFIQK---HLVRALSIADPFVFRSYSGREVGSANIISAFIINILKFNDNS 697

Query: 350 GNPYSDVFWLAALVQSVG 367
            N YSD ++++++++S+ 
Sbjct: 698 FNSYSDGWYISSVIESIS 715


>gi|402468467|gb|EJW03623.1| hypothetical protein EDEG_02054 [Edhazardia aedis USNM 41457]
          Length = 2793

 Score =  100 bits (248), Expect = 4e-18,   Method: Composition-based stats.
 Identities = 84/307 (27%), Positives = 143/307 (46%), Gaps = 33/307 (10%)

Query: 187  SPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNT 246
            +PL +IR D   E + +++  QP  M+I QL  D +V++Q +AI  L   P  S NV++ 
Sbjct: 1136 NPLLFIRIDAGNENIVKVNLQQPDFMYIEQLNTDKNVISQIEAIQVLAKNP--SENVLHI 1193

Query: 247  LNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRF-DENIGLPRPNDF 305
                L+DS+ F+RVRI     L+    E+     + H +  +  RRF      +P+PNDF
Sbjct: 1194 FEKVLNDSRVFYRVRILLIQTLSKMLFEDK---CIYHQIIHFFIRRFCIHGSTIPKPNDF 1250

Query: 306  RDFSEYFVLEAI--------PHAVAMVRAADNKSPREAVEFVLQLLKYNDNNG------- 350
              FS YF+ +A         P  V  + A D K+      F L L+KYN  +        
Sbjct: 1251 TSFSSYFIQKATVKSLAESDPEIVRNINAKDLKTKNIVSAFFLNLIKYNSVDEFNYMEPN 1310

Query: 351  ----NPYSDVFWLAALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTIS 406
                N   + F++A +++++  ++   +S   + + +K+++     D L PS+  I+TIS
Sbjct: 1311 KFTINSVGNSFYIADIIKAL-SIQLSSKSSSDIQAYIKQLEICRLKDLLFPSFQNIVTIS 1369

Query: 407  CIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALS 466
             +   +++       + +D +VK   P         VR  A  AL+    H   ID+  +
Sbjct: 1370 VLNAYSRLCFYGHLEVDVDLLVKYCSP----KHFVGVREAALEALIVNYIH---IDNVFA 1422

Query: 467  LFIKSVE 473
              +K  E
Sbjct: 1423 FIVKFYE 1429


>gi|401828341|ref|XP_003887884.1| hypothetical protein EHEL_090070 [Encephalitozoon hellem ATCC
           50504]
 gi|392998892|gb|AFM98903.1| hypothetical protein EHEL_090070 [Encephalitozoon hellem ATCC
           50504]
          Length = 1129

 Score = 99.0 bits (245), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 96/378 (25%), Positives = 164/378 (43%), Gaps = 68/378 (17%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
           +LE  +   F  KIL   I +    S VR   + EF      V   +   L+  F  +V 
Sbjct: 395 LLEGNLSRAFMEKILNYTIKKK---SDVRENYSFEFVKLVKDVTGKD---LRSLFETYVF 448

Query: 61  TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSI 120
             G   +R+GF+ NK+ N V+            S+ P       S   +G+    G ++I
Sbjct: 449 RPGVVKVRLGFTINKKNNRVDFTA---------SQIPT------SIFSNGNKTISGPVTI 493

Query: 121 RVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLD 180
             +E++G+++H  +         L+ + H          PK                   
Sbjct: 494 CSYEMEGIFEHVFM---------LDRENHF------YYHPKT------------KKKKKS 526

Query: 181 MRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLS 240
                  PL WIRAD + E+LA+I   QP  M+I QL  D +++ Q +A+  L   P  S
Sbjct: 527 EEEEEVMPLLWIRADTKGEHLAKIVIEQPDYMFIEQL-LDKNIIGQMEALENLSIKP--S 583

Query: 241 FNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP 300
            +V       L ++  F+++RI   Y L+ T      + G   L++++  +   +   + 
Sbjct: 584 THVCEIFERMLENTHIFYKIRIHILYILSRTIV--GSYYGFQRLIQYFVKKYCVQTSTIV 641

Query: 301 RPNDFRDFSEYFVLEAIPHAVAMVRAAD-----NKSPREA------VEFVLQLLKYNDNN 349
           +PNDF  F  YF+ +   H V  +  AD     N S RE         F++ +LK+NDN+
Sbjct: 642 KPNDF-SFIPYFIQK---HLVKALSYADPFVFKNYSGREVGSASIISAFIINILKFNDNS 697

Query: 350 GNPYSDVFWLAALVQSVG 367
            N YSD ++++++++S+ 
Sbjct: 698 FNSYSDGWYISSVIESLS 715


>gi|58269688|ref|XP_572000.1| hypothetical protein [Cryptococcus neoformans var. neoformans
           JEC21]
 gi|57228236|gb|AAW44693.1| conserved hypothetical protein [Cryptococcus neoformans var.
           neoformans JEC21]
          Length = 1711

 Score = 99.0 bits (245), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 74/237 (31%), Positives = 120/237 (50%), Gaps = 42/237 (17%)

Query: 191 WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNF 250
           WIR DP+ E+LA   F +    WI+QL+ D DV+AQ +AI  +   P  S  + + L   
Sbjct: 693 WIRVDPDCEWLAIFEFAEKPWCWISQLQGDRDVIAQLEAIQNMRMYP--SPVIASELART 750

Query: 251 LSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIG----------LP 300
           +     F+RVR+EAA ALA   S E D+ G   ++K +++     + G           P
Sbjct: 751 VLVKNYFYRVRMEAARALAMYNSSECDYIGYFLIMKLFQAFYCQPSSGNETEPPEIQCRP 810

Query: 301 RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLA 360
            PNDF +F++YFV +++  A+A +R    ++          + ++       YSD F+LA
Sbjct: 811 LPNDFSNFADYFVKKSLIAAMADLRDPATRA----------VWRH-------YSDSFYLA 853

Query: 361 ALVQ------SVGELEFGQQSILFLSS--LLKR-----IDRLLQFDRLMPSYNGILT 404
           A++       +VG    G  S   + S  LL++     +DR +  DRL+PSY+ ++T
Sbjct: 854 AVITAISNAFTVGSAAQGMLSETEIESEQLLQKEATDVLDRAMTVDRLVPSYHNVVT 910



 Score = 39.7 bits (91), Expect = 6.4,   Method: Compositional matrix adjust.
 Identities = 32/113 (28%), Positives = 52/113 (46%), Gaps = 18/113 (15%)

Query: 27  PVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLR 86
           P  TL+T  F     KV  L+   L+ F  +W+   GCP L +  ++ K+K  VE  V +
Sbjct: 491 PGNTLTTSYFLRQCRKVSGLD---LQTFQDQWIFGSGCPRLSVRTNFIKKKFTVEFMVSQ 547

Query: 87  DCTVKPDSRTPVLSSNTDSENRDGDIGW-------PGMMSIRVHELDGM-YDH 131
                  ++  V + N  SE +     W        G +++R+HE DG  ++H
Sbjct: 548 -------TQPAVEAINEMSEKQRKVAVWRRPTPFFEGSLTVRIHEADGAPFEH 593


>gi|336276159|ref|XP_003352833.1| hypothetical protein SMAC_04947 [Sordaria macrospora k-hell]
 gi|380092951|emb|CCC09188.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 1837

 Score = 98.6 bits (244), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 125/561 (22%), Positives = 233/561 (41%), Gaps = 98/561 (17%)

Query: 2    LEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVGT 61
            + K  GS    ++L  ++S A  ++     ++  F+ F  +     +    E + ++V  
Sbjct: 545  MSKIPGSTGVVRVLSQMVSTANISNTDPKATSLSFQDFRRQCEKRSQYVPDELWDQYVFA 604

Query: 62   CGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSE-----------NRDG 110
             GCP L +   +NK+   V++ +L+        + PV  S +  E              G
Sbjct: 605  AGCPKLDIHQKFNKKNLNVDIQILQ-------YQGPVKKSISKDEFWRELQEEIHNVYAG 657

Query: 111  DIG--WPGMMSIRVHELDGM-YDHPILPMAGDAWQL-LEIQCHSKLAARRALKPKK---- 162
            D+   + G  ++R+HE DG  Y+H  L    D     L+I  ++K   +R  + KK    
Sbjct: 658  DVPKLFTGPFTVRIHEADGTPYEHYQLITEKDKGGTNLQIAYNTKY--KRLKRTKKAMAA 715

Query: 163  --GSKPDGCDDNGDAVAGLDM-------RSSME-----------------SPLSWIRADP 196
               S  D  +   D V   +M       +  ME                     WIR D 
Sbjct: 716  AANSNSDKHEIQEDDVVYFNMLGDVLTSQKDMEDWGLQDWAQEVQTKMDQESYEWIRLDA 775

Query: 197  EMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALP-HLSFNVVNTLNNFLSDSK 255
             ME+L  +  +    M++ QL++D DVVA   A+ A +    H   + V T    + D +
Sbjct: 776  NMEWLCFMKTDMQEYMYLAQLQQDRDVVAHQDAMLAFKREKRHPVHSTVETRT--VMDKR 833

Query: 256  AFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP----------RPNDF 305
             +  VR+ A   LA  A  + ++ G++HL+  ++ + F + + +P           PNDF
Sbjct: 834  YYHGVRVMAVEDLAKQAHPDLNYIGMVHLILCFR-KFFCKKVPIPGKKDAYNYPIAPNDF 892

Query: 306  RDFSEYFVLEAIPHAVAMVRAAD----NKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAA 361
             D  +Y +  AIP  +A  R  D     +  + A +F++++L +NDN+ N +SD F++A 
Sbjct: 893  TDKGQYAIQCAIPAVLA--RTTDLQEKGRCSKRARQFIMEMLMFNDNSENEFSDQFYIAK 950

Query: 362  LVQSVGELEFGQQSIL--------------------FLSSLLKRIDRLLQFDRLMPSYNG 401
            L+ ++      Q+ I                     F+   ++ ID+  + D    +Y  
Sbjct: 951  LLDALTTSLIPQKEITEGYLIKNLDPFDDDDAEFKSFIEKTIEEIDKFRRMDEWTLTYQN 1010

Query: 402  ILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGI 461
            I T + +    ++   +   +  +  +  ++  +D N I  VR++A   L++L       
Sbjct: 1011 IWTTTALHCKMRL---MKARVIPESALDFVQYLQDDN-IDLVRIKAFECLVELGMISKAP 1066

Query: 462  DSALSLFIKSVEEEPSLRGQV 482
               L L   S +  P +R ++
Sbjct: 1067 ILKLLLSYMSTDSSPFVRDRL 1087


>gi|303390535|ref|XP_003073498.1| hypothetical protein Eint_090080 [Encephalitozoon intestinalis ATCC
           50506]
 gi|303302645|gb|ADM12138.1| hypothetical protein Eint_090080 [Encephalitozoon intestinalis ATCC
           50506]
          Length = 1134

 Score = 97.8 bits (242), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 98/378 (25%), Positives = 162/378 (42%), Gaps = 68/378 (17%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
           +LE  +   F  KIL   I R      V+   + EF      V   +  FL   F  +V 
Sbjct: 395 LLEGNLSRAFMEKILNYTIKRK---GDVKEDYSFEFVKLVKDVTGKDLRFL---FETYVF 448

Query: 61  TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSI 120
             G   +R+GF+ NK+ N V+             +TP       S   +G+    G ++I
Sbjct: 449 KPGVVKVRLGFTINKKSNRVDFTA---------EQTPT------SIFSNGNKTICGPVTI 493

Query: 121 RVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLD 180
             +E++G+++H  +         L+ + H          PK                   
Sbjct: 494 CSYEMEGIFEHVFM---------LDRENHF------YYHPKT------------KKKKKS 526

Query: 181 MRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLS 240
                  PL WIRAD + E+LA I   QP  M+I QL  D +VV Q +A+  L   P  S
Sbjct: 527 EEEEEVMPLLWIRADIKGEHLARIVIEQPDYMFIEQL-LDKNVVGQMEALENLSIKP--S 583

Query: 241 FNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP 300
             V   L   L ++  F+++R+   Y L+ T      + G   L++++  +   +   + 
Sbjct: 584 VQVCEILERMLENTHIFYKIRVHILYILSRTTI--GSYYGFQRLIQYFVKKYCVQTSTIV 641

Query: 301 RPNDFRDFSEYFVLEAIPHAVAMVRAAD-----NKSPREA------VEFVLQLLKYNDNN 349
           +PNDF  F  YF+ +   H V  +  AD     N S RE         F++ +LK+NDN+
Sbjct: 642 KPNDF-SFIPYFIQK---HLVRALSLADPFIFRNYSGREVGSASIISAFIINILKFNDNS 697

Query: 350 GNPYSDVFWLAALVQSVG 367
            N YSD ++++++++S+ 
Sbjct: 698 FNSYSDGWYISSVIESLS 715


>gi|350297324|gb|EGZ78301.1| hypothetical protein NEUTE2DRAFT_80725 [Neurospora tetrasperma FGSC
            2509]
          Length = 1870

 Score = 96.7 bits (239), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 123/529 (23%), Positives = 226/529 (42%), Gaps = 90/529 (17%)

Query: 2    LEKQMGSNFFRKILQNIISRA--QGASP-VRTLSTKEFRHFANKVGNLERPFLKEFFPRW 58
            + K  GS    ++L  ++S A      P   TLS ++FR +  K          E + ++
Sbjct: 545  MSKIPGSTGVVRVLSQMVSAANISNTDPKATTLSFQDFRRWCEKRSQY---VPDELWDQY 601

Query: 59   VGTCGCPVLRMGFSYNKRKNIVELAVLR--DCTVKPDSRTPVLSSNTDSENR--DGDIG- 113
            V   GCP L +   +NK+   V++ +L+    T K  S+        +  ++   GD+  
Sbjct: 602  VFGAGCPRLDIHQKFNKKNLNVDIQILQYQGPTKKSISKDEFWRELQEEIHQVYAGDVPK 661

Query: 114  -WPGMMSIRVHELDGM-YDHPILPMAGDAWQL-LEIQCHSKLAARRALKPKK------GS 164
             + G  ++R+HE DG  Y+H  L    D     L+I  ++K   +R  + KK       S
Sbjct: 662  LFTGPFTVRIHEADGTPYEHYQLITEKDKGGTNLQIAYNTKY--KRLKRTKKAMAAAANS 719

Query: 165  KPDGCDDNGDAVAGLDM-------RSSME-----------------SPLSWIRADPEMEY 200
              D  +   D V   +M       +  ME                     WIR D  ME+
Sbjct: 720  NSDKHEIQEDDVVYFNMLGDVLTSQKDMEDWGLQDWASEVQTKMDQESYEWIRLDANMEW 779

Query: 201  LAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALP-HLSFNVVNTLNNFLSDSKAFWR 259
            L  +  +    M++ QL++D DVVA   A+ A +    H   + + T    + D + +  
Sbjct: 780  LCFMKTDMQEYMYLAQLQQDRDVVAHQDAMLAFKREKRHAVHSTIETRT--VMDRRYYHG 837

Query: 260  VRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP----------RPNDFRDFS 309
            VR+ A   LA  A  + ++ G++HL+  ++ + F + + +P           PNDF D  
Sbjct: 838  VRVMALEDLAKQAHPDLNYIGMVHLILCFR-KFFCKKVPIPGREGGYHYPIAPNDFTDKG 896

Query: 310  EYFVLEAIPHAVAMVRAAD----NKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQS 365
            +Y +  AIP  +A  R  D     +  + A +F++++L +NDN+ N +SD F++A L+ +
Sbjct: 897  QYAIQCAIPAVLA--RTTDLQEKGRCSKRARQFIMEMLLFNDNSENEFSDQFYIAKLLDA 954

Query: 366  VGELEFGQQSIL--------------------FLSSLLKRIDRLLQFDRLMPSYNGILTI 405
            +      Q+ I                     F+   ++ ID+  + D    +Y  I T 
Sbjct: 955  LTTSVIPQKEITEGYLIKNLDPFDEDDAEFKSFIEKTIEEIDKFRRMDEWTLTYQNIWTT 1014

Query: 406  SCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
            + +    ++   +   +  +  +  ++  +D N I  VR++A   L++L
Sbjct: 1015 TALHCKMRL---MKARVIPESALDFVQYLQDDN-IDLVRIKAFECLVEL 1059


>gi|336463459|gb|EGO51699.1| hypothetical protein NEUTE1DRAFT_70654 [Neurospora tetrasperma FGSC
            2508]
          Length = 1870

 Score = 96.3 bits (238), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 123/529 (23%), Positives = 226/529 (42%), Gaps = 90/529 (17%)

Query: 2    LEKQMGSNFFRKILQNIISRA--QGASP-VRTLSTKEFRHFANKVGNLERPFLKEFFPRW 58
            + K  GS    ++L  ++S A      P   TLS ++FR +  K          E + ++
Sbjct: 545  MSKIPGSTGVVRVLSQMVSAANISNTDPKATTLSFQDFRRWCEKRSQY---VPDELWDQY 601

Query: 59   VGTCGCPVLRMGFSYNKRKNIVELAVLR--DCTVKPDSRTPVLSSNTDSENR--DGDIG- 113
            V   GCP L +   +NK+   V++ +L+    T K  S+        +  ++   GD+  
Sbjct: 602  VFGAGCPRLDIHQKFNKKNLNVDIQILQYQGPTKKSISKDEFWRELQEEIHQVYAGDVPK 661

Query: 114  -WPGMMSIRVHELDGM-YDHPILPMAGDAWQL-LEIQCHSKLAARRALKPKK------GS 164
             + G  ++R+HE DG  Y+H  L    D     L+I  ++K   +R  + KK       S
Sbjct: 662  LFTGPFTVRIHEADGTPYEHYQLITEKDKGGTNLQIAYNTKY--KRLKRTKKAMAAAANS 719

Query: 165  KPDGCDDNGDAVAGLDM-------RSSME-----------------SPLSWIRADPEMEY 200
              D  +   D V   +M       +  ME                     WIR D  ME+
Sbjct: 720  NSDKHEIQEDDVVYFNMLGDVLTSQKDMEDWGLQDWASEVQTKMDQESYEWIRLDANMEW 779

Query: 201  LAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALP-HLSFNVVNTLNNFLSDSKAFWR 259
            L  +  +    M++ QL++D DVVA   A+ A +    H   + + T    + D + +  
Sbjct: 780  LCFMKTDMQEYMYLAQLQQDRDVVAHQDAMLAFKREKRHAVHSTIETRT--VMDRRYYHG 837

Query: 260  VRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP----------RPNDFRDFS 309
            VR+ A   LA  A  + ++ G++HL+  ++ + F + + +P           PNDF D  
Sbjct: 838  VRVMALEDLAKQAHPDLNYIGMVHLILCFR-KFFCKKVPIPGREGGYHYPIAPNDFTDKG 896

Query: 310  EYFVLEAIPHAVAMVRAAD----NKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQS 365
            +Y +  AIP  +A  R  D     +  + A +F++++L +NDN+ N +SD F++A L+ +
Sbjct: 897  QYAIQCAIPAVLA--RTTDLQEKGRCSKRARQFIMEMLLFNDNSENEFSDQFYIAKLLDA 954

Query: 366  VGELEFGQQSIL--------------------FLSSLLKRIDRLLQFDRLMPSYNGILTI 405
            +      Q+ I                     F+   ++ ID+  + D    +Y  I T 
Sbjct: 955  LTTSVIPQKEITEGYLIKNLDPFDEDDAEFKSFIEKTIEEIDKFRRMDEWTLTYQNIWTT 1014

Query: 406  SCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
            + +    ++   +   +  +  +  ++  +D N I  VR++A   L++L
Sbjct: 1015 TALHCKMRL---MKARVIPESALDFVQYLQDDN-IDLVRIKAFECLVEL 1059


>gi|159471369|ref|XP_001693829.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158283332|gb|EDP09083.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 758

 Score = 96.3 bits (238), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 73/235 (31%), Positives = 114/235 (48%), Gaps = 18/235 (7%)

Query: 188 PLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTL 247
           P+ ++R DP  E+L E+   Q   MW  QL    DV+  + A          S +V+  L
Sbjct: 445 PVQYLRVDPLQEWLCEVVVLQGEAMWAAQLRSSKDVLESSGAAD--------SSSVIAAL 496

Query: 248 NNFLSDSKAFWRVRIEA-AYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFR 306
              +   + + RVR+EA     A +AS  T  A    L  FY   R D    + RPN + 
Sbjct: 497 LEAVRGERLYCRVRVEAARALAAFSASMSTQPAAPSLLSYFYDCCR-DPATAVLRPNYWS 555

Query: 307 DFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV 366
           D  ++ VL+AIP  +A +R     S  EA++ V  +L  NDN+ N Y D  +L+A++++ 
Sbjct: 556 DLGQHLVLQAIPACLASIRDVSFNSTPEALQLVTDVLSNNDNSENLYDDASYLSAVIEAA 615

Query: 367 GEL---EFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKL 418
           G L     GQ     L  LL  +D  +  + L PS+N  +  +C+  +T +AL +
Sbjct: 616 GLLRPPHAGQ-----LCRLLGLLDDYMLRECLQPSHNYAVGCACLAAMTSLALSV 665


>gi|85111588|ref|XP_964008.1| hypothetical protein NCU02052 [Neurospora crassa OR74A]
 gi|28925766|gb|EAA34772.1| predicted protein [Neurospora crassa OR74A]
 gi|38566902|emb|CAE76206.1| related to TSM1-component of TFIID complex [Neurospora crassa]
          Length = 1870

 Score = 95.9 bits (237), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 123/529 (23%), Positives = 226/529 (42%), Gaps = 90/529 (17%)

Query: 2    LEKQMGSNFFRKILQNIISRA--QGASP-VRTLSTKEFRHFANKVGNLERPFLKEFFPRW 58
            + K  GS    ++L  ++S A      P   TLS ++FR +  K          E + ++
Sbjct: 545  MSKIPGSTGVVRVLSQMVSAANISNTDPKATTLSFQDFRRWCEKRSQY---VPDELWNQY 601

Query: 59   VGTCGCPVLRMGFSYNKRKNIVELAVLR--DCTVKPDSRTPVLSSNTDSENR--DGDIG- 113
            V   GCP L +   +NK+   V++ +L+    T K  S+        +  ++   GD+  
Sbjct: 602  VFGAGCPRLDIHQKFNKKNLNVDIQILQYQGPTKKSISKDEFWRELQEEIHQVYAGDVPK 661

Query: 114  -WPGMMSIRVHELDGM-YDHPILPMAGDAWQL-LEIQCHSKLAARRALKPKK------GS 164
             + G  ++R+HE DG  Y+H  L    D     L+I  ++K   +R  + KK       S
Sbjct: 662  LFTGPFTVRIHEADGTPYEHYQLITEKDKGGTNLQIAYNTKY--KRLKRTKKAMAAAANS 719

Query: 165  KPDGCDDNGDAVAGLDM-------RSSME-----------------SPLSWIRADPEMEY 200
              D  +   D V   +M       +  ME                     WIR D  ME+
Sbjct: 720  NSDKHEIQEDDVVYFNMLGDVLTSQKDMEDWGLQDWASEVQTKMDQESYEWIRLDANMEW 779

Query: 201  LAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALP-HLSFNVVNTLNNFLSDSKAFWR 259
            L  +  +    M++ QL++D DVVA   A+ A +    H   + + T    + D + +  
Sbjct: 780  LCFMKTDMQEYMYLAQLQQDRDVVAHQDAMLAFKREKRHAVHSTIETRT--VMDRRYYHG 837

Query: 260  VRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP----------RPNDFRDFS 309
            VR+ A   LA  A  + ++ G++HL+  ++ + F + + +P           PNDF D  
Sbjct: 838  VRVMALEDLAKQAHPDLNYIGMVHLILCFR-KFFCKKVPIPGREGGYHYPIAPNDFTDKG 896

Query: 310  EYFVLEAIPHAVAMVRAAD----NKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQS 365
            +Y +  AIP  +A  R  D     +  + A +F++++L +NDN+ N +SD F++A L+ +
Sbjct: 897  QYAIQCAIPAVLA--RTTDLQEKGRCSKRARQFIMEMLLFNDNSENEFSDQFYIAKLLDA 954

Query: 366  VGELEFGQQSIL--------------------FLSSLLKRIDRLLQFDRLMPSYNGILTI 405
            +      Q+ I                     F+   ++ ID+  + D    +Y  I T 
Sbjct: 955  LTTSVIPQKEITEGYLIKNLDPFDEDDAEFKSFIEKTIEEIDKFRRMDEWTLTYQNIWTT 1014

Query: 406  SCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
            + +    ++   +   +  +  +  ++  +D N I  VR++A   L++L
Sbjct: 1015 TALHCKMRL---MKARVIPESALDFVQYLQDDN-IDLVRIKAFECLVEL 1059


>gi|396082012|gb|AFN83625.1| hypothetical protein EROM_090070 [Encephalitozoon romaleae SJ-2008]
          Length = 1131

 Score = 95.1 bits (235), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 92/375 (24%), Positives = 160/375 (42%), Gaps = 68/375 (18%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
           +LE  +   F  KIL   I +        +        F   V ++    L+  F  +V 
Sbjct: 395 LLEGNLSRAFMEKILNYTIKKKGDVKENYSF------EFVKLVKDVTGKDLRSLFETYVF 448

Query: 61  TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSI 120
             G   +R+GF+ NK+ N V+           D     + SN       G+    G ++I
Sbjct: 449 RPGVVKVRLGFTINKKSNRVDFT--------SDQIPTSIFSN-------GNKVISGPVTI 493

Query: 121 RVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLD 180
             +E++G+++H  +         L+ + H          PK                   
Sbjct: 494 CSYEMEGIFEHVFM---------LDRENHF------YYHPKT------------KKKKKS 526

Query: 181 MRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLS 240
                  PL WIRAD + E+LA+I   QP  M+I QL  D +++ Q +A+  L   P  S
Sbjct: 527 EEEEEVMPLLWIRADTKGEHLAKIVVEQPDYMFIEQL-LDKNIIGQMEALENLSIKP--S 583

Query: 241 FNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLP 300
             V   L   L ++  F+++RI   Y L+ T     ++ G   L++++  +   +   + 
Sbjct: 584 TQVCEILERMLENTHIFYKIRIHILYILSRTIV--GNYYGFQRLIQYFVKKYCVQTSTIV 641

Query: 301 RPNDFRDFSEYFVLEAIPHAVAMVRAAD-----NKSPREA------VEFVLQLLKYNDNN 349
           +PNDF  F  YF+ +   H V  +  AD     + S RE         F++ +LK+NDN+
Sbjct: 642 KPNDF-SFIPYFIQK---HLVKALSYADPFVFKSYSGREVGSASIISAFIINILKFNDNS 697

Query: 350 GNPYSDVFWLAALVQ 364
            NPYSD ++++++++
Sbjct: 698 FNPYSDGWYISSVIE 712


>gi|339248861|ref|XP_003373418.1| transcription initiation factor TFIID subunit 2 [Trichinella
           spiralis]
 gi|316970443|gb|EFV54377.1| transcription initiation factor TFIID subunit 2 [Trichinella
           spiralis]
          Length = 1125

 Score = 94.4 bits (233), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 92/337 (27%), Positives = 154/337 (45%), Gaps = 45/337 (13%)

Query: 145 EIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLDMRS-SMESPLSWIRADPEMEYLAE 203
           E  CHSK  ARR  K K    P    D  D    +D+ +   ESP+ WI  DP +  + +
Sbjct: 474 EFSCHSK--ARRHRKRKI---PIWTGDEID----MDLSAMDSESPVLWILVDPRLNTIRK 524

Query: 204 IHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHL-SFNVVNTLNNFLSDSKAFWRVRI 262
           ++ +QP   W  QL  +  +VAQ   + AL+ LP + + +    L+  +  S+ ++RVR 
Sbjct: 525 VNIDQPDYSWHYQLRYERIIVAQ---LDALKGLPKMRTVHTQAALSEIIESSQCYYRVRC 581

Query: 263 EAAYALANTASEETDWAG-LLHLVKFYKSRRFDENI-GLPRPNDFRDFSEYFVLEAIPHA 320
            A +ALA+ +S  T        LV+ ++     +N   L +PN+F +F  Y++ +A+  A
Sbjct: 582 AACFALASMSSANTAVCNPHAVLVQLFRKLFCSQNYPTLAKPNNFTNFQLYYLKKALLLA 641

Query: 321 VAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGEL----------- 369
           ++ ++   +    + ++FV  L  YN N+GN YSD  + A +++++G +           
Sbjct: 642 ISQLKTVHDTISTDVLQFVYHLSYYNTNSGNRYSDDAYRATIIKAIGNMVQPFAVLNSDA 701

Query: 370 ----------EFGQQSIL----FLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIA 415
                     +F Q         L  L K I   L  D   PSY   +T++ + +L +  
Sbjct: 702 MEFSIRHFVDQFAQHDAYRARHLLMKLFKEIIISLALDAKFPSYRDEITVAALHSLAE-- 759

Query: 416 LKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALL 452
            +    I  D   +L   + DFN    VR  A  A L
Sbjct: 760 FQYWKLIDCDN--RLWLSYLDFNRPPIVRRAAFEAFL 794


>gi|407040812|gb|EKE40341.1| hypothetical protein ENU1_093580 [Entamoeba nuttalli P19]
          Length = 946

 Score = 93.2 bits (230), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 116/487 (23%), Positives = 208/487 (42%), Gaps = 91/487 (18%)

Query: 37  RHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRT 96
           R F  ++       +K+F  R V   G   + + F++ +R            T++ D + 
Sbjct: 450 RQFVKELQTQNSSLMKQFEERLVNGTGYSKIAVSFTFTQR------------TIQTDFKI 497

Query: 97  PVLSSNTDSENRDGDIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARR 156
                   S    G      ++ +R+HE++   +  I  + G+      I CHS++  +R
Sbjct: 498 ------HQSRKEKG----YNVVPVRIHEVEQTIEDTI-ELTGEETSK-SIPCHSRIRKQR 545

Query: 157 ALKPKKGSKPDGCDDNGDAVAGLDMRSSMES-------PLSWIRADPEMEYLAEIHFN-Q 208
                   +P   D     +  LD R    S       P+ +I  DP+ +Y   I  + Q
Sbjct: 546 -------KRPIKTDTGETIMYDLDSRDCKRSGQGETDVPILYIEIDPDNDYPRRIKRDKQ 598

Query: 209 PVQ--MWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFL-SDSKAFWRVRIEAA 265
             Q  MW+ QLE    V  Q   I  LE     S   +N L +FL ++ + F++V+I AA
Sbjct: 599 SFQEYMWLTQLEFVRTVRGQLVTIEGLEKCK--SEKGINALFDFLINNHRCFYKVQIAAA 656

Query: 266 YALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVR 325
           +A+A  + E+  + G   L +FYK+  ++E++G+  P DF +   Y + + +  A A VR
Sbjct: 657 HAIARMSCEQNGYLGFHLLKQFYKTNYYEESVGVI-PLDFSNPLRYELQKGLIGAFADVR 715

Query: 326 A------------ADNK-----------SPREAVEFVLQLLKYNDNNGNPYSDVFWLAAL 362
                         D+K           S  EAV  +L++ + + N  N Y D  ++ ++
Sbjct: 716 VHSLTDKINWGGLYDDKHKNRIDALSLPSYPEAVSLLLEIAENSTNERNKYQDGVFIKSV 775

Query: 363 VQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFI 422
           + ++G++   +        + K + R L  D+++PS   I+T++ I+    +A  L    
Sbjct: 776 IGALGKIVSSKDED--NERITKFLKRELNKDKIIPSDRNIVTLAVIK----MAPGLCDLT 829

Query: 423 SLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLRGQV 482
           SL  V             ++VRV A RALL      +     L++F+++   E    G+V
Sbjct: 830 SLCSV----------GNFYEVRVAAIRALLK-----SDPQQGLTIFVET--HERGDNGRV 872

Query: 483 KLGIHAM 489
           +  I  M
Sbjct: 873 RYAICEM 879


>gi|67466781|ref|XP_649532.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56465995|gb|EAL44144.1| hypothetical protein EHI_108470 [Entamoeba histolytica HM-1:IMSS]
 gi|449704970|gb|EMD45118.1| Hypothetical protein EHI5A_134100 [Entamoeba histolytica KU27]
          Length = 946

 Score = 92.8 bits (229), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 116/487 (23%), Positives = 208/487 (42%), Gaps = 91/487 (18%)

Query: 37  RHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRT 96
           R F  ++       +K+F  R V   G   + + F++ +R            T++ D + 
Sbjct: 450 RQFVKELQTQNSSLMKQFEERLVNGTGYSKIAVSFTFTQR------------TIQTDFKI 497

Query: 97  PVLSSNTDSENRDGDIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARR 156
                   S    G      ++ +R+HE++   +  I  + G+      I CHS++  +R
Sbjct: 498 ------HQSRKEKG----YNVVPVRIHEVEQTIEDTI-ELTGEETSK-SIPCHSRIRKQR 545

Query: 157 ALKPKKGSKPDGCDDNGDAVAGLDMRSSMES-------PLSWIRADPEMEYLAEIHFN-Q 208
                   +P   D     +  LD R    S       P+ +I  DP+ +Y   I  + Q
Sbjct: 546 -------KRPIKTDTGETIMYDLDSRDCKRSGQGETDVPILYIEIDPDNDYPRRIKRDKQ 598

Query: 209 PVQ--MWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFL-SDSKAFWRVRIEAA 265
             Q  MW+ QLE    V  Q   I  LE     S   +N L +FL ++ + F++V+I AA
Sbjct: 599 SFQEYMWLTQLEFVRTVRGQLVTIEGLEKCK--SEKGINALFDFLINNHRCFYKVQIAAA 656

Query: 266 YALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVR 325
           +A+A  + E+  + G   L +FYK+  ++E++G+  P DF +   Y + + +  A A VR
Sbjct: 657 HAIARMSCEQNGYLGFHLLKQFYKTNYYEESVGVI-PLDFSNPLRYELQKGLIGAFANVR 715

Query: 326 A------------ADNK-----------SPREAVEFVLQLLKYNDNNGNPYSDVFWLAAL 362
                         D+K           S  EAV  +L++ + + N  N Y D  ++ ++
Sbjct: 716 IHSLTDKINWGGLYDDKHKNRIDALSLPSYPEAVSLLLEIAENSTNERNKYQDGVFIKSV 775

Query: 363 VQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFI 422
           + ++G++   +        + K + R L  D+++PS   I+T++ I+    +A  L    
Sbjct: 776 IGALGKIVSSKDED--NERITKFLKRELNKDKIIPSDRNIVTLAVIK----MAPGLCDLT 829

Query: 423 SLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEPSLRGQV 482
           SL  V             ++VRV A RALL      +     L++F+++   E    G+V
Sbjct: 830 SLCSV----------GNFYEVRVAAIRALLK-----SDPQQGLTIFVET--HERGDNGRV 872

Query: 483 KLGIHAM 489
           +  I  M
Sbjct: 873 RYAICEM 879


>gi|401882976|gb|EJT47215.1| hypothetical protein A1Q1_04073 [Trichosporon asahii var. asahii
           CBS 2479]
          Length = 1779

 Score = 92.8 bits (229), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 68/241 (28%), Positives = 117/241 (48%), Gaps = 38/241 (15%)

Query: 191 WIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNF 250
           WIR DP+ E+LA     +    WI+QLE + DVVAQ Q +       + S  + + L   
Sbjct: 659 WIRVDPDCEWLATFDIQEKPWFWISQLEGESDVVAQLQRMCM-----YPSPVIASELAKV 713

Query: 251 LSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKS---RRFDE----NI-GLPRP 302
           +     F+R             S E D+ GL  L+K + +      DE    N+  +P P
Sbjct: 714 VLVENYFYRYN-----------SAEADYMGLFLLLKLFHTFYCEPTDEEEPSNVKAVPLP 762

Query: 303 NDFRDFSEYFVLEAIPHAVAMVR-AADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAA 361
           NDF +F+ YF+ +++  A++ +R  A  +  R   E +L LL+ NDN+ N YSD  ++A 
Sbjct: 763 NDFSNFANYFLKKSMVTALSDLRDPATRRVWRHVRELLLDLLRMNDNSQNEYSDSHYVAT 822

Query: 362 LVQSVGEL---------EFGQQSILFLSSLLKRI----DRLLQFDRLMPSYNGILTISCI 408
           ++ ++G              ++  +    L K      +R +  DRL+PSY+ ++T++ +
Sbjct: 823 VISAIGSAFSAGVNVGGNLSEEDQVMDGQLFKEASEANERAITVDRLVPSYHNVVTLAGL 882

Query: 409 R 409
           +
Sbjct: 883 Q 883


>gi|167385807|ref|XP_001737495.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165899650|gb|EDR26194.1| hypothetical protein EDI_075900 [Entamoeba dispar SAW760]
          Length = 946

 Score = 92.0 bits (227), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 109/450 (24%), Positives = 194/450 (43%), Gaps = 84/450 (18%)

Query: 37  RHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRT 96
           R F  ++       +K+F  R V   G   + + F++ +R            T++ D + 
Sbjct: 450 RQFVKELQTQNSSLMKQFEERLVNGTGYSKIAVSFTFTQR------------TIQTDFKI 497

Query: 97  PVLSSNTDSENRDGDIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARR 156
                   S    G      ++ +R+HE++   +  I  + G+      I CHS++  +R
Sbjct: 498 ------HQSRKEKG----YNVVPVRIHEVEQTIEDTI-ELTGEETSK-SIPCHSRIRKQR 545

Query: 157 ALKPKKGSKPDGCDDNGDAVAGLDMRSSMES-------PLSWIRADPEMEYLAEIHFN-Q 208
                   +P   D     +  LD R    S       P+ +I  DP+ +Y   I  + Q
Sbjct: 546 -------KRPIKTDTGETIMYDLDSRDCKRSGQGETDVPILYIEIDPDNDYPRRIKRDKQ 598

Query: 209 PVQ--MWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFL-SDSKAFWRVRIEAA 265
             Q  MW+ QLE    V  Q   I  LE     S   +N L +FL ++ + F++V+I AA
Sbjct: 599 SFQEYMWLTQLEFVRTVRGQLVTIEGLEKCK--SEKGINALFDFLINNHRCFYKVQIAAA 656

Query: 266 YALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVR 325
           +A+A  + E+  + G   L +FYK+  ++E++G+  P DF +   Y + + +  A A VR
Sbjct: 657 HAIARMSCEQNGYLGFHLLKQFYKTNYYEESVGVI-PLDFSNPLRYELQKGLIGAFADVR 715

Query: 326 A------------ADNK-----------SPREAVEFVLQLLKYNDNNGNPYSDVFWLAAL 362
                         D+K           S  EAV  +L++ + + N  N Y D  ++ ++
Sbjct: 716 VHNLTDKINWGGLYDDKHKNRIDALSLPSYPEAVSLLLEIAENSTNERNKYQDGVFIKSV 775

Query: 363 VQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFI 422
           + ++G++   ++       + K + R L  D+++PS   I+T++ I+    +A  L    
Sbjct: 776 IGALGKIVSSKEED--NERITKFLKRELNKDKIIPSDRNIVTLAVIK----MAPGLCDLS 829

Query: 423 SLDQVVKLIKPFRDFNTIWQVRVEASRALL 452
           SL  V             ++VRV A RALL
Sbjct: 830 SLCSV----------GNFYEVRVAAIRALL 849


>gi|440298463|gb|ELP91099.1| hypothetical protein EIN_268550 [Entamoeba invadens IP1]
          Length = 936

 Score = 90.5 bits (223), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 98/423 (23%), Positives = 184/423 (43%), Gaps = 77/423 (18%)

Query: 29  RTLSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDC 88
           R L T   R F  ++       +K+F  R V   G   + + FS+ +R            
Sbjct: 444 RNLKT---RSFVKELQTQNSSLMKQFEERIVDGTGYSKIAVSFSFTQR------------ 488

Query: 89  TVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQC 148
                    ++ ++        + G+  ++ +R+HE++   +  I  + G    L  I C
Sbjct: 489 ---------IVQTDFKIRQSKKEKGY-NVVPVRIHEVEQTIEDSI-ELTG-VETLKSIPC 536

Query: 149 HSKLAA--RRALKPKKGSK------PDGCDDNGDAVAGLDMRSSMESPLSWIRADPEMEY 200
           HS++    +R +K   G           C  NG        +S  +  + +I  DP+ +Y
Sbjct: 537 HSRIRKQRKRTIKTDTGETIMYDLDSRDCKRNG--------QSETDVAILYIEIDPDNDY 588

Query: 201 LAEIHFNQPV---QMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFL-SDSKA 256
             +I  N+      MW+ QLE    V  Q   I  LE     S   +  L +FL ++ + 
Sbjct: 589 PRKIKRNKQSFLEYMWLTQLEFVRTVRGQLVTIEGLEKCN--SEKGIQALYDFLINNHRC 646

Query: 257 FWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEA 316
           F++V+IEAA+A+A  + EE  + G   L  FYK+  ++++IG+  P DF +  +Y + + 
Sbjct: 647 FYKVQIEAAHAIARMSCEENQYLGFNLLKTFYKTNYYEDSIGVV-PLDFSNALKYEIQKG 705

Query: 317 IPHAVAMVR---------------AADNKSPR--------EAVEFVLQLLKYNDNNGNPY 353
           +  A A VR                   + PR        EAV+ +L++ + + N  N Y
Sbjct: 706 LIGAFADVRLHSINEKINWGRVYKKKQTERPRALSKVNHPEAVQLLLEIAENSTNEKNSY 765

Query: 354 SDVFWLAALVQSVGELEF-GQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLT 412
            D  +L  +V+++G++    ++  + + + L+R+   L  D+++ S   I+T + + +  
Sbjct: 766 QDGIFLKEVVKALGKINSPNKEENVRIETFLRRV---LNKDKIIASDRNIVTQAVVLSTP 822

Query: 413 QIA 415
           Q A
Sbjct: 823 QFA 825


>gi|300707253|ref|XP_002995843.1| hypothetical protein NCER_101163 [Nosema ceranae BRL01]
 gi|239605067|gb|EEQ82172.1| hypothetical protein NCER_101163 [Nosema ceranae BRL01]
          Length = 1734

 Score = 87.8 bits (216), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 73/326 (22%), Positives = 149/326 (45%), Gaps = 55/326 (16%)

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+FF  +V   G   +   +S + +KN V+  + +    +P S+  + + NT +     
Sbjct: 432 MKDFFEFYVFKPGLLKVNFRYSIDTKKNRVDFKIEQ----RPTSK--IFNHNTKA----- 480

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCD 170
                G +SI  +E++G +DH I     D         H     R               
Sbjct: 481 ----IGNISIASYEVEGCFDHVINMQNSD---------HFYYHVRTK------------- 514

Query: 171 DNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAI 230
                   ++       PL W+R DP+ E+L +    QP  M+I Q+  D +V+ Q +A+
Sbjct: 515 ----KRKKIEEEEDEIMPLLWMRMDPKREHLIDGFIEQPDYMFIEQV-LDKNVIGQLEAL 569

Query: 231 AALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKS 290
             L    ++S  +   L   L  S  F+++RI+  Y L+    +  ++ G   L++F+  
Sbjct: 570 DQLS--KNISIQICEVLERVLDSSHVFYKIRIKVMYILSKINID--NFIGFQRLIQFFTK 625

Query: 291 RRFDENIGLPRPNDFRDFSEYFVLEAIPHAVA-----MVRAADNK---SPREAVEFVLQL 342
           + F ++  + +PN+F  F  YF+ +    A++     +++  + +   S      F++ +
Sbjct: 626 KYFVQSSTIVKPNEFL-FVNYFLQKHCVKALSYTNPSIIKGYNGRNVASFSVICSFIINI 684

Query: 343 LKYNDNNGNPYSDVFWLAALVQSVGE 368
           LK+NDN+ N Y+D +++A++++ + +
Sbjct: 685 LKFNDNSANLYNDSWYIASVIECLSK 710


>gi|193786061|dbj|BAG50950.1| unnamed protein product [Homo sapiens]
          Length = 504

 Score = 87.8 bits (216), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 60/206 (29%), Positives = 102/206 (49%), Gaps = 19/206 (9%)

Query: 262 IEAAYALANTA-SEETDWAGLLHLVKFYKSRRF--DENIGLPRPNDFRDFSEYFVLEAIP 318
           + A + LA  A S  + W G   +   + +R F       + + N+F+ F  YF+ +  P
Sbjct: 1   MSACFCLAKIANSMVSTWTGPPAMKSLF-TRMFCCKSCPNIVKTNNFKSFQSYFLQKTTP 59

Query: 319 HAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLA----ALVQSVGELEFGQQ 374
            A+A++R   N  P+E + F+L L+KYNDN  N +SD ++ A    AL  SV        
Sbjct: 60  VAMALLRDVHNLCPKEVLTFILDLIKYNDNRKNKFSDNYYRAEMIDALANSVTPAVSVNN 119

Query: 375 SILFLSSL-------LKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQV 427
            +  L +L       L+ I R L  ++L+PSY   +T+SC+R +    L+ +G +  D  
Sbjct: 120 EVRTLDNLNPDVRLILEEITRFLNMEKLLPSYRHTITVSCLRAIR--VLQKNGHVPSDPA 177

Query: 428 VKLIKPFRDFNTIWQVRVEASRALLD 453
             L K + ++     +R+ A  A++D
Sbjct: 178 --LFKSYAEYGHFVDIRIAALEAVVD 201


>gi|380496248|emb|CCF31831.1| hypothetical protein CH063_16140, partial [Colletotrichum
           higginsianum]
          Length = 191

 Score = 86.3 bits (212), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 53/168 (31%), Positives = 83/168 (49%), Gaps = 18/168 (10%)

Query: 264 AAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAM 323
           AA  L + A+E+   AG+ HL+K +       +   PRPNDF D  +Y V  A+P A+A 
Sbjct: 2   AADMLYHHATEKVAMAGMTHLLKAFTELFCYPDSATPRPNDFTDKKQYLVQSALPMAIAK 61

Query: 324 VRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVG---------------E 368
           +R A+ + P  A   +L  +++N+N  NPYSD F++A L+++V                +
Sbjct: 62  IRDANGRCPEPARRLLLNQVQFNNNANNPYSDHFYVAKLLEAVAHSLIPEKRRDAGDSMD 121

Query: 369 LEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILT---ISCIRTLTQ 413
           L+   +   FL   +  IDR  + D    SY  I T   + C R L +
Sbjct: 122 LDTSIEERSFLGEAIVEIDRFRRMDEWANSYQNIWTTTALECRRKLMK 169


>gi|171687459|ref|XP_001908670.1| hypothetical protein [Podospora anserina S mat+]
 gi|170943691|emb|CAP69343.1| unnamed protein product [Podospora anserina S mat+]
          Length = 1618

 Score = 85.9 bits (211), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 131/548 (23%), Positives = 229/548 (41%), Gaps = 88/548 (16%)

Query: 2    LEKQMGSNFFRKILQNIISRAQGASPVRT--LSTKEFRHFANKVGNLERPFLKEFFPRWV 59
            + K  GS    +++  I+S A   +   T  LS + FR    K+    RP   E + +W+
Sbjct: 550  MSKMTGSTGIVRVISKIVSTANIDTRNETTWLSAEPFRTACEKMSTY-RP--DELWAQWI 606

Query: 60   GTCGCPVLRMGFSYNKRKNIVELAVLRDCT-------VKPDSRTPVLSSNTDSENRDGDI 112
             + GCP L +   +NK+   V++ V +  +        K D    ++    D        
Sbjct: 607  HSAGCPKLDIKQKFNKKNLNVDITVEQRQSKDTAKEITKEDFYRELVELRHDVWAGSVPK 666

Query: 113  GWPGMMSIRVHELDGM-YDHPILPMAGDA----WQLLEIQCHSKLAARR----------- 156
             + G  S+R+ E DG  Y+H +     D     WQ+     + ++   +           
Sbjct: 667  HFTGPFSVRIQEADGTPYEHYMAISDKDKVSTNWQIAYNTKYKRMKRTKKATAAAAANSA 726

Query: 157  ------------------ALKPKKGSKPDGCDDNGDAV-AGLDMRSSMESPLSWIRADPE 197
                               L+ +K  +  G  D   A  A +D  S       WIR D  
Sbjct: 727  ADKDLQEDDVVYYNALGDVLQTEKDVQDWGLQDWSQATQAQMDQES-----YEWIRLDCN 781

Query: 198  MEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLS-DSKA 256
             E+L  +  + P  M + QL++D DVVAQ  ++  L      S  VV+T+    + D + 
Sbjct: 782  FEWLCSMKTDMPPYMIVAQLQQDRDVVAQQDSMLYLRRASRPS-GVVSTIETRTAMDRRY 840

Query: 257  FWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL-------PRPNDFRDFS 309
            ++ +R  A   L   A  E ++ G  HL+  ++    D  +G        P PNDF D +
Sbjct: 841  YYGIRCMAIEDLPKMADPELNYIGKAHLILIFRHFFCDRMVGKNGAVTFPPMPNDFSDKA 900

Query: 310  EYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQS---- 365
            +Y V  A+  A+A  R  + +  ++A  F+L LL +NDN+ N YSD  ++A L+Q+    
Sbjct: 901  QYHVQCAMVSAIAKTR-ENGRCSKDARNFLLDLLLFNDNSENDYSDHPYVACLLQALTTS 959

Query: 366  -VGELEFGQQSIL---------------FLSSLLKRIDRLLQFDRLMPSYNGILTISCIR 409
             + +   G + +L               F+   ++ ID+  + D    SY  I T + + 
Sbjct: 960  LIPKKVDGDRDLLSSLNVHDEADAEFKRFIEKTIEEIDKYRRMDEWTSSYQNIWTTAALE 1019

Query: 410  TLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFI 469
               ++ L  +  I +   ++ I+  +D N I  VR++A   L++L        + + L +
Sbjct: 1020 --CKMRLMKAKVIPV-APLEFIQYLQDDN-IDLVRIKAFECLVELGLLSK--PAIMKLLL 1073

Query: 470  KSVEEEPS 477
             SV  +PS
Sbjct: 1074 SSVSTDPS 1081


>gi|355759483|gb|EHH61623.1| hypothetical protein EGM_19632, partial [Macaca fascicularis]
          Length = 1068

 Score = 85.5 bits (210), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 51/164 (31%), Positives = 86/164 (52%), Gaps = 15/164 (9%)

Query: 301 RPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLA 360
           + N+F  F  YF+ + +P A+A++R   N  P+E + F+L L+KYNDN  N +SD ++ A
Sbjct: 606 KTNNFMSFQSYFLQKTMPVAMALLRDVHNLCPKEVLTFILDLIKYNDNRKNKFSDNYYRA 665

Query: 361 ----ALVQSVGELEFGQQSILFLSSL-------LKRIDRLLQFDRLMPSYNGILTISCIR 409
               AL  SV         +  L +L       L+ I R L  ++L+PSY   +T+SC+R
Sbjct: 666 EMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEITRFLNMEKLLPSYRHTITVSCLR 725

Query: 410 TLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLD 453
            +    L+ +G +  D    L K + ++     +R+ A  A++D
Sbjct: 726 AIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAALEAVVD 765



 Score = 40.0 bits (92), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 31/143 (21%), Positives = 61/143 (42%), Gaps = 32/143 (22%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 413 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 472

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T                    G
Sbjct: 473 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------------------SPG 510

Query: 111 DIGWPGMMSIRVHELDGMYDHPI 133
              + G + + V ELDG ++H +
Sbjct: 511 TQKYVGPLKVTVQELDGSFNHTL 533


>gi|432094711|gb|ELK26191.1| Transcription initiation factor TFIID subunit 2 [Myotis davidii]
          Length = 713

 Score = 85.1 bits (209), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 67/249 (26%), Positives = 108/249 (43%), Gaps = 62/249 (24%)

Query: 216 QLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEE 275
           QL  + DVVAQ ++I ALE  P  +  +   L + L   + F+RVR+ A + LA T    
Sbjct: 254 QLRFERDVVAQQESILALEKFPTPASRLA--LTDILEQEQCFYRVRMSACFCLAKT---- 307

Query: 276 TDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREA 335
                                                    +P A+A++R   N  P+E 
Sbjct: 308 -----------------------------------------MPVAMALLRDVHNLCPKEV 326

Query: 336 VEFVLQLLKYNDNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LK 384
           + F+L L+KYNDN  N +SD ++ A    AL  SV         +  L +L       L+
Sbjct: 327 LTFILDLIKYNDNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILE 386

Query: 385 RIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVR 444
            I R L  ++L+PSY   +T+ C+R +    L+ +G +  D    L K + ++     +R
Sbjct: 387 EITRFLNMEKLLPSYRHTITVGCLRAIR--VLQKNGHVPSDPT--LFKSYAEYAHFVDIR 442

Query: 445 VEASRALLD 453
           + A  A++D
Sbjct: 443 IAALEAVVD 451


>gi|298713843|emb|CBJ27215.1| TRANSCRIPTION INITIATION FACTOR TFIID [Ectocarpus siliculosus]
          Length = 1445

 Score = 84.3 bits (207), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 91/384 (23%), Positives = 163/384 (42%), Gaps = 93/384 (24%)

Query: 187  SPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALP--------- 237
            SP+ W++ DP++E++  +   +P   W+ QL  D  V AQ +A+  L  LP         
Sbjct: 809  SPVLWVKVDPQLEWMMSVRMRKPWHQWVEQLFCDESVEAQCEALRGLVELPLPFELRNSE 868

Query: 238  ---------------HLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEET---DWA 279
                           H      + +    + + A W+ +    +A     S+ET   +W 
Sbjct: 869  LPVQAIAEAVRGRVSHTQMEHASCVRKEAAVALALWQNQHAPPHAGGFGESKETRAQNWR 928

Query: 280  GLLHLVKFYKSRRFDE-------------NIG--------------LPRPNDFRDFSEYF 312
            GL HL+K + + RF++              IG              L  PN F D +EY 
Sbjct: 929  GLEHLLKAF-AERFEQRDSKSPASSGMRLGIGHDEAGSGGGNRKSALLLPNWFEDSTEYR 987

Query: 313  VLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGELEFG 372
            + +A+  A A +RA D  +P E ++ ++ LL+ NDN  N  SD +++A ++ S+G++   
Sbjct: 988  LKKALIWAAANMRARDGCTPLEVLDLLVVLLRENDNCENSRSDDYYVAQVLTSLGQVSTT 1047

Query: 373  QQSILFLSSLLKRIDR---LLQFD-----------------RLMPSYNGILTISCIRTLT 412
            Q++ L  SS+ + I +    L  D                 R   S+NG++    ++ L 
Sbjct: 1048 QEAGLPASSVAEAISQARLWLDLDMLGVSSSGHKATAKPATRTTKSHNGVVAACALQCLC 1107

Query: 413  QIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALL------------DLEFHCNG 460
            Q+ +   G   +D      + + +      VR+ A+++++            D E    G
Sbjct: 1108 QLEMLRGGRPEVD-----YEEYTEGRHPTNVRIAAAQSIIHLYLAYDSGLAEDAEDDGEG 1162

Query: 461  IDSALSLFIKSVE-EEPSLRGQVK 483
            + SAL   +  VE EEP+  G+ +
Sbjct: 1163 LSSALLWVLDFVESEEPAGDGRPR 1186


>gi|123479579|ref|XP_001322947.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121905802|gb|EAY10724.1| hypothetical protein TVAG_364490 [Trichomonas vaginalis G3]
          Length = 980

 Score = 83.6 bits (205), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 59/216 (27%), Positives = 114/216 (52%), Gaps = 10/216 (4%)

Query: 176 VAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEA 235
           V G  +  S+   + W+  DP+  +L  +    P  M  N L+   DV +Q + +++LE 
Sbjct: 523 VNGEQVEISIHHSILWVAIDPDHTWLMRVRPRLPQFMVHNMLDLVRDVYSQHETLSSLEE 582

Query: 236 LPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTAS-EETDWAGLLHLVKFYKSRRFD 294
               + +    L +FL ++K F+ VR  AA  LA  A+ ++ +   +  L+ +YK+  F 
Sbjct: 583 WIK-NKDPQQWLKSFLENNKIFYGVRGHAARTLALFATPDDVENTNMSTLISWYKAEFFS 641

Query: 295 ENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYS 354
            +   P+ N+F+D  ++FV   +  +++++R     +PRE  E ++Q+L  NDNNGN YS
Sbjct: 642 ND--KPKQNNFQDLQKHFVQLEVIKSMSVIRNNQGYTPREIAELLIQILDQNDNNGNQYS 699

Query: 355 DVFWLAALVQSVGELEFGQQSILFLSSLLKRIDRLL 390
           D  +   +  ++G + F        + ++KR++++L
Sbjct: 700 DDNFRVEIALALGRV-FSDN-----ADIMKRVEQIL 729


>gi|302834603|ref|XP_002948864.1| hypothetical protein VOLCADRAFT_89164 [Volvox carteri f. nagariensis]
 gi|300266055|gb|EFJ50244.1| hypothetical protein VOLCADRAFT_89164 [Volvox carteri f. nagariensis]
          Length = 2124

 Score = 79.0 bits (193), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 78/286 (27%), Positives = 125/286 (43%), Gaps = 57/286 (19%)

Query: 188  PLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAA---LEALPHL----- 239
            P+ + R DP  E+L ++   Q   MW+ QL    DVVAQ+ A+ A   +   P L     
Sbjct: 884  PVQYFRVDPGQEWLCDVVVLQSESMWVAQLRGSKDVVAQSLAVQADSHVSPSPPLFKGLM 943

Query: 240  -------------SFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGL----- 281
                         S +V+  L + +     + RVRIEAA ALA  +   +   G+     
Sbjct: 944  RLGLQLESSGAADSSSVIGVLLDCVRSGGVYCRVRIEAARALAAFSPSMSTQPGVGLVGA 1003

Query: 282  -LHLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLE-------AIPHAVAMVRAADNKSPR 333
              +L+ F+  +  D    + RPN + D   + VL+       A+P  +A +R     S  
Sbjct: 1004 APNLLSFFYDQCRDPVSAVLRPNYWADLGTHLVLQARGVQEKAVPACLASIRDVSFNSTP 1063

Query: 334  EAVEFVLQLLKYNDNNG---------------------NPYSDVFWLAALVQSVGELEFG 372
            EA++ V  +L  NDN+                      N Y D  +LAA++++ G L   
Sbjct: 1064 EAMQLVTDILTNNDNSAAYYKQTPPTYASFFPLCTTTENLYDDSSYLAAVIEAAGLLRPT 1123

Query: 373  QQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKL 418
              S+  L+ LL  +D  L  D L PS N ++  +C+  +T +AL +
Sbjct: 1124 HSSV--LARLLGLLDDYLLQDCLHPSPNYVVGSACLAAMTSLALSV 1167


>gi|116180768|ref|XP_001220233.1| hypothetical protein CHGG_01012 [Chaetomium globosum CBS 148.51]
 gi|88185309|gb|EAQ92777.1| hypothetical protein CHGG_01012 [Chaetomium globosum CBS 148.51]
          Length = 1486

 Score = 78.6 bits (192), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 126/545 (23%), Positives = 235/545 (43%), Gaps = 78/545 (14%)

Query: 2    LEKQMGSNFFRKILQNIISRAQGASPVRT--LSTKEFRHFANKVGNLERPFLKEFFPRWV 59
            + K  GS    +++  I+S A   + + +  LS  +FR    K     RP   E + +WV
Sbjct: 546  MSKIPGSMGIVRVISQIVSNANINATITSTSLSDADFRRACEKKSQY-RP--DELWQQWV 602

Query: 60   GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVK--PDSRTPVLSSNTDSENRDGDIGWP-- 115
               GCP L +   +NK+   V++++ +  + +  P++ T        +E        P  
Sbjct: 603  HGAGCPKLLIKQRFNKKNLNVDISIAQTQSKENGPNAITKEEFLKELAEEVHEVWAGPVP 662

Query: 116  ----GMMSIRVHELDGM-YDHPILPMAGD-AWQLLEIQCHSKL-------AARRALKPKK 162
                G  ++R+HE DG  Y+H +     D     L+I  ++K         A  A     
Sbjct: 663  KLFTGPFTVRIHEADGTPYEHYLAITDKDKVATTLQIAYNTKYKRMKRTKKATAAAASTA 722

Query: 163  GSKPDGCDDN-------GDAVA--------GL-----DMRSSM-ESPLSWIRADPEMEYL 201
              K D  +D+       GD +         GL     +++++M +    WIR D   E+L
Sbjct: 723  VDKQDIQEDDIVYFNMLGDVLMTQRDLENWGLQEWSDNVQTAMDQESYEWIRLDCNFEWL 782

Query: 202  AEIHFNQPVQMWINQLEKDGDVVAQAQAIAALE-ALPHLSFNVVNTLNNFLSDSKAFWRV 260
             E   + P  M++ QL++D DVVA   A+   +    H   + + T    + D + F  +
Sbjct: 783  CETVTDMPGYMYLAQLQQDRDVVAHQDAMLFFQRGKRHGVASSIETRT--VMDRRYFHGI 840

Query: 261  RIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL-------PRPNDFRDFSEYFV 313
            R  A   L   A  E ++ G   L+  ++    D  +G        P PNDF D ++Y V
Sbjct: 841  RTMAIDDLPKQADPEINYIGTAQLILIFRHFFCDRMVGKNGNVTFPPVPNDFHDKAQYSV 900

Query: 314  LEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQS-----VGE 368
              A+P A+A  R  + +  R+A  F+L LL +NDN+ N YSD F++A L+++     + +
Sbjct: 901  QCALPGAIARTRE-NGRCSRQARNFLLDLLLFNDNSDNEYSDHFYIAKLLEALTTSIIPD 959

Query: 369  LEFGQQSIL---------------FLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQ 413
               G++ +                F+   ++ ID+  + D    +Y  + T + +    +
Sbjct: 960  KVEGERDLYSRLGADDDDDMELKRFIEKTIEEIDKYRRMDEWTSTYQNVWTTTALHCKMR 1019

Query: 414  IALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVE 473
            +   +   +   Q ++ ++  +D N +  +R++A   L++L     G+ +      K+ +
Sbjct: 1020 L---MKANVIRTQPLEFVRYLQDGN-LDLIRIKAFECLVELGLLAKGMAAVALGESKAAQ 1075

Query: 474  EEPSL 478
             EP +
Sbjct: 1076 LEPEV 1080


>gi|367019792|ref|XP_003659181.1| hypothetical protein MYCTH_2295880 [Myceliophthora thermophila ATCC
            42464]
 gi|347006448|gb|AEO53936.1| hypothetical protein MYCTH_2295880 [Myceliophthora thermophila ATCC
            42464]
          Length = 1543

 Score = 78.6 bits (192), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 125/527 (23%), Positives = 227/527 (43%), Gaps = 88/527 (16%)

Query: 2    LEKQMGSNFFRKILQNIISRAQGASPVRT--LSTKEFRHFANKVGNLERPFLKEFFPRWV 59
            + K  GS    +++  I+S A   + + +  L+  +FR    K     RP   E + +WV
Sbjct: 547  MSKIPGSMGIVRVISQIVSNANINATITSTSLADADFRRACEKKSQY-RP--DELWQQWV 603

Query: 60   GTCGCPVLRMGFSYNKRKNIVELAVL----RDCTVKPDSRTPVLSSNTDS--ENRDGDIG 113
               GCP L +   +NK+   V++++     +D   K  ++   L    +   E   G + 
Sbjct: 604  HGAGCPKLLIKQRFNKKNLNVDISIAQTQSKDNGPKAITKDEFLKELLEEVYEVWAGPVP 663

Query: 114  --WPGMMSIRVHELDGM-YDHPILPMAGD-AWQLLEIQCHSKLAARR---------ALKP 160
              + G  ++R+HE DG  Y+H +     D     L+I  ++K    +         A   
Sbjct: 664  KLFTGPFTVRIHEADGTPYEHYLAITDKDKVATTLQIAYNTKYKRMKRTKKATAAAAAAN 723

Query: 161  KKGSKPDGCDDN-------GDAVA--------GL-DMRSSMESPLS-----WIRADPEME 199
              G K D  +D+       GD +         GL D   S+++ +      WIR D   E
Sbjct: 724  SAGDKHDIQEDDIVYFNMLGDVLMSPRDIENWGLQDWSESVQAAMDQESYEWIRLDCNFE 783

Query: 200  YLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALE-ALPHLSFNVVNTLNNFLSDSKAFW 258
            +L E+  + P  M++ QL++D DVVA   A+   +    H   + + T    + D + + 
Sbjct: 784  WLCEMITDMPGYMYLAQLQQDRDVVAHQDAMLFFKRGKRHGVASSIETRT--VMDRRYYH 841

Query: 259  RVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGL-------PRPNDFRDFSEY 311
             +R  A   L   A  E ++ G   L+  Y+       +G        P PNDF D ++Y
Sbjct: 842  GIRTMAIDDLPKQADPEINYIGTAQLILLYRHFFCHRIVGKNGSVTFPPAPNDFYDKAQY 901

Query: 312  FVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSV----- 366
             V  A+P A+A  R  + +  R A EF+L LL +NDN+ N YSD F++A L++++     
Sbjct: 902  AVQCALPAAIARTRE-NGRCSRRAREFLLDLLLFNDNSDNEYSDQFYIAKLLEALTTSII 960

Query: 367  -------------------GELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISC 407
                                +LEF      F+   ++ I++  + D    +Y+ I T + 
Sbjct: 961  PDKWESERDLYSSLRTDDDDDLEFK----TFIEKAIEEIEKYRRMDEWTSTYHNIWTTTA 1016

Query: 408  IRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDL 454
            +    ++   +   +   + ++ ++  +D N +  +R+EA   L++L
Sbjct: 1017 LHCKMRL---MKANVIRTEPLEFVRYLQDEN-LDLIRIEAFDCLVEL 1059


>gi|452819327|gb|EME26388.1| transcription initiation factor TFIID subunit D2, partial
           [Galdieria sulphuraria]
          Length = 580

 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 69/265 (26%), Positives = 122/265 (46%), Gaps = 31/265 (11%)

Query: 188 PLSWIRADPEMEYLAE-IHFNQPVQMWINQLEKDGDVVAQAQAIAALEALP--HLSFNVV 244
           P+ W+R DP+ E++ + + F +  +  I+ L    DV  Q +A+  L +     +S   +
Sbjct: 16  PVLWVRIDPDQEWILKFVEFRRSERDSISCLRNCRDVWGQLEALETLMSRTKGRVSVTSI 75

Query: 245 NTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRFDENIGLPRPND 304
             L   L+DS+ +++VR E A  LA   +      GL  L+K  +++ FD++ G    N 
Sbjct: 76  RCLEQILTDSRYYFQVRGECARCLAKCYTTNDRLMGLEALLKLLRNKYFDKH-GAVLSNH 134

Query: 305 FRDFSEYFVLEAIPHAVAM--------VRAADNKSPREAVEF----VLQLLKY------- 345
           F D SEYFV  A+  A+A         +   D++       F    + + L Y       
Sbjct: 135 FDDISEYFVKCAVIQAIAQAIHPISFPIYGYDSQFSLNLASFQMTHIQESLPYDVLQLLL 194

Query: 346 -----NDNNGNP-YSDVFWLAALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSY 399
                ND    P + D F+L  L+ +        +  +    + K++ R L+ +  MPSY
Sbjct: 195 SLLEQNDTAYPPLFDDCFYLHNLIIACAICSVYAKDQITRHRVYKQLFRYLKREEAMPSY 254

Query: 400 NGILTISCIRTLTQIALKLSGFISL 424
           + I+T + ++ LT   L+L GF+ +
Sbjct: 255 HNIITGASLKGLT--ILQLGGFLRI 277


>gi|320166290|gb|EFW43189.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
          Length = 1514

 Score = 75.5 bits (184), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 70/287 (24%), Positives = 124/287 (43%), Gaps = 65/287 (22%)

Query: 222  DVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGL 281
            D+  Q +AI  L   P L+     TL   L + + F R+R+ A +ALA+ AS + +  G+
Sbjct: 846  DINVQHEAIKKLADRPSLA--TWTTLVKVLHNERVFHRLRLAAVFALASMASLDPNLNGI 903

Query: 282  LHLVKFYKSR-------RFDE--------NIG------------------------LPRP 302
             +L+  ++ R       R  E        N+                         L  P
Sbjct: 904  RYLMDEFRKRFCITSHQRDKETQHAFAYLNVKAPPSPPPPTAAVPNPPVVSAFTDYLVEP 963

Query: 303  NDFRDFSEYFVLEAIPHAVAMVRAA-----DNKSPREAVEFVLQLLKYNDNNGNPYSDVF 357
            NDF +F+ Y + +A   A+  +R       +   P   V F+L +L+ NDN+ NPY+D  
Sbjct: 964  NDFTNFANYHLKKAFVRALGTIRLRRPGLRNPPCPWGVVRFLLDVLEMNDNSVNPYTDAH 1023

Query: 358  WLAALVQSVGE-------------LEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILT 404
            ++AA++  + E             L F  +  + +  +L    RLL  ++L PSY  ++ 
Sbjct: 1024 YIAAIIDGLAETVTDDYDLKPTDTLPFPYEHPIHV--ILAEFTRLLSLEKLNPSYRHVVA 1081

Query: 405  ISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEASRAL 451
            +SC+  ++ I  +  G +++D  + +      +     VR+ A R L
Sbjct: 1082 VSCLNGMSLIQAR--GVLTID--IGVFSSHARYGHFTDVRLAAMRGL 1124


>gi|443920353|gb|ELU40290.1| transcription initiation factor TFIID subunit 2 [Rhizoctonia solani
            AG-1 IA]
          Length = 1574

 Score = 73.6 bits (179), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 59/201 (29%), Positives = 94/201 (46%), Gaps = 55/201 (27%)

Query: 257  FWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRF--DENIG----------LPRPND 304
            F+RVR  AA ALA+  +      GL HL+K +    +  D N G          +PRPND
Sbjct: 851  FFRVRCAAARALASQGAP-----GLFHLLKIWTKWCYEPDSNNGNGGDGFAQRFIPRPND 905

Query: 305  FRDFSEYFVLEAIPHAVAMVRAADNKSPREAV-------EFVLQLLKYNDNNGNPYSDVF 357
            F DF EYFV +A+  A++ +R  D  S   A          ++  L YNDNNGN Y+D  
Sbjct: 906  FSDFQEYFVRKAVITALSQLR--DGASGTGAWLGGALVRGTIVDQLVYNDNNGNTYADDM 963

Query: 358  WLAALVQSVG--------------------ELEFGQ---------QSILFLSSLLKRIDR 388
            ++A L+ ++G                    E+E  +         ++   L   + +++R
Sbjct: 964  YIATLITALGTALIPPDVSERNELAHNHTVEMEMEEAMRRQMEEDENAEVLRRAVTQVER 1023

Query: 389  LLQFDRLMPSYNGILTISCIR 409
                DRL+PS + +++I+ I+
Sbjct: 1024 YRARDRLVPSRHNVISIAAIQ 1044



 Score = 47.4 bits (111), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 31/96 (32%), Positives = 44/96 (45%), Gaps = 14/96 (14%)

Query: 135 PMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGLDMRSSMESPLSWIRA 194
           P A  AW + E +           +P   + P   + N  A AG     + E+   WIR 
Sbjct: 693 PAARAAWHVEEFE---------DAEPPPHTGPIPFNPNSGAAAG-----NEETAYEWIRL 738

Query: 195 DPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAI 230
           DPEM ++A     Q   MW +QL ++ DV AQ +AI
Sbjct: 739 DPEMMWVARTELKQSGVMWASQLARERDVGAQMEAI 774


>gi|403348202|gb|EJY73534.1| hypothetical protein OXYTRI_05334 [Oxytricha trifallax]
          Length = 1887

 Score = 71.6 bits (174), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 43/161 (26%), Positives = 85/161 (52%), Gaps = 8/161 (4%)

Query: 184  SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEAL-PHLSFN 242
            ++ESP+ WIR DP+ME++ ++   Q    W+ QL ++ D++ Q +AI  L      L + 
Sbjct: 1135 NLESPVLWIRVDPDMEFIRKVKVVQEQNNWLFQLLQEKDIIGQIEAINELNKFNSELVYE 1194

Query: 243  VVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLH---LVKFYKSRRFDENIGL 299
            ++ +++   ++   F++VR  A   L    + + +   L H   L+K +  R FD++IG 
Sbjct: 1195 ILKSVS---TNENYFFKVRKIAIKKLQQMQTSQFN-KYLSHEKFLIKVFNQRNFDKSIGF 1250

Query: 300  PRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVL 340
             + NDF    EY++ ++I   ++  +    +   E  E++L
Sbjct: 1251 YKSNDFSKVLEYYIDKSILKTLSKCKEVPLELKEEIKEYLL 1291



 Score = 46.6 bits (109), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 25/91 (27%), Positives = 48/91 (52%), Gaps = 8/91 (8%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
           M+E  +  +++ +I++ I      A+P  T+S   F +   +V   +   LK+FF  W+ 
Sbjct: 884 MIESMIDKSYYERIIRQIYD---SANP--TISQGLFHNIFKQVCGFK---LKQFFQNWII 935

Query: 61  TCGCPVLRMGFSYNKRKNIVELAVLRDCTVK 91
           +  CP L   F+YNK+ N ++L + +   +K
Sbjct: 936 STSCPKLTASFNYNKKNNSLDLNIAQTSAMK 966


>gi|449020077|dbj|BAM83479.1| TATA-box binding protein-associated factor 2 [Cyanidioschyzon merolae
            strain 10D]
          Length = 1444

 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 68/287 (23%), Positives = 117/287 (40%), Gaps = 62/287 (21%)

Query: 188  PLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTL 247
            P+ WIR DP+ME++  + + Q  + W+ QL  + DV +Q     AL  L   +  V    
Sbjct: 857  PIMWIRVDPDMEWVRGVFWRQNERAWVRQLAGERDVSSQMDCCDAL-YLHATAIGVRGLA 915

Query: 248  NNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRF-DENIGLPR----- 301
             + L + +  +RVR  AA ++A     ET + GL  LV FY +R    E+  LPR     
Sbjct: 916  EHGLLNEQVHYRVRSAAAASIARWGGSETRFLGLEILVDFYANRYLVGEDQVLPRSDAKE 975

Query: 302  ------------------------------------------------PNDF-RDFSEYF 312
                                                            P DF R+ +EYF
Sbjct: 976  DNHLANVDASDALVGSQADAATTAATAPGRLQEKRARLGREWLQRLPRPLDFSRNLAEYF 1035

Query: 313  VLEAIPHAVAMVRAADN-----KSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVG 367
            +  A+  A+  +R A       ++P +    +  +L++++N  N + D ++L   + +  
Sbjct: 1036 LQRAMLRALGGIRVAQTGEGAQRTPTDLFALLDHVLRFHNNEANMFDDAYFLRDALLAFA 1095

Query: 368  ELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQI 414
                  + +  +  + + I+R L  DRL PS+   LT  CI    ++
Sbjct: 1096 SASSTNEGLRTM-EVARCIERFLNRDRLFPSHQRALTAGCIEAAAEL 1141


>gi|326437301|gb|EGD82871.1| hypothetical protein PTSG_03502 [Salpingoeca sp. ATCC 50818]
          Length = 723

 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 61/247 (24%), Positives = 111/247 (44%), Gaps = 20/247 (8%)

Query: 187 SPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL-EALPHLSFNVVN 245
           SP+ W++ D + + LA I   Q    W  Q   D  V+ Q  AI A+ + + H S  VV 
Sbjct: 113 SPILWVKFDHDSKLLASITVEQEHWAWGYQARLDRHVIFQLDAIKAMGKHVGHRS--VVF 170

Query: 246 TLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRF--DENIG-LPRP 302
           TL +   + + ++ VR +A  ALA   + + D  G++  ++ +  R +    + G +P  
Sbjct: 171 TLLDAAENEQLYFHVRTQAVLALAQCENVDEDQHGIVAALRAFYERHYCAKSDFGFMPLE 230

Query: 303 NDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFW---- 358
           NDF +F  YF+ +A   A++ ++      P E VE +  ++   D   N Y D  +    
Sbjct: 231 NDFSNFPAYFLQQATLTALSRLKNTYGVPPLEIVEMLHTIVLATDQGSNQYDDATFRRHT 290

Query: 359 LAALVQSV--------GELEFGQQSIL--FLSSLLKRIDRLLQFDRLMPSYNGILTISCI 408
           LA    ++         ++   QQ+    ++   +      + +D +  SY   +T+ C+
Sbjct: 291 LACFAHALRPHKRRNAQDMPADQQAKRQEWVRRAVSLFAHCISYDHVSASYKKRVTVGCV 350

Query: 409 RTLTQIA 415
             LT  A
Sbjct: 351 DALTSFA 357


>gi|322796793|gb|EFZ19220.1| hypothetical protein SINV_09096 [Solenopsis invicta]
          Length = 1089

 Score = 68.6 bits (166), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 47/179 (26%), Positives = 87/179 (48%), Gaps = 17/179 (9%)

Query: 310 EYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGE- 368
           E  ++  IP A+A +R      P E + F++ L KYNDN+ N YSD ++ AAL++++G  
Sbjct: 631 EMTIMRTIPVAMAGLRNNHGICPPEVLSFLMDLFKYNDNSKNRYSDNYYRAALIEALGAT 690

Query: 369 ----LEFGQQSIL---FLS----SLLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALK 417
               +   Q + +   FLS    ++L+ + R L  ++L+P Y   ++++C++ +      
Sbjct: 691 VTPVISVQQGTAITAEFLSIDTKAILEEVTRNLNLEKLLPCYKYTVSVACLKVIR----I 746

Query: 418 LSGFISLDQVVKLIKPFRDFNTIWQVRVEASRALLDLEFHCNGIDSALSLFIKSVEEEP 476
           L  F  L     L + +  +     +R+ A  AL+D     +G    L   +   E +P
Sbjct: 747 LQKFGHLPSNPHLFRAYAAYGQFEDIRIAALEALVDFT-RVDGKWEDLEFLLDMAENDP 804



 Score = 50.4 bits (119), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 54/219 (24%), Positives = 93/219 (42%), Gaps = 40/219 (18%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRH-------FANKVGNLERPFLKE 53
           MLE ++G     ++    +S A  A+  + + +  + H       FA     +    +  
Sbjct: 457 MLEHRIGQEVLLQVFNKQLSLAANAAQ-QKIESGLWSHMLISTNVFAKATFTVTGKDMSV 515

Query: 54  FFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIG 113
           F  +WV T G     + F +N+++N VEL + +D T                 ++ G   
Sbjct: 516 FIDQWVRTGGHTKFSLSFIFNRKRNTVELEIRQDTT-----------------HQRGIRK 558

Query: 114 WPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNG 173
           + G + + + ELDG + H  L + G   +  +I CHSK    +  K    +        G
Sbjct: 559 YVGPLVVNIQELDGTFKH-TLQIEGTMAR-ADITCHSKSRRNKKKKIPLCT--------G 608

Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQM 212
           + V  +D+ +  +SP+ WIR DPEM  +  I    PV M
Sbjct: 609 EEV-DMDLSAMDDSPVLWIRLDPEMTIMRTI----PVAM 642


>gi|429962342|gb|ELA41886.1| hypothetical protein VICG_01070 [Vittaforma corneae ATCC 50505]
          Length = 1504

 Score = 64.3 bits (155), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 70/314 (22%), Positives = 133/314 (42%), Gaps = 54/314 (17%)

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVEL-AVLRDCTVKPDSRTPVLSSNTDSENRD 109
           L+ FF  +V   G   +++ F  NK+KN V++ A     ++ P +   +L          
Sbjct: 561 LRSFFDFYVFRPGLMRVKLQFQINKKKNSVKVNASYTSTSLLPGANKRLL---------- 610

Query: 110 GDIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGC 169
                 G + I+  EL+G +DH I     + +       H++   ++             
Sbjct: 611 ------GNVDIKSVELEGGFDHSIAFGNENVFYY-----HTRTKKKKK------------ 647

Query: 170 DDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
                             PL +IR DP+   + E    QP  M I QL+ D  V+ Q +A
Sbjct: 648 -----------DDEEETMPLLYIRVDPKRLNIFEYTVEQPDYMHIEQLQ-DKSVIGQLEA 695

Query: 230 IAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYK 289
           I  L   P LS      L   L +   F+++R + AY L +   E  D+ GL  +++++ 
Sbjct: 696 IYNLGTKPSLS--SCEALERLLDNFHIFYKIRAKIAYILRSIKIE--DYDGLQRIIQYFI 751

Query: 290 SRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADN---KSPREAVEFVLQLLKYN 346
                 N  + + N+F   S YF+ + +  +++ +   +     + +  + F+  +LK+N
Sbjct: 752 RTYCVPNSTILKSNEFGLIS-YFIQKHLVDSISQINFKEECVISNSKIVLAFLENILKFN 810

Query: 347 DNNGNPYSDVFWLA 360
           DN+ + + D +++A
Sbjct: 811 DNSLSQFEDSWYVA 824


>gi|322796774|gb|EFZ19201.1| hypothetical protein SINV_00189 [Solenopsis invicta]
          Length = 451

 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 38/146 (26%), Positives = 71/146 (48%), Gaps = 16/146 (10%)

Query: 321 VAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVGE-----LEFGQQS 375
           +A +R      P E + F++ L KYNDN+ N YSD ++ AAL++++G      +   Q +
Sbjct: 1   MAGLRNNHGICPPEVLSFLMDLFKYNDNSKNRYSDNYYRAALIEALGATVTPVISVQQGT 60

Query: 376 ILFLSSL-------LKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVV 428
            +   SL       L+ + R L  ++L+P Y   ++++C++ +      L  F  L    
Sbjct: 61  AITAESLSIDTKAILEEVTRNLNLEKLLPCYKYTVSVACLKVIRI----LQKFGHLPSNP 116

Query: 429 KLIKPFRDFNTIWQVRVEASRALLDL 454
            L + +  +     +R+ A  AL+D 
Sbjct: 117 HLFRAYAAYGQFIDIRIAALEALVDF 142


>gi|189203565|ref|XP_001938118.1| TATA-binding protein associated factor Taf2 [Pyrenophora
           tritici-repentis Pt-1C-BFP]
 gi|187985217|gb|EDU50705.1| TATA-binding protein associated factor Taf2 [Pyrenophora
           tritici-repentis Pt-1C-BFP]
          Length = 1359

 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 72/146 (49%), Gaps = 22/146 (15%)

Query: 2   LEKQMGSNFFRKILQNIISRA-QGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
           L KQ G N   + L  ++  A QG  P   ++T +F     KVG+ +   L+ FF +W+ 
Sbjct: 506 LVKQSGKNGVDRCLYRMLFNARQGKYPNGAITTDDFLDICEKVGHQK---LESFFQQWIV 562

Query: 61  TCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSEN---------RD-- 109
             GCP      ++NK+K +V+L + +   ++ DS    L +   +++         RD  
Sbjct: 563 GSGCPTFHCLPTFNKKKQVVQLTIRQ---LQADSSITTLENGLSADDFMREAKEKSRDLR 619

Query: 110 GDIGWP---GMMSIRVHELDGM-YDH 131
              G+P   G M+IR+HE DG  Y+H
Sbjct: 620 SATGYPTFVGPMTIRIHEADGTPYEH 645


>gi|422293197|gb|EKU20497.1| hypothetical protein NGA_2071100, partial [Nannochloropsis gaditana
           CCMP526]
          Length = 290

 Score = 60.5 bits (145), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 45/158 (28%), Positives = 70/158 (44%), Gaps = 34/158 (21%)

Query: 188 PLSWIRADPEMEYLAEIHF-NQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNT 246
           P+ W+R DP+++++AE  + N P  +++ QL +D D  AQ  A+ AL + P  + +    
Sbjct: 131 PVVWLRLDPDLQWIAEWEWENLPEYIFLEQLHRDPDAAAQCLALRALTSYPKATQDTTAL 190

Query: 247 LNNFLSDSK-------------------AFW-----RVRIEAAYALANTASE-------- 274
           + +  S +                      W      VR+EAA ALA   S         
Sbjct: 191 VGSVQSGTTLRPSLAAQAMSDCVRGKQGPGWVEPSLNVRMEAARALAAWQSNHAPGSMIT 250

Query: 275 -ETDWAGLLHLVKFYKSRRFDENIGLPRPNDFRDFSEY 311
            ++ W GL  L   Y+ R FD    +P P DF D +EY
Sbjct: 251 ADSGWRGLDFLFCAYRERYFDATSRVPLPADFTDEAEY 288


>gi|269861284|ref|XP_002650354.1| transcription initiation factor TFIID subunit [Enterocytozoon
           bieneusi H348]
 gi|220066214|gb|EED43707.1| transcription initiation factor TFIID subunit [Enterocytozoon
           bieneusi H348]
          Length = 1331

 Score = 60.1 bits (144), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 58/283 (20%), Positives = 124/283 (43%), Gaps = 26/283 (9%)

Query: 182 RSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSF 241
           + S    + ++RAD + E   +I  NQP  ++I QL  D  V+ Q +A+   +   + S 
Sbjct: 545 KKSDNGTVLFVRADCKREIFGKIRINQPEALYIEQL-LDKSVIGQFEAVEFFKN--NTSE 601

Query: 242 NVVNTLNNFLSDSKAFWRVRIEAAYALANTA--------SEETDWAGLLHLVKFYKSRRF 293
             +  L     +   F RV+ E    L N          ++E+++ GL+ L++F+   + 
Sbjct: 602 KAIEALERVAENPHTFHRVKNEIYKVLKNIYFQHYGNDNNKESNYNGLMRLIQFFIKLKC 661

Query: 294 DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREA-----VEFVLQLLKYNDN 348
             N  + +  +    + + + +++   +       N    E      + F+  ++ YND 
Sbjct: 662 VPNSTIVKNTELGILN-FLIQKSLVKNITDFENEQNVKSNEKNINVIIAFLNNIINYNDT 720

Query: 349 NGNPYSDVFWLAALVQSVGELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGILTISCI 408
               ++D F+L+ ++  +   +    S L   +L   I+R    D + PS + +++ +C+
Sbjct: 721 TKEHFNDPFYLSNILNKLCIYKIHLSSTLDDQNLFSEIERFRFLDMVFPSPHNVISKTCL 780

Query: 409 RTLTQIA----LKLSGFISLDQVVKLIKPFRDFNTIWQVRVEA 447
             L ++A    + LS    LD     I  + +F ++ Q+ +E 
Sbjct: 781 ICLLRLAYAKKINLSKKFLLD-----ISKYPNFISLRQIAIEG 818


>gi|167526722|ref|XP_001747694.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163773798|gb|EDQ87434.1| predicted protein [Monosiga brevicollis MX1]
          Length = 1159

 Score = 59.3 bits (142), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 54/247 (21%), Positives = 106/247 (42%), Gaps = 36/247 (14%)

Query: 183 SSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFN 242
           +S +S + W++AD  M+++A +   Q    W    E +  V  Q  AI AL        +
Sbjct: 601 NSTKSTIMWVQADVNMKWMARVEMKQEWWAWGYAAEHERHVGWQVDAIHALRQFK--DKD 658

Query: 243 VVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRF--------- 293
            +  L+   ++ + + R R +A  +LA  +          +LVK+ + R           
Sbjct: 659 AIKLLDEIAANERYYHRARRQAILSLAQMSDAPVK-ERFQYLVKYLRKRYCAPPSMELIR 717

Query: 294 ---DENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNG 350
               E   +P  NDF+D   Y VL  +           NK+  +A++ ++ +L+Y+DN+ 
Sbjct: 718 AGKTEIPVIPMENDFKDIRNYLVLRLV--------ETPNKTCEDALQVLIDMLQYSDNSD 769

Query: 351 NPYSDVFWLAALVQSVGELEF------GQQSILFLSSLLKRIDRLLQ-------FDRLMP 397
           N + D  + A ++ ++  +            I+    L ++ + ++Q       +D + P
Sbjct: 770 NMFDDGEYRAEIINAMVNVLLRSWRMPSGSRIMTEEQLCEKTETVIQLLVHQLSYDTVDP 829

Query: 398 SYNGILT 404
           SYN  +T
Sbjct: 830 SYNQPVT 836


>gi|355723103|gb|AES07784.1| TAF2 RNA polymerase II, TATA box binding protein -associated
           factor, 150kDa [Mustela putorius furo]
          Length = 424

 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 38/126 (30%), Positives = 63/126 (50%), Gaps = 15/126 (11%)

Query: 339 VLQLLKYNDNNGNPYSDVFWLA----ALVQSVGELEFGQQSILFLSSL-------LKRID 387
           +L L+KYNDN  N +SD ++ A    AL  SV         +  L +L       L+ I 
Sbjct: 1   ILDLIKYNDNRKNKFSDNYYRAEMIDALANSVTPAVSVNNEVRTLDNLNPDVRLILEEIT 60

Query: 388 RLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIWQVRVEA 447
           R L  ++L+PSY   +T+SC+R +    L+ +G +  D    L K + ++     +R+ A
Sbjct: 61  RFLNMEKLLPSYRHTITVSCLRAIR--VLQKNGHVPSDPA--LFKSYAEYGHFVDIRIAA 116

Query: 448 SRALLD 453
             A++D
Sbjct: 117 LEAVVD 122


>gi|331214257|ref|XP_003319810.1| hypothetical protein PGTG_01984 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|309298800|gb|EFP75391.1| hypothetical protein PGTG_01984 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 279

 Score = 57.0 bits (136), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 40/124 (32%), Positives = 64/124 (51%), Gaps = 12/124 (9%)

Query: 192 IRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFL 251
           +R D E+E++ E        MW+ QL++D DVVAQ +A+ AL+ +P  S  V + L    
Sbjct: 115 VRLDVEVEWIFEFSIEMKEFMWLEQLQRDHDVVAQVEAVRALKTIP--SKVVSSHLCRVA 172

Query: 252 SDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSR-----RFDENI-----GLPR 301
              +  + +R EA  AL + A+      GL HL+K ++SR     R + +       +P+
Sbjct: 173 PVLEYIFHMRNEAVLALDSCATCRCGVLGLFHLLKLFQSRYCYPPRVEPDSPWHIRPIPK 232

Query: 302 PNDF 305
           PN F
Sbjct: 233 PNQF 236


>gi|26341738|dbj|BAC34531.1| unnamed protein product [Mus musculus]
          Length = 119

 Score = 53.5 bits (127), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 37/119 (31%), Positives = 58/119 (48%), Gaps = 6/119 (5%)

Query: 198 MEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAF 257
           M  L ++ F Q   MW  +L  + DVVAQ ++I ALE  P  +  +   L + L   + F
Sbjct: 1   MSVLRKVEFEQADFMWQYELRYERDVVAQQESILALEKFPTPASRL--ALTDILEQEQCF 58

Query: 258 WRVRIEAAYALANTA-SEETDWAGLLHLVKFYKSRRF--DENIGLPRPNDFRDFSEYFV 313
           +RVR+ A + LA  A S  + W G   +   + +R F       + + N+F  F  YF+
Sbjct: 59  YRVRMSACFCLAKIANSMVSTWTGPPAMKSLF-TRMFCCKTCPNIVKTNNFMSFQSYFL 116


>gi|326435506|gb|EGD81076.1| hypothetical protein PTSG_11021 [Salpingoeca sp. ATCC 50818]
          Length = 498

 Score = 52.4 bits (124), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 43/191 (22%), Positives = 85/191 (44%), Gaps = 17/191 (8%)

Query: 242 NVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFYKSRRF--DENIG- 298
           +VV TL +   + + ++ VR +A  ALA   + + D  G++  ++ +  R +    + G 
Sbjct: 9   SVVFTLLDAAENEQLYFHVRTQAVLALAQCENVDEDQHGIVAALRAFYERHYCAKSDFGF 68

Query: 299 LPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFW 358
           +P  NDF +F  YF+ +A   A++ ++      P E VE +  ++   D   N Y D  +
Sbjct: 69  VPLENDFSNFPAYFLQQATLTALSRLKNTYGVPPLEIVEMLHTIVLATDQGSNQYDDATF 128

Query: 359 ----LAALVQSV--------GELEFGQQSIL--FLSSLLKRIDRLLQFDRLMPSYNGILT 404
               LA    ++         ++   QQ+    ++   +      + +D +  SY   +T
Sbjct: 129 RRHTLACFAHALRPHKRRNAQDMPADQQAKRQEWVRRAVSLFAHCISYDHVSASYKKRVT 188

Query: 405 ISCIRTLTQIA 415
           + C+  LT  A
Sbjct: 189 VGCVDALTSFA 199


>gi|298243623|ref|ZP_06967430.1| Peptidase M1 membrane alanine aminopeptidase [Ktedonobacter
           racemifer DSM 44963]
 gi|297556677|gb|EFH90541.1| Peptidase M1 membrane alanine aminopeptidase [Ktedonobacter
           racemifer DSM 44963]
          Length = 889

 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 61/278 (21%), Positives = 106/278 (38%), Gaps = 62/278 (22%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEF-RHFANKVGNLERPFLKEFFPRWV 59
           ML  Q+G + FR+ +++ + R +     R + T +  R      G      L  FF +W+
Sbjct: 421 MLRNQLGDSAFRRGIKSYLERNR----TREVITADLERALEEATGRS----LAHFFHQWI 472

Query: 60  GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMS 119
            + G P   + +S++ +  + ++ V +    K D  TP                 P  ++
Sbjct: 473 YSGGYPEFEVSYSWDGQHKLAKVKVRQ--AQKVDDLTPCFKV-------------PTEIA 517

Query: 120 IRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGL 179
             +   +G  +   +P+            H  L               G D   + +  L
Sbjct: 518 FTIAASEGSDETRTIPL------------HVTL---------------GEDGEVEQIFYL 550

Query: 180 DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL--EALP 237
                +E     +R DPE   L  + F +P  + + QLE D D++ + +A  AL     P
Sbjct: 551 ----PLEREPVMVRFDPEGRLLKSLKFERPTHLLVYQLEHDADILGRVEAAEALADRREP 606

Query: 238 HLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEE 275
                ++N LN     S  FW VR  AA A+A   + E
Sbjct: 607 QALTALINALN-----SDTFWGVRTTAAEAIAKHGTAE 639


>gi|296004883|ref|XP_002808789.1| conserved Plasmodium protein, unknown function [Plasmodium falciparum
            3D7]
 gi|225632177|emb|CAX64062.1| conserved Plasmodium protein, unknown function [Plasmodium falciparum
            3D7]
          Length = 3489

 Score = 50.1 bits (118), Expect = 0.005,   Method: Composition-based stats.
 Identities = 29/84 (34%), Positives = 45/84 (53%), Gaps = 4/84 (4%)

Query: 169  CDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQ 228
            C DN   VA +  ++ +  P+ WIR D +   L  I  +Q + MWI QL  D +V++Q +
Sbjct: 2075 CLDNRKLVAKICSKNKI--PILWIRIDNDFVLLGRIRRSQSISMWIQQLYNDNNVLSQLE 2132

Query: 229  AIAALEALP--HLSFNVVNTLNNF 250
            +  AL  +P   L+ NV +  N F
Sbjct: 2133 SSFALAFIPFFFLNKNVFSNYNEF 2156


>gi|83317854|ref|XP_731343.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
 gi|23491352|gb|EAA22908.1| hypothetical protein [Plasmodium yoelii yoelii]
          Length = 1684

 Score = 49.7 bits (117), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 37/136 (27%), Positives = 64/136 (47%), Gaps = 6/136 (4%)

Query: 169  CDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQ 228
            C DN   VA +  ++ +  PL WIR D +   +  I   Q V MWI QL  D +V+AQ +
Sbjct: 1329 CLDNRKLVAKICSKNKI--PLLWIRVDNDFYIIGRIRRTQSVSMWIQQLLNDNNVLAQLE 1386

Query: 229  AIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFY 288
            +  +L  +P    N  N  +NF  +   ++    +++  + N +     +  L++ +   
Sbjct: 1387 SSFSLAFIPFFFLN-KNITSNFNENKNNYFLGPXKSSTNINNNS---VHYGDLVNNITSI 1442

Query: 289  KSRRFDENIGLPRPND 304
             S   DENI    P++
Sbjct: 1443 GSVDSDENINHLDPDN 1458


>gi|385303226|gb|EIF47315.1| tfiid subunit (150 kda) involved in rna polymerase ii transcription
           initiation [Dekkera bruxellensis AWRI1499]
          Length = 394

 Score = 49.3 bits (116), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 62/284 (21%), Positives = 108/284 (38%), Gaps = 68/284 (23%)

Query: 31  LSTKEFRHFANKVGNLERPFLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTV 90
           LST   +H   KV + +   L  FF  WV   G P   +   +NK++  +E+++ +  + 
Sbjct: 119 LSTSHXQHVCEKVVHHK---LDGFFENWVHNPGVPRFTISQKFNKKRMFIEMSIRQTQSS 175

Query: 91  K---------------PDSRTPVLSSN------------TDSENRDGDIGWPGMMSIRVH 123
                           P+ R  V                T  E+ +    + G ++IR+H
Sbjct: 176 TKMKRLAXENDEZLTLPEMRXYVQQFKNKYFIDEADQMITGEESFEPLPAFTGPITIRIH 235

Query: 124 ELDGM-YDHPILPMAGDAWQLLEIQCHSKLAARRALKP---------------------- 160
           E DG  Y+H +     + +   +IQ ++K    R  K                       
Sbjct: 236 EADGTPYEHILY--IREPYTKFDIQYNTKYRRTRRRKEGLLDDSEENKSSNSEGYLGDIL 293

Query: 161 -------KKGSKPDGCDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMW 213
                  K G K D  D++ +     +M+S       W+R D + E++ E H N   +M 
Sbjct: 294 MTSSEAXKWGLKEDAXDESTNIA---EMQSY---AFEWMRFDADNEWICEKHINVSDEMQ 347

Query: 214 INQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFLSDSKAF 257
            + L++D DV AQ +A+     +     +  N L   L D++ F
Sbjct: 348 ESMLQQDRDVDAQMEAVQYFGBIVRPKLHFANILLRTLLDTRYF 391


>gi|70949844|ref|XP_744296.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56524191|emb|CAH78218.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
          Length = 1353

 Score = 48.5 bits (114), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 39/130 (30%), Positives = 62/130 (47%), Gaps = 7/130 (5%)

Query: 169 CDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQ 228
           C DN   VA +  ++ +  PL WIR D +   +  I   Q V MWI QL  D +V+AQ +
Sbjct: 521 CLDNRKLVAKICSKNKI--PLLWIRVDNDFYIIGRIRRTQSVSMWIQQLLNDNNVLAQLE 578

Query: 229 AIAALEALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLLHLVKFY 288
           +  +L  +P    N  N  +NF +D K  + +  +    L N ++    + GL + +   
Sbjct: 579 SSFSLAFIPFFFLN-KNITSNF-NDGKNNYFMGPDKGPNLNNNSAH---YGGLANNITSM 633

Query: 289 KSRRFDENIG 298
            S   DEN+ 
Sbjct: 634 GSDGSDENMN 643


>gi|147858618|emb|CAN81008.1| hypothetical protein VITISV_011583 [Vitis vinifera]
          Length = 307

 Score = 47.0 bits (110), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 28/65 (43%), Positives = 37/65 (56%), Gaps = 9/65 (13%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGAS-PVRTLSTKEFRH--------FANKVGNLERPFL 51
           MLEKQMG   FRKILQ I+ RAQ  +  +RTLSTKE +         FA      ++ +L
Sbjct: 233 MLEKQMGPESFRKILQTIVFRAQDTTRSLRTLSTKEVQRICSQEHIWFAGIYAXEQKSYL 292

Query: 52  KEFFP 56
           ++  P
Sbjct: 293 RKCIP 297


>gi|68067458|ref|XP_675693.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56495025|emb|CAH97801.1| hypothetical protein PB000540.02.0 [Plasmodium berghei]
          Length = 926

 Score = 46.6 bits (109), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 38/74 (51%), Gaps = 2/74 (2%)

Query: 169 CDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQ 228
           C DN   V+ +  ++ +  PL WIR D +   +  I   Q V MWI QL  D +++AQ +
Sbjct: 473 CLDNRKLVSKICSKNKI--PLLWIRVDNDFYIIGRIRRTQSVSMWIQQLLNDNNILAQLE 530

Query: 229 AIAALEALPHLSFN 242
           +  +L  +P    N
Sbjct: 531 SSFSLAFIPFFFLN 544


>gi|221052886|ref|XP_002261166.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
            knowlesi strain H]
 gi|194247170|emb|CAQ38354.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
            knowlesi strain H]
          Length = 3228

 Score = 46.2 bits (108), Expect = 0.056,   Method: Compositional matrix adjust.
 Identities = 26/69 (37%), Positives = 35/69 (50%), Gaps = 2/69 (2%)

Query: 169  CDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQ 228
            C DN   VA +  +  +  PL WIR D +   L  I   Q   MWI QL  D +V+AQ +
Sbjct: 1913 CLDNRKLVAKICSKGKI--PLLWIRIDNDFNILGRIRRTQSASMWIQQLCNDNNVLAQLE 1970

Query: 229  AIAALEALP 237
            +  AL  +P
Sbjct: 1971 SSFALAFIP 1979


>gi|219115988|ref|XP_002178789.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217409556|gb|EEC49487.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 1836

 Score = 45.8 bits (107), Expect = 0.074,   Method: Compositional matrix adjust.
 Identities = 61/248 (24%), Positives = 86/248 (34%), Gaps = 87/248 (35%)

Query: 187  SPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALP--------- 237
            SPL W+R DP   Y   I   QP      QL  DGD  AQ +AI  L   P         
Sbjct: 1103 SPLLWVRVDPMGLYAGRISVCQPDACLAEQLFHDGDAAAQVEAIRTLAERPIRIQPSSKV 1162

Query: 238  HLSFNV------VNTLNNFLSDSKAFWR-------VRIEAAYALA----NTASEETD--- 277
            +  ++V      V  L + L  S A          +R +AA A+A    N A    D   
Sbjct: 1163 NAVYDVSVMELPVRVLGDCLRGSPALHSSLPHTPAIRSQAALAIAQWQNNKAPPSKDSVD 1222

Query: 278  ---WAGLLHLVKFYKSRRFDENIGLP--------RPND-------------------FRD 307
               W G+  L+ +++ R +  +I +P        + +D                   F D
Sbjct: 1223 ADSWIGINLLLHYFRERFYSNSIIMPTKFSRLALKKSDVEMNQAAAANESAGANQPTFDD 1282

Query: 308  FSEYF----------------------------VLEAIPHAVAMVRAADNKSPREAVEFV 339
              EY                             V  A+  A+A  RA D  +P  AVEF+
Sbjct: 1283 AYEYLDILEAGEERAAALEDADQVEIEEDEEYRVRSAVVTAIACTRAKDGLTPSIAVEFL 1342

Query: 340  LQLLKYND 347
              +L+  D
Sbjct: 1343 ETVLEAED 1350


>gi|156093478|ref|XP_001612778.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148801652|gb|EDL43051.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 3314

 Score = 45.4 bits (106), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 26/69 (37%), Positives = 34/69 (49%), Gaps = 2/69 (2%)

Query: 169  CDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQ 228
            C DN   VA +  +  +  PL WIR D +   L  I   Q   MWI QL  D +V AQ +
Sbjct: 1980 CLDNRKLVAKICSKGKI--PLLWIRIDNDFSILGRIRRTQSASMWIQQLFNDNNVFAQLE 2037

Query: 229  AIAALEALP 237
            +  AL  +P
Sbjct: 2038 SSFALAFIP 2046


>gi|118375202|ref|XP_001020786.1| hypothetical protein TTHERM_00408940 [Tetrahymena thermophila]
 gi|89302553|gb|EAS00541.1| hypothetical protein TTHERM_00408940 [Tetrahymena thermophila SB210]
          Length = 1956

 Score = 44.3 bits (103), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 59/276 (21%), Positives = 110/276 (39%), Gaps = 36/276 (13%)

Query: 152  LAARRALKPKKGSKPDGCDDNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHF-NQPV 210
            +A R   +   G K +  +D+         +   E    WI+ DP  +YL  + F  Q +
Sbjct: 1226 VAYRSETRKNNGKKKEDDEDDSKKKQQGSKKKQDELEKVWIKIDPSNKYLLSLDFPQQNL 1285

Query: 211  QMWINQLEKDGDVVAQAQAIAALEALPHL--------SFNVVNTLNNFLSDSKAFWRVRI 262
               + QL KD     +  ++  L  L  L        S+++V  L   +        +++
Sbjct: 1286 HSLLTQLMKDKKHRYKYISVT-LYLLRQLRNKNYDQDSYDIVKNLKLIIQKDDLETDLQV 1344

Query: 263  EAAYALANTASE-----ETDWAGLLHLVKFYKSRRFDENIGLPR---PNDFRDFSEYFVL 314
            E    +   +S+     +  +  L  L+K          IG  +   PN+F+ + +Y +L
Sbjct: 1345 EILDVIIRNSSKVKKARQECFNLLEELIK-------KNVIGQDKSLLPNNFQSYKKYMIL 1397

Query: 315  EAIPHAVAM-------VRAADNKSPREAVEFVLQLLKYNDNNGNPYSDVFWLAALVQSVG 367
              I   +         V A     P   + ++L+L+K NDN+ N + D  + A L   +G
Sbjct: 1398 MTIIRGLPYCYDENISVDADKQYPPENTIRYLLKLIKENDNSINRFDDSIYRANL---IG 1454

Query: 368  ELEFGQQSILFLSSLLKRIDRLLQFDRLMPSYNGIL 403
             L        F  ++LK + R L+ + +  S N ++
Sbjct: 1455 SLLLTNHP-YFKKTILKEVIRCLELEYIECSLNDVV 1489


>gi|60688329|gb|AAH91307.1| Taf2 protein, partial [Rattus norvegicus]
          Length = 379

 Score = 43.1 bits (100), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 21/72 (29%), Positives = 40/72 (55%), Gaps = 4/72 (5%)

Query: 382 LLKRIDRLLQFDRLMPSYNGILTISCIRTLTQIALKLSGFISLDQVVKLIKPFRDFNTIW 441
           +L+ I R L  ++L+PSY   +T+SC+R +    L+ +G +  D    L K + ++    
Sbjct: 8   ILEEITRFLNMEKLLPSYRHTITVSCLRAIR--VLQKNGHVPSD--ASLFKSYAEYGHFV 63

Query: 442 QVRVEASRALLD 453
            +R+ A  A++D
Sbjct: 64  DIRIAALEAVVD 75


>gi|403224177|dbj|BAM42307.1| uncharacterized protein TOT_040000676 [Theileria orientalis strain
            Shintoku]
          Length = 1773

 Score = 43.1 bits (100), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 45/182 (24%), Positives = 76/182 (41%), Gaps = 24/182 (13%)

Query: 171  DNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAI 230
            DN   VA +  ++ +  PL WIR D     LA I   Q   MW  QL+ D ++ A  +A 
Sbjct: 1142 DNRSIVAKICSKTRL--PLLWIRVDSAFSMLARIRRCQSGSMWEQQLQSDNNIYALIEAC 1199

Query: 231  AALEALPH--------LSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLL 282
             AL ++          L+ +    L+  +   KA   +R    Y+L    + +    G L
Sbjct: 1200 QALGSIGKSEYVSDNPLTESACKRLDEVVKKQKAHPMIRARCVYSLVCLYNRDPRQHGYL 1259

Query: 283  H-LVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQ 341
            +     Y +R  +  I    P + R  +E++       A+ ++R     +P    +FVL 
Sbjct: 1260 YGFFTKYLTRLANLTI---HPAEARFIAEFY------KAMCLLRNDQGFTP----DFVLD 1306

Query: 342  LL 343
            +L
Sbjct: 1307 IL 1308


>gi|45383890|ref|NP_989441.1| protein disulfide-isomerase A3 precursor [Gallus gallus]
 gi|82243463|sp|Q8JG64.1|PDIA3_CHICK RecName: Full=Protein disulfide-isomerase A3; AltName:
           Full=Endoplasmic reticulum resident protein 57; Short=ER
           protein 57; Short=ERp57; AltName: Full=Glucose-regulated
           thiol oxidoreductase 58 kDa protein; Flags: Precursor
 gi|22651801|gb|AAM82759.1| glucose regulated thiol oxidoreductase protein precursor [Gallus
           gallus]
          Length = 505

 Score = 41.6 bits (96), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 37/111 (33%), Positives = 47/111 (42%), Gaps = 19/111 (17%)

Query: 24  GASPVRTLSTKEFRHFANKVGNLERP--FLKEFFPRWVGTCGCPVLRMGFSYNKRKNIVE 81
           GAS V  LS  +F       G  ERP   L EFF  W G C         +  + K IV 
Sbjct: 23  GASDVVELSDADFES-----GLAERPGLVLVEFFAPWCGHCKRLAPEYEAAATRLKGIVP 77

Query: 82  LAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMSIRVHELDGMYDHP 132
           L V  DCT           +N+++ N+ G  G+P +   R  E  G YD P
Sbjct: 78  L-VKVDCT-----------ANSNTCNKYGVSGYPTLKIFRDGEESGTYDGP 116


>gi|315426927|dbj|BAJ48546.1| aminopeptidase N [Candidatus Caldiarchaeum subterraneum]
 gi|315426988|dbj|BAJ48606.1| aminopeptidase N [Candidatus Caldiarchaeum subterraneum]
 gi|343485662|dbj|BAJ51316.1| aminopeptidase N [Candidatus Caldiarchaeum subterraneum]
          Length = 822

 Score = 41.6 bits (96), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 36/111 (32%), Positives = 53/111 (47%), Gaps = 5/111 (4%)

Query: 184 SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNV 243
           SM++    +  DPE++ L  +   +P++  ++ +EK G VV + + I +L    H    V
Sbjct: 505 SMKNRPEHVCVDPELKVLRVLDIERPLEEALSMVEKCGSVVCRIEMIDSLAK--HGGRRV 562

Query: 244 VNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGL--LHLVKFYKSRR 292
           V  L   +S    FW V   AA AL    +EE   A L  L  VK  K RR
Sbjct: 563 VEVLERVIS-GDPFWGVSYRAAKALGEIKTEEAKAALLRVLQKVKHPKIRR 612


>gi|315428074|dbj|BAJ49661.1| aminopeptidase N [Candidatus Caldiarchaeum subterraneum]
          Length = 822

 Score = 41.2 bits (95), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 36/111 (32%), Positives = 53/111 (47%), Gaps = 5/111 (4%)

Query: 184 SMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNV 243
           SM++    +  DPE++ L  +   +P++  ++ +EK G VV + + I +L    H    V
Sbjct: 505 SMKNRPEHVCVDPELKVLRVLDIERPLEEALSMVEKCGSVVCRIEMIDSLAK--HGGRRV 562

Query: 244 VNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGL--LHLVKFYKSRR 292
           V  L   +S    FW V   AA AL    +EE   A L  L  VK  K RR
Sbjct: 563 VEVLERVIS-GDPFWGVSYRAAKALGEIKTEEAKAALLRVLQKVKHPKIRR 612


>gi|344173724|emb|CCA88897.1| D-glycero-D-manno-heptose 1,7-bisphosphate phosphatase [Ralstonia
           syzygii R24]
          Length = 199

 Score = 41.2 bits (95), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 41/154 (26%), Positives = 67/154 (43%), Gaps = 24/154 (15%)

Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
           D V  LD    ++SP  WI  D  +E +AE+  NQ     +    + G      +A A  
Sbjct: 13  DGVINLDSDQFIKSPDEWIAIDGSLEAIAEL--NQAGYQVVVATNQSGIGRGLFEAAA-- 68

Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDW----AGLLHLVKFYK 289
                     +N ++  +  + A    R++A +   +TA+E  D     AG+ H +    
Sbjct: 69  ----------LNAMHEKMYKALAAQGGRVDAVFFCPHTAAEGCDCRKPKAGMFHEI---- 114

Query: 290 SRRFDENI-GLPRPND-FRDFSEYFVLEAIPHAV 321
           +RRFD ++ G+P   D  RD      + A+PH V
Sbjct: 115 ARRFDVDLTGMPVVGDSLRDLQAGVEVGAVPHLV 148


>gi|84996353|ref|XP_952898.1| hypothetical protein [Theileria annulata strain Ankara]
 gi|65303895|emb|CAI76274.1| hypothetical protein, conserved [Theileria annulata]
          Length = 1684

 Score = 40.8 bits (94), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 46/181 (25%), Positives = 75/181 (41%), Gaps = 22/181 (12%)

Query: 171  DNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDV---VAQA 227
            DN   VA +  ++ +  PL WIR D     LA I   Q   MW  QL  D ++   +  +
Sbjct: 1196 DNRSIVARICSKTRL--PLLWIRVDSSFSLLARIRRCQSGSMWEQQLYSDNNIYSLIEAS 1253

Query: 228  QAIAALEALPHLSFN-----VVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDWAGLL 282
            QA+ ++    ++S N         L+N +   K+   +R    Y+L    + +      L
Sbjct: 1254 QALGSIGKSEYISDNPLIESACKRLDNMIKKHKSHPVIRARCMYSLMCLYNRDPRQHKYL 1313

Query: 283  HLVKFYKSRRFDENIGLPRPNDFRDFSEYFVLEAIPHAVAMVRAADNKSPREAVEFVLQL 342
            H V F K      N+    P + R   EY+       A+ ++R     +P    +FVL +
Sbjct: 1314 HGV-FTKYLTSFSNLA-THPTETRFIMEYY------KALCLLRNEFGFTP----DFVLDI 1361

Query: 343  L 343
            L
Sbjct: 1362 L 1362


>gi|300692466|ref|YP_003753461.1| D-glycero-D-manno-heptose 1,7-bisphosphate phosphatase [Ralstonia
           solanacearum PSI07]
 gi|299079526|emb|CBJ52204.1| D-glycero-D-manno-heptose 1,7-bisphosphate phosphatase [Ralstonia
           solanacearum PSI07]
          Length = 199

 Score = 40.8 bits (94), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 41/154 (26%), Positives = 67/154 (43%), Gaps = 24/154 (15%)

Query: 174 DAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAAL 233
           D V  LD    ++SP  WI  D  +E +AE+  NQ     +    + G      +A A  
Sbjct: 13  DGVINLDSDQFIKSPDEWIAIDGSLEAIAEL--NQAGYQVVVATNQSGIGRGLFEAAA-- 68

Query: 234 EALPHLSFNVVNTLNNFLSDSKAFWRVRIEAAYALANTASEETDW----AGLLHLVKFYK 289
                     +N ++  +  + A    R++A +   +TA+E  D     AG+ H +    
Sbjct: 69  ----------LNAMHEKMYKALAAQGGRVDAVFFCPHTAAEGCDCRKPKAGMFHEI---- 114

Query: 290 SRRFDENI-GLPRPND-FRDFSEYFVLEAIPHAV 321
           +RRFD ++ G+P   D  RD      + A+PH V
Sbjct: 115 ARRFDVDLAGVPVVGDSLRDLQAGVEVGAVPHLV 148


>gi|154284686|ref|XP_001543138.1| predicted protein [Ajellomyces capsulatus NAm1]
 gi|150406779|gb|EDN02320.1| predicted protein [Ajellomyces capsulatus NAm1]
          Length = 927

 Score = 40.8 bits (94), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 38/70 (54%), Gaps = 3/70 (4%)

Query: 46  LERPFL--KEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNT 103
           L  PFL   E+   WV +   P+LR  + Y+  +N VEL++    ++   S   +L S T
Sbjct: 146 LLEPFLHTSEYLQTWVISTALPLLRFNYEYHW-ENDVELSLDIIQSLDSGSAVNILLSKT 204

Query: 104 DSENRDGDIG 113
           +S NR GD+G
Sbjct: 205 ESRNRGGDVG 214


>gi|429329327|gb|AFZ81086.1| bromodomain-containing protein [Babesia equi]
          Length = 1659

 Score = 40.4 bits (93), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 22/63 (34%), Positives = 32/63 (50%), Gaps = 2/63 (3%)

Query: 171  DNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAI 230
            DN   VA +  ++ +  PL WIR DP    ++ +   Q   MW  QL  D ++ AQ +A 
Sbjct: 998  DNRSVVAKVCSKTRL--PLLWIRIDPNFSTISRMRRCQSSGMWEQQLLSDNNIFAQMEAA 1055

Query: 231  AAL 233
             AL
Sbjct: 1056 CAL 1058


>gi|375148747|ref|YP_005011188.1| Peptidase M1 membrane alanine aminopeptidase [Niastella koreensis
           GR20-10]
 gi|361062793|gb|AEW01785.1| Peptidase M1 membrane alanine aminopeptidase [Niastella koreensis
           GR20-10]
          Length = 531

 Score = 40.4 bits (93), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 7/86 (8%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLERPFLKEFFPRWVG 60
           ML +Q+G   F K +Q   SR  G + V     K     + K        L+ FF +W+ 
Sbjct: 392 MLRRQLGDELFWKGIQTYYSRFSGKNAVTGDLCKAMEDVSGKS-------LQPFFQQWLF 444

Query: 61  TCGCPVLRMGFSYNKRKNIVELAVLR 86
             G P L + ++YN  K  V + V++
Sbjct: 445 RAGQPKLDLTWTYNATKKEVNITVIQ 470


>gi|408406094|ref|YP_006864078.1| membrane alanyl aminopeptidase [Candidatus Nitrososphaera gargensis
           Ga9.2]
 gi|408366690|gb|AFU60420.1| putative membrane alanyl aminopeptidase [Candidatus Nitrososphaera
           gargensis Ga9.2]
          Length = 843

 Score = 40.4 bits (93), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 64/262 (24%), Positives = 99/262 (37%), Gaps = 75/262 (28%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVRTLSTKEFRHFANKVGNLER-PFLKEFFPRWV 59
           ML   +G  +FR+ L+  + R   +    T  T + R    K+  LE    L++FF +W+
Sbjct: 412 MLRHHVGDKYFRRSLKTYLQRFANS----TAETDDLR----KIFELETGKSLQQFFDQWI 463

Query: 60  GTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDGDIGWPGMMS 119
              G P L++ FS                                   +DGDI       
Sbjct: 464 FREGHPELKVDFS-----------------------------------QDGDIA-----K 483

Query: 120 IRVHELDGMYDHPILPMAGDAWQLLEIQCHSKLAARRALKPKKGSKPDGCDDNGDAVAGL 179
           I+V ++           AGD     E      LA  R  K +   K        ++V  +
Sbjct: 484 IKVEQV----------QAGDP---FEFALDVNLAFARGKKTRYTFKIS----EKESVFQI 526

Query: 180 DMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHL 239
            + S +E    W   DPE + L  I    P +M + QL +DGD V   + + A  AL   
Sbjct: 527 PVDSEVE----WFSVDPEFKILKTISIRAPKEMLVRQL-RDGDTV--IERVEAARALKDK 579

Query: 240 SFN-VVNTLNNFLSDSKAFWRV 260
           S + V++ L + +   K FW V
Sbjct: 580 SSDAVIDALKDAVLHDK-FWGV 600


>gi|442317662|ref|YP_007357683.1| M1 family peptidase [Myxococcus stipitatus DSM 14675]
 gi|441485304|gb|AGC41999.1| M1 family peptidase [Myxococcus stipitatus DSM 14675]
          Length = 873

 Score = 40.0 bits (92), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 23/84 (27%), Positives = 38/84 (45%), Gaps = 2/84 (2%)

Query: 192 IRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAIAALEALPHLSFNVVNTLNNFL 251
           +R DP  + L  +  ++ V +W+ +L    +  A+ +A  AL       F  V  L   L
Sbjct: 530 VRVDPRRDVLGTLDVDKSVGLWMEELRSAPESRARTEAAHALGR--DGGFRAVEALGAAL 587

Query: 252 SDSKAFWRVRIEAAYALANTASEE 275
            D++ FW  R  +A AL    + E
Sbjct: 588 GDARLFWGTRAASARALGRIRTPE 611


>gi|401410606|ref|XP_003884751.1| hypothetical protein NCLIV_051480 [Neospora caninum Liverpool]
 gi|325119169|emb|CBZ54721.1| hypothetical protein NCLIV_051480 [Neospora caninum Liverpool]
          Length = 3109

 Score = 40.0 bits (92), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 46/87 (52%), Gaps = 3/87 (3%)

Query: 171  DNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQAI 230
            DN   VA ++ ++ +  P+ W+RADP   ++  I   Q   MW  QL  D ++ AQ +A 
Sbjct: 1814 DNRSLVAKVNSKTKL--PVLWLRADPMQVWIGRIRRCQSGSMWEQQLLNDVNISAQLEAA 1871

Query: 231  AALEALPHLSFNVVNTLNNFLSDSKAF 257
             AL ++     +++ TL++  + S A 
Sbjct: 1872 LALGSI-RSGEDILGTLSDPAAGSGAL 1897


>gi|399217801|emb|CCF74688.1| unnamed protein product [Babesia microti strain RI]
          Length = 1529

 Score = 39.7 bits (91), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 22/59 (37%), Positives = 30/59 (50%), Gaps = 2/59 (3%)

Query: 171 DNGDAVAGLDMRSSMESPLSWIRADPEMEYLAEIHFNQPVQMWINQLEKDGDVVAQAQA 229
           DN   VA +  +S +  P+ W RADP+   +  +   Q   MW  QL  D DVV Q +A
Sbjct: 887 DNRSLVAKVCSKSRL--PVLWTRADPQFALIGRLRRCQSPAMWEQQLLADNDVVGQMEA 943


>gi|28839716|gb|AAH47732.1| TAF2 protein, partial [Homo sapiens]
          Length = 604

 Score = 39.3 bits (90), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 32/143 (22%), Positives = 62/143 (43%), Gaps = 32/143 (22%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL       +K D  +P            G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELE------IKQDYTSP------------G 556

Query: 111 DIGWPGMMSIRVHELDGMYDHPI 133
              + G + + V ELDG ++H +
Sbjct: 557 TQKYVGPLKVTVQELDGSFNHTL 579


>gi|27370696|gb|AAH35673.1| TAF2 protein, partial [Homo sapiens]
          Length = 604

 Score = 39.3 bits (90), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 32/143 (22%), Positives = 62/143 (43%), Gaps = 32/143 (22%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL       +K D  +P            G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELE------IKQDYTSP------------G 556

Query: 111 DIGWPGMMSIRVHELDGMYDHPI 133
              + G + + V ELDG ++H +
Sbjct: 557 TQKYVGPLKVTVQELDGSFNHTL 579


>gi|40674143|gb|AAH64830.1| TAF2 protein, partial [Homo sapiens]
          Length = 603

 Score = 39.3 bits (90), Expect = 7.3,   Method: Compositional matrix adjust.
 Identities = 32/143 (22%), Positives = 62/143 (43%), Gaps = 32/143 (22%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTSGFLKSISNVSGKDIQPL 518

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL       +K D  +P            G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELE------IKQDYTSP------------G 556

Query: 111 DIGWPGMMSIRVHELDGMYDHPI 133
              + G + + V ELDG ++H +
Sbjct: 557 TQKYVGPLKVTVQELDGSFNHTL 579


>gi|76779336|gb|AAI05766.1| Taf2 protein, partial [Rattus norvegicus]
          Length = 600

 Score = 39.3 bits (90), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 37/160 (23%), Positives = 72/160 (45%), Gaps = 34/160 (21%)

Query: 1   MLEKQMGSNFFRKILQNIISRAQGASPVR---------TLSTKEF-RHFANKVGNLERPF 50
           ++E ++   F  ++   ++S A  AS  +          +ST  F +  +N  G   +P 
Sbjct: 459 LIENRISMEFMLQVFNKLLSLASTASSQKFQSHMWSQMLVSTYGFLKSISNVSGKDIQPL 518

Query: 51  LKEFFPRWVGTCGCPVLRMGFSYNKRKNIVELAVLRDCTVKPDSRTPVLSSNTDSENRDG 110
           +K+    WV   G       F++N+++N++EL + +D T      +P            G
Sbjct: 519 IKQ----WVDQSGVVKFYGSFAFNRKRNVLELEIKQDYT------SP------------G 556

Query: 111 DIGWPGMMSIRVHELDGMYDHPILPMAGDAWQLLEIQCHS 150
              + G + + V ELDG ++H  L +  ++ +  +I CHS
Sbjct: 557 TQKYVGPLKVTVQELDGSFNH-TLQIEENSLK-HDIPCHS 594


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.322    0.137    0.406 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 10,756,617,159
Number of Sequences: 23463169
Number of extensions: 441596528
Number of successful extensions: 970388
Number of sequences better than 100.0: 404
Number of HSP's better than 100.0 without gapping: 339
Number of HSP's successfully gapped in prelim test: 65
Number of HSP's that attempted gapping in prelim test: 968614
Number of HSP's gapped (non-prelim): 635
length of query: 707
length of database: 8,064,228,071
effective HSP length: 150
effective length of query: 557
effective length of database: 8,839,720,017
effective search space: 4923724049469
effective search space used: 4923724049469
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 81 (35.8 bits)