BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 001003
         (1192 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|225455571|ref|XP_002268371.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Vitis vinifera]
          Length = 1442

 Score = 1883 bits (4879), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 902/1172 (76%), Positives = 1008/1172 (86%), Gaps = 42/1172 (3%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
            MS+AAYKMMHWPTGI NC SGF+THSRAD+ PQI  IQT++L+SE P+KR IGP+PNL+V
Sbjct: 1    MSYAAYKMMHWPTGIENCASGFVTHSRADFAPQIAPIQTDDLESEWPTKRQIGPLPNLIV 60

Query: 61   TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
            TAAN++E+Y+VRVQE+ S+ES+ S ETKR  +M GIS A+LELVC YRLHGNVE++ +L 
Sbjct: 61   TAANILEVYMVRVQEDDSRESRASAETKRGGVMAGISGAALELVCQYRLHGNVETMTVLP 120

Query: 121  QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180
             GG DNSRRRDSIILAF+DAKISVLEFDDSIHGLR +SMHCFE PEW HLKRG ESFARG
Sbjct: 121  SGGGDNSRRRDSIILAFQDAKISVLEFDDSIHGLRTSSMHCFEGPEWFHLKRGHESFARG 180

Query: 181  PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240
            PLVKVDPQGRC GVLVYGLQMIILKASQ G GLVGDE+   SG   SAR+ESS+VI+LRD
Sbjct: 181  PLVKVDPQGRCSGVLVYGLQMIILKASQAGYGLVGDEEALSSGSAVSARVESSYVISLRD 240

Query: 241  LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
            LDMKHVKDF FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS
Sbjct: 241  LDMKHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300

Query: 301  AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
            A+NLPHDAYKLL VPSPIGGV+V+ AN+IHYHSQSASCALALNNYAVS D+SQE+PRSSF
Sbjct: 301  AVNLPHDAYKLLPVPSPIGGVVVISANSIHYHSQSASCALALNNYAVSADNSQEMPRSSF 360

Query: 361  SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGN 420
            SVELDAA+ATWL NDVA+LSTKTG+L+LLT+ YDGRVV RLDLSK+  SVLTS I  IGN
Sbjct: 361  SVELDAANATWLSNDVAMLSTKTGELLLLTLAYDGRVVHRLDLSKSRASVLTSGIAAIGN 420

Query: 421  SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480
            SLFFLGSRLGDSLLVQF     TS+LSS +KEE GDIE D PS KRLR+SSSDALQDMVN
Sbjct: 421  SLFFLGSRLGDSLLVQF-----TSILSSSVKEEVGDIEGDVPSAKRLRKSSSDALQDMVN 475

Query: 481  GEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYE 540
            GEELSLYGSA N+TE++QKTFSF+VRDS +N+GPLKDF+YGLRINAD  ATGI+KQSNYE
Sbjct: 476  GEELSLYGSAPNSTETSQKTFSFSVRDSFINVGPLKDFAYGLRINADPKATGIAKQSNYE 535

Query: 541  L--------------------------VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDE 574
            L                          VELPGCKGIWTVYHK++RGHNADS++MA  DDE
Sbjct: 536  LVCCSGHGKNGALCILQQSIRPEMITEVELPGCKGIWTVYHKNTRGHNADSTKMATKDDE 595

Query: 575  YHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
            YHAYLIISLE+RTMVLETADLL EVTESVDY+VQG TI+AGNLFGRRRV+QV+ RGARIL
Sbjct: 596  YHAYLIISLESRTMVLETADLLGEVTESVDYYVQGCTISAGNLFGRRRVVQVYARGARIL 655

Query: 635  DGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSV 694
            DG++MTQDL            SE+STVLSVSIADPYVLL MSDG+I+LLVGDPSTCTVS+
Sbjct: 656  DGAFMTQDLPI----------SESSTVLSVSIADPYVLLRMSDGNIQLLVGDPSTCTVSI 705

Query: 695  QTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYS 754
              PA  ESSKK +S+CTLYHDKGPEPWLRKTSTDAWLSTG+GEAIDGADG   DQGDIY 
Sbjct: 706  NIPAVFESSKKSISACTLYHDKGPEPWLRKTSTDAWLSTGIGEAIDGADGAAQDQGDIYC 765

Query: 755  VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
            VV YESG LEIFDVPNFNCVF+VDKF+SG  H+VDT + E  +D++  ++ +SEE   QG
Sbjct: 766  VVSYESGDLEIFDVPNFNCVFSVDKFMSGNAHLVDTLILEPSEDTQKVMSKNSEEEADQG 825

Query: 815  RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
            RKEN H++KVVELAMQRWS  HSRPFLF ILTDGTILCY AYL+EGPE+T K+++ VS  
Sbjct: 826  RKENAHNIKVVELAMQRWSGQHSRPFLFGILTDGTILCYHAYLYEGPESTPKTEEAVSAQ 885

Query: 875  RSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPC 934
             SLS+SNVSASRLRNLRF R PLD YTREE   G    R+T+FKNI G QG FLSGSRP 
Sbjct: 886  NSLSISNVSASRLRNLRFVRVPLDTYTREEALSGTTSPRMTVFKNIGGCQGLFLSGSRPL 945

Query: 935  WCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWP 994
            W MVFRER+RVHPQLCDGSIVAFTVLHN+NCNHG IYVTSQG LKICQLP+ S+YDNYWP
Sbjct: 946  WFMVFRERIRVHPQLCDGSIVAFTVLHNINCNHGLIYVTSQGFLKICQLPAVSSYDNYWP 1005

Query: 995  VQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD 1054
            VQK IPLK TPHQ+TYFAEKNLYPLIVSVPVLKPLN VLS L+DQE GHQ++N NLSS +
Sbjct: 1006 VQK-IPLKGTPHQVTYFAEKNLYPLIVSVPVLKPLNHVLSSLVDQEAGHQLENDNLSSDE 1064

Query: 1055 LHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAI 1114
            LHR+Y+V+E+EVR+LEP+++G PWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAI
Sbjct: 1065 LHRSYSVDEFEVRVLEPEKSGAPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAI 1124

Query: 1115 GTAYVQGEDVAARGRVLLFSTGRNADNPQNLV 1146
            GTAYVQGEDVAARGRVLLFS G+N DN QNLV
Sbjct: 1125 GTAYVQGEDVAARGRVLLFSVGKNTDNSQNLV 1156


>gi|296084122|emb|CBI24510.3| unnamed protein product [Vitis vinifera]
          Length = 1448

 Score = 1878 bits (4865), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 902/1178 (76%), Positives = 1008/1178 (85%), Gaps = 48/1178 (4%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
            MS+AAYKMMHWPTGI NC SGF+THSRAD+ PQI  IQT++L+SE P+KR IGP+PNL+V
Sbjct: 1    MSYAAYKMMHWPTGIENCASGFVTHSRADFAPQIAPIQTDDLESEWPTKRQIGPLPNLIV 60

Query: 61   TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
            TAAN++E+Y+VRVQE+ S+ES+ S ETKR  +M GIS A+LELVC YRLHGNVE++ +L 
Sbjct: 61   TAANILEVYMVRVQEDDSRESRASAETKRGGVMAGISGAALELVCQYRLHGNVETMTVLP 120

Query: 121  QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180
             GG DNSRRRDSIILAF+DAKISVLEFDDSIHGLR +SMHCFE PEW HLKRG ESFARG
Sbjct: 121  SGGGDNSRRRDSIILAFQDAKISVLEFDDSIHGLRTSSMHCFEGPEWFHLKRGHESFARG 180

Query: 181  PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240
            PLVKVDPQGRC GVLVYGLQMIILKASQ G GLVGDE+   SG   SAR+ESS+VI+LRD
Sbjct: 181  PLVKVDPQGRCSGVLVYGLQMIILKASQAGYGLVGDEEALSSGSAVSARVESSYVISLRD 240

Query: 241  LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
            LDMKHVKDF FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS
Sbjct: 241  LDMKHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300

Query: 301  AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
            A+NLPHDAYKLL VPSPIGGV+V+ AN+IHYHSQSASCALALNNYAVS D+SQE+PRSSF
Sbjct: 301  AVNLPHDAYKLLPVPSPIGGVVVISANSIHYHSQSASCALALNNYAVSADNSQEMPRSSF 360

Query: 361  SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGN 420
            SVELDAA+ATWL NDVA+LSTKTG+L+LLT+ YDGRVV RLDLSK+  SVLTS I  IGN
Sbjct: 361  SVELDAANATWLSNDVAMLSTKTGELLLLTLAYDGRVVHRLDLSKSRASVLTSGIAAIGN 420

Query: 421  SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480
            SLFFLGSRLGDSLLVQF     TS+LSS +KEE GDIE D PS KRLR+SSSDALQDMVN
Sbjct: 421  SLFFLGSRLGDSLLVQF-----TSILSSSVKEEVGDIEGDVPSAKRLRKSSSDALQDMVN 475

Query: 481  GEELSLYGSASNNTESAQ------KTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
            GEELSLYGSA N+TE++Q      KTFSF+VRDS +N+GPLKDF+YGLRINAD  ATGI+
Sbjct: 476  GEELSLYGSAPNSTETSQVEAQVGKTFSFSVRDSFINVGPLKDFAYGLRINADPKATGIA 535

Query: 535  KQSNYEL--------------------------VELPGCKGIWTVYHKSSRGHNADSSRM 568
            KQSNYEL                          VELPGCKGIWTVYHK++RGHNADS++M
Sbjct: 536  KQSNYELVCCSGHGKNGALCILQQSIRPEMITEVELPGCKGIWTVYHKNTRGHNADSTKM 595

Query: 569  AAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFE 628
            A  DDEYHAYLIISLE+RTMVLETADLL EVTESVDY+VQG TI+AGNLFGRRRV+QV+ 
Sbjct: 596  ATKDDEYHAYLIISLESRTMVLETADLLGEVTESVDYYVQGCTISAGNLFGRRRVVQVYA 655

Query: 629  RGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS 688
            RGARILDG++MTQDL            SE+STVLSVSIADPYVLL MSDG+I+LLVGDPS
Sbjct: 656  RGARILDGAFMTQDLPI----------SESSTVLSVSIADPYVLLRMSDGNIQLLVGDPS 705

Query: 689  TCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLD 748
            TCTVS+  PA  ESSKK +S+CTLYHDKGPEPWLRKTSTDAWLSTG+GEAIDGADG   D
Sbjct: 706  TCTVSINIPAVFESSKKSISACTLYHDKGPEPWLRKTSTDAWLSTGIGEAIDGADGAAQD 765

Query: 749  QGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSE 808
            QGDIY VV YESG LEIFDVPNFNCVF+VDKF+SG  H+VDT + E  +D++  ++ +SE
Sbjct: 766  QGDIYCVVSYESGDLEIFDVPNFNCVFSVDKFMSGNAHLVDTLILEPSEDTQKVMSKNSE 825

Query: 809  EGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSD 868
            E   QGRKEN H++KVVELAMQRWS  HSRPFLF ILTDGTILCY AYL+EGPE+T K++
Sbjct: 826  EEADQGRKENAHNIKVVELAMQRWSGQHSRPFLFGILTDGTILCYHAYLYEGPESTPKTE 885

Query: 869  DPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFL 928
            + VS   SLS+SNVSASRLRNLRF R PLD YTREE   G    R+T+FKNI G QG FL
Sbjct: 886  EAVSAQNSLSISNVSASRLRNLRFVRVPLDTYTREEALSGTTSPRMTVFKNIGGCQGLFL 945

Query: 929  SGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGST 988
            SGSRP W MVFRER+RVHPQLCDGSIVAFTVLHN+NCNHG IYVTSQG LKICQLP+ S+
Sbjct: 946  SGSRPLWFMVFRERIRVHPQLCDGSIVAFTVLHNINCNHGLIYVTSQGFLKICQLPAVSS 1005

Query: 989  YDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNH 1048
            YDNYWPVQK IPLK TPHQ+TYFAEKNLYPLIVSVPVLKPLN VLS L+DQE GHQ++N 
Sbjct: 1006 YDNYWPVQK-IPLKGTPHQVTYFAEKNLYPLIVSVPVLKPLNHVLSSLVDQEAGHQLEND 1064

Query: 1049 NLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN 1108
            NLSS +LHR+Y+V+E+EVR+LEP+++G PWQTRATIPMQSSENALTVRVVTLFNTTTKEN
Sbjct: 1065 NLSSDELHRSYSVDEFEVRVLEPEKSGAPWQTRATIPMQSSENALTVRVVTLFNTTTKEN 1124

Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLV 1146
            ETLLAIGTAYVQGEDVAARGRVLLFS G+N DN QNLV
Sbjct: 1125 ETLLAIGTAYVQGEDVAARGRVLLFSVGKNTDNSQNLV 1162


>gi|255539681|ref|XP_002510905.1| cleavage and polyadenylation specificity factor cpsf, putative
            [Ricinus communis]
 gi|223550020|gb|EEF51507.1| cleavage and polyadenylation specificity factor cpsf, putative
            [Ricinus communis]
          Length = 1461

 Score = 1870 bits (4845), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 912/1176 (77%), Positives = 1018/1176 (86%), Gaps = 31/1176 (2%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELP-SKRGIGPVPNLV 59
            MS+AAYKM+HWPTGI +C SG+ITHSRAD+VPQIP IQT+ LDSE P SKRGIGP+PNL+
Sbjct: 1    MSYAAYKMLHWPTGIESCASGYITHSRADFVPQIPPIQTDNLDSEWPPSKRGIGPMPNLI 60

Query: 60   VTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAIL 119
            VTA +V+E+YVVRVQE+GS+ES++S ETKR  LMDG+S ASLELVCHYRLHGNVES+ +L
Sbjct: 61   VTAGSVLEVYVVRVQEDGSRESRSSRETKRGGLMDGVSGASLELVCHYRLHGNVESMVVL 120

Query: 120  SQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR 179
               G D+SRRRDSIILAF+DAKISVLEFDDSIHGLR +SMHCFE PEWLHLKRGRESFAR
Sbjct: 121  PTEGGDSSRRRDSIILAFKDAKISVLEFDDSIHGLRTSSMHCFEGPEWLHLKRGRESFAR 180

Query: 180  GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
            GPL+KVDPQGRCGG+LVY +QMIIL+A+Q  SGLVGD+D   SGG  SAR++SS+VINLR
Sbjct: 181  GPLLKVDPQGRCGGILVYDMQMIILRAAQASSGLVGDDDALSSGGSISARVQSSYVINLR 240

Query: 240  DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299
            D+DMKHVKDFIF+H YIEPV+VILHERELTWAGRVSWKHHTCMISALSISTTLKQ  LIW
Sbjct: 241  DMDMKHVKDFIFLHDYIEPVVVILHERELTWAGRVSWKHHTCMISALSISTTLKQPTLIW 300

Query: 300  SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
            S +NLPHDAYKLLAVP PIGGVLV+ ANTIHYHS+SA+ ALALNNYAVS+DSSQELPR+S
Sbjct: 301  SVVNLPHDAYKLLAVPPPIGGVLVICANTIHYHSESATYALALNNYAVSIDSSQELPRAS 360

Query: 360  FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIG 419
            FSVELDA  A WL NDVALLS K G+L+LL++VYDGRVVQRLDLSK+  SVLTSDITTIG
Sbjct: 361  FSVELDAVKAAWLLNDVALLSAKNGELLLLSLVYDGRVVQRLDLSKSKASVLTSDITTIG 420

Query: 420  NSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
            NSLFFLGSRLGDSLLVQFT G G S++SSGLKEE G+IE D PS KRL+RS+SD LQDMV
Sbjct: 421  NSLFFLGSRLGDSLLVQFTNGLGPSVVSSGLKEEVGEIEGDVPSAKRLKRSASDGLQDMV 480

Query: 480  NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY 539
            +GEELSLYGS +NNTESAQK+FSFAVRDSL+N+GPLKDFSYGLR N DASATGI+KQSNY
Sbjct: 481  SGEELSLYGSTANNTESAQKSFSFAVRDSLINVGPLKDFSYGLRSNYDASATGIAKQSNY 540

Query: 540  EL--------------------------VELPGCKGIWTVYHKSSRGHNADSSRMAAYDD 573
            +L                          V+LPGC+GIWTVYHK++RGHN D S+MAA  D
Sbjct: 541  DLVCCSGHGKNGTLCILRQSIRPEMITEVDLPGCRGIWTVYHKNARGHNVDLSKMAAAAD 600

Query: 574  EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 633
            EYHAYLIIS+EARTMVLETADLL+EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI
Sbjct: 601  EYHAYLIISMEARTMVLETADLLSEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 660

Query: 634  LDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 693
            LDGS+MTQDLS G SNSES  GSE++TV SVSIADPYVL+ M+DGSIRLL+GD STC VS
Sbjct: 661  LDGSFMTQDLSIGSSNSESSPGSESATVSSVSIADPYVLIKMTDGSIRLLIGDSSTCMVS 720

Query: 694  VQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDG---ADGGPLDQG 750
            + TP+A E+S++ VS+CTLYHDKGPEPWLRK STDAWLSTGV EAIDG   ADGGP DQG
Sbjct: 721  INTPSAFENSERSVSACTLYHDKGPEPWLRKASTDAWLSTGVSEAIDGAESADGGPHDQG 780

Query: 751  DIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEG 810
            DIY +VCYESGALEIFDVPNFN VF+VDKFVSG+TH+ D Y+RE  KDS+ + N  SEE 
Sbjct: 781  DIYCIVCYESGALEIFDVPNFNRVFSVDKFVSGKTHLADAYVREPPKDSQEKTNRISEEV 840

Query: 811  TGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDP 870
             G GRKEN H+MK VELAMQRWS HHSRPFLF +LTDGTILCY AYLFE P+ TSK++D 
Sbjct: 841  AGLGRKENAHNMKAVELAMQRWSGHHSRPFLFGVLTDGTILCYHAYLFEAPDATSKTEDS 900

Query: 871  VSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSG 930
            VS    + + ++SASRLRNLRF R PLD+Y +EET     CQRITIF NISGHQGFFL G
Sbjct: 901  VSAQNPVGLGSISASRLRNLRFVRVPLDSYIKEETSTENSCQRITIFNNISGHQGFFLLG 960

Query: 931  SRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
            SRP W MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHG IYVTSQG LKICQLPS S YD
Sbjct: 961  SRPAWFMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGNLKICQLPSFSNYD 1020

Query: 991  NYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNL 1050
            NYWPVQK IPLK TPHQ+TYF EKNLYPLIVSVPV KP+NQVLS L+DQEVGHQI+NHNL
Sbjct: 1021 NYWPVQK-IPLKGTPHQVTYFPEKNLYPLIVSVPVHKPVNQVLSSLVDQEVGHQIENHNL 1079

Query: 1051 SSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENET 1110
            SS +L +TY+VEE+EVRILE +  GGPWQT+ATIPMQSSENALTVRVVTLFN TTKENET
Sbjct: 1080 SSDELLQTYSVEEFEVRILESENGGGPWQTKATIPMQSSENALTVRVVTLFNATTKENET 1139

Query: 1111 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLV 1146
            LLAIGTAYVQGEDVAARGRVLLFS  ++ +N Q LV
Sbjct: 1140 LLAIGTAYVQGEDVAARGRVLLFSVVKSTENSQVLV 1175


>gi|356559917|ref|XP_003548242.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Glycine max]
          Length = 1447

 Score = 1843 bits (4773), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 886/1174 (75%), Positives = 1013/1174 (86%), Gaps = 40/1174 (3%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSK--RGIGPVPNL 58
            MSFAAYKMM  PTGI NC +GF+THSR+D+VP    +Q ++LD+E PS+    +G +PNL
Sbjct: 1    MSFAAYKMMQCPTGIDNCAAGFLTHSRSDFVP----LQPDDLDAEWPSRPRHHVGSLPNL 56

Query: 59   VVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAI 118
            VVTAANV+E+Y VR+QE+  +  K + +++R  L+DGI+ ASLELVCHYRLHGNVE++A+
Sbjct: 57   VVTAANVLEVYAVRLQED--QPPKAAADSRRGALLDGIAGASLELVCHYRLHGNVETMAV 114

Query: 119  LSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFA 178
            LS GG D SRRRDSI+L F DAKISVLE+DDSIHGLR +S+HCFE PEWLHLKRGRE FA
Sbjct: 115  LSIGGGDVSRRRDSIMLTFADAKISVLEYDDSIHGLRTSSLHCFEGPEWLHLKRGREQFA 174

Query: 179  RGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
            RGP+VKVDPQGRCGGVL+Y LQMIILKA+Q GSGLVG++D  GS G  +ARIESS++INL
Sbjct: 175  RGPVVKVDPQGRCGGVLIYDLQMIILKATQAGSGLVGEDDALGSSGAVAARIESSYMINL 234

Query: 239  RDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298
            RDLDM+HVKDF FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI
Sbjct: 235  RDLDMRHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 294

Query: 299  WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
            WSA+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALALN+YAV+LDSSQE+PRS
Sbjct: 295  WSAVNLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALALNSYAVTLDSSQEIPRS 354

Query: 359  SFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTI 418
            SF+VELDAA+ATWL +DVALLSTKTG+L+LLT+VYDGRVVQRLDLSK+  SVL+S ITTI
Sbjct: 355  SFNVELDAANATWLLSDVALLSTKTGELLLLTLVYDGRVVQRLDLSKSKASVLSSGITTI 414

Query: 419  GNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDM 478
            GNSLFFL SRLGDS+LVQF+CGSG SMLSS LKEE GDIEADAPS KRLRRS SDALQDM
Sbjct: 415  GNSLFFLASRLGDSMLVQFSCGSGVSMLSSNLKEEVGDIEADAPS-KRLRRSPSDALQDM 473

Query: 479  VNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSN 538
            V+GEELSLYGSA N TESAQK+FSFAVRDSL+N+GPLKDFSYGLRINADA+ATGI+KQSN
Sbjct: 474  VSGEELSLYGSAPNRTESAQKSFSFAVRDSLINVGPLKDFSYGLRINADANATGIAKQSN 533

Query: 539  YEL--------------------------VELPGCKGIWTVYHKSSRGHNADSSRMAAYD 572
            YEL                          VELPGCKGIWTVYHKS+R HNADSS+MA  D
Sbjct: 534  YELVCCSGHGKNGSLCVLRQSIRPEVITEVELPGCKGIWTVYHKSTRSHNADSSKMADDD 593

Query: 573  DEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
            DEYHAYLIISLEARTMVLETADLL+EVTESVDY+VQG+T+AAGNLFGR RVIQV+ERGAR
Sbjct: 594  DEYHAYLIISLEARTMVLETADLLSEVTESVDYYVQGKTLAAGNLFGRCRVIQVYERGAR 653

Query: 633  ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTV 692
            ILDGS+MTQD+SFG SN ESGS S+++  LSVSIADP+VLL MSDGSIRLL+GDPSTCT+
Sbjct: 654  ILDGSFMTQDVSFGASNLESGSASDSAIALSVSIADPFVLLRMSDGSIRLLIGDPSTCTI 713

Query: 693  SVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDI 752
            SV +PA+ ESSK  VSSCTLYHDKGPEPWLRKTSTDAWLSTGVGE IDG DG   D GDI
Sbjct: 714  SVTSPASFESSKGSVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGETIDGTDGAAQDHGDI 773

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            Y VVC+++G LEIFDVPNFNCVF+V+ F+SG++H+VD  M+E LKDS+       +    
Sbjct: 774  YCVVCFDNGNLEIFDVPNFNCVFSVENFMSGKSHLVDALMKEVLKDSK---QGDRDGVIN 830

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QGRKENI  MKVVELAMQRWS  HSRPFLF IL+DGTILCY AYL+E P++TSK +D  S
Sbjct: 831  QGRKENIPDMKVVELAMQRWSGQHSRPFLFGILSDGTILCYHAYLYESPDSTSKVEDSAS 890

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
               S+ +S+ + SRLRNLRF R PLDAY RE+T +G PCQ+ITIFKNI  ++GFFLSGSR
Sbjct: 891  AGGSIGLSSTNVSRLRNLRFVRVPLDAYAREDTSNGPPCQQITIFKNIGSYEGFFLSGSR 950

Query: 933  PCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNY 992
            P W MV RERLRVHPQLCDGSIVAFTVLHNVNCN G IYVTSQG+LKICQLPSGS YD+Y
Sbjct: 951  PAWVMVLRERLRVHPQLCDGSIVAFTVLHNVNCNQGLIYVTSQGVLKICQLPSGSNYDSY 1010

Query: 993  WPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1052
            WPVQK IPLKATPHQ+TYFAEKNLYPLIVS PVLKPLNQV+S L+DQ++ HQ ++ N++ 
Sbjct: 1011 WPVQK-IPLKATPHQVTYFAEKNLYPLIVSFPVLKPLNQVIS-LVDQDINHQNESQNMNP 1068

Query: 1053 VDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLL 1112
             + +R Y ++E+EVRI+EP+++GGPWQT+ATIPMQSSENALTVR+VTL NTT+KENETLL
Sbjct: 1069 DEQNRFYPIDEFEVRIMEPEKSGGPWQTKATIPMQSSENALTVRMVTLVNTTSKENETLL 1128

Query: 1113 AIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLV 1146
            AIGTAYVQGEDVAARGR+LLFS G+N DNPQ LV
Sbjct: 1129 AIGTAYVQGEDVAARGRILLFSLGKNTDNPQTLV 1162


>gi|356530945|ref|XP_003534039.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Glycine max]
          Length = 1449

 Score = 1823 bits (4721), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 887/1175 (75%), Positives = 1015/1175 (86%), Gaps = 40/1175 (3%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDS-ELPSK--RGIGPVPN 57
            MSFAAYKMM  PTGI NC +GF+THSR+D+VP    +Q ++LD+ E PS+    +GP+PN
Sbjct: 1    MSFAAYKMMQCPTGIDNCAAGFLTHSRSDFVP----LQPDDLDAAEWPSRPRHHVGPLPN 56

Query: 58   LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
            LVVTAANV+E+Y VR+QE+  +    S +++R  L+DGI+ ASLEL CHYRLHGNVE++A
Sbjct: 57   LVVTAANVLEVYAVRLQED-QQPKDASDDSRRGTLLDGIAGASLELECHYRLHGNVETMA 115

Query: 118  ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
            +LS GG D SR+RDSIIL F DAKISVLE+DDSIHGLR +S+HCFE PEWLHLKRGRE F
Sbjct: 116  VLSIGGGDVSRKRDSIILTFADAKISVLEYDDSIHGLRTSSLHCFEGPEWLHLKRGREQF 175

Query: 178  ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
            ARGP+VK+DPQGRCGGVL+Y LQMIILKA+Q GSGLVGD+D FGS G  +ARIESS++IN
Sbjct: 176  ARGPVVKIDPQGRCGGVLIYDLQMIILKATQVGSGLVGDDDAFGSSGAVAARIESSYMIN 235

Query: 238  LRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
            LRDLDM+HVKDF FV+GYIEPVMVILHERELTWAGRVSW HHTCMISALSISTTLKQHPL
Sbjct: 236  LRDLDMRHVKDFTFVYGYIEPVMVILHERELTWAGRVSWTHHTCMISALSISTTLKQHPL 295

Query: 298  IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
            IWSA+NLPHDAYKLLAVPSPIGGVLV+GANTIHYHSQSASCALALNNYAV+LDSSQE+PR
Sbjct: 296  IWSAVNLPHDAYKLLAVPSPIGGVLVIGANTIHYHSQSASCALALNNYAVTLDSSQEIPR 355

Query: 358  SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
            SSF+VELDAA+ATWL +DVALLSTKTG+L+LL +VYDGRVVQRLDLSK+  SVL+S ITT
Sbjct: 356  SSFNVELDAANATWLLSDVALLSTKTGELLLLMLVYDGRVVQRLDLSKSKASVLSSGITT 415

Query: 418  IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD 477
            IGNSLFFL SRLGDS+LVQF+CGSG SM+SS LKEE GDIE DAPS KRLRRS SDALQD
Sbjct: 416  IGNSLFFLASRLGDSMLVQFSCGSGVSMMSSNLKEEVGDIEVDAPS-KRLRRSPSDALQD 474

Query: 478  MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
            MV+GEELSLYGSA+N TESAQK+FSFAVRDSL+N+GPLKDFSYGLRINADA+ATGI+KQS
Sbjct: 475  MVSGEELSLYGSATNRTESAQKSFSFAVRDSLINVGPLKDFSYGLRINADANATGIAKQS 534

Query: 538  NYEL--------------------------VELPGCKGIWTVYHKSSRGHNADSSRMAAY 571
            NYEL                          VELPGCKGIWTVYHKS+R HNADSS+MA  
Sbjct: 535  NYELVCCSGHGKNGSLCVLRQSIRPEVITEVELPGCKGIWTVYHKSTRSHNADSSKMADD 594

Query: 572  DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
            DDEYHAYLIISLEARTMVLETADLL+EVTESVDY+VQG+T+AAGNLFGRRRVIQV+ERGA
Sbjct: 595  DDEYHAYLIISLEARTMVLETADLLSEVTESVDYYVQGKTLAAGNLFGRRRVIQVYERGA 654

Query: 632  RILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
            RILDGS+MTQD+SFG SNSESGS SE++  LSVSIADP+VLL MSDGSIRLL+GDPSTCT
Sbjct: 655  RILDGSFMTQDVSFGASNSESGSASESAIALSVSIADPFVLLRMSDGSIRLLIGDPSTCT 714

Query: 692  VSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGD 751
            +SV +PA+ ESSK  VSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDG DG   D GD
Sbjct: 715  ISVTSPASFESSKGSVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGTDGAAQDHGD 774

Query: 752  IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT 811
            IY VVC+++G LEIFD+PNFNCVF+V+ F+SG++H+VD  M+E LKDS+       +   
Sbjct: 775  IYCVVCFDNGNLEIFDIPNFNCVFSVENFMSGKSHLVDALMKEVLKDSK---QGDRDGVV 831

Query: 812  GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
             QGRK+NI +MKVVELAMQRWS  HSRPFLF IL+DGTILCY AYL+E P+ TSK +D  
Sbjct: 832  NQGRKDNIPNMKVVELAMQRWSGQHSRPFLFGILSDGTILCYHAYLYESPDGTSKVEDSA 891

Query: 872  STSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
            S   S+ +S+ + SRLRNLRF R PLDAY RE+T +G+PCQ+ITIFKNI  +QGFFLSGS
Sbjct: 892  SAGGSIGLSSTNVSRLRNLRFVRVPLDAYPREDTSNGSPCQQITIFKNIGSYQGFFLSGS 951

Query: 932  RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            RP W MV RERLRVHPQLCDGSIVAFTVLHNVNCNHG IYVTSQG+LKICQLPSGS YD+
Sbjct: 952  RPAWVMVLRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSGSNYDS 1011

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
            YWPVQK IPLKATPHQ+TYFAEKNLYPLIVS PVLKPLNQV+S L+DQ+  HQ ++ N++
Sbjct: 1012 YWPVQK-IPLKATPHQVTYFAEKNLYPLIVSFPVLKPLNQVIS-LVDQDFNHQNESQNMN 1069

Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETL 1111
              + +R Y ++E+EVRI+EP+++GGPWQT+ATIPMQSSENALTVR+VTL NTT+KENETL
Sbjct: 1070 PDEQNRFYPIDEFEVRIMEPEKSGGPWQTKATIPMQSSENALTVRMVTLLNTTSKENETL 1129

Query: 1112 LAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLV 1146
            LAIGTAYVQGEDVAARGR+LLFS G+  DNPQ LV
Sbjct: 1130 LAIGTAYVQGEDVAARGRILLFSLGKITDNPQTLV 1164


>gi|224120960|ref|XP_002318462.1| predicted protein [Populus trichocarpa]
 gi|222859135|gb|EEE96682.1| predicted protein [Populus trichocarpa]
          Length = 1455

 Score = 1818 bits (4710), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 892/1181 (75%), Positives = 996/1181 (84%), Gaps = 47/1181 (3%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKR----GIGPVP 56
            MS+AAYKMMHWPT I  C SGF+THSR++    +P + T++LDS+ PS+R    GIGP P
Sbjct: 1    MSYAAYKMMHWPTTIDTCVSGFVTHSRSESA-HLPQLHTDDLDSDWPSRRRHGGGIGPTP 59

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            NL+V + NV+E+YVVRVQEEG++   +SGE KR  +MDG++ ASLELVCHYRLHGNVES+
Sbjct: 60   NLIVASGNVLELYVVRVQEEGAR---SSGELKRGGVMDGVAGASLELVCHYRLHGNVESM 116

Query: 117  AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +LS  G D+SRRRDSIILAF+DAKISVLEFDDSIHGLR +SMHCFE P+W HLKRGRES
Sbjct: 117  GVLSVEGGDDSRRRDSIILAFKDAKISVLEFDDSIHGLRTSSMHCFEGPDWRHLKRGRES 176

Query: 177  FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
            FARGPLVKVDPQGRCGGVLVY LQMIILKA+Q GS LV DED FGSG   SA I SS++I
Sbjct: 177  FARGPLVKVDPQGRCGGVLVYDLQMIILKAAQAGSALVQDEDAFGSGAAISAHIASSYII 236

Query: 237  NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
            NLRDLDMKHVKDFIFVH YIEPV+V+LHERELTWAGRV WKHHTCMISALSISTTLKQ  
Sbjct: 237  NLRDLDMKHVKDFIFVHDYIEPVVVVLHERELTWAGRVVWKHHTCMISALSISTTLKQPT 296

Query: 297  LIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELP 356
            LIWS  NLPHDAYKLLAVPSPIGGVLV+G NTIHYHS+SASCALALN+YA S+DSSQELP
Sbjct: 297  LIWSIGNLPHDAYKLLAVPSPIGGVLVIGVNTIHYHSESASCALALNSYAASVDSSQELP 356

Query: 357  RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDIT 416
            R++FSVELDAA+ATWL  DVALLSTKTG+L+LLT+VYDGRVVQRLDLSK+  SVLTSDIT
Sbjct: 357  RATFSVELDAANATWLLKDVALLSTKTGELLLLTLVYDGRVVQRLDLSKSKASVLTSDIT 416

Query: 417  TIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ 476
            T+GNS FFLGSRLGDSLLVQFT G G+SMLS GLKEE GDIE D PS KRL+ SSSDALQ
Sbjct: 417  TLGNSFFFLGSRLGDSLLVQFTSGLGSSMLSPGLKEEVGDIEGDLPSAKRLKVSSSDALQ 476

Query: 477  DMVNGEELSLYGSASNNTESAQ-----KTFSFAVRDSLVNIGPLKDFSYGLRINADASAT 531
            DMV+GEELSLY SA NN ES+Q     KTFSF VRDSL+N+GPLKDF+YGLRINADA+AT
Sbjct: 477  DMVSGEELSLYSSAPNNAESSQVVSVIKTFSFTVRDSLINVGPLKDFAYGLRINADANAT 536

Query: 532  GISKQSNYEL--------------------------VELPGCKGIWTVYHKSSRGHNADS 565
            GISKQSNYEL                          VELPGCKGIWTVYHK++R H+ DS
Sbjct: 537  GISKQSNYELVCCSGHGKNGALCVLQQSIRPEMITEVELPGCKGIWTVYHKNARIHSVDS 596

Query: 566  SRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQ 625
             +MA+ DDEYHAYLIIS+EARTMVLETAD LTEVTESVDYFVQGRTIAAGNLFGRRRV+Q
Sbjct: 597  LKMAS-DDEYHAYLIISMEARTMVLETADHLTEVTESVDYFVQGRTIAAGNLFGRRRVVQ 655

Query: 626  VFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG 685
            VFERGARILDGS+MTQDLSFG SNSE+G  SE+STV+ VSI DPYVL+ M+DGSI++LVG
Sbjct: 656  VFERGARILDGSFMTQDLSFGGSNSETGR-SESSTVMHVSIVDPYVLVRMADGSIQILVG 714

Query: 686  DPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGG 745
            DPS CTVSV TP+A +SS K VS+CTLYHDKGPEPWLRKTSTDAWLSTG+ EAIDGAD G
Sbjct: 715  DPSACTVSVNTPSAFQSSTKSVSACTLYHDKGPEPWLRKTSTDAWLSTGISEAIDGADSG 774

Query: 746  PLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINS 805
              +QGDIY VVCYE+GALEIFDVPNFN VF VDKFVSG+TH++DT   E  KD    +  
Sbjct: 775  AHEQGDIYCVVCYETGALEIFDVPNFNSVFFVDKFVSGKTHLLDTCTGEPAKDM---MKG 831

Query: 806  SSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTS 865
              EE  G GRKE+  +MKVVEL M RWS  HSRPFLF ILTDGTILCY AYLFEGP+ TS
Sbjct: 832  VKEEVAGAGRKESTQNMKVVELTMLRWSGRHSRPFLFGILTDGTILCYHAYLFEGPDGTS 891

Query: 866  KSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQG 925
            K +D VS   S+  S +SASRLRNLRF R PLD YTREET     CQRIT FKNISG+QG
Sbjct: 892  KLEDSVSAQNSVGASTISASRLRNLRFVRVPLDTYTREETSSETSCQRITTFKNISGYQG 951

Query: 926  FFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPS 985
            FFLSGSRP W MVFRERLRVHPQLCDGSIVAFTVLH VNCNHG IYVTSQG LKIC L S
Sbjct: 952  FFLSGSRPAWFMVFRERLRVHPQLCDGSIVAFTVLHTVNCNHGLIYVTSQGNLKICHLSS 1011

Query: 986  GSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQI 1045
             S+YDNYWPVQK IPLK TPHQ+TYFAE+NLYPLIVSVPV KP+NQVLS L+DQEVGHQI
Sbjct: 1012 VSSYDNYWPVQK-IPLKGTPHQVTYFAERNLYPLIVSVPVQKPVNQVLSSLVDQEVGHQI 1070

Query: 1046 DNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTT 1105
            +NHNLSS ++HRTY+V+E+EVRILEP  + GPWQ +ATIPMQ+SENALTVR+V+LFNT+T
Sbjct: 1071 ENHNLSSEEIHRTYSVDEFEVRILEP--SNGPWQVKATIPMQTSENALTVRMVSLFNTST 1128

Query: 1106 KENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLV 1146
            KENETLLA+GTAYVQGEDVAARGR+LLFS  +N +N Q LV
Sbjct: 1129 KENETLLAVGTAYVQGEDVAARGRILLFSVVKNPENSQILV 1169


>gi|30696088|ref|NP_199979.2| cleavage and polyadenylation specificity factor subunit 1
            [Arabidopsis thaliana]
 gi|290457637|sp|Q9FGR0.2|CPSF1_ARATH RecName: Full=Cleavage and polyadenylation specificity factor subunit
            1; AltName: Full=Cleavage and polyadenylation specificity
            factor 160 kDa subunit; Short=AtCPSF160; Short=CPSF 160
            kDa subunit
 gi|332008729|gb|AED96112.1| cleavage and polyadenylation specificity factor subunit 1
            [Arabidopsis thaliana]
          Length = 1442

 Score = 1759 bits (4556), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 858/1175 (73%), Positives = 994/1175 (84%), Gaps = 39/1175 (3%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQT-EELDSELPS-KRGIGPVPNL 58
            MSFAAYKMMHWPTG+ NC SG+ITHS +D   QIP++   +++++E P+ KRGIGP+PN+
Sbjct: 1    MSFAAYKMMHWPTGVENCASGYITHSLSDSTLQIPIVSVHDDIEAEWPNPKRGIGPLPNV 60

Query: 59   VVTAANVIEIYVVRVQEEG-SKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
            V+TAAN++E+Y+VR QEEG ++E +N    KR  +MDG+   SLELVCHYRLHGNVES+A
Sbjct: 61   VITAANILEVYIVRAQEEGNTQELRNPKLAKRGGVMDGVYGVSLELVCHYRLHGNVESIA 120

Query: 118  ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
            +L  GG ++S+ RDSIIL F DAKISVLEFDDSIH LR+TSMHCFE P+WLHLKRGRESF
Sbjct: 121  VLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRESF 180

Query: 178  ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
             RGPLVKVDPQGRCGGVLVYGLQMIILK SQ GSGLVGD+D F SGG  SAR+ESS++IN
Sbjct: 181  PRGPLVKVDPQGRCGGVLVYGLQMIILKTSQVGSGLVGDDDAFSSGGTVSARVESSYIIN 240

Query: 238  LRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
            LRDL+MKHVKDF+F+HGYIEPV+VIL E E TWAGRVSWKHHTC++SALSI++TLKQHP+
Sbjct: 241  LRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHPV 300

Query: 298  IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
            IWSA+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALALNNYA S DSSQELP 
Sbjct: 301  IWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELPA 360

Query: 358  SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
            S+FSVELDAAH TW+ NDVALLSTK+G+L+LLT++YDGR VQRLDLSK+  SVL SDIT+
Sbjct: 361  SNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDITS 420

Query: 418  IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD 477
            +GNSLFFLGSRLGDSLLVQF+C SG +    GL++E  DIE +    KRLR  +SD  QD
Sbjct: 421  VGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLRM-TSDTFQD 479

Query: 478  MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
             +  EELSL+GS  NN++SAQK+FSFAVRDSLVN+GP+KDF+YGLRINADA+ATG+SKQS
Sbjct: 480  TIGNEELSLFGSTPNNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSKQS 539

Query: 538  NYEL--------------------------VELPGCKGIWTVYHKSSRGHNADSSRMAAY 571
            NYEL                          VELPGCKGIWTVYHKSSRGHNADSS+MAA 
Sbjct: 540  NYELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADSSKMAAD 599

Query: 572  DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
            +DEYHAYLIISLEARTMVLETADLLTEVTESVDY+VQGRTIAAGNLFGRRRVIQVFE GA
Sbjct: 600  EDEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVIQVFEHGA 659

Query: 632  RILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
            RILDGS+M Q+LSFG SNSES SGSE+STV SVSIADPYVLL M+D SIRLLVGDPSTCT
Sbjct: 660  RILDGSFMNQELSFGASNSESNSGSESSTVSSVSIADPYVLLRMTDDSIRLLVGDPSTCT 719

Query: 692  VSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGD 751
            VS+ +P+ +E SK+ +S+CTLYHDKGPEPWLRK STDAWLS+GVGEA+D  DGGP DQGD
Sbjct: 720  VSISSPSVLEGSKRKISACTLYHDKGPEPWLRKASTDAWLSSGVGEAVDSVDGGPQDQGD 779

Query: 752  IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT 811
            IY VVCYESGALEIFDVP+FNCVF+VDKF SGR H+ D  + E     E E+N +SE+ T
Sbjct: 780  IYCVVCYESGALEIFDVPSFNCVFSVDKFASGRRHLSDMPIHEL----EYELNKNSEDNT 835

Query: 812  GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
                 + I + +VVELAMQRWS HH+RPFLFA+L DGTILCY AYLF+G ++T K+++ +
Sbjct: 836  S---SKEIKNTRVVELAMQRWSGHHTRPFLFAVLADGTILCYHAYLFDGVDST-KAENSL 891

Query: 872  STSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
            S+    ++++  +S+LRNL+F R PLD  TRE T  G   QRIT+FKNISGHQGFFLSGS
Sbjct: 892  SSENPAALNSSGSSKLRNLKFLRIPLDTSTREGTSDGVASQRITMFKNISGHQGFFLSGS 951

Query: 932  RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            RP WCM+FRERLR H QLCDGSI AFTVLHNVNCNHGFIYVT+QG+LKICQLPS S YDN
Sbjct: 952  RPGWCMLFRERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTAQGVLKICQLPSASIYDN 1011

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
            YWPVQK IPLKATPHQ+TY+AEKNLYPLIVS PV KPLNQVLS L+DQE G Q+DNHN+S
Sbjct: 1012 YWPVQK-IPLKATPHQVTYYAEKNLYPLIVSYPVSKPLNQVLSSLVDQEAGQQLDNHNMS 1070

Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETL 1111
            S DL RTYTVEE+E++ILEP+R+GGPW+T+A IPMQ+SE+ALTVRVVTL N +T ENETL
Sbjct: 1071 SDDLQRTYTVEEFEIQILEPERSGGPWETKAKIPMQTSEHALTVRVVTLLNASTGENETL 1130

Query: 1112 LAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLV 1146
            LA+GTAYVQGEDVAARGRVLLFS G+N DN QN+V
Sbjct: 1131 LAVGTAYVQGEDVAARGRVLLFSFGKNGDNSQNVV 1165


>gi|24415580|gb|AAN41460.1| putative cleavage and polyadenylation specificity factor 160 kDa
            subunit [Arabidopsis thaliana]
          Length = 1442

 Score = 1756 bits (4548), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 857/1175 (72%), Positives = 993/1175 (84%), Gaps = 39/1175 (3%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQT-EELDSELPS-KRGIGPVPNL 58
            MSFAAYKMMHWPTG+ NC SG+ITHS +D   QIP++   +++++E P+ KRGIGP+PN+
Sbjct: 1    MSFAAYKMMHWPTGVENCASGYITHSLSDSTLQIPIVSVHDDIEAEWPNPKRGIGPLPNV 60

Query: 59   VVTAANVIEIYVVRVQEEG-SKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
            V+TAAN++E+Y+VR QEEG ++E +N    KR  +MDG+   SLELVCHYRLHGNVES+A
Sbjct: 61   VITAANILEVYIVRAQEEGNTQELRNPKLAKRGGVMDGVYGVSLELVCHYRLHGNVESIA 120

Query: 118  ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
            +L  GG ++S+ RDSIIL F DAKISVLEFDDSIH LR+TSMHCFE P+WLHLKRGRESF
Sbjct: 121  VLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRESF 180

Query: 178  ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
             RGPLVKVDPQGRCGGVLVYGLQMIILK SQ GSGLVGD+D F SGG  SAR+ESS++IN
Sbjct: 181  PRGPLVKVDPQGRCGGVLVYGLQMIILKTSQVGSGLVGDDDAFSSGGTVSARVESSYIIN 240

Query: 238  LRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
            LRDL+MKHVKDF+F+HGYIEPV+VIL E E TWAGRVSWKHHTC++SALSI++TLKQHP+
Sbjct: 241  LRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHPV 300

Query: 298  IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
            IWSA+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALALNNYA S DSSQELP 
Sbjct: 301  IWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELPA 360

Query: 358  SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
            S+FSVELDAAH TW+ NDVALLSTK+G+L+LLT++YDGR VQRLDLSK+  SVL SDIT+
Sbjct: 361  SNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDITS 420

Query: 418  IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD 477
            +GNSLFFLGSRLGDSLLVQF+C SG +    GL++E  DIE +    KRLR  +SD  QD
Sbjct: 421  VGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLRM-TSDTFQD 479

Query: 478  MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
             +  EELSL+GS  +N++SAQK+FSFAVRDSLVN+GP+KDF+YGLRINADA+ATG+SKQS
Sbjct: 480  TIGNEELSLFGSTPDNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSKQS 539

Query: 538  NYEL--------------------------VELPGCKGIWTVYHKSSRGHNADSSRMAAY 571
            NYEL                          VELPGCKGIWTVYHKSSRGHNADSS+MAA 
Sbjct: 540  NYELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADSSKMAAD 599

Query: 572  DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
            +DEYHAYLIISLEARTMVLETADLLTEVTESVDY+VQGRTIAAGNLFGRRRVIQVFE GA
Sbjct: 600  EDEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVIQVFEHGA 659

Query: 632  RILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
            RILDGS+M Q+LSFG SNSES SGSE+STV SVSIADPYVLL M+D SIRLLVGDPSTCT
Sbjct: 660  RILDGSFMNQELSFGASNSESNSGSESSTVSSVSIADPYVLLRMTDDSIRLLVGDPSTCT 719

Query: 692  VSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGD 751
            VS+ +P+ +E SK+ +S+CTLYHDKGPEPWLRK STDAWLS+GVGEA+D  DGGP DQGD
Sbjct: 720  VSISSPSVLEGSKRKISACTLYHDKGPEPWLRKASTDAWLSSGVGEAVDSVDGGPQDQGD 779

Query: 752  IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT 811
            IY VVCYESGALEIFDVP+FNCVF+VDKF SGR H+ D  + E     E E+N +SE+ T
Sbjct: 780  IYCVVCYESGALEIFDVPSFNCVFSVDKFASGRRHLSDMPIHEL----EYELNKNSEDNT 835

Query: 812  GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
                 + I + +VVELAMQRWS HH+RPFLFA+L DGTILCY AYLF+G ++T K+++ +
Sbjct: 836  S---SKEIKNTRVVELAMQRWSGHHTRPFLFAVLADGTILCYHAYLFDGVDST-KAENSL 891

Query: 872  STSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
            S     ++++  +S+LRNL+F R PLD  TRE T  G   QRIT+FKNISGHQGFFLSGS
Sbjct: 892  SPENPAALNSSGSSKLRNLKFLRIPLDTSTREGTSDGVASQRITMFKNISGHQGFFLSGS 951

Query: 932  RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            RP WCM+FRERLR H QLCDGSI AFTVLHNVNCNHGFIYVT+QG+LKICQLPS S YDN
Sbjct: 952  RPGWCMLFRERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTAQGVLKICQLPSASIYDN 1011

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
            YWPVQK IPLKATPHQ+TY+AEKNLYPLIVS PV KPLNQVLS L+DQE G Q+DNHN+S
Sbjct: 1012 YWPVQK-IPLKATPHQVTYYAEKNLYPLIVSYPVSKPLNQVLSSLVDQEAGQQLDNHNMS 1070

Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETL 1111
            S DL RTYTVEE+E++ILEP+R+GGPW+T+A IPMQ+SE+ALTVRVVTL N +T ENETL
Sbjct: 1071 SDDLQRTYTVEEFEIQILEPERSGGPWETKAKIPMQTSEHALTVRVVTLLNASTGENETL 1130

Query: 1112 LAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLV 1146
            LA+GTAYVQGEDVAARGRVLLFS G+N DN QN+V
Sbjct: 1131 LAVGTAYVQGEDVAARGRVLLFSFGKNGDNSQNVV 1165


>gi|297792471|ref|XP_002864120.1| hypothetical protein ARALYDRAFT_495232 [Arabidopsis lyrata subsp.
            lyrata]
 gi|297309955|gb|EFH40379.1| hypothetical protein ARALYDRAFT_495232 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 1444

 Score = 1753 bits (4541), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 860/1175 (73%), Positives = 991/1175 (84%), Gaps = 37/1175 (3%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQ-TEELDSELPS-KRGIGPVPNL 58
            MSFAA+KMMHWPTG+ NC SG+ITHS +D   QIP++   +++++E P+ KRGIGP+PN+
Sbjct: 1    MSFAAFKMMHWPTGVENCASGYITHSLSDSTLQIPIVSGDDDMEAEWPNHKRGIGPLPNV 60

Query: 59   VVTAANVIEIYVVRVQEEG-SKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
            V+TA N++E+Y+VR QEEG ++E +     KR  +MDG+S  SLELVCHYRLHGNVES+A
Sbjct: 61   VITAGNILEVYIVRAQEEGNTQELRIPKLVKRGGVMDGVSGVSLELVCHYRLHGNVESIA 120

Query: 118  ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
            +L  GG ++S+ RDSIIL F DAKISVLEFDDSIH LR+TSMHCFE P+WLHLKRGRESF
Sbjct: 121  VLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRESF 180

Query: 178  ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
             RGPLVKVDPQGRCGGVLVYGLQMIILKASQ GSGLVGD+D F SGG  SAR+ESS++IN
Sbjct: 181  PRGPLVKVDPQGRCGGVLVYGLQMIILKASQVGSGLVGDDDAFSSGGTVSARVESSYIIN 240

Query: 238  LRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
            LRDL+MKHVKDF+F+HGYIEPV+VIL E E TWAGRVSWKHHTC++SALSI+TTLKQHP+
Sbjct: 241  LRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINTTLKQHPV 300

Query: 298  IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
            IWSA+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALALNNYA S DSSQELP 
Sbjct: 301  IWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELPA 360

Query: 358  SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
            S+FSVELDAAH TW+ +DVALLSTK+G+L+LLT++YDGR VQRLDLSK+  SVL SDIT+
Sbjct: 361  SNFSVELDAAHGTWISSDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDITS 420

Query: 418  IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD 477
            +GNSLFFLGSRLGDSLLVQF+C SG +    GL++E  DIE +    KRL R SSD  QD
Sbjct: 421  VGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRL-RISSDTFQD 479

Query: 478  MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
             +  EELSL+GS  NN++SAQK+FSFAVRDSLVN+GP+KDF+YGLRINADA+ATG+SKQS
Sbjct: 480  TIGNEELSLFGSTPNNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSKQS 539

Query: 538  NYEL--------------------------VELPGCKGIWTVYHKSSRGHNADSSRMAAY 571
            NYEL                          VELPGCKGIWTVYHKSSRGHNADSS+MAA 
Sbjct: 540  NYELVCCSGHGKNGALCVLRQSVRPEMITEVELPGCKGIWTVYHKSSRGHNADSSKMAAD 599

Query: 572  DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
            +DEYHAYLIIS+EARTMVLETADLLTEVTESVDY+VQGRTIAAGNLFGRRRVIQVFE GA
Sbjct: 600  EDEYHAYLIISVEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVIQVFEHGA 659

Query: 632  RILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
            RILDGS+M Q+LSFG  NSES SGSE+STV SVSIADPYVLL M+D SIRLLVGDPSTCT
Sbjct: 660  RILDGSFMNQELSFGAPNSESNSGSESSTVSSVSIADPYVLLRMTDDSIRLLVGDPSTCT 719

Query: 692  VSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGD 751
            VS+ +P+ +E SKK +S+CTL+HDKGPEPWLRK STDAWLS+GVGEA+D ADGGP DQGD
Sbjct: 720  VSISSPSVLEGSKKKISACTLFHDKGPEPWLRKASTDAWLSSGVGEAVDSADGGPQDQGD 779

Query: 752  IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT 811
            IY V+CYESGALEIFDVP FNCVF+VDKF SGR H+ D  + E     E E+N +SE+  
Sbjct: 780  IYCVLCYESGALEIFDVPGFNCVFSVDKFASGRRHLSDMPIHEL----EYELNKNSED-N 834

Query: 812  GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
               R E I + KVVEL+MQRWS  H+RPFLFA+L DGTILCY AYLFEG ++T K+++ V
Sbjct: 835  ASSRNEEIKNTKVVELSMQRWSGPHTRPFLFAVLADGTILCYHAYLFEGVDST-KAENSV 893

Query: 872  STSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
            S+    ++++  +S+LRNL+F R P D  TRE T  G   QRIT+FKNISGHQGFFLSGS
Sbjct: 894  SSENPAALNSSGSSKLRNLKFLRIPFDTSTREGTSDGVASQRITMFKNISGHQGFFLSGS 953

Query: 932  RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            RP WCM+FRERLR H QLCDGSI AFTVLHNVNCNHGFIYVTSQ +LKICQLPS S YDN
Sbjct: 954  RPGWCMLFRERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTSQVVLKICQLPSASIYDN 1013

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
            YWPVQK IPLKATPHQ+TY+AEKNLYPLIVS PV KP+NQVLS L+DQE G QIDNHNLS
Sbjct: 1014 YWPVQK-IPLKATPHQVTYYAEKNLYPLIVSYPVSKPINQVLSSLVDQEAGQQIDNHNLS 1072

Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETL 1111
            S DL RTYTVEE+E++ILEP+R+GGPW+T+ATIPMQSSE+ALTVRVVTL N +T ENETL
Sbjct: 1073 SDDLQRTYTVEEFEIQILEPERSGGPWETKATIPMQSSEHALTVRVVTLLNASTGENETL 1132

Query: 1112 LAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLV 1146
            LA+GTAYVQGEDVAARGRVLLFS G+N DN QN+V
Sbjct: 1133 LAVGTAYVQGEDVAARGRVLLFSFGKNGDNSQNVV 1167


>gi|10257491|dbj|BAB11613.1| cleavage and polyadenylation specificity factor subunit [Arabidopsis
            thaliana]
          Length = 1448

 Score = 1753 bits (4540), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 858/1181 (72%), Positives = 994/1181 (84%), Gaps = 45/1181 (3%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQT-EELDSELPS-KRGIGPVPNL 58
            MSFAAYKMMHWPTG+ NC SG+ITHS +D   QIP++   +++++E P+ KRGIGP+PN+
Sbjct: 1    MSFAAYKMMHWPTGVENCASGYITHSLSDSTLQIPIVSVHDDIEAEWPNPKRGIGPLPNV 60

Query: 59   VVTAANVIEIYVVRVQEEG-SKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
            V+TAAN++E+Y+VR QEEG ++E +N    KR  +MDG+   SLELVCHYRLHGNVES+A
Sbjct: 61   VITAANILEVYIVRAQEEGNTQELRNPKLAKRGGVMDGVYGVSLELVCHYRLHGNVESIA 120

Query: 118  ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
            +L  GG ++S+ RDSIIL F DAKISVLEFDDSIH LR+TSMHCFE P+WLHLKRGRESF
Sbjct: 121  VLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRESF 180

Query: 178  ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
             RGPLVKVDPQGRCGGVLVYGLQMIILK SQ GSGLVGD+D F SGG  SAR+ESS++IN
Sbjct: 181  PRGPLVKVDPQGRCGGVLVYGLQMIILKTSQVGSGLVGDDDAFSSGGTVSARVESSYIIN 240

Query: 238  LRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
            LRDL+MKHVKDF+F+HGYIEPV+VIL E E TWAGRVSWKHHTC++SALSI++TLKQHP+
Sbjct: 241  LRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHPV 300

Query: 298  IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
            IWSA+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALALNNYA S DSSQELP 
Sbjct: 301  IWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELPA 360

Query: 358  SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
            S+FSVELDAAH TW+ NDVALLSTK+G+L+LLT++YDGR VQRLDLSK+  SVL SDIT+
Sbjct: 361  SNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDITS 420

Query: 418  IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD 477
            +GNSLFFLGSRLGDSLLVQF+C SG +    GL++E  DIE +    KRLR  +SD  QD
Sbjct: 421  VGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLRM-TSDTFQD 479

Query: 478  MVNGEELSLYGSASNNTESAQ------KTFSFAVRDSLVNIGPLKDFSYGLRINADASAT 531
             +  EELSL+GS  NN++SAQ      K+FSFAVRDSLVN+GP+KDF+YGLRINADA+AT
Sbjct: 480  TIGNEELSLFGSTPNNSDSAQVTSSVLKSFSFAVRDSLVNVGPVKDFAYGLRINADANAT 539

Query: 532  GISKQSNYEL--------------------------VELPGCKGIWTVYHKSSRGHNADS 565
            G+SKQSNYEL                          VELPGCKGIWTVYHKSSRGHNADS
Sbjct: 540  GVSKQSNYELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADS 599

Query: 566  SRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQ 625
            S+MAA +DEYHAYLIISLEARTMVLETADLLTEVTESVDY+VQGRTIAAGNLFGRRRVIQ
Sbjct: 600  SKMAADEDEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVIQ 659

Query: 626  VFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG 685
            VFE GARILDGS+M Q+LSFG SNSES SGSE+STV SVSIADPYVLL M+D SIRLLVG
Sbjct: 660  VFEHGARILDGSFMNQELSFGASNSESNSGSESSTVSSVSIADPYVLLRMTDDSIRLLVG 719

Query: 686  DPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGG 745
            DPSTCTVS+ +P+ +E SK+ +S+CTLYHDKGPEPWLRK STDAWLS+GVGEA+D  DGG
Sbjct: 720  DPSTCTVSISSPSVLEGSKRKISACTLYHDKGPEPWLRKASTDAWLSSGVGEAVDSVDGG 779

Query: 746  PLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINS 805
            P DQGDIY VVCYESGALEIFDVP+FNCVF+VDKF SGR H+ D  + E     E E+N 
Sbjct: 780  PQDQGDIYCVVCYESGALEIFDVPSFNCVFSVDKFASGRRHLSDMPIHEL----EYELNK 835

Query: 806  SSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTS 865
            +SE+ T     + I + +VVELAMQRWS HH+RPFLFA+L DGTILCY AYLF+G ++T 
Sbjct: 836  NSEDNTS---SKEIKNTRVVELAMQRWSGHHTRPFLFAVLADGTILCYHAYLFDGVDST- 891

Query: 866  KSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQG 925
            K+++ +S+    ++++  +S+LRNL+F R PLD  TRE T  G   QRIT+FKNISGHQG
Sbjct: 892  KAENSLSSENPAALNSSGSSKLRNLKFLRIPLDTSTREGTSDGVASQRITMFKNISGHQG 951

Query: 926  FFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPS 985
            FFLSGSRP WCM+FRERLR H QLCDGSI AFTVLHNVNCNHGFIYVT+QG+LKICQLPS
Sbjct: 952  FFLSGSRPGWCMLFRERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTAQGVLKICQLPS 1011

Query: 986  GSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQI 1045
             S YDNYWPVQK IPLKATPHQ+TY+AEKNLYPLIVS PV KPLNQVLS L+DQE G Q+
Sbjct: 1012 ASIYDNYWPVQK-IPLKATPHQVTYYAEKNLYPLIVSYPVSKPLNQVLSSLVDQEAGQQL 1070

Query: 1046 DNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTT 1105
            DNHN+SS DL RTYTVEE+E++ILEP+R+GGPW+T+A IPMQ+SE+ALTVRVVTL N +T
Sbjct: 1071 DNHNMSSDDLQRTYTVEEFEIQILEPERSGGPWETKAKIPMQTSEHALTVRVVTLLNAST 1130

Query: 1106 KENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLV 1146
             ENETLLA+GTAYVQGEDVAARGRVLLFS G+N DN QN+V
Sbjct: 1131 GENETLLAVGTAYVQGEDVAARGRVLLFSFGKNGDNSQNVV 1171


>gi|449470342|ref|XP_004152876.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Cucumis sativus]
          Length = 1504

 Score = 1713 bits (4436), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 859/1224 (70%), Positives = 977/1224 (79%), Gaps = 83/1224 (6%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
            MSFAAY+MMHWPTGI NC S +ITHSRAD+VP +    +++LDS+   +R IGPVPNLVV
Sbjct: 1    MSFAAYRMMHWPTGIENCDSAYITHSRADFVPAVT-SHSDDLDSDWHPRRDIGPVPNLVV 59

Query: 61   TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
            TA NV+E+YVVRV EEG +ESK+SGE +R  +MDG+S ASLELVCHYRLHGNVES+AILS
Sbjct: 60   TAGNVLEVYVVRVLEEGGRESKSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILS 119

Query: 121  QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180
              G D S++RDSIIL F++AKISVLEFDDS H LR +SMHCF+ P+WLHLKRGRESFARG
Sbjct: 120  SRGGDGSKKRDSIILVFQEAKISVLEFDDSTHSLRTSSMHCFDGPQWLHLKRGRESFARG 179

Query: 181  PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240
            P+VKVDPQGRCGGVLVYGLQMIILKASQ GSGLV D++ FG+ G  SAR+ESS++INLRD
Sbjct: 180  PVVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNTGAISARVESSYLINLRD 239

Query: 241  LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
            LD+KHVKDF+FVHGYIEPVMVILHE+ELTWAGRVSWKHHTCM+SALSISTTLKQHPLIWS
Sbjct: 240  LDVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMVSALSISTTLKQHPLIWS 299

Query: 301  AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
            A NLPHDAYKLLAVPSPIGGVLV+ AN+IHY+SQSASC LALNNYAVS DSSQ++PRS+F
Sbjct: 300  ASNLPHDAYKLLAVPSPIGGVLVISANSIHYNSQSASCMLALNNYAVSADSSQDMPRSNF 359

Query: 361  SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGN 420
            +VELDAA+ATWL NDVALLSTKTG+L+LL +VYDGRVVQRLDLSK+  SVLTS I +IGN
Sbjct: 360  NVELDAANATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGN 419

Query: 421  SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEF-------------------------- 454
            SLFFLGSRLGDSLLVQF+CG G+S L+S LK+E                           
Sbjct: 420  SLFFLGSRLGDSLLVQFSCGVGSSGLASNLKDEITYYTQNLQKEMVPPTLPSALVHESKP 479

Query: 455  ----GDIEADAPS----------------------TKRLRRSSSDALQDMVNGEELSLYG 488
                G IE +  +                        R+ R         V G+ELSLYG
Sbjct: 480  TQAKGTIELNNNNLCVENDIVDVVEVDITNMTILGENRIARRDETLTDTQVGGDELSLYG 539

Query: 489  SASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELV------ 542
            SA+NNTESAQK FSFAVRDSL+NIGPLKDFSYGLRINAD +ATGI+KQSNYELV      
Sbjct: 540  SAANNTESAQKIFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYELVCCSGHG 599

Query: 543  --------------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIIS 582
                                ELPGCKGIWTVYHK++RG  ADSSRM   DDEYHAYLIIS
Sbjct: 600  KNGALCILRQSIRPEMITEVELPGCKGIWTVYHKNTRGSIADSSRMVPDDDEYHAYLIIS 659

Query: 583  LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD 642
            LEARTMVL T +LLTEVTESVDYFV GRTIAAGNLFGRRRVIQV+E GARILDGS+MTQD
Sbjct: 660  LEARTMVLVTGELLTEVTESVDYFVHGRTIAAGNLFGRRRVIQVYESGARILDGSFMTQD 719

Query: 643  LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIES 702
            L+   + +ESG+ SE  TVLS SI+DPYVLL M+DGSIRLLVGD S+C+VSV  PAA  S
Sbjct: 720  LNLVVNGNESGNASEGCTVLSASISDPYVLLTMTDGSIRLLVGDSSSCSVSVSAPAAFGS 779

Query: 703  SKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGA 762
            SKK VSSCTLY DKG EPWLR TSTDAWLSTGVGE IDG DG   DQGDIY V CY++G 
Sbjct: 780  SKKCVSSCTLYQDKGIEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYCVACYDNGD 839

Query: 763  LEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSM 822
            LEIFDVPNF  VF VDKFVSG++H+VD  + +  K SE + N  S+E    GR E+  +M
Sbjct: 840  LEIFDVPNFTSVFYVDKFVSGKSHLVDHQISDLQKSSEVDQN--SQELISHGRNESSQNM 897

Query: 823  KVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNV 882
            KV+E+AMQRWS  HSRPFLF ILTDGTILCY AYLFE  ++ SK DD VS   S+S SN+
Sbjct: 898  KVIEVAMQRWSGQHSRPFLFGILTDGTILCYHAYLFESTDSASKIDDSVSIDNSVSSSNM 957

Query: 883  SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRER 942
            S+SRLRNLRF R PLD   RE+ P+G    R++IFKNISG+QG FL GSRP W MVFRER
Sbjct: 958  SSSRLRNLRFLRVPLDIQGREDMPNGTLSCRLSIFKNISGYQGLFLCGSRPAWFMVFRER 1017

Query: 943  LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLK 1002
            LRVHPQLCDG IVAF VLHNVNCNHG IYVTSQG+LKICQLPS S YDNYWPVQKV PLK
Sbjct: 1018 LRVHPQLCDGPIVAFAVLHNVNCNHGLIYVTSQGVLKICQLPSTSNYDNYWPVQKV-PLK 1076

Query: 1003 ATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVE 1062
             TPHQ+TYF EKNLYP+I+S PV KPLNQVLS ++DQ+VGH ++NHNLS+ +L +TY+VE
Sbjct: 1077 GTPHQVTYFHEKNLYPVIISAPVQKPLNQVLSSMVDQDVGH-VENHNLSADELQQTYSVE 1135

Query: 1063 EYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGE 1122
            E+E+RILEP+++GGPWQTRATI M SSENALT+RVVTL NTTTKENETLLA+GTAYVQGE
Sbjct: 1136 EFEIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLNTTTKENETLLAVGTAYVQGE 1195

Query: 1123 DVAARGRVLLFSTGRNADNPQNLV 1146
            DVAARGRVLLFS G++ADN Q LV
Sbjct: 1196 DVAARGRVLLFSVGKDADNSQTLV 1219


>gi|218194461|gb|EEC76888.1| hypothetical protein OsI_15095 [Oryza sativa Indica Group]
          Length = 1503

 Score = 1402 bits (3628), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 715/1187 (60%), Positives = 870/1187 (73%), Gaps = 66/1187 (5%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTE-----ELDSELPSKRG--IG 53
            MS+AAYKMMHWPTG+ +C +GF+THS +D                ++DS   + R   +G
Sbjct: 1    MSYAAYKMMHWPTGVDHCAAGFVTHSPSDAAAFFTAATVGPGPEGDIDSAAAASRPRRLG 60

Query: 54   PVPNLVVTAANVIEIYVVRVQEE------GSKESKNSGETKRRVLMDGISAASLELVCHY 107
            P PNLVV AANV+E+Y VR +        G++ S +SG      ++DGIS A LELVC+Y
Sbjct: 61   PSPNLVVAAANVLEVYAVRAETAAEDGGGGTQPSSSSG-----AVLDGISGARLELVCYY 115

Query: 108  RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
            RLHGN+ES+ +LS G A+N  RR +I LAF+DAKI+ LEFDD+IHGLR +SMHCFE PEW
Sbjct: 116  RLHGNIESMTVLSDG-AEN--RRATIALAFKDAKITCLEFDDAIHGLRTSSMHCFEGPEW 172

Query: 168  LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFS 227
             HLKRGRESFA GP++K DP GRCG  L YGLQMIILKA+Q G  LVG+++   +    +
Sbjct: 173  QHLKRGRESFAWGPVIKADPLGRCGAALAYGLQMIILKAAQVGHSLVGEDEPTCALSSTA 232

Query: 228  ARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
             RIESS++I+LR LDM HVKDF FVHGYIEPV+VILHE+E TWAGR+  KHHTCMISA S
Sbjct: 233  VRIESSYLIDLRALDMNHVKDFAFVHGYIEPVLVILHEQEPTWAGRILSKHHTCMISAFS 292

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
            IS TLKQHP+IWSA NLPHDAY+LLAVP PI GVLV+ AN+IHYHSQS SC+L LNN++ 
Sbjct: 293  ISMTLKQHPVIWSAANLPHDAYQLLAVPPPISGVLVICANSIHYHSQSTSCSLDLNNFSS 352

Query: 348  SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
              D S E+ +S+F VELDAA ATW  ND+ + S+K G+++LLTVVYDGRVVQRLDL K+ 
Sbjct: 353  HPDGSPEISKSNFQVELDAAKATWFSNDIVMFSSKAGEMLLLTVVYDGRVVQRLDLMKSK 412

Query: 408  PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
             SVL+S +T+IGNS FFLGSRLGDSLLVQF+ G+  S+L     E   DIE D P +KRL
Sbjct: 413  ASVLSSAVTSIGNSFFFLGSRLGDSLLVQFSYGASKSVLQDLTNERSADIEGDLPFSKRL 472

Query: 468  RRSSSDALQDMVNGEELSLYG-SASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINA 526
            +R  SD LQD+ + EELS     A N+ ESAQK  S+ VRD+L+N+GPLKDFSYGLR NA
Sbjct: 473  KRIPSDVLQDVTSVEELSFQNIIAPNSLESAQK-ISYIVRDALINVGPLKDFSYGLRANA 531

Query: 527  DASATGISKQSNYEL--------------------------VELPGCKGIWTVYHKSSRG 560
            D +A G +KQSNYEL                          VELP C+GIWTVY+KS RG
Sbjct: 532  DPNAMGNAKQSNYELVCCSGHGKNGSLSVLQQSIRPDLITEVELPSCRGIWTVYYKSYRG 591

Query: 561  HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
              A+       D+EYHAYLIISLE RTMVLET D L EVTE+VDYFVQ  TIAAGNLFGR
Sbjct: 592  QMAE-------DNEYHAYLIISLENRTMVLETGDDLGEVTETVDYFVQASTIAAGNLFGR 644

Query: 621  RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
            RRVIQV+ +GAR+LDGS+MTQ+L+F  +++   S SE   V   SIADPYVLL M DGS+
Sbjct: 645  RRVIQVYGKGARVLDGSFMTQELNF-TTHASESSSSEALGVACASIADPYVLLKMVDGSV 703

Query: 681  RLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAID 740
            +LL+GD  TCT+SV  P+   SS + +++CTLY D+GPEPWLRKT +DAWLSTG+ EAID
Sbjct: 704  QLLIGDYCTCTLSVNAPSIFISSSERIAACTLYRDRGPEPWLRKTRSDAWLSTGIAEAID 763

Query: 741  GADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
            G      DQ DIY ++CYESG LEIF+VP+F CVF+V+ F+SG   +VD + +   +DS 
Sbjct: 764  GNGTSSHDQSDIYCIICYESGKLEIFEVPSFRCVFSVENFISGEALLVDKFSQLIYEDST 823

Query: 801  TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
             E    ++      +KE   S+++VELAM RWS   SRPFLF +L DGT+LCY A+ +E 
Sbjct: 824  KERYDCTKASL---KKEAGDSIRIVELAMHRWSGQFSRPFLFGLLNDGTLLCYHAFSYEA 880

Query: 861  PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH-GAPCQRITIFKN 919
             E+  K   P+S   S    N S SRLRNLRF R  +D  +RE+ P  G P  RIT F N
Sbjct: 881  SESNVKR-VPLSPQGSADHHNASDSRLRNLRFHRVSIDITSREDIPTLGRP--RITTFNN 937

Query: 920  ISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
            + G++G FLSG+RP W MV R+RLRVHPQLCDG I AFTVLHNVNC+HGFIYVTSQG LK
Sbjct: 938  VGGYEGLFLSGTRPAWVMVCRQRLRVHPQLCDGPIEAFTVLHNVNCSHGFIYVTSQGFLK 997

Query: 980  ICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQ 1039
            ICQLPS   YDNYWPVQKV PL  TPHQ+TY+AE++LYPLIVSVPV++PLNQVLS + DQ
Sbjct: 998  ICQLPSAYNYDNYWPVQKV-PLHGTPHQVTYYAEQSLYPLIVSVPVVRPLNQVLSSMADQ 1056

Query: 1040 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1099
            E  H +DN   S+  LH+TYTV+E+EVRILE ++ GG W+T++TIPMQ  ENALTVR+VT
Sbjct: 1057 ESVHHMDNDVTSTDALHKTYTVDEFEVRILELEKPGGHWETKSTIPMQLFENALTVRIVT 1116

Query: 1100 LFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLV 1146
            L NTTTKENETLLAIGTAYV GEDVAARGRVLLFS  + ++N QNLV
Sbjct: 1117 LHNTTTKENETLLAIGTAYVLGEDVAARGRVLLFSFMK-SENSQNLV 1162


>gi|222628488|gb|EEE60620.1| hypothetical protein OsJ_14038 [Oryza sativa Japonica Group]
          Length = 1441

 Score = 1393 bits (3605), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 713/1187 (60%), Positives = 868/1187 (73%), Gaps = 66/1187 (5%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTE-----ELDSELPSKRG--IG 53
            MS+AAYKMMHWPTG+ +C +GF+THS +D                ++DS   + R   +G
Sbjct: 1    MSYAAYKMMHWPTGVDHCAAGFVTHSPSDAAAFFTAATVGPGPEGDIDSAAAASRPRRLG 60

Query: 54   PVPNLVVTAANVIEIYVVRVQEE------GSKESKNSGETKRRVLMDGISAASLELVCHY 107
            P PNLVV AANV+E+Y VR +        G++ S +SG      ++DGIS A LELVC+Y
Sbjct: 61   PSPNLVVAAANVLEVYAVRAETAAEDGGGGTQPSSSSG-----AVLDGISGARLELVCYY 115

Query: 108  RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
            RLHGN+ES+ +LS G A+N  RR +I LAF+DAKI+ LEFDD+IHGLR +SMHCFE PEW
Sbjct: 116  RLHGNIESMTVLSDG-AEN--RRATIALAFKDAKITCLEFDDAIHGLRTSSMHCFEGPEW 172

Query: 168  LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFS 227
             HLKRGRESFA GP++K DP GRCG  L YGLQMIILKA+Q G  LVG+++   +    +
Sbjct: 173  QHLKRGRESFAWGPVIKADPLGRCGAALAYGLQMIILKAAQVGHSLVGEDEPTCALSSTA 232

Query: 228  ARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
              IESS++I+LR LDM HVKDF FVHGYIEPV+VILHE+E TWAGR+  KHHTCMISA S
Sbjct: 233  VCIESSYLIDLRALDMNHVKDFAFVHGYIEPVLVILHEQEPTWAGRILSKHHTCMISAFS 292

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
            IS TLKQHP+IWSA NLPHDAY+LLAVP PI GVLV+ AN+IHYHSQS SC+L LNN++ 
Sbjct: 293  ISMTLKQHPVIWSAANLPHDAYQLLAVPPPISGVLVICANSIHYHSQSTSCSLDLNNFSS 352

Query: 348  SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
              D S E+ +S+F VELDAA ATWL ND+ + STK G+++LLTVVYDGRVVQRLDL K+ 
Sbjct: 353  HPDGSPEISKSNFQVELDAAKATWLSNDIVMFSTKAGEMLLLTVVYDGRVVQRLDLMKSK 412

Query: 408  PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
             SVL+S +T+IGNS FFLGSRLGDSLLVQF+  +  S+L     E   DIE D P +KRL
Sbjct: 413  ASVLSSAVTSIGNSFFFLGSRLGDSLLVQFSYCASKSVLQDLTNERSADIEGDLPFSKRL 472

Query: 468  RRSSSDALQDMVNGEELSLYG-SASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINA 526
            +R  SD LQD+ + EELS     A N+ ESAQK  S+ VRD+L+N+GPLKDFSYGLR NA
Sbjct: 473  KRIPSDVLQDVTSVEELSFQNIIAPNSLESAQK-ISYIVRDALINVGPLKDFSYGLRANA 531

Query: 527  DASATGISKQSNYEL--------------------------VELPGCKGIWTVYHKSSRG 560
            D +A G +KQSNYEL                          VELP C+GIWTVY+KS RG
Sbjct: 532  DPNAMGNAKQSNYELVCCSGHGKNGSLSVLQQSIRPDLITEVELPSCRGIWTVYYKSYRG 591

Query: 561  HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
              A+       D+EYHAYLIISLE RTMVLET D L EVTE+VDYFVQ  TIAAGNLFGR
Sbjct: 592  QMAE-------DNEYHAYLIISLENRTMVLETGDDLGEVTETVDYFVQASTIAAGNLFGR 644

Query: 621  RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
            RRVIQV+ +GAR+LDGS+MTQ+L+F  +++   S SE   V   SIADPYVLL M DGS+
Sbjct: 645  RRVIQVYGKGARVLDGSFMTQELNF-TTHASESSSSEALGVACASIADPYVLLKMVDGSV 703

Query: 681  RLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAID 740
            +LL+GD  TCT+SV  P+   SS + +++CTLY D+GPEPWL KT +DAWLSTG+ EAID
Sbjct: 704  QLLIGDYCTCTLSVNAPSIFISSSERIAACTLYRDRGPEPWLTKTRSDAWLSTGIAEAID 763

Query: 741  GADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
            G      DQ DIY ++CYESG LEIF+VP+F CVF+V+ F+SG   +VD + +   +DS 
Sbjct: 764  GNGTSSHDQSDIYCIICYESGKLEIFEVPSFRCVFSVENFISGEALLVDKFSQLIYEDST 823

Query: 801  TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
             E    ++      +KE   S+++VELAM RWS   SRPFLF +L DGT+LCY A+ +E 
Sbjct: 824  KERYDCTKASL---KKEAGDSIRIVELAMHRWSGQFSRPFLFGLLNDGTLLCYHAFSYEA 880

Query: 861  PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH-GAPCQRITIFKN 919
             E+  K   P+S   S    N S SRLRNLRF R  +D  +RE+ P  G P  RIT F N
Sbjct: 881  SESNVKR-VPLSPQGSADHHNASDSRLRNLRFHRVSIDITSREDIPTLGRP--RITTFNN 937

Query: 920  ISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
            + G++G FLSG+RP W MV R+RLRVHPQLCDG I AFTVLHNVNC+HGFIYVTSQG LK
Sbjct: 938  VGGYEGLFLSGTRPAWVMVCRQRLRVHPQLCDGPIEAFTVLHNVNCSHGFIYVTSQGFLK 997

Query: 980  ICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQ 1039
            ICQLPS   YD+YWPVQKV PL  TPHQ+TY+AE++LYPLIVSVPV++PLNQVLS + DQ
Sbjct: 998  ICQLPSAYNYDSYWPVQKV-PLHGTPHQVTYYAEQSLYPLIVSVPVVRPLNQVLSSMADQ 1056

Query: 1040 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1099
            E  H +DN   S+  LH+TYTV+E+EVRILE ++ GG W+T++TIPMQ  ENALTVR+VT
Sbjct: 1057 ESVHHMDNDVTSTDALHKTYTVDEFEVRILELEKPGGHWETKSTIPMQLFENALTVRIVT 1116

Query: 1100 LFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLV 1146
            L NTTTKENETLLAIGTAYV GEDVAARGRVLLFS  + ++N QNLV
Sbjct: 1117 LHNTTTKENETLLAIGTAYVLGEDVAARGRVLLFSFTK-SENSQNLV 1162


>gi|75145059|sp|Q7XWP1.2|CPSF1_ORYSJ RecName: Full=Probable cleavage and polyadenylation specificity
            factor subunit 1; AltName: Full=Cleavage and
            polyadenylation specificity factor 160 kDa subunit;
            Short=CPSF 160 kDa subunit
 gi|38345987|emb|CAD39979.2| OSJNBa0032B23.5 [Oryza sativa Japonica Group]
          Length = 1441

 Score = 1393 bits (3605), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 713/1187 (60%), Positives = 868/1187 (73%), Gaps = 66/1187 (5%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTE-----ELDSELPSKRG--IG 53
            MS+AAYKMMHWPTG+ +C +GF+THS +D                ++DS   + R   +G
Sbjct: 1    MSYAAYKMMHWPTGVDHCAAGFVTHSPSDAAAFFTAATVGPGPEGDIDSAAAASRPRRLG 60

Query: 54   PVPNLVVTAANVIEIYVVRVQEE------GSKESKNSGETKRRVLMDGISAASLELVCHY 107
            P PNLVV AANV+E+Y VR +        G++ S +SG      ++DGIS A LELVC+Y
Sbjct: 61   PSPNLVVAAANVLEVYAVRAETAAEDGGGGTQPSSSSG-----AVLDGISGARLELVCYY 115

Query: 108  RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
            RLHGN+ES+ +LS G A+N  RR +I LAF+DAKI+ LEFDD+IHGLR +SMHCFE PEW
Sbjct: 116  RLHGNIESMTVLSDG-AEN--RRATIALAFKDAKITCLEFDDAIHGLRTSSMHCFEGPEW 172

Query: 168  LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFS 227
             HLKRGRESFA GP++K DP GRCG  L YGLQMIILKA+Q G  LVG+++   +    +
Sbjct: 173  QHLKRGRESFAWGPVIKADPLGRCGAALAYGLQMIILKAAQVGHSLVGEDEPTCALSSTA 232

Query: 228  ARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
              IESS++I+LR LDM HVKDF FVHGYIEPV+VILHE+E TWAGR+  KHHTCMISA S
Sbjct: 233  VCIESSYLIDLRALDMNHVKDFAFVHGYIEPVLVILHEQEPTWAGRILSKHHTCMISAFS 292

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
            IS TLKQHP+IWSA NLPHDAY+LLAVP PI GVLV+ AN+IHYHSQS SC+L LNN++ 
Sbjct: 293  ISMTLKQHPVIWSAANLPHDAYQLLAVPPPISGVLVICANSIHYHSQSTSCSLDLNNFSS 352

Query: 348  SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
              D S E+ +S+F VELDAA ATWL ND+ + STK G+++LLTVVYDGRVVQRLDL K+ 
Sbjct: 353  HPDGSPEISKSNFQVELDAAKATWLSNDIVMFSTKAGEMLLLTVVYDGRVVQRLDLMKSK 412

Query: 408  PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
             SVL+S +T+IGNS FFLGSRLGDSLLVQF+  +  S+L     E   DIE D P +KRL
Sbjct: 413  ASVLSSAVTSIGNSFFFLGSRLGDSLLVQFSYCASKSVLQDLTNERSADIEGDLPFSKRL 472

Query: 468  RRSSSDALQDMVNGEELSLYG-SASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINA 526
            +R  SD LQD+ + EELS     A N+ ESAQK  S+ VRD+L+N+GPLKDFSYGLR NA
Sbjct: 473  KRIPSDVLQDVTSVEELSFQNIIAPNSLESAQK-ISYIVRDALINVGPLKDFSYGLRANA 531

Query: 527  DASATGISKQSNYEL--------------------------VELPGCKGIWTVYHKSSRG 560
            D +A G +KQSNYEL                          VELP C+GIWTVY+KS RG
Sbjct: 532  DPNAMGNAKQSNYELVCCSGHGKNGSLSVLQQSIRPDLITEVELPSCRGIWTVYYKSYRG 591

Query: 561  HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
              A+       D+EYHAYLIISLE RTMVLET D L EVTE+VDYFVQ  TIAAGNLFGR
Sbjct: 592  QMAE-------DNEYHAYLIISLENRTMVLETGDDLGEVTETVDYFVQASTIAAGNLFGR 644

Query: 621  RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
            RRVIQV+ +GAR+LDGS+MTQ+L+F  +++   S SE   V   SIADPYVLL M DGS+
Sbjct: 645  RRVIQVYGKGARVLDGSFMTQELNF-TTHASESSSSEALGVACASIADPYVLLKMVDGSV 703

Query: 681  RLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAID 740
            +LL+GD  TCT+SV  P+   SS + +++CTLY D+GPEPWL KT +DAWLSTG+ EAID
Sbjct: 704  QLLIGDYCTCTLSVNAPSIFISSSERIAACTLYRDRGPEPWLTKTRSDAWLSTGIAEAID 763

Query: 741  GADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
            G      DQ DIY ++CYESG LEIF+VP+F CVF+V+ F+SG   +VD + +   +DS 
Sbjct: 764  GNGTSSHDQSDIYCIICYESGKLEIFEVPSFRCVFSVENFISGEALLVDKFSQLIYEDST 823

Query: 801  TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
             E    ++      +KE   S+++VELAM RWS   SRPFLF +L DGT+LCY A+ +E 
Sbjct: 824  KERYDCTKASL---KKEAGDSIRIVELAMHRWSGQFSRPFLFGLLNDGTLLCYHAFSYEA 880

Query: 861  PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH-GAPCQRITIFKN 919
             E+  K   P+S   S    N S SRLRNLRF R  +D  +RE+ P  G P  RIT F N
Sbjct: 881  SESNVKR-VPLSPQGSADHHNASDSRLRNLRFHRVSIDITSREDIPTLGRP--RITTFNN 937

Query: 920  ISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
            + G++G FLSG+RP W MV R+RLRVHPQLCDG I AFTVLHNVNC+HGFIYVTSQG LK
Sbjct: 938  VGGYEGLFLSGTRPAWVMVCRQRLRVHPQLCDGPIEAFTVLHNVNCSHGFIYVTSQGFLK 997

Query: 980  ICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQ 1039
            ICQLPS   YD+YWPVQKV PL  TPHQ+TY+AE++LYPLIVSVPV++PLNQVLS + DQ
Sbjct: 998  ICQLPSAYNYDSYWPVQKV-PLHGTPHQVTYYAEQSLYPLIVSVPVVRPLNQVLSSMADQ 1056

Query: 1040 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1099
            E  H +DN   S+  LH+TYTV+E+EVRILE ++ GG W+T++TIPMQ  ENALTVR+VT
Sbjct: 1057 ESVHHMDNDVTSTDALHKTYTVDEFEVRILELEKPGGHWETKSTIPMQLFENALTVRIVT 1116

Query: 1100 LFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLV 1146
            L NTTTKENETLLAIGTAYV GEDVAARGRVLLFS  + ++N QNLV
Sbjct: 1117 LHNTTTKENETLLAIGTAYVLGEDVAARGRVLLFSFTK-SENSQNLV 1162


>gi|357162146|ref|XP_003579318.1| PREDICTED: probable cleavage and polyadenylation specificity factor
            subunit 1-like [Brachypodium distachyon]
          Length = 1442

 Score = 1391 bits (3601), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 717/1186 (60%), Positives = 872/1186 (73%), Gaps = 63/1186 (5%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTE--ELDSELPSK----RGIGP 54
            MS+AAYKMMHWPTGI +C +GFITH  +D             E D  L +     + +GP
Sbjct: 1    MSYAAYKMMHWPTGIDHCAAGFITHCPSDAAAFCSAAAASGPEGDVGLVAAARHPKRLGP 60

Query: 55   VPNLVVTAANVIEIYVVRVQEEGS------KESKNSGETKRRVLMDGISAASLELVCHYR 108
             PNLVV AANV+E+Y VR     +      + S +SG      + DGIS A LELVCHYR
Sbjct: 61   TPNLVVAAANVLEVYAVRADAAAADGAGGAQPSSSSG-----AVFDGISGARLELVCHYR 115

Query: 109  LHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWL 168
            LHGN+ES+AILS G A+N  RRDSI LAF DAKI+ LEFDD+IHGLR +SMHCFE PEW 
Sbjct: 116  LHGNIESMAILSDG-AEN--RRDSIALAFRDAKITCLEFDDAIHGLRTSSMHCFEGPEWQ 172

Query: 169  HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSA 228
            HLKRGRESFA GP++K DP GRCG  LVYGLQMIILK++Q G  LVG+++   +    + 
Sbjct: 173  HLKRGRESFAWGPVIKSDPLGRCGAALVYGLQMIILKSAQVGQSLVGEDEPTRALSSAAV 232

Query: 229  RIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
            RIESS++I+LR LD  HVKDF FVHGYIEPV+VILHERE TWAGR+S KHHTCMISA SI
Sbjct: 233  RIESSYLIDLRALDTNHVKDFTFVHGYIEPVLVILHEREPTWAGRISSKHHTCMISAFSI 292

Query: 289  STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVS 348
            S TLKQHP+IWSA N+PHDAY++L+VP PI GVLV+ AN+IHYHSQS SC+LALNN+A  
Sbjct: 293  SMTLKQHPMIWSAANIPHDAYQILSVPPPISGVLVICANSIHYHSQSTSCSLALNNFASQ 352

Query: 349  LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP 408
             D S E+ + +F VELDAA ATWL ND+ + S KTG+++LLTVVYDGR VQ+LDL K+  
Sbjct: 353  PDGSPEIHKVNFHVELDAAKATWLSNDIVMFSAKTGEMLLLTVVYDGRTVQKLDLMKSKA 412

Query: 409  SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
            SV++S +TTIG+S FFLGSR+GDSLLVQF+CG  TS++     E   DIE D P +KRL+
Sbjct: 413  SVISSGVTTIGSSFFFLGSRVGDSLLVQFSCGVPTSVIPDIADERSADIEGDLPFSKRLK 472

Query: 469  RSSSDALQDMVNGEELSLYGSA-SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
            R  SD LQD+ + EELS   +   N+ ESAQK  S+ VRD+LVN+GPLKDFSYGLR+NAD
Sbjct: 473  RVPSDILQDVTSVEELSFQNNMLPNSLESAQK-ISYVVRDALVNVGPLKDFSYGLRVNAD 531

Query: 528  ASATGISKQSNYEL--------------------------VELPGCKGIWTVYHKSSRGH 561
             +ATG +KQSNYEL                          VELP C+GIWTVY+KSSRGH
Sbjct: 532  PNATGNAKQSNYELVCCSGHGKNGALSVLQQSIRPDLITEVELPSCRGIWTVYYKSSRGH 591

Query: 562  NADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRR 621
              +       D+EYHAYLIISLE+RTMVLET D L EVTE+VDY+VQG TI AGNLFGRR
Sbjct: 592  TTE-------DNEYHAYLIISLESRTMVLETGDDLGEVTETVDYYVQGATITAGNLFGRR 644

Query: 622  RVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENST-VLSVSIADPYVLLGMSDGSI 680
            RVIQV+  GAR+LDGS+MTQ+L+F   +SES S       V S SIADPYVLL M DG+I
Sbjct: 645  RVIQVYATGARVLDGSFMTQELNFTALSSESSSSGSEPLGVASASIADPYVLLKMVDGTI 704

Query: 681  RLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAID 740
            +LLVGD STC +S+  P+ + S  + +S+CTLYHD+GPEPWLRKT  DAWLS+GV  A+D
Sbjct: 705  QLLVGDHSTCALSINAPSTLTSRGERISACTLYHDRGPEPWLRKTRGDAWLSSGVTVAVD 764

Query: 741  GADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
             +     DQ DIY ++CYESG LEIF+VP+F  VF+V  F SG + +VD + +   +DS 
Sbjct: 765  VSGSSSQDQSDIYCIICYESGKLEIFEVPSFRQVFSVGSFFSGESLLVDAFAQGFTEDSA 824

Query: 801  TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
                   +E     +KE  +++++VELAM RWS   SRPFLF +L DGT+LCYQAY +EG
Sbjct: 825  ---EGRQDETKVSLKKEVANNIRIVELAMHRWSGQFSRPFLFGLLNDGTLLCYQAYCYEG 881

Query: 861  PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNI 920
             E+  K    +S   S+ + N S SRL+NLRF R  +D  +RE+    A   RITIF N+
Sbjct: 882  LESNIKGTS-LSPDGSVDLGNASDSRLKNLRFHRVSVDITSREDISSLAR-PRITIFNNV 939

Query: 921  SGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKI 980
             G++G FLSG+RP W MV R+R RVHPQLCDG I AFTVLHNVNC+HG IYVTSQG LKI
Sbjct: 940  GGYEGLFLSGTRPVWVMVCRQRFRVHPQLCDGPIEAFTVLHNVNCSHGLIYVTSQGFLKI 999

Query: 981  CQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE 1040
            CQLPS   YDNYWPVQK IPL  TPHQ+TY+AE++LYPLIVSVPV++PLNQVLS++ DQE
Sbjct: 1000 CQLPSAYNYDNYWPVQK-IPLHGTPHQVTYYAEQSLYPLIVSVPVVRPLNQVLSIMADQE 1058

Query: 1041 VGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL 1100
            + H +DN   S+ DL +TYTVEE+EVR+LE ++ GG W+TR+TIPMQS ENALTVR+VTL
Sbjct: 1059 MIHHMDNDASSADDLQKTYTVEEFEVRVLELEKPGGRWETRSTIPMQSFENALTVRIVTL 1118

Query: 1101 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLV 1146
             NTTTKENETL+AIGTAYVQGEDVAARGRVLLFS  + ++N QNLV
Sbjct: 1119 HNTTTKENETLMAIGTAYVQGEDVAARGRVLLFSFTK-SENSQNLV 1163


>gi|168021793|ref|XP_001763425.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685218|gb|EDQ71614.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1452

 Score = 1126 bits (2913), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 604/1207 (50%), Positives = 786/1207 (65%), Gaps = 101/1207 (8%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADY-VPQIPLIQTEELDSELPSKRGIGPVPNLV 59
            MS+AA+KM+H PTG+ NC + ++THS  +     IPL       ++L +  G G  PNLV
Sbjct: 1    MSYAAFKMVHCPTGVDNCVAAYVTHSAGETDSDSIPLP-----GADLIASGGSGFPPNLV 55

Query: 60   VTAANVIEIYVVRVQE------EGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNV 113
            +T ANV+E++ VR+ E       GS    N   T R  LM G+S   LEL CHYRLHGNV
Sbjct: 56   ITKANVLEVFHVRLLEGDDSAANGSNGVGNPETTPRGGLMAGLSYVKLELACHYRLHGNV 115

Query: 114  ESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRG 173
            ESL +LS   A+  + RD+IIL F DAKISVLEFDDS HGLRI S+H FE PEW +LKRG
Sbjct: 116  ESLGVLSYRHAEGRKGRDAIILTFRDAKISVLEFDDSTHGLRIGSLHYFEGPEWQYLKRG 175

Query: 174  RESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESS 233
            RE FA GP V+ DP GRC GVL+Y  Q+++LKA+Q G GL  ++++   GG   A + +S
Sbjct: 176  REQFASGPSVRADPVGRCAGVLIYNSQLVLLKAAQVGYGLGDEDESLIMGGKLCAHVATS 235

Query: 234  HVINLRDLDMKHVKDFIFVHG--------------YIEPVMVILHERELTWAGRVSWKHH 279
            ++++LRDLDMKH+KDF+F+HG              YIEPV+V+LHE++ TWAGRV+ + H
Sbjct: 236  YIVSLRDLDMKHIKDFVFLHGKLLFLIQYIFAFSSYIEPVLVVLHEKDPTWAGRVAVRRH 295

Query: 280  TCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCA 339
            TC I+ALSI+TTLKQHP IWSA NLP+DAYKLLAVP+PIGGVLV  AN++HYHSQS SCA
Sbjct: 296  TCAITALSINTTLKQHPHIWSATNLPYDAYKLLAVPAPIGGVLVFCANSLHYHSQSGSCA 355

Query: 340  LALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ 399
            L LN +AV+ + S E PRS  SVELD AHATW+ N+VAL+STK G L+ L +VY+GR VQ
Sbjct: 356  LGLNEFAVAPEGSAEYPRSKMSVELDCAHATWVANEVALISTKNGMLLFLNLVYEGRSVQ 415

Query: 400  RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEA 459
            RL+L+K+  SVLTS + TIG + FFLGSRL DSLLVQ T GS +   SS +    GDIEA
Sbjct: 416  RLELTKSKASVLTSCMCTIGENFFFLGSRLADSLLVQHTLGSASGRTSSLM----GDIEA 471

Query: 460  D--APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTE-SAQKTFSFAVRDSLVNIGPLK 516
            D  AP+ KRL+R  S+  + +   EE+SLY S    ++ S +KTF+F VRDSLVNI PL+
Sbjct: 472  DLSAPAAKRLKREPSEEEEGVSA-EEMSLYYSTPTASDISQKKTFTFTVRDSLVNICPLR 530

Query: 517  DFSYGLRINADASATGISKQSNYEL--------------------------VELPGCKGI 550
            DF+YGLR NAD SATG+ KQSNYEL                          V LPGC GI
Sbjct: 531  DFAYGLRSNADQSATGLGKQSNYELVACSGHGKNGSLSVLHQSIRPDLINKVALPGCSGI 590

Query: 551  WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
            WTVYHK+ R  + +     + DDE+HAYLIISLE+RTMVLET D L EVTE+V+Y+ +G 
Sbjct: 591  WTVYHKTDRDDSNEFDFGTSEDDEFHAYLIISLESRTMVLETGDTLGEVTENVEYYTEGN 650

Query: 611  TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGS-ENSTVLSVSIADP 669
            TIAAGNLFGRR V+QV++ G R+LDG+ M Q+L    S  E+ S    N+ V+   IADP
Sbjct: 651  TIAAGNLFGRRFVVQVYQNGLRLLDGAKMLQELLITNSELENNSSEVANNLVIEAVIADP 710

Query: 670  YVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTD- 728
            Y+LL M+DGS++L+VGD     +S+  P     +   +++ TLY DKGP  WLR+T ++ 
Sbjct: 711  YMLLKMTDGSLQLVVGDVENTKLSIPQPQGFGITTDAITAFTLYQDKGPHQWLRRTCSEM 770

Query: 729  -----AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSG 783
                  W ST              DQG +Y +VC  SG  EI+++P   CV+ VD F  G
Sbjct: 771  NSDRSQWSSTS-------------DQGYVYCIVCRISGRFEIYELPRMVCVYAVDNFNHG 817

Query: 784  RTHIVDTYMREALKDSETEINSSSEEGTGQGR---KENIHSMKVVELAMQRWSAHHSRPF 840
             + + D  + E   +S   +   +EE    G    ++   S+ V ++  + W     RPF
Sbjct: 818  MSVLWDQKVLERRANSNAALKEGAEEDKAPGDALLRDAGLSLHVSQICFESWGEKFGRPF 877

Query: 841  LFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAY 900
            L A L+DGT+LCY A+ ++  E++   +      R  + S    SRL +LRF+R P+D  
Sbjct: 878  LLATLSDGTMLCYHAFSYDANESSDALE-----FRETATSLKDLSRLTHLRFARIPIDWV 932

Query: 901  TREETPHGAPC---QRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAF 957
            + +E   GA      +   FKN+    G F++G RP W MV R RLR HPQ CDG+I+ F
Sbjct: 933  SGQED--GAKVLYETKFCSFKNVGSFPGVFVTGLRPTWLMVCRGRLRPHPQFCDGAILGF 990

Query: 958  TVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLY 1017
            T LHNVNC HGFIY+T+QG LKICQLPS   YDN WPVQK IPL+ TPHQITY ++ NLY
Sbjct: 991  TPLHNVNCAHGFIYITAQGQLKICQLPSLLFYDNDWPVQK-IPLRGTPHQITYHSDVNLY 1049

Query: 1018 PLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSV--DLHRTYTVEEYEVRILEPDRAG 1075
             LI+S PV +P +QVL        GH  D    +S+  D  R  T E+YEVRI+EP + G
Sbjct: 1050 ALIISTPVSRPTSQVL-----MGDGHPFDQQQENSIGEDGQRLVTSEDYEVRIIEPAQPG 1104

Query: 1076 GPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST 1135
            G W+ +A I M  +ENALTVR+V++ N TT + +TLLAIGT+YVQGEDVAA+GR++L S 
Sbjct: 1105 GNWEAKAAIKMHLTENALTVRIVSIKNITTDQTQTLLAIGTSYVQGEDVAAKGRIILVSV 1164

Query: 1136 GRNADNP 1142
            G++  +P
Sbjct: 1165 GKDPQDP 1171


>gi|302814354|ref|XP_002988861.1| hypothetical protein SELMODRAFT_184138 [Selaginella moellendorffii]
 gi|300143432|gb|EFJ10123.1| hypothetical protein SELMODRAFT_184138 [Selaginella moellendorffii]
          Length = 1413

 Score = 1031 bits (2666), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 566/1197 (47%), Positives = 752/1197 (62%), Gaps = 118/1197 (9%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
            MS+AA K++H PTG++ C S FITHS     P  P        S   S      +PNLV+
Sbjct: 1    MSYAAIKLVHGPTGVSACASAFITHS-----PVNP-----ASSSGWKSGNAKDSLPNLVL 50

Query: 61   TAANVIEIYVVRVQEEGSKESKNSGE-------------TKRRVLMDGISAASLELVCHY 107
              ANV+EIY VR QE G ++S   GE              KR   M GI+AA LELVC Y
Sbjct: 51   VKANVLEIYNVRFQE-GDEKSARGGEQLVGSACVAFPASAKRGGFMSGITAAWLELVCQY 109

Query: 108  RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
            RL G V+S+AIL +G  D  R RD+IILAF  AK SVL FDD+   L+ +SMH FE PEW
Sbjct: 110  RLFGIVDSMAILHRG-RDGGRHRDAIILAFPAAKFSVLFFDDATQQLKTSSMHYFEGPEW 168

Query: 168  LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFS 227
            +HLKRGRE F  GPLV+ D QGRC GVL+Y  Q++++KA+Q   GLV ++D   SG   S
Sbjct: 169  IHLKRGREKFPGGPLVRADSQGRCAGVLIYKSQLVMMKAAQEAYGLVEEDDP--SGNIVS 226

Query: 228  ARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
            ARIESS+V+NL++L M HVKDF+F++GYIEPV+ ILHERELTWAGRV+++  TC ++ALS
Sbjct: 227  ARIESSYVVNLQELGMMHVKDFVFLYGYIEPVVAILHERELTWAGRVTFRRDTCCVTALS 286

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
            I+T  K+HP +W    LP+DAY LLAVPSPIGGVLV+ AN+I Y+SQ ++C +A+N  A 
Sbjct: 287  INTNTKKHPRLWFQTGLPYDAYSLLAVPSPIGGVLVLCANSILYYSQVSTCIVAVNELAT 346

Query: 348  SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
                S E+PRS FS+ELDAAHATWL  D ALLSTKTG LV L +++DGR VQRL+LSK+ 
Sbjct: 347  PPAGSLEMPRSKFSIELDAAHATWLSYDAALLSTKTGMLVHLHLIFDGRNVQRLELSKSK 406

Query: 408  PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
             SVL+S + TIG+  FF+GSRLGDSLLVQF   S ++ LS       G+ +     +KR+
Sbjct: 407  GSVLSSSLCTIGDMFFFVGSRLGDSLLVQFGSASTSNSLSQSYD---GEDDIMVRPSKRM 463

Query: 468  RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFS-------- 519
            R      L D  N + L  Y SA ++++     F F+VRDSL NIGP++D +        
Sbjct: 464  R------LDDDANEQSLYQYKSAVSDSQK-NMNFLFSVRDSLCNIGPIRDITGRSQNPSE 516

Query: 520  -------------YG----LRINADASATGISKQSNYEL------------VELPGCKGI 550
                         +G    L I + +       Q+N  L            V+LPGC G+
Sbjct: 517  QPGSAQDLIACCGHGKNGSLNIISRSIRPDFITQANMSLLFFAVAYALFFQVKLPGCVGV 576

Query: 551  WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
            WTVYH+S +        + A  DEYHAYLIISLE+RTMVLET + L EVT+SV+Y+ +G 
Sbjct: 577  WTVYHRSGQ--------IPAEKDEYHAYLIISLESRTMVLETGETLGEVTDSVEYYTEGP 628

Query: 611  TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPY 670
            +I+AGNLFGRRR+ QV+++G RILDG+  TQDL  G    E G+  E     S S ADPY
Sbjct: 629  SISAGNLFGRRRIAQVYQKGVRILDGARQTQDLQVG----EPGNAIE-----SASFADPY 679

Query: 671  VLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAW 730
            VLL M DGS +L+VGD  T TVSV TP  +  S  P+S+CTLY+D+GP PWLR+ + D W
Sbjct: 680  VLLRMQDGSCQLVVGDSETLTVSVSTPPELGLSPDPISACTLYNDRGPSPWLRRATGDVW 739

Query: 731  LSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDT 790
             + GV +A         DQGD+Y +VC  SG +E  ++P+  C++ V++   G   + D 
Sbjct: 740  QTLGVPDA-----NFAFDQGDMYCIVCRNSGTMEFLELPSMACLYRVERLPYGVQVLADN 794

Query: 791  YMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTI 850
              R A K      ++  EEG  + R E +  +KVV++ +  W   + RPF+F +L+DGT+
Sbjct: 795  --RTASKVPVDTSSNKDEEGAEEIR-ERMSKIKVVDICVDTWGEKYGRPFVFVLLSDGTL 851

Query: 851  LCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG-- 908
            L Y+A+++EG ++ + + D  S               RNLRF R  LD    EE  +   
Sbjct: 852  LSYRAFIYEGQDSGAHASDGTS--------------FRNLRFLRLQLDLELGEEDSNADE 897

Query: 909  -APCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNH 967
                Q+I  FK++ G QG FL+G +P W M+FRE++R+HPQ  DG IVAFT LHNVNC H
Sbjct: 898  VRSVQKIIPFKDVGGLQGLFLAGGKPTWLMIFREQIRLHPQASDGPIVAFTSLHNVNCQH 957

Query: 968  GFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLK 1027
            G IYVT++  LKIC+L +   YDN WPVQK IPLK TPHQ+ +  + N+Y L++S  V  
Sbjct: 958  GLIYVTNEASLKICRLSNILNYDNDWPVQK-IPLKGTPHQMAHHPDLNIYVLVLSFSVSV 1016

Query: 1028 PLNQVLSLLIDQEVGHQIDNHNLSS-VDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPM 1086
            P + VL    D   GHQID    S  +D  +   V+++EVR+LEP   G PW+T+ TI  
Sbjct: 1017 PTSLVLPSAADGPPGHQIDQSEASDGLDPQKMVQVDDFEVRLLEPMAQGVPWETKDTIKF 1076

Query: 1087 QSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQ 1143
            Q +EN LTVR+V++ N  T++ E LLAIGT Y+QGEDVA+RGR++L S G +  +P+
Sbjct: 1077 QPAENVLTVRIVSIKNAATEQVENLLAIGTGYLQGEDVASRGRIILVSLGEDPSDPK 1133


>gi|302761560|ref|XP_002964202.1| hypothetical protein SELMODRAFT_82277 [Selaginella moellendorffii]
 gi|300167931|gb|EFJ34535.1| hypothetical protein SELMODRAFT_82277 [Selaginella moellendorffii]
          Length = 1413

 Score = 1029 bits (2660), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 566/1199 (47%), Positives = 754/1199 (62%), Gaps = 122/1199 (10%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
            MS+AA K++H PTG++ C S FITHS     P  P        S   S      +PNLV+
Sbjct: 1    MSYAAIKLVHGPTGVSACASAFITHS-----PVNP-----ASSSGWKSGNAKDSLPNLVL 50

Query: 61   TAANVIEIYVVRVQEEGSKESKNSGE-------------TKRRVLMDGISAASLELVCHY 107
              ANV+EIY VR QE G ++S   GE              KR   M GI+AA LELVC Y
Sbjct: 51   VKANVLEIYNVRFQE-GDEKSARGGEQLVGSACVAFPASAKRGGFMSGITAAWLELVCQY 109

Query: 108  RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
            RL G V+S+AIL +G  D  R RD+IILAF  AK SVL FDD+   L+ +SMH FE PEW
Sbjct: 110  RLFGIVDSMAILHRG-RDGGRHRDAIILAFPAAKFSVLFFDDATQQLKTSSMHYFEGPEW 168

Query: 168  LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFS 227
            +HLKRGRE F  GPLV+ D QGRC GVL+Y  Q++++KA+Q   GLV ++D   SG   S
Sbjct: 169  IHLKRGREKFPGGPLVRADSQGRCAGVLIYKCQLVMMKAAQEAYGLVEEDDP--SGNIVS 226

Query: 228  ARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
            ARIESS+V+NL++L M HVKDF+F++GYIEPV+ ILHERELTWAGRV+++  TC ++ALS
Sbjct: 227  ARIESSYVVNLQELGMMHVKDFVFLYGYIEPVVAILHERELTWAGRVTFRRDTCCVTALS 286

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
            I+T  K+HP +W    LP+DAY LLAVPSPIGGVLV+ AN+I Y+SQ ++C +A+N  A 
Sbjct: 287  INTNTKKHPRLWFQTGLPYDAYSLLAVPSPIGGVLVLCANSILYYSQVSTCIVAVNELAT 346

Query: 348  SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
                S E+PRS FS+ELDAAHATWL  D ALLSTKTG LV L +++DGR VQRL+LSK+ 
Sbjct: 347  PPAGSLEMPRSKFSIELDAAHATWLSYDAALLSTKTGMLVHLHLIFDGRNVQRLELSKSK 406

Query: 408  PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEF-GDIEADAPSTKR 466
             SVL+S + TIG+  FF+GSRLGDSLLVQF    G++  S+ L+  + G+ +     +KR
Sbjct: 407  GSVLSSSLCTIGDKFFFVGSRLGDSLLVQF----GSASTSNSLEHSYDGEDDIMVRPSKR 462

Query: 467  LRRSSSDALQDMVNGEELSLYGSASNNTESAQK-TFSFAVRDSLVNIGPLKDFSY----- 520
            +R      L D  +  E SLY   S  ++S +   F F+VRDSL NIGP++D +      
Sbjct: 463  MR------LDD--DASEQSLYQYKSGVSDSQKNMNFLFSVRDSLCNIGPIRDITCRSQNP 514

Query: 521  --------------------GLRINADASATGISKQSNYEL------------VELPGCK 548
                                 L I + +       Q+N  L            V+LPGC 
Sbjct: 515  SEQPGSAQDLIACCGHGKNGSLNIISRSIRPDFITQANMSLLFFAVAYALFFQVKLPGCV 574

Query: 549  GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
            G+WTVYH+S +        + A  DEYHAYLIISLE+RTMVLET + L EVT+SV+Y+ +
Sbjct: 575  GVWTVYHRSGQ--------IPAEKDEYHAYLIISLESRTMVLETGETLGEVTDSVEYYTE 626

Query: 609  GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
            G +I+AGNLFGRRR+ QV+++G RILDG+  TQDL  G    E G+  E     S S AD
Sbjct: 627  GPSISAGNLFGRRRIAQVYQKGVRILDGARQTQDLQVG----EPGNAIE-----SASFAD 677

Query: 669  PYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTD 728
            PYVLL M DGS +L+VGD  T TVSV TP  +  S  P+S+CTLY+D+GP PWLR+ + D
Sbjct: 678  PYVLLRMQDGSCQLVVGDSETLTVSVSTPPELGLSPDPISACTLYNDRGPSPWLRRATGD 737

Query: 729  AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
             W + GV +A         DQGD+Y +VC  SG +E  ++P+  C++ V++   G   + 
Sbjct: 738  VWQTLGVPDA-----NFAFDQGDMYCIVCRNSGTMEFLELPSMACLYRVERLPYGVQVLA 792

Query: 789  DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
            D+  R A K      ++  EEG  + R E +  +KVV++ +  W   + RPF+F +L+DG
Sbjct: 793  DS--RTASKVPVDTSSNKDEEGAEEIR-ERMSKIKVVDICVDTWGEKYGRPFVFVLLSDG 849

Query: 849  TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG 908
            T+L Y+A+++EG ++ + + D  S               RNLRF R  LD    EE  + 
Sbjct: 850  TLLSYRAFIYEGQDSGAHASDGTS--------------FRNLRFLRLQLDLELGEEDSNA 895

Query: 909  ---APCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNC 965
                  Q+I  FK++ G QG FL+G +P W M+FRE++R+HPQ  DG IVAFT LHNVNC
Sbjct: 896  DEVRSVQKIIPFKDVGGLQGLFLAGGKPTWLMIFREQIRLHPQASDGPIVAFTSLHNVNC 955

Query: 966  NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPV 1025
             HG IYVT++  LKIC+L +   YDN WPVQK IPLK TPHQ+ +  + N+Y L++S  V
Sbjct: 956  QHGLIYVTNEASLKICRLSNILNYDNDWPVQK-IPLKGTPHQMAHHPDLNIYVLVLSFSV 1014

Query: 1026 LKPLNQVLSLLIDQEVGHQIDNHNLSS-VDLHRTYTVEEYEVRILEPDRAGGPWQTRATI 1084
              P + VL    D   GHQID    S  +D  +   V+++EVR+LEP   G PW+T+ TI
Sbjct: 1015 SVPTSLVLPSAADGPPGHQIDQSEASDGLDPQKMVQVDDFEVRLLEPMAQGVPWETKDTI 1074

Query: 1085 PMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQ 1143
              Q +EN LTVR+V++ N  T++ E LLAIGT Y+QGEDVA+RGR++L S G +  +P+
Sbjct: 1075 KFQPAENVLTVRIVSIKNAATEQVENLLAIGTGYLQGEDVASRGRIILVSLGEDPSDPK 1133


>gi|449524573|ref|XP_004169296.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like, partial [Cucumis sativus]
          Length = 741

 Score =  667 bits (1722), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 324/450 (72%), Positives = 370/450 (82%), Gaps = 4/450 (0%)

Query: 697  PAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVV 756
            PAA  SSKK VSSCTLY DKG EPWLR TSTDAWLSTGVGE IDG DG   DQGDIY V 
Sbjct: 11   PAAFGSSKKCVSSCTLYQDKGIEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYCVA 70

Query: 757  CYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRK 816
            CY++G LEIFDVPNF  VF VDKFVSG++H+VD  + +  K SE + NS  +E    GR 
Sbjct: 71   CYDNGDLEIFDVPNFTSVFYVDKFVSGKSHLVDHQISDLQKSSEVDQNS--QELISHGRN 128

Query: 817  ENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRS 876
            E+  +MKV+E+AMQRWS  HSRPFLF ILTDGTILCY AYLFE  ++ SK DD VS   S
Sbjct: 129  ESSQNMKVIEVAMQRWSGQHSRPFLFGILTDGTILCYHAYLFESTDSASKIDDSVSIDNS 188

Query: 877  LSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWC 936
            +S SN+S+SRLRNLRF R PLD   RE+ P+G   +R++IFKNISG+QG FL GSRP W 
Sbjct: 189  VSSSNMSSSRLRNLRFLRVPLDIQGREDMPNGTLSRRLSIFKNISGYQGLFLCGSRPAWF 248

Query: 937  MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQ 996
            MVFRERLRVHPQLCDG IVAF VLHNVNCNHG IYVTSQG+LKICQLPS S YDNYWPVQ
Sbjct: 249  MVFRERLRVHPQLCDGPIVAFAVLHNVNCNHGLIYVTSQGVLKICQLPSTSNYDNYWPVQ 308

Query: 997  KVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLH 1056
            KV PLK TPHQ+TYF EKNLYP+I+S PV KPLNQVLS ++DQ+VGH ++NHNLS+ +L 
Sbjct: 309  KV-PLKGTPHQVTYFHEKNLYPVIISAPVQKPLNQVLSSMVDQDVGH-VENHNLSADELQ 366

Query: 1057 RTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGT 1116
            +TY+VEE+E+RILEP+++GGPWQTRATI M SSENALT+RVVTL NTTTKENETLLA+GT
Sbjct: 367  QTYSVEEFEIRILEPEKSGGPWQTRATIAMHSSENALTIRVVTLLNTTTKENETLLAVGT 426

Query: 1117 AYVQGEDVAARGRVLLFSTGRNADNPQNLV 1146
            AYVQGEDVAARGRVLLFS G++ADN Q LV
Sbjct: 427  AYVQGEDVAARGRVLLFSVGKDADNSQTLV 456


>gi|414587801|tpg|DAA38372.1| TPA: hypothetical protein ZEAMMB73_993613 [Zea mays]
          Length = 573

 Score =  656 bits (1693), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 331/569 (58%), Positives = 411/569 (72%), Gaps = 34/569 (5%)

Query: 1   MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTE--------ELDSELP-SKRG 51
           MS+AAYKMMH PTGI +C +GFITHS AD                   ++DS    + R 
Sbjct: 1   MSYAAYKMMHLPTGIDHCAAGFITHSPADAAAFSTPAPAPTAAAGPDGDIDSTAARAPRR 60

Query: 52  IGPVPNLVVTAANVIEIYVVRVQ-EEGSKESKNSGETKRRVLMDGISAASLELVCHYR-- 108
           +GP PNLVV+AANV+E+Y VR +   G++++ NS  T    ++DGIS A LELVCHYR  
Sbjct: 61  VGPTPNLVVSAANVLEVYAVRAEVATGAEDAGNSSSTG--TILDGISGARLELVCHYRCK 118

Query: 109 ----------------LHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIH 152
                           LHGN+ES+A+LS G      RRDSI + F DAKI+ LEFDDSI+
Sbjct: 119 QMALASLHSLLAVNFRLHGNIESMAVLSDG---TENRRDSIAVTFNDAKITCLEFDDSIN 175

Query: 153 GLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSG 212
           GLR +SMHCFE PEW HLKRGRESFA GP++K DPQGRCG VLVYGLQ+IILKA+Q G  
Sbjct: 176 GLRTSSMHCFEGPEWFHLKRGRESFAWGPIIKGDPQGRCGAVLVYGLQIIILKAAQVGQS 235

Query: 213 LVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAG 272
           LVG+++        + RIESS+VI+LRDL+M H+KDF FVHGYIEPV+VILHERE TWAG
Sbjct: 236 LVGEDEPTRVLSSTAVRIESSYVIDLRDLEMNHIKDFTFVHGYIEPVLVILHEREPTWAG 295

Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYH 332
           R+S K  TCM+SA SIS  LKQHP+IWSA  LPHDAY+LLAVP PI G+LV+ AN+IHYH
Sbjct: 296 RISSKSQTCMLSAFSISMGLKQHPMIWSAAKLPHDAYQLLAVPPPISGILVICANSIHYH 355

Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
           SQS SC+LALN+++   D S E+ ++SF VELD A ATWL +D+ + S+K G+++LLTVV
Sbjct: 356 SQSTSCSLALNSFSSQPDGSPEILKTSFHVELDVAKATWLSHDIVMFSSKNGEILLLTVV 415

Query: 393 YDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE 452
           YDGR VQRLDL K+  SVL+S  TT+G+S  FLGSRL DSLLVQF+CG  TS+L   L +
Sbjct: 416 YDGRAVQRLDLMKSKASVLSSGATTLGSSFIFLGSRLADSLLVQFSCGMPTSVLPD-LTD 474

Query: 453 EFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNI 512
           E  DIE+D P +KRL+R  SD LQD+ + EELS +  A  N   + +  SF VRD+L+N+
Sbjct: 475 EPADIESDLPFSKRLKRIPSDVLQDVTSVEELSFHNKAVPNIVDSAEKISFVVRDALINV 534

Query: 513 GPLKDFSYGLRINADASATGISKQSNYEL 541
           GPLKDF+YGLR N+D +A GI+KQSNYEL
Sbjct: 535 GPLKDFAYGLRTNSDPNAAGIAKQSNYEL 563


>gi|414587798|tpg|DAA38369.1| TPA: hypothetical protein ZEAMMB73_163106, partial [Zea mays]
          Length = 483

 Score =  550 bits (1416), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 271/459 (59%), Positives = 328/459 (71%), Gaps = 11/459 (2%)

Query: 685  GDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADG 744
             DPSTCT+S+  PA   SS + +S+CTLY D+GPEPWLRKT TDAWLST VGEAID  D 
Sbjct: 33   ADPSTCTISINAPAIFASSSERISACTLYCDRGPEPWLRKTHTDAWLSTDVGEAIDDNDN 92

Query: 745  GPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEIN 804
               D  DIY ++CYESG LEIF+VP+F  VF+VD FVSG   + D + R + KDS     
Sbjct: 93   SSHDLSDIYCIICYESGKLEIFEVPSFKRVFSVDNFVSGPAILFDVFSRNSTKDSGIGDR 152

Query: 805  SSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENT 864
             +S+      +KE   ++K+VELAM RWS   SRPFLF +L DGT+LCY AY FEG E+ 
Sbjct: 153  DASKVSV---KKEEAANIKIVELAMHRWSGQFSRPFLFGLLNDGTLLCYHAYYFEGSESN 209

Query: 865  SKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPC---QRITIFKNIS 921
             +         S  + N + SRLRNLRF R  +D  +R++      C    RITIF N+ 
Sbjct: 210  VQCAPFSPHGGSPDIGNATDSRLRNLRFCRVSIDISSRDDIS----CLVRPRITIFNNVG 265

Query: 922  GHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
            G++G FL G RP W  V R+R RVHPQLCDG IVAFTVLHNVNC  G IYVTSQG LKIC
Sbjct: 266  GYEGLFLGGPRPTWVFVCRQRFRVHPQLCDGPIVAFTVLHNVNCCRGLIYVTSQGFLKIC 325

Query: 982  QLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1041
            QLPS   YDNYWPVQKV PL  TPHQ+TY+ E++LYPLIVSVP ++PLNQVLS + DQE+
Sbjct: 326  QLPSAYNYDNYWPVQKV-PLHGTPHQVTYYGEQSLYPLIVSVPQVRPLNQVLSSMADQEL 384

Query: 1042 GHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLF 1101
            G  ++N   S  DL   YTV+E+EVRI+E  ++ G W+TR+TIPMQS ENALTVR+VTL 
Sbjct: 385  GLHMENDVTSGGDLQEVYTVDEFEVRIMELGKSNGRWETRSTIPMQSFENALTVRIVTLQ 444

Query: 1102 NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNAD 1140
            NT+TKENETL+AIGTAYVQGEDVAARGRVLLFS  ++ +
Sbjct: 445  NTSTKENETLMAIGTAYVQGEDVAARGRVLLFSFSKSEN 483


>gi|414587799|tpg|DAA38370.1| TPA: hypothetical protein ZEAMMB73_163106 [Zea mays]
          Length = 461

 Score =  489 bits (1258), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 241/421 (57%), Positives = 293/421 (69%), Gaps = 11/421 (2%)

Query: 685  GDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADG 744
             DPSTCT+S+  PA   SS + +S+CTLY D+GPEPWLRKT TDAWLST VGEAID  D 
Sbjct: 33   ADPSTCTISINAPAIFASSSERISACTLYCDRGPEPWLRKTHTDAWLSTDVGEAIDDNDN 92

Query: 745  GPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEIN 804
               D  DIY ++CYESG LEIF+VP+F  VF+VD FVSG   + D + R + KDS     
Sbjct: 93   SSHDLSDIYCIICYESGKLEIFEVPSFKRVFSVDNFVSGPAILFDVFSRNSTKDSGIGDR 152

Query: 805  SSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENT 864
             +S+      +KE   ++K+VELAM RWS   SRPFLF +L DGT+LCY AY FEG E+ 
Sbjct: 153  DASKVSV---KKEEAANIKIVELAMHRWSGQFSRPFLFGLLNDGTLLCYHAYYFEGSESN 209

Query: 865  SKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPC---QRITIFKNIS 921
             +         S  + N + SRLRNLRF R  +D  +R++      C    RITIF N+ 
Sbjct: 210  VQCAPFSPHGGSPDIGNATDSRLRNLRFCRVSIDISSRDDI----SCLVRPRITIFNNVG 265

Query: 922  GHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
            G++G FL G RP W  V R+R RVHPQLCDG IVAFTVLHNVNC  G IYVTSQG LKIC
Sbjct: 266  GYEGLFLGGPRPTWVFVCRQRFRVHPQLCDGPIVAFTVLHNVNCCRGLIYVTSQGFLKIC 325

Query: 982  QLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1041
            QLPS   YDNYWPVQKV PL  TPHQ+TY+ E++LYPLIVSVP ++PLNQVLS + DQE+
Sbjct: 326  QLPSAYNYDNYWPVQKV-PLHGTPHQVTYYGEQSLYPLIVSVPQVRPLNQVLSSMADQEL 384

Query: 1042 GHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLF 1101
            G  ++N   S  DL   YTV+E+EVRI+E  ++ G W+TR+TIPMQS ENALTVR+VTL 
Sbjct: 385  GLHMENDVTSGGDLQEVYTVDEFEVRIMELGKSNGRWETRSTIPMQSFENALTVRIVTLQ 444

Query: 1102 N 1102
            N
Sbjct: 445  N 445


>gi|255075065|ref|XP_002501207.1| predicted protein [Micromonas sp. RCC299]
 gi|226516471|gb|ACO62465.1| predicted protein [Micromonas sp. RCC299]
          Length = 1423

 Score =  466 bits (1199), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 373/1212 (30%), Positives = 576/1212 (47%), Gaps = 177/1212 (14%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
            MSFA +K +H PTG+ +  + + TH   D  P                       PNLVV
Sbjct: 1    MSFAIHKQVHPPTGVDHAVAAYFTHPIGDGGP-----------------------PNLVV 37

Query: 61   TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
              AN + ++ +R        +  SG+        G  A SLE+V  + L+G V S+A++ 
Sbjct: 38   MQANHLTVFAIRRD----PSADASGDAAL-----GAKAMSLEVVAEFDLNGTVGSIAVMR 88

Query: 121  QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR- 179
            +       +RD++++A  ++K+SV+E+D S   +  +S+H +E+P       G  S  R 
Sbjct: 89   RRSGAPRNQRDALLIAVRESKLSVIEWDPSEMTVVPSSLHSWETPVG---TGGVPSALRV 145

Query: 180  ---GPLVKVDPQGRCGGVLVY--GLQMIIL----KASQGGSGLVGDEDTFGSGGGFSARI 230
                PL   DP+GRC  VL+   G   + L     A     G  G +D    G G +A +
Sbjct: 146  APLPPLAIADPEGRCAAVLLRAEGRSRLALCPAVDADADADGDGGGDDGDRRGQGPAASV 205

Query: 231  ESSHVINLR-DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
              S V++L  DL +  V+D  F+HGY EPV++ILHERE TWA R+   + TC+++A+SI+
Sbjct: 206  RKSFVVDLTADLALSGVRDAAFLHGYGEPVVLILHEREPTWAARMPLVNDTCVLTAVSIN 265

Query: 290  TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL 349
               K+  +IW    LP   Y+L A+P P+GG +V+  N + + SQ +S ALALN  A   
Sbjct: 266  LDTKRCTVIWQREKLPCTCYRLCAMPDPLGGAIVLSNNFLLHESQESSKALALNPLAGGG 325

Query: 350  DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ---RLDLSKT 406
              S        SVELD+AHA  L     L++TK G L+LL++  +GR +     + L + 
Sbjct: 326  TESA----LGVSVELDSAHAAVLSERQVLVTTKQGALMLLSLRVEGRRLAAHGAMHLRRA 381

Query: 407  NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT---CGSGTSMLSSGL--KEEFGDIEADA 461
              +VL+S +  I   L FLGSR+GDSLLV            ML +    K + G+ E   
Sbjct: 382  GGAVLSSGMCLITKRLLFLGSRVGDSLLVSLKKKEAAGAAQMLPAAAPKKRKAGEAEPPK 441

Query: 462  PSTKRLRRSSS----DALQDMVNGE-ELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLK 516
            P     +  +S    D L+ M+ GE E +   + +   E     ++F VRDS++ I P+ 
Sbjct: 442  PPPPPQKVGTSQDDEDELEAMLYGEGEAAAKAANAGRKE--DPGYTFTVRDSVLGISPII 499

Query: 517  DFSYGLRINADA---------SATGISKQSNYELVE---------------LPGCKGIWT 552
            D + G   +            +A G  K     +++               LPG  G WT
Sbjct: 500  DLTAGASASVQGDTEERAELVAACGHGKNGALAILQRGIQPELVTEVEAGTLPGLMGTWT 559

Query: 553  VYHKS---SRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
            VYH+S    R   + ++  AA  D +H+YL+ISLE+ TMVLET + L EV+E+V+     
Sbjct: 560  VYHESRDNERLRESGAAAAAANVDPFHSYLVISLESTTMVLETGEELREVSEAVELVTDA 619

Query: 610  RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
             T+AAGN+ GR+R+ QV + G RI +G    QDLS       +G  S +  +++  + DP
Sbjct: 620  ATLAAGNMHGRKRIAQVHKGGVRICEGPVKIQDLSAA-EMPAAGDVSPDLEIIAAQVLDP 678

Query: 670  YVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIES--SKKPVSSCTLYHDKGP--------- 718
            YVL  MSDGS+R+L GD    +V   +P++  +  + + ++S  L  D  P         
Sbjct: 679  YVLCRMSDGSLRVLKGDEEKGSVEAMSPSSYANLPTGESIASAALVDDSVPAAERPGLTT 738

Query: 719  -EP-WLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFT 776
             EP +LR+T+T    STGV          P D+      V    G LE++ +P+   +++
Sbjct: 739  REPGFLRRTAT----STGV---------LPEDEEGTVLAVTRVGGTLELYALPSCERIWS 785

Query: 777  VDKFVSGRTHIVDTYMREALK-DSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAH 835
             D    G   +      + +  D + E+  +          ++  + ++VE  +  +   
Sbjct: 786  ADGLSEGLNVLAPGGAGDDVNVDGDGEVEPT----------DDYPAPEIVEFRLDAFPRA 835

Query: 836  HSRPFLFAILTDGTILCYQAYLF-EGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSR 894
            H RP L A+  DG++L Y+A+L   G  N      P                   LRF R
Sbjct: 836  HERPMLTALRGDGSVLVYRAFLCPPGAGNVGHEAKP------------------QLRFCR 877

Query: 895  TPLD------AYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQ 948
             P++           +   G+   R     +  G +G F+SG RP W +V R R+   P 
Sbjct: 878  VPIELEGGGGGMVDTKALSGSRLTRFERVGDRGGIRGVFVSGPRPLWLLVRRSRVLALPI 937

Query: 949  LCDGS-IVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQ 1007
              +    V+FT  HNVNC +GF+  T+ G ++ICQ+P    Y+  WPV+K + L+ TPH 
Sbjct: 938  RGEAQRTVSFTPFHNVNCLNGFMLGTAAGGVRICQIPGRMHYEAAWPVRK-LALRCTPHH 996

Query: 1008 ITYFAEKNLYPLIVSVPVLKPLNQV------LSLLIDQEVGHQIDNHNLSSVDLHRTYTV 1061
            + Y  +  LY L  S PV    ++V      LS LI     + +    +  V        
Sbjct: 997  VQYLPDFRLYALSTSAPVKWKDHEVNEDDIHLSTLIKVRKANAMAKGGVEQV-------- 1048

Query: 1062 EEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQG 1121
              + +R+L P      WQ      +   E+  ++R V L NT T   +++L +GTA   G
Sbjct: 1049 --FSLRLLVPGTLECAWQ----YTVDPGEHVQSIRNVQLRNTMTGALQSMLVVGTALPGG 1102

Query: 1122 EDVAARGRVLLF 1133
            ED   RGRVL+F
Sbjct: 1103 EDAPCRGRVLIF 1114


>gi|145348791|ref|XP_001418827.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144579057|gb|ABO97120.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 1386

 Score =  454 bits (1168), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 353/1213 (29%), Positives = 561/1213 (46%), Gaps = 213/1213 (17%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
            MS A ++ +H PTG+ +  + + T    D                       G  PNL+V
Sbjct: 1    MSHAVHREVHPPTGVDHAVTAYFTRPVGD-----------------------GGDPNLIV 37

Query: 61   TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
             +AN I +Y V     G +ES                   L++   +   G + S+++L 
Sbjct: 38   ASANRITVYAV--NRRGDEES-------------------LDVCAEFDAQGAIGSMSVLR 76

Query: 121  QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFES-----PEWLHLKRGRE 175
            +       +RD++++A  + K+SV+E+D +   +  +SMH FES     P    L+  RE
Sbjct: 77   RRFGAPRNQRDALLIAIRERKLSVVEYDAATGDVCCSSMHSFESALGCNPLGTTLRMSRE 136

Query: 176  SFARGPLVKVDPQGRCGGVLV----YGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIE 231
            +    PLV  DP+GRC  V++       ++ +L +  GG GLV ++D  G   G +A + 
Sbjct: 137  A----PLVVSDPEGRCAAVVLREDGVAGKVRVLPSVDGGLGLVANDDE-GRVRGPAASVR 191

Query: 232  SSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
             S  ++L  + +  ++D  F+HGY EP + +L+E+  TWAGR +    TC I ALS+   
Sbjct: 192  ESFPLHLPGVRL--IRDACFLHGYGEPALAVLYEKTPTWAGRYNLSKDTCEIVALSVDVD 249

Query: 292  LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
             ++  +IW   NLP  +YKL A+  P+GG LV   + + + SQ +S  L LN +      
Sbjct: 250  KQKGTVIWRRQNLPSSSYKLTALLPPLGGALVFSQDFLLHESQESSSVLGLNTFGHG--G 307

Query: 352  SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVL 411
             QE   +   + LD A A+ +  D  L++TKTG L+LL +  DGR ++R+ L +   +VL
Sbjct: 308  PQE--GNDAEITLDGAQASVVSEDRVLVTTKTGALLLLALHTDGRSLRRMMLQRAGGAVL 365

Query: 412  TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTS---ML-----------SSGLKEEFGDI 457
            +S +  +   L FLGSR+GDSLLV+FT     +   ML           +   K++  ++
Sbjct: 366  SSGMCLLSRDLLFLGSRIGDSLLVKFTPKEEPTAPLMLPDAEDESEDEATEKSKDDDDEL 425

Query: 458  EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKD 517
            EA    T +     +DA+Q      E    G A          +   V+DSL+ + P+ D
Sbjct: 426  EALLYGTTKTETVQTDAVQ-----TEKKREGLAGIIPGLKVAGYDLKVKDSLLGVAPVVD 480

Query: 518  FSYGLRINADASATGISKQSNYELVE-----------------------------LPGCK 548
             + G      ++  G +K    EL+                              LP  +
Sbjct: 481  IAVGA-----SAPMGSNKNERTELITACGQGKNGALAILTRGVQPELVTEVESGTLPNLQ 535

Query: 549  GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
            G+WT+++   R   +   R     + +H +L++S+++ TM++ET + L EV+ S+++   
Sbjct: 536  GLWTLHY---RKEGSKEER-----EPFHHHLLLSMKSSTMIMETGEELQEVSASLEFITN 587

Query: 609  GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
              T+AA N+FG    +QV   G R+L G    QD+     ++  G+      + S  I D
Sbjct: 588  QATLAASNIFGHYCSVQVTGTGIRVLKGGVKVQDVGLQDMDAPKGA-----AIASAQILD 642

Query: 669  PYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDK--------GPE- 719
            PY+++ +SDGSIRLL GD    +VS+    AI +S   V++  L  D         G E 
Sbjct: 643  PYIIVRLSDGSIRLLSGDEKQMSVSLMETGAIPTSS--VTAFALVDDSVEAADAAGGGER 700

Query: 720  --PWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTV 777
               W+ + +T+  ++   G    GA     +  +    +  E G+LE+F +P+   ++  
Sbjct: 701  KSGWIHRAATNGTITGLEGNKKSGA----CNNSEAIVALTREGGSLELFSLPSCTRIWCA 756

Query: 778  DKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHS 837
            D    G        MR  +   +T +N+ S               ++V++ +  +   H 
Sbjct: 757  DGLSEG--------MR--VLSPQTPVNAESS------------VPEIVDIRIDSFQDAHE 794

Query: 838  RPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPL 897
            RP L A+  DGT+L Y+ ++          D+P+  +               LRFSR  +
Sbjct: 795  RPLLTAVRGDGTLLLYKGFIVPAGTTYEGQDEPLEKN--------------ELRFSRVNV 840

Query: 898  D-------------AYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLR 944
            D             A    ++  GA   RI       G QG F++G  P W +V R R+ 
Sbjct: 841  DVEGSGLNVAGIGAAGQLRDSLAGARLTRIGNVGEGQGVQGIFVAGPNPLWLIVRRSRVL 900

Query: 945  VHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKAT 1004
              P   +G +VAFTV HNVNC HGFI  T+ G ++ICQ+PS   Y+  WPV+KV  LK T
Sbjct: 901  ALPTRGEGEVVAFTVFHNVNCPHGFILGTALGGVRICQMPSKMHYEAAWPVRKV-ALKCT 959

Query: 1005 PHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVE-- 1062
            PH ITY  +  LY L+ S PV       +   I+Q+  H I    L+ V   R    +  
Sbjct: 960  PHTITYLPDFKLYALVTSAPV-----PWVEREIEQDNVHGI---ALAKVRRERAKANDDM 1011

Query: 1063 --EYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQ 1120
              +Y VR+L P      WQ      ++  E+   VR V L +  T    +LLA+GTA   
Sbjct: 1012 ELQYSVRLLVPGSLDSAWQH----ALEPGEHVQCVRNVQLRDINTGALLSLLAVGTAMPG 1067

Query: 1121 GEDVAARGRVLLF 1133
            GED   RGRV+LF
Sbjct: 1068 GEDTPCRGRVILF 1080


>gi|242075248|ref|XP_002447560.1| hypothetical protein SORBIDRAFT_06g003580 [Sorghum bicolor]
 gi|241938743|gb|EES11888.1| hypothetical protein SORBIDRAFT_06g003580 [Sorghum bicolor]
          Length = 374

 Score =  444 bits (1143), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 220/359 (61%), Positives = 267/359 (74%), Gaps = 14/359 (3%)

Query: 1   MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTE-------ELDSELPSK-RGI 52
           MS+AAYKMMHWPT I +C +GFITHS AD                  ++DS   S  R +
Sbjct: 1   MSYAAYKMMHWPTSIDHCAAGFITHSPADAAAFSSAAPAAAASGPDGDIDSAAASAPRRV 60

Query: 53  GPVPNLVVTAANVIEIYVVRVQEE-GSKESKNSGETKRRVLMDGISAASLELVCHYRLHG 111
           GP PNLVV+AANV+E+Y VR     G+++  NS  T    ++DGIS A LELVCHYRLHG
Sbjct: 61  GPTPNLVVSAANVLEVYAVRADSATGAEDVGNSSSTG--AILDGISGARLELVCHYRLHG 118

Query: 112 NVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLK 171
           N+ES+A+LS G      RRDSI + F+DAKI+ +EFDDS +GLR +SMHCFE PEW HLK
Sbjct: 119 NIESMAVLSDG---TENRRDSIAVTFKDAKIACMEFDDSTNGLRTSSMHCFEGPEWFHLK 175

Query: 172 RGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIE 231
           RGRESFA GP++K DPQGRCG VLVYGLQMIILKA++ G  LVG+++        + RIE
Sbjct: 176 RGRESFAWGPIIKADPQGRCGAVLVYGLQMIILKAAEVGQSLVGEDEPTRMLSSTAVRIE 235

Query: 232 SSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
           SS+VI+LRDL+M H+KDF FVHGYIEPV+VILHERE TWAGR+S K  TCM+SA SIS  
Sbjct: 236 SSYVIDLRDLEMNHIKDFTFVHGYIEPVLVILHEREPTWAGRISSKSQTCMLSAFSISMG 295

Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLD 350
           LKQHP+IWSA  LPHDAY+LLAVP PI G+LV+ AN+IHYHSQS SC+LALN+++   D
Sbjct: 296 LKQHPMIWSAAKLPHDAYQLLAVPPPISGILVICANSIHYHSQSTSCSLALNSFSSQPD 354


>gi|303285993|ref|XP_003062286.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226455803|gb|EEH53105.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 1469

 Score =  441 bits (1135), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 362/1239 (29%), Positives = 586/1239 (47%), Gaps = 186/1239 (15%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
            MSFA +K +H PTG+ +  + + TH         P+              G G  PNLVV
Sbjct: 1    MSFAIHKQVHPPTGVDHACAAYFTH---------PI--------------GSGAPPNLVV 37

Query: 61   TAANVIEIYVVRVQEEGSKESKNSGETKRR---------VLMDGISAA------------ 99
              AN + IY +R   +G      SG   +          ++ D IS A            
Sbjct: 38   LQANRLTIYAIR--RDGDARDNPSGNATKEADDAAIAASLVADAISGAGATASATIDADD 95

Query: 100  ---SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
               SLE+V  + L+G V S+A L +       +RD+++LA  ++K+SV+EFD S   L  
Sbjct: 96   AEVSLEVVAEFDLNGTVGSIATLRRRFGAPREQRDALLLAVRESKLSVVEFDPSTLSLVC 155

Query: 157  TSMHCFESPEWLHLKRGRESFAR----GPLVKVDPQGRCGGVLVY---GLQMIILKASQG 209
            +S+H +E+P       G  S  R     P+V  DP+GRC  VL+    G ++ +L     
Sbjct: 156  SSLHSWETPPG---AGGVPSALRLAPTPPVVVADPEGRCAAVLLRAEGGTRLALLPTDND 212

Query: 210  GSGLVGDEDTFGSGG----GFSARIESSHVINL-RDLDMKHVKDFIFVHGYIEPVMVILH 264
               + G + + G G     G +A ++ S+V++L R++ +++V+D  F+HGY EPV+++LH
Sbjct: 213  AMDVDGGDGSEGKGRRTLRGTAAAVKKSYVVDLVREMGVRYVRDVCFLHGYGEPVLLVLH 272

Query: 265  ERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVV 324
            E  LTWA R +    T  +SA+S++   ++H +IW    LPH  Y+L A+P+P+GG +V+
Sbjct: 273  EERLTWAARATLVKDTMRLSAISLNVDARKHTVIWRRSALPHSCYRLTAMPAPLGGAIVL 332

Query: 325  GANTIHYHSQSASCALALNNYA---VSLDSSQELPRSSFSVELDAAHATWLQNDVALLST 381
              N + + SQ +S ALALN  A      D + +   ++ +  LD A+A  +    AL++T
Sbjct: 333  SQNFLLHESQESSAALALNPLAGGGRGDDPAAKAAAAASAAALDGAYAAVISEKQALVTT 392

Query: 382  KTGDLVLLTVVYDGRVVQR---LDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
            K G L LL++  +GR +     + L +   +VL+S +  +   L FLGSR+GDSLLV   
Sbjct: 393  KAGALYLLSLRIEGRRLATRGGMHLKRAGGAVLSSGMCLVTRRLLFLGSRVGDSLLVS-R 451

Query: 439  CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNG-EELSLYGSASNNTESA 497
            C +  +  ++  +       A   +   +R        D V G    SL  +A+ +    
Sbjct: 452  CSTARASTAAPGRRPRAAAAAATTAAAEVRLLPIRPQIDGVGGVSAASLRAAAAAHRAPD 511

Query: 498  QKTFSFAVRDSLVNIGPLKDFSYGLRINADASATG----------------------ISK 535
               ++F VRDS++ I P+ D + G    A AS +G                      + +
Sbjct: 512  HPGYTFTVRDSVLGISPVIDLTVG----ASASVSGDTIERTELIAACGHGKNGALAVLQR 567

Query: 536  QSNYELVE------LPGCKGIWTVYHKSS---RGHNADSSRMAAYDDEYHAYLIISLEAR 586
                ELV       LPG KG WTV+H S+   R   + ++  A   D YHAYL+ISL + 
Sbjct: 568  GIQPELVTEVESGTLPGLKGTWTVHHDSADNERLRGSAAAAAAQAVDPYHAYLVISLASS 627

Query: 587  TMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG 646
            TM+LET + L EV+E V+      T+ AGN FGR R++QV+++G R+  G    QD++  
Sbjct: 628  TMILETGEELKEVSEHVELVTDAATLCAGNAFGRERIVQVYDKGVRVAAGPVKVQDIAST 687

Query: 647  PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS-------VQTP-- 697
               +++G G E   +++  I+ PYVL  +SDGS+ +L GD  + T+         + P  
Sbjct: 688  ELVADAGDG-EGIEIVAAEISFPYVLCRLSDGSLAVLKGDEESKTLVKLDVDALARLPPG 746

Query: 698  -----AAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAI--DGADGGPLDQG 750
                 A +     P ++    HD+ P  +L++ +T    +T    +   +  D     + 
Sbjct: 747  GGIACATLVDDSTPAAAHGGLHDRSPG-FLKRATTATATTTTTTASASREDGDDDDDSRR 805

Query: 751  DIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEG 810
             ++  V    GALE++ +P+ +  +T +    G   +                     + 
Sbjct: 806  PMFLAVTRTGGALELYSLPSCDKAWTANGLSEGVAVLSPA--------GSASAALVDRDA 857

Query: 811  TGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDP 870
                      + ++VEL +  ++  H RP L A+  DG +L Y+A+              
Sbjct: 858  AAAADAGADRAPEIVELRVDAFARAHERPLLTALRADGAVLVYRAF-------------- 903

Query: 871  VSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA------PCQRITIFKNI---S 921
                 + +V+      L  LRF+R P++    E    GA      P  R+T F+ +    
Sbjct: 904  -----TCAVAGPGGRALTQLRFARVPVEL---EGGGGGAVDLSALPGSRLTRFERVGDRG 955

Query: 922  GHQGFFLSGSRPCWCMVFRERLRVHPQLCDGS-IVAFTVLHNVNCNHGFIYVTSQGILKI 980
            G +G F+SG +P W +  R R+   P   +   +V+FT  HNVNC+ GFI  T+ G ++I
Sbjct: 956  GIRGVFVSGPQPLWLLARRSRVLALPVRGEAQRVVSFTAFHNVNCHAGFILGTAAGGVRI 1015

Query: 981  CQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE 1040
            CQ+P    Y+  WPV+K + L+ TPH + Y  +  LY L  S P         +  ++ E
Sbjct: 1016 CQIPGRMHYEAAWPVRK-LALRCTPHHVQYLPDFKLYALSTSAP---------AKWVEPE 1065

Query: 1041 VGHQIDNHNLSSVDLHRTYTV------EEYEVRILEPDRAGGPWQTRATIPMQSSENALT 1094
            V  + D H  + V   R   +      E++ V++L P    G  +T  +  M   E+   
Sbjct: 1066 VAEE-DIHAATVVKTRRAKAMARGGVEEQFAVKLLVP----GSLETAWSRTMDPGEHVQA 1120

Query: 1095 VRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
            V+ V + N  T    ++LA+GTA   GED   RGRV+LF
Sbjct: 1121 VKNVQVRNLRTGALHSMLAVGTAMPGGEDTPCRGRVILF 1159


>gi|410911304|ref|XP_003969130.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Takifugu rubripes]
          Length = 1444

 Score =  428 bits (1101), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 344/1214 (28%), Positives = 568/1214 (46%), Gaps = 184/1214 (15%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            NLVV   + + +Y +    E + ++  S ++K R          LE V  + L GNV S+
Sbjct: 29   NLVVAGTSQLFVYRIIHDVESTSKTDKSSDSKTR-------KEKLEQVAAFSLFGNVMSM 81

Query: 117  AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
              +   GA+    RD+++L+F+DAK+SV+E+D   H L+  S+H FE    L L+ G   
Sbjct: 82   ESVQLVGAN----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEE---LELRDGFVQ 134

Query: 177  FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
                P+V+VDP+ RC  +L+YG ++++L   +     + DE   G G G  +    +++I
Sbjct: 135  NVHIPIVRVDPENRCAVMLIYGTKLVVLPFRKDT---LTDEQEVGVGEGPKSSFLPTYII 191

Query: 237  NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
            ++R+LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++   K 
Sbjct: 192  DVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNIMQKV 251

Query: 295  HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
            HP+IWS  NLP D  +++AVP PIGGV+V   N++ Y +QS     +ALN+      +  
Sbjct: 252  HPVIWSLSNLPFDCTQVMAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTNGTTAFP 311

Query: 354  ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
               +    + LD + A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 312  LRLQDEVKITLDCSQADFIAYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371

Query: 413  SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD-----APSTKRL 467
            + + T+     FLGSRLG+SLL+++T       L  G  ++  + E D      P +K+ 
Sbjct: 372  TCMVTMEPGYLFLGSRLGNSLLLKYTEKLQDMPLEEGKDQQDKEKEKDMDKQEEPPSKKK 431

Query: 468  RRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
            R  SS    D V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + S G     
Sbjct: 432  RVESSSNWTDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANASMGEPAFL 487

Query: 522  ---LRINADAS-----ATGISKQSNYELV------------ELPGCKGIWTVY------- 554
                + N +        +G  K     ++            ELPGC  +WTV        
Sbjct: 488  SEEFQSNPEPDLEVVVCSGHGKNGALSVLQRSIRPQVVTTFELPGCHDMWTVISNEPVQK 547

Query: 555  --HKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTI 612
               ++ R     +   A  D + H +LI+S E  TM+L+T   + E+  S  +  QG T+
Sbjct: 548  EQEETEREGKEKTEPPAEEDTKKHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTV 606

Query: 613  AAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVL 672
             AGN+   + +IQV   G R+L+G  +TQ L F P +         S ++  S+ADPYV+
Sbjct: 607  FAGNIGDNKYIIQVSPMGIRLLEG--VTQ-LHFIPVDL-------GSPIVHCSLADPYVV 656

Query: 673  LGMSDGSIRLLVGD-----PSTCTVSVQTPAAIESSK-------KPVS---------SCT 711
            +  ++G + + V         T  +++Q P     S+       + VS         SC+
Sbjct: 657  IMTAEGVVTMFVLKIDSYMGKTHRLALQKPQISTQSRVIALCAYRDVSGMFTTENKVSCS 716

Query: 712  LYHDKGPEPWLRKTSTDAWLSTGV---------GEAIDGADGGPLDQGDI---------- 752
            +  D          +    LST +         G++  G     +++             
Sbjct: 717  ITEDISIRSQSEAETIIQDLSTNIVDDEEEMLYGDSNTGPSKEEMNRSSFAGPSEGSYSK 776

Query: 753  -----YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSS 807
                 + ++  +SG +EI+ +P++  VF V  F  G+  +VD+   ++    E E     
Sbjct: 777  AEPSHWCLITRDSGVMEIYQLPDWRLVFLVKNFPVGQRVLVDSSSGQSATQGEKE--GKK 834

Query: 808  EEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLF--EGPENTS 865
            EE T QG    +  + +V L       +HSRP+L  +  D  +L Y+A+ +  + P+N  
Sbjct: 835  EEVTRQGEIPLVKEVTLVSLGY-----NHSRPYLL-VHVDQELLIYEAFPYDQQQPQNNL 888

Query: 866  KSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE--------ETPHGAPCQ----- 912
            K                       +RF + P +   RE        +   G   +     
Sbjct: 889  K-----------------------VRFKKVPHNINFREKKSKLRKDKKAEGTAAEDSVAA 925

Query: 913  -----RITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCN 966
                 R   F++ISG+ G F+ G  P W +V  R  LR+HP   DG I +F+  HN+NC 
Sbjct: 926  RGRISRFRYFEDISGYSGVFICGPSPHWMLVTSRGALRLHPMSIDGPIESFSPFHNINCP 985

Query: 967  HGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVL 1026
             GF+Y   QG L+I  LP+  +YD  WPV+K IPL+ T H ++Y  E  +Y +  S+   
Sbjct: 986  KGFLYFNKQGELRISVLPTYLSYDAPWPVRK-IPLRCTVHYVSYHVESKVYAVCTSL--- 1041

Query: 1027 KPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPM 1086
                + L   I +  G + +   +   + +     +++ ++++ P        TR  I +
Sbjct: 1042 ----KELCTRIPRMTGEEKEYETIERDERYINPQQDKFSIQLISPVSWEAIPNTR--IDL 1095

Query: 1087 QSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNL 1145
            +  E    ++ V L +  T    +  +A GT  +QGE+V  RGR+L+         P   
Sbjct: 1096 EEWEYVTCMKTVALRSQETVSGLKGYIAAGTCLMQGEEVTCRGRILILDVIEVVPEPGQP 1155

Query: 1146 VLSGSYGPLFSSVQ 1159
            +    +  L+   Q
Sbjct: 1156 LTKNKFKVLYEKEQ 1169


>gi|432883539|ref|XP_004074300.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Oryzias latipes]
          Length = 1456

 Score =  420 bits (1080), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 356/1238 (28%), Positives = 571/1238 (46%), Gaps = 220/1238 (17%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            NLVV   + + +Y +    E +  S  S + K R          LE V  + L GNV S+
Sbjct: 29   NLVVAGTSQLFVYRIIHDVESTSSSDKSSDAKTR-------KEKLEQVASFSLFGNVMSM 81

Query: 117  AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
            A +   GA     +D+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82   ASVQLTGAS----KDALLLSFKDAKLSVIEYDPGTHDLKTLSLHYFEEPE---LRDGFFQ 134

Query: 177  FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
                P+V+VDP+ RC  +L+YG ++++L   +     + DE   G G G  +    S++I
Sbjct: 135  NVHIPIVRVDPENRCAVMLIYGTKLVVLPFRKD---TLSDEQEGGVGEGPKSSFLPSYII 191

Query: 237  NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
            ++R+LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++   K 
Sbjct: 192  DVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNIMQKV 251

Query: 295  HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS--- 351
            HP+IWS  NLP D  +++AVP PIGGV+V   N++ Y +QS         Y VSL+S   
Sbjct: 252  HPVIWSLSNLPFDCTQVMAVPKPIGGVVVFAVNSLLYLNQSVP------PYGVSLNSQTN 305

Query: 352  -SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKT 406
             +   P   +    + LD   + ++  D  ++S K G++ +LT++ DG R V+     K 
Sbjct: 306  GTTSFPLRVQEEVKITLDCCQSDFIAYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKA 365

Query: 407  NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
              SVLT+ + T+     FLGSRLG+SLL+++T     +    G  ++  + E   P  K+
Sbjct: 366  AASVLTTCMVTMEPGYLFLGSRLGNSLLLKYTEKLQEAPAEDGNDKQ--EKEKQEPPNKK 423

Query: 467  LRRSSSD-----------ALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGP 514
             R  SS             L D V+  E+ +YGS A + T+ A  TFSF V DS++NIGP
Sbjct: 424  KRVESSSNWTGCSASYFFVLSDEVD--EIEVYGSEAQSGTQLA--TFSFEVCDSILNIGP 479

Query: 515  LKDFSYG--------LRINADAS-----ATGISKQSNYELV------------ELPGCKG 549
              + S G         + N +        +G  K     ++            ELPGC  
Sbjct: 480  CANASMGEPAFLSEEFQSNPEPDLEIVVCSGYGKNGALSVLQRSIRPQVVTTFELPGCHD 539

Query: 550  IWTVY----HKSSRG--HNADSSRMAAYDD---------EYHAYLIISLEARTMVLETAD 594
            +WTV      K S G    AD+ +    D          + H +LI+S E  TM+L+T  
Sbjct: 540  MWTVISGEDKKESEGGEKEADAEKKEEQDKTEPPLEDDAKKHGFLILSREDSTMILQTGQ 599

Query: 595  LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
             + E+  S  +  QG T+ AGN+   + +IQV   G R+L+G    + L F P +     
Sbjct: 600  EIMELDTS-GFATQGPTVFAGNIGDNQYIIQVSPMGLRLLEG---VKQLHFIPVDL---- 651

Query: 655  GSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT-----VSVQTPAAIESSK----- 704
                S ++  S+ADPYV++  ++G + + V    T       +++Q P     S+     
Sbjct: 652  ---GSPIVHCSVADPYVVIMTAEGVVTMFVLKSDTYMGKTHRLALQKPQISTLSRVIALC 708

Query: 705  -----------KPVSSC---------------TLYHDKG----PEPWLRKTSTDAWLSTG 734
                       +  SSC               T+Y D       E  +    + A ++ G
Sbjct: 709  AYRDVSGMFTTENKSSCSSKEDLILRSNSETETVYQDLSNTVDDEEEMLYGESGASMAAG 768

Query: 735  V-----GEAIDGADGGPLDQGDI----YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRT 785
                  G A     GG    G      + V+  E+G +EI+ +P++  VF V  F  G+ 
Sbjct: 769  KEEMSRGSAATAPPGGEGSAGKAEPSHWCVLIRENGVMEIYQLPDWRLVFLVKNFPVGQR 828

Query: 786  HIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAIL 845
             +VD+    +   S T+ +   EE T QG    +  + +V L   R     SRP+L  + 
Sbjct: 829  VLVDS----SSGQSATQGDGKKEEVTRQGEIPLVKEVALVALGNNR-----SRPYLL-VH 878

Query: 846  TDGTILCYQAYLF--EGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE 903
             +  +L Y+A+ +  + P+N  K                       +RF + P     RE
Sbjct: 879  VENELLVYEAFPYDQQQPQNNLK-----------------------VRFKKVPHSINFRE 915

Query: 904  ETPH---------GAPCQRITI---------FKNISGHQGFFLSGSRPCWCMVF-RERLR 944
            + P          G P + + +         F++ISG+ G F+ G  P W ++  R  LR
Sbjct: 916  KKPKLKKDKKAEGGGPEENVAVKSRISRFRYFEDISGYSGVFICGPSPHWMLITSRGGLR 975

Query: 945  VHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKAT 1004
            +HP   DG I +F+  HN+NC  GF+Y   QG L+I  LP+  +YD  WPV+K IPL+ T
Sbjct: 976  LHPMTIDGPIESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRK-IPLRCT 1034

Query: 1005 PHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEY 1064
             H ++Y  E  +Y +  SV       + L   I +  G + +   +   + +     E++
Sbjct: 1035 VHFVSYHVESKVYAVCTSV-------KELCTRIPRMTGEEKEFETIERDERYINPLQEKF 1087

Query: 1065 EVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQG 1121
             ++++ P      W+T     I ++  E+   ++ V L +  T    +  +A GT  +QG
Sbjct: 1088 SIQLISPVS----WETIPNTRIDLEEWEHVTCMKTVALRSQETVSGLKGYIAAGTCVLQG 1143

Query: 1122 EDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
            E+V  RGR+L+         P   +    +  L+   Q
Sbjct: 1144 EEVTCRGRILILDVIEVVPEPGQPLTKNKFKVLYEKEQ 1181


>gi|348512553|ref|XP_003443807.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Oreochromis niloticus]
          Length = 1456

 Score =  419 bits (1077), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 351/1242 (28%), Positives = 573/1242 (46%), Gaps = 228/1242 (18%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            NLVV   + + +Y +    E + ++  S ++K R          LE V  + L GN+ S+
Sbjct: 29   NLVVAGTSQLFVYRIIHDVESTSKADKSSDSKSR-------KEKLEQVASFSLFGNIMSM 81

Query: 117  AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
            A +   GA     RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82   ASVQLVGAS----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177  FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
                P+V+VDP+ RC  +LVYG ++++L   +     + DE   G G G  +    S++I
Sbjct: 135  NVHIPVVRVDPENRCAVMLVYGTKLVVLPFRKDT---LTDEQESGVGEGPKSSFLPSYII 191

Query: 237  NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
            ++R+LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++   K 
Sbjct: 192  DVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNIMQKV 251

Query: 295  HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS--- 351
            HP+IWS  NLP D  +++AVP PIGGV+V   N++ Y +QS         Y VSL+S   
Sbjct: 252  HPVIWSLSNLPFDCTQVMAVPKPIGGVVVFAVNSLLYLNQSVP------PYGVSLNSQTN 305

Query: 352  -SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKT 406
             +   P   +    + LD   + ++  D  ++S K G++ +LT++ DG R V+     K 
Sbjct: 306  GTTAFPLRVQDEVKLTLDCCQSDFIAYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKA 365

Query: 407  NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADA---PS 463
              SVLT+ + T+     FLGSRLG+SLL+++T     +    G + +  + + D    PS
Sbjct: 366  AASVLTTCMVTMEPGYLFLGSRLGNSLLLKYTEKLQETPAEEGKERQDKEKDKDKQEPPS 425

Query: 464  TKRLRRSSSD----------ALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNI 512
             K+   SS++           L D V+  E+ +YGS A + T+ A  T+SF V DS++NI
Sbjct: 426  KKKRVESSTNWTVCVILDFFVLSDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNI 481

Query: 513  GPLKDFSYG--------LRINADAS-----ATGISKQSNYELV------------ELPGC 547
            GP  + S G         + N +        +G  K     ++            ELPGC
Sbjct: 482  GPCANASMGEPAFLSEEFQSNPEPDLEVVVCSGYGKNGALSVLQRSIRPQVVTTFELPGC 541

Query: 548  KGIWTVYHKSSRGHNADSSRMAAY------------DDEYHAYLIISLEARTMVLETADL 595
              +WTV     +    D   +               D + H +LI+S E  TM+L+T   
Sbjct: 542  HDMWTVISSDVKEDKTDKEEVEKEEEEKKTEPPLEDDAKKHGFLILSREDSTMILQTGQE 601

Query: 596  LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
            + E+  S  +  QG T+ AGN+   + +IQV   G R+L+G    + L F P +      
Sbjct: 602  IMELDTS-GFATQGPTVYAGNIGDNKYIIQVSPMGLRLLEG---VRQLHFIPVDL----- 652

Query: 656  SENSTVLSVSIADPYVLLGMSDGSIRLLVGDP-----STCTVSVQTPAAIESSKKPVSSC 710
               S ++  S+ADPYV++  ++G + + V         T  +++Q P  I S  + ++ C
Sbjct: 653  --GSPIVHCSVADPYVVIMTAEGVVTMFVLKSDSYMGKTHRLALQKP-QIPSQSRVITLC 709

Query: 711  -------------------------------TLYHDKG-----PEPWLRKTSTDAWLSTG 734
                                           T+ HD        E  L   S  +  +T 
Sbjct: 710  AYRDVSGMFTTENKVSCSIKEDTIRSQSEAETIIHDMSNTVDDEEEMLYGDSNAS--ATP 767

Query: 735  VGEAIDGADGGPLDQGDI----------YSVVCYESGALEIFDVPNFNCVFTVDKFVSGR 784
              E I+ +   P   G            + ++  E+G +EI+ +P++  VF V  F  G+
Sbjct: 768  AKEDINRSFVAPTTSGSEATSSKAEPTHWCMIIRENGVMEIYQLPDWRLVFLVKNFPVGQ 827

Query: 785  THIVDTYMREALKDSETEINSSSEEGT-GQGRKENIHSMK----VVELAMQRWSAHHSRP 839
              +VD+              SS +  T G+G+KE +        V E+A+     +HS+P
Sbjct: 828  RVLVDS--------------SSGQSATQGEGKKEEVTRQGEIPLVKEVALVSLGNNHSKP 873

Query: 840  FLFAILTDGTILCYQAYLF--EGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPL 897
            +L  +  +  +L Y+A+ +  + P+N  K                       +RF + P 
Sbjct: 874  YLL-VHVEQELLIYEAFQYDQQQPQNNLK-----------------------VRFKKVPH 909

Query: 898  DAYTRE----------------ETPHGAPCQ--RITIFKNISGHQGFFLSGSRPCWCMVF 939
            +   RE                E   G   +  R   F++ISG+ G F+ G  P W +V 
Sbjct: 910  NINFREKKSKLKKDKKAESSATEESSGVKGRIARFRFFEDISGYSGVFICGPSPHWMLVT 969

Query: 940  -RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
             R  LR+HP   DGSI +F+  HN+NC  GF+Y   QG L+I  LP+  +YD  WPV+K 
Sbjct: 970  SRGALRLHPMTIDGSIESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRK- 1028

Query: 999  IPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRT 1058
            IPL+ T H ++Y  E  +Y +  SV   +P  +     I +  G + +   +   + +  
Sbjct: 1029 IPLRCTVHYVSYHVESKVYAVCTSVK--EPCTR-----IPRMTGEEKEYEVIERDERYIH 1081

Query: 1059 YTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTA 1117
               E++ ++++ P        TR  I ++  E+   ++ V L +  T    +  +A GT 
Sbjct: 1082 PQQEKFSIQLISPVSWEAIPNTR--IDLEEWEHVTCMKTVALRSQETVSGLKGYIAAGTC 1139

Query: 1118 YVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
             +QGE+V  RGR+L+         P   +    +  L+   Q
Sbjct: 1140 LMQGEEVTCRGRILILDVIEVVPEPGQPLTKNKFKVLYEKEQ 1181


>gi|444523674|gb|ELV13604.1| Cleavage and polyadenylation specificity factor subunit 1 [Tupaia
            chinensis]
          Length = 1469

 Score =  417 bits (1071), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 348/1233 (28%), Positives = 558/1233 (45%), Gaps = 197/1233 (15%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            NLVV  A   ++YV R+  +    +KN   T+ +   +      LELV  +   GNV S+
Sbjct: 29   NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELVASFSFFGNVMSM 81

Query: 117  AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
            A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82   ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177  FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
                P V+VDP GRC  +L+YG ++++L       ++   GLVG+        G  +   
Sbjct: 135  NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232  SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
             S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187  PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290  TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
             T + HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 247  ITQRVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTAG 306

Query: 349  LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
              +     +    + LD AHA ++  D  ++S K G++ +LT+V DG R V+     K  
Sbjct: 307  TTAFPLRTQDGVRLTLDCAHAAFISYDKMVISLKGGEIYVLTLVTDGMRSVRAFHFDKAA 366

Query: 408  PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
             SVLT+ + T+     FLGSRLG+SLL+++T         +    E  D E      KR+
Sbjct: 367  ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT--EKLQEPPASAVREAADKEEPPSKKKRV 424

Query: 468  RRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFS-- 519
              +     SS   QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + +  
Sbjct: 425  DPTGGWSGSSTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMG 480

Query: 520  ---------------YGLRINAD--ASATGI---------SKQSNYELV----------- 542
                           YGL   A+     TG+         S + + E+V           
Sbjct: 481  EPAFLSEEVGTGVAEYGLIGQAEGWGRRTGLTPAPVQFQNSPEPDLEIVVCSGYGKNGAL 540

Query: 543  ---------------ELPGCKGIWTVY-------HKSSRGHNADSSRMAAYDD--EYHAY 578
                           ELPGC  +WTV         ++ +    +  R A  +D    H +
Sbjct: 541  SVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKDEEETPKAEGTEQPRAAEAEDGVRRHGF 600

Query: 579  LIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY 638
            LI+S E  TM+L+T   + E+  S  +  QG T+ AGN+   R ++QV   G R+L+G  
Sbjct: 601  LILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG-- 657

Query: 639  MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTC-----TVS 693
                L F P +         + ++  ++ADPYV++  ++G + + +    T       ++
Sbjct: 658  -VNQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMFLLKSDTYGGRHHRLA 709

Query: 694  VQTPAAIESSKKPVSSCTLYHD-------------KGPEPWLRKTSTDAWLSTGVGEAID 740
            +  P     SK  V +  LY D                EP  R +     L       +D
Sbjct: 710  LHKPPLHHQSK--VITLCLYRDVSGMFTTESRLGGARDEPGARGSCEVEGLGAETSPTVD 767

Query: 741  G------ADGG----------------PLDQGDI--------YSVVCYESGALEIFDVPN 770
                    D G                P D+           + ++  E+G +E++ +P+
Sbjct: 768  DEEEMLYGDSGSLFSPSKEETRRSSQPPADRDPAPFRAEPTHWCLLVRENGTMEMYQLPD 827

Query: 771  FNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQ 830
            +  VF V  F  G+  +VD+    +     T+  +  EE T QG    +  + +V L   
Sbjct: 828  WRLVFLVKNFPVGQRVLVDS----SFGQPATQAEARKEEATRQGELPLVKEVLLVALG-- 881

Query: 831  RWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNL 890
               +  SRP+L  +  D  +L Y+A+    P ++      +         N++    +  
Sbjct: 882  ---SRQSRPYLL-VHVDQELLLYEAF----PHDSQLGQGNLKVRFKKVPHNINFREKKLK 933

Query: 891  RFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQL 949
               +      T E         R   F++I G+ G F+ G  P W +V  R  LR+HP  
Sbjct: 934  PSKKKAEGGSTEEGAGARGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMG 993

Query: 950  CDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQIT 1009
             DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD  WPV+K IPL+ T H + 
Sbjct: 994  IDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRK-IPLRCTAHYVA 1052

Query: 1010 YFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRIL 1069
            Y  E  +Y +  S     P  +     I +  G + +   +   D +     E + ++++
Sbjct: 1053 YHVESKVYAVATSTNA--PCTR-----IPRMTGEEKEFETIERDDRYIHPQQEAFSIQLI 1105

Query: 1070 EPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAA 1126
             P      W+    A I ++  E+   ++ V+L +  T    +  +A GT  +QGE+V  
Sbjct: 1106 SPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTC 1161

Query: 1127 RGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
            RGR+L+         P   +    +  L+   Q
Sbjct: 1162 RGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQ 1194


>gi|405977622|gb|EKC42064.1| Cleavage and polyadenylation specificity factor subunit 1
            [Crassostrea gigas]
          Length = 1369

 Score =  409 bits (1052), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 333/1132 (29%), Positives = 530/1132 (46%), Gaps = 168/1132 (14%)

Query: 100  SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
             +E +  + L GN+ S+  +   GA     RDS++L+F +AK+SV+E+D   H L+ TS+
Sbjct: 4    KMECLATFTLFGNIMSMKYVKLPGA----LRDSLLLSFSEAKLSVVEYDPGTHDLQTTSL 59

Query: 160  HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDT 219
            H FE P    +K G  +    P V+VDP GRC  +LVYG  M+IL   +      GD   
Sbjct: 60   HFFEEPS---MKGGFFTNYCIPEVRVDPDGRCAAMLVYGTHMVILPFRRDVMVEEGD--- 113

Query: 220  FGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
                G   + I SS++I+LR+ D K  +VKDF F+HGY EP + IL E   TWAGR + +
Sbjct: 114  -NLAGTSKSPILSSYIIDLRNFDEKIINVKDFQFLHGYYEPTVFILFEPLQTWAGRTAVR 172

Query: 278  HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS 337
              TC I A+S++   K HP+IWS  +LP D  ++LAVP PIGGV+++  N++ Y +QS  
Sbjct: 173  ADTCSIVAISLNLQEKVHPVIWSLGSLPFDCCQVLAVPRPIGGVIIIAVNSLLYLNQSVP 232

Query: 338  CALALNNYAVSLDS----SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLT 390
                   Y VSL+S    S   P   +    + LD   A ++  D  +LS K G+L +LT
Sbjct: 233  ------PYGVSLNSISAQSTLFPLRVQEGVRIALDCCQAAFMSYDKIVLSLKGGELYVLT 286

Query: 391  VVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSG 449
            +V DG R V+  +  K+  SVLTS +    +   FLGSRLG+SLL+++T  +   + +  
Sbjct: 287  LVVDGMRSVRSFNFDKSAASVLTSCMCICEDGFLFLGSRLGNSLLLKYTEKASECLENGD 346

Query: 450  LKEEFGDIEADAPSTKRLRRSSSDALQDMV----NGEELSLYGSASNNTESAQKTFSFAV 505
            L ++    + D P+ K+ +   S  +   V    N  +L +YGSA N T +   +++F V
Sbjct: 347  LDKK----KEDEPAAKKKKVEGSTEIASDVSQIENLYDLEVYGSAENPTSTTITSYTFEV 402

Query: 506  RDSLVNIGPL------------KDFSYGLRINADASAT-GISKQSNYELV---------- 542
             D++ NIGP             ++FS     + +   T G  K     ++          
Sbjct: 403  CDNIWNIGPCGNIVMGEPAFLSEEFSSCEDPDIEMVMTSGYGKNGALSVLQRSIRPQVVT 462

Query: 543  --ELPGCKGIWTVYH--KSSRGHNADSSRMAAYDDEY---HAYLIISLEARTMVLETADL 595
              ELPGC  +WTV       +  + ++S     DD     H++LI+S    +M+LET   
Sbjct: 463  TFELPGCLDMWTVKSLVPKEKSEDKENSMEDDSDDNIEGGHSFLILSRSDSSMILETGQE 522

Query: 596  LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
            + E+  S  +  Q  TI AGN+ G R ++QV +   R+L+G    Q +       ++G  
Sbjct: 523  MNELDHS-GFSTQTTTIFAGNIGGDRYIVQVSDTSLRLLEGVRQIQHIPL-----DTG-- 574

Query: 656  SENSTVLSVSIADPYVLLGMSDGSI--------------RLLVGDPSTCTVSVQTPAAIE 701
               S V+  S+ADPY++L   +G I              RL+VG PS   +S  +   + 
Sbjct: 575  ---SPVVQCSLADPYIVLLTQEGQILMFTLRTESVGLGVRLVVGKPS---ISQHSKVEVI 628

Query: 702  SSKKPVSSCTLYHDK------GPEPWLRKTSTDAWLSTGV-----------GEAIDGADG 744
            S+ K VS      ++       P+    KT T+   S              GE       
Sbjct: 629  SAYKDVSGLFTCMNQMEDVQVTPDTKATKTVTERSFSIDAKTADEEDELLYGETESNVFN 688

Query: 745  GPLDQGDI-------------------YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRT 785
               + G                     + ++C E+G LEI+ +P++  V+ V  F  G+ 
Sbjct: 689  SSFNMGQTAEMESPTKEKKQTEAKPTYWLLLCRENGVLEIYSIPDYKKVYYVKNFPMGQK 748

Query: 786  HIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAIL 845
             +VD+           ++      G  Q  K N     + EL M       SRP L A +
Sbjct: 749  LLVDS----------VQVTDKLSSGERQ-EKVNAECPALKELLMVGLGYKDSRPHLLARV 797

Query: 846  TDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREET 905
             D        Y++E       S D     R   + +    R +  +  +   + + +EE 
Sbjct: 798  ED------DLYIYEAFSYPQSSIDNHLKLRFKKIQHDLILREKRSKSKKKDPEEFQKEEK 851

Query: 906  PHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVN 964
              G    ++  FK+++G+ G F+ G+ P W  V  R  LR+HP   DG +  F+  HN+N
Sbjct: 852  KVG----KMRYFKDVAGYSGVFVCGAYPHWIFVTSRGSLRIHPMGIDGPVWCFSEFHNIN 907

Query: 965  CNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVP 1024
            C HGF+Y    G L+I  LP+  TYD  WPV+KV PL+ TPH + Y  E  +Y ++ S P
Sbjct: 908  CPHGFLYFNKMGELRISVLPTHLTYDAPWPVRKV-PLRCTPHFVAYHFENKIYAVVTSTP 966

Query: 1025 VLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTY-TVEEYEVRILEPDRAGGPWQT--R 1081
             +   N++     +      I+       D    Y T+  + +++  P      W+    
Sbjct: 967  EI--CNKLPKTTTEDREWDTIEK------DERFIYPTIPRFTLQLYSPTS----WEVVPN 1014

Query: 1082 ATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLL 1132
              I  +  E+ ++++ + L +  T    ++ + +GT    GE+V +RGRV++
Sbjct: 1015 TKIECEEWEHVVSMKTIRLRSEETLSGFKSYIVMGTNLSLGEEVTSRGRVII 1066


>gi|229335612|ref|NP_001108153.2| cleavage and polyadenylation specificity factor subunit 1 [Danio
            rerio]
          Length = 1449

 Score =  405 bits (1042), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 333/1186 (28%), Positives = 558/1186 (47%), Gaps = 189/1186 (15%)

Query: 94   DGIS-AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIH 152
            DG S    LE V  + L GNV S+A +   G +    RD+++L+F+DAK+SV+E+D   H
Sbjct: 58   DGKSRKEKLEQVASFSLFGNVMSMASVQLVGTN----RDALLLSFKDAKLSVVEYDPGTH 113

Query: 153  GLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSG 212
             L+  S+H FE PE   L+ G       P+V+VDP+ RC  +LVYG  +++L   +    
Sbjct: 114  DLKTLSLHYFEEPE---LRDGFVQNVHIPMVRVDPENRCAVMLVYGTCLVVLPFRKDT-- 168

Query: 213  LVGDEDTFGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTW 270
             + DE     G G  +    S++I++R+LD K  ++ D  F+HGY EP ++IL E   TW
Sbjct: 169  -LADEQEGIVGEGQKSSFLPSYIIDVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTW 227

Query: 271  AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIH 330
             GRV+ +  TC I A+S++   K HP+IWS  NLP D  +++AVP PIGGV+V   N++ 
Sbjct: 228  PGRVAVRQDTCSIVAISLNIMQKVHPVIWSLSNLPFDCNQVMAVPKPIGGVVVFAVNSLL 287

Query: 331  YHSQSAS-CALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLL 389
            Y +QS     ++LN+      +    P+    + LD + A+++ +D  ++S K G++ +L
Sbjct: 288  YLNQSVPPFGVSLNSLTNGTTAFPLRPQEEVKITLDCSQASFITSDKMVISLKGGEIYVL 347

Query: 390  TVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
            T++ DG R V+     K   SVLT+ + T+     FLGSRLG+SLL+++T     + +  
Sbjct: 348  TLITDGMRSVRAFHFDKAAASVLTTCMMTMEPGYLFLGSRLGNSLLLRYTEKLQETPMEE 407

Query: 449  GLKEEFGDIEADAPSTKRLRRSSSDA-------LQDMVNGEELSLYGS-ASNNTESAQKT 500
            G + E  + +   P  K+ R  S+ A       L D ++  E+ +YGS A + T+ A  T
Sbjct: 408  GKENEEKEKQ---PPNKKKRVDSNWAGCPGKGNLPDELD--EIEVYGSEAQSGTQLA--T 460

Query: 501  FSFAVRDSLVNIGPLKDFSYG--------LRINADAS-----ATGISKQSNYELV----- 542
            +SF V DS++NIGP    S G         + N +        +G  K     ++     
Sbjct: 461  YSFEVCDSILNIGPCASASMGEPAFLSEEFQTNPEPDLEVVVCSGYGKNGALSVLQKSIR 520

Query: 543  -------ELPGCKGIWTVYHKSSR---------GHNADSSRMAAY---DDEYHAYLIISL 583
                   ELPGC  +WTV +   +         G + +  +       D + H +LI+S 
Sbjct: 521  PQVVTTFELPGCHDMWTVIYCEEKPEKPSAEGDGESPEEEKREPTIEDDKKKHGFLILSR 580

Query: 584  EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDL 643
            E  TM+L+T   + E+  S  +  QG T+ AGN+   + +IQV   G R+L+G      L
Sbjct: 581  EDSTMILQTGQEIMELDTS-GFATQGPTVYAGNIGDNKYIIQVSPMGIRLLEG---VNQL 636

Query: 644  SFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI---------------RLLVGDPS 688
             F P +         S ++  S+ADPYV++  ++G +               RL +  P 
Sbjct: 637  HFIPVDL-------GSPIVHCSVADPYVVIMTAEGVVTMFVLKNDSYMGKSHRLALQKPQ 689

Query: 689  TCT------------------------------VSVQTPAAIESSKKPVSSCT------L 712
              T                              ++++T +  E+  + +S+        L
Sbjct: 690  IHTQSRVITLCAYRDVSGMFTTENKVSFLAKEEIAIRTNSETETIIQDISNTVDDEEEML 749

Query: 713  YHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN 772
            Y +  P     K  +    +           G    +   + ++  E+G +EI+ +P++ 
Sbjct: 750  YGESNPLTSPNKEESSRGSAAASSAHTGKESGSGRQEPSHWCLLVRENGVMEIYQLPDWR 809

Query: 773  CVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRW 832
             VF V  F  G+  +VD+    +   S T+     EE T QG   +I  +K  E+A+   
Sbjct: 810  LVFLVKNFPVGQRVLVDS----SASQSATQGELKKEEVTRQG---DIPLVK--EVALVSL 860

Query: 833  SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRF 892
              +HSRP+L A + +  +L Y+A+ ++  +                    + S L+ +RF
Sbjct: 861  GYNHSRPYLLAHV-EQELLIYEAFPYDQQQ--------------------AQSNLK-VRF 898

Query: 893  SRTPLDAYTREET--------PHG---------APCQRITIFKNISGHQGFFLSGSRPCW 935
             + P +   RE+         P G             R   F++ISG+ G F+ G  P W
Sbjct: 899  KKMPHNINYREKKVKVRKDKKPEGQGEDTLGVKGRVARFRYFQDISGYSGVFICGPSPHW 958

Query: 936  CMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWP 994
             +V  R  +R+HP   DG+I +F+  HN+NC  GF+Y   QG L+I  LP+  +YD  WP
Sbjct: 959  MLVTSRGAMRLHPMTIDGAIESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWP 1018

Query: 995  VQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD 1054
            V+K IPL+ T H ++Y  E  +Y +  SV   +P  +     I +  G + +   +   +
Sbjct: 1019 VRK-IPLRCTVHYVSYHVESKVYAVCTSVK--EPCTR-----IPRMTGEEKEFETIERDE 1070

Query: 1055 LHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN-ETLLA 1113
             +     +++ ++++ P        TR  + ++  E+   ++ V L +  T    +  +A
Sbjct: 1071 RYIHPQQDKFSIQLISPVSWEAIPNTR--VDLEEWEHVTCMKTVALKSQETVSGLKGYVA 1128

Query: 1114 IGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
            +GT  +QGE+V  RGR+L+         P   +    +  L+   Q
Sbjct: 1129 LGTCLMQGEEVTCRGRILILDVIEVVPEPGQPLTKNKFKVLYEKEQ 1174


>gi|449477808|ref|XP_004155129.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like [Cucumis sativus]
          Length = 643

 Score =  404 bits (1037), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 188/255 (73%), Positives = 220/255 (86%), Gaps = 1/255 (0%)

Query: 1   MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
           MSFAAY+MMHWPTGI NC S +ITHSRAD+VP +    +++LDS+   +R IGPVPNLVV
Sbjct: 1   MSFAAYRMMHWPTGIENCDSAYITHSRADFVPAVTS-HSDDLDSDWHPRRDIGPVPNLVV 59

Query: 61  TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
           TA NV+E+YVVRV EEG +ESK+SGE KR  +MDG+S ASLELVCHYRLHGNVES+AILS
Sbjct: 60  TAGNVLEVYVVRVLEEGGRESKSSGEVKRGGIMDGVSWASLELVCHYRLHGNVESMAILS 119

Query: 121 QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180
             G D S++RDSIIL F++AKISVLEFDDS H LR +SMHCF+ P+WLHLKRGRESFARG
Sbjct: 120 SRGGDGSKKRDSIILVFQEAKISVLEFDDSTHSLRTSSMHCFDGPQWLHLKRGRESFARG 179

Query: 181 PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240
           P+VKVDPQGRCGGVLVYGLQMIILKASQ GSGLV D++ FG+ G  SAR+ESS++INLRD
Sbjct: 180 PVVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNTGAISARVESSYLINLRD 239

Query: 241 LDMKHVKDFIFVHGY 255
           LD+KHVKDF+FVH Y
Sbjct: 240 LDVKHVKDFVFVHVY 254



 Score =  367 bits (943), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 189/258 (73%), Positives = 208/258 (80%), Gaps = 26/258 (10%)

Query: 455 GDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGP 514
           GDIE DA + KR+RRSSSDALQDMV G+ELSLYGSA+NNTESAQK FSFAVRDSL+NIGP
Sbjct: 342 GDIEVDAHTAKRMRRSSSDALQDMVGGDELSLYGSAANNTESAQKIFSFAVRDSLINIGP 401

Query: 515 LKDFSYGLRINADASATGISKQSNYELV--------------------------ELPGCK 548
           LKDFSYGLRINAD +ATGI+KQSNYELV                          ELPGCK
Sbjct: 402 LKDFSYGLRINADPNATGIAKQSNYELVCCSGHGKNGALCILRQSIRPEMITEVELPGCK 461

Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
           GIWTVYHK++RG  ADSSRM   DDEYHAYLIISLEARTMVL T +LLTEVTESVDYFV 
Sbjct: 462 GIWTVYHKNTRGSIADSSRMVPDDDEYHAYLIISLEARTMVLVTGELLTEVTESVDYFVH 521

Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
           GRTIAAGNLFGRRRVIQV+E GARILDGS+MTQDL+   + +ESG+ SE  TVLS SI+D
Sbjct: 522 GRTIAAGNLFGRRRVIQVYESGARILDGSFMTQDLNLVVNGNESGNASEGCTVLSASISD 581

Query: 669 PYVLLGMSDGSIRLLVGD 686
           PYVLL M+DGSIRLLVG+
Sbjct: 582 PYVLLTMTDGSIRLLVGE 599


>gi|156364999|ref|XP_001626630.1| predicted protein [Nematostella vectensis]
 gi|156213514|gb|EDO34530.1| predicted protein [Nematostella vectensis]
          Length = 1420

 Score =  402 bits (1032), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 348/1270 (27%), Positives = 568/1270 (44%), Gaps = 237/1270 (18%)

Query: 3    FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
            +A YK  H PTG+  C +     +R                             NLVV  
Sbjct: 2    YAIYKETHPPTGVEFCVNCHFYSARES---------------------------NLVVAG 34

Query: 63   ANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQG 122
               + ++ +  Q+EGS  +++ G + +R          LELV  + L GN+ESL  +   
Sbjct: 35   TTEVRVFRLCYQQEGSSSAESGGSSLKR---------KLELVGQHSLFGNIESLHAIRLA 85

Query: 123  GADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL 182
            G      RDS++++F+DAK+S++++D   H ++  S+H FE  +   +K    +  R P+
Sbjct: 86   G----NTRDSLLMSFKDAKLSIVDYDPGKHDIKTRSLHFFEDEK---IKSHCLAQDRAPV 138

Query: 183  VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD 242
            V++DP+ RC  +L YG  +++L   Q G      +D+  S       +  S++I+++++D
Sbjct: 139  VRIDPERRCAVMLAYGTHLVVLPFRQEGGIDDTAQDSIISSSD-RPPVLPSYIIDVKEID 197

Query: 243  MK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
             K  ++ D  F+HGY EP ++IL+E   TWAGR++ ++ TC + A+S++ + K HP++W 
Sbjct: 198  EKTCNILDIQFLHGYYEPTLLILYEPLKTWAGRLAMRNDTCALVAVSLNMSQKAHPVVWQ 257

Query: 301  AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQE------ 354
               LP D   ++ VP PIGGVLV   N + Y +QS         Y VS++S  E      
Sbjct: 258  LSCLPFDCIYVMPVPKPIGGVLVCCMNALLYLNQSVP------PYGVSVNSIGENSTVFP 311

Query: 355  -LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              P+   ++ L+ ++A ++ ND  + S K G++ ++T++ DG R V+     KT  SVLT
Sbjct: 312  LKPQKGVTITLEGSNAIFIANDKLVFSLKGGEIYVVTLIADGVRSVRNFVFDKTAASVLT 371

Query: 413  SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
            S +   G+   FLGSRLG+SLLV++T       +  G +     ++ D  + +R +  + 
Sbjct: 372  SCVCECGDGYLFLGSRLGNSLLVKYT--EKPQDIVYGTENNAQSMQCD--NIERWQILNG 427

Query: 473  DALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----LRIN--- 525
              L  + + +EL +YG A         +++F V DSL+NIGP      G    L ++   
Sbjct: 428  SLLLIVDDLDELEVYG-AQQEAGVELTSYTFEVCDSLLNIGPCSCMDIGEPAFLSVSSYF 486

Query: 526  ADA--------SATGISKQSNYELV------------ELPGCKGIWTVYHKSSRG----- 560
            ADA        S +G  K     ++            ELPGC  +WTV+ K  +      
Sbjct: 487  ADAQELDLEVVSCSGYGKNGALTVLQRSIRPQVVTTFELPGCTDMWTVFSKDQKKGAQTN 546

Query: 561  --HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLF 618
              H   S      +++YH++LI+S E  +M+L+T   + EV +S  +  Q  TI AGN  
Sbjct: 547  AIHRYPSQPCTQGNEKYHSFLILSREDSSMILKTEQEIMEVDQS-GFSTQCATIYAGNFG 605

Query: 619  GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
                ++QV   G R+L+G    Q +       +SG     S ++  S+ DPY +L M+DG
Sbjct: 606  NGSYILQVTPLGVRLLEGVNQLQHIPM-----DSGL----SNIVWCSVCDPYAVLLMADG 656

Query: 679  SIRLL--VGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD----------------KGPEP 720
            S+ L+  +   S   ++V  P+  +SSK  V +C  Y D                K P P
Sbjct: 657  SVILIEFIKSASGPKLTVSRPSLSQSSK--VCACCTYKDMSGLFTTENSNLEEVSKVPSP 714

Query: 721  WLRKTS----------------------TDAWLSTGVGEAIDGADGGPLD------QGDI 752
                T+                      T   L+    E        P++      Q   
Sbjct: 715  KPEMTAPPRQEKESLTIDEEDELLYGGDTSLTLTFEPPEPSKAESAAPVEVFEEPLQPSY 774

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSS-SEEGT 811
            + +VC E+G +EI+ +P F  VF V  F      IVD+       DS     SS SEE  
Sbjct: 775  WCLVCRENGVMEIYSLPGFTRVFFVKNFSKAPRVIVDS------GDSGASTQSSVSEE-- 826

Query: 812  GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
                     S+ V E+ +      + R  L A++ D  +L Y+A+ +   E         
Sbjct: 827  --------ESLNVREVLLTGLGYKNRRATLVAVM-DQDLLIYEAFSYPTVEGH------- 870

Query: 872  STSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAP-------------CQRITIFK 918
                             NLRF +   +   RE+ P   P                + +F 
Sbjct: 871  ----------------LNLRFKKLQHNIQIREKKPKQEPKNDSETKSGLDPKVAMLRVFN 914

Query: 919  NISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGI 977
            +IS + G F+ GS P W  V  R     HP   DG +  F   HNVNC  GF+Y  ++G 
Sbjct: 915  DISSYSGIFVCGSYPFWIFVTNRGAFHWHPMSIDGPVTCFAAFHNVNCPKGFLYFNTRGE 974

Query: 978  LKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLI 1037
            L+I  LP+  +YD+ WPV+KV PL+ TPH ++Y  E   Y ++ S    +P  ++  +  
Sbjct: 975  LRISVLPTHLSYDSPWPVRKV-PLRYTPHMVSYNRESKTYAIVTSEQ--EPCKKIPRVTA 1031

Query: 1038 DQEVGHQIDNHNLSSV-DLHRTY-TVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTV 1095
            +       D   + ++ D    Y + E + ++++ P      W+    IP    +     
Sbjct: 1032 E-------DKEFVDTIRDARFIYPSTERFVLQLISPIS----WE---VIPNTRHDLDEWE 1077

Query: 1096 RVVTLFNTTTKENET------LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSG 1149
             V T+ N      ET       + +GT  + GE++A RGR+L+F        P   +   
Sbjct: 1078 HVTTMKNLLLHSEETHTGRKGFICVGTTQLYGEEIAVRGRILIFDIIEVVPEPGQPLTKN 1137

Query: 1150 SYGPLFSSVQ 1159
             +  L+   Q
Sbjct: 1138 KFKLLYEKEQ 1147


>gi|307190910|gb|EFN74734.1| Cleavage and polyadenylation specificity factor subunit 1 [Camponotus
            floridanus]
          Length = 1418

 Score =  397 bits (1020), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 339/1201 (28%), Positives = 549/1201 (45%), Gaps = 188/1201 (15%)

Query: 58   LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
            LVV  ANVI ++ +    + ++  K +     ++         LE +  Y LHGN+ S+ 
Sbjct: 30   LVVAGANVIRVFRLIPDIDMTRREKYTENRPPKM--------KLECLAQYTLHGNIMSMQ 81

Query: 118  ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
             +   G+    +RDS++L+F DAK+SV+E+D  IH LR  S+H FE  E   +K G  + 
Sbjct: 82   AVHLIGS----QRDSLLLSFRDAKLSVVEYDQDIHDLRTVSLHYFEEEE---IKDGWTNH 134

Query: 178  ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR---IESSH 234
               P+V+VDP+GRC  +L+YG ++++L   +  S  + D D   S    S+    I SS+
Sbjct: 135  HHIPIVRVDPEGRCAIMLIYGRKLVVLPFRKDPS--LDDGDLLDSAKLTSSNKTPILSSY 192

Query: 235  VINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
            +I L+ L+  M +V D  F++GY EP ++IL+E   T++GR++ +  TC + A+S++   
Sbjct: 193  MIVLKTLEEKMDNVIDLQFLYGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQ 252

Query: 293  KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYAVSLDS 351
            + HP+IWS  NLP D Y+++ V  P+GG L++  N++ Y +QS     ++LN+ A +  +
Sbjct: 253  RVHPIIWSVSNLPFDCYQVVPVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLADTSTN 312

Query: 352  SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSV 410
                P+    + L+ +   ++  D  ++S K+G+L +L++  D  R V+     K   SV
Sbjct: 313  FPLKPQEGVKMSLEGSQVAFISGDRLVISLKSGELYVLSLFADSMRSVRGFHFDKAAASV 372

Query: 411  LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE-EFGDIEADAPSTKRLRR 469
            LTS +    ++  FLGSRLG+SLL++FT     ++ +    E    + E++    K+ ++
Sbjct: 373  LTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPETLKNLNDNEITIEENESEETPAKKAKQ 432

Query: 470  S------SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG-- 521
                   +SD L D+ + EEL +YGS + +T     ++ F V DSL+NIGP  + S G  
Sbjct: 433  DFLGDWMASDVL-DIKDPEELEVYGSET-HTSIQITSYIFEVCDSLLNIGPCGNISMGEP 490

Query: 522  ------LRINAD-----ASATGISKQSNYELV------------ELPGCKGIWTVYHKSS 558
                     N D      + +G  K     ++            ELPGC+ +WTV     
Sbjct: 491  AFLSEEFLHNQDPDVELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMWTVI---- 546

Query: 559  RGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLF 618
             G   +  ++ A  +  HA+LI+S E  TM+L+T   + EV +S  +  QG T+ AGNL 
Sbjct: 547  -GTLNNDEQVKAEAEGSHAFLILSQEDSTMILQTGQEINEVDQS-GFSTQGSTVFAGNLG 604

Query: 619  GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
              R ++QV + G R+L G    Q +                 ++  S ADPYV L   DG
Sbjct: 605  ANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYVTLLSEDG 654

Query: 679  SIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGP--EPWLRKTSTDAWLST--G 734
             + LL       T  +    A    +  + +   Y D        L +T+ D  +     
Sbjct: 655  QVVLLTLREVRGTARLHAQPANLLFRPQIEALCTYRDVSGIFTTQLSETTDDEQVEEEHN 714

Query: 735  VGEA-----IDGADG-----------------GPLD----------------QGDIYSVV 756
            V E      ID  D                   PLD                +   + +V
Sbjct: 715  VEEPSLLSNIDNEDDLLYGDAPAFQMPAPSYQKPLDGVSKKAPWWQRHLQEIKPTYWLLV 774

Query: 757  CYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRK 816
              +SG LEI+ +P+    + +  F  G+  + D+     L+ +      + E        
Sbjct: 775  YRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQSAPVNEIPNPE-------- 826

Query: 817  ENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRS 876
                 M+V E+ M     H +RP L   L D  +  YQAY +  P+   K          
Sbjct: 827  -----MQVREILMVALGHHGNRPMLLVRL-DSELQIYQAYKY--PKGYLK---------- 868

Query: 877  LSVSNVSASRLRNLRFSRTP--LDAYTREE-TPHGAPCQRITI---FKNISGHQGFFLSG 930
                     R + L     P  L    +EE  P  A   RI +   F NI+G+ G F+  
Sbjct: 869  --------LRFKKLEHGIIPGRLSPKPKEEDMPMNASETRICMMRYFSNIAGYNGVFICC 920

Query: 931  SRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
              P W  +  R  LR HP   DG I +F   +NVNC  GF+Y   +  L+IC LP+  +Y
Sbjct: 921  DYPHWIFLTGRGELRTHPMGIDGPITSFAAFNNVNCPQGFLYFNRKEELRICVLPTHLSY 980

Query: 990  DNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHN 1049
            D  WPV+KV PL+ TPH +TY  E   Y +I S+   +PL                  + 
Sbjct: 981  DAPWPVRKV-PLRCTPHFVTYHLESKTYCVITSIA--EPLKSY---------------YR 1022

Query: 1050 LSSVDLHRTYTVEEYEVRILEPDR--------AGGPWQT--RATIPMQSSENALTVRVVT 1099
             +  D  + +T EE   R L P +        +   W+T     I ++  E+   ++ V+
Sbjct: 1023 FNGED--KEFTEEERPERFLYPSQEQFSIVLFSPVSWETIPNTKIELEQWEHVTCLKNVS 1080

Query: 1100 LFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSV 1158
            L    T+   +  + +GT Y  GED+ +RGR+L+F        P   +    +  +++  
Sbjct: 1081 LAYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPLTKNRFKQIYAKE 1140

Query: 1159 Q 1159
            Q
Sbjct: 1141 Q 1141


>gi|340710064|ref|XP_003393618.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Bombus terrestris]
          Length = 1417

 Score =  395 bits (1016), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 331/1202 (27%), Positives = 542/1202 (45%), Gaps = 190/1202 (15%)

Query: 58   LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
            L V  AN+I I+ +    + +K+ K +     ++         LE +  Y LHGNV S+ 
Sbjct: 30   LAVAGANIIRIFRLIPDVDITKKEKYTESRPPKM--------KLECLSQYTLHGNVMSMQ 81

Query: 118  ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
             ++  G+    +RDS++L+F DAK+SV+E+D   H LR  S+H FE  E   ++ G  + 
Sbjct: 82   AVTLVGS----QRDSLLLSFRDAKLSVVEYDQDTHDLRTVSLHYFEEEE---IRDGWTNH 134

Query: 178  ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR--IESSHV 235
               P+V+VDP+GRC  +L+YG ++++L   +  S  + D D   +    S +  I SS++
Sbjct: 135  HHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPS--LDDGDLLDNSKALSNKTPILSSYM 192

Query: 236  INLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
            I L+ L+  M ++ D  F+HGY EP ++IL+E   T++GR++ +  TC + A+S++   +
Sbjct: 193  IVLKSLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQR 252

Query: 294  QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
             HP+IWS  NLP D Y+ + V  P+GG L++  N++ Y +QS      +  Y VSL+S  
Sbjct: 253  VHPIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQS------IPPYGVSLNSLA 306

Query: 354  EL-------PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSK 405
            E        P+    + L+ +   ++ +D  ++S K+G+L +L++  D  R V+     K
Sbjct: 307  ETSTNFPLKPQEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDK 366

Query: 406  TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS-GLKEEFGDIEADAPST 464
               SVLTS +    ++  FLGSRLG+SLL++FT     ++ ++   +    + E +    
Sbjct: 367  AAASVLTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPENLQNTNENEIILEENETEETPA 426

Query: 465  KRLRRS------SSDALQDMVNGEELSLYGSASNNTESAQKT-FSFAVRDSLVNIGPLKD 517
            K++++       +SD L D+ + EEL +YGS      S Q T + F V DSL+NIGP  +
Sbjct: 427  KKIKQDFIGDWMASDVL-DIKDPEELEVYGSERETHTSIQITSYIFEVCDSLLNIGPCGN 485

Query: 518  FSYGLRINADASATGISKQSNYELV--------------------------ELPGCKGIW 551
             S G         +  S+  + ELV                          ELPGC+ +W
Sbjct: 486  ISMGEPAFLSEEFSH-SQDPDVELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMW 544

Query: 552  TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
            TV      G   +  ++    +  HA+LI+S E  TM+L+T   + EV +S  +  QG T
Sbjct: 545  TVI-----GTLNNDEQIRPEAEGSHAFLILSQEDSTMILQTGQEINEVDQS-GFSTQGST 598

Query: 612  IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
            I AGNL   R ++QV + G R+L G    Q +                 ++  S ADPYV
Sbjct: 599  IFAGNLGANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYV 648

Query: 672  LLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLY---------------HDK 716
             L   DG + LL       T  +   AA    +  + +   Y                D+
Sbjct: 649  TLLSEDGQVMLLTLREGRGTAKLHAQAANLLFRPQIEALCAYRDVSGIFTTQLPENVEDE 708

Query: 717  GPE--------------------------PWLRKTSTDAWLSTGVGEAIDGADGGPLDQG 750
             PE                           +   T +    S G+ +          +  
Sbjct: 709  APEEEHNIEEPPIVGNIDNEDDLLYGDAPAFQMPTPSHTKTSEGISKRTPWWQKHLQEIK 768

Query: 751  DIYSVVCY-ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEE 809
              Y ++ Y +SG LEI+ +P+    + +  F  G+  + D+     L+ +      + E 
Sbjct: 769  PTYWLLVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQTTPVNEIPNPE- 827

Query: 810  GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
                        M+V E+ M     H +RP L   L D  +  YQAY +  P+   K   
Sbjct: 828  ------------MQVREILMVALGHHGNRPMLLVRL-DSELQIYQAYRY--PKGHLK--- 869

Query: 870  PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLS 929
                 + L    +       LR    P+   TR        C  +  F NI+G+ G F+ 
Sbjct: 870  --LRFKKLDHGIIPGQLKPKLRDEDIPMMNETRH-------CM-MRYFSNIAGYNGVFIC 919

Query: 930  GSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGST 988
               P W  +  R  LR HP   DG + +F   +N+NC  GF+Y   +  L+IC LP+  +
Sbjct: 920  SDYPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQGFLYFNRKEELRICVLPTHLS 979

Query: 989  YDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNH 1048
            YD  WPV+KV PL+ TPH +TY  E   Y +I S+   +PL                  +
Sbjct: 980  YDAPWPVRKV-PLRCTPHFVTYHLESKTYCVITSIA--EPLKSY---------------Y 1021

Query: 1049 NLSSVDLHRTYTVEEYEVRILEPDR--------AGGPWQT--RATIPMQSSENALTVRVV 1098
              +  D  + +T EE   R + P +        +   W+T     I +   E+   ++ V
Sbjct: 1022 RFNGED--KEFTEEERPERFIYPSQEQFSIVLFSPVSWETIPNTKIELDQWEHVTCLKNV 1079

Query: 1099 TLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSS 1157
            +L    T+   +  + +GT Y  GED+ +RGR+L+F        P   +    +  +++ 
Sbjct: 1080 SLAYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPLTKNRFKQIYAK 1139

Query: 1158 VQ 1159
             Q
Sbjct: 1140 EQ 1141


>gi|350413821|ref|XP_003490124.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Bombus impatiens]
          Length = 1417

 Score =  393 bits (1010), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 331/1202 (27%), Positives = 541/1202 (45%), Gaps = 190/1202 (15%)

Query: 58   LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
            L V  AN+I I+ +    + +K+ K +     ++         LE +  Y LHGNV S+ 
Sbjct: 30   LAVAGANIIRIFRLIPDVDITKKEKYTESRPPKM--------KLECLSQYTLHGNVMSMQ 81

Query: 118  ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
             ++  G+    +RDS++L+F DAK+SV+E+D   H LR  S+H FE  E   ++ G  + 
Sbjct: 82   AVTLVGS----QRDSLLLSFRDAKLSVVEYDQDTHDLRTVSLHYFEEEE---IRDGWTNH 134

Query: 178  ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR--IESSHV 235
               P+V+VDP+GRC  +L+YG ++++L   +  S  + D D   +    S +  I SS++
Sbjct: 135  HHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPS--LDDGDLLDNSKALSNKTPILSSYM 192

Query: 236  INLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
            I L+ L+  M ++ D  F+HGY EP ++IL+E   T++GR++ +  TC + A+S++   +
Sbjct: 193  IVLKSLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQR 252

Query: 294  QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
             HP+IWS  NLP D Y+ + V  P+GG L++  N++ Y +QS      +  Y VSL+S  
Sbjct: 253  VHPIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQS------IPPYGVSLNSLA 306

Query: 354  EL-------PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSK 405
            E        P+    + L+ +   ++ +D  ++S K+G+L +L++  D  R V+     K
Sbjct: 307  ETSTNFPLKPQEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDK 366

Query: 406  TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS-GLKEEFGDIEADAPST 464
               SVLTS +    ++  FLGSRLG+SLL++FT     ++ ++   +    + E +    
Sbjct: 367  AAASVLTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPENLQNTNENEIILEENETEETPA 426

Query: 465  KRLRRS------SSDALQDMVNGEELSLYGSASNNTESAQKT-FSFAVRDSLVNIGPLKD 517
            K++++       +SD L D+ + EEL +YGS      S Q T + F V DSL+NIGP  +
Sbjct: 427  KKIKQDFIGDWMASDVL-DIKDPEELEVYGSERETHTSIQITSYIFEVCDSLLNIGPCGN 485

Query: 518  FSYGLRINADASATGISKQSNYELV--------------------------ELPGCKGIW 551
             S G         +  S+  + ELV                          ELPGC+ +W
Sbjct: 486  ISMGEPAFLSEEFSH-SQDPDVELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMW 544

Query: 552  TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
            TV      G   +  ++    +  HA+LI+S E  TM+L+T   + EV +S  +  QG T
Sbjct: 545  TVI-----GTLNNDEQIRPEAEGSHAFLILSQEDSTMILQTGQEINEVDQS-GFSTQGST 598

Query: 612  IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
            I AGNL   R ++QV + G R+L G    Q +                 ++  S ADPYV
Sbjct: 599  IFAGNLGANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYV 648

Query: 672  LLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLY---------------HDK 716
             L   DG + LL       T  +   AA    +  + +   Y                D+
Sbjct: 649  TLLSEDGQVMLLTLREGRGTAKLHVQAANLLFRPQIEALCAYRDVSGIFTTQLPENVEDE 708

Query: 717  GPE--------------------------PWLRKTSTDAWLSTGVGEAIDGADGGPLDQG 750
             PE                           +   T +    S GV +          +  
Sbjct: 709  APEEEHNIEEPPIVGNIDNEDDLLYGDAPAFQMPTPSHTKTSEGVSKRTPWWQKHLQEIK 768

Query: 751  DIYSVVCY-ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEE 809
              Y ++ Y +SG LEI+ +P+    + +  F  G+  + D+     L+ +      + E 
Sbjct: 769  PTYWLLVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQTTPVNEIPNPE- 827

Query: 810  GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
                        M+V E+ M     H +RP L   L D  +  YQAY +  P+   K   
Sbjct: 828  ------------MQVREILMVALGHHGNRPMLLVRL-DSELQIYQAYRY--PKGHLK--- 869

Query: 870  PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLS 929
                 + L    +        R    P+   TR        C  +  F NI+G+ G F+ 
Sbjct: 870  --LRFKKLDHGIIPGQLRPKPRDEDIPMMNETRH-------CM-MRYFSNIAGYNGVFIC 919

Query: 930  GSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGST 988
               P W  +  R  LR HP   DG + +F   +N+NC  GF+Y   +  L+IC LP+  +
Sbjct: 920  SDYPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQGFLYFNRKEELRICVLPTHLS 979

Query: 989  YDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNH 1048
            YD  WPV+KV PL+ TPH +TY  E   Y +I S+   +PL                  +
Sbjct: 980  YDAPWPVRKV-PLRCTPHFVTYHLESKTYCVITSIA--EPLKSY---------------Y 1021

Query: 1049 NLSSVDLHRTYTVEEYEVRILEPDR--------AGGPWQT--RATIPMQSSENALTVRVV 1098
              +  D  + +T EE   R + P +        +   W+T     I +   E+   ++ V
Sbjct: 1022 RFNGED--KEFTEEERPERFIYPSQEQFSIVLFSPVSWETIPNTKIELDQWEHVTCLKNV 1079

Query: 1099 TLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSS 1157
            +L    T+   +  + +GT Y  GED+ +RGR+L+F        P   +    +  +++ 
Sbjct: 1080 SLAYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPLTKNRFKQIYAK 1139

Query: 1158 VQ 1159
             Q
Sbjct: 1140 EQ 1141


>gi|242021233|ref|XP_002431050.1| Cleavage and polyadenylation specificity factor 160 kDa subunit,
            putative [Pediculus humanus corporis]
 gi|212516279|gb|EEB18312.1| Cleavage and polyadenylation specificity factor 160 kDa subunit,
            putative [Pediculus humanus corporis]
          Length = 1409

 Score =  391 bits (1004), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 336/1189 (28%), Positives = 540/1189 (45%), Gaps = 170/1189 (14%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            +LVV   N++ ++ +    +    +K    T+RR          LE +  + L  NV S+
Sbjct: 29   SLVVAGKNILRVFQLIPDID---PTKRDAYTERRP-----PKMKLECLSSFSLFANVMSM 80

Query: 117  AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
              +S  G+     RD+++L+F +AK+ V+E+D   H LR  S+H FE  +   +K G  +
Sbjct: 81   QAVSLAGSS----RDALLLSFREAKLCVVEYDPDSHDLRTLSLHYFEEED---MKGGWTN 133

Query: 177  FARGPLVKVDPQGRCGGVLVYGLQMIIL---KASQGGSGLVGDEDTFGSG-GGFSARIES 232
                P V+VDP+GRC  +LVYG +++IL   + S+     +   D   S      A + S
Sbjct: 134  HYDIPYVRVDPEGRCAAMLVYGRKLVILPFRRESKLDDPDIALLDPHSSSVATAKAPVLS 193

Query: 233  SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST 290
            S+ I LR++D  +++V D  F++GY EP ++IL+E   T+AGR++ +  TC + A+S++ 
Sbjct: 194  SYTITLREIDEKLENVIDIQFLYGYYEPTLLILYEPLKTFAGRIAVRSDTCAMIAVSLNI 253

Query: 291  TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYAVSL 349
              + HP IWS  NLP +  + + VP P+GG L+   N + Y +QS     +++N+ A + 
Sbjct: 254  QQRVHPAIWSVGNLPFNCTQAIPVPKPLGGTLIFSVNALIYLNQSIPPFGVSVNSIAENS 313

Query: 350  DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNP 408
             + Q   +    + L+ + AT++ +D  +LS KTG+L +L+++ D  R V+     K   
Sbjct: 314  TNFQLKIQEGVKITLEGSQATFISHDRLVLSLKTGELYVLSLLADNIRSVRGFHFDKAAA 373

Query: 409  SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
            SVLT+ +    +   FLGSRLG+SLL++FT    +      L E     E   PS +R +
Sbjct: 374  SVLTTCLCVCEDKYLFLGSRLGNSLLLRFTEKESSEAPIITLDESIR--EVPVPSKRRRQ 431

Query: 469  RSSSDAL----QDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRI 524
             +  D +     D+ + +EL +YG+   ++     +F F V DSL+NIGP  + S G   
Sbjct: 432  DALGDWMASDVADIRDLDELEVYGTQEASSSVQITSFMFEVCDSLLNIGPCGNVSMGEPA 491

Query: 525  NADASATGISKQSNYELV--------------------------ELPGCKGIWTVYHKSS 558
                  +  ++  + ELV                          ELPGC  +WTV     
Sbjct: 492  FLSEEFSN-NRDPDLELVTTSGHGKNGAICVLQRTIRPQVVTTFELPGCLDMWTVI---- 546

Query: 559  RGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLF 618
             G  +DS    A DD  HA+LI+S +  TM+L+T   + EV  S  +  QG TI AGNL 
Sbjct: 547  -GPQSDSGPTQAEDDISHAFLILSQKDSTMILQTGQEINEVDHS-GFNTQGPTIFAGNLA 604

Query: 619  GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
              + ++QV + G R+L G    Q +               S+V+  S ADPYV L   DG
Sbjct: 605  SNKYIVQVSKAGVRLLRGLEQIQHIPL----------DLGSSVVHASTADPYVALLTEDG 654

Query: 679  SIRLLVGDPS--TCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWL----- 731
             + LL    S     +SV  P  I ++ +    CT     G    L   +T+  L     
Sbjct: 655  QVVLLTLRESRGQGRLSVFKP-TIPTNPRVSKICTYRDVSG----LFTLTTEEELQNATF 709

Query: 732  ---STGVGEAIDGAD----GG--------------------PLDQGDIYS---------V 755
               S  + +  D  D    GG                    P  +   YS          
Sbjct: 710  KSDSKNMKKEADDEDEMLYGGSEVKFQLLPITNTNEPSPPRPFVRWKKYSQEIKPNYWMF 769

Query: 756  VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGR 815
            V  E+G L+I+ +P+F   F + +   G   + D             ++ +   G     
Sbjct: 770  VLRETGTLDIYSLPDFRPSFQIRRIGQGHRVLYDV------------LDMAQTSGMDGSD 817

Query: 816  KENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLF-EGPENTSKSDDPVSTS 874
               IH + VV L       H  R  +  + T+  ++ YQA+ F +GP    +        
Sbjct: 818  DPEIHELLVVSL------GHLGRRPILLLRTENDLMIYQAFKFAKGPNLKIR-------F 864

Query: 875  RSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPC 934
            R L  + +   R    +        Y  E     A   R+  F NISG+ G F+ G  P 
Sbjct: 865  RRLPQTLILKERKAKFKVK------YENEVESERA--TRLRYFSNISGYNGVFVCGPNPH 916

Query: 935  WC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW 993
            W  +  R  LR HP L DG + +F   HNVNC  GF+Y TS+  L+IC LP+  +YD  W
Sbjct: 917  WLFLTARGELRSHPMLIDGRVTSFASFHNVNCPLGFLYFTSKCELRICILPTHLSYDAPW 976

Query: 994  PVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSV 1053
            PV+KV PL+ TPH +TY  E   Y LI S    +P N+       ++  H +++ +    
Sbjct: 977  PVRKV-PLRCTPHMVTYHLESKTYCLITSSS--EPSNEYFR-FNGEDKEHSVEDRD---- 1028

Query: 1054 DLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSS--ENALTVRVVTL-FNTTTKENET 1110
            D       +++ + +  P      W+      M+    E+   V+ V L +  T    + 
Sbjct: 1029 DRFPLPLQDKFSIVLFSP----VSWEVIPNTKMELDEWEHVTCVKTVNLSYEGTRSGLKG 1084

Query: 1111 LLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
             +A+GT Y   ED+ ++GR+L++        P   +    +  +++  Q
Sbjct: 1085 YVAVGTNYNYSEDITSKGRILIYDIIEVVPEPGQPLTKNRFKTVYAKEQ 1133


>gi|383863556|ref|XP_003707246.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Megachile rotundata]
          Length = 1415

 Score =  386 bits (991), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 332/1192 (27%), Positives = 538/1192 (45%), Gaps = 172/1192 (14%)

Query: 58   LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
            LVV   N+I ++ +    + +K  K +     ++         LE +  Y LHGNV S+ 
Sbjct: 30   LVVAGGNIIRVFRLIPDVDITKREKYTESRPPKM--------KLECLAQYTLHGNVMSMQ 81

Query: 118  ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
             ++  G+    +RDS++L+F DAK+SV+E+D  IH LR  S+H FE  E   ++ G  + 
Sbjct: 82   AVTLVGS----QRDSLLLSFRDAKLSVVEYDQDIHDLRTVSLHYFEEEE---IRDGWTNH 134

Query: 178  ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
               P+V+VDP+GRC  +L+YG ++++L   +  S   GD             I SS++I 
Sbjct: 135  HHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPSLDDGDLLDNSKASSNKTPILSSYMIV 194

Query: 238  LRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
            L+ L+  M ++ D  F+HGY EP ++IL+E   T++GR++ +  TC + A+S++   + H
Sbjct: 195  LKSLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQRVH 254

Query: 296  PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
            P+IWS  NLP D Y+ + V  P+GG L++  N++ Y +QS      +  Y VSL+S  E 
Sbjct: 255  PIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQS------IPPYGVSLNSLAET 308

Query: 356  -------PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
                   P+    + L+ +   ++ +D  ++S K+G+L +L++  D  R V+     K  
Sbjct: 309  STNFPLKPQEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDKAA 368

Query: 408  PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG-SGTSMLSSGLKEEFGDIEADAPSTKR 466
             SVLTS +    ++  FLGSRLG+SLL++F    S  S   +  +    + E +    K+
Sbjct: 369  ASVLTSCVCMCDDNYLFLGSRLGNSLLLRFIEKESENSQNMNENEITIEENETEETPAKK 428

Query: 467  LRRS------SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
            +++       +SD L D+ + EEL +YGS + +T     ++ F V DSL+NIGP  + S 
Sbjct: 429  VKQDFIGDWMASDVL-DIKDPEELEVYGSET-HTSIQITSYIFEVCDSLLNIGPCGNISM 486

Query: 521  GLRINADASATGISKQSNYELV--------------------------ELPGCKGIWTVY 554
            G         +  S+  + ELV                          ELPGC+ +WTV 
Sbjct: 487  GEPAFLSEEFSH-SQDPDVELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMWTVI 545

Query: 555  HKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAA 614
                 G   +  ++    +  HA+LI+S E  TM+L+T   + EV +S  +  QG T+ A
Sbjct: 546  -----GALNNDEQVRPEAEGSHAFLILSQEDSTMILQTGQEINEVDQS-GFSTQGSTVFA 599

Query: 615  GNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLG 674
            GNL   R ++QV + G R+L G    Q +                 ++  S ADPYV L 
Sbjct: 600  GNLGANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYVSLL 649

Query: 675  MSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD-------KGPE-------- 719
              DG + LL       T  +    A    +  + +   Y D       + PE        
Sbjct: 650  SEDGQVMLLTLREGRGTAKLHAQTANLLFRPQIEALCAYRDVSGIFTTQLPENVEDEVPE 709

Query: 720  --------PWLRKTSTDAWLSTGVGEAID--------GADG----GPLDQGDI------Y 753
                    P +     +  L  G G A           ++G     P  Q  +      Y
Sbjct: 710  EEHNTEEPPIVGNIDNEDDLLYGDGPAFQMPAPSQTKSSEGTSKRAPWWQKHLQEIKPTY 769

Query: 754  SVVCY-ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
             ++ Y +SG LEI+ +P+    + +  F  G+  + D+     L+ +      + E    
Sbjct: 770  WLLVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQTAPVNEIPNPE---- 825

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
                     M+V E+ M     H +RP L   L D  +  YQ Y +  P+   K      
Sbjct: 826  ---------MQVREILMVALGHHGNRPMLLVRL-DSELQIYQTYRY--PKGHLK------ 867

Query: 873  TSRSLSVSNVSASRLR-NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
                L    +    +  NLR      D     ET H   C  +  F NI+G+ G F+   
Sbjct: 868  ----LRFKKLDHGIIPGNLRPKPKEEDMSAMNETRH---CM-MRYFSNIAGYNGVFICSD 919

Query: 932  RPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
             P W  +  R  LR HP   DG I +F   +N+NC  GF+Y   +  L+IC LP+  +YD
Sbjct: 920  YPHWIFLTGRGELRTHPMGIDGPITSFAPFNNINCPQGFLYFNRKEELRICVLPTHLSYD 979

Query: 991  NYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNL 1050
              WPV+KV PL+ TPH +TY  E   Y +I S+   +PL            G   +    
Sbjct: 980  APWPVRKV-PLRCTPHFVTYHLESKTYCVITSIA--EPLKSYYRF-----NGEDKEFTEE 1031

Query: 1051 SSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN 1108
               D     + E++ + +  P      W+T     I +   E+   ++ V+L    T+  
Sbjct: 1032 DRPDRFIFPSQEQFSIVLFSPVS----WETIPNTKIELDQWEHVTCLKNVSLAYEGTRSG 1087

Query: 1109 -ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
             +  + +GT Y  GED+ +RGR+L+F        P   +    +  +++  Q
Sbjct: 1088 LKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPLTKNRFKQIYAKEQ 1139


>gi|91078626|ref|XP_968117.1| PREDICTED: similar to cleavage and polyadenylation specificity factor
            cpsf [Tribolium castaneum]
          Length = 1413

 Score =  384 bits (987), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 331/1206 (27%), Positives = 540/1206 (44%), Gaps = 200/1206 (16%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            +LV + ANVI+++ +    +         ET           + LE V  Y L GN+ S+
Sbjct: 29   SLVTSGANVIKVFRLIPDIDTKTRIDKFNETNP-------PKSKLECVAQYTLFGNIMSM 81

Query: 117  AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
              ++   +     RD+++LAF+DAK+SV+E+D   H L+  S+H FE  +   +K G   
Sbjct: 82   QSVNLANSP----RDALLLAFKDAKLSVVEYDPETHDLKTLSLHYFEEDD---MKDGWTH 134

Query: 177  FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED---TFGSGGGFSARIESS 233
                P+V+ DP+ RC  + V+G ++++L   +  +    D D     G   G  A I +S
Sbjct: 135  HYHVPMVRADPENRCAVMTVFGRKLVVLPFRRENAIDDTDADIKPMIGGAYGSKAPILAS 194

Query: 234  HVINLRDL--DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
            ++I L+D    + ++ D  F+HGY EP ++IL E   T+AGRV+ +  TC ++A+S++  
Sbjct: 195  YMIVLKDFIDKVDNIIDIQFLHGYYEPTLLILFEPLKTFAGRVAVRTDTCAMAAISLNLQ 254

Query: 292  LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
             K HP+IWS  NLP D  K + +  P+GG L+   N + Y +QS      +  Y VSL+S
Sbjct: 255  QKVHPIIWSVANLPFDCVKAVPIKKPLGGTLIFAVNALIYLNQS------IPPYGVSLNS 308

Query: 352  SQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDL 403
              E        P+    + LD A AT+L++D  +LS K G+L +LT++ D  R V+    
Sbjct: 309  IAENSTNFPLKPQDDLCISLDCAQATFLEDDTIVLSLKGGELYVLTLLADNMRYVRSFHF 368

Query: 404  SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT--CGSGTSMLSSGLKEEFGDIEADA 461
             K   SVLT+ I+   N+  FLGSRLG+SLL++FT  C    ++            E   
Sbjct: 369  EKAAASVLTTCISVCENNFLFLGSRLGNSLLLRFTEKCNEVITL-----------DETIE 417

Query: 462  PSTKRLRRSSS----------DALQDMV--------NGEELSLYGSASNNTESAQKTFSF 503
            PS KRL+ S+S          D L D +        + EEL +YG+    +     ++ F
Sbjct: 418  PSAKRLKASNSTSENEDDKVLDTLNDCMASDVLDIRDPEELEVYGNQKQASLQIS-SYVF 476

Query: 504  AVRDSLVNIGPL------------KDFSYGLRINADASATG----------ISKQSNYEL 541
             V DSL+NIGP             ++FS  L ++ +   T           + K    ++
Sbjct: 477  EVCDSLLNIGPCGNISLGEPAFLSEEFSENLDLDLELVTTAGYGKNGALCVLQKSVRPQI 536

Query: 542  V---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTE 598
            V    LPGC  +WTV+    +                HA+LI+S E  TM+L+T D + E
Sbjct: 537  VTTFTLPGCSNMWTVHAGEDK----------------HAFLILSQEDGTMILQTGDEINE 580

Query: 599  VTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSEN 658
            + ++  +     T+ AGNL   + ++QV     R+L G    Q +       E G     
Sbjct: 581  I-DNTGFATHIPTVYAGNLGNLKYIVQVTSSAVRLLQGINQLQHIPL-----ELG----- 629

Query: 659  STVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKG- 717
            S ++ V+  DPY+ L  +DG +  L+   +     +    +  S+  PV++  +Y D   
Sbjct: 630  SPIVHVTSVDPYISLLTTDGQVITLMLREARGVAKLVISKSTLSNSPPVTTICMYRDVSG 689

Query: 718  -------------PEPWLRKTSTDAWLSTGVGEAIDGADGG--------PLDQGDIYS-- 754
                         PE ++ ++ T   +     + + G D          P  +  +Y   
Sbjct: 690  LFTSKIPEDFTHIPEHFINESETKMEVENE-DDLLYGDDSDFKMPTLNPPQPKPKVYYNW 748

Query: 755  --------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
                           V  E+  LEI+ +P+F   + +     G   +VD    E++  S 
Sbjct: 749  WKKYLLDVRPSYWLFVVRENSNLEIYSIPDFKLCYYITNLCFGHKVLVDNL--ESVTISA 806

Query: 801  TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
            +   S++ E   Q R+ ++  + VV L       H SRP L   L +  +  Y+ + F  
Sbjct: 807  STPISAAHEANIQ-RQFDVKEILVVALG-----NHGSRPLLMVRL-ERDLYIYEVFRF-- 857

Query: 861  PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNI 920
            P    K          +   NVS       R      D +  +E        ++  F NI
Sbjct: 858  PRGNLKMRFRKIKHSLIYSPNVSG------RIDTEDSDFFAIQER-----IIKMRYFTNI 906

Query: 921  SGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
            +G+ G F+ G+ P W  M  R  LR HP   DG +++F   +NVNC  GF+Y   +  L+
Sbjct: 907  AGYNGVFVCGANPHWIFMSARGELRTHPMTIDGEVLSFAAFNNVNCPQGFLYFNRKSELR 966

Query: 980  ICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQ 1039
            I  LP+  +YD  WPV+KV PL+ TPH +TY  E   Y L+ S+   +P N+        
Sbjct: 967  IGVLPTHLSYDAAWPVRKV-PLRCTPHFVTYHLESKTYCLVTSIA--EPSNKYYKF---- 1019

Query: 1040 EVGHQIDNHNLSSVDLHRTYTV---EEYEVRILEPDRAGGPWQT--RATIPMQSSENALT 1094
                  ++  LS  D    +     E++ + +  P      W       I +   E+   
Sbjct: 1020 ----NGEDKELSVEDRGDRFPYPLQEKFSLMLFSP----VSWDVIPNTKIDLDEWEHVNC 1071

Query: 1095 VRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGP 1153
            ++ V+L    T+   +  +A+GT Y  GEDV +RGR+L+F        P   +    +  
Sbjct: 1072 LKNVSLAYEGTRSGLKGYIAVGTNYNYGEDVTSRGRILIFDIIEVVPEPGQPLTKNRFKE 1131

Query: 1154 LFSSVQ 1159
            +++  Q
Sbjct: 1132 IYAKDQ 1137


>gi|443684051|gb|ELT88095.1| hypothetical protein CAPTEDRAFT_161045 [Capitella teleta]
          Length = 1410

 Score =  372 bits (955), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 335/1178 (28%), Positives = 527/1178 (44%), Gaps = 198/1178 (16%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            NLV    N I +Y +  + +  ++  ++ ETK        +   LE V  Y L GNV S+
Sbjct: 29   NLVTAGVNQIRVYRLVAESKPVEKESHTTETKS-------AKQKLECVADYELCGNVSSI 81

Query: 117  AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
              +S  GA     RD+++L FE+AK+S+ ++D     L+  S+H FE  +   L+ G   
Sbjct: 82   ESISLVGA----ARDALLLCFEEAKLSLCDYDPDTDDLKTISLHYFEDAD---LENG--C 132

Query: 177  FARG---PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESS 233
              RG     V+VDP+GRC  +L+YG  +I+L   +       D  +  S     + I S+
Sbjct: 133  CQRGLHHSEVRVDPEGRCAVMLIYGTHLIVLPFRKESPSDEIDATSCAS----KSPIMST 188

Query: 234  HVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
            ++I+LR LD +  +V D  F+HGY EP ++IL+E   TW  RV+ +  TC I A+S++  
Sbjct: 189  YIIDLRTLDERVTNVVDIQFLHGYYEPTVLILYEPLPTWTCRVAVRKDTCSIVAISLNLQ 248

Query: 292  LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
             K HP+IWS  NLP+D  +   VP PIGGV+V   N++ Y +QS         Y VSL+S
Sbjct: 249  DKTHPIIWSHSNLPYDCLRTFPVPKPIGGVIVFAVNSLLYLNQS------FPPYGVSLNS 302

Query: 352  SQEL-------PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDL 403
                       P+    + LD A A ++ ND  ++S K G+L +LT+V D  R V+   L
Sbjct: 303  LTSFNTEFLLKPQEGVRMSLDCAQAEFIDNDKLVISLKGGELYVLTLVIDSMRAVRSFHL 362

Query: 404  SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPS 463
             K   SVLT+ +   G++  FLGSRLG+SLL+++         SS      G+ +     
Sbjct: 363  DKAAASVLTTCMCMCGDNYLFLGSRLGNSLLLRYQ--EKKPEASSSSDASPGEEQRKEKM 420

Query: 464  TKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT-FSFAVRDSLVNIGPL------- 515
            T  +    S  +  + + +EL +YG  S   ES   T F F V DS++NIGP        
Sbjct: 421  TLAIGLVGSSDVSKLDDLDELEVYGRDSQAVESEDITQFMFEVCDSIINIGPCGQVEMGE 480

Query: 516  -----KDFSYGLRINADASATG----------ISKQSNYELV---ELPGCKGIWTVYHKS 557
                 ++FS+    + +   T           + +Q   ++V   ELPGC  +WTV    
Sbjct: 481  PAFLSEEFSHQEDPDLELVTTSGYGKNGAISILQRQIRPQVVTTFELPGCTDVWTVLGSP 540

Query: 558  SRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNL 617
                 +D     +     HA+L++S    +MVLET   + E+  S  +     T+ A N+
Sbjct: 541  DEQQGSDEKLAGS-----HAFLLLSRADSSMVLETGQEIMELDHS-GFCTDAPTVHAANI 594

Query: 618  FGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
               R ++QV      +L G    Q L+   S          S V+S S+ADP+VLL   D
Sbjct: 595  GNGRYIVQVGPNAIWLLKGVERIQHLALDVS----------SPVVSCSLADPHVLLLCED 644

Query: 678  GSIRLLV-----GDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKG---------PEP--- 720
            G +  LV      DP   T+S+ T    + SK  V +  LY D            EP   
Sbjct: 645  GQLLHLVLSVQGDDP---TLSLLTTKLHQKSK--VIAINLYRDTSGLFVVASSESEPSAT 699

Query: 721  --------------------------------------WLRKTSTDAWLSTGVGEAIDGA 742
                                                  W ++ S          E  +GA
Sbjct: 700  TTTEATETTTPQQQTEEGVDDEDDLLYGDSDISAITSTWQKQESEKEEKKEEEEEEAEGA 759

Query: 743  DGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETE 802
            D  P      ++V+   +G LE++ +P++   F V  F +G   ++D+     L  S   
Sbjct: 760  DIQP----TYWAVIIRATGNLELYSLPDWQLCFLVKNFATGNKLLIDSMQAADLSASFVA 815

Query: 803  INSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPE 862
               S++E              V E+ +  +  + S+P L A + D  +  Y+ +   G +
Sbjct: 816  PERSTQEVPF-----------VHEVMLHGFGVNGSQPLLMARVHD-ELYIYKVFSHVGSK 863

Query: 863  NTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTR-----EETPHGAPCQRITIF 917
                                +  RL+ +RF R       R     E+ P      R   F
Sbjct: 864  --------------------AKGRLQ-VRFKRRSHGLIIRPRDREEKIPENKKWLR--PF 900

Query: 918  KNISGHQGFFLSGSRPCW-CMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG 976
             +ISG+ G F+ GS P W  M  R  LR HP   DG+I  FT  HNVNC  GF+Y +S  
Sbjct: 901  TDISGYSGVFICGSYPHWLIMTQRGTLRGHPMAIDGTIPCFTAFHNVNCPKGFLYFSSNE 960

Query: 977  ILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLL 1036
             L+IC LP+  +YD  WPV+KV PL+ TPH + Y  +   Y ++ S  V  P  Q++ + 
Sbjct: 961  ELRICVLPTHLSYDAPWPVRKV-PLRCTPHFVVYHPDSKTYSVVSSQQV--PCTQLVRVA 1017

Query: 1037 IDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVR 1096
             D E   +I+   +   D      + ++ +++  P        TR    ++  E+ + ++
Sbjct: 1018 GDGE--KEIE--AVQKDDRFVFPIMNKFNIQLFSPVSWEPIPNTR--FDLEEWEHVMCIK 1071

Query: 1097 VVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLF 1133
             + L +  T    +  + +GT     EDV++RG++ ++
Sbjct: 1072 TINLKSEGTLSGLKGYVVVGTNLNYNEDVSSRGKLTIY 1109


>gi|47217773|emb|CAG05995.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 1446

 Score =  371 bits (953), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 332/1231 (26%), Positives = 541/1231 (43%), Gaps = 241/1231 (19%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            NLVV   + + +Y +    E + ++  S ++K R          LE V  + L GNV S+
Sbjct: 29   NLVVAGTSQLFVYRIIHDVESTSKTDKSSDSKTR-------KEKLEQVAAFSLFGNVMSM 81

Query: 117  AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
              +   GA+    RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE       R++
Sbjct: 82   ESVQLVGAN----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPEL------RDT 131

Query: 177  FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
                                                   DE   G G G  +    +++I
Sbjct: 132  LT-------------------------------------DEQELGVGEGPKSSFLPTYII 154

Query: 237  NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
            ++R+LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +   C I A+S++   K 
Sbjct: 155  DVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQAQCSIVAISLNIMQKV 214

Query: 295  HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQ 353
            HP+IWS  NLP D  +++AVP PIGGV+V   N++ Y +QS     +ALN+      +  
Sbjct: 215  HPVIWSLSNLPFDCTQVMAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTNGTTAFP 274

Query: 354  ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
               +    + LD + A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 275  LRLQDEVKITLDCSQADFIAYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 334

Query: 413  SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGL----KEEFGDIEADAPSTKRLR 468
            + + T+     FLGSRLG+SLL+++T       L  G     KE+  D++        L 
Sbjct: 335  TCMVTMEPGYLFLGSRLGNSLLLKYTEKLQEMPLEEGKDKQEKEKDNDMDKQV-YVHTLN 393

Query: 469  RSSSDALQDMVNGE--ELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRIN 525
              S+ +  D    E  E+ +YGS A + T+ A  T+SF V DS++NIGP  + S G    
Sbjct: 394  SFSAHSQHDFFVDEVDEIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANASMGEPAF 451

Query: 526  ADASATGISKQSNYELV--------------------------ELPGCKGIWTVYHKSSR 559
                  G + + + E+V                          ELPGC  +WTV     +
Sbjct: 452  LSEEFQG-NPEPDLEVVVCSGHGKNGALSVLQRSIRPQVVTTFELPGCHDMWTVISNEVK 510

Query: 560  GHNADSSRMAAY---------DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
                      ++         D + H +LI+S E  TM+L+T   + E+  S  +  QG 
Sbjct: 511  EDKKVPQSPGSFTATHYSLEEDTKKHGFLILSREDSTMILQTGQEIMELDTS-GFATQGP 569

Query: 611  TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPY 670
            T+ AGN+   + +IQV   G R+L+G    + L F P +         S ++  S+ADPY
Sbjct: 570  TVFAGNIGDNKYIIQVSPMGIRLLEG---VKQLHFIPVDL-------GSPIVHCSVADPY 619

Query: 671  VLLGMSDGSI---------------RLLVGDPSTCTVSVQTPAAIESSKKPVS------- 708
            V++  ++G +               RL +  P    +S Q+      + + VS       
Sbjct: 620  VVIMTAEGVVTMFVLKVDSYMGKTHRLALQKPQ---ISTQSRVIALCAYRDVSGMFTTEN 676

Query: 709  --SC---------------TLYHD------KGPEPWLRKTST-------DAWLSTGVGEA 738
              SC               T+ HD         E  L   S+       +  + + V   
Sbjct: 677  KVSCAIAEDFNIRSQSETETVIHDLSSNIVDDEEEMLYGDSSSNAGPSKEEMIRSFVAPG 736

Query: 739  IDGADGGPLD-QGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALK 797
               ++GGP   +   + +V  ESG +EI+ +P++  VF V  F  G+  +VD+   ++  
Sbjct: 737  PSVSEGGPSKAEPSHWCLVTRESGVMEIYQLPDWRLVFLVKNFPVGQRVLVDSSSGQSAT 796

Query: 798  DSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYL 857
              + E     EE T QG    +  + +V L       +HSRP+L  +  +  +L Y+A+ 
Sbjct: 797  QGDKE--GKKEEMTRQGEIPLVKEVTLVSLGY-----NHSRPYLL-VHVEQELLVYEAFP 848

Query: 858  F--EGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREET--------PH 907
            +  + P+N  K                       +RF + P +   RE+           
Sbjct: 849  YDQQQPQNNLK-----------------------VRFKKVPHNINFREKKSKLRKDKKAE 885

Query: 908  GAPCQ----------RITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVA 956
            GA  +          R   F++ISG+ G F+ G  P W +V  R  LR+HP   DG I +
Sbjct: 886  GAAAEDGVAARGRISRFRYFEDISGYSGVFICGPSPHWMLVTSRGALRLHPMTIDGPIES 945

Query: 957  FTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKN- 1015
            F+  HN+NC  GF+Y   QG L+I  LP+  +YD  WPV+K IPL+ T H ++Y  E   
Sbjct: 946  FSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRK-IPLRCTVHYVSYHVESKA 1004

Query: 1016 ------LYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRIL 1069
                  +Y +  SV       + L   I +  G + +   +   + +     +++ ++++
Sbjct: 1005 SLSHCCVYAVCTSV-------KELCTRIPRMTGEEKEYETIERDERYINPQQDKFSIQLI 1057

Query: 1070 EPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARG 1128
             P        TR  I ++  E    ++ V L +  T    +  +A GT  +QGE+V  RG
Sbjct: 1058 SPVSWEAIPNTR--IDLEEWEYVTCMKTVALRSQETVSGLKGYIAAGTCLMQGEEVTCRG 1115

Query: 1129 RVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
            R+L+         P   +    +  L+   Q
Sbjct: 1116 RILILDVIEVVPEPGQPLTKNKFKVLYEKEQ 1146


>gi|195583398|ref|XP_002081509.1| GD25678 [Drosophila simulans]
 gi|194193518|gb|EDX07094.1| GD25678 [Drosophila simulans]
          Length = 1450

 Score =  368 bits (944), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 330/1193 (27%), Positives = 542/1193 (45%), Gaps = 182/1193 (15%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
            NLVV  ANV+++Y +    E S+  K N  E +    M       LE +  Y L+GNV S
Sbjct: 29   NLVVAGANVLKVYRIAPNVEASQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82

Query: 116  LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
            L  +S  GA     RD+++++F+DAK+SVL+ D     L+  S+H FE  +   ++ G  
Sbjct: 83   LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
                 P V+VDP  RC  +LVYG ++++L   +  S     L   +    +     +R  
Sbjct: 136  GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195

Query: 230  IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
            I +S++I LRDLD K  +V D  F+HGY EP ++IL+E   T  GR+  +  TC++ A+S
Sbjct: 196  IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
            ++   + HP+IW+  +LP D  ++  +  PIGG LV+  N + Y +QS         Y V
Sbjct: 256  LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309

Query: 348  SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
            SL+SS +        P+    + LD A+  ++  D  ++S +TGDL +LT+  D  R V+
Sbjct: 310  SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369

Query: 400  RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
                 K   SVLTS I  + +   FLGSRLG+SLL+ FT    +++++            
Sbjct: 370  NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVEQQTEQQQ 429

Query: 448  SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
              L++E   +E +     +L  + + A    +  EEL +YGS +  +    + F F V D
Sbjct: 430  RNLQDEDQSLE-EILDVDQLEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCD 488

Query: 508  SLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS---------N 538
            SL+N+ P+     G R+  +                     +ATG SK           N
Sbjct: 489  SLMNVAPINYMCAGERVEFEEDGVTLRPHAESLQDLKIELVAATGHSKNGALSVFVNCLN 548

Query: 539  YELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
             +++   EL GC  +WTV+         D+++ ++ +D+ H ++++S    T+VL+T   
Sbjct: 549  PQIITSFELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQE 599

Query: 596  LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
            + E+ E+  + V   TI  GNL  +R ++QV  R  R+L G+ + Q++            
Sbjct: 600  INEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI---------- 648

Query: 656  SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
               S V+ VSIADPYV L + +G +  L    +  T  +       SS   V + + Y D
Sbjct: 649  DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAISAYKD 708

Query: 716  -------KG----------------------PEPWLRKTSTDAWLSTGVGEAI------D 740
                   KG                       EP ++    +  L    G A       D
Sbjct: 709  LSGLFTVKGDDINLTGSSNSAFGHSFGGYMKAEPNMKVEDEEDLLYGDAGSAFKMNSMAD 768

Query: 741  GADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
             A        D +             VV  +SG LEI+ +P+   V+ V+   +G T + 
Sbjct: 769  LAKQSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDVGNGATVLT 828

Query: 789  DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
            D    E +  S T   +S          ++ +S   +EL++     +  RP L  + T  
Sbjct: 829  DAM--EFVPISLTTQENSKAGIVQACMPQHANSPLPLELSVIGLGLNGERPLLL-VRTRV 885

Query: 849  TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG 908
             +L YQ  +F  P+   K        R L   N+   +  ++       D     E+   
Sbjct: 886  ELLIYQ--VFRYPKGHLK-----IRFRKLDXXNLLDQQPTHIELDEN--DEQEEIESYQM 936

Query: 909  AP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNC 965
             P   Q++  F N+ G  G  + G  PC+  + FR  LR+H  L +G + +F   +NVN 
Sbjct: 937  QPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVNI 996

Query: 966  NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPV 1025
             +GF+Y  +   LKI  LPS  +YD+ WPV+KV PL+ TP Q+ Y  E  +Y LI     
Sbjct: 997  PNGFLYFDTTYELKISVLPSYLSYDSIWPVRKV-PLRCTPRQLVYHRENRVYCLITQTE- 1054

Query: 1026 LKPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEPDRAGGPWQT--R 1081
             +P+ +       D+E+  +              Y +  ++E+ ++ P+     W+    
Sbjct: 1055 -EPMTKYYRFNGEDKELSEESRGERF-------IYPIGSQFEMVLISPET----WEIVPD 1102

Query: 1082 ATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
            A+I  +  E+    ++V L +  T    +  L IGT +   ED+ +RG + ++
Sbjct: 1103 ASITFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIY 1155


>gi|195485994|ref|XP_002091320.1| GE12310 [Drosophila yakuba]
 gi|194177421|gb|EDW91032.1| GE12310 [Drosophila yakuba]
          Length = 1455

 Score =  365 bits (938), Expect = 7e-98,   Method: Compositional matrix adjust.
 Identities = 327/1193 (27%), Positives = 543/1193 (45%), Gaps = 182/1193 (15%)

Query: 57   NLVVTAANVIEIYVVRVQ-EEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
            NLVV  ANV+++Y +    E G ++  N  E +    M       LE +  Y L+GNV S
Sbjct: 29   NLVVAGANVLKVYRIAPNVEAGQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82

Query: 116  LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
            L  +S  GA     RD+++++F+DAK+SVL+ D     L+  S+H FE  +   ++ G  
Sbjct: 83   LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
                 P V+VDP  RC  +LVYG ++++L   +  S     L   +    +     +R  
Sbjct: 136  GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195

Query: 230  IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
            I +S++I LRDLD K  +V D  F+HGY EP ++IL+E   T  GR+  +  TC++ A+S
Sbjct: 196  IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
            ++   + HP+IW+  +LP D  ++  +  PIGG LV+  N + Y +QS         Y V
Sbjct: 256  LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309

Query: 348  SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
            SL+SS +        P+    + LD A+  ++  D  ++S +TGDL +LT+  D  R V+
Sbjct: 310  SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369

Query: 400  RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
                 K   SVLTS I  + +   FLGSRLG+SLL+ FT    +++++            
Sbjct: 370  NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVEQQTEQQQ 429

Query: 448  SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
              L++E  ++E +     +L  + + A    +  EEL +YG+ +  +    + F F V D
Sbjct: 430  RNLQDEDQNLE-EIFDVDQLEMAPTQAKSRRIEDEELEVYGTGAKASVLQLRKFIFEVCD 488

Query: 508  SLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS---------N 538
            SL+N+ P+     G R+  +                     +ATG SK           N
Sbjct: 489  SLMNVAPINYMCAGERVEFEEDGATLRPHAESLQDLKIELVAATGHSKNGALSVFVNCIN 548

Query: 539  YELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
             +++   EL GC  +WTV+         D+++ ++ +D+ H ++++S    T+VL+T   
Sbjct: 549  PQIITSFELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQE 599

Query: 596  LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
            + E+ E+  + V   TI  GNL  +R ++QV  R  R+L G+ + Q++            
Sbjct: 600  INEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI---------- 648

Query: 656  SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
               S V+ VSIADPYV L + +G +  L    +  T  +       SS   V + + Y D
Sbjct: 649  DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAISAYKD 708

Query: 716  -------KG----------------------PEPWLRKTSTDAWLSTGVGEAI------D 740
                   KG                       EP ++    +  L    G A       D
Sbjct: 709  LSGLFTVKGDDINLTGSSNSGFGHSFGGYMKAEPNMKVEDEEDLLYGDAGNAFKMNSMAD 768

Query: 741  GADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
             A        D +             +V  +SG LEI+ +P+   V+ V+   +G   + 
Sbjct: 769  LAKQSKQKNSDWWRRLLVQAKPSYWLIVARQSGTLEIYSMPDMKLVYLVNDVGNGAMVLT 828

Query: 789  DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
            D    E +  S T   +S          ++ +S   +EL++     +  RP L  + T  
Sbjct: 829  DAM--EFVPISLTTQENSKAGIVQACMPQHANSPLPLELSVIGLGLNGERPLLL-VRTRV 885

Query: 849  TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG 908
             +L YQ  +F  P+   K        R L   N+   +  ++       DA    E+   
Sbjct: 886  ELLIYQ--VFRYPKGHLK-----IRFRKLDQLNLLDQQPTHIELDEN--DAQEEIESYQM 936

Query: 909  AP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNC 965
             P   Q++  F N+ G  G  + G  PC+  + FR  LR+H  L +G + +F   +NVN 
Sbjct: 937  QPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVNI 996

Query: 966  NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPV 1025
             +GF+Y  +   LKI  LPS  +YD+ WPV+KV PL+ TP Q+ Y  E  +Y LI     
Sbjct: 997  PNGFLYFDTTYELKISVLPSYLSYDSTWPVRKV-PLRCTPRQLVYHRENRVYCLITQTE- 1054

Query: 1026 LKPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEPDRAGGPWQT--R 1081
             +P+ +       D+E+  +              Y +  ++E+ ++ P+     W+    
Sbjct: 1055 -EPMTKYYRFNGEDKELSEESRGERF-------IYPIGSQFEMVLISPET----WEIVPD 1102

Query: 1082 ATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
            A+I  +  E+    ++V L +  T    +  L IGT +   ED+ +RG + ++
Sbjct: 1103 ASISFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIY 1155


>gi|198415711|ref|XP_002123169.1| PREDICTED: similar to cleavage and polyadenylation specificity factor
            1, partial [Ciona intestinalis]
          Length = 1370

 Score =  364 bits (935), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 333/1259 (26%), Positives = 548/1259 (43%), Gaps = 207/1259 (16%)

Query: 3    FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
            +A Y+ +H PTG+  C        +                             NL+VTA
Sbjct: 2    YAWYRQIHAPTGVEQCVYCNFASEKEK---------------------------NLLVTA 34

Query: 63   ANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQG 122
            A+ + +Y +    E + +++N  E       + +    L+ +  ++L GNV  +  +   
Sbjct: 35   ASQLTVYRLERNYEVTTKTENGEE-------NTVVKEKLQQIGSWQLFGNVVRMRSVRLA 87

Query: 123  GADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL 182
            GA    + DS++L+F +AK+S++EFD + H ++ TS+H FE   +   K G       P 
Sbjct: 88   GA----KLDSVLLSFAEAKLSIIEFDQATHDIKTTSLHYFEDALY---KDGSYQRITLPK 140

Query: 183  VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGF--SARIESSHVINLRD 240
            + VDP+ RC  + +    + ++      + L  D+           + R  +S+ I+L  
Sbjct: 141  IAVDPESRCVALQLTTKSVAVVPLRANTAALATDDGAAPQDNVSLQNKRSTTSYTIDLHA 200

Query: 241  LD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298
            +D  ++ + D  F+HGY EP +++L E   TWAGRV+ +  TC I A+S++   + HP++
Sbjct: 201  VDARLQRIIDIQFLHGYNEPTLLVLFESLRTWAGRVAMRQDTCNIVAISLNMAEQLHPVV 260

Query: 299  WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQE---- 354
            WS   LP D      VP PIGGVL+   N+I Y +QS         Y  SL+S+ E    
Sbjct: 261  WSLNGLPFDCKYAYPVPKPIGGVLIFAVNSILYLNQSVP------PYGTSLNSTTENSTS 314

Query: 355  ---LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSV 410
                P+    + LD +HA ++  +  ++S K G+L +LT++ D  R V+     K+  SV
Sbjct: 315  FPLKPQEDVCMTLDCSHAMFISPESLVISLKNGELYVLTLLVDSMRSVRNFHFDKSASSV 374

Query: 411  LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS 470
            LTS +T + +   FLGSRLG+SLL+++T        +  +       E  A   KRL  +
Sbjct: 375  LTSCLTVLDDGFLFLGSRLGNSLLLKYT-------EARPVFRNCYHTEEPAAKRKRLNTA 427

Query: 471  SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASA 530
            +  A  D  N  +L +YG  +  +E    ++ F V DSLVNIGP      G    A  S 
Sbjct: 428  ADWAASD-TNDIDLQMYGKDTVTSEPL-SSYKFEVCDSLVNIGPCGAAELGEP--AFLSE 483

Query: 531  TGIS-KQSNYELV--------------------------ELPGCKGIWTVYHKSSRGHNA 563
              +S ++S+ EL                           ELPGC  +WTV     +    
Sbjct: 484  EFVSQRESDLELAILSGHGKNGAISVLQRSVKPQVVTTFELPGCIDMWTVKSVCEKTELP 543

Query: 564  DSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
              ++      + H+YLI+S E  T++LET   + EV E+  +  + +++  GN+ G + +
Sbjct: 544  TKTQ-----QQQHSYLILSREESTLILETGKEIMEV-ENSGFNTREQSVFVGNIGGDKEL 597

Query: 624  I-QVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
            I QV   G  +L G  + Q +       E G     S +   SI DPY LL  SDG + +
Sbjct: 598  ILQVCASGVWLLAGVKLLQHIPL-----ELG-----SPITQCSICDPYALLLTSDGDLIM 647

Query: 683  LV----------GDPSTCTVSVQTPAAIE--------------------------SSKK- 705
            L                C  S+     IE                          SS K 
Sbjct: 648  LTLTNDLDSENGVKLECCNPSINQVPQIEHVCLYKDTSGLFKTASGPSDVFLPEDSSNKG 707

Query: 706  -------------PVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGA--DGGPLDQG 750
                         P+SS T   D+  E    ++  D   +      +  +     PLD G
Sbjct: 708  VSDSEIPSSLPRTPLSSKTFTVDEEDELLYGESDPDVIFAPQFAPNVPKSPTQNEPLD-G 766

Query: 751  D--------IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETE 802
            D         ++++  E+  LEI+ +P+ + V+T+  F  G+  + ++    +   S+ +
Sbjct: 767  DKEGNEEFTFWAIIARENRNLEIYSMPSLDLVYTIKNFSFGQKLLTNSGPVHSYSVSKDD 826

Query: 803  INSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPE 862
             ++S    T    K  I  + +V L  +     +S P L A + +  IL Y+ + F  PE
Sbjct: 827  KSTS----TRYSDKPRIFEILLVGLGYK-----NSSPHLIARIEE-EILIYEVFKFSAPE 876

Query: 863  NTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISG 922
               K +     S  +    V+ S    +   R P+   T+ +      C R   F NI G
Sbjct: 877  KFKKYN-----SLQIRFKKVNHS----MMIRRAPVTHETKTDQLEHRNCLR--TFSNIGG 925

Query: 923  HQGFFLSGSRPCWCMV-FRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
            + G FL G  P W  V  R  L  HP   DGS+  F   HNVNC +GF+Y  SQG L+IC
Sbjct: 926  YSGVFLCGPYPYWIFVTIRGALCCHPMSVDGSVSCFVPFHNVNCPNGFLYFNSQGELRIC 985

Query: 982  QLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1041
             LP    YD  WP++K I L+ + H + Y  E  +Y L+ SV   +P  ++  L  + E 
Sbjct: 986  MLPPHMKYDTAWPMRK-ITLRCSVHFLAYSIEHKVYALVTSVS--EPCTRLPYLTFENER 1042

Query: 1042 GHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL- 1100
              +    +L   D      ++++ V+++ P  A       A + M   E+   ++ V L 
Sbjct: 1043 EFE----DLEKGDRFIYPHIDKFSVQLISP--ASWDLVPNARLDMGEFEHITCMKNVWLS 1096

Query: 1101 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
                +   +  L +GT  V GE++++RG++++         P   +       ++S  Q
Sbjct: 1097 CGQDSSARQNFLVLGTVNVFGEEMSSRGKIIILEVIEVVPEPGQPLTKNKLKQIYSEEQ 1155


>gi|194883064|ref|XP_001975624.1| GG22421 [Drosophila erecta]
 gi|190658811|gb|EDV56024.1| GG22421 [Drosophila erecta]
          Length = 1455

 Score =  364 bits (935), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 332/1193 (27%), Positives = 542/1193 (45%), Gaps = 182/1193 (15%)

Query: 57   NLVVTAANVIEIYVVRVQ-EEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
            NLVV  ANV+++Y +    E G ++  N  E +    M       LE +  Y L+GNV S
Sbjct: 29   NLVVAGANVLKVYRIAPNVEAGQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82

Query: 116  LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
            L  +S  GA     RD+++++F+DAK+SVL+ D     L+  S+H FE  +   ++ G  
Sbjct: 83   LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
                 P V+VDP  RC  +LVYG ++++L   +  S     L   +    +     +R  
Sbjct: 136  GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195

Query: 230  IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
            I +S++I LRDLD K  +V D  F+HGY EP ++IL+E   T  GR+  +  TC++ A+S
Sbjct: 196  IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
            ++   + HP+IW+  +LP D  ++  +  PIGG LV+  N + Y +QS         Y V
Sbjct: 256  LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309

Query: 348  SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
            SL+SS +        P+    + LD A+  ++  D  ++S +TGDL +LT+  D  R V+
Sbjct: 310  SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369

Query: 400  RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
                 K   SVLTS I  + +   FLGSRLG+SLL+ FT    +++++            
Sbjct: 370  NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVEQQTEQQQ 429

Query: 448  SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
              L++E   I  +     +L  + + A    +  EEL +YGS +  +    + F F V D
Sbjct: 430  RNLQDE-EQIMEEIFDVDQLEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCD 488

Query: 508  SLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS---------N 538
            SL+N+ P+     G R+  +                     +ATG SK           N
Sbjct: 489  SLMNVAPVNYMCAGERVEFEEDGATLRPHAESLQDVKIELVAATGHSKNGALSVFVNCIN 548

Query: 539  YELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
             +++   EL GC  +WTV+         D+++ ++ +D+ H ++++S    T+VL+T   
Sbjct: 549  PQIITSFELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQE 599

Query: 596  LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
            + E+ E+  + V   TI  GNL  +R ++QV  R  R+L G+ + Q++       E G  
Sbjct: 600  INEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI-----EVG-- 651

Query: 656  SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
               S V+ VSIADPYV L + +G +  L    +  T  +       SS   V + + Y D
Sbjct: 652  ---SPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAISAYKD 708

Query: 716  -------KG----------------------PEPWLRKTSTDAWLSTGVGEAI------D 740
                   KG                       EP ++    +  L    G A       D
Sbjct: 709  LSGLFTVKGDDINLTGSSNSGFGHSFGGYMKAEPNMKVEDEEDLLYGDAGSAFKMNSMAD 768

Query: 741  GADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
             A        D +             VV  +SG LEI+ +P+   V+ V+    G   IV
Sbjct: 769  LAKQSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDV--GNGAIV 826

Query: 789  DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
             T   E +  S T   +S          ++ +S   +EL++     +  RP L  + T  
Sbjct: 827  LTDAMEFVPISLTTQENSKAGIVQACMPQHANSPLPLELSLTGLGLNGERPLLM-VRTRV 885

Query: 849  TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG 908
             +L YQ  +F  P+   K        R L   N+   +  ++       D     E+   
Sbjct: 886  ELLIYQ--VFRYPKGHLK-----IRFRKLDQLNLLDQQPTHIELDEN--DEQEDIESYQM 936

Query: 909  AP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNC 965
             P   Q++  F N+ G  G  + G  PC+  + FR  LR+H  L +G + +F   +NVN 
Sbjct: 937  QPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVNI 996

Query: 966  NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPV 1025
             +GF+Y  +   LKI  LPS  +YD+ WPV+KV PL+ TP Q+ Y  E  +Y LI     
Sbjct: 997  PNGFLYFDTTYELKISVLPSYLSYDSTWPVRKV-PLRCTPRQLVYHRENRVYCLITQTE- 1054

Query: 1026 LKPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEPDRAGGPWQT--R 1081
             +P+ +       D+E+  +              Y +  ++E+ ++ P+     W+    
Sbjct: 1055 -EPMTKYYRFNGEDKELSEESRGERF-------IYPIGSQFEMVLISPET----WEIVPD 1102

Query: 1082 ATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
            A+I  +  E+    ++V L +  T    +  L IGT +   ED+ +RG + ++
Sbjct: 1103 ASISFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIY 1155


>gi|45552619|ref|NP_995833.1| cleavage and polyadenylation specificity factor 160, isoform A
            [Drosophila melanogaster]
 gi|18203551|sp|Q9V726.1|CPSF1_DROME RecName: Full=Cleavage and polyadenylation specificity factor subunit
            1; AltName: Full=Cleavage and polyadenylation specificity
            factor 160 kDa subunit; Short=CPSF 160 kDa subunit;
            Short=dCPSF 160
 gi|7303176|gb|AAF58240.1| cleavage and polyadenylation specificity factor 160, isoform A
            [Drosophila melanogaster]
          Length = 1455

 Score =  363 bits (933), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 328/1193 (27%), Positives = 542/1193 (45%), Gaps = 182/1193 (15%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
            NLVV  ANV+++Y +    E S+  K N  E +    M       LE +  Y L+GNV S
Sbjct: 29   NLVVAGANVLKVYRIAPNVEASQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82

Query: 116  LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
            L  +S  GA     RD+++++F+DAK+SVL+ D     L+  S+H FE  +   ++ G  
Sbjct: 83   LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
                 P V+VDP  RC  +LVYG ++++L   +  S     L   +    +     +R  
Sbjct: 136  GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195

Query: 230  IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
            I +S++I LRDLD K  +V D  F+HGY EP ++IL+E   T  GR+  +  TC++ A+S
Sbjct: 196  IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
            ++   + HP+IW+  +LP D  ++  +  PIGG LV+  N + Y +QS         Y V
Sbjct: 256  LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309

Query: 348  SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
            SL+SS +        P+    + LD A+  ++  D  ++S +TGDL +LT+  D  R V+
Sbjct: 310  SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369

Query: 400  RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
                 K   SVLTS I  + +   FLGSRLG+SLL+ FT    +++++            
Sbjct: 370  NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDEVEQQSEQQQ 429

Query: 448  SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
              L++E  ++E +     +L  + + A    +  EEL +YGS +  +    + F F V D
Sbjct: 430  RNLQDEDQNLE-EIFDVDQLEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCD 488

Query: 508  SLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS---------N 538
            SL+N+ P+     G R+  +                     +ATG SK           N
Sbjct: 489  SLMNVAPINYMCAGERVEFEEDGVTLRPHAESLQDLKIELVAATGHSKNGALSVFVNCIN 548

Query: 539  YELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
             +++   EL GC  +WTV+         D+++ ++ +D+ H ++++S    T+VL+T   
Sbjct: 549  PQIITSFELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQE 599

Query: 596  LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
            + E+ E+  + V   TI  GNL  +R ++QV  R  R+L G+ + Q++            
Sbjct: 600  INEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI---------- 648

Query: 656  SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
               S V+ VSIADPYV L + +G +  L    +  T  +       SS   V + + Y D
Sbjct: 649  DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAISAYKD 708

Query: 716  -------KG----------------------PEPWLRKTSTDAWLSTGVGEAI------D 740
                   KG                       EP ++    +  L    G A       D
Sbjct: 709  LSGLFTVKGDDINLTGSSNSAFGHSFGGYMKAEPNMKVEDEEDLLYGDAGSAFKMNSMAD 768

Query: 741  GADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
             A        D +             VV  +SG LEI+ +P+   V+ V+   +G   + 
Sbjct: 769  LAKQSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDVGNGSMVLT 828

Query: 789  DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
            D    E +  S T   +S          ++ +S   +EL++     +  RP L  + T  
Sbjct: 829  DAM--EFVPISLTTQENSKAGIVQACMPQHANSPLPLELSVIGLGLNGERPLLL-VRTRV 885

Query: 849  TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG 908
             +L YQ  +F  P+   K        R +   N+   +  ++       D     E+   
Sbjct: 886  ELLIYQ--VFRYPKGHLK-----IRFRKMDQLNLLDQQPTHIDLDEN--DEQEEIESYQM 936

Query: 909  AP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNC 965
             P   Q++  F N+ G  G  + G  PC+  + FR  LR+H  L +G + +F   +NVN 
Sbjct: 937  QPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVNI 996

Query: 966  NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPV 1025
             +GF+Y  +   LKI  LPS  +YD+ WPV+KV PL+ TP Q+ Y  E  +Y LI     
Sbjct: 997  PNGFLYFDTTYELKISVLPSYLSYDSVWPVRKV-PLRCTPRQLVYHRENRVYCLITQTE- 1054

Query: 1026 LKPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEPDRAGGPWQT--R 1081
             +P+ +       D+E+  +              Y +  ++E+ ++ P+     W+    
Sbjct: 1055 -EPMTKYYRFNGEDKELSEESRGERF-------IYPIGSQFEMVLISPET----WEIVPD 1102

Query: 1082 ATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
            A+I  +  E+    ++V L +  T    +  L IGT +   ED+ +RG + ++
Sbjct: 1103 ASITFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIY 1155


>gi|195334368|ref|XP_002033855.1| GM20208 [Drosophila sechellia]
 gi|194125825|gb|EDW47868.1| GM20208 [Drosophila sechellia]
          Length = 1455

 Score =  363 bits (932), Expect = 3e-97,   Method: Compositional matrix adjust.
 Identities = 328/1198 (27%), Positives = 539/1198 (44%), Gaps = 192/1198 (16%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
            NLVV  ANV+++Y +    E S+  K N  E +    M       LE +  Y L+GNV S
Sbjct: 29   NLVVAGANVLKVYRIAPNVEASQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82

Query: 116  LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
            L  +S  GA     RD+++++F+DAK+SVL+ D     L+  S+H FE  +   ++ G  
Sbjct: 83   LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
                 P V+VDP  RC  +LVYG ++++L   +  S     L   +    +     +R  
Sbjct: 136  GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195

Query: 230  IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
            I +S++I LRDLD K  +V D  F+HGY EP ++IL+E   T  GR+  +  TC++ A+S
Sbjct: 196  IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
            ++   + HP+IW+  +LP D  ++  +  PIGG LV+  N + Y +QS         Y V
Sbjct: 256  LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309

Query: 348  SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
            SL+SS +        P+    + LD A+  ++  D  ++S +TGDL +LT+  D  R V+
Sbjct: 310  SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369

Query: 400  RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEA 459
                 K   SVLTS I  + +   FLGSRLG+SLL+ FT    +++++        D+E 
Sbjct: 370  NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVIT------LDDVEQ 423

Query: 460  DAPSTKRLRRSSSDALQDM-----------------VNGEELSLYGSASNNTESAQKTFS 502
             +   +R  +     L+++                 +  EEL +YGS +  +    + F 
Sbjct: 424  QSEQQQRNLQDEDQNLEEIFDVDQVEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFI 483

Query: 503  FAVRDSLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS----- 537
            F V DSL+N+ P+     G R+  +                     +ATG SK       
Sbjct: 484  FEVCDSLMNVAPINYMCAGERVEFEEDGVTLRPHAESLQDLKIELVAATGHSKNGALSVF 543

Query: 538  ----NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
                N +++   EL GC  +WTV+         D+++ ++ +D+ H ++ +S    T+VL
Sbjct: 544  VNCLNPQIITSFELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMFLSQRNSTLVL 594

Query: 591  ETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNS 650
            +T   + E+ E+  + V   TI  GNL  +R ++QV  R  R+L G+ + Q++       
Sbjct: 595  QTGQEINEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI----- 648

Query: 651  ESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSC 710
                    S V+ VSIADPYV L + +G +  L    +  T  +       SS   V + 
Sbjct: 649  -----DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAI 703

Query: 711  TLYHD-------KG----------------------PEPWLRKTSTDAWLSTGVGEAI-- 739
            + Y D       KG                       EP ++    +  L    G A   
Sbjct: 704  SAYKDLSGLFTVKGDDINLTGSSNSAFGHSFGGYMKAEPNMKVEDEEDLLYGDAGSAFKM 763

Query: 740  ----DGADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSG 783
                D A        D +             VV  +SG LEI+ +P+   V+ V+   +G
Sbjct: 764  NSMADLAKQSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDVGNG 823

Query: 784  RTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFA 843
               + D    E +  S T   +S          ++ +S   +EL++     +  RP L  
Sbjct: 824  AMVLTDAM--EFVPISLTTQENSKAGIVQACMPQHANSPLPLELSVIGLGLNGERPLLL- 880

Query: 844  ILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE 903
            + T   +L YQ  +F  P+   K        R L   N+   +  ++       D     
Sbjct: 881  VRTRVELLIYQ--VFRYPKGHLK-----IRFRKLDQLNLLDQQPTHIELDEN--DEQEEI 931

Query: 904  ETPHGAP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVL 960
            E+    P   Q++  F N+ G  G  + G  PC+  + FR  LR+H  L +G + +F   
Sbjct: 932  ESYQMQPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAF 991

Query: 961  HNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLI 1020
            +NVN  +GF+Y  +   LKI  LPS  +YD+ WPV+KV PL+ TP Q+ Y  E  +Y LI
Sbjct: 992  NNVNIPNGFLYFDTTYELKISVLPSYLSYDSIWPVRKV-PLRCTPRQLVYHRENRVYCLI 1050

Query: 1021 VSVPVLKPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEPDRAGGPW 1078
                  +P+ +       D+E+  +              Y +  ++E+ ++ P+     W
Sbjct: 1051 TQTE--EPMTKYYRFNGEDKELSEESRGERF-------IYPIGSQFEMVLISPET----W 1097

Query: 1079 QT--RATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
            +    A+I  +  E+    ++V L +  T    +  L IGT +   ED+ +RG + ++
Sbjct: 1098 EIVPDASITFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIY 1155


>gi|270003792|gb|EFA00240.1| hypothetical protein TcasGA2_TC003068 [Tribolium castaneum]
          Length = 1392

 Score =  361 bits (927), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 323/1206 (26%), Positives = 530/1206 (43%), Gaps = 221/1206 (18%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            +LV + ANVI+++ +    +         ET           + LE V  Y L GN+ S+
Sbjct: 29   SLVTSGANVIKVFRLIPDIDTKTRIDKFNETNP-------PKSKLECVAQYTLFGNIMSM 81

Query: 117  AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
              ++   +     RD+++LAF+DAK+SV+E+D   H L+  S+H FE  +   +K G   
Sbjct: 82   QSVNLANSP----RDALLLAFKDAKLSVVEYDPETHDLKTLSLHYFEEDD---MKDGWTH 134

Query: 177  FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED---TFGSGGGFSARIESS 233
                P+V+ DP+ RC  + V+G ++++L   +  +    D D     G   G  A I +S
Sbjct: 135  HYHVPMVRADPENRCAVMTVFGRKLVVLPFRRENAIDDTDADIKPMIGGAYGSKAPILAS 194

Query: 234  HVINLRDL--DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
            ++I L+D    + ++ D  F+HGY EP ++IL E   T+AGRV+ +  TC ++A+S++  
Sbjct: 195  YMIVLKDFIDKVDNIIDIQFLHGYYEPTLLILFEPLKTFAGRVAVRTDTCAMAAISLNLQ 254

Query: 292  LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
             K HP+IWS  NLP D  K + +  P+GG L+   N + Y +QS      +  Y VSL+S
Sbjct: 255  QKVHPIIWSVANLPFDCVKAVPIKKPLGGTLIFAVNALIYLNQS------IPPYGVSLNS 308

Query: 352  SQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDL 403
              E        P+    + LD A AT+L++D  +LS K G+L +LT++ D  R V+    
Sbjct: 309  IAENSTNFPLKPQDDLCISLDCAQATFLEDDTIVLSLKGGELYVLTLLADNMRYVRSFHF 368

Query: 404  SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT--CGSGTSMLSSGLKEEFGDIEADA 461
             K   SVLT+ I+   N+  FLGSRLG+SLL++FT  C    ++            E   
Sbjct: 369  EKAAASVLTTCISVCENNFLFLGSRLGNSLLLRFTEKCNEVITL-----------DETIE 417

Query: 462  PSTKRLRRSSS----------DALQDMV--------NGEELSLYGSASNNTESAQKTFSF 503
            PS KRL+ S+S          D L D +        + EEL +YG+    +     ++ F
Sbjct: 418  PSAKRLKASNSTSENEDDKVLDTLNDCMASDVLDIRDPEELEVYGNQKQASLQIS-SYVF 476

Query: 504  AVRDSLVNIGPL------------KDFSYGLRINADASATG----------ISKQSNYEL 541
             V DSL+NIGP             ++FS  L ++ +   T           + K    ++
Sbjct: 477  EVCDSLLNIGPCGNISLGEPAFLSEEFSENLDLDLELVTTAGYGKNGALCVLQKSVRPQI 536

Query: 542  V---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTE 598
            V    LPGC  +WTV+    +                HA+LI+S E  TM+L+T D + E
Sbjct: 537  VTTFTLPGCSNMWTVHAGEDK----------------HAFLILSQEDGTMILQTGDEINE 580

Query: 599  VTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSEN 658
            + ++  +     T+ AG                 I    ++  +L               
Sbjct: 581  I-DNTGFATHIPTVYAG-----------------INQLQHIPLELG-------------- 608

Query: 659  STVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKG- 717
            S ++ V+  DPY+ L  +DG +  L+   +     +    +  S+  PV++  +Y D   
Sbjct: 609  SPIVHVTSVDPYISLLTTDGQVITLMLREARGVAKLVISKSTLSNSPPVTTICMYRDVSG 668

Query: 718  -------------PEPWLRKTSTDAWLSTGVGEAIDGADGG--------PLDQGDIYS-- 754
                         PE ++ ++ T   +     + + G D          P  +  +Y   
Sbjct: 669  LFTSKIPEDFTHIPEHFINESETKMEVENE-DDLLYGDDSDFKMPTLNPPQPKPKVYYNW 727

Query: 755  --------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
                           V  E+  LEI+ +P+F   + +     G   +VD    E++  S 
Sbjct: 728  WKKYLLDVRPSYWLFVVRENSNLEIYSIPDFKLCYYITNLCFGHKVLVDNL--ESVTISA 785

Query: 801  TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
            +   S++ E   Q R+ ++  + VV L       H SRP L   L +  +  Y+ + F  
Sbjct: 786  STPISAAHEANIQ-RQFDVKEILVVALG-----NHGSRPLLMVRL-ERDLYIYEVFRF-- 836

Query: 861  PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNI 920
            P    K          +   NVS       R      D +  +E        ++  F NI
Sbjct: 837  PRGNLKMRFRKIKHSLIYSPNVSG------RIDTEDSDFFAIQER-----IIKMRYFTNI 885

Query: 921  SGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
            +G+ G F+ G+ P W  M  R  LR HP   DG +++F   +NVNC  GF+Y   +  L+
Sbjct: 886  AGYNGVFVCGANPHWIFMSARGELRTHPMTIDGEVLSFAAFNNVNCPQGFLYFNRKSELR 945

Query: 980  ICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQ 1039
            I  LP+  +YD  WPV+KV PL+ TPH +TY  E   Y L+ S+   +P N+        
Sbjct: 946  IGVLPTHLSYDAAWPVRKV-PLRCTPHFVTYHLESKTYCLVTSIA--EPSNKYYKF---- 998

Query: 1040 EVGHQIDNHNLSSVDLHRTYTV---EEYEVRILEPDRAGGPWQT--RATIPMQSSENALT 1094
                  ++  LS  D    +     E++ + +  P      W       I +   E+   
Sbjct: 999  ----NGEDKELSVEDRGDRFPYPLQEKFSLMLFSP----VSWDVIPNTKIDLDEWEHVNC 1050

Query: 1095 VRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGP 1153
            ++ V+L    T+   +  +A+GT Y  GEDV +RGR+L+F        P   +    +  
Sbjct: 1051 LKNVSLAYEGTRSGLKGYIAVGTNYNYGEDVTSRGRILIFDIIEVVPEPGQPLTKNRFKE 1110

Query: 1154 LFSSVQ 1159
            +++  Q
Sbjct: 1111 IYAKDQ 1116


>gi|195455711|ref|XP_002074834.1| GK23274 [Drosophila willistoni]
 gi|194170919|gb|EDW85820.1| GK23274 [Drosophila willistoni]
          Length = 1463

 Score =  354 bits (909), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 325/1205 (26%), Positives = 531/1205 (44%), Gaps = 198/1205 (16%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
            NLVV  ANV+++Y +    E S+  K N  E      M       LE +  Y L+GNV S
Sbjct: 29   NLVVAGANVLKVYRIAPNVEASQRQKLNPSE------MRVAPKMRLECLATYSLYGNVMS 82

Query: 116  LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
            L  +S  GA     RD+++++F+DAK+SVL+ D   + L+  S+H FE  +   ++ G  
Sbjct: 83   LQCVSLAGA--GAMRDALLVSFKDAKLSVLQHDPDTYALKTLSLHYFEEED---IRGGWT 137

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
                 P V+VDP  RC  +L+YG ++++L   +  S     L   +    +      R  
Sbjct: 138  GRYYVPEVRVDPDARCAVMLIYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTALVTRTP 197

Query: 230  IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
            I +S++I LRDLD K  +V D  F+HGY EP ++IL+E   T AGR+  +  TC++ A+S
Sbjct: 198  IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCAGRIKVRSDTCVLVAIS 257

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
            ++   + HP+IW+  NLP D  +LL +  PIGG LV+  N + Y +QS         Y V
Sbjct: 258  LNIQQRVHPIIWTVNNLPFDCLRLLPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 311

Query: 348  SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
            SL+SS +        P+    + LD A+  ++  D  ++S +TGDL +LT+  D  R V+
Sbjct: 312  SLNSSADNSTSFPLKPQDGVRISLDCANFAFIDVDKLVVSLRTGDLYVLTLCVDSMRTVR 371

Query: 400  RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
                 K   SVLTS I        FLGSRLG+SLL+ FT    +++++            
Sbjct: 372  NFHFHKAASSVLTSCICVCHMEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVEQQQQQQA 431

Query: 448  ----SGLKEEFGDIEAD-------APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTES 496
                S   E  G ++ D       APS  + RR         +  EEL +YG+ +  +  
Sbjct: 432  AEEPSEEAEIEGILDMDQLEAATSAPSQAKSRR---------IEDEELEVYGTGAKASVL 482

Query: 497  AQKTFSFAVRDSLVNIGPLKDFSYGLRINAD--------------------ASATGISKQ 536
              + F F V DSL+N+ P+     G R+  +                     +ATG SK 
Sbjct: 483  QLRKFVFEVCDSLINVAPINYMCAGERVEFEEDGTTLRPHAESLQDVKIELVAATGHSKN 542

Query: 537  S---------NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLE 584
                      N +++   EL GC  +WTV+         D+++  +  D+ H ++++S +
Sbjct: 543  GALSVFVNCINPQIITSFELEGCLDVWTVFD--------DATKKTSRQDQ-HDFMLLSQK 593

Query: 585  ARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLS 644
              T+VL+T   + E+ E+  + V   TI  GNL   R ++QV  R  R+L G+ + Q++ 
Sbjct: 594  NSTLVLQTGQEINEI-ENTGFTVNQATIFVGNLGQNRFIVQVTTRHVRLLQGTRLVQNVP 652

Query: 645  FGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSK 704
                          S V+ V+IADPYV L + +G +  L    S  T  +       SS 
Sbjct: 653  I----------DVGSPVVQVAIADPYVCLRVFNGQVITLALRESRGTPRLAINKHTISSS 702

Query: 705  KPVSSCTLYHD-------------------------------KGPEPWLRKTSTDAWLST 733
              V +   Y D                                  EP ++    +  L  
Sbjct: 703  PAVVAIAAYKDLSGLFTVKSDDILNLTGSGSNSAFGSTFGGYMKSEPHMKVEDEEDLLYG 762

Query: 734  GVGEAI------DGADGGPLDQGDIYS-------------VVCYESGALEIFDVPNFNCV 774
              G A       D A        D +              VV  +SG LEI+ +P+   V
Sbjct: 763  DAGNAFKMNTMADLAKQSKQKNSDWWRRMLVQAAKPTYWLVVARQSGTLEIYSMPDMKLV 822

Query: 775  FTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSA 834
            + V+   +G   + D    E +  S T   +S          ++ +S   +EL++     
Sbjct: 823  YLVNDVGNGAMVLTDAM--EFVPISLTSQENSKAGIVQSCMPQHANSPLPLELSLVGLGL 880

Query: 835  HHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSR 894
            +  RP L  + T   +L YQ  +F  P+   K        R +   N+   +  ++    
Sbjct: 881  NGERPLLL-VRTRLELLIYQ--VFRYPKGHLK-----IRFRKMDQLNLLDQQPTHVNLDD 932

Query: 895  TPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGS 953
               +             Q++  F N+ G  G  + G  PC+  +  R  LR+H  L +G 
Sbjct: 933  NEENEELESYNMQPKYVQKLRPFNNVGGMSGVMICGVNPCFLFLTSRGELRIHRLLGNGE 992

Query: 954  IVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAE 1013
            + +F   +N+N  +GF++  +   LKI  LPS  +YD+ WPV+KV PL+ TP Q+ Y  E
Sbjct: 993  VRSFAAFNNINIPNGFLFFDTTFELKISVLPSYLSYDSTWPVRKV-PLRCTPRQLVYHRE 1051

Query: 1014 KNLYPLIVSVPVLKPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEP 1071
              +Y LI      +P+ +       D+E+  +              Y +  ++++ ++ P
Sbjct: 1052 NRVYCLITQTE--EPMTKFYRFNGEDKELSEESRGERF-------IYPIGSQFDMVLISP 1102

Query: 1072 DRAGGPWQT--RATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARG 1128
            +     W+    A+I  +  E+    ++V L +  T    +  L IGT +   ED+ +RG
Sbjct: 1103 E----TWEIVPDASIRFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRG 1158

Query: 1129 RVLLF 1133
             + ++
Sbjct: 1159 NIHIY 1163


>gi|194756960|ref|XP_001960738.1| GF11349 [Drosophila ananassae]
 gi|190622036|gb|EDV37560.1| GF11349 [Drosophila ananassae]
          Length = 1455

 Score =  354 bits (909), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 324/1201 (26%), Positives = 534/1201 (44%), Gaps = 198/1201 (16%)

Query: 57   NLVVTAANVIEIYVVRVQ-EEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
            NLVV  ANV+++Y +    E G ++  N  E      M       LE +  Y L+GNV S
Sbjct: 29   NLVVAGANVLKVYRIAPNVEAGQRQKLNPTE------MRVAPKMRLECLATYTLYGNVMS 82

Query: 116  LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
            L  +S  GA     RD+++++F+DAK+SVL+ D   + L+  S+H FE  +   ++ G  
Sbjct: 83   LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTYALKTLSLHYFEEED---IRGGWT 135

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
                 P+V+VDP  RC  +LVYG ++++L   +  +     L   +    +      R  
Sbjct: 136  GRYFVPVVRVDPDSRCAVMLVYGKRLVVLPFRKDNTLDEIELADVKPIKKAPTAMVTRTP 195

Query: 230  IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
            I +S++I LRDLD K  +V D  F+HGY EP ++IL+E   T  GR+  +  TC++ A+S
Sbjct: 196  IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
            ++   + HP+IW+  +LP D  ++  +  PIGG LV+  N + Y +QS         Y V
Sbjct: 256  LNIQQRVHPIIWTVNSLPFDCQQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309

Query: 348  SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
            SL+SS +        P+    + LD A+  ++  D  ++S +TGDL +LT+  D  R V+
Sbjct: 310  SLNSSADNSTSFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369

Query: 400  RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK-------- 451
                 K   SVLTS I  + +   FLGSRLG+SLL+ FT    +++++            
Sbjct: 370  NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVDQQADQQL 429

Query: 452  ----------EEFGDIEA--DAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
                      +E  D++    AP+  + RR         +  EEL +YGS +  +    +
Sbjct: 430  QRQQSEDQTLDEILDVDQLELAPTQAKSRR---------IEDEELEVYGSGAKASVLQLR 480

Query: 500  TFSFAVRDSLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS-- 537
             F F V DSL+N+ P+     G R+  +                     +ATG SK    
Sbjct: 481  KFVFEVCDSLINVAPINYMCAGERVEFEEDGTTLRPHAENLNDLKIELVAATGHSKNGAL 540

Query: 538  -------NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
                   N +++   EL GC  +WTV+         D+++  +  D+ H ++++S    T
Sbjct: 541  SVFVNCINPQIITSFELDGCLDVWTVFD--------DATKKTSRHDQ-HDFMLLSQRNST 591

Query: 588  MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGP 647
            +VL+T   + E+ E+  + V   TI  GNL  +R ++QV  R  R+L G+ + Q++    
Sbjct: 592  LVLQTGQEINEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI-- 648

Query: 648  SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPV 707
                       S V+ VSIADPYV L + +G +  L    +  T  +       SS   V
Sbjct: 649  --------DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAV 700

Query: 708  SSCTLYHD-----------------------------KGPEPWLRKTSTDAWLSTGVGEA 738
             + + Y D                                EP ++    +  L    G A
Sbjct: 701  VAISAYKDLSGLFTVKADDVNLTGSSSSAFGHSFGGYMKAEPHMKVEDEEDLLYGDAGNA 760

Query: 739  I------DGADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKF 780
                   D A        D +             VV  +SG LEI+ +P+   V+ V+  
Sbjct: 761  FKMNSMADLAKQSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDV 820

Query: 781  VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
             +G   + D    E +  S T   +S          ++ +S   +EL +     +  RP 
Sbjct: 821  GNGAMVLTDAM--EFVPISLTTQENSKAGIVQACMPQHANSPLPLELTVLGLGLNGERPL 878

Query: 841  LFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAY 900
            L  + T   +L YQ  +F  P+   K        R L   N+   +  ++       D  
Sbjct: 879  LL-VRTRVELLIYQ--VFRYPKGHLK-----IRFRKLEQLNLMDHQPSHIELDEN--DER 928

Query: 901  TREETPHGAP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAF 957
               E+    P   Q++  F N+ G  G  + G  PC+  +  R  LR+H  L +G + +F
Sbjct: 929  EEMESYQMQPKYVQKLRPFANVGGLSGIMVCGVNPCFVFLTSRGELRIHRLLGNGDVRSF 988

Query: 958  TVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLY 1017
               +NVN  +GF+Y  +   LKI  LPS  +YD+ WP++KV PL+ TP Q+ Y  E  +Y
Sbjct: 989  AAFNNVNIPNGFLYFDTTFELKISVLPSYLSYDSTWPIRKV-PLRCTPRQLVYHRENRVY 1047

Query: 1018 PLIVSVPVLKPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEPDRAG 1075
             LI      +P+ +       D+E+  +              Y +  ++E+ ++ P+   
Sbjct: 1048 CLITQNE--EPMTKFYRFNGEDKELSEESRGERF-------IYPIGSQFEMVLISPET-- 1096

Query: 1076 GPWQT--RATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLL 1132
              W+    A+I  +  E+    ++V L +  T    +  L IGT +   ED+ +RG + +
Sbjct: 1097 --WEIVPDASIRFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHI 1154

Query: 1133 F 1133
            +
Sbjct: 1155 Y 1155


>gi|290981010|ref|XP_002673224.1| CPSF A subunit [Naegleria gruberi]
 gi|284086806|gb|EFC40480.1| CPSF A subunit [Naegleria gruberi]
          Length = 1373

 Score =  348 bits (892), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 320/1258 (25%), Positives = 535/1258 (42%), Gaps = 251/1258 (19%)

Query: 3    FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
            FA YK +H PT ++ C     T +  +                           NL++  
Sbjct: 2    FACYKQLHPPTAVSFCLKARFTSANDE---------------------------NLIIVK 34

Query: 63   ANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL-AILSQ 121
             N++E+Y+++                        + +++ LV  + L G ++S+ A+  Q
Sbjct: 35   NNIMEVYLIKP-----------------------NTSNIVLVKVFELFGVIDSIIAVCLQ 71

Query: 122  GGADNSRRRDSIILAFED-AKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180
            G      +++ +++ FED AK+SV+EFD+    L+  S+H  E      L+ G+  F   
Sbjct: 72   G-----MKKEMLLINFEDEAKVSVVEFDEKRSDLKTLSLHYLEDD---FLREGKARFFHN 123

Query: 181  PLVKVDPQGRCGGVLVYGLQMIILKASQGGS------------GLVGDEDTFGSGGGFSA 228
              + +DPQ R   V++   +++IL   Q G              L GD++      G   
Sbjct: 124  QPIILDPQNRFATVIICDSKLVILPFRQSGEDVSLSTEDNFLFALSGDQEEANENVGDQK 183

Query: 229  R-----IESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMI 283
            +     ++   +I+L DL +K+VKD+ F++GY EP ++ LHE E TW+GR++ K +T  +
Sbjct: 184  KHHQPEVQRQVIIDLNDLGIKNVKDYCFLNGYNEPTILFLHENEQTWSGRLAAKSNTSTV 243

Query: 284  SALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI-GGVLVVGANTIHYHSQSASCALAL 342
            +A+S     K +P IWS  +LPHD  KL+ +   + GG LV+G N+I + +Q A+  L+ 
Sbjct: 244  TAVSFDLFRKYYPKIWSVGSLPHDCNKLIPLQEDVAGGALVIGMNSIIHINQCATYGLSF 303

Query: 343  NNYAVSLDSSQELPRSSF---SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ 399
            N++AVS + +  +  ++F   ++  D    T++  D  L+S K G+L  + +   G  + 
Sbjct: 304  NDFAVS-NPNLSINFNTFDGPALFFDTVAYTFIARDKLLVSLKDGELYTMYLESGGSRIN 362

Query: 400  RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEA 459
             +++ KT+ +   S + T+  +L FLGS++GDS+L ++         S    EE   + A
Sbjct: 363  NINIKKTSNTTPASCMCTLKGNLIFLGSKIGDSVLYEYQEKVEVETSSLDTDEEMSSVFA 422

Query: 460  -----DAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGP 514
                 +    KR      D    +   EE ++  S S  ++         ++    NIGP
Sbjct: 423  AGENFEPEKKKRKLADDDDFFAALEKDEEPTVIESFSKVSKKETTKVELKIKHVFTNIGP 482

Query: 515  LKDFSYGLRINADASATGISKQSNYELVELPGCKGI-----WTVYHKSSRGHNADSSRMA 569
            +   +  +  + D S  G   ++N   +    C GI      TV ++S +      + + 
Sbjct: 483  ISHLTAAVTSSFDMS--GFKSKTNDNQLSAIACSGIGRHGCLTVLNRSLQPDIQSEATLP 540

Query: 570  ---------AYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
                     +   E+  YLI+SLE +T V E+   L EVT    +     T+  G +  R
Sbjct: 541  FLVKQVWTISQKTEHDLYLILSLEDKTKVFESKATLAEVTSKSMFVTNETTLNIGKI--R 598

Query: 621  RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
              ++QV  R + +L GS         P           S++    I DPYVLL   DGS+
Sbjct: 599  ESIVQV-TRKSVMLIGS--------EPKQVHHSKKEIRSSI----ILDPYVLLHFYDGSL 645

Query: 681  RLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAID 740
             LL  D    T        IES+   +++  LY    PE          +   G+ E   
Sbjct: 646  VLLTHDNGRVT---SKQLDIESNHGKITAVCLYK-TNPE----------FEFFGINEK-- 689

Query: 741  GADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
                    +G     V +  GA EI  VP+  CVF+  +F    T + D           
Sbjct: 690  --------EGKYLCCVYWTDGAFEILSVPDMTCVFSFSQFYQFHTTLFD----------- 730

Query: 801  TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
                   E  +    +  +    V E+A++   +    P+L ++L+D T+  Y+++L   
Sbjct: 731  -------EGQSSNTTQSEVKYPYVTEMALRGIGSDSEMPYLVSVLSDNTVHIYRSFL--- 780

Query: 861  PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSR------TPLDAYTREETPHGAPCQ-- 912
             + T+KS D               +RL  LRFS+       P+    ++        +  
Sbjct: 781  -DRTTKSKD---------------NRLTRLRFSKFQHDDLLPISEIDKKSQTFTLNLKSK 824

Query: 913  -----------RITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLH 961
                       ++  FKNI G+ G F +G +P W       LRVHP      +  FT  H
Sbjct: 825  YLFPKSDLGRSQLIPFKNIGGYGGLFKTGEKPFWLFTEHSNLRVHPTQSRDPVTTFTPYH 884

Query: 962  NVNCNHGFIYVT-------SQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEK 1014
            + NC HGFIY+T        Q  L I  L +   ++ YWP +K++ LK+TP+ IT+  + 
Sbjct: 885  HENCPHGFIYLTDKEQDNKKQSKLHISSLNANVKFNAYWPQRKIL-LKSTPNVITFHQDT 943

Query: 1015 NLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRA 1074
            N      SVPV      +L   I    G              + +TV+ +         +
Sbjct: 944  NTCLAFTSVPV----KAILPDSIPFPEGK-------CPPPAEQKHTVKLF---------S 983

Query: 1075 GGPWQTRATIPMQSSENALTVRVVTL----------------FNTTTKENETLLAIGTAY 1118
            G  WQ          E+A+  +VV L                 N+  ++  +++A+GTAY
Sbjct: 984  GHNWQEMDKFEFDLHESAVAAKVVYLSKEEYNDDTDISFEEPLNSRKQDLVSVVAVGTAY 1043

Query: 1119 VQGEDVAARGRVLLFS----TGRNADNPQNLVLSGSY-GPLFSSVQIDFASHFFAICS 1171
            VQ E    RGR+LLF      GR  +   NL+ S S  GP+ +  Q+D     + ICS
Sbjct: 1044 VQSERELCRGRLLLFDLDPILGRENEYKLNLISSTSVKGPITTLEQVD----RYIICS 1097


>gi|440793679|gb|ELR14857.1| CPSF A subunit region protein [Acanthamoeba castellanii str. Neff]
          Length = 1477

 Score =  347 bits (891), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 256/898 (28%), Positives = 429/898 (47%), Gaps = 169/898 (18%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGI----------------SAAS 100
           NL+V   NV+E+Y +   E+      +   T+     DG+                +  S
Sbjct: 30  NLIVAKTNVLEVYALHRHEDSKARPIDRQSTRP---TDGVISLRGEEPKDAPPYAGTQHS 86

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           + LV    L GN+ES+A +   G      +D+++L+F DAKISVLEFD + + LR  S+H
Sbjct: 87  MRLVLSSSLFGNIESMAAVRFPGTS----KDALLLSFRDAKISVLEFDIATNDLRTISLH 142

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
            FE      +K G + +   P ++VDPQ RC  +L +  ++++L   Q  S +       
Sbjct: 143 YFED---YKVKEGHDHYIHVPELRVDPQQRCAAMLAFDRKLVVLPFRQHASLM-----EI 194

Query: 221 GSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHT 280
            +GG     ++ S +++LR + + +VKDF+F+ GY EP ++IL+E   TW+GRV+   +T
Sbjct: 195 ENGGQEDQPVKPSFLLDLRAMGIINVKDFVFLQGYYEPTLLILYEPTQTWSGRVAVNRNT 254

Query: 281 CMISALSIST-----TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI------ 329
           C+ +A+S++          HP++WSA  LP+D  +L+AVP PIGG L +  N++      
Sbjct: 255 CVAAAVSLNLWQHRGQTSAHPVVWSAEFLPYDTQRLIAVPGPIGGALALSTNSLLYLNQV 314

Query: 330 -------------------HYHSQSASCALALNNYA-VSLDSSQELP---RSSFSVELDA 366
                              H+ +Q+++  L LN +A + L      P   ++   + LDA
Sbjct: 315 SFPYRLILPAHGADVSITSHHDTQASASCLPLNVFADLYLSPQTPFPSAGKNRVGIALDA 374

Query: 367 AHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNS----- 421
           A   +L +D  L+S K G+L +  ++ DGR V  + L+K   SV+TS + T+        
Sbjct: 375 ARDVFLADDQLLVSLKGGELYIFHLLSDGRTVNDIQLTKAGSSVITSCMATLSGEGADER 434

Query: 422 LFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEE--FGDIEADAPSTKRLRRSSSDA----- 474
             FLGSR+GDSLL+Q+T    ++   +G  +   F DI+ +  +         +A     
Sbjct: 435 FLFLGSRVGDSLLLQYTTADASAPKQNGATKGSLFDDIKKEEDNDDDDEDEEEEASGEGE 494

Query: 475 LQDMVNGE-ELSLYGSASNNTESAQK-----TFSFAVRDSLVNIGPLKDFSYGLRIN-AD 527
           +++  +GE E+  +G      +  +K     T+ F V DSLVN+GP+ DF+ G   + A 
Sbjct: 495 VKEEPDGEGEVDEFGRRIREEDRRKKKGLLTTYKFKVCDSLVNVGPITDFAIGESFDPAS 554

Query: 528 ASATGISKQSNYELV---------------------------ELPGCKGIWTVYHKSSRG 560
            S      Q + E+V                           +L GCK  WT+YH+S   
Sbjct: 555 VSMAEQEGQRSVEIVTCSGQGKNGSLCVLQHGVRPELVHASADLAGCKAFWTLYHRSEER 614

Query: 561 HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT-ESVDYFVQGRTIAAGNLFG 619
              ++        EYHAYL++S E +T V+   D L E++ E  D+ V   T+ AGNLF 
Sbjct: 615 QGEEA--------EYHAYLLLSEEEQTRVI-AGDGLDELSNEETDFNVAAPTVDAGNLFE 665

Query: 620 RRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGS 679
           + R++QV + G  +LDG   TQ +            S    + + SIADPYVL+ M+DG+
Sbjct: 666 QTRIVQVHQHGLILLDGVKATQRI------------STPGQIAAASIADPYVLVLMADGA 713

Query: 680 IRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAI 739
           +RL   DP++  + VQT        + + +  L++                     G A+
Sbjct: 714 LRLYFADPTSSKL-VQTSLQNIHEVRDIMAMHLFY---------------------GGAM 751

Query: 740 DGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDS 799
            G      D+  I++ +  ++G L+I+ VP F+ VF+ ++  +G   I +  MR   + +
Sbjct: 752 RGKKARTNDE--IFAAIAKDNGRLDIYSVPEFDLVFSAERAANGPRLINNVLMRPPPQSA 809

Query: 800 ETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYL 857
             + ++ +             S ++ E+A+       S P LF  L +G +L Y+ +L
Sbjct: 810 AAQQSADTT------------SARIAEIALHSIGNIPSLPHLFLYLDNGELLLYRGFL 855



 Score = 87.0 bits (214), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 42/108 (38%), Positives = 55/108 (50%), Gaps = 2/108 (1%)

Query: 912  QRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIY 971
            +RI  F  +    G F+SGS P W    R   R++P   D  + AF   HN NC HGFIY
Sbjct: 967  RRIHYFGTVGKSNGVFISGSAPAWVFAQRGYARLYPMKLDTFVRAFAEFHNANCPHGFIY 1026

Query: 972  VTSQGILKICQLPS--GSTYDNYWPVQKVIPLKATPHQITYFAEKNLY 1017
               +G LKICQLP+  G+ +     V + +PL  TP +I Y      Y
Sbjct: 1027 FNHEGTLKICQLPAAEGAIHWELPGVVRKVPLGRTPREIAYHPPSRTY 1074


>gi|195150431|ref|XP_002016158.1| GL10645 [Drosophila persimilis]
 gi|194110005|gb|EDW32048.1| GL10645 [Drosophila persimilis]
          Length = 1459

 Score =  344 bits (883), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 323/1211 (26%), Positives = 536/1211 (44%), Gaps = 214/1211 (17%)

Query: 57   NLVVTAANVIEIYVVRVQ-EEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
            NLVV  AN++++Y +    E G ++  N  E      M       LE +  Y L+GNV S
Sbjct: 29   NLVVAGANMLKVYRISPNVEAGQRQKLNPNE------MRIAPKMRLECLATYFLYGNVMS 82

Query: 116  LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
            L  +S  GA     +D+++++F+DAK+SVL+ D   + L+  S+H FE  +   ++ G  
Sbjct: 83   LQCVSLAGA----MQDALLVSFKDAKLSVLQHDPDTYALKTLSLHYFEEED---IRGGWT 135

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR------ 229
                 P+V+VDP  RC  +LVYG ++++L   +  S    DE        F         
Sbjct: 136  GRYFVPVVRVDPDARCAVMLVYGKRLVVLPFRKDNSL---DEIELTDVKPFKKAPTAMVS 192

Query: 230  ---IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMIS 284
               I +S++I L++LD K  +V D  F+HGY EP ++IL+E   T +GR+  +  TC++ 
Sbjct: 193  RTPIMASYLITLKELDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCSGRIKVRSDTCVLV 252

Query: 285  ALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNN 344
            A+S++   + HP+IW+  +LP D +++  +  PIGG LV+  N + Y +QS         
Sbjct: 253  AISLNIQQRVHPIIWTVNSLPFDCFQVYPIQKPIGGCLVMTVNAVIYLNQSVP------P 306

Query: 345  YAVSLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-R 396
            Y VSL+SS +        P+    + LD A+  ++  D  ++S +TG+L +LT+  D  R
Sbjct: 307  YGVSLNSSADNSTSFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGELYVLTLCVDSMR 366

Query: 397  VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK----- 451
             V+     K   SVLTS I    +   FLGSRLG+SLL+ FT    +++++  +      
Sbjct: 367  TVRNFHFHKAAASVLTSCICVCHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDVDAEQQA 426

Query: 452  ----------------EEFGDIEAD--APSTKRLRRSSSDALQDMVNGEELSLYGSASNN 493
                            EE  D++    AP   + RR         +  EEL +YGS +  
Sbjct: 427  EQQQQKQQRVQEDQDIEEVYDVDQIELAPPQAKSRR---------IEDEELEVYGSGAKA 477

Query: 494  TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD--------------------ASATGI 533
            +    + F F V DSL+N+ P+     G R+  +                     +ATG 
Sbjct: 478  SVLQLRKFIFEVCDSLINVAPINYMCAGERVEFEEDGTTLRPHAENLHDLKIELVAATGH 537

Query: 534  SKQS---------NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLII 581
            SK           N +++   EL GC  +WTV+         D+++  +  D+ H ++++
Sbjct: 538  SKNGALSVFVNCINPQIITSFELDGCLDVWTVFD--------DATKKTSRHDQ-HDFMLL 588

Query: 582  SLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQ 641
            S    T+VL+T   + E+ E+  + V   TI  GNL  +R ++QV  R  R+L G+ + Q
Sbjct: 589  SQSNSTLVLQTGQEINEI-ENTGFTVNQATIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQ 647

Query: 642  DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG-----SIRLLVGDPSTCT---VS 693
            ++               S V+ V+IADPYV L M +G     ++R   G P         
Sbjct: 648  NVPI----------DVGSPVVQVAIADPYVCLRMLNGQVITLALRETRGSPRLAINKHTI 697

Query: 694  VQTPA--AIESSKKPVSSCTLYHDK--------------------GPEPWLRKTSTDAWL 731
              +PA  AI + K      T+  D                       EP ++    +  L
Sbjct: 698  TSSPAVVAIAAYKDLSGLFTVKSDDVLNLTGGTGSGFGHSFGGYMKAEPNMKVEDEEDLL 757

Query: 732  STGVGEAIDGADGGPLDQGD------------------IYSVVCYESGALEIFDVPNFNC 773
                G A        L Q                     + VV  +SG LEI+ +P+   
Sbjct: 758  YGDAGNAFKINSMAVLAQQSKQKNSDWWRRLLVQAKPSYWLVVSRKSGTLEIYSMPDMKL 817

Query: 774  VFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAMQRW 832
            V+ ++   +G   + D     +L  S  E   +S+ G  Q    ++ +S   +EL++   
Sbjct: 818  VYHINDVGNGAMVLSDALEFVSLSSSTQE---NSKVGIVQSCMPQHANSPLPLELSLVGL 874

Query: 833  SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRF 892
              +  RP L  + T   +L YQ  +F  P+   K        R L   N+   +  ++  
Sbjct: 875  GLNGERPVLM-VRTRVELLIYQ--VFRYPKGNLKI-----RFRKLEQLNLLDQQPSHIEL 926

Query: 893  SRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCD 951
                 +             Q++  F N+ G  G  + G  PC+  +  R  LR+H    +
Sbjct: 927  EENDEEEELESYNMQPKYVQKLRPFSNVGGLAGIMVCGVNPCFVFLTARGELRIHRLQGN 986

Query: 952  GSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYF 1011
            G + +F   +NVN  +GF+Y  +   LKI  LPS  +YD+ WPV+KV PL+ TP Q+ Y 
Sbjct: 987  GDVRSFAAFNNVNIPNGFLYFDTTFELKISVLPSYLSYDSVWPVRKV-PLRCTPRQLVYH 1045

Query: 1012 AEKNLYPLIVSVPVLKPLNQVLSL------LIDQEVGHQIDNHNLSSVDLHRTYTVEEYE 1065
             E  +Y LI      +P+ +          L ++  G +    N S           ++E
Sbjct: 1046 RENRVYCLITQTE--EPMTKYYRFNGEDKELSEESRGERFIYPNGS-----------QFE 1092

Query: 1066 VRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGE 1122
            + ++ P+     W+    A+I  +  E+    ++V L +  T    +  L IGT +   E
Sbjct: 1093 MVLISPET----WEIVPDASIRFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSE 1148

Query: 1123 DVAARGRVLLF 1133
            D+ +RG + ++
Sbjct: 1149 DITSRGNIHIY 1159


>gi|198457226|ref|XP_001360595.2| GA10080 [Drosophila pseudoobscura pseudoobscura]
 gi|198135905|gb|EAL25170.2| GA10080 [Drosophila pseudoobscura pseudoobscura]
          Length = 1459

 Score =  343 bits (881), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 323/1211 (26%), Positives = 535/1211 (44%), Gaps = 214/1211 (17%)

Query: 57   NLVVTAANVIEIYVVRVQ-EEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
            NLVV  AN++++Y +    E G ++  N  E      M       LE +  Y L+GNV S
Sbjct: 29   NLVVAGANMLKVYRISPNVEAGQRQKLNPNE------MRIAPKMRLECLATYFLYGNVMS 82

Query: 116  LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
            L  +S  GA     +D+++++F+DAK+SVL+ D   + L+  S+H FE  +   ++ G  
Sbjct: 83   LQCVSLAGA----MQDALLVSFKDAKLSVLQHDPDTYALKTLSLHYFEEED---IRGGWT 135

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR------ 229
                 P+V+VDP  RC  +LVYG ++++L   +  S    DE        F         
Sbjct: 136  GRYFVPVVRVDPDARCAVMLVYGKRLVVLPFRKDNSL---DEIELTDVKPFKKAPTAMVS 192

Query: 230  ---IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMIS 284
               I +S++I L++LD K  +V D  F+HGY EP ++IL+E   T  GR+  +  TC++ 
Sbjct: 193  RTPIMASYLITLKELDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLV 252

Query: 285  ALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNN 344
            A+S++   + HP+IW+  +LP D +++  +  PIGG LV+  N + Y +QS         
Sbjct: 253  AISLNIQQRVHPIIWTVNSLPFDCFQVYPIQKPIGGCLVMTVNAVIYLNQSVP------P 306

Query: 345  YAVSLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-R 396
            Y VSL+SS +        P+    + LD A+  ++  D  ++S +TG+L +LT+  D  R
Sbjct: 307  YGVSLNSSADNSTSFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGELYVLTLCVDSMR 366

Query: 397  VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK----- 451
             V+     K   SVLTS I    +   FLGSRLG+SLL+ FT    +++++  +      
Sbjct: 367  TVRNFHFHKAAASVLTSCICVCHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDVDAEQQA 426

Query: 452  ----------------EEFGDIEAD--APSTKRLRRSSSDALQDMVNGEELSLYGSASNN 493
                            EE  D++    AP   + RR         +  EEL +YGS +  
Sbjct: 427  EQQQQKQQRVQEDQDIEEVYDVDQIELAPPQAKSRR---------IEDEELEVYGSGAKA 477

Query: 494  TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD--------------------ASATGI 533
            +    + F F V DSL+N+ P+     G R+  +                     +ATG 
Sbjct: 478  SVLQLRKFIFEVCDSLINVAPINYMCAGERVEFEEDGTTLRPHAENLHDLKIELVAATGH 537

Query: 534  SKQS---------NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLII 581
            SK           N +++   EL GC  +WTV+         D+++  +  D+ H ++++
Sbjct: 538  SKNGALSVFVNCINPQIITSFELDGCLDVWTVFD--------DATKKTSRHDQ-HDFMLL 588

Query: 582  SLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQ 641
            S    T+VL+T   + E+ E+  + V   TI  GNL  +R ++QV  R  R+L G+ + Q
Sbjct: 589  SQSNSTLVLQTGQEINEI-ENTGFTVNQATIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQ 647

Query: 642  DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG-----SIRLLVGDPSTCT---VS 693
            ++               S V+ V+IADPYV L M +G     ++R   G P         
Sbjct: 648  NVPI----------DVGSPVVQVAIADPYVCLRMLNGQVITLALRETRGSPRLAINKHTI 697

Query: 694  VQTPA--AIESSKKPVSSCTLYHDK--------------------GPEPWLRKTSTDAWL 731
              +PA  AI + K      T+  D                       EP ++    +  L
Sbjct: 698  TSSPAVVAIAAYKDLSGLFTVKSDDVLNLTGGSGSGFGHSFGGYMKAEPNMKVEDEEDLL 757

Query: 732  STGVGEAIDGADGGPLDQGD------------------IYSVVCYESGALEIFDVPNFNC 773
                G A        L Q                     + VV  +SG LEI+ +P+   
Sbjct: 758  YGDAGNAFKINSMAVLAQQSKQKNSDWWRRLLVQAKPSYWLVVSRKSGTLEIYSMPDMKL 817

Query: 774  VFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAMQRW 832
            V+ ++   +G   + D     +L  S  E   +S+ G  Q    ++ +S   +EL++   
Sbjct: 818  VYHINDVGNGAMVLSDALEFVSLSSSTQE---NSKVGIVQSCMPQHANSPLPLELSLVGL 874

Query: 833  SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRF 892
              +  RP L  + T   +L YQ  +F  P+   K        R L   N+   +  ++  
Sbjct: 875  GLNGERPVLM-VRTRVELLIYQ--VFRYPKGNLKI-----RFRKLEQLNLLDQQPSHIEL 926

Query: 893  SRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCD 951
                 +             Q++  F N+ G  G  + G  PC+  +  R  LR+H    +
Sbjct: 927  EENDEEEELESYNMQPKYVQKLRPFSNVGGLAGIMVCGVNPCFVFLTARGELRIHRLQGN 986

Query: 952  GSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYF 1011
            G + +F   +NVN  +GF+Y  +   LKI  LPS  +YD+ WPV+KV PL+ TP Q+ Y 
Sbjct: 987  GDVRSFAAFNNVNIPNGFLYFDTTFELKISVLPSYLSYDSVWPVRKV-PLRCTPRQLVYH 1045

Query: 1012 AEKNLYPLIVSVPVLKPLNQVLSL------LIDQEVGHQIDNHNLSSVDLHRTYTVEEYE 1065
             E  +Y LI      +P+ +          L ++  G +    N S           ++E
Sbjct: 1046 RENRVYCLITQTE--EPMTKYYRFNGEDKELSEESRGERFIYPNGS-----------QFE 1092

Query: 1066 VRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGE 1122
            + ++ P+     W+    A+I  +  E+    ++V L +  T    +  L IGT +   E
Sbjct: 1093 MVLISPET----WEIVPDASIRFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSE 1148

Query: 1123 DVAARGRVLLF 1133
            D+ +RG + ++
Sbjct: 1149 DITSRGNIHIY 1159


>gi|427780291|gb|JAA55597.1| Putative mrna cleavage and polyadenylation factor ii complex subunit
            cft1 cpsf subunit [Rhipicephalus pulchellus]
          Length = 1237

 Score =  342 bits (876), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 292/996 (29%), Positives = 449/996 (45%), Gaps = 185/996 (18%)

Query: 251  FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYK 310
            F+HGY EP ++IL+E   TW GR++ +  TC I ALS++   + HP+IWS  NLP D  +
Sbjct: 3    FLHGYYEPTLLILYEPLRTWPGRIAIRQDTCCILALSLNLQQRVHPVIWSYTNLPFDCLR 62

Query: 311  LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL-------PRSSFSVE 363
            LLAVP P+GGVL++  +++ Y +QS         Y VSL+S  +        P+    + 
Sbjct: 63   LLAVPRPLGGVLIMAVDSLLYLNQSVP------PYGVSLNSFTDFSTSFPLKPQEGLKIS 116

Query: 364  LDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSL 422
            LD A A +L  D  +LS K G+L +LT+  DG R V+     K   SVLT+ +T   +  
Sbjct: 117  LDCAQACFLSYDRLVLSLKGGELYVLTLFNDGMRSVRNFYFDKAAASVLTTSMTLCEDGY 176

Query: 423  FFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD----- 477
             FLGSRLG+SLL+ +T      +     ++E  + +A+ P +K+ R    DA+ D     
Sbjct: 177  LFLGSRLGNSLLLHYT-EKAAEVDDIAKRDEKTESDANDPPSKKKRM---DAIGDWMASD 232

Query: 478  --MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG--------LRINAD 527
              +++ +EL +YGS +  T+    +++F V DSL+NIGP      G           N D
Sbjct: 233  VALIDPDELEVYGSETMATKQL-TSYTFEVCDSLINIGPCGKICMGEPAFLSEEFVQNTD 291

Query: 528  -----ASATGISKQSNYELV------------ELPGCKGIWTVY---------HKSSRGH 561
                  +  G  K     ++            ELPGC  +WTV           K+    
Sbjct: 292  PDLELVTTAGYGKNGALCVLQRSVRPQVVTTFELPGCVHMWTVMGPPAEKKPPEKTEESD 351

Query: 562  NADSSRMAAYDD--EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFG 619
            +  S   AA       HA+LI+S    +M+L+T   + E+  S  +  Q  T+ AGNL  
Sbjct: 352  DPASEDKAAEQPLTNTHAFLILSRADSSMILQTDQEINELDHS-GFSTQNPTVFAGNLGD 410

Query: 620  RRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGS 679
             R V+QV   G R+LDG+   Q +               S++++ S+ADP+V++  ++G 
Sbjct: 411  GRYVLQVCPMGVRLLDGTRQLQHIPL----------DVGSSIVAGSLADPHVIIRSAEGL 460

Query: 680  I--RLLVGDPST-CTVSVQTPAAIESSKKPVSSCT------LYHDKGPEPWLRKTSTDAW 730
            +    L GDP+  C ++V  P       K +S C       L+  +  EP   + +    
Sbjct: 461  VIHLTLRGDPAAGCRLAVLRPQLTAVKAKILSICVYKDVSGLFTTQYREP--DEPAKPEK 518

Query: 731  LSTGVGEAIDGADGGPLDQGD--------------------------------------- 751
                  E+ID +  G LD  D                                       
Sbjct: 519  PLPPPKESIDMSSNGLLDDEDELLYGESEENPIQKEPVRMTSEEAPSVAESMFEIKEVAP 578

Query: 752  -IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETE-INSSSEE 809
              +  V  E+G LEI+ +P +   F V  F  G+  +VD+    A   +++E ++  S E
Sbjct: 579  TYWLFVARENGVLEIYSLPEYKLCFLVKNFPMGQKVLVDSVQMTAPSGTKSEKLSDMSHE 638

Query: 810  GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
                     +H + VV L ++     HSRP L A + D  +L Y+A+ F           
Sbjct: 639  SM-----PVVHEILVVGLGIR-----HSRPLLLARV-DEDLLIYEAFPF----------- 676

Query: 870  PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE-----ETPHGAPCQR-------ITIF 917
               T R   +          LRF +   D + RE     + P     ++       +  F
Sbjct: 677  -YETQREGHL---------KLRFKKMSHDIFLRERKYKTQKPENEEEEKAFQSRQWLHPF 726

Query: 918  KNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG 976
             +ISG+ G FL G RP W  M  R  LR HP   DG I  F   HNVNC  GF++   QG
Sbjct: 727  SDISGYSGVFLCGYRPYWLFMSSRGELRCHPMFVDGPIHCFAPFHNVNCPKGFLHFNKQG 786

Query: 977  ILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLL 1036
             L+I  LP+  TYD  WPV+KV PL+ TPH + Y  +   Y ++ S P   P N ++   
Sbjct: 787  ELRISTLPTHLTYDAPWPVRKV-PLRCTPHFVNYHVDSKTYCVVTSQP--DPCNHLVRF- 842

Query: 1037 IDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALT 1094
                 G + +   L     +   T++++ +++L P      W+T     + +   E+   
Sbjct: 843  ----TGEEKEYELLERDSRYIFPTMDKFSLQLLSPVS----WETIPNTRVDLDEWEHLTC 894

Query: 1095 VRVVTLFNT-TTKENETLLAIGTAYVQGEDVAARGR 1129
            ++ V L +  TT   +  LA+GT Y  GEDV +RGR
Sbjct: 895  LKNVMLSSEGTTTGMKGYLALGTNYCYGEDVTSRGR 930


>gi|308805673|ref|XP_003080148.1| cleavage and polyadenylation specificity factor (ISS) [Ostreococcus
            tauri]
 gi|116058608|emb|CAL54315.1| cleavage and polyadenylation specificity factor (ISS), partial
            [Ostreococcus tauri]
          Length = 1473

 Score =  334 bits (857), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 271/987 (27%), Positives = 427/987 (43%), Gaps = 195/987 (19%)

Query: 260  MVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIG 319
            + IL+E+  TWAGR +    TC I ALS+    ++  +IW   NLP  +YKL A+  P+G
Sbjct: 6    LAILYEKTPTWAGRYNLAKDTCEIVALSVDVDKQKSTVIWRRQNLPSSSYKLTALLPPLG 65

Query: 320  GVLVVGANTIHYHSQSASCALALNNYA------------------VSLDSSQELPRS--- 358
            GVLV   + + + SQ +S AL LN +                   +  D+    P +   
Sbjct: 66   GVLVFSQDFLLHESQESSSALCLNTFGRGGPQEGNDAETVARLAGMGEDAVANPPPACAA 125

Query: 359  -----SFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
                    + LD A A  +  D  L++TK G L LL +  DGR ++R+ L +   +VL+S
Sbjct: 126  RAVDCGLEITLDGAQAAVVSEDRVLVTTKMGALFLLALHTDGRSLRRMMLQRAGGAVLSS 185

Query: 414  DITTIGNSLFFLGSRLGDSLLVQFTCGSGTS---MLSSGLKEEFGDIEADAPSTKRLRRS 470
             +  +   L FLGSR+GDSLLV+FT  S  +   ML  G  +E    E +  S KR +  
Sbjct: 186  GMCLLSRDLLFLGSRIGDSLLVKFTPKSEPAAPLMLPKGEDDEETVDEVEKGSGKRSKSG 245

Query: 471  SSDALQDMV-----------------NGEELS--LYGS-------ASNNTESAQKT---- 500
               A++                    + +EL   LYG+           T++A+K     
Sbjct: 246  DGAAIRKRAKSTEDPPPAPSTPSPEDDDDELEALLYGTTKAESVIGDETTQTAEKKREGL 305

Query: 501  -----------FSFAVRDSLVNIGPLKDFSYGLR--INADA-------SATGISKQ---- 536
                       + F V+DSL+ + P+ D + G    +  D        +A G  K     
Sbjct: 306  AGVVPGLKVAGYDFKVKDSLLGVAPVVDITVGASAPVGTDTAERTELVTACGQGKNGALA 365

Query: 537  -----------SNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA 585
                       +  E   LP  +G+W ++ +       + +R     + +H +L++ L+ 
Sbjct: 366  ILTRGVQPELVTEVEAGTLPTLQGLWALHDRK------EGTR--EVREPFHNHLLLKLQ- 416

Query: 586  RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSF 645
                        EV+ S+++     T+AA N FG    +Q+ E   RIL      QD++ 
Sbjct: 417  ------------EVSASLEFITDQATLAAANFFGHFCSLQITETSIRILKSGMKVQDVTL 464

Query: 646  GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKK 705
                +  GS      + S  I DPY+++ +SDG++RLL GD    TVS+    A+ +S +
Sbjct: 465  ADIKAPKGS-----VIASAEILDPYIMIRLSDGTLRLLAGDEKKMTVSLMESGAMPTSSR 519

Query: 706  PVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEI 765
                       G   W+ +++T+  ++   G    GA     +Q +    +  E G+LE+
Sbjct: 520  RTRLVEALKKSG---WIHRSATNGTITGLEGSKKSGAS----NQKEAIVAIAREGGSLEL 572

Query: 766  FDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVV 825
            F +P+   ++  D    G   +  T        SE  I                   ++V
Sbjct: 573  FSLPSCTRIWNADGLSEGSRVLSPTRPVH----SELRIP------------------EIV 610

Query: 826  ELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSAS 885
            ++ +  +   H RP L A+  DGT+L Y+ ++         S++P++             
Sbjct: 611  DIRIDSFEEAHERPLLTAVRGDGTLLLYRGFIVPAGTTCEGSEEPLARG----------- 659

Query: 886  RLRNLRFSRTPLD-------------AYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                LRFSR  +D             A    ++  G    RI+      G QG F++G  
Sbjct: 660  ---ELRFSRVNIDVEGSGLNVAGVGVAGQVRDSLAGTRLTRISNVGEGQGLQGIFVAGPN 716

Query: 933  PCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNY 992
            P W +V R R+   P   +G IVAFT  HNVNC +GFI  T+ G ++ICQ+PS   Y+  
Sbjct: 717  PLWLIVRRSRVLALPTRGEGEIVAFTDFHNVNCPYGFILGTAVGGVRICQMPSKMHYEAA 776

Query: 993  WPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV-GHQIDNHNLS 1051
            WPV+K I LK TPH + Y  +  LY L+ S  V           +D+E+ G  +   +LS
Sbjct: 777  WPVRK-IALKCTPHAVAYLPDFKLYALVTSANV---------PWVDREIDGENVHGLSLS 826

Query: 1052 SVDLHRTYTVE----EYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE 1107
                 R    +    +Y VR+L P      WQ      ++  E+   VR V L +  T  
Sbjct: 827  KARRERAKAHDDMELQYSVRLLVPGSLDCVWQHT----LEPGEHVQCVRNVQLKDINTGH 882

Query: 1108 NETLLAIGTAYVQGEDVAARGRVLLFS 1134
            + + LA+GTA   GED   RGRV LF+
Sbjct: 883  SLSYLAVGTAMPGGEDTPCRGRVYLFN 909


>gi|157110889|ref|XP_001651294.1| cleavage and polyadenylation specificity factor cpsf [Aedes aegypti]
 gi|108883895|gb|EAT48120.1| AAEL000832-PA [Aedes aegypti]
          Length = 1417

 Score =  334 bits (856), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 315/1165 (27%), Positives = 526/1165 (45%), Gaps = 164/1165 (14%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            +LV   ANV+++Y  R+  +    S++   T R   M       LE +  Y L GN+ S+
Sbjct: 29   SLVTGGANVLKVY--RLIPDADATSRDKFTTTRPPNM------KLECMATYTLFGNIMSM 80

Query: 117  AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
              +S  G+    +RD+++++F+DAK+SV++FD     L+  S+H FE  +   +K G   
Sbjct: 81   QSVSLAGS----QRDALLISFQDAKLSVVQFDPDNFELKTLSLHYFEEED---IKGGWTG 133

Query: 177  FARGPLVKVDPQGRCGGVLVYGLQMIIL---KASQGGSGLVGDEDTFGSGGG---FSARI 230
                P+V+VDP  RC  +LVYG ++++L   K S      V D                I
Sbjct: 134  HYHTPIVRVDPDNRCAVMLVYGRKLVVLPFRKDSSLDEIEVQDVKPMKKAPTQLIAKTPI 193

Query: 231  ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
             +S+VI L++ +  + +V D  F+HGY EP ++IL+E   T+ GR++ +  TCM+ ALS+
Sbjct: 194  LASYVIELKESEERIDNVIDIQFLHGYYEPTLLILYEPVKTFPGRIAVRSDTCMMVALSL 253

Query: 289  STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAV 347
            +   + HP+IW+   LP D  + +A+  PIGG L++  N + Y +QS     ++LN+ A 
Sbjct: 254  NIQQRVHPVIWTVNCLPFDCLQAIAISKPIGGCLILSVNALIYLNQSVPPYGVSLNSIAD 313

Query: 348  SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKT 406
               +    P+    + LDAA   +++ +  +LS K G+L +LT+  D  R V+    SK 
Sbjct: 314  HCTNFPLKPQDGVRISLDAAQVCFIEPEKLVLSLKGGELYVLTLCADSMRSVRSFHFSKA 373

Query: 407  NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
              SVLT  I  +     FLGSRLG+SLL++F     + +++    EE  + E      KR
Sbjct: 374  ASSVLTCCICVVEEEYLFLGSRLGNSLLLRFKEKDESMVITIDDTEEVVEKEP-----KR 428

Query: 467  LRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRI-- 524
            LR          +  EEL +YGS    T     ++ F V DS++NIGP+   + G RI  
Sbjct: 429  LR----------LEQEELEVYGSG-QKTSVQLTSYIFEVCDSILNIGPIGHMAVGERISE 477

Query: 525  ---NADASATGISKQSNYELVELP--GCKGIWTVYHKSSRGHNADSSRMAAY--DDEYHA 577
               + +     +  + + E+V     G  G   V   S +     S  ++     D+ H+
Sbjct: 478  EEQDENKDVQFVPNKLDLEIVTSSGHGKNGALCVLQNSIKPQVITSFGLSGCLDVDDMHS 537

Query: 578  YLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS 637
            ++I+S EA TMVL+T D + E+ E+  +     TI  GN+ G R ++QV  +  R+L G+
Sbjct: 538  FMILSQEAGTMVLQTGDEINEI-ENTGFATNVPTIHVGNIGGNRFIVQVTTKSIRLLQGT 596

Query: 638  YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV-----GDPSTC-- 690
             + Q++                 + +VSIADPYV +  S+G +  L      G P     
Sbjct: 597  RLLQNIPI----------DLGCPLAAVSIADPYVCVRSSEGRVITLALREGKGTPRLAVN 646

Query: 691  --TVSVQTPAAIE-SSKKPVSS--CTLYHD------------------KGPEPWLRKTST 727
              T+S  TPA +  S  K VS    T Y D                    PEP ++    
Sbjct: 647  KNTIST-TPAVVAISVYKDVSGMFTTKYEDFYDGSKAGSSAYSSGFGYMKPEPHMKIEDE 705

Query: 728  DAWLSTGVGEAI------DGADGGPLDQGDIYS------------VVCYESGALEIFDVP 769
            +  L    G +       D A        D +                 ++G LEI+ +P
Sbjct: 706  EDLLYGESGRSFKMTSMADMAIETKKKNTDFWRKFMQPVKPTFWLYAVRDNGNLEIYSMP 765

Query: 770  NFNCVFTVDKFVSGRTHIVDTYMREALKDSETEIN---SSSEEGTGQGRKENIHSMKVVE 826
            +   V+ +    +G   + D+     L+  +T  +   +S+   +  G   N+   +++ 
Sbjct: 766  DLKLVYLITNIGNGNKVLQDSMEFVPLQVGQTAADADVTSNAFTSPFGFNPNLLPKEILM 825

Query: 827  LAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASR 886
            +A+     H +RP LF  L +  +L Y+ Y +      SK    +   R    S V+   
Sbjct: 826  VAL---GHHGTRPMLFVRL-ENDLLVYRVYRY------SKGHLKLRFRR--VPSGVTGPI 873

Query: 887  LRNLRFSRTPLDAYTREETPHGAPCQ-----RITIFKNISGHQGFFLSGSRPCWCMVFRE 941
             +       P D    +   H           I  F N++G+ G  + G +P + M+   
Sbjct: 874  FKIAPRQSAPTDQEGEKPDEHSTKIMYENISMIRYFNNVNGYNGVAVCGEKP-YIMLLTS 932

Query: 942  R--LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVI 999
            R  LR H       +  F   +NVNC +GF+Y   Q  LKI   P   +YD+ WPV+K I
Sbjct: 933  RGELRAHRLYAKTIMKGFAPFNNVNCPNGFLYFDEQYELKIAVFPGYLSYDSIWPVRK-I 991

Query: 1000 PLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTY 1059
            PL+++P QI Y  E  +Y +++                 +EV ++    N    +L    
Sbjct: 992  PLRSSPKQIVYHKENKVYCVVMDA---------------EEVCNKYYRFNGEDKELTEEN 1036

Query: 1060 TVE--------EYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTL-FNTTTKEN 1108
              E        ++ V ++ P      W+     +I +   E+ + ++ V+L +       
Sbjct: 1037 KGERFLYPMAHKFSVVLVTP----SAWEIIPETSINLDEWEHVIALKNVSLSYEGARSGF 1092

Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLF 1133
            +  +A+GT +   ED+ +RGR+LL+
Sbjct: 1093 KEYIAVGTNFNYSEDITSRGRLLLY 1117


>gi|321475208|gb|EFX86171.1| hypothetical protein DAPPUDRAFT_313209 [Daphnia pulex]
          Length = 1260

 Score =  333 bits (854), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 314/1154 (27%), Positives = 509/1154 (44%), Gaps = 172/1154 (14%)

Query: 57   NLVVTAANVIEIY--VVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVE 114
            NLVV  ANV+ ++  +    E+  ++    G+  +           LE +  Y L G V 
Sbjct: 29   NLVVAGANVLRVFRLIPNTDEKMLRKESADGQPPK---------MKLECLASYNLFGKVM 79

Query: 115  SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
            S+A +S  G+     +D+I+++F  AK+S++E+D     L+  S+H FE    L    G 
Sbjct: 80   SIAAVSLPGSS----QDTILMSFAHAKLSLIEYDPVSDNLKTLSLHNFEVVSIL--DEGI 133

Query: 175  ESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIE-SS 233
             S  + P ++VDP+GRC  +L++   + IL               F       + +  SS
Sbjct: 134  GSNHKIPEIRVDPEGRCAALLIFRNTLAILP--------------FRKDSAHDSNVTLSS 179

Query: 234  HVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
            ++I L DL+ +  +V D  F+HGY EP ++IL+E   T+ GR++ +  TC + A+S++T 
Sbjct: 180  YIIKLTDLEERVDNVIDVQFLHGYYEPTLIILYEPVGTFPGRIAVRQDTCNMVAVSLNTQ 239

Query: 292  LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVSLD 350
             + HP+IWS  +LP D  +LL VP P+GG L++  N++ Y +QS     +++N+ A    
Sbjct: 240  QRVHPIIWSLNSLPFDCSQLLPVPKPLGGALIMAVNSVIYVNQSVPPYGVSVNSIADHCT 299

Query: 351  SSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPS 409
            S    P     + LD A A +LQ D  +LS K G+L +LT+  D  R V++  L K   S
Sbjct: 300  SFPLKPYEGSRIGLDCARAAFLQYDRVVLSLKGGELYVLTLFADSMRSVRKFHLEKAAAS 359

Query: 410  VLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRR 469
            VLT+ +    N LF LGSRLG+SLL+ F         +    +      A  P  ++   
Sbjct: 360  VLTTCLCICDNYLF-LGSRLGNSLLLAFQ--------TKDYNQYATPFAAKKPKMEQFSL 410

Query: 470  SSSDALQDMVNGEELS--LYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
                 L D ++ EE+   LYG    +T+S   ++ F V DSL+NIGP    + G      
Sbjct: 411  LFDQEL-DHLDEEEIDNYLYGEDHESTDSKAISYQFEVCDSLLNIGPCGQMAVG---EPA 466

Query: 528  ASATGISKQSNYELVELP-----GCKGIWTVYHKSSRGHNADSSRMAAYDDEY------- 575
            ++ T   K+S    VE+      G  G   V  ++ +     +  +    D +       
Sbjct: 467  STCTDFDKKSPDPDVEIVTTSGYGKNGAICVLQRTMKPQVVTTFELPEVSDMFTVFASRN 526

Query: 576  ------HAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFER 629
                  H YL++S    TMVL+T   + E+ +S  + V   TI A NL   R ++QV   
Sbjct: 527  NEDAIMHTYLLLSRADSTMVLQTGQEINEMDQS-GFSVTSPTILAANLGNNRFIVQVCPT 585

Query: 630  GARILDGS-YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV----------LLGMSDG 678
              R+LD +  + Q+L              +  + S S +DPYV          LL   +G
Sbjct: 586  SVRLLDATATVIQELVM----------DSDFLITSASASDPYVAVLTENGRIGLLTFVEG 635

Query: 679  SIRLLV------GDPSTCTVSVQTPAAIESSKKPVS---SCTLYHDKGPEPWLRKTSTDA 729
            S   ++        P  C    +  + + ++  P +     T  H        +K   D 
Sbjct: 636  SQLEMIFPVLSKNSPVVCVCLYRDISGLFNTTIPETDSPETTKLHTANKSLNAKKEMDDE 695

Query: 730  ----WLSTGVGEAIDGADGGPLDQGDIYSVVCY--------------ESGALEIFDVPNF 771
                +  T   E+    D           V+ Y              ++G LEI+ +   
Sbjct: 696  EDYLYGDTNTEESRPTEDKTHTKFTPQQKVIDYFREIKPTFWLSIIRQNGTLEIYSLAGQ 755

Query: 772  NCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQR 831
            +    V+ F +   H+    +   +K  ET + SS+                +VE+ +  
Sbjct: 756  S---VVETFQTVHVHLGHRLIFN-MKADETSLPSSTH-------------CNIVEMGIFG 798

Query: 832  WSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRN-- 889
                H RP L    +D  +L Y+A              PV  S+  +   +   +L +  
Sbjct: 799  LGHLHRRPLLMIRTSDFGVLLYEAI----------PALPVYDSKQKNELKIRFRKLNHSL 848

Query: 890  -LRFSRTPLDAYTREE------TPHGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRE 941
             LR ++T    Y R+        P+     +   F NI+G+ G F+ G  P W  M  R 
Sbjct: 849  LLRETKT----YVRKGGQSVVLEPYAWKTNQFKYFSNIAGYTGVFIGGPYPHWLFMTSRG 904

Query: 942  RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPL 1001
             LR+HP   DGSI  F   HNVNC  GFIY+  +  L+IC LP+   YD  WPV+KV PL
Sbjct: 905  ELRLHPMSIDGSIKCFACFHNVNCAQGFIYLNRKDELRICLLPTLFNYDAPWPVRKV-PL 963

Query: 1002 KATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYT- 1060
            + TPH + Y  E   Y  I++  + +P N++     D +        +L   D    Y  
Sbjct: 964  RCTPHYLIYHVETKTY--ILATSLAEPTNRIYRFNGDDK------ELSLEERDDRFPYPH 1015

Query: 1061 VEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYV 1119
            VE++ ++++ P        TR  + +   E+   ++ V+L +       +  LA+ T Y 
Sbjct: 1016 VEKFAIQLISPVTWEAVPNTR--MDLDDWEHVTCLKTVSLEYEGHASGLKDYLAVSTNYN 1073

Query: 1120 QGEDVAARGRVLLF 1133
             GED+ +RGR+ + 
Sbjct: 1074 YGEDIISRGRIFIL 1087


>gi|193702313|ref|XP_001945086.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Acyrthosiphon pisum]
          Length = 1335

 Score =  330 bits (847), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 313/1141 (27%), Positives = 507/1141 (44%), Gaps = 203/1141 (17%)

Query: 58   LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
            LVV   N++ +Y +   +   +  K                   E +  Y L GN+  L 
Sbjct: 30   LVVAGVNILRVYRLVPTDTTCQPPK----------------TKFECLAQYTLFGNIMCL- 72

Query: 118  ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
               Q         D+++L+F +AK S++E+D  +H LR  S+H FE  ++   K G    
Sbjct: 73   ---QSVTLCPSSPDALLLSFSEAKFSLVEYDRDMHSLRTLSLHYFEDDKF---KNGHTQH 126

Query: 178  ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
               PL++VDP GRC   LVYG   ++L       G   D++        SA++  S+ I 
Sbjct: 127  WSPPLIRVDPDGRCVVGLVYGSYFVVLPF-----GRTIDDN------AKSAQVMPSYTIP 175

Query: 238  LRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
            +  +D  M ++ DF F+HGY EP ++IL+E   T+AGR++ +  TC + A+S++     H
Sbjct: 176  ISKIDPKMNNIMDFDFLHGYYEPTLLILYEPVKTFAGRIAVRKDTCAMVAISLNIQQHVH 235

Query: 296  PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQE 354
            P+IWS  +LP+D  K++AV  PIGGVL++  N++ Y +QS     +ALN+ A +L +   
Sbjct: 236  PVIWSLDSLPYDCQKVIAVSRPIGGVLIMAVNSLIYLNQSVPPFGVALNSIAKTLTNFPL 295

Query: 355  LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTS 413
              +   ++ LD A AT++ +D  + S   GDL ++T+  D  R V+     K   SVLT+
Sbjct: 296  GQQEDINLVLDRATATFISSDKLVTSLCNGDLYVITLYADSMRAVRSFHFEKCASSVLTT 355

Query: 414  DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
             IT   +S  FLGSRLG+SLL+++   S ++               D PS KR +   +D
Sbjct: 356  CITVCLDSYLFLGSRLGNSLLLRYYARSQSN--------------DDEPSIKRKKTDETD 401

Query: 474  ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
              +D+V   EL +YGS    T    +++SF V DS++NIGP    S G    A  S    
Sbjct: 402  --EDLV---ELEVYGSEV-QTSICLESYSFEVCDSIINIGPCSQASIGE--PAYISDEFS 453

Query: 534  SKQSNYELVELP--GCKGIWTVYHKSSRGHNADSSRMAAYDD--------EYHAYLIISL 583
            S + + EL+     G  G  +V H+S +     +  +  Y D        ++H ++I++ 
Sbjct: 454  SDEHDVELLCTSGHGKNGALSVLHRSIKPQLVTTFHLDGYKDMWTVHGENDFHTFMILTN 513

Query: 584  EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDL 643
               T++L+T   + E+ +S  Y  +  T+   N+   + VIQV     R+L+GS   Q +
Sbjct: 514  VDSTLILQTGQEINEL-DSSGYATREHTVFVCNM--NKFVIQVLRYSVRLLNGSEQLQSV 570

Query: 644  S--FGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI---------RLLVGDPSTCTV 692
            S  FG            S ++  S  +PY +L   DG +         R+L+  P+    
Sbjct: 571  SLDFG------------SPIIHGSSCNPYAVLLTEDGQVIVLTVKSTGRILLMRPTNFEQ 618

Query: 693  SVQTP--------AAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADG 744
              QT         + + SS  P +   L    GP     K   D ++S  V +  +   G
Sbjct: 619  IPQTKTLAVYRDVSGLFSSTMPQAEIPLV---GP-----KLQHDHFVSDSVEDEEEMLYG 670

Query: 745  GPLDQGD--------------------------IYSVVCYESGALEIFDVPNFNCVFTVD 778
               D                              + V+  ++G +EI+ +P+F       
Sbjct: 671  DARDPSSRETPHNSVSNKNTMWWLKFLEVPTPTYWVVLTRDNGYMEIYTLPDFKI----- 725

Query: 779  KFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRK-ENIHSMKVVELAMQRWSAHHS 837
                        Y    + +S   +  S EEG    +K E I  + +V L  Q       
Sbjct: 726  -----------KYRAANIDESPMILKDSLEEGCYFPKKTEIIKEILIVPLGYQ-----DK 769

Query: 838  RPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPL 897
            RP +F  L D  ++ Y  +    PE T K                       +RF +  +
Sbjct: 770  RPIMFVRL-DNEVVIYGIH--RHPEGTLK-----------------------MRFHK--M 801

Query: 898  DAYTREETPHGAPCQRITI---FKNISGHQGFFLSGSRPCWCMV-FRERLRVHPQLCDGS 953
             +    ++  G P +  ++   F  ++GH G F+ G  P   ++  R  LR HP   DG 
Sbjct: 802  TSLLTFQSRSGNPLEGTSLLRYFSKVAGHNGVFICGQNPHLILLTVRGELRCHPLHIDGP 861

Query: 954  IVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAE 1013
            I+ F   HNVNC+ GF+Y  S   L+I  LP+  +YD  WP++KV PL+ TPH I Y  E
Sbjct: 862  IMCFAPFHNVNCSQGFLYFNSDHKLRISILPTHLSYDEPWPLRKV-PLRKTPHFIAYHLE 920

Query: 1014 KNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDR 1073
               Y ++ S   L       +   D+E+  + +   L  +  H  +T+E +     EP  
Sbjct: 921  TKTYCVVTSSSELSASYYRFNGE-DKELTTE-ERDPLFPLPSHEVFTLELFSPASWEP-- 976

Query: 1074 AGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLL 1132
                     +I  +  E+   ++ V L     +   +  +A+GT Y   ED+ +RGR+ L
Sbjct: 977  -----IPDTSIETEDWEHITCLKNVALAYEGARSGLKGYIAMGTNYSYSEDITSRGRIFL 1031

Query: 1133 F 1133
            F
Sbjct: 1032 F 1032


>gi|158287218|ref|XP_309311.4| AGAP011340-PA [Anopheles gambiae str. PEST]
 gi|157019545|gb|EAA05261.4| AGAP011340-PA [Anopheles gambiae str. PEST]
          Length = 1434

 Score =  323 bits (828), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 314/1189 (26%), Positives = 540/1189 (45%), Gaps = 196/1189 (16%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            +LV   ANV+++Y  RV  +    +++     R   M       LE V  YRL+GN++S+
Sbjct: 29   SLVTGGANVLKVY--RVIPDADPATRDKYTAARPPNM------KLECVASYRLNGNIKSM 80

Query: 117  AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
              +S  G+     RD+++++F DAK+SV++FD     L+  S+H FE  +   ++ G   
Sbjct: 81   QSVSLAGS----LRDALLISFPDAKLSVVQFDPDNFDLKTLSLHYFEDED---IRGGWTG 133

Query: 177  FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR--I 230
                P+V+VDP  RC  +LVYG ++++L   +  S     L   +    +     A+  I
Sbjct: 134  HYHIPMVRVDPDNRCAVMLVYGRKLVVLPFRKDSSLDEIELQDVKPIKKAPMQLVAKTPI 193

Query: 231  ESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
             +S++I L+DLD K  +V D  F+HGY EP ++IL+E   T+ GR++ +  TC + ALS+
Sbjct: 194  LASYIIELKDLDEKIDNVIDIQFLHGYYEPTLLILYEPVRTFPGRIAVRSDTCTMVALSL 253

Query: 289  STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVS 348
            +   + HP+IW+  +LP D  + + +  PIGG LV+  N++ Y +QS         Y VS
Sbjct: 254  NIQQRVHPVIWTVNSLPFDCIQAIPINKPIGGCLVMCVNSLIYLNQSVP------PYGVS 307

Query: 349  LDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQR 400
            L+SS +        P+    + LDAA   +++ +  +LS K G+L +LT+  D  R V+ 
Sbjct: 308  LNSSADHSTSFPLKPQDGVRISLDAAQVCFIEPEKLVLSLKGGELYVLTLCADSMRSVRN 367

Query: 401  LDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
               +K   SVLTS I    +   FLGSRLG+SLL++F     + +++    ++ G +E +
Sbjct: 368  FHFNKAAASVLTSCICVCEDEYLFLGSRLGNSLLLRFKEKDESLVITI---DDSGAVEKE 424

Query: 461  APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
                KR R    +            +YGS    T     ++ F V D+++NIGP+   + 
Sbjct: 425  P---KRPRLEEEEL----------EVYGSGY-KTSVQLTSYIFEVCDNVLNIGPIAHMAV 470

Query: 521  GLRINADASATG-----ISKQSNYELVE--------------------------LPGCKG 549
            G R+  + +        +  + + E+V                           L GC  
Sbjct: 471  GERVAEEDAENQPDVQIVQNKLDIEVVTSSGHGKNGALCVLQSSIKPQVITSFGLSGCVD 530

Query: 550  IWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
            +WTV+ ++        +R A      HA++I+S E  TMVL+T + + E+ E+  +    
Sbjct: 531  VWTVFDEA-------VARRAEDGPSTHAFMILSQEGGTMVLQTGEEINEI-ENTGFATTV 582

Query: 610  RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
             TI  GN+   R ++QV  +  R+L G+ + Q++                 + SV+I DP
Sbjct: 583  PTIHVGNIGTNRFIVQVTTKSIRLLQGTRLLQNIPI----------DLGCPLASVAIVDP 632

Query: 670  YVLLGMSDGSIRLLV-----GDPSTC----TVSVQTPAAIE-SSKKPVSSCTLYHDK--- 716
            YV +  S+G +  L      G P       T+S  TPA +  S+ + VS   L+  K   
Sbjct: 633  YVCVRSSEGRVITLALREGKGTPRLAVNKNTIS-PTPAVVAISAYRDVSG--LFTKKIED 689

Query: 717  --------------------GPEPWLRKTSTDAWLSTGVGE----------AIDGADGGP 746
                                 PEP ++    +  L    G           AI G  GG 
Sbjct: 690  VYDLSRGGAASAYSSGFGSMKPEPHMKIEDEEDLLYGESGRSFKMTSMADMAIAGKSGGS 749

Query: 747  LD---------QGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDT--YMREA 795
             D         +   +     ++G LEI+ +P+   V+ +    +G   + D+  ++   
Sbjct: 750  ADFWMKYMQQVKPTYWLFAARDNGTLEIYSMPDLKLVYLITNVGNGNKVLSDSMEFVPLP 809

Query: 796  LKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQA 855
            +  S ++ ++SS  G   G   ++   +++ +A+    ++ SRP LF I  +  +L Y+ 
Sbjct: 810  MGKSASQEDASSAFGASFGVSASLLPKEILMVAL---GSYGSRPLLF-IRLEHDLLIYRV 865

Query: 856  YLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAP----- 910
            + +      SK    +   R LS S V+    R    S         E+    A      
Sbjct: 866  FRY------SKGHLKLRFKR-LSTS-VTCPVFRTPEPSGAGATEAANEQQQARATKVLYE 917

Query: 911  -CQRITIFKNISGHQGFFLSGSRPCWCMVFRE-RLRVHPQLCDGSIVAFTVLHNVNCNHG 968
                I  F N+SG+ G  + G +P +  +     LR H       + AF   +NVNC +G
Sbjct: 918  NISMIRYFANVSGYAGVAVCGEKPYFLFLTAHGELRSHRLYARTVMKAFAPFNNVNCPNG 977

Query: 969  FIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKP 1028
            F+Y   Q  LKI   P+  +YD+ WPV+K IPL+++P QI Y  E  +Y +++    +  
Sbjct: 978  FLYFDEQYELKISIFPTYLSYDSVWPVRK-IPLRSSPKQIVYHRENKVYCVVMDAEEI-- 1034

Query: 1029 LNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIP 1085
             N+       D+E+  +           HR      + V ++ P      W+     +I 
Sbjct: 1035 CNKYYRFNGEDKELTEENKGERFLYPMGHR------FSVVLVTP----AAWEVVPETSIN 1084

Query: 1086 MQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLF 1133
            ++  E+ + ++ V+L     +   +  +A+GT +   ED+ +RGR+LL+
Sbjct: 1085 LEEWEHVIALKNVSLTYEGARSGLKEYIAVGTNFNYSEDITSRGRLLLY 1133


>gi|215701517|dbj|BAG92941.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 265

 Score =  323 bits (828), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 161/246 (65%), Positives = 191/246 (77%), Gaps = 1/246 (0%)

Query: 254 GYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLA 313
           GYIEPV+VILHE+E TWAGR+  KHHTCMISA SIS TLKQHP+IWSA NLPHDAY+LLA
Sbjct: 19  GYIEPVLVILHEQEPTWAGRILSKHHTCMISAFSISMTLKQHPVIWSAANLPHDAYQLLA 78

Query: 314 VPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
           VP PI GVLV+ AN+IHYHSQS SC+L LNN++   D S E+ +S+F VELDAA ATWL 
Sbjct: 79  VPPPISGVLVICANSIHYHSQSTSCSLDLNNFSSHPDGSPEISKSNFQVELDAAKATWLS 138

Query: 374 NDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
           ND+ + STK G+++LLTVVYDGRVVQRLDL K+  SVL+S +T+IGNS FFLGSRLGDSL
Sbjct: 139 NDIVMFSTKAGEMLLLTVVYDGRVVQRLDLMKSKASVLSSAVTSIGNSFFFLGSRLGDSL 198

Query: 434 LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASN 492
           LVQF+  +  S+L     E   DIE D P +KRL+R  SD LQD+ + EELS     A N
Sbjct: 199 LVQFSYCASKSVLQDLTNERSADIEGDLPFSKRLKRIPSDVLQDVTSVEELSFQNIIAPN 258

Query: 493 NTESAQ 498
           + ESAQ
Sbjct: 259 SLESAQ 264


>gi|24653655|ref|NP_725397.1| cleavage and polyadenylation specificity factor 160, isoform B
            [Drosophila melanogaster]
 gi|15292103|gb|AAK93320.1| LD38533p [Drosophila melanogaster]
 gi|21627189|gb|AAM68553.1| cleavage and polyadenylation specificity factor 160, isoform B
            [Drosophila melanogaster]
          Length = 1420

 Score =  320 bits (820), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 318/1193 (26%), Positives = 521/1193 (43%), Gaps = 217/1193 (18%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
            NLVV  ANV+++Y +    E S+  K N  E +    M       LE +  Y L+GNV S
Sbjct: 29   NLVVAGANVLKVYRIAPNVEASQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82

Query: 116  LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
            L  +S  GA     RD+++++F+DAK+SVL+ D     L+  S+H FE  +   ++ G  
Sbjct: 83   LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
                 P V+VDP  RC  +LVYG ++++L   +  S     L   +    +     +R  
Sbjct: 136  GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195

Query: 230  IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
            I +S++I LRDLD K  +V D  F+HGY EP ++IL+E   T  GR+             
Sbjct: 196  IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRI------------- 242

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
                                  K+  +  PIGG LV+  N + Y +QS         Y V
Sbjct: 243  ----------------------KVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 274

Query: 348  SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
            SL+SS +        P+    + LD A+  ++  D  ++S +TGDL +LT+  D  R V+
Sbjct: 275  SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 334

Query: 400  RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
                 K   SVLTS I  + +   FLGSRLG+SLL+ FT    +++++            
Sbjct: 335  NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDEVEQQSEQQQ 394

Query: 448  SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
              L++E  ++E +     +L  + + A    +  EEL +YGS +  +    + F F V D
Sbjct: 395  RNLQDEDQNLE-EIFDVDQLEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCD 453

Query: 508  SLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS---------N 538
            SL+N+ P+     G R+  +                     +ATG SK           N
Sbjct: 454  SLMNVAPINYMCAGERVEFEEDGVTLRPHAESLQDLKIELVAATGHSKNGALSVFVNCIN 513

Query: 539  YELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
             +++   EL GC  +WTV+         D+++ ++ +D+ H ++++S    T+VL+T   
Sbjct: 514  PQIITSFELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQE 564

Query: 596  LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
            + E+ E+  + V   TI  GNL  +R ++QV  R  R+L G+ + Q++            
Sbjct: 565  INEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI---------- 613

Query: 656  SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
               S V+ VSIADPYV L + +G +  L    +  T  +       SS   V + + Y D
Sbjct: 614  DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAISAYKD 673

Query: 716  -------KG----------------------PEPWLRKTSTDAWLSTGVGEAI------D 740
                   KG                       EP ++    +  L    G A       D
Sbjct: 674  LSGLFTVKGDDINLTGSSNSAFGHSFGGYMKAEPNMKVEDEEDLLYGDAGSAFKMNSMAD 733

Query: 741  GADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
             A        D +             VV  +SG LEI+ +P+   V+ V+   +G   + 
Sbjct: 734  LAKQSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDVGNGSMVLT 793

Query: 789  DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
            D    E +  S T   +S          ++ +S   +EL++     +  RP L  + T  
Sbjct: 794  DAM--EFVPISLTTQENSKAGIVQACMPQHANSPLPLELSVIGLGLNGERPLLL-VRTRV 850

Query: 849  TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG 908
             +L YQ  +F  P+   K        R +   N+   +  ++       D     E+   
Sbjct: 851  ELLIYQ--VFRYPKGHLK-----IRFRKMDQLNLLDQQPTHIDLDEN--DEQEEIESYQM 901

Query: 909  AP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNC 965
             P   Q++  F N+ G  G  + G  PC+  + FR  LR+H  L +G + +F   +NVN 
Sbjct: 902  QPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVNI 961

Query: 966  NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPV 1025
             +GF+Y  +   LKI  LPS  +YD+ WPV+KV PL+ TP Q+ Y  E  +Y LI     
Sbjct: 962  PNGFLYFDTTYELKISVLPSYLSYDSVWPVRKV-PLRCTPRQLVYHRENRVYCLITQTE- 1019

Query: 1026 LKPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEPDRAGGPWQT--R 1081
             +P+ +       D+E+  +              Y +  ++E+ ++ P+     W+    
Sbjct: 1020 -EPMTKYYRFNGEDKELSEESRGERF-------IYPIGSQFEMVLISPET----WEIVPD 1067

Query: 1082 ATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
            A+I  +  E+    ++V L +  T    +  L IGT +   ED+ +RG + ++
Sbjct: 1068 ASITFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIY 1120


>gi|195056749|ref|XP_001995154.1| GH22991 [Drosophila grimshawi]
 gi|193899360|gb|EDV98226.1| GH22991 [Drosophila grimshawi]
          Length = 1426

 Score =  317 bits (813), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 322/1205 (26%), Positives = 511/1205 (42%), Gaps = 235/1205 (19%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
            NLVV  ANV+++Y +    +  +  K N  E +    M       LE +  Y L+GNV S
Sbjct: 29   NLVVAGANVLKVYRIAPNVDAVQRQKLNPSEMRLAPKM------RLECLASYTLYGNVMS 82

Query: 116  LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
            L  +S  GA     RD+++++F+DAK+SVL+ D     L+  S+H FE  +   ++ G  
Sbjct: 83   LQSVSLAGA----MRDALLISFKDAKLSVLQLDADTQTLKTLSLHYFEEDD---IRGGWT 135

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
                 P+V+VDP  RC  +LVYG ++++L   +  S     L   +    +      R  
Sbjct: 136  GRYHVPVVRVDPDARCAIMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVTRTP 195

Query: 230  IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
            I +S++I L DLD K  +V D  F+HGY EP ++IL+E   T AGR+  +  T       
Sbjct: 196  IMASYLIALADLDEKLDNVLDIQFLHGYYEPTLLILYEPVRTCAGRIKVRSDT------- 248

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
                                      +  PIGG LV+  N + Y +QS         Y V
Sbjct: 249  -----------------------FFPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 279

Query: 348  SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
            SL+SS +        P+ +  + LD A+  ++  D  ++S +TGDL +LT+  D  R V+
Sbjct: 280  SLNSSADNSTAFPLKPQDNVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 339

Query: 400  RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
                 K   SVLTS I        FLGSRLG+SLL+ FT    +++++            
Sbjct: 340  NFHFHKAAASVLTSCICVCHTEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVEATVEQQT 399

Query: 448  -----SGLKEE--FGDIEA-DAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
                   L EE    D+E  +AP   + RR         +  EEL +YGS +  +    +
Sbjct: 400  IEQSPEELAEESPVYDVEQHEAPPQSKSRR---------IEDEELEVYGSGAKASVLQLR 450

Query: 500  TFSFAVRDSLVNIGPLKDFSYG-----------LRINAD---------ASATGISKQS-- 537
             F F V DSL+N+ P+     G           LR +AD          +ATG SK    
Sbjct: 451  KFIFEVCDSLINVAPINYMCAGERVEFEEDGATLRPHADNLNDLKIELVAATGHSKNGAL 510

Query: 538  -------NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
                   N +++   EL GC  +WTV+  ++R   A ++R      + H ++++S  + T
Sbjct: 511  SVFVNCINPQIITSFELEGCLDVWTVFDDATR--KATTAR-----QDQHDFMLLSQRSST 563

Query: 588  MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGP 647
            +VL+T   + E+ E+  + V   TI  GNL  +R ++QV  R  R+L G+ + Q++    
Sbjct: 564  LVLQTGQEINEI-ENTGFTVNQPTIYVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI-- 620

Query: 648  SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPV 707
                       S V+ VSIADPYV L + +G +  L    +  T  +       SS   V
Sbjct: 621  --------DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAV 672

Query: 708  SSCTLYHD------------------------------KGPEPWLRKTSTDAWLSTGVGE 737
             +   Y D                                 EP ++    +  L    G 
Sbjct: 673  VAIAAYKDLSGLFTCKADDVLNLTGSSGAGFANSFGGYMKAEPHMKVEDEEDLLYGDAGS 732

Query: 738  AI------DGADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDK 779
            A       D A        D +             VV  +SG LEI+ +P+   V+ V+ 
Sbjct: 733  AFKLNSMADLAKQSKQKNSDWWRRQLIQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVND 792

Query: 780  FVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAMQRWSAHHSR 838
              +G   + D    E +  S T+ NS +  G       ++ +S   +EL +     H  R
Sbjct: 793  IGNGALVLSDAM--EFVPISLTQENSKA--GILHACMPQHANSPLPLELCLVGLGQHGER 848

Query: 839  PFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLD 898
            P L  + T   +L YQ + +                  +    +    L   + +   LD
Sbjct: 849  PLLL-VRTRLELLIYQVFRY------------AKGHLKIRFRKLEQLHLLEQQPTHIELD 895

Query: 899  AYTREETP----HGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGS 953
                EE           Q++  F N+ G  G  + G  PC+  +  R  LR+H  L +G 
Sbjct: 896  GEDVEEAESYNMQAKYVQKLRYFANVGGLAGIMVCGVNPCFVFLTSRGELRIHRLLGNGD 955

Query: 954  IVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAE 1013
            + +F   +NVN  HGF+Y  +   LKI  LPS  +YD  WPV+KV PL+ TP Q+ Y  E
Sbjct: 956  VRSFAAFNNVNIPHGFLYFDTTYELKISVLPSYLSYDAAWPVRKV-PLRCTPRQLVYHRE 1014

Query: 1014 KNLYPLIVSVPVLKPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTVEE-YEVRILEP 1071
              +Y LI      +P+ +       D+E+  +              Y +   +E+ ++ P
Sbjct: 1015 NRVYCLITQKE--EPMTKYYRFNGEDKELSEECRGERF-------IYPIGSLFEMVLISP 1065

Query: 1072 DRAGGPWQT--RATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARG 1128
            +     W+    A+I  +  E+    ++V L +  T    +  L IGT +   ED+ +RG
Sbjct: 1066 E----TWEIVPDASIQFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRG 1121

Query: 1129 RVLLF 1133
             + ++
Sbjct: 1122 NIHIY 1126


>gi|312380158|gb|EFR26239.1| hypothetical protein AND_07834 [Anopheles darlingi]
          Length = 1503

 Score =  311 bits (796), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 307/1203 (25%), Positives = 520/1203 (43%), Gaps = 216/1203 (17%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            +LV   ANV+++Y  R+  +    ++      R   M       LE +  YRL GN+ SL
Sbjct: 42   SLVTGGANVLKVY--RIIPDADPATREKYSATRPPNM------KLECMASYRLFGNIMSL 93

Query: 117  AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
              +S  G+    +RD+++++F DAK+SV++FD     L+  S+H FE  +   ++ G   
Sbjct: 94   QSVSLAGS----QRDALLISFPDAKLSVVQFDPDNFDLKTLSLHYFEDED---IRGGWTG 146

Query: 177  FARGPLVKVDPQGRCGGVLVYGLQMIIL---KASQGGSGLVGDEDTFGSGGGF---SARI 230
                PLV+VDP  RC  +LVYG ++++L   K S      + D                I
Sbjct: 147  HYHIPLVRVDPDNRCAVMLVYGRKLVVLPFRKDSSLDEIEMQDVKPIKKTPTLLIAKTPI 206

Query: 231  ESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
             +S++I L+DLD K  +V D  F+HGY EP ++IL+E   T+ GR++ +  TC + ALS+
Sbjct: 207  LASYIIELKDLDEKIDNVIDVQFLHGYYEPTLLILYEPVRTFPGRIAVRSDTCTMVALSL 266

Query: 289  STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVS 348
            +   + HP+IW+  +LP D  + + +  PIGG LV+  N++ Y +QS         Y VS
Sbjct: 267  NIQQRVHPVIWTVNSLPFDCLQAVPISKPIGGCLVMCVNSLIYLNQSVP------PYGVS 320

Query: 349  LDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRL 401
            L+SS +        P+    + LDAA   +++++  +LS K G+L +LT+  D       
Sbjct: 321  LNSSADHSTNFPLKPQDGVRISLDAAQVCFIESEKLVLSLKGGELYVLTLCADS------ 374

Query: 402  DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADA 461
                         I        FLGSRLG+SLL++F     + +++    ++ G +E + 
Sbjct: 375  ----------MRSICVCETEYLFLGSRLGNSLLLRFREKDESLVITI---DDSGTVEKE- 420

Query: 462  PSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
               KR R    +            +YGS    T     ++ F V DS++NIGP+   + G
Sbjct: 421  --QKRQRLEEEEL----------EVYGSGY-KTSVQLTSYIFEVCDSVLNIGPIAHMAVG 467

Query: 522  LRINAD-------------------ASATGISKQSNYELVE------------LPGCKGI 550
             RI  +                    +A+G  K     +++            L GC  +
Sbjct: 468  ERICEEEMEEGAEVQFVPNKLDVEVVTASGHGKNGALCVLQSSIKPQVITSFGLSGCLDV 527

Query: 551  WTVYHKSSRGHNADSSRMAAYDD---EYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
            WTV+ +++       +R    DD     HA++I+S E  TMVL+T + + E+ E+  +  
Sbjct: 528  WTVFDEAAGPGGVTGTRKP--DDAPPPNHAFMILSQEGATMVLQTGEEINEI-ENTGFAT 584

Query: 608  QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
               TI  GN+   R ++QV  +  R+L G+ + Q++                 + SVSI 
Sbjct: 585  DVPTIHVGNIGSNRFIVQVTTKSIRLLQGTRLLQNIPI----------DLGCPLASVSIV 634

Query: 668  DPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD------------ 715
            DPYV +  S+G +  L       T  +       S+  PV + + Y D            
Sbjct: 635  DPYVCVRSSEGRVITLALREGKGTPRLAVNKNTISASPPVIAISAYRDVSGMFTRKLEDS 694

Query: 716  ----KG---------------PEPWLRKTSTDAWLSTGVGEAID---GADGGPLDQG--- 750
                KG               PEP ++    +  L    G +      AD    D+G   
Sbjct: 695  FDVSKGGGATSAYSSGFGSMKPEPNMKIEDEEDLLYGESGRSFKVTSMADMALADKGGGN 754

Query: 751  -------------DIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREAL- 796
                           + +   ++G LEI+ +P+    + +    +G   + D+     L 
Sbjct: 755  ADFWLKYMQQIKPTYWLLAARDNGNLEIYSMPDLKLAYLISNVGNGNKVLSDSMEFVPLP 814

Query: 797  --KDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQ 854
              K   ++  ++S  G   G      S+   E+ M    ++ SRP LF I  +  +L Y+
Sbjct: 815  MAKPGTSQEEATSAFGASFGSGGVPVSLLPKEILMVALGSYGSRPILF-IRLEQDLLIYR 873

Query: 855  AYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQR- 913
             + +       +     S+    +   V A RL NL   +    A T    P+G   Q  
Sbjct: 874  VFRYAKGHLKLRFKRLTSSVTCPAFRTVPA-RLANLP-DKPATGATTDATEPNGKDTQEH 931

Query: 914  -----------ITIFKNISGHQGFFLSGSRPCWCMVFRE-RLRVHPQLCDGSIVAFTVLH 961
                       I  F N+SG+ G  + G +P +  +     LR H       + AF   +
Sbjct: 932  ATKVQYENISMIRYFGNVSGYAGVAVCGEKPYFLFLTAHGELRSHRLYARTVMKAFAPFN 991

Query: 962  NVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIV 1021
            NVNC +GF+Y   Q  LKI  LP+  +YD+ WPV+K IPL+++P QI Y  E  +Y +++
Sbjct: 992  NVNCPNGFLYFDEQYQLKISILPTYLSYDSVWPVRK-IPLRSSPKQIVYHRENRVYCVVM 1050

Query: 1022 SVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVE--------EYEVRILEPDR 1073
                             +E+ ++    N    +L      E        ++ V ++ P  
Sbjct: 1051 DA---------------EEICNKYYRFNGEDKELTEENKGERFLYPMGHQFSVVLVNP-- 1093

Query: 1074 AGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRV 1130
                W+      I ++  E+ ++++ V+L     +   +  +A+GT +   ED+ +RGR+
Sbjct: 1094 --AAWEIVPDTAIALEEWEHVVSLKNVSLAYEGARSGLKEYIAVGTNFNYSEDITSRGRL 1151

Query: 1131 LLF 1133
            LL+
Sbjct: 1152 LLY 1154


>gi|195381337|ref|XP_002049409.1| GJ21566 [Drosophila virilis]
 gi|194144206|gb|EDW60602.1| GJ21566 [Drosophila virilis]
          Length = 1420

 Score =  311 bits (796), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 314/1195 (26%), Positives = 505/1195 (42%), Gaps = 221/1195 (18%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
            NLVV  ANV+++Y +    + ++  K N  E +    M       LE +  Y L+GNV S
Sbjct: 29   NLVVAGANVLKVYRIAPNVDAAQRQKLNPTEMRLAPKM------RLECLASYSLYGNVMS 82

Query: 116  LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
            L  +S  G      RD+++++F+DAK+SVL+ D     L+  S+H FE  +   ++ G  
Sbjct: 83   LQSVSLAGG----MRDALLISFKDAKLSVLQLDADTQALKTLSLHYFEEED---IRGGWT 135

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
                 P+V+VDP  RC  +LVYG ++++L   +  S     L   +    +      R  
Sbjct: 136  GRYHVPVVRVDPDARCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVTRTP 195

Query: 230  IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
            I +S++I L DLD K  +V D  F+HGY EP ++IL+E   T AGR+             
Sbjct: 196  IMASYLIALADLDEKLDNVLDIQFLHGYYEPTLLILYEPVRTCAGRI------------- 242

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
                                  K+  +  PIGG LV+  N I Y +QS         Y V
Sbjct: 243  ----------------------KVFPIQKPIGGCLVMTVNAIIYLNQSVP------PYGV 274

Query: 348  SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
            SL+SS +        P+ +  + LD A+  ++  D  ++S +TGDL +LT+  D  R V+
Sbjct: 275  SLNSSADNSTSFPLKPQDNVRLSLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 334

Query: 400  RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEA 459
                 K   SVLTS I        FLGSRLG+SLL+ FT    +++++    E   + +A
Sbjct: 335  NFHFHKAAASVLTSCICVCHTEYIFLGSRLGNSLLLHFTEEDQSTVITLDDMENAVEQQA 394

Query: 460  DAPSTKRL----------RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSL 509
               +  +L            + S A    +  EEL +YGS +  +    + F F V DSL
Sbjct: 395  VEQAPPQLDEEQVYDVDQHEAPSQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCDSL 454

Query: 510  VNIGPLKDFSYGLRINAD--------------------ASATGISKQS---------NYE 540
            +N+ P+     G R+  +                     +ATG SK           N +
Sbjct: 455  INVAPINYMCAGERVEFEEDGSTLRPHAESLNEVKIELVAATGHSKNGALSVFVNCINPQ 514

Query: 541  LV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLT 597
            ++   EL GC  +WTV+  ++R       +      E H ++++S  + T+VL+T   + 
Sbjct: 515  IITSFELDGCLDVWTVFDDATR-------KPTTARQEQHDFMLLSQRSSTLVLQTGQEIN 567

Query: 598  EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSE 657
            E+ E+  + V   TI  GNL  +R ++QV  R  R+L G+ + Q++              
Sbjct: 568  EI-ENTGFTVNQPTIYVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI----------DV 616

Query: 658  NSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD-- 715
             S V+ VSIADPYV L + +G +  L    +  T  +       SS   V +   Y D  
Sbjct: 617  GSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAIAAYKDLS 676

Query: 716  ----------------KGP------------EPWLRKTSTDAWLSTGVGEAI------DG 741
                             GP            EP ++    +  L    G A       D 
Sbjct: 677  GLFTCKADDVLNLTGSSGPGFVNSFGGYMKAEPHMKVEDEEDLLYGDAGNAFKLNSMADL 736

Query: 742  ADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVD 789
            A        D +             VV  +SG LEI+ +P+   V+ V+   +G   + D
Sbjct: 737  AKQSKQKNSDWWRRQLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDVGNGALVLND 796

Query: 790  TYMREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
                E +  S T+ NS +  G       ++ +S   +EL +     H  RP L  + T  
Sbjct: 797  AM--EFVPISLTQENSKA--GILHACMPQHANSPLPLELCLVGLGQHGERPLLL-VRTRL 851

Query: 849  TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETP-- 906
             +L YQ + +                  +    +    L + + +   LD    EE    
Sbjct: 852  ELLIYQVFRY------------AKGHLKIRFRKLEQLHLLDQQPTHIELDGDEAEEAESY 899

Query: 907  --HGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNV 963
                   Q++  F N+ G  G  + G  P +  +  R  LR+H  L +  + +F   +NV
Sbjct: 900  NMQPKYVQKLRYFSNVGGLAGIMVCGMNPVFVFLTARGELRIHRLLGNADVRSFAAFNNV 959

Query: 964  NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSV 1023
            N  HGF+Y  +   LKI  LPS  +YD  WPV+KV PL+ TP Q+ Y  E  +Y LI   
Sbjct: 960  NIPHGFLYFDTTYELKISVLPSYLSYDAAWPVRKV-PLRCTPRQLVYHRENRVYCLITQK 1018

Query: 1024 PVLKPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTVEE-YEVRILEPDRAGGPWQT- 1080
               +P+ +       D+E+  +              Y +   +E+ ++ P+     W+  
Sbjct: 1019 E--EPMTKYYRFNGEDKELSEESRGERF-------IYPIGSLFEMVLISPE----TWEIV 1065

Query: 1081 -RATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
              A+I  +  E+    ++V L +  T    +  L IGT +   ED+ +RG + ++
Sbjct: 1066 PDASIQFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIY 1120


>gi|195122290|ref|XP_002005645.1| GI18959 [Drosophila mojavensis]
 gi|193910713|gb|EDW09580.1| GI18959 [Drosophila mojavensis]
          Length = 1431

 Score =  309 bits (791), Expect = 7e-81,   Method: Compositional matrix adjust.
 Identities = 311/1208 (25%), Positives = 506/1208 (41%), Gaps = 236/1208 (19%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
            NLVV  ANV+++Y +    + ++  K N  E +    M       LE +  Y L+GNV S
Sbjct: 29   NLVVAGANVLKVYRIAPNVDATQRQKLNPSEMRLAPKM------RLECLASYSLYGNVMS 82

Query: 116  LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
            L  +S  G      RD+++++F+DAK+SVL+ D     L+  S+H FE  +   ++ G  
Sbjct: 83   LQSVSLAGG----MRDALLVSFKDAKLSVLQLDADTQTLKTLSLHYFEEDD---IRGGWT 135

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
                 P+V+VDP  RC  +LVYG ++++L   +  S     L   +    +     +R  
Sbjct: 136  GRYHVPVVRVDPDARCAIMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTALVSRTP 195

Query: 230  IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
            I +S++I L DLD K  +V D  F+HGY EP ++IL+E   T AGR+             
Sbjct: 196  IMASYLIALADLDEKLDNVLDIQFLHGYYEPTLLILYEPVRTCAGRI------------- 242

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
                                  K+  +  PIGG LV+  N + Y +QS         Y V
Sbjct: 243  ----------------------KVFPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 274

Query: 348  SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
            SL+SS +        P+ +  + LD A+  ++  D  ++S +TGDL +LT+  D  R V+
Sbjct: 275  SLNSSADNSTSFPLKPQDNVRLSLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 334

Query: 400  RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
                 K   SVLTS I        FLGSRLG+SLL+ FT    +++++            
Sbjct: 335  NFHFHKAAASVLTSCICVCHTEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVESAATAAA 394

Query: 448  --------SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
                      + +    ++ D         + S A    +  EEL +YGS +  +    +
Sbjct: 395  TGAGEQQQQAIDQSPPQMDEDQVYDVEQHEAPSQAKSRRIEDEELEVYGSGAKASVLQLR 454

Query: 500  TFSFAVRDSLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS-- 537
             F F V DSL+N+ P+     G R+  +                     +ATG SK    
Sbjct: 455  KFIFEVCDSLINVAPINYMCAGERVEFEEDGTTLRPHAESLTDLKIELVAATGHSKNGAL 514

Query: 538  -------NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
                   N +++   EL GC  +WTV+  ++R       + +    E H ++++S  + T
Sbjct: 515  SVFVNCINPQIITSFELDGCLDVWTVFDDATR-------KPSTARQEQHDFMLLSQRSST 567

Query: 588  MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGP 647
            +VL+T   + E+ E+  + V   TI  GNL  +R ++QV  R  R+L G+ + Q++    
Sbjct: 568  LVLQTGQEINEI-ENTGFTVNQPTIYVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI-- 624

Query: 648  SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPV 707
                       S V+ VSIADPYV L + +G +  L    +  T  +       SS   V
Sbjct: 625  --------DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSAPAV 676

Query: 708  SSCTLYHD------------------------------KGPEPWLRKTSTDAWLSTGVGE 737
             +   Y D                                 EP ++    +  L    G 
Sbjct: 677  VAIAAYKDLSGLFTCKADDVLNLTGSTGAGFANSFGGYMKAEPHMKVEDEEDLLYGDAGN 736

Query: 738  AI------DGADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDK 779
            A       D A        D +             VV  +SG LEI+ +P+   V+ V+ 
Sbjct: 737  AFKLNSMADLAKQSKQKNTDWWRRQLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVND 796

Query: 780  FVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAMQRWSAHHSR 838
              +G   + D    E +  S T+ NS +  G       ++ +S   +EL++     H  R
Sbjct: 797  VGNGALVLTDAM--EFVPISLTQENSKA--GILHACMPQHANSPLPLELSLVGLGQHGDR 852

Query: 839  PFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLD 898
            P L  + T   +L YQ + +                  L +      +L  L    T ++
Sbjct: 853  PLLL-VRTRLELLIYQVFRY--------------AKGHLKIRFRKLEQLHLLDQQPTHIE 897

Query: 899  AYTREETPHGAP-------CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLC 950
                EET             Q++  F N+ G  G  + G  PC+  +  R  LR+H  L 
Sbjct: 898  LINEEETDEAESYNMQPKYVQKLRYFNNVGGLAGIMVCGVNPCFIFLTARGELRIHRLLG 957

Query: 951  DGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITY 1010
            +  + +F   +NVN  HGF+Y  +   LKI  LP+  +YD  WPV+KV PL+ TP Q+ Y
Sbjct: 958  NAEVRSFAAFNNVNIPHGFLYFDTTYELKISVLPTYLSYDAAWPVRKV-PLRCTPRQLVY 1016

Query: 1011 FAEKNLYPLIVSVPVLKPLNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTVEE-YEVRI 1068
              E  +Y LI      +P+ +       D+E+  +              Y +   +E+ +
Sbjct: 1017 HRENRVYCLITQKE--EPMTKYYRFNGEDKELSEESRGERF-------IYPIGSLFEMVL 1067

Query: 1069 LEPDRAGGPWQT--RATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVA 1125
            + P+     W+    A+I  +  E+    ++V L +  T    +  L IGT +   ED+ 
Sbjct: 1068 ISPE----TWEIVPDASIQFEPWEHVTAFKLVKLSYEGTRSGLKEYLCIGTNFNYSEDIT 1123

Query: 1126 ARGRVLLF 1133
            +RG + ++
Sbjct: 1124 SRGNIHIY 1131


>gi|197245729|gb|AAI68713.1| Cpsf1 protein [Rattus norvegicus]
          Length = 1439

 Score =  308 bits (789), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 219/671 (32%), Positives = 345/671 (51%), Gaps = 77/671 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN G T+ +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
           + + T+     FLGSRLG+SLL+++T        SS    E  D E      KR+  +  
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVDPTVG 429

Query: 471 -SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG------- 521
            +    QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G       
Sbjct: 430 WTGGKTQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFLSE 485

Query: 522 ---------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH-------- 555
                    L I        + + + + K    ++V   ELPGC  +WTV          
Sbjct: 486 EFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEE 545

Query: 556 --KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIA 613
             K+       S+  A  D   H +LI+S E  TM+L+T   + E+  S  +  QG T+ 
Sbjct: 546 TPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVF 604

Query: 614 AGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
           AGN+   R ++QV   G R+L+G      L F P +         + ++  ++ADPYV++
Sbjct: 605 AGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYVVI 654

Query: 674 GMSDGSIRLLV 684
             ++G + + +
Sbjct: 655 MSAEGHVTMFL 665



 Score =  140 bits (353), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 107/411 (26%), Positives = 181/411 (44%), Gaps = 30/411 (7%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 780  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 835

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 836  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 885

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 886  VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 945

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 946  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1005

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
             WPV+K IPL+ T H + Y  E  +Y +  S     P  +     I +  G + +   + 
Sbjct: 1006 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIE 1057

Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
              D +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    
Sbjct: 1058 RDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGL 1113

Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
            +  +A GT  +QGE+V  RGR+L+         P   +    +  L+   Q
Sbjct: 1114 KGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQ 1164


>gi|194474008|ref|NP_001124043.1| cleavage and polyadenylation specificity factor subunit 1 [Rattus
           norvegicus]
 gi|149066087|gb|EDM15960.1| cleavage and polyadenylation specific factor 1, 160kDa (predicted),
           isoform CRA_a [Rattus norvegicus]
          Length = 1386

 Score =  308 bits (789), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 219/669 (32%), Positives = 345/669 (51%), Gaps = 75/669 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN G T+ +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
           + + T+     FLGSRLG+SLL+++T        SS    E  D E      KR+  +  
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVDPTVG 429

Query: 471 -SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG------- 521
            +    QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G       
Sbjct: 430 WTGGKTQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFLSE 485

Query: 522 -------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH---------- 555
                  L I        + + + + K    ++V   ELPGC  +WTV            
Sbjct: 486 ENSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEETP 545

Query: 556 KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAG 615
           K+       S+  A  D   H +LI+S E  TM+L+T   + E+  S  +  QG T+ AG
Sbjct: 546 KAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAG 604

Query: 616 NLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
           N+   R ++QV   G R+L+G      L F P +         + ++  ++ADPYV++  
Sbjct: 605 NIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYVVIMS 654

Query: 676 SDGSIRLLV 684
           ++G + + +
Sbjct: 655 AEGHVTMFL 663



 Score =  140 bits (354), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 105/386 (27%), Positives = 176/386 (45%), Gaps = 30/386 (7%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 778  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 833

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 834  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 883

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 884  VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 943

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 944  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1003

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
             WPV+K IPL+ T H + Y  E  +Y +  S     P  +     I +  G + +   + 
Sbjct: 1004 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIE 1055

Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
              D +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    
Sbjct: 1056 RDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGL 1111

Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFS 1134
            +  +A GT  +QGE+V  RGR+ L+S
Sbjct: 1112 KGYVAAGTCLMQGEEVTCRGRIFLWS 1137


>gi|148697644|gb|EDL29591.1| cleavage and polyadenylation specific factor 1, isoform CRA_c [Mus
           musculus]
          Length = 1388

 Score =  307 bits (787), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 219/671 (32%), Positives = 344/671 (51%), Gaps = 77/671 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN G T+ +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
           + + T+     FLGSRLG+SLL+++T        SS    E  D E      KR+  +  
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVG 429

Query: 471 ---SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
                   QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G     
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485

Query: 522 ---------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH-------- 555
                    L I        + + + + K    ++V   ELPGC  +WTV          
Sbjct: 486 SEENSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEE 545

Query: 556 --KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIA 613
             K+       S+  A  D   H +LI+S E  TM+L+T   + E+  S  +  QG T+ 
Sbjct: 546 TPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVF 604

Query: 614 AGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
           AGN+   R ++QV   G R+L+G      L F P +         + ++  ++ADPYV++
Sbjct: 605 AGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYVVI 654

Query: 674 GMSDGSIRLLV 684
             ++G + + +
Sbjct: 655 MSAEGHVTMFL 665



 Score =  141 bits (355), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 105/386 (27%), Positives = 176/386 (45%), Gaps = 30/386 (7%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 780  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 835

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 836  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 885

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 886  VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPS 945

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 946  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1005

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
             WPV+K IPL+ T H + Y  E  +Y +  S     P  +     I +  G + +   + 
Sbjct: 1006 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIE 1057

Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
              D +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    
Sbjct: 1058 RDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGL 1113

Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFS 1134
            +  +A GT  +QGE+V  RGR+ L+S
Sbjct: 1114 KGYVAAGTCLMQGEEVTCRGRIFLWS 1139


>gi|74212803|dbj|BAE33365.1| unnamed protein product [Mus musculus]
          Length = 741

 Score =  307 bits (787), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 219/673 (32%), Positives = 344/673 (51%), Gaps = 79/673 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN G T+ +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
           + + T+     FLGSRLG+SLL+++T        SS    E  D E      KR+  +  
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVG 429

Query: 471 ---SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
                   QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G     
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485

Query: 522 -----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH------ 555
                      L I        + + + + K    ++V   ELPGC  +WTV        
Sbjct: 486 SEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEE 545

Query: 556 ----KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
               K+       S+  A  D   H +LI+S E  TM+L+T   + E+  S  +  QG T
Sbjct: 546 EETPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPT 604

Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
           + AGN+   R ++QV   G R+L+G      L F P +         + ++  ++ADPYV
Sbjct: 605 VFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYV 654

Query: 672 LLGMSDGSIRLLV 684
           ++  ++G + + +
Sbjct: 655 VIMSAEGHVTMFL 667


>gi|255918233|ref|NP_001157645.1| cleavage and polyadenylation specificity factor subunit 1 isoform 1
           [Mus musculus]
          Length = 1450

 Score =  307 bits (787), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 219/673 (32%), Positives = 344/673 (51%), Gaps = 79/673 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN G T+ +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
           + + T+     FLGSRLG+SLL+++T        SS    E  D E      KR+  +  
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVG 429

Query: 471 ---SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
                   QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G     
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485

Query: 522 -----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH------ 555
                      L I        + + + + K    ++V   ELPGC  +WTV        
Sbjct: 486 SEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEE 545

Query: 556 ----KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
               K+       S+  A  D   H +LI+S E  TM+L+T   + E+  S  +  QG T
Sbjct: 546 EETPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPT 604

Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
           + AGN+   R ++QV   G R+L+G      L F P +         + ++  ++ADPYV
Sbjct: 605 VFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYV 654

Query: 672 LLGMSDGSIRLLV 684
           ++  ++G + + +
Sbjct: 655 VIMSAEGHVTMFL 667



 Score =  140 bits (354), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 107/411 (26%), Positives = 181/411 (44%), Gaps = 30/411 (7%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 782  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 837

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 838  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 887

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 888  VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPS 947

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 948  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1007

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
             WPV+K IPL+ T H + Y  E  +Y +  S     P  +     I +  G + +   + 
Sbjct: 1008 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIE 1059

Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
              D +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    
Sbjct: 1060 RDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGL 1115

Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
            +  +A GT  +QGE+V  RGR+L+         P   +    +  L+   Q
Sbjct: 1116 KGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQ 1166


>gi|307107849|gb|EFN56091.1| hypothetical protein CHLNCDRAFT_145620 [Chlorella variabilis]
          Length = 1626

 Score =  307 bits (786), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 352/1386 (25%), Positives = 535/1386 (38%), Gaps = 346/1386 (24%)

Query: 4    AAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTAA 63
            A    +H PT + +C + ++TH++          Q               P+P+LVV  +
Sbjct: 7    AVCTQVHPPTAVTHCTAAWLTHAQRQ--------QGSGSADGDDGGGSGDPLPDLVVVRS 58

Query: 64   NVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGG 123
              +E+Y VR  E G   + ++             A SL+ +   RL G  ES+A+L +G 
Sbjct: 59   TQLELYSVRGSEAGGPATTHT-------------AQSLDQLASCRLFGVAESVAVL-RGR 104

Query: 124  ADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV 183
            A    +RD ++L F DAK+SVL +D   H L  +S+H FE      LK GR  F   PL 
Sbjct: 105  APG--QRDVLLLTFRDAKLSVLHWDAGRHELAPSSLHYFEGDA--SLKLGRTVFPYPPLA 160

Query: 184  KVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSG-------------------- 223
              DP GRCG V+++  Q+ +L A         D + FG G                    
Sbjct: 161  VTDPLGRCGAVIIFRHQLAVLPAV--------DSELFGLGLSAAEEDEEEAAATAALGLA 212

Query: 224  --------------------------GGFSARIESSHVINLRDLDMKHVKDFIFVHGYIE 257
                                         +A + +S+V NL    +K V+D  F+HGY E
Sbjct: 213  PPDGGGAADGEAGAPRGGAAAAAAGLPAAAAAVGNSYVDNLGKAGIKEVRDACFLHGYSE 272

Query: 258  PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
            PV+++LHE E TWAG +  K  TC+++ALS++ T K HP IW A  LP DAY+L A  +P
Sbjct: 273  PVLMVLHEAEPTWAGNLRQKKDTCVLTALSLNLTRKHHPKIWGAQELPSDAYRLSA--AP 330

Query: 318  IGGVLVVGANTIHYHSQSASCALALN---------------------------------- 343
             GGVLV+  + + ++ Q     + L+                                  
Sbjct: 331  CGGVLVLCQHLVLHYRQGQQSGVVLHPSALPPAAAPPPLLFDPQAMAEAGGPGPASAAYA 390

Query: 344  -NYAVSL------------DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLT 390
              +AV +            D+SQ    ++  V  D A   WL  + ALL  ++G L+ L 
Sbjct: 391  RQHAVDVHPETVPAAVRFCDASQA---AALKVTADGASVCWLSPESALLCLRSGQLLQLA 447

Query: 391  VVYD--GRVVQRLDLSKTNPSVLTSDITTIGNS----------------------LFFLG 426
            ++    G   + L +++   +   S   ++  +                      L FLG
Sbjct: 448  LLPQQAGGSARHLAVARAGAAPHPSCCCSLSGAHRAPHMPGSAAAAAAGQAPQPALVFLG 507

Query: 427  SRLGDSLLVQFT----CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV--- 479
            S  GDSLLV+ T     G+     ++       D  AD P++KRLR    +         
Sbjct: 508  SAAGDSLLVRATPAAAAGTKRPAEAATGAAGEEDGTADEPASKRLRLEGIEVGSAAAAVE 567

Query: 480  --------------------------------NGEELSLYGSA--------------SNN 493
                                              EE  +YG+A              +  
Sbjct: 568  ATAAAAAAAQGAAAAAAEARAAAGGGPAGSDSEDEEALIYGTALYSSAAGVAPAAAAAVP 627

Query: 494  TESAQ-KTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWT 552
            T S Q + +   V DSL NIGPL+DF+        A A G +           G  G  T
Sbjct: 628  TPSWQLQRYQLKVLDSLANIGPLRDFAVA---EPAAGAGGEAVPPALVGCSGEGKGGTLT 684

Query: 553  VYHKS----------------SRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLL 596
            V  +S                  G    +   A  +  +HAYL++S +  T VL T + L
Sbjct: 685  VLRRSVVPDVITEHRGAASASGGGSGQAAGEAAGQEGGHHAYLLLSFQGATKVLATGEEL 744

Query: 597  TEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD-----LSFGPSNSE 651
             EVTESV++ V   T+AAG++   RR+ Q F +G R+LDG    QD     L+   + + 
Sbjct: 745  REVTESVEFAVDTPTLAAGSVCCGRRIAQAFPQGLRLLDGEESVQDVWASELAAPAAAAA 804

Query: 652  SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQT--------------- 696
            +G       ++S  + DPYVLL ++DG+ R L  DP  C +S  +               
Sbjct: 805  AGGAPGGGAIVSADMCDPYVLLYLADGTARFLTADPVACRLSAASAAGAGPEAAAAAEAA 864

Query: 697  -----PAAIESSKKPVSSCTLYHD-------KGPEPWLRKTSTDAWLSTGVGEAIDGADG 744
                 P A E     +++C+L+ D       + P+   +            G     A  
Sbjct: 865  EAALRPVAAEER---ITACSLFADSCGWLAARLPQTQQQTQQQQQQQGQQDGGTTAQAAA 921

Query: 745  GPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEIN 804
                 G +Y+VVC  SGA +++ +P +  VF+    ++G   ++ T    A   +     
Sbjct: 922  SGGGCGAVYAVVCRASGACQLYALPAWQPVFSSSTSLAGGPALL-TGSGGAGGVAAAAAA 980

Query: 805  SSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSR-----------------PFLFAILTD 847
            +++         E     +VVE+ +  +    +                  P L A+  D
Sbjct: 981  AAAAAAAAGVEDEMDGPGEVVEVRLVSFGPAAAGRRDAAAARASPAPACEPPLLLALTAD 1040

Query: 848  GTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLR-----NLRFSRTPLDAYTR 902
              +L YQA+        ++      T R   +       L       LR  R        
Sbjct: 1041 HQLLAYQAFSASPGSGGTRGSSGSGTPRFRRLRLDLPPLLPPAGGPQLRLRRLHCFEGLG 1100

Query: 903  EETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD---------GS 953
            EE P                + G F++G  P W +  R  L  HP               
Sbjct: 1101 EEAP----------------YSGVFVAGQHPHWLVASRGGLLPHPHFLPQPAGPGAAAVG 1144

Query: 954  IVAFTVLHNVNCNHGFIYVTS--QGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYF 1011
               FT  HNVNC HGFI  TS  +  ++I QLP  +  D  WP Q+V  +K TP ++ ++
Sbjct: 1145 AAGFTPFHNVNCPHGFIVATSGARSGIQISQLPPRTRLDAPWPRQRV-SIKGTPLKVAHY 1203

Query: 1012 AEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEP 1071
            AE +++              VLS    +  G  +               +E +EVR + P
Sbjct: 1204 AEADMF-------------AVLSSRQGRARGRGV---------------MEGHEVRWVWP 1235

Query: 1072 DRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVL 1131
               GG WQ       +  E AL+V  V L +  T     LLA+G A   GED    GR+L
Sbjct: 1236 ---GGGWQGVGRHQRRPGERALSVGAVRLKDHATGATVPLLAVGAALPAGEDYPCGGRLL 1292

Query: 1132 LFSTGR 1137
            LF   R
Sbjct: 1293 LFEVTR 1298


>gi|16751835|ref|NP_444423.1| cleavage and polyadenylation specificity factor subunit 1 isoform 2
           [Mus musculus]
 gi|17374611|sp|Q9EPU4.1|CPSF1_MOUSE RecName: Full=Cleavage and polyadenylation specificity factor
           subunit 1; AltName: Full=Cleavage and polyadenylation
           specificity factor 160 kDa subunit; Short=CPSF 160 kDa
           subunit
 gi|11762096|gb|AAG40326.1|AF322193_1 cleavage and polyadenylation specificity factor 1 [Mus musculus]
 gi|38614159|gb|AAH56388.1| Cleavage and polyadenylation specific factor 1 [Mus musculus]
          Length = 1441

 Score =  307 bits (786), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 219/673 (32%), Positives = 344/673 (51%), Gaps = 79/673 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN G T+ +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
           + + T+     FLGSRLG+SLL+++T        SS    E  D E      KR+  +  
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVG 429

Query: 471 ---SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
                   QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G     
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485

Query: 522 -----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH------ 555
                      L I        + + + + K    ++V   ELPGC  +WTV        
Sbjct: 486 SEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEE 545

Query: 556 ----KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
               K+       S+  A  D   H +LI+S E  TM+L+T   + E+  S  +  QG T
Sbjct: 546 EETPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPT 604

Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
           + AGN+   R ++QV   G R+L+G      L F P +         + ++  ++ADPYV
Sbjct: 605 VFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYV 654

Query: 672 LLGMSDGSIRLLV 684
           ++  ++G + + +
Sbjct: 655 VIMSAEGHVTMFL 667



 Score =  140 bits (354), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 107/411 (26%), Positives = 181/411 (44%), Gaps = 30/411 (7%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 782  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 837

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 838  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 887

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 888  VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPS 947

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 948  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1007

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
             WPV+K IPL+ T H + Y  E  +Y +  S     P  +     I +  G + +   + 
Sbjct: 1008 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIE 1059

Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
              D +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    
Sbjct: 1060 RDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGL 1115

Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
            +  +A GT  +QGE+V  RGR+L+         P   +    +  L+   Q
Sbjct: 1116 KGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQ 1166


>gi|148697642|gb|EDL29589.1| cleavage and polyadenylation specific factor 1, isoform CRA_a [Mus
           musculus]
          Length = 1417

 Score =  306 bits (784), Expect = 4e-80,   Method: Compositional matrix adjust.
 Identities = 219/673 (32%), Positives = 344/673 (51%), Gaps = 79/673 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN G T+ +   +      LELV  +   GNV S+
Sbjct: 56  NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 108

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 109 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 161

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 162 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 218

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 219 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 278

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct: 279 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 338

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 339 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 398

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
           + + T+     FLGSRLG+SLL+++T        SS    E  D E      KR+  +  
Sbjct: 399 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVG 456

Query: 471 ---SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
                   QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G     
Sbjct: 457 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 512

Query: 522 -----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH------ 555
                      L I        + + + + K    ++V   ELPGC  +WTV        
Sbjct: 513 SEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEE 572

Query: 556 ----KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
               K+       S+  A  D   H +LI+S E  TM+L+T   + E+  S  +  QG T
Sbjct: 573 EETPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPT 631

Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
           + AGN+   R ++QV   G R+L+G      L F P +         + ++  ++ADPYV
Sbjct: 632 VFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYV 681

Query: 672 LLGMSDGSIRLLV 684
           ++  ++G + + +
Sbjct: 682 VIMSAEGHVTMFL 694



 Score =  140 bits (354), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 105/386 (27%), Positives = 176/386 (45%), Gaps = 30/386 (7%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 809  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 864

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 865  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 914

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 915  VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPS 974

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 975  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1034

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
             WPV+K IPL+ T H + Y  E  +Y +  S     P  +     I +  G + +   + 
Sbjct: 1035 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIE 1086

Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
              D +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    
Sbjct: 1087 RDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGL 1142

Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFS 1134
            +  +A GT  +QGE+V  RGR+ L+S
Sbjct: 1143 KGYVAAGTCLMQGEEVTCRGRIFLWS 1168


>gi|334326317|ref|XP_001364707.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like [Monodelphis domestica]
          Length = 1449

 Score =  305 bits (782), Expect = 7e-80,   Method: Compositional matrix adjust.
 Identities = 224/687 (32%), Positives = 354/687 (51%), Gaps = 99/687 (14%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV   + + +Y +    E S +S  + E K    +       LELV  +   GNV S+
Sbjct: 29  NLVVAGTSQLYVYRLNHDAETSTKSDRNAEGK----LHKEHKEKLELVASFSFFGNVMSM 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 85  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +L+YG ++++L       ++   GLVG+        G  +   
Sbjct: 138 NVHTPRVRVDPDGRCAVMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQKSSFL 189

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 190 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 249

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL 349
              K HP+IWS  NLP D  + LAVP PIGGV++   N++ Y +QS         Y VSL
Sbjct: 250 ILQKVHPVIWSLTNLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVP------PYGVSL 303

Query: 350 DS----SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRL 401
           +S    +   P   +    + LD A A ++  D  ++S K G++ +LT++ DG R V+  
Sbjct: 304 NSLTAGTTAFPLRMQDGVKITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRSF 363

Query: 402 DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI-EAD 460
              K   SVLT+ + T+     FLGSRLG+SLL+++T        S+  +    ++ + D
Sbjct: 364 HFDKAAASVLTTCMITMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAAREAPSREVSDKD 423

Query: 461 APSTKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNI 512
            P  K+ R  S+        A QD V+  E+ +YGS A + T+ A  T+SF V DS++NI
Sbjct: 424 EPPVKKKRVESTLGWAGGKSAPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNI 479

Query: 513 GPLKDFSYG----------------LRI------NADASATGISKQSNYELV---ELPGC 547
           GP  + + G                L I        + + + + K    ++V   ELPGC
Sbjct: 480 GPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGC 539

Query: 548 KGIWTVY-------HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLT 597
             +WTV         ++++G  A+   SS     D + H +LI+S E  TM+L+T   + 
Sbjct: 540 YDMWTVIAPLRKEEDETTKGEGAEQEPSSPETEDDGKRHGFLILSREDSTMILQTGQEIM 599

Query: 598 EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSE 657
           E+  S  +  QG T+ AGN+   R ++QV   G R+L+G      L F P +        
Sbjct: 600 ELDTS-GFATQGPTVYAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL------- 648

Query: 658 NSTVLSVSIADPYVLLGMSDGSIRLLV 684
            S ++  ++ADPYV++  ++G + + +
Sbjct: 649 GSPIVQCAVADPYVVIMSAEGHVTMFL 675



 Score =  143 bits (360), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 113/428 (26%), Positives = 190/428 (44%), Gaps = 64/428 (14%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + V+  ESGA+EI+ +P++  VF V  F  G+  +VD+    +     T+ ++  EE T 
Sbjct: 790  WCVLVRESGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPATQGDTKKEEVTR 845

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L  ++     +RP+L  +  D  +L Y+A+  +             
Sbjct: 846  QGELPLVKEVLLVALGNRQ-----TRPYLL-VHVDQELLIYEAFAHD------------- 886

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAP-----------------CQRIT 915
                   S +  S L+ +RF + P +   RE+ P  +                    R  
Sbjct: 887  -------SQLGQSNLK-VRFKKVPHNINFREKKPKPSKKKPEGGGTEEGAGARGRVARFR 938

Query: 916  IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
             F++I G+ G F+ G  P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   
Sbjct: 939  YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNR 998

Query: 975  QGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLS 1034
            QG L+I  LP+  +YD  WPV+K IPL+ T H + Y  E  +Y +  S          L 
Sbjct: 999  QGELRISVLPAYLSYDAPWPVRK-IPLRCTAHYVAYHVESKVYAVATS-------TNALC 1050

Query: 1035 LLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENA 1092
              I +  G + +   +   + +     E + ++++ P      W+    A I ++  E+ 
Sbjct: 1051 TRIPRMTGEEKEFETIERDERYIHPLQEAFSIQLISPVS----WEAIPNARIELEEWEHV 1106

Query: 1093 LTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSY 1151
              ++ V+L +  T    +  +A GT  +QGE+V  RGR+L+         P   +    +
Sbjct: 1107 TCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKF 1166

Query: 1152 GPLFSSVQ 1159
              L+   Q
Sbjct: 1167 KVLYEKEQ 1174


>gi|397497327|ref|XP_003819464.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 [Pan paniscus]
 gi|410336497|gb|JAA37195.1| cleavage and polyadenylation specific factor 1, 160kDa [Pan
           troglodytes]
          Length = 1442

 Score =  305 bits (781), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 220/677 (32%), Positives = 348/677 (51%), Gaps = 86/677 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
            SVLT+ + T+     FLGSRLG+SLL+++T        +++  +  KEE    +    +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426

Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
           T     +     QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G 
Sbjct: 427 TAGWSAAGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGE 482

Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
                          L I        + + + + K    ++V   ELPGC  +WTV    
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 542

Query: 555 ------HKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
                 +    G   + S   A DD   H +LI+S E  TM+L+T   + E+  S  +  
Sbjct: 543 RKEEEDNPKGEGTEQEPSTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFAT 601

Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
           QG T+ AGN+   R ++QV   G R+L+G      L F P +         + ++  ++A
Sbjct: 602 QGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVA 651

Query: 668 DPYVLLGMSDGSIRLLV 684
           DPYV++  ++G + + +
Sbjct: 652 DPYVVIMSAEGHVTMFL 668



 Score =  139 bits (349), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 105/413 (25%), Positives = 181/413 (43%), Gaps = 34/413 (8%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 783  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 838

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 839  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 888

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +        E         R   F++I G+ G F+ G  
Sbjct: 889  VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 948

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG + +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 949  PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1008

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVS--VPVLKPLNQVLSLLIDQEVGHQIDNHN 1049
             WPV+K IPL+ T H + Y  E  +Y +  S   P  +         I +  G + +   
Sbjct: 1009 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNTPCAR---------IPRMTGEEKEFET 1058

Query: 1050 LSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKE 1107
            +   + +     E + ++++ P      W+    A I +Q  E+   ++ V+L +  T  
Sbjct: 1059 IERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTCMKTVSLRSEETVS 1114

Query: 1108 N-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
              +  +A GT  +QGE+V  RGR+L+         P   +    +  L+   Q
Sbjct: 1115 GLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQ 1167


>gi|410042329|ref|XP_003954555.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
           specificity factor subunit 1 [Pan troglodytes]
          Length = 1296

 Score =  305 bits (781), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 220/677 (32%), Positives = 348/677 (51%), Gaps = 86/677 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
            SVLT+ + T+     FLGSRLG+SLL+++T        +++  +  KEE    +    +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426

Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
           T     +     QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G 
Sbjct: 427 TAGWSAAGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGE 482

Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
                          L I        + + + + K    ++V   ELPGC  +WTV    
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNEALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 542

Query: 555 ------HKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
                 +    G   + S   A DD   H +LI+S E  TM+L+T   + E+  S  +  
Sbjct: 543 RKEEEDNPKGEGTEQEPSTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFAT 601

Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
           QG T+ AGN+   R ++QV   G R+L+G      L F P +         + ++  ++A
Sbjct: 602 QGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVA 651

Query: 668 DPYVLLGMSDGSIRLLV 684
           DPYV++  ++G + + +
Sbjct: 652 DPYVVIMSAEGHVTMFL 668



 Score =  109 bits (272), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 85/307 (27%), Positives = 133/307 (43%), Gaps = 24/307 (7%)

Query: 860  GPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA-PCQRITIFK 918
            G E +   DD        S S  S S+    R S+ P D   R+  P  A P     + +
Sbjct: 732  GSETSPTVDDEEEMLYGDSGSLFSPSKEEARRSSQPPAD---RDPAPFRAEPTHWCLLVR 788

Query: 919  NISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGI 977
                    F+ G  P W +V  R  LR+HP   DG + +F   HNVNC  GF+Y   QG 
Sbjct: 789  ENGTMXXXFICGPSPPWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGE 848

Query: 978  LKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVS--VPVLKPLNQVLSL 1035
            L+I  LP+  +YD  WPV+K IPL+ T H + Y  E  +Y +  S   P  +        
Sbjct: 849  LRISVLPAYLSYDAPWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNTPCAR-------- 899

Query: 1036 LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENAL 1093
             I +  G + +   +   + +     E + ++++ P      W+    A I +Q  E+  
Sbjct: 900  -IPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVT 954

Query: 1094 TVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYG 1152
             ++ V+L +  T    +  +A GT  +QGE+V  RGR+L+         P   +    + 
Sbjct: 955  CMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFK 1014

Query: 1153 PLFSSVQ 1159
             L+   Q
Sbjct: 1015 VLYEKEQ 1021


>gi|395512730|ref|XP_003760588.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 [Sarcophilus harrisii]
          Length = 1449

 Score =  305 bits (780), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 224/687 (32%), Positives = 353/687 (51%), Gaps = 99/687 (14%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV   + + +Y +    E S +S  + E K    +       LELV  +   GNV S+
Sbjct: 29  NLVVAGTSQLYVYRLNHDAETSTKSDRNAEGK----LHKEHKEKLELVASFSFFGNVMSM 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 85  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +L+YG ++++L       ++   GLVG+        G  +   
Sbjct: 138 NVHTPRVRVDPDGRCAVMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQKSSFL 189

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 190 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 249

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL 349
              K HP+IWS  NLP D  + LAVP PIGGV++   N++ Y +QS         Y VSL
Sbjct: 250 ILQKVHPVIWSLTNLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVP------PYGVSL 303

Query: 350 DS----SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRL 401
           +S    +   P   +    + LD A A ++  D  ++S K G++ +LT++ DG R V+  
Sbjct: 304 NSLTAGTTAFPLRMQDGVKITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRSF 363

Query: 402 DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI-EAD 460
              K   SVLT+ + T+     FLGSRLG+SLL+++T        SS  +    ++ + D
Sbjct: 364 HFDKAAASVLTTCMITMEPGYLFLGSRLGNSLLLKYTEKLQEPPASSAREAPSREVSDKD 423

Query: 461 APSTKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNI 512
            P  K+ R  S+        A QD V+  E+ +YGS A + T+ A  T+SF V DS++NI
Sbjct: 424 EPPVKKKRVESTLGWAGGKSAPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNI 479

Query: 513 GPLKDFSYG----------------LRI------NADASATGISKQSNYELV---ELPGC 547
           GP  + + G                L I        + + + + K    ++V   ELPGC
Sbjct: 480 GPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGC 539

Query: 548 KGIWTVY-------HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLT 597
             +WTV         ++++G   +   SS     D + H +LI+S E  TM+L+T   + 
Sbjct: 540 YDMWTVIAPLRKEEDETTKGEGPEQEPSSPETEDDGKRHGFLILSREDSTMILQTGQEIM 599

Query: 598 EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSE 657
           E+  S  +  QG T+ AGN+   R ++QV   G R+L+G      L F P +        
Sbjct: 600 ELDTS-GFATQGPTVYAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL------- 648

Query: 658 NSTVLSVSIADPYVLLGMSDGSIRLLV 684
            S ++  ++ADPYV++  ++G + + +
Sbjct: 649 GSPIVQCAVADPYVVIMSAEGHVTMFL 675



 Score =  144 bits (364), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 114/428 (26%), Positives = 190/428 (44%), Gaps = 64/428 (14%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + V+  ESGA+EI+ +P++  VF V  F  G+  +VD+    +     T+ ++  EE T 
Sbjct: 790  WCVLVRESGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPATQGDTKKEEVTR 845

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L  ++     +RP+L  +  D  +L Y+A+  +             
Sbjct: 846  QGELPLVKEVLLVALGNRQ-----TRPYLL-VHVDQELLIYEAFAHD------------- 886

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAP-----------------CQRIT 915
                   S +  S L+ +RF + P +   RE+ P  +                    R  
Sbjct: 887  -------SQLGQSNLK-VRFKKVPHNINFREKKPKPSKKKPEGGGAEEGAGARGRVARFR 938

Query: 916  IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
             F++I G+ G F+ G  P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   
Sbjct: 939  YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNR 998

Query: 975  QGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLS 1034
            QG L+I  LP+  +YD  WPV+K IPL+ T H + Y  E  +Y +  S          L 
Sbjct: 999  QGELRISVLPAYLSYDAPWPVRK-IPLRCTAHYVAYHVESKVYAVATS-------TNALC 1050

Query: 1035 LLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENA 1092
              I +  G + +   +   D +     E + ++++ P      W+    A I ++  E+ 
Sbjct: 1051 TRIPRMTGEEKEFETIERDDRYIHPLQEAFSIQLISPVS----WEAIPNARIELEEWEHV 1106

Query: 1093 LTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSY 1151
              ++ V+L +  T    +  +A GT  +QGE+V  RGR+L+         P   +    +
Sbjct: 1107 TCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKF 1166

Query: 1152 GPLFSSVQ 1159
              L+   Q
Sbjct: 1167 KVLYEKEQ 1174


>gi|384946686|gb|AFI36948.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
           mulatta]
          Length = 1428

 Score =  305 bits (780), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 219/677 (32%), Positives = 347/677 (51%), Gaps = 86/677 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
            SVLT+ + T+     FLGSRLG+SLL+++T        +++  +  KEE    +    +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426

Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
           T           QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G 
Sbjct: 427 TASWSAGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGE 482

Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
                          L I        + + + + K    ++V   ELPGC  +WTV    
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 542

Query: 555 ------HKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
                 +    G   ++    A DD   H +LI+S E  TM+L+T   + E+  S  +  
Sbjct: 543 RKEEEDNPKGEGTEQEARSPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFAT 601

Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
           QG T+ AGN+   R ++QV   G R+L+G      L F P +         + ++  ++A
Sbjct: 602 QGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVA 651

Query: 668 DPYVLLGMSDGSIRLLV 684
           DPYV++  ++G + + +
Sbjct: 652 DPYVVIMSAEGHVTMFL 668



 Score =  138 bits (348), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 105/411 (25%), Positives = 180/411 (43%), Gaps = 30/411 (7%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 783  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 838

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 839  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 888

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E         R   F++I G+ G F+ G  
Sbjct: 889  VRFKKVPHNINFREKKPKPSKKKAEGGGTEEGAGARGRVARFRYFEDIYGYSGVFICGPS 948

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG + +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 949  PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1008

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
             WPV+K IPL+ T H + Y  E  +Y +  S              I +  G + +   + 
Sbjct: 1009 PWPVRK-IPLRCTAHYVAYHVESKVYAVATS-------TNTPCARIPRMTGEEKEFETIE 1060

Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
              + +     E + ++++ P      W+    A I +Q  E+   ++ V+L +  T    
Sbjct: 1061 RDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTCMKTVSLRSEETVSGL 1116

Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
            +  +A GT  +QGE+V  RGR+L+         P   +    +  L+   Q
Sbjct: 1117 KGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQ 1167


>gi|338728511|ref|XP_001505047.3| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like isoform 1 [Equus caballus]
          Length = 1444

 Score =  304 bits (779), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 219/675 (32%), Positives = 350/675 (51%), Gaps = 80/675 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN    + +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEAPTKNDRNAEGKAHRE--HREKLELVASFSFFGNVMSM 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 85  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 138 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 194

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 195 DVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 254

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+      +  
Sbjct: 255 HPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 314

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 315 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 374

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
           + + T+     FLGSRLG+SLL+++T        +S ++E     E + P +K+ R  S+
Sbjct: 375 TSMVTMEPGYLFLGSRLGNSLLLKYT-EKLQEPPASAVREA---AEKEEPPSKKKRVDST 430

Query: 473 -------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG--- 521
                     QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G   
Sbjct: 431 VGWSGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGEPA 486

Query: 522 -------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY----- 554
                        L I        + + + + K    ++V   ELPGC  +WTV      
Sbjct: 487 FLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRK 546

Query: 555 --HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
              ++ +G   +   S+  A  D   H +LI+S E  TM+L+T   + E+  S  +  QG
Sbjct: 547 EQEETPKGEGTEQEPSAPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQG 605

Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
            T+ AGN+   R ++QV   G R+L+G      L F P +         S ++  ++ADP
Sbjct: 606 PTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GSPIVQCAVADP 655

Query: 670 YVLLGMSDGSIRLLV 684
           YV++  ++G + + +
Sbjct: 656 YVVIMSAEGHVTMFL 670



 Score =  139 bits (350), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 106/411 (25%), Positives = 181/411 (44%), Gaps = 30/411 (7%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 785  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 840

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 841  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 890

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +        E         R   F++I G+ G F+ G  
Sbjct: 891  VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGVGARGRVARFRYFEDIYGYSGVFICGPS 950

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 951  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1010

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
             WPV+K IPL+ T H + Y  E  +Y +  S     P  +     I +  G + +   + 
Sbjct: 1011 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFETIE 1062

Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
              D +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    
Sbjct: 1063 RDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGL 1118

Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
            +  +A GT  +QGE+V  RGR+L+         P   +    +  L+   Q
Sbjct: 1119 KGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQ 1169


>gi|56676371|ref|NP_037423.2| cleavage and polyadenylation specificity factor subunit 1 [Homo
           sapiens]
 gi|23503048|sp|Q10570.2|CPSF1_HUMAN RecName: Full=Cleavage and polyadenylation specificity factor
           subunit 1; AltName: Full=Cleavage and polyadenylation
           specificity factor 160 kDa subunit; Short=CPSF 160 kDa
           subunit
 gi|16878041|gb|AAH17232.1| Cleavage and polyadenylation specific factor 1, 160kDa [Homo
           sapiens]
 gi|119602516|gb|EAW82110.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform
           CRA_c [Homo sapiens]
 gi|123993607|gb|ABM84405.1| cleavage and polyadenylation specific factor 1, 160kDa [synthetic
           construct]
 gi|123999626|gb|ABM87355.1| cleavage and polyadenylation specific factor 1, 160kDa [synthetic
           construct]
 gi|307684758|dbj|BAJ20419.1| cleavage and polyadenylation specific factor 1, 160kDa [synthetic
           construct]
          Length = 1443

 Score =  304 bits (779), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 219/678 (32%), Positives = 348/678 (51%), Gaps = 87/678 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
            SVLT+ + T+     FLGSRLG+SLL+++T        +++  +  KEE    +    +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426

Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
           T     +     QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G 
Sbjct: 427 TAGWSAAGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAVGE 482

Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
                          L I        + + + + K    ++V   ELPGC  +WTV    
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 542

Query: 555 ------HKSSRGHNADSSRMAAYDDE--YHAYLIISLEARTMVLETADLLTEVTESVDYF 606
                 +    G   + S     DD+   H +LI+S E  TM+L+T   + E+  S  + 
Sbjct: 543 RKEEEDNPKGEGTEQEPSTTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFA 601

Query: 607 VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI 666
            QG T+ AGN+   R ++QV   G R+L+G      L F P +         + ++  ++
Sbjct: 602 TQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAV 651

Query: 667 ADPYVLLGMSDGSIRLLV 684
           ADPYV++  ++G + + +
Sbjct: 652 ADPYVVIMSAEGHVTMFL 669



 Score =  139 bits (349), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 105/413 (25%), Positives = 181/413 (43%), Gaps = 34/413 (8%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 784  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 839

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 840  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 889

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +        E         R   F++I G+ G F+ G  
Sbjct: 890  VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 949

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG + +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 950  PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1009

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVS--VPVLKPLNQVLSLLIDQEVGHQIDNHN 1049
             WPV+K IPL+ T H + Y  E  +Y +  S   P  +         I +  G + +   
Sbjct: 1010 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNTPCAR---------IPRMTGEEKEFET 1059

Query: 1050 LSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKE 1107
            +   + +     E + ++++ P      W+    A I +Q  E+   ++ V+L +  T  
Sbjct: 1060 IERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTCMKTVSLRSEETVS 1115

Query: 1108 N-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
              +  +A GT  +QGE+V  RGR+L+         P   +    +  L+   Q
Sbjct: 1116 GLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQ 1168


>gi|1045574|gb|AAC50293.1| cleavage and polyadenylation specificity factor [Homo sapiens]
          Length = 1442

 Score =  304 bits (779), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 219/678 (32%), Positives = 348/678 (51%), Gaps = 87/678 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
            SVLT+ + T+     FLGSRLG+SLL+++T        +++  +  KEE    +    +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426

Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
           T     +     QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G 
Sbjct: 427 TAGWSAAGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAVGE 482

Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
                          L I        + + + + K    ++V   ELPGC  +WTV    
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 542

Query: 555 ------HKSSRGHNADSSRMAAYDDE--YHAYLIISLEARTMVLETADLLTEVTESVDYF 606
                 +    G   + S     DD+   H +LI+S E  TM+L+T   + E+  S  + 
Sbjct: 543 RKEEEDNPKGEGTEQEPSTTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFA 601

Query: 607 VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI 666
            QG T+ AGN+   R ++QV   G R+L+G      L F P +         + ++  ++
Sbjct: 602 TQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAV 651

Query: 667 ADPYVLLGMSDGSIRLLV 684
           ADPYV++  ++G + + +
Sbjct: 652 ADPYVVIMSAEGHVTMFL 669



 Score =  139 bits (349), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 105/413 (25%), Positives = 181/413 (43%), Gaps = 34/413 (8%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 784  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 839

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 840  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 889

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +        E         R   F++I G+ G F+ G  
Sbjct: 890  VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 949

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG + +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 950  PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1009

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVS--VPVLKPLNQVLSLLIDQEVGHQIDNHN 1049
             WPV+K IPL+ T H + Y  E  +Y +  S   P  +         I +  G + +   
Sbjct: 1010 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNTPCAR---------IPRMTGEEKEFET 1059

Query: 1050 LSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKE 1107
            +   + +     E + ++++ P      W+    A I +Q  E+   ++ V+L +  T  
Sbjct: 1060 IERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTCMKTVSLRSEETVS 1115

Query: 1108 N-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
              +  +A GT  +QGE+V  RGR+L+         P   +    +  L+   Q
Sbjct: 1116 GLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQ 1168


>gi|27807297|ref|NP_777145.1| cleavage and polyadenylation specificity factor subunit 1 [Bos
           taurus]
 gi|1706101|sp|Q10569.1|CPSF1_BOVIN RecName: Full=Cleavage and polyadenylation specificity factor
           subunit 1; AltName: Full=Cleavage and polyadenylation
           specificity factor 160 kDa subunit; Short=CPSF 160 kDa
           subunit
 gi|929007|emb|CAA58152.1| cleavage and polyadenylation specificity factor, 160 kDa subunit
           [Bos taurus]
 gi|296480730|tpg|DAA22845.1| TPA: cleavage and polyadenylation specificity factor subunit 1 [Bos
           taurus]
          Length = 1444

 Score =  304 bits (778), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 221/678 (32%), Positives = 345/678 (50%), Gaps = 86/678 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T  +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDSEAPTKNDRSTDGKAHRE--HREKLELVASFSFFGNVMSM 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 85  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +L+YG ++++L       ++   GLVG+        G  +   
Sbjct: 138 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 189

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 190 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 249

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+    
Sbjct: 250 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTG 309

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 310 TTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 369

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
            SVLT+ + T+     FLGSRLG+SLL+++T        S+    E  D E      KR+
Sbjct: 370 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASTA--REAADKEEPPSKKKRV 427

Query: 468 RRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
             +     S    QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G
Sbjct: 428 DATTGWSGSKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMG 483

Query: 522 ----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYHK 556
                           L I        + + + + K    ++V   ELPGC  +WTV   
Sbjct: 484 EPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAP 543

Query: 557 SSR---------GHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYF 606
             +         G   +     A DD   H +LI+S E  TM+L+T   + E+  S  + 
Sbjct: 544 VRKEQEETLKGEGTEPEPGAPEAEDDGRRHGFLILSREDSTMILQTGQEIMELDAS-GFA 602

Query: 607 VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI 666
            QG T+ AGN+   R ++QV   G R+L+G      L F P +         S ++  ++
Sbjct: 603 TQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GSPIVQCAV 652

Query: 667 ADPYVLLGMSDGSIRLLV 684
           ADPYV++  ++G + + +
Sbjct: 653 ADPYVVIMSAEGHVTMFL 670



 Score =  141 bits (355), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 106/411 (25%), Positives = 183/411 (44%), Gaps = 30/411 (7%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+GA+EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 785  WCLLVRENGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 840

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +   RP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 841  QGELPLVKEVLLVALG-----SRQRRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 890

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E T       R   F++I G+ G F+ G  
Sbjct: 891  VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGTGPRGRVARFRYFEDIYGYSGVFICGPS 950

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HN+NC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 951  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDA 1010

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
             WPV+K IPL+ T H + Y  E  +Y +  S     P  +V  +      G + +   + 
Sbjct: 1011 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTST--PCTRVPRM-----TGEEKEFETIE 1062

Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
              + +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    
Sbjct: 1063 RDERYVHPQQEAFCIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGL 1118

Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
            +  +A GT  +QGE+V  RGR+L+         P   +    +  L+   Q
Sbjct: 1119 KGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQ 1169


>gi|392306997|ref|NP_001254722.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
           mulatta]
 gi|380812168|gb|AFE77959.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
           mulatta]
 gi|383417835|gb|AFH32131.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
           mulatta]
          Length = 1442

 Score =  303 bits (777), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 219/677 (32%), Positives = 347/677 (51%), Gaps = 86/677 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
            SVLT+ + T+     FLGSRLG+SLL+++T        +++  +  KEE    +    +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426

Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
           T           QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G 
Sbjct: 427 TASWSAGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGE 482

Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
                          L I        + + + + K    ++V   ELPGC  +WTV    
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 542

Query: 555 ------HKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
                 +    G   ++    A DD   H +LI+S E  TM+L+T   + E+  S  +  
Sbjct: 543 RKEEEDNPKGEGTEQEARSPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFAT 601

Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
           QG T+ AGN+   R ++QV   G R+L+G      L F P +         + ++  ++A
Sbjct: 602 QGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVA 651

Query: 668 DPYVLLGMSDGSIRLLV 684
           DPYV++  ++G + + +
Sbjct: 652 DPYVVIMSAEGHVTMFL 668



 Score =  139 bits (349), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 106/413 (25%), Positives = 182/413 (44%), Gaps = 34/413 (8%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 783  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 838

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 839  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 888

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E         R   F++I G+ G F+ G  
Sbjct: 889  VRFKKVPHNINFREKKPKPSKKKAEGGGTEEGAGARGRVARFRYFEDIYGYSGVFICGPS 948

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG + +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 949  PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1008

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVS--VPVLKPLNQVLSLLIDQEVGHQIDNHN 1049
             WPV+K IPL+ T H + Y  E  +Y +  S   P  +         I +  G + +   
Sbjct: 1009 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNTPCAR---------IPRMTGEEKEFET 1058

Query: 1050 LSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKE 1107
            +   + +     E + ++++ P      W+    A I +Q  E+   ++ V+L +  T  
Sbjct: 1059 IERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTCMKTVSLRSEETVS 1114

Query: 1108 N-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
              +  +A GT  +QGE+V  RGR+L+         P   +    +  L+   Q
Sbjct: 1115 GLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQ 1167


>gi|351713968|gb|EHB16887.1| Cleavage and polyadenylation specificity factor subunit 1
           [Heterocephalus glaber]
          Length = 1440

 Score =  303 bits (776), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 220/678 (32%), Positives = 347/678 (51%), Gaps = 89/678 (13%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEGLTKNDKTTEGKSHRE-----KLELVASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +L+YG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
            SVLT+ + T+     FLGSRLG+SLL+++T         +    E  D E      KR+
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT--EKLQEPPASTVREAADKEEPPSKKKRV 424

Query: 468 RRSSSDA-----LQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
             ++  A      QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G
Sbjct: 425 DSAAGWAGNKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVG 480

Query: 522 ----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH- 555
                           L I        + + + + K    ++V   ELPGC  +WTV   
Sbjct: 481 EPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAP 540

Query: 556 --------KSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYF 606
                     + G   + S   A DD   H +LI+S E  TM+L+T   + E+  S  + 
Sbjct: 541 VRKEEEETPKAEGSEQEPSAPEAQDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFA 599

Query: 607 VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI 666
            QG T+ AGN+   R ++QV   G R+L+G      L F P +         + ++  ++
Sbjct: 600 TQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAV 649

Query: 667 ADPYVLLGMSDGSIRLLV 684
           ADPYV++  ++G + + +
Sbjct: 650 ADPYVVIMSAEGHVTMFL 667



 Score =  139 bits (350), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 103/381 (27%), Positives = 174/381 (45%), Gaps = 29/381 (7%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 782  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SSGQPTTQGEARKEEATR 837

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 838  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 887

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 888  VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 947

Query: 933  PCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNY 992
            P W +V    LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD  
Sbjct: 948  PHWLLVTGRGLRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAP 1007

Query: 993  WPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1052
            WPV+K IPL+ T H + Y  E  +Y +  S     P  +     I +  G + +   +  
Sbjct: 1008 WPVRK-IPLRCTAHYVAYHVESKVYAVATSTST--PCTR-----IPRMTGEEKEFEAIER 1059

Query: 1053 VDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-E 1109
             D +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    +
Sbjct: 1060 DDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLK 1115

Query: 1110 TLLAIGTAYVQGEDVAARGRV 1130
              +A GT  +QGE+V  RGRV
Sbjct: 1116 GYVAAGTCLMQGEEVTCRGRV 1136


>gi|354491124|ref|XP_003507706.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 isoform 2 [Cricetulus griseus]
          Length = 1388

 Score =  303 bits (775), Expect = 5e-79,   Method: Compositional matrix adjust.
 Identities = 217/671 (32%), Positives = 347/671 (51%), Gaps = 77/671 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN G T+ +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE  E   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEESE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS- 471
           + + T+     FLGSRLG+SLL+++T        SS    E  D E      KR+  ++ 
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVDPTAG 429

Query: 472 ----SDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
                   QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G     
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485

Query: 522 ---------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY-------HK 556
                    L I        + + + + K    ++V   ELPGC  +WTV         +
Sbjct: 486 SEENSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEE 545

Query: 557 SSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIA 613
           + R  + +   ++  A  D   H +LI+S E  TM+L+T   + E+  S  +  QG T+ 
Sbjct: 546 APRAESTEQESTTPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVF 604

Query: 614 AGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
           AGN+   R ++QV   G R+L+G      L F P +         + ++  ++ADPYV++
Sbjct: 605 AGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYVVI 654

Query: 674 GMSDGSIRLLV 684
             ++G + + +
Sbjct: 655 MSAEGHVTMFL 665



 Score =  140 bits (354), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 105/386 (27%), Positives = 176/386 (45%), Gaps = 30/386 (7%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 780  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 835

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 836  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 885

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 886  VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 945

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 946  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1005

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
             WPV+K IPL+ T H + Y  E  +Y +  S     P  +     I +  G + +   + 
Sbjct: 1006 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIE 1057

Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
              D +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    
Sbjct: 1058 RDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGL 1113

Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFS 1134
            +  +A GT  +QGE+V  RGR+ L+S
Sbjct: 1114 KGYVAAGTCLMQGEEVTCRGRIFLWS 1139


>gi|354491126|ref|XP_003507707.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 isoform 3 [Cricetulus griseus]
          Length = 1449

 Score =  302 bits (774), Expect = 6e-79,   Method: Compositional matrix adjust.
 Identities = 217/673 (32%), Positives = 347/673 (51%), Gaps = 79/673 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN G T+ +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE  E   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEESE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS- 471
           + + T+     FLGSRLG+SLL+++T        SS    E  D E      KR+  ++ 
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVDPTAG 429

Query: 472 ----SDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
                   QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G     
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485

Query: 522 -----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY------- 554
                      L I        + + + + K    ++V   ELPGC  +WTV        
Sbjct: 486 SEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEE 545

Query: 555 HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
            ++ R  + +   ++  A  D   H +LI+S E  TM+L+T   + E+  S  +  QG T
Sbjct: 546 EEAPRAESTEQESTTPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPT 604

Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
           + AGN+   R ++QV   G R+L+G      L F P +         + ++  ++ADPYV
Sbjct: 605 VFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYV 654

Query: 672 LLGMSDGSIRLLV 684
           ++  ++G + + +
Sbjct: 655 VIMSAEGHVTMFL 667



 Score =  140 bits (352), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 107/411 (26%), Positives = 181/411 (44%), Gaps = 30/411 (7%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 782  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 837

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 838  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 887

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 888  VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 947

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 948  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1007

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
             WPV+K IPL+ T H + Y  E  +Y +  S     P  +     I +  G + +   + 
Sbjct: 1008 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIE 1059

Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
              D +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    
Sbjct: 1060 RDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGL 1115

Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
            +  +A GT  +QGE+V  RGR+L+         P   +    +  L+   Q
Sbjct: 1116 KGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQ 1166


>gi|338728513|ref|XP_003365689.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like [Equus caballus]
          Length = 1450

 Score =  302 bits (774), Expect = 6e-79,   Method: Compositional matrix adjust.
 Identities = 219/681 (32%), Positives = 350/681 (51%), Gaps = 86/681 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN    + +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEAPTKNDRNAEGKAHRE--HREKLELVASFSFFGNVMSM 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 85  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 138 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 194

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 195 DVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 254

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+      +  
Sbjct: 255 HPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 314

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 315 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 374

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
           + + T+     FLGSRLG+SLL+++T        +S ++E     E + P +K+ R  S+
Sbjct: 375 TSMVTMEPGYLFLGSRLGNSLLLKYT-EKLQEPPASAVREA---AEKEEPPSKKKRVDST 430

Query: 473 -------------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDF 518
                           QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + 
Sbjct: 431 VGWSGSPRAAGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANA 486

Query: 519 SYG----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTV 553
           + G                L I        + + + + K    ++V   ELPGC  +WTV
Sbjct: 487 AMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTV 546

Query: 554 Y-------HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESV 603
                    ++ +G   +   S+  A  D   H +LI+S E  TM+L+T   + E+  S 
Sbjct: 547 IAPVRKEQEETPKGEGTEQEPSAPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS- 605

Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLS 663
            +  QG T+ AGN+   R ++QV   G R+L+G      L F P +         S ++ 
Sbjct: 606 GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GSPIVQ 655

Query: 664 VSIADPYVLLGMSDGSIRLLV 684
            ++ADPYV++  ++G + + +
Sbjct: 656 CAVADPYVVIMSAEGHVTMFL 676



 Score =  139 bits (350), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 106/411 (25%), Positives = 181/411 (44%), Gaps = 30/411 (7%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 791  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 846

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 847  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 896

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +        E         R   F++I G+ G F+ G  
Sbjct: 897  VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGVGARGRVARFRYFEDIYGYSGVFICGPS 956

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 957  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1016

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
             WPV+K IPL+ T H + Y  E  +Y +  S     P  +     I +  G + +   + 
Sbjct: 1017 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFETIE 1068

Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
              D +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    
Sbjct: 1069 RDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGL 1124

Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
            +  +A GT  +QGE+V  RGR+L+         P   +    +  L+   Q
Sbjct: 1125 KGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQ 1175


>gi|354491122|ref|XP_003507705.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 isoform 1 [Cricetulus griseus]
          Length = 1441

 Score =  302 bits (773), Expect = 9e-79,   Method: Compositional matrix adjust.
 Identities = 217/673 (32%), Positives = 347/673 (51%), Gaps = 79/673 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN G T+ +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE  E   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEESE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS- 471
           + + T+     FLGSRLG+SLL+++T        SS    E  D E      KR+  ++ 
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVDPTAG 429

Query: 472 ----SDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
                   QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G     
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485

Query: 522 -----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY------- 554
                      L I        + + + + K    ++V   ELPGC  +WTV        
Sbjct: 486 SEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEE 545

Query: 555 HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
            ++ R  + +   ++  A  D   H +LI+S E  TM+L+T   + E+  S  +  QG T
Sbjct: 546 EEAPRAESTEQESTTPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPT 604

Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
           + AGN+   R ++QV   G R+L+G      L F P +         + ++  ++ADPYV
Sbjct: 605 VFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYV 654

Query: 672 LLGMSDGSIRLLV 684
           ++  ++G + + +
Sbjct: 655 VIMSAEGHVTMFL 667



 Score =  140 bits (353), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 107/411 (26%), Positives = 181/411 (44%), Gaps = 30/411 (7%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 782  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 837

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 838  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 887

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 888  VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 947

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 948  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1007

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
             WPV+K IPL+ T H + Y  E  +Y +  S     P  +     I +  G + +   + 
Sbjct: 1008 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIE 1059

Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
              D +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    
Sbjct: 1060 RDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGL 1115

Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
            +  +A GT  +QGE+V  RGR+L+         P   +    +  L+   Q
Sbjct: 1116 KGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQ 1166


>gi|345779232|ref|XP_532356.3| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 [Canis lupus familiaris]
          Length = 1460

 Score =  301 bits (770), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 212/630 (33%), Positives = 331/630 (52%), Gaps = 72/630 (11%)

Query: 100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
            LELV  +   GNV S+A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+
Sbjct: 84  KLELVASFSFFGNVMSMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSL 139

Query: 160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDT 219
           H FE PE   L+ G       P V+VDP GRC  +L+YG ++++L   +     + +E  
Sbjct: 140 HYFEEPE---LRDGFVQNVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHE 193

Query: 220 FGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
              G G  +    S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +
Sbjct: 194 GLMGEGQRSSFLPSYIIDVRGLDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVR 253

Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA- 336
             TC I A+S++ T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS  
Sbjct: 254 QDTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVP 313

Query: 337 SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG- 395
              +ALN       +     +    + LD A A ++  D  ++S K G++ +LT++ DG 
Sbjct: 314 PYGVALNGLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGM 373

Query: 396 RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG 455
           R V+     K   SVLT+ + T+     FLGSRLG+SLL+++T        S+    E  
Sbjct: 374 RSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAA--REAA 431

Query: 456 DIEADAPSTKRLRRSS-----SDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSL 509
           D E      KR+  ++         QD V+  E+ +YGS A + T+ A  T+SF V DS+
Sbjct: 432 DKEEPPSKKKRVDCAAGWSGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSI 487

Query: 510 VNIGPLKDFSYG----------------LRI------NADASATGISKQSNYELV---EL 544
           +NIGP  + + G                L I        + + + + K    ++V   EL
Sbjct: 488 LNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFEL 547

Query: 545 PGCKGIWTVY-------HKSSRGHNA--DSSRMAAYDD-EYHAYLIISLEARTMVLETAD 594
           PGC  +WTV         ++S+G  A  +SS + A DD   H +LI+S E  TM+L+T  
Sbjct: 548 PGCYDMWTVIAPVRKEQEETSKGEVAEQESSALEAEDDGRRHGFLILSREDSTMILQTGQ 607

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
            + E+  S  +  QG T+ AGN+   R ++QV   G R+L+G      L F P +     
Sbjct: 608 EIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL---- 659

Query: 655 GSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
               S ++  ++ADPYV++  ++G + + +
Sbjct: 660 ---GSPIVQCAVADPYVVIMSAEGHVTMFL 686



 Score =  141 bits (356), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 106/411 (25%), Positives = 182/411 (44%), Gaps = 30/411 (7%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 801  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQAEARKEEATR 856

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 857  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 906

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +        E         R   F++I G+ G F+ G  
Sbjct: 907  VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 966

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 967  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1026

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
             WPV+K IPL+ T H + Y  E  +Y +  S  +  P  +     I +  G + +   + 
Sbjct: 1027 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNM--PCTR-----IPRMTGEEKEFETIE 1078

Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
              D +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    
Sbjct: 1079 RDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGL 1134

Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
            +  +A GT  +QGE+V  RGR+L+         P   +    +  L+   Q
Sbjct: 1135 KGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQ 1185


>gi|410987992|ref|XP_004000273.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 [Felis catus]
          Length = 1432

 Score =  300 bits (767), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 211/632 (33%), Positives = 334/632 (52%), Gaps = 76/632 (12%)

Query: 100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
            LELV  +   GNV S+A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+
Sbjct: 56  KLELVASFSFFGNVMSMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSL 111

Query: 160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDT 219
           H FE PE   L+ G       P V+VDP GRC  +L+YG ++++L   +     + +E  
Sbjct: 112 HYFEEPE---LRDGFVQNVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHE 165

Query: 220 FGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
              G G  +    S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +
Sbjct: 166 GLMGEGQRSSFLPSYIIDVRGLDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVR 225

Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA- 336
             TC I A+S++ T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS  
Sbjct: 226 QDTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVP 285

Query: 337 SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG- 395
              +ALN       +     +    + LD A A ++  D  ++S K G++ +LT++ DG 
Sbjct: 286 PYGVALNGLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGM 345

Query: 396 RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG 455
           R V+     K   SVLT+ + T+     FLGSRLG+SLL+++T        +S ++E   
Sbjct: 346 RSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT-EKLQEPPASAVREA-- 402

Query: 456 DIEADAPSTKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRD 507
             + + P +K+ R  S+          QD V+  E+ +YGS A + T+ A  T+SF V D
Sbjct: 403 -ADKEEPPSKKKRVDSTVGWSGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCD 457

Query: 508 SLVNIGPLKDFSYG----------------LRI------NADASATGISKQSNYELV--- 542
           S++NIGP  + + G                L I        + + + + K    ++V   
Sbjct: 458 SILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTF 517

Query: 543 ELPGCKGIWTVY-------HKSSRGHNADS--SRMAAYDD-EYHAYLIISLEARTMVLET 592
           ELPGC  +WTV         ++S+G  A+   S + A DD   H +LI+S E  TM+L+T
Sbjct: 518 ELPGCYDMWTVIAPVRKEQEETSKGEGAEQEPSTLEAEDDGRRHGFLILSREDSTMILQT 577

Query: 593 ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSES 652
              + E+  S  +  QG T+ AGN+   R ++QV   G R+L+G      L F P +   
Sbjct: 578 GQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-- 631

Query: 653 GSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
                 S ++  ++ADPYV++  ++G + + +
Sbjct: 632 -----GSPIVQCAVADPYVVIMSAEGHVTMFL 658



 Score =  143 bits (361), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 115/428 (26%), Positives = 190/428 (44%), Gaps = 64/428 (14%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G LEI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 773  WCLLVRENGTLEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQAEARKEEATR 828

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++        
Sbjct: 829  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQ------- 871

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTRE---------------ETPHGAPCQ--RIT 915
                L   N+       +RF + P +   RE               E   GA  +  R  
Sbjct: 872  ----LGQGNL------KVRFKKVPHNINFREKKPKPSKKKVEGGSAEEGAGARGRVARFR 921

Query: 916  IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
             F++I G+ G F+ G  P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   
Sbjct: 922  YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNR 981

Query: 975  QGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLS 1034
            QG L+I  LP+  +YD  WPV+K IPL+ T H + Y  E  +Y +  S  +  P  +   
Sbjct: 982  QGELRISVLPAYLSYDAPWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNM--PCTR--- 1035

Query: 1035 LLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENA 1092
              I +  G + +   +   D +     E + ++++ P      W+    A I ++  E+ 
Sbjct: 1036 --IPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHV 1089

Query: 1093 LTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSY 1151
              ++ V+L +  T    +  +A GT  +QGE+V  RGR+L+         P   +    +
Sbjct: 1090 TCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKF 1149

Query: 1152 GPLFSSVQ 1159
              L+   Q
Sbjct: 1150 KVLYEKEQ 1157


>gi|417406474|gb|JAA49895.1| Putative mrna cleavage and polyadenylation factor ii complex
           subunit cft1 cpsf subunit [Desmodus rotundus]
          Length = 1444

 Score =  299 bits (765), Expect = 7e-78,   Method: Compositional matrix adjust.
 Identities = 216/675 (32%), Positives = 346/675 (51%), Gaps = 80/675 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEAPTKNDRSTEGKAHRE--HREKLELVASFSFFGNVMSM 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 85  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 138 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 194

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 195 DVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 254

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct: 255 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTSGTTAFP 314

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +      LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 315 LRTQEGVRTTLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRSFHFDKAAASVLT 374

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR---- 468
           S + T+     FLGSRLG+SLL+++T        +S ++E     + + PS+K+ R    
Sbjct: 375 SSMVTMEPGYLFLGSRLGNSLLLKYT-EKLQEAPASAVREA---ADKEEPSSKKKRVDPT 430

Query: 469 ---RSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG--- 521
                     QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G   
Sbjct: 431 VGWSGGQSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGEPA 486

Query: 522 -------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY----- 554
                        L I        + + + + K    ++V   ELPGC  +WTV      
Sbjct: 487 FLSEEFQNSPEPDLEIVLCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVVAPVRK 546

Query: 555 --HKSSRGHNADSSRMAAY---DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
              ++ +G   +   +      D   H +LI+S E  TM+L+T   + E+  S  +  QG
Sbjct: 547 EQEETPKGEGTEQEPITPETEDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQG 605

Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
            T+ AGN+   R ++QV   G R+L+G      L F P +         S ++  ++ADP
Sbjct: 606 PTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GSPIVQCAVADP 655

Query: 670 YVLLGMSDGSIRLLV 684
            V++  ++G + + +
Sbjct: 656 CVVIMSAEGHVAMFL 670



 Score =  138 bits (348), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 111/428 (25%), Positives = 187/428 (43%), Gaps = 64/428 (14%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 785  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 840

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+  +             
Sbjct: 841  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAFAHD------------- 881

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAP-----------------CQRIT 915
                   S +    L+ +RF + P +   RE+ P  +                    R  
Sbjct: 882  -------SQLGQGNLK-VRFKKVPHNINFREKKPKPSKKKADGGGAEEGAGARGRVARFR 933

Query: 916  IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
             F++I G+ G F+ G  P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   
Sbjct: 934  YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNR 993

Query: 975  QGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLS 1034
            QG L+I  LP+  +YD  WPV+K IPL+ T H + Y  E  +Y +  S     P  +   
Sbjct: 994  QGELRISVLPAYLSYDAPWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR--- 1047

Query: 1035 LLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENA 1092
              I +  G + +   +   D +     E + ++++ P      W+    A I ++  E+ 
Sbjct: 1048 --IPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHV 1101

Query: 1093 LTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSY 1151
              ++ V+L +  T    +  +A GT  +QGE+V  RGR+L+         P   +    +
Sbjct: 1102 TCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKF 1161

Query: 1152 GPLFSSVQ 1159
              L+   Q
Sbjct: 1162 KVLYEKEQ 1169


>gi|380014171|ref|XP_003691113.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like [Apis florea]
          Length = 1583

 Score =  299 bits (765), Expect = 8e-78,   Method: Compositional matrix adjust.
 Identities = 210/668 (31%), Positives = 343/668 (51%), Gaps = 81/668 (12%)

Query: 58  LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
           LVV  AN+I ++ +    + +K+ K +     ++         LE +  Y LHGNV S+ 
Sbjct: 30  LVVAGANIIRVFRLIPDVDITKKEKYTESRPPKM--------KLECLSQYTLHGNVMSMQ 81

Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
            ++  G+    +RDS++L+F DAK+SV+E+D   H LR  S+H FE  E   ++ G  + 
Sbjct: 82  AVTLVGS----QRDSLLLSFRDAKLSVVEYDQDTHDLRTVSLHYFEEEE---IRDGWTNH 134

Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
              P+V+VDP+GRC  +L+YG ++++L   +  S   GD             I SS++I 
Sbjct: 135 HHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPSLDDGDLLDNSKASSNKTPILSSYMIV 194

Query: 238 LRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
           L+ L+  M ++ D  F+HGY EP ++IL+E   T++GR++ +  TC + A+S++   + H
Sbjct: 195 LKCLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQRVH 254

Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
           P+IWS  NLP D Y+ + V  P+GG L++  N++ Y +QS      +  Y VSL+S  E 
Sbjct: 255 PIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQS------IPPYGVSLNSLAET 308

Query: 356 -------PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
                  P+    + L+ +   ++ +D  ++S K+G+L +L++  D  R V+     K  
Sbjct: 309 STNFPLKPQEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDKAA 368

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE-EFGDIEADAPSTKR 466
            SVLTS +    ++  FLGSRLG+SLL++FT     ++ ++   E    + E +    K+
Sbjct: 369 ASVLTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPENLQNTNENEIVLEENETEETPAKK 428

Query: 467 LRRS------SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
           +++       +SD L D+ + EEL +YGS + +T     ++ F V DSL+NIGP  + S 
Sbjct: 429 IKQDFIGDWMASDVL-DIKDPEELEVYGSET-HTSIQITSYIFEVCDSLLNIGPCGNISM 486

Query: 521 G--------LRINAD-----ASATGISKQSNYELV------------ELPGCKGIWTVYH 555
           G           N D      + +G  K     ++            ELPGC+ +WTV  
Sbjct: 487 GEPAFLSEEFSHNQDPDVELVTTSGYGKNGALCVLQHSIRPQVVTTFELPGCEDMWTVI- 545

Query: 556 KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAG 615
               G   +  ++    +  HA+LI+S E  TM+L+T   + EV +S  +  QG TI AG
Sbjct: 546 ----GTLNNDEQIRPEAEGSHAFLILSQEDSTMILQTGQEINEVDQS-GFSTQGSTIFAG 600

Query: 616 NLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
           NL   R ++QV + G R+L G    Q +                 ++  S ADPYV L  
Sbjct: 601 NLGANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYVTLLS 650

Query: 676 SDGSIRLL 683
            DG + LL
Sbjct: 651 EDGQVMLL 658



 Score =  122 bits (305), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 108/422 (25%), Positives = 175/422 (41%), Gaps = 71/422 (16%)

Query: 755  VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
            +V  +SG LEI+ +P+    + +  F  G+  + D+     L+ +      + E      
Sbjct: 772  LVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQTTPVNEIPNPE------ 825

Query: 815  RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
                   M+V E+ M     H +RP L   L D  +  YQAY +  P+   K        
Sbjct: 826  -------MQVREILMVALGHHGNRPMLLVRL-DSELQIYQAYRY--PKGHLKL------- 868

Query: 875  RSLSVSNVSASRLRNLRFSRTP--LDAYTREETPHGAPCQR---ITIFKNISGHQGFFLS 929
                       R + L     P  L    R+E        R   +  F NI+G+ G F+ 
Sbjct: 869  -----------RFKKLDHGIIPGHLRPRPRDEDMPAMNDTRHCMMRYFSNIAGYNGVFIC 917

Query: 930  GSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGST 988
               P W  +  R  LR HP   DG + +F   +N+NC  GF+Y   +  L+IC LP+  +
Sbjct: 918  SDYPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQGFLYFNRKEELRICVLPTHLS 977

Query: 989  YDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNH 1048
            YD  WPV+KV PL+ TPH +TY  E   Y +I S+   +PL                  +
Sbjct: 978  YDAPWPVRKV-PLRCTPHFVTYHLESKTYCVITSIA--EPLKSY---------------Y 1019

Query: 1049 NLSSVDLHRTYTVEEYEVRILEPDR--------AGGPWQT--RATIPMQSSENALTVRVV 1098
              +  D  + +T EE   R + P +        +   W+T     I +   E+   ++ V
Sbjct: 1020 RFNGED--KEFTEEERPDRFIYPSQEQFSIVLFSPVSWETIPNTKIELDQWEHVTCLKNV 1077

Query: 1099 TLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSS 1157
            +L    T+   +  + +GT Y  GED+ +RGR+L+F        P   +    +  +++ 
Sbjct: 1078 SLAYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPLTKNRFKQIYAK 1137

Query: 1158 VQ 1159
             Q
Sbjct: 1138 EQ 1139


>gi|395860104|ref|XP_003802355.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 [Otolemur garnettii]
          Length = 1441

 Score =  299 bits (765), Expect = 8e-78,   Method: Compositional matrix adjust.
 Identities = 217/678 (32%), Positives = 343/678 (50%), Gaps = 89/678 (13%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEVLTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
            SVLT+ + T+     FLGSRLG+SLL+++T         +    E  D E      KR+
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT--EKLQEPPASATRESADKEEPPSKKKRV 424

Query: 468 RRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
             S          QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G
Sbjct: 425 DPSVGWSGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMG 480

Query: 522 ----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY-- 554
                           L I        + + + + K    ++V   ELPGC  +WTV   
Sbjct: 481 EPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAS 540

Query: 555 -----HKSSRGHNADSSR---MAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYF 606
                 ++ +G   +          D   H +LI+S E  TM+L+T   + E+  S  + 
Sbjct: 541 VRKEEEETPKGEGTEQESGVPEGEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFA 599

Query: 607 VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI 666
            QG T+ AGN+   R ++QV   G R+L+G      L F P +         + ++  ++
Sbjct: 600 TQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAV 649

Query: 667 ADPYVLLGMSDGSIRLLV 684
           ADPYV++  ++G + + +
Sbjct: 650 ADPYVVIMSAEGHVTMFL 667



 Score =  142 bits (357), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 114/428 (26%), Positives = 189/428 (44%), Gaps = 64/428 (14%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 782  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 837

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++        
Sbjct: 838  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQ------- 880

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTRE-------------ETPHGAPCQ----RIT 915
                L   N+       +RF + P +   RE              T  GA  +    R  
Sbjct: 881  ----LGQGNL------KVRFKKVPHNINFREKKPKPSKKKAEGGSTEEGAGVRGRVARFR 930

Query: 916  IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
             F++I G+ G F+ G  P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   
Sbjct: 931  YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNR 990

Query: 975  QGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLS 1034
            QG L+I  LP+  +YD  WPV+K IPL+ T H + Y  E  +Y +  S     P  +   
Sbjct: 991  QGELRISVLPAYLSYDAPWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR--- 1044

Query: 1035 LLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENA 1092
              I +  G + +   +   D +     E + ++++ P      W+    A I ++  E+ 
Sbjct: 1045 --IPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHV 1098

Query: 1093 LTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSY 1151
              ++ V+L +  T    +  +A GT  +QGE+V  RGR+L+         P   +    +
Sbjct: 1099 TCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKF 1158

Query: 1152 GPLFSSVQ 1159
              L+   Q
Sbjct: 1159 KVLYEKEQ 1166


>gi|344236599|gb|EGV92702.1| Cleavage and polyadenylation specificity factor subunit 1
           [Cricetulus griseus]
          Length = 1419

 Score =  298 bits (764), Expect = 9e-78,   Method: Compositional matrix adjust.
 Identities = 217/671 (32%), Positives = 347/671 (51%), Gaps = 82/671 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN G T+ +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE  E   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEESE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEA--DAPSTKRLRRS 470
           + + T+     FLGSRLG+SLL+++T        SS ++E      A  + P +K+ R  
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS-VREAADKASAHNEEPPSKKKRVD 430

Query: 471 SSDAL-------QDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
            +          QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G 
Sbjct: 431 PTAGWTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGE 486

Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
                          L I        + + + + K    ++V   ELPGC  +WTV    
Sbjct: 487 PAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 546

Query: 555 ----HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
                ++ R  + +   ++  A  D   H +LI+S E  TM+L+T   + E+  S  +  
Sbjct: 547 RKEEEEAPRAESTEQESTTPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFAT 605

Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
           QG T+ AGN+   R ++QV   G R+L+G      L F P +         + ++  ++A
Sbjct: 606 QGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVA 655

Query: 668 DPYVLLGMSDG 678
           DPYV++  ++G
Sbjct: 656 DPYVVIMSAEG 666



 Score =  140 bits (353), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 107/411 (26%), Positives = 181/411 (44%), Gaps = 30/411 (7%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 760  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 815

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 816  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 865

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 866  VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 925

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 926  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 985

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
             WPV+K IPL+ T H + Y  E  +Y +  S     P  +     I +  G + +   + 
Sbjct: 986  PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIE 1037

Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
              D +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    
Sbjct: 1038 RDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGL 1093

Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
            +  +A GT  +QGE+V  RGR+L+         P   +    +  L+   Q
Sbjct: 1094 KGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQ 1144


>gi|301773406|ref|XP_002922132.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
           specificity factor subunit 1-like [Ailuropoda
           melanoleuca]
          Length = 1469

 Score =  297 bits (761), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 210/632 (33%), Positives = 333/632 (52%), Gaps = 76/632 (12%)

Query: 100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
            LELV  +   GNV S+A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+
Sbjct: 103 KLELVASFSFFGNVMSMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSL 158

Query: 160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDT 219
           H FE PE   L+ G       P V+VDP GRC  +L+YG ++++L   +     + +E  
Sbjct: 159 HYFEEPE---LRDGFVQNVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHE 212

Query: 220 FGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
              G G  +    S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +
Sbjct: 213 GLMGEGQRSSFLPSYIIDVRGLDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVR 272

Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA- 336
             TC I A+S++ T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS  
Sbjct: 273 QDTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVP 332

Query: 337 SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG- 395
              +ALN       +     +    + LD A A ++  D  ++S K G++ +LT++ DG 
Sbjct: 333 PYGVALNGLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGM 392

Query: 396 RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG 455
           R V+     K   SVLT+ + T+     FLGSRLG+SLL+++T        +S ++E   
Sbjct: 393 RSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT-EKLQEPPASAVREA-- 449

Query: 456 DIEADAPSTKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRD 507
             + + P +K+ R  S+          QD V+  E+ +YGS A + T+ A  T+SF V D
Sbjct: 450 -ADKEEPPSKKKRVDSTVGWSGGKSMPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCD 504

Query: 508 SLVNIGPLKDFSYG----------------LRI------NADASATGISKQSNYELV--- 542
           S++NIGP  + + G                L I        + + + + K    ++V   
Sbjct: 505 SILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTF 564

Query: 543 ELPGCKGIWTVY-------HKSSRGHNADS--SRMAAYDD-EYHAYLIISLEARTMVLET 592
           ELPGC  +WTV         ++ +G  A+   S + A DD   H +LI+S E  TM+L+T
Sbjct: 565 ELPGCYDMWTVIAPVRKEQEETPKGEGAEQEPSALEADDDGRRHGFLILSREDSTMILQT 624

Query: 593 ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSES 652
              + E+  S  +  QG T+ AGN+   R ++QV   G R+L+G      L F P +   
Sbjct: 625 GQEIMELDTS-GFATQGPTVFAGNIGDSRYIVQVSPLGIRLLEG---VNQLHFIPVDL-- 678

Query: 653 GSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
                 S ++  ++ADPYV++  ++G + + +
Sbjct: 679 -----GSPIVQCAVADPYVVIMSAEGHVTMFL 705



 Score =  134 bits (338), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 114/435 (26%), Positives = 190/435 (43%), Gaps = 71/435 (16%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 820  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 875

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++        
Sbjct: 876  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQ------- 918

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTRE---------------ETPHGAPCQ--RIT 915
                L   N+       +RF + P +   RE               E   GA  +  R  
Sbjct: 919  ----LGQGNL------KVRFKKVPHNINFREKKPKPSKKKVEGGSAEEGAGARGRVARFR 968

Query: 916  IFKNISGHQG-------FFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNH 967
             F++I G+ G        F+ G  P W +V  R  LR+HP   DG I +F   HNVNC  
Sbjct: 969  YFEDIYGYSGGGGACPQVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPR 1028

Query: 968  GFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLK 1027
            GF+Y   QG L+I  LP+  +YD  WPV+K IPL+ T H + Y  E  +Y +  S  +  
Sbjct: 1029 GFLYFNRQGELRISVLPAYLSYDAPWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNM-- 1085

Query: 1028 PLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIP 1085
            P  +     I +  G + +   +   D +     E + ++++ P      W+    A I 
Sbjct: 1086 PCTR-----IPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVS----WEAIPNARIE 1136

Query: 1086 MQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQN 1144
            ++  E+   ++ V+L +  T    +  +A GT  +QGE+V  RGR+L+         P  
Sbjct: 1137 LEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQ 1196

Query: 1145 LVLSGSYGPLFSSVQ 1159
             +    +  L+   Q
Sbjct: 1197 PLTKNKFKVLYEKEQ 1211


>gi|355680843|gb|AER96659.1| cleavage and polyadenylation specific factor 1, 160kDa [Mustela
           putorius furo]
          Length = 1399

 Score =  297 bits (761), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 210/636 (33%), Positives = 329/636 (51%), Gaps = 81/636 (12%)

Query: 100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
            LELV  +   GNV S+A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+
Sbjct: 21  KLELVASFSFFGNVMSMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSL 76

Query: 160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVL----VYGLQMIILKASQGGSGLVG 215
           H FE PE   L+ G       P V+VDP GRC  +L    +YG ++++L   +     + 
Sbjct: 77  HYFEEPE---LRDGFVQNVHAPRVRVDPDGRCAAMLTAMLIYGSRLVVLPFRRES---LA 130

Query: 216 DEDTFGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGR 273
           +E     G G  +    S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GR
Sbjct: 131 EEHEGLMGEGQRSSFLPSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGR 190

Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHS 333
           V+ +  TC I A+S++ T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +
Sbjct: 191 VAVRQDTCCIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLN 250

Query: 334 QSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
           QS     +ALN       +     +    + LD A A ++  D  ++S K G++ +LT++
Sbjct: 251 QSVPPYGVALNGLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLI 310

Query: 393 YDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
            DG R V+     K   SVLT+ + T+     FLGSRLG+SLL+++     T  L     
Sbjct: 311 TDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKY-----TEKLQEAPA 365

Query: 452 EEFGDIEADAPSTKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSF 503
               + + D P +K+ R  S+        A QD V+  E+ +YGS A + T+ A  T+SF
Sbjct: 366 GAVRETDKDEPPSKKKRVESAVGWSGGKSAPQDEVD--EIEVYGSEAQSGTQLA--TYSF 421

Query: 504 AVRDSLVNIGPLKDFSYG----------------LRI------NADASATGISKQSNYEL 541
            V DS++NIGP  + + G                L I        + + + + K    ++
Sbjct: 422 EVCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQV 481

Query: 542 V---ELPGCKGIWTVYHKSSR---------GHNADSSRMAAYDD-EYHAYLIISLEARTM 588
           V   ELPGC  +WTV   + +         G   + S + A DD   H +LI+S E  TM
Sbjct: 482 VTTFELPGCYDMWTVIAPARKEQEETPKGDGAEQEPSALEADDDGRRHGFLILSREDSTM 541

Query: 589 VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPS 648
           +L+T   + E+  S  +  QG T+ AGN+   R ++QV   G R+L+G      L F P 
Sbjct: 542 ILQTGQEIMELDTS-GFATQGPTVFAGNIGDGRYIVQVSPLGIRLLEG---VSQLHFIPV 597

Query: 649 NSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
           +         S ++  ++ADPYV++  ++G + + +
Sbjct: 598 DL-------GSPIVQCAVADPYVVIMSAEGHVTMFL 626



 Score =  140 bits (353), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 106/411 (25%), Positives = 182/411 (44%), Gaps = 30/411 (7%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 741  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 796

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 797  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 846

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +        E         R   F++I G+ G F+ G  
Sbjct: 847  VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAAARGRVARFRYFEDIYGYSGVFICGPS 906

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 907  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 966

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
             WPV+K IPL+ T H + Y  E  +Y +  S  +  P  +     I +  G + +   + 
Sbjct: 967  PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNM--PCTR-----IPRMTGEEKEFEAIE 1018

Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
              D +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    
Sbjct: 1019 RDDRYVHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGL 1074

Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
            +  +A GT  +QGE+V  RGR+L+         P   +    +  L+   Q
Sbjct: 1075 KGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQ 1125


>gi|358415280|ref|XP_003583063.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 [Bos taurus]
          Length = 1490

 Score =  295 bits (754), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 209/635 (32%), Positives = 327/635 (51%), Gaps = 82/635 (12%)

Query: 100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
            LELV  +   GNV S+A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+
Sbjct: 114 KLELVASFSFFGNVMSMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSL 169

Query: 160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLV 214
           H FE PE   L+ G       P V+VDP GRC  +L+YG ++++L       ++   GLV
Sbjct: 170 HYFEEPE---LRDGFVQNVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLV 226

Query: 215 GDEDTFGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAG 272
           G+        G  +    S++I++R LD K  ++ D  F+HGY EP ++IL E   TW G
Sbjct: 227 GE--------GQRSSFLPSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPG 278

Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYH 332
           +V+ +  TC I A+S++ T K HP+IWS  +LP D  + LAVP PIGGV++   N++ Y 
Sbjct: 279 KVAVRQDTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYL 338

Query: 333 SQSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV 391
           +QS     +ALN+      +     +    + LD A A ++  D  ++S K G++ +LT+
Sbjct: 339 NQSVPPYGVALNSLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTL 398

Query: 392 VYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGL 450
           + DG R V+     K   SVLT+ + T+     FLGSRLG+SLL+++T        S+  
Sbjct: 399 ITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASTA- 457

Query: 451 KEEFGDIEADAPSTKRLRRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFA 504
             E  D E      KR+  +     S    QD V+  E+ +YGS A + T+ A  T+SF 
Sbjct: 458 -REAADKEEPPSKKKRVDATTGWSGSKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFE 512

Query: 505 VRDSLVNIGPLKDFSYG----------------LRI------NADASATGISKQSNYELV 542
           V DS++NIGP  + + G                L I        + + + + K    ++V
Sbjct: 513 VCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVV 572

Query: 543 ---ELPGCKGIWTVYHKSSR---------GHNADSSRMAAYDD-EYHAYLIISLEARTMV 589
              ELPGC  +WTV     +         G   +     A DD   H +LI+S E  TM+
Sbjct: 573 TTFELPGCYDMWTVIAPVRKEQEETLKGEGTEPEPGAPEAEDDGRRHGFLILSREDSTMI 632

Query: 590 LETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSN 649
           L+T   + E+  S  +  QG T+ AGN+   R ++QV   G R+L+G      L F P +
Sbjct: 633 LQTGQEIMELDAS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVD 688

Query: 650 SESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
                    S ++  ++ADPYV++  ++G + + +
Sbjct: 689 L-------GSPIVQCAVADPYVVIMSAEGHVTMFL 716



 Score =  142 bits (359), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 106/411 (25%), Positives = 183/411 (44%), Gaps = 30/411 (7%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+GA+EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 831  WCLLVRENGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 886

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +   RP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 887  QGELPLVKEVLLVALG-----SRQRRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 936

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E T       R   F++I G+ G F+ G  
Sbjct: 937  VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGTGPRGRVARFRYFEDIYGYSGVFICGPS 996

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HN+NC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 997  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDA 1056

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
             WPV+K IPL+ T H + Y  E  +Y +  S     P  +V  +      G + +   + 
Sbjct: 1057 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTST--PCTRVPRM-----TGEEKEFETIE 1108

Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
              + +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    
Sbjct: 1109 RDERYVHPQQEAFCIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGL 1164

Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
            +  +A GT  +QGE+V  RGR+L+         P   +    +  L+   Q
Sbjct: 1165 KGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQ 1215


>gi|345482082|ref|XP_001607052.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like [Nasonia vitripennis]
          Length = 1415

 Score =  294 bits (752), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 210/667 (31%), Positives = 334/667 (50%), Gaps = 78/667 (11%)

Query: 58  LVVTAANVIEIY-VVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           LVV  AN+I ++ ++   + G KE        +           LE +  Y LHGNV S+
Sbjct: 30  LVVAGANIIRVFRLIPDVDPGKKEKFTESRPPK---------MRLECLAQYTLHGNVMSM 80

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +   G+     RDS++L+F +AK+SV+E+D  IH LR  S+H FE  E   +K G  +
Sbjct: 81  QAVQLIGSP----RDSLLLSFREAKLSVVEYDPEIHSLRTVSLHYFEEEE---IKDGWTN 133

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P+V+VDP+GRC  +L+YG ++++L   +      GD             I SS++I
Sbjct: 134 HHHVPIVRVDPEGRCAVMLIYGRKLVVLPFRKDPILDEGDLIENPKSSSHKTPILSSYMI 193

Query: 237 NLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
            L+ L+  M ++ D  F+HGY EP ++IL+E   T+AGR++ +  TC + A+S++   K 
Sbjct: 194 VLKSLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFAGRIAVRQDTCAMVAISLNIQQKV 253

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYAVSLDSSQ 353
           HP+IWS  NLP D Y+ +AV  P+GG L++  N++ Y +QS     ++LN+   +  +  
Sbjct: 254 HPIIWSVSNLPFDCYQAVAVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLTDNCTNFP 313

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
             P+    + L+++   ++  D  ++S KTG+L +L++  D  R V+     K   SVLT
Sbjct: 314 LKPQEGVKISLESSQVAFISPDRLVISLKTGELYVLSLFADSMRSVRGFHFDKAAASVLT 373

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS-SGLKEEFGDIEADAPSTKRLRRS- 470
           S +    ++  FLGSRLG+SLL++FT      +   S L+       +    TK+++   
Sbjct: 374 SCVCLCDDNYLFLGSRLGNSLLLRFTEKESEKINDISMLEMSLNSSNSQEQPTKKIKLDY 433

Query: 471 -----SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG---- 521
                +SD L D+ + EEL +YGS +  T     ++ F V DSL+NIGP  + S G    
Sbjct: 434 LEDWMASDVL-DIKDPEELEVYGSET-QTSIQITSYIFEVCDSLLNIGPCGNISMGEPAF 491

Query: 522 ----LRINAD-----ASATGISKQSNYELVE------------LPGCKGIWTVYHKSSRG 560
                  N++      + +G  K     +++            LPG + IWTV   +   
Sbjct: 492 LSEEFSNNSEPDVELVTTSGYGKNGALCVLQRSIRPQVITTFDLPGYENIWTVIDSTVSD 551

Query: 561 HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
           + A +          H +LI++ +  TMVL+T   + EV +   +  QG TI AGNL   
Sbjct: 552 NRAKTETEGT-----HGFLILTQDDSTMVLQTGQEINEVVDQSGFSTQGTTIFAGNLGSN 606

Query: 621 RRVIQVFERGARILDG----SYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMS 676
           R +IQV + G R+L G     +M  DL                 ++  S ADPYV L   
Sbjct: 607 RYIIQVTQMGVRLLQGLEQIQHMPMDLG--------------CPIVHASCADPYVSLLSE 652

Query: 677 DGSIRLL 683
           DG + LL
Sbjct: 653 DGQVVLL 659



 Score =  119 bits (298), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 106/411 (25%), Positives = 178/411 (43%), Gaps = 51/411 (12%)

Query: 756  VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGR 815
            V  ++G LE++ +P     + +  F  G+  + D+            +  ++ +G+ Q  
Sbjct: 773  VYRDNGTLEVYSLPELRLSYLIKNFGFGQNILHDS------------MEFTTIQGSQQNE 820

Query: 816  KENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSR 875
              N   ++V E+A+     H +RP L   L D  +  YQ Y +  P+   K         
Sbjct: 821  PVN-PEVQVREIAVVALGHHGNRPMLLVRL-DSELQIYQVYRY--PKGHLK--------- 867

Query: 876  SLSVSNVSASRLRNLRFSRTPLDAYTREETP--HGAPCQRITIFKNISGHQGFFLSGSRP 933
             L    +  + +  + FSR        E+ P  +      +  F NI+G+ G F+ G  P
Sbjct: 868  -LRFKKIDHNFI--VGFSRI---GPKEEDMPSMNDTRLCMMRYFSNIAGYNGVFIGGDYP 921

Query: 934  CWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNY 992
             W  +  R  LR HP   DG + +F   +NVNC  GF+Y   +  L+IC LP+  +YD  
Sbjct: 922  HWIFLTGRGELRAHPMNIDGPVKSFAPFNNVNCPQGFLYFNRKDELRICVLPTHLSYDAP 981

Query: 993  WPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLL-IDQEVGHQIDNHNLS 1051
            WPV+KV PL+ TPH +TY  E   Y ++ S    +PL         D+E   +  N    
Sbjct: 982  WPVRKV-PLRCTPHFVTYHLESKTYCVVTSTA--EPLKSYYRFNGEDKEFTEEERNERF- 1037

Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
               L+ T   E++ + +  P      W T     I +   E+   ++ V+L    T+   
Sbjct: 1038 ---LYPTQ--EQFSIVLFSPVS----WDTIPNTKIDLDQWEHVTCLKNVSLAYEGTRSGL 1088

Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
            +  + IGT Y  GED+ +RGR+ +F        P   +    +  +++  Q
Sbjct: 1089 KGYIVIGTNYNYGEDITSRGRIFIFDIIEVVPEPGQPLTKNRFKQIYAKEQ 1139


>gi|110750698|ref|XP_624382.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 [Apis mellifera]
          Length = 1415

 Score =  293 bits (751), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 209/668 (31%), Positives = 343/668 (51%), Gaps = 81/668 (12%)

Query: 58  LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
           LVV  AN+I ++ +    + +K+ K +     ++         LE +  Y LHGNV S+ 
Sbjct: 30  LVVAGANIIRVFRLIPDVDITKKEKYTESRPPKM--------KLECLSQYTLHGNVMSMQ 81

Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
            ++  G+    +RDS++L+F DAK+SV+E+D   H LR  S+H FE  E   ++ G  + 
Sbjct: 82  AVTLVGS----QRDSLLLSFRDAKLSVVEYDQDTHDLRTVSLHYFEEEE---IRDGWTNH 134

Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
              P+V+VDP+GRC  +L+YG ++++L   +  S   GD             I SS++I 
Sbjct: 135 HHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPSLDDGDLLDNSKASSNKTPILSSYMIV 194

Query: 238 LRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
           L+ L+  M ++ D  F+HGY EP ++IL+E   T++GR++ +  TC + A+S++   + H
Sbjct: 195 LKCLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQRVH 254

Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
           P+IWS  NLP D Y+ + V  P+GG L++  N++ Y +QS      +  Y VSL+S  E 
Sbjct: 255 PIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQS------IPPYGVSLNSLAET 308

Query: 356 -------PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
                  P+    + L+ +   ++ +D  ++S K+G+L +L++  D  R V+     K  
Sbjct: 309 STNFPLKPQEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDKAA 368

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS-GLKEEFGDIEADAPSTKR 466
            SVLTS +    ++  FLGSRLG+SLL++FT     ++ ++   +    + E +    K+
Sbjct: 369 ASVLTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPENLQNTNENEIILEENETEETPAKK 428

Query: 467 LRRS------SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
           +++       +SD L D+ + EEL +YGS + +T     ++ F V DSL+NIGP  + S 
Sbjct: 429 IKQDFIGDWMASDVL-DIKDPEELEVYGSET-HTSIQITSYIFEVCDSLLNIGPCGNISM 486

Query: 521 G--------LRINAD-----ASATGISKQSNYELV------------ELPGCKGIWTVYH 555
           G           N D      + +G  K     ++            ELPGC+ +WTV  
Sbjct: 487 GEPAFLSEEFSHNQDPDVELVTTSGYGKNGALCVLQHSIRPQVVTTFELPGCEDMWTVI- 545

Query: 556 KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAG 615
               G   +  ++    +  HA+LI+S E  TM+L+T   + EV +S  +  QG TI AG
Sbjct: 546 ----GTLNNDEQIRPEAEGSHAFLILSQEDSTMILQTGQEINEVDQS-GFSTQGSTIFAG 600

Query: 616 NLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
           NL   R ++QV + G R+L G    Q +                 ++  S ADPYV L  
Sbjct: 601 NLGANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYVTLLS 650

Query: 676 SDGSIRLL 683
            DG + LL
Sbjct: 651 EDGQVMLL 658



 Score =  121 bits (304), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 108/422 (25%), Positives = 175/422 (41%), Gaps = 71/422 (16%)

Query: 755  VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
            +V  +SG LEI+ +P+    + +  F  G+  + D+     L+ +      + E      
Sbjct: 772  LVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQTTPVNEIPNPE------ 825

Query: 815  RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
                   M+V E+ M     H +RP L   L D  +  YQAY +  P+   K        
Sbjct: 826  -------MQVREILMVALGHHGNRPMLLVRL-DSELQIYQAYRY--PKGHLKL------- 868

Query: 875  RSLSVSNVSASRLRNLRFSRTP--LDAYTREETPHGAPCQR---ITIFKNISGHQGFFLS 929
                       R + L     P  L    R+E        R   +  F NI+G+ G F+ 
Sbjct: 869  -----------RFKKLDHGIIPGHLRPRPRDEDMPAMNDTRHCMMRYFSNIAGYNGVFIC 917

Query: 930  GSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGST 988
               P W  +  R  LR HP   DG + +F   +N+NC  GF+Y   +  L+IC LP+  +
Sbjct: 918  SDYPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQGFLYFNRKEELRICVLPTHLS 977

Query: 989  YDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNH 1048
            YD  WPV+KV PL+ TPH +TY  E   Y +I S+   +PL                  +
Sbjct: 978  YDAPWPVRKV-PLRCTPHFVTYHLESKTYCVITSIA--EPLKSY---------------Y 1019

Query: 1049 NLSSVDLHRTYTVEEYEVRILEPDR--------AGGPWQT--RATIPMQSSENALTVRVV 1098
              +  D  + +T EE   R + P +        +   W+T     I +   E+   ++ V
Sbjct: 1020 RFNGED--KEFTEEERPDRFIFPSQEQFSIVLFSPVSWETIPNTKIELDQWEHVTCLKNV 1077

Query: 1099 TLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSS 1157
            +L    T+   +  + +GT Y  GED+ +RGR+L+F        P   +    +  +++ 
Sbjct: 1078 SLAYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPLTKNRFKQIYAK 1137

Query: 1158 VQ 1159
             Q
Sbjct: 1138 EQ 1139


>gi|322792443|gb|EFZ16427.1| hypothetical protein SINV_15375 [Solenopsis invicta]
          Length = 1532

 Score =  293 bits (749), Expect = 5e-76,   Method: Compositional matrix adjust.
 Identities = 206/621 (33%), Positives = 328/621 (52%), Gaps = 63/621 (10%)

Query: 100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
            LE +  Y LHGN+ S+  +   G+    +RDS++L+F DAK+SV+E+D  IH LR  S+
Sbjct: 32  KLECLAQYTLHGNIMSMQAVHLIGS----QRDSLLLSFRDAKLSVVEYDQDIHDLRTVSL 87

Query: 160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE-D 218
           H FE  E   +K G  +    P+V+VDP+GRC  +L++G ++++L   +  S   GD  D
Sbjct: 88  HYFEEEE---IKDGWTNHHHIPIVRVDPEGRCAVMLIFGRKLVVLPFRKDPSLDDGDLLD 144

Query: 219 TFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSW 276
           T        A I SS++I L+ L+  M +V D  F+HGY EP ++IL+E   T+AGR++ 
Sbjct: 145 TAKLTSSNKAPILSSYMIVLKSLEEKMDNVIDLQFLHGYYEPTLLILYEPVRTFAGRIAV 204

Query: 277 KHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS- 335
           +  TC + A+S++   + HP+IWS  NLP D Y+ + V  P+GG L++  N++ Y +QS 
Sbjct: 205 RQDTCAMVAISLNIQQRVHPIIWSVSNLPFDCYQAVPVKKPLGGTLIMAFNSLIYLNQSI 264

Query: 336 ASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG 395
               ++LN+ A +  +    P+    + L+ A   ++  D  ++S K+G+L +L++  D 
Sbjct: 265 PPYGVSLNSLADTSTNFPLKPQEGVKMSLEGAQVAFISADRLVISLKSGELYVLSLFADS 324

Query: 396 -RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS-SGLKEE 453
            R V+     K   SVLTS +    ++  FLGSRLG+SLL++FT     ++ + +G +  
Sbjct: 325 MRSVRGFHFDKAAASVLTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPETLKNLNGGEIT 384

Query: 454 FGDIEADAPSTKRLRRS------SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
             + E++    K+ ++       +SD L D+ + EEL +YGS + +T     ++ F V D
Sbjct: 385 IEENESEETPAKKAKQDFLGDWMASDVL-DIKDPEELEVYGSET-HTSIQITSYIFEVCD 442

Query: 508 SLVNIGPLKDFSYG--------LRINAD-----ASATGISKQSNYELV------------ 542
           SL+NIGP  + S G           N D      + +G  K     ++            
Sbjct: 443 SLLNIGPCGNISMGEPAFLSEEFLQNQDPDVELVTTSGYGKNGALCVLQRSIRPQVVTTF 502

Query: 543 ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTES 602
           ELPGC+ +WTV        N D  +  A  +  HA+LI+S E  TM+L+T   + EV +S
Sbjct: 503 ELPGCEDMWTVIGTL----NNDEIKTEA--EGSHAFLILSQEDSTMILQTGQEINEVDQS 556

Query: 603 VDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVL 662
             +  QG T+ AGNL   R ++QV + G R+L G    Q +                 ++
Sbjct: 557 -GFSTQGSTVFAGNLGANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIV 605

Query: 663 SVSIADPYVLLGMSDGSIRLL 683
             S ADPYV L   DG + LL
Sbjct: 606 HASCADPYVTLLSEDGQVMLL 626



 Score =  127 bits (319), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 112/420 (26%), Positives = 176/420 (41%), Gaps = 66/420 (15%)

Query: 755  VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
            +V  +SG LEI+ +P+    + +  F  G+  + D+     L+   T IN          
Sbjct: 740  LVYRDSGTLEIYSLPDLRLSYLIRNFGFGQYVLHDSMESTTLQS--TPINEIPHP----- 792

Query: 815  RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
                   M+V E+ M     H +RP L   L D  +  YQ Y +  P+   K        
Sbjct: 793  ------DMQVREILMVALGHHGNRPMLLVRL-DSELQIYQVYRY--PKGYLK-------- 835

Query: 875  RSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI---FKNISGHQGFFLSGS 931
              L    +    +   R S  P      E+ P      RI +   F NI+G+ G F+   
Sbjct: 836  --LRFKKLDHGIIPG-RLSPRP----KEEDVPRNTSDTRICVMRYFSNIAGYNGVFICSD 888

Query: 932  RPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
             P W  +  R  LR HP   DGS+ +F   +N+NC  GF+Y   +  L+IC LP+  +YD
Sbjct: 889  YPHWIFLTGRGELRTHPMGIDGSVTSFAAFNNINCPQGFLYFNRKEELRICVLPTHLSYD 948

Query: 991  NYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNL 1050
              WPV+KV PL+ TPH +TY  E   Y +I S    +PL                  +  
Sbjct: 949  APWPVRKV-PLRCTPHFVTYHLESKTYCVITSTA--EPLKSY---------------YRF 990

Query: 1051 SSVDLHRTYTVEEYEVRILEPDR--------AGGPWQT--RATIPMQSSENALTVRVVTL 1100
            +  D  + +T EE   R L P +        +   W+T     I +   E+   ++ V+L
Sbjct: 991  NGED--KEFTEEERPDRFLYPSQEQFSIVLFSPVSWETIPNTKIELDQWEHVTCLKNVSL 1048

Query: 1101 FNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
                T+   +  + +GT Y  GED+ +RGR+L+F        P   +    +  +++  Q
Sbjct: 1049 AYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPLTKNRFKQIYAKEQ 1108


>gi|301628217|ref|XP_002943254.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 [Xenopus (Silurana) tropicalis]
          Length = 628

 Score =  292 bits (747), Expect = 8e-76,   Method: Compositional matrix adjust.
 Identities = 204/626 (32%), Positives = 318/626 (50%), Gaps = 74/626 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV   + + +Y +    E S + +   E K            LEL+  +   GNV S+
Sbjct: 29  NLVVAGTSQLYVYRLNPNCESSSKGEKGSEVKGH-------KEKLELMASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F++AK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKEAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +L+YG Q+++L       ++   GLVG+        G  +   
Sbjct: 135 NVHNPKVRVDPSGRCAVMLIYGTQLVVLPFRRDTLAEEHDGLVGE--------GQKSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R+LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRELDEKLLNIIDMQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
              K HP+IWS  NLP+D  + LAVP PIGGV++   N++ Y +QS     ++LN+    
Sbjct: 247 IMQKVHPVIWSLTNLPYDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVSLNSLTNG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             S    P+    V LD + AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTSFPLKPQEGLRVTLDCSQATFISYDKMVISLKGGEIYVLTLITDGMRSVRSFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
            SVLT+ +T +     FLGSRLG+SLL+++T     S        +    + D P  K+ 
Sbjct: 367 ASVLTTSMTPMEPGYLFLGSRLGNSLLLRYTEKVQDSPAGPSKDPD----KQDEPPNKKK 422

Query: 468 RRSSSDALQ-----DMVNG-EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
           R  SS A       +MV+  +E+ +YGS    + +   T+SF V DS++NIGP    S G
Sbjct: 423 RVDSSLARPGGSKGNMVDEIDEIEVYGSEM-QSGTQLSTYSFEVCDSILNIGPCATASMG 481

Query: 522 ----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYHK 556
                           L I        + + + + K    ++V   ELPGC  +WTV   
Sbjct: 482 EPAFLSEEFQESPEPDLEIVLCSGYGKNGALSVLQKSIRPQVVTTFELPGCHDMWTVISN 541

Query: 557 SSRGHNADSSRM------AAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
             +               A  D   H +LI+S +  TM+L+T   + E+  S  +  Q  
Sbjct: 542 HKKEEQEGEKEGETPPVEAEEDTNRHGFLILSRDDSTMILQTGQEIMELDTS-GFATQDP 600

Query: 611 TIAAGNLFGRRRVIQVFERGARILDG 636
           T+ AGN+   + ++QV  RG R+L+G
Sbjct: 601 TVYAGNIGDNKYIVQVSPRGIRLLEG 626


>gi|332018184|gb|EGI58789.1| Cleavage and polyadenylation specificity factor subunit 1
           [Acromyrmex echinatior]
          Length = 1412

 Score =  289 bits (739), Expect = 7e-75,   Method: Compositional matrix adjust.
 Identities = 214/665 (32%), Positives = 342/665 (51%), Gaps = 76/665 (11%)

Query: 58  LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
           LVV  ANVI ++ +    + ++  K + ET+            LE +  Y LHGN+ S+ 
Sbjct: 30  LVVAGANVIRVFRLIPDVDMTRREKYT-ETRP-------PKMKLECLTQYTLHGNIMSMQ 81

Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
            +   G+    +RDS++L+F DAK+SV+E+D  IH LR  S+H FE  E   +K G  + 
Sbjct: 82  AVHLIGS----QRDSLLLSFRDAKLSVVEYDQDIHDLRTVSLHYFEEEE---IKDGWTNH 134

Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR---IESSH 234
              P+V+VDP+GRC  +L++G ++++L   +  S  + D D   S    S     I SS+
Sbjct: 135 HHIPIVRVDPEGRCAVMLIFGRKLVVLPFRKDPS--LDDGDLLDSAKLTSTNKTPILSSY 192

Query: 235 VINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
           +I L+ L+  M +V D  F+HGY EP ++IL+E   T++GR++ +  TC + A+S++   
Sbjct: 193 MIVLKTLEEKMDNVIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQ 252

Query: 293 KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYAVSLDS 351
           + HP+IWS  NLP D Y+ + V  P+GG L++  N++ Y +QS     ++LN+ A S  +
Sbjct: 253 RVHPIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLADSSTN 312

Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSV 410
               P+    + L+ +   ++  D  ++S K+G+L +L++  D  R V+     K   SV
Sbjct: 313 FPLKPQEGVKMSLEGSQVAFISADRLVISLKSGELYVLSLFADSMRSVRGFHFDKAAASV 372

Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM--LSSGLKEEFGDIEADAPSTKRLR 468
           LTS +    ++  FLGSRLG+SLL++FT     ++  L+        +   + P+ K  +
Sbjct: 373 LTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPETLKNLNDNEITIEENENEETPAKKTKQ 432

Query: 469 R-----SSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG-- 521
                  +SD L D+ + EEL +YGS + +T     ++ F V DSL+NIGP  + S G  
Sbjct: 433 DFLGDWMASDVL-DIKDPEELEVYGSET-HTSIQITSYIFEVCDSLLNIGPCGNISMGEP 490

Query: 522 ------LRINAD-----ASATGISKQSNYELV------------ELPGCKGIWTVYHKSS 558
                    N D      + +G  K     ++            +LPGC+ +WTV     
Sbjct: 491 AFLSEEFLQNQDPDVELVTTSGYGKNGALCVLQRSIRPQVVTTFQLPGCEDMWTVIGIV- 549

Query: 559 RGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLF 618
              N D  R    ++  HA+LI+S E  TMVL+T   + EV +S  +  QG T+ AGNL 
Sbjct: 550 ---NNDEIRT---EEGSHAFLILSQEDSTMVLQTGQEINEVDQS-GFSTQGSTVFAGNLG 602

Query: 619 GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
             R ++QV + G R+L G    Q +                 ++  S ADPYV L   DG
Sbjct: 603 ANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYVALLSEDG 652

Query: 679 SIRLL 683
            + LL
Sbjct: 653 QVMLL 657



 Score =  125 bits (314), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 108/412 (26%), Positives = 174/412 (42%), Gaps = 54/412 (13%)

Query: 755  VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
            +V  +SG LEI+ +P+    + +  F  G+  + D+     L+   T IN          
Sbjct: 771  LVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQ--STPINEIPHP----- 823

Query: 815  RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
                   M+V E+ M     H +RP L   L D  +  YQAY +  P+   K        
Sbjct: 824  ------DMQVREILMVALGHHGNRPMLLVRL-DSDLQIYQAYRY--PKGYLK-------- 866

Query: 875  RSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI---FKNISGHQGFFLSGS 931
              L    +    +   R S  P      E+ P      RI +   F NI+G+ G F+   
Sbjct: 867  --LRFKKLDHGIIPG-RLSPRP----KEEDVPRNRNITRICVMRYFSNIAGYNGVFICSD 919

Query: 932  RPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
             P W  +  R  LR HP   DG + +F   +N+NC  GF+Y   +  L+IC LP+  +YD
Sbjct: 920  YPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQGFLYFNRKEELRICVLPTHLSYD 979

Query: 991  NYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNL 1050
              WPV+KV PL+ TPH +TY  E   Y +I S    +PL        + +V         
Sbjct: 980  APWPVRKV-PLRCTPHFVTYHLESKTYCVITSTA--EPLKSYYRFNGEDKV--------- 1027

Query: 1051 SSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN 1108
                L + Y + ++    +    +   W+T     I +   E+   ++ V+L    T+  
Sbjct: 1028 ----LTKLYYLFQFSRIFMNLLFSPVSWETIPNTKIELDQWEHVTCLKNVSLAYEGTRSG 1083

Query: 1109 -ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
             +  + +GT Y  GED+ +RGR+L+F        P   +    +  +++  Q
Sbjct: 1084 LKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPLTKNRFKQIYAKEQ 1135


>gi|119602512|gb|EAW82106.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform
           CRA_a [Homo sapiens]
 gi|119602513|gb|EAW82107.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform
           CRA_a [Homo sapiens]
 gi|119602514|gb|EAW82108.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform
           CRA_a [Homo sapiens]
          Length = 1365

 Score =  287 bits (734), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 202/620 (32%), Positives = 322/620 (51%), Gaps = 80/620 (12%)

Query: 115 SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
           S+A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G 
Sbjct: 2   SMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGF 54

Query: 175 ESFARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSAR 229
                 P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  + 
Sbjct: 55  VQNVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSS 106

Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
              S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S
Sbjct: 107 FLPSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAIS 166

Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYA 346
           ++ T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+  
Sbjct: 167 LNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLT 226

Query: 347 VSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSK 405
               +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K
Sbjct: 227 TGTTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDK 286

Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADA 461
              SVLT+ + T+     FLGSRLG+SLL+++T        +++  +  KEE    +   
Sbjct: 287 AAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRV 346

Query: 462 PSTKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
            +T     +     QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + 
Sbjct: 347 DATAGWSAAGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAV 402

Query: 521 G----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY- 554
           G                L I        + + + + K    ++V   ELPGC  +WTV  
Sbjct: 403 GEPAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIA 462

Query: 555 --------HKSSRGHNADSSRMAAYDDE--YHAYLIISLEARTMVLETADLLTEVTESVD 604
                   +    G   + S     DD+   H +LI+S E  TM+L+T   + E+  S  
Sbjct: 463 PVRKEEEDNPKGEGTEQEPSTTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-G 521

Query: 605 YFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSV 664
           +  QG T+ AGN+   R ++QV   G R+L+G      L F P +         + ++  
Sbjct: 522 FATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQC 571

Query: 665 SIADPYVLLGMSDGSIRLLV 684
           ++ADPYV++  ++G + + +
Sbjct: 572 AVADPYVVIMSAEGHVTMFL 591



 Score =  139 bits (350), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 105/413 (25%), Positives = 181/413 (43%), Gaps = 34/413 (8%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 706  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 761

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 762  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 811

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +        E         R   F++I G+ G F+ G  
Sbjct: 812  VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 871

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG + +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 872  PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 931

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVS--VPVLKPLNQVLSLLIDQEVGHQIDNHN 1049
             WPV+K IPL+ T H + Y  E  +Y +  S   P  +         I +  G + +   
Sbjct: 932  PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNTPCAR---------IPRMTGEEKEFET 981

Query: 1050 LSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKE 1107
            +   + +     E + ++++ P      W+    A I +Q  E+   ++ V+L +  T  
Sbjct: 982  IERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTCMKTVSLRSEETVS 1037

Query: 1108 N-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
              +  +A GT  +QGE+V  RGR+L+         P   +    +  L+   Q
Sbjct: 1038 GLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQ 1090


>gi|426361048|ref|XP_004047737.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 [Gorilla gorilla gorilla]
          Length = 1440

 Score =  287 bits (734), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 210/675 (31%), Positives = 336/675 (49%), Gaps = 84/675 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP G C  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGTCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
            SVLT+ + T+     FLGSRLG+SLL+++T        +S ++E     + + P +K+ 
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT-EKLQEPPASAVREA---ADKEEPPSKKK 422

Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVR---DSLVNIGPLKDFSYG--- 521
           R  ++               G  +     A    + AV    DS++NIGP  + + G   
Sbjct: 423 RVDATAGWSGEGRSRAGQERGQVTQGWSGAGAPLTVAVPQVCDSILNIGPCANAAMGEPA 482

Query: 522 -------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY----- 554
                        L I        + + + + K    ++V   ELPGC  +WTV      
Sbjct: 483 FLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRK 542

Query: 555 ----HKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
               +    G   + S   A DD   H +LI+S E  TM+L+T   + E+  S  +  QG
Sbjct: 543 EEEDNPKGEGTEQEPSTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQG 601

Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
            T+ AGN+   R ++QV   G R+L+G      L F P +         + ++  ++ADP
Sbjct: 602 PTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADP 651

Query: 670 YVLLGMSDGSIRLLV 684
           YV++  ++G + + +
Sbjct: 652 YVVIMSAEGHVTMFL 666



 Score =  139 bits (349), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 105/413 (25%), Positives = 181/413 (43%), Gaps = 34/413 (8%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 781  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 836

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 837  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 886

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +        E         R   F++I G+ G F+ G  
Sbjct: 887  VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 946

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG + +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 947  PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1006

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVS--VPVLKPLNQVLSLLIDQEVGHQIDNHN 1049
             WPV+K IPL+ T H + Y  E  +Y +  S   P  +         I +  G + +   
Sbjct: 1007 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNTPCAR---------IPRMTGEEKEFET 1056

Query: 1050 LSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKE 1107
            +   + +     E + ++++ P      W+    A I +Q  E+   ++ V+L +  T  
Sbjct: 1057 IERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTCMKTVSLRSEETVS 1112

Query: 1108 N-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
              +  +A GT  +QGE+V  RGR+L+         P   +    +  L+   Q
Sbjct: 1113 GLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQ 1165


>gi|390347522|ref|XP_003726804.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like [Strongylocentrotus purpuratus]
          Length = 1439

 Score =  286 bits (733), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 219/726 (30%), Positives = 349/726 (48%), Gaps = 105/726 (14%)

Query: 3   FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
           +A Y+ +H PTG+ +C      H  +                  P ++      NLVV  
Sbjct: 2   YAFYREIHPPTGVEHC---VYCHFFS------------------PDQQ------NLVVAK 34

Query: 63  ANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQG 122
            + + +Y + +  + +K +    + K +          LE    + + G V S+    Q 
Sbjct: 35  GSELTVYSM-ITVDSNKPTDKESKPKNK----------LEEAATFHIFGKVMSM----QS 79

Query: 123 GADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL 182
                  RD+++L+F +AK+S++E+D ++H L+  SMH FE  E    K G       P+
Sbjct: 80  AQVTGSGRDALLLSFMNAKVSIVEYDPNMHDLKTLSMHYFEEDE---TKEGVYRNIFHPV 136

Query: 183 VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD 242
           VKVDP  RC  +L YG ++++L   +   GLV D D   S       +  S+VI L ++D
Sbjct: 137 VKVDPDHRCAIMLTYGSKLVVLPFRR--DGLVEDLDKSMSASTRRGALMPSYVIRLNEMD 194

Query: 243 --MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
             + +V D  F+HGY EP ++IL+E   TWAGRV+ +  TC I ALS++   K HP+IWS
Sbjct: 195 DPICNVLDIQFLHGYYEPTLLILYEPLRTWAGRVAVRQDTCSIVALSLNMAQKVHPIIWS 254

Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS----SQELP 356
             +LP+D  ++ AVP PIGGVL++  N++ Y +QS      +  Y VSL+S    S   P
Sbjct: 255 QSSLPYDCMQVQAVPKPIGGVLILAVNSLLYLNQS------IPPYGVSLNSLTDWSTAFP 308

Query: 357 ---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + +D   AT++  D   LS K G++ +LT++ DG R V+   L K   SVLT
Sbjct: 309 LKTQEGVKLSMDCTQATFISYDRLALSLKDGEIYVLTLLVDGMRSVRGFHLDKAAASVLT 368

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
           + I  +G+   FLGSRLG+SLL+++T     +  S   K E      + PS K     +S
Sbjct: 369 TCICPMGDGFLFLGSRLGNSLLLKYTEKVSETSPSDASKTEEPKPGEEPPSKKMRSDDAS 428

Query: 473 DALQD----MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG------- 521
           D +      + + +EL +YG     T +   ++SF + DSL+NIGP  +   G       
Sbjct: 429 DWMASDTKFLDDPDELEVYGKQVQKTGTQLTSYSFEICDSLLNIGPCGNMIMGEPAFLSE 488

Query: 522 -LRINAD-----ASATGISKQSNYELVE------------LPGCKGIWTV--YHKSSRGH 561
             + N D      + +G  K     +++            LPGC  +WTV    K+    
Sbjct: 489 EFQGNVDPDLELVTTSGYGKNGALSVLQRTIRPQVVTTFNLPGCLDMWTVKSLKKAKADE 548

Query: 562 NADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRR 621
            ++ S  +  D + HA+LI+S +  +MVL+T   +TEV     +  Q  TI A N+   R
Sbjct: 549 KSEESETSPEDKDRHAFLILSKQDSSMVLQTGQEITEVAAG-GFSTQAPTIFASNMGDDR 607

Query: 622 RVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIR 681
            ++QV  +   +++G    Q +               S +   S+ADPY+LL   +G   
Sbjct: 608 YIVQVMNKSICLMEGVEQIQHMVL----------DVGSPIKQCSLADPYLLLLTENGDPI 657

Query: 682 LLVGDP 687
           L+   P
Sbjct: 658 LMTLKP 663



 Score =  134 bits (336), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 108/419 (25%), Positives = 174/419 (41%), Gaps = 43/419 (10%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + V C E+G LE++ +P+    F V  F  G   +VD               S S   TG
Sbjct: 776  WCVFCRENGQLEMYSLPDMVLAFLVKNFPMGSKVLVD---------------SGSAFMTG 820

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
               +++    +V E+ +        + ++ A++ D  I+ Y+A+    P NT   +  + 
Sbjct: 821  DQSQQHEMLQQVQEVLLVGLGHDRKKIYMLALVEDD-IMIYEAF----PYNTVTQEHHLR 875

Query: 873  TSRSLSVSNVSASRLRNLRFSRTP----------LDAYTREETPHGAPCQRITIFKNISG 922
              R   + +    + +  R S+ P                +         R+  F N+  
Sbjct: 876  V-RFRKIPHKILMKPKKTRTSKKPTAEGGTKTETETEAESDTKTQTRRVNRLREFHNVQT 934

Query: 923  HQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
            + G F+SGS P W  V  R  LR HP   DG+I  F   HNVNC +GF+Y   +  L+IC
Sbjct: 935  YSGVFISGSHPYWLFVTSRGALRTHPMPVDGAISCFASFHNVNCPNGFLYFNRKEELRIC 994

Query: 982  QLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1041
             LPS  +YD  WPV+KV PL+ TPH + Y  E   Y ++ SV       Q     + +  
Sbjct: 995  VLPSHLSYDAPWPVRKV-PLRCTPHFVAYHVETKTYAVVTSV-------QETKTHVWKVT 1046

Query: 1042 GHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL- 1100
            G +I    +   D     T   + +++  P        TR  I  +++EN   ++VV L 
Sbjct: 1047 GEEIGEEPVERDDRFVPTTKVVFSIQLFSPVSWDAIPNTR--IEYEAAENVTCLKVVNLS 1104

Query: 1101 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
               T    +  + + T +V  ED+  RG V ++        P   +      PL+   Q
Sbjct: 1105 CEGTMTGKKGYVVVATTHVYSEDLQTRGSVYIYDCIEVVPEPGQPLTKNKLKPLYEKRQ 1163


>gi|355698297|gb|EHH28845.1| Cleavage and polyadenylation specificity factor 160 kDa subunit
           [Macaca mulatta]
          Length = 1436

 Score =  285 bits (729), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 214/702 (30%), Positives = 343/702 (48%), Gaps = 112/702 (15%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
            SVLT+ + T+     FLGSRLG+SLL+++T        +++  +  KEE    +    +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426

Query: 464 TKRLRRSSSDALQDMVNGEELSLYGSASNN--------------------TESAQKTFSF 503
           T           QD V+  E+ +YGS + +                    ++  Q+  +F
Sbjct: 427 TASWSAGGKSVPQDEVD--EIEVYGSEAQSGTQLATYSFEVRLRQQGPHPSQCPQRPLTF 484

Query: 504 AVR---DSLVNIGPLKDFSYG--LRINADASATGISKQSNYELV---------------- 542
           AV    DS++NIGP  + + G    ++ +      S + + E+V                
Sbjct: 485 AVPQVCDSILNIGPCANAAMGEPAFLSEEVPRVVNSPEPDLEIVVCSGHGKNGALSVLQK 544

Query: 543 ----------ELPGCKGIWTVY---------HKSSRGHNADSSRMAAYDD-EYHAYLIIS 582
                     ELPGC  +WTV          +    G   ++    A DD   H +LI+S
Sbjct: 545 SIRPQVVTTFELPGCYDMWTVIAPVRKEEEDNPKGEGTEQEARSPEADDDGRRHGFLILS 604

Query: 583 LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD 642
            E  TM   T   + E+  S  +  QG T+ AGN+   R ++QV   G R+L+G      
Sbjct: 605 REDSTM---TGQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQ 657

Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
           L F P +         + ++  ++ADPYV++  ++G + + +
Sbjct: 658 LHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMFL 692



 Score =  115 bits (289), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 74/253 (29%), Positives = 119/253 (47%), Gaps = 20/253 (7%)

Query: 913  RITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIY 971
            R   F++I G+ G F+ G  P W +V  R  LR+HP   DG + +F   HNVNC  GF+Y
Sbjct: 923  RFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLY 982

Query: 972  VTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVS--VPVLKPL 1029
               QG L+I  LP+  +YD  WPV+K IPL+ T H + Y  E  +Y +  S   P  +  
Sbjct: 983  FNRQGELRISVLPAYLSYDAPWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNTPCAR-- 1039

Query: 1030 NQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQ 1087
                   I +  G + +   +   + +     E + ++++ P      W+    A I +Q
Sbjct: 1040 -------IPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQ 1088

Query: 1088 SSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLV 1146
              E+   ++ V+L +  T    +  +A GT  +QGE+V  RGR+L+         P   +
Sbjct: 1089 EWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPL 1148

Query: 1147 LSGSYGPLFSSVQ 1159
                +  L+   Q
Sbjct: 1149 TKNKFKVLYEKEQ 1161


>gi|431908147|gb|ELK11750.1| Cleavage and polyadenylation specificity factor subunit 1 [Pteropus
           alecto]
          Length = 671

 Score =  285 bits (728), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 212/661 (32%), Positives = 328/661 (49%), Gaps = 114/661 (17%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN    + +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEAPTKNDRSAEGKAHRE--HREKLELVASFSFFGNVMSM 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 85  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +L+YG ++++L       ++   GLVG+        G  +   
Sbjct: 138 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 189

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 190 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 249

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 250 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 309

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A A ++  D  ++S K G++ +LT+V DG R V+     K  
Sbjct: 310 TTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLVTDGLRSVRAFHFDKAA 369

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC----GSGTSMLSSGLKEEFGDIEADAPS 463
            SVLTS + T+     FLGSRLG+SLL+++T        +++  +  KEE        P 
Sbjct: 370 ASVLTSSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEAPASTVREAADKEE--------PP 421

Query: 464 TKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPL 515
           +K+ R  S+          QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP 
Sbjct: 422 SKKKRVDSTVGWSGGKSVAQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPC 477

Query: 516 K-------------------------DFSYGLRINADASATGISKQSNYELV-------- 542
                                     + + GL  +   +    S + + E+V        
Sbjct: 478 ANAAMGEPAFLSEEVPVWEVQGGGGVECTVGLWPHPSLAQFQNSPEPDLEIVMCSGYGKN 537

Query: 543 ------------------ELPGCKGIWTVY-------HKSSRGHNAD---SSRMAAYDDE 574
                             ELPGC  +WTV         ++ +G   +   S+  A  D  
Sbjct: 538 GALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEQEETPKGEAVEPEPSAPDADDDGR 597

Query: 575 YHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
            H +LI+S E  TM+L+T   + E+  S  +  QG T+ AGN+   R ++QV   G R+L
Sbjct: 598 RHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLL 656

Query: 635 D 635
           +
Sbjct: 657 E 657


>gi|148697643|gb|EDL29590.1| cleavage and polyadenylation specific factor 1, isoform CRA_b [Mus
           musculus]
          Length = 1311

 Score =  283 bits (725), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 196/601 (32%), Positives = 310/601 (51%), Gaps = 68/601 (11%)

Query: 129 RRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQ 188
           +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G       P V+VDP 
Sbjct: 11  KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQNVHTPRVRVDPD 67

Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK--HV 246
           GRC  +L+YG ++++L   +     + +E     G G  +    S++I++R LD K  ++
Sbjct: 68  GRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYIIDVRALDEKLLNI 124

Query: 247 KDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPH 306
            D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K HP+IWS  +LP 
Sbjct: 125 IDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVHPVIWSLTSLPF 184

Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSSFSVELD 365
           D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +     +    + LD
Sbjct: 185 DCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLRTQEGVRITLD 244

Query: 366 AAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFF 424
            A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT+ + T+     F
Sbjct: 245 CAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLF 304

Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-----SSDALQDMV 479
           LGSRLG+SLL+++T        SS    E  D E      KR+  +          QD V
Sbjct: 305 LGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVGWTGGKTVPQDEV 362

Query: 480 NGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----------------L 522
           +  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G                L
Sbjct: 363 D--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFLSEEFQNSPEPDL 418

Query: 523 RI------NADASATGISKQSNYELV---ELPGCKGIWTVYH----------KSSRGHNA 563
            I        + + + + K    ++V   ELPGC  +WTV            K+      
Sbjct: 419 EIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEETPKAESTEQE 478

Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
            S+  A  D   H +LI+S E  TM+L+T   + E+  S  +  QG T+ AGN+   R +
Sbjct: 479 PSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNIGDNRYI 537

Query: 624 IQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
           +QV   G R+L+G      L F P +         + ++  ++ADPYV++  ++G + + 
Sbjct: 538 VQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMF 587

Query: 684 V 684
           +
Sbjct: 588 L 588



 Score =  141 bits (356), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 105/386 (27%), Positives = 176/386 (45%), Gaps = 30/386 (7%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 703  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 758

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 759  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 808

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 809  VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPS 868

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 869  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 928

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
             WPV+K IPL+ T H + Y  E  +Y +  S     P  +     I +  G + +   + 
Sbjct: 929  PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIE 980

Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
              D +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    
Sbjct: 981  RDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGL 1036

Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFS 1134
            +  +A GT  +QGE+V  RGR+ L+S
Sbjct: 1037 KGYVAAGTCLMQGEEVTCRGRIFLWS 1062


>gi|327287424|ref|XP_003228429.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like [Anolis carolinensis]
          Length = 1294

 Score =  283 bits (724), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 213/680 (31%), Positives = 336/680 (49%), Gaps = 89/680 (13%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV   + + +Y +    E + +S  S E K            LELV  +   GNV S+
Sbjct: 29  NLVVAGTSQLYVYRLNHDSESTTKSDRSSEGKSH-------KEKLELVAAFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + DE     G G  +    S++I
Sbjct: 135 NVHIPKVRVDPDGRCAVMLIYGTRLVVLPFRRD---TLTDEHEGVVGEGQKSSFLPSYII 191

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R+LD K  ++ D  F++GY EP ++IL E   TW GRV+ +  TC I A+S++   K 
Sbjct: 192 DIRELDEKLLNIIDMQFLYGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNIMQKV 251

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS--- 351
           HP+IWS  NLP D  + LAVP PIGGV++   N++ Y +QS         Y VSL+S   
Sbjct: 252 HPVIWSLSNLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVP------PYGVSLNSLTN 305

Query: 352 -SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKT 406
            +   P   +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K 
Sbjct: 306 GTTVFPLRIQEGVKITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRSFHFDKA 365

Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
             SVLT+ + T+     FLGSRLG+SLL+++T       +++  K+     E      KR
Sbjct: 366 AASVLTTCMITMDPGYLFLGSRLGNSLLLRYTEKLQEPPVNAA-KDATEKTEEPPVKKKR 424

Query: 467 LRRSSS-----DALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
           + + ++      A QD V+  E+ +YGS + +  +   T+SF V DS++NIGP  + + G
Sbjct: 425 VEQQANWAGGKSAPQDEVD--EIEVYGSEAQSG-TQLSTYSFEVCDSILNIGPCANAAMG 481

Query: 522 ----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYHK 556
                           L I        + + + + K    ++V   ELPGC  +WTV   
Sbjct: 482 EPAFLSEEFQNSLEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAP 541

Query: 557 SSRGHNADSSRMAAY----------DDEYHAYLIISLEARTMVLETADLLTEVTESVDY- 605
                  D+   +A           D + H +LI+S E  TMV        +    +D  
Sbjct: 542 QKAEQEEDAQGESAEKEPSPPEPPDDGKRHGFLILSREDSTMVNPANGPTGQEIMELDTS 601

Query: 606 -FVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSV 664
                 T  AGN+   R ++QV   G R+L+G      L F P +         S ++  
Sbjct: 602 GLAPRSTQDAGNIGENRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GSPIVQC 651

Query: 665 SIADPYVLLGMSDGSIRLLV 684
           ++ADPYV++  S+G + + V
Sbjct: 652 AVADPYVVIMSSEGQVTMFV 671



 Score = 87.8 bits (216), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 66/227 (29%), Positives = 104/227 (45%), Gaps = 19/227 (8%)

Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
           + V+  E+G +EI+ +P +  VF V  F  G+  +VD+   +    +E    +  EE   
Sbjct: 786 WCVLVRENGTMEIYQLPEWRLVFLVKNFPMGQRVLVDSSFGQPASQAE----AKKEEVIR 841

Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
           Q     +  + +V L  ++     SRP+L  +  D  +L Y+A+      + S+      
Sbjct: 842 QTEMPLVKEVLLVALGNRQ-----SRPYLL-VHVDQELLIYEAF-----NHDSQLGQTNL 890

Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREE--TPHGAPCQRITIFKNISGHQGFFLSG 930
             R   V +    R +  R S+   ++   EE   P G    R   F++I G+ G F+ G
Sbjct: 891 KVRFKKVPHNINFREKKPRPSKKKTESAGGEEASVPRGR-VARFRYFEDIYGYSGVFICG 949

Query: 931 SRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG 976
             P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG
Sbjct: 950 PSPHWLLVTSRGALRLHPMTIDGPIESFAPFHNVNCPKGFLYFNRQG 996


>gi|440904368|gb|ELR54893.1| Cleavage and polyadenylation specificity factor subunit 1, partial
           [Bos grunniens mutus]
          Length = 1417

 Score =  280 bits (717), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 206/635 (32%), Positives = 320/635 (50%), Gaps = 89/635 (14%)

Query: 100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
            LELV  +   GNV S+A +   GA    +RD+++L       SV+E+D   H L+  S+
Sbjct: 65  KLELVASFSFFGNVMSMASVQLAGA----KRDALLL-------SVVEYDPGTHDLKTLSL 113

Query: 160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLV 214
           H FE PE   L+ G       P V+VDP GRC  +L+YG ++++L       ++   GLV
Sbjct: 114 HYFEEPE---LRDGFVQNVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLV 170

Query: 215 GDEDTFGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAG 272
           G+        G  +    S++I++R LD K  ++ D  F+HGY EP ++IL E   TW G
Sbjct: 171 GE--------GQRSSFLPSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPG 222

Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYH 332
           RV+ +  TC I A+S++ T K HP+IWS  +LP D  + LAVP PIGGV++   N++ Y 
Sbjct: 223 RVAVRQDTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYL 282

Query: 333 SQSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV 391
           +QS     +ALN+      +     +    + LD A A ++  D  ++S K G++ +LT+
Sbjct: 283 NQSVPPYGVALNSLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTL 342

Query: 392 VYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGL 450
           + DG R V+     K   SVLT+ + T+     FLGSRLG+SLL+++T        S+  
Sbjct: 343 ITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASTA- 401

Query: 451 KEEFGDIEADAPSTKRLRRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFA 504
             E  D E      KR+  +     S    QD V+  E+ +YGS A + T+ A  T+SF 
Sbjct: 402 -REAADKEEPPSKKKRVDATTGWSGSKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFE 456

Query: 505 VRDSLVNIGPLKDFSYG----------------LRI------NADASATGISKQSNYELV 542
           V DS++NIGP  + + G                L I        + + + + K    ++V
Sbjct: 457 VCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVV 516

Query: 543 ---ELPGCKGIWTVYHKSSR---------GHNADSSRMAAYDD-EYHAYLIISLEARTMV 589
              ELPGC  +WTV     +         G   +     A DD   H +LI+S E  TM+
Sbjct: 517 TTFELPGCYDMWTVIAPVRKEQEETLKGEGTEPEPGAPEAEDDGRRHGFLILSREDSTMI 576

Query: 590 LETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSN 649
           L+T   + E+  S  +  QG T+ AGN+   R ++QV   G R+L+G      L F P +
Sbjct: 577 LQTGQEIMELDAS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVD 632

Query: 650 SESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
                    S ++  ++ADPYV++  ++G + + +
Sbjct: 633 L-------GSPIVQCAVADPYVVIMSAEGHVTMFL 660



 Score =  141 bits (356), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 106/411 (25%), Positives = 183/411 (44%), Gaps = 30/411 (7%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+GA+EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 775  WCLLVRENGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 830

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +   RP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 831  QGELPLVKEVLLVALG-----SRQRRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 880

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E T       R   F++I G+ G F+ G  
Sbjct: 881  VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGTGPRGRVARFRYFEDIYGYSGVFICGPS 940

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HN+NC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 941  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDA 1000

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
             WPV+K IPL+ T H + Y  E  +Y +  S     P  +V  +      G + +   + 
Sbjct: 1001 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTST--PCTRVPRM-----TGEEKEFETIE 1052

Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
              + +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    
Sbjct: 1053 RDERYVHPQQEAFCIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGL 1108

Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
            +  +A GT  +QGE+V  RGR+L+         P   +    +  L+   Q
Sbjct: 1109 KGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQ 1159


>gi|441648592|ref|XP_004093268.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
           specificity factor subunit 1 [Nomascus leucogenys]
          Length = 1177

 Score =  280 bits (715), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 202/651 (31%), Positives = 321/651 (49%), Gaps = 115/651 (17%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
            SVLT+ + T+     FLGSRLG+SLL+++                          T++L
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKY--------------------------TEKL 400

Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
           +   + A+++  + EE            S +K                       R++A 
Sbjct: 401 QEPPASAVREAADKEE----------PPSKKK-----------------------RVDAT 427

Query: 528 ASATGISKQSNYELV----ELPGCKGIWTVY---------HKSSRGHNADSSRMAAYDD- 573
              +G  ++S    V    ELPGC  +WTV          +    G   + S   A DD 
Sbjct: 428 VGWSGEGQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEDNPKGEGTEQEPSTPEADDDC 487

Query: 574 EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 633
             H +LI+S E  TM+L+T   + E+  S  +  QG T+ AGN+   R ++QV   G R+
Sbjct: 488 RRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRL 546

Query: 634 LDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
           L+G      L F P +         + ++  ++ADPYV++  ++G + + +
Sbjct: 547 LEG---VNQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMFL 587



 Score =  121 bits (304), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 86/290 (29%), Positives = 136/290 (46%), Gaps = 21/290 (7%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 702  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 757

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 758  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 807

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E         R   F++I G+ G F+ G  
Sbjct: 808  VRFKKVPHNINFREKKPKPSKKKAEGGGTEEGAGARGRVARFRYFEDIYGYSGVFICGPS 867

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG + +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 868  PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 927

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYP-LIVSVPVLKPLNQVLSLLIDQE 1040
             WPV K IPL+ T H + Y  E  + P  I++  V+K     +SLL  QE
Sbjct: 928  PWPVXK-IPLRCTAHYVAYHVESKVCPNFILAADVMKS----ISLLRYQE 972


>gi|403302917|ref|XP_003942095.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 [Saimiri boliviensis boliviensis]
          Length = 1390

 Score =  273 bits (699), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 203/645 (31%), Positives = 322/645 (49%), Gaps = 105/645 (16%)

Query: 115 SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
           S+A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G 
Sbjct: 2   SMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGF 54

Query: 175 ESFARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSAR 229
                 P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  + 
Sbjct: 55  VQNVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSS 106

Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
              S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S
Sbjct: 107 FLPSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAIS 166

Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI--------------------------GGV 321
           ++ T K HP+IWS  +LP D  + LAVP PI                          GGV
Sbjct: 167 LNITQKVHPVIWSLTSLPFDCTQALAVPKPIGENPGGAEGSAGRGAVSLPTSLCPPPGGV 226

Query: 322 LVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLS 380
           ++   N++ Y +QS     +ALN+      +     +    + LD A AT++  D  ++S
Sbjct: 227 VIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLRTQEGVRITLDCAQATFISYDKMVIS 286

Query: 381 TKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
            K G++ +LT++ DG R V+     K   SVLT+ + T+     FLGSRLG+SLL+++T 
Sbjct: 287 LKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTE 346

Query: 440 G----SGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS-ASNNT 494
                  +++  +G KEE    +    +T           QD V+  E+ +YGS A + T
Sbjct: 347 KLQEPPASAVREAGDKEEPPSKKKRVDATAGWSAGGKSVPQDEVD--EIEVYGSEAQSGT 404

Query: 495 ESAQKTFSFAVRDSLVNIGPLKDFSYG----------------LRI------NADASATG 532
           + A  T+SF V DS++NIGP  + + G                L I        + + + 
Sbjct: 405 QLA--TYSFEVCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGHGKNGALSV 462

Query: 533 ISKQSNYELV---ELPGCKGIWTVY---------HKSSRGHNADSSRMAAYDD-EYHAYL 579
           + K    ++V   ELPGC  +WTV          +    G   + S   A DD   H +L
Sbjct: 463 LQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEENPKGEGTEQEPSTPEADDDSRRHGFL 522

Query: 580 IISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYM 639
           I+S E  TM+L+T   + E+  S  +  QG T+ AGN+   R ++QV   G R+L+G   
Sbjct: 523 ILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNVGDDRYIVQVSPLGIRLLEG--- 578

Query: 640 TQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
              L F P +         + ++  ++ADPYV++  ++G + + +
Sbjct: 579 VNQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMFL 616



 Score =  140 bits (354), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 113/428 (26%), Positives = 190/428 (44%), Gaps = 64/428 (14%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 731  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 786

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++        
Sbjct: 787  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQ------- 829

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTRE---------------ETPHGAPCQ--RIT 915
                LS  N+       +RF + P +   RE               E   GA  +  R  
Sbjct: 830  ----LSQGNL------KVRFKKVPHNINFREKKPKPSKKKAEGGSAEEGAGARGRVARFR 879

Query: 916  IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
             F++I G+ G F+ G  P W +V  R  LR+HP   DG + +F   HN+NC  GF+Y   
Sbjct: 880  YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPVDSFAPFHNINCPRGFLYFNR 939

Query: 975  QGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLS 1034
            QG L+I  LP+  +YD  WPV+K IPL+ T H + Y  E  +Y +  S     P  +   
Sbjct: 940  QGELRISVLPAYLSYDAPWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR--- 993

Query: 1035 LLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENA 1092
              I +  G + +   +   + +     E + ++++ P      W+    A I +Q  E+ 
Sbjct: 994  --IPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHV 1047

Query: 1093 LTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSY 1151
              ++ V+L +  T    +  +A GT  +QGE+V  RGR+L+         P   +    +
Sbjct: 1048 TCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKF 1107

Query: 1152 GPLFSSVQ 1159
              L+   Q
Sbjct: 1108 KVLYEKEQ 1115


>gi|402879380|ref|XP_003903320.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
           specificity factor subunit 1 [Papio anubis]
          Length = 1389

 Score =  271 bits (692), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 202/644 (31%), Positives = 321/644 (49%), Gaps = 104/644 (16%)

Query: 115 SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
           S+A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G 
Sbjct: 2   SMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGF 54

Query: 175 ESFARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSAR 229
                 P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  + 
Sbjct: 55  VQNVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSS 106

Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
              S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S
Sbjct: 107 FLPSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAIS 166

Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI-------------------------GGVL 322
           ++ T K HP+IWS  +LP D  + LAVP PI                         GGV+
Sbjct: 167 LNITQKVHPVIWSLTSLPFDCTQALAVPKPIGEYPGSGWGCVEGALSLPTSLCPPPGGVV 226

Query: 323 VVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLST 381
           V   N++ Y +QS     +ALN+      +     +    + LD A AT++  D  ++S 
Sbjct: 227 VFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLRIQEGVRITLDCAQATFISYDKMVISL 286

Query: 382 KTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG 440
           K G++ +LT++ DG R V+     K   SVLT+ + T+     FLGSRLG+SLL+++T  
Sbjct: 287 KGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEK 346

Query: 441 ----SGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS-ASNNTE 495
                 +++  +  KEE    +    +T           QD V+  E+ +YGS A + T+
Sbjct: 347 LQEPPASAVREAADKEEPPSKKKRVDATASWSAGGKSVPQDEVD--EIEVYGSEAQSGTQ 404

Query: 496 SAQKTFSFAVRDSLVNIGPLKDFSYG----------------LRI------NADASATGI 533
            A  T+SF V DS++NIGP  + + G                L I        + + + +
Sbjct: 405 LA--TYSFEVCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVL 462

Query: 534 SKQSNYELV---ELPGCKGIWTVY---------HKSSRGHNADSSRMAAYDD-EYHAYLI 580
            K    ++V   ELPGC  +WTV          +    G   ++    A DD   H +LI
Sbjct: 463 QKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEDNPKGEGTEQEARSPEADDDGRRHGFLI 522

Query: 581 ISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMT 640
           +S E  TM+L+T   + E+  S  +  QG T+ AGN+   R ++QV   G R+L+G    
Sbjct: 523 LSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---V 578

Query: 641 QDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
             L F P +         + ++  ++ADPYV++  ++G + + +
Sbjct: 579 NQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMFL 615



 Score =  139 bits (349), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 105/411 (25%), Positives = 180/411 (43%), Gaps = 30/411 (7%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 730  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 785

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 786  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 835

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E         R   F++I G+ G F+ G  
Sbjct: 836  VRFKKVPHNINFREKKPKPSKKKAEGGGTEEGAGXRGRVARFRYFEDIYGYSGVFICGPS 895

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG + +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 896  PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 955

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
             WPV+K IPL+ T H + Y  E  +Y +  S              I +  G + +   + 
Sbjct: 956  PWPVRK-IPLRCTAHYVAYHVESKVYAVATS-------TNTPCARIPRMTGEEKEFETIE 1007

Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
              + +     E + ++++ P      W+    A I +Q  E+   ++ V+L +  T    
Sbjct: 1008 RDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTCMKTVSLRSEETVSGL 1063

Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
            +  +A GT  +QGE+V  RGR+L+         P   +    +  L+   Q
Sbjct: 1064 KGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQ 1114


>gi|348555856|ref|XP_003463739.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 isoform 2 [Cavia porcellus]
          Length = 1387

 Score =  271 bits (692), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 209/676 (30%), Positives = 332/676 (49%), Gaps = 88/676 (13%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   + + A     +      GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRPTEGKSHREKLGAGGPPSLSF----GNVMSM 82

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +              I      ++SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 83  ASVQLXXXXXX------IALISFPQLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 133

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +L+YG ++++L       ++   GLVG+        G  +   
Sbjct: 134 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 185

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 186 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 245

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+  + 
Sbjct: 246 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTLG 305

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 306 TTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 365

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
            SVLT+ + T+     FLGSRLG+SLL+++T         +    E  D E      KR+
Sbjct: 366 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT--EKLQEPPASTVREAADKEEPPSKKKRV 423

Query: 468 RRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
             +     S    QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G
Sbjct: 424 DSTAGWAGSKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVG 479

Query: 522 --------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH--- 555
                         L I        + + + + K    ++V   ELPGC  +WTV     
Sbjct: 480 EPAFLSEENSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVR 539

Query: 556 ------KSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
                   + G   + S   A DD   H +LI+S E  TM+L+T   + E+  S  +  Q
Sbjct: 540 KEEEETPKAEGSEQEPSAPEAEDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQ 598

Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
           G T+ AGN+   R ++QV   G R+L+G      L F P +         + ++  ++AD
Sbjct: 599 GPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVAD 648

Query: 669 PYVLLGMSDGSIRLLV 684
           PYV++  ++G + + +
Sbjct: 649 PYVVIMSAEGHVTMFL 664



 Score =  139 bits (351), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 105/386 (27%), Positives = 176/386 (45%), Gaps = 30/386 (7%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 779  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSSGQPTTQGEVR----KEEATR 834

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 835  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 884

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 885  VRFKKVPHNINFREKKPKPSKKKAEGGSTDEGSGVRGRVARFRYFEDIYGYSGVFICGPS 944

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 945  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1004

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
             WPV+K IPL+ T H + Y  E  +Y +  S     P  +     I +  G + +   + 
Sbjct: 1005 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTST--PCTR-----IPRMTGEEKEFEAIE 1056

Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
              D +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    
Sbjct: 1057 RDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGL 1112

Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFS 1134
            +  +A GT  +QGE+V  RGR+ L+S
Sbjct: 1113 KGYVAAGTCLMQGEEVTCRGRIFLWS 1138


>gi|348555854|ref|XP_003463738.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 isoform 1 [Cavia porcellus]
          Length = 1440

 Score =  270 bits (690), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 209/678 (30%), Positives = 332/678 (48%), Gaps = 90/678 (13%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   + + A     +      GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRPTEGKSHREKLGAGGPPSLSF----GNVMSM 82

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +              I      ++SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 83  ASVQLXXXXXX------IALISFPQLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 133

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +L+YG ++++L       ++   GLVG+        G  +   
Sbjct: 134 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 185

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 186 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 245

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+  + 
Sbjct: 246 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTLG 305

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 306 TTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 365

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
            SVLT+ + T+     FLGSRLG+SLL+++T         +    E  D E      KR+
Sbjct: 366 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT--EKLQEPPASTVREAADKEEPPSKKKRV 423

Query: 468 RRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
             +     S    QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G
Sbjct: 424 DSTAGWAGSKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVG 479

Query: 522 ----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH- 555
                           L I        + + + + K    ++V   ELPGC  +WTV   
Sbjct: 480 EPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAP 539

Query: 556 --------KSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYF 606
                     + G   + S   A DD   H +LI+S E  TM+L+T   + E+  S  + 
Sbjct: 540 VRKEEEETPKAEGSEQEPSAPEAEDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFA 598

Query: 607 VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI 666
            QG T+ AGN+   R ++QV   G R+L+G      L F P +         + ++  ++
Sbjct: 599 TQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAV 648

Query: 667 ADPYVLLGMSDGSIRLLV 684
           ADPYV++  ++G + + +
Sbjct: 649 ADPYVVIMSAEGHVTMFL 666



 Score =  139 bits (350), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 107/411 (26%), Positives = 181/411 (44%), Gaps = 30/411 (7%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 781  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSSGQPTTQGEVR----KEEATR 836

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 837  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 886

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 887  VRFKKVPHNINFREKKPKPSKKKAEGGSTDEGSGVRGRVARFRYFEDIYGYSGVFICGPS 946

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 947  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1006

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
             WPV+K IPL+ T H + Y  E  +Y +  S     P  +     I +  G + +   + 
Sbjct: 1007 PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTST--PCTR-----IPRMTGEEKEFEAIE 1058

Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
              D +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    
Sbjct: 1059 RDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGL 1114

Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
            +  +A GT  +QGE+V  RGR+L+         P   +    +  L+   Q
Sbjct: 1115 KGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQ 1165


>gi|296227035|ref|XP_002807684.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
           specificity factor subunit 1 [Callithrix jacchus]
          Length = 1394

 Score =  266 bits (681), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 201/672 (29%), Positives = 322/672 (47%), Gaps = 124/672 (18%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN    + +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSAEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
            SVLT+ ++                    F C +G   +                     
Sbjct: 367 ASVLTTSVSGTEG----------------FLCAAGGKSVP-------------------- 390

Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG------ 521
                   QD    +E+ +YGS + +  +   T+SF V DS++NIGP  + + G      
Sbjct: 391 --------QD--EXDEIEVYGSETQSG-TQLATYSFEVCDSILNIGPCANAAMGEPAFLS 439

Query: 522 ----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY-------- 554
                     L I        + + + + K    ++V   ELPGC  +WTV         
Sbjct: 440 EEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEE 499

Query: 555 -HKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTI 612
            +    G   + S   A DD   H +LI+S E  TM+L+T   + E+  S  +  QG T+
Sbjct: 500 ENPKGEGTEQEPSTPEADDDSRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTV 558

Query: 613 AAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVL 672
            AGN+   R ++QV   G R+L+G      L F P +         + ++  ++ADPYV+
Sbjct: 559 FAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFVPVDL-------GAPIVQCAVADPYVV 608

Query: 673 LGMSDGSIRLLV 684
           +  ++G + + +
Sbjct: 609 IMSAEGHVTMFL 620



 Score =  143 bits (360), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 106/411 (25%), Positives = 184/411 (44%), Gaps = 30/411 (7%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 735  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 790

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++  S   + 
Sbjct: 791  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLSQGNLK 840

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E         R   F++I G+ G F+ G  
Sbjct: 841  VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGAGARGRVARFRYFEDIYGYSGVFICGPS 900

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG + +F   HN+NC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 901  PHWLLVTGRGALRLHPMGIDGPVDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDA 960

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
             WPV+K IPL+ T H + Y  E  +Y +  S     P  +     I +  G + +   + 
Sbjct: 961  PWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFETIE 1012

Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
              + +     E + ++++ P      W+    A I +Q  E+   ++ V+L +  T    
Sbjct: 1013 RDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEWEHVTCMKTVSLRSEETVSGL 1068

Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
            +  +A GT  +QGE+V  RGR+L+         P+  +    +  L+   Q
Sbjct: 1069 KGYVAAGTCLMQGEEVTCRGRILIMDVIEVVTEPRQTLTXXKFKVLYEKEQ 1119


>gi|55725165|emb|CAH89449.1| hypothetical protein [Pongo abelii]
          Length = 565

 Score =  263 bits (673), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 182/536 (33%), Positives = 282/536 (52%), Gaps = 65/536 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
            SVLT+ + T+     FLGSRLG+SLL+++T        +++  +  KEE    +    +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426

Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
           T     +     QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G 
Sbjct: 427 TAGWSAAGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGE 482

Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTV 553
                          L I        + + + + K    ++V   ELPGC  +WTV
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTV 538


>gi|281205270|gb|EFA79463.1| CPSF domain-containing protein [Polysphondylium pallidum PN500]
          Length = 1395

 Score =  256 bits (653), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 209/734 (28%), Positives = 340/734 (46%), Gaps = 109/734 (14%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLV+   +++++Y +R      ++ +     +++   D +    LEL    +L   +ESL
Sbjct: 31  NLVIAKTSLLQVYTIRYDRIEQQQQQQQQTNEQQSQQDTLKPW-LELNLELQLFSIIESL 89

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +   G D     DS+IL+F DAK+S+++++ +   L I S+H FE      LK GR++
Sbjct: 90  NCVRLPGDD----IDSLILSFRDAKVSIVKYNKATEKLDIRSLHYFEGNS--ELKGGRKT 143

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVG-------------------DE 217
           F   PL++VD Q RC  +L+Y   + +L   +  S L                     DE
Sbjct: 144 FRTPPLIRVDYQQRCAVMLLYDRHLAVLPFPRSFSILDDEEEEEEEEAAVVADQQQQHDE 203

Query: 218 D-----------TFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHER 266
           +              S      +   S+VI+L  L +++VKDF F+H Y EP ++ LHE 
Sbjct: 204 NEQQQPQDDQQQQQTSEKNKKKKQSESYVISLNSLGIENVKDFCFLHTYYEPTLLFLHEP 263

Query: 267 ELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGA 326
             TW  R+S K  T +++A+S++   +Q P+IWS  +LP++  +L+ VP P+GG +V+  
Sbjct: 264 SQTWTSRISSKKFTNVLTAVSLNIAQRQQPVIWSIEHLPYNCERLVPVPDPLGGAMVLTP 323

Query: 327 NTIHYHSQSASCALALNNYA-VSLDSSQELPRSSFSVE----LDAAHATWLQNDVALLST 381
           N + Y +QS+   L  N YA +      + P  S S      LD A+  +L  D  L S 
Sbjct: 324 NILFYFNQSSRYGLECNEYAQIDTGDQFQFPIDSSSTNLVFTLDCANFIFL-GDRLLGSL 382

Query: 382 KTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT--- 438
           K G+L++  ++ DGR VQR+ ++K   SVL+S    + ++L FLGSRLGDSLL+Q+T   
Sbjct: 383 KGGELLIFHLISDGRNVQRISITKAGASVLSSTSCVLTDNLLFLGSRLGDSLLLQYTEKI 442

Query: 439 ----CGSGTSMLSSGLKE----EFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
                      LS+  K+    E  D+  D     +   S +D     +  +E  ++   
Sbjct: 443 IDVDSSDNVENLSNPYKKKKTSEVFDLFDDEERNSKTGASDADGNGQSLFDDEDDIF--- 499

Query: 491 SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRIN-ADASATGISKQSNYELV------- 542
            N+ ++  K++   + D + NIGP+ D   G+  + A  S     +Q + ELV       
Sbjct: 500 -NDKKNQLKSYRLNICDHITNIGPVSDLITGVSYDHASVSNDESFEQRSLELVACSGHGK 558

Query: 543 -------------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
                              ELPG +  WT+Y+      +   S  +       +      
Sbjct: 559 NGALTILQYGVRPELNTSFELPGVRQSWTLYYDDPLAASQSGSSASNAAASAASKKRQHE 618

Query: 584 EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDG-SYMTQD 642
           E  T+V +T   L EV +         TI   N+FGRRR+  V + G ++L G S +TQ+
Sbjct: 619 EDSTLVFQTGGQLKEVAK-----FDHATITVANMFGRRRIALVHQNGIKLLSGHSNITQE 673

Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS-TCTVSVQTPAAIE 701
           +                +V    I DPYVL+   DG+I L  G+   T  +  + P    
Sbjct: 674 IKL-------------KSVKMAYIVDPYVLILHKDGTISLYQGNTGITQLLEYELPQP-- 718

Query: 702 SSKKPVSSCTLYHD 715
             K  V SC+++HD
Sbjct: 719 --KDGVMSCSMFHD 730



 Score =  133 bits (335), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 89/322 (27%), Positives = 148/322 (45%), Gaps = 49/322 (15%)

Query: 823  KVVELAMQRW-SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSN 881
            K+VE+ +    ++ HS P+L  +   G IL Y+A          K  D +  ++ L    
Sbjct: 872  KIVEIVIHYLHNSPHSSPYLMILNEFGDILIYKAI---------KYKDSMDNTKEL---- 918

Query: 882  VSASRLRNLRFSRTPLDAYTREETPHGAP-------CQRITIFKNISGHQGFFLSGSRPC 934
                 +R ++ +   L +  RE +    P        ++I  F NI GH+G F+ G R  
Sbjct: 919  -----IRFIKHTDQNLHSKQREYSYGIDPSSESSFYIRKIVAFDNIGGHKGVFMCGKRSL 973

Query: 935  WCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWP 994
            W    +  LR HP      + +FT  HN+NC++GFIY T +G+L+I QL +   ++N W 
Sbjct: 974  WFFCEKNYLRAHPMNFKDPVTSFTCFHNINCSYGFIYFTEKGVLRINQLSNMMNFENEWA 1033

Query: 995  VQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD 1054
            ++K IPL+ T H+I++  E   Y L++S P                   Q D        
Sbjct: 1034 IRK-IPLRMTCHKISFHQEFKCYVLVISYP----------------QAPQSDEEEEEKEK 1076

Query: 1055 LHRTYTVEE-YEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL--FNTTTKENETL 1111
              +   +EE ++V++++P      W    +  M   E  L  ++V L   +    + +  
Sbjct: 1077 SKKPLILEEKFQVKLIDPSMN---WSIVDSFSMSEKETVLCAKIVHLKYADVDGIKLKPY 1133

Query: 1112 LAIGTAYVQGEDVAARGRVLLF 1133
            L +GTAY  GED   +GR+L+F
Sbjct: 1134 LCVGTAYTHGEDTVCKGRILVF 1155


>gi|330799483|ref|XP_003287774.1| hypothetical protein DICPUDRAFT_32967 [Dictyostelium purpureum]
 gi|325082229|gb|EGC35718.1| hypothetical protein DICPUDRAFT_32967 [Dictyostelium purpureum]
          Length = 1453

 Score =  248 bits (632), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 198/738 (26%), Positives = 331/738 (44%), Gaps = 119/738 (16%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGIS-AASLELVCHYRLHGNVES 115
           NLV++  N +++Y +       K  KN   T ++  +  +    SLEL+   +L G +ES
Sbjct: 31  NLVLSKNNTLQVYKI-------KYVKNENTTTQQKQIKKVEIKPSLELLIELKLFGTIES 83

Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
           +A +   G +    +DS++L F DAKISVL+++  I    I S+H +E+ E+   K GR 
Sbjct: 84  MASVRYPGEN----KDSLLLTFRDAKISVLDYNIDIMDFEIRSLHFYENDEF---KNGRI 136

Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHV 235
            F   P++K+D Q RC  +L+Y   +++L   Q  S L  +++             ++  
Sbjct: 137 HFKHPPILKIDTQQRCATMLLYDRNIVVLPFKQISSILDDEDEEEKDEEDEKENDNANQD 196

Query: 236 INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
                 D     +F F++GY EP ++ LHE   TW  R++ K  T  ++A+SI+ + K  
Sbjct: 197 YTEEFDDDDDDNNFCFLYGYYEPTILFLHEPSQTWTSRIAVKRLTSQLTAISINFSTKLA 256

Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYA-VSLDSSQE 354
            +IW   N+P++  +L++VP P+ G LV+  N + + +Q++   LA+N YA + +    E
Sbjct: 257 SIIWHTSNMPYNCDQLVSVPEPLSGALVITPNIMFHVNQTSKYGLAVNEYANIDIGDKFE 316

Query: 355 LPRS---SFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVL 411
            P     +    LD ++  +L+ D  + S K G+L++  ++ DGR VQR+ +SK   SVL
Sbjct: 317 FPLDETLNLVFTLDRSNFVFLEADKFIGSLKGGELLIFHLISDGRTVQRIHVSKAGGSVL 376

Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS 471
            + +  + ++L FLGSRLGDSLL+Q+T  S T        +E  + E  +   K+ + S 
Sbjct: 377 ATCMCVVSDNLLFLGSRLGDSLLLQYTEKSIT--------DESLEHENFSNPYKKQKTSE 428

Query: 472 SDAL-----------QDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
            + L            D V  EE  L+    N  +S Q      + D ++N+GP+ D   
Sbjct: 429 QEKLLNQQQQQQKDEMDEVLDEEDELFKEKKNQLKSYQ----LGICDQILNVGPVGDMVI 484

Query: 521 GLRINADASATGISKQSNY--ELVELPGCKG----------------------------- 549
           G  +N       +     Y    +EL  C G                             
Sbjct: 485 GQALNPTYDLNTLPSDPAYMPRFLELVTCSGYGKNGSISILQNSVKPEIVGAFDSEGVVN 544

Query: 550 -IWTVYHKSSRGHNADSSR------------------------MAAYDDEYHAYLIISLE 584
             WTVY+K+S     D                               +++Y  YL IS+ 
Sbjct: 545 SFWTVYNKASSSIKEDEEEKLIGKKRTINEIIKEEQQYEQQQQKQPIEEDYLDYLYISMS 604

Query: 585 ARTMVLETADLLTEVTESVDYFVQG----RTIAAGNLFGRRRVIQVFERGARIL-DGSYM 639
             T    T  L T  +E      +G    RT+  GNLF +RR++ + E   ++L D + +
Sbjct: 605 NGT----TNILDTTSSEEGKLTFKGEFEYRTLDMGNLFNKRRIVLINENSIKLLNDYNNI 660

Query: 640 TQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAA 699
            Q++                 + S  I DPYVL+  SD SI+L   D     ++    + 
Sbjct: 661 VQEIKLS------------KPIKSTFIQDPYVLVHYSDNSIQLFKCDYKLLKLNQFNFSL 708

Query: 700 IESSKKPVSSCTLYHDKG 717
               +  V + +L+ DK 
Sbjct: 709 NHGDEGKVLTSSLFFDKN 726



 Score = 98.2 bits (243), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 77/318 (24%), Positives = 152/318 (47%), Gaps = 48/318 (15%)

Query: 820  HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSV 879
             ++++VE++++    ++S+P+L      G ++ Y+++  E  +   K  +     R LS 
Sbjct: 876  ENLEIVEISLE--ILNNSQPYLLLKNRIGDLIVYKSFKKENGDLRFKKYNHNFILRDLSN 933

Query: 880  SNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF 939
            ++ S +            D Y +         + I   K  S + G F+ G +P W    
Sbjct: 934  NSKSINS-----------DGYRK---------KSIVNIKLSSKNNGVFIGGQKPVWIFNE 973

Query: 940  RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-QGILKICQLPSGSTYDNYWPVQKV 998
            +  +R+H    DG+IV+    HN +C +GF+Y T  +  +KI  L     ++N + +++V
Sbjct: 974  KGYIRLHSMDFDGAIVSLKPFHNADCPNGFLYYTEDKQHIKIGYLNGLMNFENEYAIRRV 1033

Query: 999  IPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRT 1058
             P+K + H+I Y  E   Y ++VS P               +V  +++  +   +     
Sbjct: 1034 -PIKLSAHKIAYHNELKCYVVVVSFP---------------QVTQELEEDSKKPI----- 1072

Query: 1059 YTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTT--TKENETLLAIG 1115
             T E+++++I++P      W+   +  +Q  E  L +++V+L F  +  T +++  L IG
Sbjct: 1073 LTDEKFQIKIIDP-TIDWSWRFIDSFSLQDRETVLAMKIVSLKFKESDETIKSKPFLVIG 1131

Query: 1116 TAYVQGEDVAARGRVLLF 1133
            TA+  GED   +GRVL+F
Sbjct: 1132 TAFTFGEDTQCKGRVLVF 1149


>gi|428186188|gb|EKX55039.1| hypothetical protein GUITHDRAFT_160593 [Guillardia theta CCMP2712]
          Length = 2290

 Score =  238 bits (607), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 142/412 (34%), Positives = 222/412 (53%), Gaps = 38/412 (9%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMD---GISAASLELVCHYRLHGNV 113
           NL V     +E+YV++ +E+   ++ N  +  ++   D   G   A+L+ V  Y L+GNV
Sbjct: 31  NLAVVKGTQLELYVLKEEEKKHSKTCNGKQNGQKAAGDSGHGHGGATLQCVGRYDLNGNV 90

Query: 114 ESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRG 173
           ES+A +   G    R RD + L F DAK+S+LE+D+SI  +   S+H FE  E   +++G
Sbjct: 91  ESMAFVRLPG----RNRDHLFLVFRDAKLSILEYDNSIDDIVNVSLHLFEDDE---IRKG 143

Query: 174 RESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF------------- 220
           R SF R PL++VDP  RC  +LVY  +M+++     GS L  D++               
Sbjct: 144 RVSFGRAPLLRVDPLQRCAALLVYESKMVVIPFKHKGSDLEEDDEILTQPNKKFKSESAS 203

Query: 221 -------GSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
                  G+       I  ++V++L +  +KHV DF F+ GY EP +  LHE   TWAGR
Sbjct: 204 SNTVTRLGAPSDNKLGILPTYVVDLDEAGIKHVVDFTFLDGYYEPTISFLHENSRTWAGR 263

Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHS 333
           ++  + T MI+ +S++ + ++ P+IWSA  LPH++  ++A+P+P GGV+VV +N + Y +
Sbjct: 264 LAVSNFTGMITTVSLNISQRRQPIIWSASKLPHNSRHIVALPAPAGGVVVVSSNALIYRN 323

Query: 334 QSASCALALNNYAVSL-DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
               CAL LN YA++  D       +   +  D  H   L+    L S  TG+  ++ V 
Sbjct: 324 HEQKCALKLNEYAIAAGDGGNRFDTAGDIICFDTVHPVRLEGYQMLFSLVTGESYIMGVQ 383

Query: 393 Y--DGRVVQRLDLS----KTNPS-VLTSDITTIGNSLFFLGSRLGDSLLVQF 437
              DG  ++ L L     K +PS    S +  +G+S  FLGSRLGDS LV+ 
Sbjct: 384 LDTDGNTIKALTLDLVDVKLSPSGGFASIMCRVGDSYLFLGSRLGDSSLVKM 435



 Score = 76.6 bits (187), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 76/314 (24%), Positives = 135/314 (42%), Gaps = 77/314 (24%)

Query: 501 FSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELV------------------ 542
           + F + D+L NIGP+     G R++A     G  K+ + ELV                  
Sbjct: 587 YRFELCDTLTNIGPI-----GSRLDA-----GAVKKDSVELVTASGGLQYGKLGVLQRSL 636

Query: 543 --------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDE----YHAYLIISL--EARTM 588
                    LP  + +WTV+  +++  + D       ++E     HAY++IS   +  T+
Sbjct: 637 NPVVMTAVPLPDAQAVWTVFGPTAKAADEDMEEDGNEEEEQSAGMHAYMVISQGNDKGTI 696

Query: 589 VLETADLLT-EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGP 647
           VL+  +L   +  E VD+ V  +T+  GN+FG +R++QV      +L+G    Q+L    
Sbjct: 697 VLKGRELEEFDEDEQVDFEVDAKTVCVGNIFGNQRIVQVTPWNVYVLNGPRKEQELPV-- 754

Query: 648 SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPV 707
               +G+G +   +++  I DPY+ L + DG + LLVGD S+  V+      +      +
Sbjct: 755 ---VAGNGLQ---IVAAYIRDPYIALILQDGRLNLLVGDASSMQVNY-----VSHEIHNI 803

Query: 708 SSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFD 767
           ++   + D  P+                GEA D        Q D+       +G  +++ 
Sbjct: 804 TAACFFLDPIPD----------------GEANDDP-----QQRDVMLAAAPRNGHFQLYT 842

Query: 768 VPNFNCVFTVDKFV 781
           +P+   V+    FV
Sbjct: 843 LPSLELVYDAADFV 856



 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 54/235 (22%), Positives = 96/235 (40%), Gaps = 35/235 (14%)

Query: 913  RITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD--GSIVAFTVLHNVNCNHGFI 970
            R+       G +G  ++  +P   +  R   R+HP   D    + +    +N+ C  G +
Sbjct: 1068 RLMPLGGAGGLEGVLIAARQPAVVLFGRGLPRIHPWKLDRGEGVRSAARFNNLQCKDGIV 1127

Query: 971  YVT------SQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVP 1024
             +       ++G+LKIC +P G + D  WP+ +   +  T H + + A    + L+VS  
Sbjct: 1128 CIADKGRDRAKGVLKICNIPEGISGDTPWPL-RTKHVGMTVHHVAFHAATGCHVLVVSSQ 1186

Query: 1025 VLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRT---YTVEEYEVRILEPDRAGGPWQTR 1081
                               +I++       L       T E+YEV++  P          
Sbjct: 1187 ------------------QEIEDERKPEGTLEGAIPPLTEEKYEVQLRAP--YSMELLDS 1226

Query: 1082 ATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAAR--GRVLLF 1133
                  + E AL ++VV L NT  K++    +A+GT +  GE   +R  GR+ +F
Sbjct: 1227 YEFDFANGEKALCLQVVHLKNTRVKDSLLPFVAVGTGFQNGESETSRATGRIYVF 1281


>gi|9794904|gb|AAF98386.1| cleavage and polyadenylation specificity factor [Drosophila
           melanogaster]
          Length = 507

 Score =  237 bits (604), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 162/497 (32%), Positives = 258/497 (51%), Gaps = 49/497 (9%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
           NLVV  ANV+++Y +    E S+  K N  E +    M       LE +  Y L+GNV S
Sbjct: 29  NLVVAGANVLKVYRIAPNVEASQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82

Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
           L  +S  GA     RD+++++F+DAK+SVL+ D     L+  S+H FE  +   ++ G  
Sbjct: 83  LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135

Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
                P V+VDP  RC  +LVYG ++++L   +  S     L   +    +     +R  
Sbjct: 136 GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195

Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
           I +S++I LRDLD K  +V D  F+HGY EP ++IL+E   T  GR+  +  TC++ A+S
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255

Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
           ++   + HP+IW+  +LP D  ++  +  PIGG LV+  N + Y +QS         Y V
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309

Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
           SL+SS +        P+    + LD A+  ++  D  ++S +TGDL +LT+  D  R V+
Sbjct: 310 SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369

Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
                K   SVLTS I  + +   FLGSRLG+SLL+ FT    +++++            
Sbjct: 370 NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDEVEQQSEQQQ 429

Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
             L++E  ++E +     +L  + + A    +  EEL +YGS +  +    + F F V D
Sbjct: 430 RNLQDEDQNLE-EIFDVDQLEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCD 488

Query: 508 SLVNIGPLKDFSYGLRI 524
           SL+N+ P+     G R+
Sbjct: 489 SLMNVAPINYMCAGERV 505


>gi|58702050|gb|AAH90169.1| LOC564406 protein, partial [Danio rerio]
          Length = 416

 Score =  229 bits (583), Expect = 8e-57,   Method: Compositional matrix adjust.
 Identities = 132/350 (37%), Positives = 206/350 (58%), Gaps = 15/350 (4%)

Query: 94  DGIS-AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIH 152
           DG S    LE V  + L GNV S+A +   G +    RD+++L+F+DAK+SV+E+D   H
Sbjct: 58  DGKSRKEKLEQVASFSLFGNVMSMASVQLVGTN----RDALLLSFKDAKLSVVEYDPGTH 113

Query: 153 GLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSG 212
            L+  S+H FE PE   L+ G       P+V+VDP+ RC  +LVYG  +++L   +    
Sbjct: 114 DLKTLSLHYFEEPE---LRDGFVQNVHIPMVRVDPENRCAVMLVYGTCLVVLPFRKDT-- 168

Query: 213 LVGDEDTFGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTW 270
            + DE     G G  +    S++I++R+LD K  ++ D  F+HGY EP ++IL E   TW
Sbjct: 169 -LADEQEGIVGEGQKSSFLPSYIIDVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTW 227

Query: 271 AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIH 330
            GRV+ +  TC I A+S++   K HP+IWS  NLP D  +++AVP PIGGV+V   N++ 
Sbjct: 228 PGRVAVRQDTCSIVAISLNIMQKVHPVIWSLSNLPFDCNQVMAVPKPIGGVVVFAVNSLL 287

Query: 331 YHSQSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLL 389
           Y +QS     ++LN+      +    P+    + LD + A+++ +D  ++S K G++ +L
Sbjct: 288 YLNQSVPPFGVSLNSLTNGTTAFPLRPQEEVKITLDCSQASFITSDKMVISLKGGEIYVL 347

Query: 390 TVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
           T++ DG R V+     K   SVLT+ + T+     FLGSRLG+SLL+++T
Sbjct: 348 TLITDGMRSVRAFHFDKAAASVLTTCMMTMEPGYLFLGSRLGNSLLLRYT 397


>gi|147799623|emb|CAN68460.1| hypothetical protein VITISV_027523 [Vitis vinifera]
          Length = 558

 Score =  228 bits (580), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 119/175 (68%), Positives = 142/175 (81%), Gaps = 5/175 (2%)

Query: 424 FLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEE 483
           F GS+LGDSLLVQFT     S+ SS +++  GDIE B PS KR RRSSSDALQDMVNG++
Sbjct: 331 FEGSQLGDSLLVQFT-----SIPSSSVEKRVGDIEGBVPSAKRSRRSSSDALQDMVNGDK 385

Query: 484 LSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVE 543
           L LYGSA N+TE++QKTFSF+V DSL+++GPLKDF+YGLRINAD  ATGI KQ     VE
Sbjct: 386 LPLYGSAPNSTETSQKTFSFSVNDSLIDVGPLKDFAYGLRINADLKATGIVKQKMITEVE 445

Query: 544 LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTE 598
           LPGC+ IWTVYHK++RGHNADS++M   DDEY AYLIIS E+RTMVLET +LL E
Sbjct: 446 LPGCERIWTVYHKNTRGHNADSTKMITKDDEYCAYLIISPESRTMVLETVELLGE 500


>gi|449661926|ref|XP_002167992.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like [Hydra magnipapillata]
          Length = 1122

 Score =  227 bits (579), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 193/664 (29%), Positives = 305/664 (45%), Gaps = 124/664 (18%)

Query: 57  NLVVTAANVIEIYVV----RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGN 112
           NLV      + +Y +     V  +G + SK         ++D +    LEL+  + L GN
Sbjct: 29  NLVTAGGQRLNVYRLCDADMVVSDGDQSSK---------IVDSVGKRRLELLASFTLFGN 79

Query: 113 VESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
           + ++ ++  G    S  RDS++LAF+ AK+S++EFD   H L+  SMH FE+ E+   K 
Sbjct: 80  IINMQVVRLG----SNVRDSLLLAFKHAKLSIVEFDPLSHDLKTDSMHYFENDEF---KG 132

Query: 173 GRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIES 232
           G       PLV+VDP+ RC  +L+Y   +++L        +  DE    S G     +  
Sbjct: 133 GLSHNIYLPLVRVDPEQRCACMLIYNRHLVVLPFKHD---IKLDESEELSDGEHIKSVLP 189

Query: 233 SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST 290
           S++I+L  L+  + ++ +  F+HGY +P ++ L E   T  GRV+ +  T  +SA+S++ 
Sbjct: 190 SYMIDLHSLEQPLLNITELQFLHGYHQPTLMFLFEPVQTSTGRVAVRQDTFCVSAISLNM 249

Query: 291 TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLD 350
           T K HP+IWS  NLP D + L  +  PIGGVLV  +N++ Y +QS      +  Y VSL+
Sbjct: 250 TEKVHPVIWSVTNLPFDCHMLRPIEKPIGGVLVFASNSLIYLNQS------IPPYGVSLN 303

Query: 351 SSQE----LP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLD 402
           S  E     P   +    + L  +    +  D  +LS K G++ +L+++ DG R V+   
Sbjct: 304 SITEGSTMFPLKIQEDVVITLAESSCDAIATDQFILSLKGGEIYVLSLLSDGLRTVRSFH 363

Query: 403 LSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAP 462
             K   SVL S +  I +   FLGSRLG+SLL+++T                   E D+ 
Sbjct: 364 FEKAAGSVLASCVCWIEHGFVFLGSRLGNSLLLRYT-------------------EKDSA 404

Query: 463 STKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL 522
           S     +S    ++ M  G                         DSL+NIGP+   + G 
Sbjct: 405 SIA--EKSKEAKVEKMYGGGVGGGIIVC----------------DSLLNIGPITKAALGE 446

Query: 523 RINADASATGISKQSNYELV--------------------------ELPGCKGIWTVYHK 556
                    G S+Q + E+V                          ELPGC  +WTV  K
Sbjct: 447 PAFLSEEFFG-SRQIDLEMVCCSGYGKNGTLTVLQRSIRPQVVTTFELPGCVNMWTVCGK 505

Query: 557 SSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGN 616
           SS+             + YH+YLI+S +  TMVL+T   +TE+  S  + VQ  TI A N
Sbjct: 506 SSKESV----------ENYHSYLILSRDDSTMVLKTGAEITELDNS-GFNVQQPTIFACN 554

Query: 617 LFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMS 676
               + ++QV  +   +L+ +     +S            +   +   SI+DPYV++  S
Sbjct: 555 HLSNKYILQVCPQSIHLLEDTVQINSISL----------QDTIKITQCSISDPYVVMVDS 604

Query: 677 DGSI 680
            G +
Sbjct: 605 TGQL 608


>gi|149512998|ref|XP_001514888.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like, partial [Ornithorhynchus anatinus]
          Length = 831

 Score =  224 bits (572), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 215/738 (29%), Positives = 342/738 (46%), Gaps = 141/738 (19%)

Query: 233 SHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST 290
           S++I++R LD K  ++ D  F+HGY EP ++IL+E   TW GRV+ +  TC I A+S++ 
Sbjct: 8   SYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILYEPNQTWPGRVAVRQDTCSIVAISLNI 67

Query: 291 TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLD 350
             K HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS      +  Y VSL+
Sbjct: 68  LQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQS------VPPYGVSLN 121

Query: 351 S----SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLD 402
           S    +   P   R    + LD A A ++  D  ++S K G++ +LT++ DG R V+   
Sbjct: 122 SLTAGTTAFPLRLREGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRSFH 181

Query: 403 LSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADA- 461
             K   SVLT+ + T+     FLGSRLG+SLL+++T         S  +E   D  AD  
Sbjct: 182 FDKAAASVLTTCMITMEPGYLFLGSRLGNSLLLKYTEKLQEPPAGSA-REPARDSGADKQ 240

Query: 462 -PSTKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNI 512
            P  K+ R   +        A QD V+  E+ +YGS A + T+ A  T+SF V DS++NI
Sbjct: 241 EPPVKKKRVEQALSWAGGKSAAQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNI 296

Query: 513 GPLKDFSYG----------------LRI------NADASATGISKQSNYELV---ELPGC 547
           GP  + + G                L I        + + + + K    ++V   ELPGC
Sbjct: 297 GPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGC 356

Query: 548 KGIWTVYHK-------SSRGHNADSSRMAAY---DDEYHAYLIISLEARTMVLETADLLT 597
             +WTV          S +G  A+S         D + H +LI+S E  TM+L+T   + 
Sbjct: 357 YDMWTVIAPVRKEEGDSPKGEGAESEPTPPEPEDDGKRHGFLILSREDSTMILQTGQEIM 416

Query: 598 EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSE 657
           E+  S  +  QG T+ AGN+   R ++QV   G R+L+G      L F P +        
Sbjct: 417 ELDTS-GFATQGPTVYAGNIGDDRYIVQVSPLGLRLLEG---VNQLHFIPVDL------- 465

Query: 658 NSTVLSVSIADPYVLLGMSDGSIR--LLVGDP---STCTVSVQTPAAIESSKKPVSSCTL 712
            S ++  ++ADPYV++  ++G +   LL  D     T  +++  P  + S  K ++ C +
Sbjct: 466 GSPIVQCAVADPYVVIMSAEGHVTMFLLKSDSYGGRTHRLALHKP-PLHSQSKVIALC-V 523

Query: 713 YHD-----------KGP--EPWLRKTSTDAWLSTGVGEAIDGADG--------------- 744
           Y D            GP  +P LR  S    L   +   +D  +                
Sbjct: 524 YRDVSGMFTTESRASGPRDDPSLRGQSEAEPLLQELSHTVDDEEEMLYGDSSSLFSPSRD 583

Query: 745 -------GPLDQGDI--------YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVD 789
                   P D+           + V+  ++GA+EI+ +P +  VF V  F  G+  +VD
Sbjct: 584 EPRRSSLPPADRDAPQYRAEPTHWCVLVRDNGAMEIYQLPEWRLVFLVKNFPMGQRVLVD 643

Query: 790 TYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAI----- 844
           +   +    S  +  +  EE   QG    +  + +V L  ++     +RP+L  +     
Sbjct: 644 SSFGQPAA-SAAQAEAKKEEPARQGELPLVKEVLLVALGNRQ-----TRPYLLRLKWAIR 697

Query: 845 ---LTDGTILCYQAYLFE 859
              LT  T +  Q Y+ +
Sbjct: 698 DSELTSITFIDMQLYIHQ 715


>gi|340371789|ref|XP_003384427.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like [Amphimedon queenslandica]
          Length = 1408

 Score =  224 bits (571), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 235/880 (26%), Positives = 376/880 (42%), Gaps = 207/880 (23%)

Query: 3   FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
           +A Y+ +H PTG+ +C S    HS  + V                            V +
Sbjct: 2   YAVYREVHPPTGVEHCTSCHFVHSEKEQV---------------------------AVAS 34

Query: 63  ANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQG 122
            +++ I+ V      ++  +N G+ K            L     +  HGN++SL  +   
Sbjct: 35  TSLLRIFDV------AQLQRNDGKAK------------LVQCLEFSFHGNIQSLDKVRLR 76

Query: 123 GADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL 182
            +D    RD ++L+F DAK+S++E++   +GL+  SMH FE  E   ++ G       P+
Sbjct: 77  HSD----RDCLLLSFNDAKLSIVEYNPETNGLKTVSMHQFEDEE---IRGGILHNDSRPV 129

Query: 183 VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD 242
           VKVDP+GRC  +L++G  + +    Q    L  D     S    +  I  ++ I+LRDL 
Sbjct: 130 VKVDPEGRCAVMLLFGSHLAVCPFQQD---LSIDTPLSPSPSLDTHDILPTYTISLRDLP 186

Query: 243 --MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
             +  +KD  F+ GY  P ++ L E   TWAGR+S +  + M+  LS++T+ K H +IW+
Sbjct: 187 EPLPVIKDMTFIEGYTSPTLLFLSEVSPTWAGRISLRQDSMMLLGLSLNTSDKSHTVIWT 246

Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALN---NYAVSLDSSQELP 356
             NLP D+  L  VP P+GGVLV GANT+ Y +QS+    L+LN   +Y        E  
Sbjct: 247 LKNLPFDSSYLHPVPKPLGGVLVFGANTLIYLNQSSPPYGLSLNSITDYTTRFLLKNE-- 304

Query: 357 RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG--RVVQRLDLSKTNPSVLTSD 414
             S  + LD + + ++ N+  L+S ++GD+ ++T+  D   R V+R+   K   S+L+S 
Sbjct: 305 -GSLGIRLDCSQSVFISNEQLLVSLQSGDIYIVTLFPDSGMRGVKRITFDKAASSILSSC 363

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
           I +I     FLGSRL +SLL+++     ++ +   + E  G                  A
Sbjct: 364 ICSIKPHFLFLGSRLANSLLLRY-----STTVKQNIVEPIG-----------------GA 401

Query: 475 LQDMVNGEELSLYGSASNNTESAQK----TFSFAVRDSLVNIGPLKDFSYGLRINADASA 530
           + D+   +++ +YG ++ +  ++       +S  V DSL+ IGP+   + G    A  S 
Sbjct: 402 ILDL---DDIEVYGESAVSQSTSSSSLLTNYSLEVCDSLLCIGPVVKATIGE--PAFLSE 456

Query: 531 TGISKQS-NYELV--------------------------ELPGCKGIWTV---------- 553
             + K   + ELV                          ELPGC  +WTV          
Sbjct: 457 EFVDKSDLDLELVLCSGHGKNGALSVLQRTIRPQVVTTFELPGCIDMWTVKSEGEEEEKG 516

Query: 554 ----YHKSSRGHNADSSRMAAYDDE-YHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
                   + G   D SR         H YLI+S    TMVL+T   +TE+ +S  +  Q
Sbjct: 517 EETKEEGQNEGGEKDQSREKEEKGSGQHDYLILSRSDSTMVLQTGQEITELDQS-GFATQ 575

Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDG----SYMTQDLSFGPSNSESGSGSENSTVLSV 664
             T+ AGN+     ++Q      R+L G     Y+  D+  G              V  V
Sbjct: 576 SATVFAGNV--GSFIVQATRTDIRLLKGIKQLCYVALDMGGG--------------VKCV 619

Query: 665 SIADPYVLLGMSDGSIRLL--------VGDPST--------------------CTVSVQ- 695
            +  PYV++ + +G I LL        +  PS                      T S+Q 
Sbjct: 620 DVCSPYVIVLLMEGEIGLLKLVDESLVLSWPSLGNNTPVNHISAYTDTSGLFDVTSSLQF 679

Query: 696 ----------TPAAIESSKKPVSSCTLYHDK-----GPEPWLRKTSTDAWLSTGVGEAID 740
                      P A    K+P  S +L +D+     GP     K    + +   +     
Sbjct: 680 EGDGSEKEEEVPIAPPPVKRPHLSSSLLYDEDELLYGPVKTEVKEENASPMEASLAAE-- 737

Query: 741 GADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKF 780
             +  P      + ++C E GALEI+ VP F  VF V  F
Sbjct: 738 -PEAPPPITPTHWCLLCKEDGALEIYSVPEFQFVFAVRNF 776



 Score =  117 bits (294), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 77/225 (34%), Positives = 113/225 (50%), Gaps = 23/225 (10%)

Query: 917  FKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQ 975
            F NI+G+ G F+ G  P W  M  R  L +HP   DG + +F    NVNC  GF+Y   +
Sbjct: 894  FSNIAGYSGVFVCGPYPHWIFMAARGHLSIHPMYIDGPVQSFAPFDNVNCPSGFLYFNKE 953

Query: 976  GILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSL 1035
              L+I  LP+  +YD+YWPV+KV PLKATPH + Y  E  ++ +I S P      Q +++
Sbjct: 954  SELRISVLPTQLSYDSYWPVRKV-PLKATPHFVGYHMESKVHVIIASTP------QPVTV 1006

Query: 1036 LIDQEVGHQIDNHNLSSVDLHRTYTVEE-YEVRILEPDRAGGPWQTRATIPMQSSENALT 1094
            + D   G   D       D    Y+ EE Y +++L P      W+   TIP    E    
Sbjct: 1007 IPDPN-GETEDALETVERDGRFVYSQEETYYLQLLSPTS----WE---TIPHSKYEMEAH 1058

Query: 1095 VRVVTLFNTTTKENETL------LAIGTAYVQGEDVAARGRVLLF 1133
              V  +     +  ETL      + +GT    GE+++A+G+VL+F
Sbjct: 1059 YHVTDMKVMRLRSQETLSGRKEYIVVGTMATFGEELSAKGKVLIF 1103


>gi|33411762|emb|CAD58786.1| cleavage and polyadenylation specificity factor 1 [Bos taurus]
          Length = 880

 Score =  221 bits (563), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 161/497 (32%), Positives = 252/497 (50%), Gaps = 62/497 (12%)

Query: 233 SHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST 290
           S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ 
Sbjct: 8   SYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNI 67

Query: 291 TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSL 349
           T K HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+     
Sbjct: 68  TQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGT 127

Query: 350 DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNP 408
            +     +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   
Sbjct: 128 TAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAA 187

Query: 409 SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
           SVLT+ + T+     FLGSRLG+SLL+++T        S+    E  D E      KR+ 
Sbjct: 188 SVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASTA--REAADKEEPPSKKKRVD 245

Query: 469 RS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
            +     S    QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G 
Sbjct: 246 ATTGWSGSKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGE 301

Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYHKS 557
                          L I        + + + + K    ++V   ELPGC  +WTV    
Sbjct: 302 PAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 361

Query: 558 SR---------GHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
            +         G   +     A DD   H +LI+S E  TM+L+T   + E+  S  +  
Sbjct: 362 RKEQEETLKGEGTEPEPGAPEAEDDGRRHGFLILSREDSTMILQTGQEIMELDAS-GFAT 420

Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
           QG T+ AGN+   R ++QV   G R+L+G      L F P +         S ++  ++A
Sbjct: 421 QGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GSPIVQCAVA 470

Query: 668 DPYVLLGMSDGSIRLLV 684
           DPYV++  ++G + + +
Sbjct: 471 DPYVVIMSAEGHVTMFL 487



 Score =  124 bits (312), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 80/271 (29%), Positives = 128/271 (47%), Gaps = 16/271 (5%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+GA+EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 602  WCLLVRENGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 657

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +   RP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 658  QGELPLVKEVLLVALG-----SRQRRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 707

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E T       R   F++I G+ G F+ G  
Sbjct: 708  VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGTGPRGRVARFRYFEDIYGYSGVFICGPS 767

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HN+NC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 768  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDA 827

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVS 1022
             WPV+K IPL+ T H + Y  E  +Y +  S
Sbjct: 828  PWPVRK-IPLRCTAHYVAYHVESKVYAVATS 857


>gi|402591342|gb|EJW85272.1| hypothetical protein WUBG_03818, partial [Wuchereria bancrofti]
          Length = 1025

 Score =  218 bits (556), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 253/999 (25%), Positives = 418/999 (41%), Gaps = 150/999 (15%)

Query: 101  LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
            LE +   RL   V+S AI        +   DS +L F+DAK+S++  + +   L+  S+H
Sbjct: 62   LECLLAVRLLAPVQSFAI---ARISQNPDCDSFLLGFDDAKLSIVAVNPADRCLKTISLH 118

Query: 161  CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
            CFE      LK G       P+++VDP  RC  +LV+G  + +L  +   + L       
Sbjct: 119  CFEDE---LLKDGFTKNLPRPVIRVDPGQRCASMLVFGRYLAVLPFNDSSAQL------- 168

Query: 221  GSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
                        S+ + L  +D +  +V D +F+ GY EP ++ L+E   T  GR   ++
Sbjct: 169  -----------HSYTVQLSQIDSRLVNVVDMVFLDGYYEPTLLFLYEPVQTTCGRACVRY 217

Query: 279  HTCMISALSISTTLKQHPL--IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA 336
             T  +  L +S  +K+  L  +W   NLP D  ++LA+P P+GG+L+V  N + Y +QS 
Sbjct: 218  DT--MCVLGVSLNVKEQVLASVWQLTNLPMDCNQILAIPRPVGGILLVATNELIYLNQSV 275

Query: 337  S-CALALNNYAVSLDSSQELPRSSF---SVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
              C ++LN+    +D   + P   F   ++ LD A  T +  +  LL  + G L  L +V
Sbjct: 276  PPCGISLNS---CMDGFTKFPLKDFKHMALTLDGAVVTVVSTNKILLCDRNGRLFTLILV 332

Query: 393  YDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
             D    V+ L+L     +V+   +T+      F+GSRL DS+ +   C    S L     
Sbjct: 333  TDATNSVKSLELKFQFETVIPCTMTSCAPGYLFIGSRLCDSVFLH--CIFEQSTLEES-- 388

Query: 452  EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS---ASNNTESAQKTFSFAVRDS 508
                       +TK+++ S+     +    E+  LYG         +  ++  +  V D 
Sbjct: 389  -----------ATKKIKLSTEPNANE--EDEDFELYGEMLPKVAKPDITEELLNIRVLDK 435

Query: 509  LVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCK--GIWTVYHKSSRGHNADSS 566
            L+N+GP K  + G    +        K   ++LV   G    G   +  +S R     SS
Sbjct: 436  LLNVGPCKKITGGCPSISAYFQEITRKDPLFDLVCACGHGKFGSICILQRSIRPEIITSS 495

Query: 567  RMAAY---------DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNL 617
             +            +D+ H Y I S E  T+ LET + L E+ E+  +     TIAAG L
Sbjct: 496  SIEGVVQYWAIGRREDDTHMYFIASRELGTLALETDNDLVEL-EAPIFSTSESTIAAGEL 554

Query: 618  FGRRRVIQV-------FERGARILDGSYMTQDLSFGPSNSES-----GSGSENSTVLSVS 665
                  +QV          G +I    Y+   L+F   N+          ++N  +L   
Sbjct: 555  ADGGLAVQVTTSSLVMVAEGQQI---QYIPLQLTFPVRNASIVDPYIAICTQNGRLLMYE 611

Query: 666  IAD-PYVLLGMSDGSIRL------------------LVGDPSTCTVSVQTPAAIESSKKP 706
            + + P+V L   D S RL                  ++   S   +S Q  A   +   P
Sbjct: 612  LTNHPHVHLKEIDISKRLRHETSPITSLSVYRDMSGIIRFCSAANMSQQQQATGANMHIP 671

Query: 707  -------VSSCTLYHDKGPEPWLRKTSTDAWLSTGVG--EAIDGADGGPLDQGDI----Y 753
                   V    LY D       RK +       G+   E     D   +D   I    +
Sbjct: 672  EQEDFEDVDDLLLYGDSKKS---RKETLSKRRIVGMKLTEQNTHFDTDVIDPNTIVPSHW 728

Query: 754  SVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG- 812
              +  E+G + I+ +P  + V+ V K     +H+ D    +   D E     ++ EGT  
Sbjct: 729  IAIARENGNMYIYSIPELHLVYMVKKI----SHLPDIATDQPYVDDE----PATGEGTDA 780

Query: 813  -QGRKENIHSMK----VVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
              G   +  ++K    ++EL +     +  RP LF +L D T+  Y+ + +    N    
Sbjct: 781  MSGTMTDTFAVKPEEVIMELLLVGMGMNQGRPLLF-LLIDDTVSAYEMFTY----NNGIQ 835

Query: 868  DDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI--FKNISG-HQ 924
                   + L  + V+    R+ RF  T  D     E+   A   +  +  F+ I     
Sbjct: 836  GHLAIRFKRLPYTTVT----RSCRFQGT--DGRAAVESVRDAVRHKTVLHFFERIGNVLN 889

Query: 925  GFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG-ILKICQL 983
            G F+  S PC   +     R+HP   DG I++FT  +N  C +GFIY+T +  ++++ +L
Sbjct: 890  GVFICSSYPCIFFLESGVPRLHPVNLDGPILSFTTFNNAACPNGFIYLTERDRLMRVAKL 949

Query: 984  PSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVS 1022
            PS    D  +PV++ I + AT H + Y    N Y ++ S
Sbjct: 950  PSDMILDASYPVKR-INVGATVHSVVYLLHSNTYAVLTS 987


>gi|119602515|gb|EAW82109.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform
           CRA_b [Homo sapiens]
          Length = 377

 Score =  218 bits (555), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 133/369 (36%), Positives = 205/369 (55%), Gaps = 31/369 (8%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDIT 416
            SVLT+ ++
Sbjct: 367 ASVLTTSVS 375


>gi|307191845|gb|EFN75271.1| Cleavage and polyadenylation specificity factor subunit 1
           [Harpegnathos saltator]
          Length = 1214

 Score =  215 bits (548), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 152/475 (32%), Positives = 243/475 (51%), Gaps = 52/475 (10%)

Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAM 302
           M +V D  F+HGY EP ++IL+E   T++GR++ +  TC + A+S++   + HP+IWS  
Sbjct: 1   MDNVIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQRVHPIIWSVS 60

Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYAVSLDSSQELPRSSFS 361
           NLP D Y+ + V  P+GG L++  N++ Y +QS     ++LN+ A +  +    P+    
Sbjct: 61  NLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLADTSTNFPLRPQDGVK 120

Query: 362 VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGN 420
           + L+ A   +L  D  ++S K+G+L +L++  D  R V+     K   SVLTS +    +
Sbjct: 121 ISLEGAQVAFLSADRLVISLKSGELYVLSLFADSMRSVRGFHFDKAAASVLTSCVCMCED 180

Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE-EFGDIEADAPSTKRLRRS------SSD 473
           +  FLGSRLG+SLL++FT     ++ S    E    D + + P  K+ ++       +SD
Sbjct: 181 NYLFLGSRLGNSLLLRFTEKEPETIKSLDDGEINIEDNDNEEPPAKKAKQDFLGDWMASD 240

Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL----------R 523
            L D+ + EEL +YGS + +T     ++ F V DSL+NIGP  + S G            
Sbjct: 241 VL-DIKDPEELEVYGSET-HTSIQITSYIFEVCDSLLNIGPCGNISMGEPAFLSEEFAHN 298

Query: 524 INAD---ASATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRM 568
            N D    + +G  K     ++            ELPGC+ +WTV      G   +  ++
Sbjct: 299 QNPDVELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMWTVI-----GSLNNDEQV 353

Query: 569 AAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFE 628
            +  +  HA+LI+S E  TMVL+T   + EV +S  +  QG T+ AGNL   R ++QV +
Sbjct: 354 KSETEGSHAFLILSQEDSTMVLQTGQEINEVDQS-GFSTQGSTVFAGNLGANRYIVQVTQ 412

Query: 629 RGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
            G R+L G    Q +                 ++  S ADPYV+L   DG + LL
Sbjct: 413 MGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYVILLSEDGQVMLL 457



 Score =  127 bits (319), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 110/420 (26%), Positives = 178/420 (42%), Gaps = 66/420 (15%)

Query: 755  VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
            +V  +SG LEI+ +P+    + +  F  G+  + D+     L+ +      + E      
Sbjct: 570  LVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYMLHDSMESTTLQSAPINETLNPE------ 623

Query: 815  RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
                   ++V E+ M     H +RP L   L D  +  YQAY +  P+   K        
Sbjct: 624  -------LQVREVLMVALGHHGNRPMLLVRL-DSELQIYQAYKY--PKGHLK-------- 665

Query: 875  RSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI---FKNISGHQGFFLSGS 931
              L    +    +     SR P      E+ P  A   RI +   F NI+G+ G F+   
Sbjct: 666  --LRFKKLDHGIIPG-HLSRKP----KEEDVPVNANETRICMMRYFSNIAGYNGVFICSD 718

Query: 932  RPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
             P W  +  R  LR HP   DGS+ +F   +N+NC  GF+Y   +  L+IC LP+  +YD
Sbjct: 719  YPHWIFLTGRGELRTHPMGIDGSVTSFAAFNNINCPQGFLYFNRKEELRICVLPTHLSYD 778

Query: 991  NYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNL 1050
              WPV+KV PL+ TPH +TY  E   Y +I S    +PL                  +  
Sbjct: 779  APWPVRKV-PLRCTPHFVTYHLESKTYCVITSTS--EPLKSY---------------YRF 820

Query: 1051 SSVDLHRTYTVEEYEVRILEPDR--------AGGPWQT--RATIPMQSSENALTVRVVTL 1100
            +  D  + +T E+   R L P +        +   W+T     I +   E+   ++ V+L
Sbjct: 821  NGED--KEFTEEDRPERFLYPSQEQFCIVLFSPVSWETIPNTKIELDQWEHVTCLKNVSL 878

Query: 1101 FNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
                T+   +  + +GT Y  GED+ +RGR+L+F        P   +    +  +++  Q
Sbjct: 879  AYEGTRSGLKGYIVLGTNYNYGEDITSRGRILIFDIIEVVPEPGQPLTKNRFKQIYAKEQ 938


>gi|49619065|gb|AAT68117.1| cleavage and polyadenylation specific factor 1 [Danio rerio]
          Length = 1105

 Score =  213 bits (542), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 221/887 (24%), Positives = 382/887 (43%), Gaps = 173/887 (19%)

Query: 388  LLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSML 446
            +LT++ DG R V+     K   SVLT+ + T+     FLGSRLG+SLL+++T      + 
Sbjct: 2    VLTLITDGMRSVRAFHFDKAAASVLTTCMMTMEPGYLFLGSRLGNSLLLRYT----EKLQ 57

Query: 447  SSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNG------EELSLYGS-ASNNTESAQK 499
             + ++E   + E +     + +R  S+       G      +E+ +YGS A + T+ A  
Sbjct: 58   ETPMEEGKENEEKEKEPPNKKKRVDSNWAGCPKKGNLPDELDEIEVYGSEAQSGTQLA-- 115

Query: 500  TFSFAVRDSLVNIGPLKDFSYG--------LRINADAS-----ATGISKQSNYELV---- 542
            T+SF V DS++NIGP    S G         + N +        +G  K     ++    
Sbjct: 116  TYSFEVCDSILNIGPCASASMGEPAFLSEEFQTNPEPDLEVVVCSGYGKNGALSVLQKSI 175

Query: 543  --------ELPGCKGIWTVYHKSSR---------GHNADSSRMAAY---DDEYHAYLIIS 582
                    ELPGC  +WTV +   +         G + +  +       D + H +LI+S
Sbjct: 176  RPQVVTTFELPGCHDMWTVIYCEEKPEKPSAEGDGESPEEEKREPTIEDDKKKHGFLILS 235

Query: 583  LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD 642
             E  TM+L+T   + E+  S  +  QG T+ AGN+   + +IQV   G R+L+G      
Sbjct: 236  REDSTMILQTGQEIMELDTS-GFATQGPTVYAGNIGDNKYIIQVSPMGIRLLEG---VNQ 291

Query: 643  LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI---------------RLLVGDP 687
            L F P +         S ++  S+ADPYV++  ++G +               RL +  P
Sbjct: 292  LHFIPVDL-------GSPIVHCSVADPYVVIMTAEGVVTMFVLKNDSYMGKSHRLALQKP 344

Query: 688  STCT------------------------------VSVQTPAAIESSKKPVSSCT------ 711
               T                              ++++T +  E+  + +S+        
Sbjct: 345  QIHTQSRVITLCAYRDVSGMFTTENKVSFLAKEEIAIRTNSETETIIQDISNTVDDEEEM 404

Query: 712  LYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNF 771
            LY +  P     K  +    +           G    +   + ++  E+G +EI+ +P++
Sbjct: 405  LYGESNPLTSPNKEESSRGSAAASSAHTGKESGSGRQEPSHWCLLVRENGVMEIYQLPDW 464

Query: 772  NCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQR 831
              VF V  F  G+  +VD+    +   S T+     EE T QG   +I  +K V L    
Sbjct: 465  RLVFLVKNFPVGQRVLVDS----SASQSATQGELKKEEVTRQG---DIPLVKEVALVSLG 517

Query: 832  WSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLR 891
            +S  HSRP+L A + +  +L Y+A+ ++               +  + SN+       +R
Sbjct: 518  YS--HSRPYLLAHV-EQELLIYEAFPYD---------------QQQAQSNL------KVR 553

Query: 892  FSRTPLDAYTREET--------PHG---------APCQRITIFKNISGHQGFFLSGSRPC 934
            F + P +   RE+         P G             R   F++ISG+ G F+ G  P 
Sbjct: 554  FKKMPHNINYREKKVKVRKDKKPEGQGEDSLGVKGRVARFRYFQDISGYSGVFICGPSPH 613

Query: 935  WCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW 993
            W +V  R  +R+HP   DG+I +F+  HN+NC  GF+Y   QG L+I  LP+  +YD  W
Sbjct: 614  WMLVTSRGAMRLHPMTIDGAIESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPW 673

Query: 994  PVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSV 1053
            PV+K IPL+ T H ++Y  E  +Y +  SV   +P  +     I +  G + +   +   
Sbjct: 674  PVRK-IPLRCTVHYVSYHVESKVYAVCTSVK--EPCTR-----IPRMTGEEKEFETIERD 725

Query: 1054 DLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN-ETLL 1112
            + +     +++ ++++ P        TR  + ++  E+   ++ V L +  T    +  +
Sbjct: 726  ERYIHPQQDKFSIQLISPVSWEAIPNTR--VDLEEWEHVTCMKTVALKSQETVSGLKGYV 783

Query: 1113 AIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
            A+GT  +QGE+V  RGR+L+         P   +    +  L+   Q
Sbjct: 784  ALGTCLMQGEEVTCRGRILILDVIEVVPEPGQPLTKNKFKVLYEKEQ 830


>gi|301093545|ref|XP_002997618.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262110008|gb|EEY68060.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 1744

 Score =  206 bits (525), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 270/1184 (22%), Positives = 460/1184 (38%), Gaps = 321/1184 (27%)

Query: 235  VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
            ++ LR++++   V D  F+ GY+EP +++LHE   + +  GR++    T  ++ +SI+  
Sbjct: 278  LLRLREVEITGKVIDLAFLDGYLEPTLMVLHEENDKNSTCGRLAVGFDTYCLTVISINMK 337

Query: 292  LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
             + HP IW+  NLP D ++L+   +P+GGV+V+ AN I Y +Q+    LA N +A    +
Sbjct: 338  TRLHPKIWTVKNLPSDCFRLIPCRAPLGGVVVLSANAILYFNQTQFHGLATNVFASKTVN 397

Query: 352  SQELPRS------------SFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD----- 394
                P S              +V L      +LQ    LL+  +G + +L++ Y+     
Sbjct: 398  QSVFPLSEAVYETPEHETVQLNVVLYDCQFEYLQEKELLLTMPSGQVYVLSLPYEDTSSR 457

Query: 395  ----------GRVVQRLDLSKTNPSVLTSDITTIG-NSLFFLGSRLGDSLLVQFTCGSGT 443
                      GR    L L     S+  S +         F+GSR GDS+L         
Sbjct: 458  GLYGFGGVSSGRNAS-LSLRMLRSSIQASCVCIDDEKQTLFIGSRSGDSVLFALDKKKLV 516

Query: 444  SMLSSGLKEEFGDI------EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESA 497
            +      K+E   I      +  AP  K     S  A ++  + ++L LYG+A    E A
Sbjct: 517  TATEEEQKDEEMPIKEVVIKQESAPEIK-----SEPAEEEEEDEDDLFLYGAAPTKEEPA 571

Query: 498  QKT---------------------------FSFAVR--DSLVNIGPLKDFSYGLRINADA 528
              +                           + + +R  D L +IG +     G+  NAD 
Sbjct: 572  ATSSTECTNGVGVSSVKTEENGAPEQDTGSYDYELRQIDVLPSIGQITSIELGVENNAD- 630

Query: 529  SATGISKQSNYELV--------------------------ELPGCKGIWTVYHKSSRGHN 562
                 S +   ELV                          EL GC+ +WTV         
Sbjct: 631  -----SNEKREELVISGGYERSGAISVLHNGLRPIVGTEAELNGCRAMWTVSSSLPSATR 685

Query: 563  ADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRR 622
            +   R       Y+AYLI+S+  RTMVL T + +  + +   ++  G T+AA NLF ++R
Sbjct: 686  SSDGR------SYNAYLILSVAHRTMVLRTGEGMEPLEDDSGFYTSGSTLAAANLFNKQR 739

Query: 623  VIQVFERGARIL------------------DGSYM----------------TQDLSFGPS 648
            ++Q+F++GAR++                  +G+                  TQ+++    
Sbjct: 740  IVQIFKQGARVMMEVPEEETSNGQEKSAKTEGAEDEEEDDEDDGPRVKLVCTQEITLEGD 799

Query: 649  NSESGSGSENSTV--LSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTP--------- 697
                G   + S+V  +SV + DPY+LL ++DGS+RLL+GD     +SV  P         
Sbjct: 800  VECGGMNVDTSSVGIVSVDVVDPYILLLLTDGSVRLLMGDEEDLELSVIDPEIDYAEGIS 859

Query: 698  ---AAIESSKKPVSSCTLYHD--------------------------------------- 715
                + + SK   SS  L++D                                       
Sbjct: 860  EANGSADMSKHGSSSACLFYDWAGMFVENAWVEEEQEERHEATQSRAKRAEDDDDMDALY 919

Query: 716  -KGPEPWLRKT-STDAWLSTGVGEAIDGADGGPLDQ---GDIYSVVCYESGALEIFDVPN 770
               P P +  T +T +  ST      DG+   PL Q     +   +C+  G+L +F +P+
Sbjct: 920  SSKPSPKVATTNATKSTPSTATPRNEDGSVSIPLLQQKDAKMMCGMCFGDGSLHVFSLPD 979

Query: 771  F--------------NCVFTVDKFVSGRTHIVDTYMREALKDSETEIN-SSSEEGTGQGR 815
            F              + V T++ +  GR   V       L      +N S+S    G+ +
Sbjct: 980  FKKRGVFPYLTFAPQSLVNTLEHYQVGRNKTVK------LSAPVLGLNASTSSANDGRIK 1033

Query: 816  KENIHSMKVVELAMQRW--------SAHHSRPFLFAILTDGTILCYQAY-LFEGPENTSK 866
            K +  +  V ++ + R         + + SR  +   L +G +L Y A   FE  +  + 
Sbjct: 1034 KSHTINSPVADIVIHRVGPSEGQHNAQYLSRMVMLVFLANGDLLMYSAAPKFESLKPRAN 1093

Query: 867  SD-DPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQR---------ITI 916
             +  PV     +    ++   L     +    +A    E    A   +         +T 
Sbjct: 1094 GEIAPVFHFVRVGTELITRPFLPPKARTNAHNEAGNNPEVNTSAVLAKLRAGFRYPMLTC 1153

Query: 917  FKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGS------IVAFTVLHNVNCNHGFI 970
            F N++   G F  G+ P W +  R      P +C+ +      +++FT  H+ NC +GFI
Sbjct: 1154 FHNVNNMSGAFFRGAHPMWILGDRGHASFVP-MCNAAPRVSVPVLSFTSFHHWNCPNGFI 1212

Query: 971  YVTSQGILKICQLPSGSTYDNYWP-----VQKVIPLKATPHQITYFA-----------EK 1014
            Y  S+G L++C+LPS  T     P     V +     AT H + Y             E 
Sbjct: 1213 YFHSRGALRVCELPSSKT-STILPSSGGFVLQKAEFGATLHHMLYLGSHGPGGVAEALEA 1271

Query: 1015 NLYPLIVSV--------------------------PVLKPL-NQVLSLLIDQEVGHQIDN 1047
              Y ++ S                           P   PL + V++   +    ++ D 
Sbjct: 1272 PTYAVVCSARLKPADADRATEVEGAEEELEPENLDPNGNPLGSNVMAPTAEMFADYETD- 1330

Query: 1048 HNLSSVDLHRTYTVEE-YEVRILEPDRAGGPWQTRAT--IPMQSSENALTVRVVTLFNTT 1104
                    H  +T E+ YE+R+++ D   G W  R    +  +  E  L+V+++ L++++
Sbjct: 1331 --------HMAHTEEDVYELRLVQTDEF-GEWGRRGVFRVHFERYEVVLSVKLMYLYDSS 1381

Query: 1105 -------------TKENETLLAIGTAYV--QGEDVAARGRVLLF 1133
                          K+    L +GT +V   GED + RGR+LL+
Sbjct: 1382 LMKEEVASTSPEWNKKKRPYLVVGTGWVGPHGEDESGRGRLLLY 1425


>gi|301103686|ref|XP_002900929.1| cleavage and polyadenylation specificity factor subunit, putative
            [Phytophthora infestans T30-4]
 gi|262101684|gb|EEY59736.1| cleavage and polyadenylation specificity factor subunit, putative
            [Phytophthora infestans T30-4]
          Length = 1561

 Score =  206 bits (523), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 268/1170 (22%), Positives = 457/1170 (39%), Gaps = 306/1170 (26%)

Query: 235  VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
            ++ LR++++   V D  F+ GY+EP +++LHE   + +  GR++    T  ++ +SI+  
Sbjct: 108  LLRLREVEITGKVIDLAFLDGYLEPTLMVLHEENDKNSTCGRLAVGFDTYYLTVISINMK 167

Query: 292  LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL-- 349
             + HP IW+  NLP D ++L+   +P+GGV+V+ AN I Y +Q+    LA N +A     
Sbjct: 168  TRLHPKIWTVKNLPSDCFRLIPCRAPLGGVVVLSANAILYFNQTQFHGLATNVFASKTVN 227

Query: 350  -------DSSQELPR---SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD----- 394
                   D+  E P    +  +V L      +LQ+   LL+   G + +L++ Y+     
Sbjct: 228  QSVFPLSDAVYETPEHETAQLNVVLYDCQFEYLQDKELLLTMPCGQVYVLSLPYEDTSSR 287

Query: 395  ----------GRVVQRLDLSKTNPSVLTSDITTIG-NSLFFLGSRLGDSLLVQFTCGSGT 443
                      GR    L L     S+  S +         F+GSR GDS+L         
Sbjct: 288  GLYGFGGVSSGRNAS-LSLRMLRSSIQASCVCIDDEKQTLFIGSRSGDSVLFALDKKKLV 346

Query: 444  SMLSSGLKEEFGDI------EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESA 497
            +      K+E   I      +  AP  K     S  A ++  + ++L LYG+A    E A
Sbjct: 347  TATEEEQKDEEMPIKEVVIKQESAPEIK-----SEPAEEEEEDEDDLFLYGAAPTKEEPA 401

Query: 498  QKT---------------------------FSFAVR--DSLVNIGPLKDFSYGLRINADA 528
              +                           + + +R  D L +IG +     G+  NAD 
Sbjct: 402  ATSSTECTNGVGVSSVKTEENGAPEQDTGPYDYELRQIDVLPSIGQITSIELGVENNAD- 460

Query: 529  SATGISKQSNYELV--------------------------ELPGCKGIWTVYHKSSRGHN 562
                 S +   ELV                          EL GC+ +WTV         
Sbjct: 461  -----SNEKREELVISGGYERSGAISVLHNGLRPIVGTEAELNGCRAMWTVSSSLPSATR 515

Query: 563  ADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRR 622
            +   R       Y+AYLI+S+  RTMVL T + +  + +   ++  G T+AA NLF ++R
Sbjct: 516  SSDGR------SYNAYLILSVAHRTMVLRTGEGMEPLEDDSGFYTSGPTLAAANLFNKQR 569

Query: 623  VIQVFERGARIL------------------DGSYM----------------TQDLSFGPS 648
            ++Q+F++GAR++                  +G+                  TQ+++    
Sbjct: 570  IVQIFKQGARVMMEVPEEETSNGQEKSGKAEGAEDEEEDDEDDGPRVKLVCTQEITLEGD 629

Query: 649  NSESGSGSENSTV--LSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTP--------- 697
                G   + S+V  +SV + DPY+LL ++D S+RLL+GD     +SV  P         
Sbjct: 630  VECGGMNVDTSSVGIVSVDVVDPYILLLLTDVSVRLLMGDEEDLELSVIDPEIDYAEGIS 689

Query: 698  ---AAIESSKKPVSSCTLYHD-------------KGPEPWLRK-TSTDAWLSTGVGEAID 740
                + + SK   SS  L++D               P P +    +T +  ST      D
Sbjct: 690  EANGSADMSKHGSSSACLFYDWAEDDDDMDALYSSKPSPKVATMNATKSMPSTATPRNED 749

Query: 741  GADGGPLDQ---GDIYSVVCYESGALEIFDVPNF--------------NCVFTVDKFVSG 783
            G+   PL Q     +   +C+  G+L +F +P+F              + V T++ +  G
Sbjct: 750  GSVSIPLLQQKDAKMMCSMCFGDGSLHVFSLPDFKKRGVFPYLTFAPQSLVNTLEHYQVG 809

Query: 784  RTHIVDTYMREALKDSETEIN-SSSEEGTGQGRKENIHSMKVVELAMQRW--------SA 834
            R   V       L      +N S+S    G+ +K +  +  V ++ + R         + 
Sbjct: 810  RNKTVK------LSAPALGLNASTSSANDGRIKKSHTINSPVADIVIHRVGPSEGQHNAQ 863

Query: 835  HHSRPFLFAILTDGTILCYQAY-LFEGPENTSKSD-DPVSTSRSLSVSNVSASRLRNLRF 892
            + SR  +   L +G +L Y A   FE  +  +  +  PV     +    ++   L     
Sbjct: 864  YLSRMVMLVFLANGDLLMYSAAPKFESLKPRANGEIAPVFHFVRVGTELITRPFLPPKAR 923

Query: 893  SRTPLDAYTREETPHGAPCQR---------ITIFKNISGHQGFFLSGSRPCWCMVFRERL 943
            +    +A    E    A   +         +T F N++   G F  G+ P W +  R   
Sbjct: 924  TNAHNEAGNNPEVNTSAVLAKLRAGFRYPMLTCFYNVNNMSGAFFRGAHPMWILGDRGHA 983

Query: 944  RVHPQLCDGS-------------------IVAFTVLHNVNCNHGFIYVTSQGILKICQLP 984
               P     S                   +++FT  H+ +C +GFIY  S+G L++C+LP
Sbjct: 984  SFVPMCVPSSAPPKANGTSKNAAPRVSVPVLSFTPFHHWSCPNGFIYFHSRGALRVCELP 1043

Query: 985  SGSTYDNYWP-----VQKVIPLKATPHQITYFA-----------EKNLYPLIVSV----- 1023
            S  T     P     V +     AT H + Y             E   Y ++ S      
Sbjct: 1044 SSKT-STILPSSGGFVLQKAEFGATLHHMLYLGSHGPGGVAEALEAPTYAVVCSARLKPA 1102

Query: 1024 ---------------------PVLKPL-NQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTV 1061
                                 P   PL + V++   +    ++ D         H  +T 
Sbjct: 1103 DADRATEVEGAEEELEPENLDPNGNPLGSNVMAPTAEMFADYETD---------HMAHTE 1153

Query: 1062 EE-YEVRILEPDRAGGPWQTRAT--IPMQSSENALTVRVVTLFNTT-------------T 1105
            E+ YE+R+++ D   G W  R    +  +  E  L+V+++ L++++              
Sbjct: 1154 EDVYELRLVQTDEF-GEWGRRGVFRVHFERYEVVLSVKLMYLYDSSLMKEEVASTSPEWN 1212

Query: 1106 KENETLLAIGTAYV--QGEDVAARGRVLLF 1133
            K+    L +GT +V   GED + RGR+LL+
Sbjct: 1213 KKKRPYLVVGTGWVGPHGEDESGRGRLLLY 1242


>gi|296414526|ref|XP_002836950.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295632796|emb|CAZ81141.1| unnamed protein product [Tuber melanosporum]
          Length = 1468

 Score =  203 bits (517), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 268/1213 (22%), Positives = 472/1213 (38%), Gaps = 247/1213 (20%)

Query: 57   NLVVTAANVIEIYVVRVQE----EGSKESKNSGETKRRVL----------------MDGI 96
            N++V   ++++I+     E        ++K  G+  RR+L                    
Sbjct: 29   NVLVAKTSLLQIFTTTTYETELNSALADAKQPGDIDRRILDADEEQTFAADIALQRSQVE 88

Query: 97   SAASLELVCHYRLHGNV---ESLAILS--QGGADNSRRRDSIILAFEDAKISVLEFDDSI 151
            S   L LV  Y L G+V   + + +LS   GG       ++++ +F+DAK S++E+D   
Sbjct: 89   SVTKLVLVAEYPLSGSVTGLQRIKLLSTRSGG-------EAVLASFKDAKCSLMEWDPET 141

Query: 152  HGLRITSMHCFESPEWLHLKRGRESFARGPLVK--------VDPQGRCGGVLVYGLQMII 203
            + +   S+H +E          RE F   P+V          DP  RC  +   G  + I
Sbjct: 142  NSITTISLHYYE----------REEFC-SPVVSDGLPTELVADPGSRCAALRFSGDMLAI 190

Query: 204  LKASQGG-----------------------------SGLVGDEDTFGSGG---------G 225
            +   Q                               +G    EDT    G         G
Sbjct: 191  IPFRQREDEELSLGRGDADEVMGDEDGDNDDWDPEMAGTARGEDTIMGEGDVKTTDATEG 250

Query: 226  FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMI 283
                   S V+++  LD  + HV    F+H Y EP   IL+    TW G ++ +     I
Sbjct: 251  KDRPYHPSFVLSVSQLDDAISHVISLTFLHEYREPTFGILYSPRRTWTGLLAAEGRKDTI 310

Query: 284  SALSISTTLKQHP--LIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCAL 340
            S + I+  L+Q     I S   LP+D +K++ +  P GG L+VG N  IH      +  +
Sbjct: 311  SYIVITLDLEQKASTPILSVSGLPYDIFKVVPLAPPTGGSLLVGGNELIHVDQAGKTTGV 370

Query: 341  ALNNYAVSLDSSQELP-RSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRV 397
            A+N +         L  +S   +EL+ +    L+++    LL TK G+ V++    DGR 
Sbjct: 371  AVNPFCRRSTGFAGLADQSDLCLELEGSQVVELESEGGDMLLFTKRGEGVIVGFRMDGRN 430

Query: 398  VQRLDLSKTNP---SVLTSDITT---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
            V  + ++K N    S++   ++T   +G    F+G   GD+ ++++           G+K
Sbjct: 431  VSGVKITKLNNHPGSIVGGRVSTAVGLGGRRLFVGCIEGDARVLKWRRKGERKKAGEGIK 490

Query: 452  EE----------FGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTF 501
            EE          +G +E            SS     + NG          N+   +Q  +
Sbjct: 491  EEVLENEDEDDVYGALEDMDDDLYGGGGDSSFRKDSLTNGRR--------NSEAKSQGEY 542

Query: 502  SFAVRDSLVNIGPLKDFSYG------------------LRINADASATGISKQSNYELV- 542
             F   D L N+GP +D + G                  L +   +  +  S+ S   ++ 
Sbjct: 543  IFQTHDRLTNLGPFRDITLGKPTFPEESRERQKGVSPELELVTTSGPSNTSEDSGISIIR 602

Query: 543  -----------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDE-----YHAYLIISLEAR 586
                       + P C+ +WTV  +S+   NA        DD      +  +L ++    
Sbjct: 603  KSISPTIVGRFDFPQCQALWTVRARSANTSNAAVGLGGEEDDRSVEESFDRFLFVTKNDE 662

Query: 587  TMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG 646
            + V    D   EV    D+  +G TI  G +    R++QV     R+ D       +   
Sbjct: 663  SQVFRVGDTFEEV-RGTDFESEGETIEVGVVGNGMRIVQVVSEQVRVYDCDLQLSQII-- 719

Query: 647  PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKP 706
            P   E  +G E   V    + DPY+LL   DGS  +   D +   ++ +   AI+  K  
Sbjct: 720  PMFDEE-TGEEGPNVHRARVCDPYILLIKVDGSPAVYKMDSTNLELAEERADAIKFDKYQ 778

Query: 707  VSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQ-GDIYSVVCYESGALEI 765
             S C     K                 G+   +D     P++   D    +    G L+I
Sbjct: 779  -SGCIYASTK-----------------GIFIPLD----APVENVKDYLLFLLTVEGGLQI 816

Query: 766  FDVPN-FNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKV 824
            +D+ N    +F+ + F        +T       D+ T   ++ E+      K+ I  + V
Sbjct: 817  YDLSNPVTPLFSAESF--------NTLYPLLRTDNPTSPTANREK---HRSKQLIIEILV 865

Query: 825  VELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTS--KSDDPVSTSRSLSVSNV 882
             ++      +    P+L A  ++  +  Y+ ++   P      KS +P   S  LS+S  
Sbjct: 866  ADMG----DSIFKEPYLIARSSNNDLTFYKPFISSSPSTLRFIKSPNPHIASNELSLSAG 921

Query: 883  SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCM-VFRE 941
            + +  R L                        T   N++G+   FL G+ P + +   + 
Sbjct: 922  TKNIFRPL------------------------TAVYNLAGYSAVFLPGADPSFVIKTAKS 957

Query: 942  RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPL 1001
              R+H +L    + + +  H+   + GF+YV S GI+++  +P+  T+D  W  +KV P 
Sbjct: 958  SPRIH-KLAGTGVRSLSSFHSAGADRGFVYVDSLGIVRVALMPAEFTFDGNWGYKKVTPG 1016

Query: 1002 KATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTV 1061
            +     + YF   N+Y  ++S    +P +      + +E G      N++  D      +
Sbjct: 1017 EHV-QSLAYFPPMNVY--VISTSKRQPFD------LAEEDG------NIAKDDTTLQPEI 1061

Query: 1062 EEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQ 1120
            +   +++L P      W          +E AL V+ ++L  +  TKE + L+++GTA  +
Sbjct: 1062 DSGTLKLLSPQT----WTAVDEYKFAHNEIALVVKTISLEVSEHTKERKQLVSVGTAIFR 1117

Query: 1121 GEDVAARGRVLLF 1133
            GED +ARG + +F
Sbjct: 1118 GEDHSARGGIYVF 1130


>gi|213407244|ref|XP_002174393.1| cleavage factor one Cft1 [Schizosaccharomyces japonicus yFS275]
 gi|212002440|gb|EEB08100.1| cleavage factor one Cft1 [Schizosaccharomyces japonicus yFS275]
          Length = 1431

 Score =  201 bits (510), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 289/1205 (23%), Positives = 483/1205 (40%), Gaps = 230/1205 (19%)

Query: 57   NLVVTAANVIEIY-VVRVQEEGS--------------KESKNSGETKRRVL-MDGISAAS 100
            NL+V   + ++++ +VRVQ +                +E+    ET  +++     +  +
Sbjct: 29   NLIVAKDDFLQVFDIVRVQRDSDDVEDAFGSSMNLRMEENDAFMETNMQLIRTHEHTVYT 88

Query: 101  LELVCHYRLHGNVESLAILSQ--GGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITS 158
            L LV   R+ G ++ LA++    GG       D ++L    AK+S+L +D     L   S
Sbjct: 89   LRLVYQTRVFGTIKDLAVVKPKLGGFTT----DLLVLLTNYAKVSILVWDSLTQQLSTVS 144

Query: 159  MHCFES---PEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVG 215
            MH +ES   P+ +      E+ A+   + VDP+  C  +  YG  M I+   +     + 
Sbjct: 145  MHYYESVVPPKPI----AEETPAQ---LIVDPESTCCVLRFYGDMMAIIPFRKPEDLEME 197

Query: 216  DEDTFGSGG-GFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAG 272
            D +               S V+    LD  +  V D  F+ GY E  + +L+  E T   
Sbjct: 198  DANAQSEKPVDVQCVYLPSFVLTASQLDYSIARVLDSKFLEGYREATLALLYCPEQTSTV 257

Query: 273  RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY- 331
             +  +  T  ++ +++    +   +I S  NLP+D Y +L VP+P+GG L++G N I Y 
Sbjct: 258  FLPVRKDTVSLAVITLDIEQRASAVITSIHNLPYDIYCILPVPAPLGGSLLLGGNEIIYV 317

Query: 332  HSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL-----QNDVALLSTKTGDL 386
             S  ++  +A+N +  +  + Q   RSSF +EL+      L     ++   LL   TG L
Sbjct: 318  DSAGSTVGIAVNPFYRNATNFQLEDRSSFQLELEGTIGVPLSSPRTESVSVLLIHPTGQL 377

Query: 387  VLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFTC 439
              L  + DG+ V+ LDL     + N ++L S +T    + +   FLGS+ GDS LVQ++ 
Sbjct: 378  FYLDFLMDGKNVKNLDLHPASDELNNALLQSGVTCALPVADHELFLGSQTGDSYLVQWSR 437

Query: 440  GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
             S  +         + D E DA           D L D        +Y + S       K
Sbjct: 438  RSINNQTQEEGTLTYKDEENDA-------DEEVDELDD--------IYDTGSKEKAKRNK 482

Query: 500  -----TFSFAVRDSLVNIGPLKDFSYG---------------LRINADASATGIS----- 534
                      V D L N+GP+ +F  G               L +    + TG S     
Sbjct: 483  FVELGPLRLEVHDVLSNVGPIIEFCTGKAGSLAYFPQDNHGPLEVTC-VTGTGKSGSLVV 541

Query: 535  -KQSNYELVE----LPGCKGIWTVYHKSSRGHNADSSRMAAY-DDEYHAYLIISLEARTM 588
             ++S   +VE      GC+ +WT+ H + R  N  S     Y DDEY  YL++S E  + 
Sbjct: 542  FRRSISPVVEGKFNFEGCQSLWTI-HVTGRLKNPRSHGSERYLDDEYDTYLVVSKEKESF 600

Query: 589  VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS-YMTQDLSFGP 647
            V    +   EV +S D+  +G TI  G L G  R++Q+     R+ D + ++ Q ++ G 
Sbjct: 601  VFTAGETFDEVEDS-DFNTKGSTINVGGLLGGMRIVQICTTSLRVYDPNIHLVQRINLG- 658

Query: 648  SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPV 707
                     +   V++ S+ DPYV+L +  G I L   D  T  +       +    K V
Sbjct: 659  ---------KKQNVVAASVCDPYVVLVLLGGRILLYSMDAETQRL---IKMDLHKQLKNV 706

Query: 708  SSCTLYHDKGPEPWLRKTSTDAWL-----STGVGE-AIDGADGGP--------------- 746
             + +LY     +P +++  ++  L     S G  +  +DG D  P               
Sbjct: 707  KAASLYSTN--DPVMQELFSELDLGRNNSSPGKSDIQMDGVDTQPDRPSMPAGNQVTETN 764

Query: 747  ---LDQGDIYS----VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDS 799
               LD+    +     V ++ G L++  +P ++CV   D F       + T + + L   
Sbjct: 765  VSTLDEQSFAAHFVLFVLHDDGRLKVLHLPTYSCVLECDVF------DLPTVLYDGLSSE 818

Query: 800  E-TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLF 858
              TE++ SS+E              +VE+             L        I  Y+ ++ 
Sbjct: 819  RVTEMHESSQE--------------LVEVLATDLGDEAKEAHLLIRSRMNEITVYKPFVC 864

Query: 859  EGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREET------------- 905
              P  T K++                     LRFS+ P +  TRE T             
Sbjct: 865  SNPV-THKTE---------------------LRFSKIPQEGMTRESTECSLQDLVAETEQ 902

Query: 906  ---PHGAPCQ------------RITIFKNISGHQGFFLSGSRPCWCMVFRERL-RVHPQL 949
               P  A  Q            R+   + I  H   F++G++P + +     + + HP L
Sbjct: 903  ENAPKDASEQKPQKSSSTVDKPRMVALQRIGNHSAVFITGAKPFFLLKTAHSVAKFHPLL 962

Query: 950  CDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQIT 1009
             +  I++    H  +   G+I+V     + IC+      YD+ W  +KV  +  + H I 
Sbjct: 963  SECRILSLASFHTEHAPKGYIFVDENYDINICRFQDDINYDHRWGYKKV-NVGRSVHGIA 1021

Query: 1010 YFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRIL 1069
            Y   K +Y +  S   L P        +  E G+ +         L RT +     + ++
Sbjct: 1022 YHPTKMVYAIATS--TLTPYE------VTDEEGNVVYPLKNEGEYLPRTNS---GMLELV 1070

Query: 1070 EPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARG 1128
             P      W           E  L VR+V L  +  TK  +  +A+GT+  +GED+A RG
Sbjct: 1071 SP----LTWTVIDRYKFLDYEIPLCVRLVNLEISDVTKLRKPFIAVGTSITKGEDIAVRG 1126

Query: 1129 RVLLF 1133
               LF
Sbjct: 1127 STYLF 1131


>gi|148886831|sp|Q7SEY2.2|CFT1_NEUCR RecName: Full=Protein cft-1; AltName: Full=Cleavage factor two
            protein 1
          Length = 1456

 Score =  201 bits (510), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 256/1117 (22%), Positives = 450/1117 (40%), Gaps = 160/1117 (14%)

Query: 94   DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHG 153
            D  ++A L LV    L G +  LA + +    +S   D ++L+F DA++S++E++   + 
Sbjct: 96   DRANSAKLVLVAEVTLPGTMTGLARIKKPSGSSSGGADCLLLSFRDARLSLVEWNVERNT 155

Query: 154  LRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL 213
            L   S+H +E  E +             L+  DP  RC  +      + IL   Q    +
Sbjct: 156  LETVSIHYYEKEELVGSPWVAPLHQYPTLLVADPASRCAALKFSERNLAILPFKQPDEDM 215

Query: 214  VGD------------EDTFGSGGGFSARIES-----SHVINLRDLD--MKHVKDFIFVHG 254
              D            +D  G+    ++ IE      S V+ L  L+  + H     F+H 
Sbjct: 216  DMDNWDEELDGPRPKKDLSGAVANGASTIEDTPYSPSFVLRLSKLEASLLHPVHLAFLHE 275

Query: 255  YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
            Y +P + +L   +          H T M+  L +    +    I +   LP D ++++A+
Sbjct: 276  YRDPTIGVLSSTKTASNSLGHKDHFTYMVFTLDLQQ--RASTTILAVNGLPQDLFRVVAL 333

Query: 315  PSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
            P+P+GG L+VGAN  IH      S  +A+N       S   + ++   + L+      L 
Sbjct: 334  PAPVGGALLVGANELIHIDQSGKSNGIAVNPLTKQTTSFSLVDQADLDLRLEGCAIDVLA 393

Query: 374  NDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDITTI---GNSLFF 424
             ++   LL    G L L+T   DGR V  L +    P    SV+ S +T++   G S  F
Sbjct: 394  AELGEFLLILNDGRLGLITFRIDGRTVSGLSIKMIAPEAGGSVIQSRVTSLSRMGRSTMF 453

Query: 425  LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
            +GS  GDS+L+ +T   G +      ++    ++              D   D + GEE 
Sbjct: 454  VGSEEGDSVLLGWTRRQGQT------QKRKSRLQDADLDLDLDDEDLEDDDDDDLYGEES 507

Query: 485  SLYGSASNNTESAQK-TFSFAVRDSLVNIGPLKDFSYGLRINADAS-------------- 529
            +    A +  ++ +    +F + D L++I P++  +YG  +    S              
Sbjct: 508  ASPEQAMSAAKAIKSGDLNFRIHDRLLSIAPIQKMTYGQPVTLPDSEEERNSEGVRSDLQ 567

Query: 530  ---ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDD- 573
               A G  K S   ++            E P  +G WTV  K          +    +D 
Sbjct: 568  LVCAVGRGKASALAIMNLAIQPKIIGRFEFPEARGFWTVCAKKPIPKTLVGDKGPMNNDY 627

Query: 574  ----EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
                +YH ++I++       E   +   TA     +T +      G T+ AG +    R+
Sbjct: 628  DTSGQYHKFMIVAKVDLDGYETSDVYALTAAGFESLTGTEFEPAAGFTVEAGTMGKDSRI 687

Query: 624  IQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
            +QV +   R  DG   ++Q +     + E+G+      V + SIADP++LL   D S+ +
Sbjct: 688  LQVLKSEVRCYDGDLGLSQIVPM--LDEETGA---EPRVRTASIADPFLLLIRDDFSVFI 742

Query: 683  LVGDPSTCTVSVQTPA-AIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDG 741
                P    +        I +S K ++ C LY D          ++  +    VG+    
Sbjct: 743  AEMSPKLLELEEVEKEDQILTSTKWLAGC-LYTD----------TSGVFADETVGKGT-- 789

Query: 742  ADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSET 801
                   + +I   +   SG L I+ +P+      V + +S        Y+   L     
Sbjct: 790  -------KDNILMFLLSTSGVLYIYRLPDLTKPVYVAEGLS--------YIPPGLS---- 830

Query: 802  EINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGP 861
              + ++ +GT    KE++  + V +L        H  P+L     +  +  YQ Y  +  
Sbjct: 831  -ADYAARKGTA---KESVAEILVADLG----DTTHKSPYLILRHANDDLTLYQPYRLK-- 880

Query: 862  ENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA----PCQRITIF 917
               + +  P S S       +   ++ N  F++ P +    ++ PH A    P +R +  
Sbjct: 881  ---ATAGQPFSKS-------LFFQKVPNSTFAKAPEEKPADDDEPHNAQRFLPMRRCS-- 928

Query: 918  KNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGI 977
             NISG+   FL GS P + +   +       L    + A +  H   C HGFIY  + GI
Sbjct: 929  -NISGYSTVFLPGSSPSFILKTAKSSPRVLSLQGSGVQAMSSFHTEGCEHGFIYADTNGI 987

Query: 978  LKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLI 1037
             ++ Q+P+ S+Y       K IP+      + Y      Y  +V    ++P      L  
Sbjct: 988  ARVTQIPTDSSYAELGLSVKKIPIGVDTQSVAYHPPTQAY--VVGCNDVEPFE----LPK 1041

Query: 1038 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1097
            D +   +    N++   +     V+   +++L    +G  W    T+ M+  E  L V  
Sbjct: 1042 DDDYHKEWARENITFKPM-----VDRGVLKLL----SGITWTVIDTVEMEPCETVLCVET 1092

Query: 1098 VTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
            + L  + +T E + L+A+GTA ++GED+  RGRV +F
Sbjct: 1093 LNLEVSESTNERKQLIAVGTALIKGEDLPTRGRVYVF 1129


>gi|147827332|emb|CAN62175.1| hypothetical protein VITISV_001516 [Vitis vinifera]
          Length = 1989

 Score =  201 bits (510), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 121/228 (53%), Positives = 148/228 (64%), Gaps = 49/228 (21%)

Query: 383  TGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSG 442
            +G+L+LLT+V DGRVV +L LSK+  SV TS I  IG+SL F GS+LGDSLLVQF     
Sbjct: 1657 SGELLLLTLVCDGRVVYKLGLSKSRASVFTSGIAAIGSSLSFPGSQLGDSLLVQF----- 1711

Query: 443  TSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFS 502
            T++ SS ++++ GD E D PSTKR RRSSSDALQDM NG++L LY               
Sbjct: 1712 TAIPSSSVEKKVGDSEGDVPSTKRSRRSSSDALQDMDNGDKLPLY--------------- 1756

Query: 503  FAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYEL--------------------- 541
              V DSL+N+GPLKDF+YGLRIN D  ATGI KQSNYEL                     
Sbjct: 1757 --VSDSLINVGPLKDFAYGLRINTDLKATGIVKQSNYELMCCSGHGKNGALCILQQSIRP 1814

Query: 542  -----VELPGCKGIWTVYHKSSRGHNADSSRMA-AYDDEYHAYLIISL 583
                 VELPGCKGIWTVYHK++RGHNADS +M+  +D E+ A++  SL
Sbjct: 1815 ERITEVELPGCKGIWTVYHKNTRGHNADSIKMSHVFDLEFRAFIFFSL 1862


>gi|407929511|gb|EKG22329.1| Cleavage/polyadenylation specificity factor A subunit [Macrophomina
            phaseolina MS6]
          Length = 1418

 Score =  200 bits (509), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 258/1108 (23%), Positives = 447/1108 (40%), Gaps = 169/1108 (15%)

Query: 99   ASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITS 158
            + L LV  Y L G V SLA +     D     D++++AF DAK+S++E+D + H L   S
Sbjct: 81   SKLVLVAEYPLEGTVLSLARIK--ALDTKSGGDALLIAFRDAKMSLVEWDPANHALSTIS 138

Query: 159  MHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV-GDE 217
            +H +E  E        +       +  DP  RC  +      + IL   Q G  LV GD+
Sbjct: 139  IHYYEGEELHGAPWDADLGHYHNFLAADPSSRCAALKFGARHLAILPFRQLGDDLVEGDD 198

Query: 218  ---------------DTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVM 260
                           +   +G       +SS  ++L  +D  + H     F+H Y EP  
Sbjct: 199  YDPDFDEPMDAPAAKEKATNGDVAQTPYKSSFALSLPQIDPALTHPVHLDFLHEYREPTF 258

Query: 261  VILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGG 320
             I+   +   A  +  +      +  ++    K    + S   LP+D +K++ +P P+GG
Sbjct: 259  GIISANKAAAASLLYERRDLLTYTVFTLDLEEKASTALLSVAGLPYDTHKVIPLPLPVGG 318

Query: 321  VLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVA 377
             L++GAN  IH      + A+A+N++A    S     +S  ++ L+ A    L  +N   
Sbjct: 319  ALLLGANQFIHVDQAGKTSAVAVNDFAKQCSSFPMSDQSELAMRLEGASIELLSPENGDL 378

Query: 378  LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS-------VLTSDITTIGNSLFFLGSRLG 430
            L+  K G L +++   DGR V  L + K +            S  T++G +  F+GS  G
Sbjct: 379  LVVLKDGSLAVISFKLDGRSVSGLSIRKISEEKGGHVVPTAASCTTSLGRNRMFIGSEDG 438

Query: 431  DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
            DS+L+ +T     + LS   K    ++ AD            D      +G   +   SA
Sbjct: 439  DSVLLGWT--KKAAQLSR--KRSHAEMLADDAELSFDEEDLEDDDDLYGDGPSTAKTASA 494

Query: 491  SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSN-YELV------- 542
            S+   S    ++F + D ++++ P+KD +       D +   + + ++  +LV       
Sbjct: 495  SSEA-SDPSNYTFRIHDIMLSLAPIKDVALASHKVTDTAIGTLERAADQLDLVVSTGRGA 553

Query: 543  -------------------ELPGCKGIWTVYHK--SSRGHNADSSRMA----AYDDEYHA 577
                               E    + +W+V+ K  + +G  A  S+ A    A D +Y  
Sbjct: 554  AGGLALMRREIDPVILRKGEFSNARAVWSVHAKKPAPKGMVAAGSQDAEAKLAADVDYDQ 613

Query: 578  YLIISL------EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
            +LI+S       E   +   TA    E  +         TI  G + G  R++QV +   
Sbjct: 614  FLIVSRSNGDGGEESAIFNITATGFEETNKGDFEREDAATINVGTIAGGTRIVQVLKAEI 673

Query: 632  RILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTC 690
            R  D    + Q L   P   E+GS      ++S S ADPY+L+   D S+ +L  D +  
Sbjct: 674  RSYDSELGLDQIL---PMEDENGS---ELRIISASFADPYILVIRDDSSVIVLQADANGE 727

Query: 691  TVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQG 750
               +     + S+K                         WLS  + ++    +       
Sbjct: 728  MEEIDRGDTLLSTK-------------------------WLSGCIHQSQSTGEKA----- 757

Query: 751  DIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEG 810
               + +    G L IF++P+ +    V   +         ++   L    T   SSS   
Sbjct: 758  --LAYLLSAEGGLHIFELPDLSKPVYVAASLG--------FLPPTLTADFTPRRSSS--- 804

Query: 811  TGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDP 870
                 K  +  + V EL      + +  P+L    +   ++ YQ Y F   E        
Sbjct: 805  -----KAALTEVIVAELG----DSTYKTPYLIVRTSSNDLVIYQPYHFPAHEVVK----- 850

Query: 871  VSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSG 930
                 +L    +   RL    FS  P  A   E+T  G      TI  N+ G+   F++G
Sbjct: 851  -PFFENLRWLKIPQPRLPE--FSEEP--ALESEDTGIGKESILTTI-ANVGGYSAVFMAG 904

Query: 931  SRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY- 989
            + P + +     L    ++   S+   +  H   C+ GF Y+ + G L++CQLP G  Y 
Sbjct: 905  TSPSFILKESSSLPRVIKMRTKSVKNLSSFHRAECDRGFAYINADGNLRVCQLPRGYRYG 964

Query: 990  DNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHN 1049
            D  W V+K I +      + Y   K++  L++ V   KP       L + E  H+    N
Sbjct: 965  DAGWAVKK-ISINQDVQAMCYHPPKDV--LVLGVGDKKPFT-----LPEDEHHHEWLEEN 1016

Query: 1050 LSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATI---PMQSSENALTVRVVTL-FNTTT 1105
            ++   +     VE+  +++L+        Q+ A I    +++ E  LT++V+ L  +  T
Sbjct: 1017 ITFKPM-----VEQGMIKVLDT-------QSLAVIDTYELEAFEVVLTIKVLNLEVSENT 1064

Query: 1106 KENETLLAIGTAYVQGEDVAARGRVLLF 1133
             E + L+A+GT +++GED+ +RG + +F
Sbjct: 1065 HERKQLVAVGTGFIRGEDLPSRGCIYVF 1092


>gi|38014465|gb|AAH60475.1| LOC398931 protein, partial [Xenopus laevis]
          Length = 363

 Score =  200 bits (508), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 109/303 (35%), Positives = 178/303 (58%), Gaps = 23/303 (7%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           LEL+  +   GN+ S+A +   GA    +RD+++L+F++AK+SV+E+D   H L+  S+H
Sbjct: 66  LELMASFSFFGNIMSMASVQLAGA----KRDALLLSFKEAKLSVVEYDPGTHDLKTLSLH 121

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVG 215
            FE PE   L+ G       P V+VDP GRC  +L+YG Q+++L       ++   GLVG
Sbjct: 122 YFEEPE---LRDGFVQNVHIPKVRVDPSGRCAVMLIYGTQLVVLPFRRDTLAEEHEGLVG 178

Query: 216 DEDTFGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGR 273
           +        G  +    S++I++R+LD K  ++ D  F+HGY EP ++IL E   TW GR
Sbjct: 179 E--------GQKSSFLPSYIIDVRELDEKLLNIIDMQFLHGYYEPTLLILFEPNQTWPGR 230

Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHS 333
           V+ +  TC I A+S++   K HP+IWS  +LP+D  + LAVP P+GGV++   N++ Y +
Sbjct: 231 VAVRQDTCSIVAISLNIMQKVHPIIWSLNSLPYDCTQALAVPKPVGGVVIFAVNSLLYLN 290

Query: 334 QSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
           QS     ++LN+      S    P+    + LD + AT++  D  ++S K G++ ++T++
Sbjct: 291 QSVPPYGVSLNSLTNGTTSFPLKPQEEVRITLDCSQATFISYDKMVISLKGGEIYVVTLI 350

Query: 393 YDG 395
            DG
Sbjct: 351 TDG 353


>gi|66812672|ref|XP_640515.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
 gi|60468551|gb|EAL66554.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
          Length = 1628

 Score =  197 bits (502), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 156/585 (26%), Positives = 266/585 (45%), Gaps = 128/585 (21%)

Query: 239 RDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298
           +++++++VKDF F+HGY EP ++ LHE   TW  R++ K  TC ++A+S++   K    I
Sbjct: 281 KNIEIENVKDFCFLHGYYEPTILFLHEPIQTWTSRIAVKKFTCQMTAISLNLLTKAGSFI 340

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W+  N P++   L++VP P+GG LV+ AN + Y +Q++   LA+N YA S+D+S  +   
Sbjct: 341 WNVSNFPYNCEMLVSVPEPLGGALVITANIMFYVNQTSRYGLAVNEYA-SIDTSTIIGSQ 399

Query: 359 SFS----------VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP 408
            F             LD ++  +L++D  + S K G+L++  ++ DGR VQR+ +SK   
Sbjct: 400 PFDFPIDDTLNLVFTLDRSNFVFLESDKFIGSLKGGELLIFHLISDGRSVQRIHVSKAGG 459

Query: 409 SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGT------SMLSSGLKEE-------FG 455
           SVLTS I  + N+L FLGSRLGDSLL+Q+T  S T         S+  K++         
Sbjct: 460 SVLTSCICVLSNNLIFLGSRLGDSLLLQYTEKSITDDQLEHENFSNPYKKQKTSEVFDLF 519

Query: 456 DIEADAPSTKRLRRSSSDALQDMVNGEELS-------------LYGSASNNTESAQKTFS 502
           D  ++  +      +++   Q+  +   ++             L+    N  +S Q    
Sbjct: 520 DENSETNNNNNSNNNNNKENQEKSSSSSIASKLLEEIEDEEDQLFKEKKNQLKSYQ---- 575

Query: 503 FAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY-----ELV--------------- 542
             + D ++NIGP+ D   G  I+     T    Q  Y     ELV               
Sbjct: 576 LGICDQIINIGPIGDIVVGQSIDPTYDETIQPNQPEYVPKTLELVTCSGYGKNGSISVLQ 635

Query: 543 -----------ELPGCKGIWTVY------------------HKSSRGHNADSSRMAAY-- 571
                      ELPG   +WTVY                   K SR  N ++ +      
Sbjct: 636 NNIKPELVMAFELPGILNVWTVYKEEIEEEHIEKEIKKNTSKKRSRDENNNNEQEDNEQE 695

Query: 572 ----------------DDEYHAYLIISL-EARTMVLETADLLTEVTESVDYFVQGRTIAA 614
                           D  +H YL +SL +  T++ ET   L EV +        +++  
Sbjct: 696 DNEDNEEEEEEEKMQKDKNWHDYLYLSLKDGTTLIFETGRDLKEVGK-----FNFKSLDI 750

Query: 615 GNLFGRRRVIQVFERGARILDG-SYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
           GNLFGR+R++ +++ G ++++G   + Q++              N  + S  I DP++LL
Sbjct: 751 GNLFGRKRIVVIYQGGIKLINGFDRVIQEIQI------------NEPIKSSYICDPFILL 798

Query: 674 GMSDGSIRLLVG-DPSTCTVSVQTPAAIESSKKPVSSCTLYHDKG 717
              +G+I++  G D     +     +   +  + + S +L+ D+ 
Sbjct: 799 QFHNGTIQIFKGIDEENQLIQFSINSISNNLNQSIFSSSLFFDRN 843



 Score = 93.2 bits (230), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 62/242 (25%), Positives = 112/242 (46%), Gaps = 42/242 (17%)

Query: 912  QRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQ----------------LCDGSIV 955
            +RI  F +ISG +G F+ G +P W    +  LR+H                      ++ 
Sbjct: 1122 KRIFEFSSISGKRGLFIGGKKPIWAFCEKGYLRLHSMDSSDNSNSNNSNNNNNNNSNTVE 1181

Query: 956  AFTVLHNVNCNHGFIYVTSQG-ILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEK 1014
             FT  +N++C  GFIY + +  ++KIC L +   ++N   +++ IP K + H+I Y +E 
Sbjct: 1182 TFTSFNNISCQDGFIYFSKEKDVIKICTLSTLMNFENDIAIRR-IPTKNSCHKIAYHSEA 1240

Query: 1015 NLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRA 1074
              Y +IVS P               +V  ++   +   +      T +++++++++P   
Sbjct: 1241 KCYVVIVSFP---------------QVTQELQEDSKKPI-----LTDDKFQIKLIDP-TI 1279

Query: 1075 GGPWQTRATIPMQSSENALTVRVVTLFNTT---TKENETLLAIGTAYVQGEDVAARGRVL 1131
               W+   +  +Q  E  L +++V+L  T           L IGTA+  GED   +GRVL
Sbjct: 1280 DWNWKFIDSFSLQDRETVLAMKIVSLKFTEPDGITRARPFLVIGTAFTFGEDTQCKGRVL 1339

Query: 1132 LF 1133
            +F
Sbjct: 1340 VF 1341



 Score = 91.3 bits (225), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 58/168 (34%), Positives = 89/168 (52%), Gaps = 18/168 (10%)

Query: 57  NLVVTAANVIEIYVVRVQE----EGSKESKNSGETKRRVLMDGISAA-------SLELVC 105
           NLV+   NV++IY +R ++    E   +S+   + ++      I+         SLEL+ 
Sbjct: 32  NLVLAKTNVLQIYKIRYEKIEKYENVSDSQPQQQQEQEQQQQDITQKKKIELKPSLELII 91

Query: 106 HYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESP 165
             +L GN+ES+A +    ++    RDS+IL F DAKISVL++D  +    I S+H FE  
Sbjct: 92  EKKLFGNIESMASVRYPNSE----RDSLILTFRDAKISVLDYDSDLLDFEIRSLHYFEKD 147

Query: 166 EWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL 213
           E+   K GR  F   PL+KVD Q RC  +L+Y   + +L   +  S L
Sbjct: 148 EF---KGGRNHFKHPPLLKVDTQQRCAVMLLYDRNLAVLPFKKTSSIL 192


>gi|328773280|gb|EGF83317.1| hypothetical protein BATDEDRAFT_21894 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 1673

 Score =  197 bits (501), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 181/744 (24%), Positives = 330/744 (44%), Gaps = 152/744 (20%)

Query: 98  AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
           AA LEL   +R+HGN+ SL ++     + S + D+++L+F++AK+S++E+      L   
Sbjct: 87  AACLELAAQFRVHGNITSLGVVPM---NYSGKADALLLSFKEAKMSLVEYSQFTQKLVTV 143

Query: 158 SMHCFESPEWLHLKRGRESFARGPL-VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGD 216
           SMH FE  E+  L     S  R P  +KVDPQG C  + +YG ++ IL   Q G+ L+ D
Sbjct: 144 SMHYFEREEFKKLG----SIDRPPPEIKVDPQGYCAAMRIYGDRLAILPFKQDGADLLND 199

Query: 217 EDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
            +   S   F   I    V+   DLD  ++++ DF F+ GY  P + I+++ E TW  R+
Sbjct: 200 LNDANSKYPFRPSI----VLPFLDLDKSIRNIIDFTFLFGYAVPTIAIMYQTEQTWTARL 255

Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY--- 331
             +  T  I+ +S+ T  + +P+++    LP++   L++VP+PIGG++V+  N I +   
Sbjct: 256 GIRKDTVSIAVISLDTAEESYPVLYKIEKLPYNCTMLVSVPTPIGGLIVLSHNAIIFTDQ 315

Query: 332 -HSQSASCA----------LALNNYAVSLDSSQELP---------------RSSFSVELD 365
            H+   +C           + L  Y + LD  Q  P                   ++ LD
Sbjct: 316 IHAPGIACIVNAYFDSETNIMLTPYELQLDMVQPRPPRPPSVFFAQNKYTDYKELAISLD 375

Query: 366 AAHATWLQNDVALLSTKTGDLVLLTVVYDGRV----------VQRLDLSK---------- 405
            +   ++  D+ LL  + G+++ + ++ +  V          V+   L++          
Sbjct: 376 GSRGMFISPDIFLLVLRDGEMIQVDLIGEEGVGRSWKRRKGGVKSFQLTRLGIRMTAPVH 435

Query: 406 -------TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIE 458
                  +NP  L+   +++     FLGSR G  L   +   S  +  +  L  +F ++E
Sbjct: 436 LFPLADASNPLSLSGRNSSVPLGGSFLGSR-GSKLRYNYLFASSRTTDACLL--QFVEVE 492

Query: 459 ADAPSTKRLRRSSSDALQDMVNGE----ELSLYGSASNNTES------------------ 496
             A S+  +  +++  + +  NGE    +  LYG ++   ++                  
Sbjct: 493 EFAKSSVSMNGAAN--MNNTDNGEDDELDKDLYGDSTTAKQTDTDMSALLSSDEHGHGEI 550

Query: 497 -AQKTFSFAVRDSLVNIGPLKDFSYGLRINAD---------------ASATG-------- 532
            +++T  F + DS+  + PL+DF+ GL                     +ATG        
Sbjct: 551 VSEQTLRFRLCDSVTVVSPLRDFAVGLPAETSEHRFSPKIGGCDLEIVAATGHGPHGHLA 610

Query: 533 -ISKQSNYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM 588
            +++    ++V   ELP  + +WT+        + D   ++   D +H Y+I+S  + T 
Sbjct: 611 ILNRSVRPQIVTTFELPQIEEMWTI---RCAKFDKDYRLVSEPTDAFHKYVILSHSSGTS 667

Query: 589 VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSF--- 645
           +L+  +  TE+ ++  ++  G T+  G L     ++QV   G  + D  +   D +    
Sbjct: 668 ILKAGEAFTEMDDTT-FYQAGPTVGVGALLDETIIVQVHPNGVILFD--FSKYDFTIIDR 724

Query: 646 --------------GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
                         G    E   G ++  V+S S  DPY +L M+ G I LL  D +T  
Sbjct: 725 LNTNRMHALYIFVEGTKLQEMRVGDDDIWVISCSFMDPYAMLLMNTGHIVLLSLDETTHQ 784

Query: 692 VSVQTPAAIESSKKPVSSCTLYHD 715
           ++  +    E  K+ VS+ +LY D
Sbjct: 785 ITQIS----EYKKRLVSTFSLYCD 804



 Score =  117 bits (294), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 124/502 (24%), Positives = 207/502 (41%), Gaps = 111/502 (22%)

Query: 753  YSVVCYESGALEIFDVPNFN--CVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEG 810
            +  V  ++G L ++ +P+F   C F +  F +     +D  +  +     T  N++ +E 
Sbjct: 930  WCFVYTDTGHLLVYTLPDFKECCAFPL--FSTLPVLAMDVPLWRSRSIDSTFANTTGDE- 986

Query: 811  TGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDP 870
                       + VV L     S     P+L  +  +G +  Y+  +F  P  TS +DD 
Sbjct: 987  --------FEEILVVNLGN---SKDRQTPYLVCLAANGDLAVYK--IFVCP--TSSNDDD 1031

Query: 871  VS--------TSRSLSVSNVSASRLRN---LRFSRTPLDAYTRE---------------E 904
             S         SR+ +   + A  L+    +R  R P D  TR+               +
Sbjct: 1032 TSFVNSGTFKQSRTPAELELDAQNLKKRLAIRLVRIPHDQITRDLQFYTDNEGDKIDLVQ 1091

Query: 905  TPHGAPC----QRITIFKNI--SG---HQGFFLSGSRPCWCMVFRER------------- 942
             P   P     Q +  F  I  SG   + G  ++GSRPCW MV  +              
Sbjct: 1092 EPQHQPTFLKRQHLKPFDAIGWSGGNMYSGVVVTGSRPCWIMVALQSRQQDLDVISFDNS 1151

Query: 943  -----------------LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPS 985
                             LR HP   DG +  F  LHNVN  HGF+Y+  +G+ +ICQLP 
Sbjct: 1152 VACSTKLPPVPLLGTNMLRFHPMPVDGPMKCFAPLHNVNVAHGFLYINWKGLFRICQLPP 1211

Query: 986  GSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVS------VPVLKPLNQVLSLLIDQ 1039
               +D+ WPV KV P+  T H++ Y      Y +  S      +P  +  + V + +ID+
Sbjct: 1212 QFNFDHDWPVCKV-PIHKTVHKVAYHYSSQTYAIATSTPERFDIPHAQYASAVAAAVIDE 1270

Query: 1040 EVGHQIDNHNLSSVDLHR---------TYTVEEYEVRILEPDRAGGPWQTRATIPMQSSE 1090
              G ++ +       +             TV+ Y++ ++    +   W+T  +I +  +E
Sbjct: 1271 --GDEMPDAERKVTGIRELSEIKPGMYEATVDRYKIELV----SSVTWETVDSIELSEAE 1324

Query: 1091 NALTVRVVTLFNTTTKENETL-LAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSG 1149
              + +  V L +  T   + L LAIGT Y +GED+++RG++ L+       +P N   + 
Sbjct: 1325 TVMALEAVDLSSKETISGKKLYLAIGTGYSRGEDLSSRGKLHLYDVIEVVPDPNNPQTNR 1384

Query: 1150 SYGPLFSSVQIDFASHFFAICS 1171
             +  + S    D  S F AIC+
Sbjct: 1385 KFKHVDSE---DDRSPFSAICT 1403


>gi|171695066|ref|XP_001912457.1| hypothetical protein [Podospora anserina S mat+]
 gi|170947775|emb|CAP59938.1| unnamed protein product [Podospora anserina S mat+]
          Length = 1441

 Score =  197 bits (501), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 273/1193 (22%), Positives = 463/1193 (38%), Gaps = 209/1193 (17%)

Query: 57   NLVVTAANVIEIYVVRV--------QEEGSKESKNSGETKRRVLMD--GISAA------- 99
            NLVV  +++++I+  ++        Q+     ++N+G  + R+  D  G+ A+       
Sbjct: 28   NLVVAKSSLLQIFRTKIVSTEIDASQQGSGARTRNAGRYESRLANDDDGLEASFLGGDSL 87

Query: 100  ----------SLELVCHYRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFD 148
                       L LV    L G +  LA +    + N R   D++++AF+DA++S++E+D
Sbjct: 88   AFKTDRTNNTKLVLVSEISLSGTITGLAKIK---SQNLRSGGDALLVAFKDARLSLVEWD 144

Query: 149  DSIHGLRITSMHCFESPE-----WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMII 203
               H L   S+H +E  E     W        +F     +  DP GRC  +   G+ + I
Sbjct: 145  AERHDLSTVSIHYYEQDELQGSPWAPPLSNFTNF-----LAADPGGRCAALKFGGMNLAI 199

Query: 204  LKASQG---------------GSGLVGDEDTFGSGGGF--SARIESSHVINLRDLD--MK 244
            L   Q                G   V  E    +GG          S V+ L +LD  + 
Sbjct: 200  LPFKQADEDIDMDDDWDEDLDGPRPVKQEAAVVNGGSSIKETPYSPSFVLRLSNLDPSLL 259

Query: 245  HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNL 304
            H     F+H Y EP   IL    +  +  +  K H   +   ++    +    I S   L
Sbjct: 260  HPVHLAFLHEYREPTFGILAST-VNASNSLGRKDHLAYM-VFTLDLQQRASTTILSVPGL 317

Query: 305  PHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE 363
            P D +++  +P+P+GG L+VGAN  IH         +A+N       S     +S  ++ 
Sbjct: 318  PQDLFRVQPLPAPVGGALLVGANELIHIDQSGKPNGVAVNPLTKQCTSFGLSDQSDLNLR 377

Query: 364  LDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT-- 417
            L+      L  +  L+    G + L+T   DGR V  LD+    S+T  S++   ++T  
Sbjct: 378  LEECTIDVLSAEELLVILSDGRMALVTFRIDGRTVSGLDVKLLPSETGGSLIPGRVSTLS 437

Query: 418  -IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG-DIEADAPSTKRLRRSSSDAL 475
             IG S+ F GS  GDSL+  +T     S       ++ G DI+              D  
Sbjct: 438  RIGKSVMFAGSEEGDSLVFGWTKKQNQSGRKKSRLQDVGLDIDMADEEDLDEDEDEDDLY 497

Query: 476  QDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADAS------ 529
             +    ++ ++  +ASN  E      +F + D L++I P++  +YG  ++A  S      
Sbjct: 498  AEEPTPKQQAV-ATASNVKEG---DLTFRIHDRLLSIAPIQSMTYGQPVDAPGSEEEQNS 553

Query: 530  -----------ATGISKQSNYELV------------ELPGCKGIWTVYHKS--SRGHNAD 564
                         G +K S   ++            E P  +G WTV  K    +    D
Sbjct: 554  AGVRSELQLVCGVGRNKSSAMAIMNLAIPPKVIGRFEFPEARGFWTVCAKKPVPKSLQGD 613

Query: 565  SSRMAAYDD-----EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIA 613
                A  +D     +Y  ++I++       E   +   TA     +T +      G TI 
Sbjct: 614  KGPGAIGNDYGTSGQYDKFMIVAKVDLDGYEKSDVYALTAAGFESLTGTEFDPAAGFTIE 673

Query: 614  AGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
            AG +    R+IQV +   R  DG      +   P   E       +T  S SIADPY+L+
Sbjct: 674  AGTMGKDNRIIQVLKSEVRCYDGDLGLSQIV--PMMDEETGAEPRAT--SASIADPYLLI 729

Query: 674  GMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLST 733
              +D S+ +          S+     +E  +K         DK         +T  WL+ 
Sbjct: 730  IRNDQSVFI---------ASIHDDNELEEVEK--------EDK-------TLATTKWLTG 765

Query: 734  GVGEAIDGADGGPLDQGD--------IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRT 785
             +    +G  G   + GD        I   +   SGAL I+ +P+               
Sbjct: 766  CLYTDTNGVFGE--ESGDKKAKLPESILMFLLSASGALYIYRLPDL-------------- 809

Query: 786  HIVDTYMREALKDSETEINS--SSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFA 843
                 Y+ E L    T +++  ++ +GT    KE +  + V +L           P+L  
Sbjct: 810  -CKPVYVAEGLSYIPTGLSADYAARKGTA---KETVSEILVADLG----DTTAKSPYLIL 861

Query: 844  ILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE 903
               +  +  Y+ Y ++           +   ++L    +  S L     +++P +    E
Sbjct: 862  RHANDDLTMYEPYRYQLGAG-------LEFPKTLFFQKIPNSVL-----AKSPAEETDDE 909

Query: 904  ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNV 963
            E  H A C  +    NI G+   FL G  P + +   + +     L   ++ A +  H  
Sbjct: 910  EVTHQAKCLALRRCNNIGGYSTVFLPGPSPSFIIKSSKSMPKVLPLQGAAVTAISSFHTE 969

Query: 964  NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSV 1023
             C HGFIY  S  I+++ QLP   ++       K IP+      + Y      Y +  + 
Sbjct: 970  GCEHGFIYADSHNIVRVSQLPKDWSFAETGLAVKKIPIGEDIVAVAYHPPSQSYVVACNT 1029

Query: 1024 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD-LHRTYTVEEYEVRILEPDRAGGPWQTRA 1082
            P  +P           E+    D H   + + L    T+E   ++++ P      W    
Sbjct: 1030 P--EPF----------ELPRDDDYHKEWAREVLPFKPTLERGTLKLIGPIT----WTVVD 1073

Query: 1083 TIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFS 1134
            TI M+  EN L V  + L  +  T E + L+ +GTA  +GED+  RG V +++
Sbjct: 1074 TIVMEPCENVLCVETLNLEVSEATNERKLLIGVGTAITKGEDLPTRGAVYVYN 1126


>gi|336276223|ref|XP_003352865.1| hypothetical protein SMAC_04980 [Sordaria macrospora k-hell]
 gi|380092984|emb|CCC09221.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 1486

 Score =  196 bits (497), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 266/1124 (23%), Positives = 449/1124 (39%), Gaps = 175/1124 (15%)

Query: 94   DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHG 153
            D  ++A L LV    L G +  LA + +    +S   D ++L+F DA++S++E++   + 
Sbjct: 95   DRANSAKLVLVAEVTLPGTITGLARIKKPSGSSSGGADCLLLSFRDARLSLVEWNVDRNT 154

Query: 154  LRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL 213
            L   S+H +E  E +             L+  DP  RC  +      + IL   Q    +
Sbjct: 155  LETISIHYYEKEELVGSPWVAPLHHYPTLLLADPASRCAALKFSERNLAILPFKQPDEDM 214

Query: 214  VGD------------EDTFGSGGGFSARIES-----SHVINLRDLD--MKHVKDFIFVHG 254
              D            +D  G+    ++ IE      S V+ L  L+  + H     F+H 
Sbjct: 215  DMDNWDEELDGPRPKKDLSGAIANGASTIEDTPYSPSFVLRLSKLEASLLHPVHLAFLHE 274

Query: 255  YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
            Y +P + +L   +          H T M+  L +    +    I +   LP D ++++A+
Sbjct: 275  YRDPTIGVLSSTKTASNSLGHRDHFTYMVFTLDLQQ--RASTTILAVNGLPQDLFRVVAL 332

Query: 315  PSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
            P+P+GG L+VGAN  IH      S  +A+N       S   + +S   + L+      L 
Sbjct: 333  PAPVGGALLVGANELIHIDQSGKSNGIAVNPLTKQTTSFSLVDQSDLDLRLEGCAIDVLA 392

Query: 374  NDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDITT---IGNSLFF 424
             ++   LL    G L L+T   DGR V  L +    P    SV+ S +T+   +G S  F
Sbjct: 393  AELGEFLLILNDGRLGLITFRIDGRTVSGLSIKMLAPEAGGSVIQSRVTSLSRVGRSTVF 452

Query: 425  LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
            +GS  GDS+L+ +T   G +      K    DI+ D                        
Sbjct: 453  VGSEEGDSVLLGWTRRQGQTQKR---KSRIQDIDLDLDLDDEDLEDDD-----------D 498

Query: 485  SLYGSASNNTESA--------QKTFSFAVRDSLVNIGPLKDFSYGLRINADAS------- 529
             LYG  S + E A            +F + D L++I P++  +YG  +    S       
Sbjct: 499  DLYGEESTSPEQAISAAKAVKSGELNFRIHDRLLSIAPIQKMTYGQPVTLPDSEEERNSE 558

Query: 530  ----------ATGISKQSNYELV------------ELPGCKGIWTVYHKS--SRGHNADS 565
                      A G  K S   ++            E P  +G WTV  K    +    D 
Sbjct: 559  GVRSDLQLVCAVGRGKASALAIMNLAIQPKIIGRFEFPEARGFWTVCAKKPIPKTLVGDK 618

Query: 566  SRMAA-YD--DEYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGN 616
              M+  YD   ++H ++I++       E   +   TA     +T +      G T+ AG 
Sbjct: 619  GPMSNDYDTSGQHHKFMIVAKVDLDGYETSDVYALTAAGFESLTGTEFEPAAGFTVEAGT 678

Query: 617  LFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
            +    R++QV +   R  DG   ++Q +     + E+G+      V + SIADP++LL  
Sbjct: 679  MGKDCRILQVLKSEVRCYDGDLGLSQIVPM--LDEETGA---EPRVRTASIADPFLLLIR 733

Query: 676  SDGSIRLLVGDPSTCTV-SVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTG 734
             D S+ +    P    +  V+    + +  K ++ C LY D                +TG
Sbjct: 734  DDFSVFVAEMSPKLLELDEVEKEDQMLTGTKWLAGC-LYTD----------------TTG 776

Query: 735  VGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMRE 794
            V  A + A  G  D  +I   +   SG L I+ +P+      V + +S        Y+  
Sbjct: 777  V-FADEAAGKGTKD--NILMFLLSTSGVLYIYRLPDLTKPVYVAEGLS--------YIPP 825

Query: 795  ALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQ 854
             L       + ++ +GT    KE++  + V +L        H  P+L     +  +  YQ
Sbjct: 826  GLS-----ADYAARKGTA---KESVAEILVADLG----DTTHKSPYLILRHANDDLTLYQ 873

Query: 855  AYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA----P 910
             Y  +     + +  P S S       +   ++ N  F++ P +    ++  H A    P
Sbjct: 874  PYRVK-----ATAGQPFSKS-------LFFQKVPNSTFAKAPEEKPVEDDELHNAQRFLP 921

Query: 911  CQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFI 970
             +R T   NISG+   FL GS P + +   +       L    + A +  H   C HGFI
Sbjct: 922  MRRCT---NISGYSTVFLPGSSPSFILKTAKSSPRVLGLQGSGVQAMSSFHTEGCEHGFI 978

Query: 971  YVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLN 1030
            Y  + GI ++ Q+P+ S++       K IP+      + Y      Y  +V     +P  
Sbjct: 979  YADTNGIARVTQIPTDSSFAELGLSVKKIPVGVDTQSVVYHPPTQAY--VVGCNNAEPFE 1036

Query: 1031 QVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSE 1090
                L  D +   +    N++   +     V+   +++L    +G  W    T+ M+  E
Sbjct: 1037 ----LPKDDDYHKEWARENITFKPM-----VDRGMLKLL----SGITWTVIDTVEMEPCE 1083

Query: 1091 NALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
              L V  + L  + +T E + L+A+GTA  +GED+  RGRV +F
Sbjct: 1084 TVLCVETLNLEVSESTNERKQLIAVGTALTKGEDLPTRGRVYVF 1127


>gi|393245434|gb|EJD52944.1| hypothetical protein AURDEDRAFT_81080 [Auricularia delicata TFB-10046
            SS5]
          Length = 1422

 Score =  195 bits (495), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 258/1179 (21%), Positives = 490/1179 (41%), Gaps = 184/1179 (15%)

Query: 52   IGPVPNLVVTAANVIEIYVVRVQ------------EEGSKESKNSGETKRRVLMD----- 94
            +G   NLVV   N++ ++ VR++            +E  +  +     +  V MD     
Sbjct: 40   LGVATNLVVARQNLLRVFEVRIEAAPLPSQEKLLADEQGRGRRGMEAVEGEVEMDVGGEG 99

Query: 95   ----GI-------------SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAF 137
                GI             +   L LV  +RLHG V  L  + Q  A    + D ++++F
Sbjct: 100  FVSAGIVKSAGQHARQRQRTVTRLYLVRQHRLHGIVTGLGRV-QTMASLEDKLDRLLVSF 158

Query: 138  EDAKISVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLV 196
            +DAKI++LE+ +  H L   S+H +E +P+ L     R        +++DP  RC  + +
Sbjct: 159  KDAKIALLEWSEVSHDLSTISIHTYERAPQMLAFDSARALTE----LRIDPNSRCAALTL 214

Query: 197  YGLQMIILKASQGGSGLVGDEDTFGSGGGFSARI--ESSHVINLRDLD--MKHVKDFIFV 252
             G  + IL   +  + L  D D     GG S  +    S +++L ++D  ++++ D  F+
Sbjct: 215  PGDAVAILPFYESQAELDMDVDQ----GGVSRDVPYSPSFILSLPEVDNDIRNIIDIAFL 270

Query: 253  HGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLL 312
             G+  P + +L E + TW GR++    T  +  L++    + +P+I S   LP+D+ +L+
Sbjct: 271  PGFNNPTLAVLFETQRTWTGRLAEFKDTVRLRILTLDVVTRTYPIIGSVDGLPYDSMRLV 330

Query: 313  AVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATW 371
            A P+ +GGV+V+ AN + +  QS  + A+A N +A  + S    P       L+ + A +
Sbjct: 331  ACPAALGGVIVLTANAVLHIDQSGKNVAVAANGWAARV-SEFPTPAPERDETLEGSRAVF 389

Query: 372  LQNDVALLSTKTGDLVLLTVVYDGRVVQRLDL-SKTNPSVLTSDITTIGNSLFFLGSRLG 430
            + +   LL  + G +V + ++ +GR+V ++D+  +   + + + +  + + L  +GS  G
Sbjct: 390  VSDKTFLLVYRDGSIVPVELILEGRMVTKIDMGQRLAQTTIPTVVCAVQDDLVLVGSTAG 449

Query: 431  DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ-----DMVNGEELS 485
             S+L++ T              E  DI  DA S +    ++ ++       D ++ E+  
Sbjct: 450  PSILLKVT-------------HEEEDITPDAGSARENGAANGNSTNGATYDDPMDSEDED 496

Query: 486  LYGSASNNTESAQKTFS--------------FAVRDSLVNIGPLKDFSYGLRINAD---- 527
            LYG       S   T +               A+ DSL   GP+ D ++ L  N +    
Sbjct: 497  LYGGTDMMVTSTSGTLTVGGTAALEKRRILRLALADSLCGHGPISDMAFILGRNGERHVP 556

Query: 528  -------ASATG--------ISKQSNYELVELPGCKGIWTVYHKSS---RGHNADSSRMA 569
                      TG        +  +   +L  + G +G+W+   + +    G N +    A
Sbjct: 557  ELLAGVGVGHTGGLARFQRDLPARVKRKLHRISGNRGVWSFPVRRAVKVAGMNIERPTGA 616

Query: 570  AYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ--GRTIAAGNLFGRRRVIQVF 627
            A  D     +I+S +A      +   + + +  VD   +    TI AG  F R  ++QV 
Sbjct: 617  ADWD----TVIVSTDATPSPGLSRVAVKDSSTDVDILTRLPAITIGAGPFFQRTAILQVV 672

Query: 628  ERGARIL--DGS--YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
                R+L  DGS   + +DL            +  + + + SI+DP+V++   D ++ L 
Sbjct: 673  NNAIRVLEADGSERQVIKDLD---------GTTPRAKIRACSISDPFVVVVREDDTLGLF 723

Query: 684  VGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDK------GPEPWLR-KTSTDAWLSTGVG 736
            VG+     +  +  + +        + + Y D       G    L+ K   +A  ST + 
Sbjct: 724  VGETGKGKLRRKDMSMLGDKASRYLAASFYQDHSGLFQVGTARSLKGKEKANAPASTTIE 783

Query: 737  EAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREAL 796
             A+D        +G  + V+C   G +EI+ +P    VF+          + DT+     
Sbjct: 784  AAMDEG------RGSQWLVLCRPQGVVEIWALPKLTLVFSCGGVSDIPPVMADTF----- 832

Query: 797  KDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAY 856
               +    S  ++   Q    ++      E+ +        RP L  +L  GT+  Y   
Sbjct: 833  ---DLATPSPVQDPPRQAEDHDVE-----EILISPIGETTPRPHLLVLLRSGTVAVYDTA 884

Query: 857  LFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI 916
              E P        PV+T R   +  ++  R+ +      P++   +   P  AP   I  
Sbjct: 885  PVELP--------PVATGREAGL-QLAFVRIMSRAVDTAPIERAEKRGAP--APRHLIPF 933

Query: 917  FKNISGHQGFFLSGSRPCWCM-VFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQ 975
              ++S   G FL+G +P W +   +  +R+ P + +  + +FT          F+  T +
Sbjct: 934  STSVS---GVFLTGGKPGWILGTDKTAVRLVPAV-NQVVHSFTACSLWGNRGEFLMNTDE 989

Query: 976  GILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSL 1035
            G   +  +P     D   P    +P +  P+  T  A +    ++++   L+        
Sbjct: 990  GPCLVEWMPD-LRLDEELP-SFFMP-RGRPY--TSIAYEATTGMVIAASSLRS-----RF 1039

Query: 1036 LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTV 1095
            ++  E G+ +   +   +      T +   + +++P+     W T        +E   TV
Sbjct: 1040 VLFDEDGNTVWKPDAEFIS---DPTTDTSSLELIDPET----WTTVDGFEFAFNEMINTV 1092

Query: 1096 RVVTLFNTTTKE-NETLLAIGTAYVQGEDVAARGRVLLF 1133
            R V L   +T+  ++  +A+GT   +GED+A +G   +F
Sbjct: 1093 RTVNLETVSTEAGSKDFIAVGTTVFRGEDLAVKGATYIF 1131


>gi|194374339|dbj|BAG57065.1| unnamed protein product [Homo sapiens]
          Length = 330

 Score =  191 bits (485), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 114/302 (37%), Positives = 170/302 (56%), Gaps = 35/302 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL 349
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS         Y V+L
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVP------PYGVAL 300

Query: 350 DS 351
           +S
Sbjct: 301 NS 302


>gi|341892673|gb|EGT48608.1| CBN-CPSF-1 protein [Caenorhabditis brenneri]
          Length = 1440

 Score =  190 bits (483), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 178/623 (28%), Positives = 300/623 (48%), Gaps = 93/623 (14%)

Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
           +DSI++AF+DAK+S++  ++    ++  S+H FE+    +L+ G   +   P+V+ DP+ 
Sbjct: 91  QDSILMAFDDAKLSIITINEKERNMQTISLHAFENE---YLRDGFVKYFHPPIVRTDPEN 147

Query: 190 RCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVK 247
           RC   LVYG  + IL   +                  S RI S ++I L+ +D  + +V 
Sbjct: 148 RCAASLVYGKHIAILPFHEN-----------------SKRIHS-YIIPLKQIDPRLDNVA 189

Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHD 307
           D +F+ GY EP ++ L+E   T  GR   ++ T  I  +S++   +Q  ++W   NLP D
Sbjct: 190 DIVFLDGYYEPTILFLYEPLQTTPGRACVRYDTMCIMGVSVNIVDRQFAVVWQTANLPMD 249

Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELP---RSSFSVE 363
              LL +P P+GG +V G+NTI Y +Q+   C + LN+     D   + P     S  + 
Sbjct: 250 CATLLPIPKPLGGAIVFGSNTIVYLNQAVPPCGIVLNS---CYDGFTKFPLKDMKSMKMT 306

Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKTNPSVLTSDITTIGNS 421
           LD + + ++++    + T+ G+L LL +V    G  V+ L+ SK   + +   +T     
Sbjct: 307 LDCSTSVYMEDGRIAVGTRDGELFLLRLVTSSGGATVKSLEFSKVWDTSIAYTLTVCAPG 366

Query: 422 LFFLGSRLGDSLLVQFTCGSGT--SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
             FLGSRLGDS L++++    T  S+    +++E   +EA+                  +
Sbjct: 367 HLFLGSRLGDSQLLEYSLIKTTRESVKRHKMEQEQNHVEAE------------------L 408

Query: 480 NGEELSLYGSA-----SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
           + ++L LYG A     +++ E   ++  F+  D L NIGP+K    G R N  ++    +
Sbjct: 409 DEDDLELYGGAIEEQQNDDEEQITESLQFSELDRLRNIGPVKSMCVG-RPNYMSNDLVDA 467

Query: 535 KQSN--YELVELP--GCKGIWTVYHKSSRGHNADSSRMAA---------YDDEYHAYLII 581
           K+ +  ++++     G  G   V+ +S R     SS +            ++E H YLI+
Sbjct: 468 KRRDPVFDVITASGHGKNGSLCVHQRSLRPEIVTSSLLEGAEQLWAVGRKENESHKYLIV 527

Query: 582 SLEARTMVLETADLLTEVTESVDYFVQGR-TIAAGNLFGRRRVIQVFERG-ARILDGSYM 639
           S    T+VLE  + L E+ E +  FV G+ T+AAG L      +QV     A + DG   
Sbjct: 528 SRIRSTLVLELGEELIELEEPL--FVTGQPTVAAGELSQGAFAVQVTSTSIALVTDG--- 582

Query: 640 TQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL--LVGDPSTCTVSV--- 694
            Q L+    +S       N  V+  SI DPYV +   +G + L  LV +P      +   
Sbjct: 583 -QQLAEVKIDS-------NFPVVQASIVDPYVAVLTQNGRLLLYTLVSNPYMQLQEIDLA 634

Query: 695 QTPAA--IESSKKPVSSCTLYHD 715
           QTP +  I  S   ++S ++Y D
Sbjct: 635 QTPFSTFIAQSASQITSISMYAD 657



 Score = 68.2 bits (165), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 105/430 (24%), Positives = 179/430 (41%), Gaps = 51/430 (11%)

Query: 723  RKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVS 782
            ++   DA  S   GE  D  D         + V+ +E+G L +  +P    V+ + +F +
Sbjct: 736  KRLGHDAIQSGRGGEQSDAIDPSSYTSISHWLVLAHENGRLSVHSLPEMELVYQIGRFPN 795

Query: 783  GRTHIVD-----TYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHS 837
                +VD           +K       +S EE      K+N    +++E  +     + S
Sbjct: 796  VPELLVDLTPEEEEKERRIKAQLAAKEASDEEQLNAEMKKNCE--RIMEAQIVGMGINQS 853

Query: 838  RPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPL 897
             P L AI+ D  ++ Y+ +            +P S    L ++         LR S    
Sbjct: 854  HPILMAIV-DEQVIMYEMFA-----------NPNSQPGHLGIAFRKLPHFICLRSSPYLK 901

Query: 898  DAYTR------EETPHGAPCQRITIFKNISG-HQGFFLSGSRPCWCMVFRE--RLRVHPQ 948
                R      EE     P   I  F+ +S  + G  + G+ P   +V+     ++ HP 
Sbjct: 902  SDGKRAAFQIVEEDGKRYPL--IHSFERVSTVNNGVIIGGAVPT-LLVYGAWGGMQTHPM 958

Query: 949  LCDGSIVAFTVLHNVNCNHGFIYVT-SQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQ 1007
              DGSI AFT  +  N  +GF+Y+T  +  L+I ++ +   Y+  +PV+K I +  T H 
Sbjct: 959  TIDGSIKAFTPFNIDNVPYGFVYMTQKKSELRIAKMHADFDYEMPYPVKK-IEVGRTIHS 1017

Query: 1008 ITYFAEKNLYPLIVSVPVLKPLNQVLSLLID--QEVGHQIDNHNLSSVDLHRTYTVEEYE 1065
            + Y    ++Y ++ SVP  KP N++  ++ D  QE  H+ D + +  +     YT+  + 
Sbjct: 1018 VRYLMNSDVYVVVSSVP--KPSNKIWVVMNDDKQEEIHEKDENFV--LPAPPKYTLNLF- 1072

Query: 1066 VRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGE 1122
                    +   W+      I  +  E       V L + +T    ET LAIGT    GE
Sbjct: 1073 --------SSQDWKAVPNTEISFEDMEAVTACEDVALKSESTHTGFETYLAIGTVNNYGE 1124

Query: 1123 DVAARGRVLL 1132
            +V  RGR++L
Sbjct: 1125 EVLVRGRIIL 1134


>gi|134025022|gb|AAI35011.1| LOC564406 protein [Danio rerio]
          Length = 348

 Score =  189 bits (479), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 107/291 (36%), Positives = 168/291 (57%), Gaps = 14/291 (4%)

Query: 94  DGIS-AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIH 152
           DG S    LE V  + L GNV S+A +   G +    RD+++L+F+DAK+SV+E+D   H
Sbjct: 58  DGKSRKEKLEQVASFSLFGNVMSMASVQLVGTN----RDALLLSFKDAKLSVVEYDPGTH 113

Query: 153 GLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSG 212
            L+  S+H FE PE   L+ G       P+V+VDP+ RC  +LVYG  +++L   +    
Sbjct: 114 DLKTLSLHYFEEPE---LRDGFVQNVHIPMVRVDPENRCAVMLVYGTCLVVLPFRKDT-- 168

Query: 213 LVGDEDTFGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTW 270
            + DE     G G  +    S++I++R+LD K  ++ D  F+HGY EP ++IL E   TW
Sbjct: 169 -LADEQEGIVGEGQKSSFLPSYIIDVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTW 227

Query: 271 AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIH 330
            GRV+ +  TC I A+S++   K HP+IWS  NLP D  +++AVP PIGGV+V   N++ 
Sbjct: 228 PGRVAVRQDTCSIVAISLNIMQKVHPVIWSLSNLPFDCNQVMAVPKPIGGVVVFAVNSLL 287

Query: 331 YHSQSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLS 380
           Y +QS     ++LN+      +    P+    + LD + A+++ +D  ++S
Sbjct: 288 YLNQSVPPFGVSLNSLTNGTTAFPLRPQEEVKITLDCSQASFITSDKMVIS 338


>gi|268580265|ref|XP_002645115.1| Hypothetical protein CBG16808 [Caenorhabditis briggsae]
 gi|296439546|sp|A8XPU7.1|CPSF1_CAEBR RecName: Full=Probable cleavage and polyadenylation specificity
           factor subunit 1; AltName: Full=Cleavage and
           polyadenylation specificity factor 160 kDa subunit;
           Short=CPSF 160 kDa subunit
          Length = 1454

 Score =  188 bits (477), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 150/576 (26%), Positives = 266/576 (46%), Gaps = 81/576 (14%)

Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
           +DSI++ F+DAK+S++  ++    ++  S+H FE+    +L+ G  ++   P+V+ DP  
Sbjct: 92  QDSILMTFDDAKLSIVAVNEKERNMQTISLHAFENE---YLRDGFTTYFNPPIVRTDPAN 148

Query: 190 RCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVK 247
           RC   LVYG  + IL   +    ++                  S++I L+ +D  + +V 
Sbjct: 149 RCAASLVYGKHIAILPFHENSKRIL------------------SYIIPLKQIDPRLDNVA 190

Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHD 307
           D +F+ GY EP ++ L+E   T  GR   ++ T  I  +S++   +Q  ++W   NLP D
Sbjct: 191 DMVFLEGYYEPTILFLYEPLQTTPGRACVRYDTMCIMGVSVNIVDRQFAVVWQTANLPMD 250

Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELP---RSSFSVE 363
              LL++P P+GG +V G+NTI Y +Q+   C + LN+     D   + P        + 
Sbjct: 251 CNSLLSIPKPLGGAVVFGSNTIVYLNQAVPPCGIVLNS---CYDGFTKFPLKDMKHLKMT 307

Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKTNPSVLTSDITTIGNS 421
           LD + + ++++    + ++ GDL LL +V    G  V+ L+ SK   + +   +T     
Sbjct: 308 LDCSTSVYMEDGRIAVGSREGDLYLLRLVTSSGGATVKSLEFSKVCDTSIAFTLTVCAPG 367

Query: 422 LFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNG 481
             F+GSRLGDS L+++T                  ++    S K+ R    +  +  ++ 
Sbjct: 368 HLFVGSRLGDSQLLEYTL-----------------LKVTKESAKKQRLEQQNPSEIELDE 410

Query: 482 EELSLYGSA-----SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQ 536
           +++ LYG A     +++ E   ++  F   D L+N+GP+K   +G R N  ++    +K+
Sbjct: 411 DDIELYGGAIEMQQNDDDEQISESLQFRELDRLLNVGPVKSMCFG-RPNYMSNDLIDAKR 469

Query: 537 SN--YELVELP--GCKGIWTVYHKSSRGHNADSSRMAA---------YDDEYHAYLIISL 583
            +  ++LV     G  G   V+ +S R     SS +            ++E H YLI+S 
Sbjct: 470 KDPVFDLVTASGHGKNGALCVHQRSMRPEIITSSLLEGAEQLWAVGRKENESHKYLIVS- 528

Query: 584 EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG-ARILDGSYMTQD 642
             R+ ++          E   +     T+AAG L      +QV     A + DG  M Q+
Sbjct: 529 RVRSTLILELGEELVELEEQLFVTNEPTVAAGELLQGALAVQVTSTCIALVTDGQQM-QE 587

Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
           +              N  V+  SI DPYV +   +G
Sbjct: 588 VHI----------DSNFPVVQASIVDPYVAVLTQNG 613



 Score = 82.4 bits (202), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 107/442 (24%), Positives = 190/442 (42%), Gaps = 63/442 (14%)

Query: 723  RKTSTDAWLSTGVGEAIDGADGGPLDQGDIYS------VVCYESGALEIFDVPNFNCVFT 776
            ++   DA +S+  GE  D      +D    YS      VV +++G + I  +P+   V+ 
Sbjct: 737  KRLGHDAIMSSRGGEQSDA-----IDPTRTYSSITHWLVVAHDNGRITIHSLPDLELVYQ 791

Query: 777  VDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT--------GQGRKENIHSM------ 822
            + +F +    +VD  + E  K+ + +  ++ E+           +  ++ ++S       
Sbjct: 792  IGRFSNVPELLVDMTVEEEEKEKKAKQTAAQEKEKETEKKKDDAKNEEDQVNSEMKKLCE 851

Query: 823  KVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNV 882
            KVVE  +     + + P L AI+ D  ++ Y+ +    P+        V+  +   +  +
Sbjct: 852  KVVEAQIVGMGINQAHPVLIAII-DEEVVLYEMFASYNPQPGHLG---VAFRKLPHLIGL 907

Query: 883  SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISG-HQGFFLSGSRPCWCMVFRE 941
              S   N+   R P +     E  HG     I  F+ IS  + G  + G+ P   +V+  
Sbjct: 908  RTSPYVNIDGKRAPFEM----EMEHGKRYTLIHPFERISSINNGVMIGGAVPT-LLVYGA 962

Query: 942  --RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQ-GILKICQLPSGSTYDNYWPVQKV 998
               ++ H    DGSI AFT  +N N  HGF+Y+T Q   L+I ++     YD  +PV+K 
Sbjct: 963  WGGMQTHQMTIDGSIKAFTPFNNENVLHGFVYMTQQKSELRIARMHPDFDYDMPYPVKK- 1021

Query: 999  IPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLID--QEVGHQIDNHNLSSVDLH 1056
            I +  T H + Y    ++Y ++ SVP  KP N++  ++ D  QE  H+ D + +  +   
Sbjct: 1022 IEVGKTVHNVRYLMNSDIYAVVSSVP--KPSNKIWVVMNDDKQEEIHEKDENFV--LPAP 1077

Query: 1057 RTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN------ET 1110
              YT+  +              Q  A +P    E      V  + +   K        +T
Sbjct: 1078 PKYTLNLFSS------------QDWAAVPNTEFEFEDMEAVTAMEDVPLKSESRYGGLDT 1125

Query: 1111 LLAIGTAYVQGEDVAARGRVLL 1132
             LA+ T    GE+V  RGR++L
Sbjct: 1126 YLALATVNNYGEEVLVRGRIIL 1147


>gi|313232279|emb|CBY09388.1| unnamed protein product [Oikopleura dioica]
          Length = 1451

 Score =  188 bits (477), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 182/695 (26%), Positives = 313/695 (45%), Gaps = 77/695 (11%)

Query: 57  NLVVTAANVIEIYVVR--VQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVE 114
           NL V A N++ +Y +R  V E G+   +                   EL   + L G V 
Sbjct: 45  NLAVAAGNMLSVYRIRSSVDEAGNHFDR------------------FELCDEFELWGIVV 86

Query: 115 SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
            +  L   G+     RDS++L+ E++K  ++E++     L   SMH F+  +   L+RG 
Sbjct: 87  CMTRLRLAGS----VRDSLLLSIEESKCVIVEYEPDTGSLSTISMHFFQDED---LRRGF 139

Query: 175 ESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS-GLVGD--EDTFGSGGGFSARIE 231
              +   L +VD   RC  VLVYG  + +L   +     L G   +  F    GF A   
Sbjct: 140 RKLSSMALARVDGFNRCAAVLVYGSYLAVLPFRRSTERDLSGQRHQAVFYENSGFIA--- 196

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
             ++I+L+ L +K   V DF F+ GY +P +++L+E   TW GRV+ +  TC + ALSI+
Sbjct: 197 --NMIDLQSLPVKIASVLDFQFLEGYNDPTILLLYEALPTWTGRVTERQDTCGMVALSIN 254

Query: 290 TTLKQHPLIWSAMNLPH-DAYK--LLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNY 345
              + HP+IW    LP  + Y   L  +P P+GG L+   N++ Y  QS     +ALN+ 
Sbjct: 255 LIDETHPVIWQMAGLPFPNPYSSALFPIPKPLGGSLLFATNSLIYLDQSVPPYGVALNSL 314

Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLS 404
            +   +     +    + L    A  L +D   +S ++GD+ ++T+  D    V+R  L 
Sbjct: 315 PLGCTNFALKTQDVAPLNLQNCKACMLSDDSICVSLESGDVYIITLKKDSLNNVRRFYLD 374

Query: 405 KTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPST 464
           +   SV+ + ++ + ++L FLGSRLG+SLL+++ C   +   S+ L+    D        
Sbjct: 375 QVASSVIPTTLSKLSDNLIFLGSRLGNSLLLRYKCKENSKKSSTSLENGEKDGVEIENKE 434

Query: 465 KRLRRSSSDALQDMVNG------EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDF 518
           +     + +  +   NG      +++  YG    N +    ++ F   D+L NIGP    
Sbjct: 435 EEKNELNFEIEKSSENGSPENKRKKMRYYGDEIFNLD-VNTSYDFETMDNLSNIGPCGPV 493

Query: 519 SYGLRINADASATGI---SKQSNYELVELPGCK----GIWTVYHKSSRGHNA-------- 563
                 N + +   +   ++  N ++  L G      G  TV HKS R   A        
Sbjct: 494 ELIHTANHNDNYDHVGSDARDRNIDVCVLSGKDKTGFGSITVLHKSVRPSIASQFPFPMN 553

Query: 564 --DSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEV-TESVDYFVQGRTIAAGNLFGR 620
             D   +   ++E H+ L+++ + +TMV +T  +L E+  E        +TI    +   
Sbjct: 554 FSDMWTLRRSEEETHSLLVMTKKDQTMVFQTGAILEELKKEECGLATNAKTIFCATIGNG 613

Query: 621 RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGS- 679
           + ++QV  R   ++D    TQ+         SG       ++ V+  DPYV++  S G+ 
Sbjct: 614 KYIVQVLPRAVVLVDMD--TQETIQNKPFDLSGQ------IIQVA-CDPYVVILASKGTI 664

Query: 680 IRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYH 714
           I L++ + S  T  ++T  A E   +      + H
Sbjct: 665 ISLVLFENSDGTAMLKTSTAPECKNQDDPEKKIMH 699



 Score = 82.0 bits (201), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 99/408 (24%), Positives = 177/408 (43%), Gaps = 53/408 (12%)

Query: 760  SGALEIFDVPNFNCVFTV-DKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN 818
            +G+LEI+ +P+  C+    D+  +    I++T               S  EG+ +GR+ +
Sbjct: 814  NGSLEIYSLPD--CLLRFGDRNFANAPRILET---------------SRFEGS-EGRRVD 855

Query: 819  IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLS 878
            +  + V E+ +       S P++  ++ D  ++    Y F    N  +++ PV + R + 
Sbjct: 856  V--LDVQEMNVFNMGPS-SLPYIVVMIGDQLMI----YRFRATLNRFQTESPVLSGRFIK 908

Query: 879  VSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMV 938
            + +    + + LR      D  ++ +  +    ++   F NIS H G FL G+ P W   
Sbjct: 909  LQD----KTKLLRRIPGVHDESSKTKNRNNKIMRQ---FMNISDHNGIFLGGAYPTWIFC 961

Query: 939  FRE-RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVT-SQGILKICQLPSGSTYDNYWPVQ 996
             +  RL +H    +G + AFT   N  C  GF+Y   S   L +  L     YD  WP +
Sbjct: 962  GQNGRLNIHSMWQEGFVNAFTPFDNEKCADGFLYFRHSTKTLTVANLQPFLKYDADWPFK 1021

Query: 997  KVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLL--IDQEVGHQIDNHNLSSVD 1054
            K I L  TP   +Y  E+ +      + V    ++ + +L  I+ E GH+ +  +L  V 
Sbjct: 1022 K-IKLNYTPCFSSYDLEQKV------LTVCGSRSEKIEMLPKINAE-GHK-EYEDLPEVQ 1072

Query: 1055 LHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKE-NETL 1111
               T    ++ V +  P      W+    + I M + E+ L  R V L +  +    +  
Sbjct: 1073 NVETQLFPQFFVEMFSP----ASWEVIPNSRIEMDAHEHILCCRSVYLKSEASMSGRKQY 1128

Query: 1112 LAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
            +AIGT+ + GED  +RGR++L         P   +    Y  +F + Q
Sbjct: 1129 IAIGTSNICGEDFQSRGRLILLEVIDVVPEPGKPLTRYKYKTVFDASQ 1176


>gi|339253000|ref|XP_003371723.1| cleavage and polyadenylation specificity factor subunit 1
           [Trichinella spiralis]
 gi|316967988|gb|EFV52332.1| cleavage and polyadenylation specificity factor subunit 1
           [Trichinella spiralis]
          Length = 1376

 Score =  187 bits (476), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 165/616 (26%), Positives = 286/616 (46%), Gaps = 74/616 (12%)

Query: 99  ASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITS 158
           AS ELV   +++G + S+AI    G     + D I+LA +DAK+SV+ +D   H L   S
Sbjct: 65  ASFELVLSEQVYGRLASVAIARLTGF----QLDVILLAIDDAKLSVVGYDIETHSLVTLS 120

Query: 159 MHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL---KASQGGSGLVG 215
           MH +E   +   K G   F   P++++DP+ RC  + +YG  +++L   + S   S  + 
Sbjct: 121 MHYYEDDLF---KLGFTRFEIPPMLRMDPERRCAAMTIYGAHLVVLPLVRESLYESMNIV 177

Query: 216 DEDTFGSGGGFSARIESSHV-INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
           D      G  FS R+ S  V  N  D  M +V D  F+HG+ EP +++L+E   T AGRV
Sbjct: 178 DPSQ-RPGWPFSLRLTSYTVAFNAIDAKMHNVTDMCFLHGFYEPTVLLLYEPTQTTAGRV 236

Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQ 334
             +  T  I A+S++   K H +IW+  NLP DA+ LLA+P P+GGVL+   N+I Y +Q
Sbjct: 237 VVRQDTYQILAVSLNPKDKTHAVIWTLGNLPFDAFALLALPKPLGGVLLFSVNSIIYLNQ 296

Query: 335 SA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY 393
           S   C + +N+      +     RS   V LD +HA  + +  A L  ++G + ++++++
Sbjct: 297 SVPCCGILINDNGRGFTNYPLRDRSELMVTLDGSHAALIDSANAALVLRSGLVFVVSLLF 356

Query: 394 DG-RVVQRLDLSKTN-----PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447
           D   +V+ + L+ ++     PS +++    + ++  F+GS +G+S L  +       +++
Sbjct: 357 DRLNMVKEILLTASSVRGAAPSTVSA---CVSSNCLFVGSAIGNSALYAYEAIEQVDVVA 413

Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASN--NTES-AQKTFSFA 504
             L                 R +  + L DM       LYG       TE+  Q  F F 
Sbjct: 414 VTLPA---------------RDTGLNLLDDM------QLYGELIRPCTTETLVQTKFEFR 452

Query: 505 VRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNAD 564
             D L ++GP +  + G    A  +       S++ +   PG  G +TV  +S R     
Sbjct: 453 RLDQLASLGPCRAITVGESSVAMVNNFYEDYVSDWLVAGGPGTDGSFTVMQRSVRPRLLT 512

Query: 565 SSRMAAYDDEYHA-----------------YLIISLEARTMVLETADLLTEVTESVDYFV 607
            +R+    + +                   Y++++ + RT+V   +  +TE+ ++  + +
Sbjct: 513 QTRVEDVLNAWSVGAQLIGSVDRSASPRPQYMLLTTKQRTVVFTLSSGITEIFDT-GFEI 571

Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
           +  TIA G++     V+QV +    +L      Q ++                V   S+ 
Sbjct: 572 RFETIACGDMMNGAYVVQVTKENLVLLHRGQQVQCINL----------RVFEEVCQASVI 621

Query: 668 DPYVLLGMSDGSIRLL 683
           DPYV L +  G + L 
Sbjct: 622 DPYVALIVRHGHVLLF 637



 Score = 87.0 bits (214), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 86/345 (24%), Positives = 149/345 (43%), Gaps = 54/345 (15%)

Query: 838  RPFLFAILTDGTILCYQAYLFEGPENTSK----------------------------SDD 869
            RPFLFA++ +  +L Y+A+ +  P+   +                            +DD
Sbjct: 755  RPFLFAVVEE-QLLIYEAFHYPYPQQRYRLSVRFKKVRHTAILQRFRRIGRDDFKLLADD 813

Query: 870  -----PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE--ETPHGAPCQRITIFKNISG 922
                      R  S  + + SR R  R S    +A+  E     + AP ++++ F+N++G
Sbjct: 814  FQFSEQYRRRRKRSKHDSNRSR-RGDRHSGRRQEAHEHEPYRLTYEAPARQLSPFENVAG 872

Query: 923  HQGFFLSGSRPCWCMVFRE-RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
            + G F+ G  P +C + ++  LR+HP   DG +VAF    +      F Y T+ G++++ 
Sbjct: 873  YAGLFIGGGYPYFCFLSKQGDLRLHPMHIDGPVVAFAPYCSPKQLRAFAYFTADGMMRVS 932

Query: 982  QLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1041
             LPS   +D   P  KV  L    H + Y  E + Y L  S  +  P ++V++L+ D + 
Sbjct: 933  SLPSKFDFDRSIPSMKV-ELGRAAHFVVYLMESHTYALTTSEQM--PCHKVVTLIGDDKQ 989

Query: 1042 GHQIDNHNLSSVDLHRTY-TVEEYEVRILEPDRAGGPWQTR--ATIPMQSSENALTVRVV 1098
                D         H  Y T+E++++++   D     W     A +     E+    + V
Sbjct: 990  FETFDREA-----PHFIYPTMEQFKLQLYSAD----TWLPVPGAELDFDEFEHVTACQEV 1040

Query: 1099 TLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP 1142
             L +  +    ++ LAIGT    GE+V  RGR+L+         P
Sbjct: 1041 QLKSEGSASGLQSYLAIGTVLNYGEEVLIRGRLLIIDVVEVVPEP 1085


>gi|317157892|ref|XP_001826637.2| protein cft1 [Aspergillus oryzae RIB40]
 gi|391864317|gb|EIT73613.1| mRNA cleavage and polyadenylation factor II complex, subunit CFT1
            [Aspergillus oryzae 3.042]
          Length = 1389

 Score =  186 bits (473), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 251/1093 (22%), Positives = 434/1093 (39%), Gaps = 170/1093 (15%)

Query: 131  DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGR 190
            ++I+LAF +AK++++E+D   +G+   S+H +E  +        +  + G ++ VDP  R
Sbjct: 88   EAILLAFRNAKLALIEWDPGRYGICTISIHYYERDDSTSSPWVPDLSSCGSILSVDPSSR 147

Query: 191  CGGVLVYGLQ-MIILKASQGGSGLVGDE------DTFGSGG--------------GFSAR 229
            C  V  +G++ + IL   Q G  LV D+      +  GS G                 A 
Sbjct: 148  CA-VFNFGIRNLAILPFHQPGDDLVMDDYGELDDERLGSHGLESGTDCDMTKESIAHRAP 206

Query: 230  IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
              SS V+ L  LD  + H     F++ Y EP   IL+ +  T    +  +      +  +
Sbjct: 207  YSSSFVLPLAALDPSILHPISLAFLYEYREPTFGILYSQVATSNALLHERKDVVFYTVFT 266

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA 346
            +    +    + S   LP D +K++A+P P+GG L++G+N  +H      + A+ +N ++
Sbjct: 267  LDLEQRASTTLLSVSRLPSDLFKVVALPPPVGGALLIGSNELVHVDQAGKTNAVGVNEFS 326

Query: 347  VSLDSSQELPRSSFSVELDAAHATWLQ--NDVALLSTKTGDLVLLTVVYDGRVVQRLDLS 404
              + S     +S  ++ L+      L   N   LL   TG++VL+    DGR V  + + 
Sbjct: 327  RQVSSFSMTDQSDLALRLEGCIVERLSETNGDLLLVPTTGEIVLVKFRLDGRSVSGISVH 386

Query: 405  KTNPSV-------LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE---EF 454
               P           S    +G+   FLGS   DS+L+      G S+ SSG K+   + 
Sbjct: 387  PIPPHAGGDIVKSAASSSAFLGDKRVFLGSEDADSILL------GWSVPSSGTKKPRPQA 440

Query: 455  GDIEADAPSTKRLRRSSSDALQDMVNG--EELSLYGSASNNTESAQKTFSFAVRDSLVNI 512
               E D+       +S  D  +D +     E+ + G   +        ++F   D L+NI
Sbjct: 441  RHTEEDSGGFSDEDQSEDDVYEDDLYATVPEVVVDGRRPSAESFGSSLYNFREYDRLLNI 500

Query: 513  GPLKDFSYGLRINADASATGISKQSNYELV----------------------------EL 544
            GPLKD ++G    +          S  ELV                            +L
Sbjct: 501  GPLKDIAFGRSFTSLGGEENAGNDSGLELVASQGWDRSGGLAVMKRGLELQVLNSMRTDL 560

Query: 545  PGCKGIWTVYHKSSRGHNADS---SRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
              C  +WT    +S  H  ++   +   A + E H Y+++S +A +   E +++     +
Sbjct: 561  ASC--VWT----ASVAHMEEAVSKTTTQAENRECHQYVVVS-KATSAEREQSEVFRVEGQ 613

Query: 602  SVDYFV-------QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG---PSNSE 651
             +  F        +  TI  G L G+ RV+Q+     R  DG     DL      P   E
Sbjct: 614  ELRPFRAPEFNPNEDVTIDIGTLIGKNRVVQILRSEVRSYDG-----DLGLAQIYPVWDE 668

Query: 652  SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCT 711
                SE    +S S+ DPYV +   D ++ LL  D S     V+    I +SK   +SC 
Sbjct: 669  --DTSEERMAISSSLVDPYVAILRDDSTLLLLQADDSGDLDEVELNEQIANSK--WTSCC 724

Query: 712  LYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNF 771
            LY DK                TG+  +I  A    L Q  +   +  +   L I+ +P+ 
Sbjct: 725  LYFDK----------------TGIFSSI-SATSDELAQNSMTLFLMTQDCRLFIYRLPDQ 767

Query: 772  NCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQR 831
              +      + G   +      E  K S T              +E +  + V +L    
Sbjct: 768  KLL----AIIEGVDCLPPVLSSEPPKRSTT--------------REVLTEIVVADLG-DS 808

Query: 832  WSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLR 891
            WS   S P+L        +  Y+ ++      T    +P +    L  +N+   R+    
Sbjct: 809  WS---SFPYLIIRSRHDDLAVYRPFI----SITKSVGEPHADLNFLKETNLVLPRI---- 857

Query: 892  FSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD 951
             +    D  + EE     P   + I  NISG    F  G  P + +           L  
Sbjct: 858  -TSGVEDQSSTEEVIKSVP---LRIVSNISGFSAIFRPGVSPGFIVRTSTSSPHFLGLKG 913

Query: 952  GSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYF 1011
            G   + +      C  GFI + S+G++ +CQ+P G   D  W +Q+ IP+      + Y 
Sbjct: 914  GYAQSLSKFQTSECGEGFILLDSKGVIHVCQMPLGVQLDYPWTIQQ-IPIGEQVDHLAYS 972

Query: 1012 AEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEP 1071
            +   +Y +  S            L  D E+  +  N   S         V+   ++++ P
Sbjct: 973  SSSGMYVIGTS------HRTEFKLPEDDELHPEWRNEMTSFFP-----EVQRSSLKVVSP 1021

Query: 1072 DRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRV 1130
                  W    +  +  +E+ + V+ ++L  +  T E + ++ +GTA+ +GED+A+RG V
Sbjct: 1022 KT----WTVIDSYLLSPAEHVMAVKNMSLEISENTHERKDMIVVGTAFARGEDIASRGCV 1077

Query: 1131 LLFSTGRNADNPQ 1143
             +F   +   +P+
Sbjct: 1078 YVFEVIKVVPDPK 1090


>gi|326432241|gb|EGD77811.1| hypothetical protein PTSG_08901 [Salpingoeca sp. ATCC 50818]
          Length = 1506

 Score =  186 bits (473), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 180/690 (26%), Positives = 297/690 (43%), Gaps = 128/690 (18%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLV    NV+ +Y + VQ +G+ + +   E      +DG+     + V   R  GN    
Sbjct: 29  NLVTVQGNVLSVYNL-VQAQGAADKRCHLEADISFTLDGVP----QDVATVRPRGN---- 79

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE-----WLHLK 171
                        RD +I  F+DA+++++ FD  +  L   S+H FE  +     W   +
Sbjct: 80  ------------SRDLLIFTFKDARVAIVRFDPKMRDLETVSLHAFEDTDTKLGGWHSEQ 127

Query: 172 RGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF-GSGGGFSARI 230
           R R        V VDP  RC  ++VYG ++I++  S G +    + DT   +   F++R 
Sbjct: 128 RLR--------VCVDPLHRCAALMVYGCKLIVISFSSGTATAAPEADTQEDTEQSFTSR- 178

Query: 231 ESSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
               VI+L  L   +  V D  F+ GY  P + ILH+    W G ++    T  ++ALS+
Sbjct: 179 ----VIDLLSLPSTIGRVDDMAFLDGYDVPCLAILHQPRPAWVGHMAKTKDTAHVTALSL 234

Query: 289 S------------TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA 336
           +                  P++W   NLP D + L  VP+P+GGV+V+G N + Y +QS 
Sbjct: 235 ALDEMTARRAPTAPPPPPPPVVWHQENLPSDTFALQPVPAPLGGVVVIGVNVLFYVTQSL 294

Query: 337 SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR 396
             +LALN Y+ +  ++    ++  S++LD AH   L     L +  +GD+ LLT+V    
Sbjct: 295 VRSLALNGYSRASTNAPIQEQTGISLDLDGAHHALLTPTQILFALPSGDIHLLTIVCTDV 354

Query: 397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT---CGSGTSMLSSGL--K 451
            V  L + K   SV+ SDI T+G    F+ SR   SLL+++      + T +  SG+  +
Sbjct: 355 TVDGLRMDKLATSVIGSDICTLGRRHIFIASRHATSLLLEWAPIPLSATTHIDVSGVSGR 414

Query: 452 EEFGDIEADAPSTKRLRRSSS------------DALQDMVNGEELSLYGSASNNTESAQK 499
           ++ G     + ST  L  S+S            D   D+V+G     +G  S        
Sbjct: 415 DDAGLYGTSSDSTAALNTSASRDGSSTGGDDLDDVYGDVVDGGTTGAHGIGSGGR---VM 471

Query: 500 TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVE---------------- 543
           T     RD+L  + P+K  + G   +A         +S YELV                 
Sbjct: 472 TVKLMARDALPTVAPIKSTAVG--TSAQGVVPHADPRSQYELVSCIGHDKNGALANISYS 529

Query: 544 -----------LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLET 592
                      L   K  W V+  +S+               +H +++ S   +TMV   
Sbjct: 530 LKPQVLLTEDALSSVKDCWAVHSNNSK---------------HHTHVVFSKPKKTMVFRV 574

Query: 593 ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSES 652
           A    ++     +  +  T+ AGN+ GR+ V+QV  +   +LD     +D  F       
Sbjct: 575 AGDFEQLRHPRGFDTEASTVFAGNVMGRQLVLQVTAKHVMLLDD----RDCVFDERM--- 627

Query: 653 GSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
               +   +  VS+ADPY+ L ++D + ++
Sbjct: 628 ---KKGVRITKVSVADPYIALLLNDATTKV 654



 Score = 55.5 bits (132), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 63/294 (21%), Positives = 108/294 (36%), Gaps = 45/294 (15%)

Query: 757  CYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEIN------------ 804
            C ++G L IF VP+   VF    F        D+  R+ +   E E              
Sbjct: 807  CDKNGVLSIFQVPDMREVFCCTVFSVLPNVAWDSVYRKEIGPVELEPEMPLKRAKTMDEK 866

Query: 805  -----------SSSEEGTGQGRKENIHSMKVVELAMQRWSA----HHSRPFLFAILTDGT 849
                       +  E G+ Q  ++    ++  E+ +    A      SRP LF       
Sbjct: 867  GQSVFVEADEEADDESGSAQAEEDEQDRLQRKEMTIVELLAIGLGRGSRPHLFLRNETQH 926

Query: 850  ILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA 909
            ++ Y+ +       TS      S  R          RLR      T +D    + +    
Sbjct: 927  VIVYEIF-------TS------SYKRHEKYEGRLQIRLRKRHQHPTWIDERLAQSS--SI 971

Query: 910  PCQRITIFKNISGHQGFFLSGSRPCW--CMVFRERLRVHPQLCDGSIVAFTVLHNVNCNH 967
            P      F +ISG  G F+   RP W  C    + +R H    DG++  FT L +     
Sbjct: 972  PPAAFRPFADISGCDGVFVCARRPSWFMCDHTHKVVRHHAMRFDGAVQCFTQLKHAMHTS 1031

Query: 968  GFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIV 1021
             F+Y T +G++++    +G       P ++  P+KA+   + +  E  +Y +++
Sbjct: 1032 CFLYFTGKGVMRMATTAAGQVLSTPLPSRRT-PIKASACYVDFDPESGVYVVVL 1084


>gi|25148482|ref|NP_500157.2| Protein CPSF-1 [Caenorhabditis elegans]
 gi|22096347|sp|Q9N4C2.2|CPSF1_CAEEL RecName: Full=Probable cleavage and polyadenylation specificity
           factor subunit 1; AltName: Full=Cleavage and
           polyadenylation specificity factor 160 kDa subunit;
           Short=CPSF 160 kDa subunit
 gi|373220398|emb|CCD73182.1| Protein CPSF-1 [Caenorhabditis elegans]
          Length = 1454

 Score =  186 bits (471), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 162/589 (27%), Positives = 273/589 (46%), Gaps = 84/589 (14%)

Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
           +DSI++ F+DAK+S++  ++    ++  S+H FE+    +L+ G  +  + PLV+ DP  
Sbjct: 92  QDSILMTFDDAKLSIVSINEKERNMQTISLHAFENE---YLRDGFINHFQPPLVRSDPSN 148

Query: 190 RCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVK 247
           RC   LVYG  + IL   +                  S RI S +VI L+ +D  + ++ 
Sbjct: 149 RCAACLVYGKHIAILPFHEN-----------------SKRIHS-YVIPLKQIDPRLDNIA 190

Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHD 307
           D +F+ GY EP ++ L+E   T  GR   ++ T  I  +S++   +Q  ++W   NLP D
Sbjct: 191 DMVFLDGYYEPTILFLYEPIQTTPGRACVRYDTMCIMGVSVNIVDRQFAVVWQTANLPMD 250

Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSS---FSVE 363
             +LL +P P+GG LV G+NT+ Y +Q+   C L LN+     D   + P        + 
Sbjct: 251 CSQLLPIPKPLGGALVFGSNTVVYLNQAVPPCGLVLNS---CYDGFTKFPLKDLKHLKMT 307

Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKTNPSVLTSDITTIGNS 421
           LD + + ++++    + ++ GDL LL ++    G  V+ L+ SK   + +   +T     
Sbjct: 308 LDCSTSVYMEDGRIAVGSRDGDLFLLRLMTSSGGGTVKSLEFSKVYETSIAYSLTVCAPG 367

Query: 422 LFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD--ALQDMV 479
             F+GSRLGDS L+++T    T                   + KRL+  + D  A +  +
Sbjct: 368 HLFVGSRLGDSQLLEYTLLKTTRDC----------------AVKRLKIDNKDPAAAEIEL 411

Query: 480 NGEELSLYGSA-----SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
           + +++ LYG A     +++ E   ++  F   D L N+GP+K    G R N  ++    +
Sbjct: 412 DEDDMELYGGAIEEQQNDDDEQIDESLQFRELDRLRNVGPVKSMCVG-RPNYMSNDLVDA 470

Query: 535 KQSN--YELVELP--GCKGIWTVYHKSSRGHNADSSRMAA---------YDDEYHAYLII 581
           K+ +  ++LV     G  G   V+ +S R     SS +            ++E H YLI+
Sbjct: 471 KRRDPVFDLVTASGHGKNGALCVHQRSLRPEIITSSLLEGAEQLWAVGRKENESHKYLIV 530

Query: 582 SLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG-ARILDGSYMT 640
           S   R+ ++          E   +     T+AAG L      +QV     A + DG  M 
Sbjct: 531 S-RVRSTLILELGEELVELEEQLFVTGEPTVAAGELSQGALAVQVTSTCIALVTDGQQM- 588

Query: 641 QDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL--LVGDP 687
           Q++              N  V+  SI DPYV L   +G + L  LV +P
Sbjct: 589 QEVHI----------DSNFPVIQASIVDPYVALLTQNGRLLLYELVMEP 627



 Score = 70.5 bits (171), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 96/399 (24%), Positives = 174/399 (43%), Gaps = 49/399 (12%)

Query: 755  VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVD-TYMREALKDSETEINSSSEEGTGQ 813
            +V +E+G L I  +P    V+ + +F +    +VD T   E  +       ++ E     
Sbjct: 777  IVSHENGRLSIHSLPEMEVVYQIGRFSNVPELLVDLTVEEEEKERKAKAQQAAKEASVPT 836

Query: 814  GRKENIHSM------KVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
               E +++       +V+E  +     + + P L AI+ D  ++ Y+ +          S
Sbjct: 837  DEAEQLNTEMKQLCERVLEAQIVGMGINQAHPILMAIV-DEQVVLYEMF---------SS 886

Query: 868  DDPVSTSRSLSVSNV-------SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNI 920
             +P+     +S   +       ++S L N    R P +     +  +G     I  F+ +
Sbjct: 887  SNPIPGHLGISFRKLPHFICLRTSSHL-NSDGKRAPFEM----KINNGKRFSLIHPFERV 941

Query: 921  SG-HQGFFLSGSRPCWCMVFRE--RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-QG 976
            S  + G  + G+ P   +V+     ++ H    DG I AFT  +N N  HG +Y+T  + 
Sbjct: 942  SSVNNGVMIVGAVPTL-LVYGAWGGMQTHQMTVDGPIKAFTPFNNENVLHGIVYMTQHKS 1000

Query: 977  ILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLL 1036
             L+I ++     Y+  +PV+K I +  T H + Y    ++Y ++ S+P  KP N++  ++
Sbjct: 1001 ELRIARMHPDFDYEMPYPVKK-IEVGRTIHHVRYLMNSDVYAVVSSIP--KPSNKIWVVM 1057

Query: 1037 ID--QEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALT 1094
             D  QE  H+ D + +  +     YT+  +  +    D A  P      I  +  E    
Sbjct: 1058 NDDKQEEIHEKDENFV--LPAPPKYTLNLFSSQ----DWAAVP---NTEISFEDMEAVTA 1108

Query: 1095 VRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLL 1132
               V L + +T    ETLLA+GT    GE+V  RGR++L
Sbjct: 1109 CEDVALKSESTISGLETLLAMGTVNNYGEEVLVRGRIIL 1147


>gi|260835073|ref|XP_002612534.1| hypothetical protein BRAFLDRAFT_58262 [Branchiostoma floridae]
 gi|229297911|gb|EEN68543.1| hypothetical protein BRAFLDRAFT_58262 [Branchiostoma floridae]
          Length = 318

 Score =  184 bits (468), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 107/297 (36%), Positives = 168/297 (56%), Gaps = 38/297 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           +LVV     + +Y ++   E S++ K                  +ELV  + ++GN+ S+
Sbjct: 29  SLVVAGTTQLHVYRLKGDMEKSRKQK------------------MELVASFSMYGNIMSV 70

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +   G+D    RD+++L+F DAK+S++E+D   H L+  SMH FE  E   +K G  S
Sbjct: 71  ESVQLAGSD----RDALLLSFMDAKLSIVEYDPGTHDLKTASMHYFEEEE---VKDGYVS 123

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P+V+VDP+GRC  +L+YG ++++L   + G+    DE    +G   S  I  +++I
Sbjct: 124 NYHAPMVRVDPEGRCAVMLIYGKRLVVLPFRKEGAV---DEAEMSAGSKSS--ILPTYMI 178

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
            L+DLD +  +V D  F+HGY +P ++IL+E   TW GRV+ +  TC I A+S++   + 
Sbjct: 179 KLQDLDERLINVVDLQFLHGYFDPTLLILYEPLQTWPGRVAVRQDTCCIVAVSLNIAQRV 238

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
           HP+IWS  NLP D  + +AVP PIGGVLV   N++ Y +QS         Y VSL+S
Sbjct: 239 HPIIWSVGNLPFDCKQAVAVPKPIGGVLVFAVNSLLYLNQSVP------PYGVSLNS 289


>gi|367052335|ref|XP_003656546.1| hypothetical protein THITE_2121311 [Thielavia terrestris NRRL 8126]
 gi|347003811|gb|AEO70210.1| hypothetical protein THITE_2121311 [Thielavia terrestris NRRL 8126]
          Length = 1460

 Score =  183 bits (465), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 277/1202 (23%), Positives = 468/1202 (38%), Gaps = 227/1202 (18%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLM---------DGISAA-------- 99
            NLVV  +++++++  +V     + S +SG   R             DG+ A+        
Sbjct: 28   NLVVAKSSLLQVFRTKVVSTELEASPDSGHRSRNAARYESRLANDDDGLEASFLGGDSLA 87

Query: 100  ---------SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDS 150
                      L LV    L G V  LA +    A +    DS+++A +DA++S++E+D  
Sbjct: 88   LRTDRANVTKLVLVAETPLAGTVTGLARIKTPHARHGC--DSLLIALKDARLSLVEWDAE 145

Query: 151  IHGLRITSMHCFESPEWLHLKRGRESFARGPL------VKVDPQGRCGGVLVYGLQMIIL 204
             H L   S+H +E  E       + S    PL      ++ DP  RC  +      + IL
Sbjct: 146  RHALATVSIHYYEQEEL------QGSPWAAPLSHYVNFLEADPGSRCAALKFGARNLAIL 199

Query: 205  KASQGGSGL-VGDEDTFGSGGGFSARIESSHVIN-----------------LRDLD--MK 244
               Q    + +GD D     G   A+ +SS VI+                 L +LD  + 
Sbjct: 200  PFRQADEDIDMGDWDG-ELDGPRPAKDQSSAVIDGASNIEDTPYSPSFVLRLSNLDPSLL 258

Query: 245  HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNL 304
            H     F+H Y EP   IL              H T M+  L +    K    I S   L
Sbjct: 259  HPVHLAFLHEYREPTFGILASTASASNSLGRKDHFTYMVFTLDLQQ--KASTTILSVGGL 316

Query: 305  PHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE 363
            P D ++++ +P+P+GG L+VG+N  IH      +  +A+N       S   + +S  ++ 
Sbjct: 317  PQDLFRVVPLPAPVGGALLVGSNELIHIDQSGKANGVAVNPMTRQCTSFGLVDQSELNLR 376

Query: 364  LDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT 417
            L+      L  D+   L+    G + L+T   DGR V  L+L    + +  S++   +TT
Sbjct: 377  LEGCVVDVLTADLGELLVILNDGRMALVTFRIDGRTVSGLELRMLPASSGGSIIPGRVTT 436

Query: 418  ---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
               +G +  F G   GDS+L                   FG  +  + + +R  R+    
Sbjct: 437  LSRVGRNAMFAGLEEGDSVL-------------------FGWAKKQSQAGRRRPRAKDAV 477

Query: 475  LQ------------DMVNGEELSLYGSASNNTESAQKT------FSFAVRDSLVNIGPLK 516
            LQ            +  + ++L     A+    S+  +       +F + D LV+I P++
Sbjct: 478  LQMDEEAGEEEEEEEDEDEDDLYGEEPAARQQPSSTASSLMTGDLTFRIHDRLVSIAPIQ 537

Query: 517  DFSYGLRINADAS-----------------ATGISKQSNYELV------------ELPGC 547
              +YG  +    S                 A G  K ++   +            E P  
Sbjct: 538  AMTYGQPVWLPGSEEERNSAGVHSDLQLVCAVGRDKSASLATINLAIAPKVIGRFEFPEA 597

Query: 548  KGIWTV-----YHKSSRGHNADSSRMAAYD--DEYHAYLII------SLEARTMVLETAD 594
            +G WT+       KS +G  A +S    YD   +Y  ++I+        E   +   TA 
Sbjct: 598  RGFWTMCAKKPIPKSLQGDKAGASLGNGYDTSGQYDKFMIVGKVDLDGYEKSDVYALTAA 657

Query: 595  LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESG 653
                +  +      G TI AG +    R+IQV +   R  DG + ++Q L   P   E  
Sbjct: 658  GFESLGGTEFDPAAGITIEAGTMGKGSRIIQVLKSEVRCYDGDFGLSQIL---PMQDEE- 713

Query: 654  SGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLY 713
            +G+E   V S S+ADP++L+   D S+ +   D S     +     + S+ K ++ C LY
Sbjct: 714  TGAEPRAV-SASVADPFLLIIRDDSSVFIARIDSSNELEELDKDDPVLSTTKWLTGC-LY 771

Query: 714  HDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNC 773
             D          S   +    +G+    A         +   +   SGAL I+ +P+   
Sbjct: 772  AD----------SAGVFAEESMGKPASTAQC-------VLMFLLSASGALYIYRLPDLAR 814

Query: 774  VFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWS 833
               V + +S        Y+   L       + +  +GT    KE +  + V +L      
Sbjct: 815  PIYVAEGLS--------YIPPGLS-----ADYAGRKGTA---KETLAEILVADLG----D 854

Query: 834  AHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFS 893
            + H  P+L     +  +  YQ +        S+     + S +L    V      N   +
Sbjct: 855  STHKSPYLILRHANDDLTLYQPF-------RSRKATEQAFSETLFFQKVP-----NTALA 902

Query: 894  RTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERL-RVHPQLCDG 952
            ++P +A   +E  H      +    N+ G+   F+ G+ P + +   + + RV P L   
Sbjct: 903  KSPQEA-DEDEASHQPRFLSMRRCDNVGGYSTVFVPGASPSFIIASSKSMPRVMP-LQGS 960

Query: 953  SIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFA 1012
             ++A +  H   C HGFIY  S+ I ++CQ P G  Y       + IP+      + Y  
Sbjct: 961  GVIAMSPFHTEGCEHGFIYADSRRIARVCQFPDGCIYAETGVAVRKIPIGEDIAAVAY-- 1018

Query: 1013 EKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPD 1072
                +P + S  V    ++   L  D +   +    NLS        TV+   +++L P 
Sbjct: 1019 ----HPPMQSYVVGCNTSEPFELPKDDDYHKEWARENLSF-----KPTVDRGILKLLSPI 1069

Query: 1073 RAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVL 1131
                 W     + M+  E  L V  + L  +  T E + L+A+GTA  +GED+  RGRV 
Sbjct: 1070 T----WTVVDAVQMEPCETILCVETLNLEVSEFTNERKQLIAVGTALTKGEDLPTRGRVY 1125

Query: 1132 LF 1133
            ++
Sbjct: 1126 VY 1127


>gi|391328522|ref|XP_003738737.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like [Metaseiulus occidentalis]
          Length = 1500

 Score =  183 bits (464), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 122/404 (30%), Positives = 199/404 (49%), Gaps = 61/404 (15%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGIS----AASLELVCHYRLHGN 112
           NLVV    VI++Y                    R++ DG++     A LE    +   GN
Sbjct: 29  NLVVAGGTVIKVY--------------------RLVCDGLNETDDKAKLEHQQTFNCFGN 68

Query: 113 VESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
           +  +  +    +     RDS++  F++ KIS++E+D + H L+  ++   E  E+   K 
Sbjct: 69  ISGMEKIRLNAS-----RDSLLFVFKETKISLVEYDPATHELQTLAIRSLEKEEY---KE 120

Query: 173 GRESFARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFS 227
           G  +F    L+KVDP  RC  VL+YG  + I+      A+     +   + T  +  GF 
Sbjct: 121 GFYNFVGNTLIKVDPLNRCAAVLIYGKHLAIIPFVKKDATDLSDPIASSKSTQTNTSGFL 180

Query: 228 ARIESSHVINLRDLD----MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMI 283
                 + I L DLD    + ++ D  F++GY EP +++L+E   TW GRV+ +  TC I
Sbjct: 181 ----EYYTIRLIDLDEEKGVNNIHDMTFLNGYYEPTLLLLYEPIRTWTGRVAIRQDTCSI 236

Query: 284 SALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALN 343
            ALS++   + HP +WS   LP +++K+L VP PIGGVL++  N + Y +QS        
Sbjct: 237 MALSLNVYQRVHPPVWSFSGLPFNSFKVLPVPKPIGGVLILSVNALLYLNQSVPA----- 291

Query: 344 NYAVSLDSSQELPRSSFSVE--------LDAAHATWLQNDVALLSTKTGDLVLLTVVYDG 395
            Y VSL+   E   +SF ++        LD     +L     LLS   GDL +L++  DG
Sbjct: 292 -YGVSLNCFTEC-STSFPLKDQAGPPLTLDCCRCEFLSETKILLSVANGDLYVLSLFTDG 349

Query: 396 -RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
            R + + +  K   + + + I+       F+GSR+G+SLL+++T
Sbjct: 350 MRSINQFEFKKIATTTVATCISLCEPGYLFVGSRIGNSLLLRYT 393



 Score =  179 bits (455), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 178/661 (26%), Positives = 277/661 (41%), Gaps = 138/661 (20%)

Query: 543  ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTES 602
            ELPGC  +WTV   S+R  + D        ++ H +LI+S    TM+L+T   + E+  S
Sbjct: 603  ELPGCTDLWTVRSSSTRSPDVD--------EDSHQFLILSRPDSTMILQTGQEINELDHS 654

Query: 603  VDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVL 662
              +  Q  TI AGNL   R +IQV     R+L+G    Q +               S ++
Sbjct: 655  -GFCTQSPTIFAGNLADGRYIIQVCPNSVRLLEGVKQLQQVPI----------DVGSPLV 703

Query: 663  SVSIADPYVLLGMSDGSIRLLV--GDPST-CTVSVQTPAAIESSKKPVSSCTLYHD---- 715
            S SIAD +VL+   DG +  L   GD +T   +SV  P     +K  +++  +Y D    
Sbjct: 704  SASIADLHVLVMSQDGLVIQLTLRGDDTTGYKLSVLKPQ-FPGAKSKITALCIYKDVSGL 762

Query: 716  -----KGPEPWLR-KTSTDAWLSTGV------------------GEAIDGAD--GGPLDQ 749
                 + PE   + KT     + T V                  G ++D  D   G L+ 
Sbjct: 763  FVTKIQKPEDIAKPKTEAKTKVKTEVAKKVLRSADFDDEDELLYGSSVDIKDLVAGGLNA 822

Query: 750  GDI-----------------------------YSVVCYESGALEIFDVPNFNCVFTVDKF 780
             +I                             +  +  E+GALEI+  P++   + V  F
Sbjct: 823  ANIVPTTQTKDTAEEEDYEENVRKIAPVEPTFWVFLARENGALEIYSFPDYKLRYFVKNF 882

Query: 781  VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
                  + +  ++ A    +T   S+SE              KV+E+ +     H SRP 
Sbjct: 883  -----PLCNKILQNAAATGQTTSASTSEAQLP----------KVMEIFVCALGMHQSRPL 927

Query: 841  LFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAY 900
            LFA + D  +  Y+AY F               ++      +   RL++   +  P   Y
Sbjct: 928  LFARV-DSELHIYEAYPF--------------VNQKEGHLKLQFRRLQH-AVTMEPRRVY 971

Query: 901  TREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTV 959
             ++E       + I  F+++ G+ G F+ G RP W  +  R  LR HP L DG I +F  
Sbjct: 972  KQKEGDPTLSLRWIRAFQDVCGYNGVFVCGRRPHWIFLTARGELRAHPMLNDGRIYSFAT 1031

Query: 960  LHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPL 1019
             HNVNC  GF++    G L+IC LPS   YD  WP++K IP+  TPH + Y  +   Y +
Sbjct: 1032 FHNVNCEKGFLFFNKYGELRICALPSYLNYDAPWPMRK-IPIYETPHSVNYHVDSRTYCV 1090

Query: 1020 IVS-------VPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPD 1072
              S       VP L   ++     I++E    I              TV+++ + +  P 
Sbjct: 1091 ATSKEETATCVPKLANEDKEFE-PIERESSRFIPP------------TVDKFALELWSPV 1137

Query: 1073 RAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENET-LLAIGTAYVQGEDVAARGRVL 1131
                   TR  +PM+  E    V+ V + +  T   E  L+A+GT +  GED+ A+GR+L
Sbjct: 1138 SWEAIPNTR--MPMEDWEKITCVKNVMIASEGTTSGEKGLIAVGTIHNFGEDITAKGRIL 1195

Query: 1132 L 1132
            L
Sbjct: 1196 L 1196


>gi|390358535|ref|XP_789715.3| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like [Strongylocentrotus purpuratus]
          Length = 1223

 Score =  182 bits (462), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 141/456 (30%), Positives = 221/456 (48%), Gaps = 60/456 (13%)

Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYH 332
           RV+ +  TC I ALS++   K HP+IWS  +LP+D  ++ AVP PIGGVL++  N++ Y 
Sbjct: 11  RVAVRQDTCSIVALSLNMAQKVHPIIWSQSSLPYDCMQVQAVPKPIGGVLILAVNSLLYL 70

Query: 333 SQSASCALALNNYAVSLDS----SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGD 385
           +QS      +  Y VSL+S    S   P   +    + +D   AT++  D   LS K G+
Sbjct: 71  NQS------IPPYGVSLNSLTDWSTAFPLKTQEGVKLSMDCTQATFISYDRLALSLKDGE 124

Query: 386 LVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTS 444
           + +LT++ DG R V+   L K   SVLT+ I  +G+   FLGSRLG+SLL+++T     +
Sbjct: 125 IYVLTLLVDGMRSVRGFHLDKAAASVLTTCICPMGDGFLFLGSRLGNSLLLKYTEKVSET 184

Query: 445 MLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD----MVNGEELSLYGSASNNTESAQKT 500
             +   K E      + P+ K     +SD +      + + +EL +YG     T +   +
Sbjct: 185 SPTDASKTEEPKPGEEPPTKKMRSDDASDWMASDTKFLDDPDELEVYGKQVQKTGTQLTS 244

Query: 501 FSFAVRDSLVNIGPLKDFSYG--------LRINAD-----ASATGISKQSNYELVE---- 543
           +SF + DSL+NIGP  +   G         + N D      + +G  K     +++    
Sbjct: 245 YSFEICDSLLNIGPCGNMIMGEPAFLSEEFQGNVDPDLELVTTSGYGKNGALSVLQRTIR 304

Query: 544 --------LPGCKGIWTVYHKSSRGHNAD----SSRMAAYDDEYHAYLIISLEARTMVLE 591
                   LPGC  +WTV  KS +   AD     S  +  D + HA+LI+S +  +MVL+
Sbjct: 305 PQVVTTFNLPGCLDMWTV--KSLKEAKADEKSEESEASPEDKDRHAFLILSKQDSSMVLQ 362

Query: 592 TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSE 651
           T   +TEV     +  Q  TI A N+   R ++QV  +   +++G    Q +        
Sbjct: 363 TGQEITEVAAG-GFSTQAPTIFASNMGDDRYIVQVMNKSICLMEGVEQIQHMVL------ 415

Query: 652 SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDP 687
                  S +   S+ADPY+LL   +G   L+   P
Sbjct: 416 ----DVGSPIKQCSLADPYLLLLTENGDPILMTLKP 447



 Score =  139 bits (349), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 117/446 (26%), Positives = 187/446 (41%), Gaps = 49/446 (10%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + V C E+G LE++ +P+    F V  F  G   +VD               S S   TG
Sbjct: 560  WCVFCRENGQLEMYSLPDMVLAFLVKNFPMGSKVLVD---------------SGSAFMTG 604

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
               +++    +V E+ +        + ++ A++ D  I+ Y+A+    P NT   +  + 
Sbjct: 605  DQSQQHEMLQQVQEVLLVGLGHDRKKIYMLALVEDD-IMIYEAF----PYNTVTQEHHLR 659

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPL-DAYTREETPHGAP---------CQRITIFKNISG 922
              R   + +    + +  R S+ P  +  T+ ET   A            R+  F N+  
Sbjct: 660  V-RFRKIPHKILMKPKKTRTSKKPTAEGGTKPETETEAESDTKTTSRRVNRLREFHNVQT 718

Query: 923  HQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
            + G F+SGS P W  V  R  LR HP   DG+I  F   HNVNC +GF+Y   +  L+IC
Sbjct: 719  YSGVFISGSHPYWLFVTSRGALRTHPMPVDGAISCFASFHNVNCPNGFLYFNRKEELRIC 778

Query: 982  QLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1041
             LPS  +YD  WPV+KV PL+ TPH + Y  E   Y ++ SV       Q     + +  
Sbjct: 779  VLPSHLSYDAPWPVRKV-PLRCTPHFVAYHVETKTYAVVTSV-------QETKTHVWKVT 830

Query: 1042 GHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL- 1100
            G +I    +   D     T   + +++  P        TR  I  +++EN   ++VV L 
Sbjct: 831  GEEIGEEPVERDDRFVPTTKVVFSIQLFSPVSWDAIPNTR--IEYEAAENVTCLKVVNLS 888

Query: 1101 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQI 1160
               T    +  + + T +V  ED+  RG V ++        P   +      PL+   Q 
Sbjct: 889  CEGTMTGKKGYVVVATTHVYSEDLQTRGSVYIYDCIEVVPEPGQPLTKNKLKPLYEKRQK 948

Query: 1161 DFASH------FFAICSNSFVFVFLF 1180
               S       F   C    V+++ F
Sbjct: 949  GPVSALCEVMGFLLTCIGQKVYMWQF 974


>gi|393907593|gb|EJD74705.1| CPSF A subunit region family protein [Loa loa]
          Length = 990

 Score =  182 bits (461), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 167/640 (26%), Positives = 276/640 (43%), Gaps = 83/640 (12%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           LE +   RL   V+S AI        +   DS++L F+DAK+S++  + +   L+  S+H
Sbjct: 62  LECLLAVRLLAPVQSFAI---ARIPQNPDCDSLLLGFDDAKLSIVGVNPADRSLKTISLH 118

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
           CFE      LK G       P+++VDP  RC  +LV+G  + +L  +  G+ L       
Sbjct: 119 CFEDE---LLKDGFTKNLPRPVIRVDPGQRCAAMLVFGRYLAVLPFNDSGAQL------- 168

Query: 221 GSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
                       S+ + L  +D +  +V D +F+ GY EP ++ L+E   T  GR   ++
Sbjct: 169 -----------HSYTVQLSQIDSRLVNVVDMVFLDGYYEPTLLFLYEPVQTTCGRACVRY 217

Query: 279 HTCMISALSISTTLKQHPL--IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA 336
            T  +  L +S  +K+  L  +W   NLP D  ++LA+P P+GG+L+V  N + Y +QS 
Sbjct: 218 DT--MCVLGVSLNVKEQVLASVWQLTNLPMDCNQILAIPRPVGGILLVATNELIYLNQSV 275

Query: 337 -SCALALNNYAVSLDSSQELPRSSFS---VELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
             C ++LN+    +D   + P   F    + LD    T +  +  LL  + G L  L +V
Sbjct: 276 PPCGISLNS---CMDGFTKFPLRDFKHMVLTLDGCVVTVISTNKILLCDRNGRLFTLVLV 332

Query: 393 YDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
            D    V+ L+L     +V+   +T+      F+GSRL DS+ +       T        
Sbjct: 333 TDATNSVKSLELKFQFKTVIPCTMTSCAPGYLFIGSRLCDSVFLHCIFEQST-------- 384

Query: 452 EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT---ESAQKTFSFAVRDS 508
                ++  AP   +L  +  +A +D    E+  LYG         +SA++  +  V D 
Sbjct: 385 -----LDESAPKKIKL-NTELNANED----EDFELYGEVLPKVAKPDSAEELLNIRVLDK 434

Query: 509 LVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCK--GIWTVYHKSSRGHNADSS 566
           L+N+GP K  + G    +        K   ++LV   G    G   ++ +S R     SS
Sbjct: 435 LLNVGPCKKITGGCPSISAYFQEVTRKDPLFDLVCACGHGKFGSICIFQRSVRPEIVTSS 494

Query: 567 RMAAY---------DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNL 617
            +            +D+ H Y I S E  T+ LET + L E+ E+  +     TIAAG L
Sbjct: 495 SIEGVVQYWAVGRREDDTHMYFIASKELGTLALETDNDLVEL-EAPIFATSEPTIAAGEL 553

Query: 618 FGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
                 +QV      ++      Q +                 V S SI DPY+ +   +
Sbjct: 554 ADGGLAVQVTTSSLVMVAEGQQIQHIPL----------QLTFPVRSASIVDPYIAICTQN 603

Query: 678 GSIRL--LVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
           G + +  L   P      +     +     P++S ++Y D
Sbjct: 604 GRLLMYELTSHPHVHLKEIDISKRLRHETSPITSLSIYRD 643



 Score = 80.1 bits (196), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 70/274 (25%), Positives = 123/274 (44%), Gaps = 30/274 (10%)

Query: 759  ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE------TEINSSSEEGTG 812
            E+G + I+ +P  + V+ V K     +H+ D    +   D E       +  S++   T 
Sbjct: 733  ENGNMYIYSIPELHLVYMVKKI----SHLPDIATDQPYVDDEPATAESIDTMSATMTDTF 788

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
              + E +    ++EL M     +  RP LF +L D T+  Y+ + +    N         
Sbjct: 789  AAKPEEV----IMELLMVGMGMNQGRPMLF-LLIDDTVSVYEMFTY----NNGIQGHLAV 839

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI--FKNISG-HQGFFLS 929
              + L  + V+    R+ RF    LD     E+   A   +  +  F+ I     G F+ 
Sbjct: 840  RFKRLPYTVVT----RSCRFQG--LDGRAAVESVRDAVRHKTVLHFFERIGNVLNGVFIC 893

Query: 930  GSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-QGILKICQLPSGST 988
             S PC   +     R+HP   DG I++FT  +N  C +GFIY+T  + ++++ +LP+   
Sbjct: 894  SSYPCIFFLETGVPRLHPVNLDGPILSFTTFNNAACPNGFIYLTERERLMRVAKLPNDMI 953

Query: 989  YDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVS 1022
             D  +PV++ I + A+ H +TY    N Y ++ S
Sbjct: 954  LDTSYPVKR-IDVGASVHSVTYLLHSNTYAVLTS 986


>gi|345566738|gb|EGX49680.1| hypothetical protein AOL_s00078g169 [Arthrobotrys oligospora ATCC
            24927]
          Length = 1407

 Score =  182 bits (461), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 266/1216 (21%), Positives = 474/1216 (38%), Gaps = 282/1216 (23%)

Query: 57   NLVVTAANVIEIYVVRVQEEGS-----KESKNSGETKRRVLM-----DGISAAS------ 100
            NLVV   ++++I+ +   E+        E+K+ G + RRV       D  +  S      
Sbjct: 31   NLVVAKTSLLQIFRLVEYEDAEGEFALDEAKDEGGSDRRVFEGRDHEDSFTVESGMHLQR 90

Query: 101  --------LELVCHYRLHGNVESLAIL----SQGGADNSRRRDSIILAFEDAKISVLEFD 148
                    L+LV  Y L+G+V S+  +    S+ G D       ++++F+ AKIS+LE+D
Sbjct: 91   ETIEKTTKLDLVAQYHLYGSVTSMVKIRIPTSKSGGD------CLLVSFDSAKISLLEWD 144

Query: 149  DSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVK--------VDPQGRCGGV-LVYGL 199
             + H +   S+H +E  E+           R PL           DP+ RC      + L
Sbjct: 145  PAAHSISTISLHYYEGDEF-----------RSPLTPEFPINYLISDPKSRCAAFKFNHDL 193

Query: 200  QMII-LKASQGGSGLVGDEDTF-----------------------GSGGGFSARIESSHV 235
              I+  + ++     + D D+F                       G G         S V
Sbjct: 194  VAILPFRQTEDEDLEIPDNDSFTYDLEDDDDAEKPKKDVEMKDNTGEGKPSDTPYHPSFV 253

Query: 236  INLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
            ++   LD  ++ + D +F+H Y EP   I+++ +    G +  +        +++    +
Sbjct: 254  LSASQLDESVERIIDIVFLHEYREPTFGIVYQPQQGSVGMLERRKDPTHFIVVTLDLDQR 313

Query: 294  QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI-HYHSQSASCALALNNYAVSLDSS 352
                I SA NLP D +K +A+P PIGG L++G + I H        A+A+N+YA    + 
Sbjct: 314  ASTSIMSAKNLPFDIWKAVALPPPIGGTLLLGEHEIVHVDQAGKMSAVAVNSYAQQYSAF 373

Query: 353  QELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP-- 408
                +S   + L++  A  L N+    L+ T  GD  +L+   +GR +  L + +     
Sbjct: 374  NMTDQSDLELNLESCSAISLPNENGDVLIVTIAGDFAILSFKAEGRSISSLSVRRIQSKD 433

Query: 409  -----SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPS 463
                 S     +  +GN  FFLGS   D++L  +      + LS               S
Sbjct: 434  GYPFTSAPCETLVEVGNRRFFLGSLDSDAMLWGYKRKGEKTSLSQK-------------S 480

Query: 464  TKRLRRSSSDALQDMVNGEELSLYG------------SASNNTESAQKTFSFAVRDSLVN 511
              +L R  +       + +E  LYG            S+ N    +   + F   D L N
Sbjct: 481  EVKLERDDA-EDNVEDDDDEDDLYGESTVTPITPRKASSGNIGRGSSGEYVFRRHDRLQN 539

Query: 512  IGPLKDFSYG--------LRINADA-------SATGISKQSNYEL------------VEL 544
            +GP +  ++G        L+++          + TG   +    +             + 
Sbjct: 540  VGPCRQMAFGRPAMLPEKLKLHQGVLPELELMATTGRGVEGAVTVFNTSICPRVSATFDF 599

Query: 545  PGCKGIWTVYHKSSRGHNA--DSSRMAAYDDE------YHAYLIISLEARTMV------- 589
              C+ +W V+ K  +   +   SS    Y+++      Y  YL  S  + T+V       
Sbjct: 600  KDCQRLWAVHSKQVKKGQSMIPSSVSKGYEEQIGATEDYSTYLFASNTSETLVYKVGTKF 659

Query: 590  --LETADL-LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG 646
              LE  D+  TEV  ++++         G      R+ QV E   ++ D     Q +   
Sbjct: 660  EPLEGTDIETTEVCPTLEF---------GTFQDGLRIAQVCETNVKVYDSEL--QLIQII 708

Query: 647  PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS-VQTPAAIESSKK 705
             +N E   G  +  ++S S ADPY+LL   D SI        T  +  ++ PA I+ +K 
Sbjct: 709  STNDEDPDGGPH--IVSASFADPYMLLICGDSSILACQCHERTLELDRIELPATIKDTK- 765

Query: 706  PVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCY---ESGA 762
                                 T+  L T   E            G    V+C+   E G 
Sbjct: 766  --------------------YTNGCLYTSSSEV--------FGLGTKSQVLCFLLTEEGT 797

Query: 763  LEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG-QGRKENIHS 821
            L++F +PNF    T++ F                 D   ++ S  E        ++ I  
Sbjct: 798  LQVFTLPNFELKATLEHF-----------------DMSLQLVSPDETALRFHTARDEIEE 840

Query: 822  MKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSN 881
            + V +L      A    P+L        I+ Y+ ++  G                     
Sbjct: 841  IIVADLGDNISKA----PYLIVKTKRDDIIIYEPFISNG--------------------- 875

Query: 882  VSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRE 941
            +   ++ N   +  P  + + +++P G P  +I    ++ G+   F++G  P +     +
Sbjct: 876  ICFKKIYN---TVLPTVSLSEQKSPSG-PLVKI---DDLGGYSVAFMAGDTPTFITKSSK 928

Query: 942  RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPL 1001
             L    +L  G + + +  +      GF+Y+ S+G  ++C  P  S  ++ W  Q+ IPL
Sbjct: 929  TLPKLYKLQGGMVRSLSPFNTKETERGFLYIDSKGTARVCHFPEVSM-EHTWLSQR-IPL 986

Query: 1002 KATPHQITYFAEKNLYPLIV---SVPVLKPLN-QVLSLLIDQEVGHQIDNHNLSSVDLHR 1057
            + TP  +TY+  KN+Y + V   S P +   + Q+   L+D+ +  +++  +L  +    
Sbjct: 987  ERTPTSLTYYDPKNVYVVSVLSTSKPEVDDEDFQMEEGLVDETLLPELETGHLVMISPVT 1046

Query: 1058 TYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA 1117
              T + YE  + E      P+  +A + ++ SE              TKE + L+A+GT 
Sbjct: 1047 WTTTDRYEFPVHEV-----PFVVKA-VELEISE-------------VTKERKVLIAVGTG 1087

Query: 1118 YVQGEDVAARGRVLLF 1133
             ++GE+  ARG V +F
Sbjct: 1088 LLRGENSPARGAVYVF 1103


>gi|412986884|emb|CCO15310.1| predicted protein [Bathycoccus prasinos]
          Length = 1595

 Score =  181 bits (459), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 183/700 (26%), Positives = 296/700 (42%), Gaps = 139/700 (19%)

Query: 501  FSFAVRDSLVNIGPLKDFSYGLR--INAD-------ASATGISKQSNY---------ELV 542
            + F+V+DSL+ I P+ D + G    +  D        +A G  K             ELV
Sbjct: 668  YKFSVKDSLLCISPVVDLTVGASAPVGTDLDPRTELVAACGHGKNGALAILTRGITPELV 727

Query: 543  ------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA--RTMVLETAD 594
                   LPG +  W      +   N  + R    D+ +  +LI+SL +   TMVLET +
Sbjct: 728  TEVESGALPGLRACWAT---RTEDDNDGTVRPKRKDELFDEHLILSLSSTKTTMVLETGE 784

Query: 595  LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESG- 653
             L EV++ VD+ V   T+A   +F  R + QV +   R     +  +   F   + +   
Sbjct: 785  ELREVSKEVDFIVDEETLACERIFNGRAIAQVTKTKIR-----FTRKGKKFAVDDIDLAF 839

Query: 654  -SGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAI-------ESSKK 705
              G E + +    I +  + L +SDGSIR+++GD  T T ++              ++  
Sbjct: 840  LKGGEGAQITLAIIQNDAIALRLSDGSIRIILGDSKTNTFTLLEKVGELFASDNHSNTGS 899

Query: 706  PVSSCTLYHD----------------KGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQ 749
             V++ TLY D                + P  WL +T      + G  E  D +     + 
Sbjct: 900  DVTAFTLYDDSVACTDSFGGGGGGLNRAP-GWLERT------ACGDREEKDESK----EN 948

Query: 750  GDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEE 809
             ++        G L ++ +P+   +++      G         RE L  + T I+S    
Sbjct: 949  NNVVFATISRDGTLALYSLPSLKKLWSSGGVSDG---------REILAPNSTGIDSIDFN 999

Query: 810  GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
               +  K  +  +++   A    +A + RP L     DG++L YQA  F+ P +      
Sbjct: 1000 DECEVEKYTVSDIRLDAFA----NAAYERPLLTCFRADGSVLAYQA--FKSPSSN----- 1048

Query: 870  PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYT--REETPHGAPCQ---RITIFKNIS--- 921
                                LRF+R P++  T   E T +    Q   R+T  +NI    
Sbjct: 1049 -------------------ELRFARVPIEIETAGSELTNNDVSVQGGSRLTRIENIGDGR 1089

Query: 922  GHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSI-VAFTVLHNVNCNHGFIYVTSQGILKI 980
            G  G F+SG  P W +V R R+   P   +G   +AF   HNVNC  GFI  T++G +++
Sbjct: 1090 GIAGVFVSGLNPIWLIVRRGRVLALPTRGEGGARIAFAPFHNVNCPKGFILATNEGGIRV 1149

Query: 981  CQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE 1040
            C+LP     +  WPV+K + L+ TP  ITY  +  LY L+ S  V  P         D E
Sbjct: 1150 CRLPGKMHIEAQWPVRK-LALRCTPRAITYMNDFKLYALVTSASV--PWK-------DFE 1199

Query: 1041 VGHQIDNHNLSSVDLHRTYT------VEEYEVRILEPDRAGGPWQTRATIPMQSSENALT 1094
            +  + D+H  +     +         V+++ +R+L P      WQ      ++  E+ L 
Sbjct: 1200 I-DETDSHARALYRFRKEKAKSEGNVVQQFAIRLLVPGTLETAWQK----AVEPGEHILC 1254

Query: 1095 VRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFS 1134
            V+ V + + +T    ++LAIGTA   GED   RGR+LLF+
Sbjct: 1255 VKNVQIRDQSTGALLSMLAIGTAMPGGEDTPCRGRILLFA 1294



 Score =  141 bits (356), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 100/325 (30%), Positives = 163/325 (50%), Gaps = 72/325 (22%)

Query: 181 PLV-KVDPQGRCGGVLVYGLQMIILK----------------ASQGGSGLVGDEDTFGSG 223
           P++ + DP+GRC  VL+   +   +K                +S G   +   ++  G G
Sbjct: 208 PIIGRADPEGRCAAVLLRNEEKAKVKIMPASETSTSSNYIKESSNGSKKMTTKKE--GEG 265

Query: 224 GGF-SARIESSHVINLRDL---DMKHVKDFIFVHGYIEPVMVILHEREL-TWAGRVSWKH 278
             +  A I SS  +++R +       V+D  F+HGY EPV++IL+E    TW+GR+S + 
Sbjct: 266 TVYVPATIGSSFDLDVRKILGPSAAFVRDCCFLHGYGEPVLMILYESNPPTWSGRLSLRM 325

Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASC 338
            TC + A+SI  T K++ ++W+   LP  AY L  VP+P+GGVLV+ +  I Y SQS+S 
Sbjct: 326 DTCKLVAVSIDCTKKKYTIVWTREKLPSAAYSLFPVPNPLGGVLVLSSGHILYESQSSSA 385

Query: 339 ALALN----------NYAVSLD------------------------SSQELPRSSFSVEL 364
               +          N+A  +                         SS E  ++ F V+L
Sbjct: 386 TYISDFLGKGGPQEGNFAEEIARNNGVEGQAAHANPVPHVNSNKNVSSYETTQNEFQVQL 445

Query: 365 DAAHATWLQNDVALLSTKTGDLVLLTVVYD------------GRVVQRLDLSKTNPSVLT 412
           DAA    ++ +VA++S+KTG L+  TV+ +            GR  +R+ + K+  +VL+
Sbjct: 446 DAAKIEMIRENVAIISSKTGQLI--TVILETVGGAASVGSKVGRRCRRIRVLKSGNAVLS 503

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQF 437
           S +  +G  L F+GSR+GDSLL+ +
Sbjct: 504 SGLAAVGKDLLFIGSRVGDSLLIGY 528


>gi|308459872|ref|XP_003092248.1| CRE-CPSF-1 protein [Caenorhabditis remanei]
 gi|308253976|gb|EFO97928.1| CRE-CPSF-1 protein [Caenorhabditis remanei]
          Length = 1448

 Score =  181 bits (458), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 155/578 (26%), Positives = 263/578 (45%), Gaps = 86/578 (14%)

Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
           +DSI++AF+DAK+S++  ++    ++  S+H FE+    +L+ G  ++   P+V+ DP  
Sbjct: 92  QDSILMAFDDAKLSIVAVNEKERNMQTISLHAFENE---YLRDGFINYFHPPIVRTDPSN 148

Query: 190 RCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVK 247
           RC   LVYG  + IL   +    ++                  S++I L+ +D  + +V 
Sbjct: 149 RCAASLVYGKHIAILPFHENSKRIL------------------SYIIPLKQIDPRLDNVA 190

Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHD 307
           D +F+ GY EP ++ L+E   T  GR   ++ T  I  +S++   +Q  ++W   NLP D
Sbjct: 191 DMVFLDGYYEPTILFLYEPLQTTPGRACVRYDTMCIMGVSVNIVDRQFAVVWQTANLPMD 250

Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELP---RSSFSVE 363
              LL +P P+GG LV G+NTI Y +Q+   C + LN+     D   + P        + 
Sbjct: 251 CTSLLPIPKPLGGALVFGSNTIVYLNQAVPPCGVVLNS---CYDGFTKFPLKDMKHLKMT 307

Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKTNPSVLTSDITTIGNS 421
           LD A + ++++    +  + G L LL +V    G  V+ ++ S+   + +   +T     
Sbjct: 308 LDCATSVYMEDGRIAVGGRDGVLYLLRLVTSSGGATVKSMEFSRVWETSIAYCLTVCAPG 367

Query: 422 LFFLGSRLGDSLLVQFTCGSGT--SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
             F+GSRLGDS LV++T    T  S     ++++ G+IE D                   
Sbjct: 368 HLFIGSRLGDSQLVEYTLLKMTKESAKRQKIEKDPGEIELDE------------------ 409

Query: 480 NGEELSLYGSA-----SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
             +++ LYG A     +++ E   ++  F   D L N+GP+K   +G R N  +S     
Sbjct: 410 --DDMELYGGAIEMQLNDDEEQILESLEFRELDRLRNVGPVKSMCFG-RPNYMSSDLAEM 466

Query: 535 KQSN--YELVELP--GCKGIWTVYHKSSRGHNADSSRMAA---------YDDEYHAYLII 581
           K+ +  ++LV     G  G   V+ +S R     SS +            ++E H YLI+
Sbjct: 467 KRRDPVFDLVTASGHGKNGALCVHQRSLRPEIITSSILEGAEQLWAVGRKENESHKYLIV 526

Query: 582 SLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG-ARILDGSYMT 640
           S   R+ ++          E   +     T+AAG L      +QV     A + DG  M 
Sbjct: 527 S-RVRSTLVLELGEELVELEEQLFVTNEPTVAAGELSQGALAVQVTSTCIALVTDGQQM- 584

Query: 641 QDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
           Q++              N  V+  SI DPYV +   +G
Sbjct: 585 QEVHI----------DSNFPVVQASIQDPYVAVLTQNG 612



 Score = 85.1 bits (209), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 100/428 (23%), Positives = 183/428 (42%), Gaps = 45/428 (10%)

Query: 723  RKTSTDAWLSTGVGEAIDGADGGPLDQGDIYS------VVCYESGALEIFDVPNFNCVFT 776
            R+   DA +S+  GE  D      +D    YS      +V +++G L I  +P+   V+ 
Sbjct: 741  RRLGHDAIMSSRGGEQSDA-----IDPTRTYSSITHWLMVAHDNGRLSIHSLPDMELVYQ 795

Query: 777  VDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQR----W 832
            + +F +    ++D    E  K+ + +   ++++      +      K+ E  M+      
Sbjct: 796  IGRFSNVPELLMDMTTDEEEKERKAKAQQAAKDTAADEDQLTTEMKKLCERVMEAQIVGM 855

Query: 833  SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRF 892
              + S P L AI+ D  ++ Y+ +    P+        ++  +      +  S   N   
Sbjct: 856  GINQSHPVLMAIV-DEQVVMYEMFSHYNPQAGHLG---IAFRKLPHFICLRTSSHLNSDG 911

Query: 893  SRTPLDAYTREETPHGAPCQRITIFKNISG-HQGFFLSGSRPCWCMVFR-ERLRVHPQLC 950
             R P +     E  +G     I  F+ IS  + G  + G+ P   +      ++ H    
Sbjct: 912  KRAPFEM----EVENGKRYTLIHPFERISSINNGVMIGGAVPTLVVYGAWGGMQTHQMTI 967

Query: 951  DGSIVAFTVLHNVNCNHGFIYVTSQ-GILKICQLPSGSTYDNYWPVQKVIPLKATPHQIT 1009
            DG I AFT  +N N  HGF+Y+T Q   L+I ++     Y+  +P++K I +  T H + 
Sbjct: 968  DGPIKAFTPFNNENVLHGFVYMTQQKSELRIARMHPDFDYEMPYPMKK-IEVGRTIHNVR 1026

Query: 1010 YFAEKNLYPLIVSVPVLKPLNQVLSLLID--QEVGHQIDNHNLSSVDLHRTYTVEEYEVR 1067
            Y    ++Y ++ S+P  KP N++  ++ D  QE  H+ D + +  +     YT+  +   
Sbjct: 1027 YLMNSDVYVVVSSIP--KPSNKIWVVMNDDKQEEIHEKDENFV--LPAPPKYTLNLF--- 1079

Query: 1068 ILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDV 1124
                  +   W+      I  +  E       V+L + +T    ET LA+GT    GE+V
Sbjct: 1080 ------SSQDWKAVPNTEIEFEDMEAVTACEDVSLKSESTISGVETYLAVGTVNNYGEEV 1133

Query: 1125 AARGRVLL 1132
              RGR++L
Sbjct: 1134 LVRGRIIL 1141


>gi|384487281|gb|EIE79461.1| hypothetical protein RO3G_04166 [Rhizopus delemar RA 99-880]
          Length = 1468

 Score =  179 bits (453), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 163/650 (25%), Positives = 277/650 (42%), Gaps = 97/650 (14%)

Query: 88  KRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEF 147
           K+  ++   +   LELV  ++++G + ++  +           DS++L F DAK+S+LE+
Sbjct: 87  KKGGMISDTTLGRLELVAQFKMNGIITTMGTVRTNSPRGREGCDSLLLGFSDAKMSLLEW 146

Query: 148 DDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL---VKVDPQGRCGGVLVYGLQMIIL 204
             S + +   S+H +E  E+      ++ F   P    + +DPQ RC     Y  ++ +L
Sbjct: 147 SSSTNSIITVSIHYYERDEF------KKEFLTNPYPSAIHIDPQQRCAVFNFYDNKLAVL 200

Query: 205 KASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVI 262
              Q  S  + +    G           S +I+L  LD  +K+V D  F+  Y EP + I
Sbjct: 201 PFRQ--SDKLDERQGEGEEDEEKWPYYPSFIIDLATLDSRIKNVIDMTFLSDYYEPTLAI 258

Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVL 322
           L + E TW GR+     T  +  +S+  T K +P+I+S   LP+D +KL+A+P P+ G+L
Sbjct: 259 LFQPEQTWTGRLGNNKDTVSLVVISLDITAKIYPIIYSIDKLPYDCFKLVAMPKPVTGML 318

Query: 323 VVGANTIHYHSQ-SASCALALNNYAVSLDSSQELPRSSFS-------VELDAAHATWLQN 374
           V+ AN+I + SQ S    +A+N Y      + + P   +        + L+ A A     
Sbjct: 319 VIAANSILHVSQGSPGMGVAVNGYT---KKTTDFPGMIYEPSLIELGLSLEGAKALAFGG 375

Query: 375 DVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN---------------PSVLTSDITTIG 419
           D  L+  + G   L+ V  DG  V  + +S+                 P +L S  + + 
Sbjct: 376 DRCLIFMQNGHWALVEVRRDGNKVVGMAISEIKHDLPVMEKKPPRFDTPPLLASVPSCVT 435

Query: 420 N----SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDAL 475
           N      FFLGSR+GDSLL+++          +       D   +      +     D +
Sbjct: 436 NVKAGEYFFLGSRVGDSLLIKYDANRVNHQSVAPPVFRVCDTMLNTGPIVDMAVGDVDTV 495

Query: 476 QDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISK 535
           +   +  +L L  S+ +    A   F         +I P   F++      D+ A     
Sbjct: 496 EQQEDWPQLELVSSSGHGKNGALCVFQ-------RHIYPQTSFAFH---QFDSQA----- 540

Query: 536 QSNYELVELPGCKGIWTVY-HKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
                         IW++   K+ +  N         DD++   L IS    T+VL   D
Sbjct: 541 --------------IWSIKCRKNDQQQNE--------DDDFDKLLFISKSKSTLVLSAGD 578

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL--DGSYMTQDLSFGPSNSES 652
            L EV     ++ +G TIA   LF   R++QV+  G  +L  +G  +           ++
Sbjct: 579 ELQEV--KTGFYTRGSTIAVSTLFDATRIVQVYATGVMVLTPEGKRI-----------QT 625

Query: 653 GSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTC-TVSVQTPAAIE 701
                 + ++  SI DPY+LL + +  I  L GD ST   + +Q P  I+
Sbjct: 626 VPIPRGAKIVEASIHDPYILLTLDNNKILALQGDASTKDIIHIQLPNHIK 675



 Score =  101 bits (252), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 98/433 (22%), Positives = 173/433 (39%), Gaps = 56/433 (12%)

Query: 760  SGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENI 819
            +G L I+ +P+F   F   +F      IVD        DS              G K  I
Sbjct: 811  TGILRIYSLPDFKEHFACPQFSIAPDLIVD--------DS--------------GVKSRI 848

Query: 820  HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSV 879
             +  + E+ M         P L        I+ Y+A+ +    +  +     S  +   V
Sbjct: 849  PTNNIQEILMTHIGKERKDPHLVVRTDTNDIIIYKAFTYLDESSPDRLALRFSRVQHEYV 908

Query: 880  SNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI----------------FKNISGH 923
            S  S+S     +  R  +D +   +T      + + +                F +++G+
Sbjct: 909  SRKSSSHESKPKKKRGIIDEFEIPDTDLNEEEEDLKLSTKKMDKKIQRKLLIPFTDVAGY 968

Query: 924  QGFFLSGSRPCWCMV-FRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQ 982
             G F++G++P W M   +  +RVHP   +  IV FT  HNVNC HGFI V S+  +++ +
Sbjct: 969  AGVFVAGAQPAWLMCSCKSFVRVHPMKTEHEIVGFTQFHNVNCQHGFITVDSKSTIQLSR 1028

Query: 983  LPS-GSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1041
            L + G  YD  W +QKV+ L  T H+I Y     +Y ++VS  V   +    +  ID   
Sbjct: 1029 LRTEGINYDLDWVIQKVL-LGQTVHKIQYHPVMRVYAVLVSSSVPTRMKNDDNQYID--- 1084

Query: 1042 GHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL- 1100
            G + D              +E++ + ++ P      W+    +  +  E   ++    L 
Sbjct: 1085 GKETDERGPGEF----LPEMEQFSMILVSP----VTWEIVDKVEFEEFEQCFSLECALLD 1136

Query: 1101 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQI 1160
               T+   +  + IGT  ++GED   +G + ++        P N   +  + P+ +    
Sbjct: 1137 SKQTSTGRKYYMIIGTGTLKGEDTTMKGSIRMYDIIEVVPEPDNPQTNHKFKPVLTE--- 1193

Query: 1161 DFASHFFAICSNS 1173
            D      A+C+ S
Sbjct: 1194 DVKGAVTAMCTVS 1206


>gi|49619061|gb|AAT68115.1| cleavage and polyadenylation specificity factor 1 [Danio rerio]
          Length = 312

 Score =  178 bits (452), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 99/254 (38%), Positives = 149/254 (58%), Gaps = 18/254 (7%)

Query: 100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
            LE V  + L GNV S+A +   G +    RD+++L+F+DAK+SV+E+D   H L+  S+
Sbjct: 65  KLEQVASFSLFGNVMSMASVQLVGTN----RDALLLSFKDAKLSVVEYDPGTHDLKTLSL 120

Query: 160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDT 219
           H FE PE   L+ G       P+V+VDP+ RC  +LVYG  +++L      +  + DE  
Sbjct: 121 HYFEEPE---LRDGFVQNVHIPMVRVDPENRCAVMLVYGTCLVVLPFR---NDTLADEQE 174

Query: 220 FGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
              G G       S++I++R+LD  + ++ D  F+HGY EP ++IL E   TW GRV+ +
Sbjct: 175 GIVGEGQKFSFLPSYIIDVRELDETLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVR 234

Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS 337
             TC I A+S++   K HP+IWS  NLP D  +++AVP PIGGV+V   N++ Y +QS  
Sbjct: 235 QDTCSIVAISLNIMQKVHPVIWSLSNLPFDCNQVMAVPKPIGGVVVFAVNSLLYLNQSVP 294

Query: 338 CALALNNYAVSLDS 351
                  + VSL+S
Sbjct: 295 ------PFGVSLNS 302


>gi|302831157|ref|XP_002947144.1| hypothetical protein VOLCADRAFT_87503 [Volvox carteri f. nagariensis]
 gi|300267551|gb|EFJ51734.1| hypothetical protein VOLCADRAFT_87503 [Volvox carteri f. nagariensis]
          Length = 2830

 Score =  178 bits (452), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 158/600 (26%), Positives = 260/600 (43%), Gaps = 88/600 (14%)

Query: 575  YHAYLIISL-EARTMVLETADLLTEVTES--VDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
            +HAYL+I++   RTMVL   D L +VT S   ++ V   T+AAGNLF    ++Q    G 
Sbjct: 1889 FHAYLLITMGRVRTMVLRCTDGLDDVTNSPECEFLVNQPTLAAGNLFHNAVIVQACPMGL 1948

Query: 632  RILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS-------------IADPYVLLGMSDG 678
            R+L+G  + Q+L      +     ++ S                    ADPYVL+G+SDG
Sbjct: 1949 RVLEGMTLVQELRVSDFQASRPKTAQYSFCCRTKHPIAHRAMGPIPQAADPYVLVGLSDG 2008

Query: 679  SIRLLVGDPSTCTVSVQTPAA-------IESSKKPVSSCTLYHDKGPEPWLRKTSTDAWL 731
            +  LL GDP + T+ V T AA         S ++ +++  L+ D+            +W+
Sbjct: 2009 TAVLLEGDPLSLTLGVATAAAEQLMAVPARSRQQRLAAACLHRDE-----------TSWM 2057

Query: 732  STGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVS-------GR 784
            ++        +         I+  +C  SG LE + +P+   VF      +       G 
Sbjct: 2058 ASATAAEAASS----GSSFSIFLWICRLSGRLECYSLPSMRLVFHSSGLAAAEEVLRMGP 2113

Query: 785  THIVDTY--MREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHS----- 837
              + D Y         +E E++     G G G  E+     VVEL ++ +    S     
Sbjct: 2114 AVMYDVYDLFGGGGGGAEAELDG----GGGSGIMED----PVVELRVESFLGGGSPAVPD 2165

Query: 838  --RPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSA----------- 884
              RP L  +   G ++ YQ  L   P ++   + P +   +   S               
Sbjct: 2166 CERPVLLVMAASGNLVAYQIALRRLPLDSLSHEAPAAMGAAAGSSGGGGGIGGGAALGPR 2225

Query: 885  -SRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERL 943
             +R  +L ++     +++R +       ++  +    + + G F++GSRP W +  R  L
Sbjct: 2226 MARFDHLAYTDPSSKSHSRTDI------RKYPVASQGTSYSGVFVAGSRPLWLVASRGGL 2279

Query: 944  RVHPQLCDGSIVAFTVLHNVNCNHGFI-YVTSQGILKICQLPSGSTYDNYWPVQKVIPLK 1002
              HP   +G++ A T  HN NC  GFI   +S+G+LK+CQLP  +  D  W V + +PL+
Sbjct: 2280 VPHPMFAEGAVAAMTPFHNANCPLGFISACSSRGLLKVCQLPPHTRLDTPW-VTRRVPLR 2338

Query: 1003 ATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVE 1062
             TPH++ +F +  L   I S  V+           D            ++    R    E
Sbjct: 2339 VTPHKLAWFRDAGLMAAITSRVVVSRPRPPEEPGGDAHAAAAYAAAAAAAAGRGRE---E 2395

Query: 1063 EYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGE 1122
             +E+R+LEP+  G  W +    P    E AL ++V+ L N TT + + LLA+GT    G+
Sbjct: 2396 AWELRLLEPNGCGRLWLSPLLPP---GEQALCLKVIYLQNATTGDTDALLAVGTGSPMGQ 2452



 Score =  104 bits (260), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 67/215 (31%), Positives = 120/215 (55%), Gaps = 25/215 (11%)

Query: 228  ARIESSHVINL-RDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISAL 286
            A + + +++NL + + ++ V+D +F+HGY EPV+++LHE + TW G +  +  TC ++A+
Sbjct: 1305 ATLGNGYLLNLNKMMGIREVRDCVFLHGYTEPVLLLLHEPDPTWVGMLRERKDTCCLAAI 1364

Query: 287  SISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYA 346
            SIS  LK+H ++W   +LP+D +KLLAVP     VLV+  N +   SQ++  A ALN+ A
Sbjct: 1365 SISLRLKRHTILWKLASLPYDCFKLLAVPY-RPAVLVISPNLLLLCSQASQHAAALNSNA 1423

Query: 347  VS--------LDSSQELP---------RSSFSVELDAA-----HATWLQN-DVALLSTKT 383
            +         LD S+E P         + + +V  D A     +AT + + +V     ++
Sbjct: 1424 LPGEVPPPLILDPSREPPAATAARLAAQYALNVHPDCAPAAGRNATLMADLEVVAAGLQS 1483

Query: 384  GDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTI 418
            G L+ + + ++G   QR+ + +T    + S +  I
Sbjct: 1484 GTLLAVHLQFEGPADQRITVVRTGGGPIASAMVGI 1518



 Score = 95.5 bits (236), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 65/188 (34%), Positives = 94/188 (50%), Gaps = 8/188 (4%)

Query: 56   PNLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMD-GISAASLELVCHYRLHGNVE 114
            PNL+V   N +E++ +R     +  +  +           G   A LELV  Y LHG VE
Sbjct: 1078 PNLIVVRTNRLEVHSLRSSAVATNAAAATATAAATASAAVGSGGARLELVVSYHLHGVVE 1137

Query: 115  SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
            SLA+LS G   +S RRD+++LAF + K+SV+E++   H LR +S+H FE    +  + GR
Sbjct: 1138 SLAVLSGG---SSSRRDALLLAFREGKLSVVEWNPRTHSLRTSSLHYFEGDPGVQ-REGR 1193

Query: 175  ESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSG---LVGDEDTFGSGGGFSARIE 231
             +    P V  DP GRC  +     Q+ +L A +  +G    V D    G G G   RI 
Sbjct: 1194 IAVPLPPRVVTDPAGRCAAMSFCFSQLALLPALEVKAGAWQCVDDGGVMGVGRGERERIG 1253

Query: 232  SSHVINLR 239
              H+   R
Sbjct: 1254 GVHINERR 1261


>gi|170576536|ref|XP_001893668.1| CPSF A subunit region family protein [Brugia malayi]
 gi|158600196|gb|EDP37499.1| CPSF A subunit region family protein [Brugia malayi]
          Length = 1323

 Score =  178 bits (452), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 169/647 (26%), Positives = 278/647 (42%), Gaps = 96/647 (14%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           LE +   RL   V+S AI       +    DS++L F+DAK+S++  + +   L+  S+H
Sbjct: 62  LECLLAVRLLAPVQSFAIARISQNPDC---DSLLLGFDDAKLSIVAVNPADRCLKTISLH 118

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
           CFE      LK G       P+++VDP  RC  +LV+G  + +L  +   + L       
Sbjct: 119 CFEDE---LLKDGFTKNLPRPVIRVDPGQRCASMLVFGRYLAVLPFNDSSTQL------- 168

Query: 221 GSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
                       S+ + L  +D +  +V D +F+ GY EP ++ L+E   T  GR   ++
Sbjct: 169 -----------HSYTVQLSQIDSRLVNVVDMVFLDGYYEPTLLFLYEPVQTTCGRACVRY 217

Query: 279 HTCMISALSISTTLKQHPL--IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA 336
            T  +  L +S  +K+  L  +W   NLP D  ++LA+P P+GG+L+V  N + Y +QS 
Sbjct: 218 DT--MCVLGVSLNVKEQVLASVWQLTNLPMDCNQILAIPRPVGGILLVATNELIYLNQSV 275

Query: 337 -SCALALNNYAVSLDSSQELPRSSF---SVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
             C ++LN+    +D   + P   F   ++ LD A  T +  +  LL  + G L  L +V
Sbjct: 276 PPCGISLNS---CMDGFTKFPLKDFKHMALTLDGAVVTVVSTNKILLCDRNGRLFTLILV 332

Query: 393 YDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
            D    V+ L+L     +V+   +T+      F+GSRL DS+ +   C    S L     
Sbjct: 333 TDATNSVKSLELKFQFETVIPCTMTSCAPGYLFIGSRLCDSVFLH--CIFEQSTLEES-- 388

Query: 452 EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT---ESAQKTFSFAVRDS 508
                      +TK+++ S+     +    E+  LYG         +  ++  +  V D 
Sbjct: 389 -----------ATKKMKLSTEPNANE--EDEDFELYGEVLPKVAKPDVTEELLNIRVLDK 435

Query: 509 LVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCK--GIWTVYHKSSRGHNADSS 566
           L+N+GP K  + G    +        K   ++LV   G    G   +  +S R     SS
Sbjct: 436 LLNVGPCKKITGGCPSVSAYFQEITRKDPLFDLVCACGHGKFGSICILQRSIRPEIITSS 495

Query: 567 RMAAY---------DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNL 617
            +            +D+ H Y I S E  T+ LET + L E+ E+  +     TIAAG L
Sbjct: 496 SIEGVVQYWAVGRREDDTHMYFIASRELGTLALETDNDLVEL-EAPIFSTSESTIAAGEL 554

Query: 618 FGRRRVIQV-------FERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPY 670
                 +QV          G +I    Y+   L+F               V S SI DPY
Sbjct: 555 ADGGLAVQVTTSSLVMVAEGQQI---QYIPLQLTF--------------PVRSASIVDPY 597

Query: 671 VLLGMSDGSIRL--LVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
           + +   +G + +  L   P      +     +     P++S ++Y D
Sbjct: 598 IAICTQNGRLLMYELTNQPHVSLKEIDISKRLRHETSPITSLSIYRD 644



 Score = 77.0 bits (188), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 70/277 (25%), Positives = 121/277 (43%), Gaps = 30/277 (10%)

Query: 756  VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE------TEINSSSEE 809
            +  E+G + I+ +P  + V+ V K     +H+ D    +   D E       +  S +  
Sbjct: 731  IARENGNMYIYSIPELHLVYMVKKI----SHLPDIATDQPYVDDEPVTGEGIDAMSGTMT 786

Query: 810  GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
             T   + E +    ++EL +     +  RP LF +L D T+  Y+ + +    N      
Sbjct: 787  DTFAVKPEEV----IMELLLVGMGMNQGRPLLF-LLIDDTVSAYEMFTY----NNGIQGH 837

Query: 870  PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI--FKNISG-HQGF 926
                 + L  + V+    R+ RF  T  D     E+   A   +  +  F+ I     G 
Sbjct: 838  LAIRFKRLPYTTVT----RSCRFQGT--DGRAAVESVRDAVRHKTVLHFFERIGNVLNGV 891

Query: 927  FLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG-ILKICQLPS 985
            F+  S PC   +     R+HP   DG I++FT  +N  C +GFIY+T +   +++ +LPS
Sbjct: 892  FICSSYPCIFFLESGVPRLHPVNLDGPILSFTTFNNAVCPNGFIYLTERDRFMRVAKLPS 951

Query: 986  GSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVS 1022
                D  +PV++ I + AT H + Y    N Y ++ S
Sbjct: 952  DMILDASYPVKR-INVGATVHSVVYLLHSNTYAVLTS 987


>gi|116182170|ref|XP_001220934.1| hypothetical protein CHGG_01713 [Chaetomium globosum CBS 148.51]
 gi|88186010|gb|EAQ93478.1| hypothetical protein CHGG_01713 [Chaetomium globosum CBS 148.51]
          Length = 1394

 Score =  178 bits (451), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 279/1195 (23%), Positives = 462/1195 (38%), Gaps = 220/1195 (18%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLM---------DGISAA-------- 99
            NL V  +++++I+  +V       S+N+G   R             DG+ A+        
Sbjct: 41   NLAVAKSSLLQIFRTKVIATELDTSQNNGHRTRNANRYESRLANDDDGLEASFLGGDSLA 100

Query: 100  ---------SLELVCHYRLHGNVESLAILSQGGADNSRR-RDSIILAFEDAKISVLEFDD 149
                      L LV  + L G V  L  +      N+R   DS++LAF+DAK+S++E+D 
Sbjct: 101  QRTDRANYTKLVLVAEFPLAGTVTGLVRIK---TPNARLGLDSLLLAFKDAKLSLVEWDT 157

Query: 150  SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQG 209
              H L   S+H +E  E                +  DP  RC  +      + IL   Q 
Sbjct: 158  EHHTLSTVSIHYYEQEELQGSPWAAPLSHYANFLAADPGSRCAALKFGARNLAILPFKQA 217

Query: 210  GSGL-VGDEDTFGSGGGFSARIES----------------SHVINLRDLD--MKHVKDFI 250
               + +GD D    G   +  + S                S V+ L +LD  + H     
Sbjct: 218  DEDIDMGDWDEELDGPRPAKDLSSAVINGASNIEDTPYSPSFVLRLSNLDPSLLHPVHLA 277

Query: 251  FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYK 310
            F+H Y EP   IL              H   M+  L +    K    I S   LP D ++
Sbjct: 278  FLHEYREPTFGILASTAAASNSLGRKDHFVYMVFTLDLQQ--KASTTILSVTGLPQDLFR 335

Query: 311  LLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHA 369
            ++ +P+P+GG L+VG+N  IH         +A+N       S   + +S  ++ L+    
Sbjct: 336  VVPLPAPVGGALLVGSNELIHIDQSGKPNGVAVNPMTKHCTSFGLVDQSDLNLRLEGCVI 395

Query: 370  TWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGN 420
              L  D+   L+    G + ++T+  DGR V  L+L    + +  S++   ++T   IG 
Sbjct: 396  DVLAADLGELLIILNDGQMAVMTLRIDGRTVSGLELKILPASSGGSIVPGRVSTLSRIGR 455

Query: 421  SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA------ 474
            +  F G   GDS+L                   FG  +      +R  R+  +A      
Sbjct: 456  NAMFAGLEEGDSVL-------------------FGWAKKQTQVGRRKPRTKDNAGDVDVE 496

Query: 475  ---LQDMVNGEELSLYGSASNNTESAQKTF--------SFAVRDSLVNIGPLKDFSYGLR 523
                 +    +E  LYG AS                  S  V D L+N+GP++  +Y   
Sbjct: 497  EDEDIEEEEEDEDDLYGEASAPQHQPVSAVSGLLSGEASLRVHDRLINLGPIQAMTYSQP 556

Query: 524  INADAS-----------------ATGISKQS-----NYEL-------VELPGCKGIWTVY 554
            +    S                 A G  K +     N E+        E P  +G WT+ 
Sbjct: 557  VWLPGSEEERNSAGVHSDLQLVCAVGREKSASLVTMNLEIQPKVIGRFEFPEARGFWTMC 616

Query: 555  HKSSRGHNADSSRMAAY-------DDEYHAYLIIS------LEARTMVLETADLLTEVTE 601
             K        S +   +         +Y  ++I++       E   +   TA     +  
Sbjct: 617  AKKPIPKTLQSDKGGNFLGKDYDVSGQYDKFMIVAKVDLDGYEKSDVYALTAAGFESLGG 676

Query: 602  SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENST 660
            +      G TI AG +    R+IQV +   R  DG + ++Q +     + E+G+      
Sbjct: 677  TEFDPAAGITIEAGTMGKGSRIIQVLKSEVRCYDGDFGLSQIVPM--LDEETGA---EPR 731

Query: 661  VLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEP 720
             +S SIADP +L+   D S+ +   D S     ++      ++ K ++ C LY D     
Sbjct: 732  AISASIADPLLLIIRDDSSVFVAQMDSSNELEELEKEDQTLATTKWLTGC-LYAD----- 785

Query: 721  WLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKF 780
                 +T A+      E + G  G P     I   +   SG+L I+ +P+ +    V + 
Sbjct: 786  -----TTGAF-----AEEVAGKGGKPAQA--ILVFLLSASGSLYIYRLPDLSKPVYVAEG 833

Query: 781  VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
            +S        Y+   L       + S+ +GT    KE +  + V +LA  R    H+   
Sbjct: 834  LS--------YIPPGLS-----ADYSARKGTA---KETVAEILVADLA-NRSQLRHAN-- 874

Query: 841  LFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAY 900
                  D TI  YQ + +    +TS   D    S++L        +L N  F+++P +A 
Sbjct: 875  -----DDLTI--YQPFRY----STSAGAD---FSKTLFF-----QKLPNAAFAKSPEEAD 915

Query: 901  TREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMV-FRERLRVHPQLCDGSIVAFTV 959
              E T H      +    NI+G+   FL G+ P + +   +   RV P L    ++A + 
Sbjct: 916  EDEAT-HQPRMLSMRRCSNIAGYSTVFLPGASPSFIIKSSKSAPRVLP-LQGAGVIAMSP 973

Query: 960  LHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPL 1019
             H   C +GFIY  SQ + ++ QLP    Y       + IP+      + Y      Y  
Sbjct: 974  FHTEGCENGFIYADSQHMARVTQLPQDWNYAETGLAVRKIPIGEDIAAVAYHPPMQSY-- 1031

Query: 1020 IVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQ 1079
            +V    L+P      L  D +   +    NLS        TV+   ++++ P      W 
Sbjct: 1032 VVGCNTLEPFE----LPKDDDYHKEWARENLSF-----KPTVDRGILKLVSPIT----WT 1078

Query: 1080 TRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
               ++ M+  E  L V  ++L  +  T E + L+A+GTA ++GED+  RGRV ++
Sbjct: 1079 VVDSVQMEPCETVLCVATLSLEVSEFTNERKQLIAVGTALIKGEDLPTRGRVYVY 1133


>gi|324499955|gb|ADY39993.1| Cleavage and polyadenylation specificity factor subunit 1 [Ascaris
           suum]
          Length = 1434

 Score =  178 bits (451), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 172/656 (26%), Positives = 281/656 (42%), Gaps = 114/656 (17%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           LE + H RL   V+SLA+        +    S++L F+ AK+SV+ F  +   L+  S+H
Sbjct: 107 LECIIHVRLLAPVKSLAV---ARIPQNPSCSSLLLGFDTAKLSVVGFSAAERSLKTISLH 163

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
           CFE      LK G  +    P+++VDP  RC  +L+YG  + +L        L       
Sbjct: 164 CFEEE---MLKDGYVTDLPSPVIRVDPAQRCAVMLIYGRYLAVLPFDDTSPHL------- 213

Query: 221 GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
                       ++ + L  +D  + ++ D  F+ GY EP ++ L+E   T AGR   ++
Sbjct: 214 -----------HTYTVALSSIDPRLVNIIDIAFLDGYYEPTLLFLYEPAQTTAGRACVRY 262

Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS- 337
            T  +  +S++T  + H  +W   NLP D  ++L +P PIGG L++GAN + Y +QS   
Sbjct: 263 DTVCMLGVSLNTKEQVHASVWQLNNLPMDCNQVLMIPRPIGGALIIGANELIYLNQSVPP 322

Query: 338 CALALNNYAVSLDSSQELPRSS---FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD 394
           C   LN+    +D   + P  S    ++ LD   A  +  +  ++  ++G L +LT+V D
Sbjct: 323 CGSLLNS---CMDGFTKFPLKSEKEMALTLDGCAACVISTNKVVVCARSGALFILTLVVD 379

Query: 395 G-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEE 453
               V+ ++        +   +T       F+GSR+GDSL +++          S L   
Sbjct: 380 STNSVKSIEFKHEFDVSIPHTVTACSPGYLFVGSRVGDSLFIEYV---------SEL--- 427

Query: 454 FGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQ---KTFSFAVRDSLV 510
              +  D P  K+L+    +  QD +  E+L LYG A  +  S     +   F V D ++
Sbjct: 428 ---VPVDDPIEKKLK---VEVPQDDLEDEDLELYGKALPSVISQDVSVEKMRFRVLDRML 481

Query: 511 NIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS---- 566
           N+ P K  +           +G S+  N  L E P    ++      + GH  DSS    
Sbjct: 482 NVAPCKKMT-----------SGCSEGLNSYLQEQPRLDPVFD--RVCACGHGKDSSICIF 528

Query: 567 ---------------------RMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDY 605
                                 +   +D+ H Y+I S E  ++ LET + L E+   V +
Sbjct: 529 QQSIRPDIITSSSIEGVIQYWAVGRREDDTHMYIIASKELGSLALETDNDLVELEAPV-F 587

Query: 606 FVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQ----DLSFGPSNSESGSGSENSTV 661
                TIAAG L      +QV      ++      Q     L+F               V
Sbjct: 588 ITSESTIAAGELADGGLSVQVTTSSIVVVAEGQQIQLIPLQLTF--------------PV 633

Query: 662 LSVSIADPYVLLGMSDGSIRL--LVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
           LS SI DP+V +   +G + L  L   P     +V  P  I  +K P+++  +Y D
Sbjct: 634 LSASIVDPFVAICTQNGRLLLYELDNTPHVHLKAVDLPGNIIHNKSPITALCIYRD 689



 Score = 98.6 bits (244), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 108/391 (27%), Positives = 177/391 (45%), Gaps = 51/391 (13%)

Query: 755  VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREA--LKDSE---TEINSSSEE 809
            V+  E+G L I+ +P    V+ V K     +H+ D  + E   L D     ++I S++  
Sbjct: 773  VMARENGNLYIYSIPEMQLVYMVKKL----SHLPDVAIDEMNYLGDESVVASDIASNTLN 828

Query: 810  GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
                 + E I    +VE+ +     +  RP LF ++ D  +  Y+ ++++   N      
Sbjct: 829  EALVAKPEEI----IVEVLLTGMGMNQGRPMLFVVV-DDMVSVYEMFMYD---NGVVEHL 880

Query: 870  PVSTSRSLSVSNVSASRLRNLRFS----RTPLDAYTREETPHGA---PCQRITIFKNISG 922
             V   R L  + V+    R+ RF     R P++A  R+   +     P +RI    N   
Sbjct: 881  AVRFKR-LPYTTVT----RSCRFQGNDGRAPVEA-ARDTVRYRTALHPFERIGNILN--- 931

Query: 923  HQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-QGILKIC 981
              G F+  S PC  ++    LR+HP   +G I++FT  +NV C +GFIY+T  +  ++I 
Sbjct: 932  --GVFICSSYPCVFLMDSGILRMHPLNLEGPILSFTAFNNVLCPNGFIYLTEREWAMRIA 989

Query: 982  QLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1041
            +LP+    D+  PV+K I    T H I Y  + N Y ++ S    K  N  L +L++++ 
Sbjct: 990  KLPTDVELDSSLPVRK-IRTGRTIHNIVYLLQSNTYAVVGSE---KKPNNRLCVLVNED- 1044

Query: 1042 GHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQ--TRATIPMQSSENALTVRVVT 1099
                D H     D      +E Y+V++  P+     W+    A I M+  E       V 
Sbjct: 1045 -KSFDEH--EKADSFVLPELEVYDVKLYSPED----WKPVPNAEIKMEDFEVLTCCEEVV 1097

Query: 1100 LFNTTTKEN-ETLLAIGTAYVQGEDVAARGR 1129
            L +  T    +  LA+GTA   GE+V  RGR
Sbjct: 1098 LRSEGTVSGVQNYLAVGTACNYGEEVLVRGR 1128


>gi|440637976|gb|ELR07895.1| hypothetical protein GMDG_02777 [Geomyces destructans 20631-21]
          Length = 1495

 Score =  176 bits (447), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 262/1132 (23%), Positives = 440/1132 (38%), Gaps = 190/1132 (16%)

Query: 97   SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
            S   L LV  Y L G V SLA +    +++    +S++L+F+DAK+S++E+D   HGL  
Sbjct: 147  STTKLVLVGEYALAGTVTSLARIKI--SESKSGGESLLLSFKDAKLSLVEWDPERHGLST 204

Query: 157  TSMHCFESPE-----WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS 211
             S+H +E  E     W        ++     +  DP+GRC  +      + IL   QG  
Sbjct: 205  VSIHYYEQEEIGGSPWDPYLSNCFNY-----LTADPRGRCAALKFGARNLAILPFRQGDE 259

Query: 212  GLVGDE--------------DTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGY 255
                D+               T  + G        S V+ L  LD  + H     F++ Y
Sbjct: 260  DTTMDDWDEELDGPRPTTAIITSENKGHEDTPYAPSFVLRLSSLDPTLIHTVHLAFLYEY 319

Query: 256  IEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVP 315
             EP   IL       +  +  +         ++    K    I     LP+D +K++ +P
Sbjct: 320  REPTFGILSSTLSPSSSLLDERKDQLSYMVFTLDLNQKASTTILVVTGLPYDLFKVIPLP 379

Query: 316  SPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQN 374
            SPIGG L+VG N  IH      +  +A+N  A S  S   + +SS  + L+      L  
Sbjct: 380  SPIGGALLVGGNELIHIDQSGKANGVAVNALAKSCTSFGLVDQSSLQMRLEGCAVEQLSA 439

Query: 375  DVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS-------VLTSDITTIGNSLFFL 425
            D    L+   TG+L +L+   DGR V  L+L +  PS          S  + I ++  F+
Sbjct: 440  DNGEMLIILNTGELAVLSFRMDGRSVSGLNLRRV-PSESGICMGAQASCTSLINHNSMFI 498

Query: 426  GSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEE-- 483
            GS   DS+++ ++  S  +      +      +          +   D  +D + GE   
Sbjct: 499  GSEDTDSIVLGWSRKSKQAGRRRS-QPTIDAGDDADVDGTDEDQEDEDEDEDDLYGESTA 557

Query: 484  -LSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL----RINADASATGISKQSN 538
             + L G  + +  S    ++F + DSLVNI PL+D +       R + D  AT IS +SN
Sbjct: 558  AIPLKGEVAADANSKAGDYAFRIHDSLVNIAPLRDVTLSKPETPREDEDEEAT-ISTRSN 616

Query: 539  YELV--------------------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYD 572
            +ELV                          E P  +GIWT+  K       +  +  A  
Sbjct: 617  FELVGVTGRNTSGSLAFLRREIEPNVIGRFEFPEARGIWTLCAKRPLIKGLEPEKSEAIL 676

Query: 573  D-------EYHAYLIISL-------EARTMVLETADLLTEVTESVDYF-VQGRTIAAGNL 617
            D       ++   +I+S        E+   VL +A    E     ++    G TI  G +
Sbjct: 677  DPESELGAQFDRLMIVSKSTEDTPEESSVYVLTSAGF--EALADTEFEPAAGATIKCGTV 734

Query: 618  FGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMS 676
                RV+Q+ +   R  DG   + Q L     + E+G+      +++ SI DPYVLL   
Sbjct: 735  GNGMRVVQILKSEVRSYDGDLGLAQILPM--FDDETGA---EPKIVAASIVDPYVLLIRD 789

Query: 677  DGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVG 736
            D SI +   D       ++       + K +S C LY+D                STG+ 
Sbjct: 790  DASIFVASCDSDNDLEEIERGDDSLLTNKWLSGC-LYND----------------STGMF 832

Query: 737  EAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHIVDTYM--R 793
                 ++G    +  I S++  E GAL ++ +P+ +  ++  +      T I   Y   R
Sbjct: 833  AETALSNGTVSKKSVIMSLLNSE-GALFMYALPDLSKPIYQANGVSFIPTTISPDYATRR 891

Query: 794  EALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCY 853
              + ++ TE+                       L      A    P+L    ++  +  Y
Sbjct: 892  STVAETLTEV-----------------------LLADLGDATSKSPYLIFRASNDDLTIY 928

Query: 854  QAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQR 913
            +   F+ P     S+ P   S+SL    +    +       T + A   E    G+P + 
Sbjct: 929  EP--FQVP-----SEAPRPLSKSLHFQKIHNPHVAKTANPETEV-AADAESAKRGSPMRA 980

Query: 914  ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVT 973
            I    N+ G    FL G  P + +   +       L    + + +  H   C+ GFIYV 
Sbjct: 981  IA---NVGGLSSVFLPGDSPSFVVKSSKSTPRVVGLRGHGVRSLSGFHTEGCDRGFIYVD 1037

Query: 974  SQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPL--IVSVPVLKPLNQ 1031
            S+GI ++ QL   +   +     + + +      +TY   K++Y +  +V  P   P + 
Sbjct: 1038 SKGIARVSQLEPETNVTDIGLTLRKVKIGEEVQAVTYHPPKDVYVIGTVVKEPFELPKDD 1097

Query: 1032 VLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE---------VRILEPDRAGGPWQTRA 1082
                                  D HR +  E+           +++L P      W    
Sbjct: 1098 ----------------------DYHREWAKEDITFKPLTGRGFLKLLNPSN----WSVID 1131

Query: 1083 TIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
             + + S E  + ++ + L  +  T E + L+ +GTA  +GED+A RGRV ++
Sbjct: 1132 KVELDSHEIIMCIKTLNLEVSENTHERKQLITVGTAISKGEDLAIRGRVYVY 1183


>gi|291232724|ref|XP_002736306.1| PREDICTED: cleavage and polyadenylation specific factor 1-like
           [Saccoglossus kowalevskii]
          Length = 304

 Score =  173 bits (439), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 118/354 (33%), Positives = 175/354 (49%), Gaps = 62/354 (17%)

Query: 3   FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
           +A Y+ +H PTGI +C  G                  EE               NL++  
Sbjct: 2   YALYRQIHPPTGIEHCVYGH-------------FFSKEE--------------KNLIIAG 34

Query: 63  ANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQG 122
           A  + +Y + + +  SK+ K+  E  R                 + L GN+ SL      
Sbjct: 35  ATDLHVYRL-LSDVDSKQKKSKLEHLRS----------------FSLFGNIMSLQTTRLA 77

Query: 123 GADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL 182
           GA     RD+++L+F+DAK+SV+E+D   H L+  S+H FE      LK G  S    P 
Sbjct: 78  GAS----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEEA---LKEGYVSNYYIPQ 130

Query: 183 VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD 242
           V VDP  RC  +L+YG ++++L   + G+    D+D    G   S+ +  S++INL+D+D
Sbjct: 131 VVVDPDNRCAVMLMYGSKLVVLPFRREGAA--EDQDGVLPGSSKSSFL-PSYIINLQDID 187

Query: 243 MK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
            K  ++ D  F+HGY EP + IL E   TW GRV+ +  TC I A+S++   + HP+IWS
Sbjct: 188 QKLINIIDIKFLHGYYEPTLFILFEPLRTWPGRVAVRKDTCCIVAISLNIEQRVHPVIWS 247

Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQE 354
             NLP D  K + VP PIGGVLV   +++ Y +QS         Y VSL+   E
Sbjct: 248 LNNLPFDCIKAIPVPKPIGGVLVFAVDSLLYLNQSVP------PYGVSLNGLTE 295


>gi|395740218|ref|XP_002819588.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 [Pongo abelii]
          Length = 1388

 Score =  172 bits (437), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 102/269 (37%), Positives = 152/269 (56%), Gaps = 29/269 (10%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
            T K HP+IWS  +LP D  + LAVP PI
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPI 275



 Score =  115 bits (287), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 101/347 (29%), Positives = 167/347 (48%), Gaps = 56/347 (16%)

Query: 379 LSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           +S K G++ +LT++ DG R V+     K   SVLT+ + T+     FLGSRLG+SLL+++
Sbjct: 285 ISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKY 344

Query: 438 TCG----SGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS-ASN 492
           T        +++  +  KEE    +    +T     +     QD V+  E+ +YGS A +
Sbjct: 345 TEKLQEPPASAVREAADKEEPPSKKKRVDATAGWSAAGKSVPQDEVD--EIEVYGSEAQS 402

Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYG----------------LRI------NADASA 530
            T+ A  T+SF V DS++NIGP  + + G                L I        + + 
Sbjct: 403 GTQLA--TYSFEVCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGHGKNGAL 460

Query: 531 TGISKQSNYELV---ELPGCKGIWTVY---------HKSSRGHNADSSRMAAYDD-EYHA 577
           + + K    ++V   ELPGC  +WTV          +    G   + S   A DD   H 
Sbjct: 461 SVLQKSIRPQVVTTFELPGCYDMWTVIAPLRKEEEDNPKGEGTEQEPSTPEADDDGRRHG 520

Query: 578 YLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS 637
           +LI+S E  TM+L+T   + E+  S  +  QG T+ AGN+   R ++QV   G R+L+G 
Sbjct: 521 FLILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG- 578

Query: 638 YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
                L F P +         + ++  ++ADPYV++  ++G + + +
Sbjct: 579 --VNQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMFL 616



 Score = 99.4 bits (246), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 101/431 (23%), Positives = 174/431 (40%), Gaps = 72/431 (16%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 731  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 786

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++        
Sbjct: 787  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQ------- 829

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTRE-------------ETPHGAPCQ----RIT 915
                L   N+       +RF + P +   RE              T  GA  +    R  
Sbjct: 830  ----LGQGNL------KVRFKKVPHNINFREKKPKPSKKKAEGGSTEEGAGARGRVARFR 879

Query: 916  IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
             F++I G+ G F+ G  P W +V  R  LR+HP   DG + +F   HNVNC  GF+Y   
Sbjct: 880  YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNR 939

Query: 975  QGILKICQLPSGSTYDNYWPVQKVIPLKATP-HQITYFAEKNLYPLIVS--VPVLKPLNQ 1031
            Q   ++   PS +      P      L   P H         +Y +  S   P  +    
Sbjct: 940  QEPQRLSGSPSRTXXXXPTPPG----LLGLPGHWCVTPTNPQVYAVATSTNTPCAR---- 991

Query: 1032 VLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSS 1089
                 I +  G + +   +   + +     E + ++++ P      W+    A I +Q  
Sbjct: 992  -----IPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVS----WEAIPNARIELQEW 1042

Query: 1090 ENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLS 1148
            E+   ++ V+L +  T    +  +A GT  +QGE+V  RGR+L+         P   +  
Sbjct: 1043 EHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTK 1102

Query: 1149 GSYGPLFSSVQ 1159
              +  L+   Q
Sbjct: 1103 NKFKVLYEKEQ 1113


>gi|241060959|ref|XP_002408050.1| cleavage and polyadenylation specificity factor, putative [Ixodes
            scapularis]
 gi|215492346|gb|EEC01987.1| cleavage and polyadenylation specificity factor, putative [Ixodes
            scapularis]
          Length = 1241

 Score =  172 bits (437), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 206/815 (25%), Positives = 329/815 (40%), Gaps = 143/815 (17%)

Query: 388  LLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSML 446
            +LT+  DG R V+  +  K   SVLT+ +T       FLGSRLG+SLL+ +T      M 
Sbjct: 198  VLTLFNDGMRSVRNFNFDKAAASVLTTSMTLCEEGYLFLGSRLGNSLLLHYT-EKAAEME 256

Query: 447  SSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVR 506
             +G KE+               ++  D    +++ +EL +YGS +  T+    +++F V 
Sbjct: 257  EAGKKED---------------KAEGDVNVALIDPDELEVYGSETLATKQL-TSYTFEVC 300

Query: 507  DSLVNIGPLKDFSYG--------LRINAD-----ASATGISKQSNYELV----------- 542
            DSL+NIGP      G           N+D      +  G  K     ++           
Sbjct: 301  DSLINIGPCGKICMGEPAFLSEEFTQNSDPDLELVTTAGYGKNGALCVLQRSVRPQVVTT 360

Query: 543  -ELPGCKGIWTVYHKSSRGHNA------DSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
             ELPGC  +WTV    +           D     A     HA+LI+S    +M+L+T   
Sbjct: 361  FELPGCVHMWTVMGPPTEKKKKEASEESDEQAADATLTNTHAFLILSRADSSMILQTDQE 420

Query: 596  LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
            + E+  S  +  Q  T+ AGNL   R V+QV   G R+LDG+   Q +            
Sbjct: 421  INELDHS-GFSTQNPTVFAGNLGDGRYVLQVCPMGVRLLDGTRQLQHIPL---------- 469

Query: 656  SENSTVLSVSIADPYVLLGMSDGSIRLLV--GDPST-CTVSVQTP--AAIESSKKPVSSC 710
               S ++  S+ADP+VL+    G +  L   GDP++ C ++V  P   A+ S +    +C
Sbjct: 470  DVGSPIVGGSLADPHVLIRSEGGLVVHLTLRGDPASGCRLAVLRPQLTAVVSHRANALTC 529

Query: 711  --------------TLYHD----KGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDI 752
                           LY D    +  +  +R T+ +    +      +  +  P      
Sbjct: 530  HCIAVSGVLDDEDELLYGDSEDTRATKEPVRVTAMET--ESETANVFELKEVKP----TF 583

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETE-INSSSEEGT 811
            +  V  E+G LEI+ +P++   F V  F  G+  +VD+    A   +++E ++  S E  
Sbjct: 584  WVFVARENGVLEIYSLPDYKLCFLVKNFPMGQRVLVDSVQMTAPSGTKSEKLSDMSHE-- 641

Query: 812  GQGRKENIHSMKVV-ELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDP 870
                      M VV E+ M       SRP L A + D  +L Y+A+ F   +        
Sbjct: 642  ---------CMPVVHEILMVGLGVRQSRPLLLARV-DEDLLIYEAFPFYETQREGH---- 687

Query: 871  VSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSG 930
                  L    ++   +   R  +T       EE    +    +  F +ISG+ G FL G
Sbjct: 688  ----LKLRFKKLNHDIILRSRKYKTQKPENEEEEKAFQSRLW-LQPFSDISGYSGVFLCG 742

Query: 931  SRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG-----ILK---IC 981
             RP W  M  R  LR HP   DG +  F   HNVNC  GF++   Q      +L    + 
Sbjct: 743  HRPHWLFMSSRGELRYHPMFVDGPVYCFAPFHNVNCPKGFLHFNKQSDSYALLLHSYWLS 802

Query: 982  QLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1041
            QLPS   +            K   H+  +FA +      +  P        L  + D   
Sbjct: 803  QLPSPKRHGERLLFNCPSHKKICIHRCHFFALQQKAADFLWPPPFVTTVSPLPFVADSR- 861

Query: 1042 GHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVT 1099
                          +   T++++ +++L P      W+T     + +   E+   ++ V 
Sbjct: 862  --------------YIFPTMDKFSLQLLSPVS----WETIPNTRVDLDEWEHLTCIKNVM 903

Query: 1100 LFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLF 1133
            L +  T    +  LA+GT Y  GEDV +RGR+ + 
Sbjct: 904  LSSEGTSTGMKGYLALGTNYCYGEDVTSRGRITIL 938



 Score = 54.3 bits (129), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 50/95 (52%), Gaps = 5/95 (5%)

Query: 181 PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240
           P+++VDP  RC  +LV+   + ++   +  +    +E   G   G    +   + + L +
Sbjct: 58  PMIRVDPCNRCAAMLVFSRTIAVVPFRKDTAA---EEQETGPTFGNKPPLLDWYPVALTE 114

Query: 241 LDMK--HVKDFIFVHGYIEPVMVILHERELTWAGR 273
           LD K  +V D  F+HGY EP ++IL+E   TW G+
Sbjct: 115 LDEKINNVIDMQFLHGYYEPTLLILYEPLRTWPGK 149



 Score = 40.0 bits (92), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 31/103 (30%), Positives = 51/103 (49%), Gaps = 9/103 (8%)

Query: 236 INLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
           + L +LD K  +V D  F+HGY EP ++IL+E   TW G V    +  M S  + +    
Sbjct: 158 VALTELDEKINNVIDMQFLHGYYEPTLLILYEPLRTWPGYVLTLFNDGMRSVRNFNFDKA 217

Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA 336
              ++ ++M L  + Y  L   S +G  L+     +HY  ++A
Sbjct: 218 AASVLTTSMTLCEEGYLFLG--SRLGNSLL-----LHYTEKAA 253


>gi|340924328|gb|EGS19231.1| hypothetical protein CTHT_0058560 [Chaetomium thermophilum var.
            thermophilum DSM 1495]
          Length = 1460

 Score =  172 bits (436), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 269/1188 (22%), Positives = 459/1188 (38%), Gaps = 201/1188 (16%)

Query: 57   NLVVTAANVIEIYVVR--------VQEEGSKESKNSGETKRRVLMD--GISAA------- 99
            NLVV  +++++++  +        +Q  G+ + +++   + R+  D  G+ A+       
Sbjct: 28   NLVVAKSSLLQVFRTKTVTTEIDTLQTNGASKGRSAARYENRLANDDDGLEASFLGGDSL 87

Query: 100  ----------SLELVCHYRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFD 148
                       L LV    L G V  L+ +       SR   +S++LAF DAK+S++E+D
Sbjct: 88   GFRADRTTNTKLVLVYETPLAGTVIGLSKIK---TSTSRSGCESLLLAFRDAKLSLVEWD 144

Query: 149  DSIHGLRITSMHCFESPEWLHLKRGRESFARGPL------VKVDPQGRCGGVLVYGLQMI 202
               + L   S+H +E  E       + S    PL      +  DP  RC  +      + 
Sbjct: 145  AERNALGTVSIHYYEQEEL------QGSPWAAPLSHYVNFLVADPGSRCAALKFAARNLA 198

Query: 203  ILKASQGGSGL-VGDEDTFGSG------------GGFSARIES-----SHVINLRDLD-- 242
            IL   Q    + +GD D    G               ++ IE      S V+ L +LD  
Sbjct: 199  ILPFRQVDEDIDMGDWDEELDGPRPQKDVSNAAVSNGASNIEDTPYSPSFVLRLSNLDPS 258

Query: 243  MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAM 302
            + H     F+H Y EP   IL              H+T M+  L +    K    I S  
Sbjct: 259  LLHPVHLAFLHEYREPTFGILASTSSASNALGRKDHYTYMVFTLDLQQ--KASTTILSVS 316

Query: 303  NLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFS 361
             LP D Y+++ +P+P+GG L+VG N  IH         +A+N       S     +S  +
Sbjct: 317  GLPQDLYRVVPLPAPVGGALLVGCNELIHIDQSGKPNGVAVNPMTKQCTSFGLADQSDLN 376

Query: 362  VELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDI 415
            + L+      L  D+   L+    G +VL+T   DGR V  L+L    P    +++   I
Sbjct: 377  IRLEGCIIDVLTPDLGEFLMILNDGRMVLITFRIDGRTVSGLELRLVPPASGGTIIPGRI 436

Query: 416  TT---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
            +T   IG ++ F GS  GDSL+  +     T   +   + +    + D            
Sbjct: 437  STLSRIGKNVMFAGSEEGDSLVFGW-----TKKQTQAGRRKSKPRDDDFYMDDYEEEEEE 491

Query: 473  DALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRI---NADAS 529
                D+   E  S +   S  +       SF + D L++I P++  +YG  +    ++  
Sbjct: 492  VDEDDLYGEETTSHHQPVSAASSLLSGDLSFRIHDRLISIAPIQSMTYGQPVWMPGSEEE 551

Query: 530  ATGISKQSNYELV--------------------------ELPGCKGIWTVYHKSSRGHNA 563
               I   ++ +LV                          E    +G WT+  K     + 
Sbjct: 552  RNSIGVHADLQLVCAVGRDKSSCLATMNLAIQPKVIGQFEFSEARGFWTMCAKKPIPKSL 611

Query: 564  DSSRMAA------YDD--EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQG 609
             S +  +      YD   +Y  ++I++       E   +   TA     +  +      G
Sbjct: 612  QSDKGVSVLGGNDYDTGGQYDRFMIVAKVDLDGYEKSDVYALTAAGFEGLCGTEFDPAAG 671

Query: 610  RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
             TI AG +    R++Q+ +   R  DG +    +   P   E  +G+E   V + SIADP
Sbjct: 672  ITIEAGTMGKGSRIVQILKSEVRSYDGDFGLSQIV--PMMDEE-TGAEPRAV-TASIADP 727

Query: 670  YVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDA 729
            Y+L+   D S  +   D S     ++    +  S K +S C LY+D              
Sbjct: 728  YLLIIRDDSSAFIAGIDSSNELEELRKEDKVLVSSKWLSGC-LYND-------------- 772

Query: 730  WLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHIV 788
              ST +          P     I   +   SGAL I+ +P+ +  ++  D          
Sbjct: 773  --STAIFAEETAKSSKPTQS--ILLFLLSSSGALYIYRLPDLSKPIYVTDGLA------- 821

Query: 789  DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
              Y+  AL    T       +GT    KE I  + V +L        H  P+L    ++ 
Sbjct: 822  --YIPPALSSDFT-----VRKGT---PKEAITEIMVADLG----DTTHKSPYLILRHSND 867

Query: 849  TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG 908
             +  YQ Y ++           + T +  S   +   +L N  F+R P +   +++ P  
Sbjct: 868  DLTIYQPYRYK-----------LGTGQVFS-KTLFFQKLPNPSFARAP-EETEQDDVPPQ 914

Query: 909  APCQRITIFKNISGHQGFFLSGSRPCWCMVFRERL-RVHPQLCDGSIVAFTVLHNVNCNH 967
                 +    NI+G+   FL G  P + +   + + RV P L    ++A +  H   C+H
Sbjct: 915  PRLLSMRRCNNIAGYSTVFLPGHSPSFILKSAKSMPRVVP-LQGAGVIAMSPFHTEGCDH 973

Query: 968  GFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLK 1027
            GFIY  S  I ++ Q+P   +Y       K +P+      + Y   +  Y  +V     +
Sbjct: 974  GFIYADSHNIARVTQIPEDWSYAELGLAVKKVPIGEDIAAVAYHPPQQCY--VVGCNASE 1031

Query: 1028 PLNQVLSLLIDQEVGHQIDNHN-LSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPM 1086
            P           E+    D H   +  +L    T++   ++++ P      W    T+ +
Sbjct: 1032 PF----------ELPKDDDYHKEWARENLVFKPTLDRGLLKLISPIT----WTVIDTVQL 1077

Query: 1087 QSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
            +  E  L V  + L  + +T E   L+A+GTA  +GED+  RGRV ++
Sbjct: 1078 EPCETVLCVETLNLEVSESTNERRQLIAVGTALTKGEDLPTRGRVHVY 1125


>gi|367018592|ref|XP_003658581.1| hypothetical protein MYCTH_2294503 [Myceliophthora thermophila ATCC
            42464]
 gi|347005848|gb|AEO53336.1| hypothetical protein MYCTH_2294503 [Myceliophthora thermophila ATCC
            42464]
          Length = 1547

 Score =  171 bits (434), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 266/1148 (23%), Positives = 439/1148 (38%), Gaps = 190/1148 (16%)

Query: 94   DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRR------------DSIILAFEDAK 141
            D  +   L LV  + L G V  LA +    A+ +               DS+++AF DA+
Sbjct: 93   DRANTTKLVLVAEFPLAGTVTGLARIRTPKANRNHDGGAGHAGHAGHGCDSLLIAFRDAR 152

Query: 142  ISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL------VKVDPQGRCGGVL 195
            +S++E+D   H L   S+H +E  E       + S    PL      +  DP  RC  + 
Sbjct: 153  LSLVEWDAEQHTLSTISIHYYEQEEL------QGSPWAAPLSHYVNFLVADPGSRCAALK 206

Query: 196  VYGLQMIILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVIN----------------- 237
                 + IL   Q    + +GD D    G   +    S+ V+N                 
Sbjct: 207  FGARNLAILPFRQADEDIDMGDWDEELDGPRPAKDPSSNAVVNGASNIEDTPYSPSFVLR 266

Query: 238  LRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
            L +LD  + H     F+H Y EP   IL              H   M+  L +    K  
Sbjct: 267  LSNLDPSLLHPVHLAFLHEYREPTFGILASATAPSNALGRKDHLVYMVFTLDLQQ--KAS 324

Query: 296  PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQE 354
              I S   LP D ++++ +P+P+GG L+VG+N  IH         +A+N       +   
Sbjct: 325  TTILSVSGLPQDLFRVVPLPAPVGGALLVGSNELIHVDQSGKPNGVAVNPMTRQCTNFGL 384

Query: 355  LPRSSFSVELDAAHATWLQNDVALLST--KTGDLVLLTVVYDGRVVQRLDLSKTNPSV-- 410
            + +S  ++ L+      L  D+  L      G   ++T   DGR V  L++     S   
Sbjct: 385  VDQSDLNLRLEGCAIDVLTPDLGELFVVLNDGRAAVVTFRIDGRTVSGLEIKMLPESAGG 444

Query: 411  -----LTSDITTIGNSLFFLGSRLGDSLLVQFT---CGSGTSMLSSGLKEEFGDIEADAP 462
                   S ++ IG +  F G   GDSLL+ +      +G   L +      GD++A+  
Sbjct: 445  SLIPGRVSTLSRIGRNAVFAGREEGDSLLLGWAKRQAQTGRRRLRARDAAGSGDVDAEG- 503

Query: 463  STKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT-------------FSFAVRDSL 509
                L     D + +  + +E           ESA +               SF V D L
Sbjct: 504  --AELAEGDEDVVAEGEDEDEDEEDEDDLYGEESAPRQQPVSAASSFLSGDVSFRVHDRL 561

Query: 510  VNIGPLKDFSY----------------GLRINADASAT-GISKQSNYELV---------- 542
            +++ P++  +Y                G+R + +   T G  K +    V          
Sbjct: 562  LSVAPIQALTYSQPVYLAGSEEERNSAGVRSDLNLVCTVGRDKSAALATVNLAIQPRVIG 621

Query: 543  --ELPGCKGIWTV-----YHKSSRGHNADSSRMAAYD--DEYHAYLIISLEARTMVLETA 593
              E P  +G WTV       KS +G  A +S    YD   +Y  ++I++ +      E +
Sbjct: 622  RFEFPEARGFWTVCAKKPVPKSLQGDKAGNSLSKDYDTAGQYDRFMIVA-KVDLDGYEKS 680

Query: 594  DLLTEVTESVDYF-------VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG 646
            D+        +           G TI AG +    R+IQ+ +   R  DG +    +   
Sbjct: 681  DVYALTAAGFEGLGGTEFDPAAGITIEAGTMGKGSRIIQILKSEVRCYDGDFGLSQIV-- 738

Query: 647  PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKP 706
            P   E  +G+E   V S SI DP++L+   D S  +   D S     +       +S K 
Sbjct: 739  PMLDEE-TGAEPRAV-SASIVDPFLLIIRDDSSAFIAQVDSSNELEELDKEDPTLASTKW 796

Query: 707  VSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIF 766
            ++ C LY D          +T A+     G+      GG L Q  +   +   SGAL I+
Sbjct: 797  LTGC-LYAD----------TTGAFAEEAPGK------GGKLSQ-SVLMFLLSASGALHIY 838

Query: 767  DVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVE 826
             +P+ +    V + +S        Y+   L       + S+ +GT    KE I  + V +
Sbjct: 839  RLPDLSKPVYVAEGLS--------YIPPGLS-----ADYSARKGTA---KETIAEILVAD 882

Query: 827  LAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASR 886
            L        H  P L    T+  +  YQ + +    NT      +  S++L        +
Sbjct: 883  LG----DMTHKSPHLILRHTNDDLTLYQPFRY----NTGAG---LEFSKTLFF-----QK 926

Query: 887  LRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVH 946
            L N  F+++P +A   E T H      +    N+ G+   FL G+ P + +   + +   
Sbjct: 927  LPNTVFAKSPEEADDDEAT-HQPRFLSMRRCANVGGYSTVFLPGASPSFIIKSSKSVPKV 985

Query: 947  PQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPH 1006
              L    ++A +  H   C HGFIY  S+ + ++ QLP   +Y       + IP+     
Sbjct: 986  LPLQGTGVIAMSPFHTEGCEHGFIYADSRDMARVAQLPQDWSYAELGLAVRKIPIGEDIA 1045

Query: 1007 QITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEV 1066
               Y      Y +  + P      +   L  D +   +    NL+        TV+   +
Sbjct: 1046 AAAYHPPMQSYVVGCNTP------EPFELPKDDDYHKEWARENLAF-----KPTVDRGNL 1094

Query: 1067 RILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVA 1125
            +++ P      W    +I M+  E  L V  + L  +  T E + L+A+GTA  +GED+ 
Sbjct: 1095 KLVSPIT----WTVVDSIQMEPCETVLCVECLGLEVSEFTNERKQLIAVGTAITKGEDLP 1150

Query: 1126 ARGRVLLF 1133
             RGRV ++
Sbjct: 1151 TRGRVYVY 1158


>gi|384253955|gb|EIE27429.1| hypothetical protein COCSUDRAFT_64224 [Coccomyxa subellipsoidea
           C-169]
          Length = 1137

 Score =  171 bits (434), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 148/477 (31%), Positives = 227/477 (47%), Gaps = 68/477 (14%)

Query: 225 GFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMIS 284
             S  + +S+++ L  L +  V+D +F+H Y EPV+++LHE + +W G++     T  ++
Sbjct: 18  ALSTTVGNSYMLKLAKLGISEVRDAVFLHRYSEPVLLVLHETKPSWGGQLRNSKDTMEVT 77

Query: 285 ALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNN 344
           A S++   K+H  +WS  NLP DA+KL+ VP   GG LV+  N + Y SQ A+ A A   
Sbjct: 78  AFSLNVAHKRHTRLWSIGNLPSDAFKLIEVPG--GGGLVICQNLLIYVSQEAAAAAASGA 135

Query: 345 YAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLS 404
                          F ++L      WL ++  LL   +G L+L+ V  +G   +RL +S
Sbjct: 136 PRA----------EGFELDLTDCSGAWLADNSLLLGLASGQLILVNVQLEGS--KRLKVS 183

Query: 405 KTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPST 464
           K   +   S +  +G  L FLGS + +SLL++     G ++L  G +E+    EADA   
Sbjct: 184 KAQGAPPPSCMCRLGPELLFLGSWVANSLLIR-AVPEGQTLLLGGPEEQAS--EADATHA 240

Query: 465 KRLRRSSSDALQDMVN--GEELSLY-----GSASNNTESAQKTFSFAVRDSLVNIGPLKD 517
            +  R   DA  D+ N   +E+SL        A  +T  A K +S  V DSLV+IG ++D
Sbjct: 241 SKRPRLDPDA-ADLGNEDEDEVSLIYRTDAQPALPSTTGASK-YSLQVVDSLVSIGIVQD 298

Query: 518 FSYGLRINADASATGISKQSN-------------------------YEL---VELPGCKG 549
              G   +  A    ++K                             EL   V LPG   
Sbjct: 299 LVTG-EASTSAPQEWVAKTERGPPKLLAAVGSDKFGAVAVLRSSLVPELVTEVPLPGVDQ 357

Query: 550 IWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTES-VDYFVQ 608
           +W V H    G   D S +      YHA+L ++ ++ T VL T + L E   S VD+ + 
Sbjct: 358 MWAV-HFQPEGLPVDDSLL------YHAFLFLNEKSGTKVLRTGEELDETDSSQVDFILS 410

Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS 665
            RT+ AGNL G  R++QV  RG  +L GS   QDL       +   G  N+T+++ S
Sbjct: 411 SRTVFAGNLLGNSRIVQVHARGVVLLSGSSRVQDLPV-----QDLIGVSNTTIVAAS 462



 Score =  120 bits (301), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 103/315 (32%), Positives = 133/315 (42%), Gaps = 49/315 (15%)

Query: 839  PFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLD 898
            PFL  +L DGT L Y+A  F  P                    V   RL     +  P  
Sbjct: 553  PFLLLLLADGTFLAYRA--FHTPRG-----------------RVCFKRLSLPAHAHCPPQ 593

Query: 899  AYTREETPHGAPCQRITIFKNISGHQ-----GFFLSGSRPCWCMVFRERLRVHPQLCDGS 953
                + T   AP   +T F  +   +     G F+SG RP W +  R  L  H    +G 
Sbjct: 594  DRRSKTT---APSSSMTRFDGLGESKEHVNSGMFVSGERPLWLVASRGTLVAHAMDVEGR 650

Query: 954  IVAFTVLHNVNCNHGFIYV----TSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQIT 1009
            +   T  HN+NC  GFI           LKICQLP  +  D  WP+QK I ++ATPH++ 
Sbjct: 651  VSGMTPFHNINCPLGFITACMAENDGETLKICQLPMRTRLDTPWPLQK-IAVRATPHRLA 709

Query: 1010 YFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS---VDLHRTYTVEEY-- 1064
            Y+AE  LY L+VS PV  P  +       QE     D H   S    D     +  E   
Sbjct: 710  YYAEARLYVLLVSRPV--PYRE------HQEEASDGDPHASYSYICADAAAKASGTELGG 761

Query: 1065 EVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDV 1124
            EVR+LEP R    +QT A   +   E   +V    L N  T   E  + +GTA   GED 
Sbjct: 762  EVRLLEPGR----YQTVARHALDPGEEPCSVAADWLRNAQTGALEPYITVGTALNYGEDY 817

Query: 1125 AARGRVLLFSTGRNA 1139
               GR+LLF   R +
Sbjct: 818  PCSGRILLFKATRTS 832


>gi|358338426|dbj|GAA28838.2| cleavage and polyadenylation specificity factor subunit 1
           [Clonorchis sinensis]
          Length = 1741

 Score =  171 bits (434), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 141/482 (29%), Positives = 225/482 (46%), Gaps = 67/482 (13%)

Query: 129 RRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQ 188
           R DS++L+F +AK++V+ FD   + L+  S+H +E   + +LK GR  F+  P+++VDP 
Sbjct: 27  RLDSLLLSFTEAKVAVMGFDPVQYELKTLSLHNYE---FENLKSGRTHFSHLPILRVDPL 83

Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGDE--------DTFGSGGGFS------ARIESSH 234
            RC  VLVY   + +L   +  +   GD+        +T    G  S      A + ++ 
Sbjct: 84  QRCAVVLVYDRHLAVLPFRRSEALAAGDKYLAKPVTNNTARGAGSLSWERRATAPLLATF 143

Query: 235 VINLRD---LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
              L       + +V D  F++G+ EP +++L+E   TWAGRVS +  TC I ALS +  
Sbjct: 144 TTCLSSSTGEKINNVLDMQFLNGFYEPTLLVLYEPIGTWAGRVSARRDTCCIVALSFNLQ 203

Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYA---V 347
            + +P+IW   +LP+D   + +VP PIGGVL++  N+I Y  Q+  SC L LN YA    
Sbjct: 204 KRTNPVIWFQESLPYDCTYVHSVPEPIGGVLILATNSIIYMKQTLPSCGLPLNCYAQVTT 263

Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLT--VVYDGRVVQRLDLSK 405
           +    Q++P+    + LD      + +   L+ T+TG + LL+  V +  + V  L L +
Sbjct: 264 NFPMRQDVPQCG-PLTLDGCRIVTMTDSQFLIVTRTGKMCLLSLWVEHTTQTVSSLLLHE 322

Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGS------------GTSMLSSGLKEE 453
              SV    +  +     F+GSRL DS+L+  T  +              +  +   + +
Sbjct: 323 IGCSVPPYSVALLDKGYVFVGSRLCDSVLLHLTASTMFVNTLGRIVDLDETTTADNFRTD 382

Query: 454 FGDIEADA---------PSTKRLRRSSSDALQD----MVNGE------ELSLYGSASNNT 494
              IE DA         P+ K     SS         +V+G       ++ LYG    N 
Sbjct: 383 IPMIERDAESIPVDKNNPTEKEAENVSSGTPSKPSGSIVHGPYVFDEVDVELYGDTILNP 442

Query: 495 ESAQK---TFSFAVRDSLVNIGPL-----KDFSYGLRINADASATG-ISKQSNYELVELP 545
            S  +   T+ F V D LVN GP+      +  Y    N D +    I+ Q+    VEL 
Sbjct: 443 PSDVRELNTYKFEVADRLVNFGPMGLLTSGEVPYLAPGNTDPTDEALIAAQAEMHHVELL 502

Query: 546 GC 547
            C
Sbjct: 503 AC 504



 Score = 70.9 bits (172), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 95/406 (23%), Positives = 168/406 (41%), Gaps = 46/406 (11%)

Query: 748  DQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSS 807
            D+   ++ + + +G LEI+ +P+F  ++ V  F      +VD     A + ++ E+N  +
Sbjct: 1007 DKSRYFAFIVFTNGVLEIYSLPDFTLLYEVHHFSDLPAMLVDC---RAGQGNKVEVNLEN 1063

Query: 808  EEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
                    ++NI    V+E+ +     +  RP L  + T   I  ++A         S +
Sbjct: 1064 IPNCPAAEEDNIPP-TVLEITVFPIGRNRDRPVLL-VRTSQEIAFFEALC------PSHN 1115

Query: 868  DDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREET-PHGAPCQRITI--------FK 918
            +     S S S   +   R R L     PL A  R  T P  A  Q   +        F+
Sbjct: 1116 EAHPFASESWSQEGL---RWRRLPIP-CPLVAPRRVRTDPKIADVQSTMLTRKNLLRPFE 1171

Query: 919  NISGHQGFFLSGSRPCWCMVFRE-RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGI 977
            +I GH G F+ G+ P W        +RV     DG + +F  L+   C  GF+Y T    
Sbjct: 1172 DIDGHCGVFVCGATPIWLFSSDTGHIRVFNHSIDGIMGSFAPLNTDICPSGFVYFTYSNE 1231

Query: 978  LKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLI 1037
            +++  L  G ++  +  + + +PL+ TP+ + Y  E   Y L+ +   +K  + V  L  
Sbjct: 1232 MRLATLLPGYSFKEHLGM-RWVPLELTPYFLQYHIESKTYALVGT--RVKSCSSVYHLNA 1288

Query: 1038 ----DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRA---GGPWQT--RATIPMQS 1088
                ++EV  +     L S+D         Y +++  P  +     PWQ    A I  + 
Sbjct: 1289 EGNKEEEVLLRPPTCVLPSLDY--------YVLQMYAPSTSLAEATPWQAIPHACIDFEP 1340

Query: 1089 SENALTVRVVTLFNTTTKE-NETLLAIGTAYVQGEDVAARGRVLLF 1133
             E    +    L +  T    +  LA+G     GE++  RGR+++ 
Sbjct: 1341 WEVVTCMITAQLSSEQTFHGTKDYLALGANLSYGEEIPVRGRIIIL 1386


>gi|406865186|gb|EKD18229.1| CPSF A subunit region [Marssonina brunnea f. sp. 'multigermtubi'
            MB_m1]
          Length = 1443

 Score =  171 bits (433), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 265/1193 (22%), Positives = 477/1193 (39%), Gaps = 207/1193 (17%)

Query: 57   NLVVTAANVIEIYVVRVQ------EEGSKESKNSGETKRRVLMDGISAA----------- 99
            NL+V   ++++++  ++       EEG+  SK + +    +  DG+ A+           
Sbjct: 28   NLIVAKTSLLQVFTTKITSIELGIEEGA--SKQNDKWDPSLDNDGLDASFIGADSLLRPD 85

Query: 100  -----SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGL 154
                  L LV  Y L G + SLA +    + +    +++++ F DAK+S++E+D +  G+
Sbjct: 86   RARRTKLVLVAEYTLSGTITSLARIKTLSSKSGG--EALLVGFRDAKLSLVEWDPARPGI 143

Query: 155  RITSMHCFESPEWLHLKRG------RESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQ 208
               S+H +E  E   L+R       +ES      +  DP  RC  +   G  + I+   Q
Sbjct: 144  STISIHYYEQDE---LQRSPWAPNLKESVN---YLIADPGSRCAALKFGGRNLGIIPFKQ 197

Query: 209  GGSGL-VGDEDTFGSGGGFSARIES---------------SHVINLRDLDMKHVKD--FI 250
                + + D D    G   + ++ +               S V+ L  LD   +      
Sbjct: 198  DDEDVNMDDWDEEIDGPRPADKVITKATNSSNDKETPYGPSFVLRLATLDPNLINPIHLA 257

Query: 251  FVHGYIEPVMVILHERELTWAGRVSWK--HHTCMISALSISTTLKQHPLIWSAMNLPHDA 308
            F++ Y EP   IL   ++  +  +  +  H T M+  L +    +    I S   LP+D 
Sbjct: 258  FLYEYREPTFGILSSSQMPASSLLFERRDHLTYMVFTLDLQQ--RASTTIMSVTGLPYDL 315

Query: 309  YKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
            ++++ + +P+GG L++G N  IH      +  +A+N +A    S   + +S   + L+ +
Sbjct: 316  FEVVPLDAPVGGALLIGTNELIHIDQAGKANGVAVNVFAKQCTSFGLVDQSGLDMRLEGS 375

Query: 368  HATWL--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDITT---I 418
                L  Q+   ++  +TG++ +L+   DGR V  L + + +     SV+ + ++T   I
Sbjct: 376  KIEQLSIQSGEMIIFLQTGEIAILSFHMDGRSVSSLSVRRVSAEAGGSVIPARVSTLSHI 435

Query: 419  GNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDM 478
            G +  F+GS   DS+++ +   S  S   S  K     IE    ++        D   D 
Sbjct: 436  GQNTLFVGSACADSMVLGW---SRKSNQVSRRKPRVEVIEDADDASLDELDDEDDDADDD 492

Query: 479  VNGEELSLYGSASN------NTESAQKTFSFAVRDSLVNIGPLKDFSYG---LRINADAS 529
            + GE  S+   A+N         S    + F V DSLVNI P+ + ++G   L  N D  
Sbjct: 493  LYGEGPSIIQDATNGVAKSDTVNSKAGDYVFQVHDSLVNIAPIVNITFGNASLSQNEDEK 552

Query: 530  ATGISKQSNYELV--------------------------ELPGCKGIWTVYHK--SSRGH 561
               +  +   ELV                          E P  +GIWT+  K  + +G 
Sbjct: 553  LDSVGVRGYLELVASVGKQRAGALAVIHQNIQPKVIGRFEFPEARGIWTMSAKRPAEKGL 612

Query: 562  NADSSRMA-----AYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYF-------VQG 609
             A   + +     A D +Y   +I+S +A +   ET+D+    + + +           G
Sbjct: 613  EAKKEKSSTSGDYAIDAQYDRLMIVS-KALSDGTETSDVYALTSANFEALTGTEFEPAAG 671

Query: 610  RTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
             TI AG L    RVIQV +   R  DG+  + Q L       +  +G+E   ++S S AD
Sbjct: 672  STIEAGTLGNGNRVIQVLKSEVRSYDGNLGLAQILPM----YDDDTGAE-PKIVSASFAD 726

Query: 669  PYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTD 728
            PY+LL   D SI +   D +     ++       + K ++ C LY D             
Sbjct: 727  PYLLLFRDDSSIFVAQSDENNELEEIEREDDALLATKWLTGC-LYAD------------- 772

Query: 729  AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNF-NCVFTVDKFVSGRTHI 787
               S GV   +    G  +++ ++   +    GAL I+ +P+  N V+  +         
Sbjct: 773  ---SRGVFAPVQSDKGQKVEE-NVMMFLLSAGGALHIYALPDLSNAVYVAEGLC------ 822

Query: 788  VDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSR-PFLFAILT 846
               ++   L  +     S++ E              + EL +       +R P+L    +
Sbjct: 823  ---FVPPVLSAAYAARRSAARE-------------TITELVVADLGDETARSPYLILRPS 866

Query: 847  DGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETP 906
               +  Y+      P +TS      S S     S +   ++ N   +R P    +  ET 
Sbjct: 867  TDDLTIYE------PFHTS------SESSGGLASTLQFLKIHNPHLARNP--DVSAAETA 912

Query: 907  HGAPCQR---ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNV 963
             G    R   + +  N+ G+   FL G  P + M   +       L    +   +  H  
Sbjct: 913  DGIQETRDEPMRVISNLGGYCTVFLPGGSPSFIMKSAKSTPKVISLQGLGVRGMSSFHTE 972

Query: 964  NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSV 1023
             C+ GFIY    G+ ++ QLP  +T+       + I L    H + Y      Y    S 
Sbjct: 973  GCDRGFIYTDVDGLARVSQLPKDTTFAELGVSLQKIELGQEIHGVAYHPPTECYVAATST 1032

Query: 1024 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTY--TVEEYEVRILEPDRAGGPWQTR 1081
                          + E+  + DNH+        T+  T+E+  +R++ P      W   
Sbjct: 1033 EA------------EFELPKEDDNHHPQWAKEQITFKPTMEQGRLRLINPVN----WTVV 1076

Query: 1082 ATIPMQSSENALTVRVVTLFNT-TTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
              + +   E  + ++ + L  +  T E + L+A+GT   +GED+A +GR+ ++
Sbjct: 1077 DEVELDPFEVIMCIKTLILETSEITNERKQLIAVGTGISKGEDLAIKGRIHVY 1129


>gi|389740693|gb|EIM81883.1| hypothetical protein STEHIDRAFT_65512 [Stereum hirsutum FP-91666 SS1]
          Length = 1438

 Score =  171 bits (432), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 274/1205 (22%), Positives = 484/1205 (40%), Gaps = 190/1205 (15%)

Query: 47   PSKRGIGPVPNLVVTAANVIEIYVVRV------------QEEGSKESKNSGETKRRVLMD 94
            P  +   P+ NLVV  +N++ I  VR             +E      K +   +  V MD
Sbjct: 31   PDSQKALPLFNLVVARSNLLRILEVREVPTLRPIHLDDERERRGNVRKGTEPVEGEVEMD 90

Query: 95   ---------GISAAS----------LELVCHYRLHGNV---ESLAILSQGGADNSRRRDS 132
                     G S AS             V  YRLHG V   E++ I+S          D 
Sbjct: 91   EQGEGYVNMGASTASNGAPRPTVLRFYFVRDYRLHGTVTGLETVRIMSS----LEDEMDR 146

Query: 133  IILAFEDAKISVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRC 191
            ++++F+DAKI++LE+    H L   S+H +E +P+ L L    +S      ++ DP  +C
Sbjct: 147  LLVSFKDAKIALLEWSTDTHSLSTVSIHTYERAPQLLSL----DSNMFTAQLRTDPLSQC 202

Query: 192  GGVLVYGLQMIILKASQGGSGL-VGDED-TFGSGGGFSARIESSHVINLR---DLDMKHV 246
              + +      IL   Q    L V D+D T      +S     S +++L    D  +++V
Sbjct: 203  AALSLPKDAFAILPFYQTQVDLDVMDQDQTRARDVPYSP----SFILDLAAEVDERIRNV 258

Query: 247  KDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPH 306
             DF+F+ G+  P + +L + + TW GR+     T  +   +++   + +P+I S   LP+
Sbjct: 259  VDFVFLPGFSHPTVAVLFQAQQTWTGRLKEYKDTMRLFIFTLNVVTRSYPIITSVEGLPY 318

Query: 307  DAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFS---- 361
            D   ++  P+ +GGV+V+ +N+ IH    S   ALA+N +   +    ++P ++ +    
Sbjct: 319  DCLSVVPCPAALGGVVVLTSNSVIHIDQASRRVALAVNGW---MPRVSDMPVTALAQGDQ 375

Query: 362  --VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-----TNPSVLTSD 414
              +EL+ +  T++ +    +  K G +  +    DG+VV +L +S      T PSV    
Sbjct: 376  GRLELEGSRMTFVDDKTLFIVLKDGTIHPVEFFVDGKVVSKLSISPPLAQTTTPSV---- 431

Query: 415  ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
            I  I N  FF+GS  G S L++           SG++E+  D + +    K    +  D+
Sbjct: 432  IRKITNEHFFVGSTAGPSALLKV----------SGVEEDIQD-DVEEIDGKTAPAAVVDS 480

Query: 475  LQDMVNGEELSLYGSA--------------SNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
            +  M   ++  LYGS+              + +T   +     ++ DSL   GP+ D ++
Sbjct: 481  VDGMDIDDDDDLYGSSKADPTPTANGNAVETTSTTRKRTVIHLSLCDSLPAHGPISDMTF 540

Query: 521  GLRINAD------ASATGISKQSNYELVE--LP-----------GCKGIWTVYHKSSRGH 561
             +  N D       +ATG      + L +  LP           G +G+W++  + +   
Sbjct: 541  SMTKNGDRAVPELVAATGSGLLGGFTLFQRDLPIRTKRKLHAIGGARGVWSLPVRQAVRV 600

Query: 562  NADSSRMAAYD-DEYHAYLIISLEARTM--VLETADLLTEVTESVDYFVQG-RTIAAGNL 617
            N  S +         +  +IIS +A     +   A   ++   ++   + G  T+ A   
Sbjct: 601  NGVSYQTPQNPLRSDNDTIIISTDATPSPGISRIATRSSKTDLNITTRIPGVTTVGAAPF 660

Query: 618  FGRRRVIQVFERGARIL--DGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
            F    ++ V     R+L  DGS         P     G+ +  + + + SI DPYV +  
Sbjct: 661  FQGTAILHVLSNAIRVLEPDGSERQ------PIKDMDGN-NYRAKIKNCSICDPYVFVLR 713

Query: 676  SDGSIRLLVGDPSTCTVSVQ--TPAAIESSKKPVSSCTLYHDKGP-----EPWLRKTSTD 728
             D +I L +G+     +  +  +P   ++S+  ++ C      G         L  ++T 
Sbjct: 714  EDETIGLFIGETERGKIRRKDMSPMGDKTSRY-IAGCFFSDTTGTFQAHVNSSLNGSNTT 772

Query: 729  AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
               +T   +++  A      Q   + ++    G +EI+ +P    VF+     + +  +V
Sbjct: 773  KQNATSTLQSVMNA-----GQKTQWLLLVRPQGVMEIWTLPKLTLVFSTTALATLQPLLV 827

Query: 789  DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
            D+    AL          S     Q RK     + + ++ +        RP LF +L  G
Sbjct: 828  DSLDPPAL----------SSLPQDQPRKP--QELDIDQILVAPLGETSPRPHLFVLLRSG 875

Query: 849  TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREET--P 906
             +  Y+A  FE P     + DP   SR  S+  V   ++ +  F     D   +E++   
Sbjct: 876  QLAIYEAVSFELP-----TGDPEPASRP-SILPVKLVKVLSRAFDIQHPDEQPQEKSVLA 929

Query: 907  HGAPCQRITI-FKNISGHQ----GFFLSGSRPCWCM-VFRERLRVHPQLCDGSIVAFTVL 960
                 QR+ I F      +    G F +G RPCW +   +  +RVH       + +FT  
Sbjct: 930  ELKKIQRLFIPFVTSPAPEKTFTGVFFTGDRPCWILGTDKGGIRVHSS-GHAVVHSFTPC 988

Query: 961  HNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITY--FAEKNLYP 1018
               +    F+  T +G   +  +P    +         +P +  P    Y          
Sbjct: 989  SLWDSKGDFLLYTDEGPCLLEWMPDVQLHTE-------LPSRFMPRSRAYTNVVFDPFTC 1041

Query: 1019 LIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPW 1078
            LIV    LK   Q  S   D     + D  N+S        T +   + ++ PD     W
Sbjct: 1042 LIVGAASLK--AQFTSFDEDGNQTWEPDAPNISYP------TTDCSTLELITPDA----W 1089

Query: 1079 QTRATIPMQSSENALTVRVVTLFNTTTKENE-TLLAIGTAYVQGEDVAARGRVLLFSTGR 1137
             T       S+E    V  V L   +T   + + +A+GT   +GED+A +G   +F    
Sbjct: 1090 LTMDGYEFASNEIVNAVECVMLETQSTDSGQKSFIAVGTTINRGEDLAVKGATYIFEIVE 1149

Query: 1138 NADNP 1142
               +P
Sbjct: 1150 VVPDP 1154


>gi|452001482|gb|EMD93941.1| hypothetical protein COCHEDRAFT_1129958 [Cochliobolus heterostrophus
            C5]
          Length = 1385

 Score =  170 bits (431), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 255/1160 (21%), Positives = 459/1160 (39%), Gaps = 196/1160 (16%)

Query: 57   NLVVTAANVIEIYVVR-----VQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
            NLVV   ++++++ ++     V   G  E++N+      E     L    S A L LV  
Sbjct: 28   NLVVAKNSLLQVFELKSTTTEVTPGGGDEAENAAANLDTEAADVPLQRTESTAKLVLVGE 87

Query: 107  YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
            + L G V SLA +         R +++++AF DAK+S++E+D   + L   S+H +E+P+
Sbjct: 88   FPLAGTVVSLARVK--ALSTKSRGEALLVAFRDAKLSLVEWDPESYSLHTISIHYYENPD 145

Query: 167  ------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQ------------ 208
                  W    +   +F     +  DP  RC  +      + IL   Q            
Sbjct: 146  LPGIAPWSADLKDTYNF-----LTADPSSRCAALKFGSHNLAILPFRQRDLVDDDYDSDA 200

Query: 209  -GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHE 265
             G      ++ T  + G  +    SS V+ L +LD  + H     F+H Y EP   I+  
Sbjct: 201  DGPKESKPEQQT--ASGSHTTPYTSSFVLPLTNLDPTLTHPVHLAFLHEYREPTFGIVAA 258

Query: 266  RELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVG 325
               T    ++ +      S  ++    K    + S   LP+D  +++ +PSPIGG L+VG
Sbjct: 259  SRDTAPSLLAHRKDILTYSVFTLDLEQKASTTLLSVSGLPYDITRVVPLPSPIGGALLVG 318

Query: 326  AN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTK 382
            +N  IH      +  +A+N +A +  S     +S  ++ L+      L ++    L+   
Sbjct: 319  SNEIIHVDQGGKTNGVAVNEFAKACTSFPLSDQSDLALRLEGCSVELLSHEAGDVLVVLN 378

Query: 383  TGDLVLLTVVYDGRVVQRLDLSKTNP-------SVLTSDITTIGNSLFFLGSRLGDSLLV 435
             G L++LT   DGR V  + +                S  + +G    F+GS  GDS+++
Sbjct: 379  NGRLLVLTFTLDGRTVSGMTVHPVAADHGGHLIKAAASCTSNLGRGRLFVGSEDGDSVML 438

Query: 436  QFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASNNT 494
             +T        +S L+ +  +   D            D   D+ N    ++   +A+ + 
Sbjct: 439  GWTS------TASHLRRKQSNANIDTDEDMSDEEDMEDMEDDLYNDTAPAVQKITAAASE 492

Query: 495  ESAQKTFSFAVRDSLVNIGPLKD------------------FSYGLRINADASATGISKQ 536
             +A  T++F + D L +I P+K+                   S G    A AS T ++++
Sbjct: 493  PTAPGTYTFRIHDVLPSIAPIKNAVLHPGKDTESLNRGEVMLSTGR--GAAASITALNRE 550

Query: 537  SNYELV---ELPGCKGIWTVYHKS------SRGHNADSSRMAAYDDEYHAYLIISLEAR- 586
             +   V   +LP  +G W V+ +       +     D     A + +Y  YL++S     
Sbjct: 551  LHPVTVATRQLPSARGTWAVHARKQAPGDVTAAFGEDMEANMATNVDYDQYLVVSKTGED 610

Query: 587  ----TMVLE-TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQ 641
                T+V E   + LTE  +      +G T+  G L    +V+QV     R  D S +  
Sbjct: 611  GTESTVVYEVNGNELTETDKGDFEREEGSTLFVGVLAAGTKVVQVMRTEIRTYD-SELNM 669

Query: 642  DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIE 701
            D      + ESG+      V++ S ADPY+L+   D S+++               A  +
Sbjct: 670  DQILPMEDEESGN---EVNVINASFADPYLLVLREDSSVKIFR-------------ATGD 713

Query: 702  SSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESG 761
               + V +  L             S   WLS  + ++            ++++ +    G
Sbjct: 714  GELEDVEATGL-------------SNSQWLSASLFKSASFT--------EVFAFLLTPEG 752

Query: 762  ALEIFDVPNFNCVFTVDKFVSGRTHIVDT-YM--REALKDSETEINSSSEEGTGQGRKEN 818
             L +F V +      V + +S    ++   Y+  R A+K + TEI               
Sbjct: 753  GLRVFAVSDMEKPCYVAEALSFLPPVLGMDYVPKRSAIKATITEI--------------- 797

Query: 819  IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLS 878
                    LA     A    P L    +   ++ Y+A+                 S S S
Sbjct: 798  --------LAADLGDATTKSPHLIVRTSSDNLVIYKAF----------------HSPSRS 833

Query: 879  VSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCM 937
             +++    LR ++ S+  +  YT +     +  +  +    +I G+   F  G+ P +  
Sbjct: 834  AADLWTKNLRWVKLSQQHIPRYTEDGGAEDSGFESTLLTLSDIGGYSTVFQRGTTPAF-- 891

Query: 938  VFRERLRVHPQ---LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWP 994
            +F+E     P+   L    + + T  H  +C  GF Y+ S   L+I QLP  + Y +   
Sbjct: 892  IFKESSSA-PRVIGLSGKPVKSLTSFHTSSCQRGFAYLDSTDTLRISQLPPQTHYGHLGW 950

Query: 995  VQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD 1054
              + +P+ A  H + Y +   LY +    P    L+         E  H    + L   D
Sbjct: 951  ATRRMPMDAEIHALAYHS-SGLYIVGTGQPEEYQLDP-------SETYH----YELPKED 998

Query: 1055 LHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLA 1113
            +    T+E   +++L+       W    T  +   E  L+++ + L  +  T + + L+A
Sbjct: 999  MSFKPTIERGIIKLLDEKT----WTIIDTHVLDPQEVVLSIKTLNLEVSENTHQRKDLVA 1054

Query: 1114 IGTAYVQGEDVAARGRVLLF 1133
            +GTA + GED+A +G + +F
Sbjct: 1055 VGTAILHGEDLATKGCIRIF 1074


>gi|451849663|gb|EMD62966.1| hypothetical protein COCSADRAFT_92785 [Cochliobolus sativus ND90Pr]
          Length = 1405

 Score =  169 bits (429), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 254/1156 (21%), Positives = 457/1156 (39%), Gaps = 188/1156 (16%)

Query: 57   NLVVTAANVIEIYVVR-----VQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
            NLVV   ++++++ ++     V   G  E++N+      E     L    S A L LV  
Sbjct: 28   NLVVAKNSLLQVFELKSTTTEVTPGGGDEAENAAANLDTEAADVPLQRTESTAKLVLVGE 87

Query: 107  YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
            + L G V SLA +         R +++++AF DAK+S++E+D   + L   S+H +E+P+
Sbjct: 88   FPLAGTVVSLARVK--ALSTKSRGEALLVAFRDAKLSLVEWDPESYNLHTISIHYYENPD 145

Query: 167  ------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQ-------GGSGL 213
                  W    +   +F     +  DP  RC  +      + IL   Q         S  
Sbjct: 146  LPGIAPWSADLKDTYNF-----LTADPSSRCAALKFGSHNLAILPFRQRDLVDDDYDSDA 200

Query: 214  VGDEDTF----GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERE 267
             G +++      + G  +    SS V+ L +LD  + H     F+H Y EP   I+    
Sbjct: 201  DGPKESKLEQQAASGSHTTPYTSSFVLPLTNLDPTLTHPVHLAFLHEYREPTFGIVAASR 260

Query: 268  LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327
             T    ++ +      S  ++    K    + S   LP+D  +++ +PSPIGG L+VG+N
Sbjct: 261  DTAPSLLAHRKDILTYSVFTLDLEQKASTTLLSVSGLPYDITRVVPLPSPIGGALLVGSN 320

Query: 328  -TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTG 384
              IH      +  +A+N +A +  S     +S  ++ L+      L ++    L+    G
Sbjct: 321  EIIHVDQGGKTSGVAVNEFAKTCTSFPLSDQSDMALRLEGCSVELLSHEAGDVLIVLNNG 380

Query: 385  DLVLLTVVYDGRVVQRLDLSKTNP-------SVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
             L++LT   DGR V  + +                S  + +G    F+GS  GDS+++ +
Sbjct: 381  RLLVLTFTLDGRTVSGMTVHPVAADHGGHLIKAAASCTSNLGRGRLFVGSEDGDSVMLGW 440

Query: 438  TCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASNNTES 496
            T        +S L+ +  +   D            D   D+ N    ++   +A+ +  +
Sbjct: 441  TS------TASHLRRKQSNANIDTDEDMSDEEDMDDMEDDLYNDTAPAVQKITAAASEPT 494

Query: 497  AQKTFSFAVRDSLVNIGPLKD-----------FSYG-LRINADASATGISKQSNYEL--- 541
            A  T++F + D L +I P+K+            + G + ++    A       N EL   
Sbjct: 495  APGTYTFRIHDVLPSIAPIKNAVLHPGKDTESLNRGEIMLSTGRGAAAAITALNRELHPV 554

Query: 542  ----VELPGCKGIWTVYHKS------SRGHNADSSRMAAYDDEYHAYLIISLEAR----- 586
                 +LP  +G W V+ +       +     D     A + +Y  YL++S         
Sbjct: 555  TAATRQLPSARGTWAVHARKQAPGDVTAAFGEDMEANMATNVDYDQYLVVSKTGEDGTES 614

Query: 587  TMVLE-TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSF 645
            T+V E   + LTE  +      +G T+  G L    +V+QV     R  D S +  D   
Sbjct: 615  TVVYEVNGNELTETDKGDFEREEGSTLFVGILAAGTKVVQVMRTEIRTYD-SELNMDQIL 673

Query: 646  GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKK 705
               + ESG+      V++ S ADPY+L+   D S+++               A  +   +
Sbjct: 674  PMEDEESGN---ELNVINASFADPYLLVLREDSSVKIFR-------------ATGDGELE 717

Query: 706  PVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEI 765
             V +  L             S   WLS  + ++            ++++ +    G L +
Sbjct: 718  DVEATGL-------------SNSQWLSASLFKSASFT--------EVFAFLLTPEGGLRV 756

Query: 766  FDVPNFNCVFTVDKFVSGRTHIVDT-YM--REALKDSETEINSSSEEGTGQGRKENIHSM 822
            F V +      V + +S    ++   Y+  R A+K + TEI                   
Sbjct: 757  FAVSDMEKPCYVAEALSFLPPVLGMDYVPKRSAIKATITEI------------------- 797

Query: 823  KVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNV 882
                LA     A    P L    +   I+ Y+A+                 S S S +++
Sbjct: 798  ----LAADLGDATTKSPHLIIRTSSDNIVIYKAF----------------HSPSRSAADL 837

Query: 883  SASRLRNLRFSRTPLDAYTREETPHGAPCQRITI-FKNISGHQGFFLSGSRPCWCMVFRE 941
                LR ++ S+  +  YT +     +  +   +   +I G+   F  G+ P +  +F+E
Sbjct: 838  WTKNLRWVKLSQQHIPRYTEDGGAEDSGFESTLLALSDIGGYSTVFQRGTTPAF--IFKE 895

Query: 942  RLRVHPQ---LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
                 P+   L    + + T  H  +C  GF Y+ S   L+I QLP  + Y +     + 
Sbjct: 896  SSSA-PRVIGLSGKPVKSLTSFHTSSCQRGFAYLDSTDTLRISQLPPQTHYGHLGWATRR 954

Query: 999  IPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRT 1058
            +P+ A  H + Y +   LY +    P    L+         E  H    + L   D+   
Sbjct: 955  MPMDAEIHALAYHS-SGLYIIGAGQPEEYQLDP-------SETYH----YELPKEDMSFK 1002

Query: 1059 YTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTA 1117
             T+E   +++L+       W    T  +   E  L+++ + L  +  T + + L+A+GTA
Sbjct: 1003 PTIERGIIQLLDEKT----WAIIDTHVLDPQEVVLSIKTLNLEVSENTHQRKDLIAVGTA 1058

Query: 1118 YVQGEDVAARGRVLLF 1133
             + GED+A +G + +F
Sbjct: 1059 ILHGEDLATKGCIRIF 1074


>gi|291232722|ref|XP_002736302.1| PREDICTED: cleavage and polyadenylation specific factor 1-like
            [Saccoglossus kowalevskii]
          Length = 984

 Score =  169 bits (428), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 197/808 (24%), Positives = 324/808 (40%), Gaps = 188/808 (23%)

Query: 423  FFLGSRLGDSLLVQFT--CGSGTSMLSSGLKE---EFGDIEADAPSTKRLRRSSSDALQD 477
             FLGSRLG+SLL+++       T  +++G K+   +    + + P+ K+    +SD +  
Sbjct: 6    LFLGSRLGNSLLLKYVEKAQESTDSVTNGAKKTEEDEETNKEEPPNKKKRTDDASDWIAS 65

Query: 478  MV-----NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG--------LRI 524
             V     + +EL +YGS +    +   +++F V DS++NIGP      G         + 
Sbjct: 66   DVALLAEDVDELEVYGSQTQ-AGTQLTSYTFEVCDSIMNIGPCTKAVMGEPVFLSEEFQT 124

Query: 525  NAD-----ASATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSR 567
            N D      + +G SK     ++            ELPGC  +WTV     +  N D  +
Sbjct: 125  NPDPDMELVALSGYSKNGALSVLQRSIRPQVVTTFELPGCIDMWTVVGPPEK-ENKDQPK 183

Query: 568  MAAYDD---------EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLF 618
                ++           HA+LI+S +  +M+L T   + E+  S  +  QG T+ AGNL 
Sbjct: 184  EKTEEEGDKKPDALTNGHAFLILSRDDSSMILSTGQEIMELDHS-GFSTQGPTVYAGNLG 242

Query: 619  GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
                ++QV   G R+L+G    Q +               S ++  S++DPY LL    G
Sbjct: 243  NNAYILQVSPMGVRLLEGVNQLQHIPL----------DLGSPIVLCSVSDPYALLMSEKG 292

Query: 679  SI--------------RLLVGDP---------STCTVSVQTPAAIESSKKPVSSCTLYHD 715
             +              RL +  P         + C     +     SSK   +S      
Sbjct: 293  ELVLLTLKPDGFAGGHRLAISRPQIPQISRILTLCAYKDTSGMFTTSSKMESTSDETEEK 352

Query: 716  KGPEPWLRKTSTDAWLSTG-------VGEAIDGADGGPLDQGD----------------- 751
            K  +P +   S  + +S          GE+ D +   P  + +                 
Sbjct: 353  KITKPSVADISMTSEISNVDDEDEMLYGES-DASLFSPTKKEEKSSFLQTREVLSETKPT 411

Query: 752  IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT 811
             +  +  E+G LEI+ +P+F   F V  F  G   +VD+Y          ++ +S+  G+
Sbjct: 412  YWCAMSRENGVLEIYSLPDFKLAFLVKNFPMGFKVMVDSY----------QMTASAPGGS 461

Query: 812  GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
             +  +++     V EL +      + +  L A + D  +  Y+A+               
Sbjct: 462  SKSDQQHDMMPIVKELLLIGLGHKNKKTHLLARV-DEDLYIYEAF--------------- 505

Query: 872  STSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
             T    S+ N     LR LRF +                                F+ G 
Sbjct: 506  -THDQSSLDN----HLR-LRFRKV-------------------------------FVCGP 528

Query: 932  RPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
             P W  M  R  LR HP   DGS+  F   HN+NC  GF+Y    G L+IC LP+  +YD
Sbjct: 529  YPHWLFMTSRGALRSHPMHIDGSVTCFAPFHNINCPKGFLYFNKHGELRICVLPTHLSYD 588

Query: 991  NYWPVQKVIPLKATPHQITYFAEKNLYPLIVSV--PVLKPLNQVLSLLIDQEVGHQIDNH 1048
              WPV+KV PL+ TPH I+Y  E   Y ++ SV  P L+         I +  G   +  
Sbjct: 589  ALWPVRKV-PLRCTPHFISYHIESKTYAVVTSVSEPCLR---------ICKMTGDDKEFE 638

Query: 1049 NLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTK 1106
            ++   D     T+E++ +++  P      W+      I  +  E+   ++ V L +  T 
Sbjct: 639  DVERDDRFIFPTIEKFSLQLFSP----LSWEAIPNTKIDTEDWEHITGLKTVFLKSEGTV 694

Query: 1107 EN-ETLLAIGTAYVQGEDVAARGRVLLF 1133
               +  +A+ T  V GE+V  RGR+L+F
Sbjct: 695  SGLKGFIAVSTTIVYGEEVTCRGRILIF 722


>gi|261201748|ref|XP_002628088.1| protein CFT1 [Ajellomyces dermatitidis SLH14081]
 gi|239590185|gb|EEQ72766.1| protein CFT1 [Ajellomyces dermatitidis SLH14081]
          Length = 1403

 Score =  167 bits (424), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 258/1168 (22%), Positives = 468/1168 (40%), Gaps = 193/1168 (16%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            NL+V  + +++++ +     GS   ++  +T+ +          L LV  Y L G +  L
Sbjct: 28   NLIVAKSTLLQVFNLVNVVYGSAPGQSDEKTRSQY-------TKLVLVAEYALSGTITDL 80

Query: 117  A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
                 + S+ G +      ++++   +AK+S++E+D   H +  TS+H +E  + +H+  
Sbjct: 81   GRVKILNSKSGGE------AVLVGTRNAKLSLIEWDPERHKIATTSIHYYERDD-VHISP 133

Query: 173  GRESFARGPL-VKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---------------- 214
               + A  P  + VDP  RC  VL +G + + IL   Q G  LV                
Sbjct: 134  WTPNLANCPSHLTVDPSSRCA-VLNFGKKNLAILPFHQVGDDLVMDDFDSDVEEPPRDTN 192

Query: 215  -----GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERE 267
                  DE    +G  F     SS V+ +  L+  M H     F++ Y EP   IL+ + 
Sbjct: 193  HTAEGQDEAKKSNGLAFHTPYASSFVLPIAALEPAMLHPISLAFLYEYREPTFGILYSQV 252

Query: 268  LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327
             T +  +  +      S  ++    +    + S   LP+D +K++A+P P+GG L++G N
Sbjct: 253  ATSSALLHDRKDVVFYSVFTLDLEQRASTTLLSVSRLPNDLFKVVALPPPVGGALLIGTN 312

Query: 328  T-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTG 384
              +H      + A+ +N +A    S     +S   + L+ +    L  +N   LL    G
Sbjct: 313  ELVHIDQAGKTNAVGVNEFAREASSFSMADQSDLEMRLEGSIVEQLGTENGDMLLVLLNG 372

Query: 385  DLVLLTVVYDGRVVQRLDLS-----------KTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
             + +L+   DGR V  + L            K  PS        +G    F GS   DS+
Sbjct: 373  KMAVLSFKLDGRSVSGISLRLVPDLAGGSLLKARPSC----SVPLGRGKIFFGSEESDSV 428

Query: 434  LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLY------ 487
            L+      G S  S+  K+       D         SS +  +D  +  E  LY      
Sbjct: 429  LI------GWSRPSTRPKDPPVQGAGD---DNIAELSSDEEEEDDEDIYEDDLYATPVPT 479

Query: 488  GSASNNTESAQKT----FSFAVRDSLVNIGPLKDFSYGLRI---NADASATGISKQSNYE 540
            G+ +  + S + T    ++F + D L N+GP++D + G      + D      S  +N E
Sbjct: 480  GAKARGSLSVKGTNLNDYTFRIHDRLWNLGPMRDLTLGRPAGSRDKDKRQPVSSLSTNLE 539

Query: 541  LVELPG--------------------------CKGIWTVYHKSSRGHNADSSRMAAYDDE 574
            LV   G                            G W+V+ K  +  +   S        
Sbjct: 540  LVATQGYGKAGGLTILRREIDPYVIDSLMIKDTDGAWSVHVKDPKLPSQSGSLPLNASSN 599

Query: 575  YHAYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFE 628
            Y  YL++S      + +++V   +    E T++ ++   + RTI  G L G  RV+QV +
Sbjct: 600  YDHYLLLSKSKGSDKEKSVVYTMSSGGLEETKASEFNPNEDRTIDIGTLAGGTRVVQVLK 659

Query: 629  RGARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDP 687
               R  D G  + Q       +      SE   V+  S ADPYVL+   D S+ LL  D 
Sbjct: 660  GEVRSYDSGLGLAQIFPVWDEDM-----SEEKYVVHASFADPYVLIIRDDQSVLLLQADG 714

Query: 688  STCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPL 747
            S     ++    I S+     S +LY DK       +T+    LS  V            
Sbjct: 715  SGDLDEIEADGIINSTT--WISGSLYQDKYRSFMSYETAPSRKLSDNV------------ 760

Query: 748  DQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHIVDTYMREALKDSETEINSS 806
                +  ++  ES  L IF +PN    VFT +                   D   +I S+
Sbjct: 761  ----LLFLLSSES-KLHIFHLPNAKEPVFTAECV-----------------DLLPQILST 798

Query: 807  SEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSK 866
                     +E++  + V ++      +    P+L    ++  ++ Y+ Y      +T+ 
Sbjct: 799  EPPPKRATYRESLTEILVADIG----DSVSRTPYLILRSSNNDLILYEPY------HTTH 848

Query: 867  SDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGF 926
            S +  S       S++   +  N  F +    +   + +  GA  + + +  ++ G++  
Sbjct: 849  STEKKS-------SDLRFLKTINHHFPKFHAGSNVEDSSHIGALPKPLRVLGDVCGYRTV 901

Query: 927  FLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSG 986
            F+ G+ PC+ +     +     L   ++ + +  +   C  GF+YV +  ++++C+ P  
Sbjct: 902  FMPGNSPCFVIKSSTSIPHVLNLRGKTVHSLSSFNIPACERGFVYVDADNVVRMCRFPRN 961

Query: 987  STYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQID 1046
            + +D  W  +K I L      + Y +    Y +  S  V        +L  D E+  +  
Sbjct: 962  THFDGSWATRK-IGLGEQVDIVEYSSSSETYVIGTSQKV------DFNLPEDDEIHPEWR 1014

Query: 1047 NHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTT 1105
            N  +S +       +++  V++L P      W    +  ++++E  + V+ + L  +  T
Sbjct: 1015 NEVISFLP-----QIDQGSVKLLSPRT----WSIIDSHTLRTAERIMCVKCLDLEVSEIT 1065

Query: 1106 KENETLLAIGTAYVQGEDVAARGRVLLF 1133
             E   ++A+GTA  +GED+AARG + +F
Sbjct: 1066 HERRDMIAVGTAVTRGEDIAARGCIYIF 1093


>gi|242798830|ref|XP_002483249.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Talaromyces stipitatus ATCC 10500]
 gi|218716594|gb|EED16015.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Talaromyces stipitatus ATCC 10500]
          Length = 1382

 Score =  167 bits (423), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 272/1163 (23%), Positives = 457/1163 (39%), Gaps = 203/1163 (17%)

Query: 57   NLVVTAANVIEIYVV-------RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRL 109
            NLVV   ++++IY +        V E G + + N    KR           L+L   Y L
Sbjct: 28   NLVVIKTSLLQIYNLVTETVTPSVLENGQRANDNE---KRN------ETTKLQLFAEYDL 78

Query: 110  HGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWL 168
            HG V  +   S+    NSR   D+++L+F +AK+S++E++  I  +   S+H +E  +  
Sbjct: 79   HGTVTDI---SRINILNSRSGGDALLLSFRNAKLSLIEWNPEIQNISTVSIHYYEKEDIT 135

Query: 169  HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDE-------DTF 220
                  +       + VDP  RC  VL +G++ + IL   Q G  LV DE       D F
Sbjct: 136  LSPWAPDLSQCDSHLTVDPSSRCA-VLNFGVRNLAILPFHQAGDDLVMDEYDPDLDMDDF 194

Query: 221  GSGGGFSARIES----------------SHVINLRDLD--MKHVKDFIFVHGYIEPVMVI 262
                  ++  +S                S V+ L  LD  + H     F+H Y EP   I
Sbjct: 195  TGQDKNTSHTDSKKGTEKDHTHQTPYAASFVLPLTALDPTLIHPIGLTFLHEYREPTFGI 254

Query: 263  LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVL 322
            L+    T A  +  +    + S  ++    +    + S   LP D   ++A+P+P+GG L
Sbjct: 255  LYSPIATSAALLEERKDVVVYSVFTLDLEQRASTPLLSIAKLPSDLLHIMALPAPVGGAL 314

Query: 323  VVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LL 379
            ++G+N  IH      + A+A+N +A  + S   + +S   + L+ +    +  +    LL
Sbjct: 315  LIGSNELIHVDQSGKASAVAVNEFAKQVSSFPMIDQSDLGLRLENSVVEVINKECGDILL 374

Query: 380  STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDIT--------TIGNSLFFLGSRLGD 431
            +  TG+LVL+    DGR V    +    P+    D+         ++G+   F+GS   D
Sbjct: 375  TLSTGELVLVHFKIDGRSVSGPVVCPV-PTNSGGDVVGATASCSISLGSGKVFIGSEDTD 433

Query: 432  SLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS--SDALQDMVNGEELSLYGS 489
            SLL+     S  S  S    E+  D + +      +      S A ++ VN        +
Sbjct: 434  SLLLDCYVSSAVSKKSKDHGEDQFDEDMNDEDDDDMYEDDLYSSAPKEAVNK-------A 486

Query: 490  ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKG 549
             SN   SA + +SF V D L ++  L+  + G   + D+ A  +S QS +EL EL    G
Sbjct: 487  VSNG--SASEDYSFRVLDKLPSLASLRSVTVGKPASRDSDAGNVS-QSVHEL-ELAAAYG 542

Query: 550  ---------IWTVYH----KSSRGHNADS------SRMAAYDDEYHAYLIISLEARTMVL 590
                     +    H     +  G  ADS      S  +  +D          E+ + V 
Sbjct: 543  SGRNGGVALLQRALHLDGISTMNGETADSVWNINTSTKSGRNDPSEG------ESPSYVF 596

Query: 591  ETADLLTEVTESVDYFVQGR----------------TIAAGNLFGRRRVIQVFERGARIL 634
             T    T+  E++ Y V G                 T+  G L G  RV+QV     R+ 
Sbjct: 597  LTKSNSTDNEETLVYAVNGSNLEPFSAPDVNPNGDPTVDIGTLAGNSRVVQVLTGEVRVY 656

Query: 635  DGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 693
            D +  M Q     P   E   G E   V S S ADPY+L+   D S+ LL  D S     
Sbjct: 657  DTNLGMAQ---IYPVWDED-EGDERFAV-STSFADPYLLIIRDDSSVLLLHSDESGDLDE 711

Query: 694  VQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIY 753
            +  P  I SS+  +  C LY DK                  V E  D A       G+ Y
Sbjct: 712  LSKPETI-SSQSWLCGC-LYTDK----------------HNVFE--DNA------TGNTY 745

Query: 754  SVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQ 813
              +  +   L +F +P    V   +                   D  + I SS +     
Sbjct: 746  MFLLNQECKLFMFRLPTRELVSVTEGV-----------------DYVSSILSSDQPAKRL 788

Query: 814  GRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVST 873
              +E I  + V +L         + P+L        ++ Y+               PV  
Sbjct: 789  NSRETIAELLVADLG----EISTASPYLIIRSATDDLIIYK---------------PVRE 829

Query: 874  SRSLSVSNVSASRLR--NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
            +     + V+   ++  N    + P++A   +        +R+    +I G+    +SG+
Sbjct: 830  NSKDEKTGVTLKYIKESNHFLPKVPIEAAATDTQQRMPGLRRLA---DIGGYAAVLMSGA 886

Query: 932  RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
             P   +   + L     +   SI   +   +  C  G IYV ++ +++ C+L   +  D 
Sbjct: 887  SPSLVVRTSKSLPRVFSIQSDSIRGISGFDSAGCEKGLIYVDNEHVVRTCRLHDNTQLDF 946

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
             WP++K IPL     ++ Y A    Y  +    V+   ++    L D +  H       +
Sbjct: 947  SWPIRK-IPLN---EEVDYLA----YSTVSGTYVVGTTHEQDFKLPDNDELHP----EWA 994

Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENET 1110
            + D+     V +  +++L P      W+   +    ++E    +  + L  +  T E + 
Sbjct: 995  NEDISLRPKVAQGSIKLLNPKT----WKVIDSYTFNAAERITAIENINLEISEKTSERKD 1050

Query: 1111 LLAIGTAYVQGEDVAARGRVLLF 1133
            ++ +GT + +GED+AARG V +F
Sbjct: 1051 MIVVGTTFAKGEDIAARGNVYVF 1073


>gi|302694047|ref|XP_003036702.1| hypothetical protein SCHCODRAFT_63425 [Schizophyllum commune H4-8]
 gi|300110399|gb|EFJ01800.1| hypothetical protein SCHCODRAFT_63425 [Schizophyllum commune H4-8]
          Length = 1396

 Score =  167 bits (423), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 249/1174 (21%), Positives = 459/1174 (39%), Gaps = 204/1174 (17%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGIS------------------- 97
            N+V    N + IY VR +   SK    +   K+   M+G+                    
Sbjct: 38   NVVTARGNTLSIYEVREETATSKSPTEAKSQKKDDAMEGVKEERQTPVVQVRSLSKKTYP 97

Query: 98   ---------AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFD 148
                     +    LV  +RLHG V  L  +    +      D ++++F+DAKI++LE+ 
Sbjct: 98   DSDSHSQPLSTKFHLVREHRLHGVVTGLQAVKIISSLEDHL-DRLLVSFKDAKIALLEWS 156

Query: 149  DSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQ 208
             +   L   S+H +E    + +     +F     ++VDPQ RC  + +      IL   Q
Sbjct: 157  TATQDLLTVSIHTYERAIQM-VATDISAFTSE--LRVDPQSRCAALSLPKDAFAILPPCQ 213

Query: 209  GGSGLVGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIEPVMVILHE 265
                +  D                S ++NL    +  +++V DF F+ G+  P + +L+E
Sbjct: 214  VSDSVCRD-----------VPYSPSFILNLPSEVESGIRNVIDFTFLPGFSNPTVAVLYE 262

Query: 266  RELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVG 325
               TW GR++ +  T  ++  ++    +++P+I  A  LP D   +LA PS  GGV+VV 
Sbjct: 263  TYQTWTGRLNEQKDTVKMAFFTLDIVNRRYPVIGLATGLPCDCLSVLACPS-TGGVMVVA 321

Query: 326  ANTIHYHSQSASCALALNNYAVSLDSSQELP------RSSFSVELDAAHATWLQNDVALL 379
            +N+I Y  QS    +   N  +   S   LP        + ++EL+ + + ++ +  A +
Sbjct: 322  SNSIIYVDQSGRKVVLPVNAWIPRMSDIALPTNLTPEEQARTLELEGSRSIFVDDKTAFI 381

Query: 380  STKTGDLVLLTVVYDGRVVQRLDL-----SKTNPSVLTSDITTIGNSLFFLGSRLGDSLL 434
              K G +  + +V  GRVV +L L       T PS+L      I N    +GS  GDS  
Sbjct: 382  ILKDGTIYPVELVTAGRVVSKLALGTPLAKTTIPSILRR----INNDYLLVGSASGDS-- 435

Query: 435  VQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDAL--QDM-VNGEELSLYGSAS 491
                     ++LS+   EE  D + D  +      +S  AL  QD+ ++ ++  +YG + 
Sbjct: 436  ---------ALLSTSWVEEVIDDDVDMEAN-----TSVAALEQQDIEMDDDDDDIYGPSI 481

Query: 492  NNTESAQK------------TFSFAVRDSLVNIGPLKDFSYGLRINAD------ASATGI 533
              T ++QK                +  D+L   GP+ D ++ +  N D       +ATG 
Sbjct: 482  IKTGTSQKESAAPMSKKTRSVLRLSFCDALPAYGPIADLTFTVGKNGDRPVAELVTATGS 541

Query: 534  SKQSNYELVE--LP-----------GCKGIWTV-YHKSSRGHNADSSRMAAYDDEYHAYL 579
                 + L +  LP           G +G+W++   +SS    A+           H  L
Sbjct: 542  GHLGGFTLFQKDLPLRKKKKLPIISGARGVWSLPIRRSSSAAVAE-----------HDTL 590

Query: 580  IISLEA-------RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
            IIS +A       R  V  T   L+ V+      V G TI AG  F R  ++ V     R
Sbjct: 591  IISTDANPSPGFSRLAVRATKGDLSVVSR-----VNGMTIGAGPFFQRTAILHVMTNAIR 645

Query: 633  IL--DGS--YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS 688
            +L  DG+   + +D+               + + S SI DPYVL+   D +I L +G+ +
Sbjct: 646  VLEPDGNERQIIKDME---------GNVPRAKIKSCSICDPYVLIFREDDTIGLFIGETT 696

Query: 689  TCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVG--EAIDGADGGP 746
               +  +  + +       ++   + D      +   + DA   T +     +D +    
Sbjct: 697  RGKIRRKDMSPMGEKSSRYTAGGFFTDTASVFRVYHQNADANTETTIPMHSVVDASSKSQ 756

Query: 747  LDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSS 806
                  + V+    G +EI+ +P    VF+     + +  + D+    AL   +      
Sbjct: 757  ------WLVLVRPQGVVEIWTLPKLTLVFSTTLLATLQNVLTDSQEPPALSPPQDPPRKP 810

Query: 807  SEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSK 866
             E             + + ++ +        +P L  +L  G +  Y+A+          
Sbjct: 811  QE-------------LDIEQILLTNLGQSDPKPHLLVLLRSGHLAIYEAFATNQAPIVEP 857

Query: 867  SDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGF 926
               P ++S  +    +++      R   T       ++       +    F       G 
Sbjct: 858  PLKPRASSLQIQFVKIASKAFEMQRTDETEKGILAEQK----KALRTFVPFACAGAPAGV 913

Query: 927  FLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPS 985
            F +G RP W +   +  ++++P     ++ AF+       +  F+  + +G   +C+   
Sbjct: 914  FFTGDRPHWIVATDKGGVQMYPS-GHAAVYAFSACTLWERSTEFLIYSEEG-QTLCE--- 968

Query: 986  GSTYDNYWPVQKVIPLKATPHQITY--FAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGH 1043
               +   + + + +P++  P    Y     +    +IV+   L+            E G+
Sbjct: 969  ---WITEYEIGRPLPMRHIPRGRAYSNIVYEPASSMIVAAASLRARFASF-----DEDGN 1020

Query: 1044 QI---DNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL 1100
            QI   D   ++        TVE   + ++ P+     W T       ++E   T+  V L
Sbjct: 1021 QIWAPDGPGITEP------TVECSTLELISPEV----WATVDGYEFATNEFVNTMECVPL 1070

Query: 1101 FNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLF 1133
               +T+   +  +A+GT+ V+GED+A +G   +F
Sbjct: 1071 ETVSTEAGVKHFIAVGTSIVRGEDLAVKGATYIF 1104


>gi|147864212|emb|CAN80950.1| hypothetical protein VITISV_016701 [Vitis vinifera]
          Length = 262

 Score =  167 bits (422), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 89/148 (60%), Positives = 105/148 (70%), Gaps = 26/148 (17%)

Query: 451 KEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLV 510
           +++ GDIE D PS KR RRSSSDALQDM N ++L LYG A N+TE++QKTFSF+V DSL+
Sbjct: 53  RKKVGDIEGDVPSAKRSRRSSSDALQDMFNSDKLPLYGLAPNSTETSQKTFSFSVSDSLI 112

Query: 511 NIGPLKDFSYGLRINADASATGISKQSNYEL--------------------------VEL 544
           N+GPLKDF+YGLRINAD  ATGI KQSNYEL                          VEL
Sbjct: 113 NVGPLKDFAYGLRINADLKATGIVKQSNYELMCCSGHGKNGALCILQQSIRPERITEVEL 172

Query: 545 PGCKGIWTVYHKSSRGHNADSSRMAAYD 572
           PGCKGIWTVYHK++RGHNADS +M + D
Sbjct: 173 PGCKGIWTVYHKNTRGHNADSIKMVSAD 200


>gi|317036382|ref|XP_001398211.2| protein cft1 [Aspergillus niger CBS 513.88]
          Length = 1393

 Score =  166 bits (419), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 242/1167 (20%), Positives = 469/1167 (40%), Gaps = 178/1167 (15%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            +L+V   ++++IY +  +     E  ++ +   ++L++            Y L G V  L
Sbjct: 28   DLIVVRTSLLQIYSLH-KVASHAEGADAQQESTKLLLEK----------EYSLSGTVTGL 76

Query: 117  A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
                 + S+ G +      ++++AF +AK+S++E+D    G+   S+H +E  +      
Sbjct: 77   CRVKVLNSKSGGE------AVLVAFRNAKLSLIEWDPERRGISTISIHYYERDDLTRSPW 130

Query: 173  GRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFGS--------- 222
              +    G ++ VDP  RC  +  +G++ + I+   Q G  LV D+  +GS         
Sbjct: 131  VPDLNNCGSILSVDPSSRCA-IFNFGIRNLAIIPFHQPGDDLVMDD--YGSDLGEGISTD 187

Query: 223  ---GGG-----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHER 266
               GGG           +      S V+ L  LD  + H     F++ Y EP   IL+ +
Sbjct: 188  HDLGGGTVADKAKEGIVYQTPYAPSFVLPLTTLDPSILHPISLAFLYEYREPTFGILYSQ 247

Query: 267  ELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGA 326
              T +  +  +      +  ++    +   ++ S   LP D ++++A+P P+GG L++G+
Sbjct: 248  VATSSALLPERKDVVFYTVFTLDLEQQASTVLLSVSRLPSDLFRVVALPPPVGGALLIGS 307

Query: 327  NT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKT 383
            N  +H      + A+ +N ++  + S     +S  ++ L+      L +     LL   T
Sbjct: 308  NELVHIDQAGKTNAVGVNEFSRQVSSFSMTDQSDLALRLENCIVECLGDSSGDMLLVLTT 367

Query: 384  GDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI-------TTIGNSLFFLGSRLGDSLLVQ 436
            G++ ++    DGR V  + +         + I       T IG+   FLGS  GDS+L+ 
Sbjct: 368  GEMAIVKFKLDGRSVSGISVHLLPAHAGLTSIYSAAAASTFIGDGKIFLGSEDGDSVLLG 427

Query: 437  FTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV--NGEELSLYGSASNNT 494
            ++  S ++       ++  D  AD        +S  D  +D +     + +L G   +  
Sbjct: 428  YSYSSSSTKKHRLQAKQVIDDSADMSEED---QSDDDVYEDDLYSTSPDTTLTGRRPSGE 484

Query: 495  ESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI---- 550
             SA   + F + D L+NIGPL+D + G R++ +   TG    S    +++   +G     
Sbjct: 485  SSAFGLYDFRIHDKLINIGPLRDITMGKRLSTNLEKTGDRTNSTSPELQIVASQGSHKSG 544

Query: 551  -WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA-------------RTMVLETADLL 596
               V  +    H   S  + + D  + A L    EA             R  V+ T    
Sbjct: 545  GLVVMAREIDPHVVASISLESVDCIWTASLTREEEAVSGTSEKMGQQSQRCYVIATEVKG 604

Query: 597  TEVTESVDYFVQGR----------------TIAAGNLFGRRRVIQVFERGARILDGSY-M 639
            ++  ES+ + V G                 TI+ G    R+RV+QV +   R  D    +
Sbjct: 605  SDREESLIFVVDGHDLKPFRAPDFNPNEDVTISVGTQESRKRVVQVLKNEVRSYDFDLSL 664

Query: 640  TQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAA 699
            TQ       ++     ++    +S S+AD  + +   D ++  L  D S     V     
Sbjct: 665  TQIYPIWDDDT-----NDERMAVSASLADSCLAILRDDSTLLFLQADDSGDLDEVVFGED 719

Query: 700  IESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYE 759
            + S K    SC LY DK                TG+  +ID     P+ + D++  +   
Sbjct: 720  VASGK--WISCCLYSDK----------------TGMFSSIDRTLSEPV-KNDMFLFLLSH 760

Query: 760  SGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENI 819
               L ++ V +   + ++ +   G + ++                 SSE     G +EN+
Sbjct: 761  DCKLFVYRVRD-QKLLSIIEGTDGLSPLL-----------------SSEPPKRSGTRENL 802

Query: 820  HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSV 879
                V +L  + WSA    P+L        ++ Y+ ++             VST     +
Sbjct: 803  IEAIVADLG-ETWSAS---PYLILRSETDDLIIYKPFV-------------VSTGPVEGI 845

Query: 880  SNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF 939
             ++  S+  N    R P    + + +      + + I  +ISG    F+ G+   + +  
Sbjct: 846  HSLKFSKETNSVLPRIPPGVSSTQPSGSDYRARPLRILPDISGLSAVFMPGASAGFIIRT 905

Query: 940  RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVI 999
                    +L   +  + + L    C+ GFIY+ SQ  ++ C+LP  + +D  W +++V 
Sbjct: 906  SASAPHFLRLRGENSRSVSSLDTPECSKGFIYLDSQSTVRFCKLPPMTRFDYQWTLKRVH 965

Query: 1000 PLKATPHQITYFAEKNLYPLIVSVPVLKPLNQV-LSLLIDQEVGHQIDNHNLSSVDLHRT 1058
              +   H + Y     +Y       VL   +     L  D E+  +  N  +S     R 
Sbjct: 966  LGEQVDH-LAYSTSSGMY-------VLGTCHATDFKLPEDDELHPEWRNEAISFFPSARG 1017

Query: 1059 YTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTA 1117
                   ++++ P+     W    +  + + E  + ++ ++L  +  T E + ++ +GTA
Sbjct: 1018 SF-----IKLVSPNT----WSIIDSFSLGADEYVMAIKNISLEVSENTHERKDMIVVGTA 1068

Query: 1118 YVQGEDVAARGRVLLFSTGRNADNPQN 1144
            + +GED+ +RG + +F   +   +P +
Sbjct: 1069 FARGEDIPSRGCIYVFEVVQVVPDPDH 1095


>gi|325189779|emb|CCA24259.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 1911

 Score =  166 bits (419), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 150/568 (26%), Positives = 251/568 (44%), Gaps = 123/568 (21%)

Query: 235 VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
           ++ LR+LD+K  + DF F+ GY+EP ++ILHE    +  +GR +  + T  ++ LSI+  
Sbjct: 382 LLRLRELDIKGRIADFAFLDGYLEPTLMILHEENERIASSGRFAIGYDTMCLTVLSITLN 441

Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYA----- 346
            + HP+IW   NLP D ++++    PIGG L++  N I Y +Q+    + LN +A     
Sbjct: 442 SRLHPVIWCVKNLPADCFRIIPCKVPIGGALLLSTNAILYFNQTQFYGIKLNVFADKTVN 501

Query: 347 --------VSLDSSQELPRSS--------------FSVELDAAHATWLQNDVALLSTKTG 384
                    + +  + LP +S               S+ L   H  +L +   LLS    
Sbjct: 502 QSLFPCQDATYEVLEPLPDASEPPAQGRLAFIEKPLSILLYDCHYDYLGSSDILLSLPDD 561

Query: 385 DLVLLTVVY-DGRVVQRLDLSKTNPSVL---TSDITTIG-------NSLFFLGSRLGDSL 433
            L +L +     RV    + + T   +L    S  +T         N   F+GSR GDS+
Sbjct: 562 SLYVLKMPQTSNRVFSVEEYNHTGKFILRKVASPASTASCLLVNRENDSIFIGSRCGDSV 621

Query: 434 LVQF--------TCGSGTSMLSSG-----LKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480
           L              SGT ++S G           D + +A   ++L+   S    D  +
Sbjct: 622 LYSAHRQKINARKTLSGTVVMSDGSISGTSNVRGADTDNEAALAEKLQAFGSTIALDATD 681

Query: 481 GEELSLYG-----SASNNTESAQKTFSFA------------VRDSLVNIGPLKDFSYGLR 523
            ++  LYG      ++     +   FSF+              D +  IG +     G++
Sbjct: 682 EDDAFLYGPTLSQESTGGAMPSSDCFSFSSMKQEDHSLHLQAIDFIPGIGQITSMDLGVQ 741

Query: 524 INADAS--------ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNA 563
            N+D++        + G SK  +  ++            EL GC+ +WTV   SS    +
Sbjct: 742 SNSDSNEQHEELVVSGGSSKDGSISVIHHGLRPIVSTAAELSGCRAMWTVVGMSSDVPES 801

Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
             +R       Y +YLI+S+  RTM+L T + +  + +   ++  G T+ A NLF +RR+
Sbjct: 802 QVTR------RYDSYLILSVAQRTMILRTGEEMEPLEDDSGFYTCGPTLCATNLFSQRRI 855

Query: 624 IQVFERGARILDGSYM----------------------TQDLSFGPSNSESGS---GSEN 658
           +QVF++G R++  + +                      TQ++ F   + ESG     + N
Sbjct: 856 VQVFKQGVRVMQQASIPASEAKEDDEGTQDVPLTRLVCTQEIPFA-GDIESGGMNVDTAN 914

Query: 659 STVLSVSIADPYVLLGMSDGSIRLLVGD 686
             ++SV   DPY+LL ++DGSIRLL GD
Sbjct: 915 VGIVSVDTIDPYILLLLTDGSIRLLEGD 942



 Score = 53.5 bits (127), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 69/317 (21%), Positives = 127/317 (40%), Gaps = 65/317 (20%)

Query: 756  VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDS----ETEINSSSEEG- 810
            +CY  G+L ++ VP+F  +            +++T   E+ + S    +T  +  S+ G 
Sbjct: 1085 LCYGDGSLHVYSVPDFGKMGIFPYVTFAPKFLLNTMTPESRRASYGYGDTARHRISKGGP 1144

Query: 811  ----------TGQGRKENIHSMK--VVELAMQRWS----AHHSRPF----LFAILTDGTI 850
                      T +GR    H++   V ++A+ R       H+S+ F    L   L +G +
Sbjct: 1145 RLGFSAIPADTNEGRIRKAHAINSPVADIAIHRIGPSEGQHNSQLFSHMVLLVFLANGDL 1204

Query: 851  LCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLR-NLRFSRTPLDAYTREETPHGA 909
            + Y+      P   S  D    +   + V+    +R    ++  +   +A T +E   G+
Sbjct: 1205 IMYKLL----PSIPSPRDSKQPSFHFVRVNENLITRPNLPMKAIKDSGNAGTHDENSLGS 1260

Query: 910  P----------------CQRITIFKNISGHQGFFLSGSRPCWCMVFRER-----LRVHPQ 948
                                +T F N++ + G F  G+ P W +  + +     L +   
Sbjct: 1261 TEASTSAIIAKLRANFRYPMLTRFFNVNNNSGMFFRGAYPVWILPNQGQPVFVPLNIAAA 1320

Query: 949  LCDGS--------IVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD-----NYWPV 995
              D +        +++FT  H+ NC +GF+Y  S G L++C+LPS          N + +
Sbjct: 1321 PSDPTRRTTFKVPVLSFTPFHHWNCPNGFVYFHSSGSLRVCELPSSQNSTLLPSGNGFVL 1380

Query: 996  QKVIPLKATPHQITYFA 1012
            QKV    AT H + Y  
Sbjct: 1381 QKV-RFGATIHHLLYLG 1396


>gi|297722899|ref|NP_001173813.1| Os04g0252200 [Oryza sativa Japonica Group]
 gi|255675253|dbj|BAH92541.1| Os04g0252200, partial [Oryza sativa Japonica Group]
          Length = 432

 Score =  165 bits (418), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 85/125 (68%), Positives = 101/125 (80%), Gaps = 1/125 (0%)

Query: 1022 SVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTR 1081
            SV   +PLNQVLS + DQE  H +DN   S+  LH+TYTV+E+EVRILE ++ GG W+T+
Sbjct: 30   SVCSFRPLNQVLSSMADQESVHHMDNDVTSTDALHKTYTVDEFEVRILELEKPGGHWETK 89

Query: 1082 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADN 1141
            +TIPMQ  ENALTVR+VTL NTTTKENETLLAIGTAYV GEDVAARGRVLLFS  + ++N
Sbjct: 90   STIPMQLFENALTVRIVTLHNTTTKENETLLAIGTAYVLGEDVAARGRVLLFSFTK-SEN 148

Query: 1142 PQNLV 1146
             QNLV
Sbjct: 149  SQNLV 153


>gi|239611898|gb|EEQ88885.1| protein CFT1 [Ajellomyces dermatitidis ER-3]
 gi|327352847|gb|EGE81704.1| CFT1 [Ajellomyces dermatitidis ATCC 18188]
          Length = 1402

 Score =  165 bits (418), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 259/1168 (22%), Positives = 468/1168 (40%), Gaps = 194/1168 (16%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            NL+V  + +++++ +     GS   ++  +T+ +          L LV  Y L G +  L
Sbjct: 28   NLIVAKSTLLQVFNLVNVVYGSAPGQSDEKTRSQY-------TKLVLVAEYALSGTITDL 80

Query: 117  A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
                 + S+ G +      ++++   +AK+S++E+D   H +  TS+H +E  + +H+  
Sbjct: 81   GRVKILNSKSGGE------AVLVGTRNAKLSLIEWDPERHKIATTSIHYYERDD-VHISP 133

Query: 173  GRESFARGPL-VKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---------------- 214
               + A  P  + VDP  RC  VL +G + + IL   Q G  LV                
Sbjct: 134  WTPNLANCPSHLTVDPSSRCA-VLNFGKKNLAILPFHQVGDDLVMDDFDSDVEEPPRDTN 192

Query: 215  -----GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERE 267
                  DE    +G  F     SS V+ +  L+  M H     F++ Y EP   IL+ + 
Sbjct: 193  HTAEGQDEAKKSNGLAFHTPYASSFVLPIAALEPAMLHPISLAFLYEYREPTFGILYSQV 252

Query: 268  LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327
             T +  +  +      S  ++    +    + S   LP+D +K++A+P P+GG L++G N
Sbjct: 253  ATSSALLHDRKDVVFYSVFTLDLEQRASTTLLSVSRLPNDLFKVVALPPPVGGALLIGTN 312

Query: 328  T-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTG 384
              +H      + A+ +N +A    S     +S   + L+ +    L  +N   LL    G
Sbjct: 313  ELVHIDQAGKTNAVGVNEFAREASSFSMADQSDLEMRLEGSIVEQLGTENGDMLLVLLNG 372

Query: 385  DLVLLTVVYDGRVVQRLDLS-----------KTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
             + +L+   DGR V  + L            K  PS        +G    F GS   DS+
Sbjct: 373  KMAVLSFKLDGRSVSGISLRLVPDLAGGSLLKARPSC----SVPLGRGKIFFGSEESDSV 428

Query: 434  LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLY------ 487
            L+      G S  S+  K    D          +   SSD  +D  +  E  LY      
Sbjct: 429  LI------GWSRPSTRPK----DPPVQGAGDDNIAELSSDEEEDDEDIYEDDLYATPVPT 478

Query: 488  GSASNNTESAQKT----FSFAVRDSLVNIGPLKDFSYGLRI---NADASATGISKQSNYE 540
            G+ +  + S + T    ++F + D L N+GP++D + G      + D      S  +N E
Sbjct: 479  GAKARGSLSVKGTNLNDYTFRIHDRLWNLGPMRDLTLGRPAGSRDKDKRQPVSSLSTNLE 538

Query: 541  LVELPG--------------------------CKGIWTVYHKSSRGHNADSSRMAAYDDE 574
            LV   G                            G W+V+ K  +  +   S        
Sbjct: 539  LVATQGYGKAGGLTILRREIDPYVIDSLMIKDTDGAWSVHVKDPKLPSQSGSLPLNASSN 598

Query: 575  YHAYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFE 628
            Y  YL++S      + +++V   +    E T++ ++   + RTI  G L G  RV+QV +
Sbjct: 599  YDHYLLLSKSKGSDKEKSVVYTMSSGGLEETKASEFNPNEDRTIDIGTLAGGTRVVQVLK 658

Query: 629  RGARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDP 687
               R  D G  + Q       +      SE   V+  S ADPYVL+   D S+ LL  D 
Sbjct: 659  GEVRSYDSGLGLAQIFPVWDEDM-----SEEKYVVHASFADPYVLIIRDDQSVLLLQADG 713

Query: 688  STCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPL 747
            S     ++    I S+     S +LY DK       +T+    LS  V            
Sbjct: 714  SGDLDEIEADGIINSTT--WISGSLYQDKYRSFMSYETAPSRKLSDNV------------ 759

Query: 748  DQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHIVDTYMREALKDSETEINSS 806
                +  ++  ES  L IF +PN    VFT +                   D   +I S+
Sbjct: 760  ----LLFLLSSES-KLHIFHLPNAKEPVFTAECV-----------------DLLPQILST 797

Query: 807  SEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSK 866
                     +E++  + V ++      +    P+L    ++  ++ Y+ Y      +T+ 
Sbjct: 798  EPPPKRATYRESLTEILVADIG----DSVSRTPYLILRSSNNDLILYEPY------HTTH 847

Query: 867  SDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGF 926
            S +  S       S++   +  N  F +    +   + +  GA  + + +  ++ G++  
Sbjct: 848  STEKKS-------SDLRFLKTINHHFPKFHAGSNVEDSSHIGALPKPLRVLGDVCGYRTV 900

Query: 927  FLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSG 986
            F+ G+ PC+ +     +     L   ++ + +  +   C  GF+YV +  ++++C+ P  
Sbjct: 901  FMPGNSPCFVIKSSTSIPHVLNLRGKTVHSLSSFNIPACERGFVYVDADNVVRMCRFPRN 960

Query: 987  STYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQID 1046
            + +D  W  +K I L      + Y +    Y +  S  V        +L  D E+  +  
Sbjct: 961  THFDGSWATRK-IGLGEQVDIVEYSSSSETYVIGTSQKV------DFNLPEDDEIHPEWR 1013

Query: 1047 NHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTT 1105
            N  +S +       +++  V++L P      W    +  ++++E  + V+ + L  +  T
Sbjct: 1014 NEVISFLP-----QIDQGSVKLLSPRT----WSIIDSHTLRTAERIMCVKCLDLEVSEIT 1064

Query: 1106 KENETLLAIGTAYVQGEDVAARGRVLLF 1133
             E   ++A+GTA  +GED+AARG + +F
Sbjct: 1065 HERRDMIAVGTAVTRGEDIAARGCIYIF 1092


>gi|396471273|ref|XP_003838832.1| similar to cleavage and polyadenylation specificity factor subunit A
            [Leptosphaeria maculans JN3]
 gi|312215401|emb|CBX95353.1| similar to cleavage and polyadenylation specificity factor subunit A
            [Leptosphaeria maculans JN3]
          Length = 1402

 Score =  165 bits (418), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 260/1167 (22%), Positives = 454/1167 (38%), Gaps = 205/1167 (17%)

Query: 57   NLVVTAANVIEIYVVR-----VQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
            NLVV   ++++I+ ++     V  EG  E  N+      E          + A L LV  
Sbjct: 28   NLVVAKNSLLQIFEIKSTTTEVTPEGGDEVDNAAANLDTEAADVQFQRTENTAKLVLVAE 87

Query: 107  YRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESP 165
            + L G V SLA +    A N++ R +++++AF DAK+S++E+D   + L   S+H +E+P
Sbjct: 88   FPLAGTVISLARIK---ALNTKSRGEALLVAFRDAKLSLVEWDPETYNLHTISIHYYENP 144

Query: 166  E------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGG-------SG 212
            +      W    +   +F     +  DP  RC  +      + IL   Q         S 
Sbjct: 145  DLPGIAPWSADLKDTYNF-----LTADPSSRCAALKFGTHNLAILPFRQSDLVEDDYDSD 199

Query: 213  LVGDEDT------FGSGGGFSARI--ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVI 262
            L G  DT        SGG  + +    SS V+ L +LD  + H     F+H Y EP   I
Sbjct: 200  LDGPRDTKPDQAEAPSGGETTHKTPYSSSFVLPLTNLDPTLTHPVHLAFLHQYREPTFGI 259

Query: 263  LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVL 322
            +          ++ +      S  ++    K    + S   LP+D  +++ +P PIGG L
Sbjct: 260  IAASRAAAPSLLANRKDILTYSVFTLDLEQKASTTLLSVTGLPYDISRVVPLPHPIGGAL 319

Query: 323  VVGAN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LL 379
            ++G N  IH      +  +A+N +A +  S     +S  ++ L+  +   L  D    +L
Sbjct: 320  LLGNNEIIHVDQGGKTNGVAVNEFAKACTSFPLSDQSDLALHLEGCNVELLSQDTGDVVL 379

Query: 380  STKTGDLVLLTVVYDGRVVQRLDLSKTNPS----VLTSDITTIGNSL---FFLGSRLGDS 432
                G L+++T   +GR V  + +          VL +  +   N +    F+GS  G+S
Sbjct: 380  VLNNGRLLIMTFTLEGRTVSGMTIQTVAADHGGHVLKAGSSCTSNLVRGRLFIGSEDGES 439

Query: 433  LLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG--SA 490
            +L+      G S  ++ L+    ++  D   T        D L D +  +        +A
Sbjct: 440  VLL------GWSSATASLRRRHSNVGLDGDGTSEEEEEDIDDLDDDLYNDTAPAVQKITA 493

Query: 491  SNNTESAQKTFSFAVRDSLVNIGPLKD------------------FSYGLRINADASATG 532
            + +  +   T+SF + D+L +I P++D                   S G    A  + TG
Sbjct: 494  AASEPTPPGTYSFRIHDTLPSIAPIRDAVLHPGKVTDSLNRGEIMLSTGR--GAAGAITG 551

Query: 533  ISKQ---SNYELVELPGCKGIWTVYHKSS------RGHNADSSRMAAYDDEYHAYLIISL 583
            + ++    +    ELP   GIW V+ +             D+    + D +Y  YL++S 
Sbjct: 552  LDRELHPVSLAASELPSTHGIWAVHARKQAPGGVVTAFGEDTEANMSTDVDYDQYLVVSK 611

Query: 584  EAR-----TMVLET-ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS 637
             +      T+V E   + L+E  +      +G T+  G L    +V+QV     R  D S
Sbjct: 612  TSEDGSESTVVYEVHGNELSETDKGDFEREEGSTLFVGVLAAGTKVVQVMRTEVRTYD-S 670

Query: 638  YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTP 697
             +  D      + E+G+      V++ S ADPY+L+         L  D S   +     
Sbjct: 671  ELNMDQILPMEDEETGN---ELRVINASFADPYLLV---------LREDSSVKILKASGD 718

Query: 698  AAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVC 757
              +E  +                  R  S+  WLS  + ++    +        I++ + 
Sbjct: 719  GELEDLEA-----------------RGLSSTKWLSASLFKSATFTE--------IFAFLL 753

Query: 758  YESGALEIFDV--PNFNC-VFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
               G L IF +  P   C V     F+     +     R A K + TEI           
Sbjct: 754  TPEGGLHIFAMSEPEKPCYVAEALGFLPPLLTVDFVPRRSAAKATITEI----------- 802

Query: 815  RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
                        LA          P L    +   ++ Y+A+ F                
Sbjct: 803  ------------LAADLGDVTTRSPHLIIRTSSDDLVIYKAFHFP--------------- 835

Query: 875  RSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI-FKNISGHQGFFLSGSRP 933
             S S +++    LR ++ ++  +  Y  +     A  +   +   ++ G+   F  G+ P
Sbjct: 836  -SRSAADLWTKNLRWIKLAQQHVPRYVEDAGSEDAGVESTLLALDDVCGYSTVFQRGASP 894

Query: 934  CWCMVFRERLRVHPQ---LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
             +  +F+E     P+   L    +   T  H  +C  GF YV S   L+I QLPS + + 
Sbjct: 895  SF--IFKEA-SSSPRVIGLSGKPVKGLTTFHTSSCERGFAYVDSTDTLRISQLPSRTHFG 951

Query: 991  NYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVP---VLKPLNQVLSLLIDQEVGHQIDN 1047
            +     + +P+ A  + + Y     LY +    P   VL P           E  H    
Sbjct: 952  HLGWATRRLPMDAEVYALAYHP-AGLYVVGTGQPEDFVLDP----------SETYH---- 996

Query: 1048 HNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTK 1106
            + L   D+    +VE   +++++     G W    T      E  L ++ + L  + TT 
Sbjct: 997  YELPKEDISFKPSVERGVIKLIDE----GTWSIIDTHVFDPQEVVLCIKALNLEVSETTH 1052

Query: 1107 ENETLLAIGTAYVQGEDVAARGRVLLF 1133
            + + L+A+GT+ V GED+A +G + +F
Sbjct: 1053 QRKDLIAVGTSIVHGEDLATKGCIRIF 1079


>gi|260835071|ref|XP_002612533.1| hypothetical protein BRAFLDRAFT_120973 [Branchiostoma floridae]
 gi|229297910|gb|EEN68542.1| hypothetical protein BRAFLDRAFT_120973 [Branchiostoma floridae]
          Length = 1003

 Score =  165 bits (417), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 152/553 (27%), Positives = 228/553 (41%), Gaps = 105/553 (18%)

Query: 543  ELPGCKGIWTVY------HKSSRGHNADSSRMAAYDDEY-----------------HAYL 579
            +LPGC  +WTV            G  A+S+      +                   H +L
Sbjct: 107  DLPGCLDMWTVIGIPPESKPQEEGEKAESAGSEEKPEGEKEETKEEGPPDVDLTNSHGFL 166

Query: 580  IISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYM 639
            I+S E  TMVL+T   + E+  S  +  QG T+ AGN+   + +IQV   G R+L G   
Sbjct: 167  ILSREDSTMVLQTGKEIMELDHS-GFSTQGPTVYAGNIGNNKYIIQVSPYGIRLLQGVKQ 225

Query: 640  TQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL--VGDPSTCTVSVQTP 697
             Q L F          S+    +  S+ADPY L+   DG I LL  V DP      +   
Sbjct: 226  LQHLPFD---------SKGPAFVLASVADPYALVMSEDGQILLLTLVNDPYGSGHRLSAK 276

Query: 698  AAIESSKKPVSSCTLYHDKG------------PEPWLRKTSTDAW------LSTGV---- 735
                + K    +   Y D              P P + K + +        + TGV    
Sbjct: 277  KIDMAGKSQAITVCAYRDTSGLFTVSSPSTTTPAPEVEKDAAEPAAEDAVAMETGVDDED 336

Query: 736  ----GEAIDGADGGPLDQGDI-------------------YSVVCYESGALEIFDVPNFN 772
                GE      G  + + ++                   + V+C E+G+LEI+++P+F+
Sbjct: 337  EMLYGEPSAKPSGPAVVREEVKPSTSTVQEPVVKEVEPTHWCVICRENGSLEIYNLPDFS 396

Query: 773  CVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRW 832
             V+ V  F +G   +VD++   +   +                K+      V E+ M   
Sbjct: 397  LVYLVKNFPTGMKLLVDSFQSTSSASTSQS------------DKQGDQLASVKEILMVGL 444

Query: 833  SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRF 892
                SRP L A + D  +L Y+A+    P + S    P  T   +    V  + +   R 
Sbjct: 445  GHKGSRPHLLARV-DEDLLIYEAF----PYHLS----PSYTMLKIRFKKVQHNLILRERK 495

Query: 893  SRTPLDAYTREET--PHGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQL 949
                  A  +EE+    G+  Q    F +ISG+ G F+ GS P W  M  R  LR+HP  
Sbjct: 496  GGKTKKAGDQEESDGQTGSRIQHFRTFTDISGYSGLFICGSSPHWLFMTSRGALRIHPMS 555

Query: 950  CDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQIT 1009
             DG++  F+  HNVNC  GF+Y    G L+I  LP+  +YD  WPV+KV PL+ TPH + 
Sbjct: 556  IDGAVTCFSPFHNVNCPKGFLYFNRGGELRISVLPTHLSYDAPWPVRKV-PLRCTPHFVA 614

Query: 1010 YFAEKNLYPLIVS 1022
            Y  E  +Y +  S
Sbjct: 615  YHMECKVYAVAAS 627


>gi|325187036|emb|CCA21579.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 1912

 Score =  165 bits (417), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 151/569 (26%), Positives = 250/569 (43%), Gaps = 124/569 (21%)

Query: 235 VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
           ++ LR+LD+K  + DF F+ GY+EP ++ILHE    +  +GR +  + T  ++ LSI+  
Sbjct: 382 LLRLRELDIKGRIADFAFLDGYLEPTLMILHEENERIASSGRFAIGYDTMCLTVLSITLN 441

Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYA----- 346
            + HP+IW   NLP D ++++    PIGG L++  N I Y +Q+    + LN +A     
Sbjct: 442 SRLHPVIWCVKNLPADCFRIIPCKVPIGGALLLSTNAILYFNQTQFYGIKLNVFADKTVN 501

Query: 347 --------VSLDSSQELPRSS--------------FSVELDAAHATWLQNDVALLSTKTG 384
                    + +  + LP +S               S+ L   H  +L +   LLS    
Sbjct: 502 QSLFPCQDATYEVLEPLPDASEPPAQGRLAFIEKPLSILLYDCHYDYLGSSDILLSLPDD 561

Query: 385 DLVLLTVVY-DGRVVQRLDLSKTNPSVL---TSDITTIG-------NSLFFLGSRLGDSL 433
            L +L +     RV    + + T   +L    S  +T         N   F+GSR GDS+
Sbjct: 562 SLYVLKMPQTSNRVFSVEEYNHTGKFILRKVASPASTASCLLVNRENDSIFIGSRCGDSV 621

Query: 434 LVQF--------TCGSGTSMLSSG-----LKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480
           L              SGT ++S G           D + +A   ++L+   S    D  +
Sbjct: 622 LYSAHRQKINARKTLSGTVVMSDGSISGTSNVRGADTDNEAALAEKLQAFGSTIALDATD 681

Query: 481 GEELSLYG------SASNNTESAQKTFSFA------------VRDSLVNIGPLKDFSYGL 522
            ++  LYG      S       +   FSF+              D +  IG +     G+
Sbjct: 682 EDDAFLYGPTLSQESTGGGKLPSSDCFSFSSMKQEDHSLHLQAIDFIPGIGQITSMDLGV 741

Query: 523 RINADAS--------ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHN 562
           + N+D++        + G SK  +  ++            EL GC+ +WTV   SS    
Sbjct: 742 QSNSDSNEQHEELVVSGGSSKDGSISVIHHGLRPIVSTAAELSGCRAMWTVVGMSSDVPE 801

Query: 563 ADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRR 622
           +  +R       Y +YLI+S+  RTM+L T + +  + +   ++  G T+ A NLF +RR
Sbjct: 802 SQVTR------RYDSYLILSVAQRTMILRTGEEMEPLEDDSGFYTCGPTLCATNLFSQRR 855

Query: 623 VIQVFERGARILDGSYM----------------------TQDLSFGPSNSESGS---GSE 657
           ++QVF++G R++  + +                      TQ++ F   + ESG     + 
Sbjct: 856 IVQVFKQGVRVMQQASIPASEAKEDDEGTQDVPLTRLVCTQEIPFA-GDIESGGMNVDTA 914

Query: 658 NSTVLSVSIADPYVLLGMSDGSIRLLVGD 686
           N  ++SV   DPY+LL ++DGSIRLL GD
Sbjct: 915 NVGIVSVDTIDPYILLLLTDGSIRLLEGD 943



 Score = 53.5 bits (127), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 69/317 (21%), Positives = 127/317 (40%), Gaps = 65/317 (20%)

Query: 756  VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDS----ETEINSSSEEG- 810
            +CY  G+L ++ VP+F  +            +++T   E+ + S    +T  +  S+ G 
Sbjct: 1086 LCYGDGSLHVYSVPDFGKMGIFPYVTFAPKFLLNTMTPESRRASYGYGDTARHRISKGGP 1145

Query: 811  ----------TGQGRKENIHSMK--VVELAMQRWS----AHHSRPF----LFAILTDGTI 850
                      T +GR    H++   V ++A+ R       H+S+ F    L   L +G +
Sbjct: 1146 RLGFSAIPADTNEGRIRKAHAINSPVADIAIHRIGPSEGQHNSQLFSHMVLLVFLANGDL 1205

Query: 851  LCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLR-NLRFSRTPLDAYTREETPHGA 909
            + Y+      P   S  D    +   + V+    +R    ++  +   +A T +E   G+
Sbjct: 1206 IMYKLL----PSIPSPRDSKQPSFHFVRVNENLITRPNLPMKAIKDSGNAGTHDENSLGS 1261

Query: 910  P----------------CQRITIFKNISGHQGFFLSGSRPCWCMVFRER-----LRVHPQ 948
                                +T F N++ + G F  G+ P W +  + +     L +   
Sbjct: 1262 TEASTSAIIAKLRANFRYPMLTRFFNVNNNSGMFFRGAYPVWILPNQGQPVFVPLNIAAA 1321

Query: 949  LCDGS--------IVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD-----NYWPV 995
              D +        +++FT  H+ NC +GF+Y  S G L++C+LPS          N + +
Sbjct: 1322 PSDPTRRTTFKVPVLSFTPFHHWNCPNGFVYFHSSGSLRVCELPSSQNSTLLPSGNGFVL 1381

Query: 996  QKVIPLKATPHQITYFA 1012
            QKV    AT H + Y  
Sbjct: 1382 QKV-RFGATIHHLLYLG 1397


>gi|322694449|gb|EFY86278.1| Cleavage factor two protein 1 [Metarhizium acridum CQMa 102]
          Length = 1431

 Score =  164 bits (416), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 250/1078 (23%), Positives = 418/1078 (38%), Gaps = 160/1078 (14%)

Query: 131  DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV---KVDP 187
            ++++LA++ AK+ + E+D   H L  TS+H +E  E   L+        G  V   + DP
Sbjct: 127  EALLLAYKAAKMCLTEWDPQRHTLETTSIHYYEKDE---LQGAPWEMPFGDYVNYLEADP 183

Query: 188  QGRCGGVLVYGLQMIILKASQGGSGLVGD---ED-------------TFGSGGG----FS 227
              RC         + IL  +Q    L  D   ED             T G G G      
Sbjct: 184  GSRCVAFKFGSRNLAILPFTQSEEDLEMDDWDEDLDGPCPVKEEPPTTNGDGPGDHDLVK 243

Query: 228  ARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
            +R   S V+ L  LD  + H     F+H Y EP   IL   +          H T  +  
Sbjct: 244  SRYTPSFVLRLPLLDPSLLHPVHLAFLHEYREPTFGILSSMQSPSPALGIKDHLTYKVFT 303

Query: 286  LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNN 344
            L +    +    I S   LP D ++++A+P+P+GG L+VG N  IH         +A+N+
Sbjct: 304  LDLQQ--RASTTILSVTGLPQDLFRVIALPAPMGGALLVGENELIHIDQSGKPNGVAVND 361

Query: 345  YAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLD 402
             A  + S   + +S   + L+      L ND+   LL    G L ++    DGR V ++ 
Sbjct: 362  MAKQMTSFSLVDQSELGLRLEGCAVELLANDIGELLLILNDGRLAIICFHIDGRTVSKIS 421

Query: 403  L----SKTNPSVLTSDITTI---GNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG 455
            +    ++   +++ S ++ I   G++  FLGS   DS+++ ++   G        K +  
Sbjct: 422  IRLVSAECGGNLIKSQVSCISKLGSNTLFLGSESNDSIVLGWSRKQGQE------KRKKS 475

Query: 456  DIEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASNNTESAQKTFSFAVRDSLVNIGP 514
             +     +         D   D + G + SL   S + N  S     SF V+D+L++I P
Sbjct: 476  RLLDPDLALDVDDLDLDDDEDDDLYGNDSSLAKPSQTINGSSKPGEVSFRVQDTLLSIAP 535

Query: 515  LKDFSYGL-RINADASATGISKQSNYEL----------------------------VELP 545
            ++D + G      D+    +SK    EL                             + P
Sbjct: 536  IRDVACGAPAFVPDSEEATLSKGVTAELELACAVGRGFSGSVAILNREIQPKVIGRFDFP 595

Query: 546  GCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIIS------LEARTMVLETADLLTEV 599
              +G WT+  K      A  +       +Y  Y+I++       E   +   TA     +
Sbjct: 596  EARGFWTMCVKKPLSKGAAVASDYDTTAQYDKYMIVAKVDLDGYETSDVYALTAAGFETL 655

Query: 600  TESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSEN 658
             ++      G T+ AG +  + R+IQV +   R  DG   ++Q L   P   E       
Sbjct: 656  KDTEFEPAAGFTVEAGTMGKQMRIIQVLKSEVRCYDGDLGLSQIL---PMLDEDTGAEPR 712

Query: 659  STVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGP 718
            +T  S SI DPY+LL   D SI +     +     V  P     S K  S C LY+D   
Sbjct: 713  AT--SASIVDPYLLLNRDDSSIFIAQIHSNNELEEVFKPDGTLKSTKWASGC-LYND--- 766

Query: 719  EPWLRKTSTDAWLSTGVG-EAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTV 777
                    T     + V  +  D AD        I   +   +GAL ++ +P+      V
Sbjct: 767  --------TQGIFQSNVNKQKADAAD-------RIMMFLLSSAGALHVYALPD------V 805

Query: 778  DKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHS 837
             K +         ++ EAL      ++++     G   KE+I  + V +L      A   
Sbjct: 806  SKPI---------FVAEALTSIPPFLSAAFVARKG-ASKESITEILVADLG----DAISQ 851

Query: 838  RPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPL 897
             P+L        +  Y+      P    +  D   ++  L    V+ S  +       P 
Sbjct: 852  TPYLIVRHASDDLTIYE------PVRCQEEGDAELSASLLFKKCVNTSLAKT-----APE 900

Query: 898  DAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAF 957
             +    E P   P +R     N++G+   FL G+ P + +           L    +   
Sbjct: 901  VSEDDAEPPRFVPLRRCA---NVNGYGAVFLPGASPSFVLKSSHSEPRVIGLQGLGVRGM 957

Query: 958  TVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLY 1017
            +  H   C+ GFIYV  +GI ++ QLPS +++ +     K I L      I+Y      Y
Sbjct: 958  STFHTEGCDRGFIYVDVEGIARVTQLPSNASFTDLGVSVKKIALDGDVGMISYHHPTGTY 1017

Query: 1018 PLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHN-LSSVDLHRTYTVEEYEVRILEPDRAGG 1076
              +V+   L+P           E+    D H   +   +    T     ++++ P     
Sbjct: 1018 --VVACTKLEPF----------ELPRDDDYHKEWAKETIKFPPTTARGILKLINPVT--- 1062

Query: 1077 PWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
             W     + ++  E+  +++ + L  +  TKE + L+A+GTA  +GED+  RGRV +F
Sbjct: 1063 -WTVIHELELEPCESIESMKTLHLEVSEETKERKMLVAVGTALSKGEDLPTRGRVQVF 1119


>gi|46120520|ref|XP_385083.1| hypothetical protein FG04907.1 [Gibberella zeae PH-1]
          Length = 1436

 Score =  164 bits (414), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 262/1148 (22%), Positives = 437/1148 (38%), Gaps = 183/1148 (15%)

Query: 72   RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQ-----GGADN 126
            R  ++   ES   G     V  D  +   L LV    L G V  LA +       GG   
Sbjct: 68   RANDDDGLESSFLGGETMIVKTDRTNNTKLVLVAELPLSGAVTGLAKVKTKHSKCGG--- 124

Query: 127  SRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR-GPLVKV 185
                +++++A++ AK+ +  +D     L   S+H +E  E LH      SF      ++ 
Sbjct: 125  ----EALLIAYKAAKLCMAVWDPEKSTLETISIHYYEKEE-LHGAPWEVSFDEYANYLEA 179

Query: 186  DPQGRCGGVLVYGLQMIILKASQGGSGLVGDE------------DTFGSGGGFSARIESS 233
            DP  RC         + IL   Q    L  D+            +T     G S  +E  
Sbjct: 180  DPGSRCAAFQFGSRNIAILPFRQAEEDLEMDDWDEDLDGPRPVKETAAVANGDSDTVEPP 239

Query: 234  HV------INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
            +       + L D  + H   F F+H Y EP   IL   +          H T  +  L 
Sbjct: 240  YTPSFVLRLPLLDPSLLHPVHFAFLHEYREPTFGILSSSQERAHSLGQKDHLTYKVFTLD 299

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA 346
            +    +    I S  +LP D +K+LA+P+P+GG L++G N  IH      +  +A+N+ A
Sbjct: 300  LQQ--RASTTILSVTDLPRDLFKILALPAPVGGALLIGENELIHVDQSGKANGVAVNSMA 357

Query: 347  VSLDSSQELPRSSFSVELD--AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLS 404
              + S     ++  ++ L+        ++N   LL    G + +++ + DGR V  L + 
Sbjct: 358  RQITSFSLTDQADLNLRLEHCVVEQLHIENGELLLVLNDGQIGIVSFLIDGRTVSGLSIK 417

Query: 405  ----KTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI 457
                +   +VL S  +T   +G + FF+GS +GDS+++ +T   G        K    D 
Sbjct: 418  MVTDENGGNVLKSRASTASKLGKNTFFVGSEMGDSVVLGWTRKMGQEKRR---KPRLIDT 474

Query: 458  EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKD 517
            +      +       D   D+   E  +   + + N        SF + D+L++I P+KD
Sbjct: 475  DIALDVDELDLEDDDDEDDDLYGTESAAAKPAQALNGSGRSGELSFRIHDTLLSIAPIKD 534

Query: 518  FSYGLR--------------INAD---ASATGISKQSNYELV------------ELPGCK 548
             + G                + +D   A   G  K  +  ++            E P  +
Sbjct: 535  LTPGKTSFLPDSEEMTLSDGVVSDLHLACIVGRGKAGSLAILNRNIQPKIIGRFEFPEAR 594

Query: 549  GIWTVYHKSSRGHNADSSRMAAYDDEYHA------YLIISLEARTMVLETADLLT----- 597
            G WT+  K         S  A   DEY A      Y+I++ +      ET+D+       
Sbjct: 595  GFWTMSVKKPLPKALGGS--AGVGDEYEAFGQHDKYMIVA-KVDLDGYETSDVYALTGAG 651

Query: 598  -EVTESVDY-FVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGS 654
             E  +  ++    G T+ AG +  + R+IQV +   R  DG   +TQ L     + E+G+
Sbjct: 652  FETLKETEFDPAAGFTVEAGTMGKQMRIIQVLKSEVRSYDGDLGLTQILPM--LDEETGA 709

Query: 655  GSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYH 714
                  V S SI DPY+LL   D S+ L   D +     V+   A   + K  + C LY 
Sbjct: 710  ---EPRVTSASIVDPYLLLIRDDSSLLLAQIDSNNELEEVEKMDATLQNTKWHAGC-LYA 765

Query: 715  DKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-C 773
            D                 T      +  D G  ++  I   +   +GAL ++ +P+ +  
Sbjct: 766  D-----------------TEGAFQFNANDKGETEK--IMMFLLSSTGALHVYALPDLSKP 806

Query: 774  VFTVDKFVSGRTHI-VDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRW 832
            V+  +       H+  D  +R  L                   KE +  + V +L     
Sbjct: 807  VYVAEGLSYVPPHLSADYTLRRGLA------------------KETLREILVADLG---- 844

Query: 833  SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSAS----RLR 888
                  P+L        +  Y+               P+   R    SN+SA+    ++ 
Sbjct: 845  DTISQSPYLILRNQTDDLTIYE---------------PIHHVRPGGESNLSAALSFKKMS 889

Query: 889  NLRFSRTPLDAYTRE-ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHP 947
            N+  + TP      + E P   P +R     NI+G+   FL GS P + +   + +    
Sbjct: 890  NVTLATTPAQTEDDDVEQPRFMPMRRCA---NINGYSTVFLPGSSPSFVLKSSKSIPRVI 946

Query: 948  QLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQ 1007
             L    I   +  H   C+ GFIY   +GI ++ Q PS + +       K +PL +    
Sbjct: 947  GLQGLGIRGMSSFHTEGCDRGFIYADDKGIARVTQFPSDTNFTELGISVKKVPLGSDVRG 1006

Query: 1008 ITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHN-LSSVDLHRTYTVEEYEV 1066
            I Y      Y  I      +P           E+    D H   +   L    T+    +
Sbjct: 1007 IAYHQPTGAY--IAGCMTSEPF----------ELPKDDDYHKEWAKETLSFPPTMPRGVL 1054

Query: 1067 RILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVA 1125
            +++ P      W     I ++S E+   ++ + L  +  TKE   L+A+GTA  +GED+ 
Sbjct: 1055 KLISPIT----WTVIHDIELESCESIECMKTLHLEVSEDTKERRFLVAVGTAVSKGEDLP 1110

Query: 1126 ARGRVLLF 1133
             RGRV ++
Sbjct: 1111 IRGRVHVY 1118


>gi|426194401|gb|EKV44332.1| hypothetical protein AGABI2DRAFT_187183 [Agaricus bisporus var.
            bisporus H97]
          Length = 1413

 Score =  163 bits (412), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 247/1150 (21%), Positives = 459/1150 (39%), Gaps = 151/1150 (13%)

Query: 57   NLVVTAANVIEIYVVRVQE-----EGSKESKNSGETKR-------RVLMD-------GIS 97
            N+VV  +N++ I+ VR +      +   E +  G+T+R        V MD        I+
Sbjct: 49   NVVVARSNLLRIFEVREEPAPFPTQADDERERKGKTRRGTEAVEGEVEMDEEGEGFVNIA 108

Query: 98   AASLE-----------LVCHYRLHGNV---ESLAILSQGGADNSRRRDSIILAFEDAKIS 143
             ++++            +  +RLHG V   E + I+    A    + D ++++F+DAKI+
Sbjct: 109  KSAIQKTKLPTVTKFYFIREHRLHGIVTGLEGVRIM----ASLEDKLDRLLVSFKDAKIA 164

Query: 144  VLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI 202
            +LE+ D+IH L   S+H +E +P+ + L        R  L +VDP  RC  + +    + 
Sbjct: 165  LLEWSDTIHDLVTVSIHTYERAPQLISLD---SPLFRSDL-RVDPISRCAALSLPKHAIA 220

Query: 203  ILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIEP 258
            IL   Q  + L V ++D   S          S +++L    + ++++V DF+F+ G+  P
Sbjct: 221  ILPFYQTQAELDVMEQDQSQSK---DVPYSPSFILDLPIQVEENIRNVIDFVFLPGFNNP 277

Query: 259  VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
             + IL + + TW GR+     T  +   ++    +   +I S   LP+DA+ LL   + I
Sbjct: 278  TIAILFQTQQTWTGRLRESKDTARLIIFTLDILTQNSTIITSVEGLPYDAFSLLPCSTAI 337

Query: 319  GGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPR---SSFSVELDAAHATWLQN 374
            GGV+V+  N++ Y  QS+   +L +N +A  +      P    ++  + L+   +  + +
Sbjct: 338  GGVIVITGNSVIYVDQSSRRVSLQVNGWATRISDLPYPPMEEDAALKLHLEGCRSAMVDD 397

Query: 375  DVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSL----FFLGSRLG 430
                L  K G +  + ++ DG+ V +L ++   P++  + I T+   +     F+GS +G
Sbjct: 398  KTVFLIYKDGTVYPVELIADGKTVSKLIMA---PALAQTTIPTVVKRVDEDHLFIGSAVG 454

Query: 431  DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
             S+L++            G K     +  D          + D   D + G+        
Sbjct: 455  PSILLKTAHVEQEVEEEHGSKSGPAVVTQDV---------TMDDDDDDIYGDSTMETEPT 505

Query: 491  SNNTESAQKT---FSFAVRDSLVNIGPLKDFSYGLRINAD------ASATGISKQSNYEL 541
            +N     +KT      ++RD L   GP+   ++ L +N +       +ATG      + L
Sbjct: 506  ANGVTHVRKTKTVIHLSLRDYLPAYGPISSMTFSLAMNGEKAVPELVAATGAGSLGGFTL 565

Query: 542  VE--LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAY----LIISLEARTMVLETADL 595
             +  LP  K    +Y   SRG  +   R     +  H +    LI+S +       +   
Sbjct: 566  FQRDLPTVKKRKILYISGSRGIWSLPIRQPLRSNTSHGHDYDTLILSTDINPSPGSSRIA 625

Query: 596  LTEVTE--SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL--DGSYMTQDLSFGPSNSE 651
            +  +    S++    G TI A   F R  ++ V     R+L  DG+          +  +
Sbjct: 626  VRSMNRDVSINSRTPGLTIGAAPFFQRTAILHVMTNAIRVLHPDGTERQ-------TIPD 678

Query: 652  SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG-DPSTCTVSVQTPAAIESSKKPVSSC 710
                     +   SIADP+VL+   D SI + V  D         +P   +SS+  ++ C
Sbjct: 679  KDGNMPRPKIRFCSIADPFVLVMREDDSIGMFVATDREKIRRKDMSPMGDKSSRY-LAGC 737

Query: 711  TLYHDKGPEPWLRKTSTD--AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDV 768
                  G    L + + D  +  +T   +   GA          + ++    G LEI+ +
Sbjct: 738  FFTDTTG----LFEANFDNKSPATTSTLQITSGAKSQ-------WLLLVRPQGVLEIWTL 786

Query: 769  PNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELA 828
            P  +  F+     S ++ + DT+   A                 Q        + + ++ 
Sbjct: 787  PKLSLAFSTPAIASLQSVLTDTHDPPA-------------PSLPQDPPRKPQDLDIEQIL 833

Query: 829  MQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLR 888
            +         P L   L  G +  Y+A +    +N    D P +TS  +    ++A    
Sbjct: 834  LAPIGESSPTPHLCVFLRSGQLAIYEAVVLG--QNPEVPDTPRATSLQIQFVKIAAKSFE 891

Query: 889  NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCM-VFRERLRVHP 947
              R            +  +      +T  +    + G F +G RP W +   R  ++V+P
Sbjct: 892  IQRPEENEKGILAEHKKINRMFIPFVTSPRPSVTYSGVFFTGDRPHWILSTDRSGVQVYP 951

Query: 948  QLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQ 1007
                  + AFT          F+  T  G + +  +P        +     +P+++ P  
Sbjct: 952  S-GHNVVHAFTPCSLWESKGEFLMYTEDGPILVEWVPD-------FQFDGPLPMRSIPRG 1003

Query: 1008 ITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQI---DNHNLSSVDLHRTYTVEEY 1064
              Y     L+    S+ V     Q      D++ G+ I   D  N+SS       +V+  
Sbjct: 1004 RAY--SNVLFDPSTSLIVAASSLQSTFTSFDED-GNNIWEPDAPNISSP------SVDCS 1054

Query: 1065 EVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE-NETLLAIGTAYVQGED 1123
             + ++ PD     W T       ++E    + +VTL    T+   +  +A+GT   +GED
Sbjct: 1055 ALELIAPDI----WATMDGFEFATNEYINDMTIVTLETAATETGTKDFIAVGTTIDRGED 1110

Query: 1124 VAARGRVLLF 1133
            +A +G   +F
Sbjct: 1111 LAVKGATYIF 1120


>gi|393220097|gb|EJD05583.1| cleavage factor protein [Fomitiporia mediterranea MF3/22]
          Length = 1450

 Score =  163 bits (412), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 225/962 (23%), Positives = 394/962 (40%), Gaps = 157/962 (16%)

Query: 76  EGSKESKNSGETKRRVLMDGISAASLEL--VCHYRLHGNV---ESLAILSQGGADNSRRR 130
           EG  E    GE    +    +S  + +   +  +RLHG V   E + ILS          
Sbjct: 89  EGEVEMDTQGEGFVNMASKPLSMTTYQFHFIREHRLHGIVTGLEPVKILSS----TEDSL 144

Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQG 189
           D ++++F+DAK+++LE+   +H L   S+H +E +P+   L   + +   G L +VDP  
Sbjct: 145 DRLLVSFKDAKLALLEWSPELHDLVTVSIHTYERAPQMTFLDPSKFT---GQL-RVDPLS 200

Query: 190 RCGGVLVYG--LQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVK 247
           RC  + +    L ++    SQ    LV  + T      +S       + N  D  +++V 
Sbjct: 201 RCAALSLPCDCLAILPFYHSQVDLDLVDADQTVSRDIPYSPSF-ILDLFNQVDHRIRNVI 259

Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHD 307
           DF F+ G+  P + +L + + TW GR+     TC +   ++      +P+I S  NLPHD
Sbjct: 260 DFAFLPGFNNPTLAVLFQTQHTWTGRLKEFKDTCNLFIFTLDLVTHMYPIITSVENLPHD 319

Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQ-SASCALALNNYA--VSLDSSQELPRSSFS--V 362
            + +L   S +GGV+++  N++ Y  Q S    L +N +A  VS    Q+L     +  +
Sbjct: 320 CFAMLPCDSSLGGVVIISCNSLIYVDQASRKTVLPVNGWAARVSDMPMQQLRPEEMNRDL 379

Query: 363 ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITTI 418
            L+ AHAT++ +    + T+ G ++ + +V DGR   RL L    S+T    L  ++   
Sbjct: 380 HLEGAHATFVDSRTFFIITRDGLVLPVEIVMDGRTALRLALHPAMSQTTTPALVRNVAFR 439

Query: 419 GNS--------LFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEF------GDIEAD-APS 463
             S        + F+GS +G S+L++ T           ++EE       GDI A  A +
Sbjct: 440 SASGDQAPRSQILFVGSTVGPSVLLRVTW----------VEEEIQKDKQQGDIPAAVADN 489

Query: 464 TKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFS---FAVRDSLVNIGPLKDFSY 520
              +     D +   V  E  + +G  +  +++A +T S    ++ DSL   GP+   ++
Sbjct: 490 PMAVDFDDEDDIYGDVAKETQTTHGQPTAASQAAVETKSVIHLSLCDSLSAYGPINSMAF 549

Query: 521 GLRINAD------ASATGISKQSNYELVE-------------LPGCKGIWTV-YHKSSRG 560
            L  N D       +ATG ++   + L +             + G +GIW +   +S + 
Sbjct: 550 ALTRNGDRPTAELVAATGYARLGGFTLFQRDVPTRSKRKLHAVGGARGIWCIPVRQSLKV 609

Query: 561 HNADSSR-MAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR----TIAAG 615
           + ++ SR +     E    +I+S +A      T       +   D  +  R    TI A 
Sbjct: 610 NGSERSRNLLPGSSEVVDTVIVSTDANPSPGLTR--FAAKSSRNDIAITARRTETTIGAA 667

Query: 616 NLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
             F R  +I V     R+L+      D S      +    ++   +    I+DP++L+  
Sbjct: 668 PFFQRTAIIHVTTDLIRVLE-----PDCSERQCIRDMDGSNKRPKIRFCCISDPFILVIR 722

Query: 676 SDGSIRLLVGDPSTCTVSVQ--TPAAIESSKKPVSSCTLYHDKG------PEPWLRKTST 727
            D S+ L VGD     +  +  TP   E   +  + C      G       E      ++
Sbjct: 723 EDESLGLFVGDAERGRIRRKDMTPMG-EKVSRYSAGCFFLDQSGIFELHMSESSPTTGTS 781

Query: 728 DAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHI 787
           D     G G      D     +G  + V+C   G +EI+ +P    VF+        + +
Sbjct: 782 DDKQRMGTGSLESAVDA---QRGTQWLVLCRPQGVVEIWTLPKLALVFSTSSLKDLPSVV 838

Query: 788 VDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTD 847
            D++   AL        S  E+   + ++ +I  ++  ++        +  P L  +L  
Sbjct: 839 SDSFDPPAL--------SLPEDPPRKPQEADIELLQFAQIG-----ELYPHPHLIVMLRC 885

Query: 848 GTILCYQAYLFEGPENTSKSDDPVSTSRSLSV----------------------SNVSAS 885
           G +  YQA   +      K D P ST R+ ++                      S+V A 
Sbjct: 886 GQLAIYQAVAVD------KDDFPESTVRTSTLKIKFIKMGTRSFEPRQLEPAEKSSVIAE 939

Query: 886 RLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF-RERLR 944
           + R LR S  P       E             K +S   G F++G  PCW +   ++ L+
Sbjct: 940 QRRALR-SLVPFIVSPNSE-------------KRVS---GVFVTGDEPCWIVATDKDGLK 982

Query: 945 VH 946
           +H
Sbjct: 983 IH 984


>gi|429851266|gb|ELA26469.1| protein cft1 [Colletotrichum gloeosporioides Nara gc5]
          Length = 1411

 Score =  163 bits (412), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 267/1152 (23%), Positives = 452/1152 (39%), Gaps = 179/1152 (15%)

Query: 69   YVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSR 128
            Y  R+ ++   ES   G     V  D  +   L LV  Y + G V  LA +      NS+
Sbjct: 66   YDRRLNDDDGLESSFLGGDGMLVRADRTNNTKLVLVAEYPIFGVVAGLARIK---IQNSK 122

Query: 129  RR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL----- 182
               +++++A   A++S++++D   H L   S+H +E  E         S   GPL     
Sbjct: 123  SGGEALLIATRVARLSLVQWDPEKHALEDVSIHFYEKEEL------EGSPFDGPLSNYPT 176

Query: 183  -VKVDPQGRCGGV---------LVYGLQMIILKASQGGSGLVGDED---------TFGSG 223
             +  DP  RC  +         L + L    +        + G            T G+ 
Sbjct: 177  HLAADPGSRCAALRFGSRYIAFLPFKLNDEDIDMDDWDEDVDGPRPAKEPSATAATNGTS 236

Query: 224  GGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTC 281
                    +S+V+ L  LD  + H     F+H Y EP   I+   +          H + 
Sbjct: 237  NLADVPYSTSYVLPLPQLDPSLLHPVHLAFLHEYREPTFGIISSMQRRSNTLPRKDHFSY 296

Query: 282  MISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCAL 340
             +  L +    +    I S  NLP D +K++A+P PIGG L+VG N  IH         +
Sbjct: 297  KVFTLDLQQ--RASTAILSVNNLPQDLFKVIALPGPIGGALLVGTNELIHIDQSGKPNGV 354

Query: 341  ALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTGDLVLLTVVYDGRVV 398
            A+N +     S     +S   + L+  +   +  +N   L+    G L ++    DGR V
Sbjct: 355  AVNAFTKETTSFPLADQSELDLRLEHCYIEQMSPENGELLMVLSDGRLAIIAFKIDGRTV 414

Query: 399  QRLDL----SKTNPSVL---TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
              L +    ++   +V+    S I+ +  + FF+GS   DSL+V      G +   +   
Sbjct: 415  SGLSVRIVPAEAGGNVVQCGASSISRLSKNAFFIGSTGSDSLVV------GVTRKQTQNA 468

Query: 452  EEFGDIEADAPSTKRLRRSSSDALQDMVNGE-ELSLYGSASNNTESAQKTFSFAVRDSLV 510
             +   +  D+ +         D   D + GE   ++  S + N        SF V DSL+
Sbjct: 469  RKKTRLVDDSFADDLEDEDIDDDDDDDLYGETTTTVQSSTAANGVPKGGEISFRVHDSLL 528

Query: 511  NIGPLKDFSYG--------------------LRINA-----DASATGISKQSNYELV--- 542
            ++ P+KD + G                    L++ A     +A+A  I  Q+    V   
Sbjct: 529  SLAPVKDMTTGKQAFIPESEDEKNSVGVVADLQLAAAVGKGNAAAIAIMNQNIQPKVIGK 588

Query: 543  -ELPGCKGIWT--VYHKSSRGHNADSSRMAAYDDEYHA------YLIISLEARTMVLETA 593
             E P  +G WT  V     +    D    AA   E+ A      ++I+S +      ET+
Sbjct: 589  FEFPEARGFWTMCVQKPIPKSLQGDKGANAAVGSEFDASSIYDKFMIVS-KVDLDGYETS 647

Query: 594  DLLTEVTESVDYF-------VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSF 645
            D+        + F         G T+ AG +    R+IQV +   R  DG   ++Q L  
Sbjct: 648  DVYALTGAGFEAFTGTEFDPAAGFTVEAGTMGKHMRIIQVLKSEVRCYDGDLGLSQILPM 707

Query: 646  GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKK 705
               + E+G+      V+S SIADPY+LL   D SI +   D +     ++       S +
Sbjct: 708  --LDEETGA---EPRVVSASIADPYLLLVRDDASIMVAQIDNNNELEEMEKQDDTILSTQ 762

Query: 706  PVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEI 765
             ++ C LY D                +TGV   I    G P  Q  I+  +    GAL I
Sbjct: 763  WLAGC-LYTD----------------TTGVFAPIQTDKGTPESQ-SIFMFLLSAVGALYI 804

Query: 766  FDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVV 825
            + +P+ +    V    +G T+ V  ++           + +   GT Q   E +  + V 
Sbjct: 805  YALPDLSKPVYV---AAGMTY-VPPFL---------SADYAVRRGTVQ---ETLTEVLVA 848

Query: 826  ELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSAS 885
            +L      A  S P+L     +  I  Y+    E        D     +++L    ++  
Sbjct: 849  KLG----DATESSPYLILRHANDDITIYEPIRLE------SQDKSEGLAKTLHFQKIT-- 896

Query: 886  RLRNLRFSRTPLDAYTRE--ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERL 943
               N   +++P++    +  E P   P +      NI+G+   FL G+ P + +   +  
Sbjct: 897  ---NPALAKSPVEVADDDANEQPRFVPLRPCA---NINGYSTVFLPGASPSFIIKSAKSA 950

Query: 944  RVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKA 1003
                 L    +   +  H   C  GFIY  S+G  ++ QLP+ ++++    ++K IP+  
Sbjct: 951  PKVLGLQGIGVRGMSSFHTEGCERGFIYADSEGHTRVTQLPADTSFELGVSIRK-IPVGD 1009

Query: 1004 TPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTY-TVE 1062
                I Y      Y +  SV   +P           E+    D H   + +   T+  +E
Sbjct: 1010 AIGLIAYHPPMETYAVACSVS--EPF----------ELPKDDDYHKEWAKETITTFPQME 1057

Query: 1063 EYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQG 1121
               +++L P      W    T+ +   E A+ ++ + L  +  TKE   L+AIGTA  +G
Sbjct: 1058 RGIIKLLSP----ATWSVIDTVELDPHEVAMCMKTLHLEVSEETKERRMLIAIGTAINRG 1113

Query: 1122 EDVAARGRVLLF 1133
            ED+  RGR+L++
Sbjct: 1114 EDLPIRGRILVY 1125


>gi|408396642|gb|EKJ75797.1| hypothetical protein FPSE_03977 [Fusarium pseudograminearum CS3096]
          Length = 1427

 Score =  162 bits (411), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 262/1147 (22%), Positives = 434/1147 (37%), Gaps = 181/1147 (15%)

Query: 72   RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQ-----GGADN 126
            R  ++   ES   G     V  D  +   L LV    L G V  LA +       GG   
Sbjct: 68   RANDDDGLESSFLGGETMIVKTDRTNNTKLVLVAELPLSGAVTGLAKVKTKHSKCGG--- 124

Query: 127  SRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR-GPLVKV 185
                +++++A++ AK+ +  +D     L   S+H +E  E LH      SF      ++ 
Sbjct: 125  ----EALLIAYKAAKLCMAVWDPEKSTLETISIHYYEKEE-LHGAPWEVSFDEYANYLEA 179

Query: 186  DPQGRCGGVLVYGLQMIILKASQGGSGLVGDE------------DTFGSGGGFSARIESS 233
            DP  RC         + IL   Q    L  D+            +T     G S  +E  
Sbjct: 180  DPGSRCAAFQFGSRNIAILPFRQAEEDLEMDDWDEDLDGPRPVKETATVANGDSDTVEPP 239

Query: 234  HV------INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
            +       + L D  + H   F F+H Y EP   IL   +          H T  +  L 
Sbjct: 240  YTPSFVLRLPLLDPSLLHPVHFAFLHEYREPTFGILSSSQEPAHSLGQKDHLTYKVFTLD 299

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA 346
            +    +    I S  +LP D +K+LA+P+P+GG L++G N  IH      +  +A+N+ A
Sbjct: 300  LQQ--RASTTILSVTDLPRDLFKILALPAPVGGALLIGENELIHVDQSGKANGVAVNSMA 357

Query: 347  VSLDSSQELPRSSFSVELD--AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLS 404
              + S     ++  ++ L+        ++N   LL    G + +++ + DGR V  L + 
Sbjct: 358  RQITSFSLTDQADLNLRLEHCVVEQLHIENGELLLVLNDGQIGIVSFLIDGRTVSGLSVK 417

Query: 405  ----KTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI 457
                +   +VL S  +T   +G + FF+GS +GDS+++ +T   G        K    D 
Sbjct: 418  MVTDENGGNVLKSRASTASKLGKNAFFVGSEMGDSVVLGWTRKMGQEKRR---KPRLIDT 474

Query: 458  EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKD 517
            +      +       D   D+   E  +   + + N        SF + D+L++I P+KD
Sbjct: 475  DIALDVDELDLEDDDDEDDDLYGTESAAAKPAQALNGSGRSGELSFRIHDTLLSIAPIKD 534

Query: 518  FSYGLR--------------INAD---ASATGISKQSNYELV------------ELPGCK 548
             + G                + +D   A   G  K  +  ++            E P  +
Sbjct: 535  LTPGKTSFLPDSEEMTLSDGVVSDLHLACIVGRGKAGSLAILNRNIQPKIIGRFEFPEAR 594

Query: 549  GIWTVYHKSSRGHNADSSRMAAYDDEY-----HAYLIISLEARTMVLETADLLT------ 597
            G WT+  K         S  A   DEY     H   +I  +      ET+D+        
Sbjct: 595  GFWTMSVKKPLPKALGGS--AGVGDEYETFGQHDKYMIVAKVDLDGYETSDVYALTGAGF 652

Query: 598  EVTESVDY-FVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSG 655
            E  +  ++    G T+ AG +  + R+IQV +   R  DG   +TQ L     + E+G+ 
Sbjct: 653  ETLKETEFDPAAGFTVEAGTMGKQMRIIQVLKSEVRSYDGDLGLTQILPM--LDEETGA- 709

Query: 656  SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
                 V S SI DPY+LL   D S+ L   D +     V+   A   + K  + C LY D
Sbjct: 710  --EPRVTSASIVDPYLLLIRDDSSLLLAQIDSNNELEEVEKMDATLQNTKWHAGC-LYAD 766

Query: 716  KGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CV 774
                             T     +  +D G  ++  I   +   +GAL ++ +P+ +  V
Sbjct: 767  -----------------TKGAFQLSASDKGETEK--IMMFLLSSTGALHVYALPDLSKPV 807

Query: 775  FTVDKFVSGRTHI-VDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWS 833
            +  +       H+  D  +R  L                   KE +  + V +L      
Sbjct: 808  YVAEGLSYVPPHLSADYTLRRGLA------------------KETLREILVADLG----D 845

Query: 834  AHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSAS----RLRN 889
                 P+L        +  Y+               P+   R    SN+SA+    +  N
Sbjct: 846  TISQSPYLILRNQTDDLTIYE---------------PIRHVRPGGESNLSAALSFKKTSN 890

Query: 890  LRFSRTPLDAYTRE-ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQ 948
            +  + TP      E E P   P +R     NI+G+   FL GS P + +   + +     
Sbjct: 891  VTLATTPAQTEDDEVEQPRFMPMRRCA---NINGYSTVFLPGSSPSFVLKSSKSIPRVIG 947

Query: 949  LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQI 1008
            L    I   +  H   C+ GFIY   +GI ++ Q PS + +       K +PL +    I
Sbjct: 948  LQGLGIRGMSSFHTEGCDRGFIYADDKGIARVTQFPSDTNFTELGISVKKVPLGSDVRGI 1007

Query: 1009 TYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHN-LSSVDLHRTYTVEEYEVR 1067
             Y      Y  I      +P           E+    D H   +   L    T+    ++
Sbjct: 1008 AYHQPTGAY--IAGCMTSEPF----------ELPKDDDYHKEWAKETLSFPPTMPRGILK 1055

Query: 1068 ILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAA 1126
            ++ P      W     I ++S E+   ++ + L  +  TKE   L+A+GTA  +GED+  
Sbjct: 1056 LISPIT----WTVIHDIELESCESIECMKTLHLEVSEDTKERRFLVAVGTAVSKGEDLPI 1111

Query: 1127 RGRVLLF 1133
            RGRV ++
Sbjct: 1112 RGRVHVY 1118


>gi|409076059|gb|EKM76433.1| hypothetical protein AGABI1DRAFT_108759 [Agaricus bisporus var.
            burnettii JB137-S8]
          Length = 1413

 Score =  162 bits (410), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 246/1150 (21%), Positives = 458/1150 (39%), Gaps = 151/1150 (13%)

Query: 57   NLVVTAANVIEIYVVRVQE-----EGSKESKNSGETKR-------RVLMD-------GIS 97
            N+VV  +N++ I+ VR +      +   E +  G+T+R        V MD        I+
Sbjct: 49   NVVVARSNLLRIFEVREEPAPFPTQADDERERKGKTRRGTEAVEGEVEMDEEGEGFVNIA 108

Query: 98   AASLE-----------LVCHYRLHGNV---ESLAILSQGGADNSRRRDSIILAFEDAKIS 143
             ++++            +  +RLHG V   E + I+    A    + D ++++F+DAKI+
Sbjct: 109  KSAIQKTKLPTVTKFYFIREHRLHGIVTGLEGVRIM----ASLEDKLDRLLVSFKDAKIA 164

Query: 144  VLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI 202
            +LE+ D+IH L   S+H +E +P+ + L        R  L +VDP  RC  + +    + 
Sbjct: 165  LLEWSDTIHDLVTVSIHTYERAPQLISLD---SPLFRSDL-RVDPISRCAALSLPKHAIA 220

Query: 203  ILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIEP 258
            IL   Q  + L V ++D              S +++L    + ++++V DF+F+ G+  P
Sbjct: 221  ILPFYQTQAELDVMEQD---QSQAKDVPYSPSFILDLPIQVEENIRNVIDFVFLPGFNNP 277

Query: 259  VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
             + IL + + TW GR+     T  +   ++    +   +I S   LP+DA+ LL   + I
Sbjct: 278  TIAILFQTQQTWTGRLRESKDTARLIIFTLDILTQNSTIITSVEGLPYDAFSLLPCSTAI 337

Query: 319  GGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPR---SSFSVELDAAHATWLQN 374
            GGV+V+  N++ Y  QS+   +L +N +A  +      P    ++  + L+   +  + +
Sbjct: 338  GGVIVITGNSVIYVDQSSRRVSLQVNGWATRISDLPYPPMEEDATLKLHLEGCRSAMVDD 397

Query: 375  DVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSL----FFLGSRLG 430
                L  K G +  + ++ DG+ V +L ++   P++  + I T+   +     F+GS +G
Sbjct: 398  KTVFLIYKDGTVYPVELIADGKTVSKLIMA---PALAQTTIPTVVKRVDEDHLFIGSAVG 454

Query: 431  DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
             S+L++            G K     +  D          + D   D + G+        
Sbjct: 455  PSILLKTAHVEQEVEEEHGSKSGPAVVTQDV---------TMDDDDDDIYGDSTMETEPT 505

Query: 491  SNNTESAQKT---FSFAVRDSLVNIGPLKDFSYGLRINAD------ASATGISKQSNYEL 541
            +N     +KT      ++RD L   GP+   ++ L +N +       +ATG      + L
Sbjct: 506  ANGVTHVRKTKTVIHLSLRDYLPAYGPISSMTFSLAMNGEKAVPELVAATGAGSLGGFTL 565

Query: 542  VE--LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAY----LIISLEARTMVLETADL 595
             +  LP  K    +Y   SRG  +   R     +  H +    LI+S +       +   
Sbjct: 566  FQRDLPTVKKRKILYISGSRGIWSLPIRQPLRSNTSHGHDYDTLILSTDINPSPGSSRIA 625

Query: 596  LTEVTE--SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL--DGSYMTQDLSFGPSNSE 651
            +  +    S++    G TI A   F R  ++ V     R+L  DG+          +  +
Sbjct: 626  VRSMNRDVSINSRTPGLTIGAAPFFQRTAILHVMTNAIRVLHPDGTERQ-------TIPD 678

Query: 652  SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG-DPSTCTVSVQTPAAIESSKKPVSSC 710
                     +   SIADP+VL+   D SI + V  D         +P   +SS+  ++ C
Sbjct: 679  KDGNMPRPKIRFCSIADPFVLVMREDDSIGMFVATDREKIRRKDMSPMGDKSSRY-LAGC 737

Query: 711  TLYHDKGPEPWLRKTSTD--AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDV 768
                  G    L + + D  +  +T   +   GA          + ++    G LEI+ +
Sbjct: 738  FFTDTTG----LFEANFDNKSPATTSTLQITSGAKSQ-------WLLLVRPQGVLEIWTL 786

Query: 769  PNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELA 828
            P  +  F+     S ++ + DT+   A                 Q        + + ++ 
Sbjct: 787  PKLSLAFSTPAIASLQSVLTDTHDPPA-------------PSLPQDPPRKPQDLDIEQIL 833

Query: 829  MQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLR 888
            +         P L   L  G +  Y+A +    +N    D P +TS  +    ++A    
Sbjct: 834  LAPIGESSPTPHLCVFLRSGQLAIYEAVVLG--QNPEVPDTPRATSLQIQFVKIAAKSFE 891

Query: 889  NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCM-VFRERLRVHP 947
              R            +  +      +T  +    + G F +G RP W +   R  ++V+P
Sbjct: 892  IQRPEENEKGILAEHKKINRMFIPFVTSPRPSVTYSGVFFTGDRPHWILSTDRSGVQVYP 951

Query: 948  QLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQ 1007
                  + AFT          F+  T  G + +  +P        +     +P+++ P  
Sbjct: 952  S-GHNVVHAFTPCSLWESKGEFLMYTEDGPILVEWVPD-------FQFDGPLPMRSIPRG 1003

Query: 1008 ITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQI---DNHNLSSVDLHRTYTVEEY 1064
              Y     L+    S+ V     Q      D++ G+ I   D  N+SS       +V+  
Sbjct: 1004 RAY--SNVLFDPSTSLIVAASSLQSTFTSFDED-GNNIWEPDAPNISSP------SVDCS 1054

Query: 1065 EVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE-NETLLAIGTAYVQGED 1123
             + ++ PD     W T       ++E    + +VTL    T+   +  +A+GT   +GED
Sbjct: 1055 ALELIAPDI----WATMDGFEFATNEYINDMTIVTLETAATETGTKDFIAVGTTIDRGED 1110

Query: 1124 VAARGRVLLF 1133
            +A +G   +F
Sbjct: 1111 LAVKGATYIF 1120


>gi|449543656|gb|EMD34631.1| hypothetical protein CERSUDRAFT_116804 [Ceriporiopsis subvermispora
            B]
          Length = 1440

 Score =  162 bits (409), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 262/1189 (22%), Positives = 481/1189 (40%), Gaps = 200/1189 (16%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRR-------------VLMDGISAASLE- 102
            N+VV  ++++ I+ VR +E     S+   E +RR             V MDG     L  
Sbjct: 49   NVVVARSSLLRIFEVR-EEPAPISSQKEDERERRASVRKGTEAVEGEVEMDGSGEGFLNM 107

Query: 103  ---------------------LVCHYRLHG---NVESLAILSQGGADNSRRRDSIILAFE 138
                                 L+  +RLHG    +E + I++        R D ++++F+
Sbjct: 108  GSVKSTAQNGSVQPPTINRFYLIREHRLHGIVTGIEGVRIVTS----LEDRLDRLLVSFK 163

Query: 139  DAKISVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY 197
            DAKI++LE+ D++H L   S+H +E +P+ + L    +S    P ++ DP  RC  +L+ 
Sbjct: 164  DAKIALLEWSDAVHDLVTVSIHTYERAPQLMAL----DSSLFRPTLRADPLSRCAALLLP 219

Query: 198  GLQMIILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVH 253
               + IL   Q  + L V ++DT             S +++L    D  +++V DF+F+ 
Sbjct: 220  RDSIAILPFYQSQAELDVVEQDT---SQLRDVPYSPSFIVDLSAEVDDRIRNVIDFVFLP 276

Query: 254  GYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLA 313
            G+  P + +L +++ TW GR+     T  +   ++    + +P+I S   LP+D + + A
Sbjct: 277  GFNNPTIAVLFQKQQTWTGRLREYKDTVSLYIFTLDLVTRNYPVITSTEGLPYDCFAVAA 336

Query: 314  VPSPIGGVLVVGANTIHYHSQSA-SCALALNNY-------AVSLDSSQELPRSSFSVELD 365
              + +GGV+++ +N I Y  QS+   AL +N +        V   S+QE  R    + L+
Sbjct: 337  CSTALGGVVILASNAIIYVDQSSRRVALPVNGWPPRVSDMPVQALSAQEQLR---DLRLE 393

Query: 366  AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-----TNPSVLTSDITTIGN 420
             +H  ++ +    +  K G +  + +V DG+ V +L +S      T P+V    +  +  
Sbjct: 394  GSHFVFVDDRTLFIILKDGTVYPVELVLDGKSVSKLTMSSAVARTTIPTV----VRRVQT 449

Query: 421  SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480
               F+GS +G S+L++      T+ +   + +E  D+E     T     +  D+   M  
Sbjct: 450  DHLFIGSTVGPSVLLK------TARVEEDIADE--DVEMSVAPT-----AVVDSTDTMDL 496

Query: 481  GEELSLYGSASNNTE------------SAQKT-FSFAVRDSLVNIGPLKDFSYGLRINAD 527
             +E  LYGS    T             S ++T    ++ DSL   GP+ D ++ L  N D
Sbjct: 497  DDEDDLYGSTKETTHRVDGLVNGAADASKKRTVVHLSLCDSLPAHGPIADMTFALAKNGD 556

Query: 528  ------ASATGISKQSNYELVE--LP-----------GCKGIWTVYHKSSRGHNADSSRM 568
                   +ATG      + L +  LP           G +G+W++  + +   N  +   
Sbjct: 557  RAVPELVAATGSGTLGGFTLFQRDLPTRVKRKLHAIGGGRGMWSLPVRQAVKVNGSTYEK 616

Query: 569  AAYDDEYHAY---LIISLEARTM--VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
             A  + +H+    +IIS +A     +   A        ++   + G TI A   F    +
Sbjct: 617  PA--NPFHSVNDSVIISTDANPSPGLSRIASRNQNGDITITTRIPGTTIGAAPFFQGTAI 674

Query: 624  IQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
            + V      ++    +  D S      +         + + SI DP+VL+   D +I L 
Sbjct: 675  LHVMYNVTNVI--RVLEPDGSERQIIKDVDGNVARPKIRACSICDPFVLIIREDDTIGLF 732

Query: 684  VGDPSTCTVSVQTPAAI-ESSKKPVSSCTLYHDKGP-EPWLRKTSTDAWLSTGVGEAIDG 741
            +G+P    +  +  + + + + + ++ C      G  +  L   +     +T   ++   
Sbjct: 733  IGEPERGKIRRKDMSPMGDKTSRYLTGCFFTDTTGTFQTHLNPLAAGTEAATSTLQS--A 790

Query: 742  ADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSET 801
             + G   Q   + ++C   G LEI+ +      F+     S  + +VDTY    L     
Sbjct: 791  INAGSRSQ---WLILCRPQGTLEIWTLSKLTLAFSTTLIPSLESVVVDTYDVPHL----- 842

Query: 802  EINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGP 861
               S  ++   + ++ +I  + V  L          RP+L   L  G +  Y+      P
Sbjct: 843  ---SLPQDPPRKPQELDIEQIVVAPLG-----ESSPRPYLTVFLRSGQLAVYETIPVAPP 894

Query: 862  ENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNIS 921
                   DP+  SRS ++          +RF +    A+  ++         +   K IS
Sbjct: 895  A------DPLPNSRSCTIL---------VRFRKVLSKAFDIQQQNEEVEKSVLAEQKRIS 939

Query: 922  ------------GH--QGFFLSGSRPCWCM-VFRERLRVHPQLCDGSIV-AFTVLHNVNC 965
                        G    G F +G RPCW +   +  ++V P     S+V AFT       
Sbjct: 940  RLLIPFVTSPNPGQTLSGVFFTGDRPCWILSTDKGGVKVFPS--GHSVVHAFTASSVWES 997

Query: 966  NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPV 1025
               F+  + +G   +  +P G   D + P  + +P       + Y    +L      V  
Sbjct: 998  KSDFLLYSEEGPSLLEWIP-GVQLDGHLP-SRTVPRNKAYSNVVYDPSTSLI-----VAA 1050

Query: 1026 LKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIP 1085
                ++  S   D  +  + D  N+ S+    T T+E     +L PD     W T     
Sbjct: 1051 SSSQSRFASYDEDGNIVWEPDASNI-SLPFCETSTLE-----LLSPDG----WVTLDGYE 1100

Query: 1086 MQSSENALTVRVVTLFNTTTKE-NETLLAIGTAYVQGEDVAARGRVLLF 1133
               +E    +  VTL  ++T+   +  + +GT   +GED+A +G   +F
Sbjct: 1101 FAPNEFVNCLDCVTLETSSTESGTKDYIVVGTTINRGEDLAVKGAAYVF 1149


>gi|336388105|gb|EGO29249.1| hypothetical protein SERLADRAFT_445076 [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 1424

 Score =  161 bits (407), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 204/968 (21%), Positives = 383/968 (39%), Gaps = 149/968 (15%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRR-------------VLMDG-------- 95
           N+VV  +NV+ I+ VR +E     ++   E  RR             V MDG        
Sbjct: 47  NVVVARSNVLRIFEVR-EERPPMSTQTEDERDRRSHVRKGTEAVEGEVEMDGQGEGYVNM 105

Query: 96  ----------ISAASLELVCHYRLHGNV---ESLAILSQGGADNSRRRDSIILAFEDAKI 142
                      + +    V  + LHG V   E++ I+S     N    D ++++F+DAKI
Sbjct: 106 GTVKKGAVHLPTVSRFYFVREHMLHGTVTGLETVRIMSS----NDDNLDRLLVSFKDAKI 161

Query: 143 SVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQM 201
           ++LE+ D IH L   S+H +E +P+ + L    +S      ++VDP  RC  + +    +
Sbjct: 162 ALLEWSDDIHDLITVSIHTYERAPQLMAL----DSSLFHTKLRVDPSSRCAALSLPKDAI 217

Query: 202 IILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIE 257
            IL   Q  + L V ++D              S +++L    D +++HV DF+F+ G+  
Sbjct: 218 AILPFFQSQAELDVMEQD---QNQARDVPYSPSFILDLASDVDENIRHVIDFVFLPGFNN 274

Query: 258 PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
           P + +L + E TW+GR+     T  +   ++      +P+I +   LP D   L+   + 
Sbjct: 275 PTIAVLFQTEQTWSGRLKEFKDTAKLIIFTLDLLSHTYPVITAVDGLPFDCISLVPCVAS 334

Query: 318 IGGVLVVGANTIHY-HSQSASCALALNNYAVSLDSSQELP-----RSSFSVELDAAHATW 371
           +GGV+++ +NTI Y    S   AL +N ++ S  S   +P      +S ++ L+  HA  
Sbjct: 335 LGGVVIMSSNTIIYVDPASRRVALPVNGWS-SRVSDMPMPALSGDEASRNISLEGCHAVL 393

Query: 372 LQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKT-NPSVLTSDITTIGNSLFFLGSRLG 430
           + +    +  K G +  + +V DG+ V +L ++     + + S +  I     FLGS +G
Sbjct: 394 VDDRTMFVFLKDGTVYPVELVADGKTVSKLSMAPALAQTTIPSMVRKINEDHLFLGSIVG 453

Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
            S+L++             L      ++A           +  ++  + +         +
Sbjct: 454 ASVLLKTVRVEEEVEDEEKLPAHAAVVDAPTTMDLDDDDDTMPSMNGVTH---------S 504

Query: 491 SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD------ASATG------------ 532
           +N     +     ++ DSL   GP+ D ++ L    D       +ATG            
Sbjct: 505 NNIIHRTRSVVHLSLCDSLPAYGPISDVTFSLAKLGDRYVPELVAATGSGFLGGFTLFQR 564

Query: 533 -ISKQSNYELVELPGCKGIWTV-YHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM-- 588
            +  ++  +L  + G +GIW+    +  R +     R     +  +  +IIS +A     
Sbjct: 565 DLPSRTKRKLHAIGGARGIWSFPVRQQVRVNGLSYERPVNSFESENDTVIISTDANPSPG 624

Query: 589 VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILD-GSYMTQDLSFGP 647
           V   A   ++   ++   + G TI AG+ F R  ++ V     R+L+ G  + +DL    
Sbjct: 625 VSRIATRTSKSDIAIPTRIPGTTIGAGSFFQRTAILHVMTNAIRVLESGKQIIKDLD--- 681

Query: 648 SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQ--TPAAIESSKK 705
                        + + SI DP+VL+   D +I L +G+     +  +  +P   +SS+ 
Sbjct: 682 ------GNIPRPRIKACSICDPFVLIIREDDTIGLFIGEAERGKIRRKDMSPMGDKSSRY 735

Query: 706 PV------SSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYE 759
                   +SC         P       D  +++ +   ++       +    + ++   
Sbjct: 736 LAGCFFTDNSCIFETHANDLPSSASNGVDKNVTSTMQAVVNS------NSRSQWLILVRP 789

Query: 760 SGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENI 819
            G +EI+ +P     F+          + D+Y   AL   +              RK N 
Sbjct: 790 QGVMEIWTLPKLTLAFSTSSLAMLEHILSDSYDTPALSPPQ-----------DHPRKSN- 837

Query: 820 HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSV 879
             + V ++ +         P+L   L  G I+ Y+A     P +            S+  
Sbjct: 838 -DLDVEQIILAPLGETAPLPYLLVFLRSGQIVIYEAVPTPAPAD------------SIPP 884

Query: 880 SNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNI----------SGHQGFFLS 929
           S VS  +++ ++ +    +    EET      ++  I +            S   G F +
Sbjct: 885 SRVSVLKVKFIKTATKIFELPKHEETEKSILAEQKRISRQFVPFVTSPTPGSVLSGVFFT 944

Query: 930 GSRPCWCM 937
           G RP W +
Sbjct: 945 GDRPSWIV 952


>gi|121797760|sp|Q2TZ19.1|CFT1_ASPOR RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
            1
 gi|83775384|dbj|BAE65504.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 1393

 Score =  160 bits (406), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 250/1101 (22%), Positives = 432/1101 (39%), Gaps = 182/1101 (16%)

Query: 131  DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGR 190
            ++I+LAF +AK++++E+D   +G+   S+H +E  +        +  + G ++ VDP  R
Sbjct: 88   EAILLAFRNAKLALIEWDPGRYGICTISIHYYERDDSTSSPWVPDLSSCGSILSVDPSSR 147

Query: 191  CGGVLVYGLQ-MIILKASQGGSGLVGDE------DTFGSGG--------------GFSAR 229
            C  V  +G++ + IL   Q G  LV D+      +  GS G                 A 
Sbjct: 148  CA-VFNFGIRNLAILPFHQPGDDLVMDDYGELDDERLGSHGLESGTDCDMTKESIAHRAP 206

Query: 230  IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
              SS V+ L  LD  + H     F++ Y EP   IL+ +  T    +  +      +  +
Sbjct: 207  YSSSFVLPLAALDPSILHPISLAFLYEYREPTFGILYSQVATSNALLHERKDVVFYTVFT 266

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA 346
            +    +    + S   LP D +K++A+P P+GG L++G+N  +H      + A+ +N ++
Sbjct: 267  LDLEQRASTTLLSVSRLPSDLFKVVALPPPVGGALLIGSNELVHVDQAGKTNAVGVNEFS 326

Query: 347  VSLDSSQELPRSSFSVELDAAHATWLQ--NDVALLSTKTGDLVLLTVVYDGRVVQRLDLS 404
              + S     +S  ++ L+      L   N   LL   TG++VL+    DGR V  + + 
Sbjct: 327  RQVSSFSMTDQSDLALRLEGCIVERLSETNGDLLLVPTTGEIVLVKFRLDGRSVSGISVH 386

Query: 405  KTNPSV-------LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE---EF 454
               P           S    +G+   FLGS   DS+L+      G S+ SSG K+   + 
Sbjct: 387  PIPPHAGGDIVKSAASSSAFLGDKRVFLGSEDADSILL------GWSVPSSGTKKPRPQA 440

Query: 455  GDIEADAPSTKRLRRSSSDALQDMVNG--EELSLYGSASNNTESAQKTFSFAVRDSLVNI 512
               E D+       +S  D  +D +     E+ + G   +        ++F   D L+NI
Sbjct: 441  RHTEEDSGGFSDEDQSEDDVYEDDLYATVPEVVVDGRRPSAESFGSSLYNFREYDRLLNI 500

Query: 513  GPLKDFSYGLRINADASATGISKQSNYELV----------------------------EL 544
            GPLKD ++G    +          S  ELV                            +L
Sbjct: 501  GPLKDIAFGRSFTSLGGEENAGNDSGLELVASQGWDRSGGLAVMKRGLELQVLNSMRTDL 560

Query: 545  PGCKGIWTVYHKSSRGHNADS---SRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
              C  +WT    +S  H  ++   +   A + E H Y+++S +A +   E +++     +
Sbjct: 561  ASC--VWT----ASVAHMEEAVSKTTTQAENRECHQYVVVS-KATSAEREQSEVFRVEGQ 613

Query: 602  SVDYFV-------QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG---PSNSE 651
             +  F        +  TI  G L G+ RV+Q+     R  DG     DL      P   E
Sbjct: 614  ELRPFRAPEFNPNEDVTIDIGTLIGKNRVVQILRSEVRSYDG-----DLGLAQIYPVWDE 668

Query: 652  SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCT 711
                SE    +S S+ DPYV +   D ++ LL  D S     V+    I +SK   +SC 
Sbjct: 669  --DTSEERMAISSSLVDPYVAILRDDSTLLLLQADDSGDLDEVELNEQIANSKW--TSCC 724

Query: 712  LYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNF 771
            LY DK                TG+  +I  A    L Q  +   +  +   L I+ +P+ 
Sbjct: 725  LYFDK----------------TGIFSSI-SATSDELAQNSMTLFLMTQDCRLFIYRLPDQ 767

Query: 772  NCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQR 831
              +      + G   +      E  K S T              +E +  + V +L    
Sbjct: 768  KLL----AIIEGVDCLPPVLSSEPPKRSTT--------------REVLTEIVVADLG-DS 808

Query: 832  WSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLR 891
            WS   S P+L        +  Y+ ++      T    +P +    L  +N+   R+    
Sbjct: 809  WS---SFPYLIIRSRHDDLAVYRPFI----SITKSVGEPHADLNFLKETNLVLPRI---- 857

Query: 892  FSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD 951
             +    D  + EE     P   + I  NISG    F  G  P + +           L  
Sbjct: 858  -TSGVEDQSSTEEVIKSVP---LRIVSNISGFSAIFRPGVSPGFIVRTSTSSPHFLGLKG 913

Query: 952  GSIVAFTVLHNVNCNHGFIYVTSQGI------LKICQLPSGSTYDNYWP--VQKVIPLKA 1003
            G   + +      C  GFI + S+ +      L  C L   +   +Y+P  +Q+ IP+  
Sbjct: 914  GYAQSLSKFQTSECGEGFILLDSKVLCFILLCLTYCILSFHTGCHSYYPWTIQQ-IPIGE 972

Query: 1004 TPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEE 1063
                + Y +   +Y +  S            L  D E+  +  N   S         V+ 
Sbjct: 973  QVDHLAYSSSSGMYVIGTS------HRTEFKLPEDDELHPEWRNEMTSFFP-----EVQR 1021

Query: 1064 YEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGE 1122
              ++++ P      W    T+    +E+ + V+ ++L  +  T E + ++ +GTA+ +GE
Sbjct: 1022 SSLKVVSPKT----W----TVIDSPAEHVMAVKNMSLEISENTHERKDMIVVGTAFARGE 1073

Query: 1123 DVAARGRVLLFSTGRNADNPQ 1143
            D+A+RG V +F   +   +P+
Sbjct: 1074 DIASRGCVYVFEVIKVVPDPK 1094


>gi|19112233|ref|NP_595441.1| cleavage factor one Cft1 (predicted) [Schizosaccharomyces pombe
            972h-]
 gi|74582544|sp|O74733.1|CFT1_SCHPO RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
            1
 gi|3738146|emb|CAA21247.1| cleavage factor one Cft1 (predicted) [Schizosaccharomyces pombe]
          Length = 1441

 Score =  160 bits (405), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 268/1202 (22%), Positives = 469/1202 (39%), Gaps = 214/1202 (17%)

Query: 57   NLVVTAANVIEIYVV-RVQEEGS-----------------KESKNSGETKRRVL-MDGIS 97
            NLVV+  N + ++ + ++Q++ S                  ES+   ET   ++  +  +
Sbjct: 29   NLVVSKVNSLHLFEIEKIQKDESSFPLDDSLQNEFSTSIIDESQAFMETNMHLIRTNEQT 88

Query: 98   AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
               L LV   ++ G +  ++ L   G++     D +I+  + AK+S LE+D         
Sbjct: 89   TYVLRLVSQVKVFGTITEISALKGKGSNGC---DLLIMLTDYAKVSTLEWDMQSQSFVTN 145

Query: 158  SMHCFESPEWLHLKRGRESFARGPL-VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGD 216
            S+H +E      +K      +  P  + VDP   C  +L +   M+ +        L  +
Sbjct: 146  SLHYYED-----VKSSNICSSHTPTQLLVDPDSDCC-LLRFLTDMMAIIPYPANEDLDME 199

Query: 217  EDTF-GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
            E     S    S   + S V+    LD  +  + D  F++GY EP + IL+  E T    
Sbjct: 200  EAAIENSKISSSYAYKPSFVLASSQLDASISRILDVKFLYGYREPTLAILYSPEQTSTVT 259

Query: 274  VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY-H 332
            +  +  T + S +++    +   +I +  +LP+D Y  +++P+P+GG L++G N + Y  
Sbjct: 260  LPLRKDTVLFSLVTLDLEQRASAVITTIQSLPYDIYASVSIPTPLGGSLLLGGNELIYVD 319

Query: 333  SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL-----QNDVALLSTKTGDLV 387
            S   +  + +N+Y           +S F++EL+   A  L     +    +L   +G   
Sbjct: 320  SAGRTVGIGVNSYYSKCTDFPLQDQSDFNLELEGTIAIPLTSSKTETPFVVLVHTSGQFF 379

Query: 388  LLTVVYDGRVVQRLDLS----KTNPSVLTSDITTI---GNSLFFLGSRLGDSLLVQFTCG 440
             L  + DG+ V+ L L     + N   L S IT     G +L FLGS+  DS L++++  
Sbjct: 380  YLDFLLDGKSVKGLSLQALDLEINDDFLKSGITCAVPAGENLVFLGSQTTDSYLLRWSRR 439

Query: 441  SGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT 500
            +          EE    E D      L  ++   + DM++  E      +          
Sbjct: 440  TT--------NEEVRLDEGD----DTLYGTNDAEMDDMLDIYETDESVGSKRKIAYENGP 487

Query: 501  FSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY---ELV--------------- 542
                + D L NIGP+ DF+ G      A +     Q N+   ELV               
Sbjct: 488  LRLEICDVLTNIGPITDFAVG-----KAGSYSYFPQDNHGPLELVGTAGADGAGGLVVFR 542

Query: 543  -----------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVL 590
                       +  GC+ +WTV   S +  N  S   A Y + E   YL++S E  + + 
Sbjct: 543  RNIFPLIAGEFQFDGCEALWTV-SISGKLRNMKSRIQAQYSNPELETYLVLSKEKESFIF 601

Query: 591  ETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSN 649
               +   EV  S D+    +T+  G+L    R++Q+     R+ D +  +TQ  +F    
Sbjct: 602  LAGETFDEVQHS-DFSKDSKTLNVGSLLSGMRMVQICPTSLRVYDSNLRLTQLFNF---- 656

Query: 650  SESGSGSENSTVLSVSIADPYVLLGMSDGSI----------RLLVGDPSTCTVSVQTPAA 699
                  S+   V+S SI DP +++    G I          RL+  D       V+T A+
Sbjct: 657  ------SKKQIVVSTSICDPCIIVVFLGGGIALYKMDLKSQRLIKTDLQNRLSDVKT-AS 709

Query: 700  IESSKKPVSSCTLY----------------HDKGPEPWL-----RKTSTDAWLSTGVGEA 738
            + S         L+                +D   E  L      KTS +  +  G  ++
Sbjct: 710  LVSPDSSALFAKLFTYNETLNAKGQIANGMNDSASETDLDIQPNHKTSNNDQM--GYDQS 767

Query: 739  IDGADGGP--------------LDQGDIYS----VVCYESGALEIFDVPNFNCVFTVDKF 780
            +  AD  P              LDQ  +          + G L+++++ +F+ +   D F
Sbjct: 768  V-SADDVPEVDNTIVTEKNVSNLDQESLEKHPILFALTDEGKLKVYNLADFSLLMECDVF 826

Query: 781  VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
                T      +   ++   T  N  S             S ++VEL +         P 
Sbjct: 827  DLPPT------LFNGMESERTYFNKES-------------SQELVELLVADLGDDFKEPH 867

Query: 841  LFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAY 900
            LF       I  Y+A+L+    NT K  + ++ ++   V   + +R        TP DA 
Sbjct: 868  LFLRSRLNEITVYKAFLYS---NTDKHKNLLAFAK---VPQETMTREFQANVG-TPRDAE 920

Query: 901  TREETPHGAPCQ--RITIFKNISGHQGFFLSGSRPCWCM-VFRERLRVHPQLCDGSIVAF 957
            +  E    +     ++T  + +  H   F++G +P   +       +  P   +  I++ 
Sbjct: 921  STMEKKASSSVDHLKMTALEVVGNHSAVFVTGRKPFLILSTLHSNAKFFPISSNIPILSV 980

Query: 958  TVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLY 1017
               H  +   G+IYV     ++IC+      YDN WP +KV  L    + I Y   K +Y
Sbjct: 981  APFHAHHAPQGYIYVDENSFIRICKFQEDFEYDNKWPYKKV-SLGKQINGIAYHPTKMVY 1039

Query: 1018 PLIVSVPVLKPL-----NQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPD 1072
             +  +VP+   +     N+  ++  D +    +   N  S+DL    T            
Sbjct: 1040 AVGSAVPIEFKVTDEDGNEPYAITDDNDY---LPMANTGSLDLVSPLT------------ 1084

Query: 1073 RAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVL 1131
                 W    +   Q  E  L+V +V L  + TTK  +  +A+GT+  +GED+A RG   
Sbjct: 1085 -----WTVIDSYEFQQFEIPLSVALVNLEVSETTKLRKPYIAVGTSITKGEDIAVRGSTY 1139

Query: 1132 LF 1133
            LF
Sbjct: 1140 LF 1141


>gi|226290902|gb|EEH46330.1| cleavage and polyadenylation specificity factor subunit A
            [Paracoccidioides brasiliensis Pb18]
          Length = 1343

 Score =  160 bits (404), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 264/1174 (22%), Positives = 457/1174 (38%), Gaps = 197/1174 (16%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            NL+V    ++++Y +     GS   ++  +T+ +        + L LV  Y L G V  L
Sbjct: 28   NLIVAKTTLLQVYNLVNVVYGSGPGQSDEKTRSQY-------SKLVLVAEYALSGTVTDL 80

Query: 117  AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
              +     D+    ++I++A  +AK+S++E+D   H +  TS+H +E  + +H+     +
Sbjct: 81   GRVKI--LDSKSGGEAILVATRNAKLSLIEWDPEKHQISTTSIHYYERDD-VHISPWTPN 137

Query: 177  FARGP-LVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV-GDEDTFGSGGGFSARIESS 233
             A  P  + VDP  RC  VL +G + + IL   Q G  LV GD   F S      +I+++
Sbjct: 138  LAACPSQLTVDPSSRCA-VLNFGKKNLAILPFHQMGDDLVMGD---FDSDHDEERQIDTN 193

Query: 234  HVINLRD--------------------------LDMKHVKDFIFVHGYIEPVMVILHERE 267
            H    RD                            M H     F++ Y EP   IL+ + 
Sbjct: 194  HTAEERDEANKPDGPVYQTPYASSFVLPIAALEPSMLHPISLAFLYEYREPTFGILYSQV 253

Query: 268  LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327
               +  +  +      S  ++    +    + S   LP+D +K++ +P P+GG L+VG+N
Sbjct: 254  AASSALLHDRKDVVFYSVFTLDLEQRASTTLLSVPRLPNDLFKVIPLPPPVGGALLVGSN 313

Query: 328  T-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTG 384
              +H      + A+ +N +A    S     +S   + L+      L  +N   LL    G
Sbjct: 314  ELVHVDQAGRTNAVGVNEFAREASSFSMADQSDLEMRLEGCVVEQLGTENCDMLLVLLNG 373

Query: 385  DLVLLTVVYDGRVVQRLDLS-----------KTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
             + +++   DGR V  + L            +T PS        +G    F GS  GDS+
Sbjct: 374  VMAVVSFKLDGRSVSGIYLRPVSDQAGGAILRTKPSC----SALVGRGKIFFGSEEGDSM 429

Query: 434  LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNN 493
            L+ ++  S  + +     E   D  A+    +       DA +D +    ++  G  S N
Sbjct: 430  LIGWSRPSAGATVPPA-PETGEDNVAELSEDEEEEDDDEDAYEDDLYATPVT-PGINSRN 487

Query: 494  TESAQKT----FSFAVRDSLVNIGPLKDFSYGL---RINADASATGISKQSNYELVELPG 546
            T S   T    + F + D L N+GP++D + G      + D   +  S  +  ELV   G
Sbjct: 488  TTSVNGTSLNDYIFRIHDRLWNLGPMRDITLGRPPGSRDKDKRQSVSSLSAYLELVTTQG 547

Query: 547  --------------------------CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLI 580
                                        G+ +V+ K  +      S        Y  YL+
Sbjct: 548  YGRAGGLAILRREIDPYVIDSLMIKDTDGVRSVHVKDPKLPTQSGSLPVNAGSNYDHYLL 607

Query: 581  ISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARIL 634
            +S      + +++V + +    E T + ++   + RTI  G L G  RV+QV +   R  
Sbjct: 608  LSKSKGFDKEKSVVYKMSSGGLEETRAPEFNPNEDRTIDIGTLAGGTRVVQVLKGEVRSY 667

Query: 635  DGSYMTQDLSFG---PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
            D + +   L      P   E    SE  +V+  S ADPYVL+   D SI LL  D S   
Sbjct: 668  DSANLHLGLGLAQIYPVWDE--DTSEERSVVHASFADPYVLIIRDDSSILLLQADESGDL 725

Query: 692  VSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQ-- 749
              ++T   IES+     S +LY DK            ++LS          +G P  +  
Sbjct: 726  DEIETDGIIESTT--WISGSLYQDK----------YRSFLSY---------EGTPNRKPS 764

Query: 750  GDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYM---REALKDSETEINSS 806
             ++   +      L IF +PN        + V     I+ T +   R   ++  TEI   
Sbjct: 765  DNVLLFLLNSESKLYIFHLPNAKEPVYTAESVDLLPQILPTELPPRRTTYRECLTEI--- 821

Query: 807  SEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSK 866
                                L      +    P+L        ++ Y+ Y          
Sbjct: 822  --------------------LVADLGDSVSRTPYLILRSNSNELILYEPY-----HTVQS 856

Query: 867  SDDPVSTSRSLSVSN------VSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNI 920
            ++  +S  R L ++N      +  S L NL  S   L    R     G  C   T+F   
Sbjct: 857  TEKRLSDLRFLKIANHHFPKFLPESNLGNLSDSDRQL---ARPLRALGDVCGYRTVF--- 910

Query: 921  SGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKI 980
                   + G+ PC+ +     +     L   ++ + +  +   C  GF+YV +  ++++
Sbjct: 911  -------MPGNSPCFIIKSATSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDTDNVVRM 963

Query: 981  CQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE 1040
            C+ P  + +D  W  +K I L      + Y +    Y L  S            L  D E
Sbjct: 964  CRFPRNTHFDGSWAARK-IGLGEQVDSVEYSSSSETYVLGTS------QKADFKLPEDDE 1016

Query: 1041 VGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL 1100
            +  +  N  +S         +++  V++L P      W    +  ++++E  + V+ + L
Sbjct: 1017 IHPEWRNEVISFFP-----QIDKGSVKLLNPRT----WSIIDSYQLRTAERVMCVKCLNL 1067

Query: 1101 -FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
              +  T E + ++A+GTA  +GED+AARG + +F
Sbjct: 1068 EASEITHERKEMIAVGTALTRGEDIAARGCIYVF 1101


>gi|336375160|gb|EGO03496.1| hypothetical protein SERLA73DRAFT_165174 [Serpula lacrymans var.
           lacrymans S7.3]
          Length = 1428

 Score =  159 bits (403), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 204/972 (20%), Positives = 383/972 (39%), Gaps = 153/972 (15%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRR-------------VLMDG-------- 95
           N+VV  +NV+ I+ VR +E     ++   E  RR             V MDG        
Sbjct: 47  NVVVARSNVLRIFEVR-EERPPMSTQTEDERDRRSHVRKGTEAVEGEVEMDGQGEGYVNM 105

Query: 96  --------------ISAASLELVCHYRLHGNV---ESLAILSQGGADNSRRRDSIILAFE 138
                          + +    V  + LHG V   E++ I+S     N    D ++++F+
Sbjct: 106 GTVKSTGKKGAVHLPTVSRFYFVREHMLHGTVTGLETVRIMSS----NDDNLDRLLVSFK 161

Query: 139 DAKISVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY 197
           DAKI++LE+ D IH L   S+H +E +P+ + L    +S      ++VDP  RC  + + 
Sbjct: 162 DAKIALLEWSDDIHDLITVSIHTYERAPQLMAL----DSSLFHTKLRVDPSSRCAALSLP 217

Query: 198 GLQMIILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVH 253
              + IL   Q  + L V ++D              S +++L    D +++HV DF+F+ 
Sbjct: 218 KDAIAILPFFQSQAELDVMEQD---QNQARDVPYSPSFILDLASDVDENIRHVIDFVFLP 274

Query: 254 GYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLA 313
           G+  P + +L + E TW+GR+     T  +   ++      +P+I +   LP D   L+ 
Sbjct: 275 GFNNPTIAVLFQTEQTWSGRLKEFKDTAKLIIFTLDLLSHTYPVITAVDGLPFDCISLVP 334

Query: 314 VPSPIGGVLVVGANTIHY-HSQSASCALALNNYAVSLDSSQELP-----RSSFSVELDAA 367
             + +GGV+++ +NTI Y    S   AL +N ++ S  S   +P      +S ++ L+  
Sbjct: 335 CVASLGGVVIMSSNTIIYVDPASRRVALPVNGWS-SRVSDMPMPALSGDEASRNISLEGC 393

Query: 368 HATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKT-NPSVLTSDITTIGNSLFFLG 426
           HA  + +    +  K G +  + +V DG+ V +L ++     + + S +  I     FLG
Sbjct: 394 HAVLVDDRTMFVFLKDGTVYPVELVADGKTVSKLSMAPALAQTTIPSMVRKINEDHLFLG 453

Query: 427 SRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSL 486
           S +G S+L++             L      ++A           +  ++  + +      
Sbjct: 454 SIVGASVLLKTVRVEEEVEDEEKLPAHAAVVDAPTTMDLDDDDDTMPSMNGVTH------ 507

Query: 487 YGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD------ASATG-------- 532
              ++N     +     ++ DSL   GP+ D ++ L    D       +ATG        
Sbjct: 508 ---SNNIIHRTRSVVHLSLCDSLPAYGPISDVTFSLAKLGDRYVPELVAATGSGFLGGFT 564

Query: 533 -----ISKQSNYELVELPGCKGIWTV-YHKSSRGHNADSSRMAAYDDEYHAYLIISLEAR 586
                +  ++  +L  + G +GIW+    +  R +     R     +  +  +IIS +A 
Sbjct: 565 LFQRDLPSRTKRKLHAIGGARGIWSFPVRQQVRVNGLSYERPVNSFESENDTVIISTDAN 624

Query: 587 TM--VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILD-GSYMTQDL 643
               V   A   ++   ++   + G TI AG+ F R  ++ V     R+L+ G  + +DL
Sbjct: 625 PSPGVSRIATRTSKSDIAIPTRIPGTTIGAGSFFQRTAILHVMTNAIRVLESGKQIIKDL 684

Query: 644 SFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQ--TPAAIE 701
                            + + SI DP+VL+   D +I L +G+     +  +  +P   +
Sbjct: 685 D---------GNIPRPRIKACSICDPFVLIIREDDTIGLFIGEAERGKIRRKDMSPMGDK 735

Query: 702 SSKKPV------SSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSV 755
           SS+         +SC         P       D  +++ +   ++       +    + +
Sbjct: 736 SSRYLAGCFFTDNSCIFETHANDLPSSASNGVDKNVTSTMQAVVNS------NSRSQWLI 789

Query: 756 VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGR 815
           +    G +EI+ +P     F+          + D+Y   AL   +              R
Sbjct: 790 LVRPQGVMEIWTLPKLTLAFSTSSLAMLEHILSDSYDTPALSPPQ-----------DHPR 838

Query: 816 KENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSR 875
           K N   + V ++ +         P+L   L  G I+ Y+A     P +            
Sbjct: 839 KSN--DLDVEQIILAPLGETAPLPYLLVFLRSGQIVIYEAVPTPAPAD------------ 884

Query: 876 SLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNI----------SGHQG 925
           S+  S VS  +++ ++ +    +    EET      ++  I +            S   G
Sbjct: 885 SIPPSRVSVLKVKFIKTATKIFELPKHEETEKSILAEQKRISRQFVPFVTSPTPGSVLSG 944

Query: 926 FFLSGSRPCWCM 937
            F +G RP W +
Sbjct: 945 VFFTGDRPSWIV 956


>gi|348679545|gb|EGZ19361.1| putative cleavage and polyadenylation specificity factor CPSF
           [Phytophthora sojae]
          Length = 1752

 Score =  159 bits (401), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 162/634 (25%), Positives = 260/634 (41%), Gaps = 155/634 (24%)

Query: 235 VINLRDLD-MKHVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
           ++ LR+L+ M  V D  F+ GY+EP +++LHE   + +  GR++    T  I+ +SI+  
Sbjct: 277 LLRLRELEIMGKVIDLAFLDGYLEPTLMVLHEENEKNSTCGRLAAGFDTYCITVISINMN 336

Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL-- 349
            + HP IW+  NLP D +KL    +P+GGV+V+ AN   Y +Q+    LA N +A     
Sbjct: 337 TRLHPKIWTVKNLPSDCFKLFPCRAPLGGVVVLSANAFLYFNQTQFHGLATNVFASKTVN 396

Query: 350 -------DSSQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ 399
                  D+  E P    +   + L      +L     LL+   GD  +L++ Y+    +
Sbjct: 397 QSVFPLSDAVYETPDHEMAQLHIVLYDCQFEYLHEKEVLLTMPNGDAYVLSLPYEDTSSR 456

Query: 400 RL----DLSKTNPSVLTSDITTIG-----------NSLFFLGSRLGDSLLVQFTCGSGTS 444
            L      S +  + L+  +   G               F+GSR GDS+L        TS
Sbjct: 457 GLYGFGGASSSRNASLSLRMLRSGIQAHCLCVNEEKKTLFVGSRSGDSVLYALDQKKLTS 516

Query: 445 MLSSGLK----EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA--------SN 492
                 K    EE    E            +  A ++  + ++L LYG+A        S 
Sbjct: 517 AGGEASKQQEDEEMLIKEEVVKEEVTAEVKAEPAEEEEEDEDDLFLYGAAPTKEEPTTSG 576

Query: 493 NTESAQKTFSFAVR----------------------DSLVNIGPLKDFSYGLRINADASA 530
           +TE+   T   AV+                      D L +IG +     G+  NAD   
Sbjct: 577 STEAVNGTNGSAVKKEENGHAVEEESGPYDYVLHQIDVLPSIGQITSIELGIENNAD--- 633

Query: 531 TGISKQSNYELV--------------------------ELPGCKGIWTVYHKSSRGHNAD 564
              S +   ELV                          EL GC+ +WTV         + 
Sbjct: 634 ---SNEKREELVISGGYERSGAISVLHNGLRPIVGTEAELNGCRAMWTVSSSLPSATKSS 690

Query: 565 SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVI 624
             R       Y+AYLI+S+  RTMVL T + +  + +   ++  G T+AA NLF ++R++
Sbjct: 691 DGR------SYNAYLILSVAHRTMVLRTGEGMEPLEDDSGFYTSGPTLAAANLFNKQRIV 744

Query: 625 QVFERGARIL------------DGS----------------------YMTQDLSFGPSNS 650
           Q+F++GAR++            DG+                        TQ+++      
Sbjct: 745 QIFKQGARVMMEVPDEETSNGNDGAEKTAKPEDEEVDDEDDGPKVKLVCTQEITLEGDVE 804

Query: 651 ESGSGSENSTV--LSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTP----------- 697
             G   + +TV  +SV + DPY+LL ++DGS+RLL+GD     ++V  P           
Sbjct: 805 CGGMNVDTATVGIVSVDVVDPYILLLLTDGSVRLLMGDEEDMELTVIDPEIDYLDGVTES 864

Query: 698 -AAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAW 730
               ++SK   SS  L++D     W      +AW
Sbjct: 865 NGTADASKHGSSSACLFYD-----WAGMFRENAW 893



 Score = 72.8 bits (177), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 68/280 (24%), Positives = 119/280 (42%), Gaps = 65/280 (23%)

Query: 914  ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGS------IVAFTVLHNVNCNH 967
            +T F N++   G F  G+ P W +  R +    P +C  +      +++FT  H+ NC +
Sbjct: 1159 LTTFYNVNNMSGAFFRGAHPMWILGDRGQPTFIP-MCSAAPKVSVPVLSFTPFHHWNCPN 1217

Query: 968  GFIYVTSQGILKICQLPSGSTYDNYWP-----VQKVIPLKATPHQITYFA---------- 1012
            GFIY  S+G L++C+LPS  T     P     V +     AT H + Y            
Sbjct: 1218 GFIYFHSRGALRVCELPSSKT-STILPSSGGFVLQKAEFGATLHHMLYLGNHGPGGVSEA 1276

Query: 1013 -EKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS------------------- 1052
             E   Y ++ SV  +KP +   +  + ++   + +  NL +                   
Sbjct: 1277 LEAPTYAVVCSVK-MKPTDAERATEV-EDADEEKEPENLDANGNPVGSNVMAPTAEMFPD 1334

Query: 1053 --VDLHRTYTVEEYEVRILEPDRAGGPWQTRAT--IPMQSSENALTVRVVTLFNTT---- 1104
              +D       E YE+R+++ +   G W  R    +  +  E  L+V+++ L++++    
Sbjct: 1335 FEIDQMAHTEEEVYELRLVQTNEF-GEWGRRGVFRVHFERYEVVLSVKLMYLYDSSLMKE 1393

Query: 1105 ---------TKENETLLAIGTAYV--QGEDVAARGRVLLF 1133
                      K+    L IGT +V   GED + RGR+LL+
Sbjct: 1394 EVASTSAEWNKKKRPYLVIGTGWVGPHGEDESGRGRLLLY 1433


>gi|310789917|gb|EFQ25450.1| CPSF A subunit region [Glomerella graminicola M1.001]
          Length = 1439

 Score =  158 bits (399), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 265/1174 (22%), Positives = 445/1174 (37%), Gaps = 221/1174 (18%)

Query: 69   YVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAIL----SQGGA 124
            Y  R+ ++   ES   G     V  D      L LV  Y + G V  LA +    S+ G 
Sbjct: 66   YDRRLNDDDGLESSFLGGDGMLVRADRAVNTKLVLVAEYPIFGVVTGLARIKIQHSKSGG 125

Query: 125  DNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL-- 182
            +      ++++A   A++S+++++   H L   S+H +E  E       + S   GPL  
Sbjct: 126  E------ALLIATRVARLSLVQWNSEKHALEDISIHYYEKEEL------QGSPFDGPLAN 173

Query: 183  ----VKVDPQGRCGGVLVYGLQMI-ILKASQG--------------GSGLVGDEDTFGSG 223
                +  DP  RC   L +G + I  L   Q               G     +  T  + 
Sbjct: 174  YRTHLAADPGSRCAA-LSFGPRYIAFLPFKQADEDIDMDDWDEDVDGPRPAKEPPTTAAT 232

Query: 224  GGFS----ARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
             G S        +S+V+ L  LD  + H     F+H Y EP   I+   +          
Sbjct: 233  NGTSNIADVPYSTSYVLPLPQLDPSLLHPVYLAFLHEYREPTFGIISSTQRRSNTLPRKD 292

Query: 278  HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSA 336
            H +  +  L +    +    I S  NLP D +K++A+P P+GG L+VG N  IH      
Sbjct: 293  HFSYKVFTLDLQQ--RASTAILSVNNLPQDLFKVVALPGPVGGALLVGTNELIHIDQSGK 350

Query: 337  SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTGDLVLLTVVYD 394
               +A+N +     +     +S   + L+  H   +  +N   L+    G L ++T   D
Sbjct: 351  PNGVAVNAFTKETTNFPLADQSDLDLRLEHCHIELMSAENGELLMVLSDGRLAIITFKID 410

Query: 395  GRVVQRLDLSKTNPSV-------LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447
            GR V  + +      V         S I+ +  ++FF+GS   DSL++ +T         
Sbjct: 411  GRTVSGVSVKPVAAEVGGNIVQCSVSTISKLSRNVFFVGSTGSDSLVLGWT--------- 461

Query: 448  SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELS---------------LYGSASN 492
                         A + +R  R   D+ +  +  E++                +   A+ 
Sbjct: 462  ----------RKQAQNARRKTRLVDDSFEYDLEDEDMDDGDDDDLYGETTTTMIQPGATA 511

Query: 493  NTESAQKTFSFAVRDSLVNIGPLKDFSYGLR-INAD------------------------ 527
            N  S     +F V DSL++I P+KD + G +  N D                        
Sbjct: 512  NGVSKGGDLTFRVHDSLLSIAPVKDMTSGKQAFNPDSEEANNSVGVVADLQLACVVGRGN 571

Query: 528  ASATGISKQSNYELV----ELPGCKGIWT--VYHKSSRGHNADSSRMAAYDDEYHAYLII 581
            A A  I  Q+    V    E P  +G WT  V     +    D    AA   E+ A    
Sbjct: 572  AGAVAILNQNIQPKVIGKFEFPEARGFWTMCVQKPVPKSLQGDKGANAAVGSEFDAS--- 628

Query: 582  SLEARTMVLETADLLTEVTESVDYFV-----------------QGRTIAAGNLFGRRRVI 624
            S+  + M++   DL  +  E+ D +                   G T+ AG +    R+I
Sbjct: 629  SIYDKFMIVSKVDL--DGYETSDVYALTGAGFEALTGTEFDPAAGFTVEAGTMGKHMRII 686

Query: 625  QVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
            QV +   R  DG   ++Q L     + E+G+      V+S SIADPY+LL   D SI + 
Sbjct: 687  QVLKSEVRCYDGDLGLSQILPM--LDEETGA---EPRVVSASIADPYLLLVRDDSSIMVA 741

Query: 684  VGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGAD 743
              D +     V+       S K ++ C LY D                +TG    +    
Sbjct: 742  QIDNNCELEEVEKQDDAILSTKWLAGC-LYAD----------------TTGRFAPVQTDK 784

Query: 744  GGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEI 803
            G P  Q +I+  +   +GAL I+ +P+ +    V    +G T++                
Sbjct: 785  GTPEGQ-NIFMFLLSAAGALYIYALPDLSKPVYV---AAGLTYVPPLL----------SA 830

Query: 804  NSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPEN 863
            + +   GT Q   E +  + V +L         + P+L     +  +  Y+    E  + 
Sbjct: 831  DYAVRRGTVQ---ETLTELLVADLG----DTTTTSPYLILRHANDDLTIYEPIRLESQDK 883

Query: 864  TSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE--ETPHGAPCQRITIFKNIS 921
            T      V  S++L    ++     N   +++P++    E  E P   P +      NI+
Sbjct: 884  T------VGLSKTLHFQKIT-----NPALAKSPVEVADDEANEQPRFVPLRPC---PNIN 929

Query: 922  GHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
            G+   FL G+ P + +   +       L    +   +  H   C  GFIY  S+G  ++ 
Sbjct: 930  GYSTVFLPGASPSFIIKSSKSSPKVIGLQGIGVRGMSSFHTEGCERGFIYADSEGQTRVT 989

Query: 982  QLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1041
            QLP+ + +       + IP+      I Y      Y +  SV            L   E+
Sbjct: 990  QLPADTNFTELGVAVRKIPIGDNVGLIAYHPPMETYAVACSV------------LERFEL 1037

Query: 1042 GHQIDNHNLSSVDLHRTY-TVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL 1100
                D H   + +   +Y   E   ++++ P      W    T+ ++  E A+ ++ + L
Sbjct: 1038 PKDDDYHKEWAKEATTSYPQTERGIIKLMSPTT----WSVIDTVELEPHEVAMCMKTLHL 1093

Query: 1101 -FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
              +  TKE   L+ IGTA  +GED+  RGR+L++
Sbjct: 1094 EVSEETKERRMLITIGTAINRGEDLPIRGRILVY 1127


>gi|347838999|emb|CCD53571.1| similar to Cleavage and polyadenylation specificity factor subunit 1
            [Botryotinia fuckeliana]
          Length = 1447

 Score =  158 bits (399), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 257/1196 (21%), Positives = 461/1196 (38%), Gaps = 215/1196 (17%)

Query: 57   NLVVTAANVIEIYVVR--------VQEEGSKESKNSGETKRRVLMD-GIS---------- 97
            NLVV  +++++I+  +        + E+ S  +K+      RV  D G+           
Sbjct: 28   NLVVAKSSLLQIFTTKTVSVDLDELSEKDSSTAKDDTNIDPRVNNDDGVEDSFLGTDSIM 87

Query: 98   -------AASLELVCHYRLHGNVESLA----ILSQGGADNSRRRDSIILAFEDAKISVLE 146
                      L LV  Y L G V SL     I S+ G +      +I++ F+DAK+S++E
Sbjct: 88   QRPELARTTKLVLVAEYNLSGTVTSLVRVKTISSKTGGE------AILVGFKDAKLSLVE 141

Query: 147  FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKA 206
            +D    G+   S+H +E  E                + VDP  RC  +      + IL  
Sbjct: 142  WDPERPGISTISVHFYEQDELQGSPWAPSLSDCVNYLTVDPGSRCAALKFGARNLAILPF 201

Query: 207  SQGGSGLVGDEDTFGSG--------------GGFSARIESSHVINLRDLDMK-----HVK 247
             Q     + D D    G              G       SS V+ L  LD       H++
Sbjct: 202  KQDEDVNMDDWDEELDGPRPAKISQKAAAEDGQLDTPYGSSFVLRLSSLDPSIIFPIHLE 261

Query: 248  DFIFVHGYIEPVMVILHERELTWAGRVSWK--HHTCMISALSISTTLKQHPLIWSAMNLP 305
               F++ Y EP   IL       +  +  +  H T M+  L +    K    I S   LP
Sbjct: 262  ---FLYEYREPTFGILSSTMAPSSALLQERRDHLTYMVFTLDMHQ--KASTTILSVGGLP 316

Query: 306  HDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVEL 364
            +D ++++ +  P+GG L+VG N  IH      +  +A+N +A        L ++   + L
Sbjct: 317  YDLFRIVPLAPPVGGALLVGTNELIHIDQAGKANGVAVNMFAKQCTGFSLLDQADLDLRL 376

Query: 365  DAAHATWL--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLT---SDI 415
            +      L  +N   L+   +GD+ +L+   DGR V  L + + +     ++LT   S +
Sbjct: 377  EGCKIDQLSIENGEMLIILHSGDIAILSFRMDGRSVSGLSIRRVSAELGGAILTGAASCV 436

Query: 416  TTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDAL 475
            +++G    F+GS + DS+++ +   SG +       +     E D            +  
Sbjct: 437  SSLGAGSLFVGSEVSDSVILGWNRKSGQTSRRKSRLDSSAIAEVDE---AMFDEEDLEDD 493

Query: 476  QDMVNGEELSLYGSASNNTESAQKT--FSFAVRDSLVNIGPLKDFSYG---LRINADASA 530
             D + G+  ++  + +N T S  KT  ++F + DS+VNI P+ + ++G   L +  D   
Sbjct: 494  DDDLYGDGPTITHATANITASNSKTGDYTFRIHDSMVNIAPITNIAFGEAALSLGKDEEL 553

Query: 531  TGISKQSNYELV--------------------------ELPGCKGIWTVYHK--SSRGHN 562
                 QS  +LV                          +LP  +GIWT+  K  + +G  
Sbjct: 554  KSSGVQSELQLVAAVGREKGGSLAVINREIQPNVIGRFDLPEARGIWTMSAKRPAPKGLQ 613

Query: 563  ADSSRMA-----AYDDEYHAYLIISL--EARTMVLETA-----DLLTEVTESVDYF-VQG 609
             +  +         D +Y   +I+S   +A   + E+A     D   E     ++    G
Sbjct: 614  VNKEKSVTSGDYGVDAQYDRLMIVSKASDAEDAIEESAVYALTDAGFEALTGTEFEPAAG 673

Query: 610  RTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
             TI AG L    RV+Q+ +   R  DG   + Q L     + E+G+      ++S S AD
Sbjct: 674  STIEAGTLGNGMRVVQILKSEVRSYDGDLGLAQILPM--LDDETGA---EPKIISASFAD 728

Query: 669  PYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTD 728
            P++LL   D SI +   D       ++    I  S K ++ C LY D           +D
Sbjct: 729  PFLLLIRDDASIFVAQCDDDNDLEEIERVDDILLSTKWLTGC-LYDD------YSGAFSD 781

Query: 729  AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDK---FVSGRT 785
            +  S   GE             ++   +    GAL I+ +P+ +    V +   FV    
Sbjct: 782  SK-SNKAGE-------------NVKMFLLSAGGALHIYALPDLSKPVYVAEGICFVPPVL 827

Query: 786  HIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAIL 845
                   + A +++ TEI                       L      +    P+L    
Sbjct: 828  SADYAARKSAARETLTEI-----------------------LVANLGDSVSQSPYLILRP 864

Query: 846  TDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREET 905
            ++  +  Y+ +  +            S S  L  S +   +++N   ++ P    + EE 
Sbjct: 865  SNDDLTIYEPFRVK------------SASPDLLSSTLQFLKIQNTHLTQAP--DVSAEEQ 910

Query: 906  PHGA------PCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTV 959
              GA      P + I+   N+ G+   F+ G  P + +   +       L    + + + 
Sbjct: 911  VDGAQQTSDKPMRAIS---NLGGYSTVFMPGGSPSFIIKSSKTAPKVLSLQGTGVRSLSS 967

Query: 960  LHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPL 1019
             H   C+ GFIY +++GI ++ Q P  +T+ +     + I +    H + Y      Y +
Sbjct: 968  FHTEGCDRGFIYASTEGIARVAQFPPNTTFADIGMALRKIEIGEDVHAVAYHPPLQTYVI 1027

Query: 1020 IVSVPVLKPLNQVLSLLIDQEVGHQIDNHNL-SSVDLHRTYTVEEYEVRILEPDRAGGPW 1078
              S               D E+    D+       ++    ++E+  ++++ P      W
Sbjct: 1028 GTST------------FTDFELPKDDDHRKTWQEENIALKPSIEKSFLKLVSPVN----W 1071

Query: 1079 QTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
                 I ++  E    ++ + L  +  T E + L+ +GTA  +GED+A  GR+ ++
Sbjct: 1072 SVIDAIELEPCELITCIKTMNLVISEVTNERKHLIVVGTAITKGEDLATTGRLYVY 1127


>gi|325094074|gb|EGC47384.1| cleavage factor two protein 1 [Ajellomyces capsulatus H88]
          Length = 1377

 Score =  157 bits (397), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 242/1120 (21%), Positives = 443/1120 (39%), Gaps = 173/1120 (15%)

Query: 101  LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
            L LV  Y L G +  L  +     D+    +++++A  +AK+S++E+D   H +  TS+H
Sbjct: 65   LVLVAEYALSGTITDLGRVKI--LDSKSGGEAVLVATRNAKLSLIEWDPERHQISTTSIH 122

Query: 161  CFESPEWLHLKRGRESFARGP-LVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---- 214
             +E  + +++     + A  P  + VDP  RC  VL +G + + IL   Q G  LV    
Sbjct: 123  YYERDD-VNISPWTPNLASCPSYLTVDPSSRCA-VLNFGKKNLAILPFHQVGDDLVMDDF 180

Query: 215  -----------------GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGY 255
                              DE    +G  F     SS V+ +  L+  M H     F++ Y
Sbjct: 181  DSDVEEPHRNMNQTAEETDEANKSNGPVFQTPYASSFVLPIAALEPSMLHPISLAFLYEY 240

Query: 256  IEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVP 315
             EP   IL+ +  T +  +  +      S  ++    +    + S   LP+D +K++ +P
Sbjct: 241  REPTFGILYSQVATSSALLHDRKDVVFYSVFTLDLEQRASTTLLSVSRLPNDLFKVVPLP 300

Query: 316  SPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL-- 372
             P+GG L++G+N  +H      + A+ +N +A    S     +S   + L+ +    L  
Sbjct: 301  PPVGGALLIGSNELVHIDQAGKTNAVGVNEFAREASSFSMADQSDLEMRLEDSIVEQLGA 360

Query: 373  QNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFL 425
            +N   LL    G + +L+   DGR V  + L     +   S+L +  +    +     F 
Sbjct: 361  ENGDMLLVLLNGKMAVLSFKLDGRSVSGISLRPVPDQAGSSLLKAKPSCSVPVSRGKIFF 420

Query: 426  GSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD-------- 477
            GS  GDS+L+ ++  S  +      +   G+I   +           D            
Sbjct: 421  GSEEGDSVLMGWSRPSARTKDPRAQRTGEGNIAQLSDEDDDDEEEDDDDDAYEDDLYATP 480

Query: 478  MVNG----EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRI---NADASA 530
            M  G    + +S+ G+  N+       + F + D L N+GP++D + G      + D   
Sbjct: 481  MTTGIKARDYVSVNGTGFND-------YIFRIHDRLWNLGPMRDLTLGRPPGPRDKDKRQ 533

Query: 531  TGISKQSNYELVELPG--------------------------CKGIWTVYHKSSRGHNAD 564
               S  +N ELV   G                            G  +VY K  +  +  
Sbjct: 534  PVSSILTNLELVTTQGYGKAGGLAILRREIDPFVIDSLMIKDTDGARSVYVKDPKLPSQS 593

Query: 565  SSRMAAYDDEYHAYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLF 618
             S        Y  YL++S      + +++V   +    E T++ ++   + RTI  G L 
Sbjct: 594  GSLPLNPGSNYDHYLLLSKSKGLDKEKSVVYRMSSGGLEETKAPEFNPNEDRTIDIGTLA 653

Query: 619  GRRRVIQVFERGARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
               RV+QV +   R  D G  + Q       +      SE  +V+  S ADPYVL+   D
Sbjct: 654  SGTRVVQVLKGEVRSYDSGLGLAQIFPVWDEDM-----SEEKSVVHTSFADPYVLIIRDD 708

Query: 678  GSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGE 737
             SI LL  D S      +T   I S+     S +LY DK                     
Sbjct: 709  QSILLLQADESGDLDEAETDGIINSTT--WISGSLYQDK-------------------YR 747

Query: 738  AIDGADGGP-LDQGD-IYSVVCYESGALEIFDVPNF-NCVFTVDKFVSGRTHIVDTYMRE 794
            + +  +G P + Q D +   +      L +F +PN    VFT +                
Sbjct: 748  SFNSYEGPPNMKQSDNVLLFLLSSESKLYVFHLPNAREPVFTTESI-------------- 793

Query: 795  ALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQ 854
               D   +I S+         +E I  + V +L      +    P+L    ++  +  Y+
Sbjct: 794  ---DLLPQILSTEPPPRRVTYRETITELLVADLG----DSVSRSPYLILRSSNSDLTLYE 846

Query: 855  AYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRI 914
             Y +     TS ++   S  R + ++N    +      S + ++ +    T    P +  
Sbjct: 847  PYHY-----TSSTEKQFSDLRFVKIANHHFPKFH----SESNVEKHPANCTALSKPLR-- 895

Query: 915  TIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
             +  ++ G++  F+ G+ PC+ +     +     L   ++ + +  +   C  GF+YV +
Sbjct: 896  -VLGDVCGYRTVFMPGNSPCFIIKSSTSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDT 954

Query: 975  QGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLS 1034
              ++++C+ P  + +D  W  +K I L      + Y +    Y +  +  V        +
Sbjct: 955  DNVVRMCRFPRNTHFDGSWAARK-IGLGEQVDAVEYSSSSETYVIGTNQKV------DFN 1007

Query: 1035 LLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALT 1094
            L  D E+  +  N  +S +       +++  V++L P      W    +  ++++E  + 
Sbjct: 1008 LPEDDEIHPEWRNEVISFLP-----QIDKGSVKLLTPRT----WSIIDSYNLRNAERIMC 1058

Query: 1095 VRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
            V+ + L  +  T E +  + +GTA  +GED+AARG + +F
Sbjct: 1059 VKCLNLEVSEITHERKDTIVVGTALTKGEDIAARGCIYIF 1098


>gi|240277254|gb|EER40763.1| cleavage factor two protein 1 [Ajellomyces capsulatus H143]
          Length = 1408

 Score =  157 bits (397), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 242/1120 (21%), Positives = 443/1120 (39%), Gaps = 173/1120 (15%)

Query: 101  LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
            L LV  Y L G +  L  +     D+    +++++A  +AK+S++E+D   H +  TS+H
Sbjct: 65   LVLVAEYALSGTITDLGRVKI--LDSKSGGEAVLVATRNAKLSLIEWDPERHQISTTSIH 122

Query: 161  CFESPEWLHLKRGRESFARGP-LVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---- 214
             +E  + +++     + A  P  + VDP  RC  VL +G + + IL   Q G  LV    
Sbjct: 123  YYERDD-VNISPWTPNLASCPSYLTVDPSSRCA-VLNFGKKNLAILPFHQVGDDLVMDDF 180

Query: 215  -----------------GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGY 255
                              DE    +G  F     SS V+ +  L+  M H     F++ Y
Sbjct: 181  DSDVEEPHRNMNQTAEETDEANKSNGPVFQTPYASSFVLPIAALEPSMLHPISLAFLYEY 240

Query: 256  IEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVP 315
             EP   IL+ +  T +  +  +      S  ++    +    + S   LP+D +K++ +P
Sbjct: 241  REPTFGILYSQVATSSALLHDRKDVVFYSVFTLDLEQRASTTLLSVSRLPNDLFKVVPLP 300

Query: 316  SPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL-- 372
             P+GG L++G+N  +H      + A+ +N +A    S     +S   + L+ +    L  
Sbjct: 301  PPVGGALLIGSNELVHIDQAGKTNAVGVNEFAREASSFSMADQSDLEMRLEDSIVEQLGA 360

Query: 373  QNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFL 425
            +N   LL    G + +L+   DGR V  + L     +   S+L +  +    +     F 
Sbjct: 361  ENGDMLLVLLNGKMAVLSFKLDGRSVSGISLRPVPDQAGSSLLKAKPSCSVPVSRGKIFF 420

Query: 426  GSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD-------- 477
            GS  GDS+L+ ++  S  +      +   G+I   +           D            
Sbjct: 421  GSEEGDSVLMGWSRPSARTKDPRAQRTGEGNIAQLSDEDDDDEEEDDDDDAYEDDLYATP 480

Query: 478  MVNG----EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRI---NADASA 530
            M  G    + +S+ G+  N+       + F + D L N+GP++D + G      + D   
Sbjct: 481  MTTGIKARDYVSVNGTGFND-------YIFRIHDRLWNLGPMRDLTLGRPPGPRDKDKRQ 533

Query: 531  TGISKQSNYELVELPG--------------------------CKGIWTVYHKSSRGHNAD 564
               S  +N ELV   G                            G  +VY K  +  +  
Sbjct: 534  PVSSILTNLELVTTQGYGKAGGLAILRREIDPFVIDSLMIKDTDGARSVYVKDPKLPSQS 593

Query: 565  SSRMAAYDDEYHAYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLF 618
             S        Y  YL++S      + +++V   +    E T++ ++   + RTI  G L 
Sbjct: 594  GSLPLNPGSNYDHYLLLSKSKGLDKEKSVVYRMSSGGLEETKAPEFNPNEDRTIDIGTLA 653

Query: 619  GRRRVIQVFERGARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
               RV+QV +   R  D G  + Q       +      SE  +V+  S ADPYVL+   D
Sbjct: 654  SGTRVVQVLKGEVRSYDSGLGLAQIFPVWDEDM-----SEEKSVVHTSFADPYVLIIRDD 708

Query: 678  GSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGE 737
             SI LL  D S      +T   I S+     S +LY DK                     
Sbjct: 709  QSILLLQADESGDLDEAETDGIINSTT--WISGSLYQDK-------------------YR 747

Query: 738  AIDGADGGP-LDQGD-IYSVVCYESGALEIFDVPNF-NCVFTVDKFVSGRTHIVDTYMRE 794
            + +  +G P + Q D +   +      L +F +PN    VFT +                
Sbjct: 748  SFNSYEGPPNMKQSDNVLLFLLSSESKLYVFHLPNAREPVFTTESI-------------- 793

Query: 795  ALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQ 854
               D   +I S+         +E I  + V +L      +    P+L    ++  +  Y+
Sbjct: 794  ---DLLPQILSTEPPPRRVTYRETITELLVADLG----DSVSRSPYLILRSSNSDLTLYE 846

Query: 855  AYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRI 914
             Y +     TS ++   S  R + ++N    +      S + ++ +    T    P   +
Sbjct: 847  PYHY-----TSSTEKQFSDLRFVKIANHHFPKFH----SESNVEKHPANCTALSKP---L 894

Query: 915  TIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
             +  ++ G++  F+ G+ PC+ +     +     L   ++ + +  +   C  GF+YV +
Sbjct: 895  RVLGDVCGYRTVFMPGNSPCFIIKSSTSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDT 954

Query: 975  QGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLS 1034
              ++++C+ P  + +D  W  +K I L      + Y +    Y +  +  V        +
Sbjct: 955  DNVVRMCRFPRNTHFDGSWAARK-IGLGEQVDAVEYSSSSETYVIGTNQKV------DFN 1007

Query: 1035 LLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALT 1094
            L  D E+  +  N  +S +       +++  V++L P      W    +  ++++E  + 
Sbjct: 1008 LPEDDEIHPEWRNEVISFLP-----QIDKGSVKLLTPRT----WSIIDSYNLRNAERIMC 1058

Query: 1095 VRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
            V+ + L  +  T E +  + +GTA  +GED+AARG + +F
Sbjct: 1059 VKCLNLEVSEITHERKDTIVVGTALTKGEDIAARGCIYIF 1098


>gi|225558298|gb|EEH06582.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
          Length = 1408

 Score =  157 bits (397), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 242/1120 (21%), Positives = 444/1120 (39%), Gaps = 173/1120 (15%)

Query: 101  LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
            L LV  Y L G +  L  +     D+    +++++A  +AK+S++E+D   H +  TS+H
Sbjct: 65   LVLVAEYALSGTITDLGRVKI--LDSKSGGEAVLVATRNAKLSLIEWDPERHQICTTSIH 122

Query: 161  CFESPEWLHLKRGRESFARGP-LVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---- 214
             +E  + +++     + A  P  + VDP  RC  VL +G + + IL   Q G  LV    
Sbjct: 123  YYERDD-VNISPWTPNLASCPSYLTVDPSSRCA-VLNFGKKNLAILPFHQVGDDLVMDDF 180

Query: 215  -----------------GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGY 255
                              DE    +G  F     SS V+ +  L+  M H     F++ Y
Sbjct: 181  DSDVEEPHRNMNQTAEETDEANKSNGPVFQTPYASSFVLPIAALEPSMLHPISLAFLYEY 240

Query: 256  IEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVP 315
             EP   IL+ +  T +  +  +      S  ++    +    + S   LP+D +K++ +P
Sbjct: 241  REPTFGILYSQVATSSALLHDRKDVVFYSVFTLDLEQRASTTLLSVSRLPNDLFKVVPLP 300

Query: 316  SPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL-- 372
             P+GG L++G+N  +H      + A+ +N +A    S     +S   + L+ +    L  
Sbjct: 301  PPVGGALLIGSNELVHIDQAGKTNAVGVNEFAREASSFSMADQSDLEMRLEDSIVEQLGA 360

Query: 373  QNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFL 425
            +N   LL    G + +L+   DGR V  + L     +   S+L +  +    +     F 
Sbjct: 361  ENGDMLLVLLNGKMAVLSFKLDGRSVSGISLRPVPDQAGSSLLKAKPSCSVPVSRGKIFF 420

Query: 426  GSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD-------- 477
            GS  GDS+L+ ++  S  +      +   G+I   +           D            
Sbjct: 421  GSEEGDSVLMGWSRPSARTKDPRAQRTGEGNIAQLSDEDDDDEEEDDDDDAYEDDLYATP 480

Query: 478  MVNG----EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRI---NADASA 530
            M  G    + +S+ G+  N+       + F + D L N+GP++D + G      + D   
Sbjct: 481  MTTGIKARDYVSVNGTGFND-------YIFRIHDRLWNLGPMRDLTLGRPPGPRDKDKRQ 533

Query: 531  TGISKQSNYELVELPG--------------------------CKGIWTVYHKSSRGHNAD 564
               S  +N ELV   G                            G  +VY K  +  +  
Sbjct: 534  PVSSILTNLELVTTQGYGKAGGLAILRREIDPFVIDSLMIKDTDGARSVYVKDPKLPSQS 593

Query: 565  SSRMAAYDDEYHAYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLF 618
             S        Y  YL++S      + +++V   +    E T++ ++   + RTI  G L 
Sbjct: 594  GSLPLNPGSNYDHYLLLSKSKGLDKEKSVVYRMSSGGLEETKAPEFNPNEDRTIDIGTLA 653

Query: 619  GRRRVIQVFERGARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
               RV+QV +   R  D G  + Q       +      SE  +V+  S ADPYVL+   D
Sbjct: 654  SGTRVVQVLKGEVRSYDSGLGLAQIFPVWDEDM-----SEEKSVVHTSFADPYVLIIRDD 708

Query: 678  GSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGE 737
             SI LL  D S      +T   I S+     S +LY DK                     
Sbjct: 709  QSILLLQADESGDLDEAETDGIINSTT--WISGSLYQDK-------------------YR 747

Query: 738  AIDGADGGP-LDQGD-IYSVVCYESGALEIFDVPNF-NCVFTVDKFVSGRTHIVDTYMRE 794
            + +  +G P + Q D +   +      L +F +PN    VFT +                
Sbjct: 748  SFNSYEGPPNMKQSDNVLLFLLSSESKLYVFHLPNAREPVFTTESI-------------- 793

Query: 795  ALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQ 854
               D   +I S+         +E I  + V +L      +    P+L    ++  ++ Y+
Sbjct: 794  ---DLLPQILSTEPPPRRVTYRETITELLVADLG----DSVSRSPYLILRSSNSDLILYE 846

Query: 855  AYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRI 914
             Y +     TS ++   S  R + ++N    +      S + ++ +    T    P   +
Sbjct: 847  PYHY-----TSSTEKQFSDLRFVKIANHHFPKFH----SESNVEKHPANCTTLSKP---L 894

Query: 915  TIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
             +  ++ G++  F+ G+ PC+ +     +     L   ++ + +  +   C  GF+YV +
Sbjct: 895  RVLGDVCGYRTVFMPGNSPCFIIKSSTSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDT 954

Query: 975  QGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLS 1034
              ++++C+ P  + +D  W  +K I L      + Y +    Y +  +  V        +
Sbjct: 955  DNVVRMCRFPRNTHFDGSWAARK-IGLGEQVDAVEYSSSSETYVIGTNQKV------DFN 1007

Query: 1035 LLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALT 1094
            L  D E+  +  N  +S +       +++  V++L P      W    +  ++++E  + 
Sbjct: 1008 LPEDDEIHPEWRNEVISFLP-----QIDKGSVKLLTPRT----WSIIDSYNLRNAERIMC 1058

Query: 1095 VRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
            V+ + L  +  T E +  + +GTA  +GED+AARG + +F
Sbjct: 1059 VKCLNLEVSEITHERKDTIVVGTALTKGEDIAARGCIYIF 1098


>gi|322704830|gb|EFY96421.1| Cleavage factor two protein 1 [Metarhizium anisopliae ARSEF 23]
          Length = 1433

 Score =  155 bits (392), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 265/1142 (23%), Positives = 432/1142 (37%), Gaps = 172/1142 (15%)

Query: 72   RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRD 131
            R  ++   ES   G     V  D  +   L L+    L G V  LA +     +     +
Sbjct: 70   RANDDDGLESSFLGGESLIVRADPSNITKLVLITEIPLAGTVIGLARVKV--KNTPSGGE 127

Query: 132  SIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV---KVDPQ 188
            +++LA++ AK+ + E+    H L  TS+H +E  E   L+      + G  V   + DP 
Sbjct: 128  ALLLAYKAAKMCLTEWHPQRHTLETTSIHYYEKDE---LQGAPWEMSFGDYVNYLEADPG 184

Query: 189  GRCGGVLVYGLQMIILKASQGGSGLVGD---ED-------------TFGSGGG----FSA 228
             RC         + IL  +Q    L  D   ED             T G G G      +
Sbjct: 185  SRCVAFKFGSRNLAILPFTQSEEDLEMDDWDEDLDGPRPVKEELPLTNGDGPGDHDLVKS 244

Query: 229  RIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISAL 286
            R   S V+ L  LD  + H     F+H Y EP   IL   +   A      H T  +  L
Sbjct: 245  RYTPSFVLRLPLLDPSLLHPVHLAFLHEYREPTFGILSSMQSPSAALGIKDHLTYKVFTL 304

Query: 287  SISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNY 345
             +    +    I S   LP D ++++A+P+P+GG L+VG N  IH         +A+N+ 
Sbjct: 305  DLQQ--RASTTILSVTGLPQDLFRVMALPAPMGGALLVGENELIHIDQSGKPNGVAVNDM 362

Query: 346  AVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDL 403
            A  + S   + +S   + L+      L ND+   LL    G L ++    DGR V ++ +
Sbjct: 363  AKQMTSFSLVDQSELGLRLEGCAVELLANDIGELLLILNDGRLAIVCFHIDGRTVSKISI 422

Query: 404  ----SKTNPSVLTSDITTI---GNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGD 456
                ++   +++ S ++ I   G++  FLGS   DS+++ ++   G        K +   
Sbjct: 423  RLVSAEYGGNLIKSQVSCISKLGSNTLFLGSESNDSIVLGWSRKQGQE------KRKKSR 476

Query: 457  IEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASNNTESAQKTFSFAVRDSLVNIGPL 515
            +     +         D   D + G + SL   S + N  S     SF ++DSL++I P+
Sbjct: 477  LLDPDLALDVDDLDLDDDEDDDLYGNDASLAKPSQTINGGSKPGEVSFRIQDSLLSIAPI 536

Query: 516  KDFSYGL-RINADASATGISKQSNYEL----------------------------VELPG 546
            +D + G   +  D+    +SK    EL                             + P 
Sbjct: 537  RDVACGAPALVPDSEEATLSKGVTAELELACAVGRGSSGSVAILNREIQPKVIGRFDFPE 596

Query: 547  CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIIS------LEARTMVLETADLLTEVT 600
             +G WT+  K      A  +       +Y  Y+I++       E   +   TA     + 
Sbjct: 597  ARGFWTMCAKKPLSKGAAVASDFDTTGQYDKYMIVAKVDLDGYETSDVYALTAAGFETLK 656

Query: 601  ESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENS 659
            ++      G T+ AG +  + R+IQV +   R  DG   ++Q L   P   E       +
Sbjct: 657  DTEFEPAAGFTVEAGTMGKQMRIIQVLKSEVRCYDGDLGLSQIL---PMLDEDTGAEPRA 713

Query: 660  TVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPE 719
            T  S SI DPY+LL   D SI +     +     V  P     S K  S C LY+D    
Sbjct: 714  T--SASIVDPYLLLIRDDSSIFIAQIHSNNELEEVLKPDGTLKSTKWASGC-LYND---- 766

Query: 720  PWLRKTSTDAWLSTGVGEAIDGADGGPLDQGD-IYSVVCYESGALEIFDVPNFN-CVFTV 777
                   T       V E          D+ D I   +    GAL ++ +P+ +  VF  
Sbjct: 767  -------TQGIFQNNVNEQ-------QADETDRIMMFLLSSVGALHVYALPDVSRPVFVA 812

Query: 778  DKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHS 837
            +   S     +  ++  A           + +G     KE+I  + V +L      A   
Sbjct: 813  EALTS-----IPPFLSAAF---------VARKGAS---KESITEILVADLG----DAISQ 851

Query: 838  RPFLFAILTDGTILCYQA--YLFEGPENTSKS---DDPVSTSRSLSVSNVSASRLRNLRF 892
             P+L        +  Y+   Y  EG    S S      V+TS + +   VS         
Sbjct: 852  TPYLIVRHASDDLTIYEPVRYQAEGDAELSASLLFKKCVNTSLAKTAPEVSED------- 904

Query: 893  SRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDG 952
                 DA    E P   P +R     N++G+   FL  + P + +           L   
Sbjct: 905  -----DA----EPPRFVPLRRCA---NVNGYGAVFLPNASPSFVLKSSHSEPRVMGLQGL 952

Query: 953  SIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFA 1012
             +   +  H   C+ GFIYV  +GI ++ QLPS +         K I L      I+Y  
Sbjct: 953  GVRGMSTFHTEGCDRGFIYVDMEGIARVTQLPSNANLTELGVSVKKIALDGDVGMISYHH 1012

Query: 1013 EKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPD 1072
                Y  +V    L+           +E   +  N            T+    ++++ P 
Sbjct: 1013 PTGTY--VVGCTKLEQFELPRDDDYHKEWAKETSNF---------PPTMARGILKLINPV 1061

Query: 1073 RAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVL 1131
                 W     + ++  E+  +++ + L  +  TKE + L+A+GTA  +GED+  RGRV 
Sbjct: 1062 T----WTVIHELELEPCESIESMKTLHLEVSEETKERKMLVAVGTALSKGEDLPTRGRVQ 1117

Query: 1132 LF 1133
            +F
Sbjct: 1118 VF 1119


>gi|353231025|emb|CCD77443.1| putative cleavage and polyadenylation specificity factor cpsf
           [Schistosoma mansoni]
          Length = 1825

 Score =  155 bits (392), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 146/586 (24%), Positives = 252/586 (43%), Gaps = 123/586 (20%)

Query: 4   AAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTAA 63
           A +K +  PT + NC    +TH +                           + NLV+T  
Sbjct: 15  AVFKHISPPTAVDNCLYCHLTHPK---------------------------LKNLVITRG 47

Query: 64  NVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGG 123
             IEIY V+        S  SGET+               V    ++ N+  +  +   G
Sbjct: 48  GFIEIYNVK--------SSASGETR------------FNWVYGTSVYENIADIVTVRFTG 87

Query: 124 ADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV 183
                   S++L+F +AK++V+ F+     LR  S+H +E   + +LK GR +F + P++
Sbjct: 88  DLLD----SLLLSFPEAKVAVMNFNPVTFELRTLSLHNYE---FENLKSGRMNFTKLPIL 140

Query: 184 KVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED----TFGSGGGFSARIESSHVINLR 239
           ++DP  RC  +LVY   + +L   +    +  + D    +  +   +  R  +  +    
Sbjct: 141 RLDPHQRCAVMLVYDRHLAVLPFRRTEVLVSAETDPKHISVRNSLLWQQRATAPLLATFT 200

Query: 240 DL-------DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
                     + +V D  F++G+ EP +++L+E   TWAGRVS +  TC I ALS +   
Sbjct: 201 TCLSTSTGEKINNVLDMQFLYGFYEPTLLVLYEPIGTWAGRVSARRDTCCIVALSFNLQK 260

Query: 293 KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYA---VS 348
           + +P+IW   +LP D   +++VP PIGGV+V+ AN+I Y  Q+  SC L LN YA    +
Sbjct: 261 RTNPVIWFQESLPFDCRSVISVPQPIGGVVVMAANSILYLKQTLPSCGLPLNCYAQISTN 320

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKT 406
               Q++P S   + +D      L     L+ T++G+L LL++  +   + V  L   K 
Sbjct: 321 FPMRQDVP-SCGPLSIDGCRVVTLNETQFLIGTRSGNLYLLSLWLEQATQTVTSLLFHKV 379

Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQF---------------------TCGSGTSM 445
             +V    +  + +   F+GSR  DS+L++                      + G+  ++
Sbjct: 380 GHAVPPHCMVLLESKYLFIGSRFCDSVLMKIDYSLLCVDANGKEVDHQLLNQSSGTNNTL 439

Query: 446 LSSGLKEEFGDIEAD------------------------APSTKRLRRSSSDALQD---M 478
             S L +    +E D                        + STKR     +D + D    
Sbjct: 440 KDSELVDGKSIVEDDSDEIPNKCPRIEEGENDKTISKSLSQSTKRNTLDENDIISDNHYK 499

Query: 479 VNGEELSLYGSASNNTESAQK---TFSFAVRDSLVNIGPLKDFSYG 521
            +  ++ LYG +  +  S  +    +SF V D L+N+GP+   + G
Sbjct: 500 FDEVDVELYGESILSPPSIYREIVNYSFKVVDRLINLGPMGQLTSG 545



 Score = 68.9 bits (167), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 90/404 (22%), Positives = 154/404 (38%), Gaps = 49/404 (12%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            ++ + + +G LEI+ +P+F  ++ V  F      ++D      +   +     ++ +   
Sbjct: 1086 FAFIVFTNGVLEIYSLPDFTLLYEVHHFTDLPQMLID---HRGVSSEQLHKQYTNSQNVS 1142

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
                ++I    ++E+ +        RP L  + T   I  ++A L   P+          
Sbjct: 1143 YTEDDSIPP-PILEILVYPIGIDKDRPVLM-VRTSQEIAFFEA-LCPSPDE--------- 1190

Query: 873  TSRSLSVSNVSASRLRNLRFS-RTPLDAYTREET-PHGAPCQRITI--------FKNISG 922
             S  L        RLR  R     PL A  R  T P     Q   +        F+NI  
Sbjct: 1191 -SYPLISGTFYEGRLRWRRLPLPCPLVAPRRVRTDPKIMDVQSTLLTRTHMLRSFENIGD 1249

Query: 923  HQGFFLSGSRPCWCMVFRE-RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
            H+G F+ G  P W       +LRV P   DG + +F  L+   C+ GF+Y T    +++ 
Sbjct: 1250 HRGVFVCGGNPIWLFATDSGQLRVFPHSIDGIMGSFAPLNAKICHSGFVYFTFSNEMRLA 1309

Query: 982  QLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1041
             LP G +++ +  + K I L   P+ + Y  E   Y  +V +   +P   V  L  +   
Sbjct: 1310 TLPPGYSFNEHLGI-KWITLDPVPYYVQYHVESKTY-AVVGIHS-EPCKSVFRLNAEGNK 1366

Query: 1042 GHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGG------PWQTRATIPMQSSENALTV 1095
               +     + V      T++ Y +++  P+          PW     IP    E     
Sbjct: 1367 EEDVLVRPKTCV----LPTLDYYSLQMYAPNLNANHRNKQPPW---LLIPNTLIEFEPWE 1419

Query: 1096 RVVTLFNTTTKENETL------LAIGTAYVQGEDVAARGRVLLF 1133
             V  L        ET       LA+G     GE++  RGR+L+ 
Sbjct: 1420 VVTCLITAQLASEETFHGTKDYLALGANLTYGEEIPVRGRILIL 1463


>gi|224135031|ref|XP_002321966.1| predicted protein [Populus trichocarpa]
 gi|222868962|gb|EEF06093.1| predicted protein [Populus trichocarpa]
          Length = 180

 Score =  155 bits (392), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 85/152 (55%), Positives = 100/152 (65%), Gaps = 22/152 (14%)

Query: 733 TGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYM 792
           TG+ EAIDGADGG  DQGDIY V+CYE+GALEIFDVPNFN VF VDKFVSG+TH+VD++M
Sbjct: 4   TGISEAIDGADGGAHDQGDIYRVICYETGALEIFDVPNFNSVFIVDKFVSGKTHLVDSFM 63

Query: 793 REALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILC 852
            E  +D    +N   EE  G GRKE      +V L              F ILT GTILC
Sbjct: 64  GEPPRDLTKGMN---EEVAGAGRKE------IVLL-------------FFGILTYGTILC 101

Query: 853 YQAYLFEGPENTSKSDDPVSTSRSLSVSNVSA 884
           Y A LFEGP+  SK +DPVS   S+  S++SA
Sbjct: 102 YHACLFEGPDGNSKLEDPVSAQNSVGDSSISA 133



 Score = 67.0 bits (162), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 31/50 (62%), Positives = 41/50 (82%)

Query: 1020 IVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRIL 1069
            I +  V +P+NQVLS + DQE GHQI+N NLSSV++HRT +V+E+EVRIL
Sbjct: 131  ISAFAVQRPVNQVLSSMADQEFGHQIENPNLSSVEIHRTDSVDEFEVRIL 180


>gi|440466842|gb|ELQ36086.1| hypothetical protein OOU_Y34scaffold00669g71 [Magnaporthe oryzae Y34]
 gi|440481991|gb|ELQ62520.1| hypothetical protein OOW_P131scaffold01068g7 [Magnaporthe oryzae
            P131]
          Length = 1475

 Score =  154 bits (390), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 261/1197 (21%), Positives = 455/1197 (38%), Gaps = 207/1197 (17%)

Query: 57   NLVVTAANVIEIYVVRV---QEEGSKESKNSGETKRRVLMD--GISAA------------ 99
            NLVV  +++++I+  R+   + +G+ +S  +       L D  G+ A+            
Sbjct: 51   NLVVAKSSLLQIFATRLVPAELDGTSQSAKATHNYDTKLNDDEGLEASFLGGDAAIIRSD 110

Query: 100  ----SLELVCHYRLHGNVESLAILSQGGADNSRRR---------DSIILAFEDAKISVLE 146
                 L LV  + L G +  LA +       S            D +++AF+DAK+S++E
Sbjct: 111  RNHTKLVLVAEFPLSGTITGLARVKANATKTSNGNGAGSSSSGGDFLLIAFKDAKLSLVE 170

Query: 147  FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL------VKVDPQGRCGGVLVYGLQ 200
            +D     L   S+H +E  E       + S    PL      +  DP  RC   L +G +
Sbjct: 171  WDPDRRSLETISIHYYEQNEL------QSSPWAAPLSDYVNFLVADPGSRCAA-LKFGAR 223

Query: 201  MIILKASQGGSGLVGDED----------------TFGSGGGFSARIES-----SHVINLR 239
             + +   +   G +G +D                T  + G     +E      S V+ L 
Sbjct: 224  SLAIIPFKQADGDIGMDDWDEELDGPRPAQEKPATAATNGTTDNVVEDTPYTPSFVLRLP 283

Query: 240  DLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
            +LD  + H     F++ Y EP   IL    +T +  ++ K H    +  ++    K    
Sbjct: 284  NLDPALLHPVHLAFLYEYREPTFGILSS-NITPSTYLARKDH-LTYTVFTLDLQQKASTT 341

Query: 298  IWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELP 356
            I S   LP D  +++A+P+P+GG L+VG+N  IH      +  +A+N    S  S     
Sbjct: 342  ILSVGGLPKDLTRVIALPAPVGGALLVGSNELIHIDQSGKANGVAVNPMTKSCTSFSLAD 401

Query: 357  RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY--DGRVVQRLDLSKTNP------ 408
            +S   + L+      L  +         D  L T+V+  DGR V  L +    P      
Sbjct: 402  QSDLGLRLEGCMINVLSAEDGQFIIVLNDGRLATLVFHIDGRTVSGLKIKMVAPEAGGQL 461

Query: 409  -SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
                 S +T +G +  F GS  GDS++  +          S  K +  D + D       
Sbjct: 462  LQTSVSCLTRLGRNALFAGSDRGDSVVFGWNRKHNQ---VSKRKPKIQDPDLDLDIDYDD 518

Query: 468  RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL----- 522
                 D   D+    E +   ++++  E+      F V D +V+I P++D ++G      
Sbjct: 519  LEDDEDDDDDLYADTEKTKATTSASTGETKTDDLIFRVHDRMVSIAPIRDVTFGKPPPPT 578

Query: 523  ---RINADASA----------TGISKQSNYELV------------ELPGCKGIWTV---- 553
               R   D +A           G  K S+  ++            E P  +G+WT+    
Sbjct: 579  DAERNTKDPAAVQSELQLVAVVGRDKASSLAIINREMTPVSIGRFEFPEARGLWTLSTQK 638

Query: 554  -YHKSSRGHNADSSRMAAYDD----EYHAYLIISLEARTMVLETADLLTEVTESVDYF-- 606
               K  +  N +    AA +     +Y  Y+I++ E      ET+D+        +    
Sbjct: 639  PLPKPLQASNKNPKTAAATESILSAQYDQYMIVAKEDDDG-FETSDVYALTAAGFETLSG 697

Query: 607  -----VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENST 660
                   G TI AG +    ++IQV +   R  DG   +TQ +     + E+G       
Sbjct: 698  TEFEPAAGFTIEAGTMGDHTKIIQVLKSEVRCYDGDLGLTQIIPM--LDEETG---HEPR 752

Query: 661  VLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEP 720
              S SIADPY+L+   D S  +   +  +    ++    I SS K  + C LY D     
Sbjct: 753  ATSASIADPYLLIIRDDSSAFIAHVNEDSEIEEIEKEDKIISSTKWSTGC-LYAD----- 806

Query: 721  WLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKF 780
                       S G   A       P     I   +   +GAL I+ +P+ +        
Sbjct: 807  -----------SKGAFAATQQTAKSPKSTPTIMMFLLSAAGALYIYALPDIS-------- 847

Query: 781  VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
                      Y+ E L      +++      G  R E I  + V +L    + + H    
Sbjct: 848  -------RPVYVAEGLCYVPPYLSADYSARKGMAR-ETISEILVTDLGDTVFKSPH---- 895

Query: 841  LFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAY 900
            +    ++  +  Y+ Y          ++D  S ++ L +      +L N   ++ P +A 
Sbjct: 896  VILRHSNHDLTIYEPYRI--------AEDSQSLTKILRL-----RKLPNPAVAKAP-EAT 941

Query: 901  TREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQ---LCDGSIVAF 957
              E+ P  +    +    NI+G+   F+ G  P + +   +  +  P+   L    + A 
Sbjct: 942  NSEDPPLMSRNMPLRACANIAGYSAVFMPGHSPSFLI---KSAKATPKVIGLRGSGVRAM 998

Query: 958  TVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLY 1017
            +  H   C  GFIY  S G+ ++ Q+P  +++       K +PL      I Y +   +Y
Sbjct: 999  SSFHTEGCERGFIYADSAGVARVAQIPKDTSFSELGLSVKKVPLGIDADGIAYHSPTGVY 1058

Query: 1018 PLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGP 1077
             L  S        +   L  D +   +    N+S   +     VE   ++++ P      
Sbjct: 1059 VLTCS------YWEPFELPKDDDYHCEWAKENISFKPM-----VERSVLKVINPIN---- 1103

Query: 1078 WQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
            W    T   +  E A+ +R + L  + +T E   L+ +GTA  +GED+  RG + +F
Sbjct: 1104 WSDIWTEEFEQHEVAMCIRSLNLEVSQSTNERRQLITVGTAMCKGEDLPVRGGIYVF 1160


>gi|256079900|ref|XP_002576222.1| cleavage and polyadenylation specificity factor cpsf [Schistosoma
           mansoni]
          Length = 1958

 Score =  154 bits (390), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 147/586 (25%), Positives = 255/586 (43%), Gaps = 106/586 (18%)

Query: 4   AAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTAA 63
           A +K +  PT + NC    + H          +     +D+ L        + NLV+T  
Sbjct: 15  AVFKHISPPTAVDNCLYCHLKH----------ISPPTAVDNCLYCHLTHPKLKNLVITRG 64

Query: 64  NVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGG 123
             IEIY V+        S  SGET+               V    ++ N+  +  +   G
Sbjct: 65  GFIEIYNVK--------SSASGETR------------FNWVYGTSVYENIADIVTVRFTG 104

Query: 124 ADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV 183
                   S++L+F +AK++V+ F+     LR  S+H +E   + +LK GR +F + P++
Sbjct: 105 DLLD----SLLLSFPEAKVAVMNFNPVTFELRTLSLHNYE---FENLKSGRMNFTKLPIL 157

Query: 184 KVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED----TFGSGGGFSARIESSHVINLR 239
           ++DP  RC  +LVY   + +L   +    +  + D    +  +   +  R  +  +    
Sbjct: 158 RLDPHQRCAVMLVYDRHLAVLPFRRTEVLVSAETDPKHISVRNSLLWQQRATAPLLATFT 217

Query: 240 DL-------DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
                     + +V D  F++G+ EP +++L+E   TWAGRVS +  TC I ALS +   
Sbjct: 218 TCLSTSTGEKINNVLDMQFLYGFYEPTLLVLYEPIGTWAGRVSARRDTCCIVALSFNLQK 277

Query: 293 KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYA---VS 348
           + +P+IW   +LP D   +++VP PIGGV+V+ AN+I Y  Q+  SC L LN YA    +
Sbjct: 278 RTNPVIWFQESLPFDCRSVISVPQPIGGVVVMAANSILYLKQTLPSCGLPLNCYAQISTN 337

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKT 406
               Q++P S   + +D      L     L+ T++G+L LL++  +   + V  L   K 
Sbjct: 338 FPMRQDVP-SCGPLSIDGCRVVTLNETQFLIGTRSGNLYLLSLWLEQATQTVTSLLFHKV 396

Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQF---------------------TCGSGTSM 445
             +V    +  + +   F+GSR  DS+L++                      + G+  ++
Sbjct: 397 GHAVPPHCMVLLESKYLFIGSRFCDSVLMKIDYSLLCVDANGKEVDHQLLNQSSGTNNTL 456

Query: 446 LSSGLKEEFGDIEAD------------------------APSTKRLRRSSSDALQD---M 478
             S L +    +E D                        + STKR     +D + D    
Sbjct: 457 KDSELVDGKSIVEDDSDEIPNKCPRIEEGENDKTISKSLSQSTKRNTLDENDIISDNHYK 516

Query: 479 VNGEELSLYGSASNNTESAQK---TFSFAVRDSLVNIGPLKDFSYG 521
            +  ++ LYG +  +  S  +    +SF V D L+N+GP+   + G
Sbjct: 517 FDEVDVELYGESILSPPSIYREIVNYSFKVVDRLINLGPMGQLTSG 562



 Score = 68.9 bits (167), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 90/404 (22%), Positives = 154/404 (38%), Gaps = 49/404 (12%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            ++ + + +G LEI+ +P+F  ++ V  F      ++D      +   +     ++ +   
Sbjct: 1103 FAFIVFTNGVLEIYSLPDFTLLYEVHHFTDLPQMLID---HRGVSSEQLHKQYTNSQNVS 1159

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
                ++I    ++E+ +        RP L  + T   I  ++A L   P+          
Sbjct: 1160 YTEDDSIPP-PILEILVYPIGIDKDRPVLM-VRTSQEIAFFEA-LCPSPDE--------- 1207

Query: 873  TSRSLSVSNVSASRLRNLRFS-RTPLDAYTREET-PHGAPCQRITI--------FKNISG 922
             S  L        RLR  R     PL A  R  T P     Q   +        F+NI  
Sbjct: 1208 -SYPLISGTFYEGRLRWRRLPLPCPLVAPRRVRTDPKIMDVQSTLLTRTHMLRSFENIGD 1266

Query: 923  HQGFFLSGSRPCWCMVFRE-RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
            H+G F+ G  P W       +LRV P   DG + +F  L+   C+ GF+Y T    +++ 
Sbjct: 1267 HRGVFVCGGNPIWLFATDSGQLRVFPHSIDGIMGSFAPLNAKICHSGFVYFTFSNEMRLA 1326

Query: 982  QLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1041
             LP G +++ +  + K I L   P+ + Y  E   Y  +V +   +P   V  L  +   
Sbjct: 1327 TLPPGYSFNEHLGI-KWITLDPVPYYVQYHVESKTY-AVVGIHS-EPCKSVFRLNAEGNK 1383

Query: 1042 GHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGG------PWQTRATIPMQSSENALTV 1095
               +     + V      T++ Y +++  P+          PW     IP    E     
Sbjct: 1384 EEDVLVRPKTCV----LPTLDYYSLQMYAPNLNANHRNKQPPW---LLIPNTLIEFEPWE 1436

Query: 1096 RVVTLFNTTTKENETL------LAIGTAYVQGEDVAARGRVLLF 1133
             V  L        ET       LA+G     GE++  RGR+L+ 
Sbjct: 1437 VVTCLITAQLASEETFHGTKDYLALGANLTYGEEIPVRGRILIL 1480


>gi|389641257|ref|XP_003718261.1| cft-1 [Magnaporthe oryzae 70-15]
 gi|351640814|gb|EHA48677.1| cft-1 [Magnaporthe oryzae 70-15]
          Length = 1452

 Score =  154 bits (388), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 263/1199 (21%), Positives = 456/1199 (38%), Gaps = 211/1199 (17%)

Query: 57   NLVVTAANVIEIYVVRV---QEEGSKESKNSGETKRRVLMD--GISAA------------ 99
            NLVV  +++++I+  R+   + +G+ +S  +       L D  G+ A+            
Sbjct: 28   NLVVAKSSLLQIFATRLVPAELDGTSQSAKATHNYDTKLNDDEGLEASFLGGDAAIIRSD 87

Query: 100  ----SLELVCHYRLHGNVESLAILSQGGADNSRRR---------DSIILAFEDAKISVLE 146
                 L LV  + L G +  LA +       S            D +++AF+DAK+S++E
Sbjct: 88   RNHTKLVLVAEFPLSGTITGLARVKANATKTSNGNGAGSSSSGGDFLLIAFKDAKLSLVE 147

Query: 147  FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL------VKVDPQGRCGGVLVYGLQ 200
            +D     L   S+H +E  E       + S    PL      +  DP  RC   L +G +
Sbjct: 148  WDPDRRSLETISIHYYEQNEL------QSSPWAAPLSDYVNFLVADPGSRCAA-LKFGAR 200

Query: 201  MIILKASQGGSGLVGDED----------------TFGSGGGFSARIES-----SHVINLR 239
             + +   +   G +G +D                T  + G     +E      S V+ L 
Sbjct: 201  SLAIIPFKQADGDIGMDDWDEELDGPRPAQEKPATAATNGTTDNVVEDTPYTPSFVLRLP 260

Query: 240  DLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
            +LD  + H     F++ Y EP   IL    +T +  ++ K H    +  ++    K    
Sbjct: 261  NLDPALLHPVHLAFLYEYREPTFGILSS-NITPSTYLARKDH-LTYTVFTLDLQQKASTT 318

Query: 298  IWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELP 356
            I S   LP D  +++A+P+P+GG L+VG+N  IH      +  +A+N    S  S     
Sbjct: 319  ILSVGGLPKDLTRVIALPAPVGGALLVGSNELIHIDQSGKANGVAVNPMTKSCTSFSLAD 378

Query: 357  RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY--DGRVVQRLDLSKTNP------ 408
            +S   + L+      L  +         D  L T+V+  DGR V  L +    P      
Sbjct: 379  QSDLGLRLEGCMINVLSAEDGQFIIVLNDGRLATLVFHIDGRTVSGLKIKMVAPEAGGQL 438

Query: 409  -SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
                 S +T +G +  F GS  GDS++  +          S  K +  D + D       
Sbjct: 439  LQTSVSCLTRLGRNALFAGSDRGDSVVFGWNRKHNQ---VSKRKPKIQDPDLDLDIDYDD 495

Query: 468  RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL----- 522
                 D   D+    E +   ++++  E+      F V D +V+I P++D ++G      
Sbjct: 496  LEDDEDDDDDLYADTEKTKATTSASTGETKTDDLIFRVHDLMVSIAPIRDVTFGKPPPPT 555

Query: 523  ---RINADASA----------TGISKQSNYELV------------ELPGCKGIWTV---- 553
               R   D +A           G  K S+  ++            E P  +G+WT+    
Sbjct: 556  DAERNTKDPAAVQSELQLVAVVGRDKASSLAIINREMTPVSIGRFEFPEARGLWTLSTQK 615

Query: 554  -YHKSSRGHNADSSRMAAYDD----EYHAYLIISLEARTMVLETADLLTEVTESVDYF-- 606
               K  +  N +    AA +     +Y  Y+I++ E      ET+D+        +    
Sbjct: 616  PLPKPLQASNKNPKTAAATESILSAQYDQYMIVAKEDDDG-FETSDVYALTAAGFETLSG 674

Query: 607  -----VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENST 660
                   G TI AG +    ++IQV +   R  DG   +TQ +     + E+G       
Sbjct: 675  TEFEPAAGFTIEAGTMGDHTKIIQVLKSEVRCYDGDLGLTQIIPM--LDEETG---HEPR 729

Query: 661  VLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEP 720
              S SIADPY+L+   D S  +   +  +    ++    I SS K  + C LY D     
Sbjct: 730  ATSASIADPYLLIIRDDSSAFIAHVNEDSEIEEIEKEDKIISSTKWSTGC-LYAD----- 783

Query: 721  WLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKF 780
                       S G   A       P     I   +   +GAL I+ +P+ +        
Sbjct: 784  -----------SKGAFAATQQTAKSPKSTPTIMMFLLSAAGALYIYALPDIS-------- 824

Query: 781  VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
                      Y+ E L      +++      G  R E I  + V +L    + + H    
Sbjct: 825  -------RPVYVAEGLCYVPPYLSADYSARKGMAR-ETISEILVTDLGDTVFKSPH---- 872

Query: 841  LFAIL--TDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLD 898
               IL  ++  +  Y+ Y          ++D  S ++ L +      +L N   ++ P +
Sbjct: 873  --VILRHSNHDLTIYEPYRI--------AEDSQSLTKILRL-----RKLPNPAVAKAP-E 916

Query: 899  AYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQ---LCDGSIV 955
            A   E+ P  +    +    NI+G+   F+ G  P + +   +  +  P+   L    + 
Sbjct: 917  ATNSEDPPLMSRNMPLRACANIAGYSAVFMPGHSPSFLI---KSAKATPKVIGLRGSGVR 973

Query: 956  AFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKN 1015
            A +  H   C  GFIY  S G+ ++ Q+P  +++       K +PL      I Y +   
Sbjct: 974  AMSSFHTEGCERGFIYADSAGVARVAQIPKDTSFSELGLSVKKVPLGIDADGIAYHSPTG 1033

Query: 1016 LYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAG 1075
            +Y L  S        +   L  D +   +    N+S   +     VE   ++++ P    
Sbjct: 1034 VYVLTCS------YWEPFELPKDDDYHCEWAKENISFKPM-----VERSVLKVINPIN-- 1080

Query: 1076 GPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
              W    T   +  E A+ +R + L  + +T E   L+ +GTA  +GED+  RG + +F
Sbjct: 1081 --WSDIWTEEFEQHEVAMCIRSLNLEVSQSTNERRQLITVGTAMCKGEDLPVRGGIYVF 1137


>gi|358056450|dbj|GAA97624.1| hypothetical protein E5Q_04302 [Mixia osmundae IAM 14324]
          Length = 1305

 Score =  152 bits (385), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 169/673 (25%), Positives = 300/673 (44%), Gaps = 98/673 (14%)

Query: 55  VPNLVVTAANVIEIY-----VVRVQE---EGSKESKNSGETKRRVLMDGISAASLELVCH 106
           V NLVV  +N +++Y      V VQ    +GS  S    +T+            L+L+  
Sbjct: 37  VRNLVVARSNFLQVYEVLEEPVPVQSSVTDGSSASMREDQTR------------LQLLAE 84

Query: 107 YRLHGNVESLAILSQGGADNSRR--RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFES 164
           +  HG V  LA LS     ++R+  R  ++++F DAK++V+E+ D +H L   SMH FE 
Sbjct: 85  HVCHGIVTGLARLS---TLDTRQDGRHRLVISFRDAKMTVMEWSDQLHDLAPVSMHSFE- 140

Query: 165 PEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGG 224
                L +G +  A   +++VD   RC  +L+    + IL   Q  S L   ED     G
Sbjct: 141 -RLPQLSQG-DLGAFQAVLRVDQASRCVALLLPDNTLGILPFFQDLSEL---ED-MTREG 194

Query: 225 GFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCM 282
             S     S  I+L ++   +++V DF F+ G+ EP + IL +R+ TW GR+ +      
Sbjct: 195 LQSLPYAPSLTIDLSEIGPGIRNVVDFAFLPGFSEPTIAILFQRKPTWTGRIDFAKDITS 254

Query: 283 ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALA 341
           +  +++    + +P+I+ A  LP+DA  L   P  +GGV+++ AN+ +H    S    +A
Sbjct: 255 LVMVTLDIGSRNYPVIFEADGLPYDALSLSVCPRELGGVVILCANSLVHIDQSSKMTGIA 314

Query: 342 LNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRL 401
           +N +  +L  ++   R +  + L+ A   ++   VA+L T+TG+   L +  DGR V  +
Sbjct: 315 VNGWTSTLTDARLDSRPTLRLVLEGAQCAFVGQQVAVLCTRTGETFSLHLEKDGRNVSSM 374

Query: 402 DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADA 461
           D      + + + I T+G +  F+GS  G S+L+++   SG             DI    
Sbjct: 375 DCRPRAVTCIPACIETVGAAYVFVGSAQGQSVLLRWASQSGAG----------ADILDIT 424

Query: 462 PSTKRLRRSSSDALQDMVNGEELSLYGSA-SNNTESAQ-----KTFSFAVRDSLVNIGPL 515
            S   L +  SDA+ D        LY +A ++N    Q     K     + D+L   G +
Sbjct: 425 ESGTGLVQ--SDAMDD-------DLYATAGAHNGNGHQIAPTGKDVQLELCDTLPGYGTI 475

Query: 516 KDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEY 575
           +  +       D ++  + + S      +    G+ T++         D     A D  +
Sbjct: 476 RHIAV-----LDHTSASLDEPSLVACTGVQAMAGLTTIHRHVPSVRQVDLDLPTARDIRH 530

Query: 576 HAYLIISLEAR----------TMVLETADLLTEVTESVDYFVQGRT---------IAAGN 616
                + LE R           ++  T    + +  ++D   Q  T         +AAG+
Sbjct: 531 --IWTVGLEQRQKMGRGPITHQIICSTGS--SSMVYTLDQDTQAATLARKSAEVPLAAGS 586

Query: 617 LFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMS 676
            F R +V++V E   R+            G   +E+  G  ++  + V+++DP+V +  +
Sbjct: 587 FFSRSQVLEVTEDMLRLYSPD--------GQITTEAPHGQADA--IDVTVSDPFVAVLSA 636

Query: 677 DGSIRLLVGDPST 689
             ++ +  GDP+T
Sbjct: 637 ARNVTVFFGDPTT 649


>gi|302924728|ref|XP_003053954.1| predicted protein [Nectria haematococca mpVI 77-13-4]
 gi|256734895|gb|EEU48241.1| predicted protein [Nectria haematococca mpVI 77-13-4]
          Length = 1429

 Score =  152 bits (385), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 256/1141 (22%), Positives = 432/1141 (37%), Gaps = 174/1141 (15%)

Query: 75   EEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSII 134
            ++G + S   G     V  D  +   L LV    L G V  LA +      +    ++++
Sbjct: 72   DDGLESSFLGGGESMLVRTDRTNNTKLVLVAELPLTGTVIGLAKIKTKYTKSGG--EALL 129

Query: 135  LAFEDAKISVLEFDDSIHGLRITSMHCFESPE-----WLHLKRGRESFARGPLVKVDPQG 189
            LA++ AK+ + E+D   + L   S+H +E  E     W   +   + +     ++ DP  
Sbjct: 130  LAYKAAKMCLCEWDPKKNTLETLSIHYYEKDELQGAPW---EVAFDEYVN--FLEADPGS 184

Query: 190  RCGGVLVYGLQMIILKASQGGSGLVGD---EDTFGS---------GGGFSARIESSHV-- 235
            RC         + IL   Q    L  D   ED  G            G S  +E+++   
Sbjct: 185  RCAAFQFGSRNIAILPFRQAEEDLEMDDWDEDLDGPRPVKESTAVANGDSDTLEAAYTPS 244

Query: 236  ----INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
                + L D  + H     F+H Y EP   IL   +          H T  +  L +   
Sbjct: 245  FVLRLPLLDPSLLHPVHLAFLHEYREPTFGILSSSQERAHSLGQKDHLTYKVFTLDLQQ- 303

Query: 292  LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLD 350
             +    I S  +LP D YK++A+P+P+GG L++G N  IH      +  +A+N+ A  + 
Sbjct: 304  -RASTTILSVTDLPRDLYKMIALPAPVGGALLIGENEFIHIDQSGKANGVAVNSMARQMT 362

Query: 351  SSQELPRSSFSVELDAA--HATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLS---K 405
            S     ++  ++ L+       +++N   LL    G L +++   DGR V  + +    +
Sbjct: 363  SFSLSDQADLNLRLEGCIIEQLYIENGELLLILNDGRLGIVSFRIDGRTVSGISIKMIPE 422

Query: 406  TNPSVL----TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADA 461
             N   L     S  + +G + FF+GS  GDS+++    G    M     ++         
Sbjct: 423  ENGGRLIKSRASTASKLGKNTFFIGSETGDSVVL----GWSRKMSQEKRRKTRLVDADLG 478

Query: 462  PSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
                 L     D   D + G E +   + + N        SF + D+L++I P++D + G
Sbjct: 479  LDVDDLDLEDDDDEDDDLYGTETAAKPTQALNGAGKSGELSFRIHDTLLSIAPIRDLTSG 538

Query: 522  -LRINADASATGISKQ--SNYELV--------------------------ELPGCKGIWT 552
                  D+    +SK   S+ +L                           E P  +G WT
Sbjct: 539  KAAFLPDSEEATLSKGVVSDLQLACVVGRGNSGSLAILNRHIQPKIIGRFEFPEARGFWT 598

Query: 553  VYHK----SSRGHNADSSRMAAYDDEYHAYLIIS------LEARTMVLETADLLTEVTES 602
            +  K     S G N           ++  Y+I++       E   +   TA     + E+
Sbjct: 599  MCVKKPVPKSLGGNVTVGNDYETFGQHDKYMIVAKVDLDGYETSDVYALTAAGFETLKET 658

Query: 603  VDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTV 661
                  G T+ AG +  + RVIQV +   R  DG   +TQ L     + E+G+      V
Sbjct: 659  EFDPAAGFTVEAGTMGKQMRVIQVLKSEVRSYDGDLGLTQILPM--LDEETGA---EPRV 713

Query: 662  LSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPW 721
            +S SIADPY+LL   D S+ +   D +     V+   +   S K  + C LY D      
Sbjct: 714  ISASIADPYLLLIRDDSSVLIAQIDSNNELEEVEKTDSTLQSTKWHAGC-LYTD------ 766

Query: 722  LRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKF 780
                      + GV +   G  G   D   I   +   +GAL ++ +P+ +  V+  +  
Sbjct: 767  ----------TKGVFQPSVGDKGA--DTSKIMMFLLSSTGALHVYALPDLSKPVYVAEGL 814

Query: 781  VSGRTHI-VDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRP 839
                 H+  D  +R  L                   KEN+  + V +L           P
Sbjct: 815  CYVPPHLSADYTLRRGLA------------------KENLRELLVADLG----DTVSQSP 852

Query: 840  FLFAILTDGTILCYQA--YLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPL 897
            +L        +  Y+   Y  EG E T         S +L+    S + L       +  
Sbjct: 853  YLILRNQTDDLTIYEPLRYQPEGAEPT--------LSATLTFKKTSNAALATSPVETSQE 904

Query: 898  DAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAF 957
            DA    + P   P +      N++G+   FL G  P + +   + +     L    I   
Sbjct: 905  DAV---QQPRFVPLRTCA---NVNGYSTVFLPGPSPSFILKSSKSIPRVIGLQGLGIRGM 958

Query: 958  TVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLY 1017
            +  H   C+ GFIY   +GI ++ QLPS + + +     K +PL +    I Y      Y
Sbjct: 959  STFHTEGCDRGFIYADDEGIARVTQLPSETNFTDLGISVKKVPLDSDVCGIAYHQPTGTY 1018

Query: 1018 PLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGP 1077
                +       N+   L  D +   +     L+        T+    ++++ P      
Sbjct: 1019 IAGCTT------NEPFELPRDDDYHKEWAKETLTFAP-----TMPRGVLKLISP------ 1061

Query: 1078 WQTRATI----PMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLL 1132
                 T+     ++S E+   ++ + L  +  TKE   LL +GTA  +GED+  RGRV +
Sbjct: 1062 --VSLTVIHDQELESCESIECMKTLQLEVSEETKERRFLLTVGTALSKGEDLPIRGRVHV 1119

Query: 1133 F 1133
            F
Sbjct: 1120 F 1120


>gi|320591495|gb|EFX03934.1| cleavage and polyadenylation specificity factor subunit [Grosmannia
            clavigera kw1407]
          Length = 1461

 Score =  152 bits (383), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 251/1128 (22%), Positives = 429/1128 (38%), Gaps = 185/1128 (16%)

Query: 97   SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
            S + L LV  + L G V  LA +   G  +    +++++A +DA++S+LE+D   + L  
Sbjct: 100  SISKLVLVAEFPLAGTVTGLARIKIPGTKSGG--EAVLVALKDARLSLLEWDPDQNDLTT 157

Query: 157  TSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGD 216
             S+H +E  E                +  DP  RC  +      + IL   Q     V  
Sbjct: 158  ISIHYYEQEELQGAPWAAPLSDYANFLVADPGSRCAALKFGARNLAILPFRQADEEDVDM 217

Query: 217  ED-------------------TFGSGGGFS-ARIESSHVINLRDLD--MKHVKDFIFVHG 254
            +D                     G G G        S V+ L +LD  + H     F+H 
Sbjct: 218  DDWDEELDGPRPAKDPSSAAVVSGPGDGIEDTPFAPSFVLRLSNLDTTLLHPVHLAFLHE 277

Query: 255  YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
            Y EP   IL     T A  V  +         ++    K    I S  NLP D ++++ +
Sbjct: 278  YREPTFGILSSSVSTSA--VIGRRDKLSYLVFTLDLQQKASTTILSVANLPQDLFRVVPI 335

Query: 315  PSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
            PSPIGG ++VGAN  IH      +  +A+N +     S     +S  ++ L+      L 
Sbjct: 336  PSPIGGAILVGANELIHIDQSGRANGVAVNPFTKQSTSFGLADQSDLALRLEGCTVDVLS 395

Query: 374  NDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLS----KTNPSVLTSDITT---IGNSLFF 424
             +    L+    G L +LT+  DGR V  L +     +    V+ S IT    IG  + F
Sbjct: 396  AEAGELLIVLHDGQLAVLTIRVDGRTVSGLSVKMVRREAGGDVIQSGITCLSRIGRQMLF 455

Query: 425  LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG-DIEADAPSTKRLRRSSSDALQDMVNGEE 483
             GS   DS+++ ++   G +          G D+ AD       R    +   D  + + 
Sbjct: 456  AGSDQADSVVLGWSRKQGQTARRKPRANRAGLDLGADEEYFDDEREEGEELDDDEDDDDL 515

Query: 484  LSLYGSAS------NNTESAQKTFSFAVRDSLVNIGPLKDFSYG-------LRINADAS- 529
                 SA+      N T       SF + D L++I P++D   G       L   +D + 
Sbjct: 516  YGDGPSAAQTLGIDNTTGRGGDDLSFRIHDRLLSIAPIRDMVIGKPALVGELAKRSDQAT 575

Query: 530  ---------ATGISKQSNYELV------------ELPGCKGIWTV-----YHKSSRGHNA 563
                     A G  +     L+            E    + +WTV       ++ +G   
Sbjct: 576  IHSELNLVCAVGSGRAGALALLSREINPDPLGAFEFAEAQALWTVSSSKPIPRTIQGEKG 635

Query: 564  DSSRMAAYDDE--YHAYLIISLEARTMVLETADLLTEVTESVDYF-------VQGRTIAA 614
             ++    Y+    +  Y+I++ E      ET+D+        +           G T+ A
Sbjct: 636  GATVGEDYESPAMHDKYMIVAKEDDDG-FETSDVYAVTASGFETLKGTEFEPAAGFTVQA 694

Query: 615  GNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
            G +   RR+IQV +   R  DG   ++Q L       +  +G+E   VL  SIADPY+LL
Sbjct: 695  GTMGRNRRIIQVLKSEVRCYDGDLGLSQILPM----VDEDTGAE-PRVLFASIADPYLLL 749

Query: 674  GMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLST 733
               D S+ +   +       ++      +S K V+ C LYHD        KTS  A+L +
Sbjct: 750  IRDDASVLVAEMNKDFELEELERDDGSLASTKWVAGC-LYHDTA--SVFSKTSILAFLLS 806

Query: 734  GVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVS--GRTHIVDTY 791
                                      SG   I+ +P+      V + ++   R  + D  
Sbjct: 807  A-------------------------SGTFYIYALPDLKQPVYVAEGLNYVPRLFLPDHT 841

Query: 792  MREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTIL 851
            +R  +                   KE +  + V +L      A    P+L     +  + 
Sbjct: 842  VRRGMA------------------KEPLTEILVADLG----DAVSKAPYLIVRHANDDLT 879

Query: 852  CYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLR--NLRFSRTPLDAYTREETPHGA 909
             YQ               P+ T  SL   + S   L+  N  F+++P+ + + ++     
Sbjct: 880  IYQ---------------PLRTPSSLGSLSESLRFLKVPNPVFAKSPV-SISSDDASSQL 923

Query: 910  PCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQ---LCDGSIVAFTVLHNVNCN 966
                + + +NI G+   FL GS   + +   +  +  P+   L   ++ + +  H  +  
Sbjct: 924  RAMPLRVCENIGGYSTVFLPGSSASFVL---KSAKSQPRVVSLQGTAVRSLSPFHTESSE 980

Query: 967  HGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVL 1026
              FIYV  +G  ++C +P+G          + + L    + + Y      Y +  S    
Sbjct: 981  RSFIYVDVEGSGRVCSMPAGWNLTELGVCARKVALDTDANALAYHPPTGTYAVGTSA--- 1037

Query: 1027 KPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPM 1086
                 + +  + ++  H+ D +  S+    R    E   + ++ P    G W T  T+ M
Sbjct: 1038 -----LEAFELPKDDPHRADWNKESTA--FRPL-AERGRLLLMSP----GSWSTIDTVEM 1085

Query: 1087 QSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
            +  E  + V+ + L  +  T E + L+A+GTA  +GED+A RGRV +F
Sbjct: 1086 EPYEVVMCVKTLNLEVSEATNERKQLVAVGTAISRGEDLAIRGRVYVF 1133


>gi|257215708|emb|CAX83006.1| Cleavage and polyadenylation specificity factor subunit 1
           [Schistosoma japonicum]
          Length = 462

 Score =  151 bits (382), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 114/397 (28%), Positives = 199/397 (50%), Gaps = 45/397 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLV+T +  IEIY ++        S  SGET+     + +   S+        + N+  +
Sbjct: 41  NLVITRSGFIEIYNIK--------SSVSGETR----FNWVYGTSV--------YENIADI 80

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +   G        S++L+F +AK++V+ F+     LR  S+H +E   + +LK GR +
Sbjct: 81  VSVRFAGDLLD----SLLLSFSEAKVAVMNFNPITFELRTLSLHNYE---FENLKSGRMN 133

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK--------ASQGGSGLVGDEDTFGSGGGFSA 228
           F + P++++DP  RC  +LVY   + +L         +++     +G  +        +A
Sbjct: 134 FTKLPILRLDPYQRCAVMLVYDRHLAVLPFRRTEVLVSAETDPKHIGVRNFLLWQQRATA 193

Query: 229 RIESSHVINLRDL---DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
            + ++    L       + +V D  F+HG+ EP +++L+E   TWAGRVS +  TC I A
Sbjct: 194 PLLATFTTCLSTSTGEKINNVLDMQFLHGFYEPTLLVLYEPIGTWAGRVSARRDTCCIVA 253

Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNN 344
           LS +   + +P+IW   +LP D   ++ VP PIGGV+++ AN+I Y  Q+  SC+L LN 
Sbjct: 254 LSFNLQKRTNPVIWFQESLPFDCRSVIPVPQPIGGVVIMAANSILYLKQTLPSCSLPLNC 313

Query: 345 YA---VSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQ 399
           YA    +    Q++P S   + +D      L     L+ T++G+L LL++  +   + V 
Sbjct: 314 YAQISTNFPMRQDVP-SCGPLSIDGCRVVTLNETQFLIGTRSGNLYLLSLWLEQATQTVT 372

Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQ 436
            L   K   +V    +  + +   F+GSR  DS+L++
Sbjct: 373 SLLFHKVGHAVPPHCMVLLESKYLFIGSRFCDSVLMK 409


>gi|170102106|ref|XP_001882269.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164642641|gb|EDR06896.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 1406

 Score =  151 bits (381), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 243/1189 (20%), Positives = 457/1189 (38%), Gaps = 219/1189 (18%)

Query: 57   NLVVTAANVIEIYVVR-----VQEEGSKESKNSGETKR-------RVLMD---------- 94
            N+VV  +N++ I+ VR     +Q +   E +   + +R        V MD          
Sbjct: 51   NVVVARSNLLRIFEVREEPCPIQNQADDERERRSKVRRGTEAVEGEVAMDEQGDGFINIA 110

Query: 95   --------GISAASLELVCHYRLHG---NVESLAILSQGGADNSRRRDSIILAFEDAKIS 143
                      +      V  + LHG    +E + I+S        R D ++++F+DAKI+
Sbjct: 111  KSQKCPTHTPTVTRFYFVREHHLHGIVTGIEGVKIMSS----LEDRLDRLLISFKDAKIA 166

Query: 144  VLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI 202
            +LE+ D++H L   S+H +E +P+ + +     S  R  L + DP  RC  + +    + 
Sbjct: 167  LLEWSDAVHDLITVSIHTYERAPQLMSID---SSLFRTEL-RTDPISRCAALSLPRHALA 222

Query: 203  ILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIEP 258
            IL   Q  + L V D+D              S +++L    D ++++V DF F+ G+  P
Sbjct: 223  ILPFYQSQAELEVMDQD---QSQAKDVPYSPSFILDLPAQVDQNIRNVIDFAFLPGFNNP 279

Query: 259  VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
             + +L + + TW GR+     T  +   ++    + +P+I S   LPH+   LL   + +
Sbjct: 280  TIAVLFQTQQTWTGRLREFKDTVRLVIFTLDIVTQNYPIITSVEGLPHECLALLPCGTSL 339

Query: 319  GGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQELPRSSFSVE-------LDAAHAT 370
            GGV+++ +N I Y  QS+    L +N +   +    ++P  S + E       L+ + A 
Sbjct: 340  GGVVIITSNAIIYTDQSSKRVVLPVNGWVSRI---SDIPLPSLTPEEQLRNICLEGSRAV 396

Query: 371  WLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSL----FFLG 426
            ++ +    +  K G +  L +V DG+ V +L +S   P +  + I ++   L    F +G
Sbjct: 397  FVDDRNLFVILKDGTVYPLEIVVDGKTVSKLTMS---PPLAQTSIPSVLRKLDDDHFLVG 453

Query: 427  SRLGDSLLVQFTCGSGTSMLSSGLKEEFG---DIEADAPSTKRLRRSSSDALQDMVNGEE 483
            S +G S+L++          ++ ++EE     D+EA AP+T        +   D  N   
Sbjct: 454  SSVGPSVLLK----------AAHIEEEVAEDHDMEA-APATVVYDADDMEFDDDDGNLPR 502

Query: 484  LSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD------ASATGISKQS 537
            ++          +       ++RDSL   GP+ D ++ L  N D       +ATG     
Sbjct: 503  VA-------QPMAKPTVIHLSLRDSLPAYGPISDMTFSLAKNGDRPVPELVAATGSGFLG 555

Query: 538  NYELVE--LP-----------GCKGIWTV------------YHKSSRGHNADSSRMAAYD 572
             + L +  LP           G +G+W++            Y K+     A++  +    
Sbjct: 556  GFTLFQRDLPVRTKRKLHVIGGARGLWSLPIRQPVKASGISYEKAVNPFQAENDSLIIST 615

Query: 573  DEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
            D  +    +S   +  V+ TA             + G TI A   F R  V+ V     R
Sbjct: 616  D-INPSPGLSRAGKNDVMITAR------------IPGTTIGAAPFFQRTTVLHVMTNALR 662

Query: 633  ILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
            +L+ G  + +D+                 + + SI+DP+VL+   D SI L +G+     
Sbjct: 663  VLEPGMQIIKDMD---------GNMPRPRIRACSISDPFVLILREDDSIGLFIGETERGK 713

Query: 692  VSVQTPAAIESSKKPVSSCTLYHDKG-PEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQG 750
            +  +  + +        SC      G  E     ++T   +++ +  A++    G     
Sbjct: 714  IRRKDMSPMGDK----VSCFYTDTTGLLESNFENSTTPVGVTSTLSAAVNAGSKGQ---- 765

Query: 751  DIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEG 810
              + ++    G +E++ +P     F+ D   S +  +VD++   A               
Sbjct: 766  --WLILVRPQGIVELWTLPKLTLGFSADGLTSLQNVLVDSHDPPA-------------PS 810

Query: 811  TGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDP 870
              Q          V ++ +        RP L   L  G +  Y+                
Sbjct: 811  LPQDPPRKPQEFDVEQILVAPIGESSPRPHLCVFLRSGQLTIYEVLPLG----------- 859

Query: 871  VSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISG-------- 922
              T+ +L     +  +++ ++ S    +    EE   G   ++  I++            
Sbjct: 860  -RTTEALPKVRPAHVKIKFVKISSMAFEIQRPEEGEKGIIAEQKRIYRMFVPFVTSASPG 918

Query: 923  --HQGFFLSGSRPCWCM-VFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
                G F +G RP W     +  ++++P      + AFT          F+  T +    
Sbjct: 919  VTFSGVFFTGDRPNWIFGTDKGGVQIYPS-GHAVVNAFTPCSLFESKGDFLMYTEEA--S 975

Query: 980  ICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVL---SLL 1036
            + +      YD        +PL++ P    Y           S  V  P   +L   S L
Sbjct: 976  VSKWLPDFHYDG------PLPLRSVPRGRAY-----------SSLVFDPSTSLLVAASSL 1018

Query: 1037 IDQEVGHQIDNHNLSSVDLHR--TYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALT 1094
              +   +  D++ +   +         +   + ++ PD     W T       ++E    
Sbjct: 1019 QAKFASYDDDDNKIWEPETPNIGNPMCDTSTLELISPDM----WITMDGFEFATNEYIND 1074

Query: 1095 VRVVTLFNTTTK-ENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP 1142
            V  VTL    T+  ++  +A+GT   +GED+AARG   ++       +P
Sbjct: 1075 VACVTLETAGTEVGSKDFIAVGTTIDRGEDLAARGATYIYEIVEVVPDP 1123


>gi|67521912|ref|XP_659017.1| hypothetical protein AN1413.2 [Aspergillus nidulans FGSC A4]
 gi|74598221|sp|Q5BDG7.1|CFT1_EMENI RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
            1
 gi|40745387|gb|EAA64543.1| hypothetical protein AN1413.2 [Aspergillus nidulans FGSC A4]
 gi|259486722|tpe|CBF84808.1| TPA: Protein cft1 (Cleavage factor two protein 1)
            [Source:UniProtKB/Swiss-Prot;Acc:Q5BDG7] [Aspergillus
            nidulans FGSC A4]
          Length = 1339

 Score =  151 bits (381), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 256/1156 (22%), Positives = 444/1156 (38%), Gaps = 210/1156 (18%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            NL+V   ++++I+ +R        S ++ +T+ R          L L   Y+L G V  +
Sbjct: 28   NLIVARTSLLQIFSLR------DVSLSALDTEVRPAQHRQETCKLVLEREYQLPGTVTDI 81

Query: 117  A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
                 + ++ G D      ++++AF DAK+S++E+D   +GL   S+H +E  +      
Sbjct: 82   CRVKILKTKSGGD------AVLVAFRDAKLSLVEWDPERYGLSTISIHYYERDDMTRSPW 135

Query: 173  GRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIE- 231
              +    G ++  DP  RC         + I+   Q G  LV D+  FGS   +  R+E 
Sbjct: 136  ASDLSTCGSILSADPGSRCAIFQFGARSLAIIPFHQPGDDLVMDD--FGSEPDYENRVEG 193

Query: 232  --------------------SSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELT 269
                                SS V+ L  LD  + H     F++ Y EP   IL+ +  T
Sbjct: 194  NSRSHEAKDKDAAEYQTPYASSFVLPLTALDPSVIHPISLAFLYEYREPTFGILYSQVAT 253

Query: 270  WAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT- 328
                +  +      + +++    +    + S   LP D +K++A+P P+GG L++G+N  
Sbjct: 254  SHALLHERKDVVFYTVITLDLEQRASTTLLSVTRLPSDLFKVVALPPPVGGSLLIGSNEL 313

Query: 329  IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDL 386
            +H      + A+ +N ++    S     +S  ++ L+        +D    LL+  TG  
Sbjct: 314  VHIDQAGKTNAVGVNEFSRQASSFSMTDQSDLALRLENCVVERFSDDNGDLLLALSTGVF 373

Query: 387  VLLTVVYDGRVVQRLD---LSKTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFTCG 440
             L++   DGR V  +    LS  +   L S  ++   +GN   F GS   DS+L+     
Sbjct: 374  ALVSFKLDGRSVSGISVRPLSGPSKEFLASTASSSAFLGNGKVFFGSESADSVLL----- 428

Query: 441  SGTSMLSSGLKEEF-GDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
             G S  SS  K+ F G    D         S  DA +D +     +       N  S   
Sbjct: 429  -GWSSASSATKKSFSGSTSND--------ESEDDAYEDDLYSSAPAAMTDNPQNQPSNSS 479

Query: 500  TFSFA---VRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCK--GIWTVY 554
              +F    + D L + GP++D   G    A +  T   K    ELV   G    G   + 
Sbjct: 480  VAAFGDLRIHDRLSSPGPIRDIVLGRSSEASSRDT---KDGVLELVAAQGSDEGGTMVIM 536

Query: 555  HK--------SSRGHNADS----SRMAAYDDEYHAYLIISL-------EARTMVLETADL 595
             +        S     A+S    S +   +D+   Y+I+S        E+   VLE  D 
Sbjct: 537  KREVDPYLVASMAADTANSLWTVSLLPDNNDQKRDYVILSKQEKPDKEESEVFVLE--DK 594

Query: 596  LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
            L  +T          T+  G L  + RVIQV     R  D  +   D             
Sbjct: 595  LRPITAPEFNPNHELTVEIGTLASKSRVIQVLRNEVRSYDAVWDEDD------------- 641

Query: 656  SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
            S+    ++ ++ DPY+ +   D ++ LL  D S                  +   TL  D
Sbjct: 642  SDERVAVNATLVDPYLAIIRDDSTLLLLQADDS----------------GDLDEVTLSED 685

Query: 716  KGPEPWLRKT--STDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNC 773
               + WL     S +A   T    +I                +  +   L ++ +P+F  
Sbjct: 686  VVSQKWLSACFYSDNAGFFTAPFASI--------------LFLLNQDHQLYVYRLPDF-A 730

Query: 774  VFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWS 833
            V +V + V     I+ T   E  K S T              +EN+  + VVEL      
Sbjct: 731  VISVIEGVGCLPPILST---EPPKRSTT--------------RENVLQIAVVELG----D 769

Query: 834  AHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFS 893
            ++ S PFL     +  ++ Y+ +     E T          R L  +N +  +  N    
Sbjct: 770  SYSSLPFLILRTENDDLVVYKPFFTNSKELTGL--------RFLKEANHTLPKTPNTT-- 819

Query: 894  RTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHP---QLC 950
                D    E  P       + I  NI+G    F+ G  P    +FR      P   +L 
Sbjct: 820  ----DELQSEMKP-------LRILPNIAGCSSIFMPG--PSAGFIFRAS-TTSPHFIRLR 865

Query: 951  DGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITY 1010
             G I         + + GF Y+ S G L + +LP G+     W + + +P+     ++TY
Sbjct: 866  GGFIKGLGCFD--SPDKGFAYLDSHG-LHLAKLPEGTQLGYPW-IMRTVPIGQQIDKLTY 921

Query: 1011 FAEKNLYPLIVSVPVLKPLNQV-LSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRIL 1069
             +  + Y       VL    +    L  D E+  +  N  +S +       V +  ++++
Sbjct: 922  VSASDTY-------VLGTCQRCEFRLPEDDELHPEWRNEEISFLP-----EVNQSSLKVV 969

Query: 1070 EPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARG 1128
             P      W    + P++ +E+ + ++ ++L  +  T E   ++ +GT+  +GED+ +RG
Sbjct: 970  SPKT----WSVIDSYPLEPAEHIMVMKTMSLEVSENTHERRDMIVVGTSLARGEDIPSRG 1025

Query: 1129 RVLLFSTGRNADNPQN 1144
             + +F       +P+ 
Sbjct: 1026 CIYVFEVIEVVPDPEQ 1041


>gi|315045910|ref|XP_003172330.1| serine/threonine protein kinase [Arthroderma gypseum CBS 118893]
 gi|311342716|gb|EFR01919.1| serine/threonine protein kinase [Arthroderma gypseum CBS 118893]
          Length = 1397

 Score =  150 bits (380), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 255/1165 (21%), Positives = 453/1165 (38%), Gaps = 193/1165 (16%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            NL+V   ++++++ +     GS  +   G+  ++   D +  A L L   Y + G +  L
Sbjct: 28   NLIVAKTSLLQVFSLVNVTYGSAPA---GQPDQKGRHDRLQHAKLVLAAEYEVPGTITGL 84

Query: 117  AILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
              +      NS+   D+I+++  +AK+S++E+D   HG+   S+H +E  E        +
Sbjct: 85   ERVR---ISNSKSGGDAILVSSRNAKLSLIEWDPQKHGITTISIHYYEGEESHMSPWVPD 141

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDE-DTFGSGGGFSARIES- 232
              +    + VDP G C  +  +G+  + IL   Q G  LV D+ D   +G   +  +   
Sbjct: 142  LGSCSSSLTVDPNGNCA-IFNFGIHSLAILPFHQAGDDLVMDDYDAIPNGDDTTDAVNDA 200

Query: 233  ----------------SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
                            S V+ +  LD  + H     F+H Y EP   IL+ +        
Sbjct: 201  QKPAPGNAVHDKPYAPSFVLPMTALDPALTHPIHMEFLHEYREPTFGILYSQVARSMSLT 260

Query: 275  SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHS 333
              +      S  ++    K    + +   LP D +K++ +P PIGG L++G N  +H   
Sbjct: 261  IDRKDIVSYSIFTLDLQQKASTSLLTVSRLPSDIFKVVPLPPPIGGALLIGTNELVHVDQ 320

Query: 334  QSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTV 391
               + A+ +N +A    +     +S   + L+      L +     LL    G + +LT 
Sbjct: 321  AGKTNAVGVNEFARQASAFSMADQSDLEMRLEGCMVEQLGSGAGDVLLILSDGRMAILTF 380

Query: 392  VYDGRVVQRLDL----SKTNPSVLTSDIT---TIGNSLFFLGSRLGDSLLVQFTCGS--- 441
              DGR V  + L     ++  S++ S  +   ++G +  F GS  GDS+L+ ++  S   
Sbjct: 381  KVDGRSVAGISLHFVAEQSGGSIIKSRPSCSASLGRNKLFYGSEEGDSILLGWSKHSSAT 440

Query: 442  -------------GTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG 488
                         GT+ LS   +++  D +           +++   + +VNG+     G
Sbjct: 441  KKPSKAAGGGNEDGTANLSDEEEQDDDDDDMYEDDLYSANPTTTQQEKQVVNGD-----G 495

Query: 489  SASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCK 548
            +A+         F+    D L ++GP +D + G    + +     S       +EL   +
Sbjct: 496  AAN---------FTLRAHDRLWSLGPYRDITLGRPPKSKSKDRQDSVPEISAPLELVAAR 546

Query: 549  GI-----WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA------------RTMVLE 591
            G       TV  +       DS +M   DD Y  + I  ++             R ++L 
Sbjct: 547  GFGKAGGLTVLKREIDPFTIDSLKM---DDVYGVWSIRVIDPKSKDAGLSRSYDRYLLLA 603

Query: 592  TADLLTEVTESVDYFV----------------QGRTIAAGNLFGRRRVIQVFERGARILD 635
             A    +  ESV Y V                +  TI  G L    RV+QV     R  D
Sbjct: 604  KAK-GDDKEESVVYSVGSSGLDSIDAPEFNPNEDCTIDIGTLATGSRVVQVLRTEIRSYD 662

Query: 636  GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS--TCTVS 693
             +     +   P   E    SE  TV+  S A+PY+L    D S+ +L  D +     V 
Sbjct: 663  CNLGLAQIY--PVWDE--DTSEERTVIQASFAEPYLLTIRDDNSLLILQADKNGDLDEVE 718

Query: 694  VQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIY 753
            +Q  AA   S K VS C LY DK      +  S+D              D       +I 
Sbjct: 719  IQGSAA---SAKWVSGC-LYEDK-----TKIFSSD-------------LDTEHAATPNIL 756

Query: 754  SVVCYESGALEIFDVPNF-NCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
              +    G L IF +PN    +  VD                 L  S     SSS     
Sbjct: 757  LFLLDSDGNLSIFRLPNITEPLCRVDNL--------------NLLPSNLPYESSSRRPV- 801

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
               +E +  + V +L      A H  P++        ++ Y+ Y   G    SK      
Sbjct: 802  --NRETLTELLVADLG----DAIHKSPYMILRTKHDDLVLYEPYRITGENGRSKLQ--FI 853

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
             + +  V     ++  N   +R+P            +P + +    ++ G++  F+SG  
Sbjct: 854  KAVNHVVMGPRTNQPMNKDINRSP------------SPSKLLRALSDVCGYKTVFMSGQN 901

Query: 933  PCWCM---VFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
            PC+ +   + R  +    +L   ++ + T  H   C  GF YV    ++++ +LPS + +
Sbjct: 902  PCFILKSAIARPNVL---RLRGKAVQSLTGFHIAACERGFAYVDEDNVIRMSRLPSNTRF 958

Query: 990  DNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHN 1049
            D+ W  +K IPL      I Y +    Y +  S        +   L  D E   +  N  
Sbjct: 959  DSAWATRK-IPLGEQVDCIVYSSASESYVIGTST------KEDFKLPEDDESHTEWRNEF 1011

Query: 1050 LSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKEN 1108
            ++ +       ++   V++LEP      W       ++ +E    ++++ L  + TT E 
Sbjct: 1012 ITFLP-----QLDRGTVKLLEPKN----WSAIDIYEVEPAERITCIKIIRLEISETTHER 1062

Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLF 1133
            + ++ +G+A  +GED+  +G + +F
Sbjct: 1063 KDMVVVGSAVAKGEDIVPKGCIRVF 1087


>gi|392558419|gb|EIW51607.1| hypothetical protein TRAVEDRAFT_176174 [Trametes versicolor FP-101664
            SS1]
          Length = 1431

 Score =  150 bits (378), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 247/1117 (22%), Positives = 455/1117 (40%), Gaps = 161/1117 (14%)

Query: 103  LVCHYRLHGNVESL-AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHC 161
            LV  +RLHG V  L A+ +    ++  + D ++++F+DAKI++LE+ D+IH +   S+H 
Sbjct: 123  LVREHRLHGTVTGLEAVRTVHSLED--KLDRLLVSFKDAKIALLEWSDAIHDVMTVSIHT 180

Query: 162  FE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILK--ASQGGSGLVGDED 218
            +E +P+ + L        RG L +VDP  RC  + +    + IL    SQ    L+  E 
Sbjct: 181  YERAPQLMALD---SPLFRGEL-RVDPLSRCAALSLPKDSLAILPFYQSQAELDLMEQES 236

Query: 219  TFGSGGGFSARIESSHVINL-RDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS 275
            +      +S     S V++L  D+D  +++V DF F+ G+  P + +L + + TW GR+ 
Sbjct: 237  SQARDVPYSP----SFVLDLANDVDQRIRNVIDFAFLPGFNNPTVAVLCQYQQTWTGRLK 292

Query: 276  WKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQ- 334
                T  +   ++      +PLI +   LP+D   L    + IGGV ++ +N I +  Q 
Sbjct: 293  EYKDTVGLFIFTLDLVTNNYPLITAVDGLPYDCLSLTPCSTAIGGVFILASNAIIFVDQA 352

Query: 335  SASCALALNNY---AVSLDSSQELPRSSF-SVELDAAHATWLQNDVALLSTKTGDLVLLT 390
            S    L +N +      L      P+    +++L+ A  T++ +    +  K G +  + 
Sbjct: 353  SRRVILPVNGWPPRTSDLTMPSLTPQEQLRNLQLEGARFTFVDDKTLFVILKDGTVHPVE 412

Query: 391  VVYDGRVVQRLDLSK-----TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
            +V DG+ V RL ++      T P+V    +  + +   F+GS +G S+L++      T+ 
Sbjct: 413  LVLDGKTVSRLSMADALARTTIPAV----VARVRDDYLFVGSMVGPSVLLR------TAH 462

Query: 446  LSSGLKEEFGDIEAD-----APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK- 499
            +   +KEE  D++A      AP+         D      NGE+ S  G+ +   +S +K 
Sbjct: 463  VEEVIKEEDVDMDAGPATVVAPADTMDLDDDDDLYGPSGNGEQPSANGATNGTVDSVKKR 522

Query: 500  -TFSFAVRDSLVNIGPLKDFSYGLRINAD------ASATG-------------ISKQSNY 539
                 ++ D+L   G + D ++GL  N D       +ATG             +  +S  
Sbjct: 523  TVVRLSLCDALPAHGAISDMAFGLARNGDRVVPELIAATGSGELGGFHLFQRDMPTRSKR 582

Query: 540  ELVELPGCKGIWTV-YHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM--VLETADLL 596
            +L  + G +G+W++   ++ +       R ++ +D     +IIS +A     +   A   
Sbjct: 583  KLHAIGGARGMWSLAVRQAMKVSGGTLERPSSQNDS----VIISTDANPSPGLSRIATRS 638

Query: 597  TEVTESVDYFVQGRTIAAGNLFGRRRVIQVF---ERGARIL--DGS--YMTQDLSFGPSN 649
                 ++   + G T+ A   F    ++ +        R+L  DG+   + +DL      
Sbjct: 639  AHSDIAITTRIPGTTLGAAPFFQGTAILHILFNVTNAIRVLEPDGTERQIIKDLE----- 693

Query: 650  SESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSS 709
                  +    + S SI DP+VL+   D +I L +G+     +  +  + +        +
Sbjct: 694  ----GTAPRPKIKSCSICDPFVLIIREDDTIGLFIGELERGKIRRKDMSPMGDKTSRYVA 749

Query: 710  CTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDI-------YSVVCYESGA 762
               + D           T   L T V E     +     QG +       + ++    G 
Sbjct: 750  GGFFTD-----------TSGLLQTFVNEQAPAENVTSTLQGAMNAGNKSQWLILVRPQGV 798

Query: 763  LEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSM 822
            +E++ +P     F+     +    + D+Y   AL        S  ++   + ++ +I  +
Sbjct: 799  VELWTLPKLTLAFSTTLLATLDPILTDSYDGPAL--------SLPQDPPRKPQELDIDQI 850

Query: 823  KVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNV 882
             +  L   R      RP L  +L  G +  Y+A     P     S    + S +L V  V
Sbjct: 851  VIAPLGESR-----PRPHLIVLLRSGQLAVYEAVAIPPPPEPLPS----TRSSTLLVKFV 901

Query: 883  S-ASRLRNLRFSRTPLDAYTREE----------TPHGAPCQRITIFKNISGHQGFFLSGS 931
              AS+  +++       +   E+              AP Q  +         G F +G 
Sbjct: 902  KVASKAFDIQHPEEEQKSVLAEQKRISRLLVPFVTSPAPGQTFS---------GVFFTGD 952

Query: 932  RPCWCM-VFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
            RP W +   +  ++V P      + AFT          F+  + +G   +  +P     D
Sbjct: 953  RPSWILSTDKGGVKVFPS-GHSVVQAFTTSSLWESRGDFLLYSEEGPSLVEWMPD-VQLD 1010

Query: 991  NYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNL 1050
             + P + V   ++ P+    F       LIV+    +  N+  S   D  V  + D+ N+
Sbjct: 1011 GHLPARSVP--RSRPYSNVVFDAST--SLIVAASSFQ--NRFASYDEDGNVVWEPDSPNI 1064

Query: 1051 SSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN-E 1109
            SS  L    T+E     ++ PD     W T        +E    +  + L   +T+   +
Sbjct: 1065 SS-PLCECSTLE-----LISPDG----WITMDGYEFAPNEFVNCIVSIPLETMSTESGMK 1114

Query: 1110 TLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLV 1146
              +A+GT   +GED+A +G V +F       +P   V
Sbjct: 1115 DFIAVGTTINRGEDLAVKGAVYIFEIVEVVPDPSTHV 1151


>gi|358390357|gb|EHK39763.1| hypothetical protein TRIATDRAFT_48211 [Trichoderma atroviride IMI
            206040]
          Length = 1441

 Score =  149 bits (375), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 250/1173 (21%), Positives = 442/1173 (37%), Gaps = 175/1173 (14%)

Query: 57   NLVVTAANVIEIYVVR-------------------------VQEEGSKESKNSGETKRRV 91
            NLVV   ++++I+ V+                         V ++   ES   G     +
Sbjct: 28   NLVVAKGSLLQIFTVKAISTELDPEFQPSQPTETETRFDRQVNDDDGLESSFLGGESMFM 87

Query: 92   LMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSI 151
              D  +   L L+    L G V  LA +     + +   ++++LA++ AK+ + E+D   
Sbjct: 88   RTDRTNNTKLVLIAEIPLAGTVIGLARVKT--KNTASGGEALLLAYKAAKMCLAEWDPKK 145

Query: 152  HGLRITSMHCFESPEWLHLKRGRESFARG-PLVKVDPQGRCGGVLVYGLQMIILKASQGG 210
            + L   S+H +E  E +      E F      ++ DP  RC         + IL  ++  
Sbjct: 146  NELETISIHYYEKEE-MQGSPWEEVFGEYVNYLEADPGSRCAAFKFGTRNLAILPFTRSE 204

Query: 211  SGLVG---DED-------------TFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFV 252
              L     DED               G G    A    S V+ L  LD  + H     F+
Sbjct: 205  EDLEMEDWDEDLDGPRPVKEHTAAANGDGNNVEAAYTPSFVLRLPLLDPSLLHPVHLTFL 264

Query: 253  HGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLL 312
            H Y EP   +L   +   +   S  H +  +  L +    +    I S   LPHD YK++
Sbjct: 265  HEYREPTFGVLSSSQAPASSLGSKDHLSYKVFTLDLQQ--RASTTILSVTGLPHDLYKVI 322

Query: 313  AVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELD--AAHA 369
            A+P+P+GG L+VG N  IH         +A+N  A    S     +S  ++ L+  A   
Sbjct: 323  ALPAPVGGALLVGQNELIHVDQSGKPNGVAINPMAKLATSFNLTDQSDLNLRLESCAIEL 382

Query: 370  TWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITTI---GNSL 422
              ++N   LL    G L +++   DGR V  L +    +    +++ S +T I   G + 
Sbjct: 383  LAIENGELLLILNDGRLGIISFKIDGRTVSGLGVKLVGADCGGNIIKSRVTCISRLGKNA 442

Query: 423  FFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGE 482
            FFLGS   DS+++ +   S         K    D +      +       +      +  
Sbjct: 443  FFLGSETSDSVVLGW---SRKQTQEKRRKSRLIDTDLALDVDELDLEDDEEDDDLYGDDS 499

Query: 483  ELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLR--------------INAD- 527
              +     +N         SF + D+L++I P++D + G                ++AD 
Sbjct: 500  ATTKPNQTANGGTVKSGDISFRIHDTLLSIAPIQDITCGQSAFLPDSEEATLNKGVSADL 559

Query: 528  --ASATGISKQSNYELV------------ELPGCKGIWTVYHK----SSRGHNADSSRMA 569
              A A G  +  +  ++            E P  +G WT+  K     S G NA ++   
Sbjct: 560  QLACAVGRGEAGSIAVINREIQPKVIGRFEFPEARGFWTMCVKKPVPKSLGTNAGAAGDY 619

Query: 570  AYDDEYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
                ++  ++I++       E   +   TA     + E+      G T+ AG +  +  V
Sbjct: 620  DAPIQHDKFMIVAKVDLDGYETSDVYALTAAGFETLKETEFEPAAGFTVEAGTMGNQMVV 679

Query: 624  IQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
            IQV +   R  +G   + Q L       +  +G+E   V S SI DPY+L+   D S+ L
Sbjct: 680  IQVLKSEVRCYNGDLGLIQILPM----LDEETGAEPRAV-SASIVDPYLLIIRDDASVFL 734

Query: 683  LVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGA 742
               D +     ++   +  +S K  + C LY D                + GV +A  G 
Sbjct: 735  AQIDSNNEIEEIEKTDSGLTSTKWAAGC-LYKD----------------TKGVFQANQG- 776

Query: 743  DGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHIVDTYMREALKDSET 801
            D       ++   +   +GAL I+ +P+ +  V+  +   S   H+   ++ + +     
Sbjct: 777  DQAKKSGEEVMMFLLNTAGALHIYALPDLSKPVYVAEGLSSIPPHLSADFVAKKV----- 831

Query: 802  EINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGP 861
                          +E +  + V +L        H  P+L    +   +  Y+       
Sbjct: 832  ------------ASREALTELVVADLG----DTVHYSPYLILRHSTDDLTIYEPIRL--- 872

Query: 862  ENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNIS 921
                 +D P            SA+ +        PL+   ++  P   P   + I  N+ 
Sbjct: 873  ----PTDSPTRNLSDTLFFKKSANSILAKSTVEDPLEDTAQQ--PRYVP---LRICANVG 923

Query: 922  GHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
            G+   FL G  P + +   + +     +    +   +  +   C+ GFIY  S+GI ++ 
Sbjct: 924  GYSTVFLPGPSPAFILKSSKSVPRVVGVQGLGVRGMSTFNTEGCDRGFIYSDSEGIARVT 983

Query: 982  QLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1041
            QLPS + +       K +PL      + Y      Y    +V       +   L  D + 
Sbjct: 984  QLPSKTNFTELGVSVKKVPLGNDVRHVAYHHPTETYIAGCAV------TEGFELPKDDDY 1037

Query: 1042 GHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL- 1100
              +    +LS    H +       ++++ P      W    +I M+  E+   ++ + L 
Sbjct: 1038 HKEWAKESLS---FHPSTV--RGSLKLISPVT----WTVIHSIDMEPGESIECMKTLHLE 1088

Query: 1101 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
             +  TKE   LLA+GTA  +GED+  RGRV ++
Sbjct: 1089 VSEETKERRMLLAVGTALTRGEDLPTRGRVQVY 1121


>gi|350633238|gb|EHA21604.1| hypothetical protein ASPNIDRAFT_51242 [Aspergillus niger ATCC 1015]
          Length = 1406

 Score =  149 bits (375), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 238/1182 (20%), Positives = 469/1182 (39%), Gaps = 195/1182 (16%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            +L+V   ++++IY +  +     E  ++ +   ++L++            Y L G V  L
Sbjct: 28   DLIVVRTSLLQIYSLH-KVASHAEGADAQQESTKLLLEK----------EYSLSGTVTGL 76

Query: 117  A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
                 + S+ G +      ++++AF +AK+S++E+D    G+   S+H +E  +      
Sbjct: 77   CRVKVLNSKSGGE------AVLVAFRNAKLSLIEWDPERRGISTISIHYYERDDLTRSPW 130

Query: 173  GRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFGS--------- 222
              +    G ++ VDP  RC  +  +G++ + I+   Q G  LV D+  +GS         
Sbjct: 131  VPDLNNCGSILSVDPSSRCA-IFNFGIRNLAIIPFHQPGDDLVMDD--YGSDLGEGISTD 187

Query: 223  ---GGG-----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHER 266
               GGG           +      S V+ L  LD  + H     F++ Y EP   IL+ +
Sbjct: 188  HDLGGGTVADKAKEGIVYQTPYAPSFVLPLTTLDPSILHPISLAFLYEYREPTFGILYSQ 247

Query: 267  ELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGA 326
              T +  +  +      +  ++    +   ++ S   LP D ++++A+P P+GG L++G+
Sbjct: 248  VATSSALLPERKDVVFYTVFTLDLEQQASTVLLSVSRLPSDLFRVVALPPPVGGALLIGS 307

Query: 327  NT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKT 383
            N  +H      + A+ +N ++  + S     +S  ++ L+      L +     LL   T
Sbjct: 308  NELVHIDQAGKTNAVGVNEFSRQVSSFSMTDQSDLALRLENCIVECLGDSSGDMLLVLTT 367

Query: 384  GDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI-------TTIGNSLFFLGSRLGDSLLVQ 436
            G++ ++    DGR V  + +         + I       T IG+   FLGS  GDS+L+ 
Sbjct: 368  GEMAIVKFKLDGRSVSGISVHLLPAHAGLTSIYSAAAASTFIGDGKIFLGSEDGDSVLLG 427

Query: 437  FTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV--NGEELSLYGSASNNT 494
            ++  S ++       ++  D  AD        +S  D  +D +     + +L G   +  
Sbjct: 428  YSYSSSSTKKHRLQAKQVIDDSADMSEED---QSDDDVYEDDLYSTSPDTTLTGRRPSGE 484

Query: 495  ESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI---- 550
             SA   + F + D L+NIGPL+D + G R++ +   TG    S    +++   +G     
Sbjct: 485  SSAFGLYDFRIHDKLINIGPLRDITMGKRLSTNPEKTGDRTNSTSPELQIVASQGSHKSG 544

Query: 551  -WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA-------------RTMVLETADLL 596
               V  +    H   S  + + D  + A L    EA             R  V+ T    
Sbjct: 545  GLVVMAREIDPHVVASISLESVDCIWTASLTREEEAVSGTSEKMGQQSQRCYVIATEVKG 604

Query: 597  TEVTESVDYFVQGR----------------TIAAGNLFGRRRVIQVFERGARILDGSYMT 640
            ++  ES+ + V G                 TI+ G    R+RV+QV +   R  D    T
Sbjct: 605  SDREESLIFVVDGHDLKPFRAPDFNPNEDVTISIGTQESRKRVVQVLKNEVRSYDFGKFT 664

Query: 641  -----QDLSFGPSNSES-------GSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS 688
                 ++ + G   S +          ++    +S S+AD  + +   D ++  L  D S
Sbjct: 665  PSRCRRNFADGTDLSLTQIYPIWDDDTNDERMAVSASLADSCLAILRDDSTLLFLQADDS 724

Query: 689  TCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLD 748
                 V     + S K    SC LY DK                TG+  +ID     P+ 
Sbjct: 725  GDLDEVVFGEDVASGK--WISCCLYSDK----------------TGMFSSIDRTLSEPV- 765

Query: 749  QGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSE 808
            + D++  +      L +       C+      + G   ++      +   S+  I++   
Sbjct: 766  KNDMFLFLLSHDCKLFV------KCLLWSSFALRGWHLMLSKSSGLSRPRSKAAIDN--- 816

Query: 809  EGTGQGRKENIHSMKVVELAM----QRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENT 864
                +G +  + S+ ++E  +    + WSA    P+L        I+C+     EG    
Sbjct: 817  ----RGDRRFVASVNLIEAIVADLGETWSAS---PYL--------IVCHH---IEG---- 854

Query: 865  SKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQ 924
                          + ++  S+  N    R P    + + +      + + I  +ISG  
Sbjct: 855  --------------IHSLKFSKETNSVLPRIPPGVSSTQPSGSDYRARPLRILPDISGLS 900

Query: 925  GFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLP 984
              F+ G+   + +          +L   +  + + L    C+ GFIY+ SQ  ++ C+LP
Sbjct: 901  AVFMPGASAGFIIRTSASAPHFLRLRGENSRSVSSLDTPECSKGFIYLDSQSTVRFCKLP 960

Query: 985  SGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQV-LSLLIDQEVGH 1043
              + +D  W +++V   +   H + Y     +Y       VL   +     L  D E+  
Sbjct: 961  PMTRFDYQWTLKRVHLGEQVDH-LAYSTSSGMY-------VLGTCHATDFKLPEDDELHP 1012

Query: 1044 QIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FN 1102
            +  N +  ++      +     ++++ P+     W    +  + + E  + ++ ++L  +
Sbjct: 1013 EWRNEDCLAISFFP--SARGSFIKLVSPNT----WSIIDSFSLGADEYVMAIKNISLEVS 1066

Query: 1103 TTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQN 1144
              T E + ++ +GTA+ +GED+ +RG + +F   +   +P +
Sbjct: 1067 ENTHERKDMIVVGTAFARGEDIPSRGCIYVFEVVQVVPDPDH 1108


>gi|358387835|gb|EHK25429.1| hypothetical protein TRIVIDRAFT_32877 [Trichoderma virens Gv29-8]
          Length = 1440

 Score =  148 bits (374), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 258/1178 (21%), Positives = 457/1178 (38%), Gaps = 186/1178 (15%)

Query: 57   NLVVTAANVIEIYVVR-------------------------VQEEGSKESKNSGETKRRV 91
            NLVV   ++++I+ V+                         V ++   ES   G     +
Sbjct: 28   NLVVAKGSLLQIFTVKSISTELDPEFQPNQPAEVDTRFDRQVNDDDGLESSFLGGETMFM 87

Query: 92   LMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSI 151
              D  +   L L+    L G V  LA L       +   + ++LA++ AK+ + ++D   
Sbjct: 88   RTDRTNNTKLVLIAEIPLAGTVIGLARLKTN--KTASGGEVLLLAYKAAKMCLAQWDPKK 145

Query: 152  HGLRITSMHCFESPEWLHLKRGRESFARGPLV---KVDPQGRCGGVLVYGLQMIILKASQ 208
            + L   S+H +E  E L      E F  G  V   + DP  RC         + IL   +
Sbjct: 146  NELETISIHYYEKEE-LQGSPWEEVF--GEYVNHLEADPGSRCAAFKFGTRNLAILPFRR 202

Query: 209  GGSGLVG---DEDTFG---------SGGGFSARIESSHV------INLRDLDMKHVKDFI 250
                L     DED  G         +  G S  +E+++       + L D  + H     
Sbjct: 203  SEEDLEMEDWDEDLDGPRPVKEQAAAVNGDSDNVEAAYTPSFVLRLPLLDPSLLHPVHLT 262

Query: 251  FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYK 310
            F+H Y EP   +L   +   A  +  K H       ++    +    I S   LPHD YK
Sbjct: 263  FLHEYREPTFGVLSSSQAP-AASLGLKDHLSY-KVFTLDLQQRASTTILSVTGLPHDLYK 320

Query: 311  LLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELD--AA 367
            ++A+P+P+GG L+VG N  IH         +A+N  A  + S     ++  ++ L+  A 
Sbjct: 321  VIALPAPVGGALLVGQNELIHVDQSGKPNGVAVNPMAKLVTSFSLTDQADLNLRLENCAI 380

Query: 368  HATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ----RLDLSKTNPSVLTSD---ITTIGN 420
                ++N   LL    G L +++   DGR V     RL  +    +V+ S    I+ +G 
Sbjct: 381  ELLAVENGELLLILNDGRLGIISFKIDGRTVSGLSVRLVGADCGGNVIKSRAACISRLGK 440

Query: 421  SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD-APSTKRLRRSSSDALQDMV 479
            + FF+GS  GDS+++ +     +   +   + +   I+ D A     L     +   D+ 
Sbjct: 441  NTFFVGSETGDSVVLGW-----SRRQTQEKRRKSRLIDPDLALEVDELDLEDDEEDDDLY 495

Query: 480  NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLR--------------IN 525
              +  +     +N   +     SF + D L++I P++D + G                ++
Sbjct: 496  GDDSAATKPQTTNGGAAKSGDLSFRIHDVLLSIAPIQDITCGQAACLPDSEEATLIKGVS 555

Query: 526  AD---ASATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRMAA 570
            +D   A A G  +  +  ++            E P  +G WT+  K     +  S+   A
Sbjct: 556  SDLQLACAVGRGEAGSLAIINREIQPRVIGRFEFPEARGFWTMCVKKPVPKSLGSNVGVA 615

Query: 571  YDDE---YHAYLIISLEARTMVLETADLLTEVTESVDYFVQ-------GRTIAAGNLFGR 620
             D +    H   +I  +      ET+D+        +   +       G T+ AG +  +
Sbjct: 616  GDYDAPIQHDKFMIVAKVDLDGYETSDVYALTAAGFETLKETEFEPAAGFTVEAGTMGKQ 675

Query: 621  RRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGS 679
              VIQV +   R  +G   + Q L       +  +G+E   V S SI DPY+L+   DGS
Sbjct: 676  MMVIQVLKSEVRCYNGDLGLIQILPM----LDEETGAEPRAV-SASIVDPYLLIIRDDGS 730

Query: 680  IRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAI 739
            + L   D +     ++      +S K V+ C LY D                + GV ++ 
Sbjct: 731  VFLAQIDSNNEIEEMEKADGGLTSTKWVAGC-LYKD----------------TKGVFQSN 773

Query: 740  DGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHIVDTYM--REAL 796
              +  G  D+G +   +   +GAL I+ +P+ +  V+  +   S   H+   ++  R A 
Sbjct: 774  LNSAAGKADEG-VMMFLLNSAGALHIYSLPDLSKAVYIAEGLSSIPPHLSAGFVARRGAT 832

Query: 797  KDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAY 856
            +++ TEI                    V +L      + HS P+L    +   +  Y+  
Sbjct: 833  RETLTEI-------------------VVADLG----DSVHSSPYLILRHSTDDLTIYEPI 869

Query: 857  LFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI 916
                   T    D +   +S + S+++ S + +      P D     + P   P +    
Sbjct: 870  RLPTASATHALSDTLFFKKSAN-SSLAKSAVED------PSDDTA--QPPRYVPLRTCA- 919

Query: 917  FKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG 976
              N+ G+   FL G  P + +   + +     L    +   +  H   C+ GFIY  S+G
Sbjct: 920  --NVGGYSAVFLPGPSPAFIIKSSKSIPRVVGLQGLGVRGMSTFHTEGCDRGFIYADSEG 977

Query: 977  ILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLL 1036
            I ++ QLPS +         K +PL      + Y      Y    ++       +   L 
Sbjct: 978  IARVTQLPSKTNLTELGVSVKKVPLGHDIRHVAYHHPTETYIAGCTI------TENFELP 1031

Query: 1037 IDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVR 1096
             D +   +    +LS +      ++    ++++ P      W    +I M+  E+   ++
Sbjct: 1032 KDDDYHKEWARESLSFLP-----SMARGALKLINPIT----WTVIHSIDMEPGESIECMK 1082

Query: 1097 VVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
             + L  +  TKE   LLA+GTA  +GED+  RGRV ++
Sbjct: 1083 TLHLEVSEETKERRMLLAVGTALTRGEDLPTRGRVQVY 1120


>gi|326471884|gb|EGD95893.1| protein kinase subdomain-containing protein [Trichophyton tonsurans
            CBS 112818]
          Length = 1398

 Score =  148 bits (374), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 266/1161 (22%), Positives = 447/1161 (38%), Gaps = 183/1161 (15%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            NL+V   ++++++ +     GS  +    +  R    D    A L L   Y + G +  L
Sbjct: 28   NLIVAKTSLLQVFSLVNVTYGSTTATQPDQKGRN---DRSQHAKLVLAAEYEVPGTITGL 84

Query: 117  AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
              +    + +    D+I+++  +AK+S++E+D   HG+   S+H +E  E  H+      
Sbjct: 85   QRVRISNSKSGG--DAILVSSRNAKLSLIEWDPEKHGISTISIHYYEGEES-HMSPWVPD 141

Query: 177  FARGPL-VKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDE---------------DT 219
                P  + VDP G C  +  +G+  + IL   Q G  LV D+               D 
Sbjct: 142  LGSCPSSLTVDPNGNCA-IFNFGIHSLAILPFHQAGDDLVMDDYDATPNGDDSTDMVSDA 200

Query: 220  FGSGGGFSARIES---SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
              S  G +A  +    S V+ +  LD  + H     F+H Y EP   IL+ +        
Sbjct: 201  QKSAPGNTAHDKPYAPSFVLPMAALDPALTHPIHMEFLHEYREPTFGILYSQVARSTSLT 260

Query: 275  SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHS 333
              +      S  ++    +    + +   LP D +K++ +P P+GG L++G N  +H   
Sbjct: 261  IDRKDVVSYSIFTLDLQQRASTSLLTVSRLPSDVFKIVPLPPPVGGALLIGTNELVHVDQ 320

Query: 334  QSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTV 391
               + A+ +N +A    +     +S   + L+      L +     LL    G + +L+ 
Sbjct: 321  AGKTNAVGVNEFARQASAFSMADQSDLEMRLEGCIVEQLGSGTGDVLLILADGRMSILSF 380

Query: 392  VYDGRVVQRLDL-----------SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG 440
              DGR V  + L           +K  PS   S    +G +  F GS  GDS+L+ ++  
Sbjct: 381  KVDGRSVSGISLHFVAEQSGGLITKARPSCSAS----LGRNKLFYGSEEGDSILLGWSRP 436

Query: 441  SGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTES---- 496
            S T+   S  K   G  E+ A           D   D +  ++L     AS   E     
Sbjct: 437  SSTTKRPS--KAADGVDESGAADLSDEAEQDDDGDDDDMYEDDLHSVNPASIRQEKQVVN 494

Query: 497  --AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI---- 550
              +   F+F   D L ++GP +D + G    + +     S  +    +EL   +G     
Sbjct: 495  GDSPADFTFRAYDRLWSLGPYRDITLGKPPKSKSKDQRDSVPAIAAPLELVAARGFGKSG 554

Query: 551  -WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEAR---TMVLETAD---LLTEVT--- 600
              TV  +    +  DS +M   DD Y  + I  ++ +   T +  + D   LL +     
Sbjct: 555  GLTVLKREVDPYTIDSLKM---DDVYGVWSIRVVDPKSKDTRLSRSYDKYLLLAKAKGDD 611

Query: 601  --ESVDYFV----------------QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD 642
              ESV Y V                +  T+  G L    RV+QV     R  D  Y    
Sbjct: 612  KEESVVYSVGSSGLDSIDAPEFNPNEDCTVDIGTLATGTRVVQVLRTEIRSHD--YNLGL 669

Query: 643  LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS--TCTVSVQTPAAI 700
                P   E    SE  TV+  S A+PY+L    D S+ +L  D +     V VQ  AA 
Sbjct: 670  AQIYPVWDE--DTSEERTVIQASFAEPYLLTIRDDHSLLILQTDKNGDLDEVEVQGSAA- 726

Query: 701  ESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYES 760
              S K VS C LY DK           + + S    E     + GP    +I   +    
Sbjct: 727  --SGKWVSGC-LYEDK----------MNIFFSDFDIE----NEAGP----NILLFLLDVD 765

Query: 761  GALEIFDVPNFN-CVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENI 819
            G L IF +PN +  +  VD                 L  S     SSS        +E +
Sbjct: 766  GNLSIFRLPNISEPLCRVDNL--------------NLLPSNLPYESSSRRPV---NRETL 808

Query: 820  HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSV 879
              + + +L      A H  P++        ++ Y+ Y   G    S         R L  
Sbjct: 809  TELLIADLG----DAIHKSPYMILRTKHDDLVLYEPYRIAGESGHSG-------LRFLKA 857

Query: 880  SN--VSASRLR---NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPC 934
             N  V   R     N   +R+P            + C+ +    ++ G++  F+SG  PC
Sbjct: 858  VNHVVMGPRTDQGVNHDINRSP------------SSCKLLRALPDVCGYKTVFMSGHNPC 905

Query: 935  WCMVFRERLRVHPQLCDGSIV-AFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW 993
            + ++     R H     G  V + +  H   C  GF YV    ++++ +LPS + +D+ W
Sbjct: 906  F-ILKSAIARPHVLRLRGKAVQSLSGFHIAACERGFAYVDEDNVIRMSRLPSNTRFDSGW 964

Query: 994  PVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSV 1053
              +K I L      I Y +    Y +  S        +   L  D E   +  N  ++ +
Sbjct: 965  ATRK-IALGEQVDSIVYSSASECYVIGTSA------KEDFKLPEDDESHTEWRNEFITFL 1017

Query: 1054 DLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLL 1112
                   +E   V++LEP      W T  +  ++ +E    + V+ L  +  T E + ++
Sbjct: 1018 P-----QLERGTVKLLEPKN----WSTIDSHELKPAERITCIEVIRLEISELTHERKDMV 1068

Query: 1113 AIGTAYVQGEDVAARGRVLLF 1133
             +G++ V+GED+  +G + +F
Sbjct: 1069 VVGSSIVKGEDIVPKGFIRVF 1089


>gi|431908146|gb|ELK11749.1| Cleavage and polyadenylation specificity factor subunit 1 [Pteropus
            alecto]
          Length = 820

 Score =  148 bits (373), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 109/411 (26%), Positives = 184/411 (44%), Gaps = 31/411 (7%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+GA+EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 162  WCLLVRENGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQAEARKEEATR 217

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 218  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLVYEAF----PHDSQLGQGNLK 267

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++  R +  R S+   +    E         R   F++I G+ G F+ G  
Sbjct: 268  VRFKKVPHNINF-REKKPRPSKKKAEGGAEEGPGARGRVARFRYFEDIYGYSGVFICGPS 326

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 327  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 386

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
             WPV+K IPL+ T H + Y  E  +Y +  S              I +  G + +   + 
Sbjct: 387  PWPVRK-IPLRCTAHYVAYHVESKVYAVATS-------TNTPCTRIPRMTGEEKEFETIE 438

Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN- 1108
              D +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    
Sbjct: 439  RDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGL 494

Query: 1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
            +  +A GT  +QGE+V  RGR+L+         P   +    +  L+   Q
Sbjct: 495  KGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQ 545


>gi|395324102|gb|EJF56549.1| hypothetical protein DICSQDRAFT_93527 [Dichomitus squalens LYAD-421
            SS1]
          Length = 1433

 Score =  147 bits (371), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 262/1194 (21%), Positives = 476/1194 (39%), Gaps = 212/1194 (17%)

Query: 57   NLVVTAANVIEIYVVR-------VQEEGSKES-----KNSGETKRRVLMD---------- 94
            N+VV  ++++ I+ VR        Q+E  KE      K +   +  V MD          
Sbjct: 42   NVVVARSSLLRIFEVREEPAPVSTQKEVEKERRAAVRKGTEAVEGEVEMDTSGEGFVNMG 101

Query: 95   ---GISAAS-------LELVCHYRLHGNVESL-AILSQGGADNSRRRDSIILAFEDAKIS 143
               G++ A+         LV  +RLHG V  L A+ +    D+  + D ++++F+DAKI+
Sbjct: 102  TSAGLNGAAHPPTVNRFYLVREHRLHGTVTGLEAVRTVHSLDD--KLDRLLVSFKDAKIA 159

Query: 144  VLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI 202
            +LE+  S+H +   S+H +E +P+ + +        R  L + DP  RC  + +    + 
Sbjct: 160  LLEWSLSLHDVITVSIHTYERAPQLIAID---SPLFRSEL-RADPLSRCAALSLPKDSLA 215

Query: 203  ILK--ASQGGSGLVGDEDTFGSGGGFSARIESSHVINL-RDLD--MKHVKDFIFVHGYIE 257
            IL    SQ    ++  E +      +S     S +++L  D+D  +++V DF F+ G+  
Sbjct: 216  ILPFYQSQAELDILEQEASQARDVPYSP----SFILDLANDVDKRIRNVIDFTFLPGFHN 271

Query: 258  PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
            P + +L + + TW GR+     T  +   ++      +P+I +   LP+D + L    + 
Sbjct: 272  PTVAVLCQYQQTWTGRLKEYKDTVGLYIFTLDFVTNNYPVITAVDGLPYDCFALTPCSTA 331

Query: 318  IGGVLVVGANTIHYHSQSA-SCALALNNYAVSLD-------SSQELPRSSFSVELDAAHA 369
            IGGV+++ +N + +  QS     L +N +   +        ++QE  R    ++L+ A  
Sbjct: 332  IGGVVILASNAVLFVDQSGRRVILPVNGWPPRVSDLPMPPLTAQEQTR---DLQLEGARF 388

Query: 370  TWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-----TNPSVLTSDITTIGNSLFF 424
             ++ +    L  K G +  + ++ DGR V +L +S      T P+V    +  IG+   F
Sbjct: 389  VFVDDKKLFLILKDGTVYPIELIQDGRTVSKLTMSDALARTTIPAV----VKRIGDDHIF 444

Query: 425  LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIE---ADAPST--------------KRL 467
            +GS +G S+L++          ++ ++EE  D +   A+ P+T                L
Sbjct: 445  IGSIVGPSVLLK----------TARVEEEIHDEDVAMAEGPATVVDTSKTVDMMDDDDDL 494

Query: 468  RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
               S+ A Q   NG        A +N  + +     ++ D++   GP+ D ++GL  N D
Sbjct: 495  YGPSTIADQPAANGTA----NGAVDNVRT-RTVVHLSLCDAIPAHGPISDMTFGLSRNGD 549

Query: 528  ------ASATGISKQSNYELVE--LP-----------GCKGIW------------TVYHK 556
                   +ATG     ++ L +  +P           G +G+W            T + +
Sbjct: 550  RLVPELVAATGSGHLGSFSLFQRDMPTRFKRKLHAIGGGRGMWSLPVRQQVKTGGTTFER 609

Query: 557  SSRGHNADSSRMAAYDDEYHAYLIISLEART--MVLETADLLTEVTESVDYFVQGRTIAA 614
             S   +AD+  +    D   +  +  +  R+    +     +  VT     F QG  I  
Sbjct: 610  PSNPFHADNDTVIISTDANPSPGLSRIATRSSHSDITITTRIPGVTLGAAPFFQGTAILH 669

Query: 615  GNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLG 674
              +F    VI+V E      DG+          S  +    +    + S SI DP++L+ 
Sbjct: 670  -VMFNVTNVIRVLEP-----DGTERQ-------SIKDLDGNAARPRIKSCSICDPFILII 716

Query: 675  MSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTG 734
              D +I L +G+     +  +  + +        +   Y D      L +T  +A     
Sbjct: 717  REDDTIGLFIGEIERGKIRRKDMSPMGEKTSKYLAGYFYTDTS---GLFQTFLNA---EA 770

Query: 735  VGEAIDGADGGPLDQGDI--YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYM 792
             GEA      G ++ G+   +  +    G +EI+ +P     F+     +    I D+  
Sbjct: 771  PGEAATSTLQGAMNAGNKTHWLTLVRPQGVVEIWTLPKLTLAFSTTTLATLDPVISDSLE 830

Query: 793  REALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILC 852
              AL                Q        + V +L +      H RP L  +L  G +  
Sbjct: 831  PPALS-------------LPQDPPRKPQELDVDQLVIAPLGESHPRPHLIVLLRSGQLAI 877

Query: 853  YQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNL-RFSRTPLDAYTREETPHGAPC 911
            Y+A     P       DP+  +RSL++       L NL +      D    EE       
Sbjct: 878  YEAVAASPPA------DPLPPTRSLTL-------LVNLVKVKSKAFDIQHTEEEQKSVLA 924

Query: 912  QRITIFKNI----------SGHQGFFLSGSRPCWCM-VFRERLRVHPQLCDGSIVAFTVL 960
            ++  I + +            + G F +G RP W +   +  +RV P      + AFT  
Sbjct: 925  EQKRISRLLLPFVTSPAPGQTYSGVFFTGDRPSWIVSTDKGGVRVFPS-GHNVVHAFTTC 983

Query: 961  HNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLI 1020
                    F+  + +G   +  +P     D + P + V   ++ P+    F   +   LI
Sbjct: 984  SLWESRGDFLLYSEEGPSLVEWMPD-IILDAHLPARSVP--RSRPYSHVVFDASS--SLI 1038

Query: 1021 VSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT 1080
            V+      +N+  S   D  +  + D+ N+S      T T+E     ++ PD     W T
Sbjct: 1039 VAASSF--MNRFASYDEDGNIVWEPDSPNISFPHCE-TSTLE-----LISPDG----WIT 1086

Query: 1081 RATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLF 1133
                   ++E    V  V L   +T+   +  +A+GT   +GED+A +G V +F
Sbjct: 1087 MDGYEFAANEFVSCVVSVPLETVSTESGMKDFIAVGTTINRGEDLAVKGAVYIF 1140


>gi|242075246|ref|XP_002447559.1| hypothetical protein SORBIDRAFT_06g003570 [Sorghum bicolor]
 gi|241938742|gb|EES11887.1| hypothetical protein SORBIDRAFT_06g003570 [Sorghum bicolor]
          Length = 389

 Score =  147 bits (371), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 74/111 (66%), Positives = 89/111 (80%), Gaps = 1/111 (0%)

Query: 1036 LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTV 1095
            + DQE+G  I+N   S  DL + YTV+E+EVRI+E  +  G W+TR TIPMQS ENALTV
Sbjct: 1    MADQELGLHIENDVTSGDDLQKVYTVDEFEVRIMELGKPSGHWETRFTIPMQSFENALTV 60

Query: 1096 RVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLV 1146
            R+VTL NT+TKENETL+AIGTAYVQGEDVAARGRVLL+S  R ++N QNLV
Sbjct: 61   RIVTLQNTSTKENETLMAIGTAYVQGEDVAARGRVLLYSFSR-SENSQNLV 110


>gi|427795803|gb|JAA63353.1| Putative mrna cleavage and polyadenylation factor ii complex subunit
            cft1 cpsf subunit, partial [Rhipicephalus pulchellus]
          Length = 726

 Score =  147 bits (371), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 118/391 (30%), Positives = 179/391 (45%), Gaps = 61/391 (15%)

Query: 756  VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETE-INSSSEEGTGQG 814
            V  E+G LEI+ +P +   F V  F  G+  +VD+    A   +++E ++  S E     
Sbjct: 73   VARENGVLEIYSLPEYKLCFLVKNFPMGQKVLVDSVQMTAPSGTKSEKLSDMSHESMPV- 131

Query: 815  RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
                +H + VV L ++     HSRP L A + D  +L Y+A+ F              T 
Sbjct: 132  ----VHEILVVGLGIR-----HSRPLLLARV-DEDLLIYEAFPF------------YETQ 169

Query: 875  RSLSVSNVSASRLRNLRFSRTPLDAYTRE-----ETPHGAPCQR-------ITIFKNISG 922
            R   +          LRF +   D + RE     + P     ++       +  F +ISG
Sbjct: 170  REGHL---------KLRFKKMSHDIFLRERKYKTQKPENEEEEKAFQSRQWLHPFSDISG 220

Query: 923  HQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
            + G FL G RP W  M  R  LR HP   DG I  F   HNVNC  GF++   QG L+I 
Sbjct: 221  YSGVFLCGYRPYWLFMSSRGELRCHPMFVDGPIHCFAPFHNVNCPKGFLHFNKQGELRIS 280

Query: 982  QLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEV 1041
             LP+  TYD  WPV+KV PL+ TPH + Y  +   Y ++ S P   P N ++        
Sbjct: 281  TLPTHLTYDAPWPVRKV-PLRCTPHFVNYHVDSKTYCVVTSQP--DPCNHLVRF-----T 332

Query: 1042 GHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVT 1099
            G + +   L     +   T++++ +++L P      W+T     + +   E+   ++ V 
Sbjct: 333  GEEKEYELLERDSRYIFPTMDKFSLQLLSP----VSWETIPNTRVDLDEWEHLTCLKNVM 388

Query: 1100 LFNT-TTKENETLLAIGTAYVQGEDVAARGR 1129
            L +  TT   +  LA+GT Y  GEDV +RGR
Sbjct: 389  LSSEGTTTGMKGYLALGTNYCYGEDVTSRGR 419


>gi|196012166|ref|XP_002115946.1| hypothetical protein TRIADDRAFT_59883 [Trichoplax adhaerens]
 gi|190581722|gb|EDV21798.1| hypothetical protein TRIADDRAFT_59883 [Trichoplax adhaerens]
          Length = 1187

 Score =  144 bits (363), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 95/299 (31%), Positives = 144/299 (48%), Gaps = 19/299 (6%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+      I +Y +   +E       S      +  D      LE +  Y  +G +  +
Sbjct: 29  NLLTAGPTCIRVYDIIKDQEDIDLDNRSDNADNHLNKDNKLHPELEFLASYSFYGKIYGI 88

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
               +        RDS+ + F DAK+S++E+D     L   S+H FE  E   LK G   
Sbjct: 89  ----ESVRFRHHHRDSLFICFADAKLSLVEYDADNSNLTTLSLHTFEDDE---LKNGFSR 141

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE-DTFGSGGGFSARIESSHV 235
               P+++VDP  RC  ++V  + + IL     G      + D   + G +   +  S+V
Sbjct: 142 NLSIPIIRVDPDNRCAAMVVSNVHLAILPFRHRGPAEQQVQIDPKNTSGKYP--LMPSYV 199

Query: 236 INLRDLDMKHVKDFI---FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
           +++RDL  + V   I   F+ GY EP ++IL E   TW+GRV+ +  TC I A+S++T  
Sbjct: 200 VDVRDLGNEKVSRLIDIRFLEGYYEPTILILCEILRTWSGRVAVRQDTCSILAVSLNTID 259

Query: 293 KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
           K HP+IWS  NLP D    + VP PIGGVL+  AN + + +QS         YA SL+S
Sbjct: 260 KVHPVIWSLNNLPFDCLGAITVPRPIGGVLIFAANCLLHLNQSKP------PYAESLNS 312



 Score = 75.9 bits (185), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 88/391 (22%), Positives = 159/391 (40%), Gaps = 100/391 (25%)

Query: 458 EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT--ESAQKTFSFAVRDSLVNIGPL 515
           + D P++K+LR       +++       LY + ++ T  ES  ++++F V D ++++GP 
Sbjct: 336 DTDEPTSKKLRTDDEKEDEELE-----KLYSAHTSCTAKESYLRSYTFEVCDRILHVGPC 390

Query: 516 KDFSYGLRINADASATGISKQSNYELV--------------------------ELPGCKG 549
              + G        +T + ++S+ E+V                          +LPGC  
Sbjct: 391 ASIAIG------QISTFVQEESDVEVVICSGHDKNGALSVLNKGIKPQVVASYDLPGCVD 444

Query: 550 IWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
           +WTV  K  R ++ +        +  H +LIIS +  TM+L T   +TEV E + +  Q 
Sbjct: 445 MWTV--KDIRLNDENDGDFET--ENTHKFLIISRDNLTMILRTGKEITEV-EQLGFLTQT 499

Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
           +T+ AGNL     +IQV      ++      Q L               S ++  S+ DP
Sbjct: 500 KTVFAGNLDNGNCIIQVTPYEVILVSKGEKIQQLEL----------ENESPIVFCSLQDP 549

Query: 670 YVLLGMSDGSIRLL---VGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD----------- 715
           Y+ L +  GSI +L   + D     V +     +  S+  +++C L+ D           
Sbjct: 550 YISLLLEGGSIMMLAFELSDNGEKQVKLVNTTPLNHSR--IAACCLFQDNNGRMSVSDGI 607

Query: 716 --KGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDI--------------------- 752
             + P P    T+  A L       ID  +   LD  D                      
Sbjct: 608 SIRTPSP----TNEPAELMEDEKFTIDDDELLYLDVNDTNLQTNDVPVASTSYTDNLERK 663

Query: 753 ---YSVVCYESGALEIFDVPNFNCVFTVDKF 780
              +  +C ++G LE++ +P+++ V+TV+ F
Sbjct: 664 VSYWLFLCLDNGKLEVYSIPSYDKVYTVNGF 694



 Score = 68.9 bits (167), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 60/190 (31%), Positives = 91/190 (47%), Gaps = 19/190 (10%)

Query: 949  LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQI 1008
            L DG +  F   +  NC +GF+Y  S+  L+IC L    TYD  WPV KV PL+ T H I
Sbjct: 767  LVDGYVKCFAPFNIANCPNGFLYFNSEEDLRICVLDQRFTYDCPWPVHKV-PLRNTLHFI 825

Query: 1009 TYFAEKNLYPLIVS-VPVLKPLNQVLS---LLIDQEVGHQIDNHNLSSVDLHRTYTVEEY 1064
            T+      Y +I S + V + +  + +     I  E G +  +  +    L +  T E +
Sbjct: 826  THHFVTKTYVIISSTMTVCEKMPHITTEDKEFIPVEKGDRFIHAPVEKFCL-QLITSETW 884

Query: 1065 EVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGED 1123
            E+    PD         A I M   E+   ++ V L +  T    +  +A+GT  V GE+
Sbjct: 885  EII---PD---------AEIQMAEWEHVTCLKSVKLKSEETVSGLKEFIAVGTTNVCGEE 932

Query: 1124 VAARGRVLLF 1133
            VA RGR+++F
Sbjct: 933  VACRGRIVIF 942


>gi|340515387|gb|EGR45642.1| predicted protein [Trichoderma reesei QM6a]
          Length = 1441

 Score =  143 bits (361), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 258/1147 (22%), Positives = 437/1147 (38%), Gaps = 178/1147 (15%)

Query: 72   RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRD 131
            +V ++   ES   G     V  +  +   L L+    L G V  LA L    +  +   +
Sbjct: 68   QVNDDDGLESSFLGGETMLVRTERTNNTKLVLITEIPLAGTVIGLARLRT--SRTASGGE 125

Query: 132  SIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV---KVDPQ 188
             +++A++ AK+ + E+D   + L   S+H +E  E L      E F  G  V   + DP 
Sbjct: 126  VLLIAYKAAKLCMAEWDPRKNELETISIHYYEKEE-LQGAPWEEVF--GEYVNHLEADPG 182

Query: 189  GRCGGVLVYGLQMIILKASQGGSGLVG---DEDTFG---------SGGGFSARIESSHV- 235
             RC  +      + IL   +    L     DED  G         +  G S  +E+++  
Sbjct: 183  SRCAALKFGTRNLAILPFRRSEEDLEMEDWDEDLDGPRPVKEQAAAVNGDSDNVEAAYTP 242

Query: 236  -----INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST 290
                 + L D  + H     F+H Y EP   +L   +   A   +  H +  +  L +  
Sbjct: 243  SFVLRLPLLDPSLLHPVHLTFLHEYREPTFGVLSSSQAPAASLGARDHLSYKVFTLDLQQ 302

Query: 291  TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSL 349
              +    I S   LPHD Y+++A+P+P+GG L+VG N  IH      S  +A+N  A   
Sbjct: 303  --RASTTILSVTGLPHDLYRVIALPAPVGGALLVGQNELIHVDQSGKSNGVAVNPMAKLA 360

Query: 350  DSSQELPRSSFSVELD--AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ----RLDL 403
             S     +S   + L+  A     ++N   LL    G L +++   DGR V     RL  
Sbjct: 361  TSFSLTDQSDLKLRLENCAIEVLAIENGELLLILNDGRLGIISFKIDGRTVSGLSVRLVG 420

Query: 404  SKTNPSVLTSDITTI---GNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
            +    +VL S  T +   G +  F+GS   DS+++ +   S         K    D +  
Sbjct: 421  ADCGGNVLKSRATCVSRLGKNTLFVGSETSDSVVLGW---SRRQTQEKRKKSRLIDPDLA 477

Query: 461  APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
                +       +      +    +      N   +     +F + D L++I P++D + 
Sbjct: 478  LEVDELDLEDDEEDDDLYGDDSVATKPQQLPNGGPAKSGDLTFRIHDVLLSIAPIQDVTC 537

Query: 521  GLR--------------INAD---ASATGISKQSNYELV------------ELPGCKGIW 551
            G                + AD   A A G  +  +  ++            E P  +G W
Sbjct: 538  GQAAFPPDSEEATLNRGVRADLQLACAVGRGEAGSLAIINREIQPRVIGRFEFPEARGFW 597

Query: 552  TVYHKSS--RGHNADSSRMAAYDD--EYHAYLIIS------LEARTMVLETADLLTEVTE 601
            T+  K    +   A++     YD   ++  ++I++       E   +   TA     + E
Sbjct: 598  TMCVKKPVPKSLGANAGVAGDYDTPIQHDKFMIVAKVDLDGYETSDVYALTAAGFETLKE 657

Query: 602  SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENST 660
            +      G T+ AG +  +  VIQV +   R  +G   + Q L       +  +G+E   
Sbjct: 658  TEFEPAAGFTVEAGTMGKQMVVIQVLKSEVRCYNGDLNLIQILPM----LDEETGAEPRA 713

Query: 661  VLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEP 720
            V S SI DPY+ +   DGS+ L   D +     ++   +  +S K V+ C     KG   
Sbjct: 714  V-SASIVDPYLFIVRDDGSVFLAQIDSNNEIEEMEKTDSSLTSTKWVAGCLYKDTKG--- 769

Query: 721  WLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDK 779
              + + +D+   T   EA+                +   +GAL IF +P+ +  V+  + 
Sbjct: 770  IFQSSYSDSTKQTS--EAV-------------MMFLLNSTGALHIFALPDLSKAVYVAEG 814

Query: 780  FVSGRTHIVDTYM--REALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHS 837
              S   H+   Y   R A +++ TEI                    V +L      A H+
Sbjct: 815  LSSIPPHLSAGYAARRGATRETLTEI-------------------VVADLG----DAVHA 851

Query: 838  RPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTP- 896
             P+L    +   +  Y+      P N         T+ +LS           L F ++P 
Sbjct: 852  SPYLILRHSTNDLTIYEPIRL--PAN--------ETAHTLS---------DTLFFKKSPN 892

Query: 897  -LDAYTREETPHGAPCQR-----ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLC 950
             + A +  E P     Q      + I  N+ G+   FL G  P + +     +     L 
Sbjct: 893  AVLAKSAVEDPSDDTAQPPRYVPLRICANVGGYSSVFLPGPSPAFVIKSSRSVPRVVGLQ 952

Query: 951  DGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITY 1010
               +   +  H   C+ GFIY  S+GI ++ QLPS + +       K +PL      + Y
Sbjct: 953  GHGVRGMSTFHTEGCDRGFIYADSEGIARVTQLPSKTNFTELGISVKKVPLGFDVRHVAY 1012

Query: 1011 FAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHN---LSSVDLHRTYTVEEYEVR 1067
                  Y  I    V +          + E+    D H      SV L  T       ++
Sbjct: 1013 HHPTETY--IAGCAVTE----------NFELPKDDDYHKEWARESVPLPPTAV--RGALK 1058

Query: 1068 ILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAA 1126
            ++ P      W    +I M++ E+   ++ + L  +  TKE   LLA+GTA  +GED+  
Sbjct: 1059 LINPIT----WTVIHSIDMEAGESIECMKTLHLEVSEETKERRMLLAVGTALSRGEDLPT 1114

Query: 1127 RGRVLLF 1133
            RGRV ++
Sbjct: 1115 RGRVQVY 1121


>gi|358372791|dbj|GAA89393.1| cleavage and polyadenylation specificity factor subunit A
            [Aspergillus kawachii IFO 4308]
          Length = 1372

 Score =  143 bits (360), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 238/1168 (20%), Positives = 461/1168 (39%), Gaps = 201/1168 (17%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            +L+V   ++++IY +  +   + E  ++ +   ++L++            Y L G V  L
Sbjct: 28   DLIVVRTSLLQIYSLH-KVTSNAEGADAQQELTKLLLEK----------EYSLSGTVTGL 76

Query: 117  AILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
              +      NSR   +++++AF +AK+S++E+D     +   S+H +E  +        +
Sbjct: 77   CRVK---VLNSRSGGEAVLVAFRNAKLSLIEWDPERRSISTISIHYYERDDLTRSPWVPD 133

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFGS------------ 222
                G ++ VDP  RC  +  +G++ + I+   Q G  LV D+  +GS            
Sbjct: 134  LKNCGSILSVDPSSRCA-IFNFGIRNLAIIPFHQPGDDLVMDD--YGSDLGEGMSTDHDL 190

Query: 223  GGG---------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWA 271
            GGG         +      S V+ L  LD  + H     F++ Y EP   IL+ +  T +
Sbjct: 191  GGGPDKAKEGIAYQTPYAPSFVLPLTALDPSILHPISLAFLYEYREPTFGILYSQVATSS 250

Query: 272  GRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IH 330
              +  +      +  ++    +   ++ S   LP D ++++A+P P+GG L++G+N  +H
Sbjct: 251  ALLPERKDVVFYTVFTLDLEQQASTILLSVSRLPSDLFRVVALPPPVGGALLIGSNELVH 310

Query: 331  YHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVL 388
                  + A+ +N ++  + S     +S  ++ L+      L +     LL   TG++ +
Sbjct: 311  IDQAGKTNAVGVNEFSRQVSSFSMTDQSDLALRLENCIVECLGDSSGDMLLVLSTGEMAI 370

Query: 389  LTVVYDGRVVQRLD---------LSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
            +    DGR V  +          L+  N +   S  T IG+   FLGS  GDS+L+ ++C
Sbjct: 371  MKFKLDGRSVSGISVHLLPAHAGLTSMNSAAAAS--TFIGDGKIFLGSEDGDSVLLGYSC 428

Query: 440  GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV--NGEELSLYGSASNNTESA 497
             S +S       ++  D  AD        +S  D  +D +     + +L G   +   SA
Sbjct: 429  SSSSSKKHRLQAKQAIDDSADMSEED---QSEDDVYEDDLYSTSPDTTLTGRRPSGESSA 485

Query: 498  QKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI-----WT 552
               + F + D L+NIGPL+D + G ++  +    G    S    +++   +G        
Sbjct: 486  FGLYDFRMHDKLINIGPLRDITIGRKLPTNQEKGGDRTNSTSPELQIVASQGSHKSGGLV 545

Query: 553  VYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA-------------RTMVLETADLLTEV 599
            V  +    H   S  + + D  + A L    EA             R  V+ T    ++ 
Sbjct: 546  VMAREIDPHVVASISLESVDSIWTASLTWEEEAVSRTSENIGQRSQRCYVIATEAKASDR 605

Query: 600  TESVDYFVQGR----------------TIAAGNLFGRRRVIQVFERGARILDGSY-MTQD 642
             ES+ + V G                 TI  G    R+RV+QV +   R  D    +TQ 
Sbjct: 606  EESLIFVVDGHDLKPFRAPDFNPNEDVTINIGTQESRKRVVQVLKNEVRSYDIDLGLTQI 665

Query: 643  LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIES 702
                  ++     ++    +S S+AD  + +   D ++  L  D S     V     + S
Sbjct: 666  YPIWDDDT-----NDERMAVSASLADSCLAILRDDSTLLFLQADDSGDLDEVVLGEDVAS 720

Query: 703  SKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGA 762
             K    SC LY DK                TG+  +ID     P+ + D++  +      
Sbjct: 721  GK--WISCCLYSDK----------------TGLFSSIDRTLSEPV-KNDMFLFLLSHDSK 761

Query: 763  LEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSM 822
            L ++ V +   + ++ + + G + ++                 SSE     G +EN+   
Sbjct: 762  LFVYRVRD-QKLLSIIEGLDGLSPLL-----------------SSEPPKRSGTRENLVEA 803

Query: 823  KVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNV 882
             V +L  + WSA    P+L     +  ++ Y+ ++             + T  +  +  +
Sbjct: 804  IVADLG-ETWSAS---PYLILRSENDDLIIYKPFV-------------IPTGPTGEIHTL 846

Query: 883  SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRER 942
              S+  N        D  + + +      + + I  +ISG    F+ G+           
Sbjct: 847  KFSKENNSVLPMISPDVDSTQPSGSDYRVRPLRILPDISGLSAVFMPGAS---------- 896

Query: 943  LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQ----GILKICQLPSGSTYDNYWPVQKV 998
                          F +  + +  H F+ +  +      ++ CQLP  + +D  W ++KV
Sbjct: 897  ------------AGFVLRTSASAPH-FLRLRGESPRCSTVRFCQLPPMTRFDYQWTLKKV 943

Query: 999  IPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQV-LSLLIDQEVGHQIDNHNLSSVDLHR 1057
               +   H + Y     +Y       VL   +     L  D E+  +  N  +S     R
Sbjct: 944  HLGEQVDH-LAYSTSSGMY-------VLGTCHATDFKLPDDDELHPEWRNEAISFFPSAR 995

Query: 1058 TYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGT 1116
                    ++++ P+     W    +  + + E  + ++ ++L  +  T E + L+ +GT
Sbjct: 996  GSF-----IKLVSPNT----WSIIDSYSLGTDEYVMAIKNISLEISENTHERKDLIVVGT 1046

Query: 1117 AYVQGEDVAARGRVLLFSTGRNADNPQN 1144
            A+ +GED+ +RG + +F   +   +P +
Sbjct: 1047 AFARGEDIPSRGCIYVFEVVQVVPDPDD 1074


>gi|378734083|gb|EHY60542.1| histone H2A [Exophiala dermatitidis NIH/UT8656]
          Length = 1361

 Score =  142 bits (357), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 238/1133 (21%), Positives = 424/1133 (37%), Gaps = 229/1133 (20%)

Query: 97   SAASLELVCHYRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLR 155
            S   L LV  Y L G + SL  +      NS+   D++++AF DAK+S++E+D ++H + 
Sbjct: 49   SETKLVLVAEYNLAGTITSLGRVK---IPNSKSGGDAVLVAFRDAKLSLIEWDPALHSIS 105

Query: 156  ITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVG 215
              S+H +E  +   +    +       + VDP  RC         + I+   Q    L  
Sbjct: 106  TLSIHYYEHHDLQSIPWQPDLSKCVSHLTVDPSSRCAAFNFGVSNLAIIPLHQVRDELAM 165

Query: 216  DE---------DTFGSGGGFSARIES-------SHVINLRDLD--MKHVKDFIFVHGYIE 257
            DE         +     G    + +S       S V+ L  LD  + H  D  F+H Y +
Sbjct: 166  DEFDEVDGEVKERLSPDGQNENKHDSPDTPFKPSFVLPLTALDPGLLHPVDMAFLHEYRD 225

Query: 258  PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
            P + IL+      +     +    + +  ++    K    + S   LP+D Y+++A+P P
Sbjct: 226  PTVGILYSTAARSSNMNHERRDVTIYAVYALDIGQKASTALQSVQKLPNDLYRVMALPPP 285

Query: 318  IGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDV 376
            +GG L++G N  IH      + A+A+N  A    S      +++ ++L+      L N  
Sbjct: 286  VGGALLIGGNELIHIDQSGKTIAIAVNELAKEASSFPMADHANYRLKLEGCQIEHLGNPS 345

Query: 377  A--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSV-------LTSDITTIGNSLFFLGS 427
               L+  KTG+L LL+   DGR+V  + L +             ++  T +G++  F+GS
Sbjct: 346  GDMLVILKTGELALLSFRMDGRMVSSMALRRVGEGQSQGLALGASTCSTNLGSNRLFIGS 405

Query: 428  RLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA-----LQDMVNGE 482
               DS+L+    G  T+ L                +  R++  + DA      ++    +
Sbjct: 406  EESDSILL--ATGRKTTQLRR--------------TNSRIQSQADDAGLFDDNEEDGIED 449

Query: 483  ELSLYGSASN----NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQ-- 536
            E  LY   ++    N  +     +F + D L +I P+ D +        A  + +++Q  
Sbjct: 450  EDDLYAELADELNGNASTDVSGHNFRLLDRLPSIAPINDVALANVGKRRAEESEVTRQEL 509

Query: 537  ---------------------SNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEY 575
                                 S    ++     G+W  +   +RG         A ++ +
Sbjct: 510  AVAYGRGHAGGLAFLSRKLEPSVTRQIKFERPIGVW-CFSSGNRGQQ------GAEEENF 562

Query: 576  HAYLIISL-----EARTMVLETAD-LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFER 629
               ++IS        RT +L   D  L  + ES      G  I    L      IQV   
Sbjct: 563  DDLVMISQTTDDGAGRTKLLRLIDGDLNSMGESEFDESAGAAIGVFKLEATNHTIQVLPT 622

Query: 630  GARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPST 689
              R+ D  +    + F   + E G   + +  + VS  DPY+++   DGS+ LL  D + 
Sbjct: 623  ELRVYDAGFALSQI-FPIVDEEEG---QTARAVKVSFVDPYLVVVKDDGSMSLLKADKAG 678

Query: 690  CTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDG-ADGGPLD 748
                V+ P  + +    + S TLY D           TD    T      +G A  GP+ 
Sbjct: 679  ELDEVELPENLRAWS--ILSATLYQD-----------TDDMFQT--SRFYNGTATPGPI- 722

Query: 749  QGDIYSVVCYESGALEIFDVPNFNC-VFTVDKFVSGRTHIV-DTYMREALKDSETEINSS 806
                   +  + G   +  +PN +  VF  D      TH++ D  + +  ++        
Sbjct: 723  -----LTILTQDGHFCLLSLPNVSIQVFQCDSLPFLPTHLMQDLQLPKHWRN-------- 769

Query: 807  SEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSK 866
                     K+++  + + +L     ++   +P+L      G ++ Y+++          
Sbjct: 770  ---------KDDLGEVLLADLG----NSTDRQPYLVVRNLVGDVIIYESFAMP------- 809

Query: 867  SDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGF 926
              D + + R   V   +A  L +             EE    +  Q +    N++GH   
Sbjct: 810  --DVLGSFRFKKVFTKAAGELED------------GEEVGQPSTLQPMQAVTNVAGHASV 855

Query: 927  FLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSG 986
            F+ G +P   M     +    +L    + +   +H   C  G + V +   +K C +P  
Sbjct: 856  FIPGRQPLLIMREASTMPRVYELNPTKLKSMNSVHTGTCRQGLVLVDADDEIKFCNIPDS 915

Query: 987  STYD-NYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQI 1045
            +    + W +++V PL      + YFA  + Y L                          
Sbjct: 916  TVLGLSDWVIRRV-PLGQDITSVAYFAPTDSYILAT------------------------ 950

Query: 1046 DNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRAT--IP--MQSSENALTV------ 1095
             NH                E ++ + D     WQ  AT  +P  +QSS   L+       
Sbjct: 951  -NHT--------------TEFQLPQDDEWHPEWQGEATKFLPSSIQSSLKLLSAKTHSII 995

Query: 1096 ---------RVVTL------FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
                     RV+ L       +  T E + L+ +GTA V+GE+V  RG + +F
Sbjct: 996  SQYSFDACERVLCLESLNLEVSEETHERKDLIVVGTAIVKGENVTTRGNLYIF 1048


>gi|346319828|gb|EGX89429.1| protein CFT1 [Cordyceps militaris CM01]
          Length = 1452

 Score =  142 bits (357), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 255/1144 (22%), Positives = 435/1144 (38%), Gaps = 207/1144 (18%)

Query: 91   VLMDGISAASLELVCHYRLHGNVESLAIL-----SQGGADNSRRRDSIILAFEDAKISVL 145
            +L D      L LV    + G +  LA L     S GG       ++++LA+  AK+ + 
Sbjct: 94   LLRDRSQHTKLVLVAELPVAGTIIGLARLKLPHTSSGG-------EALLLAYRGAKMCLT 146

Query: 146  EFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV---KVDPQGRCGGVLVYGLQMI 202
            E++     L   S+H +E  E   L+        G  V   + DP  RC         + 
Sbjct: 147  EWNPRRAALETVSIHFYEKDE---LQGAPWELPFGEYVNYLEADPASRCAAFKFGSRNLA 203

Query: 203  ILKASQGG--------------------SGLVGDEDTFGSGGGFSARIESSHVINLRDLD 242
            IL   Q                      + L  + D  G   G  ++   S V+ L  LD
Sbjct: 204  ILPFRQAEEDLEMEDWDEALDGPKPPKEASLATNGDANGDANGTQSQHSPSFVLRLPLLD 263

Query: 243  --MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
              + H     F+H Y EP   IL   + T        H T  +  L +    +    I S
Sbjct: 264  PTLLHPVHLAFLHQYREPTFGILSSAQSTSIALGFRDHLTYKVFTLDLKQ--RASTTILS 321

Query: 301  AMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA-----VSLDSSQE 354
               LP D  +++ +P+P+GG L+VGAN  IH      +  +A+N  A      SL+   E
Sbjct: 322  VTGLPQDLSRVIPLPTPVGGALLVGANELIHIDQSGKANGVAVNPMARQMTSFSLNDQSE 381

Query: 355  LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDL---SKTNPSVL 411
            L   ++ +E  A     +++   LL      L +++   DGR V  + L   S+ N   L
Sbjct: 382  L---NYRLEGCAIEPVSMESGELLLILNDASLAIVSFKIDGRTVSGISLVPVSQENGGNL 438

Query: 412  ----TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
                 S I+ IG S  F+GS  GDS+++ +     +   S   +++   ++A+       
Sbjct: 439  LKSHVSCISRIGKSSMFIGSEYGDSVVLGW-----SRKQSQEKRKKSRVLDAELALDVDD 493

Query: 468  RRSSSDALQDMVNG-EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
                     D + G E  +   S + N  +      F ++DSL+ + P+ D + G     
Sbjct: 494  IDLDDFDEDDDLYGTESTAAKPSLATNGVTKGGELIFRLQDSLLCLAPIHDVAPGKAVFP 553

Query: 522  -------LRINAD-----ASATGISKQS-----NYEL-------VELPGCKGIWTVYHK- 556
                   LR         A A G  K       N E+        E P  +G WT+  K 
Sbjct: 554  LDSEEVVLRDGVTSELQLACAVGRGKAGAIAILNREIQPKVIGRFEFPEARGFWTMCVKK 613

Query: 557  ---SSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYF------ 606
                + G NA  S  + YD  E +   +I  +      ET+D+        +        
Sbjct: 614  PLPKALGSNAVVS--SEYDSMELYDRFMIVAKVDLDGYETSDVYALTDAGFESLKDTEFE 671

Query: 607  -VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSV 664
               G T+ AG +  + R+IQV +   R  DG   ++Q L       +  +G+E   V+S 
Sbjct: 672  PAAGFTVMAGTMGKQMRIIQVLKSEVRCYDGDLGLSQILPM----MDEDTGAE-PRVVSA 726

Query: 665  SIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRK 724
            SIADPY+++   D SI +               A I+S+ +      +  DKGP   ++ 
Sbjct: 727  SIADPYLMVIRDDNSIFI---------------AKIDSNDE---LDEVEKDKGPLASIK- 767

Query: 725  TSTDAWLSTGVGEAIDGA-DGGPLDQGDIYSVVCY---ESGALEIFDVPNFNCVFTVDKF 780
                 W +  +    DG       D+G    ++ +    +GAL I+D+ N +        
Sbjct: 768  -----WQTGCLYADHDGHFQPKQPDEGSSPRILMFLMSTTGALHIYDLDNLS-------- 814

Query: 781  VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
                      Y+ E L  S     S++  G     KE +  + V +L           P+
Sbjct: 815  -------EPVYVAEGLT-STPPFLSANFTGRKAAAKETLTEILVADLG----DVVAKSPY 862

Query: 841  LFAILTDGTILCYQAYLFEGPENTS---------KSDDPVSTSRSLSVSNVSASRLRNLR 891
            L        +  Y+   +  P ++S         K     + ++S   S+      +  R
Sbjct: 863  LILRHDTDDLTLYEPVRYHEPNSSSAPLSDTLFFKKSTNSTIAKSAPASDKEDDETQQKR 922

Query: 892  FSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD 951
            F                 P Q   +  N+ G+   FLSG  P + +   + +     L  
Sbjct: 923  F----------------VPLQ---LCANVGGYSAVFLSGDSPSFILKSAKSIPRIVGLQG 963

Query: 952  GSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYF 1011
              +   +  H   C+ GFIY  ++GI ++ QLP+ + Y       K IPL    +++++ 
Sbjct: 964  QGVQGMSTFHTEGCDRGFIYADTKGIARVSQLPTDTNYAELGISVKKIPLDCDVNRVSFH 1023

Query: 1012 AEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHN-LSSVDLHRTYTVEEYEVRILE 1070
            +    Y  I +    +P           E+    D H   +   ++   T+    ++++ 
Sbjct: 1024 SHTATY--IAACSTREPF----------ELPKDDDYHKEWARETVNFAPTMPRGILKLIS 1071

Query: 1071 PDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGR 1129
            P      W    ++ ++S E   ++  + L  +  TKE   ++A+G+A  +GED+  RGR
Sbjct: 1072 P----AAWTVIHSLDLESCETIESMMALHLEISEETKERRMVVAVGSAICKGEDLPTRGR 1127

Query: 1130 VLLF 1133
            V +F
Sbjct: 1128 VQVF 1131


>gi|400597740|gb|EJP65470.1| CPSF A subunit region [Beauveria bassiana ARSEF 2860]
          Length = 1444

 Score =  141 bits (356), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 255/1144 (22%), Positives = 437/1144 (38%), Gaps = 200/1144 (17%)

Query: 85   GETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISV 144
            GET   +L D      L LV    + G V  LA L     ++    ++++LA+  AK+ +
Sbjct: 85   GET--LLLRDRAQNTKLVLVAEIPVAGTVIGLARLKLQNTESGG--EALLLAYRGAKMCL 140

Query: 145  LEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV---KVDPQGRCGGVLVYGLQM 201
             E++     L   S+H +E  E   L+        G  V   + DP  RC         +
Sbjct: 141  TEWNPQKAALDTVSIHYYEKDE---LQGAPWELPFGEYVNYLEADPASRCAAFKFGSRNL 197

Query: 202  IILKASQGGSGL-VGDEDTFGSG-----------GGFSARIESSH----VINLRDLD--M 243
             IL   Q    L + D D    G            G     ES H    V+ L  LD  +
Sbjct: 198  AILPFRQAEEDLEMEDWDEALDGPKPAKEAALATNGDDHETESQHSPSFVLRLPLLDPTL 257

Query: 244  KHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
             H     F+H Y EP   IL   + T        H T  +  L +    +    I S   
Sbjct: 258  LHPVHLAFLHQYREPTFGILSSAQSTSIALGFRDHMTYKVFTLDLKQ--RASTTILSVTG 315

Query: 304  LPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
            LP D  +++ +P+P+GG L+VG N  IH      +  +A+N  A  + S     +S  + 
Sbjct: 316  LPQDLKRVIPLPTPVGGALLVGENELIHIDQSGKANGVAVNPMARQMTSFSLADQSELNY 375

Query: 363  ELD--AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDL---SKTNPSVL----TS 413
             L+  A     +++   LL      L +++   DGR V  + L   S+ N   L     S
Sbjct: 376  RLEGCAIEPISMESGELLLILNDASLAIISFKIDGRTVSGISLAAVSQENGGNLLKSRVS 435

Query: 414  DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
             I+ IG +  F+GS  GDS+++ +     +   S   +++   ++ D      L     D
Sbjct: 436  CISRIGKASMFIGSESGDSVVLGW-----SRKQSQEKRKKSRALDTD------LALDVED 484

Query: 474  ALQDMVNGEELSLYGSASNNTESAQKTFS--------FAVRDSLVNIGPLKDFSYGLRIN 525
               D    E+  LYG+ S   + +Q            F ++D+L+ + P+ D + G  + 
Sbjct: 485  IDLDDDFDEDDDLYGTESAAAKPSQAGAGATKGGEPVFRLQDALLCLAPIHDVAPGKAVF 544

Query: 526  AD-----------------ASATGISKQS-----NYEL-------VELPGCKGIWTVYHK 556
                               A A G  K       N E+        E P  +G W +  K
Sbjct: 545  PSDSEEAFLRDGVTSELQLACAVGRGKAGAIAILNREIQPKVIGRFEFPEARGFWAMCVK 604

Query: 557  SSRGHNADSSRM--AAYD--DEYHAYLIISLEARTMVLETADLLTEVTESVDYF------ 606
                    SS +  + YD  ++Y  ++I++ +      ET+D+        +        
Sbjct: 605  KPVPKALGSSAVISSEYDSTEQYDRFMIVA-KVDLDGYETSDVYALTDAGFESLKDTEFE 663

Query: 607  -VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSV 664
               G T+ AG +  + R++QV +   R  DG   ++Q L       +  +G+E   V+S 
Sbjct: 664  PAAGFTVMAGTMGKQMRIVQVLKSEVRCYDGDLGLSQILPM----LDEDTGAE-PRVVSA 718

Query: 665  SIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRK 724
            SIADPY+++   D S+ +           + +   +E  +K         DKGP      
Sbjct: 719  SIADPYLMIIRDDNSVFI---------AKIGSNDELEEVEK---------DKGP------ 754

Query: 725  TSTDAWLSTGVGEAIDGA--DGGPLDQGDIYSVVCYES--GALEIFDVPNFNCVFTVDKF 780
              +  W +  +    DG      P D     +++   S  GAL ++D+ N +        
Sbjct: 755  LVSTKWQTGCLYTDYDGTFQAKKPDDNASPRTMMFLMSTAGALHMYDLDNLS-------- 806

Query: 781  VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
                      Y+ E L  S     S++  G     KE +  + V +L           PF
Sbjct: 807  -------EPVYVAEGLT-STPPFLSANFTGRKAAAKERLTEILVADLG----DVVSKSPF 854

Query: 841  LFAILTDGTILCYQAYLFEGPENTS---------KSDDPVSTSRSLSVSNVSASRLRNLR 891
            L        +  Y+   ++ P ++S         K     + ++S S  +      +  R
Sbjct: 855  LILRHDTDDLTLYEPVRYQEPNSSSPPLTDTLFFKKSANATIAKSASAFDKEEDETQQRR 914

Query: 892  FSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD 951
            F   PL            PC       N+ G+   FLSG  P + +   + +     L  
Sbjct: 915  F--VPLQ-----------PC------GNVGGYSTVFLSGDSPSFVLKSAKSIPRIVGLQG 955

Query: 952  GSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYF 1011
              +   +  H   C+ GFIY  ++GI ++CQLP+ + Y       K IPL    +++++ 
Sbjct: 956  QGVQGMSTFHTAGCDRGFIYADTKGIARVCQLPTDTNYAELGISVKKIPLDCDVNRVSFH 1015

Query: 1012 AEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVD-LHRTYTVEEYEVRILE 1070
            +    Y  I +    +P           E+    D H   + + +    T+    ++++ 
Sbjct: 1016 SHTATY--IAACSTREPF----------ELPKDDDYHKEWAREVVSFAPTMPRGMLKLIS 1063

Query: 1071 PDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGR 1129
            P      W    ++ ++S E   ++  + L  +  TKE   L+A+G+A  +GED+  RGR
Sbjct: 1064 P----AAWTVIHSLDLESCETIESMMALHLEISEETKERRMLVAVGSAICKGEDLPTRGR 1119

Query: 1130 VLLF 1133
            V +F
Sbjct: 1120 VQVF 1123


>gi|390599704|gb|EIN09100.1| hypothetical protein PUNSTDRAFT_67240 [Punctularia strigosozonata
            HHB-11173 SS5]
          Length = 1439

 Score =  140 bits (353), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 245/1112 (22%), Positives = 443/1112 (39%), Gaps = 159/1112 (14%)

Query: 97   SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
            + A L LV  +RLHG V  L  +    + N    D ++++FEDAKI+VLE+ +  H L  
Sbjct: 118  TVARLRLVREHRLHGMVTGLGRIKILSSLNDGL-DRLLISFEDAKIAVLEWSEEQHDLLT 176

Query: 157  TSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVG 215
             S+H +E +P+ + L     S   G L +VDP  RC  + +      I+   Q       
Sbjct: 177  VSIHTYERAPQLMSLN---ASLFHGWL-RVDPISRCAALALPCDAFAIIPFHQTLE---- 228

Query: 216  DEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIEPVMVILHERELTWAG 272
                        A    S +++L    D  + +V D  F+ G+  P + +L +   TW G
Sbjct: 229  -----------EAPYAPSFILDLTSEVDQRIHNVVDMSFLPGFNNPTVAVLFQPTQTWTG 277

Query: 273  RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYH 332
            R++    T  +   ++    + +P+I S  NLP+D   + A  + +GGV+V+ +N+I + 
Sbjct: 278  RLTEYKDTMKLLVFTLDAVTRNYPVITSVDNLPYDCLSVHACSAAVGGVIVITSNSIIHV 337

Query: 333  SQSA-SCALALNNYAVSLDSSQELP----RSSFSVELDAAHATWLQNDVALLSTKTGDLV 387
            SQS+   AL++N +A  +      P     ++ ++ L+ +   ++ +    L  K G + 
Sbjct: 338  SQSSRRVALSVNGWASRVTDMSLAPVQAEYATRNLALEGSRLAFVDDRTFFLFLKDGTVY 397

Query: 388  LLTVVYDGRVVQRLDLSKT-NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSML 446
             + +  DG VV  + +      S + + +T +     F+GS  G S+L++ T        
Sbjct: 398  PVELSLDGAVVSTISMGHALAQSAIPAVVTPVTQEHIFVGSTAGTSVLLKIT-------- 449

Query: 447  SSGLKEEFGDIEADAPSTKRLRRSSSDALQDM-------VNGEELSLYGSASNNTESAQK 499
               ++EE  D  +DA +   +  + S  + D        +  +  SL    +N T  + K
Sbjct: 450  --SVEEEVEDNASDAVAAAVVDTADSMVMDDDDDIYGVSMKTDAQSLSNGHANGTHLSVK 507

Query: 500  TFS---FAVRDSLVNIGPLKDFSYGLRINAD------ASATGISKQSNYELVE--LP--- 545
              S    ++ DSL   G + D S+ L  N +       +ATG      + L +  LP   
Sbjct: 508  KRSVTHLSLSDSLPGYGSISDMSFSLAKNGEKVVPELVAATGSGSMGGFTLFQRDLPART 567

Query: 546  --------GCKGIW------------TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA 585
                    G +G+W            T Y ++     AD+  +    D   A  +     
Sbjct: 568  KRKLHAIGGGRGMWSLSLRPTVKVNGTSYERAVNPFQADNDTVVVSTDANPAPGLSRFSH 627

Query: 586  RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL--DGS--YMTQ 641
            RT   E          S+   V G+TI A   F R  ++ V     R+L  DGS   + +
Sbjct: 628  RTPRTEI---------SITTRVPGQTIGAAPFFQRTAILHVMSNAIRVLEPDGSERQVIK 678

Query: 642  DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAI- 700
            DL                 +   SI DP+VL+   D +I L +G+     +  +  + + 
Sbjct: 679  DLD---------GNMARPKIRHCSICDPFVLIVREDDTIGLFIGESERGKIRRKDMSPMG 729

Query: 701  ESSKKPVSSCTLYHDKGPEPWLRKTSTDAW---LSTGVGEAIDGADG-----------GP 746
            + + + ++ C    + G      + + ++     +T   + +  AD            G 
Sbjct: 730  DKTSRYLTGCFFTDNAGVFDLRSQANGNSGADKTATSTLQGVVNADSRSQWLLLVRPQGV 789

Query: 747  LDQGDIYSVV-CYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINS 805
            L+  D+  +  C      +I+ +P  + VF+V    +    + D+    AL         
Sbjct: 790  LEASDLSPIPGCRRLNEKQIWTLPKLSIVFSVRLASTLDWVLADSGDGPAL--------- 840

Query: 806  SSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTS 865
             S  G    R +    + V +  +        +P L   L  G +  YQA     P   S
Sbjct: 841  -SMPGESPRRPQE---LDVEQAVIAPLGETAPQPHLLLFLRSGQLAIYQAI----PMQAS 892

Query: 866  KSDDPVS-TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQ 924
              D+ +S  S  +  + V+       R   +       ++         +T     +   
Sbjct: 893  SVDESLSRPSLGVRFAKVATRVFEIQRQDDSEKSILAEQKKISRVLIPFLTSPSPTTTFS 952

Query: 925  GFFLSGSRPCWCMVF-RERLRVHPQLCDGSIV-AFTVLHNVNCNHGFIYVTSQGILKICQ 982
            G F +G  PCW +   R  +R+HP     S+V AFT          F+  + +G   +  
Sbjct: 953  GVFFTGDHPCWILKPDRSGIRIHPS--GHSVVHAFTSCSLWESKGDFLLYSDEGPSLLEW 1010

Query: 983  LPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVG 1042
            +P  +  +   P  + IP   +  ++T+ A      LIV+   L+   +  +   D  + 
Sbjct: 1011 MPD-TDVETELP-SRSIPQPRSYSKVTFDASTG---LIVAAAHLE--AEFATYDEDNNIV 1063

Query: 1043 HQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFN 1102
             + D+ N+ S       T+E     ++ PD     W T       ++E   +V  V L  
Sbjct: 1064 WEPDSANV-SFPRSSCSTLE-----LISPDE----WITMDGFEFANNEFVTSVESVPLET 1113

Query: 1103 TTTKE-NETLLAIGTAYVQGEDVAARGRVLLF 1133
            ++T+  ++  +A+GT   +GED+A RG   +F
Sbjct: 1114 SSTESGSKDFIAVGTTIDRGEDLAVRGTTYVF 1145


>gi|189203597|ref|XP_001938134.1| conserved hypothetical protein [Pyrenophora tritici-repentis
           Pt-1C-BFP]
 gi|187985233|gb|EDU50721.1| conserved hypothetical protein [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 1407

 Score =  139 bits (351), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 168/697 (24%), Positives = 292/697 (41%), Gaps = 86/697 (12%)

Query: 57  NLVVTAANVIEIYVVR-----VQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
           NLVV   ++++I+ ++     V     + S+N+      E     L    + A L LV  
Sbjct: 28  NLVVAKNSLLQIFELKSTTTEVTPGAGENSENAAANLDTEAADVPLQRTENTAKLVLVAE 87

Query: 107 YRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESP 165
           + L G V SLA +    A N++ + +++++AF DAK+S++E+D   + L   S+H +E+P
Sbjct: 88  FPLAGTVISLARVK---ALNTKSKGEALLVAFRDAKLSLVEWDPETYNLHTISIHYYENP 144

Query: 166 E------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQ----------- 208
           +      W    +   +F     +  DP  RC  +      + IL   Q           
Sbjct: 145 DLPGIAPWSADLKDTYNF-----LTADPSSRCAALKFGTHNLAILPFRQRDLVEDDYDSD 199

Query: 209 -GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHE 265
             G      +   G+ G       SS V+ L +LD  + H     F+H Y EP   I+  
Sbjct: 200 ADGPKETKADQANGTNGEHKTPYSSSFVLPLTNLDPTLTHPVHLAFLHEYREPTFGIVAA 259

Query: 266 RELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVG 325
              T    ++ +      S  ++    K    + S   LP+D  K++ +PSPIGG L+VG
Sbjct: 260 SRATAPSLLAQRKDILTYSVFTLDLEQKASTTLLSVSGLPYDITKVVPLPSPIGGALLVG 319

Query: 326 AN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTK 382
            N  IH      +  +A+N +A +  S     +S  ++ L+      L  +    L+   
Sbjct: 320 GNEIIHVDQGGKTNGVAVNEFAKACTSFSLSDQSDLALHLEGCSIELLSQETGDVLIVLN 379

Query: 383 TGDLVLLTVVYDGRVVQRLDLSKTNP-------SVLTSDITTIGNSLFFLGSRLGDSLLV 435
            G L++LT   DGR V  + +                S  + +G    F+GS  G+S+++
Sbjct: 380 NGRLLILTFTLDGRTVSGMTIQTVAADHGGHLLKSAASCTSNLGRGRLFIGSEDGESVML 439

Query: 436 QFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASNNT 494
            +T       L++ L+ +  + + D            D   D+ N    +++  +A+ + 
Sbjct: 440 GWTG------LTNQLRRKLSNADLDG-EDDSEEEEIDDMEDDLYNDTAPTMHKITAAVSE 492

Query: 495 ESAQKTFSFAVRDSLVNIGPLKD-----------FSYG-LRINADASATGISKQSNYEL- 541
            +A  T++F + D L +I P+KD            + G + ++    A G     + EL 
Sbjct: 493 PTAPGTYTFRIHDVLPSIAPIKDAVLHPGKVTESLNRGEIMLSTGRGAAGAITALDRELH 552

Query: 542 ------VELPGCKGIWTVYHKS------SRGHNADSSRMAAYDDEYHAYLIISL--EART 587
                  ELP   G+W V+ +       +     D+    A D +Y  YL++S   E  T
Sbjct: 553 PISVATKELPSAHGVWAVHARKQAPGDVTAAFGEDTEANMATDVDYDQYLVMSKNGEDGT 612

Query: 588 MVLE-TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG 646
           +V E   D LTE  +      +G T+  G L    +V+QV     RI D       +   
Sbjct: 613 VVYEVNGDKLTETDKGDFEREEGTTLLVGILAAGTKVVQVMRTEVRIYDSELNLVHIQSM 672

Query: 647 PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
               E GS  E + +++ S ADPY+L+   D S+++ 
Sbjct: 673 EEEEEGGSTKELN-IINASFADPYLLILREDSSVKIF 708



 Score = 67.0 bits (162), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 63/268 (23%), Positives = 116/268 (43%), Gaps = 34/268 (12%)

Query: 876  SLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI-FKNISGHQGFFLSGSRPC 934
            S S S++    LR ++ S+  +  Y  +        +   +   ++ G+   F  G+ P 
Sbjct: 831  SRSASDLWTKNLRWVKLSQQHVPRYIEDNGSEDPGFESTLVALDDVCGYSTVFQRGTTPA 890

Query: 935  WCMVFRERLRVHPQ---LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            +  +F+E     P+   L    + + T  H   C  GF Y+ S   L+ICQLP  + Y +
Sbjct: 891  F--IFKEASSA-PRVIGLSGKPVKSLTSFHTSKCQRGFAYLDSTDTLRICQLPPQTHYGH 947

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQID----- 1046
                 + +P+ +  H +TY     LY  IV                 Q   +Q+D     
Sbjct: 948  LGWATRRMPMDSEVHALTYHP-SGLY--IVGT--------------GQTEDYQLDPTETY 990

Query: 1047 NHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTT 1105
            +++L   DL    ++E   V++L+       W    T  +   E  L+++ + L  +  T
Sbjct: 991  HYDLPKEDLTFKPSIERGVVKLLDEKS----WTIIDTHILDPQEIVLSIKTLNLEVSEIT 1046

Query: 1106 KENETLLAIGTAYVQGEDVAARGRVLLF 1133
             + + L+A+GT+ V GED+A +G + +F
Sbjct: 1047 HQRKDLIAVGTSVVHGEDLATKGCIRIF 1074


>gi|409046890|gb|EKM56369.1| hypothetical protein PHACADRAFT_93103 [Phanerochaete carnosa
            HHB-10118-sp]
          Length = 1417

 Score =  139 bits (351), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 231/1099 (21%), Positives = 446/1099 (40%), Gaps = 144/1099 (13%)

Query: 103  LVCHYRLHGNV---ESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
             V  +RLHG V   ES+ I+S          D ++++F+DAKI++LE+ D+++ L   S+
Sbjct: 120  FVREHRLHGTVTGMESIRIVSS----QEDGLDRLLVSFKDAKIALLEWSDAVNDLLTVSI 175

Query: 160  HCFE-SPEWLHLKRGRESFARGPL----VKVDPQGRCGGVLVYGLQMIILKASQGGSGL- 213
            H +E +P+ + L+         PL    ++ DP  RC  +++    + IL   Q  + L 
Sbjct: 176  HTYERAPQMMALE--------APLFHSQLRTDPLSRCAALMLPKDSLAILPFYQSQADLD 227

Query: 214  VGDEDTFGSGGGFSARIESSHVINLR-DLD--MKHVKDFIFVHGYIEPVMVILHERELTW 270
            + ++DT  S          S V+++  D+D  +KHV D +F+ G+  P + +L +   TW
Sbjct: 228  IMEQDTQTSCRDIP--YSPSFVLDMTTDVDERIKHVIDLVFLPGFNSPTIAVLFQNTQTW 285

Query: 271  AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIH 330
              R+     T  +   ++    +  P++ +  NLP+D   L+   + +GGV++V AN + 
Sbjct: 286  TSRLREYKDTVGLIIFTLDLVTRNCPVLTAVDNLPYDCLYLVPCSAQLGGVVIVSANALI 345

Query: 331  YHSQ-SASCALALNNYAVSLDSSQELPR-----SSFSVELDAAHATWLQNDVALLSTKTG 384
            Y +Q S    L +N +   + S   LP+      S +++L+ ++A ++ ++   +    G
Sbjct: 346  YVAQTSRRVILPVNGWQARV-SDHPLPQLTEEEKSRNLKLEGSYAVFVDDNKLFVLLSDG 404

Query: 385  DLVLLTVVYDGRVVQRLDL-SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGT 443
             +  + V  DGR V RL + S    + + + +  + +   F+GS  G S+L++      T
Sbjct: 405  TVYPMEVHADGRTVSRLTMGSALAQTTIPAIVRRVTDENLFIGSTAGPSVLLK------T 458

Query: 444  SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSF 503
            S +   +KEE  +++  AP+      +  D   D  +GE       A   T         
Sbjct: 459  SHVEEDVKEEDVEMDT-APAAVVDEANEMDLDDD--DGELCHWVHFAKKRT-----VVHL 510

Query: 504  AVRDSLVNIGPLKDFSYGLRINAD------ASATGISKQSNYELVE--LP---------- 545
            ++ DS+   GP+ D ++ L    D       +ATG      + L +  LP          
Sbjct: 511  SLCDSIPAYGPVSDMTFSLTRVGDRPVAELVAATGSGGLGGFTLFQRDLPSRVKRKLHAV 570

Query: 546  -GCKGIWTV-YHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM--VLETADLLTEVTE 601
             G +G+W++   ++ R + +   R +      +  +IIS +A     +   A   ++   
Sbjct: 571  GGGRGMWSLAVRQAVRVNGSTYERPSNPHHGGNDAVIISTDANPSPGLSRIASRSSKSDI 630

Query: 602  SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL--DGS--YMTQDLSFGPSNSESGSGSE 657
             +   + G T+ A + F    ++ V     R+L  DG+   + +DL              
Sbjct: 631  QITTRIPGTTVGAASFFQGTAILHVMSNAIRVLEPDGTERQIIKDLD---------GSVP 681

Query: 658  NSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAI-ESSKKPVSSCTLYHDK 716
               +   S+ DP++++   D S+ L +G+P    +  +  + + E + K ++ C  + D 
Sbjct: 682  RPKIRYCSMCDPFIMVIREDDSLGLFIGEPERGKIRRKDMSPMGEKTSKYIAGC-FFMDT 740

Query: 717  GPEPWLRKTSTDAWLSTGVGEAIDGA-DGGPLDQGDIYSVVCYESGALEIFDVPNFNCVF 775
                  R  +  A     V   +    + G   Q   + ++    G LE++ +P    VF
Sbjct: 741  TGIFQSRVNAAAAAADKNVTSTLQTVMNAGTRTQ---WLLLVRPQGVLEVWSLPKLALVF 797

Query: 776  TVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAH 835
            +     +  + +VD+    AL                Q        + V ++A+      
Sbjct: 798  STSHVSALESVLVDSGDSPAL-------------SLPQDPPRKPQDLDVEQIAIAPLGES 844

Query: 836  HSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRT 895
             S+ +L   L  G    Y+A     P   S    P + + +L V  V        +    
Sbjct: 845  SSKLYLLVFLRCGLFAVYEAL----PAPASTDPPPPTRTSTLCVKFV--------KVVTR 892

Query: 896  PLDAYTREETPHGAPCQRITIFKNI----------SGHQGFFLSGSRPCWCM-VFRERLR 944
              D    EE       ++  I + +              G FL+G RPCW +   +  ++
Sbjct: 893  AFDIQQSEEVEKSVLAEQKRISRQLIPFVTSPTPGRAFSGVFLTGDRPCWILSTDKGGVK 952

Query: 945  VHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKAT 1004
            + P      + AFT          F+  + +G   I  +P    ++ + P + +      
Sbjct: 953  IMPS-GHQVVHAFTACSLWESKGDFLLYSDEGPSLIEWVPE-IQFEGHLPSRSI----PR 1006

Query: 1005 PHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEY 1064
            P   ++   +    L+V+   L+  +   S   D+ V  + D  N+S          E  
Sbjct: 1007 PRPYSHVVFEPTTTLLVAASSLQ--STFTSYDEDRNVVWEPDEPNMS------LPVCETS 1058

Query: 1065 EVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE-NETLLAIGTAYVQGED 1123
             + ++ PD     W T        +E    +  +TL   +T+   +  +A+ T   +GED
Sbjct: 1059 ALELISPDT----WTTMDGYEFAQNEFVTCMECITLETLSTETGTKDFVAVSTTINRGED 1114

Query: 1124 VAARGRVLLFSTGRNADNP 1142
            +A +G V +F       +P
Sbjct: 1115 LAVKGAVYIFEVVEVVPDP 1133


>gi|342877552|gb|EGU79002.1| hypothetical protein FOXB_10431 [Fusarium oxysporum Fo5176]
          Length = 1399

 Score =  137 bits (346), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 245/1126 (21%), Positives = 418/1126 (37%), Gaps = 161/1126 (14%)

Query: 69   YVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQ-----GG 123
            Y  R  ++   ES   G     V  D  +   L LV    L G V  LA +       GG
Sbjct: 65   YDHRANDDDGLESSFLGGESMLVRTDRTNLTKLVLVAELPLSGTVTGLAKVKTKHSKCGG 124

Query: 124  ADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR-GPL 182
                   +++++A++ AK+ +  +D     L   S+H +E  E LH      SF      
Sbjct: 125  -------EALLIAYKAAKLCMAVWDPEKSNLETISIHYYEKEE-LHGAPWEVSFDEYTNY 176

Query: 183  VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD 242
            ++ DP  RC         + IL   Q    L  D+      G    + ES+ V N     
Sbjct: 177  LEADPGSRCAAFQFGSRNLAILPFRQAEEDLEMDDWDEDLDGPRPVK-ESTTVANGDSDT 235

Query: 243  MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAM 302
            ++            EP   IL   +          H T  +  L +    +    I S  
Sbjct: 236  LEPA----------EPTFGILSSSQERAHSLGQKDHLTYKVFTLDLQQ--RASTTILSVT 283

Query: 303  NLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFS 361
            +LP D +K++ +P+P+GG L++G N  IH      S  +A+N+ A  + S     ++  +
Sbjct: 284  DLPRDLFKIIPLPAPVGGSLLIGENELIHVDQSGKSNGVAVNSMARQITSFSLTDQADLN 343

Query: 362  VELD--AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ----RLDLSKTNPSVLTSDI 415
            + L+        ++N   LL    G + ++T   DGR V     R+   +   +++ S  
Sbjct: 344  LRLEHCVIETLSIENGELLLVLNDGRIGIVTFQIDGRTVSGLTVRMVADENGGNLIKSRA 403

Query: 416  TT---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
            +T   +G + +F+GS +GDS+++ +T   G        K    D E              
Sbjct: 404  STASKLGKNAYFVGSEVGDSVVLGWTRKMGQEKRR---KPRLIDAEIGLEMDDLDLEDED 460

Query: 473  DALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG-LRINADASAT 531
            D   D+   E  +   + + N        SF + D+L++I P+KD + G +  + D+   
Sbjct: 461  DEDDDLYGTESAAAKPAQALNGGGKTGELSFRIHDTLLSIAPIKDLTPGKVSFHPDSEEA 520

Query: 532  GISKQSNYEL----------------------------VELPGCKGIWTVYHK----SSR 559
             +S+    +L                             E P  +  WT+  K     + 
Sbjct: 521  TLSQGVVSDLHLACVVGRGKAGSLAILNRNIQPKIIGRFEFPEARDFWTMSVKKPMPKAL 580

Query: 560  GHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLT------EVTESVDY-FVQGRTI 612
            G N           ++  Y+I++ +      ET+D+        E  +  ++    G T+
Sbjct: 581  GGNVGMGNEYETFGQHDKYMIVA-KVDLDGYETSDVYALTGAGFETLKDTEFDPAAGFTV 639

Query: 613  AAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
             AG +  + R+IQV +   R  DG   +TQ L     + E+G+      V S SIADPY+
Sbjct: 640  EAGTMGKQMRIIQVLKSEVRSYDGDLGLTQILPM--LDEETGA---EPRVTSASIADPYL 694

Query: 672  LLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWL 731
            LL   D S+ L   D +     V+   A   + K  S C     KG     +  ++D   
Sbjct: 695  LLIRDDSSLMLAQIDSNNELEEVEKMDATLQNTKWHSGCLYADTKGA---FQPNASDKGA 751

Query: 732  STGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHI-VD 789
             T                  I   +   +GAL ++ +P+ +  V+  +       H+  D
Sbjct: 752  ET----------------EKIMMFLLSSTGALHVYALPDLSKPVYVAEGLCYVPPHLSAD 795

Query: 790  TYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGT 849
              +R  L                   KEN+  + V +L           P+L        
Sbjct: 796  YTLRRGLA------------------KENLREILVADLG----DTTSQSPYLILRNQTDD 833

Query: 850  ILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA 909
            +  Y+      P    +     S S +L+    S + L  +       D     E P   
Sbjct: 834  LTIYE------PLRHVRDGGETSLSATLTFKKTSNTTLATIPVETEQDDV----EQPRFV 883

Query: 910  PCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGF 969
            P +      NI+G+   FL G  P + +   + +     L    +   +  H   C+ GF
Sbjct: 884  PLRPCA---NINGYSTVFLPGPSPSFVIKSSKSIPRVIGLQGLGVRGMSTFHTEGCDRGF 940

Query: 970  IYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPL 1029
            IY   +GI ++ QLP  + +       K +PL A    I Y      Y  I    + +P 
Sbjct: 941  IYADDKGIARVTQLPPDTNFTELGISVKKVPLGADVRGIAYHQPTGAY--IAGCMISEPF 998

Query: 1030 NQVLSLLIDQEVGHQIDNHN-LSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQS 1088
                      E+    D H   +   L    T+    ++++ P      W     + ++S
Sbjct: 999  ----------ELPKDDDYHKEWAKETLTFPPTMPRGVLKLISPVS----WTVIHEVELES 1044

Query: 1089 SENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
             E+   ++ + L  +  TKE   L+ +GTA  +GED+  RGRV +F
Sbjct: 1045 CESIECMKTLHLEVSEDTKERRFLVTVGTAVSKGEDLPIRGRVHVF 1090


>gi|353234640|emb|CCA66663.1| related to cleavage and polyadenylation specificity factor, 160 kDa
            subunit [Piriformospora indica DSM 11827]
          Length = 1324

 Score =  137 bits (345), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 252/1134 (22%), Positives = 454/1134 (40%), Gaps = 194/1134 (17%)

Query: 55   VPNLVVTAANVIEIYVVR---VQEEGSKES--KNSGETKRRVLMDGISAASLELVCHYRL 109
            V NLVV   N + IY VR     EE   ES  K+SG ++         +  L LV  + L
Sbjct: 36   VTNLVVGRNNRLRIYDVRRTIYTEETHVESDLKSSGPSRH--------SHRLCLVREHLL 87

Query: 110  HGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLH 169
            HG +  LA +           D ++++F+D+K++++E+ ++++ +   S+H +E    L 
Sbjct: 88   HGIIIGLAAVRTANPGLGSP-DRLLVSFQDSKLALMEWSNTLYDISTVSIHSYERSPLLL 146

Query: 170  LKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL-VGDEDTFGSGGGFSA 228
                 E  A    ++ DP  RC  +++    + +L   Q  + L V D        G  +
Sbjct: 147  NSDFTECRA---YLRTDPANRCAALVMPRDNIALLPWYQPQTELDVQD--------GIQS 195

Query: 229  RIES-----SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTC 281
              E      S+V N+  +D  ++++ D +F+ G+  P + IL + + TW GR+       
Sbjct: 196  IAEELPYSPSYVTNVSAMDERIRNILDLVFLPGFNVPTIAILFQEQRTWTGRLKENKDNT 255

Query: 282  MISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI-HYHSQSASCAL 340
             +  +S+    + + +I +   LP+D+  +    + +GGVLVV AN+I H    S    L
Sbjct: 256  SLFFISLDLVSRSYQVIATIEKLPYDSLYMSPCHAKLGGVLVVTANSILHVDQASKITTL 315

Query: 341  ALNNYAVSL-DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ 399
             ++ +A  + D+S     +   + L+ +   ++ +   +LS   G  + + + ++GR V 
Sbjct: 316  PMSGWAARVSDTSHGFQDAVDDIHLEGSRMGYISDSQVILSLSNGKCLHIRIDHEGRTVW 375

Query: 400  RLDLSKT-----NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEF 454
             L    T      PSVL +      + L FLGS  GDS+L ++                 
Sbjct: 376  GLTAVHTFGISSPPSVLIAK-----DGLVFLGSTAGDSVLFEYA---------------- 414

Query: 455  GDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGP 514
                                 QD+ +  +  L     N +E+   +FS    D+L + G 
Sbjct: 415  ---------------------QDLSSHRDFML----PNASETIPTSFSLLPVDNLQDSGS 449

Query: 515  LKDFS-YGLRINADA---SATGISKQSNYELV----------ELP---GCKGIWTVYHKS 557
                S +GLR + +    +A G+     +  V          +LP   G +GIW     S
Sbjct: 450  YTAASFFGLRGSEEPALIAANGLDDLGGFSTVHKTMPLRLRKKLPAIAGRQGIW-----S 504

Query: 558  SRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR----TIA 613
             R H  +   +    +      ++S +A T     + + T+    +D  +  R    TIA
Sbjct: 505  MRVHQGNGIELPLGHNT-----LLSTDA-TPTPGASRIATKSQARLDINITTRIPMLTIA 558

Query: 614  AGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
                F    ++QV     R+L     T D S      +  + +  + +   +I DPYVL+
Sbjct: 559  VAPFFDGTHLLQVTSNSLRLL-----TTDGSEKQVIPDRDNSTARARIRHAAICDPYVLI 613

Query: 674  GMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD-----KGPEPWLRKTSTD 728
               D ++ L VG+P+   +  +  + +   K    + T Y D     K  E  +R+T   
Sbjct: 614  LREDDTLGLFVGEPTRGKLRRKDMSPLGDKKLCYWAATFYDDLTGRLKIDEDLMRETK-- 671

Query: 729  AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
                  VG           ++G+ +  +C  +G LEI+ +P    VF V     G +   
Sbjct: 672  -----AVG-----------NRGEKWLALCRSTGTLEIWSLPKLALVF-VSSISLGPS--- 711

Query: 789  DTYMREALK-DSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTD 847
                   LK D + E++S+++     G    +  + + +L     S H     L  +   
Sbjct: 712  ------VLKHDQKKEVDSATKTELPVG-ATTLQQVIITDLGEIEPSPH-----LIVLYDS 759

Query: 848  GTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVS-ASRLRNLRFSRTPLDAYTREETP 906
              ++ YQ      P    K+  P    RS+    +S   R+ +   + TP +  T   + 
Sbjct: 760  NLLIVYQMV----PLEPDKAGLPQLDRRSVPSLRISFVKRMVHHLANPTPDENQTSGGSN 815

Query: 907  HGAPCQRITIFKNISGH----QGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHN 962
                 + I  F  +        G F++G  P W +       +H      ++ +FT    
Sbjct: 816  EKRLPKTIVPFSVLDWEGNSIYGAFVTGDNPAWILSKNHSGLLHLPCGYEAVHSFTPCSM 875

Query: 963  VNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVS 1022
             + +  F+  T +G   + Q   G T+   +P  K    +      T  A  N   L+V+
Sbjct: 876  WDFSPTFLMSTEEGSC-LVQWTPGITFHGQYPCSKTRKGRTQ----TNIAYSNTTGLLVA 930

Query: 1023 VPVLKPLNQVLSLLIDQEVGH--QIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT 1080
                   N    LL D+E  +  + D  N+S   L  +       + +L+P+     W T
Sbjct: 931  ASS----NDRDFLLFDEEGTNSWEPDGVNVSLPKLGAS------ALELLDPET----WVT 976

Query: 1081 RATIPMQSSENALTVRVVTLFNTTTKE-NETLLAIGTAYVQGEDVAARGRVLLF 1133
                   ++E    V  V L   +T+  N+  +A+GT+  +GED+A RG   +F
Sbjct: 977  IDGYEFAANEVVNIVESVKLETLSTQTGNKEFIAVGTSIHRGEDLAVRGGTYIF 1030


>gi|426235955|ref|XP_004011942.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 [Ovis aries]
          Length = 819

 Score =  137 bits (344), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 112/384 (29%), Positives = 176/384 (45%), Gaps = 54/384 (14%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T  +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDSEAPTKNDRSTDGKAHRE--HREKLELVASFSFFGNVMSM 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAK--------------ISVLEFDDSIHGLRITSMHCF 162
           A +   GA    +RD+++L+F+DAK              +    FD +   +  TSM   
Sbjct: 85  ASVQLAGA----KRDALLLSFKDAKGGYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTM 140

Query: 163 ESPEWL----------------HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKA 206
           E P +L                 L+    S  R    K +P  +   V         ++ 
Sbjct: 141 E-PGYLFLGSRLGNSLLLKYTEKLQEPPASTTREAADKEEPPSKKKRVDATTGWAGRVRE 199

Query: 207 SQGGSGLVGDEDTFGSGG---------GFSARIESSH---VINLRDLDMK--HVKDFIFV 252
            +     V + + +GS            F  R  S     +I++R LD K  ++ D  F+
Sbjct: 200 GELPQDEVDEIEVYGSEAQSGTQLATYSFEVRWGSEWLPGIIDVRALDEKLLNIVDLQFL 259

Query: 253 HGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLL 312
           HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K HP+IWS  +LP D  + L
Sbjct: 260 HGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVHPVIWSLTSLPFDCTQAL 319

Query: 313 AVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATW 371
           AVP PIGGV++   N++ Y +QS     +ALN+      +     +    + LD A A +
Sbjct: 320 AVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLRTQEGVRITLDCAQAAF 379

Query: 372 LQNDVALLSTKTGDLVLLTVVYDG 395
           +  D  ++S K G++ +LT++ DG
Sbjct: 380 ISYDKMVISLKGGEIYVLTLITDG 403



 Score = 87.0 bits (214), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 64/254 (25%), Positives = 108/254 (42%), Gaps = 21/254 (8%)

Query: 907  HGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNC 965
            +G    R+ + K    H   F+ G  P W +V  R  LR+HP   DG I +F   HN+NC
Sbjct: 486  YGGRHHRLALHKPPLHH--VFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNINC 543

Query: 966  NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPV 1025
              GF+Y   QG L+I  LP+  +YD  WPV+K IPL+ T H + Y  E  +Y +  S   
Sbjct: 544  PRGFLYFNRQGELRISVLPAYLSYDAPWPVRK-IPLRCTAHYVAYHVESKVYAVATSTST 602

Query: 1026 LKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIP 1085
              P  +V  +      G + +   +   + +     E + ++++ P      W+      
Sbjct: 603  --PCTRVPRM-----TGEEKEFETIERDERYVHPQQEAFCIQLISPVS----WEAIPNAR 651

Query: 1086 MQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNL 1145
            ++  E              ++ +   +  G+   +GE+V  RGR+L+         P   
Sbjct: 652  IELEEXXXXX------XXGSRGHVYSVPAGSCLKEGEEVTCRGRILIMDVIEVVPEPGQP 705

Query: 1146 VLSGSYGPLFSSVQ 1159
            +    +  L+   Q
Sbjct: 706  LTKNKFKVLYEKEQ 719



 Score = 52.0 bits (123), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 54/160 (33%), Positives = 76/160 (47%), Gaps = 22/160 (13%)

Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDL-VLLTVVYDG-RVVQRLDLSKTNPSVLTSDI 415
           S  SV+L  A     + D  LLS K      +LT++ DG R V+     K   SVLT+ +
Sbjct: 83  SMASVQLAGA-----KRDALLLSFKDAKGGYVLTLITDGMRSVRAFHFDKAAASVLTTSM 137

Query: 416 TTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL-------- 467
            T+     FLGSRLG+SLL+++T         +    E  D E      KR+        
Sbjct: 138 VTMEPGYLFLGSRLGNSLLLKYT--EKLQEPPASTTREAADKEEPPSKKKRVDATTGWAG 195

Query: 468 RRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVR 506
           R    +  QD V+  E+ +YGS A + T+ A  T+SF VR
Sbjct: 196 RVREGELPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVR 231


>gi|388581811|gb|EIM22118.1| hypothetical protein WALSEDRAFT_28358 [Wallemia sebi CBS 633.66]
          Length = 1259

 Score =  137 bits (344), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 164/666 (24%), Positives = 278/666 (41%), Gaps = 103/666 (15%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNV--- 113
           N+V TA N ++IY + +                        +A L L   Y+LHG +   
Sbjct: 30  NIVTTANNTLKIYEIDIDS-------------------NTPSAKLILRREYQLHGEIIGI 70

Query: 114 ESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRG 173
           +S+ ILS         +D +++AF DAKI++LE+ D I+ +   S+H +E  + + + + 
Sbjct: 71  QSIKILST----TEDGKDRLLIAFRDAKIALLEWSDEINDIVTVSIHTYERSQQV-ISQD 125

Query: 174 RESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESS 233
              F    +++ DP+ RC  +L+    + IL      + L  D D   S          S
Sbjct: 126 MSRFK--AILRSDPENRCSALLLPDDSLAILPVHSAHAEL-EDLDQDVSNAIKDVPYAPS 182

Query: 234 HVINLR--DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
            ++ L+  D D+ +V D+ F+ G+  P + +L E   TW GR+S    TC +  L++   
Sbjct: 183 FILPLKSIDSDICNVIDYTFLPGFHNPTLAVLCEPRQTWTGRLSDSQDTCQVFFLTLDLV 242

Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLD 350
            + +P+I +  NLP+D+  L A P  IGGV ++ AN  IH          A N +A +L 
Sbjct: 243 TQVYPIIATVDNLPYDSMSLKAAPKEIGGVAILSANAIIHVDQNGRPVGRATNGWA-TLT 301

Query: 351 SSQEL--PRSSFSVELDAAHATWLQ------NDVALLSTKTGDLVLLTVVYDGRVVQRLD 402
           S++    P     V L+ A   +LQ      +  ALL    G++  +    +GR + R+D
Sbjct: 302 SARNFDAPPKDLFVRLEGASIEFLQPKSKQTHPQALLFLPNGEIHAVQFYREGRTISRID 361

Query: 403 LSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAP 462
           +SK         I +    L   G  L     V      GTS L   + +   D+E    
Sbjct: 362 ISK---PFAKGSIPSGAYRLDIDGQGLSGGQFVFIPSMVGTSFLIR-VGKSLNDLEL--- 414

Query: 463 STKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFS------FAVRDSLVNIGPLK 516
              +  +  + A  DM    +  LYGS+    +  ++         F + D + + GP++
Sbjct: 415 -FPKQEKVGTTAYDDMDVDVDEELYGSSDKKADEKEEEEEISSEPPFTICDYIESYGPIQ 473

Query: 517 DFSYG---------LRINADASA------TGISKQSNYE---LVELPGCKGIWTVYHKSS 558
           D + G         L+I A   A      T   ++  +E    +++ G  G+WT ++ + 
Sbjct: 474 DITIGRYMQTRNSPLQILAATGAGHVGGITAFHQEVPFESKHKLDVQGNHGLWT-FNVTG 532

Query: 559 RGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLF 618
            G+      + A D +    +   L +  + L   D   EV           TIAA    
Sbjct: 533 VGN-----VLVATDSKSKTKISKLLPSNEVALIAED--NEV-----------TIAADTAA 574

Query: 619 GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
              R++ +     ++L    + Q               EN  V   SI+DPY+L   S+G
Sbjct: 575 NSTRILMITSNAIKVLKEDGIEQ----------QSLQIENGEVQRASISDPYILTLQSNG 624

Query: 679 SIRLLV 684
           SI L +
Sbjct: 625 SISLFI 630


>gi|148886829|sp|A2R919.1|CFT1_ASPNC RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
            1
 gi|134083776|emb|CAK47110.1| unnamed protein product [Aspergillus niger]
          Length = 1383

 Score =  136 bits (343), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 236/1168 (20%), Positives = 459/1168 (39%), Gaps = 190/1168 (16%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            +L+V   ++++IY +  +     E  ++ +   ++L++            Y L G V  L
Sbjct: 28   DLIVVRTSLLQIYSLH-KVASHAEGADAQQESTKLLLEK----------EYSLSGTVTGL 76

Query: 117  A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
                 + S+ G +      ++++AF +AK+S++E+D    G+   S+H +E  +      
Sbjct: 77   CRVKVLNSKSGGE------AVLVAFRNAKLSLIEWDPERRGISTISIHYYERDDLTRSPW 130

Query: 173  GRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFGS--------- 222
              +    G ++ VDP  RC  +  +G++ + I+   Q G  LV D+  +GS         
Sbjct: 131  VPDLNNCGSILSVDPSSRCA-IFNFGIRNLAIIPFHQPGDDLVMDD--YGSDLGEGISTD 187

Query: 223  ---GGG-----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHER 266
               GGG           +      S V+ L  LD  + H     F++ Y EP   IL+ +
Sbjct: 188  HDLGGGTVADKAKEGIVYQTPYAPSFVLPLTTLDPSILHPISLAFLYEYREPTFGILYSQ 247

Query: 267  ELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGA 326
              T +  +  +      +  ++    +   ++ S   LP D ++++A+P P+GG L++G+
Sbjct: 248  VATSSALLPERKDVVFYTVFTLDLEQQASTVLLSVSRLPSDLFRVVALPPPVGGALLIGS 307

Query: 327  NT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKT 383
            N  +H      + A+ +N ++  + S     +S  ++ L+      L +     LL   T
Sbjct: 308  NELVHIDQAGKTNAVGVNEFSRQVSSFSMTDQSDLALRLENCIVECLGDSSGDMLLVLTT 367

Query: 384  GDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI-------TTIGNSLFFLGSRLGDSLLVQ 436
            G++ ++    DGR V  + +         + I       T IG+   FLGS  GDS+L+ 
Sbjct: 368  GEMAIVKFKLDGRSVSGISVHLLPAHAGLTSIYSAAAASTFIGDGKIFLGSEDGDSVLLG 427

Query: 437  FTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV--NGEELSLYGSASNNT 494
            ++  S ++       ++  D  AD        +S  D  +D +     + +L G   +  
Sbjct: 428  YSYSSSSTKKHRLQAKQVIDDSADMSEED---QSDDDVYEDDLYSTSPDTTLTGRRPSGE 484

Query: 495  ESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI---- 550
             SA   + F + D L+NIGPL+D + G R++ +   TG    S    +++   +G     
Sbjct: 485  SSAFGLYDFRIHDKLINIGPLRDITMGKRLSTNLEKTGDRTNSTSPELQIVASQGSHKSG 544

Query: 551  -WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA-------------RTMVLETADLL 596
               V  +    H   S  + + D  + A L    EA             R  V+ T    
Sbjct: 545  GLVVMAREIDPHVVASISLESVDCIWTASLTREEEAVSGTSEKMGQQSQRCYVIATEVKG 604

Query: 597  TEVTESVDYFVQGR----------------TIAAGNLFGRRRVIQVFERGARILDGSY-M 639
            ++  ES+ + V G                 TI+ G    R+RV+QV +   R  D    +
Sbjct: 605  SDREESLIFVVDGHDLKPFRAPDFNPNEDVTISVGTQESRKRVVQVLKNEVRSYDFDLSL 664

Query: 640  TQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAA 699
            TQ       ++     ++    +S S+AD  + +   D ++  L  D S     V     
Sbjct: 665  TQIYPIWDDDT-----NDERMAVSASLADSCLAILRDDSTLLFLQADDSGDLDEVVFGED 719

Query: 700  IESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYE 759
            + S K    SC LY DK                TG+  +ID     P+ + D++  +   
Sbjct: 720  VASGK--WISCCLYSDK----------------TGMFSSIDRTLSEPV-KNDMFLFLLSH 760

Query: 760  SGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENI 819
               L ++ V +   + ++ +   G + ++                 SSE     G +EN+
Sbjct: 761  DCKLFVYRVRD-QKLLSIIEGTDGLSPLL-----------------SSEPPKRSGTRENL 802

Query: 820  HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSV 879
                V +L  + WSA    P+L        ++ Y+ ++             VST     +
Sbjct: 803  IEAIVADLG-ETWSAS---PYLILRSETDDLIIYKPFV-------------VSTGPVEGI 845

Query: 880  SNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF 939
             ++  S+  N    R P    + + +      + + I  +ISG    F+ G+   + +  
Sbjct: 846  HSLKFSKETNSVLPRIPPGVSSTQPSGSDYRARPLRILPDISGLSAVFMPGASAGFII-- 903

Query: 940  RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVI 999
                         S   F  L   N        +    ++ C+LP  + +D  W +++V 
Sbjct: 904  ---------RTSASAPHFLRLRGEN--------SRSSTVRFCKLPPMTRFDYQWTLKRVH 946

Query: 1000 PLKATPHQITYFAEKNLYPLIVSVPVLKPLNQV-LSLLIDQEVGHQIDNHNLSSVDLHR- 1057
              +   H + Y     +Y       VL   +     L  D E+  +  N  +S     R 
Sbjct: 947  LGEQVDH-LAYSTSSGMY-------VLGTCHATDFKLPEDDELHPEWRNEAISFFPSARG 998

Query: 1058 TYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGT 1116
            ++    ++  +   D     +    +  + + E  + ++ ++L  +  T E + ++ +GT
Sbjct: 999  SFIKLVWDHHLQRQDSVILIFHLH-SFSLGADEYVMAIKNISLEVSENTHERKDMIVVGT 1057

Query: 1117 AYVQGEDVAARGRVLLFSTGRNADNPQN 1144
            A+ +GED+ +RG + +F   +   +P +
Sbjct: 1058 AFARGEDIPSRGCIYVFEVVQVVPDPDH 1085


>gi|159123784|gb|EDP48903.1| cleavage and polyadenylation specificity factor subunit A, putative
           [Aspergillus fumigatus A1163]
          Length = 1401

 Score =  136 bits (343), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 173/744 (23%), Positives = 307/744 (41%), Gaps = 114/744 (15%)

Query: 57  NLVVTAANVIEIY-VVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
           NLVV   +V++I+ +++VQ     E+  +   +     D +    L L   Y L G V  
Sbjct: 28  NLVVVKTSVLQIFSLLKVQHHSRGETIETKSARP----DQVETTKLVLEREYPLSGTVVD 83

Query: 116 LA----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLK 171
           +     + S+ G +      +++LAF +AK+S++E+D   HG+   S+H +E  +     
Sbjct: 84  ICRVKILNSKSGGE------ALLLAFRNAKLSLVEWDPERHGISTISIHYYERDDLTRSP 137

Query: 172 RGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFG--------- 221
              +  + G ++ VDP  RC  V  +G++ + IL   Q G  L  D+  F          
Sbjct: 138 WVPDLSSCGSILSVDPSSRCA-VFNFGIRNLAILPFHQPGDDLAMDDYEFHLHQDDLNQV 196

Query: 222 ---SGGGFSAR--------IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHEREL 268
               G G  ++          SS V+ L  LD  + H     F++ Y EP   IL+ +  
Sbjct: 197 SDHVGNGLKSKDSTVYQTPYASSFVLPLTALDPSILHPVSLAFLYEYREPTFGILYSQIA 256

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
           T    +S +  +   +  ++    +    + S   LP D +K++A+P P+GG L++G+N 
Sbjct: 257 TSHALLSERKDSIFYTVFTLDLEQRASTTLLSVPKLPSDLFKVVALPPPVGGALLIGSNE 316

Query: 329 -IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGD 385
            +H      + A+ +N +A  + +   + +S  ++ L+      + +     LL   +G+
Sbjct: 317 LVHVDQAGKTNAVGVNEFARQVSAFSMVDQSDLALRLEGCVVEHISDSTGDLLLVLSSGN 376

Query: 386 LVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFT 438
           +VL+    DGR V  + L    ++   +++ S  ++   +G+   F GS   DS+L+ ++
Sbjct: 377 MVLVHFQLDGRSVSGISLRPLPTQAGGTIMKSAASSSAFLGSGRVFFGSEDADSVLLSWS 436

Query: 439 CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ-DMVNGE-ELSLYGSASNNTES 496
                       +    ++  D        +S  DA + D+   E E    G   +   +
Sbjct: 437 SMPN----PKKSRPRMSNVAEDREEASDDSQSEEDAYEDDLYTAEPETPALGRRPSAETT 492

Query: 497 AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKG------- 549
               + F   D L NIGPL+D + G   +   +   + K +  EL EL   +G       
Sbjct: 493 GVGAYIFQTLDRLPNIGPLRDITLGKPASTVENTGRLIKNACSEL-ELVAAQGSGRNGGL 551

Query: 550 ----------------------IWTVYHKSSRGHN--ADSSRMAAYDDEYHAYLIISLEA 585
                                 +WT       G     D  ++   + EY  Y+I+S + 
Sbjct: 552 VLMKREIEPDVTASFDAQSVQEVWTAVVALGSGAPLVLDEQQI---NQEYRQYVILS-KP 607

Query: 586 RTMVLETADLLTEVTESVDYFVQGR-------TIAAGNLFGRRRVIQVFERGARILDGSY 638
            T   ET+++    T+ +  F           TI  G L  ++RV+QV     R    SY
Sbjct: 608 ETPDKETSEVFIADTQDLKPFRAPEFNPNNDVTIEIGTLSCKKRVVQVLRNEVR----SY 663

Query: 639 MTQDLSFG-----PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 693
              D+  G     P   E    S+    +S S+ADPY+ +   D ++ +L  D S     
Sbjct: 664 ---DIDLGLAQIYPVWDE--DTSDERMAVSASLADPYIAILRDDSTLMILQADDSGDLDE 718

Query: 694 VQTPAAIESSKKPVSSCTLYHDKG 717
           V+   A  + K    SC LY DK 
Sbjct: 719 VELNEAARAGK--WRSCCLYWDKA 740



 Score = 58.2 bits (139), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 54/235 (22%), Positives = 100/235 (42%), Gaps = 19/235 (8%)

Query: 914  ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIV---AFTVLHNVNCNHGFI 970
            + I  NIS     F+ G RP   ++   +   H     G  V   +   L + + + GFI
Sbjct: 884  LRILPNISNFSAVFMPG-RPASFILKTAKSCPHVFRLRGEFVRSLSIFDLASPSLDTGFI 942

Query: 971  YVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLN 1030
            YV S+ +L+IC+ PS + +D  W ++K+   +   H + Y      Y L  S       +
Sbjct: 943  YVDSKDVLRICRFPSDTLFDYTWALRKISIGEQVDH-LAYATSSETYVLGTS------HS 995

Query: 1031 QVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSE 1090
                L  D E+     N  L    L     + +  ++++ P      W    +  +   E
Sbjct: 996  ADFKLPDDDELHPDWRNEGLVISFLPE---LRQCSLKVVSPRT----WTVIDSYSLGPDE 1048

Query: 1091 NALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQN 1144
              + V+ + L  +  T E   ++ +GTA+ +GED+ +RG + +F   +   +P+ 
Sbjct: 1049 YVMAVKNMDLEVSENTHERRNMIVVGTAFARGEDIPSRGCIYVFEVIKVVPDPEK 1103


>gi|146324727|ref|XP_747211.2| cleavage and polyadenylation specificity factor subunit A, putative
           [Aspergillus fumigatus Af293]
 gi|148886828|sp|Q4WCL1.2|CFT1_ASPFU RecName: Full=Protein cft1; AltName: Full=Cleavage factor two
           protein 1
 gi|129556124|gb|EAL85173.2| cleavage and polyadenylation specificity factor subunit A, putative
           [Aspergillus fumigatus Af293]
          Length = 1401

 Score =  136 bits (342), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 173/744 (23%), Positives = 307/744 (41%), Gaps = 114/744 (15%)

Query: 57  NLVVTAANVIEIY-VVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
           NLVV   +V++I+ +++VQ     E+  +   +     D +    L L   Y L G V  
Sbjct: 28  NLVVVKTSVLQIFSLLKVQHHSRGETIETKSARP----DQVETTKLVLEREYPLSGTVVD 83

Query: 116 LA----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLK 171
           +     + S+ G +      +++LAF +AK+S++E+D   HG+   S+H +E  +     
Sbjct: 84  ICRVKILNSKSGGE------ALLLAFRNAKLSLVEWDPERHGISTISIHYYERDDLTRSP 137

Query: 172 RGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFG--------- 221
              +  + G ++ VDP  RC  V  +G++ + IL   Q G  L  D+  F          
Sbjct: 138 WVPDLSSCGSILSVDPSSRCA-VFNFGIRNLAILPFHQPGDDLAMDDYEFHLHQDDLNQV 196

Query: 222 ---SGGGFSAR--------IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHEREL 268
               G G  ++          SS V+ L  LD  + H     F++ Y EP   IL+ +  
Sbjct: 197 SDHVGNGLKSKDSTVYQTPYASSFVLPLTALDPSILHPVSLAFLYEYREPTFGILYSQIA 256

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
           T    +S +  +   +  ++    +    + S   LP D +K++A+P P+GG L++G+N 
Sbjct: 257 TSHALLSERKDSIFYTVFTLDLEQRASTTLLSVPKLPSDLFKVVALPPPVGGALLIGSNE 316

Query: 329 -IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGD 385
            +H      + A+ +N +A  + +   + +S  ++ L+      + +     LL   +G+
Sbjct: 317 LVHVDQAGKTNAVGVNEFARQVSAFSMVDQSDLALRLEGCVVEHISDSTGDLLLVLSSGN 376

Query: 386 LVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFT 438
           +VL+    DGR V  + L    ++   +++ S  ++   +G+   F GS   DS+L+ ++
Sbjct: 377 MVLVHFQLDGRSVSGISLRPLPTQAGGTIMKSAASSSAFLGSGRVFFGSEDADSVLLSWS 436

Query: 439 CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ-DMVNGE-ELSLYGSASNNTES 496
                       +    ++  D        +S  DA + D+   E E    G   +   +
Sbjct: 437 SMPN----PKKSRPRMSNVAEDREEASDDSQSEEDAYEDDLYTAEPETPALGRRPSAETT 492

Query: 497 AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKG------- 549
               + F   D L NIGPL+D + G   +   +   + K +  EL EL   +G       
Sbjct: 493 GVGAYIFQTLDRLPNIGPLRDITLGKPASTVENTGRLIKNACSEL-ELVAAQGSGRNGGL 551

Query: 550 ----------------------IWTVYHKSSRGHN--ADSSRMAAYDDEYHAYLIISLEA 585
                                 +WT       G     D  ++   + EY  Y+I+S + 
Sbjct: 552 VLMKREIEPDVTASFDAQSVQEVWTAVVALGSGAPLVLDEQQI---NQEYRQYVILS-KP 607

Query: 586 RTMVLETADLLTEVTESVDYFVQGR-------TIAAGNLFGRRRVIQVFERGARILDGSY 638
            T   ET+++    T+ +  F           TI  G L  ++RV+QV     R    SY
Sbjct: 608 ETPDKETSEVFIADTQDLKPFRAPEFNPNNDVTIEIGTLSCKKRVVQVLRNEVR----SY 663

Query: 639 MTQDLSFG-----PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 693
              D+  G     P   E    S+    +S S+ADPY+ +   D ++ +L  D S     
Sbjct: 664 ---DIDLGLAQIYPVWDE--DTSDERMAVSASLADPYIAILRDDSTLMILQADDSGDLDE 718

Query: 694 VQTPAAIESSKKPVSSCTLYHDKG 717
           V+   A  + K    SC LY DK 
Sbjct: 719 VELNEAARAGK--WRSCCLYWDKA 740



 Score = 57.8 bits (138), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 54/235 (22%), Positives = 100/235 (42%), Gaps = 19/235 (8%)

Query: 914  ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIV---AFTVLHNVNCNHGFI 970
            + I  NIS     F+ G RP   ++   +   H     G  V   +   L + + + GFI
Sbjct: 884  LRILPNISNFSAVFMPG-RPASFILKTAKSCPHVFRLRGEFVRSLSIFDLASPSLDTGFI 942

Query: 971  YVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLN 1030
            YV S+ +L+IC+ PS + +D  W ++K+   +   H + Y      Y L  S       +
Sbjct: 943  YVDSKDVLRICRFPSETLFDYTWALRKISIGEQVDH-LAYATSSETYVLGTS------HS 995

Query: 1031 QVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSE 1090
                L  D E+     N  L    L     + +  ++++ P      W    +  +   E
Sbjct: 996  ADFKLPDDDELHPDWRNEGLVISFLPE---LRQCSLKVVSPRT----WTVIDSYSLGPDE 1048

Query: 1091 NALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQN 1144
              + V+ + L  +  T E   ++ +GTA+ +GED+ +RG + +F   +   +P+ 
Sbjct: 1049 YVMAVKNMDLEVSENTHERRNMIVVGTAFARGEDIPSRGCIYVFEVIKVVPDPEK 1103


>gi|330919204|ref|XP_003298516.1| hypothetical protein PTT_09264 [Pyrenophora teres f. teres 0-1]
 gi|311328242|gb|EFQ93393.1| hypothetical protein PTT_09264 [Pyrenophora teres f. teres 0-1]
          Length = 1388

 Score =  135 bits (340), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 170/699 (24%), Positives = 293/699 (41%), Gaps = 90/699 (12%)

Query: 57  NLVVTAANVIEIYVVR-----VQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
           NLVV   ++++I+ ++     V     + S+N+      E     L    + A L LV  
Sbjct: 28  NLVVAKNSLLQIFELKSTTTEVTPGSGENSENAAANLDTEAADVPLQRTENTAKLVLVAE 87

Query: 107 YRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESP 165
           + L G V SLA +    A N++ + +++++AF DAK+S++E+D   + L   S+H +E+P
Sbjct: 88  FPLAGTVISLARVK---ALNTKSKGEALLVAFRDAKLSLVEWDPETYNLHTISIHYYENP 144

Query: 166 E------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE-- 217
           +      W    +   +F     +  DP  RC  +      + IL   Q    LV D+  
Sbjct: 145 DLPGIAPWSADLKDTYNF-----LTADPSSRCAALKFGTHNLAILPFRQ--RDLVEDDYD 197

Query: 218 ------------DTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVIL 263
                           + G       SS V+ L +LD  + H     F+H Y EP   I+
Sbjct: 198 SDAEVPKETKADQANDTSGEHKTPYSSSFVLPLTNLDPTLTHPVHLAFLHEYREPTFGIV 257

Query: 264 HERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLV 323
                T    ++ +      S  ++    K    + S   LP+D  K++ +PSPIGG L+
Sbjct: 258 AASRATAPSLLAQRKDILTYSVFTLDLEQKASTTLLSVSGLPYDITKVVPLPSPIGGALL 317

Query: 324 VGAN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLS 380
           VG N  IH      +  +ALN +A +  S     +S  ++ L+      L  +    L+ 
Sbjct: 318 VGRNEIIHVDQGGKTNGVALNEFAKACTSFSLSDQSDLALHLEGCSIELLSQETGDVLIV 377

Query: 381 TKTGDLVLLTVVYDGRVVQRLDLSKTNP-------SVLTSDITTIGNSLFFLGSRLGDSL 433
              G L++LT   DGR V  + +                S  + +G    F+GS  G+S+
Sbjct: 378 LNNGRLLILTFTLDGRTVSGMTIQTVAADHGGHLVKSAASCTSNLGRGRLFIGSEDGESV 437

Query: 434 LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASN 492
           ++ +T       L++ L+ +  + + D            D   D+ N    +++  +A+ 
Sbjct: 438 MLGWTG------LTNQLRRKLSNADLDG-EDDSDEEEIDDMEDDLYNDTAPTMHKITAAV 490

Query: 493 NTESAQKTFSFAVRDSLVNIGPLKD-----------FSYG-LRINADASATGISKQSNYE 540
           +  +A  T++F + D L +I P+KD            + G + ++    A G     + E
Sbjct: 491 SEPTAPGTYTFRIHDVLPSIAPIKDAVLHPGKVTESLNRGEIMLSTGRGAAGAITALDRE 550

Query: 541 L-------VELPGCKGIWTVYHKS------SRGHNADSSRMAAYDDEYHAYLIISL--EA 585
           L        ELP   G+W V+ +       +     D+    A D +Y  YL++S   E 
Sbjct: 551 LHPISVATKELPLAHGVWAVHARKQAPGDVTAAFGEDTEANMATDVDYDQYLVMSKNGED 610

Query: 586 RTMVLE-TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLS 644
            T+V E   D LTE  +      +G T+  G L    +V+QV     RI D       + 
Sbjct: 611 GTVVYEVNGDQLTETDKGDFEREEGTTLLVGVLAAGTKVVQVMRTEVRIYDSELNLVHIQ 670

Query: 645 FGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
                 E GS  E + +++ S ADPY+L+   D S+++ 
Sbjct: 671 SMEEEEEGGSTKELN-IINASFADPYLLILREDSSVKIF 708



 Score = 65.1 bits (157), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 53/226 (23%), Positives = 97/226 (42%), Gaps = 27/226 (11%)

Query: 914  ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVT 973
            +    +I G+   F  G+ P + +           L    + + T  H  +C  GF Y+ 
Sbjct: 870  LVALDDICGYSTVFQRGTTPAFILKEASSAPRVIGLSGKPVKSLTSFHTSSCQRGFAYLD 929

Query: 974  SQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVL 1033
            S   L+ICQLP  + Y +     + +P+ +  H +TY     LY  IV            
Sbjct: 930  STDTLRICQLPPQTHYGHLGWATRRMPMDSEVHTLTYHP-PGLY--IVGT---------- 976

Query: 1034 SLLIDQEVGHQID-----NHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQS 1088
                 Q   +Q+D     +++L   DL    ++E   +++L+       W    T  +  
Sbjct: 977  ----GQAEDYQLDPTETYHYDLPKEDLTFKPSIERGVIKLLDEKS----WTIIDTHVLDP 1028

Query: 1089 SENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
             E  L+++ + L  +  T + + L+A+GT+ V GED+A +G + +F
Sbjct: 1029 QEVVLSIKTLNLEVSEITHQRKDLIAVGTSVVHGEDLATKGCIRIF 1074


>gi|449299306|gb|EMC95320.1| hypothetical protein BAUCODRAFT_25380 [Baudoinia compniacensis UAMH
            10762]
          Length = 1437

 Score =  135 bits (339), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 248/1174 (21%), Positives = 438/1174 (37%), Gaps = 230/1174 (19%)

Query: 52   IGP-VPNLVVTAANVIEIY-VVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRL 109
            IGP   NLVV   ++++++ V R+ +       +  + + R          L L+  Y L
Sbjct: 22   IGPQADNLVVAKTSLLQVFEVKRISQAKDNGHHDHADAQSR----------LSLIGEYTL 71

Query: 110  HGNVESLAILS-----QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFES 164
             G V +L+ ++      GGA       +++ AF+DAK+S++E+D   + +   S+H +E 
Sbjct: 72   SGTVTALSPITLPSSRTGGA-------ALVCAFKDAKLSLIEWDPEHYRISTISIHYYEG 124

Query: 165  PEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS------------- 211
               L    G        ++ VDP  RC  +     Q+ IL   Q G              
Sbjct: 125  DNVLLPPFGAALSECESILTVDPGSRCAALKFGERQLAILPFRQQGDELADEAAEDADMA 184

Query: 212  --------GLVGDEDTFGSGGGFSAR----IESSHVINLRDLD--MKHVKDFIFVHGYIE 257
                    G V  + T  +    S       +SS V+ L  LD  + H     F+H Y E
Sbjct: 185  EAESEEQPGNVTLKRTSTTQALDSKDDITPYKSSFVLPLITLDPSLTHPVHLAFLHEYRE 244

Query: 258  PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
            P   IL   +      +  +      +  ++    +    + S   LP D +K++A+P P
Sbjct: 245  PTFGILSAPQQPSLALLDERKDCLSYTVFTLDLEQRASTNLMSVSKLPSDLWKVIALPPP 304

Query: 318  IGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDV 376
            +GG L+VG N  IH      + A+A+N +A    +      S  +++L+      L +  
Sbjct: 305  VGGALLVGTNELIHIDQSGKTTAVAVNEFAKVASNFSMADHSDLNMKLEGCEIEMLDSST 364

Query: 377  --ALLSTKTGDLVLLTVVYDGRVVQRLDLSK---TNPSVLTSD----ITTIGNSLFFLGS 427
              AL+    G    L+    GR V  L +S+   TN   + +     + ++     F+GS
Sbjct: 365  GNALIVLNDGSFATLSFKMLGRTVGGLTVSRVADTNGGNVNASAPSCVASMQQQKLFVGS 424

Query: 428  RLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD---------------APSTKRLRRSSS 472
              G S LV++   + T        +  G                    APS   ++R++S
Sbjct: 425  EDGSSSLVRWAKDTPTLSRKRSHAQMLGQDAPMDDADDAEELDEDDLYAPSAVAVKRAAS 484

Query: 473  DALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL--RINAD--- 527
                             A+     A  T++F + DSL ++ P+ +   G   R   +   
Sbjct: 485  ----------------VANAAAVDASTTYTFELEDSLNSLAPMNNVCLGRSPRTGKEKLE 528

Query: 528  -ASATGISKQS-----NYELV-------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDE 574
              +  G  K S     N E++       ++ G K IW+V  +S  G    S+      D 
Sbjct: 529  LVAGIGRGKASSLAFMNREIIPNEIRSRDVAGAKDIWSVCARSREGDKVSSA------DT 582

Query: 575  YHAYLIISLEARTMVLETAD----LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG 630
            Y   L +     T   + AD     + E+ E+ D+   G T+  G L     ++Q     
Sbjct: 583  YDNLLFVFDGESTKTYKYADSAEGSIIELDET-DFEGDGETVCVGTLANGSCIVQCRRTE 641

Query: 631  ARILDGSYMTQDLSFGPSNSESGSGSENST---VLSVSIADPYVLLGMSDGSIRLLVGDP 687
             R       T D   G S     S  E      +++ S  DPY+L+   D S+++L  D 
Sbjct: 642  IR-------TYDHQLGLSQIIPMSDDETDAELKIVATSFCDPYLLVIQDDSSVQILQVDK 694

Query: 688  STCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPL 747
                         +   +P+ +         E  LR+     WL+  +         G L
Sbjct: 695  -------------QGDVEPLDAA--------ESDLREGK---WLTGSL-------YAGEL 723

Query: 748  DQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSS 807
              G   + +  + G L++F +P    V++              ++   L         S+
Sbjct: 724  SDGQSAAFLLGQEGGLQVFSLPETKLVYSAPTL---------PFLPPVL---------SA 765

Query: 808  EEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
            +    +G K  +  + VV+L  +      +RP+L        ++ Y+ + +         
Sbjct: 766  DAPQRRGGKVTLTEVLVVDLGAEGV----TRPYLIVRTAMDDLILYEPFHY--------- 812

Query: 868  DDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE----ETPHGAPCQRITIFKNISGH 923
                    S +  +  A+   +LRF + P     +     +T  G P Q       I G 
Sbjct: 813  --------SATTLDARATGFTDLRFRKVPFTYLPKYDEGLDTADGRPAQLQPAV--IGGR 862

Query: 924  QGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQL 983
               +L G  P + +     L     L    + +F+ LH   C  GF  V   G LK  QL
Sbjct: 863  NALYLPGGTPSFLVKEATSLPKVLGLRARGVRSFSPLHRAGCQQGFALVDGDGKLKEYQL 922

Query: 984  PSGSTYDNYWPVQKVIPLKATPH---QITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE 1040
            P   ++   W V + + L   P    Q+ Y  ++ +Y       V+     V   L D +
Sbjct: 923  PGHVSFATGWSV-RTLTLGEPPQEVRQVAYHEQRGIY-------VVATCRDVDFTLHDLD 974

Query: 1041 VGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL 1100
               + D  NL          V +Y + +L    +    +   ++ M  +E   +++++ L
Sbjct: 975  ERQRDDEPNLKP-------QVPQYTLHLL----SATSHKVIQSLEMPYAEIVTSLKIMPL 1023

Query: 1101 -FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
              +  T E + +L +G A  +GED  A+G + +F
Sbjct: 1024 EVSEHTHEQKLMLVVGAAAQRGEDAPAKGLLTVF 1057


>gi|147772179|emb|CAN73417.1| hypothetical protein VITISV_017053 [Vitis vinifera]
          Length = 609

 Score =  134 bits (336), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 69/122 (56%), Positives = 80/122 (65%), Gaps = 26/122 (21%)

Query: 503 FAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYEL--------------------- 541
           F V DSL+N+GPLK F+Y LRINAD  ATGI KQSN+EL                     
Sbjct: 430 FEVNDSLINVGPLKVFAYALRINADLKATGIVKQSNFELMCCSGHGKNGALCILQQSIRP 489

Query: 542 -----VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLL 596
                VEL GC+ IWTVYHK++RGHNADS++M   DDEY AYLIIS E+RTMVLET +LL
Sbjct: 490 EMITEVELSGCERIWTVYHKNTRGHNADSTKMVTKDDEYCAYLIISPESRTMVLETVELL 549

Query: 597 TE 598
            E
Sbjct: 550 GE 551


>gi|115490949|ref|XP_001210102.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114196962|gb|EAU38662.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 908

 Score =  132 bits (332), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 177/731 (24%), Positives = 297/731 (40%), Gaps = 148/731 (20%)

Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGR 190
           ++++LAF +AK+S++E+D   HG+   S+H +E  +        +  + G ++ V+P  R
Sbjct: 62  EAVLLAFRNAKLSLIEWDPERHGISTISIHYYERDDLTCSPWVPDLSSCGSILDVEPSSR 121

Query: 191 CGGVLVYGLQ-MIILKASQGGSGLVGD-------------EDTFGSGGGFSARIESSHVI 236
           C  V  +G++ + I+   Q G  LV D             ++T      ++    SS V+
Sbjct: 122 CA-VFNFGIRNLAIIPFHQPGDDLVMDDYDSDLDERKHVDQETTRESPAYATPYASSFVL 180

Query: 237 NLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
            L   D  + H     F+H Y EP   IL+ +  T    +  +      S  ++    + 
Sbjct: 181 PLTAFDPSILHPISLAFLHEYREPTFGILYSQVATSNALLHERKDVVFYSVFTLDLEQRA 240

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQ 353
              + S   LP D + ++A+P P+GG L++G+N  +H      + A+ +N ++  + +  
Sbjct: 241 STTLLSVARLPSDLFHVVALPPPVGGSLLIGSNELVHVDQAGKTNAVGVNEFSRQVSAFS 300

Query: 354 ELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRL---------- 401
              +S  ++ L+      L ++    +L   TG++VL+    DGR V  +          
Sbjct: 301 MTDQSDLALRLEGCRVERLADNSGDMILILSTGNMVLIKFKLDGRSVSGISVHPVPVHAG 360

Query: 402 -DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
            DL K+      S    +GN   FLGS   DSLL+      G S LSSG           
Sbjct: 361 GDLMKS----AASSSAFLGNGEVFLGSEDADSLLL------GWSDLSSG----------- 399

Query: 461 APSTKRLRRSSSDA-------LQDMVNGE--ELSLYGSASNNTESAQKT---------FS 502
              TKRLR   +DA         +M + +  E  LY ++ + T   ++          ++
Sbjct: 400 ---TKRLRSHKNDANDSGDVSDDNMSDDDVYEDDLYSTSPDATADGRRVSADPSSFGLYN 456

Query: 503 FAVRDSLVNIGPLKDFSYG--------LRINADASATGISKQS---NYELVEL------- 544
           F + D L+NI PL+D + G         + N  A    ++ Q    N  L+ +       
Sbjct: 457 FRINDRLLNIAPLRDITLGKPSTFDKDRKDNVSAELELVASQGSDRNGGLIAMRREIDPE 516

Query: 545 -------PGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA---- 593
                       +WT   +SS G ++            H Y+I+S +      ET     
Sbjct: 517 VLASFTIDSANCVWTACVESSGGKDS------------HQYVIVSKQTNIDKEETEIFRV 564

Query: 594 ---DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG---P 647
              DL       V+   +  TI  G L  + RV+QV +   R  D      DL      P
Sbjct: 565 DGLDLKPIKAPEVNPN-EEVTIDVGTLAKQSRVVQVLKNEVRCYDA-----DLGLAQIYP 618

Query: 648 SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPV 707
              E    S+    +S S+ DPYV +   D ++ LL  D S     V+ P  + ++ K +
Sbjct: 619 VWDE--DTSDEHPAVSASVTDPYVAILRDDSTLLLLHVDDSGDVDEVEMPDNM-AAHKWL 675

Query: 708 SSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFD 767
           SSC LY DK                TGV  +     G    Q D++  +  +   L I+ 
Sbjct: 676 SSC-LYLDK----------------TGVFASNTDTKGS--RQNDMFLFLLGQDCRLFIYR 716

Query: 768 VPNFNCVFTVD 778
           +P+   V T+D
Sbjct: 717 LPDLLLVSTID 727


>gi|169603229|ref|XP_001795036.1| hypothetical protein SNOG_04622 [Phaeosphaeria nodorum SN15]
 gi|160706354|gb|EAT88382.2| hypothetical protein SNOG_04622 [Phaeosphaeria nodorum SN15]
          Length = 1338

 Score =  132 bits (331), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 167/727 (22%), Positives = 295/727 (40%), Gaps = 142/727 (19%)

Query: 57  NLVVTAANVIEIY-----VVRVQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
           NL+V   ++++++     V  V   G  E+ N+      E     L    + A L LV  
Sbjct: 28  NLIVAKNSLLQVFELKSTVTEVASGGEGEADNAAANFDTEAADVPLQRIENTAKLVLVGE 87

Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
           + L G V SLA +     +   R +++++AF DAK+S++E+D   + L   S+H +E+P+
Sbjct: 88  FPLAGTVISLARVK--ALNTKSRAEALLVAFRDAKLSLVEWDPETYNLHTISIHYYENPD 145

Query: 167 ------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL---------------- 204
                 W    +   +F     +  DP  RC  +      + IL                
Sbjct: 146 VPGLAPWDAELKDTYNF-----LTADPSSRCAALKFGTHNLAILPFRQRDLAEDEYDSDN 200

Query: 205 KASQGGSGLVGDEDTFGSGGGFSARI--ESSHVINLRDLD--MKHVKDFIFVHGYIEPVM 260
           +A+Q G      E   G+ G  + +    SS V+ L +LD  + H     F+H Y EP  
Sbjct: 201 EAAQEGKA----ERANGANGDDAVKTPYSSSFVLPLTNLDPTLTHPVHLAFLHEYREPTF 256

Query: 261 VILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGG 320
            ++   + T A  ++ +      +  ++    K    + S   LP+D  +++ +P PIGG
Sbjct: 257 GVISSSKATAASLLTHRKDILTYTVFTLDLEQKASTTLLSVPGLPYDLTQVVPLPHPIGG 316

Query: 321 VLVVGAN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA-- 377
            L+VG+N  IH      +  +A+N  A +  S     ++  ++ L+      L  D    
Sbjct: 317 ALLVGSNEIIHVDQAGKTNGVAVNELAKACTSFALSDQADLALRLEGCTLELLSQDTGDV 376

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDI---TTIGNSLFFLGSRLG 430
           ++    G + +LT   DGR V  + +    +    ++L +     T +G    F+GS  G
Sbjct: 377 MIVLNDGSIFILTFSLDGRNVSAMTIQPVPADNGGNILKTRASCSTNLGRGRLFIGSEDG 436

Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
           +S+L+ +T                        ++ +LRR  S+  Q   + E++S     
Sbjct: 437 ESVLMGWTS-----------------------TSNQLRRKQSNTAQSG-DDEDMSDVEEE 472

Query: 491 S---------NNTESAQK-------------TFSFAVRDSLVNIGPLKD----------- 517
                     N+T +  K             T++F V D L +I P++D           
Sbjct: 473 EVDDLDDDLYNDTATTVKKITAAAAEPTAPGTYTFRVHDVLPSIAPIRDTVLHPGKDTES 532

Query: 518 FSYG-LRINADASATGISKQSNYEL-------VELPGCKGIWTVYHKSSR--------GH 561
            + G + ++    A G     N EL        ELP   G+W V+ K           G 
Sbjct: 533 LTKGEIMLSTGRGAAGAITALNRELHPTMLAQTELPSSNGVWAVHAKKQAPAGIVADFGQ 592

Query: 562 NADSSRMAAYDDEYHAYLIISLE-----ARTMVLETADLLTEVTESVDYFV-QGRTIAAG 615
           +A+++  A+ D +Y  YL++S         T+V E        TE  D+   +G T++ G
Sbjct: 593 DAEAN--ASSDVDYDQYLVVSKAWEDGTESTVVYEVHGNELSETEKGDFERDEGLTLSVG 650

Query: 616 NLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
            L    +V+QV     R  D     + +   P   E      N  +++ S ADPY+L+  
Sbjct: 651 VLARGTKVVQVLRSEVRTYDSELGMEQII--PMEDEETGNELN--IINASFADPYLLIQR 706

Query: 676 SDGSIRL 682
            D S+++
Sbjct: 707 EDSSVKI 713



 Score = 60.5 bits (145), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 53/222 (23%), Positives = 91/222 (40%), Gaps = 25/222 (11%)

Query: 917  FKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG 976
              NI+G+      G  P + +           L    + + T  H  +C  GF Y+ S  
Sbjct: 846  LDNINGYSTVIQRGRSPAFILKESSSAPRVIGLSGNPVKSLTRFHTSSCQRGFAYLDSTD 905

Query: 977  ILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVP---VLKPLNQVL 1033
             L+I QLP  + Y +     + +P+ A  H + Y     LY +    P    L P +   
Sbjct: 906  TLRISQLPPSTHYGHLGWAARRMPMDAEVHALAYHP-SGLYVIGTGQPEEYTLDPNDTFH 964

Query: 1034 SLLIDQEVGHQID-NHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENA 1092
              L  +E   +    H +  V   +T+TV   +  +L+P                  E  
Sbjct: 965  YELPKEETSFKPKVEHGIIKVMDEKTWTV--IDTHVLDP-----------------QEVI 1005

Query: 1093 LTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
            L ++ + L  + TT + + ++A+GTA V GED+A +G + +F
Sbjct: 1006 LCIKTLNLEVSETTHQRKDVIAVGTAIVLGEDLATKGNIRIF 1047


>gi|414587800|tpg|DAA38371.1| TPA: hypothetical protein ZEAMMB73_571351 [Zea mays]
          Length = 108

 Score =  132 bits (331), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 68/98 (69%), Positives = 77/98 (78%)

Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGP 647
           MVL+T D L EVTE+VDY VQG TIAAGNLFGR RVIQV+ +GAR+LDGS+MTQ+L+F  
Sbjct: 1   MVLQTGDDLGEVTETVDYNVQGSTIAAGNLFGRCRVIQVYAKGARVLDGSFMTQELNFSM 60

Query: 648 SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG 685
             SES   SE     S SIADPYVLL MSDGSIRLL+G
Sbjct: 61  HTSESSLNSEPLAAASASIADPYVLLKMSDGSIRLLIG 98


>gi|121925707|sp|Q0UUE2.1|CFT1_PHANO RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two
           protein 1
          Length = 1375

 Score =  131 bits (330), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 167/727 (22%), Positives = 295/727 (40%), Gaps = 142/727 (19%)

Query: 57  NLVVTAANVIEIY-----VVRVQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
           NL+V   ++++++     V  V   G  E+ N+      E     L    + A L LV  
Sbjct: 28  NLIVAKNSLLQVFELKSTVTEVASGGEGEADNAAANFDTEAADVPLQRIENTAKLVLVGE 87

Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
           + L G V SLA +     +   R +++++AF DAK+S++E+D   + L   S+H +E+P+
Sbjct: 88  FPLAGTVISLARVK--ALNTKSRAEALLVAFRDAKLSLVEWDPETYNLHTISIHYYENPD 145

Query: 167 ------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL---------------- 204
                 W    +   +F     +  DP  RC  +      + IL                
Sbjct: 146 VPGLAPWDAELKDTYNF-----LTADPSSRCAALKFGTHNLAILPFRQRDLAEDEYDSDN 200

Query: 205 KASQGGSGLVGDEDTFGSGGGFSARI--ESSHVINLRDLD--MKHVKDFIFVHGYIEPVM 260
           +A+Q G      E   G+ G  + +    SS V+ L +LD  + H     F+H Y EP  
Sbjct: 201 EAAQEGKA----ERANGANGDDAVKTPYSSSFVLPLTNLDPTLTHPVHLAFLHEYREPTF 256

Query: 261 VILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGG 320
            ++   + T A  ++ +      +  ++    K    + S   LP+D  +++ +P PIGG
Sbjct: 257 GVISSSKATAASLLTHRKDILTYTVFTLDLEQKASTTLLSVPGLPYDLTQVVPLPHPIGG 316

Query: 321 VLVVGAN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA-- 377
            L+VG+N  IH      +  +A+N  A +  S     ++  ++ L+      L  D    
Sbjct: 317 ALLVGSNEIIHVDQAGKTNGVAVNELAKACTSFALSDQADLALRLEGCTLELLSQDTGDV 376

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDI---TTIGNSLFFLGSRLG 430
           ++    G + +LT   DGR V  + +    +    ++L +     T +G    F+GS  G
Sbjct: 377 MIVLNDGSIFILTFSLDGRNVSAMTIQPVPADNGGNILKTRASCSTNLGRGRLFIGSEDG 436

Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
           +S+L+ +T                        ++ +LRR  S+  Q   + E++S     
Sbjct: 437 ESVLMGWTS-----------------------TSNQLRRKQSNTAQSG-DDEDMSDVEEE 472

Query: 491 S---------NNTESAQK-------------TFSFAVRDSLVNIGPLKD----------- 517
                     N+T +  K             T++F V D L +I P++D           
Sbjct: 473 EVDDLDDDLYNDTATTVKKITAAAAEPTAPGTYTFRVHDVLPSIAPIRDTVLHPGKDTES 532

Query: 518 FSYG-LRINADASATGISKQSNYEL-------VELPGCKGIWTVYHKSSR--------GH 561
            + G + ++    A G     N EL        ELP   G+W V+ K           G 
Sbjct: 533 LTKGEIMLSTGRGAAGAITALNRELHPTMLAQTELPSSNGVWAVHAKKQAPAGIVADFGQ 592

Query: 562 NADSSRMAAYDDEYHAYLIISLE-----ARTMVLETADLLTEVTESVDYFV-QGRTIAAG 615
           +A+++  A+ D +Y  YL++S         T+V E        TE  D+   +G T++ G
Sbjct: 593 DAEAN--ASSDVDYDQYLVVSKAWEDGTESTVVYEVHGNELSETEKGDFERDEGLTLSVG 650

Query: 616 NLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
            L    +V+QV     R  D     + +   P   E      N  +++ S ADPY+L+  
Sbjct: 651 VLARGTKVVQVLRSEVRTYDSELGMEQII--PMEDEETGNELN--IINASFADPYLLIQR 706

Query: 676 SDGSIRL 682
            D S+++
Sbjct: 707 EDSSVKI 713



 Score = 61.6 bits (148), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 61/269 (22%), Positives = 103/269 (38%), Gaps = 25/269 (9%)

Query: 870  PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLS 929
            P  +S  L   N+   +L      R   D    E          +    NI+G+      
Sbjct: 836  PSRSSSDLWTHNLRWVKLSQQHVPRYMEDGAQEEAADEPGFESTLLALDNINGYSTVIQR 895

Query: 930  GSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
            G  P + +           L    + + T  H  +C  GF Y+ S   L+I QLP  + Y
Sbjct: 896  GRSPAFILKESSSAPRVIGLSGNPVKSLTRFHTSSCQRGFAYLDSTDTLRISQLPPSTHY 955

Query: 990  DNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVP---VLKPLNQVLSLLIDQEVGHQID 1046
             +     + +P+ A  H + Y     LY +    P    L P +     L  +E   +  
Sbjct: 956  GHLGWAARRMPMDAEVHALAYHP-SGLYVIGTGQPEEYTLDPNDTFHYELPKEETSFKPK 1014

Query: 1047 -NHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTT 1104
              H +  V   +T+TV   +  +L+P                  E  L ++ + L  + T
Sbjct: 1015 VEHGIIKVMDEKTWTV--IDTHVLDP-----------------QEVILCIKTLNLEVSET 1055

Query: 1105 TKENETLLAIGTAYVQGEDVAARGRVLLF 1133
            T + + ++A+GTA V GED+A +G + +F
Sbjct: 1056 THQRKDVIAVGTAIVLGEDLATKGNIRIF 1084


>gi|119484094|ref|XP_001261950.1| cleavage and polyadenylation specificity factor subunit A, putative
           [Neosartorya fischeri NRRL 181]
 gi|148886830|sp|A1DB13.1|CFT1_NEOFI RecName: Full=Protein cft1; AltName: Full=Cleavage factor two
           protein 1
 gi|119410106|gb|EAW20053.1| cleavage and polyadenylation specificity factor subunit A, putative
           [Neosartorya fischeri NRRL 181]
          Length = 1400

 Score =  131 bits (330), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 179/747 (23%), Positives = 309/747 (41%), Gaps = 119/747 (15%)

Query: 57  NLVVTAANVIEIY-VVRVQEE---GSKESKNSGETKRRVLMDGISAASLELVCHYRLHGN 112
           NLVV   +V++I+ +++VQ     G+ E K++         D +    L L   Y L G 
Sbjct: 28  NLVVVKTSVLQIFSLLKVQHHLRGGTIEGKSARP-------DRVETTKLVLEREYPLSGT 80

Query: 113 VESLA---ILS--QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
           V  +    IL+   GG       ++++LAF +AK+S++E+D   HG+   S+H +E  + 
Sbjct: 81  VVDICRVKILNPKSGG-------EALLLAFRNAKLSLVEWDPERHGISTLSIHYYERDDL 133

Query: 168 LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGD-------EDT 219
                  +  + G ++ VDP  RC  V  +G++ + IL   Q G  L  D       +D 
Sbjct: 134 TRSPWVPDLSSCGSILSVDPSSRCA-VFNFGIRNLAILPFHQPGDDLAMDDYEFHLHQDD 192

Query: 220 FGS-----GGGFSAR--------IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILH 264
           F       G    ++          SS V+ L  LD  + H     F++ Y EP   +L+
Sbjct: 193 FNQVSDHVGNDLKSKDRTVYQTPYASSFVLPLTALDPSILHPVSLAFLYEYREPTFGVLY 252

Query: 265 ERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVV 324
            +  T    +  +  +   +  ++    +    + S   LP D +K++A+P P+GG L++
Sbjct: 253 SQIATSHALLPERKDSIFYTVFTLDLEQRASTTLLSVPKLPSDLFKVVALPPPVGGALLI 312

Query: 325 GANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLST 381
           G+N  +H      + A+ +N +A  + +   + +S  ++ L+      L +     LL  
Sbjct: 313 GSNELVHVDQAGKTNAVGVNEFARQVSAFSMVDQSDLALRLEGCVVEHLSDSTGDLLLVL 372

Query: 382 KTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFLGSRLGDSLL 434
            +G++VL+    DGR V  + L    ++   +++ S  ++   +G+   F GS   DS+L
Sbjct: 373 SSGNMVLVHFQLDGRSVSGISLRPLPAQAGGTIMKSAASSSAFLGSGRVFFGSEDADSVL 432

Query: 435 VQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ-DMVNGE-ELSLYGSASN 492
           + ++  S         +    ++  D        +S  D  + D+   E E    G   +
Sbjct: 433 LSWSSMSSN---PKKPRPRMSNVAEDREEASVDSQSEEDVYEDDLYTAEPETPALGRRPS 489

Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYG-----------LRINADASATGISKQS---N 538
              S    + F + D L NIGPL+D + G           L  NA +    I+ Q    N
Sbjct: 490 AETSGVGVYIFQILDRLPNIGPLRDITLGKPASTVENTGRLIENACSELELIAAQGSGRN 549

Query: 539 YELV--------------ELPGCKGIWTVYHKSSRGHN--ADSSRMAAYDDEYHAYLIIS 582
             LV              +    +G+WT       G     D  R+   + EY  Y+I+S
Sbjct: 550 GGLVLMKREIEPDVAASFDAQSVQGVWTAVVALGSGAPLVPDEQRI---NQEYRQYVILS 606

Query: 583 LEARTMVLETADLLTEVTESVDYFVQGR-------TIAAGNLFGRRRVIQVFERGARILD 635
            +      E +++     + +  F           TI  G L  +RRV+QV     R   
Sbjct: 607 -KPEAPDKEQSEVFIADKQDLKPFKAPEFNPNNDVTIEIGTLSCKRRVVQVLRNEVR--- 662

Query: 636 GSYMTQDLSFG-----PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTC 690
            SY   D+  G     P   E    S+    +S S+ADPY+ +   D ++ LL  D S  
Sbjct: 663 -SY---DIDLGLAQIYPVWDE--DTSDERMAVSASLADPYIAILRDDSTLMLLQADDSGD 716

Query: 691 TVSVQTPAAIESSKKPVSSCTLYHDKG 717
              V+   +  + K    SC LY DK 
Sbjct: 717 LDEVELDDSTRAGK--WRSCCLYWDKA 741



 Score = 55.5 bits (132), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 51/236 (21%), Positives = 99/236 (41%), Gaps = 23/236 (9%)

Query: 914  ITIFKNISGHQGFFLSGSRPCWCMVFRER----LRVHPQLCDGSIVAFTVLHNVNCNHGF 969
            + I  NIS     F+ G    + +   +      R+  +   G  ++   L + + + GF
Sbjct: 885  LRILPNISDLSAVFMPGPSASFILKTAKSCPHVFRLRGEFVRG--LSIFDLASPSLDKGF 942

Query: 970  IYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPL 1029
            IYV S+ +L+IC+ PS + +D  W ++K+   +   H + Y      Y L  S       
Sbjct: 943  IYVDSKDVLRICRFPSETLFDYTWALRKIGIGEQVDH-LAYATSSETYVLGTS------H 995

Query: 1030 NQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSS 1089
            +    L  D E+     N  +S +   R  +++    R          W    +  +  +
Sbjct: 996  SADFKLPDDDELHPDWRNEVISFLPELRQCSLKVVSPRT---------WTVIDSYSLGPA 1046

Query: 1090 ENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQN 1144
            E  + V+ + L  +  T E   ++ +GTA+  GED+ +RG + +F   +   +P+ 
Sbjct: 1047 EYVMAVKNMDLEVSENTHERRNMIVVGTAFAWGEDIPSRGCIYVFEVIKVVPDPEK 1102


>gi|380488833|emb|CCF37111.1| CPSF A subunit region, partial [Colletotrichum higginsianum]
          Length = 1062

 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 232/1047 (22%), Positives = 400/1047 (38%), Gaps = 173/1047 (16%)

Query: 69   YVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAIL----SQGGA 124
            Y  R+ ++   ES   G     V  D      L LV  Y + G V  LA +    S+ G 
Sbjct: 66   YDHRLNDDDGLESSFLGGDGMLVRADRAINTKLVLVAEYPIFGIVTGLAKIKLQYSKSGG 125

Query: 125  DNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL-- 182
            +      ++++A   A++S++++D   H L   S+H +E  E         S   GPL  
Sbjct: 126  E------ALLIATRVARLSLVQWDPEKHALEDISIHYYEKEEL------EGSPFDGPLNN 173

Query: 183  ----VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIES------ 232
                +  DP  RC  +      +  L   Q    +  D+      G   A+  S      
Sbjct: 174  YRTHLAADPGSRCAALRFGPRYIAFLPFKQADEDIDMDDWDEDVDGPRPAKEPSATAATN 233

Query: 233  ------------SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
                        S+V+ L  LD  + H     F+H Y EP   I+   +          H
Sbjct: 234  GTSNIADVPYSTSYVLPLPQLDPSLLHPVHLAFLHEYREPTFGIISSTQRRSNTLPRKDH 293

Query: 279  HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSAS 337
             +  +  L +    +    I S  NLP D +K++A+P P+GG L+VG N  IH       
Sbjct: 294  FSYKVFTLDLQQ--RASTAILSVNNLPQDLFKVIALPGPVGGALLVGTNELIHIDQSGKP 351

Query: 338  CALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTGDLVLLTVVYDG 395
              +A+N +     +     +S   + L+  +   +  +N   L+    G L ++T   DG
Sbjct: 352  NGVAVNPFTKETTNFPLADQSDLDLRLEHCYIELMSAENGELLMILSDGRLAIITFKIDG 411

Query: 396  RVVQ----RLDLSKTNPSVLTSDITTIGN---SLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
            R V     +L  ++    ++   ++TI     ++FF+G+   DSL++ +T     +    
Sbjct: 412  RTVSGVGVKLVPTEVGGGIVQCSVSTISRLSRNVFFVGTTGSDSLVLGWTRKQAQNARK- 470

Query: 449  GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELS--LYGSASNNTESAQKTFSFAVR 506
              K    D   D+           D   D + GE  +  +   A+ N  S     +F V 
Sbjct: 471  --KTRLVD---DSFEYDLEDEDMEDDDDDDLYGETTTTMIQPGATANGVSKGGDLTFRVH 525

Query: 507  DSLVNIGPLKDFSYGLRI-------------------------NADASATGISKQSNYEL 541
            DSL++I P+KD + G +                            +A A  I  Q+    
Sbjct: 526  DSLLSIAPVKDMTSGKQAFIPDSEEEKNSVGVVADLQLACVVGRGNAGAVAIVNQNIQPK 585

Query: 542  V----ELPGCKGIWTV-----YHKSSRGHNADSSRMAAYDD---EYHAYLIISLEARTMV 589
            V    E P  +G WT+       KS +G    ++ +A+  D   +Y  ++I+S +     
Sbjct: 586  VIGKFEFPEARGFWTMCVQKPVPKSLQGDKGANAAVASEFDASSKYDKFMIVS-KVDLDG 644

Query: 590  LETADLLTEVTESVDYF-------VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQ 641
             ET+D+        +           G T+ AG +    R+IQV +   R  DG   ++Q
Sbjct: 645  YETSDVYALTGAGFEALTGTEFDPAAGFTVEAGTMGKHMRIIQVLKSEVRCYDGDLGLSQ 704

Query: 642  DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIE 701
             L     + E+G+      V+S SI DPY+LL   D SI +   D +     V+      
Sbjct: 705  ILPM--LDEETGA---EPRVISASITDPYLLLVRDDSSIMVAQIDNNCELEEVEKQDDTI 759

Query: 702  SSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESG 761
             S K ++ C LY D                +TG+   +    G P  Q + +  +   +G
Sbjct: 760  LSTKWLAGC-LYTD----------------TTGLFAPMQTDKGTPEGQ-NTFMFLLSAAG 801

Query: 762  ALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHS 821
            AL I+ +PN +    V    +G T+ V  ++           + +   GT Q   E +  
Sbjct: 802  ALYIYALPNLSKPVYV---AAGLTY-VPPFL---------SADYAVRRGTVQ---ETLTE 845

Query: 822  MKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSN 881
            + V +L         + P+L     +  +  Y+    E  + T      +  +++L    
Sbjct: 846  LLVADLG----DTTATSPYLIVRHANDDLTIYEPIRLESQDKT------LGLAKTLHFQK 895

Query: 882  VSASRLRNLRFSRTPLDAYTRE--ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF 939
            ++     N   +++P++    E  E P   P +      NI+G+   FL G+ P   +  
Sbjct: 896  IT-----NPALAKSPVEVADDEANEQPRFVPLRPCA---NINGYSTVFLPGASPSLIV-- 945

Query: 940  RERLRVHPQ---LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQ 996
             +  +  P+   L    +   +  H   C  GFIY  S+G  ++ QLP+ S +       
Sbjct: 946  -KSAKSSPKVVGLQGIGVRGMSSFHTEGCERGFIYADSEGQTRVTQLPADSNFAELGVSV 1004

Query: 997  KVIPLKATPHQITYFAEKNLYPLIVSV 1023
            + IP+      I Y      Y +  S+
Sbjct: 1005 RKIPIGDAVGLIAYHPPMETYAVACSI 1031


>gi|154320778|ref|XP_001559705.1| hypothetical protein BC1G_01861 [Botryotinia fuckeliana B05.10]
          Length = 1153

 Score =  130 bits (327), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 191/909 (21%), Positives = 356/909 (39%), Gaps = 153/909 (16%)

Query: 293  KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDS 351
            K    I S   LP+D ++++ +  P+GG L+VG N  IH      +  +A+N +A     
Sbjct: 10   KASTTILSVGGLPYDLFRIVPLAPPVGGALLVGTNELIHIDQAGKANGVAVNMFAKQCTG 69

Query: 352  SQELPRSSFSVELDAAHATWL--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP- 408
               L ++   + L+      L  +N   L+   +GD+ +L+   DGR V  L + + +  
Sbjct: 70   FSLLDQADLDLRLEGCKIDQLSIENGEMLIILHSGDIAILSFRMDGRSVSGLSIRRVSAE 129

Query: 409  ---SVLT---SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAP 462
               ++LT   S ++++G    F+GS + DS+++ +   SG +       +     E D  
Sbjct: 130  LGGAILTGAASCVSSLGAGSLFVGSEVSDSVILGWNRKSGQTSRRKSRLDSSAIAEVD-- 187

Query: 463  STKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT--FSFAVRDSLVNIGPLKDFSY 520
                      +   D + G+  ++  + +N T S  KT  ++F + DS+VNI P+ + ++
Sbjct: 188  -EAMFDEEDLEDDDDDLYGDGPTITHATANITASNSKTGDYTFRIHDSMVNIAPITNIAF 246

Query: 521  G---LRINADASATGISKQSNYELV--------------------------ELPGCKGIW 551
            G   L +  D        QS  +LV                          +LP  +GIW
Sbjct: 247  GEAALSLGKDEELKSSGVQSELQLVAAVGREKGGSLAVINREIQPNVIGRFDLPEARGIW 306

Query: 552  TVYHK--SSRGHNADSSRMA-----AYDDEYHAYLIISL--EARTMVLETA-----DLLT 597
            T+  K  + +G   +  +         D +Y   +I+S   +A   + E+A     D   
Sbjct: 307  TMSAKRPAPKGLQVNKEKSVTSGDYGVDAQYDRLMIVSKASDAEDAIEESAVYALTDAGF 366

Query: 598  EVTESVDYF-VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSG 655
            E     ++    G TI AG L    RV+Q+ +   R  DG   + Q L     + E+G+ 
Sbjct: 367  EALTGTEFEPAAGSTIEAGTLGNGMRVVQILKSEVRSYDGDLGLAQILPM--LDDETGA- 423

Query: 656  SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
                 ++S S ADP++LL   D SI +   D       ++    I  S K ++ C LY D
Sbjct: 424  --EPKIISASFADPFLLLIRDDASIFVAQCDDDNDLEEIERVDDILLSTKWLTGC-LYDD 480

Query: 716  KGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVF 775
                       +D+  S   GE             ++   +    GAL I+ +P+ +   
Sbjct: 481  ------YSGAFSDSK-SNKAGE-------------NVKMFLLSAGGALHIYALPDLSKPV 520

Query: 776  TVDK---FVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRW 832
             V +   FV           + A +++ TEI                       L     
Sbjct: 521  YVAEGICFVPPVLSADYAARKSAARETLTEI-----------------------LVANLG 557

Query: 833  SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRF 892
             +    P+L    ++  +  Y+ +  +            S S  L  S +   +++N   
Sbjct: 558  DSVSQSPYLILRPSNDDLTIYEPFRVK------------SASPDLLSSTLQFLKIQNTHL 605

Query: 893  SRTPLDAYTREETPHGA------PCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVH 946
            ++ P    + EE   GA      P + I+   N+ G+   F+ G  P + +   +     
Sbjct: 606  TQAP--DVSAEEQVDGAQQTSDKPMRAIS---NLGGYSTVFMPGGSPSFIIKSSKTAPKV 660

Query: 947  PQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPH 1006
              L    + + +  H   C+ GFIY +++GI ++ Q P  +T+ +     + I +    H
Sbjct: 661  LSLQGTGVRSLSSFHTEGCDRGFIYASTEGIARVAQFPPNTTFADIGMALRKIEIGEDVH 720

Query: 1007 QITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNL-SSVDLHRTYTVEEYE 1065
             + Y      Y +  S               D E+    D+       ++    ++E+  
Sbjct: 721  AVAYHPPLQTYVIGTST------------FTDFELPKDDDHRKTWQEENIALKPSIEKSF 768

Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDV 1124
            ++++ P      W     I ++  E    ++ + L  +  T E + L+ +GTA  +GED+
Sbjct: 769  LKLVSPVN----WSVIDAIELEPCELITCIKTMNLVISEVTNERKHLIVVGTAITKGEDL 824

Query: 1125 AARGRVLLF 1133
            A  GR+ ++
Sbjct: 825  ATTGRLYVY 833


>gi|403411348|emb|CCL98048.1| predicted protein [Fibroporia radiculosa]
          Length = 1437

 Score =  130 bits (326), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 239/1177 (20%), Positives = 462/1177 (39%), Gaps = 176/1177 (14%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLE-------------- 102
            N+VV  +N++ I+ VR +       K     +R  +  G  A   E              
Sbjct: 45   NVVVARSNLLRIFEVREEPAPFSTQKEDERDRRASMRKGTEAVEGEVEMDASGEGFVNMG 104

Query: 103  --------------------LVCHYRLHG---NVESLAILSQGGADNSRRRDSIILAFED 139
                                L+  +RLHG    +E + I++    ++S   D ++++F+D
Sbjct: 105  SVKSTGQNGILHQPTVNRFYLIREHRLHGIVTGIEGVRIIT--SIEDSF--DRLLVSFKD 160

Query: 140  AKISVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPL----VKVDPQGRCGGV 194
            AKI++LE+ +++H L   S+H +E +P+ + +          PL    ++ DP  RC  +
Sbjct: 161  AKIALLEWSEAMHDLITVSIHTYERAPQLMAID--------APLFRSQLRADPLSRCAAL 212

Query: 195  LVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIF 251
             +    + IL   Q  + L  D     +          S +++L    D  +++V DF+F
Sbjct: 213  SLPKDSIAILPFYQSQAEL--DIMEHETSQARDVPYSPSFILDLSADVDTRIRNVIDFVF 270

Query: 252  VHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKL 311
            + G+  P + +L + + TW GR+     T  +   ++    + +P+I +   LPHD + +
Sbjct: 271  LPGFNSPTIAVLFQYQQTWTGRLKEYKDTVGLILFTLDLVTRHYPVITAIDGLPHDCFAM 330

Query: 312  LAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSSFS-------VE 363
                + +GGV+V+ +N+I Y  Q+     L ++ +   L    +LP  S S       ++
Sbjct: 331  APCSTALGGVVVLASNSIIYVDQATRRVILPVSGW---LPRISDLPIPSLSHQDQQRDLQ 387

Query: 364  LDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLS-KTNPSVLTSDITTIGNSL 422
            L+ +   ++ +    +  K G +  + ++ DG+ V RL ++     + + S +  + +  
Sbjct: 388  LEGSQFVFVDDRTLYVVLKDGTVYPVEIIVDGKTVSRLSMAPPVARTTMPSLVRKMQDDY 447

Query: 423  FFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD--APSTKRLRRSSSDALQDMVN 480
             F+GS +G S+L++ T          G   E   + A   AP+         D       
Sbjct: 448  LFVGSIIGPSVLLKTT---RVEEDIEGDDVEMASVPATVVAPNNAMDLDDDDDLYGGSAV 504

Query: 481  GEELSLYGSASNNTESAQK---TFSFAVRDSLVNIGPLKDFSYGLRINAD------ASAT 531
             E+  + G   N + +  K       +  DSL   GP+ D ++ L  N D       +AT
Sbjct: 505  IEQPHMNGITQNGSTAISKKRTVVQLSFCDSLPAYGPIADMTFTLAKNGDRAVPELVAAT 564

Query: 532  GISKQSNYELVE--LP-----------GCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAY 578
            G      + L++  LP           G +GIW++  + +   N  +    A  + YHA 
Sbjct: 565  GSGMLGGFTLLQRDLPTRTKRKMHAIGGGRGIWSLLVRQAVKVNGSTYERPA--NPYHAE 622

Query: 579  ---LIISLEARTM--VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 633
               ++IS +A     +   A    +    +   + G TI A   F    ++ V      +
Sbjct: 623  NDSIVISTDANPSPGLSRIASRNAQGDIQITTRIPGTTIGAAPFFQGTAILHVMINVTNV 682

Query: 634  LDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 693
            +    +  D +      +         +   SI DP+VL+   D SI L +G+     + 
Sbjct: 683  I--RVLEPDGTERQVIKDWDGNIPRPKIRFCSICDPFVLIIRDDDSIGLFIGESERGKIR 740

Query: 694  VQTPAAI-ESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDI 752
             +  + + E + + ++ C  + D      + + +  A +           + G   Q   
Sbjct: 741  RKDMSPMGEKTSRYLAGC-FFTDTSGIFQVHQNAQAAGIEGATSTLQSVMNAGNRTQ--- 796

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++C   G +EI+ +P     F+        + + D Y   AL        S  ++   
Sbjct: 797  WLILCRPQGVIEIWTLPKLGLAFSTTHAAGLESVLTDLYDPPAL--------SVPQDPPR 848

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            + ++ +I  + V  L          RP L   L  G +  Y+ +    P        P +
Sbjct: 849  KPQELDIEQLLVAPLG-----ESSPRPHLMLFLRSGQLAVYEVHSTPVPAEPL----PAA 899

Query: 873  TSRSLSVSNVSA-SRLRNLRFSRTPLDAYTREETPHGAPCQRIT-----IFKNISGHQ-- 924
             S +L V  V   SR  N++ S     +   E+       +RI+        + S  Q  
Sbjct: 900  RSSTLLVKFVKVLSRAFNIQHSDEVEKSVLAEQ-------KRISHLLIPFATSPSPGQTF 952

Query: 925  -GFFLSGSRPCWCMVF-RERLRVHPQLCDGSIV-AFTVLHNVNCNHGFIYVTSQGILKIC 981
             G FL+G RP W +   +  ++V P     S+V AFT        + F+  + +G   + 
Sbjct: 953  SGVFLTGDRPSWLLCTDKGGVKVLPS--GHSVVHAFTASSVWESKNDFLLYSEEGPSLME 1010

Query: 982  QLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE- 1040
             LP     D +      +P ++ P   +Y           S  V  P   ++     Q+ 
Sbjct: 1011 WLPD-VQLDGH------LPSRSVPRPRSY-----------SNVVYDPSTSLIVAASSQQS 1052

Query: 1041 --VGHQIDNHNLSSVDLHRTY-TVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1097
                +  D + +   D + ++ + E   + ++ P+     W T        +E    +  
Sbjct: 1053 KFASYDEDGNIVWEPDTNISFPSCECSALELISPEG----WVTMDGYEFAQNEFVNCLDC 1108

Query: 1098 VTLFNTTTKE-NETLLAIGTAYVQGEDVAARGRVLLF 1133
            +TL   +T+   +  +A+GT   +GED+A +G V +F
Sbjct: 1109 ITLETMSTETGTKDFIAVGTTINRGEDLAVKGAVYIF 1145


>gi|346971831|gb|EGY15283.1| cft-1 [Verticillium dahliae VdLs.17]
          Length = 1445

 Score =  129 bits (323), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 256/1199 (21%), Positives = 459/1199 (38%), Gaps = 225/1199 (18%)

Query: 57   NLVVTAANVIEIYVVRV-------QEEGSKESKNSGETKRRVLMD--GISAA-------- 99
            NL+V+  ++++I+ V+         +  +K + N+GET  R + D  G+ +A        
Sbjct: 28   NLIVSKGSLLQIFAVKTVSTEIDTSQIQAKSTSNAGETYDRRINDDDGLESAFLGGDGML 87

Query: 100  ---------SLELVCHYRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDD 149
                      L LV  Y +HG +  LA +      +SR   +++++    A++S+L++D 
Sbjct: 88   MRADRTTNTRLVLVAEYPVHGVIAGLARVK---IQSSRSGGEALLVHSRTARLSLLQWDP 144

Query: 150  SIHGLRITSMHCFESPEWLHLKRGRESFARGPL------VKVDPQGRCGGVLVYGLQMII 203
              HG+   S+H +E  EW      + S   GPL      ++ DPQ RC   L +GL+ I 
Sbjct: 145  EKHGVEDISIHFYEKEEW------QGSPMDGPLRQHATILQADPQSRCAA-LKFGLRKIA 197

Query: 204  L-------------KASQGGSGLVGDEDTFGSGGGFSARIES----------SHVINLRD 240
                             +   G    E+   +     +   S          S V+ L  
Sbjct: 198  FLPFRQIDGDIDMDDWDEEVDGPRPQEEPPAAAAVHGSSSNSSSLAPVPYTPSFVLALPQ 257

Query: 241  LD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298
            LD  + H   F F+H Y EP + I+              H T  +  + +        L 
Sbjct: 258  LDPEILHPVHFAFLHEYREPTLGIISSTNRRLKMEPQKDHFTFKVFTVDL--------LQ 309

Query: 299  WSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPR 357
             +++N      K++A+P P+GG L++G N  IH      +  +A+N YA  +       +
Sbjct: 310  KASLN------KVIALPKPMGGALLIGENELIHIDQAGKAHGVAVNPYAAKMTKFPLADQ 363

Query: 358  SSFSVELDAAHATWL--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVL 411
            S   + L+      +  +N   LL T+ G++ ++T   DGR V  + +    ++    VL
Sbjct: 364  SELKLRLEHCEVELMSPENGEMLLVTRHGEMAVVTFKMDGRSVSGVSVKVVATENGGDVL 423

Query: 412  ---TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
                + +T +  +  F G+  GDS ++    G     + +  K+     E+         
Sbjct: 424  PFRAACLTKVTKNSMFYGTIGGDSKVI----GWSRQHVQTARKKARLLDESLDYDLDDDE 479

Query: 469  RSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG------- 521
                D   D + GE       ++    +      F V DSL+++ P+ D +YG       
Sbjct: 480  ADDDDDDDDDLYGEGTVAPQPSAAAGSAKGGDVVFRVHDSLLSLSPIMDMTYGKTAFFPG 539

Query: 522  ---------LRINAD-ASATGISKQSNYELV------------ELPGCKGIWT--VYHKS 557
                     +R   D   A G  +  +  L+            + P  +G WT  V    
Sbjct: 540  SEDAKNSEGVRSELDLVCAVGRHRGGSLALINQHIQPRVIGRFDFPEARGFWTTRVQKTI 599

Query: 558  SRGHNADS-SRMAAYDD-----EYHAYLIISLEARTMVLETADLLTEVTESVDYF----- 606
            ++    D  + +A  +D     +Y  ++I++ +      ET+D+        +       
Sbjct: 600  AKSLQGDKGANLAVGNDYGSVTQYDKFMIVA-KVDLDGYETSDVYALTGAGFEALSGTEF 658

Query: 607  --VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLS 663
                G TI AG +    R+IQV     R  DG   ++Q L     + E+G+      V+S
Sbjct: 659  DPAAGLTIEAGTMGNDMRIIQVLRSEVRCYDGDLGLSQILPM--LDEETGA---EPRVIS 713

Query: 664  VSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDK----GPE 719
             SI DPY+LL   D SI +           +        S K +S C LY D      P 
Sbjct: 714  ASIVDPYLLLLREDSSILVAQITNHNELEELDKEDETIVSTKWLSGC-LYKDSRGLFAPV 772

Query: 720  PWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDK 779
               + TST   +   +  AI          G+++  +C       ++ +PN +    V  
Sbjct: 773  QTDKGTSTSESVFLFLLNAI----------GELHVRIC-------VYALPNLSKSIYV-- 813

Query: 780  FVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRP 839
              +G ++I           S    + ++  GT     E +  + V +L     ++ H   
Sbjct: 814  -AAGLSYI----------PSLLSADYTARRGTS---PETLTEILVADLGDSTSASAH--- 856

Query: 840  FLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDA 899
             L     +  +  Y+ +   G E   K D     + SL    VS S L     +++P++A
Sbjct: 857  -LILRHANDDMTIYEPFRIGGQEE--KED----LANSLFFKKVSNSHL-----AKSPVEA 904

Query: 900  YTREETPHGAPCQRITIFK---NISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVA 956
               E         R+   +   NI G+   FL G+ P + +   +       L    +  
Sbjct: 905  AEDEAVQE----NRVIPLRACDNIGGYSTVFLPGASPSFILKSSKSTPKVIGLQGLGVNG 960

Query: 957  FTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNL 1016
             +  H   C  GFIY  S+G  ++ Q P  +         + +P+      + +     +
Sbjct: 961  MSSFHTEGCERGFIYADSKGCARVTQFPDAANVAELGVSVRKVPIDTAVSHVAWHPNMEV 1020

Query: 1017 YPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEY-EVRILEPDRAG 1075
            Y   V+   L+P           E+    D H   + +      ++E+  +++  P    
Sbjct: 1021 Y--AVASSKLEPF----------ELPKDDDYHKEWAKEECPMPPMKEHGSIKLYSPIT-- 1066

Query: 1076 GPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
              W       ++  E A+ ++ + L  +  TKE   L A+GTA ++GED+  RGR+L+F
Sbjct: 1067 --WNVIDEFELEQYEVAMCMKTLLLEVSEETKERRMLFAVGTAILRGEDLPVRGRILVF 1123


>gi|156040479|ref|XP_001587226.1| hypothetical protein SS1G_12256 [Sclerotinia sclerotiorum 1980]
 gi|154696312|gb|EDN96050.1| hypothetical protein SS1G_12256 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 1447

 Score =  128 bits (322), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 184/745 (24%), Positives = 307/745 (41%), Gaps = 153/745 (20%)

Query: 57  NLVVTAANVIEIYV-----VRVQEEGSKES---KNSGETKRRVL-MDGISAA-------- 99
           NLVV  A++++I+      V + E   K+S   K+   T  R    DG+ A+        
Sbjct: 28  NLVVAKASLLQIFTTKTVSVDLDELSGKDSSTVKDVTSTDPRAHDEDGVEASFLGADSIL 87

Query: 100 ---------SLELVCHYRLHGNVESLA----ILSQGGADNSRRRDSIILAFEDAKISVLE 146
                     L L+  Y L G V SL     I S+ G +      ++++ F+DAK+S++E
Sbjct: 88  PRSELARTTKLVLIAEYNLSGTVTSLVRVKTISSKTGGE------ALLVGFKDAKLSLVE 141

Query: 147 FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKA 206
           +D    G+   S+H +E  E                + VDP  RC  +      + IL  
Sbjct: 142 WDPERPGISTISVHFYEQDELQGSPWAPSLSDCVNYLTVDPGSRCAALKFGARNLAILPF 201

Query: 207 SQGGSGLVGDED--------------TFGSGGGFSARIESSHVINLRDLDMKHV--KDFI 250
            Q     + D D              +    G  +    SS V+ L  LD   +      
Sbjct: 202 KQDEDVNMDDWDEELDGPRPAKISQKSAAENGILATPYGSSFVLRLSSLDPSLIFPIHLE 261

Query: 251 FVHGYIEPVMVILHERELTWAGRVSWK--HHTCMISALSISTTLKQHPLIWSAMNLPHDA 308
           F++ Y EP   IL       +  +  +  H T M+  L I    K    I S   LP+D 
Sbjct: 262 FLYEYREPTFGILSSTMAPSSALLQERKDHLTYMVFTLDIHQ--KASTTILSVGGLPYDL 319

Query: 309 YKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
           + ++ +  P+GG L+VGAN  IH      +  +A+N +A    +   L +S  ++ L+  
Sbjct: 320 FMIVPLAPPVGGALLVGANELIHIDQAGKANGVAVNMFAKQCTNFSLLDQSDLALRLEGC 379

Query: 368 HATWL--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS----VLT---SDITTI 418
               L  +N   L+   +GD+ +L+   DGR V  L + + +      +LT   S ++++
Sbjct: 380 KIDQLSIENGEMLIILHSGDIAILSFRMDGRSVSGLSIRRVSAELGGDILTGAASCVSSL 439

Query: 419 GNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDM 478
           G    F+GS + DS+++ ++  SG                   PS ++ R  SS A+ D+
Sbjct: 440 GAGALFVGSEVSDSVILGWSRKSGQ------------------PSRRKSRLDSS-AIADV 480

Query: 479 ----------------VNGEELSLYGSASNNTESAQKT--FSFAVRDSLVNIGPLKDFSY 520
                           + G+  ++  +A+N T S  K   ++F++ DS+VNI P+ + ++
Sbjct: 481 DEAMLDEEDLEDDDDDLYGDGPTISPTAANVTASNSKAGDYTFSIHDSMVNIAPITNITF 540

Query: 521 G-----------LRINADAS------ATGISKQSNYELV------------ELPGCKGIW 551
           G           L++N   S      A G  K  +  ++            ELP  +GIW
Sbjct: 541 GEVALSSDKEEELKLNGVQSELQLLAAVGREKGGSLAVINRNIQPNVIGRFELPEARGIW 600

Query: 552 TVYHK--SSRGHNADSSRMA-----AYDDEYHAYLIISL--EARTMVLETA-----DLLT 597
           T+  K  + +G   +  +         D +Y   +I+S   EA   + E+A     +   
Sbjct: 601 TMSAKKPAPKGLQVNKEKTVIGGDYGVDAQYDRLMIVSKASEAEDAIDESAVYALTNAGF 660

Query: 598 EVTESVDYF-VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSG 655
           E     ++    G TI AG L    RVIQV +   R  DG   + Q L     + E+G+ 
Sbjct: 661 EALSGTEFEPAAGSTIEAGTLGNGMRVIQVLKSEVRSYDGDLGLAQILPM--LDDETGA- 717

Query: 656 SENSTVLSVSIADPYVLLGMSDGSI 680
                ++S S ADP++LL   D SI
Sbjct: 718 --EPKIISASFADPFLLLIRDDASI 740



 Score = 70.9 bits (172), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 56/266 (21%), Positives = 113/266 (42%), Gaps = 21/266 (7%)

Query: 872  STSRSLSVSNVSASRLRNLRFSRTP-LDAYTREETPHGAPCQRITIFKNISGHQGFFLSG 930
            STS +L  S +   ++ N   ++ P + A  + +       + +    N+ G+   F+ G
Sbjct: 879  STSPNLLSSTLQFLKIHNTHLAQAPDVSAEEQADETQQTSDKPMRAVSNLGGYSVVFMPG 938

Query: 931  SRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
              P + +   + L     L    +   +  H   C+ GFIY  ++GI+++ Q P  +T+ 
Sbjct: 939  GSPSFIVKSSKTLPKVLSLQGTGVRGLSSFHTEGCDRGFIYADTEGIVRVAQFPPTTTFA 998

Query: 991  NYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNL 1050
            +     + + +    H + Y +    Y +  S               D E+    D+H  
Sbjct: 999  DIGMALRKVEIGEDVHAVAYHSPLQTYVIGTST------------FTDFELPKD-DDHRR 1045

Query: 1051 S--SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKE 1107
            S    D+    ++E+  ++++ P      W    TI ++  E    ++ + L  +  T E
Sbjct: 1046 SWQEEDIAFKPSIEKSSLKLISPVN----WSVIDTIELEPCEVITCIKTMNLVVSEVTNE 1101

Query: 1108 NETLLAIGTAYVQGEDVAARGRVLLF 1133
             + LL +GTA  +GED+A  GR+ ++
Sbjct: 1102 RKPLLVVGTAITKGEDLATTGRLYVY 1127


>gi|336463425|gb|EGO51665.1| hypothetical protein NEUTE1DRAFT_89273 [Neurospora tetrasperma FGSC
           2508]
          Length = 1437

 Score =  127 bits (318), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 156/658 (23%), Positives = 267/658 (40%), Gaps = 84/658 (12%)

Query: 94  DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHG 153
           D  ++A L LV    L G +  LA + +    +S   D ++L+F DA++S++E++   + 
Sbjct: 96  DRANSAKLVLVAEVTLPGTITGLARIKKPSGSSSGGADCLLLSFRDARLSLVEWNVERNT 155

Query: 154 LRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL 213
           L   S+H +E  E +             L+  DP  RC  +      + IL   Q    +
Sbjct: 156 LETVSIHYYEKEELVGSPWVAPLHQYPTLLVADPASRCAALKFSERNLAILPFKQPDEDM 215

Query: 214 VGD------------EDTFGSGGGFSARIES-----SHVINLRDLD--MKHVKDFIFVHG 254
             D            +D  G+    ++ IE      S V+ L  L+  + H     F+H 
Sbjct: 216 DMDNWDEELDGPRPKKDLSGAVANGASTIEDTPYSPSFVLRLSKLEASLLHPVHLAFLHE 275

Query: 255 YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
           Y +P + +L   +          H T M+  L +    +    I +   LP D ++++A+
Sbjct: 276 YRDPTIGVLSSTKTASNSLGHKDHFTYMVFTLDLQQ--RASTTILAVNGLPQDLFRVVAL 333

Query: 315 PSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
           P+P+GG L+VGAN  IH      S  +A+N       S   + +S   + L+      L 
Sbjct: 334 PAPVGGALLVGANELIHIDQSGKSNGIAVNPLTKQTTSFSLVDQSDLDLRLEGCAIDVLA 393

Query: 374 NDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDITTI---GNSLFF 424
            ++   LL    G L L+T   DGR V  L +    P    SV+ S +T++   G S  F
Sbjct: 394 AELGEFLLILNDGRLGLITFRIDGRTVSGLSIKMLAPEAGGSVIQSRVTSLSRMGRSTVF 453

Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
           +GS  GDS+L+ +T   G +      ++    I+              D   D + GEE 
Sbjct: 454 VGSEEGDSVLLGWTRRQGQT------QKRKSRIQDADLDLDLDDEDLEDDDDDDLYGEES 507

Query: 485 SLYGSASNNTESAQK-TFSFAVRDSLVNIGPLKDFSYGLRINADAS-------------- 529
           +    A +  ++ +    +F + D L++I P++  +YG  +    S              
Sbjct: 508 TSPEQAMSAAKAIKSGDLNFRIHDRLLSIAPIQKMTYGQPVTLPDSEEERNSEGVRSDLQ 567

Query: 530 ---ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDD- 573
              A G  K S   ++            E P  +G WTV  K          +    +D 
Sbjct: 568 LVCAVGRGKASALAIMNLAIQPKIIGRFEFPEARGFWTVCAKKPIPKTLVGDKGPMNNDY 627

Query: 574 ----EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
               +YH ++I++       E   +   TA     +T +      G T+ AG +    R+
Sbjct: 628 DTSGQYHKFMIVAKVDLDGYETSDVYALTAAGFESLTGTEFEPAAGFTVEAGTMGKDSRI 687

Query: 624 IQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
           +QV +   R  DG   ++Q +     + E+G+      V + SIADP++LL   D S+
Sbjct: 688 LQVLKSEVRCYDGDLGLSQIVPM--LDEETGA---EPRVRTASIADPFLLLIRDDFSV 740



 Score = 95.5 bits (236), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 80/323 (24%), Positives = 139/323 (43%), Gaps = 39/323 (12%)

Query: 816  KENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSR 875
            KE++  + V +L        H  P+L     +  +  YQ Y  +     + +  P S S 
Sbjct: 791  KESVAEILVADLG----DTTHKSPYLILRHANDDLTLYQPYRLK-----ATAGQPFSKS- 840

Query: 876  SLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA----PCQRITIFKNISGHQGFFLSGS 931
                  +   ++ N  F++ P +    ++ PH A    P +R +   NISG+   FL GS
Sbjct: 841  ------LFFQKVPNSTFAKAPEEKPVDDDEPHNAQRFLPMRRCS---NISGYSTVFLPGS 891

Query: 932  RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
             P + +   +       L    + A +  H   C HGFIY  + GI ++ Q+P+ S+Y  
Sbjct: 892  SPSFILKTAKSSPRVLSLQGSGVQAMSSFHTEGCEHGFIYADTNGIARVTQIPTDSSYAE 951

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
                 K IP+      + Y      Y  +V    ++P      L  D +   +    N++
Sbjct: 952  LGLSVKKIPVGVDTQSVAYHPPTQAY--VVGCNDVEPFE----LPKDDDYHKEWARENIT 1005

Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENET 1110
               +     V+   +++L    +G  W    T+ M+  E  L V  + L  + +T E + 
Sbjct: 1006 FKPM-----VDRGVLKLL----SGITWTVIDTVEMEPCETVLCVETLNLEVSESTNERKQ 1056

Query: 1111 LLAIGTAYVQGEDVAARGRVLLF 1133
            L+A+GTA ++GED+  RGRV +F
Sbjct: 1057 LIAVGTALIKGEDLPTRGRVYVF 1079


>gi|350297359|gb|EGZ78336.1| protein cft-1 [Neurospora tetrasperma FGSC 2509]
          Length = 1437

 Score =  127 bits (318), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 156/658 (23%), Positives = 267/658 (40%), Gaps = 84/658 (12%)

Query: 94  DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHG 153
           D  ++A L LV    L G +  LA + +    +S   D ++L+F DA++S++E++   + 
Sbjct: 96  DRANSAKLVLVAEVTLPGTITGLARIKKPSGSSSGGADCLLLSFRDARLSLVEWNVERNT 155

Query: 154 LRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL 213
           L   S+H +E  E +             L+  DP  RC  +      + IL   Q    +
Sbjct: 156 LETVSIHYYEKEELVGSPWVAPLHQYPTLLVADPASRCAALKFSERNLAILPFKQPDEDM 215

Query: 214 VGD------------EDTFGSGGGFSARIES-----SHVINLRDLD--MKHVKDFIFVHG 254
             D            +D  G+    ++ IE      S V+ L  L+  + H     F+H 
Sbjct: 216 DMDNWDEELDGPRPKKDLSGAVANGASTIEDTPYSPSFVLRLSKLEASLLHPVHLAFLHE 275

Query: 255 YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
           Y +P + +L   +          H T M+  L +    +    I +   LP D ++++A+
Sbjct: 276 YRDPTIGVLSSTKTASNSLGHKDHFTYMVFTLDLQQ--RASTTILAVNGLPQDLFRVVAL 333

Query: 315 PSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
           P+P+GG L+VGAN  IH      S  +A+N       S   + +S   + L+      L 
Sbjct: 334 PAPVGGALLVGANELIHIDQSGKSNGIAVNPLTKQTTSFSLVDQSDLDLRLEGCAIDVLA 393

Query: 374 NDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDITTI---GNSLFF 424
            ++   LL    G L L+T   DGR V  L +    P    SV+ S +T++   G S  F
Sbjct: 394 AELGEFLLILNDGRLGLITFRIDGRTVSGLSIKMLAPEAGGSVIQSRVTSLSRMGRSTVF 453

Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
           +GS  GDS+L+ +T   G +      ++    I+              D   D + GEE 
Sbjct: 454 VGSEEGDSVLLGWTRRQGQT------QKRKSRIQDADLDLDLDDEDLEDDDDDDLYGEES 507

Query: 485 SLYGSASNNTESAQK-TFSFAVRDSLVNIGPLKDFSYGLRINADAS-------------- 529
           +    A +  ++ +    +F + D L++I P++  +YG  +    S              
Sbjct: 508 TSPEQAMSAAKAIKSGDLNFRIHDRLLSIAPIQKMTYGQPVTLPDSEKERNSEGVRSDLQ 567

Query: 530 ---ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDD- 573
              A G  K S   ++            E P  +G WTV  K          +    +D 
Sbjct: 568 LVCAVGRGKASALAIMNLAIQPKIIGRFEFPEARGFWTVCAKKPIPKTLVGDKGPMNNDY 627

Query: 574 ----EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
               +YH ++I++       E   +   TA     +T +      G T+ AG +    R+
Sbjct: 628 DTSGQYHKFMIVAKVDLDGYETSDVYALTAAGFESLTGTEFEPAAGFTVEAGTMGKDSRI 687

Query: 624 IQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
           +QV +   R  DG   ++Q +     + E+G+      V + SIADP++LL   D S+
Sbjct: 688 LQVLKSEVRCYDGDLGLSQIVPM--LDEETGA---EPRVRTASIADPFLLLIRDDFSV 740



 Score = 95.5 bits (236), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 80/323 (24%), Positives = 139/323 (43%), Gaps = 39/323 (12%)

Query: 816  KENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSR 875
            KE++  + V +L        H  P+L     +  +  YQ Y  +     + +  P S S 
Sbjct: 791  KESVAEILVADLG----DTTHKSPYLILRHANDDLTLYQPYRLK-----ATAGQPFSKS- 840

Query: 876  SLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA----PCQRITIFKNISGHQGFFLSGS 931
                  +   ++ N  F++ P +    ++ PH A    P +R +   NISG+   FL GS
Sbjct: 841  ------LFFQKVPNSTFAKAPEEKPVDDDEPHNAQRFLPMRRCS---NISGYSTVFLPGS 891

Query: 932  RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
             P + +   +       L    + A +  H   C HGFIY  + GI ++ Q+P+ S+Y  
Sbjct: 892  SPSFILKTAKSSPRVLSLQGSGVQAMSSFHTEGCEHGFIYADTNGIARVTQIPTDSSYAE 951

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
                 K IP+      + Y      Y  +V    ++P      L  D +   +    N++
Sbjct: 952  LGLSVKKIPVGVDTQSVAYHPPTQAY--VVGCNDVEPFE----LPKDDDYHKEWARENIT 1005

Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENET 1110
               +     V+   +++L    +G  W    T+ M+  E  L V  + L  + +T E + 
Sbjct: 1006 FKPM-----VDRGVLKLL----SGITWTVIDTVEMEPCETVLCVETLNLEVSESTNERKQ 1056

Query: 1111 LLAIGTAYVQGEDVAARGRVLLF 1133
            L+A+GTA ++GED+  RGRV +F
Sbjct: 1057 LIAVGTALIKGEDLPTRGRVYVF 1079


>gi|167526060|ref|XP_001747364.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163774199|gb|EDQ87831.1| predicted protein [Monosiga brevicollis MX1]
          Length = 1324

 Score =  126 bits (316), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 92/331 (27%), Positives = 153/331 (46%), Gaps = 29/331 (8%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           LEL   +RL+G   ++  ++       + RD+++L+F DAKIS ++F+ S   L    + 
Sbjct: 34  LELAASFRLNGVATAMVAITL----PKQLRDTVVLSFADAKISAIQFEPSTRTLITQKLI 89

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
             E  E ++  +        P+++ DP  RC G LVYG +++I+ A              
Sbjct: 90  NLEI-EAVYGSKVNADLP--PVLQADPLHRCIGALVYGCRLVIIPAH------------- 133

Query: 221 GSGGGFSARIESS-HVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
                   R      VI+L  L   +   K F F+ GY  P  ++LHE    W GR +  
Sbjct: 134 ----ALQPRTNVQFRVIDLEKLSSPLGQAKSFCFLTGYTTPTALLLHEPRPVWVGRHAVG 189

Query: 278 HHTCMISALS--ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS 335
             +C++SALS  + TT    P +W+  +LP D + L+  P P+GG L+V  N + + +Q+
Sbjct: 190 RDSCVLSALSCELDTTDDFAPTVWAKDSLPSDCFALVPTPQPLGGALIVSPNMVLHTNQA 249

Query: 336 ASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG 395
           +S A+A+N  A          ++  S+ LD A  T++ +  A+ S ++G L  + +  D 
Sbjct: 250 SSSAVAVNAIAARATGYPHTTQAGLSLNLDNARVTFITSVDAIFSLQSGQLYRIGLRSDD 309

Query: 396 RVVQRLDLSKTNPSVLTSDITTIGNSLFFLG 426
                    K   S  T D+   G  L   G
Sbjct: 310 GTPAPKRPHKAVASRSTEDLDIYGADLHHRG 340



 Score = 53.9 bits (128), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 88/417 (21%), Positives = 158/417 (37%), Gaps = 87/417 (20%)

Query: 739  IDGADGGP-LDQGDIYSVVCY------ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTY 791
             D A+ GP +   D Y+   Y      ESGAL I  VP+    F    F  G + + D  
Sbjct: 661  FDAAELGPSVATSDDYAAPTYWALITTESGALYICTVPDLKIAFHCPSFGDGHSLVWDRL 720

Query: 792  MREALK---DSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
              +A+    D     + +  +    G +E I    ++ L   +      RP L A  +D 
Sbjct: 721  PNQAIPQAGDGADAPDEARNDDDAHGSEEYIVETLLIGLGQGQ------RPHLLARTSDH 774

Query: 849  TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG 908
             +L Y+ +             PV       V +V+ + +R                    
Sbjct: 775  HLLMYEVF-------------PV-------VPSVTEASVR-------------------- 794

Query: 909  APCQRITIFKNISGHQGFFLSGSRPCW--CMVFRERLRVHPQLCDGSIVAFTVLHNVNCN 966
                R+  F+NI+G  G  ++G RP    C    + + + P   + ++  F  LH  +  
Sbjct: 795  ----RLKPFQNIAGCDGVCVTGPRPLLVACGHQLKAITIVPLALEDAVKTFHPLHMDDVE 850

Query: 967  HGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVL 1026
            +GFIY T  G L     P G   +     ++ + L  T  +I +  +  L  L++  P  
Sbjct: 851  NGFIYFTKAGTLCCATAPDGLMLNRGVLARRAV-LGRTIQKIAFDLDSRLAALLLMEP-- 907

Query: 1027 KPLNQVLSLLIDQEVGHQIDNHNLSSVDLHR-TYTVEE-------YEVRILEPDRAGGPW 1078
            +P           E+     N++  S +L   +Y  +E       +++++L P       
Sbjct: 908  RP-----------ELKPSRGNNDPPSNELPNISYRPDEPKALTPFFQLQLLSPKSMKLLP 956

Query: 1079 QTRATIPMQSSENAL-TVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFS 1134
             TR    +     +   VR+ +  N+T K+N   +A+G   ++G+     G V  ++
Sbjct: 957  DTRIEYDLHHHVTSFAAVRLSSSLNSTGKQN--YIAVGVTLLEGQRATTTGFVDFYT 1011


>gi|295665178|ref|XP_002793140.1| cleavage and polyadenylation specificity factor subunit A
           [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226278054|gb|EEH33620.1| cleavage and polyadenylation specificity factor subunit A
           [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 1408

 Score =  125 bits (313), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 184/750 (24%), Positives = 304/750 (40%), Gaps = 121/750 (16%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V    ++++Y +     GS   ++  +T+ +        + L LV  Y L G +  L
Sbjct: 28  NLIVAKTTLLQVYNLVNVVYGSSPGQSDEKTRSQY-------SKLVLVAEYALSGTITDL 80

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +     D+    ++I++A  +AK+S++E+D   H +  TS+H +E  + +H+     +
Sbjct: 81  GRVKI--LDSKSGGEAILVATRNAKLSLIEWDPEKHQISTTSIHYYERDD-VHISPWTPN 137

Query: 177 FARGPL-VKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV-------------------- 214
            A  P  + VDP  RC  VL +G + + IL   Q G  LV                    
Sbjct: 138 LAACPSHLTVDPSSRCA-VLNFGKKNLAILPFHQMGDDLVMDDFDSDHDDERQIDTNHTA 196

Query: 215 --GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTW 270
              DE     G  +     SS V+ +  L+  M H     F++ Y EP   IL+ +    
Sbjct: 197 EERDEANKPDGPVYQTPYASSFVLPIAALEPSMLHPISLAFLYEYREPTFGILYSQVAAS 256

Query: 271 AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-I 329
           +  +  +      S  ++    +    + S   LP+D +K++ +P P+GG L+VG+N  +
Sbjct: 257 SALLHDRKDVVFYSVFTLDLEQRASTTLLSVPRLPNDLFKVIPLPPPVGGALLVGSNELV 316

Query: 330 HYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTGDLV 387
           H      + A+ +N +A    S     +S   + L+      L  +N   LL    G + 
Sbjct: 317 HVDQAGRTNAVGVNEFAREASSFSMADQSDLEMRLEGCVVEQLGTENCDMLLVLLNGVMA 376

Query: 388 LLTVVYDGRVVQRLDLS-----------KTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQ 436
           +++   DGR V  + L            +T PS        +G    F GS  GDS+L+ 
Sbjct: 377 VVSFKLDGRSVSGIYLRPVSDQAGGAILRTKPSC----SAPVGRGKIFFGSEEGDSILI- 431

Query: 437 FTCGSGTSMLSSGLK----EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLY----- 487
                G S LS+G K     E G+   D  +         D   D  +  E  LY     
Sbjct: 432 -----GWSRLSAGAKVSPAPETGE---DNVAELSEDEEDDDDDDDEEDAYEDDLYATPVT 483

Query: 488 -GSASNNTESAQKT----FSFAVRDSLVNIGPLKDFSYGL---RINADASATGISKQSNY 539
            G    NT S   T    + F + D L N+GP++D + G      + D   +  S  +  
Sbjct: 484 PGINPRNTASMNGTSLNDYIFRIHDRLWNLGPMRDITLGRPPGSRDKDKRQSVSSLSAYL 543

Query: 540 ELVELPG--------------------------CKGIWTVYHKSSRGHNADSSRMAAYDD 573
           ELV   G                            G+ +V+ K  +  +   S  A    
Sbjct: 544 ELVTTQGYGRAGGLAILRREIDPYVIDSLMIKDTDGVRSVHVKDPKLPSQSGSLPANAGS 603

Query: 574 EYHAYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVF 627
            Y  YL++S      + +++V + +    E T + ++   + RTI  G L G  RV+QV 
Sbjct: 604 NYDHYLLLSKSKGFDKEKSVVYKMSSGGLEETRAPEFNPNEDRTIDIGTLAGGTRVVQVL 663

Query: 628 ERGARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGD 686
           +   R  D G  + Q       ++     SE  +V+  S A+PYVL+   D SI LL  D
Sbjct: 664 KGEVRSYDSGLGLAQIYPVWDEDT-----SEERSVMHASFAEPYVLIIRDDSSILLLQAD 718

Query: 687 PSTCTVSVQTPAAIESSKKPVSSCTLYHDK 716
            S     ++T   I+S+     S +LY DK
Sbjct: 719 ESGDLDEIETDGIIKSTT--WISGSLYQDK 746



 Score = 71.2 bits (173), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 50/216 (23%), Positives = 101/216 (46%), Gaps = 17/216 (7%)

Query: 919  NISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGIL 978
            ++ G++  F+ G+ PC+ +     +     L   ++ + +  +   C  GF+YV +  ++
Sbjct: 900  DVCGYRTVFMPGNSPCFIIKSATSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDTDNVV 959

Query: 979  KICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLID 1038
            ++C+ P  + +D  W  +K I L      + Y +    Y L  S  V   L +      D
Sbjct: 960  RMCRFPRNTHFDGSWAARK-IGLGEQVDSVEYSSSSETYVLGTSQKVDFKLPE------D 1012

Query: 1039 QEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVV 1098
             E+  +  N  +S         +++  V++L P      W    +  +++SE  + V+ +
Sbjct: 1013 DEIHPEWRNEVISFFP-----QIDKGSVKLLNPRT----WSIIDSYQLRTSERVMCVKCL 1063

Query: 1099 TL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
             L  +  T E + ++A+GTA  +GED+AARG + +F
Sbjct: 1064 NLEASEITHERKEMIAVGTALTRGEDIAARGCIYVF 1099


>gi|312077399|ref|XP_003141287.1| hypothetical protein LOAG_05705 [Loa loa]
          Length = 316

 Score =  124 bits (312), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 80/266 (30%), Positives = 132/266 (49%), Gaps = 34/266 (12%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           LE +   RL   V+S AI        +   DS++L F+DAK+S++  + +   L+  S+H
Sbjct: 62  LECLLAVRLLAPVQSFAI---ARIPQNPDCDSLLLGFDDAKLSIVGVNPADRSLKTISLH 118

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
           CFE      LK G       P+++VDP  RC  +LV+G  + +L  +  G+ L       
Sbjct: 119 CFEDE---LLKDGFTKNLPRPVIRVDPGQRCAAMLVFGRYLAVLPFNDSGAQL------- 168

Query: 221 GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
                       S+ + L  +D  + +V D +F+ GY EP ++ L+E   T  GR   ++
Sbjct: 169 -----------HSYTVQLSQIDSRLVNVVDMVFLDGYYEPTLLFLYEPVQTTCGRACVRY 217

Query: 279 HTCMISALSISTTLKQHPL--IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA 336
            T  +  L +S  +K+  L  +W   NLP D  ++LA+P P+GG+L+V  N + Y +QS 
Sbjct: 218 DT--MCVLGVSLNVKEQVLASVWQLTNLPMDCNQILAIPRPVGGILLVATNELIYLNQSV 275

Query: 337 -SCALALNNYAVSLDSSQELPRSSFS 361
             C ++LN+    +D   + P   F 
Sbjct: 276 PPCGISLNS---CMDGFTKFPLRDFK 298


>gi|164429683|ref|XP_964609.2| hypothetical protein NCU02082 [Neurospora crassa OR74A]
 gi|157073577|gb|EAA35373.2| conserved hypothetical protein [Neurospora crassa OR74A]
          Length = 1437

 Score =  124 bits (311), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 154/658 (23%), Positives = 267/658 (40%), Gaps = 84/658 (12%)

Query: 94  DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHG 153
           D  ++A L LV    L G +  LA + +    +S   D ++L+F DA++S++E++   + 
Sbjct: 96  DRANSAKLVLVAEVTLPGTMTGLARIKKPSGSSSGGADCLLLSFRDARLSLVEWNVERNT 155

Query: 154 LRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL 213
           L   S+H +E  E +             L+  DP  RC  +      + IL   Q    +
Sbjct: 156 LETVSIHYYEKEELVGSPWVAPLHQYPTLLVADPASRCAALKFSERNLAILPFKQPDEDM 215

Query: 214 VGD------------EDTFGSGGGFSARIES-----SHVINLRDLD--MKHVKDFIFVHG 254
             D            +D  G+    ++ IE      S V+ L  L+  + H     F+H 
Sbjct: 216 DMDNWDEELDGPRPKKDLSGAVANGASTIEDTPYSPSFVLRLSKLEASLLHPVHLAFLHE 275

Query: 255 YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
           Y +P + +L   +          H T M+  L +    +    I +   LP D ++++A+
Sbjct: 276 YRDPTIGVLSSTKTASNSLGHKDHFTYMVFTLDLQQ--RASTTILAVNGLPQDLFRVVAL 333

Query: 315 PSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
           P+P+GG L+VGAN  IH      S  +A+N       S   + ++   + L+      L 
Sbjct: 334 PAPVGGALLVGANELIHIDQSGKSNGIAVNPLTKQTTSFSLVDQADLDLRLEGCAIDVLA 393

Query: 374 NDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDITTI---GNSLFF 424
            ++   LL    G L L+T   DGR V  L +    P    SV+ S +T++   G S  F
Sbjct: 394 AELGEFLLILNDGRLGLITFRIDGRTVSGLSIKMIAPEAGGSVIQSRVTSLSRMGRSTMF 453

Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
           +GS  GDS+L+ +T   G +      ++    ++              D   D + GEE 
Sbjct: 454 VGSEEGDSVLLGWTRRQGQT------QKRKSRLQDADLDLDLDDEDLEDDDDDDLYGEES 507

Query: 485 SLYGSASNNTESAQK-TFSFAVRDSLVNIGPLKDFSYGLRINADAS-------------- 529
           +    A +  ++ +    +F + D L++I P++  +YG  +    S              
Sbjct: 508 ASPEQAMSAAKAIKSGDLNFRIHDRLLSIAPIQKMTYGQPVTLPDSEEERNSEGVRSDLQ 567

Query: 530 ---ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDD- 573
              A G  K S   ++            E P  +G WTV  K          +    +D 
Sbjct: 568 LVCAVGRGKASALAIMNLAIQPKIIGRFEFPEARGFWTVCAKKPIPKTLVGDKGPMNNDY 627

Query: 574 ----EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
               +YH ++I++       E   +   TA     +T +      G T+ AG +    R+
Sbjct: 628 DTSGQYHKFMIVAKVDLDGYETSDVYALTAAGFESLTGTEFEPAAGFTVEAGTMGKDSRI 687

Query: 624 IQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
           +QV +   R  DG   ++Q +     + E+G+      V + SIADP++LL   D S+
Sbjct: 688 LQVLKSEVRCYDGDLGLSQIVPM--LDEETGA---EPRVRTASIADPFLLLIRDDFSV 740



 Score = 95.9 bits (237), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 80/323 (24%), Positives = 139/323 (43%), Gaps = 39/323 (12%)

Query: 816  KENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSR 875
            KE++  + V +L        H  P+L     +  +  YQ Y  +     + +  P S S 
Sbjct: 791  KESVAEILVADLG----DTTHKSPYLILRHANDDLTLYQPYRLK-----ATAGQPFSKS- 840

Query: 876  SLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA----PCQRITIFKNISGHQGFFLSGS 931
                  +   ++ N  F++ P +    ++ PH A    P +R +   NISG+   FL GS
Sbjct: 841  ------LFFQKVPNSTFAKAPEEKPADDDEPHNAQRFLPMRRCS---NISGYSTVFLPGS 891

Query: 932  RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
             P + +   +       L    + A +  H   C HGFIY  + GI ++ Q+P+ S+Y  
Sbjct: 892  SPSFILKTAKSSPRVLSLQGSGVQAMSSFHTEGCEHGFIYADTNGIARVTQIPTDSSYAE 951

Query: 992  YWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
                 K IP+      + Y      Y  +V    ++P      L  D +   +    N++
Sbjct: 952  LGLSVKKIPIGVDTQSVAYHPPTQAY--VVGCNDVEPFE----LPKDDDYHKEWARENIT 1005

Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENET 1110
               +     V+   +++L    +G  W    T+ M+  E  L V  + L  + +T E + 
Sbjct: 1006 FKPM-----VDRGVLKLL----SGITWTVIDTVEMEPCETVLCVETLNLEVSESTNERKQ 1056

Query: 1111 LLAIGTAYVQGEDVAARGRVLLF 1133
            L+A+GTA ++GED+  RGRV +F
Sbjct: 1057 LIAVGTALIKGEDLPTRGRVYVF 1079


>gi|50552095|ref|XP_503522.1| YALI0E03982p [Yarrowia lipolytica]
 gi|74634000|sp|Q6C740.1|CFT1_YARLI RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
            1
 gi|49649391|emb|CAG79101.1| YALI0E03982p [Yarrowia lipolytica CLIB122]
          Length = 1269

 Score =  124 bits (311), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 235/1074 (21%), Positives = 412/1074 (38%), Gaps = 184/1074 (17%)

Query: 98   AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
            A  LEL+  Y L G V  +  +     DN    DS+ ++ + AK  ++ ++ S   +   
Sbjct: 51   APRLELITEYYLDGTVTGVTRIKT--IDN-YDLDSLYISVKHAKAVIVAWNASSFTIDTK 107

Query: 158  SMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE 217
            S+H +E  + L      E       V  +       +L    +M  L   + G   + D+
Sbjct: 108  SLHYYE--KGLVESNFFEPECSSVAVSDEANSFYTCLLFQNDRMAFLPIIEKG---LDDD 162

Query: 218  DTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS 275
            +   SG  F    + S ++    LD  +++V D  F+H Y E  M IL + +  W G  +
Sbjct: 163  EMPESGQVF----DPSFIVKASRLDKRIENVMDICFLHEYRETTMGILFQPKRAWVGMKN 218

Query: 276  WKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS 335
                T   + +S+    K   +I +   LP DA K++ +P+P+GG L++ ANTI Y   S
Sbjct: 219  ILKDTVSYAIVSVDVHQKNSTVIGTLNGLPVDAQKVIPLPAPLGGSLIICANTILYIDSS 278

Query: 336  ASCALALNNYAVSLDSSQELPR--SSFSVELDAAHATWLQN--DVALLSTKTGDLVLLTV 391
            AS    + N     +S   + R  S+  + L+ A   ++Q   + ALL T+ G    L  
Sbjct: 279  ASYTGVMVNNTHRQNSDLIVSRDQSTLDLRLEGAEVCFIQELGNTALLVTEDGQFFSLLF 338

Query: 392  VYDGRVVQRLDLSKTNPS--VLT--SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447
              DGR V  L+L    P   +L+  S +    +   FLGSR GDSLLV++  G   S   
Sbjct: 339  NKDGRRVASLELRPIEPDNFILSQPSSVAAGPDGTIFLGSRAGDSLLVKWYHGEPESQPE 398

Query: 448  SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTE-SAQKTFSFAVR 506
              L                          D  N  +  LYG  +  TE +  +     + 
Sbjct: 399  ETL--------------------------DDGNESDDDLYGGDTAQTEDTTNRPLKLRLA 432

Query: 507  DSLVNIGPLKDFSYGLRINADA----SATGISKQSNYELV--------------ELPGCK 548
            D ++ +GP++  + G    +      + TG+   S   ++              ++PG +
Sbjct: 433  DRMLGMGPMQSLALGKNRGSQGVEFVTTTGVGANSALAILTSALMPYKRKSLYKDMPGGQ 492

Query: 549  GIWTVYHK-SSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
              W+V  +    G  A S       D  ++YL     A   V+E   L T+  ++  +FV
Sbjct: 493  -FWSVPVRFEEEGEVAKSRTYVVSSDSENSYLYYVDAAG--VIEDVSLSTKKKKTKKHFV 549

Query: 608  QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
               T    +      ++QV      I D                  S  + +T +   + 
Sbjct: 550  SNVTTIFSSSMLDSALLQVCLETVNIYDAKI---------GQPHKYSLPQGTTAVEARVL 600

Query: 668  DPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTST 727
              YVL+ +SDG +++L        VS+     +++++  +   +     G        +T
Sbjct: 601  GNYVLVLLSDGQVKILEA------VSINKRPFLKAAQVSIEPASESKAIG------IYAT 648

Query: 728  DAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHI 787
            D+ L+ G         G P        VVCY  G+L             +    S    I
Sbjct: 649  DSSLTFGAPSKKRTRQGSPAQDSRPVVVVCYADGSL------------LLQGLNSDDRLI 696

Query: 788  VDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTD 847
            +D           ++++   +E  GQ        +++V++A+      H       +LT 
Sbjct: 697  LDA----------SDLSGFIKEKDGQLYDA---PLELVDIALSPLGDDHILRDYLVLLTP 743

Query: 848  GTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH 907
              ++ Y+ Y +                               LRF +  L     E TP 
Sbjct: 744  QQLVVYEPYHYND----------------------------KLRFRKIFL-----ERTPT 770

Query: 908  GAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD----GSIVAFTVLHNV 963
                +R+T    I+G     ++G       +  + L   P+L +       VAFT     
Sbjct: 771  INSDRRLTQVPLINGKHTLGVTGET---AYILVKTLHTSPRLIEFGETKGAVAFT----- 822

Query: 964  NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSV 1023
            + +  F Y+T  G +  C+     + +  WPV+ V     T  ++TY    ++Y      
Sbjct: 823  SWDGKFAYLTQAGEVAECRFDPSFSLETNWPVKHVQLCGETISKVTYHETMDVY------ 876

Query: 1024 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSV--DLHRTYTVEEYEVRILEPDRAGGPWQTR 1081
             V+     V  ++ D+      D+  + S+  D+    T +   +RI+ P      W   
Sbjct: 877  -VIATHKTVPHVVRDE------DDEVIESLTPDIMPATTYQG-AIRIVNP----YSWTVI 924

Query: 1082 ATIPMQ-SSENALTVRVVTLFNTTTK-ENETLLAIGTAYVQGEDVAARGRVLLF 1133
             +   +  +E AL    V L  +  K +   ++A+GT+ ++GED+AARG + LF
Sbjct: 925  DSYEFEMPAEAALCCESVKLSISDRKSQKREVVAVGTSILRGEDLAARGALYLF 978


>gi|392572878|gb|EIW66021.1| hypothetical protein TREMEDRAFT_70300 [Tremella mesenterica DSM
           1558]
          Length = 1408

 Score =  123 bits (309), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 150/656 (22%), Positives = 279/656 (42%), Gaps = 94/656 (14%)

Query: 101 LELVCHYRLHGNVESLAILS--QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITS 158
           L L+C + LHG +  LA L   +   D     D ++++F+DAK+++LE+  S   +   S
Sbjct: 117 LHLLCQHTLHGWITGLAPLRTIESSVDG---LDRLLVSFKDAKMALLEW--SRGDIATVS 171

Query: 159 MHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED 218
           +H +E  +   +  G   F   PL++ DP  R   + +    + IL   Q  S L   E+
Sbjct: 172 LHTYERCQ--QMVTGDLQFYT-PLLRSDPLSRLAVLTLPEDSLAILPVLQEQSDLDPLEN 228

Query: 219 TFGSGGGFSARIESSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSW 276
            F     +S     S V++L D+   +K+++D +F+ G+  P + +L+    TWAGR   
Sbjct: 229 -FTKDAPYSP----SFVLSLADVAPTIKNLQDLLFLPGFHSPTLAVLYSPYHTWAGRYHS 283

Query: 277 KHHTCMISALSISTTLK-QHPLIWSAMNLPHDAYKLLAVPSPIGGV-LVVGANTIHYHSQ 334
           +  T  +   +   T    +PL+ S   LP D+  ++A P+ +GGV LV     +H    
Sbjct: 284 QRDTFCLEVRTFDITAGGSYPLLTSVSGLPSDSLYIVACPAELGGVVLVTTTGLLHIDQS 343

Query: 335 SASCALALNNY-----AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLL 389
             + A ++N +      +  D S E    S  + L+ + + ++     LL  + GD+  +
Sbjct: 344 GRTVATSVNAWWSHITTLPCDKSSE----SRKISLEGSKSVFVTERDMLLVLQNGDVHQV 399

Query: 390 TVVYDGRVVQRLDLSKTNPSV-LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
               +GR +  + + + + +V   S + T GN   F+G   GDSLL         +    
Sbjct: 400 RFEMNGRAIGAIKVDEQSSNVPAPSSMVTTGNQAIFVGCAEGDSLLANVDIKRAVA---- 455

Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTE----SAQKTFSFA 504
            +++    IEA+A           D  +D+    ++ L   A+N  +    +       +
Sbjct: 456 -IEDRKPAIEAEA---------EVDWDEDLYGDIDVPLTNGATNGAKYQAITGPANIVLS 505

Query: 505 VRDSLVNIGPLKDFSYGLRINADASAT--------GISKQSNY-------------ELVE 543
             D L  +G + D  +G+    + + T        G SK+S +                E
Sbjct: 506 PADVLTGVGKIVDMEFGIASTDEGTRTYPQLVTIGGGSKRSTFNAFRRGIPISKRRRFNE 565

Query: 544 LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA---RTMVLETADLLTEVT 600
           L   + +W +  +   G +     + +  ++    ++ S EA   R   L       ++ 
Sbjct: 566 LFNTESVWFLPIQRPSGQH-----LKSIPEDRRTTMLFSSEATQTRIFSLSAKPNPEQIG 620

Query: 601 ESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENST 660
                 + G+++  G  F R  V+ V +    +LD    TQ             G+E   
Sbjct: 621 R-----ISGKSLTVGPFFQRSNVLVVTQTEVLLLDSDGKTQ----------QSIGNEGEE 665

Query: 661 VLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS-VQTPAAIESSKKPVSSCTLYHD 715
           ++S SI+DPYV++   +GS  + VGD     +S V+ P+  +S + P  +  ++ D
Sbjct: 666 IVSASISDPYVVIRRVNGSGSMFVGDTVARQLSEVKIPS--DSLQPPYQAIEVFSD 719


>gi|402219312|gb|EJT99386.1| hypothetical protein DACRYDRAFT_17537 [Dacryopinax sp. DJM-731 SS1]
          Length = 1620

 Score =  123 bits (308), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 162/667 (24%), Positives = 267/667 (40%), Gaps = 112/667 (16%)

Query: 101 LELVCHYRLHGNVESLAILS--QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITS 158
           L LV  +R+HG V  L  +     G D     D ++++F+DAKI++LE+ D+I+ L   S
Sbjct: 136 LHLVREHRMHGFVTGLEKVRTLASGEDGM---DRLLVSFKDAKIALLEWSDAIYDLSTVS 192

Query: 159 MHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGD-- 216
           +H +E    +      E     PL++ DP+ RC  +L+    + IL   Q     + D  
Sbjct: 193 LHTYERSSQVSTSEASE---HRPLLRADPESRCAALLLPKDALAILPFVQRTGLDLADPA 249

Query: 217 EDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
            D       ++     S+V  L D D  ++HV DF F+  +  P + IL++    W GR+
Sbjct: 250 RDKEREHQPYTP----SYVFPLSDADDTLRHVLDFCFLPSFHTPTLAILYQPAQNWTGRL 305

Query: 275 SWKHHTCMISALSISTTLK----------QHPLIWSAMNLPHDAYKLLAVP--SPIGGVL 322
           S       ++ +++    K             +I     LP+DA+ LL     S  GGV+
Sbjct: 306 SQTKDNTSLAIVTLDLVGKGAAAGGGAGGGGAVISRTHGLPYDAFSLLPAREGSTFGGVV 365

Query: 323 VVGANTI-HYHSQSASCALALNNYAVSLDSSQELPRSSFSVE--------LDAAHATWLQ 373
           V+  N++ H         LA + +     S+   P  +F+ E        L+ +   W  
Sbjct: 366 VLAGNSVLHVDPAGRIVGLAASGWHAQ-SSALRFPLWAFTAEEGETEERKLEGSRLCWAG 424

Query: 374 NDVALLSTKTGDLVLLTVVYDGRVVQRLD----LSKTN-PSVLT-------SDITTIGNS 421
               +L    G    L V  +GR V  L     L +T+ P+VL          +   G  
Sbjct: 425 EQQLILVGAQGWARELKVGVEGRNVSSLSAGRRLGRTSAPAVLCPVGEQSGRALKPTGRD 484

Query: 422 LFFLGSRLGDSLLVQFTCGSG--TSMLSSGLKEEF--GDIEADAPSTKRLRRSSSDALQD 477
           L +L S  G S+L+Q   G      +  +G ++E    D+E DA S K      +D L D
Sbjct: 485 LVWLASEAGQSVLLQVHKGEPRVEEVKPNGEEKEIEGEDMEIDADSDK------NDDLAD 538

Query: 478 MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
           +     L    ++      A    +  V D+L   G + D S+ L        +G  + +
Sbjct: 539 IYGDSGLPAAAASGVTAGPALPWLTLEVLDALQGHGQIADMSFALSFR-----SGPDRPT 593

Query: 538 NYELVELP-GCKGIWTVYHK--------------SSRG---------------HNADSSR 567
              +   P G +G WTVY                 +RG                  +   
Sbjct: 594 PKLVCSTPEGERGAWTVYENGLPIRVKRRVPAVAGTRGIWSLRVRRGDRARRGGRRERGE 653

Query: 568 MAAYDDEYHAYLIISLEA-------RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
               D E    LI+S +A       RT+ +++   L  ++      +   T+AAG  F  
Sbjct: 654 REWADGEERDNLIVSTDATPSPGISRTITVDSRGELQIISR-----LPALTLAAGVFFSH 708

Query: 621 RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
             V+QV      +LDG    ++L     N       E S ++   + DP+V++   +GS+
Sbjct: 709 TCVMQVTPDSLHLLDGD--GKELQVLKDNE---GNKEASPIIKACVEDPWVVVTRENGSV 763

Query: 681 RLLVGDP 687
            L +GDP
Sbjct: 764 ALYLGDP 770


>gi|452979579|gb|EME79341.1| hypothetical protein MYCFIDRAFT_104419, partial [Pseudocercospora
            fijiensis CIRAD86]
          Length = 1342

 Score =  122 bits (307), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 247/1126 (21%), Positives = 424/1126 (37%), Gaps = 232/1126 (20%)

Query: 101  LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
            L LV  Y L G V SLA       DN+   D+II+AF DAK+S++E+D   H +   S+H
Sbjct: 46   LSLVAEYPLAGTVISLA--RTKPRDNASGGDAIIIAFRDAKLSLVEWDPENHRISTISLH 103

Query: 161  CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
             +E    +    G        ++ VDP  RC  +     Q+ IL     G  L G+E+  
Sbjct: 104  YYEGDNVITPPFGPTLAESESILTVDPSSRCAALKFGARQLAILPFRHFGDELAGEEEED 163

Query: 221  GSG----GGFSARIESSH--------------VINLRDLD--MKHVKDFIFVHGYIEPVM 260
            G         S R ES+H              V+ L  LD  + H     F+H Y EP  
Sbjct: 164  GFENEPMSAVSKRRESTHLNGEEEQTPYKASFVLPLTALDPTLSHTVHLAFLHEYREPTF 223

Query: 261  VILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGG 320
             IL          +  +      +  ++    +    + +   LP   +K+  +P PIGG
Sbjct: 224  GILSAPMEPSNALLEERKDVLTYTVYTLDLEQRASTNLITVPKLPSTLWKVKPLPLPIGG 283

Query: 321  VLVVGANTIHYHSQSASC-ALALNNYA-------VSLDSSQELPRSSFSVELDAAHATWL 372
             L+VG N + +  QS    A A+N +A       +S  S   L     S+E     +  L
Sbjct: 284  ALLVGTNELVHVDQSGKANATAVNEFAKLESDFGMSDQSHLNLKLEDCSIETIDPKSGQL 343

Query: 373  QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-------TNPSVLTSDITTIGNSLFFL 425
                 LL T  G L ++     GR +  ++++        T+ S   S I  + N   F+
Sbjct: 344  -----LLVTSDGALAIIEFKLLGRSISAINVTPVTEDNGVTSLSAAPSCIANLANGSVFI 398

Query: 426  GSRLGDSLLVQFTCGSGTSMLSSGLKEEFGD------IEADAPSTKRLRRSSSDALQDMV 479
            GS  G S L+ ++  +          +  G        +A       L  ++ +A +  V
Sbjct: 399  GSEDGASSLMGWSQPTAPLTRKRSHAQMLGKDGDEEDEDAIEEDDDDLYDAAPEAKKRAV 458

Query: 480  NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD-----ASATGIS 534
            +  EL           S+   + F +RD L ++GP+     G +  +      A+ATG  
Sbjct: 459  SDTELG----------SSNAAYQFEIRDHLQSLGPIHRMCVGRQGKSSDKLQLAAATG-R 507

Query: 535  KQS------NYELVELPGCKGIWTVYHKSSRGHNADSS---RMAAYDDEYHAYLIISLEA 585
            KQS      N ++V  PG         ++SR  NA S+   R     DE       +L+ 
Sbjct: 508  KQSGRLTLLNRDVVPTPG---------RASRFENAKSAWAVRAHQAGDES------TLDN 552

Query: 586  RTMVLETADLLT-EVTESVDYFV--------------QGRTIAAGNLFGRRRVIQVFERG 630
            +  V E A+    E++ + ++FV              +G T+    L   + ++Q  ++ 
Sbjct: 553  KLFVFEGANTKAYEISSADEHFVEDRYPEHAKSEWESEGETLEVVALADGKIIVQFRKQE 612

Query: 631  ARILDGSY-MTQDLSFGPSNSESGSGSENS-TVLSVSIADPYVLLGMSDGSIRLLVGDPS 688
             R  D +  M Q L   P   E    +EN   ++ +++ DPYVL+   D SI++L     
Sbjct: 613  VRTYDANLAMNQIL---PMEDE----AENELNIVHIAVCDPYVLVIRDDSSIQIL----- 660

Query: 689  TCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLD 748
                SVQ      +  +P+ +     +K             WL+  +         G L 
Sbjct: 661  ----SVQG-----NELEPLEAEGSVAEK------------KWLTGSLY-------AGTLT 692

Query: 749  QGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHI-VDTYMREALKDSETEINSSS 807
            QG     +    G L  F +P+   +F +         I VD   R A            
Sbjct: 693  QGSAAVFLLNADGGLHAFALPDLQPLFAIPTLPHLPPVIAVDAAQRRA------------ 740

Query: 808  EEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
                  G +E +  + V +L         ++P+L        ++ Y+ + +         
Sbjct: 741  ------GTRETLTEVLVSDLGQHGV----TQPYLVLRTAMDDVVLYEPFHY--------- 781

Query: 868  DDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFF 927
              P ++ R     ++   R R + FS  P  + +  E+    P    ++   I  +    
Sbjct: 782  --PQTSGRKSWHQDL---RFRKVPFSHIPKYSESIAESQSARPPPLKSV--KIDTYSAIA 834

Query: 928  LSGSRPCWCM----VFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQL 983
            + G+ PC  +       + L +        +     ++ V C +GF  + +   L+  QL
Sbjct: 835  IPGAPPCLLLKEPSTLPKVLEIRQSAELNRLSMLCPINRVGCENGFFMINADEELEEQQL 894

Query: 984  PSGSTYDNYWPVQKVIPLKATPHQ------ITYFAEKNLYPLIVSVPVLKPLNQVLSLLI 1037
            P  + Y   W V +V P+   P+Q      I Y  E+ LY       V+    +V     
Sbjct: 895  PLNTWYGTGWSVHQV-PI-GHPNQIEDVRRIAYHEERGLY-------VVATCREVDFYFA 945

Query: 1038 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEP------DRAGGPWQTRAT----IPMQ 1087
            +++  H   +      D+     V +Y V ++        D    P+    T    + ++
Sbjct: 946  EEDGRHPEQD------DITLRPKVPQYNVHLISAISHHIIDTVHMPYLAAITDLQVMMLE 999

Query: 1088 SSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
            +SEN             T E + L+ +  A  +GED+ A+G + ++
Sbjct: 1000 ASEN-------------THEQKPLVVVSAAAQRGEDMPAKGTLYVY 1032


>gi|403170487|ref|XP_003329830.2| hypothetical protein PGTG_11767 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375168746|gb|EFP85411.2| hypothetical protein PGTG_11767 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 1513

 Score =  122 bits (306), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 173/771 (22%), Positives = 308/771 (39%), Gaps = 167/771 (21%)

Query: 48  SKRGIGPVPNLVVTAANVIEIYVVRVQEEGSKE---SKNSGETKRRVLMDGISAASLELV 104
           SK    P+ NL+V  + +++++ + + E+   E   ++N  E K +          L  +
Sbjct: 37  SKTRPRPITNLIVARSTLLQVFELCLVEDDQAENNHTRNHHELKNK-------NYKLFHL 89

Query: 105 CHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFES 164
           C +RLHG V  L  L+          D ++++F+DAK+++LE+ +S   L   S+H FE 
Sbjct: 90  CEHRLHGRVTGLQRLTTLDTQEDGL-DRLLVSFQDAKMTLLEWSNSAADLVPISLHTFEK 148

Query: 165 -PEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSG 223
            P+       R+   +   ++VDP  RC  +L+    + +L   Q    L    D+ G  
Sbjct: 149 LPQITQGDLPRDFQGQ---LEVDPLSRCAVLLLPQATLAVLPFFQDQLDL----DSLGLS 201

Query: 224 GGFSARI------------ESSHVINLRD----------LDMKH--VKDFI---FVHGYI 256
           GG  + +             SS +++             LD +H  +K  I   F+ G+ 
Sbjct: 202 GGLKSALGSEQQRFQTFPYASSFILDFNQQLLNHLPPSSLDSQHRPIKSVIALKFLPGFS 261

Query: 257 EPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
           EP + +L++ + TW+ R+    +T  +  L++       P+I    NLP+DA+ L+A P 
Sbjct: 262 EPTLAVLYQSQYTWSARLENHANTAALIVLTLDLGSNHFPIISHTTNLPYDAHGLVACPK 321

Query: 317 PIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS------------------ 358
            + GVLV+ A+ I +  QS+       N  V   S  ++PR                   
Sbjct: 322 ELAGVLVLCADMILHVDQSSKIIGLATNGWVKHTSELQIPRQDTVRLITPTNKISGHRST 381

Query: 359 ---------------------------SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV 391
                                         V L+ A   + + D A +  +TG++  L  
Sbjct: 382 TNKSDERPEDLEDGEEQDESGVPEGHEKLLVRLENAKIVFSRADRAFVFLRTGEVFSLQF 441

Query: 392 VYDGRVVQRLDLSKTN-PSVLTSDITTIGNSLFFLGSRLGDSLL-----------VQFTC 439
           + DGR + +L L K +  S++ S +  + N   F+GS  GDS L                
Sbjct: 442 LRDGRTLTKLVLEKLDLLSIIPSTVLKVNNECLFVGSMAGDSALYILDHLRPRSSSDDDN 501

Query: 440 GSGTSMLSSGLKE-----------EFG-DIEADAPSTKRLRRSSSDALQD--MVNGEELS 485
             G  + SS + +           +F  DI  D   T  +RR+    L D    NG +  
Sbjct: 502 DDGHQLPSSSIIQPDKAAKNQSSLDFDEDIYGDRTETDPVRRTDHSQLYDDRPSNGADDG 561

Query: 486 LYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELP 545
             G+ ++  E   +     + D +   GP++DF+         +ATG+        +EL 
Sbjct: 562 RPGAGAHLAEPFLR-----LGDVIQAHGPIRDFTM--------AATGVENMP----LELL 604

Query: 546 GCKGI-----WTVYH-----KSSRGHNADSSRMAAYDDEYHAYLII-----SLEARTMVL 590
            C G       TV+H     +  R  + +S   +  +  +   L++     S E + + +
Sbjct: 605 ACTGTGDLGGLTVFHREIPLRKRRKLSFESPSASHINALFFTSLVVESGGLSEERKVVWM 664

Query: 591 ETADLLTEVT---ES-----VDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD 642
             +   TE+    ES     ++ F + +T+A    FG++ V+QV     ++   S     
Sbjct: 665 GRSGPRTEIATYGESGELSLINTFPE-KTLAVSPFFGKQFVVQVTNTAIKLFTSSL---- 719

Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 693
                  ++         +L  SI D YV+L    G   +  GD  + T+S
Sbjct: 720 -----EEAQVIQPEPAVKILRASIVDDYVMLETHCGLKLIYQGDHDSKTLS 765


>gi|225679191|gb|EEH17475.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
          Length = 1377

 Score =  122 bits (305), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 181/747 (24%), Positives = 302/747 (40%), Gaps = 110/747 (14%)

Query: 53  GPVPNLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGN 112
           G V    V    ++++Y +     GS   ++  +T+ +        + L LV  Y L G 
Sbjct: 4   GAVAAFRVAKTTLLQVYNLVNVVYGSGPGQSDEKTRSQY-------SKLVLVAEYALSGT 56

Query: 113 VESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
           V  L  +     D+    ++I++A  +AK+S++E+D   H +  TS+H +E  + +H+  
Sbjct: 57  VTDLGRVKI--LDSKSGGEAILVATRNAKLSLIEWDPEKHQISTTSIHYYERDD-VHISP 113

Query: 173 GRESFARGP-LVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV-GDEDTFGSGGGFSAR 229
              + A  P  + VDP  RC  VL +G + + IL   Q G  LV GD   F S      +
Sbjct: 114 WTPNLAACPSQLTVDPSSRCA-VLNFGKKNLAILPFHQMGDDLVMGD---FDSDHDEERQ 169

Query: 230 IESSHVINLRD--------------------------LDMKHVKDFIFVHGYIEPVMVIL 263
           I+++H    RD                            M H     F++ Y EP   IL
Sbjct: 170 IDTNHTAEERDEANKPDGPVYQTPYASSFVLPIAALEPSMLHPISLAFLYEYREPTFGIL 229

Query: 264 HERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLV 323
           + +    +  +  +      S  ++    +    + S   LP+D +K++ +P P+GG L+
Sbjct: 230 YSQVAASSALLHDRKDVVFYSVFTLDLEQRASTTLLSVPRLPNDLFKVIPLPPPVGGALL 289

Query: 324 VGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLS 380
           VG+N  +H      + A+ +N +A    S     +S   + L+      L  +N   LL 
Sbjct: 290 VGSNELVHVDQAGRTNAVGVNEFAREASSFSMADQSDLEMRLEGCVVEQLGTENCDMLLV 349

Query: 381 TKTGDLVLLTVVYDGRVVQRLDLS-----------KTNPSVLTSDITTIGNSLFFLGSRL 429
              G + +++   DGR V  + L            +T PS        +G    F GS  
Sbjct: 350 LLNGVMAVVSFKLDGRSVSGIYLRPVSDQAGGAILRTKPSC----SALVGRGKIFFGSEE 405

Query: 430 GDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS 489
           GDS+L+ ++  S  + +     E   D  A+    +       DA +D +    ++  G 
Sbjct: 406 GDSMLIGWSRPSAGATVPPA-PETGEDNVAELSEDEEEEDDDEDAYEDDLYATPVT-PGI 463

Query: 490 ASNNTESAQKT----FSFAVRDSLVNIGPLKDFSYGL---RINADASATGISKQSNYELV 542
            S NT S   T    + F + D L N+GP++D + G      + D   +  S  +  ELV
Sbjct: 464 NSRNTASVNGTSLNDYIFRIHDRLWNLGPMRDITLGRPPGSRDKDKRQSVSSLSAYLELV 523

Query: 543 ELPG--------------------------CKGIWTVYHKSSRGHNADSSRMAAYDDEYH 576
              G                            G+ +V+ K  +      S        Y 
Sbjct: 524 TTQGYGRAGGLAILRREIDPYVIDSLMIKDTDGVRSVHVKDPKLPTQSGSLPVNAGSNYD 583

Query: 577 AYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERG 630
            YL++S      + +++V + +    E T + ++   + RTI  G L G  RV+QV +  
Sbjct: 584 HYLLLSKSKGFDKEKSVVYKMSSGGLEETRAPEFNPNEDRTIDIGTLAGGTRVVQVLKGE 643

Query: 631 ARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPST 689
            R  D G  + Q       ++     SE  +V+  S ADPYVL+   D SI LL  D S 
Sbjct: 644 VRSYDSGLGLAQIYPVWDEDT-----SEERSVVHASFADPYVLIIRDDSSILLLQADESG 698

Query: 690 CTVSVQTPAAIESSKKPVSSCTLYHDK 716
               ++T   IES+     S +LY DK
Sbjct: 699 DLDEIETDGIIESTT--WISGSLYQDK 723



 Score = 69.7 bits (169), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 48/216 (22%), Positives = 100/216 (46%), Gaps = 27/216 (12%)

Query: 919  NISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGIL 978
            ++ G++  F+ G+ PC+ +     +     L   ++ + +  +   C  GF+YV +  ++
Sbjct: 877  DVCGYRTVFMPGNSPCFIIKSATSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDTDNVV 936

Query: 979  KICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLID 1038
            ++C+ P  + +D  W  +K I L      + Y +    Y L  S                
Sbjct: 937  RMCRFPRNTHFDGSWAARK-IGLGEQVDSVEYSSSSETYVLGTS---------------- 979

Query: 1039 QEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVV 1098
            Q+V  ++   +    ++H  +  EE  V++L P      W    +  ++++E  + V+ +
Sbjct: 980  QKVDFKLPEDD----EIHPEWRNEE-SVKLLNPRT----WSIIDSYQLRTAERVMCVKCL 1030

Query: 1099 TL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
             L  +  T E + ++A+GTA  +GED+AARG + +F
Sbjct: 1031 NLEASEITHERKEMIAVGTALTRGEDIAARGCIYVF 1066


>gi|392585051|gb|EIW74392.1| hypothetical protein CONPUDRAFT_133073 [Coniophora puteana
           RWD-64-598 SS2]
          Length = 1490

 Score =  121 bits (303), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 196/913 (21%), Positives = 350/913 (38%), Gaps = 130/913 (14%)

Query: 57  NLVVTAANVIEIYVVR-------VQEEGSKESKN---------SGETKRRVLMDGI---- 96
           NLV   +N+I IY VR        Q E  KE K+          GE +     DG     
Sbjct: 40  NLVTARSNIIRIYEVREDAASLSSQVEAEKERKSHVRKGTEAVEGEVEMDTGGDGWVNMG 99

Query: 97  ----------SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLE 146
                     +      V  + +HG V  +  + +  + N  R D ++++F+DAKI++LE
Sbjct: 100 SVKSTSSGPPTVTRFHFVREHVVHGIVTGMDCI-RTISSNEDRMDRLLVSFKDAKIALLE 158

Query: 147 FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKA 206
           + D+ H L   S+H +E  E   L        R  L +VDP  RC  + +    + IL  
Sbjct: 159 WSDAAHDLITVSIHTYERSE--QLMSIDAPLFRSSL-RVDPLSRCAALSLPNNALAILPF 215

Query: 207 SQGGSGLVGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIEPVMVIL 263
            Q  +     E    + G        S +++L    D  + +V DF F+ G+  P + +L
Sbjct: 216 YQTQAEFDVIEGEGETEGMRDVPYSPSFILDLPVDVDSSLCNVIDFAFLPGFNNPTLAVL 275

Query: 264 HERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP------ 317
            + E TWAGR+     T ++   ++       P++ +   LP DA+ L     P      
Sbjct: 276 CQSEQTWAGRLKEHRDTTLVVTFTLDLLSCTFPILSTLRGLPSDAFSLSPATLPPDFTSG 335

Query: 318 -------IGGVLVVGANTIHYHSQSASCA-------------LALNNYAVSLDSSQELPR 357
                    GV+V+  + + Y  Q A C              L+++N  ++  ++++   
Sbjct: 336 LSGGASNAHGVVVLTPDAVLYADQ-ARCVGAAVSGWATRTSDLSISNAYLTGGTAKDAEG 394

Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-TNPSVLTSDIT 416
               + L+ A    L     LL  ++G++ ++ +V +GR V R+D+      +V+ + + 
Sbjct: 395 DVKPLALEGAFPLLLTPTALLLVLRSGEMHVVRLVTEGRSVGRVDVGPCVGQTVMPATVV 454

Query: 417 TIGNSLFFLGSRLGDS--------LLVQFTCGSGTSMLSSGLKEEFGDIEADA--PSTKR 466
            +      LG   G+         + V    G  T +LS+   EE      +    S   
Sbjct: 455 RVKAPQRALGQGQGEGEKAKERRMVFVGSIVGPAT-LLSAERVEETAAANGNGVNGSGAN 513

Query: 467 LRRSSSDALQDM--VNGEELSLYGSASNNTE----SAQKTFSFAVRDSLVNIGPLKDFSY 520
               + DA  +M     ++  LYG  +  ++    SA++   FA  D++   GP+ D ++
Sbjct: 514 GHVENKDAGMEMDVDLDDDDDLYGPTTLTSQPSSGSAEEALRFAFCDAIPAHGPILDMAF 573

Query: 521 GLRINAD------ASATGISKQSNYELVE-------------LPGCKGIWTVYHKSS-RG 560
            L    D       ++TG      + L +             L G +GIW++  K S RG
Sbjct: 574 ALGKWGDRYVPELVASTGAEHLGGFTLFQRDLPIRTKRKLHVLGGARGIWSISVKQSPRG 633

Query: 561 HNADSSRMAAYDDEYHAYLIISLEARTM--VLETADLLTEVTESVDYFVQGRTIAAGNLF 618
             A S+      +  +  ++IS +A     V   A   T    ++   + G T+ AG  F
Sbjct: 634 SAASSAGAGPNPELANDTVVISTDANPSPGVSRIATRSTRTDLAIPTRIPGTTVGAGPFF 693

Query: 619 GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
           GR  ++ V     R+L+      D +   S  ++      + +   SI DP VL+   D 
Sbjct: 694 GRTAILHVMTNSIRVLE-----PDGTERQSIKDTDGNMPRAKIRWCSICDPVVLIIREDD 748

Query: 679 SIRLLVGDPSTCTVSVQTPAAI-ESSKKPVSSCTLYHDKG-----PEPWLRKTSTDAWLS 732
           ++ L +G+P    +  +  + + E S + ++ C      G      +P     S+     
Sbjct: 749 TLGLFIGEPERGRIRRKDMSPMGEKSSRYIAGCFFADTSGLFEAFMDPKAAAASSKGDKD 808

Query: 733 TGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYM 792
            G  + +        +    + V+    G LEI+ +P    VF+     +      D+Y 
Sbjct: 809 KGATQTMQSVVNAATNSQ--WLVLVRPQGVLEIWTLPKLTLVFSTTLIATLDNVCADSYD 866

Query: 793 REALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILC 852
             AL   +       E             + V  + M +    +  P L   L  G +  
Sbjct: 867 PAALSLPQDPPRKPQE-------------LDVENIVMAQLGESNPTPHLMVFLRSGQVAI 913

Query: 853 YQAYLFEGPENTS 865
           Y+      P + S
Sbjct: 914 YETVHHPPPPDPS 926


>gi|212541400|ref|XP_002150855.1| cleavage and polyadenylation specificity factor subunit A, putative
           [Talaromyces marneffei ATCC 18224]
 gi|210068154|gb|EEA22246.1| cleavage and polyadenylation specificity factor subunit A, putative
           [Talaromyces marneffei ATCC 18224]
          Length = 1383

 Score =  121 bits (303), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 181/751 (24%), Positives = 305/751 (40%), Gaps = 137/751 (18%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V   ++++IY +  +   +   +N  +    V  +   A  L L   Y L+G V  +
Sbjct: 28  NLIVVKTSLLQIYTLVAETSTTLILENDQQADDDVKNE---ATKLHLHAEYDLYGTVTDI 84

Query: 117 A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
           +    + S+ G D      +++L+F +AK+S++E++    G+   S+H +E         
Sbjct: 85  SPVKILKSRSGGD------ALLLSFRNAKLSLIEWNPETQGISTMSIHYYE--------- 129

Query: 173 GRESFARGPLV----------KVDPQGRCGGVLVYGLQMI-ILKASQGGSGLVGDE---- 217
            +E     P V           VDP  RC  +L +G++ I IL   Q G  LV DE    
Sbjct: 130 -KEDITLSPWVPDLSQCDSHLTVDPSSRCA-LLNFGVRNIAILPFHQAGDDLVMDEYDPD 187

Query: 218 ------------------DTFGSGGGF--SARIESSHVINLRDLD--MKHVKDFIFVHGY 255
                             D+  + G         +S V+ L  LD  + H     F+H Y
Sbjct: 188 LDMDDLTDQEENKKPSHTDSKKAEGDLIHQTPYAASFVLPLTALDPTLIHPIGLTFLHEY 247

Query: 256 IEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVP 315
            EP   IL+    T A  +  +    + S  ++    +    + S   LP D   ++A+P
Sbjct: 248 REPTFGILYSPIATSAALLEERKDVVVYSVFTLDLEQRASTPLLSIAKLPSDLLHIMALP 307

Query: 316 SPIGGVLVVGAN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQN 374
           +P+GG L++G+N  IH      + A+A+N +A  + +   + +S   + L+ +    + N
Sbjct: 308 APVGGTLLIGSNEMIHIDQSGKASAVAVNEFAKQVSAFPMVDQSDLELRLEGSVVEVINN 367

Query: 375 DVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT--------IGNSLFF 424
           +    LL+  TG+LVL+    DGR V    +    P+V   D+ +        +G+   F
Sbjct: 368 ESGDILLTLSTGELVLVHFKIDGRSVSGFVVFPI-PAVSGGDVVSAVASCAVALGSGKVF 426

Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS--SDALQDMVNGE 482
           +GS   +S+L+     S  S  S     +  D E +      +      S A ++ VN  
Sbjct: 427 IGSEDAESVLLDCYLPSAVSKKSRDYDRDHFDEEMNNEEDDDMYEDDLYSSAPKEAVN-- 484

Query: 483 ELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELV 542
           +    G  S+N       ++F V D L+++GPL+  + G   + D++A     Q + + +
Sbjct: 485 KTVSNGRISDN-------YTFKVIDRLLSLGPLRAVAVGKPASRDSNAE--DAQQSVDDL 535

Query: 543 ELPGCKG-----------------------------IWTVYHKSSR-GHNADSSRMAAYD 572
           EL    G                             +W +   +++ GHN DS       
Sbjct: 536 ELAAAYGSGRGGGVALLQRTLHLDDVFTLGAESADSVWNITTSNTKSGHN-DSG------ 588

Query: 573 DEYHAYLIISL-----EARTMVLETADLLTEVTESVDYFVQGR-TIAAGNLFGRRRVIQV 626
           +E  +Y+I++         T+V    +   E   + D    G  TI    L G  RV+QV
Sbjct: 589 EENQSYVILTKANSPENEETLVYAVNERNLEPFNAPDVNPNGDPTIDIDVLAGNSRVVQV 648

Query: 627 FERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG 685
                RI D +  M Q     P   E   G E   V S S AD Y+L+   D S+ LL  
Sbjct: 649 LTGEVRIYDTNLGMAQ---IYPVWDED-EGDERFAV-SASFADHYLLIIRDDSSVLLLHS 703

Query: 686 DPSTCTVSVQTPAAIESSKKPVSSCTLYHDK 716
           D S     +  P  +  S +P     LY D+
Sbjct: 704 DESGDLDELTKPETV--SSQPWLCGCLYTDR 732



 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 45/227 (19%), Positives = 89/227 (39%), Gaps = 35/227 (15%)

Query: 917  FKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG 976
              ++ G+    +SG+ P   +   + L     +    I   +  +   C  G +YV ++ 
Sbjct: 873  LSDLGGYAAVVMSGASPNLIVRTSKSLPHVYSIQSDFIRGISGFNGAGCKKGLVYVDNER 932

Query: 977  ILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLL 1036
            +++ CQL + +  D  WP+++ IPL      + Y      Y +  +              
Sbjct: 933  LVRTCQLYNNAQLDFSWPIRR-IPLNEQVDHLAYSTASGTYVVGTT-------------- 977

Query: 1037 IDQEVGHQIDNHNLSSVDLHRTYTVEEYE---------VRILEPDRAGGPWQTRATIPMQ 1087
                  H+ D       +LH  +  EE           ++++ P      W+   +    
Sbjct: 978  ------HEQDFKLPDDDELHPEWATEEISLLPKVAYGSIKLINPKT----WKVIDSYTFS 1027

Query: 1088 SSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
             +E    V  + L  +  T + + ++ +GT Y +GED+AARG V +F
Sbjct: 1028 PAERITAVENINLEISEKTGKRKDMIVVGTTYAKGEDIAARGNVYVF 1074


>gi|164655043|ref|XP_001728653.1| hypothetical protein MGL_4214 [Malassezia globosa CBS 7966]
 gi|159102535|gb|EDP41439.1| hypothetical protein MGL_4214 [Malassezia globosa CBS 7966]
          Length = 1212

 Score =  120 bits (302), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 152/591 (25%), Positives = 264/591 (44%), Gaps = 75/591 (12%)

Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
           +RL G V  +  + Q  A     RD ++++F DAK++++E+DD    L   S+H FE   
Sbjct: 22  HRLFGQVTGIQSV-QTLASQVDGRDRLLVSFRDAKLALMEWDDVYGDLNSISIHTFERAP 80

Query: 167 WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGF 226
            L +     SF   P + VDP  RC  +L+    + IL   Q  S L G +D   +    
Sbjct: 81  QL-VDGLPPSFV--PRLLVDPASRCAALLLPQDALAILPFVQEASEL-GADDPRDAALLD 136

Query: 227 SARIESSHVINLR---DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMI 283
            A    S +++     D  +++V+D +F+ G+ +P++ +L+E ELTW G +S    T  +
Sbjct: 137 QAPYAPSFILSFSEDVDASIRNVRDCVFLPGFQKPMLAVLYEPELTWTGSLSRARLTTRV 196

Query: 284 SALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALAL 342
             +++  T+ ++P+  ++  LP+D   L+A P  +GGVLVV  + + +  Q+A    L++
Sbjct: 197 CFITLDLTVTKYPVTVTSEALPYDTLYLVACPDSLGGVLVVTPSALLHLDQTARLVGLSV 256

Query: 343 NNYAVSLDSSQELPRSSFSV---ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ 399
           + +     S   LP ++ ++   +L ++  T+ + +  LL  + G ++      +GR V 
Sbjct: 257 SRWTDFTSSELMLPNATATLGDCDLQSSVLTFTEANGGLLVLRDGRMLTFQCALEGRTVT 316

Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEA 459
            L L+     VL  +    G + F     L + L++  +    T + +  L E   +I A
Sbjct: 317 SLSLN----VVLVPERQ--GGASFV--QALPERLILCASFQDDTYLYAMNLLEAPTEIAA 368

Query: 460 D-APSTKRLR-RSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKD 517
              P  + L   +  DA    + G+      + S   + A       V D L  +GPL D
Sbjct: 369 STGPDQQSLEPDADVDADALDLYGDSFKPDVATSKQAQPA----GLDVLDVLPTLGPLND 424

Query: 518 FSYGLRINADASA---TGISKQSNYELVE----------LPGCKGIWTVYHKSSRGHNAD 564
            +YG+  NA   A      + Q +  ++E          +     IWTV        N  
Sbjct: 425 MTYGVVRNAHGKAHPHMVATMQHHLAVIEPRLRCDVVQNIAPAHAIWTV------SINGK 478

Query: 565 SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVI 624
              + A+D+E    L+ SLE+ +            T  +   +Q RTIA G+   +  VI
Sbjct: 479 WLLLTAWDEE---CLVYSLESNS------------THFLSQHLQ-RTIACGS--TQAGVI 520

Query: 625 QVFERGARILD--GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
           +V  + A +LD  G  MT   +F   ++ +  G         SI D YV L
Sbjct: 521 RVTSKRAEVLDEHGRIMT---TFAECDANASYG-------DASIQDSYVAL 561


>gi|121719617|ref|XP_001276507.1| cleavage and polyadenylation specificity factor subunit A, putative
           [Aspergillus clavatus NRRL 1]
 gi|148886827|sp|A1C3U1.1|CFT1_ASPCL RecName: Full=Protein cft1; AltName: Full=Cleavage factor two
           protein 1
 gi|119404719|gb|EAW15081.1| cleavage and polyadenylation specificity factor subunit A, putative
           [Aspergillus clavatus NRRL 1]
          Length = 1401

 Score =  119 bits (297), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 167/723 (23%), Positives = 294/723 (40%), Gaps = 127/723 (17%)

Query: 57  NLVVTAANVIEIYV---VRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNV 113
           NLVV   +V++I+    V    EG   +  S         D + +  L L   Y L G V
Sbjct: 28  NLVVVKTSVLQIFSLLNVSCSAEGEIIAAKSARP------DQLQSTKLILEREYSLSGTV 81

Query: 114 ESLA----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLH 169
             L     + ++ G D      +I+LAF +AK+S++E+D   +G+   S+H +E  +   
Sbjct: 82  SDLCRVKLLKTKSGGD------AILLAFRNAKLSLVEWDPERYGISTISIHYYERDDITR 135

Query: 170 LKRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV-GD----------- 216
                +  + G ++ VDP  RC  V  +G++ + IL   Q G  LV GD           
Sbjct: 136 SPWVPDLSSCGSILSVDPSSRCA-VFNFGIRNLAILPFHQPGDDLVMGDYESDSQKQSHE 194

Query: 217 ---EDTFGS-----GGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHER 266
              +D+ G+     G        SS V+ L  LD  + H     F++ Y EP   IL+ +
Sbjct: 195 HEMDDSAGNSKSKEGAVHQTPYASSFVLPLTALDSAILHPVSLAFLYEYREPTFGILYSQ 254

Query: 267 ELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGA 326
             T    +  +      +  ++    +   ++ S   LP D +K++A+P P+GG L++G 
Sbjct: 255 IATSNSLLHERKDAIFYTVFTLDLEQRASTMLLSVTRLPSDLFKVVALPPPVGGALLIGY 314

Query: 327 NT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKT 383
           N  +H      + A+ +N ++  + +     +S  ++ L+      L N     LL+  +
Sbjct: 315 NELVHVDQAGKTNAVGVNEFSRQVSTFSMADQSELALRLEGCVVELLGNSSGDLLLALSS 374

Query: 384 GDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI--------TTIGNSLFFLGSRLGDSLLV 435
           G +VL+    DGR V  + + +  P     +I         ++G+   F GS   +S+L+
Sbjct: 375 GTMVLVHFKLDGRSVSGISI-RPLPGHAGGNILKAAASASASLGSDKVFFGSEDAESVLL 433

Query: 436 QFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNN-- 493
            ++  S  +  S   + E   IE D         S  D  +D        LY +A +   
Sbjct: 434 GWSLSSSNARKS---RSESKRIEKDHEEGSDDSESEEDVYED-------DLYSAAPDTPA 483

Query: 494 -------TESAQKTFSFAVRDSLVNIGPLKDFSYG-------------------LRINAD 527
                    S   ++ F V D L N  PL+D + G                   L + A 
Sbjct: 484 LGHRLSVAPSTFASYKFKVHDVLPNTAPLRDIALGQPAMPVEDTGSHLDNICSELELVAA 543

Query: 528 ASATG-----ISKQSNYELVE----LPGCKGIWT---VYHKSSRGHNADSSRMAAYDDEY 575
             + G     + K+    +V+    +    G+WT       +++  + D + +    +E+
Sbjct: 544 YGSNGNGGLVVMKRELEPVVKASLNVGPIHGVWTASIALGSAAKPMSGDQTNI----EEW 599

Query: 576 HAYLIISLEARTMVLETADLLTEVTESVDYFVQGR-------TIAAGNLFGRRRVIQVFE 628
             Y+I++ + +T+  E +++      ++  F           +I  G L  R+RV+QV  
Sbjct: 600 RQYVILT-KPQTIDKEESEVFIVDGLNLKPFKAPEFNPNNDISIQVGTLSNRKRVVQVLR 658

Query: 629 RGARILDGSYMTQDLSFG---PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG 685
              R  D      DL      P   E    S+    LS S+ADPY+ +   D ++ LL  
Sbjct: 659 NEVRSYD-----SDLELAQIYPVWDE--DTSDERMALSASLADPYIAILRDDSTLLLLQA 711

Query: 686 DPS 688
           D S
Sbjct: 712 DDS 714



 Score = 64.7 bits (156), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 66/266 (24%), Positives = 111/266 (41%), Gaps = 33/266 (12%)

Query: 889  NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSG---------SRPCWCMVF 939
            N    R P D+ T       +  + + I  +ISG+   F+ G         SR C   + 
Sbjct: 861  NHVLPRIPPDSDTNISDKEPSNHRPLCILPDISGYSAVFMPGTSASFIFKTSRSC-PHIL 919

Query: 940  RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVI 999
            R R  V   L D     FT   + +   GFIYV S+ +++ICQLP  + YD  W ++KV 
Sbjct: 920  RLRGGVVRSLSD---FDFT---DPSLGRGFIYVDSKDVVRICQLPPETIYDYSWTLKKVA 973

Query: 1000 PLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTY 1059
              +   H + Y      Y L  S       +    L  D E+  +  N  +S +   R  
Sbjct: 974  IGEHVDH-LAYSISSETYVLGTS------HSADFKLPEDDELHPEWRNEAISFLPELRQC 1026

Query: 1060 TVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAY 1118
                  ++++ P      W    +  +   E  + V+ + L  +  T E + ++ +GTA 
Sbjct: 1027 C-----LKVVHPKT----WTVIDSYTLGPDEEIMAVKNMNLEVSENTHERKNMIVVGTAL 1077

Query: 1119 VQGEDVAARGRVLLFSTGRNADNPQN 1144
             +GED+ ARG + +F   +   +P+ 
Sbjct: 1078 ARGEDIPARGCIYVFEVIKVVPDPEK 1103


>gi|320040273|gb|EFW22206.1| hypothetical protein CPSG_00105 [Coccidioides posadasii str.
           Silveira]
          Length = 1387

 Score =  116 bits (291), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 175/731 (23%), Positives = 291/731 (39%), Gaps = 94/731 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V   ++++++ +     G+    N+ +  R   ++      L LV  Y L G +  L
Sbjct: 28  NLIVAKTSILQVFSLVNVAYGTSALPNADDKGR---VERQQYTKLILVAEYDLSGTITGL 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +     D+    +++++A  +AK+S++E+D   HG+   S+H +E  E +H       
Sbjct: 85  GRVKI--LDSRSGGEALLVATRNAKLSLVEWDHERHGISTISIHYYER-EDVHSSPWTPD 141

Query: 177 FARGP-LVKVDPQGRCGGVLVYGLQMI-ILKASQGGSGLV-----GDEDTFGSGGG---- 225
               P L+ VDP  RC  +L +G+  + IL   Q G  LV     GD D    G      
Sbjct: 142 LKLCPSLLAVDPSSRCA-ILNFGIHSVAILPFHQTGDDLVMDEFDGDLDEKPEGASNIPA 200

Query: 226 ----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
                     +     SS V+ L  LD  + H     F++ Y EP   IL+    T +  
Sbjct: 201 QIAVENDTTMYKTPYASSFVLPLTALDPALVHPIHLAFLYEYREPTFGILYSHLTTSSAL 260

Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYH 332
           +  +      S  ++    +    + +   LP D +K++ +P PIGG L++G+N  IH  
Sbjct: 261 LRDRKDIVSYSVFTLDIQQRASTTLITVSRLPSDLWKVVPLPPPIGGALLIGSNELIHVD 320

Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLT 390
               + A+ +N +A    +   + +S   + L+      L  D    LL    G + +L 
Sbjct: 321 QAGKTNAVGINEFARQASAFSMVDQSDLGLRLEGCVVEQLGTDSGDILLVLADGKMAILR 380

Query: 391 VVYDGRVVQ----RLDLSKTNPSVLTSDIT---TIGNSLFFLGSRLGDSLLVQFTCGSGT 443
           +  DGR V     +L   K   S+L +  +   ++G    F GS   DSLL+ +   S  
Sbjct: 381 LKVDGRSVSGISAQLVSEKAGGSILKARPSCSASLGRGKVFFGSEETDSLLIGW---SRP 437

Query: 444 SMLSSGLKEEFGDI---EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT 500
           S L    K E  D    +              D     VN   LS   S +N     +  
Sbjct: 438 SQLMRKPKVESADDVFGDHSETEDDEDDIYEDDLYSTPVNQTTLSKTTSQTNGLN--KDD 495

Query: 501 FSFAVRDSLVNIGPLKDFSYGL--------------RINADASATGISKQSN-------- 538
           F F   D L N+GP+ D + G               R +AD        + N        
Sbjct: 496 FVFRSHDRLWNLGPMSDVTLGRPPGSHDKNRKQSSSRTSADLELVVTQGKGNAGGLAVLQ 555

Query: 539 -------YELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL-----EAR 586
                   + +++    G+W++   +      DS+        Y  YL+ S      + +
Sbjct: 556 RELDPYVIDSMKMDNVDGVWSIQVGA-----PDSTNTRTSSRNYDKYLVFSKSTEPGKEQ 610

Query: 587 TMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSF 645
           ++V        E  ++ ++   +  T+  G L G  RV+QV +   R  D +     +  
Sbjct: 611 SVVYSVGGSGIEEMKAPEFNPNEDSTVDIGTLAGGTRVVQVLKSEVRSYDTNLELAQIY- 669

Query: 646 GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKK 705
            P   E    S+  +V+S S A+PYVL+   D S+ LL  D S     V     I SS +
Sbjct: 670 -PIWDE--DTSDELSVVSASFAEPYVLIVRDDQSLLLLQADKSGDLDEVNI-DGILSSHR 725

Query: 706 PVSSCTLYHDK 716
            +S C LY DK
Sbjct: 726 WLSGC-LYLDK 735



 Score = 77.8 bits (190), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 55/227 (24%), Positives = 105/227 (46%), Gaps = 17/227 (7%)

Query: 917  FKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG 976
            + +I G++  F+SGS PC+ M          +L   ++ + +  H   C  GF YV +  
Sbjct: 878  YSDICGYKTVFMSGSNPCFVMKSSTSSPHVLRLRGEAVSSLSSFHIPACEKGFAYVDASN 937

Query: 977  ILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLL 1036
            ++++C+LP  + +DN W  +KV  +      + YFA   +Y L  S  V   L +     
Sbjct: 938  MVRMCRLPGNTRFDNSWVTRKV-HVGDQIDCVEYFAHSEIYALGSSHKVDFKLPE----- 991

Query: 1037 IDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVR 1096
             D E+  +  +  +S +       +E   +++L P      W    +  +  +E  + ++
Sbjct: 992  -DDEIHPEWRSEVISFMP-----QLERGCIKLLSPRT----WSVVDSYELGDAERVMCMK 1041

Query: 1097 VVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP 1142
             + +  +  T E + +L +GTA V+GED+  RG + +F     A +P
Sbjct: 1042 TINMEISEITHEMKDMLVVGTATVRGEDITPRGSIYVFEIIEVAPDP 1088


>gi|303321596|ref|XP_003070792.1| CPSF A subunit region family protein [Coccidioides posadasii C735
           delta SOWgp]
 gi|240110489|gb|EER28647.1| CPSF A subunit region family protein [Coccidioides posadasii C735
           delta SOWgp]
          Length = 1394

 Score =  116 bits (291), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 175/731 (23%), Positives = 291/731 (39%), Gaps = 94/731 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V   ++++++ +     G+    N+ +  R   ++      L LV  Y L G +  L
Sbjct: 28  NLIVAKTSILQVFSLVNVAYGTSALPNADDKGR---VERQQYTKLILVAEYDLSGTITGL 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +     D+    +++++A  +AK+S++E+D   HG+   S+H +E  E +H       
Sbjct: 85  GRVKI--LDSRSGGEALLVATRNAKLSLVEWDHERHGISTISIHYYER-EDVHSSPWTPD 141

Query: 177 FARGP-LVKVDPQGRCGGVLVYGLQMI-ILKASQGGSGLV-----GDEDTFGSGGG---- 225
               P L+ VDP  RC  +L +G+  + IL   Q G  LV     GD D    G      
Sbjct: 142 LKLCPSLLAVDPSSRCA-ILNFGIHSVAILPFHQTGDDLVMDEFDGDLDEKPEGASNIPA 200

Query: 226 ----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
                     +     SS V+ L  LD  + H     F++ Y EP   IL+    T +  
Sbjct: 201 QIAVENDTTMYKTPYASSFVLPLTALDPALVHPIHLAFLYEYREPTFGILYSHLTTSSAL 260

Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYH 332
           +  +      S  ++    +    + +   LP D +K++ +P PIGG L++G+N  IH  
Sbjct: 261 LRDRKDIVSYSVFTLDIQQRASTTLITVSRLPSDLWKVVPLPPPIGGALLIGSNELIHVD 320

Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLT 390
               + A+ +N +A    +   + +S   + L+      L  D    LL    G + +L 
Sbjct: 321 QAGKTNAVGINEFARQASAFSMVDQSDLGLRLEGCVVEQLGTDSGDILLVLADGKMAILR 380

Query: 391 VVYDGRVVQ----RLDLSKTNPSVLTSDIT---TIGNSLFFLGSRLGDSLLVQFTCGSGT 443
           +  DGR V     +L   K   S+L +  +   ++G    F GS   DSLL+ +   S  
Sbjct: 381 LKVDGRSVSGISAQLVSEKAGGSILKARPSCSASLGRGKVFFGSEETDSLLIGW---SRP 437

Query: 444 SMLSSGLKEEFGDI---EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT 500
           S L    K E  D    +              D     VN   LS   S +N     +  
Sbjct: 438 SQLMRKPKVESADDVFGDHSETEDDEDDIYEDDLYSTPVNQTTLSKTTSQTNGLN--KDD 495

Query: 501 FSFAVRDSLVNIGPLKDFSYGL--------------RINADASATGISKQSN-------- 538
           F F   D L N+GP+ D + G               R +AD        + N        
Sbjct: 496 FVFRSHDRLWNLGPMSDVTLGRPPGSHDKNRKQSSSRTSADLELVVTQGKGNAGGLAVLQ 555

Query: 539 -------YELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL-----EAR 586
                   + +++    G+W++   +      DS+        Y  YL+ S      + +
Sbjct: 556 RELDPYVIDSMKMDNVDGVWSIQVGA-----PDSTNTRTSSRNYDKYLVFSKSTEPGKEQ 610

Query: 587 TMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSF 645
           ++V        E  ++ ++   +  T+  G L G  RV+QV +   R  D +     +  
Sbjct: 611 SVVYSVGGSGIEEMKAPEFNPNEDSTVDIGTLAGGTRVVQVLKSEVRSYDTNLELAQIY- 669

Query: 646 GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKK 705
            P   E    S+  +V+S S A+PYVL+   D S+ LL  D S     V     I SS +
Sbjct: 670 -PIWDE--DTSDELSVVSASFAEPYVLIVRDDQSLLLLQADKSGDLDEVNI-DGILSSHR 725

Query: 706 PVSSCTLYHDK 716
            +S C LY DK
Sbjct: 726 WLSGC-LYLDK 735



 Score = 68.6 bits (166), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 56/246 (22%), Positives = 106/246 (43%), Gaps = 36/246 (14%)

Query: 917  FKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-- 974
            + +I G++  F+SGS PC+ M          +L   ++ + +  H   C  GF YV +  
Sbjct: 866  YSDICGYKTVFMSGSNPCFVMKSSTSSPHVLRLRGEAVSSLSSFHIPACEKGFAYVDASV 925

Query: 975  -----------------QGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLY 1017
                             Q ++++C+LP  + +DN W  +KV  +      + YFA   +Y
Sbjct: 926  CVPKQYFVPWNKLILVIQNMVRMCRLPGNTRFDNSWVTRKV-HVGDQIDCVEYFAHSEIY 984

Query: 1018 PLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGP 1077
             L  S  V   L +      D E+  +  +  +S +       +E   +++L P      
Sbjct: 985  ALGSSHKVDFKLPE------DDEIHPEWRSEVISFMP-----QLERGCIKLLSPRT---- 1029

Query: 1078 WQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG 1136
            W    +  +  +E  + ++ + +  +  T E + +L +GTA V+GED+  RG + +F   
Sbjct: 1030 WSVVDSYELGDAERVMCMKTINMEISEITHEMKDMLVVGTATVRGEDITPRGSIYVFEII 1089

Query: 1137 RNADNP 1142
              A +P
Sbjct: 1090 EVAPDP 1095


>gi|255948500|ref|XP_002565017.1| Pc22g10080 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211592034|emb|CAP98296.1| Pc22g10080 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 1392

 Score =  114 bits (285), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 170/731 (23%), Positives = 296/731 (40%), Gaps = 151/731 (20%)

Query: 57  NLVVTAANVIEIY--VVRVQEEGSKE-----SKNSGETKRRVLMDGISAASLELVCHYRL 109
           NLVV   ++++++  V  V  +  KE     S  S + + +++++            Y L
Sbjct: 28  NLVVVRTSLLQVFSLVKIVSSQPQKEVPEPLSSQSSQPETKLVLEK----------EYPL 77

Query: 110 HGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWL 168
            G V  L   S+    N+R   ++I++A  +AK+S++E+D    G+   S+H +E  +  
Sbjct: 78  SGTVTDL---SRVKILNTRSGGEAILIAVRNAKLSLIEWDPERRGISTISIHYYERDDLT 134

Query: 169 HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---------GDED 218
                 +    G ++ VDP  RC  V  +G++ + IL   Q G  LV         G+  
Sbjct: 135 RSPWVPDLSRCGSILSVDPSSRCA-VYNFGIRNLAILPFHQAGDDLVMDDYDSELDGERP 193

Query: 219 TFGSGGGFSARIE-------------SSHVINLRDLD--MKHVKDFIFVHGYIEPVMVIL 263
           +  SGGG  A+IE             SS V+ L  LD  + H     F++ Y EP   IL
Sbjct: 194 SQNSGGG--AQIEKRKEEPDHQTPYSSSFVLPLTALDPSLLHPISLAFLYEYREPTFGIL 251

Query: 264 HERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLV 323
           + +  T    +  +      +  ++    +    + S   LP D +K++A+P P+GG L+
Sbjct: 252 YSQVATSTALLHERKDVVFYAVFTLDLEQRASTTLLSVSRLPSDLFKVVALPLPVGGALL 311

Query: 324 VGANTI-HYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLS 380
           +G+N I H      + A+ +N ++  + S     +S  +  L+      L  D    LL+
Sbjct: 312 LGSNEIVHVDQAGKTNAVGVNEFSRQVSSFSMTDQSDLAFRLEGCVVERLGGDSGDLLLA 371

Query: 381 TKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI--------TTIGNSLFFLGSRLGDS 432
             +G++ L+    DGR V  + +    P+    DI        T +G+   F+GS   DS
Sbjct: 372 LASGNMALIKFKLDGRSVSGITVHSL-PAYAGGDILKSAASCSTCLGDGNVFIGSEDADS 430

Query: 433 LLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR---RSSSDALQDM----VNGEELS 485
           +L++++  S                     ST++ R   + ++D L D+       E+  
Sbjct: 431 VLLEWSHTSA--------------------STRKARLESKQTADGLDDLSDEDDQMEDDD 470

Query: 486 LYGSASNNTE---------SAQKTFSFAVRDSLVNIGPLKDFSYGL---RINADASATGI 533
           LY SA    +         S  + ++F + D L +IGPL+D + G      N  + AT  
Sbjct: 471 LYSSAPGPIQVDNRMGTDSSTPEFYNFRLNDKLSSIGPLRDITLGKAFSNTNRKSQATTG 530

Query: 534 SKQSNYELVELPG--------------------------CKGIWTVYHKSSRGHNADSSR 567
           +  +  ELV   G                             +W+      RG       
Sbjct: 531 TVAAELELVASQGSDRGGGLVVIKREIDPLTTMSLKVDDADAVWSASVTKRRG------- 583

Query: 568 MAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV-------QGRTIAAGNLFGR 620
            ++ D+    Y++IS    +   E  ++     +S+  F        +  T+  G+L G 
Sbjct: 584 ASSTDNPSCQYVVISRSTDSE-QEVNEVFIVEEQSLKPFKAPEFNPNEDCTVDIGSLAGN 642

Query: 621 RRVIQVFERGARILDGSYMTQDLSFGPSNSE---SGSGSENSTVLSVSIADPYVLLGMSD 677
            R++QV     R    SY   D+  G S          S+     S S  DPY+++   D
Sbjct: 643 TRLVQVLRNEVR----SY---DIDLGLSQIYPVWDEDTSDERVAASASFIDPYLVIIRDD 695

Query: 678 GSIRLLVGDPS 688
            S+ LL  D S
Sbjct: 696 SSVLLLQADES 706



 Score = 69.3 bits (168), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 62/237 (26%), Positives = 104/237 (43%), Gaps = 25/237 (10%)

Query: 914  ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHP---QLCDGSIVAFTVLHNVNC--NHG 968
            + I  NISG    F+ G+   +  VFR   +  P   +L  G     +   +V+   ++G
Sbjct: 877  LRILPNISGFSTIFMPGASSSF--VFRTA-KSSPHIIRLRGGFTRWLSSFDSVDTGRDNG 933

Query: 969  FIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKP 1028
            FIYV SQ  ++ CQLPS + +D  W ++KV P++     + Y      Y L  S      
Sbjct: 934  FIYVDSQNCVRACQLPSQTQFDYPWTLRKV-PIEEQVDFLAYSTSSETYVLGTS------ 986

Query: 1029 LNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQS 1088
                  L    ++  +  N  LS         + E  ++++ P      W    + P+  
Sbjct: 987  REGDFKLPEGDDLHPEWRNEELSFCP-----KIPESSIKVVSPKT----WTIIDSYPLDP 1037

Query: 1089 SENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQN 1144
             E    V+ V +  +  T E   L+ +GTA V+GED+ ARG + +F   + A +P+ 
Sbjct: 1038 DEQVTAVKNVNIEVSENTHERRDLIVVGTAIVKGEDMPARGTIYVFDVIKVAPDPEK 1094


>gi|425765419|gb|EKV04111.1| Cleavage and polyadenylation specificity factor subunit A, putative
           [Penicillium digitatum Pd1]
 gi|425767100|gb|EKV05682.1| Cleavage and polyadenylation specificity factor subunit A, putative
           [Penicillium digitatum PHI26]
          Length = 1271

 Score =  114 bits (285), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 175/758 (23%), Positives = 301/758 (39%), Gaps = 153/758 (20%)

Query: 57  NLVVTAANVIEIYVV------RVQEEGSK---ESKNSGETKRRVLMDGISAASLELVCHY 107
           NL+V   ++++I+ +      ++Q+EGS+      +  ETK            L L   Y
Sbjct: 28  NLIVIRTSLLQIFSLVKIVSSQLQKEGSEPHGSQFSQPETK------------LVLEKEY 75

Query: 108 RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
            L G V  L+ +     +N    ++I++A  +AK+S++E+D   HG+   S+H +E  + 
Sbjct: 76  PLSGTVTDLSRVKI--LNNKSGGEAILIAVRNAKLSLIEWDPERHGISTISIHYYERDDL 133

Query: 168 LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---------GDE 217
                  +    G ++ VDP  RC  V  +G++ + IL   Q G  LV         G+ 
Sbjct: 134 TRSPWVPDLSRCGSILSVDPSSRCA-VYNFGIRNLAILPFHQAGDDLVMDDYDSELEGER 192

Query: 218 DTFGSGGGFSARIE-----------SSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILH 264
               SGGG   +             SS V+ L  LD  + H     F++ Y EP   IL 
Sbjct: 193 PIQNSGGGAEPKKSKEGPAYQTPYCSSFVLPLTALDPSLLHPISLAFLYEYREPTFGILF 252

Query: 265 ERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVV 324
            +  T    +  +      +  ++    +    + S   LP D +K++A+P P+GG L++
Sbjct: 253 SQVATSTALLYERKDVVFYAVFTLDLEQRASTTLLSVSRLPSDLFKVVALPLPVGGALLL 312

Query: 325 GANTI-HYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLST 381
           G+N I H      + A+ +N ++  + S     +S  +  L+      L  D    LL+ 
Sbjct: 313 GSNEIVHVDQAGKTNAVGVNEFSRQVSSFSMTDQSDLAFRLEGCVVERLGGDSGDLLLAL 372

Query: 382 KTGDLVLLTVVYDGRVVQRL-----------DLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
            +GD+ L+    DGR V  +           D+ K+  S      + +G+   F+GS   
Sbjct: 373 ASGDMALIKFKLDGRSVSGITIHLLPAHAGGDMLKSAASC----SSCLGDGNVFIGSEDA 428

Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA-------LQDMVNGEE 483
           DS+L++++  S                     STK+ R  S            D    E+
Sbjct: 429 DSVLLEWSRSSA--------------------STKKARLESKQTADGFDDLEDDDDQMED 468

Query: 484 LSLYGSASNNTESAQKT---------FSFAVRDSLVNIGPLKDFSYGLRIN---ADASAT 531
             LY SA  +T+   +          ++F ++D L +IGPL+D + G   +    +  AT
Sbjct: 469 DDLYSSAPGSTQVDNRMGTENLTTEFYNFRLKDCLPSIGPLRDITLGKVFSNTYREKQAT 528

Query: 532 GISKQSNYELVELPG--------------------------CKGIWTVYHKSSRGHNADS 565
             +  +  ELV   G                            G+W+   K  RG     
Sbjct: 529 CEAVSAELELVASQGSDRGGGLVVIKREIDPLTTMSLKIDDADGVWSASVKKRRG----- 583

Query: 566 SRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV-------QGRTIAAGNLF 618
              ++ D+    Y+++S    +   E  ++     +++  F        +  T+  G+  
Sbjct: 584 --ASSTDNPSRQYVVVSRSTDSEQ-ELNEVFVAEEQNLKPFRAPEFNPNEDCTVDIGSFA 640

Query: 619 GRRRVIQVFERGARILDGSYMTQDLS-FGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
           G  R++QV     R  D   M   LS   P   E    S+    +S S  DPY+++   D
Sbjct: 641 GDTRLVQVLRNEVRSYD---MELGLSQIYPVWDE--DTSDERVAVSASFIDPYLMIIRDD 695

Query: 678 GSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
            S+ LL  D +     V     I SS+    S  LY+D
Sbjct: 696 SSVLLLQADENGDLDEVPLSTLIISSR--WRSGCLYYD 731



 Score = 51.2 bits (121), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 41/167 (24%), Positives = 72/167 (43%), Gaps = 17/167 (10%)

Query: 978  LKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLI 1037
            ++ CQLPS + +D  W ++KV P++   + + Y      Y L  S            L  
Sbjct: 822  IRACQLPSQTQFDYSWTLRKV-PIEEQVNFLAYSTSSETYVLGTS------RQGDFKLPE 874

Query: 1038 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRV 1097
              E+  +  N  LS         + E  ++++ P      W    + P+   E    V+ 
Sbjct: 875  GDELHPEWRNEELSFCP-----KIPESSIKVVSPKT----WTIIDSYPLDPDEQVTAVKN 925

Query: 1098 VTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQ 1143
            V +  +  T E   L+ +GTA  +GED+ ARG + +F   + A +P+
Sbjct: 926  VNIEVSENTHERMDLIVVGTAIAKGEDMPARGTIYVFDVIKVAPDPE 972


>gi|258575565|ref|XP_002541964.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237902230|gb|EEP76631.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 1376

 Score =  113 bits (283), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 186/745 (24%), Positives = 298/745 (40%), Gaps = 123/745 (16%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V   +V++++ +     G+  S ++ +  R   ++      L L+  Y L G V  L
Sbjct: 28  NLIVAKTSVLQVFSLVNVAYGASTSPSTDDKTR---VERQQYTRLVLLAEYDLPGTVTGL 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +     D+    +++++A  +AK+S++E+D   HG+   S+H +E  E LH       
Sbjct: 85  GRVKT--LDSKSGGEALLVATRNAKLSLVEWDHERHGISTVSIHYYER-EDLHNSPWTPD 141

Query: 177 FARGP-LVKVDPQGRCGGVLVYGLQMI-ILKASQGGSGLVGD---EDTFG---------- 221
               P L+ VDP  RC  +L +G+  + IL   Q G  LV D   ED  G          
Sbjct: 142 LKLCPSLLAVDPSSRCA-ILNFGIHSVAILPFHQTGDDLVMDDFDEDLRGEKPEDMDNAL 200

Query: 222 ---SGGGFSAR----IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAG 272
              +     AR      SS V+ L  LD  + H     F++ Y EP   IL+    T   
Sbjct: 201 VESTAANDVARHKTPYASSFVLPLTALDPALVHPIHLAFLYEYREPTFGILYSHVATSFA 260

Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHY 331
            +  +      +  ++    +    + +   LP D + ++ +P PIGG L++G+N  IH 
Sbjct: 261 LLGERKDVVSYAVFTLDIQQRTSTTLVTVSRLPSDLWNVVPLPPPIGGSLLIGSNELIHV 320

Query: 332 HSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL---QNDVALLSTKTGDLVL 388
                + A+ +N +A          +S   + L+      L     D+AL+   +G + +
Sbjct: 321 DQAGKTNAVGVNEFARQASEFSMADQSDLELRLEGCVIEQLGTESGDIALV-LASGRMAI 379

Query: 389 LTVVYDGRVVQ----RLDLSKTNPSVLTSDIT---TIGNSLFFLGSRLGDSLLVQFTCGS 441
           +    DGR V     +L  ++   S+L +  +   ++G    FLGS   DS+LV +T  S
Sbjct: 380 VRFKVDGRSVSGIFVQLVSTQAGGSILKARPSCSASLGRGKIFLGSEETDSVLVGWTRPS 439

Query: 442 GTSMLSSGLKEEFGDIEADAPSTKRLRRSSS-------DALQDMVNGEELSLYGSASNNT 494
                                S KRL+R SS       D   D  +  E  LY + +N T
Sbjct: 440 Q--------------------SIKRLKRDSSGPRAGETDTDDDEDDIYEDDLYSTPTNQT 479

Query: 495 ESAQKT----------FSFAVRDSLVNIGPLKDFSYGLRINA-DASATGISKQS-NYELV 542
              +            F F   D L ++GP+KD + G      D ++   SK S + ELV
Sbjct: 480 TVPKTVSQTNGLIKDEFVFRCHDRLWSLGPMKDITLGRTPGTRDQASKKTSKPSTDLELV 539

Query: 543 EL--PGCKGIWTVYHKSSRGHNADSSRMAAYDD-------------------EYHAYLII 581
                G  G  T+  K    +  DS +M   D                     Y  YL+ 
Sbjct: 540 VTHGQGDAGGLTILRKELDPYIIDSMKMDNVDGVWSVQIAPSNTSNPSTTSRNYDKYLVF 599

Query: 582 SLEARTMVLETADLLTEVTESVDYFV-------QGRTIAAGNLFGRRRVIQVFERGARIL 634
           S ++R    E + + T     +D          +  T+  G L G  RV+QV     R  
Sbjct: 600 S-KSRGHAKEQSVVYTVGGNGIDEMKAPEFNPNEDHTVDIGTLAGGTRVVQVLTSEVRSY 658

Query: 635 DGSYMTQDLSFG---PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
           D      DL+     P   E    S+  +V   S A+PY+L+   D S+ LL  D S   
Sbjct: 659 D-----TDLALAQIYPVWDE--DTSDELSVTGASFAEPYLLITRDDQSLLLLQPDSSGDL 711

Query: 692 VSVQTPAAIESSKKPVSSCTLYHDK 716
             V     + +S K +  C LY DK
Sbjct: 712 DEVNIDGLL-TSNKWLCGC-LYFDK 734



 Score = 77.8 bits (190), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 56/237 (23%), Positives = 106/237 (44%), Gaps = 29/237 (12%)

Query: 913  RITIFKNISGHQGFFLSGSRPCWCMVFRE------RLRVHPQLCDGSIVAFTVLHNVNCN 966
            R+    ++ G++  F+ GS PC+ M          RL+  P      + + +  H   C 
Sbjct: 876  RLRAIPDLCGYKTMFMPGSNPCFIMKSSTSSPHVLRLKGEP------VSSLSSFHMPACE 929

Query: 967  HGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVL 1026
             GF YV ++ ++++C+LP  + +DN W  +K I +      + YFA    Y L  S    
Sbjct: 930  KGFAYVDAKNMVRMCRLPGNTRFDNAWAARK-IHIGEQVDCVEYFARSETYVLGTS---- 984

Query: 1027 KPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPM 1086
               ++   L  D EV  +  +  +S +       ++   V++L P      W       +
Sbjct: 985  --YHEDFKLPEDDEVHTEWRSEVISFMP-----QLDRGRVKLLSPRT----WSIIDCYDL 1033

Query: 1087 QSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP 1142
             ++E  L ++ + +  +  T E + ++ +GTA V+GED+  RG + +F     A +P
Sbjct: 1034 GATERILCLKTINMEVSEITHERQDMVVVGTAIVRGEDITPRGSIYVFEIIDVAPDP 1090


>gi|119195757|ref|XP_001248482.1| hypothetical protein CIMG_02253 [Coccidioides immitis RS]
 gi|121769680|sp|Q1E5B0.1|CFT1_COCIM RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two
           protein 1
 gi|392862316|gb|EAS37050.2| protein CFT1 [Coccidioides immitis RS]
          Length = 1387

 Score =  113 bits (283), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 172/731 (23%), Positives = 294/731 (40%), Gaps = 94/731 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V   ++++++ +     G+    N+ +  R   ++      L LV  Y L G +  L
Sbjct: 28  NLIVAKTSILQVFSLVNVAYGTSAPPNADDKGR---VERQQYTKLILVAEYDLSGTITGL 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +     D+    ++++++  +AK+S++E+D   HG+   S+H +E  E +H       
Sbjct: 85  GRVKI--LDSRSGGEALLVSTRNAKLSLVEWDHERHGISTISIHYYER-EDVHSSPWTPD 141

Query: 177 FARGP-LVKVDPQGRCGGVLVYGLQMI-ILKASQGGSGLVGDE-----DTFGSGGG---- 225
               P L+ VDP  RC  +L +G+  + IL   Q G  LV DE     D    G      
Sbjct: 142 LRLCPSLLAVDPSSRCA-ILNFGIHSVAILPFHQTGDDLVMDEFDEDLDEKPEGASNIPA 200

Query: 226 ----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
                     +     SS V+ L  LD  + H     F++ Y EP   IL+    T +  
Sbjct: 201 QAAVANDTTMYKTPYASSFVLPLTALDPALVHPIHLAFLYEYREPTFGILYSHLTTSSAL 260

Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYH 332
           +  +      +  ++    +    + +   LP D +K++ +P PIGG L++G+N  IH  
Sbjct: 261 LHDRKDIVSYAVFTLDIQQRASTTLITVSRLPSDLWKVVPLPPPIGGALLIGSNELIHVD 320

Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLT 390
               + A+ +N +A    +   + +S   + L+      L  D    LL    G + +L 
Sbjct: 321 QAGKTNAVGINEFARQASAFSMVDQSDLGLRLEGCVVEQLGTDSGDILLVLADGKMAILR 380

Query: 391 VVYDGRVVQ----RLDLSKTNPSVLTSDIT---TIGNSLFFLGSRLGDSLLVQFTCGSGT 443
           +  DGR V     +L   K   S+L +  +   ++G    F GS   DSLL+ ++  S  
Sbjct: 381 LKVDGRSVSGISAQLVSEKAGGSILKARPSCSASLGRGKVFFGSEETDSLLIGWSRPS-Q 439

Query: 444 SMLSSGLK---EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT 500
           SM    ++   + FG  +              D     VN   LS   S +N     +  
Sbjct: 440 SMRKPKVESADDVFG--DHSETEDDEDDIYEDDLYSTPVNQTTLSKTTSQTNGLN--KDD 495

Query: 501 FSFAVRDSLVNIGPLKDFSYGL--------------RINADASATGISKQSN-------- 538
           F F   D L N+GP+ D + G               R +AD        + N        
Sbjct: 496 FVFRSHDRLWNLGPMSDVTLGRPPGSHDKNRKQSSSRTSADLELVVTQGKGNAGGLAVLQ 555

Query: 539 -------YELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL-----EAR 586
                   + +++    G+W++   +      DS+        Y  YL+ S      + +
Sbjct: 556 RELDPYVIDSMKMDNVDGVWSIQVGA-----PDSTNTRTSSRNYDKYLVFSKSTEPGKEQ 610

Query: 587 TMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSF 645
           ++V        E  ++ ++   +  T+  G L G  RV+QV +   R  D +     +  
Sbjct: 611 SVVYSVGGSGIEEMKAPEFNPNEDSTVDIGTLAGGTRVVQVLKSEVRSYDTNLELAQIY- 669

Query: 646 GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKK 705
            P   E    S+  +V+S S A+PYVL+   D S+ LL  D S     V     I SS +
Sbjct: 670 -PIWDE--DTSDELSVVSASFAEPYVLIVRDDQSLLLLQADKSGDLDEVNI-DGILSSHR 725

Query: 706 PVSSCTLYHDK 716
            +S C LY DK
Sbjct: 726 WLSGC-LYLDK 735



 Score = 79.7 bits (195), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 63/253 (24%), Positives = 116/253 (45%), Gaps = 25/253 (9%)

Query: 891  RFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLC 950
            RF  +P  AY     PH    + +  + +I G++  F+SGS PC+ M          +L 
Sbjct: 860  RFDPSP-KAYM----PHS---KFLRAYSDICGYKTVFMSGSNPCFVMKSSTSSPHVLRLR 911

Query: 951  DGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITY 1010
              ++ + +  H   C  GF YV +  ++++C+LPS + +DN W  +KV  +      + Y
Sbjct: 912  GEAVSSLSSFHIPACEKGFAYVDASNMVRMCRLPSNTRFDNSWVTRKV-HVGDQIDCVEY 970

Query: 1011 FAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILE 1070
            FA   +Y L  S  V   L +      D E+  +  +  +S +       +E   +++L 
Sbjct: 971  FAHSEIYALGSSHKVDFKLPE------DDEIHPEWRSEVISFMP-----QLERGCIKLLS 1019

Query: 1071 PDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGR 1129
            P      W    +  +  +E  + ++ + +  +  T E + +L +GTA V+GED+  RG 
Sbjct: 1020 PRT----WSVVDSYELGDAERVMCMKTINMEISEITHEMKDMLVVGTATVRGEDITPRGS 1075

Query: 1130 VLLFSTGRNADNP 1142
            + +F     A +P
Sbjct: 1076 IYVFEIIEVAPDP 1088


>gi|406602601|emb|CCH45811.1| hypothetical protein BN7_5397 [Wickerhamomyces ciferrii]
          Length = 1287

 Score =  112 bits (280), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 129/613 (21%), Positives = 261/613 (42%), Gaps = 62/613 (10%)

Query: 96  ISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLR 155
           I + + +L+ ++    N + + I S    D+   + +I+ +   AK+S++ FD  ++ ++
Sbjct: 42  IDSKNDKLILNHEFKLNGKIIGIKSIKLPDSQYDQLAILTSL--AKLSIVSFDHDLNTIQ 99

Query: 156 ITSMHCFESPEWLH-LKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV 214
             S+H +ES  +   + +  ES      +K+DP  +   ++VY   +  L   Q    ++
Sbjct: 100 TNSLHYYESEFYTKSISKINES-----QLKIDPNNQTS-LVVYNDLLAFLPFKQDDDEII 153

Query: 215 GDEDTFGSGGGFSARIESSH--VI---NLRDLDMKHVKDFIFVHGYIEPVMVILHERELT 269
            D+    S       IE  H  +I   N  +  + ++ D  F+H Y +P + ILH +E T
Sbjct: 154 DDDHHTQSNDQQQQNIELFHNSIILPANKLESTVSNIIDCDFLHSYRDPTLAILHNKEQT 213

Query: 270 WAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-T 328
           WA  +S K  T     LS+         I    NLP+D + +  +P PI G L++G N  
Sbjct: 214 WASDLSIKKDTVNFVVLSLDLLNDSSTAILLVENLPYDLWFVKPLPDPINGTLLIGCNEI 273

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           IH  +   +  + LN Y   +   +   +S  ++ L+ +    L +   L+  + G+   
Sbjct: 274 IHIDNSGNTKGIGLNKYYQDITDFKLKDQSDLNIFLEHSKVEILNDKNILIIDQFGESYN 333

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTS---DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
           L    DG+ V+ L ++K    +       IT I     F+G +  DS+L+++        
Sbjct: 334 LQFFIDGKSVKDLLITKFEKDLQIRSPISITNIDEQNIFIGCQSSDSILIKY-------- 385

Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAV 505
               LK+E  + +   P+  +      D           +      N        F+  +
Sbjct: 386 --EKLKQETNEAKPTTPAATKTNNDDDDEDLYEDEDLNNNNDDELIN--------FNLQI 435

Query: 506 RDSLVNIGPLKDFSYGLRINADASATGIS--KQSNYELVEL--PGCKGIWTVYHKSSRGH 561
           +D L N GPL  F+ G +IN ++   G++   Q++  +V     G +G  T++++S +  
Sbjct: 436 KDKLFNAGPLSSFTLG-KINPNSLIQGLTNPNQNDVSIVGTSGEGKQGKLTLFNQSIQPK 494

Query: 562 NADSSRMAAYDDEY---HAYLIIS-LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNL 617
              S +    +  +   + YLI + L+     +   +   +  +S D+     TI    +
Sbjct: 495 IHSSLKFNNINKTWNILNKYLITTDLQNFKSEIFLINENFKNFQSFDFKNNNITINIDTI 554

Query: 618 FGRRRVIQVFERGARILDGSY---MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLG 674
             ++R++Q+      + D ++   +  +  F               +++  I DP++++ 
Sbjct: 555 QSQKRILQITSNNVYLFDLNFKKLLQINFDF--------------EIINGKIFDPFIIIT 600

Query: 675 MSDGSIRLLVGDP 687
            S G +++   DP
Sbjct: 601 SSKGEVKIFEMDP 613


>gi|58268668|ref|XP_571490.1| cleavage and polyadenylation specific protein [Cryptococcus
           neoformans var. neoformans JEC21]
 gi|134113364|ref|XP_774707.1| hypothetical protein CNBF3860 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|338817789|sp|P0CM63.1|CFT1_CRYNB RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two
           protein 1
 gi|338817790|sp|P0CM62.1|CFT1_CRYNJ RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two
           protein 1
 gi|50257351|gb|EAL20060.1| hypothetical protein CNBF3860 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|57227725|gb|AAW44183.1| cleavage and polyadenylation specific protein, putative
           [Cryptococcus neoformans var. neoformans JEC21]
          Length = 1431

 Score =  111 bits (278), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 159/715 (22%), Positives = 296/715 (41%), Gaps = 106/715 (14%)

Query: 57  NLVVTAANVIEIYVVRVQE----EGSKESKNSGETKRRVLMDGI---------------- 96
           NLVV  A V+ ++ +R +     E  K  ++  E ++ V M+ +                
Sbjct: 48  NLVVAGAEVLRVFEIREESVPIIENVKLEEDVAEGEKDVQMEEVGDGFFDDGHAERAPLK 107

Query: 97  --SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGL 154
             +   L L+  + L+G +  LA  ++         D +I++F+DAK+++LE+  S   +
Sbjct: 108 YQTTRRLHLLTQHELNGTITGLAA-TRTLESTIDGLDRLIVSFKDAKMALLEW--SRGDI 164

Query: 155 RITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV 214
              S+H +E    ++     +S+   PL++ DP  R   + +    + +L   Q  S L 
Sbjct: 165 ATVSLHTYERCSQMNTG-DLQSYV--PLLRTDPLSRLAVLTLPEDSLAVLPLIQEQSEL- 220

Query: 215 GDEDTFGSGGGFSARIESSHVINLRDLDM--KHVKDFIFVHGYIEPVMVILHERELTWAG 272
              D    G    A    S V++L D+ +  K+++D +F+ G+  P + +L     TW+G
Sbjct: 221 ---DPLSEGFSRDAPYSPSFVLSLSDMSITIKNIQDLLFLPGFHSPTIALLFSPMHTWSG 277

Query: 273 RV-SWKHHTCM-ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIH 330
           R+ + K   C+ I    +S+    +PL+ S   LP D+  L+A PS +GG+++V +  I 
Sbjct: 278 RLQTVKDTFCLEIRTFDLSSG-TSYPLLTSVSGLPSDSLYLVACPSELGGIVLVTSTGIV 336

Query: 331 YHSQ----SASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDL 386
           +  Q    +A+C  A  +   SL  S  +   S  + L+ +   ++     LL  + G +
Sbjct: 337 HVDQGGRVTAACVNAWWSRITSLKCS--MASVSQKLTLEGSRCVFVTPHDMLLVLQNGAV 394

Query: 387 VLLTVVYDGR---VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGT 443
             +    +GR   V++ LD     P    SD+T  G+   F+GS  GDS L +       
Sbjct: 395 HQVRFSMEGRAVGVIEVLDKGCVVPP--PSDLTVAGDGAVFVGSAEGDSWLAKVNVVRQV 452

Query: 444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSF 503
              S   K+E  +++ D    + L    +DA  D    E   L+G A+          + 
Sbjct: 453 VERSEKKKDEM-EVDWD----EDLYGDINDAALDEKAQE---LFGPAA---------ITL 495

Query: 504 AVRDSLVNIGPLKDFSYGL-----------------------RINADASATGISKQSNYE 540
           +  D L  +G + D  +G+                        IN       I+K+  + 
Sbjct: 496 SPYDILTGVGKIMDIEFGIAASDQGLRTYPQLVAVSGGSRNSTINVFRRGIPITKRRRFN 555

Query: 541 LVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT 600
             EL   +G+W +      G      +     +   A +++S E          L ++ T
Sbjct: 556 --ELLNAEGVWFLPIDRQTGQ-----KFKDIPEAERATILLSSEGNAT--RVFALFSKPT 606

Query: 601 ESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENST 660
                 + G+T++A   F R  +++V      +LD +        G          +   
Sbjct: 607 PQQIGRLDGKTLSAAPFFQRSCILRVSPLEVVLLDNN--------GKIIQTVCPRGDGPK 658

Query: 661 VLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
           +++ SI+DP+V++  +D S+   VGD    TV+ + P   E       +  ++ D
Sbjct: 659 IVNASISDPFVIIRRADDSVTFFVGDTVARTVA-EAPIVSEGESPVCQAVEVFTD 712


>gi|169864473|ref|XP_001838845.1| cleavage factor protein [Coprinopsis cinerea okayama7#130]
 gi|116500065|gb|EAU82960.1| cleavage factor protein [Coprinopsis cinerea okayama7#130]
          Length = 1458

 Score =  111 bits (277), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 200/917 (21%), Positives = 360/917 (39%), Gaps = 164/917 (17%)

Query: 57  NLVVTAANVIEIYVVRVQE-------EGSKESKN---------SGETKRRVLMDGISAAS 100
           NLVV  +N++ I+ VR +        E  +E K           GE       DG    S
Sbjct: 40  NLVVARSNLLRIFEVREEPCAVPHGVEDERERKGGIRRGTEAVEGELAMDAQGDGFINVS 99

Query: 101 ----------------LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISV 144
                           L LV  ++LHG V  L+ + +  A    + D ++++F+DAKI++
Sbjct: 100 KGMAMKSDVEHPKTTRLYLVREHKLHGMVTGLSGV-RIIASLEDKLDRLLVSFKDAKIAL 158

Query: 145 LEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVK----VDPQGRCGGVLVYGL 199
           LE+ D++H L   S+H +E +P+   L          PL K    VDPQ RC  + +   
Sbjct: 159 LEWSDAVHDLVPVSIHTYERAPQLTSLT--------APLFKSQLRVDPQSRCAALGLPNH 210

Query: 200 QMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYI 256
            + IL               F            S +++L    + ++++V DF F+ G+ 
Sbjct: 211 ALAILP--------------FLDDAVSDVPYSPSFILDLAVSVNPNIRNVADFCFLPGFN 256

Query: 257 EPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
           +P + ++ E   TW GR+     T  +   ++      +P+I S   LP D+  L  VP+
Sbjct: 257 KPTLAVMFEPLQTWMGRIGEYKDTVKLVIFTLDIKTSSYPIITSVDGLPMDSLGL--VPA 314

Query: 317 PIGGVLVVGANTIHYHSQSAS--CALALNNYA--VSLDSSQELPRSSFSVELDAAHATWL 372
             GGV++   N++ Y  QS+S   A+ +N +A  ++      LP    ++ L+ +    +
Sbjct: 315 -FGGVVITTPNSLIYIDQSSSRQIAVPVNGWASRITDLPLLPLPSPDLNLTLEGSKTVVV 373

Query: 373 QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-----TNPSVLTSDITTIGNSLFFLGS 427
                 +    G +  + V+ DG+ V +L + K     T PSV+ S              
Sbjct: 374 DEKTLFVILANGIIYPIEVMADGKTVTKLQVGKPLAQATIPSVVES-------------- 419

Query: 428 RLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR-----RSSSDALQDMVNGE 482
            LGD  L   +      +L +   EE  D E +  + K +          D      + +
Sbjct: 420 -LGDGHLFVGSTVGVGVVLKTAWVEEEVDDEEEGTNAKVVEDDIDMDLYDDDDDLYGDSK 478

Query: 483 ELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD------ASATG---- 532
             +   +   +T+  +     ++RD+L   GP+   ++ L    D       +ATG    
Sbjct: 479 NKTQVTAEVKDTKKYRSVLHLSLRDTLPAYGPISSLTFSLATEGDKPVPELVTATGSGIL 538

Query: 533 ---------ISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAY----- 578
                    +  ++  +++ + G +G+W++  + S      SS   A +   HA      
Sbjct: 539 GGFTLFQRDLPTRTKKKILAVGGTRGLWSLPIRQSVKKGGSSSSTTAIE---HAKTERDT 595

Query: 579 LIISLEA-------RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
           LI+S +A       R          TEV  ++   V G T+ A   F R  ++ V     
Sbjct: 596 LILSTDATPSPGVSRIATRAPPGGKTEV--NITTRVPGTTVGAAPFFQRTAILVVMTNSI 653

Query: 632 RILDGSYMTQDLSFGPSNSESGSGSE------NSTVLSVSIADPYVLLGMSDGSIRLLVG 685
           ++L+           P  +E  +  +         + S SI DP+VL+   D S+ L +G
Sbjct: 654 KVLE-----------PDGTERQTIQDMDGKLLRPKIRSCSICDPFVLIIREDDSLGLFIG 702

Query: 686 DPSTCTVSVQTPAAI-ESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAI--DGA 742
           +     +  +  + + E + K ++ C      G      +TS     +T   + +   G+
Sbjct: 703 ETERGKIRRKDMSPMGEKTSKYLAGCFFTDTSGLFGQQFETSVPVEGATATLQNVVSGGS 762

Query: 743 DGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETE 802
             G   Q   + ++    G +EI+ +P     F+V    S    +VD++ + AL  S   
Sbjct: 763 TSGGKPQHTQWLLLVRPQGVMEIWTLPKLTLAFSVSAVPSLFNVLVDSHDKPAL--SVPN 820

Query: 803 INSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPE 862
                +   G+   E +   +V E           R  LF  L +G +  Y+A     P 
Sbjct: 821 PGDPPQRKPGEFDVEQVCVSRVGE-------DGRGRVCLFVFLRNGQLTIYEAL----PL 869

Query: 863 NTSKSDDPVSTSRSLSV 879
           +T+ S    S   ++ V
Sbjct: 870 STTASQPAASVDGAMDV 886


>gi|9794908|gb|AAF98388.1| cleavage and polyadenylation specificity factor [Drosophila
            melanogaster]
          Length = 813

 Score =  110 bits (276), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 132/530 (24%), Positives = 217/530 (40%), Gaps = 81/530 (15%)

Query: 659  STVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD--- 715
            S V+ VSIADPYV L + +G +  L    +  T  +       SS   V + + Y D   
Sbjct: 10   SPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAISAYKDLSG 69

Query: 716  ----KG----------------------PEPWLRKTSTDAWLSTGVGEAI------DGAD 743
                KG                       EP ++    +  L    G A       D A 
Sbjct: 70   LFTVKGDDINLTGSSNSAFGHSFGGYMKAEPNMKVEDEEDLLYGDAGSAFKMNSMADLAK 129

Query: 744  GGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTY 791
                   D +             VV  +SG LEI+ +P+   V+ V+   +G   + D  
Sbjct: 130  QSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDVGNGSMVLTDAM 189

Query: 792  MREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTIL 851
              E +  S T   +S          ++ +S   +EL++     +  RP L  + T   +L
Sbjct: 190  --EFVPISLTTQENSKAGIVQACMPQHANSPLPLELSVIGLGLNGERPLLL-VRTRVELL 246

Query: 852  CYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAP- 910
             YQ  +F  P+   K        R L   N+   +  ++       D     E+    P 
Sbjct: 247  IYQ--VFRYPKGHLKI-----RFRKLDQLNLLDQQPTHIELDEN--DEQEEIESYQMQPK 297

Query: 911  -CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHG 968
              Q++  F N+ G  G  + G  PC+  + FR  LR+H  L +G + +F   +NVN  +G
Sbjct: 298  YVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVNIPNG 357

Query: 969  FIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKP 1028
            F+Y  +   LKI  LPS  +YD+ WPV+KV PL+ TP Q+ Y  E  +Y LI      +P
Sbjct: 358  FLYFDTTYELKISVLPSYLSYDSVWPVRKV-PLRCTPRQLVYHRENRVYCLITQTE--EP 414

Query: 1029 LNQVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEPDRAGGPWQT--RATI 1084
            + +       D+E+  +       S D    Y +  ++E+ ++ P+     W+    A+I
Sbjct: 415  MTKYYRFNGEDKELSEE-------SRDERFIYPIGSQFEMVLISPET----WEIVPDASI 463

Query: 1085 PMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
              +  E+    ++V L +  T    +  L IGT +   ED+ +RG + ++
Sbjct: 464  TFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIY 513


>gi|405121446|gb|AFR96215.1| cleavage and polyadenylation specific protein [Cryptococcus
           neoformans var. grubii H99]
          Length = 1431

 Score =  110 bits (276), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 160/726 (22%), Positives = 301/726 (41%), Gaps = 107/726 (14%)

Query: 45  ELPSKRGIGPVPNLVVTAANVIEIYVVRVQE----EGSKESKNSGETKRRVLMDGI---- 96
           + P  + IG   NLVV  A V+ ++ +R +     E +K  ++  E ++ V M+ +    
Sbjct: 39  DTPDVKVIG---NLVVAGAEVLRVFEIREESVPIIEKAKLEEDVAEGEKDVQMEEVGDGF 95

Query: 97  --------------SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKI 142
                         +   L L+  + L+G V  LA  ++         D +I++F+DAK+
Sbjct: 96  FDDGHAERAPLKYQTTRRLHLLTQHELNGTVTGLAA-TRTLESTIDGLDRLIVSFKDAKM 154

Query: 143 SVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI 202
           ++LE+  S   +   S+H +E    ++     +S+   PL++ DP  R   + +    + 
Sbjct: 155 ALLEW--SRGDIATVSLHTYERCSQMNTG-DLQSYV--PLLRTDPLWRLAVLTLPEDSLA 209

Query: 203 ILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVM 260
           +L   Q  S L    D    G    A    S V++L D+   +K+++D +F+ G+  P +
Sbjct: 210 VLPLIQEQSEL----DPLSEGFSRDAPYSPSFVLSLSDVSTTIKNIQDLLFLPGFHSPTI 265

Query: 261 VILHERELTWAGRV-SWKHHTCM-ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
            +L     TW+GR+ + K   C+ I    +S+    +PL+ S   LP D+  L+A PS +
Sbjct: 266 ALLFSPMHTWSGRLQTVKDTFCLEIRTFDLSSG-TSYPLLTSVSGLPSDSLYLVACPSEL 324

Query: 319 GGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFS--VELDAAHATWLQNDV 376
           GG+++V +  I +  Q    A A  N   S  +S +   +S S  + L+ +   ++    
Sbjct: 325 GGIVIVTSTGIVHVDQGGRVAAACVNAWWSRITSLKCSTASVSQKLTLEGSRCVFVTPHD 384

Query: 377 ALLSTKTGDLVLLTVVYDGR---VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
            LL  + G +  +    +GR   V++ LD     P    SD+T  G+   F+GS  GDS 
Sbjct: 385 MLLVLQNGAVHQVRFSMEGRAVGVIEVLDKGCVVPP--PSDLTVAGDGAVFVGSAEGDSW 442

Query: 434 LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNN 493
           L +          +   K+E  +++ D    + L    +DA  D    E+   +G A+  
Sbjct: 443 LAKVNVVRQVVERAEKKKDEM-EVDWD----EDLYGDINDAALDEKAQEQ---FGPAA-- 492

Query: 494 TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA--------SATGISKQSNYELV--- 542
                   + +  D L  +G + D  +G+  +           + +G S+ S + +    
Sbjct: 493 -------ITLSPYDILTGVGKIMDIEFGIAASDQGLRTYPQLVAVSGGSRNSTFNVFRRG 545

Query: 543 ----------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA---RTMV 589
                     EL    G+W +      G      +     +   A +++S E    R   
Sbjct: 546 IPITKRRRFNELLNADGVWFLPIDRQTGQ-----KFKDIPEAERATMLLSSEGNATRVFA 600

Query: 590 LETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSN 649
           L +     ++       + G+T++A   F R  ++ V      +LD +        G   
Sbjct: 601 LSSKPTPQQIGR-----LDGKTLSAAPFFQRSCILHVSPLEVVLLDNN--------GKII 647

Query: 650 SESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSS 709
                  +   +++ SI+DP+V++  +D S+   VGD    TV  + P   E       +
Sbjct: 648 QTVCPRGDGPKIVNASISDPFVIIRRADDSVTFFVGDTVARTVG-EAPIVSEGESPVCQA 706

Query: 710 CTLYHD 715
             ++ D
Sbjct: 707 VEIFTD 712


>gi|356527660|ref|XP_003532426.1| PREDICTED: disease resistance response protein 206-like [Glycine
           max]
          Length = 281

 Score =  110 bits (275), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 56/106 (52%), Positives = 76/106 (71%), Gaps = 8/106 (7%)

Query: 1   MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSK--RGIGPVPNL 58
           MSFAAYKMM  PTGI NC  GF+THSR+D+VP    +Q +++D E PS+    +G +PNL
Sbjct: 1   MSFAAYKMMQCPTGIDNCAVGFLTHSRSDFVP----LQPDDIDVEWPSRPCHHVGSLPNL 56

Query: 59  VVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELV 104
           +VT ANV+E+Y VR+QE+ S   K + +++   L+DGI  ASLELV
Sbjct: 57  IVTVANVLEVYAVRLQEDQSP--KAAIDSRSDTLLDGIVGASLELV 100


>gi|401889164|gb|EJT53104.1| cleavage and polyadenylation specific protein [Trichosporon asahii
           var. asahii CBS 2479]
          Length = 1358

 Score =  110 bits (275), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 171/682 (25%), Positives = 284/682 (41%), Gaps = 115/682 (16%)

Query: 45  ELPSKRGIGPVPNLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMD---------G 95
           E+P  + +G   NLVV     + ++ +R  EE +    +     ++  MD          
Sbjct: 39  EVPDVKVVG---NLVVAGGQDLRVFEIR--EESTPLPDDESAVPKQEDMDVGDSFFDSAP 93

Query: 96  ISAAS--------LELVCHYRLHGNVESLAILSQGGADNS-RRRDSIILAFEDAKISVLE 146
           I  A         L L+  + LHG V  LA L     D+S    D ++++FE AK S  +
Sbjct: 94  IERAPVRYKTTRRLHLLTRHTLHGVVTGLAGLRT--IDSSVDGLDRLLVSFEHAKWSRGD 151

Query: 147 FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKA 206
                  +   S+H +E  + + +    + +   P+++ DP  R   + +    + +L  
Sbjct: 152 -------IATVSLHTYERCQQM-INGNFQGYV--PMLRSDPLSRLAILTLPEDALAVLPI 201

Query: 207 SQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHER 266
            Q  S L   +D+  S                   ++K++KDF+F+ G+  P + +L   
Sbjct: 202 VQEQSELDAMQDSVSSP------------------EIKNIKDFLFLPGFHSPTIALLFAP 243

Query: 267 ELTWAGRVSWKHHTCMISALSISTTLK-QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVG 325
             TWAGR      T  +   +I T+    +PLI S   LP D+  L+A PS +GGV+VV 
Sbjct: 244 MNTWAGRYKSVKDTFRLEIRTIDTSAGGTYPLITSVTGLPSDSQYLVACPSEVGGVVVVT 303

Query: 326 ANTIHYHSQSAS-CALALN---NYAVSL--DSSQELPRSSFSVELDAAHATWLQNDVALL 379
           A+ I +  QS    + ++N   NY  ++  DSS E    S  + LD +HA ++  +  LL
Sbjct: 304 ASGIIHIDQSGRLVSTSVNGWWNYTTNMKSDSSYE----SQKLALDNSHAQFVTENDMLL 359

Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLT-SDITTIGNSLFFLGSRLGDSLLVQFT 438
             +TG++  +    DGR V  + + + + +V   S +   G+   F+GS  GDSLL    
Sbjct: 360 VLETGEVHQIRFEMDGRAVGAIKVDEQSSTVPPPSTLVPAGSDGIFVGSVEGDSLLAMVE 419

Query: 439 CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGE---ELSLYGSASNNTE 495
                S      +EE        P TK+      D  +++  G     +      S    
Sbjct: 420 KARDQSA-----QEE--------PETKQQEMDVDDWDEEVATGPVTVSVKAQDVLSGIGR 466

Query: 496 SAQKTFSFAVRD-------SLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCK 548
            A   F  AV D        LV IG     S G  +N       I+K+  +E  +L    
Sbjct: 467 IADMEFGIAVTDLGTRTYPQLVCIG---GGSQGSTMNVFRRGIPITKRRLFE--QLRTAV 521

Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
             W +  + +   NA   +     ++    +  + E  T +   +    +V E +  F +
Sbjct: 522 ATWFLPVERA---NAPKFKDIPESEQSTIAIAATQEGSTQIFALS--TRKVQERIAEFPE 576

Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSES-GSGSENS---TVLSV 664
              IA G    R R++ V      +LD            SN+   G+  E S    +++ 
Sbjct: 577 P-AIATGTWLRRTRIVLVLPSQVLLLD------------SNANPVGTICEMSDAPPIVAA 623

Query: 665 SIADPYVLLGMSDGSIRLLVGD 686
           SIADPYVL+  +DGS+ + VGD
Sbjct: 624 SIADPYVLIRRADGSVSVFVGD 645


>gi|328864890|gb|EGG13276.1| CPSF domain-containing protein [Dictyostelium fasciculatum]
          Length = 1627

 Score =  110 bits (274), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 68/243 (27%), Positives = 114/243 (46%), Gaps = 38/243 (15%)

Query: 912  QRITIFKNISGHQGFFLSG-SRPCWCMVFRERLRVHPQ---------------LCDGSIV 955
            +RI  F NI   +G F+SG S P W    +   R+HP                     I 
Sbjct: 1018 RRIIPFSNIGNKRGIFVSGVSTPIWIFSEKNFPRIHPMKQQQQTTSSSSSSSSSSKRPIT 1077

Query: 956  AFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKN 1015
             FT  HN+NC HGFIY    G+L IC+LP G+ Y+N WP++K + ++ T H+I+Y   + 
Sbjct: 1078 TFTTFHNINCKHGFIYFDHTGMLCICRLPDGTNYENEWPIRK-LAIRMTCHKISYHPVQK 1136

Query: 1016 LYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEE-YEVRILEPDRA 1074
             Y L++S P     ++      ++E+             L +   +EE Y++++++P   
Sbjct: 1137 CYVLVLSYPQAPQSDEDEQEEQEREL-------------LKKPLVLEEKYQLKLIDP--- 1180

Query: 1075 GGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN----ETLLAIGTAYVQGEDVAARGRV 1130
               W    +  +   E  L  +++ L +    +     +  + +GTAY  GED   +GR+
Sbjct: 1181 ANNWNIIDSFSLAEKETVLCSKIIYLRHADESDIIPKLKPFVIVGTAYTHGEDTVCKGRI 1240

Query: 1131 LLF 1133
            L+F
Sbjct: 1241 LIF 1243


>gi|321260384|ref|XP_003194912.1| cleavage and polyadenylation specific protein [Cryptococcus gattii
           WM276]
 gi|317461384|gb|ADV23125.1| cleavage and polyadenylation specific protein, putative
           [Cryptococcus gattii WM276]
          Length = 1431

 Score =  109 bits (273), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 162/734 (22%), Positives = 302/734 (41%), Gaps = 123/734 (16%)

Query: 45  ELPSKRGIGPVPNLVVTAANVIEIYVVRVQE----EGSKESKNSGETKRRVLMDGI---- 96
           + P  + IG   NLVV  A  + ++ +R +     E  K  ++  E K+ V M+ +    
Sbjct: 39  DTPDVKVIG---NLVVAGAEALRVFEIREESVPIIEKVKLEEDVAEGKKDVQMEEVGDGF 95

Query: 97  --------------SAASLELVCHYRLHGNVESLAILS--QGGADNSRRRDSIILAFEDA 140
                         +   L L+  + L+G V  LA     +   D     D +I++F+DA
Sbjct: 96  FDDGHAERAPLKYQTTRRLYLLAQHELNGTVTGLAATRTLESAIDG---LDRLIVSFKDA 152

Query: 141 KISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ 200
           K+++LE+  S   +   S+H +E    ++     +S+   PL++ DP  R   + +    
Sbjct: 153 KMALLEW--SRGDIATVSLHTYERCPQMNTG-DLQSYV--PLLRTDPLSRLAVLTLPEDS 207

Query: 201 MIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEP 258
           + +L   Q  S L    D    G    A    S V++L D+   +K+++D +FV G+  P
Sbjct: 208 LAVLPLIQEQSEL----DPLSEGFSRDAPYSPSFVLSLSDVSTTIKNIQDLLFVPGFHSP 263

Query: 259 VMVILHERELTWAGRV-SWKHHTCM-ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
            + +L     TW+GR+ + K   C+ I    +S+    +PL+ S   LP D+  L+A PS
Sbjct: 264 TIALLFSPMHTWSGRLQTVKDTFCLEIRTFDLSSG-TSYPLLTSVSGLPSDSLYLVACPS 322

Query: 317 PIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFS--VELDAAHATWLQN 374
            +GG+++V +  I +  Q    A A  N   S  +S +   +S S  + L+ +   ++  
Sbjct: 323 ELGGIVLVTSTGIVHIDQGGRVAAACVNAWWSRITSLKCSMASVSQKLTLEGSRCVFVTP 382

Query: 375 DVALLSTKTGDLVLLTVVYDGRVVQRLD-LSKTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
              LL  + G +  +    +GR V  ++ L K       SD+   G+   F+GS  GDS 
Sbjct: 383 HDMLLILQNGAVHQVRFSMEGRAVGLIEVLDKGCVVPPPSDLIVTGDGAVFVGSAEGDSW 442

Query: 434 LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNN 493
           L +                            +R+ R+     +  V+ +E  LYG  ++ 
Sbjct: 443 LAKVNV-----------------------VRQRVERAEEKKDEMEVDWDE-DLYGDINDA 478

Query: 494 T--ESAQKTF-----SFAVRDSLVNIGPLKDFSYGLRINADA--------SATGISKQSN 538
              E AQ+ F     + +  D L  +G + D  +G+  +           + +G S+ S 
Sbjct: 479 ALDEKAQEQFGPAAITLSPYDILTGVGKIMDIEFGIAASDQGLRTYPQLVAVSGGSRNST 538

Query: 539 YELV-------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA 585
           + +              EL   +G+W +      G      +     +   A +++S E 
Sbjct: 539 FNVFRRGIPITKRRRFNELLNAEGVWFLSIDRQTGQ-----KFKDIPEAERATILLSSEG 593

Query: 586 ---RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS-YMTQ 641
              R   L +     ++       + G+T++A   F R  ++ V      +LD +  + Q
Sbjct: 594 NATRVFALSSKPTPQQIGR-----LDGKTLSAAPFFQRSCILHVSPLEVVLLDNNGKIIQ 648

Query: 642 DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIE 701
            +         G G +   +++ SI+DP+ ++  +D S+   VGD    TV+ + P   E
Sbjct: 649 TV------CPRGDGPK---IVNASISDPFAIIRRADDSVTFFVGDTVARTVA-EAPIVSE 698

Query: 702 SSKKPVSSCTLYHD 715
                  +  ++ D
Sbjct: 699 GESPVCQAVEVFTD 712


>gi|320169222|gb|EFW46121.1| cleavage and polyadenylation specificity factor 1 [Capsaspora
           owczarzaki ATCC 30864]
          Length = 1725

 Score =  108 bits (270), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 76/261 (29%), Positives = 135/261 (51%), Gaps = 23/261 (8%)

Query: 229 RIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHEREL-TWAGRVSWKHHTCMISA 285
           R+  S+ I L +L   + HV D  F+ GY EP + +L E    +W GR   +  TC + A
Sbjct: 294 RLRPSYEIKLTELQRHIHHVIDIEFLTGYFEPTLALLFEPNAPSWTGRTVQRKDTCSMVA 353

Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNN 344
           LSI+T+   HP++WS   LP ++ +++AVP P+ G ++V  + I + SQS+ +  ++LN 
Sbjct: 354 LSINTSSHSHPVVWSVDKLPFNSMRVMAVPRPVCGTVIVTPDAILHLSQSSPTVGVSLNE 413

Query: 345 Y-AVSLDSSQELPR------SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRV 397
             ++S +    +P       SS    +      +L  +  L  T+ G++ + T++ +GR 
Sbjct: 414 LSSMSTELRLGIPENKHPDGSSVVYNMQEGRCCFLTPETLLAVTEGGEMFVATLLTEGRT 473

Query: 398 VQRLDLSKTNPSVLTSDITTIGNSLF-FLGSRLGDSLLVQF----TCGSGTSMLSSGLKE 452
           V R+ +     SVL   +T++ N  + F+GSR  DS+L++     T  +    L+S   +
Sbjct: 474 VVRIRIEPAGASVLPCCMTSLYNGQYCFIGSRASDSVLLRVMNNATAAADKRRLASAALD 533

Query: 453 EFGDIEADAPSTKRLRRSSSD 473
           +F        + KR R S ++
Sbjct: 534 DFS-------ANKRSRSSDTN 547



 Score =  108 bits (269), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 71/223 (31%), Positives = 113/223 (50%), Gaps = 23/223 (10%)

Query: 920  ISGHQ---GFFLSGSRPCWCMV--FRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
            + GHQ   G F+ G RP W ++   R+ LR H  L DGS+ AF+  +N  C  GF+Y T+
Sbjct: 1218 LGGHQLCSGVFVCGRRPLWLLMSPTRKALRAHLMLTDGSVSAFSAFNNNACPGGFVYFTT 1277

Query: 975  QGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLS 1034
            QG L+ CQL   + +DN WPV++V PL+AT H I Y      Y L+ S P  KP   +  
Sbjct: 1278 QGTLRFCQLAPTTNHDNPWPVRRV-PLRATAHYIGYHEVFRTYVLVTSHP--KPYFNLPR 1334

Query: 1035 LLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALT 1094
            L  D+            ++      T + + ++++ P      W++  +  + + E   +
Sbjct: 1335 LTNDETYTPVPYTPKPRAI----PATFDTFSLQLISPVT----WESIHSFDLPAFERVTS 1386

Query: 1095 VRVVTLFNTTTKENETLL----AIGTAYVQGEDVAARGRVLLF 1133
            V +  +   T++E  T L     IGT  ++GEDV   GR+++F
Sbjct: 1387 VDIAAI---TSQETVTGLKDYVVIGTTVIEGEDVTCHGRIIVF 1426



 Score = 82.4 bits (202), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 66/253 (26%), Positives = 120/253 (47%), Gaps = 30/253 (11%)

Query: 3   FAAYKMMHWPTGIANCGSGFITHS--RADYVPQIPLIQTEELDSELPSKRGIGPVPNL-- 58
           FA ++  H PT + +C     T++  R   V +  L++   +D+   S  G G    L  
Sbjct: 2   FAYFRQQHPPTAVEHCVEASFTNAAERQLVVARANLLEVYRIDAATAS--GSGWRSELSS 59

Query: 59  --VVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAA---------SLELVCHY 107
              +TA     +++ R    G  +   S +    +    + +A          LELV  +
Sbjct: 60  GSALTAQTAGAMHLGRAAGYGGNDGGRSDDAATEINTRSLHSAPATPPALQHKLELVASF 119

Query: 108 RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
            L GNVES+ +          +RDS++LAF++AK++V+++D +   L+  S+H +E    
Sbjct: 120 NLSGNVESIGVARLAHC----KRDSLLLAFKEAKVAVVDYDPATLDLKTISLHMYED--- 172

Query: 168 LHLKRGRESFARG----PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSG 223
           + ++ GR++ A      P+++VDP  +C   LVYG ++IIL   Q     + ++D + S 
Sbjct: 173 IEMRGGRDATALQAVWPPVIRVDPMRQCAAFLVYGTKLIILPFRQESH--LDEDDDYQSA 230

Query: 224 GGFSARIESSHVI 236
              +A +  S  I
Sbjct: 231 QAPAASVPPSAQI 243



 Score = 67.8 bits (164), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 57/192 (29%), Positives = 96/192 (50%), Gaps = 25/192 (13%)

Query: 543 ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLET-ADLLTEVTE 601
           EL G +G+W+V+   S   +   + +++ D   H+ L+ S +  T+V  T  + L ++ E
Sbjct: 739 ELTGGRGLWSVF---STALDPSLAALSSLDGASHSLLVASRDDSTLVFTTTGEELEQIAE 795

Query: 602 SVDYFVQGRTIAAGNLF---GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSEN 658
           S  +F  G TIA GN+F   G+  ++ VF  G R++DG  + Q+L     +S        
Sbjct: 796 S-GFFTAGATIAIGNVFAANGKILIVDVFAHGIRLVDGVNLRQELLLAQLSSV------- 847

Query: 659 STVLSVSIADPYVLLGMSDGSIRLL--VGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDK 716
           S ++  SIA+  VL   +DG++  +   GD      S  T AA     +PV + +LY D+
Sbjct: 848 SEIIHASIAESSVLALHADGAVSFVQFTGDTQELVASTATVAA----GQPVVAVSLYADR 903

Query: 717 G----PEPWLRK 724
                PE  L++
Sbjct: 904 SGLFVPEAVLQR 915


>gi|76157351|gb|AAX28300.2| SJCHGC08809 protein [Schistosoma japonicum]
          Length = 225

 Score =  107 bits (267), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 58/154 (37%), Positives = 91/154 (59%), Gaps = 5/154 (3%)

Query: 242 DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA 301
            + +V D  F+HG+ EP +++L+E   TWAGRVS +  TC I ALS +   + +P+IW  
Sbjct: 52  KINNVLDMQFLHGFYEPTLLVLYEPIGTWAGRVSARRDTCCIVALSFNLQKRTNPVIWFQ 111

Query: 302 MNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYA---VSLDSSQELPR 357
            +LP D   ++ VP PIGGV+++ AN+I Y  Q+  SC+L LN YA    +    Q++P 
Sbjct: 112 ESLPFDCRSVIPVPQPIGGVVIMAANSILYLKQTLPSCSLPLNCYAQISTNFPMRQDVP- 170

Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV 391
           S   + +D      L     L+ T++G+L LL++
Sbjct: 171 SCGPLSIDGCRVVTLNETQFLIGTRSGNLYLLSL 204


>gi|327304811|ref|XP_003237097.1| hypothetical protein TERG_01819 [Trichophyton rubrum CBS 118892]
 gi|326460095|gb|EGD85548.1| hypothetical protein TERG_01819 [Trichophyton rubrum CBS 118892]
          Length = 1398

 Score =  106 bits (265), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 176/734 (23%), Positives = 287/734 (39%), Gaps = 96/734 (13%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V   ++++++ +     GS  +    +  R    +    A L L   Y + G + SL
Sbjct: 28  NLIVAKTSLLQVFSLVNVTYGSTTAAQPDQKGRN---ERSQHAKLVLAAEYEVPGTITSL 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +    + +    D+II++  +AK+S++E+D   HG+   S+H +E  E  H+      
Sbjct: 85  QRVKISNSKSGG--DAIIVSSRNAKLSLIEWDPEKHGISTISIHYYEGEES-HMSPWVPD 141

Query: 177 FARGPL-VKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFGSGGGFSARIES-- 232
               P  +  DP G C  +  +G+  + IL   Q G  LV D+      G  SA + S  
Sbjct: 142 LGSCPSSLTADPNGNCA-IFNFGIHSLAILPFHQAGDDLVMDDYDATPNGNDSADVVSDP 200

Query: 233 ----------------SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
                           S V+ +  LD  + H     F+H Y EP   IL+ +        
Sbjct: 201 QKSAPENTAHDKPYAPSFVLPMTALDPALTHPIHMEFLHEYREPTFGILYSQVARSTSLT 260

Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHS 333
             +      S  ++    K    + +   LP D +K++ +P P+GG L++G N  +H   
Sbjct: 261 IDRKDIVSYSIFTLDLQQKASTSLLTVSRLPSDVFKIVPLPPPVGGALLIGTNELVHVDQ 320

Query: 334 QSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTV 391
              + A+ +N +A    +     +S   + L+      L +     LL    G + +L+ 
Sbjct: 321 AGKTNAVGVNEFARQASAFSMADQSDLEMRLEGCIIEQLGSGTGDILLILADGRMSILSF 380

Query: 392 VYDGRVVQRLDL----SKTNPSVLTSDIT---TIGNSLFFLGSRLGDSLLVQFTCGSGTS 444
             DGR V  + L     ++N S+  +  T   ++G +  F GS  GDS+L+ ++  S T 
Sbjct: 381 KVDGRSVSGISLHFVAEQSNGSITIARPTCSASLGRNKLFCGSEEGDSILLGWSRPSSTI 440

Query: 445 MLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTES------AQ 498
              S  K   G  E  A           D   D +  ++L     AS   E       + 
Sbjct: 441 KRPS--KAADGVDENGAADLSDEAEQDDDGDDDDMYEDDLYSANLASTRQEKQVVNGDSP 498

Query: 499 KTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI-----WTV 553
             F F   D L ++GP +D + G    + +     S       +EL   +G        V
Sbjct: 499 ADFIFRAYDRLWSLGPYRDITLGKPPKSKSKDQRDSVPEIAAPLELVAARGFGKSGGLAV 558

Query: 554 YHKSSRGHNADSSRMAAYDDEYHAYLIISLEART-------------MVLETADLLTEVT 600
             +    +  DS +M   DD Y  + I  ++ ++             ++ +T D   +  
Sbjct: 559 LKREIDPYTIDSLKM---DDVYGVWSIRVVDPKSKDTGLSRSYDKYLLLAKTKD--DDKE 613

Query: 601 ESVDYFV----------------QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLS 644
           ESV Y V                +  TI  G L    RV+QV     R  D  Y      
Sbjct: 614 ESVVYSVGSSGLDSIDAPEFNPNEDCTIDIGTLAAGTRVVQVLRTEIRSYD--YNLGLAQ 671

Query: 645 FGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS--TCTVSVQTPAAIES 702
             P   E    SE  TV+  S A+PY+L    D S+ +L  D +     V VQ  AA   
Sbjct: 672 IYPVWDE--DTSEERTVIQASFAEPYLLTIRDDHSLLILQTDKNGDLDEVEVQGSAA--- 726

Query: 703 SKKPVSSCTLYHDK 716
           S K +S C LY DK
Sbjct: 727 SGKWISGC-LYEDK 739



 Score = 72.4 bits (176), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 52/225 (23%), Positives = 102/225 (45%), Gaps = 19/225 (8%)

Query: 911  CQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIV-AFTVLHNVNCNHGF 969
            C+R+    ++ G++  F+SG  PC+ ++     R H     G  V + +  H   C  GF
Sbjct: 882  CKRLRALPDVCGYKTVFMSGHNPCF-ILKSAIARPHVLRLRGKAVQSLSGFHIAACERGF 940

Query: 970  IYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPL 1029
             YV    ++++ +LPS + +D+ W  +K I        I Y +    Y +  S       
Sbjct: 941  AYVDEDNVIRMSRLPSNTRFDSGWATRK-IAFGEQVDSIVYSSASECYVIGTSA------ 993

Query: 1030 NQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSS 1089
             +   L  D E   +  N  ++ +       +E   V++LEP      W T  +  ++ +
Sbjct: 994  KEDFKLPEDDESHTEWRNEFITFLP-----QLERGTVKLLEPRN----WSTIDSHELEPA 1044

Query: 1090 ENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
            E  + + V+ L  +  T E + ++ +G++ V+GED+  +G + +F
Sbjct: 1045 ERIMCIEVIRLEISELTHERKDMVVVGSSIVKGEDIVPKGFIRVF 1089


>gi|344229600|gb|EGV61485.1| hypothetical protein CANTEDRAFT_109087 [Candida tenuis ATCC 10573]
          Length = 1300

 Score =  106 bits (264), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 104/458 (22%), Positives = 200/458 (43%), Gaps = 45/458 (9%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           L LV  Y+L G + S+  +       + + D +++A + AKIS++ +D + H +R  S+H
Sbjct: 51  LNLVDQYKLFGTITSIKPIR---TIENPKLDYLLVATQLAKISLVRWDHASHSIRTVSLH 107

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
            +E+   +      +      L+ V+P+  C  V    L   +             ++  
Sbjct: 108 YYEN---VIQTSTFDKLNSAELI-VEPKNACLCVRYKNLLTFLPFTRLKTEEDEYADEED 163

Query: 221 GS-GGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
           G+    +    +SS +IN ++LD +   + D  F+H Y +P + +L  ++  WAG + +K
Sbjct: 164 GAVTNSYDGIYDSSFLINGQNLDSRIGTIVDADFLHNYRQPTVALLSSKDQVWAGNLFFK 223

Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSA 336
                   LS+    K+   +    +LP+D  +L+++PSP+ G L+VGAN  IH  +   
Sbjct: 224 KDNISYIVLSLDLNTKKSTTVLKIDDLPYDIDRLISLPSPLNGSLLVGANQLIHIDNGGI 283

Query: 337 SCALALNNYA--VSLDSSQELPRSSFSVELDAAHATWLQNDVALLST-KTGDLVLLTVVY 393
           +  +++N +    + +S   +  S  ++ L+      L N+  +L    TG+  +LT   
Sbjct: 284 TRKISVNPFTDLTTKNSKNYINYSHMNLRLENCSVVPLPNENKVLVILSTGEFYMLTFEI 343

Query: 394 DGRVVQRLDLSKTNPS-------VLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSML 446
           DG+ ++RL       S               + N+L F+G++ G+S L+Q+         
Sbjct: 344 DGKTIKRLTFEVVETSRYNGINVTFPGQFAALDNNLLFVGNKNGNSPLIQYKY------- 396

Query: 447 SSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVR 506
             G KE+   ++ DA   +           D    +  S            ++   F + 
Sbjct: 397 -EGAKEK--AVKEDAKDEEDNDGDEELYEDDEEKVKSFS------------KEKLDFTLC 441

Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVEL 544
           D L+N GP+  F++G   N    +  I+   NY+ V +
Sbjct: 442 DELINHGPISAFTFGFYSNEKFKSNLIN--PNYQEVSI 477


>gi|302652141|ref|XP_003017930.1| hypothetical protein TRV_08062 [Trichophyton verrucosum HKI 0517]
 gi|291181516|gb|EFE37285.1| hypothetical protein TRV_08062 [Trichophyton verrucosum HKI 0517]
          Length = 844

 Score =  105 bits (262), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 183/741 (24%), Positives = 288/741 (38%), Gaps = 110/741 (14%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V   ++++++ +     GS       +  R    D    A L L   Y + G +  L
Sbjct: 28  NLIVAKTSLLQVFSLVNVTYGSTTGTQPDQKGRH---DRSQHAKLVLAAEYEVPGTITGL 84

Query: 117 AILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE-----WLHL 170
             +      NS+   D+I+++  DAK+S++E+D   HG+   S+H +E  E     W+  
Sbjct: 85  QRVR---ISNSKSGGDAILVSSRDAKLSLIEWDPEKHGISTISIHYYEGEESHMSPWVP- 140

Query: 171 KRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---------GDEDT- 219
                S + G  + VDP G C  +  +G+  + IL   Q G  LV         GD+ T 
Sbjct: 141 --DLGSCSSG--LTVDPNGNC-AIFNFGIHSLAILPFHQAGDDLVMDDYDATPNGDDSTD 195

Query: 220 FGSGGGFSARIESSH--------VINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELT 269
             S    SA   +SH        V+ +  LD  + H     F+H Y EP   IL+ +   
Sbjct: 196 MVSDAQKSAPGNTSHDKPYAPSFVLPMTALDPALTHPIHMEFLHEYREPTFGILYSQVAR 255

Query: 270 WAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT- 328
                  +      S  ++    K    + +   LP D +K++ +P P+GG L++G N  
Sbjct: 256 STSLTIDRKDVVSYSIFTLDLQQKASTSLLTVSRLPSDVFKIVPLPPPVGGALLIGTNEL 315

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDL 386
           +H      + A+ +N +A    +     +S   + L+      L +     LL    G +
Sbjct: 316 VHVDQAGKTNAVGVNEFARQASAFSMADQSDLEMRLEGCIVEQLGSGTGDVLLILADGRM 375

Query: 387 VLLTVVYDGRVVQRLDL-----------SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLV 435
            +L+   DGR V  + L           +K  PS   S    +G +  F GS  GDS+L+
Sbjct: 376 SILSFKVDGRSVSGISLHFVAEQSGGSITKARPSCSAS----LGRNKLFYGSEEGDSVLL 431

Query: 436 QFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTE 495
            ++  S T+   S  K   G  E  A           D   D +  ++L     AS   E
Sbjct: 432 GWSRPSSTTKRPS--KSVDGVDENGAADLSDEADQDDDGDDDDMYEDDLYSVNPASTRQE 489

Query: 496 S------AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKG 549
                  +   F+F   D L ++GP +D + G    + +     S       +EL   +G
Sbjct: 490 KQVVNGDSPADFTFRAYDRLWSLGPYRDITLGKPSKSKSKDQQDSVPEIAAPLELVAARG 549

Query: 550 I-----WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART-----------MVLETA 593
                  TV  +    +  DS +M   DD Y  + I  ++ ++            +L   
Sbjct: 550 FGKSGGLTVLKREVDPYTIDSLKM---DDVYGVWSIRVVDPKSKDTGLSRSYDKYLLLAK 606

Query: 594 DLLTEVTESVDYFV----------------QGRTIAAGNLFGRRRVIQVFERGARILDGS 637
               +  ESV Y V                +  TI  G L    RV+QV     R  D  
Sbjct: 607 SKGEDKEESVVYSVGSSGLDSIDAPEFNPNEDCTIDIGTLATGTRVVQVLRTEIRSYD-- 664

Query: 638 YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS--TCTVSVQ 695
           Y        P   E    SE  TV+  S A+PY+L    D S+ +L  D +     V VQ
Sbjct: 665 YNLGLAQIYPVWDE--DTSEERTVIQASFAEPYLLTIRDDHSLLILQTDKNGDLDEVEVQ 722

Query: 696 TPAAIESSKKPVSSCTLYHDK 716
             AA   S K +S C LY DK
Sbjct: 723 GSAA---SGKWISGC-LYEDK 739


>gi|320583269|gb|EFW97484.1| RNA-binding subunit of the mRNA cleavage and polyadenylation factor
           [Ogataea parapolymorpha DL-1]
          Length = 1309

 Score =  105 bits (262), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 107/465 (23%), Positives = 212/465 (45%), Gaps = 46/465 (9%)

Query: 100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
            L+L+  YRL+G + ++    +  ++ +   D +I++ + AK+SV+++D  +H +   S+
Sbjct: 50  KLQLIGEYRLNGQIINI---DKFRSNENESLDYLIVSTKLAKLSVIKWDSQLHAISTVSL 106

Query: 160 HCFESP-EWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMII--LKASQGGSGLVGD 216
           H +++  + L +++  ++  +    + DP   C  + +  L   +   K       L  D
Sbjct: 107 HYYDTALDALTVEKLEKTSVQH---RTDPNSLCTCLRLNELFTFLPFYKEYLDEEELKDD 163

Query: 217 EDTFGSGGGFSARIESSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHERE-LTWAGR 273
            +              S ++N   L  D+K++ D+ F+H Y +P M IL+  E +TWAG 
Sbjct: 164 AEEAKDIKKRKKLFTESFILNASSLYPDIKNIVDYQFLHSYRDPTMAILYAPETMTWAGH 223

Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-TIHYH 332
           +     T  +  LS+    K+   I    NLP+D   +  + SP  G L+VG+N  IH +
Sbjct: 224 LPKAKDTLKVIVLSLDLENKKASAIMELTNLPYDVDYIYPLESPTNGFLLVGSNEIIHVN 283

Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
           S  +   +  N Y   + + +   +SS  + L+ +    ++ D  L+ T++G+   L   
Sbjct: 284 SLGSVRGIYTNEYFTDISNLKLKDQSSLGLMLENSRVGLVKEDQVLIITESGEFYQLNFE 343

Query: 393 YDG-----RVVQRLDLSK-----TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSG 442
             G       +Q+++ S       N  ++ + + ++   LFF+  + GDS L++ +  SG
Sbjct: 344 KIGGNSTITGLQKVETSNYKGIIVNHPIMITSVPSL--DLFFVCCQGGDSSLIRISSKSG 401

Query: 443 TSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFS 502
             +L    KE+ GD +        L            + E+   + S+  N++       
Sbjct: 402 --VLPQETKEQNGDTKETKDDDDWL-----------YDEEDQKSHKSSLVNSQ------- 441

Query: 503 FAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC 547
           F   D+++N GPL DF+ G R++ +    G+   +  E V +  C
Sbjct: 442 FKKMDNILNCGPLVDFTLG-RVSIEQKIMGLPNPNYNEDVLVAAC 485



 Score = 60.8 bits (146), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 68/297 (22%), Positives = 112/297 (37%), Gaps = 44/297 (14%)

Query: 837  SRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTP 896
            S+ +L A+   G +L Y+ +            DP+  +  L   N            + P
Sbjct: 750  SKDYLVALTFGGEVLIYETFF-----------DPIERTYKLMKIN---------EMCQFP 789

Query: 897  LDAYTREETPHGAPCQRITI-FKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIV 955
            +         H    +R  I   N  G++   ++G+     +     +    Q    S +
Sbjct: 790  IVGAPDNSYAHATKIERYLISVDNFQGYKAVLVTGASAFVILKEYNSIPRMLQFTKRSSL 849

Query: 956  AFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKN 1015
             F   +   C +G I +      +ICQL S  TY N  P+ K      T ++I Y +  N
Sbjct: 850  YFAEYNTDRCPNGVISIDETKACRICQLDSSYTYSNRLPIAKYKIGDKTINKIRYHSLSN 909

Query: 1016 LYPLIVSVPVLKPLNQVLSLLIDQEVGHQI----DNHNLSSVDLHRTYTVEEYEVRILEP 1071
             Y  I+S     P N V       E G  +    D+  L S  L  T       V ++ P
Sbjct: 910  TY--IISTLEEGPYNPV------DEDGEPLPGLRDDRKLKSTSLKGT-------VHLVSP 954

Query: 1072 DRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARG 1128
                  W    TI ++ +E   ++ V+ L  + T   +T++ IGTA  + ED+A  G
Sbjct: 955  ----ANWTIIDTIELEDNEYVTSIEVIELKVSETIATKTVVLIGTARCRNEDLATHG 1007


>gi|302506529|ref|XP_003015221.1| hypothetical protein ARB_06344 [Arthroderma benhamiae CBS 112371]
 gi|291178793|gb|EFE34581.1| hypothetical protein ARB_06344 [Arthroderma benhamiae CBS 112371]
          Length = 1370

 Score =  105 bits (261), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 181/736 (24%), Positives = 287/736 (38%), Gaps = 100/736 (13%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V   ++++++ +     GS  +    +  R    D    A L L   Y + G +  L
Sbjct: 28  NLIVAKTSLLQVFSLVNVTYGSTTATQPDQKGRH---DRSQHAKLVLAAEYEVPGTITGL 84

Query: 117 AILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
             +      NS+   D+I+++  +AK+S++E+D   HG+   S+H +E  E        +
Sbjct: 85  QRVR---ISNSKSGGDAILVSSRNAKLSLIEWDPEKHGISTISIHYYEGEESHMSPWVPD 141

Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDE---------------DT 219
             +    + VDP G C  +  +G+  + IL   Q G  LV D+               D 
Sbjct: 142 LGSCSSSLTVDPNGNCA-IFNFGIHSLAILPFHQAGDDLVMDDYDATPNGDDSTDLVSDA 200

Query: 220 FGSGGGFSARIES---SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
             S  G +A  +    S V+ +  LD  + H     F+H Y EP   IL+ +        
Sbjct: 201 QKSAPGNTAHDKPYAPSFVLPMTALDPALTHPIHMEFLHEYREPTFGILYSQVARSTSLT 260

Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHS 333
             +      S  ++    K    + +   LP D +K++ +P P+GG L++G N  +H   
Sbjct: 261 IDRKDVVSYSIFTLDLQQKASTSLLTVSRLPSDVFKIVPLPPPVGGALLIGTNELVHVDQ 320

Query: 334 QSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTV 391
              + A+ +N +A    +      S   + L+      L +     LL    G + +L+ 
Sbjct: 321 AGKTNAVGVNEFARQASAFSMADHSDLEMRLEGCIVEQLGSGTGDVLLILADGRMSILSF 380

Query: 392 VYDGRVVQRLDL-----------SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG 440
             DGR V  + L           +K  PS   S    +G +  F GS  GDS+L+ ++  
Sbjct: 381 KVDGRSVSGISLHFVAEQSGGSITKARPSCSAS----LGRNKLFYGSEEGDSVLLGWSRP 436

Query: 441 SGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTES---- 496
           S T+   S  K   G  E  A           D   D +  ++L     AS   E     
Sbjct: 437 SSTTKRPS--KAADGVDENGAADLSDEAEQDDDGDDDDMYEDDLYSVNPASTRQEKQVVN 494

Query: 497 --AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI---- 550
             +   F+F   D L ++GP +D + G    + +     S       +EL   +G     
Sbjct: 495 GDSPADFTFRAYDRLWSLGPYRDITLGKPPKSKSKDQQDSVPEIAAPLELVAARGFGKSG 554

Query: 551 -WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEAR---TMVLETAD---LLTEVT--- 600
             TV  +    +  DS +M   DD Y  + I  L+ +   T +  + D   LL +     
Sbjct: 555 GLTVLKREVDPYTIDSLKM---DDVYGVWSIRVLDPKSKDTGLSRSYDKYLLLAKAKGED 611

Query: 601 --ESVDYFV----------------QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD 642
             ESV Y V                +  TI  G L    RV+QV     R  D  Y    
Sbjct: 612 KEESVVYSVGSSGLDSIDTPEFNPNEDCTIDIGTLATGTRVVQVLRTEIRSYD--YNLGL 669

Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS--TCTVSVQTPAAI 700
               P   E    SE  TV+  S A+PY+L    D S+ +L  D +     V VQ  AA 
Sbjct: 670 AQIYPVWDE--DTSEERTVIQASFAEPYLLTIRDDHSLLILQTDKNGDLDEVEVQGSAA- 726

Query: 701 ESSKKPVSSCTLYHDK 716
             S K +S C LY DK
Sbjct: 727 --SGKWISGC-LYEDK 739


>gi|390358537|ref|XP_001201130.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like [Strongylocentrotus purpuratus]
          Length = 283

 Score =  104 bits (260), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 79/273 (28%), Positives = 126/273 (46%), Gaps = 51/273 (18%)

Query: 3   FAAYKMMHWPTGIANC-GSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVT 61
           +A Y+ +H PTG+ +C    F +                      P ++      NLVV 
Sbjct: 2   YAFYREIHPPTGVEHCVYCHFFS----------------------PDQQ------NLVVA 33

Query: 62  AANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQ 121
             + + +Y +   +      K+S    +           LE    + + G V S+    Q
Sbjct: 34  KGSELTVYSMITVDSNKPTDKDSKPKNK-----------LEEAATFHIFGKVMSM----Q 78

Query: 122 GGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGP 181
                   RD+++L+F +AK+S++E+D ++H L+  SMH FE  E    K G       P
Sbjct: 79  SAQVTGSGRDALLLSFMNAKVSIVEYDPNMHDLKTLSMHYFEEDE---TKEGVYRNIFHP 135

Query: 182 LVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDL 241
           +VKVDP  RC  +L YG ++++L   +   GLV D D   S       +  S+VI L ++
Sbjct: 136 VVKVDPDHRCAIMLTYGSKLVVLPFRR--DGLVEDLDKSMSASTRRGALMPSYVIRLNEM 193

Query: 242 D--MKHVKDFIFVHGYIEPVMVILHERELTWAG 272
           D  + +V D  F+HGY EP ++IL+E   TWAG
Sbjct: 194 DDPICNVLDIQFLHGYYEPTLLILYEPLRTWAG 226


>gi|328848896|gb|EGF98089.1| hypothetical protein MELLADRAFT_96156 [Melampsora larici-populina
           98AG31]
          Length = 1427

 Score =  104 bits (260), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 193/933 (20%), Positives = 350/933 (37%), Gaps = 171/933 (18%)

Query: 104 VCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFE 163
           V  ++LHG V  L  ++     +    D ++++F+DAKI++LE+      L   S+H FE
Sbjct: 73  VLEHQLHGIVTGLQPITTIDT-HVDGLDRLLVSFKDAKITLLEWSHQQSDLVPISLHTFE 131

Query: 164 SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSG 223
               +        F +   ++ DPQ RC  + +    + +L   Q  +    D +T  S 
Sbjct: 132 KLPQITQGDFPTIFDQ---LETDPQSRCAILKLPQSTIAVLPFFQENN---LDLETLFSN 185

Query: 224 GGFSA---RIES-----SHVINLRD------------------LDMKHVKDFIFVHGYIE 257
              SA   RI+S     S +I+L                      +K +  F F+ G+ +
Sbjct: 186 SNPSANNQRIQSFPYAPSFIIDLNQSQSFKSQTQTHSQTQTQQKSIKSIISFKFLPGFSQ 245

Query: 258 PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
           P + IL+  + TWAGR+     +C +  +++  +     +I+   NLP+ A+ ++A P  
Sbjct: 246 PTLAILYTYQHTWAGRLENTTDSCSLIFITLDLSSNHFTIIFQIDNLPYHAHSIMACPKE 305

Query: 318 IGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELP---------------RSSFSV 362
           +GGVLV+ A+ I +  QS+       N    L +  ++P                    V
Sbjct: 306 VGGVLVICADMILHIDQSSKLIGIATNGWSKLSTHLDVPTQQMVKIVTEDGQDQEERLKV 365

Query: 363 ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-TNPSVLTSDITTIGNS 421
            L+ +   ++  D AL+    G +  L +  DGR + +L L K    SV+ S    I + 
Sbjct: 366 RLENSKLVFVTIDRALMFLTDGQIFRLCLYQDGRTLIKLCLEKFPVVSVIPSVAVKISDH 425

Query: 422 LFFLGSRLGDSLLVQFTC------------------------GSGTSMLSSGLKEEFGDI 457
             F+GS LGDS+++                            G+   +  +   E +G  
Sbjct: 426 SVFVGSMLGDSIVMGIEFEGEKEVEVVEEVEVEVEAEVVHQNGNEMEIDQAEEDEIYGKE 485

Query: 458 EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKD 517
           E D   TK       D +  ++           + N +  ++  S  + DS+   GP++D
Sbjct: 486 EPDDKKTK-----DQDGIDSIIK----------ATNKKIHREIRSLRLHDSISGHGPIRD 530

Query: 518 FSYGLRINADASATGISKQSNYE-LVELPGCKGI-----WTVYHKS---SRGHNADSSRM 568
           F+             +SK   +E  +E+ GC G       T+++K     +    DS+  
Sbjct: 531 FT-------------MSKIGGFEDSLEMVGCTGSGETGGLTIFYKEMPLMKRKKLDSTNE 577

Query: 569 A---------AYDDEYHA----YLIISLEARTMVLETADLLTEVTESVDY----FVQGRT 611
           +         A++D   +       IS+  RT +        E   + D      +   T
Sbjct: 578 SMKITNLNSIAFNDPTGSPGCELAWISIHDRTKIFSMIKNPEEGNRTSDLKFMKTLNAST 637

Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
           I     F +   +Q+     ++L      +     P  +E    ++ + ++   +   Y+
Sbjct: 638 IYVAMFFDQTCFLQITSYEIKLLKVVGFGEVQVIRPIETE----NKKNKIIRAKVVQDYI 693

Query: 672 LLGMSDGSIRLLVGDPSTCTVS-VQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAW 730
           LL  SD  + L  G   + T+  +Q P       KPV+  +L+    P  +     T+  
Sbjct: 694 LLETSDHRVMLYKGQVDSLTIDRIQLPQL----SKPVTYASLFSAHLP-LYDHDDQTN-- 746

Query: 731 LSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKF-----VSGRT 785
              G+G   D     P      +  V    G L I  +P    VFTV        +    
Sbjct: 747 ---GIGLDNDEDAEKP------WLFVTDLGGVLHILSLPELEIVFTVKGIENLPDLLDED 797

Query: 786 HIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAIL 845
              +   + A++    + +   EE      KEN     +         A  +RP L+  L
Sbjct: 798 EDEEQQQQPAIEYEHEDGDVKMEEDEKVEPKENSSIQMIYGFVT---GAKVARPHLYVEL 854

Query: 846 TDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRF-SRTPLDAYTREE 904
            +G +  YQ  +        K  DP ++       ++  +++   +F S  P+    R  
Sbjct: 855 NNGALAVYQISI----AYDRKPGDPSTSKPRRQALSIRLNKVLGYQFESSEPISNLDR-- 908

Query: 905 TPHGAPCQRITIFKNISGHQGFFLSGSRPCWCM 937
                   ++ + K  +   G  LSG  P W +
Sbjct: 909 --------KVKVVKKNATFSGIHLSGLEPIWIV 933


>gi|296806499|ref|XP_002844059.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
 gi|238845361|gb|EEQ35023.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
          Length = 1348

 Score =  104 bits (260), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 162/745 (21%), Positives = 298/745 (40%), Gaps = 116/745 (15%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V   ++++++ +     GS  + +  +  R    D    A L LV  Y++ G +  L
Sbjct: 28  NLIVAKTSLLQVFSLVNVTYGSSLANHPDQKSRH---DRSQHAKLVLVAEYQVSGTITGL 84

Query: 117 AILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
             +      NS+   D+I+++  +AK+S++E+D   HG+   S+H +E  E        +
Sbjct: 85  ERVK---ISNSKSGGDAILVSSRNAKLSLIEWDPRNHGISTISIHYYEGEESHMSPWVPD 141

Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDE----------------- 217
             +    + VDP G C  +  +G+  + IL   Q G  LV D+                 
Sbjct: 142 LGSCASNLTVDPNGNCA-IFNFGIHSLAILPFHQTGDDLVMDDYDSVLNGDSAADTINDT 200

Query: 218 --DTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
              T G     S   E S V+ L  LD  + H     F+H Y EP   IL+ +       
Sbjct: 201 QKPTAGDSTVHSKPYEPSFVLPLAALDPALTHPIHMEFLHEYREPTFGILYSQVARSTSL 260

Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYH 332
              +      +  ++    +    + +   LP D +K++++P P+GG L++G N  +H  
Sbjct: 261 SIDRKDVVSYAIFTLDLQQRASTSLLTVSRLPSDMFKVVSLPPPVGGALLIGTNELVHVD 320

Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLT 390
               + A+ +N +A    +   + +S   + L+      L +D    LL    G + +LT
Sbjct: 321 QAGKTNAVGVNEFARQASAFSMVDQSDLEMRLEDCVVEQLGSDAGEVLLILTDGRMAILT 380

Query: 391 VVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFTCGSGT 443
              DGR V  + L     ++  S++ +  +    +G S  F GS  GDS+L+ ++  S  
Sbjct: 381 FKVDGRSVSGISLHYVAEQSGGSIIKARPSCSAGLGRSKLFCGSEEGDSILLGWSKPSSN 440

Query: 444 SMLSSGLKE---EFGDIEADAPSTKRLRR-----------SSSDALQD--MVNGEELSLY 487
           +   +   E   E G  E      +               + +  LQ+  +VNG++ +  
Sbjct: 441 TKKPTKANEDTNEDGTTEFSGEDEQDDDDDDIYEDDLYSANPAPTLQEKRVVNGDDTA-- 498

Query: 488 GSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG------LRINAD-----------ASA 530
                        F F + D L ++GP +D + G      L+   D            +A
Sbjct: 499 ------------DFVFKIHDRLWSLGPFRDITLGRPPKSKLKDKRDNVPSISASLELVAA 546

Query: 531 TGISKQSNYEL------------VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAY 578
            G  K     +            +++    G+W++     +   A ++       +Y  Y
Sbjct: 547 RGFGKSGGLAVLKREIDPFTIDSLKMDNVYGVWSIRVTDPKSKEASAT---GNSRDYDKY 603

Query: 579 LII-----SLEARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGAR 632
           L++     S +  ++V    +   +  ++ ++   +  TI  G L    RV+QV     R
Sbjct: 604 LLLAKAKCSDKEESVVYSVGNSGLDSIDAPEFNPNEDCTIDIGTLAAGSRVVQVLRTEIR 663

Query: 633 ILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
             D +  +TQ       ++     SE  TV+  S A+PY+L    D S+ +L  D +   
Sbjct: 664 SYDYNLGLTQIYPVWDEDT-----SEERTVVQASFAEPYLLAIRDDHSLLVLQADKTGDL 718

Query: 692 VSVQTPAAIESSKKPVSSCTLYHDK 716
             V+    + +S   VS C LY D+
Sbjct: 719 DEVEI-QGLATSADWVSGC-LYEDR 741



 Score = 70.5 bits (171), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 56/243 (23%), Positives = 104/243 (42%), Gaps = 37/243 (15%)

Query: 904  ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCM--------VFRERLRVHPQLCDGSIV 955
            E  H  P + +    +I G++  F+ G  PC+ +        V R R +        ++ 
Sbjct: 820  EGKHPFPRKPLRALSDICGYKTVFMPGQNPCFILKSAITQPHVLRLRGK--------AVQ 871

Query: 956  AFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKN 1015
            + +  H   C  GF YV    I+++ +LPS + +D+ W  +K IPL      I Y +   
Sbjct: 872  SLSGFHIAACERGFAYVDEDNIIRMSRLPSNTRFDSTWATRK-IPLGEQVDCIVYSSASE 930

Query: 1016 LYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAG 1075
             Y +  SV       +   L  D E   +  N  ++ +       +E   V++L+P    
Sbjct: 931  SYVIGTSV------KEDFKLPEDDESHTEWQNEFITFLP-----QLERGTVKLLDPKN-- 977

Query: 1076 GPWQTRATIP----MQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRV 1130
              W      P    ++ +E    + V+ L  +  T E + ++ +G+A V+GED+  +G +
Sbjct: 978  --WSIADIAPSSHELEPAERITCIEVIRLEISEITHERKDMVVVGSAIVKGEDIVPKGCI 1035

Query: 1131 LLF 1133
             +F
Sbjct: 1036 RVF 1038


>gi|348679451|gb|EGZ19267.1| hypothetical protein PHYSODRAFT_492468 [Phytophthora sojae]
          Length = 736

 Score =  104 bits (259), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 111/427 (25%), Positives = 169/427 (39%), Gaps = 102/427 (23%)

Query: 235 VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
           ++ LR+L++   V D  F+ GY+EP +++LHE   + +  GR++    T  I+ +SI+  
Sbjct: 261 LLRLRELEITGKVIDLAFLDGYLEPTLMVLHEENEKNSTCGRLAAGFDTYCITVISINMN 320

Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
            + HP IW+  NLP D +KL+   +P+GGV+V+ AN   Y +Q+    LA N     L  
Sbjct: 321 TRLHPKIWTVKNLPSDCFKLIPCRAPLGGVVVLSANAFLYFNQTQFHGLATN----VLRE 376

Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRL-DLSKTNPSV 410
             +   +  ++ L      +L     LL+   GD  +L++ Y+ R V+    + KT    
Sbjct: 377 QDDHEMAQLNIVLYDCQFEYLHEKEVLLTMPNGDAYVLSLPYEDRSVRFWRSIKKT---- 432

Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQF------TCGSGTSMLSSG----LKEEFGDIEAD 460
                        F+GSR GDS+L         + G   S L       +KEE    E  
Sbjct: 433 ------------LFVGSRSGDSVLYALDQKKLTSAGGEASKLQEDEEMLIKEEVVKEEVT 480

Query: 461 -----------------------APSTKRLRRSSSDALQDMVNGEELSLYGSASN----N 493
                                  AP+ +    S S    + VNG   S      N     
Sbjct: 481 AEVKAEPAEEEEEDEDDLFLCGAAPTKEEPTTSGS---TEAVNGTNGSAVKKEENGHAVE 537

Query: 494 TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELV----------- 542
            ES    +     D L +IG +      +  NAD      S +   ELV           
Sbjct: 538 EESGPYDYVLHQIDVLPSIGQITSIELSIENNAD------SNEKREELVISGGYEHSGAI 591

Query: 543 ---------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
                          EL GC+ +WTV         +   R       Y+AYLI+S+  RT
Sbjct: 592 SVLHNGLRPIVGTEAELNGCRAMWTVSSSLPSATKSSDGR------SYNAYLILSVAHRT 645

Query: 588 MVLETAD 594
           MVL T +
Sbjct: 646 MVLRTGE 652


>gi|406699110|gb|EKD02327.1| cleavage and polyadenylation specific protein [Trichosporon asahii
           var. asahii CBS 8904]
          Length = 1339

 Score =  104 bits (259), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 137/528 (25%), Positives = 223/528 (42%), Gaps = 80/528 (15%)

Query: 181 PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240
           P+++ DP  R   + +    + +L   Q  S L   +D+  S                  
Sbjct: 157 PMLRSDPLSRLAILTLPEDALAVLPIVQEQSELDAMQDSVSSP----------------- 199

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK-QHPLIW 299
            ++K++KDF+F+ G+  P + +L     TWAGR      T  +   +I T+    +PLI 
Sbjct: 200 -EIKNIKDFLFLPGFHSPTIALLFAPMNTWAGRYKSVKDTFRLEIRTIDTSAGGTYPLIT 258

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALN---NYAVSL--DSSQ 353
           S   LP D+  L+A PS +GGV+VV A+ I +  QS    + ++N   NY  ++  DSS 
Sbjct: 259 SVTGLPSDSQYLVACPSEVGGVVVVTASGIIHIDQSGRLVSTSVNGWWNYTTNMKSDSSY 318

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLT- 412
           E    S  + LD +HA ++  +  LL  +TG++  +    DGR V  + + + + +V   
Sbjct: 319 E----SQKLALDNSHAQFVTENDMLLVLETGEVHQIRFEMDGRAVGAIKVDEQSSTVPPP 374

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
           S +   G+   F+GS  GDSLL         S      +EE        P TK+      
Sbjct: 375 STLVPAGSDGIFVGSVEGDSLLAMVEKARDQSA-----QEE--------PETKQQEMDVD 421

Query: 473 DALQDMVNGE---ELSLYGSASNNTESAQKTFSFAVRD-------SLVNIGPLKDFSYGL 522
           D  +++  G     +      S     A   F  AV D        LV IG     S G 
Sbjct: 422 DWDEEVATGPVTVSVKAQDVLSGIGRIADMEFGIAVTDLGTRTYPQLVCIG---GGSQGS 478

Query: 523 RINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIIS 582
            +N       I+K+  +E  +L      W +  + +   NA   +     ++    +  +
Sbjct: 479 TMNVFRRGIPITKRRLFE--QLRTAVATWFLPVERA---NAPKFKDIPESEQSTIAIAAT 533

Query: 583 LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD 642
            E  T +   +    +V E +  F +   IA G    R R++ V      +LD       
Sbjct: 534 QEGSTQIFALS--TRKVQERIAEFPEP-AIATGTWLRRTRIVLVLPSQVLLLD------- 583

Query: 643 LSFGPSNSES-GSGSENS---TVLSVSIADPYVLLGMSDGSIRLLVGD 686
                SN+   G+  E S    +++ SIADPYVL+  +DGS+ + VGD
Sbjct: 584 -----SNANPVGTICEMSDAPPIVAASIADPYVLIRRADGSVSVFVGD 626


>gi|452841862|gb|EME43798.1| hypothetical protein DOTSEDRAFT_79774 [Dothistroma septosporum NZE10]
          Length = 1347

 Score =  102 bits (253), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 225/1103 (20%), Positives = 416/1103 (37%), Gaps = 188/1103 (17%)

Query: 101  LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
            L LV  Y L G V +LA +     D     D+++LAF+DAK++++E+D   H +   S+H
Sbjct: 51   LVLVGEYSLSGTVTNLAQVKL--PDTKTAGDALLLAFKDAKLTLIEWDPENHRISTISIH 108

Query: 161  CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
             +E    +    G        ++ VDP  RC  +     Q+ +L   Q    L  +ED  
Sbjct: 109  YYEGDNVVSQPFGPGLGECENILTVDPNWRCAALKFGTRQLAVLPFRQLDDELGVEEDGD 168

Query: 221  GSGGGFSAR-----------------IESSHVINLRDL--DMKHVKDFIFVHGYIEPVMV 261
                  + +                  ++S V+ L  L  D+++  D  F++GY E  + 
Sbjct: 169  AEPASTTLKRSESILQNVNGEVQQTPYKASFVLALSTLLEDIRYTVDLGFLYGYRESTLG 228

Query: 262  ILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGV 321
            IL       +  +  +          +     +   +     LP+  +K++ +P+P+GG 
Sbjct: 229  ILSSSLQPSSSLLDIRKDELEYRMFKLELEQGESTELQVVKQLPNSLWKVVPLPAPVGGA 288

Query: 322  LVVGANT-IHYHSQSASCALALNNYAVSLDSSQELP-RSSFSVELDAAHATWL--QNDVA 377
            L+VG N+ +H    +   ++A+N +A +L+S + +  +S  +++L+      L  ++   
Sbjct: 289  LLVGTNSFVHVDLNAKVNSVAVNEFA-ALESDRGMEDQSDLNLKLEGCSVEILDAESRQV 347

Query: 378  LLSTKTGDLVLLTVVYDGRVVQRL-----------DLSKTNPSVLTSDITTIGNSLFFLG 426
            L+  + G L  +     GR +Q L           DL KT PS     +  + ++  F+G
Sbjct: 348  LVVLRDGSLATIYFEQSGRSIQGLKVSRVREEHGGDLVKTAPSC----VARLDHNKVFVG 403

Query: 427  SRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSL 486
            S  G S LV+++     S LS   K   G +               D         E   
Sbjct: 404  SEDGASSLVRWS--RSISTLSR--KRTHGQMLGQHGDEDDEEALEDDDDDLYDAAPETK- 458

Query: 487  YGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADAS------ATGISKQS--- 537
               A++ T++ +   SF ++D L ++GP+ D   G    A          TG  + S   
Sbjct: 459  -KRATSTTDAFETPPSFQIQDVLHSLGPINDVCLGKSDGAQVDKLQMMLGTGRGRSSRIS 517

Query: 538  --NYELVELPG-------CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM 588
              N ++V +          K  W V+ K +             DD++H  L+ + + +  
Sbjct: 518  CLNRDIVPVSARKSTIGRAKSAWAVHAKRND-----------RDDDFHDNLLFAYDGQET 566

Query: 589  VLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFG 646
             +   D +  +  +   F  +G TI    L     V+Q  +   R  D    ++Q +   
Sbjct: 567  KIYDVDEVGYMERTAQEFEHEGETIDVQMLAKDTIVVQCRKSEIRTYDADLALSQIIPMV 626

Query: 647  PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKP 706
               ++     E   ++ +S  DPY+L+  +D                             
Sbjct: 627  DEETD-----EEYEIVYLSFCDPYLLVVRND----------------------------- 652

Query: 707  VSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIF 766
             SS  + H +G E    +   D      +G +I     G L +      +    G + +F
Sbjct: 653  -SSIQVLHVRGKEIEPLEGEGDIAEKKWLGGSIHT---GSLTKDVPALFLLSAQGTMHVF 708

Query: 767  DVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVE 826
             +P+   V                Y   AL      ++S + +    G KE +  + V E
Sbjct: 709  SLPSLEPV----------------YHAPALPHLPPVLSSDAPQRRA-GPKEALTELLVAE 751

Query: 827  LAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASR 886
            L     ++    P+L A      ++ Y+   F  PE       P  + +  +        
Sbjct: 752  LG----ASGVDTPYLVARTALDDLVLYEP--FRHPE-------PAPSDQWYT-------- 790

Query: 887  LRNLRFSRTPL-------DAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF 939
              NLRF + P+       +A  +EE+    P + I    ++  +    + GS P   ++ 
Sbjct: 791  --NLRFRKVPVTYIPKYNEAIAQEESTRPLPLRSI----HVGDYDAVTIPGSPPL--LLV 842

Query: 940  RER------LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW 993
            +E       L V        +     +H  +C  GF  V + G+L+   LP  + Y   W
Sbjct: 843  KEASSLPRVLEVRISNESNRVATLLPIHLDHCKKGFAAVNADGLLEEYHLPLSAWYGTGW 902

Query: 994  PVQKVI--PLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051
             VQ+V           + Y   + +Y +     V     +     + Q  G Q D     
Sbjct: 903  SVQQVDLGSEDLEVRHLAYHETRGVYVVATCKDVDFYFAEDDHRHLGQSGGGQDDITLRP 962

Query: 1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETL 1111
             V  +  + V     R+++         +RA +P   +  AL V  + + +  T E +  
Sbjct: 963  QVKQYSIHLVSSKTHRVID---------SRA-MPYLEAITALQVMPLEV-SELTHEQDLR 1011

Query: 1112 LAIGTAYVQGEDVAARGRVLLFS 1134
            + + TA ++GED+ ARG +++F+
Sbjct: 1012 ILVSTAAMRGEDMPARGAIIVFN 1034


>gi|159470705|ref|XP_001693497.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158283000|gb|EDP08751.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 461

 Score = 99.4 bits (246), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 57/164 (34%), Positives = 97/164 (59%), Gaps = 7/164 (4%)

Query: 230 IESSHVINL-RDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
           + +S+V+NL + + ++ V+D +F+HGY EPV+++LHE + TWAGR+  +  TC ++A+S+
Sbjct: 128 VGNSYVLNLHKMMGIREVRDCVFLHGYTEPVLLLLHEPDPTWAGRLRERKDTCCLTAISV 187

Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASC-ALALNNYAV 347
           S  LK+H ++W A  LP+D Y+LL +P      LV+  + +   SQ++   A ALN+ A+
Sbjct: 188 SLRLKRHTVLWRAAGLPYDCYRLLPLPQ-RPAALVLSPSLVMLTSQASQPQAAALNSTAL 246

Query: 348 SLDSSQEL----PRSSFSVELDAAHATWLQNDVALLSTKTGDLV 387
             ++   L     R + SV      A +  ND A    ++  LV
Sbjct: 247 PGEAPPPLVFDPAREAPSVTAARMAAEFALNDCAPALGRSAALV 290


>gi|238508528|ref|XP_002385456.1| cleavage and polyadenylation specificity factor subunit A, putative
           [Aspergillus flavus NRRL3357]
 gi|220688975|gb|EED45327.1| cleavage and polyadenylation specificity factor subunit A, putative
           [Aspergillus flavus NRRL3357]
          Length = 1204

 Score = 99.0 bits (245), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 92/355 (25%), Positives = 159/355 (44%), Gaps = 40/355 (11%)

Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGR 190
           ++I+LAF +AK++++E+D   +G+   S+H +E  +        +  + G ++ VDP  R
Sbjct: 88  EAILLAFRNAKLALIEWDPGRYGICTISIHYYERDDSTSSPWVPDLSSCGSILSVDPSSR 147

Query: 191 CGGVLVYGLQ-MIILKASQGGSGLVGDE------DTFGSGG--------------GFSAR 229
           C  V  +G++ + IL   Q G  LV D+      +  GS G                 A 
Sbjct: 148 CA-VFNFGIRNLAILPFHQPGDDLVMDDYGELDDERLGSHGLESGTDCDMTKESIAHRAP 206

Query: 230 IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
             SS V+ L  LD  + H     F++ Y EP   IL+ +  T    +  +      +  +
Sbjct: 207 YSSSFVLPLAALDPSILHPISLAFLYEYREPTFGILYSQVATSNALLHERKDVVFYTVFT 266

Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA 346
           +    +    + S   LP D +K++A+P P+GG L++G+N  +H      + A+ +N ++
Sbjct: 267 LDLEQRASTTLLSVSRLPSDLFKVVALPPPVGGALLIGSNELVHVDQAGKTNAVGVNEFS 326

Query: 347 VSLDSSQELPRSSFSVELDAAHATWLQ--NDVALLSTKTGDLVLLTVVYDGRVVQRLDLS 404
             + S     +S  ++ L+      L   N   LL   TG++VL+    DGR V  + + 
Sbjct: 327 RQVSSFSMTDQSDLALRLEGCIVERLSETNGDLLLVPTTGEIVLVKFRLDGRSVSGISVH 386

Query: 405 KTNPSV-------LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE 452
              P           S    +G+   FLGS   DS+L+      G S+ SSG K+
Sbjct: 387 PIPPHAGGDIVKSAASSSAFLGDKRVFLGSEDADSILL------GWSVPSSGTKK 435



 Score = 71.2 bits (173), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 57/247 (23%), Positives = 104/247 (42%), Gaps = 20/247 (8%)

Query: 898  DAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAF 957
            D  + EE     P   + I  NISG    F  G  P + +           L  G   + 
Sbjct: 678  DQSSTEEVIKSVP---LRIVSNISGFSAIFRPGVSPGFIVRTSTSSPHFLGLKGGYAQSL 734

Query: 958  TVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLY 1017
            +      C  GFI + S+G++ +CQ+P G   D  W +Q+ IP+      + Y +   +Y
Sbjct: 735  SKFQTSECGEGFILLDSKGVIHVCQMPLGVQLDYPWTIQQ-IPIGEQVDHLAYSSSSGMY 793

Query: 1018 PLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGP 1077
             +  S            L  D E+  +  N   S         V+   ++++ P      
Sbjct: 794  VIGTS------HRTEFKLPEDDELHPEWRNEMTSFFP-----EVQRSSLKVVSPKT---- 838

Query: 1078 WQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG 1136
            W    +  +  +E+ + V+ ++L  +  T E + ++ +GTA+ +GED+A+RG V +F   
Sbjct: 839  WTVIDSYLLSPAEHVMAVKNMSLEISENTHERKDMIVVGTAFARGEDIASRGCVYVFEVI 898

Query: 1137 RNADNPQ 1143
            +   +P+
Sbjct: 899  KVVPDPK 905


>gi|150951283|ref|XP_001387581.2| pre-mRNA 3'-end processing factor CF II mRNA cleavage and
           polyadenylation factor II complex, subunit CFT1 (CPSF
           subunit) RNA processing and modification
           [Scheffersomyces stipitis CBS 6054]
 gi|149388465|gb|EAZ63558.2| pre-mRNA 3'-end processing factor CF II mRNA cleavage and
           polyadenylation factor II complex, subunit CFT1 (CPSF
           subunit) RNA processing and modification
           [Scheffersomyces stipitis CBS 6054]
          Length = 1341

 Score = 97.4 bits (241), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 98/415 (23%), Positives = 183/415 (44%), Gaps = 55/415 (13%)

Query: 55  VPNLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVE 114
           V +LVV  A +++I+ V VQ + S  SK                  L+L+  ++LHG + 
Sbjct: 26  VKHLVVGKATLLQIFEV-VQLKSSTPSK--------------PQHRLKLIDQFKLHGLIT 70

Query: 115 SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
            +  +    + N    D ++++ + AK SV+++D  +H +   S+H +E+          
Sbjct: 71  DIKPIRTVESPNF---DYLLVSTKSAKFSVIKWDHHLHTISTVSLHYYENAIQ---NSTY 124

Query: 175 ESFARGPLVKVDPQGRCGGVLVYGL----------QMIILKASQGGSGLVGDEDTFGSGG 224
           E  ++  L+ ++P G C  +    L          ++    A      +V  E      G
Sbjct: 125 EKLSKSELL-LEPYGSCSCLRFKNLLCFLPFETAEELDDDDADSENEDMVKSEKKEHENG 183

Query: 225 GFSARI--------ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
             +  +        ++S +I+ + LD  +  + D  F+  Y EP   IL +R+  WAG +
Sbjct: 184 TVNVPVTDQPGSFFDTSFLIDGQSLDSSIGSIIDMQFLFKYREPTFGILSQRQQAWAGNL 243

Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-TIHYHS 333
                      L++  T K    +    NLP+D  +++ +PSP+ G L++G N  IH  +
Sbjct: 244 PKIKDNVQFCILTLDLTTKSTVSVLKIDNLPYDVDRIVPLPSPLNGCLLLGCNEIIHVDN 303

Query: 334 QSASCALALNNYAVSLDSSQEL--PRSSFSVELDAAHATWLQND-VALLSTKTGDLVLLT 390
                 +A+N +   + +S +    ++  +++L+      L ND  ALL   TG+   L 
Sbjct: 304 GGIVRRIAVNQFTSLITASTKAYQDQTHLNLKLEDCSVVALPNDHRALLVLSTGEFYYLN 363

Query: 391 VVYDGRVVQRLDLSKTNPSVLTSD--------ITTIGNSLFFLGSRLGDSLLVQF 437
              DG+ +++  +   +  +L SD        I T+ N+L F  +  G+S LVQF
Sbjct: 364 FEVDGKSIKKFTIESVD-KLLYSDIKLTFPGQIATLDNNLLFFANHNGNSPLVQF 417



 Score = 60.5 bits (145), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 67/306 (21%), Positives = 126/306 (41%), Gaps = 53/306 (17%)

Query: 836  HSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRT 895
            H   +L  +   G +L Y+ Y F+G                    N    + ++L  +  
Sbjct: 786  HKEEYLTILTIGGEVLLYKLY-FDG-------------------ENYEFKKEKDLAITGA 825

Query: 896  PLDAYTREETPHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSI 954
            P +AY     P G   +R +  F N++G+   F++G  P   +     +    Q      
Sbjct: 826  PENAY-----PIGTAVERRLAYFPNLNGYTCIFVTGVTPYLILKSLHSIPRIYQFSKIPA 880

Query: 955  VAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEK 1014
            V+ +  H+    +G I++ +Q   +ICQLP    Y+N WP+ K+I +  +   ITY    
Sbjct: 881  VSISPFHDSKVANGLIFLDNQQNARICQLPLDFNYENTWPM-KLIHIGESIRAITYHESS 939

Query: 1015 NLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTY---TVEEYEVRILEP 1071
            + Y       V+     +     D+E G  I       V LH+     +  +  ++++ P
Sbjct: 940  HTY-------VVSTFKDIDYECFDEE-GKPI-------VGLHKDKPPSSAYKGSIKLISP 984

Query: 1072 DRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKE---NETLLAIGTAYVQGEDVAAR 1127
                  W    TI +  +E  +TV+ + L   ++TK+    +  + IG+   + ED++A 
Sbjct: 985  FN----WSVIDTIELADNELGMTVKSMILDVGSSTKKFKHKKEFIVIGSGKYRMEDLSAN 1040

Query: 1128 GRVLLF 1133
            G   ++
Sbjct: 1041 GSFRIY 1046


>gi|9794906|gb|AAF98387.1| cleavage and polyadenylation specificity factor [Drosophila
           melanogaster]
          Length = 279

 Score = 97.4 bits (241), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 81/300 (27%), Positives = 141/300 (47%), Gaps = 65/300 (21%)

Query: 425 LGSRLGDSLLVQFTCGSGTSMLS------------SGLKEEFGDIEADAPSTKRLRRSSS 472
           LGSRLG+SLL+ FT    +++++              L++E  ++E +     +L  + +
Sbjct: 1   LGSRLGNSLLLHFTEEDQSTVITLDEVEQQSEQQQRNLQDEDQNLE-EIFDVDQLEMAPT 59

Query: 473 DALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD----- 527
            A    +  EEL +YGS +  +    + F F V DSL+N+ P+     G R+  +     
Sbjct: 60  QAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCDSLMNVAPINYMCAGERVEFEEDGVT 119

Query: 528 ---------------ASATGISKQS---------NYELV---ELPGCKGIWTVYHKSSRG 560
                           +ATG SK           N +++   EL GC  +WTV+      
Sbjct: 120 LRPHAESLQDLKIELVAATGHSKNGALSVFVNCINPQIITSFELDGCLDVWTVFD----- 174

Query: 561 HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
              D+++ ++ +D+ H ++++S    T+VL+T   + E+ E+  + V   TI  GNL  +
Sbjct: 175 ---DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQEINEI-ENTGFTVNQPTIFVGNLGQQ 229

Query: 621 RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
           R ++QV  R  R+L G+ + Q++               S V+ VSIADPYV L + +G +
Sbjct: 230 RFIVQVTTRHVRLLQGTRLIQNVPIDVG----------SPVVQVSIADPYVCLRVLNGQV 279


>gi|452825139|gb|EME32137.1| cleavage and polyadenylation specificity factor subunit-like
           protein [Galdieria sulphuraria]
          Length = 1454

 Score = 96.7 bits (239), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 119/529 (22%), Positives = 200/529 (37%), Gaps = 91/529 (17%)

Query: 184 KVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDM 243
           KVDP+     VL+    ++++        ++   D+  +    +  +    +++LR L  
Sbjct: 166 KVDPEHGLIAVLIRKKNLLLI----AKYPILSHRDSLSAECSSNKLLSDPVILDLRRLGH 221

Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
                F F+ GY  P + +L E+  TW+G  S    + ++S +    + K+   IW    
Sbjct: 222 FETIHFCFMFGYSLPTLALLEEKTPTWSGSFSVTRDSRLVSVVQFDLSDKKMKRIWQVEE 281

Query: 304 LPHDAYKLLAVPS-PIGGVLVVGANTIHYHSQSASC-ALALNNYAVSLDSSQELPRSSFS 361
           LPH+ + + +VP    GG LV G N I Y    +    L+ N+      S   L      
Sbjct: 282 LPHECFMVSSVPFLQGGGFLVFGWNIILYFRDGSFVDGLSCNDLGDVYLSKWSLRSQDAP 341

Query: 362 VELDA--------AHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
           + LD         +H T+++N V +L  + G    L +   G     + L      +  S
Sbjct: 342 ISLDGCEVVSEFDSHDTFMKNPVIIL--RDGAFFELCIPKKGG-DSVISLRYCKILIQPS 398

Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
            ++  GN L FLGS +  S L++    + T +                           D
Sbjct: 399 TVSYCGNGLIFLGSHVSPSALLEIIWKNSTEL-----------------------HPEDD 435

Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
            L+        S +G +SN     +   S   RDSL  IGP++D      I   +    +
Sbjct: 436 ELE--------SFFGKSSNKNFVVETIDS---RDSLFCIGPIQDLEVFDNIIGSSRKMEL 484

Query: 534 SK---QSNYELV---------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEY 575
                  NY  V                L  C+ IW V  +   G    S  +       
Sbjct: 485 IAAVGSRNYGAVIIFRRTVSPSLLTSIRLEDCQQIWNVLCQRKMGERNGSVPL------- 537

Query: 576 HAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL- 634
              LI+S +  T+VL  +D + E+ +S  +    RT+    +   R +IQVF+ G RIL 
Sbjct: 538 ---LILSTQRNTIVLSVSDTIDELVDS-QFQTSSRTLWVSRVLHDRYIIQVFDEGLRILG 593

Query: 635 DGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
           +   +       P +           V    + DPYV+L +S   + +L
Sbjct: 594 NWDSLISLYELPPGD----------VVTQAFVCDPYVMLHLSSSYLVIL 632


>gi|159470707|ref|XP_001693498.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158283001|gb|EDP08752.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 366

 Score = 96.3 bits (238), Expect = 9e-17,   Method: Composition-based stats.
 Identities = 69/217 (31%), Positives = 93/217 (42%), Gaps = 54/217 (24%)

Query: 921  SGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-QGILK 979
            + H G F++G+RP W +  R  L  H    +G + A T  HNVNC  GFI   S +G LK
Sbjct: 164  ANHSGVFVAGARPLWLVAGRGGLAAHAMWSEGPVAALTPFHNVNCPLGFITACSARGQLK 223

Query: 980  ICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQ 1039
            +C LP  +  D  W  ++V PLK TPH++ +F E      IV+    +P           
Sbjct: 224  VCCLPPHTRLDGAWATRRV-PLKVTPHRLAWFREAG----IVAAITSRPAPS-------- 270

Query: 1040 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1099
                             R    EE           GG             E AL ++ V 
Sbjct: 271  -----------------RPRPAEE----------PGG-------------EQALCLKFVY 290

Query: 1100 LFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG 1136
            L N TT + +TLLA+GT    GED    GR+LL+S  
Sbjct: 291  LRNATTGDTDTLLAVGTGTPLGEDYPCLGRLLLYSVA 327



 Score = 56.6 bits (135), Expect = 8e-05,   Method: Composition-based stats.
 Identities = 36/111 (32%), Positives = 57/111 (51%), Gaps = 7/111 (6%)

Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDL------SFGPSNSESGSGSE 657
           +Y     TIAAGNLF    ++Q    G R+L+G  + QDL      + G   + S  G  
Sbjct: 4   EYITDQPTIAAGNLFHNAVIVQACPGGVRLLEGMSLVQDLPLSELQALGGVAAASRPGVA 63

Query: 658 NSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPA-AIESSKKPV 707
             T+  + +ADPYVL+ +S+G+  LL  D  + T+ +   A  +   ++PV
Sbjct: 64  PPTITHMQVADPYVLVSLSNGTACLLEADLLSMTLGLGGCAGGVPDCERPV 114


>gi|301093651|ref|XP_002997671.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262110061|gb|EEY68113.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 478

 Score = 96.3 bits (238), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 97/387 (25%), Positives = 159/387 (41%), Gaps = 101/387 (26%)

Query: 235 VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
           ++ LR++++   V D  F+ GY+EP +++LHE   + +  GR++    T  ++ +SI+  
Sbjct: 106 LLRLREVEITGKVIDLAFLDGYLEPTLMVLHEENDKNSTCGRLAVGFDTYCLTVISINMK 165

Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
            + HP IW+  NLP D ++L+   +P+GGV+V+ AN I Y +Q+    LA N +A     
Sbjct: 166 TRLHPKIWTVKNLPSDCFRLIPCRAPLGGVVVLSANAILYFNQTQFHGLATNVFASKTHE 225

Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVL 411
           + +L     +V L      +LQ    LL+   G + +L++ Y+    + L          
Sbjct: 226 TAQL-----NVVLYDCQFEYLQEKELLLTMPCGQVYVLSLPYEDTSSRGL---------- 270

Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIE----------ADA 461
                  G    F+GSR GDS+L           L +  +EE  D E            A
Sbjct: 271 ---YGFGGKQTLFIGSRSGDSVLFVLD----KKKLVTATEEEPKDEEMPIKEVVIKQESA 323

Query: 462 PSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT--------------------- 500
           P  K     S  A ++  + ++L LYG+A    E A  +                     
Sbjct: 324 PEIK-----SEPAEEEEEDEDDLFLYGAAPTKEEPAATSSTECTNGVGVSSVKTEENGAP 378

Query: 501 ------FSFAVR--DSLVNIGPLKDFSYGLRINADASATGISKQSNYELV---------- 542
                 + + +R  D L +IG +     G+  NAD      S +   ELV          
Sbjct: 379 EQDTGPYDYELRQIDVLPSIGQITSIELGVENNAD------SNEKREELVISGGYERSGA 432

Query: 543 ----------------ELPGCKGIWTV 553
                           EL GC+ +WTV
Sbjct: 433 ISVLHNGLRPIVGTEAELNGCRAMWTV 459


>gi|301103688|ref|XP_002900930.1| cleavage and polyadenylation specificity factor subunit, putative
           [Phytophthora infestans T30-4]
 gi|262101685|gb|EEY59737.1| cleavage and polyadenylation specificity factor subunit, putative
           [Phytophthora infestans T30-4]
          Length = 613

 Score = 94.4 bits (233), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 95/383 (24%), Positives = 159/383 (41%), Gaps = 93/383 (24%)

Query: 235 VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
           ++ LR++++   V D  F+ GY+EP +++LHE   + +  GR++    T  ++ +SI+  
Sbjct: 241 LLRLREVEITGKVIDLAFLDGYLEPTLMVLHEENDKNSTCGRLAVGFDTYCLTVISINMK 300

Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
            + HP IW+  NLP D ++L+   +P+GGV+V+ AN I Y +Q+    LA N +A     
Sbjct: 301 TRLHPKIWTVKNLPSDCFRLIPCRAPLGGVVVLSANAILYFNQTQFHGLATNVFASKTHE 360

Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVL 411
           + +L     +V L      +LQ    LL+  +G + +L++ Y+    + L          
Sbjct: 361 TVQL-----NVVLYDCQFEYLQEKELLLTMPSGQVYVLSLPYEDTSSRGL---------- 405

Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI------EADAPSTK 465
                  G    F+GSR GDS+L         +      K+E   I      +  AP  K
Sbjct: 406 ---YGFGGKQTLFIGSRSGDSVLFVLDKKKLVTATEEEPKDEEMPIKEVVIKQESAPEIK 462

Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT------------------------- 500
                S  A ++  + ++L LYG+A    E A  +                         
Sbjct: 463 -----SEPAEEEEEDEDDLFLYGAAPTKEEPAATSSTECTNGVGVSSVKTEENGAPEQDT 517

Query: 501 --FSFAVR--DSLVNIGPLKDFSYGLRINADASATGISKQSNYELV-------------- 542
             + + +R  D L +IG +     G+  NAD      S +   ELV              
Sbjct: 518 GPYDYELRQIDVLPSIGQITSIELGVENNAD------SNEKREELVISGGYERSGAISVL 571

Query: 543 ------------ELPGCKGIWTV 553
                       EL GC+ +WTV
Sbjct: 572 HNGLRPIVGTEAELNGCRAMWTV 594


>gi|443894082|dbj|GAC71432.1| mRNA cleavage and polyadenylation factor II complex, subunit CFT1
           [Pseudozyma antarctica T-34]
          Length = 1543

 Score = 94.4 bits (233), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 101/406 (24%), Positives = 170/406 (41%), Gaps = 58/406 (14%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            LV    +++ IY V  +      +  S  T      D     +L +   + L G V  L
Sbjct: 46  QLVTARDDLLTIYDVYDRSSSQSAASTSNGTANGTAGDAKPRHTLIVTRRHSLFGTVTGL 105

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +    +D   R   ++++F DAK+++LE++D+   L   S+H +E      L  G   
Sbjct: 106 QRVDTLASDKDARH-RLLVSFADAKLALLEWNDTTDDLETVSIHTYERAT--QLLNGTPP 162

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFS--------- 227
             R P + VDP  RC  +L+    + IL   +  +     E  F  G GF          
Sbjct: 163 LFR-PNLNVDPLSRCAALLLPHDALAILPFYRDNA-----EFDFDDGLGFDLANDALDAS 216

Query: 228 --------ARIES-----SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAG 272
                   A +ES     S V+ +R++D  ++++KDF F+ G+ +P + +L +   TW G
Sbjct: 217 DAAAMAAAAHMESLPYSPSFVLTMREVDPKIRNLKDFCFLPGFQKPTVAVLFDHSPTWTG 276

Query: 273 RVSWKHHTCMIS--ALSISTTL------------------KQHPLIWSAMNLPHDAYKLL 312
            ++ +  +  +    L +S +L                    HP++ ++  LP+D   +L
Sbjct: 277 LLTHRKDSFAVYLFTLDLSASLDGATLGSAAALLDDGNMRSAHPVVTTSSQLPYDCLYML 336

Query: 313 AVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV----ELDAAH 368
             P  +GGVLVV  + I +  QS    +   N      S+ E P S   V    +L A+ 
Sbjct: 337 PCPQSLGGVLVVCMSAILHVDQSGRVVVTALNRWFKTTSAIE-PESVLDVPGLADLQASQ 395

Query: 369 ATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSD 414
             +  +  A+LS   GDL  L    DGR V+   L + +     SD
Sbjct: 396 LVFTTDTDAVLSLSNGDLYRLRCHMDGRSVEGFRLERIDQLTAGSD 441


>gi|326477251|gb|EGE01261.1| protein kinase subdomain-containing protein [Trichophyton equinum
           CBS 127.97]
          Length = 1267

 Score = 94.4 bits (233), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 133/579 (22%), Positives = 232/579 (40%), Gaps = 65/579 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V   ++++++ +     GS  +    +  R    D    A L L   Y + G +  L
Sbjct: 28  NLIVVKTSLLQVFSLVNVTYGSTTATQPDQKGRN---DRSQHAKLVLAAEYEVPGTITGL 84

Query: 117 AILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
             +      NS+   D+I+++  +AK+S++E+D   HG+   S+H +E  E  H+     
Sbjct: 85  QRVR---ISNSKSGGDAILVSSRNAKLSLIEWDPEKHGISTISIHYYEGEES-HMSPWVP 140

Query: 176 SFARGPL-VKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDE---------------D 218
                P  + VDP G C  +  +G+  + IL   Q G  LV D+               D
Sbjct: 141 DLGSCPSSLTVDPNGNCA-IFNFGIHSLAILPFHQAGDDLVMDDYDATPNGDDSTDMVSD 199

Query: 219 TFGSGGGFSARIES---SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
              S  G +A  +    S V+ +  LD  + H     F+H Y EP   IL+ +       
Sbjct: 200 AQKSAPGNTAHDKPYAPSFVLPMAALDPALTHPIHMEFLHEYREPTFGILYSQVARSTSL 259

Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYH 332
              +      S  ++    +    + +   LP D +K++ +P P+GG L++G N  +H  
Sbjct: 260 TIDRKDVVSYSIFTLDLQQRASTSLLTVSRLPSDVFKIVPLPPPVGGALLIGTNELVHVD 319

Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLT 390
               + A+ +N +A    +     +S   + L+      L +     LL    G + +L+
Sbjct: 320 QAGKTNAVGVNEFARQASAFSMADQSDLEMRLEGCIVEQLGSGTGDVLLILADGRMSILS 379

Query: 391 VVYDGRVVQRLDL-----------SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
              DGR V  + L           +K  PS   S    +G +  F GS  GDS+L+ ++ 
Sbjct: 380 FKVDGRSVSGISLHFVAEQSGGLITKARPSCSAS----LGRNKLFYGSEEGDSILLGWSR 435

Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTES--- 496
            S T+   S  K   G  E+ A           D   D +  ++L     AS   E    
Sbjct: 436 PSSTTKRPS--KAADGVDESGAADLSDEAEQDDDGDDDDMYEDDLYSVNPASIRQEKQVV 493

Query: 497 ---AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI--- 550
              +   F+F   D L ++GP +D + G    + +     S  +    +EL   +G    
Sbjct: 494 NGDSPADFTFRAYDRLWSLGPYRDITLGKPPKSKSKDQRDSVPAIAAPLELVAARGFGKS 553

Query: 551 --WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
              TV  +    +  DS +M   DD Y  + I  ++ ++
Sbjct: 554 GGLTVLKREVDPYTIDSLKM---DDVYGVWSIRVVDPKS 589



 Score = 69.7 bits (169), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 52/225 (23%), Positives = 101/225 (44%), Gaps = 19/225 (8%)

Query: 911  CQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIV-AFTVLHNVNCNHGF 969
            C+ +    ++ G++  F+SG  PC+ ++     R H     G  V + +  H   C  GF
Sbjct: 751  CKLLRALPDVCGYKTVFMSGHNPCF-ILKSAIARPHVLRLRGKAVQSLSGFHIAACERGF 809

Query: 970  IYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPL 1029
             YV    ++++ +LPS + +D+ W  +K I L      I Y +    Y +  S       
Sbjct: 810  AYVDEDNVIRMSRLPSNTRFDSGWATRK-IALGEQVDSIVYSSASECYVIGTSA------ 862

Query: 1030 NQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSS 1089
             +   L  D E   +  N  ++ +       +E   V++LEP      W T  +  ++ +
Sbjct: 863  KEDFKLPEDDESHTEWRNEFITFLP-----QLERGTVKLLEPKN----WSTIDSHELKPA 913

Query: 1090 ENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
            E    + V+ L  +  T E + ++ +G++ V+GED+  +G + +F
Sbjct: 914  ERITCIEVIRLEISELTHERKDMVVVGSSIVKGEDIVPKGFIRVF 958


>gi|298715583|emb|CBJ28136.1| cleavage and polyadenylation specificity factor CG10110-PA
           [Ectocarpus siliculosus]
          Length = 1906

 Score = 94.0 bits (232), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 93/303 (30%), Positives = 136/303 (44%), Gaps = 77/303 (25%)

Query: 215 GDEDTFGSGGGFSARIESSHVINLR-------DLDMKHVKDFI----FVHGYIEPVMVIL 263
           G+ED  G G G +A+ +     NL        DL+   +  FI    F+ G+ EP + +L
Sbjct: 265 GEEDG-GLGNGATAKGDGGAGGNLAVSKPFTIDLEEAGITGFIKAAAFLEGFHEPALALL 323

Query: 264 HERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLV 323
           +E   T AGR++ K  TC ++ LSI+ T  + P+IW   NLPHD++ L+ VPSPIGG+ V
Sbjct: 324 YEPIQTCAGRLASKRSTCRLALLSINLTQGRAPVIWQVENLPHDSWDLVPVPSPIGGLQV 383

Query: 324 VGANTIHYHSQS-ASCALALNNYA-VSLDSS-QELP------------------------ 356
           +  N + + +QS     LA+N YA  ++D +  E P                        
Sbjct: 384 ISTNAVMHVNQSEVRSILAVNGYARATVDPALLECPLRGGDSDWGWTSFRRSHPEREVVD 443

Query: 357 RSSFSV--ELDAAHATWLQNDVALLSTKTGD-----LVLLTVVY---------------- 393
            SS+ V  ELD     +L     LLS +TG+     L L TV                  
Sbjct: 444 LSSYDVCIELDVVRCAFLTPTSMLLSLRTGEVYALRLHLTTVTAAAADAAGCSRPPGGAA 503

Query: 394 ---DGRVVQR--LDLSKTNP-SVLT---------SDITTIGNSLFFLGSRLGDSLLVQFT 438
                RVV +    + + +P SVL             +     L F+GSR+GDSLLV ++
Sbjct: 504 FGTPNRVVGQSMRPVGRASPCSVLAVAASGGSGGDGGSGASKGLVFMGSRVGDSLLVDYS 563

Query: 439 CGS 441
             S
Sbjct: 564 VAS 566



 Score = 45.4 bits (106), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 50/225 (22%), Positives = 91/225 (40%), Gaps = 60/225 (26%)

Query: 3   FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
           +  Y+ +H PTG+ +   G +T + +                            +LVV  
Sbjct: 7   YTCYRQLHPPTGVDHAVFGSVTAAGSR---------------------------DLVVAK 39

Query: 63  ANVIEIYVVRVQEEGS----------KESKNSGETKRRVLMDGISAASLELVCHYRLHGN 112
           A+ +E+Y V   +  S          +++ N  E       D  S   LEL   + L GN
Sbjct: 40  ASTLELYRVHRDDHSSTAAAAAAAAARDTSNGDERDD----DDASGYYLELAGTFPLAGN 95

Query: 113 VESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
           + +LA++           D ++++F  AK++++ +D  +  L   S+H F++        
Sbjct: 96  ITALAVIP----------DILVVSFGVAKMALVAYDSVLGRLETISIHNFDAGAIGPGAG 145

Query: 173 GRES-FARGPLVK--------VDPQGRCGGVLVYGLQMIILKASQ 208
           G ES +     +K         DP GRC   +V G Q+++L A +
Sbjct: 146 GVESGYGLAAALKDRPRTISSSDPAGRCLAAVVAGCQLVVLPARR 190


>gi|402085944|gb|EJT80842.1| cft-1 [Gaeumannomyces graminis var. tritici R3-111a-1]
          Length = 1450

 Score = 94.0 bits (232), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 157/686 (22%), Positives = 262/686 (38%), Gaps = 126/686 (18%)

Query: 91  VLMDGISAASLELVCHYRLHGNVESLA------ILSQGGADNSRRRDSIILAFEDAKISV 144
           V  D  S   + L+  + L G V  LA      +   GG   S   D +++AF+DAK+S+
Sbjct: 86  VRSDRASHTKIVLIAEFPLSGTVTGLARVKPPNVSKTGGG--SGVGDLLLIAFKDAKLSL 143

Query: 145 LEFDDSIHGLRITSMHCFESPE-----WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGL 199
           + +D     L   S+H +E  E     W        +F     +  DP  RC  +     
Sbjct: 144 VAWDSERRSLETFSIHYYEQDELQGNPWECPLSDYANF-----LVADPGSRCAALKFGPR 198

Query: 200 QMIILKASQGGSGL-VGDEDTFGSGG------------GFSARIES-----SHVINLRDL 241
            + IL   Q    + +GD D    G               ++ IE      S V+ L +L
Sbjct: 199 SLAILPFKQADEDIGMGDWDEALDGPRPAQSQSAAVAINGTSTIEDTPYSPSFVLRLPNL 258

Query: 242 D--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299
           D  + H     F++ Y EP   IL    +T +  +  K H    +  ++    K    I 
Sbjct: 259 DPALLHPVHLAFLYEYREPTFGILSS-SITPSNCLDRKDH-LTYTVFTLDLQQKASTTIL 316

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRS 358
           S   LP D  +++A+P+P+GG L+VGAN  IH      +  +A+N +     S      S
Sbjct: 317 SVGGLPKDLTRVIALPAPVGGALLVGANELIHIDQSGKANGVAVNPFTKQCTSFGLADHS 376

Query: 359 SFSVELDAAHATWL--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLT 412
             ++ L+      L  ++   L+    G L  +T   DGR V  L +    P    ++L 
Sbjct: 377 DLNLRLEGCTIEVLSAEHGELLVVLDDGRLATITFHIDGRTVSGLKVRIIPPEAGGNILP 436

Query: 413 SDITT---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRR 469
           + ++    IG +  F GS  GDS+++      G +  SS +  +   ++        L  
Sbjct: 437 TSVSCLSRIGRNAMFAGSERGDSIVI------GWNRKSSQVSRKKSRVQ-----DPDLDL 485

Query: 470 SSSDALQDMVNGEELSLYGSASNNT--------ESAQKTFSFAVRDSLVNIGPLKDFSYG 521
                  +    ++  LYG     T        ++  +   F   D L++I P++D +YG
Sbjct: 486 DIDFDDLEDDEDDDDDLYGDTEKTTTVAGLASGQAKLEDLVFRCHDRLISIAPIRDMAYG 545

Query: 522 LRINADASATGISK----QSNYELV--------------------------ELPGCKGIW 551
                    TG       QS  +LV                          + P  +G+W
Sbjct: 546 KPPPPAEGETGSRNSTPIQSELQLVAVVGRDRASSLAIMNREMTPVSIGRFDFPEARGLW 605

Query: 552 TVY-----------HKSSRGHNADSSRMAAYDDEYHAYLIIS------LEARTMVLETAD 594
           T+             K ++    D      YD     +++++       E+  + + TA 
Sbjct: 606 TLACQKPLPKVLQGEKGTKPVGGDFGVPVQYDK----FMVVAKEDDDNFESSNIYVLTAA 661

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESG 653
              ++  +      G TI AG +    ++IQV +   R  DG   +TQ +   P   E  
Sbjct: 662 GFEKLVGTEFEPAAGFTIEAGTMGNHTKIIQVLKSEVRCYDGDLGLTQII---PMLDEET 718

Query: 654 SGSENSTVLSVSIADPYVLLGMSDGS 679
           +    +T  S SIADPY+L+   D S
Sbjct: 719 NHEPRAT--SASIADPYLLIIRDDSS 742



 Score = 68.6 bits (166), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 61/275 (22%), Positives = 113/275 (41%), Gaps = 23/275 (8%)

Query: 867  SDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH-----GAPCQRITIFK--N 919
            S+D ++      ++  S S    LRF + P  A  + +         AP +R+ +    N
Sbjct: 871  SNDDLTIYEPFKIAESSQSLSGTLRFRKLPNPAVAKSQDTKVSDDAPAPMRRMPLRACGN 930

Query: 920  ISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
            I+G+   FL G  P + +   +       L    + A +  H   C+ GFIY   +G+ +
Sbjct: 931  IAGYSCVFLPGHSPSFLIKSSKSTPRVIGLQGPGVRAMSPFHTKGCDRGFIYADYEGVAR 990

Query: 980  ICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQ 1039
            + Q+P+  ++       K +PL      I Y     +Y  +V+    +P      L  D 
Sbjct: 991  VAQIPNDCSFAELGLSVKKVPLNMDADGIAYHTPSGVY--VVTCSFWEPFE----LPSDD 1044

Query: 1040 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1099
            E   +    N++          E   ++++ P      W    T     +E A+ ++ + 
Sbjct: 1045 ESHREWAKENITF-----KPQTEHSVLKVINPVN----WSEIWTEEFDKNEVAMCIKSLN 1095

Query: 1100 L-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
            L  + +T E   L+ +GTA  +GED+  RG V ++
Sbjct: 1096 LEVSQSTNERRHLITVGTAICKGEDLPVRGCVYVY 1130


>gi|448105510|ref|XP_004200513.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
 gi|448108635|ref|XP_004201144.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
 gi|359381935|emb|CCE80772.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
 gi|359382700|emb|CCE80007.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
          Length = 1344

 Score = 93.6 bits (231), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 105/502 (20%), Positives = 196/502 (39%), Gaps = 83/502 (16%)

Query: 58  LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL- 116
           LVV  + +++++ +    + SKE K                  L+LV  ++LHG +  L 
Sbjct: 29  LVVGKSTLLQVFDIVQSNKKSKEYK------------------LKLVEQFKLHGLITDLK 70

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A+ +    D     D ++++ + AK+S++++D   + +   S+H +E+          E 
Sbjct: 71  AVRTVENPD----LDYLLVSTKSAKMSLVKWDHHENSISTVSLHYYENSIQ---SSTYEK 123

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMII-------------LKASQGGSGLVGDEDTFGSG 223
                L+ ++P   C  +    L   +              +   G SG  G  D   + 
Sbjct: 124 LTTTELI-MEPNNTCACLRFKNLLTFLPFEMPDEDDEEDGYENVDGASGSRGKHDNKATQ 182

Query: 224 GGFS-ARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHT 280
              + A   SS VI+ ++LD +  +V D  F++ Y EP + I+  +  TW G +      
Sbjct: 183 QDENQALFYSSFVIDAQNLDSRIGNVIDMKFLYNYKEPTLAIISSKNHTWTGLLPLTKDN 242

Query: 281 CMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-TIHYHSQSASCA 339
                LS+    K    +    NLP D   ++ +P P+ G L++G N  IH      +  
Sbjct: 243 ISFIVLSLDLVTKTSTTVLKIDNLPFDIDTIVPLPKPLNGTLLIGCNEIIHVDHGGITRR 302

Query: 340 LALNNYAVSLDSSQELPR--SSFSVELDAAHATWLQND-VALLSTKTGDLVLLTVVYDGR 396
           LA+N +  S+ SS +  R  S  +++L+      + ND    L  K GD   +    DG+
Sbjct: 303 LAVNQFTSSITSSIKNYRDQSELNLKLENCCVKPIPNDHRVFLILKNGDFYYINFAIDGK 362

Query: 397 VVQRLDLSKTNP-------SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSG 449
            ++   L K N             D+  + N+L F+ ++ G+S L++             
Sbjct: 363 TIKNFYLEKVNSINQNEIGISYPEDVVHLDNNLMFICNKNGNSPLIELKF---------- 412

Query: 450 LKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTES----------AQK 499
                    +++   +   +     +QD  NG          ++                
Sbjct: 413 ---------SESKDNQNAEQQKDTEMQDTENGTTDKNDNDDDDDIYEDDEDNEKVLIKNS 463

Query: 500 TFSFAVRDSLVNIGPLKDFSYG 521
              F   D L+N GP+  F++G
Sbjct: 464 VIEFTKHDELINNGPVSSFTFG 485



 Score = 50.4 bits (119), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 41/196 (20%), Positives = 81/196 (41%), Gaps = 28/196 (14%)

Query: 824  VVELAMQRWSAHHSRPFLFAILT-DGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNV 882
            +  +        HS+     ILT  G ++ Y+ + F+G                    N 
Sbjct: 774  IKNIVFNELGDEHSKDEYLTILTIGGEVIIYKLF-FDG-------------------DNF 813

Query: 883  SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIF-KNISGHQGFFLSGSRPCWCMVFRE 941
               + ++L+ +  P +AY     P G   +R  ++  N++G+   F++G  P +      
Sbjct: 814  KFIKEKDLKITGAPDNAY-----PLGTTLERRLVYVPNVNGYSSIFVTGIIPYFITKTVH 868

Query: 942  RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPL 1001
             +    +      V+F+   + N  +GFIY+ +    ++C++P    Y+N WP++K I +
Sbjct: 869  SVPRIFRFTKLPAVSFSSYSDSNIKNGFIYLDNSKNARMCEIPLDFNYENNWPIKK-IQM 927

Query: 1002 KATPHQITYFAEKNLY 1017
              T   I Y    N +
Sbjct: 928  PETVKAIAYHELSNTF 943


>gi|294659889|ref|XP_462318.2| DEHA2G17908p [Debaryomyces hansenii CBS767]
 gi|218511978|sp|Q6BHK3.2|CFT1_DEBHA RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two
           protein 1
 gi|199434312|emb|CAG90824.2| DEHA2G17908p [Debaryomyces hansenii CBS767]
          Length = 1342

 Score = 92.4 bits (228), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 91/415 (21%), Positives = 178/415 (42%), Gaps = 64/415 (15%)

Query: 58  LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
           L+V  A V++++ +   E  +++ K                  L+LV  ++LHG +  + 
Sbjct: 29  LIVGKATVLQVFEIITTETKTQQYK------------------LKLVEQFKLHGLITDIK 70

Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
            +     +NS+  D ++++ + AK+S++++D  ++ +   S+H +E+          E  
Sbjct: 71  AIRT--VENSQL-DYLLVSSKGAKMSLIKWDHHLNSISTVSLHYYENSIQ---SSTYEKL 124

Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMII-----------------LKASQGGSGLVGDEDTF 220
               LV V+P   C  +    L   +                 +  S G      +++  
Sbjct: 125 TTTDLV-VEPNNNCTCLRFKNLLTFLPFETLDEEEEDDDDDEEMNGSSGSDKKATNKENG 183

Query: 221 GSGGG-FSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
            S G   S   ESS +I+ R LD +   + D  F++ Y EP + I+  +   WAG +   
Sbjct: 184 NSNGEEVSELFESSFMIDGRTLDSRIGDIIDMQFLYNYREPTIAIIFSKAHAWAGNLPKV 243

Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-TIHYHSQSA 336
                   LS+    K    +    NLP D  K++ +P P+ G L++G N  IH  +   
Sbjct: 244 KDNINFIVLSLDLVTKASTTVLKIDNLPFDIDKIIPLPQPLNGSLLMGCNEIIHVDNGGI 303

Query: 337 SCALALNNYAVSLDSSQE--LPRSSFSVELDAAHATWLQND-VALLSTKTGDLVLLTVVY 393
           +  LALN +  S+ +S +    +S  +++L+      + ND   L+    GD   +    
Sbjct: 304 TRRLALNQFTSSITTSLKNYHDQSDLNLKLENCSVKPIPNDNKVLMILNNGDFYYINFKI 363

Query: 394 DGRVVQRL-----------DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           DG+ +++            D+  T P     +I T+ N+L F+ ++ G++ L++ 
Sbjct: 364 DGKTIKKFFVEKVSDLNYDDIQLTYP----GEIATLDNNLMFISNKNGNNPLLEL 414



 Score = 58.2 bits (139), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 58/259 (22%), Positives = 114/259 (44%), Gaps = 29/259 (11%)

Query: 881  NVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFR 940
            N    + ++L  +  P +AY+   T      +R+  F N++G    F++G  P +     
Sbjct: 810  NFKLVKEKDLIITGAPDNAYSLGTTIE----RRLVYFPNVNGFTSIFVTGITPYYISKTT 865

Query: 941  ERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIP 1000
              +    +      V+F    +    +G IY+ +    +IC++P    Y+N WP++K IP
Sbjct: 866  HSVPRIFKFTKLPAVSFAPYSDDKIKNGLIYLDNSKNARICEIPVDFNYENNWPIKK-IP 924

Query: 1001 LKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS--SVDLHRT 1058
            +K +   +TY    N +       V+    ++    +D+E G  I   + S  S + ++ 
Sbjct: 925  IKESIKSVTYHELSNTF-------VISTYEEIPYDCLDEE-GKPIVGVDKSKPSANSYKG 976

Query: 1059 YTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKE---NETLLAI 1114
            Y      ++++ P      W    TI +   E  + V+ + L   ++TK+    + L+ I
Sbjct: 977  Y------IKLISPYN----WSVIDTIELVDGEIGMNVQSMVLDVGSSTKKFKNKKELIVI 1026

Query: 1115 GTAYVQGEDVAARGRVLLF 1133
            GT   + ED++A G   +F
Sbjct: 1027 GTGKYRMEDLSANGSFKIF 1045


>gi|254564833|ref|XP_002489527.1| RNA-binding subunit of the mRNA cleavage and polyadenylation factor
           [Komagataella pastoris GS115]
 gi|238029323|emb|CAY67246.1| RNA-binding subunit of the mRNA cleavage and polyadenylation factor
           [Komagataella pastoris GS115]
 gi|328349950|emb|CCA36350.1| Protein cft1 [Komagataella pastoris CBS 7435]
          Length = 1388

 Score = 90.1 bits (222), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 105/444 (23%), Positives = 177/444 (39%), Gaps = 51/444 (11%)

Query: 93  MDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIH 152
           +D      L LV  Y+L G V  L  +           D +++A +  K S++++D S +
Sbjct: 80  IDFSQNVKLSLVAEYKLDGLVTDLCKIR---TIEDSHHDYVLVATKGVKFSMIKWDQSSN 136

Query: 153 GLRITSMHCFESPEWLHLKRGRES-----FARGPLVKVDPQGRCGGVLVYGLQMIILKAS 207
            +   S+H        H K+  E+     F     +  DP   C  +L   + +  L   
Sbjct: 137 SISTVSLH--------HYKKIVENSLIDKFNVDTKLIADPNNHCSCLLANEI-LFFLPFL 187

Query: 208 QGGSGLVGDEDTFGSGGGFSARIESSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHE 265
           Q       DE+  G          ++ +    DL  ++K + D  F+HGY EP + +L+ 
Sbjct: 188 QHEV----DEELDGKFVENKKLYSNTFLQFSNDLQPNIKTIIDIEFLHGYSEPTLAVLYT 243

Query: 266 RELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVG 325
              T  G +     T  +   S++   K    I    NLP+D  ++L + SP+ G L++G
Sbjct: 244 SFPTCTGALPKAKDTVSLQVFSLNLQNKASTSIIEVNNLPYDTDRILPLSSPLNGCLLIG 303

Query: 326 AN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTG 384
           AN  IH +S   +  ++ N +A    + +   +S+  + L+      + ND  +L T+ G
Sbjct: 304 ANQIIHLNSMGTAKGISCNLFAAKCSNFKLSDQSNLDLRLEKCVLGQVYNDKVILITEKG 363

Query: 385 DLVLLTVVYDGRV-----VQRLDLSKTNPSVLT--SDITTIGNSLFFLGSRLGDSLLVQF 437
                +    G V     +Q++   K    VL+  +  T I    FF+G +  DS+L   
Sbjct: 364 AFYAFSFDIVGGVSSINEIQKIAAEKYQGLVLSLPTMFTNIDGKTFFIGCQGSDSVLF-- 421

Query: 438 TCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESA 497
                      G K        D     ++  +  DAL       E  LY     N    
Sbjct: 422 -----------GSKARLNTQNVDVNGKSKV-ITEEDALY------EEDLYADDIQNVAQG 463

Query: 498 QKTFSFAVRDSLVNIGPLKDFSYG 521
                F   DSL+NIGP+ +F+ G
Sbjct: 464 IDHIDFVKLDSLLNIGPITNFTTG 487


>gi|343425828|emb|CBQ69361.1| related to cleavage and polyadenylation specificity factor, 160 kDa
           subunit [Sporisorium reilianum SRZ2]
          Length = 1567

 Score = 89.0 bits (219), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 85/349 (24%), Positives = 158/349 (45%), Gaps = 48/349 (13%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           L LV  + L G V  L  + Q  A +   RD ++++F+DAK+++LE++D    L   S+H
Sbjct: 92  LVLVRRHTLFGVVTGLQRV-QTLATDKDARDCLLVSFKDAKLALLEWNDLTDDLETVSIH 150

Query: 161 CFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL-------KASQGGSG 212
            +E +P+ L+   G  +    P++ VDP  RC  +L+    + +L               
Sbjct: 151 TYERAPQLLN---GTPNLFH-PILNVDPLSRCAALLLPHDALAVLPFYRDAADFDFDLDD 206

Query: 213 LVGDEDTFGSGGGFSARIES-----SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHE 265
            +       +    +A +E+     S V+ +R++D  ++++KDF F+ G+ +P + +L  
Sbjct: 207 RLDLAKDDAAAVAAAAEMETLPYSPSFVLTMREVDPKIRNLKDFCFLPGFQKPTVAVLFS 266

Query: 266 RELTWAGRVSWKHHTCMI-------------------SALSISTTLKQHPLIWSAMNLPH 306
              TW G ++ +  T  +                    AL   T    HP++ ++  LP+
Sbjct: 267 HTPTWTGLLAERKDTFSVYLFTLDLSASLDGTLSSAADALDDGTVRSAHPVVTTSTALPY 326

Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSASCAL-ALNNY-----AVSLDSSQELPRSSF 360
           D   +++ P  +GGVLVV  +++ +  QS    + ALN +     A+  +S  +LP    
Sbjct: 327 DCLYMVSCPQTLGGVLVVCMSSVLHVDQSGRVVVTALNGWFKTISAIEPESVLDLPEIP- 385

Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS 409
             +L  +   +      +L+   GDL       DGR V+   L + + S
Sbjct: 386 --DLQGSQLVFTAETAGVLALVDGDLYRFRCQMDGRSVEGFRLERMDQS 432


>gi|402913617|ref|XP_003919276.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like, partial [Papio anubis]
          Length = 132

 Score = 88.6 bits (218), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 48/121 (39%), Positives = 72/121 (59%), Gaps = 12/121 (9%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           LEL   +   GNV S+A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H
Sbjct: 18  LELAASFSFFGNVMSMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLH 73

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVG 215
            FE PE   L+ G       P V+VDP GRC  +LVYG ++++L       ++   GLVG
Sbjct: 74  YFEEPE---LRDGFVQNVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVG 130

Query: 216 D 216
           +
Sbjct: 131 E 131


>gi|154285962|ref|XP_001543776.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
 gi|150407417|gb|EDN02958.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
          Length = 1283

 Score = 88.2 bits (217), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 143/677 (21%), Positives = 261/677 (38%), Gaps = 123/677 (18%)

Query: 501  FSFAVRDSLVNIGPLKDFSYGLRI---NADASATGISKQSNYELVELPG----------- 546
            + F + D L N+GP++D + G      + D      S  +N ELV   G           
Sbjct: 376  YIFRIHDRLWNLGPMRDLTLGRPPGPRDKDKRQPVSSILANLELVTTQGYGKAGGLAILR 435

Query: 547  ---------------CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL-----EAR 586
                             G  +VY K  +  +   S        Y  YL++S      + +
Sbjct: 436  REIDPFVIDSLMIKDTDGARSVYVKDPKLPSQSGSLPLNPGSNYDHYLLLSKSKGLDKEK 495

Query: 587  TMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARILD-GSYMTQDLS 644
            ++V   +    E T++ ++   + RTI  G L    RV+QV +   R  D G  + Q   
Sbjct: 496  SVVYRMSSGGLEETKAPEFNPNEDRTIDIGTLASGTRVVQVLKGEVRSYDSGLGLAQIFP 555

Query: 645  FGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSK 704
                +      SE  +V+  S ADPYVL+   D SI LL  D S      +T   I S+ 
Sbjct: 556  VWDEDM-----SEEKSVVHTSFADPYVLIIRDDQSILLLQADDSGDLDEAETDGIINSTT 610

Query: 705  KPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGP-LDQGD-IYSVVCYESGA 762
                S +LY DK                     +    +G P + Q D +   +      
Sbjct: 611  --WISGSLYQDKY-------------------RSFKSHEGPPNMKQSDNVLLFLLSSESK 649

Query: 763  LEIFDVPNF-NCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHS 821
            L +F +PN    VFT +                   D   +I S+         +E I  
Sbjct: 650  LYVFHLPNAREPVFTTESI-----------------DLLPQILSTEPPPRRVTYRETITE 692

Query: 822  MKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSN 881
            + V +L      +    P+L    ++  ++ Y+ Y +             ST R  S   
Sbjct: 693  LLVADLG----DSVSRSPYLILRSSNSDLILYEPYHYTS-----------STERQFS--G 735

Query: 882  VSASRLRNLRFSRTPLDAYTREETPHGAPCQRIT----IFKNISGHQGFFLSGSRPCWCM 937
            +   ++ N  F ++  ++   +   H A C  I+    +  ++ G++  F+ G+ PC+ +
Sbjct: 736  LRFVKIANHHFPKSHSESNAGK---HPANCTAISKPLRVLGDVCGYRTVFMPGNSPCFII 792

Query: 938  VFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQK 997
                 +     L   ++ + +  +   C  GF+YV +  ++++C+ P  + +D  W  +K
Sbjct: 793  KSSTSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDTDNVVRMCRFPRNTHFDGSWAARK 852

Query: 998  VIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHR 1057
             I L      + Y +    Y +  +  V        +L  D E+  +  N  +S +    
Sbjct: 853  -IGLGEQVDAVEYSSSSETYVIGTNQKV------DFNLPEDDEIHPEWRNEVISFLP--- 902

Query: 1058 TYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGT 1116
               +++  V++L P      W    +  ++++E  + V+ + L  +  T E +  + +GT
Sbjct: 903  --QIDKGSVKLLTPRT----WSIIDSYNLRTAERIMCVKCLNLEVSEITHERKDTIVVGT 956

Query: 1117 AYVQGEDVAARGRVLLF 1133
            A  +GED+AARG + +F
Sbjct: 957  ALTKGEDIAARGCIYIF 973



 Score = 47.4 bits (111), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 45/180 (25%), Positives = 84/180 (46%), Gaps = 21/180 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V    +++++ +     GS   +   +T+ +          L LV  Y L G +  L
Sbjct: 28  NLIVAKTTLLQVFNLVNVVYGSGPGQPDEKTRSQY-------TKLVLVAEYALSGTITDL 80

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +     D+    +++++A  +AK+S++E+D   H +  TS+H +E  + +++     +
Sbjct: 81  GRVKI--LDSKSGGEAVLVATRNAKLSLIEWDPERHQISTTSIHYYERDD-VNISPWTPN 137

Query: 177 FARGP-LVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFGSGGGFSARIESSH 234
            A  P  + VDP  RC  VL +G + + IL   Q G  LV D+        F + +E  H
Sbjct: 138 LASCPSYLTVDPNSRC-AVLNFGKKNLAILPFHQVGDDLVMDD--------FDSDVEEQH 188


>gi|354547787|emb|CCE44522.1| hypothetical protein CPAR2_403250 [Candida parapsilosis]
          Length = 1334

 Score = 87.8 bits (216), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 98/458 (21%), Positives = 181/458 (39%), Gaps = 64/458 (13%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           L+LV  ++L G V  L  L       + + D II++ + AK S+++++  +H +   S+H
Sbjct: 57  LKLVEQFKLQGTVTGLKPLR---TSENPQLDYIIVSTKYAKFSIIKWNHQLHSISTVSLH 113

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL---------------- 204
            +E+          E  A   L+ V+P       L Y   +  L                
Sbjct: 114 YYEN---CIQHSTFEKLAISDLI-VEPTYSSVSCLRYKNLLCFLPFEGVNDHDDDDDDDD 169

Query: 205 -----KASQG-GSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYI 256
                   +G    + G + + G+        +SS +I+   L+  +  V D  F+H Y 
Sbjct: 170 DDDDTDDEKGVAENVAGVDKSNGASNDNQPFYDSSFIIDAGTLESSVDSVLDLQFLHHYQ 229

Query: 257 EPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
           E  + IL  +  +WAG +           +++    K    +++  NLP+D  +++ +  
Sbjct: 230 ETTIAILSSKSNSWAGNLIKNKDNVQFQVMTLDIQSKSTLPVFTIDNLPYDIDRIIPLSK 289

Query: 317 PIGGVLVVGAN-TIHYHSQSASCALALNNY----AVSLDSSQELPRSSFSVELDAAHATW 371
           P+ G L++G N  IH  +   +  +A+N +      S+ S Q+    +  +E D +    
Sbjct: 290 PLNGCLLLGCNEIIHVDNGGIAKRIAVNAFTSLITASVKSYQDESELNLKLE-DCSIVPI 348

Query: 372 LQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS-------VLTSDITTIGNSLFF 424
            ++   LL   TG+   L    DG+ ++R+ L               + ++ T+ N+L F
Sbjct: 349 PEDHRVLLILATGEFYFLNFELDGKSIKRIHLEAVEQKAYDAIKLTYSGEVATLDNNLLF 408

Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
             +  GDS LV+    S   +           +E      K       D   +   GEE 
Sbjct: 409 FANMNGDSPLVEIKYSSSAKV-----------VEKQVLDKKEEDSDEEDLYNEDEEGEEQ 457

Query: 485 SLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL 522
            +   +            F + DSL+N GP+  F+ GL
Sbjct: 458 KVMRKSH---------IEFKLHDSLINNGPVSSFTLGL 486


>gi|71021721|ref|XP_761091.1| hypothetical protein UM04944.1 [Ustilago maydis 521]
 gi|46100541|gb|EAK85774.1| hypothetical protein UM04944.1 [Ustilago maydis 521]
          Length = 1597

 Score = 87.4 bits (215), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 95/399 (23%), Positives = 177/399 (44%), Gaps = 54/399 (13%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKE-----SKNSGETKRRVLMDGISAASLELVCHYRLHG 111
            LV    +V+ IY V  Q   S       S+++  +         S  +L +  ++ L G
Sbjct: 46  QLVTARDDVLTIYDVYGQPHASASTIPGISRHTATSSVSSNTSACSHKNLVISRNHTLFG 105

Query: 112 NVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFE-SPEWLHL 170
            V  L  + Q  A +   RD ++++F+DAK+++LE++D+I  L   S+H +E +P+ L+L
Sbjct: 106 AVTGLQRV-QTLASDKDNRDRLLVSFKDAKLALLEWNDAIDDLETISIHTYERAPQLLNL 164

Query: 171 KRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE-------DTFGSG 223
                     P++ VDP  RC  +L+    + IL   +  +    D            + 
Sbjct: 165 A----PHLFHPILNVDPLSRCAALLLPHDSLAILPFYRDAADFDFDLDDHLEIAKDDVAA 220

Query: 224 GGFSARIES-----SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSW 276
              +A ++S     S V+ +R++D  ++++K F F+ G+ +P + +L     TW G +S 
Sbjct: 221 VVAAADLQSLPYSPSFVLTMREVDPKIRNLKHFCFLPGFQKPTVAVLFSHNPTWTGLLSE 280

Query: 277 KHHTCMI--------------------SALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
           +  T  +                     AL   T    HP++ ++  LP+D   ++A P 
Sbjct: 281 RKDTFSVYLFTLDLSASLDGATFSSSAEALDDGTARSAHPVVTTSTPLPYDCLYMVACPQ 340

Query: 317 PIGGVLVVGANTIHYHSQSASCAL-ALNNY-----AVSLDSSQELPRSSFSVELDAAHAT 370
            +GGV+VV  +++ +  QS    + ALN +     A+  +S  EL   S   +L  +   
Sbjct: 341 TLGGVIVVCMSSLLHVDQSGRVMVTALNQWFKTTSAIEPESILEL---SDIADLQGSQLV 397

Query: 371 WLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS 409
           +      +L+   G++       DGR V+ + L +   S
Sbjct: 398 FTSKTQGVLTLVNGEIYRFRCQTDGRSVEGIRLERMQES 436


>gi|260941626|ref|XP_002614979.1| hypothetical protein CLUG_04994 [Clavispora lusitaniae ATCC 42720]
 gi|238851402|gb|EEQ40866.1| hypothetical protein CLUG_04994 [Clavispora lusitaniae ATCC 42720]
          Length = 1363

 Score = 87.0 bits (214), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 60/220 (27%), Positives = 110/220 (50%), Gaps = 15/220 (6%)

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
           SS ++    LD K   + D  F+H Y +P + +L +++ TWAG +       + S LS+ 
Sbjct: 224 SSFILEASALDNKIGDIIDLQFLHHYRQPTIAVLSQQKSTWAGLLPQTKDNVIFSVLSLD 283

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVS 348
              +    +    NLP+D  K++A+PSP+ G L++G N  IH  +   +  +A+N Y   
Sbjct: 284 MQTRLTTTVLQIENLPYDLEKIIALPSPLNGSLLIGCNELIHVDTGGITRRIAVNQYTED 343

Query: 349 LDSSQE--LPRSSFSVELDAAHATWLQNDVALLST-KTGDLVLLTVVYDGRVVQRLDLSK 405
           + +S +    ++S  ++L+      + ND  LL   +TG++  +    DG+ ++R+ + +
Sbjct: 344 ITASLKNYADQTSLDLKLEDCSILPIPNDNKLLMVLRTGEMYFIVFEVDGKTIKRMSVEE 403

Query: 406 TNPSVLTSDI--------TTIGNSLFFLGSRLGDSLLVQF 437
             PS   S I         ++ N+L FL  R  +S LV+ 
Sbjct: 404 I-PSETYSQIKLMDPSSFASLDNNLLFLTGRSSNSHLVEL 442



 Score = 47.0 bits (110), Expect = 0.070,   Method: Compositional matrix adjust.
 Identities = 34/116 (29%), Positives = 55/116 (47%), Gaps = 16/116 (13%)

Query: 889 NLRFSRTPLDAYTREETPHGAPCQRITI-FKNISGHQGFFLSGSRPCWCMVFRERLRVHP 947
           +L  +  P +AY+     HG   +R  I F ++SG     ++G  P   M+ R R   H 
Sbjct: 837 DLPITGAPFNAYS-----HGTSIERRMIYFPDVSGTTCIMVTGVIPY--MITRSR---HS 886

Query: 948 QL-----CDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
           Q+         IV+F         +G IY+ ++   +I +LPS  +YD  WP++KV
Sbjct: 887 QVKVFKFSKIPIVSFVPFSTDKIKNGLIYLDTKKNARIVELPSEFSYDYNWPIRKV 942


>gi|388856288|emb|CCF50097.1| related to cleavage and polyadenylation specificity factor, 160 kDa
           subunit [Ustilago hordei]
          Length = 1568

 Score = 85.9 bits (211), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 91/358 (25%), Positives = 164/358 (45%), Gaps = 61/358 (17%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           L L+  + L G V  L  + Q  + +   RD ++++F DAK+++LE++ +   L   S+H
Sbjct: 84  LVLIRKHSLFGTVTGLQRI-QTLSTSKDSRDRLLVSFTDAKLALLEWNHTTDDLETVSIH 142

Query: 161 CFE-SPEWL----HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS---- 211
            +E +P+ L    HL +        P + +DP  RC  +L+    + IL   +  +    
Sbjct: 143 TYERAPQLLNGIPHLFQ--------PNLNIDPLSRCAALLLPHDALAILPFYRDAAEFEF 194

Query: 212 --GLVGDEDTFGSGGGFSA-----RIES-----SHVINLRDLD--MKHVKDFIFVHGYIE 257
             GL  D +   +G   +A     +IES     S V+ +R++D  ++++KDF F+ G+ +
Sbjct: 195 DHGLHLDLNLDFAGEDKAAMQAAVQIESLPYSPSFVLTMREVDPKIRNLKDFCFLPGFQK 254

Query: 258 PVMVILHERELTWAGRVSWKHHT----------------CMISALSIS----TTLKQHPL 297
           P + +L     T  G ++ +                    M+ + S S    T    HP+
Sbjct: 255 PTVALLFAHSPTCTGLLAERKDNFSVYLFTLDLAASLDGAMLGSASYSFDDATLRSMHPV 314

Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNY-----AVSLDS 351
           + ++ +LP+D   +L  P  +GGVLVV  ++I +  QS    A ALN +     A+  +S
Sbjct: 315 LTTSSSLPYDCLYMLPCPQTLGGVLVVCMSSILHVDQSGRVVATALNGWFNLVSAIQPES 374

Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS 409
             +LP  +   +L  +   +      +L+   GD+   T   DGR +Q   L +   S
Sbjct: 375 LLDLPEIA---DLQGSQLVFTAETEGVLTLVHGDVYTFTCQMDGRNIQGFRLERMQQS 429


>gi|154421858|ref|XP_001583942.1| CPSF A subunit region family protein [Trichomonas vaginalis G3]
 gi|121918186|gb|EAY22956.1| CPSF A subunit region family protein [Trichomonas vaginalis G3]
          Length = 1297

 Score = 84.3 bits (207), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 96/368 (26%), Positives = 153/368 (41%), Gaps = 49/368 (13%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           L LV   +  G +  +     GG       DSII+  + +K+ VL+  D+   L+ T  H
Sbjct: 48  LRLVWEKKFWGEIFGVYRHKSGG-----EYDSIIVGCDTSKVIVLQVIDN--DLKETEYH 100

Query: 161 CFESPEWLHLKRGRES--------FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSG 212
            F  P        +               ++  DP G C  +L+    + +L  +     
Sbjct: 101 EFNRPGPPEPDPPKPERPFDISTRLRNKTIMDADPTGTCLALLLAQNILYVLPLANK--- 157

Query: 213 LVGDEDTFGSGGGFSAR---IESSHVINLRDLDMK----HVKDFIFVHGYIEPVMVILHE 265
            +  E T  +G  + +    I+ +   ++   D K     ++D +F+ GY  P + I+HE
Sbjct: 158 -IKIESTEKAGDEYHSSWKVIKDAFAYDVH-TDFKSPLYRIRDMVFLDGYKNPTLAIIHE 215

Query: 266 RELTWAGRVSWKHHTCMISALSISTTLKQHPLI---------WSAMNLPHDAYKLLAVPS 316
              TW+ R+  +  T  +S +S     K+  LI         W++  LPH+++ L+ VP 
Sbjct: 216 LIPTWSVRLPLQKSTVAVSIVSPPLKKKETVLISASIDKVTMWTSRALPHNSFGLVHVPD 275

Query: 317 PIGGVLVVGANTIHYHSQSASCALALNNYA-----VSLDSSQELPRSSFSVELDAAHATW 371
           PIGG LV+  N I Y   +   ALALN  A     V +D +   P      EL +   T 
Sbjct: 276 PIGGFLVLSKNAIIYMDHTNIVALALNKLAYLDDEVPVDITANGPGCH---ELYSKVGTA 332

Query: 372 LQNDVALLSTKTGDLVLLTVVYDGRVV-----QRLDLSKTNPSVLTSDITTIGNSLFFLG 426
           +     LL+     L +LT+ Y+G  V           + +PS   S   T   SL F+G
Sbjct: 333 IDKSHILLTVDQHYLSILTLHYNGVKVTNLSLNVNLNLEFHPSCFLSLNYTNNRSLVFMG 392

Query: 427 SRLGDSLL 434
           S   DS L
Sbjct: 393 STTHDSTL 400


>gi|255718033|ref|XP_002555297.1| KLTH0G05984p [Lachancea thermotolerans]
 gi|238936681|emb|CAR24860.1| KLTH0G05984p [Lachancea thermotolerans CBS 6340]
          Length = 1307

 Score = 84.0 bits (206), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 127/653 (19%), Positives = 262/653 (40%), Gaps = 131/653 (20%)

Query: 99  ASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSI-HGLRIT 157
             L L+  ++LHG +  +A++ Q         D ++++   AK+S++ FD S+   L   
Sbjct: 45  GKLVLLHEFKLHGQITGMALVPQMEGP----LDCLVVSTGKAKLSLVRFDPSMPMCLETL 100

Query: 158 SMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYG---LQMIILKASQGGSGLV 214
           S+H +E+      ++     A+   +++DP+ RC  VL++    L ++ L  ++      
Sbjct: 101 SLHYYEAE---FTRKNLIELAKTSKLRLDPERRC--VLLFNSDVLALLPLNINEEDE--- 152

Query: 215 GDEDTFGSGGGFSARIES---------SHVINLRDL--DMKHVKDFIFVHGYIEPVMVIL 263
            D++   +      ++E+         S V+++ DL  ++K+V D  F++ + +P + +L
Sbjct: 153 -DDNQEPTHQAKKRKVENGDARRLAKQSSVLHVSDLSAELKNVVDIQFLNSFSQPTLAVL 211

Query: 264 HERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLV 323
           ++  L W+G         M   ++I+   K++  I+    LPHD + ++ + +     ++
Sbjct: 212 YQPRLAWSGNDKVAGKGSM-RLMAITPHEKKNTTIYQVKELPHDVHTIIPLAN---SCVL 267

Query: 324 VGANTIHY--HSQSASCALALNNYAVSLDSSQELPRSSFSVELD-----AAHATWLQNDV 376
           VG N I    ++ +    + LN+++     S+++  SS  V        A+       ++
Sbjct: 268 VGVNEIVSVDNTGAIQSTIQLNSFSPKFTGSKQIDNSSLEVMFTEPIVWASAMVSKDREI 327

Query: 377 ALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSV--------LTSDITTIGNSL------ 422
            +L     D+  +T+  +GR++    L +  P V        L + I  +   +      
Sbjct: 328 LILMDHKADMYSITLQSEGRLLIDFTLVRL-PIVNDIFKDQNLPTCIVALSGGIRLKTCQ 386

Query: 423 FFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGE 482
           FF+G   GD+++V+          S+ L+  F     +A           DAL     G+
Sbjct: 387 FFIGFSSGDAVVVK----------SNNLRSAFESQYREAIELPNDEDEDYDALY----GD 432

Query: 483 ELSLYGSASNNTESAQKTFSFAVR--DSLVNIGPLKDFSYGLRINADASATGISKQSNYE 540
           +  L    ++N  + +    F +   DSL+N+GP+     G   + +A+  G+   +  E
Sbjct: 433 DEDLARPVNDNKATVETAVPFEIELMDSLINVGPITSICTGRVSSINATIEGLPNPNRNE 492

Query: 541 L----------------------------VELPGCKGIWTVYHKSSRGHNADSSRMAAYD 572
           L                            ++      IW +  +    +   +   A   
Sbjct: 493 LAIVSTSGHDSGTYLNVMEPSVRPLVQQALKFTSVTKIWNLKIRKKDKYLVTTDSGAEKS 552

Query: 573 DEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
           D Y       + A+   ++       VT          T+    L G +R++QV  +   
Sbjct: 553 DVYE------IGAKIASIKPKHFKRNVT----------TVEIAILGGGKRIVQVTTKAVY 596

Query: 633 ILDGSY---MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
           + +  +   MT    F               V+ VSI DP++LL  S G I++
Sbjct: 597 LFNLGFKKLMTISFDF--------------EVVHVSILDPFILLTNSKGEIKI 635


>gi|190348091|gb|EDK40482.2| hypothetical protein PGUG_04580 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 1320

 Score = 83.6 bits (205), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 127/612 (20%), Positives = 237/612 (38%), Gaps = 68/612 (11%)

Query: 100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
            L L+  ++L+G V +L    +    +S   D I++A + AK+S++ +D   H +   S+
Sbjct: 51  KLRLLDQFKLYGTVTAL---KKFRTVDSPDLDYILVATKAAKVSMIRWDHQTHSIATESL 107

Query: 160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV----- 214
           H +E           E+     L+ V+P       + +   +  L  S            
Sbjct: 108 HYYEKSIQ---AATYETLDETELI-VEPNRYSCFCVRFKNLLTFLPFSTPDDDDDDMDDE 163

Query: 215 GDEDTFGSGGGFSARI-ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWA 271
           G+        GF + +  SS +++ + L+  +  + D  F+H Y EP + IL  +  TW 
Sbjct: 164 GETKKQKYVPGFDSEVFGSSFMVDAQTLEPSIGTIVDMQFLHNYREPTVAILSSKAATWT 223

Query: 272 GRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-TIH 330
           G +           ++I    K    +    NLP D  +L+ +  P+ G L++G N  IH
Sbjct: 224 GLLPKVKDNITYHVMTIDLATKATTTVLKIENLPFDIDRLVPLSHPLNGCLLLGCNEIIH 283

Query: 331 YHSQSASCALALNNYAVSLDSSQE--LPRSSFSVELDAAHATWLQND-VALLSTKTGDLV 387
             +      LA+N Y   + +S +    ++  ++ L+      L ND   LLS  TG L 
Sbjct: 284 VDNGGIVRRLAVNKYTEDITASVKNYHDQTDLNLMLENCAVIPLPNDNRVLLSLSTGSLF 343

Query: 388 LLTVVYDGRVVQRLDLSKTNPSVLTS-DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSML 446
            +    D + ++R  L     +  +S D+T  G   F       DS L+     +G S L
Sbjct: 344 HINFDVDIKTIKRFALEPVLETHYSSVDLTYPGQPAFL------DSNLLFIANNNGNSPL 397

Query: 447 SSGLKEEFGDIEADAPSTKRL-RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAV 505
                     +E      + +  +  S+  +DM   EEL    +A       Q    +  
Sbjct: 398 ----------LEVKYLRNEEVTEKVQSNGKEDMDGDEELYDDDNAGEKIVIRQGDIKYFK 447

Query: 506 RDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPG------CKGIWT------- 552
            D L+N GP+ DF+ G        A  I+   N   +   G      C  I+        
Sbjct: 448 HDELINHGPVSDFTLGKYSTEKFKANLINPNLNDVCIVSNGGSHKQSCLNIFAPSVQPII 507

Query: 553 ----VYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
                + + +R  N ++  +   DD      I  +E     L++ D + +          
Sbjct: 508 RSSLTFSQVNRMWNINNKYLITSDDVNSKSEIFQIEKSYSRLKSKDFIND---------- 557

Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
             TIA   L   + ++Q+  +   + +  +  + +SF     E     +   ++S ++ D
Sbjct: 558 EMTIAMHELNNGKYILQITPKHIEVFNSKF-KRHMSF---EDELKDAMKEDQIISSTVHD 613

Query: 669 PYVLLGMSDGSI 680
            Y+++  + G +
Sbjct: 614 DYLMIFFASGEV 625



 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 62/263 (23%), Positives = 115/263 (43%), Gaps = 35/263 (13%)

Query: 880  SNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIF-KNISGHQGFFLSGSRPCWCMV 938
            SN    + ++L  +  P +AY     P G   +R  ++   +SG    F++G  P +   
Sbjct: 787  SNFKLKKEKDLLITGAPDNAY-----PAGTSIERRLVYIPLVSGFSSIFVTGVVPYFITR 841

Query: 939  FRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
             R  +    +    +  +F    +   ++G I++ +    +IC+LP    YDN  PV+KV
Sbjct: 842  TRHSIPRIFKFTKIAAQSFASFSDSKVSNGLIFLDNAKNARICELPRDFNYDNNLPVKKV 901

Query: 999  IPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE----VGHQIDNHNLSSVD 1054
             P+  T   +TY    N Y  +VS     P N      +D+E     G + D  + +S  
Sbjct: 902  -PIGETVKSVTYHELSNTY--VVSTYREIPYNA-----LDEEGNPIAGLKKDKPSANSY- 952

Query: 1055 LHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKE---NET 1110
                    +  ++++ P      W    T+ ++ +E A+TV+ + L   ++TK     + 
Sbjct: 953  --------KGSLKLISPYN----WTVIETVELRDNEIAMTVKSMVLDIGSSTKRFKHRKE 1000

Query: 1111 LLAIGTAYVQGEDVAARGRVLLF 1133
            LL +GT   + ED+ A G   ++
Sbjct: 1001 LLVVGTGRYRMEDLGANGAFKIY 1023


>gi|398397855|ref|XP_003852385.1| hypothetical protein MYCGRDRAFT_100364 [Zymoseptoria tritici
           IPO323]
 gi|339472266|gb|EGP87361.1| hypothetical protein MYCGRDRAFT_100364 [Zymoseptoria tritici
           IPO323]
          Length = 1333

 Score = 82.8 bits (203), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 78/320 (24%), Positives = 133/320 (41%), Gaps = 32/320 (10%)

Query: 97  SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
           + + L L+  Y L G V S+A +     D     ++I+LAF++AK+S++E+D   H +  
Sbjct: 45  AQSKLVLIGGYPLAGTVTSIARVKT--LDTRTGGEAILLAFKNAKLSLIEWDPENHRIST 102

Query: 157 TSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL------------ 204
            S+H +E    +    G        ++ VDP  RC  +     Q+ IL            
Sbjct: 103 VSIHYYEGENVIAQPYGPSLGEYESILTVDPGSRCAALKFGARQLAILPFRQFGDELLGE 162

Query: 205 ------KASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYI 256
                  A+ G +    D    G         + S V+ L  LD  + H  D  F+H Y 
Sbjct: 163 EEGEFENANDGTTSKKHDAMQNGEDEAEQTPYKQSFVLPLTTLDPALSHTIDLAFLHEYR 222

Query: 257 EPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
           EP   I+             +      +  ++    K    + +  NLP   +K++ +PS
Sbjct: 223 EPTFGIISSAIEPSYALFDERKDILSYTVFTLDLEQKASTNLITVPNLPSTLWKVVPLPS 282

Query: 317 PIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQND 375
           PIGG L++G N  IH      + A A+N +A+         +S  +++L+          
Sbjct: 283 PIGGALLIGTNEFIHVDQSGKANATAVNEFAMKESDFGMADQSGLNLKLEGC-------S 335

Query: 376 VALLSTKTGDLVLLTVVYDG 395
           V +L+  TG+  +L V+ DG
Sbjct: 336 VEILNASTGE--MLVVLRDG 353



 Score = 42.0 bits (97), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 49/257 (19%), Positives = 102/257 (39%), Gaps = 31/257 (12%)

Query: 889  NLRFSRTPLDAYTR-EETPHGAPCQRITIFK--NISGHQGFFLSGSRPCWCMVFRER--- 942
            NLRF + P+    +  ++P   P  R    +   I G+    + G+  C  ++ +E    
Sbjct: 775  NLRFRKVPVPYIPKYNDSPLEDPNARPPALRRMQIGGYNTVSIPGAPSC--LLLKEASGP 832

Query: 943  ---LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVI 999
               L V+      +    T L+ + C +GF  V   G L  CQLP  + +   W +++ I
Sbjct: 833  PKILEVNEPKRSNATTILTPLNRIGCENGFATVDVNGALHECQLPPDAWFSTGWSIRQ-I 891

Query: 1000 PLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS---SVDLH 1056
             L     ++ + A      + V+          +     +E G   +  ++S    V  +
Sbjct: 892  DLGDDAREVRHLAYHEARGIFVAATC-----TTVDFYFAEEDGRHPEQDDISIRPQVPQY 946

Query: 1057 RTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGT 1116
              + +     +I+   +          +P   +  AL V    + +  + E + ++ + T
Sbjct: 947  SVHLISAKTHKIIHTHK----------LPYLETVTALKVMPAEV-SELSHEVKPVVVVST 995

Query: 1117 AYVQGEDVAARGRVLLF 1133
               +GED+ A+G +++F
Sbjct: 996  GAQRGEDMPAKGALIVF 1012


>gi|159155577|gb|AAI54419.1| Cpsf1 protein [Danio rerio]
          Length = 400

 Score = 82.0 bits (201), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 68/240 (28%), Positives = 110/240 (45%), Gaps = 51/240 (21%)

Query: 483 ELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG--------LRINADAS---- 529
           E+ +YGS A + T+ A  T+SF V DS++NIGP    S G         + N +      
Sbjct: 54  EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCASASMGEPAFLSEEFQTNPEPDLEVV 111

Query: 530 -ATGISKQSNYELV------------ELPGCKGIWTVYHKSSR---------GHNADSSR 567
             +G  K     ++            ELPGC  +WTV +   +         G + +  +
Sbjct: 112 VCSGYGKNGALSVLQKSIRPQVVTTFELPGCHDMWTVIYCEEKPEKPSAEGDGESPEEEK 171

Query: 568 MAAY---DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVI 624
                  D + H +LI+S E  TM+L+T   + E+  S  +  QG T+ AGN+   + +I
Sbjct: 172 REPTIEDDKKKHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVYAGNIGDNKYII 230

Query: 625 QVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
           QV   G R+L+G      L F P +         S ++  S+ADPYV++  ++G + + V
Sbjct: 231 QVSPMGIRLLEG---VNQLHFIPVDL-------GSPIVHCSVADPYVVIMTAEGVVTMFV 280


>gi|146415762|ref|XP_001483851.1| hypothetical protein PGUG_04580 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 1320

 Score = 81.3 bits (199), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 127/612 (20%), Positives = 237/612 (38%), Gaps = 68/612 (11%)

Query: 100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
            L L+  ++L+G V +L    +    +S   D I++A + AK+S++ +D   H +   S+
Sbjct: 51  KLRLLDQFKLYGTVTAL---KKFRTVDSPDLDYILVATKAAKVSMIRWDHQTHSIATESL 107

Query: 160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV----- 214
           H +E           E+     L+ V+P       + +   +  L  S            
Sbjct: 108 HYYEKSIQ---AATYETLDETELI-VEPNRYSCFCVRFKNLLTFLPFSTPDDDDDDMDDE 163

Query: 215 GDEDTFGSGGGFSARI-ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWA 271
           G+        GF + +  SS +++ + L+  +  + D  F+H Y EP + IL  +  TW 
Sbjct: 164 GETKKQKYVPGFDSEVFGSSFMVDAQTLEPSIGTIVDMQFLHNYREPTVAILSLKAATWT 223

Query: 272 GRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-TIH 330
           G +           ++I    K    +    NLP D  +L+ +  P+ G L++G N  IH
Sbjct: 224 GLLPKVKDNITYHVMTIDLATKATTTVLKIENLPFDIDRLVPLSHPLNGCLLLGCNEIIH 283

Query: 331 YHSQSASCALALNNYAVSLDSSQE--LPRSSFSVELDAAHATWLQND-VALLSTKTGDLV 387
             +      LA+N Y   + +S +    ++  ++ L+      L ND   LLS  TG L 
Sbjct: 284 VDNGGIVRRLAVNKYTEDITASVKNYHDQTDLNLMLENCAVIPLPNDNRVLLSLLTGSLF 343

Query: 388 LLTVVYDGRVVQRLDLSKTNPSVLTS-DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSML 446
            +    D + ++R  L     +  +S D+T  G   F       DS L+     +G S L
Sbjct: 344 HINFDVDIKTIKRFALEPVLETHYSSVDLTYPGQPAFL------DSNLLFIANNNGNSPL 397

Query: 447 SSGLKEEFGDIEADAPSTKRL-RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAV 505
                     +E      + +  +  S+  +DM   EEL    +A       Q    +  
Sbjct: 398 ----------LEVKYLRNEEVTEKVQSNGKEDMDGDEELYDDDNAGEKIVIRQGDIKYFK 447

Query: 506 RDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPG------CKGIWT------- 552
            D L+N GP+ DF+ G        A  I+   N   +   G      C  I+        
Sbjct: 448 HDELINHGPVSDFTLGKYSTEKFKANLINPNLNDVCIVSNGGSHKQSCLNIFAPSVQPII 507

Query: 553 ----VYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
                + + +R  N ++  +   DD      I  +E     L++ D + +          
Sbjct: 508 RSSLTFSQVNRMWNINNKYLITSDDVNLKSEIFQIEKSYSRLKSKDFIND---------- 557

Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
             TIA   L   + ++Q+  +   + +  +  + +SF     E     +   ++S ++ D
Sbjct: 558 EMTIAMHELNNGKYILQITPKHIEVFNSKF-KRHMSF---EDELKDAMKEDQIISSTVHD 613

Query: 669 PYVLLGMSDGSI 680
            Y+++  + G +
Sbjct: 614 DYLMIFFASGEV 625



 Score = 57.8 bits (138), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 61/262 (23%), Positives = 114/262 (43%), Gaps = 35/262 (13%)

Query: 881  NVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIF-KNISGHQGFFLSGSRPCWCMVF 939
            N    + ++L  +  P +AY     P G   +R  ++   +SG    F++G  P +    
Sbjct: 788  NFKLKKEKDLLITGAPDNAY-----PAGTSIERRLVYIPLVSGFSSIFVTGVVPYFITRT 842

Query: 940  RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVI 999
            R  +    +    +  +F    +   ++G I++ +    +IC+LP    YDN  PV+KV 
Sbjct: 843  RHSIPRIFKFTKIAAQSFASFSDSKVSNGLIFLDNAKNARICELPRDFNYDNNLPVKKV- 901

Query: 1000 PLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQE----VGHQIDNHNLSSVDL 1055
            P+  T   +TY    N Y  +VS     P N      +D+E     G + D  + +S   
Sbjct: 902  PIGETVKSVTYHELSNTY--VVSTYREIPYNA-----LDEEGNPIAGLKKDKPSANSY-- 952

Query: 1056 HRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKE---NETL 1111
                   +  ++++ P      W    T+ ++ +E A+TV+ + L   ++TK     + L
Sbjct: 953  -------KGSLKLISPYN----WTVIETVELRDNEIAMTVKSMVLDIGSSTKRFKHRKEL 1001

Query: 1112 LAIGTAYVQGEDVAARGRVLLF 1133
            L +GT   + ED+ A G   ++
Sbjct: 1002 LVVGTGRYRMEDLGANGAFKIY 1023


>gi|428164905|gb|EKX33915.1| hypothetical protein GUITHDRAFT_158867 [Guillardia theta CCMP2712]
          Length = 1092

 Score = 80.1 bits (196), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 128/601 (21%), Positives = 236/601 (39%), Gaps = 127/601 (21%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+++  ++   L+ V    ++G + ++ + +  GA+    R+S+ +  E  K  ++E+D 
Sbjct: 37  RLVIYTLTPEGLQPVLDTGIYGRIAAIELYTVAGAE----RESLYILTERLKFCIVEYDS 92

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFAR----GPLVKVDPQGRCGGVLVY-GLQMIIL 204
           S   L   +M   +           +S  R    GP+  +DP+ R  G L+Y GL  +I 
Sbjct: 93  STGELITKAMGDVQ-----------DSVGRPVDGGPIAHIDPERRMIGFLLYDGLFKVIP 141

Query: 205 KASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILH 264
             ++ G               F+ R+E   V++++           F++GY +P +V+L 
Sbjct: 142 IDTRNGQ----------LREAFNIRLEELQVLDVQ-----------FLYGYAQPTIVLL- 179

Query: 265 ERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGV 321
                      ++    M    +   +++    I   WS   +   A  ++ VP+PIGG 
Sbjct: 180 -----------YQDPKEMRHLKTYQVSIRDKDFIAGPWSQTGVEIGATMIIPVPTPIGGC 228

Query: 322 LVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLST 381
           +++G  TI Y +         +   + +D +  + R+   ++ D            LL  
Sbjct: 229 ILLGEQTISYLNGDKG-----DTKTIHMDMT--VIRAWGKIDEDGRR--------YLLGD 273

Query: 382 KTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGS 441
             G L +L + +DG  V  L L     +     IT + + + F+GS  GDS L++     
Sbjct: 274 HLGQLYVLVLEFDGNKVLGLKLDTLGETSSAKTITYLDSGVVFIGSCFGDSQLIRL---- 329

Query: 442 GTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTF 501
                                  K    S+ + L+   N   +  +       +   +  
Sbjct: 330 --------------------HPDKDENDSNIEVLESFTNLGPIQDFCVVDLERQGQGQVV 369

Query: 502 SFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGH 561
           + +        G LKD S  LR+  +    GI++Q+    VELPG KG+W++        
Sbjct: 370 TCS--------GTLKDGS--LRVVRN--GIGINEQAA---VELPGIKGLWSLRE------ 408

Query: 562 NADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRR 621
                   + D +Y  YLI S    T VLE AD     TE   +    +TI   N+ G  
Sbjct: 409 --------SIDAQYDKYLIQSFVNETRVLEIADEELSETEIDGFDHNAQTIFCSNVLG-D 459

Query: 622 RVIQVFERGARILDGSYMTQDLSFGPSNSE--SGSGSENSTVLSVSIADPYVLLGMSDGS 679
            ++Q+ E   R++          + P N E  + +G     V+  S     + L +S+G 
Sbjct: 460 CLLQITEVSLRLVSTKSKQLLKEWFPPNGERITVAGGNVQQVVLTSGKRTLIYLDVSNGD 519

Query: 680 I 680
           +
Sbjct: 520 V 520


>gi|224135035|ref|XP_002321967.1| predicted protein [Populus trichocarpa]
 gi|222868963|gb|EEF06094.1| predicted protein [Populus trichocarpa]
          Length = 60

 Score = 78.6 bits (192), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 36/48 (75%), Positives = 41/48 (85%)

Query: 684 VGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWL 731
           + DPSTC VSV TP+A +SSKK VS+CTLYHDKGPEP LRKTS +AWL
Sbjct: 1   MTDPSTCMVSVNTPSAFQSSKKSVSACTLYHDKGPEPLLRKTSPNAWL 48


>gi|255720869|ref|XP_002545369.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
 gi|240135858|gb|EER35411.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
          Length = 1351

 Score = 78.2 bits (191), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 73/318 (22%), Positives = 134/318 (42%), Gaps = 29/318 (9%)

Query: 222 SGGGFSAR--IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
           +G  F  R   +SS +I+   LD  +  V D  F+H Y EP + +L  +   WAG +   
Sbjct: 198 NGNSFEPRQFYDSSFIIDATTLDSTVGTVIDMQFLHNYREPTIGVLSSKSEVWAGNLLKS 257

Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSA 336
                   L++    K    ++   NLP++  +++ +PSP+ GV++VG N  IH  +   
Sbjct: 258 KDNIQFQVLTLDLNSKSTVSVFKIDNLPYEIDRVIPLPSPLNGVILVGCNELIHVDNGGV 317

Query: 337 SCALALNNY----AVSLDSSQELPRSSFSVELDAAHATWLQND-VALLSTKTGDLVLLTV 391
              +A+N +      S+ S Q+  +S  +++L+ +    + ND   LL  KTG+   +  
Sbjct: 318 MKRIAVNKFTGLTTASIKSFQD--QSDLNLKLEDSTIVPIPNDHRVLLVLKTGEFYYINF 375

Query: 392 VYDGRVVQRLDLSKTNPSVLTS-------DITTIGNSLFFLGSRLGDSLLVQFTCGSGTS 444
             DG+ ++R+ +   +  +          ++  +  +L F  +  G+S LVQ       S
Sbjct: 376 ELDGKSIKRVHIDVIDKKLYEKVKLTYPGEVAVLDKNLLFFANSSGNSPLVQVKYRDSLS 435

Query: 445 MLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFA 504
               G   E  D E +               ++    E+ +L          ++    F 
Sbjct: 436 DAKIGAPIEESDEEDETQKADEDDDEDDLYKEEEEEEEQKNL----------SKTHIEFV 485

Query: 505 VRDSLVNIGPLKDFSYGL 522
             D L+N GP   F+ G+
Sbjct: 486 YHDELINNGPSSSFTLGV 503


>gi|453082807|gb|EMF10854.1| CPSF_A-domain-containing protein [Mycosphaerella populorum SO2202]
          Length = 1349

 Score = 78.2 bits (191), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 144/680 (21%), Positives = 266/680 (39%), Gaps = 93/680 (13%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV   ++++++       G K + N G  ++ VL           V  Y L G V S+
Sbjct: 28  NLVVAKTSLLQVF-------GVKAAGNDGGNEKLVL-----------VGEYSLAGTVTSI 69

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +     D     ++++L+F+DAK+S++E+D   + +   S+H +E    +    G   
Sbjct: 70  ARVKT--LDTKSGGEAVLLSFKDAKLSLVEWDPENYRISTISLHFYEGDNVISAPFGPPL 127

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQ---------------GGSGLVGDEDT-- 219
                ++ VDP  RC  +     Q+ IL   Q                   L   + T  
Sbjct: 128 ADCDSILTVDPSSRCAALKFGARQLAILPFRQFGDELAGEEEEGEFDADHALATSKRTES 187

Query: 220 --FGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS 275
               +G       ++S  + L  LD  + H     F+H Y EP   IL          + 
Sbjct: 188 VPHANGDTEHTPYKASFTLALTALDPSVSHAVHLAFLHEYREPTFGILSATVEPSYSLLE 247

Query: 276 WKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS 335
            +      + L++    +    + S   LP   ++++ +P P+GG L++G N + +  QS
Sbjct: 248 ERKDILTYTVLTLDLEQRASTNLISVPKLPSTLWEVVPLPLPVGGALLLGTNELVHVDQS 307

Query: 336 ASC-ALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVV 392
               A A+N +A          +S  +++L+      L +     L+ T  G L +L+  
Sbjct: 308 GKANATAVNEFAKLESDFGMADQSHLNLKLEDCRVEVLDSKTGELLIVTNDGSLAILSFQ 367

Query: 393 YDGRVVQRLDLSKTNPSVLTSDITT-------IGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
             GR +  L++ +      ++ I T       +  S  F+GS  G S L+ ++    TS 
Sbjct: 368 MHGRSISALNVKRATSENGSTTIHTAPSCMARLEGSKIFIGSEDGASSLLGWS--RPTSA 425

Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAV 505
           L+   K     +       +       D        E      S +  T +AQ TFS  +
Sbjct: 426 LNR--KRSHAQMLDKEADDEDEEMEEDDDDLYDAAPEPKKRASSETAVTSTAQYTFS--I 481

Query: 506 RDSLVNIGPLKDFSYG--------LRINADASATGISKQS--NYELV-------ELPGCK 548
            D L++ GP+ +   G        L I A A     S+ +  + ++V       +L   +
Sbjct: 482 IDELLSTGPIHEVCLGRSGPWKDRLEIAAGAGRKQASRLTLMHRDIVPTVRRKCKLGAAR 541

Query: 549 GIWTVYHKSSRGHNADSSRMA-AYD-DEYHAYLIISL--EARTMVLETADLLTEVTESVD 604
             W +  K       +   +   +D D+   Y I S   +  +    +A       E++D
Sbjct: 542 ATWALRPKQRNAALPEYDNLLFVFDGDDTKVYDIPSQDEDGSSYTERSAPEFESAGETLD 601

Query: 605 YFVQGRTIAAGNLFGRRRVIQVFERGARI-LDGSYMTQDLSFGPSNSESGSGSENSTVLS 663
                 T+A G +  + R  ++    A++ LD           P   E     E+ +++ 
Sbjct: 602 M----ATVADGTIVVQTRRTELRTYNAKLGLD--------QIIPMTDE--ETDEDLSIVH 647

Query: 664 VSIADPYVLLGMSDGSIRLL 683
           ++++DPYVL+   D S+++L
Sbjct: 648 IAVSDPYVLVIRGDNSVQVL 667



 Score = 47.0 bits (110), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 39/175 (22%), Positives = 78/175 (44%), Gaps = 19/175 (10%)

Query: 960  LHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPL 1019
            +H   C HGF+ + +   ++  QLP  + Y   W +Q+V   +   H I Y AE+ +Y  
Sbjct: 875  VHRPGCEHGFLTINADEEVQENQLPEKTWYGTGWSIQQVDIGEDVRH-IAYHAEREVY-- 931

Query: 1020 IVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQ 1079
                 V+     +      ++  H   +      D+     V +Y + ++    +    Q
Sbjct: 932  -----VVATCRDIDFYFAGEDGRHPEQD------DIELRPQVPQYTIHLV----SAKSHQ 976

Query: 1080 TRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
               ++ +   E    ++V++L  +  T E + L+ + TA  +GED+ A+G V+L+
Sbjct: 977  RLQSVELGYLETVTALKVMSLEVSENTHEQKDLVVVSTAAQRGEDMPAKGAVILY 1031


>gi|254580509|ref|XP_002496240.1| ZYRO0C13816p [Zygosaccharomyces rouxii]
 gi|238939131|emb|CAR27307.1| ZYRO0C13816p [Zygosaccharomyces rouxii]
          Length = 1331

 Score = 78.2 bits (191), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 140/656 (21%), Positives = 270/656 (41%), Gaps = 124/656 (18%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           L L   ++  G +  LA++ Q  +      D ++L    AKISV+ +D++ + +   S+H
Sbjct: 48  LILTHEFKFEGRITDLAVVPQKDSP----LDCLLLCTSIAKISVVRYDEASNSIETLSLH 103

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS--------- 211
            +E        R     A+   ++VDP  RC   L++   +I L   Q  S         
Sbjct: 104 YYEDS---FKDRSILELAKESTMRVDPGKRCA--LLFNNDVIALLPLQTTSLNDGEEEDE 158

Query: 212 ---GLVGDEDTFGSGGGFSARIESSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHER 266
                  D+    + G  +A    S + N ++L  DM +V D  F+  +  P + ++ E 
Sbjct: 159 DMDDERPDKRQKNNKGRITA---PSAIFNAKELHQDMNNVIDVTFLRNFTRPTLAVIFEN 215

Query: 267 ELTWAGR-------VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIG 319
           +  WAG        V++   T  +++   ST +K   +I +   L  D + ++ + +   
Sbjct: 216 KPVWAGTSQVLPLPVTYMAFTLEVTSNEQSTDIKS-TVIATVKELSWDFHTMIPIAN--- 271

Query: 320 GVLVVGANTIHYHSQSASCA--LALNNYA-VSLDSSQELPRSSFSVELDAAHA-TWLQND 375
           G ++VG+N + Y   + S    + LN+YA  ++  ++ + RS   + L       W  +D
Sbjct: 272 GCIIVGSNEMAYIDNTGSLQSIIFLNSYANKNMKKARIVDRSKSKILLHKPTTYNWSVSD 331

Query: 376 VALLSTKTGDLVLLT----------VVYDGRVVQRLDL-----------SKTNPSVLTSD 414
                ++TG+ +L+           + Y+GR++ + D+           + +N + ++  
Sbjct: 332 ---QKSETGETLLIMDHQAAFYYIQLEYEGRLLTKFDIINLPIVNDTLKNNSNATCISRL 388

Query: 415 ITTI-GNSL-FFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
            +T+ GN +  F+G R GD+ +++       + L + ++      E  +P    + +   
Sbjct: 389 NSTLSGNYVDLFVGFRSGDASVLRL------NNLKAAIESRDEHKEITSPPENDIEKFED 442

Query: 473 DALQDMVNGEELSLYGSASNNTESAQKT---FSFAVRDSLVNIGPLKDFSYGLRINADAS 529
              +D +  EE S       N E   +T   F   V  SL NI P+   + G   + D  
Sbjct: 443 ---EDDLYSEEASDADKEKENKEVVVETVLPFDIEVLSSLRNIAPITSLTPGKICSVDKF 499

Query: 530 ATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL-EARTM 588
             G+S  +  E V L    G  T       G +    +M+   +   A   IS+ +   +
Sbjct: 500 VEGLSNPNRNE-VSLVATSGNGT-------GSHLTEIQMSVRPEVQLALKFISITQMWNL 551

Query: 589 VLETADLLTEVTES---------VD----YFVQGR-----TIAAGNLFGR-RRVIQVFER 629
            ++  D     T+S         +D     + +GR     T  + ++FG  +R++QV   
Sbjct: 552 KIKNKDKYLITTDSNKNKSDIYLIDKNFALYKEGRFRRDATTVSISMFGSDKRIVQVTTN 611

Query: 630 GARILDGSY---MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
              + D ++    T    F               V+ VS+ DPY+L+ +S G I++
Sbjct: 612 HLYLYDTNFKRLTTMKFEF--------------EVVHVSVMDPYILITVSRGDIKV 653


>gi|241954348|ref|XP_002419895.1| subunit of the mRNA cleavage and polyadenylation factor, putative
           [Candida dubliniensis CD36]
 gi|223643236|emb|CAX42110.1| subunit of the mRNA cleavage and polyadenylation factor, putative
           [Candida dubliniensis CD36]
          Length = 1420

 Score = 77.8 bits (190), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 55/236 (23%), Positives = 110/236 (46%), Gaps = 17/236 (7%)

Query: 216 DEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
           +EDT G+        +SS +I+   LD  +  V D  F+H Y EP + +L  ++  WAG 
Sbjct: 197 EEDTNGTNKESHLFYDSSFIIDATTLDSSIDTVVDMQFLHNYREPTIAVLSSKQEVWAGN 256

Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYH 332
           +           L++    K    ++   NLP++  +++ +PSP+ G L+VG N  IH  
Sbjct: 257 LIKSKDNIQFQVLTLDLNSKSTISVFKIDNLPYEIDRIVPLPSPLNGTLLVGCNELIHVD 316

Query: 333 SQSASCALALNNY----AVSLDSSQELPRSSFSVELDAAHATWLQND-VALLSTKTGDLV 387
           +      +A+N +      S+ S Q+  +S  +++L+      + +D   LL  +TG+  
Sbjct: 317 NGGVLKRIAVNKFTRLITASIKSFQD--QSDLNLKLENCSIVPIPDDHRVLLILQTGEFY 374

Query: 388 LLTVVYDGRVVQRLDLSKTNPSVLTS-------DITTIGNSLFFLGSRLGDSLLVQ 436
            +    DG+ ++R+ +   +             ++  +  ++ F+ +  G+S L+Q
Sbjct: 375 FINFELDGKSIKRIHIDNVDKKTYDKIQLNHPGEVAVLDKNMLFIANSNGNSPLIQ 430


>gi|348681092|gb|EGZ20908.1| hypothetical protein PHYSODRAFT_259403 [Phytophthora sojae]
          Length = 1137

 Score = 77.0 bits (188), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 109/467 (23%), Positives = 182/467 (38%), Gaps = 108/467 (23%)

Query: 174 RESFARGPLV----KVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR 229
           R+S  R   +     +DP+GR  G+ +Y     ++    G   L   +DTF         
Sbjct: 107 RDSIGRSSEIVTSGNIDPEGRLIGMNLYEGYFKVIPIDSGKGIL---KDTF--------- 154

Query: 230 IESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
                  N+R LD   V D  F+HGY +P + +L+E             H      L   
Sbjct: 155 -------NIR-LDELRVIDIKFLHGYTKPTICVLYED-------YKAARHIKTYHILLKE 199

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL 349
               + P  WS  N+   A  L+ VP+P+GGVL+V   TI YH+ S   A+ + +  + +
Sbjct: 200 KDFAEGP--WSQSNVESGASLLIPVPAPVGGVLIVSNQTIVYHNGSTFHAIPMQSTVIQV 257

Query: 350 DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS 409
             + +   S F                 LL+ + G L ++ + + G+ V  + L     +
Sbjct: 258 YGAVDKDGSRF-----------------LLADQYGTLSVVALQHTGKEVTGVHLEVLGET 300

Query: 410 VLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRR 469
            + S ++ + N + F+GS  GDS L++              ++E G              
Sbjct: 301 NIASCLSYLDNGVVFIGSTFGDSQLIKLNAD----------RDENG-------------- 336

Query: 470 SSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLRINAD 527
           S  + L   VN   +  +     + +   +  T S A +D     G L+    G+ IN  
Sbjct: 337 SYIEVLDTYVNVGPIIDFCVMDLDRQGQGQIVTCSGADKD-----GTLRVIRNGIGINEQ 391

Query: 528 ASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
           ASA            ELPG KG+W +               AA  D+Y     +S E R 
Sbjct: 392 ASA------------ELPGIKGMWAL-----------RETFAAEHDKYLLQSYVS-EIRI 427

Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
           + +   D + E  + +  F   +T+   N++G    +QV E   R++
Sbjct: 428 LAIGDEDEMEE--KEIPAFTNVKTLLCRNMYGDVW-LQVTESEVRLI 471


>gi|68471006|ref|XP_720510.1| likely Cleavage and Polyadenylation Specificity Factor subunit
           [Candida albicans SC5314]
 gi|74591422|sp|Q5AFT3.1|CFT1_CANAL RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two
           protein 1
 gi|46442380|gb|EAL01670.1| likely Cleavage and Polyadenylation Specificity Factor subunit
           [Candida albicans SC5314]
          Length = 1420

 Score = 76.6 bits (187), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 52/221 (23%), Positives = 104/221 (47%), Gaps = 17/221 (7%)

Query: 231 ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
           +SS +I+   LD  +  V D  F+H Y EP + +L  ++  WAG +           L++
Sbjct: 217 DSSFIIDATTLDSSIDTVVDMQFLHNYREPTIAVLSSKQEVWAGNLIKSKDNIQFQVLTL 276

Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNY-- 345
              LK    ++   NLP++  +++ +PSP+ G L+VG N  IH  +      +A+N +  
Sbjct: 277 DLNLKSTISVFKIDNLPYEIDRIIPLPSPLNGTLLVGCNELIHVDNGGVLKRIAVNKFTR 336

Query: 346 --AVSLDSSQELPRSSFSVELDAAHATWLQND-VALLSTKTGDLVLLTVVYDGRVVQRLD 402
               S  S Q+  +S  +++L+      + +D   LL  +TG+   +    DG+ ++R+ 
Sbjct: 337 LITASFKSFQD--QSDLNLKLENCSVVPIPDDHRVLLILQTGEFYFINFELDGKSIKRIH 394

Query: 403 LSKTNPSVLTS-------DITTIGNSLFFLGSRLGDSLLVQ 436
           +   +             ++  +  ++ F+ +  G+S L+Q
Sbjct: 395 IDNVDKKTYDKIQLNHPGEVAILDKNMLFIANSNGNSPLIQ 435


>gi|302403950|ref|XP_002999813.1| cft-1 [Verticillium albo-atrum VaMs.102]
 gi|261361315|gb|EEY23743.1| cft-1 [Verticillium albo-atrum VaMs.102]
          Length = 1349

 Score = 76.3 bits (186), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 117/507 (23%), Positives = 199/507 (39%), Gaps = 75/507 (14%)

Query: 233 SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST 290
           S V+ L  LD  + H   F F+H Y EP + I+              H T  +   ++  
Sbjct: 197 SFVLALPQLDPEILHPVHFAFLHEYREPTLGIISSSNRRLKMEPQMDHFTFKV--FTVDL 254

Query: 291 TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSL 349
             K    I +  NLP    K++A+  P+GG L++G N  IH      +  +A+N YA  +
Sbjct: 255 LQKASTAILTVSNLPQSLKKVVALSKPMGGALLIGENELIHIDQAGKAHGVAVNPYAAKM 314

Query: 350 DSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDL---- 403
                  +S   + L+      +   N   LL T+ G++ ++T   DGR V  + +    
Sbjct: 315 TKFPLADQSELKLRLEHCEVELMSPDNGEMLLVTRHGEMAVVTFKMDGRSVSGVSVKVVA 374

Query: 404 SKTNPSVL---TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
           ++    +L    + +T +  +  F G+  GDS ++ +   S   + ++  K    D   +
Sbjct: 375 TENGGDILPFRAACLTKVSKNSMFYGTIGGDSQVIGW---SRQHVQTARKKARLLD---E 428

Query: 461 APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
           +           D   D + GE       ++    +      F V DSL+++ P+ D +Y
Sbjct: 429 SLDYDLDEDELDDDDDDDLYGEGTVAPQPSAAAGSAKGGDVVFRVHDSLLSLSPIMDMAY 488

Query: 521 G----------------LRINAD-ASATGISKQSNYELV------------ELPGCKGIW 551
           G                +R   D   A G  +  +  L+            E P  +G W
Sbjct: 489 GKTAFFPGSEEAKNSEGVRSELDLVCAVGRHRGGSLALINQHIQPRVIGRFEFPEARGFW 548

Query: 552 TV-----YHKSSRGHNADSSRMAAYDD-----EYHAYLIISLEARTMVLETADLLTEVTE 601
           T        KS +G     + +A  +D     +Y  ++I++ +      ET+D+      
Sbjct: 549 TTRVQKTIAKSLQGEKG--ANLAVGNDYGSVTQYDKFMIVA-KVDLDGYETSDVYALTGA 605

Query: 602 SVDYF-------VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESG 653
             +           G TI AG +    R+IQV     R  DG   ++Q L     + E+G
Sbjct: 606 GFEALSGTEFDPAAGLTIEAGTMGNDMRIIQVLRSEVRCYDGDLGLSQILPM--LDEETG 663

Query: 654 SGSENSTVLSVSIADPYVLLGMSDGSI 680
           +      V+S SI DPY+LL   D SI
Sbjct: 664 A---EPRVISASIVDPYLLLLREDSSI 687



 Score = 43.1 bits (100), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 35/138 (25%), Positives = 66/138 (47%), Gaps = 30/138 (21%)

Query: 57  NLVVTAANVIEIYVVRV-------QEEGSKESKNSGETKRRVLMD--GISAA-------- 99
           NL+V+  ++++I+ V+         +  +K S  +GET  R + D  G+ +A        
Sbjct: 28  NLIVSKGSLLQIFAVKTVSTEIDTSQIQAKSSSKAGETYDRRINDDDGLESAFLGGDGML 87

Query: 100 ---------SLELVCHYRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDD 149
                     L LV  Y +HG +  LA +      +SR   +++++    A++S+L++D 
Sbjct: 88  MRADRTTNTRLVLVAEYPVHGVIAGLARVK---IQSSRSGGEALLVHSRTARLSLLQWDP 144

Query: 150 SIHGLRITSMHCFESPEW 167
             HG+   S+H +E  EW
Sbjct: 145 EKHGVEDVSIHFYEKEEW 162



 Score = 42.0 bits (97), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 20/57 (35%), Positives = 33/57 (57%), Gaps = 1/57 (1%)

Query: 1078 WQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
            W       ++  E A+ ++ + L  +  TKE   L A+GTA ++GED+  RGR+L+F
Sbjct: 985  WNVIDEFELEQYEVAMCMKTLLLEVSEETKERRMLFAVGTAILRGEDLPVRGRILVF 1041


>gi|50305395|ref|XP_452657.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|74606921|sp|Q6CTT2.1|CFT1_KLULA RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two
           protein 1
 gi|49641790|emb|CAH01508.1| KLLA0C10274p [Kluyveromyces lactis]
          Length = 1300

 Score = 76.3 bits (186), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 136/639 (21%), Positives = 257/639 (40%), Gaps = 111/639 (17%)

Query: 98  AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
           A  L L   ++L G +  + +L Q G   S  +   IL+   +K+S++ FD     L   
Sbjct: 45  AQKLVLAYEWKLAGKIIDMQLLPQIG---SPLKMLAILS-SKSKVSLVRFDPVAESLETL 100

Query: 158 SMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI-ILKASQGGSGLVGD 216
           S+H +   ++++L     S     ++ VDP  RC  +LV+   ++ IL        +  D
Sbjct: 101 SLHYYHD-KFVNL--STSSLKTESIMAVDPLFRC--LLVFNEDVLAILPLKLNTEDMEID 155

Query: 217 EDTFGSGGGFSARIESSHVINLRDLDM---------KHVKDFIFVHGYIEPVMVILHERE 267
           ED  G     + R++ +  I    + M         KHV D  +++ + +P + IL++  
Sbjct: 156 EDENGIKEPMAKRLKRNQGITSDSIIMPISSLHKSLKHVYDIKWLNNFSKPTVGILYQPV 215

Query: 268 LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327
           L W G      +T     LS+    ++  +I    +LP+D + L  VP   G VL +G N
Sbjct: 216 LAWCGNEKVLGNTMRYMVLSLDVEDEKTTVIAELADLPNDLHTL--VPLKRGYVL-IGVN 272

Query: 328 TIHYHSQSA---SCALALNNYAVSLDSSQELPRSSFSVELDAA----HATWLQNDVALLS 380
            + Y S S    SC + LN +A S  +++    S  ++ L  +    +    ++D+ +L 
Sbjct: 273 ELLYISASGALQSC-IRLNTFATSSINTRITDNSDMNIFLSKSSIYFYKALKRHDLLILI 331

Query: 381 TKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRL-----GD---- 431
            +   +  +    +G ++ + D  +            I N + F  SRL     GD    
Sbjct: 332 DENCRMYNIITESEGNLLTKFDCVQ----------VPIVNEI-FKNSRLPLSVCGDLNLE 380

Query: 432 --SLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS 489
              +L+ F  G    +    LK  F        + ++L  +  D        E  +LYG 
Sbjct: 381 TGRVLIGFLSGDAMFLQLKNLKVAFA-------AKRQLVETVDDDDD-----EYSALYGE 428

Query: 490 ASNNTES----AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELP 545
           + NNT +     Q+ F  ++ DS+ NIGPL   + G   + + +   +   +  E   + 
Sbjct: 429 SQNNTHTRIVETQEPFDISLLDSIFNIGPLTSLTIGKVASVEPTIQRLPNPNKDEF-SIV 487

Query: 546 GCKGI-----WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT 600
              G+      T  H + + H   + +  +    ++    + ++ +   L T D   E +
Sbjct: 488 ATSGVGRGSHLTALHSTVQPHIEQALKFTSATRIWN----LKIKGKDKYLVTTDADKEKS 543

Query: 601 E------------SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-----MTQDL 643
           +            + D+    RTI    +   +R++QV   G  + D  +     +T D+
Sbjct: 544 DVYQIDRNFEPFRAQDFRKDSRTIGMETMDDDKRILQVTSGGLYLFDVDFKRLARLTIDI 603

Query: 644 SFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
                            ++   I DPY+L   + G+I++
Sbjct: 604 E----------------IVHACIIDPYILFTDARGNIKI 626


>gi|363750592|ref|XP_003645513.1| hypothetical protein Ecym_3197 [Eremothecium cymbalariae
           DBVPG#7215]
 gi|356889147|gb|AET38696.1| Hypothetical protein Ecym_3197 [Eremothecium cymbalariae
           DBVPG#7215]
          Length = 1318

 Score = 76.3 bits (186), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 135/646 (20%), Positives = 266/646 (41%), Gaps = 105/646 (16%)

Query: 97  SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
           +   L L   ++L G+V S+A++ Q G++       +++     K+S+L+FD     L  
Sbjct: 44  AKGQLVLSYEWKLSGHVHSMALIPQPGSE----LYCLVILTGCGKLSILKFDHMSQSLDT 99

Query: 157 TSMHCFESP-EWLHLKRGRESFARGPLVKVDPQGRCGGV----LVYGLQMIILKASQGGS 211
            S+H +E   + L L       +  P + VD   RC  V     +  L + + K  +   
Sbjct: 100 LSLHYYEDKFKELSLLE----ISNTPSLIVDRSFRCLLVRNNDCIAILPLNVTKEEEEEE 155

Query: 212 GLVGDEDTFGSGGGFSAR------------IESSHVINLRDL--DMKHVKDFIFVHGYIE 257
                ++   +GG FS +            + SS ++    L  D+K+V D  F+HG+ +
Sbjct: 156 EDNEKDEDRSNGGRFSFKRHKLNGGSVKQFVNSSTIMPASHLHSDIKNVLDVQFLHGFNK 215

Query: 258 PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
           P + IL++  L W+G    +  T  +  LS+    ++  +I     LP+D + L+ + + 
Sbjct: 216 PTLAILYQPILAWSGNEKLRSQTVKVIILSLDFEDEKSTVINIIQGLPNDLHTLIPLSN- 274

Query: 318 IGGVLVVGANTIHYHSQSASC--ALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ-- 373
               +VVG N + Y   + +    ++LN+++ ++ +++    SS     +     +    
Sbjct: 275 --ASIVVGVNELIYIDNTGALQGTVSLNSFSKTVLNTKVKDNSSLQAFFNRPVCQYTTIS 332

Query: 374 --NDVALLSTKTGDLVLLTVVYDGRVVQ-----RL----DLSKTN--PSVLTSDITTIGN 420
              D+ LL  +   +  + +  +GR+V      RL    D+ K N  P+ +  D+     
Sbjct: 333 KGKDIMLLMDEKSQMYNVIIESEGRLVTAFNCVRLPIVNDIFKNNHLPTCICGDVDLETG 392

Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480
           +L F+G + GD++ V+      +S+ S G   E   +EAD    +               
Sbjct: 393 NL-FIGFKSGDAMRVRLN-NLRSSLASKGNVVE--TMEADEDYDE--------------- 433

Query: 481 GEELSLYGSASNNTESAQKT---FSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
                LYG ++   +    T   F     D+L+NIGPL   + G   + + +   ++  +
Sbjct: 434 -----LYGGSTEVEKKNMDTETPFDIETLDNLINIGPLTSLAVGKVSSIEPTIAKLTNPN 488

Query: 538 NYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYD------------DEYHAYLIISLEA 585
             EL  +    G  T  H +   +    +   A               +   YL+ +  +
Sbjct: 489 RCEL-SIVATSGNSTGSHLTVFENTIVPTVEKALKFISVTQIWNLKIKDKDKYLVTTDSS 547

Query: 586 RTMV-LETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY---MTQ 641
           ++   + + D   +  +S D+     T++       +R++QV  +G  + D ++   MT 
Sbjct: 548 QSKSDIYSIDRDFKPFKSFDFKKNDTTVSTAVTGAGKRIVQVTSKGVYLFDINFKRMMTM 607

Query: 642 DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDP 687
           +  F               V+ V I DP++LL  S G I++   +P
Sbjct: 608 NFDF--------------EVVHVCINDPFLLLTNSKGDIKIYELEP 639


>gi|167384458|ref|XP_001736962.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165900458|gb|EDR26769.1| hypothetical protein EDI_171140 [Entamoeba dispar SAW760]
          Length = 836

 Score = 75.9 bits (185), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 83/326 (25%), Positives = 141/326 (43%), Gaps = 50/326 (15%)

Query: 133 IILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFA----RGPLVKVDPQ 188
           ++L F++AK+S+L +D++ +   I S+HCFE P    LKR +E         P + +D +
Sbjct: 74  LVLLFKEAKVSILRYDETNNKFVIHSLHCFELP----LKRMQEGLTPTTYTNPRLLIDKR 129

Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKD 248
           GRC  ++ Y   M ++                    GF    ++S+ INL    +  + D
Sbjct: 130 GRCISLICYDRLMWVIPL------------------GFD---KTSYSINLEKFGINRIID 168

Query: 249 FIFVHGYIEPVMVILHERELTWAGR-VSWKHHTCMISALSISTTL---KQHPLIWSAMNL 304
            I + GY  P +  LH +  TW GR V+    T  I  LS+   +   KQ  +   +   
Sbjct: 169 CIVLDGYDLPSVAFLHMKIPTWEGRIVNTGETTNEIIVLSLEPDVIHEKQDIVATVSYQF 228

Query: 305 PHDAYKLLAVPS--PIGGVLVVGANTIHYHSQSA--SCALALNNYAVSLDSSQELPRSSF 360
            +  Y  L +    P  G+L++  N+I Y S ++  S  L    + V +  +   P SSF
Sbjct: 229 SYVPYNALQIVDCYPTNGILILTINSIIYLSTTSFESFILPFGKFFV-IPKNNNRPLSSF 287

Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLL-------TVVYDGRVVQRL-DLSKTN-PSVL 411
            +       T + N V  +   T  L ++         V+   +  R+ D+  TN P   
Sbjct: 288 QI---LQMQTKIMNSVKSIFKLTNHLYIIFSMNGESYYVHLLSIANRICDVIITNSPYKY 344

Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQF 437
                TI ++  F+GS + DS +  +
Sbjct: 345 HPTTFTISSNHLFIGSTVHDSYIYNY 370


>gi|238881599|gb|EEQ45237.1| conserved hypothetical protein [Candida albicans WO-1]
          Length = 1423

 Score = 75.9 bits (185), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 55/238 (23%), Positives = 110/238 (46%), Gaps = 19/238 (7%)

Query: 216 DEDTFGSGGGFSARI--ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWA 271
           +ED  G+      R+  +SS +I+   LD  +  V D  F+H Y EP + +L  ++  WA
Sbjct: 203 EEDKNGTTTNQEPRLFYDSSFIIDATTLDSSIDTVVDMQFLHNYREPTIAVLSSKQEVWA 262

Query: 272 GRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IH 330
           G +           L++    K    ++   NLP++  +++ +PSP+ G L+VG N  IH
Sbjct: 263 GNLIKSKDNIQFQVLTLDLNSKSTISVFKIDNLPYEIDRIIPLPSPLNGTLLVGCNELIH 322

Query: 331 YHSQSASCALALNNY----AVSLDSSQELPRSSFSVELDAAHATWLQND-VALLSTKTGD 385
             +      +A+N +      S  S Q+  +S  +++L+      + +D   LL  +TG+
Sbjct: 323 VDNGGVLKRIAVNKFTRLITASFKSFQD--QSDLNLKLENCSVVPIPDDHRVLLILQTGE 380

Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTS-------DITTIGNSLFFLGSRLGDSLLVQ 436
              +    DG+ ++R+ +   +             ++  +  ++ F+ +  G+S L+Q
Sbjct: 381 FYFINFELDGKSIKRIHIDNVDKKTYDKIQLNHPGEVAILDKNMLFIANSNGNSPLIQ 438


>gi|449019486|dbj|BAM82888.1| similar to cleavage and polyadenylation specificity factor subunit
           [Cyanidioschyzon merolae strain 10D]
          Length = 1880

 Score = 74.7 bits (182), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 112/499 (22%), Positives = 194/499 (38%), Gaps = 128/499 (25%)

Query: 243 MKHVK--DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI----STTLKQHP 296
           + HV+  D  F+ G   P MV+L+E   TWAGRV    ++C ++A+ +    +    + P
Sbjct: 345 LGHVRILDCCFLTGTALPTMVMLYEERPTWAGRVEAVSNSCALAAIVLPPLPAGAAGEEP 404

Query: 297 LI-WSAMNLPHDAYKLLAVPS------PIGGVLVVGANTIHYHSQSASCALAL--NNYA- 346
           L+ W    LP DA K++ +PS         G+L++ AN + +   +     +L  N++  
Sbjct: 405 LVAWRIQGLPFDAEKVVPLPSVEWDRAAEQGLLLIAANVLFWIRGNGQIGASLSGNHFGD 464

Query: 347 --VSLDSSQELP---------------RSSFSVELDAAHATWLQNDVALLSTKTGDLVLL 389
             + LD  Q LP               R+S  +    A    ++     L    G++  L
Sbjct: 465 TFMELDGCQ-LPGALYGGTDSDIISRCRTSQVLHFRGACIAPVRLHRYGLFLADGNVYQL 523

Query: 390 TVVYDGRVVQRLDL------SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGT 443
            +  D     RL+       S+  P+ L  D   +   L F+ + LG S+L + T     
Sbjct: 524 ALHADAEYPLRLEALRVRGESRLAPAPL--DAKLLSRDLLFVAAHLGSSVLYRMT----- 576

Query: 444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSF 503
                             P  +R R S+++              G+   N  + +  +  
Sbjct: 577 ---------------QVHPHGRRTRTSAAE-------------NGTLHKNATTKEAQWEL 608

Query: 504 AVRDSLVNIGPLKDF---------SYGLRINADA--SATGISKQS------------NYE 540
             RD++  +GP+ D            G  ++     +ATG   QS             ++
Sbjct: 609 QQRDTIFQLGPIVDLVVIPPRYSPPAGTLLDPGEILAATGHQHQSCLARCTYQVQTREWQ 668

Query: 541 LVELPGCKGIWTVY--HKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTE 598
            +   GC+ +W++Y  H  +  H  +     A+     +   + L+ R    + AD  T 
Sbjct: 669 RIPSAGCRRVWSLYADHDGTGMHQEEQ----AFLLLSLSKSSVILDIRRGFEQAAD--TR 722

Query: 599 VTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY---MTQDL---SFGPSNSES 652
           V       +   TIAAGNL  RR + QV   G R+LD +      +D+   +  P  + S
Sbjct: 723 V------LLPSPTIAAGNLAQRRLIAQVHRTGIRLLDANLDVVYEEDMLLAALEPGTAVS 776

Query: 653 GSGSENSTVLSVSIADPYV 671
           G+          S+ DPY+
Sbjct: 777 GA----------SVVDPYI 785


>gi|301121252|ref|XP_002908353.1| DNA damage-binding protein, putative [Phytophthora infestans T30-4]
 gi|262103384|gb|EEY61436.1| DNA damage-binding protein, putative [Phytophthora infestans T30-4]
          Length = 1150

 Score = 74.7 bits (182), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 110/467 (23%), Positives = 183/467 (39%), Gaps = 108/467 (23%)

Query: 174 RESFARGPLV----KVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR 229
           R+S  R   +     +DP+GR  G+ +Y     ++    G   L    DTF         
Sbjct: 107 RDSIGRSSEIVTSGNIDPEGRLIGMNLYEGYFKVIPIDSGKGIL---RDTF--------- 154

Query: 230 IESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
                  N+R LD   V D  F+HGY +P + +L+E +   A  V   H       L   
Sbjct: 155 -------NIR-LDELRVIDIKFLHGYNKPTICVLYE-DYKAARHVKTYH------ILLKE 199

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL 349
               + P  WS  N+   A  L+ VP+P GGVL+V   TI YH+ S   A+ + +  + +
Sbjct: 200 KDFAEGP--WSQSNVESGASLLIPVPAPTGGVLIVSNQTIVYHNGSTFHAIPMQSTVIQV 257

Query: 350 DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS 409
             + +   S F                 LL+ + G L ++ + + G+ V  + L     +
Sbjct: 258 YGAVDKDGSRF-----------------LLADQYGTLSVVALQHTGKEVSGVHLEVLGET 300

Query: 410 VLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRR 469
            + S ++ + N + F+GS  GDS L++              ++E G              
Sbjct: 301 NIASCLSYLDNGVVFIGSTFGDSQLIKLNAD----------RDETG-------------- 336

Query: 470 SSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLRINAD 527
           S  + L   VN   +  +     + +   +  T S A +D     G L+    G+ IN  
Sbjct: 337 SYIEVLDSYVNVGPIIDFCVMDLDRQGQGQIVTCSGADKD-----GTLRVIRNGIGINEQ 391

Query: 528 ASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
           ASA            ELPG KG+W +               AA  D++     +S E R 
Sbjct: 392 ASA------------ELPGIKGMWAL-----------RETFAAEHDKFLLQSYVS-EVRI 427

Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
           + +   D + E  + +  F   +T+   N++G    +QV E   R++
Sbjct: 428 LAIGDEDEMEE--KEIPAFTNVKTLLCRNMYGDYW-LQVTESEVRLI 471


>gi|448530371|ref|XP_003870046.1| mRNA cleavage and polyadenylation factor [Candida orthopsilosis Co
           90-125]
 gi|380354400|emb|CCG23915.1| mRNA cleavage and polyadenylation factor [Candida orthopsilosis]
          Length = 1327

 Score = 74.3 bits (181), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 83/375 (22%), Positives = 157/375 (41%), Gaps = 57/375 (15%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           L+LV  ++L G V  L  L      +    D ++++ + AK S++ ++  +H +   S+H
Sbjct: 57  LKLVEQFKLQGTVSGLKALRTSECPH---LDYVVVSTKYAKFSIIRWNHQLHNISTVSLH 113

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL---------------- 204
            +E+          E  A   L  V+P       L Y   +  L                
Sbjct: 114 YYEN---CIQHSTFEKLAISDLT-VEPTYSSVSCLRYKNLLCFLPFEGVHEEDDEDDTDD 169

Query: 205 ----KASQGGS----GLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHG 254
                  +GGS    GL  +   F          ++S +I+   LD  +  V D  F+H 
Sbjct: 170 EDIDNDKKGGSITKNGLSYENQPF---------YDASFIIDAGILDSTIDTVLDVQFLHN 220

Query: 255 YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
           Y EP + IL  +  +WAG +           +++    K    +++  NLP+D  +++ +
Sbjct: 221 YQEPTIAILSAKSNSWAGNLIKNKDNVQFQVMTLDVQSKSTLPVFNIDNLPYDIDRVIPL 280

Query: 315 PSPIGGVLVVGANT-IHYHSQSASCALALNNY----AVSLDSSQELPRSSFSVELDAAHA 369
           P+P+ G L++G N  IH  +   +  +A+N +      S+ S Q+   S  +++L+    
Sbjct: 281 PNPLNGCLLIGCNELIHVDNGGIAKRIAVNAFTSLITASVKSYQD--ESDLNLKLENCAI 338

Query: 370 TWLQND-VALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS-------DITTIGNS 421
             + +D   LL   TG+   L    DG+ ++++ L   +  +  S        + ++  +
Sbjct: 339 VPIPDDHRVLLILATGEFYYLNFDLDGKSIKKIHLELVDQKMYDSIRLTYPGQVASLDKN 398

Query: 422 LFFLGSRLGDSLLVQ 436
           L F  +  GDS LV+
Sbjct: 399 LLFFANLNGDSSLVE 413


>gi|145351726|ref|XP_001420218.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144580451|gb|ABO98511.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 1120

 Score = 74.3 bits (181), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 127/549 (23%), Positives = 224/549 (40%), Gaps = 113/549 (20%)

Query: 96  ISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLR 155
           + A  L+ V    ++G + ++++   G  D   R   + L  E    +VL +D++   L+
Sbjct: 58  LHAEGLKPVLDVPINGRIATMSLCQTGSGDGKAR---LYLTTERYGFTVLSYDEANEELK 114

Query: 156 ITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLV 214
             +    +         GR +   G +  VD   R  G+ +Y GL  +I    +GG    
Sbjct: 115 TEAFGDVQD------NIGRPA-DDGQIGIVDDTCRAIGLRLYDGLFKVIPCDEKGG---- 163

Query: 215 GDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
                          ++ +  I L +L    V+D  F+HG  +P + +L+ R+   A  V
Sbjct: 164 ---------------VKEAFNIRLEEL---RVEDIKFLHGTPKPTIAVLY-RDTKDA--V 202

Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQ 334
             K +   I      ++    P  W+  +L   + K++ VP+PIGGV+V+G   I Y   
Sbjct: 203 HIKTYEIGIREKEFVSS----P--WAQNDLEGGSNKIIPVPAPIGGVVVLGQEIIVY--- 253

Query: 335 SASCALALNNYAVSLD---SSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV 391
                  LN +    D    +  +P       +    A        LL    G L LL +
Sbjct: 254 -------LNKFEDDADVFLKAINIPNIPDRTNITCYGAIDPDGSRYLLGDADGMLYLLVI 306

Query: 392 VYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
           ++DG+ V+ L + +   + + S ++ + N + F+GS  GDS L++             L 
Sbjct: 307 LHDGKRVRELKIERLGDTSIASTLSYLDNGVVFVGSTYGDSQLIK-------------LH 353

Query: 452 EEFGDIEADAPSTKRLRRSSSDALQDMVNGE--ELSLYGSASNNTESAQKTFSFAVRDSL 509
            E   I+ D   T          L  +V+    +L  +G     T S             
Sbjct: 354 AEKTSIDKDGNPTYVQILEEFTNLGPIVDFAFVDLERHGQGQVVTCS------------- 400

Query: 510 VNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMA 569
              G LKD S  LR+  +    GI +Q+   +++LPG KG++++        ++D S+M 
Sbjct: 401 ---GALKDGS--LRVVRN--GIGIDEQA---VIQLPGVKGLFSL-------RDSDDSQM- 442

Query: 570 AYDDEYHAYLIISLEARTMVL----ETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQ 625
                   YL+++    T +L    +  D L E TE   +  + +T+  GN+ G    +Q
Sbjct: 443 ------DKYLVVTFINETRILGFVGDEGDTLDE-TEIAGFDAEAQTLCCGNMQG-NVFLQ 494

Query: 626 VFERGARIL 634
           V  RG R++
Sbjct: 495 VTHRGVRLV 503


>gi|367014525|ref|XP_003681762.1| hypothetical protein TDEL_0E03080 [Torulaspora delbrueckii]
 gi|359749423|emb|CCE92551.1| hypothetical protein TDEL_0E03080 [Torulaspora delbrueckii]
          Length = 1327

 Score = 73.9 bits (180), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 130/638 (20%), Positives = 269/638 (42%), Gaps = 86/638 (13%)

Query: 98  AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
           +A L L   ++ HG +  LA++ Q  +      D ++L    AK+S+++FD   + +   
Sbjct: 45  SAKLFLTNEFKFHGKITDLALIPQVNSS----LDCLLLCTSIAKVSIVKFDPLSNSIETA 100

Query: 158 SMHCFESP--EWLHLKRGRESFARGPLVKVDPQGRCGGVLVYG-LQMIILKASQGGSGLV 214
           S+H +E    +   L+  ++S+ R     +DP  RC  +L    L ++  +A+       
Sbjct: 101 SLHYYEDKFRDLSLLEIAQQSYFR-----LDPSKRCAIILNNDVLALLPFRAA------T 149

Query: 215 GDEDTFGSGGGFSARIES--------SHVINLRDL--DMKHVKDFIFVHGYIEPVMVILH 264
            D++   +      R+++        S +   ++L  ++++V D  F++ + +P + IL 
Sbjct: 150 DDDEEADAENNDVKRMKTSSDKVTYPSKIFVAKELHSEIRNVIDVQFLNNFSKPTIAILF 209

Query: 265 ERELTWAG--RVSWKHHTCMISALSISTTLKQHPL----IWSAMNLPHDAYKLLAVPSPI 318
           E  L WAG  +++ +  + MI  L IS+T          I     L  D + L+ + +  
Sbjct: 210 EPTLIWAGNRQLNPQPISYMIFTLEISSTDNTTKFGATTIGKLTGLSWDFHSLVPISN-- 267

Query: 319 GGVLVVGANTIHYHSQSASC--ALALNNYA-VSLDSSQELPRSSFSVELDAAHA-TW--- 371
            G ++VGAN + +   S +    + LN+++  +L   + +  S + + L  + A  W   
Sbjct: 268 -GCMIVGANELAFADNSGALQSVILLNSFSDRNLRQGRIIDNSKYEILLPQSIARCWSPP 326

Query: 372 ----LQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK---TNPSVLTSDITTIGNSLFF 424
               + ++  LL     ++  + +  +GR++ + D+ K    N ++  +   T  + L  
Sbjct: 327 TSDKVNDETLLLMDANSNVYYVQLESEGRLLIKFDIIKLPIVNDTLKNNQGCTCMSRLNS 386

Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-SSDALQDMVNGEE 483
             S     LL+ F  G    +  + LK      +     ++ +  S   D  +D +  +E
Sbjct: 387 RSSNNNMDLLMGFKSGDALVVRLNNLKSAAESRDEHKIFSEAMESSFDKDEDEDNLYSDE 446

Query: 484 LSLYGSASNNTESAQKT---FSFAVRDSLVNIGPLKDFSYGLRINADASATGI--SKQSN 538
            S  G A +N E   +T   F   +  ++ NIGP+   + G   + +    G+    ++ 
Sbjct: 447 ASDAGKADDNKEVIVETVTPFDIELLSTIKNIGPITSLAVGKVCSVEKYVKGLLNPNRNE 506

Query: 539 YELVELP--GCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL- 595
           Y +V     G     T    S R     + +  +    ++    + ++ R   L T D  
Sbjct: 507 YSMVATSGNGSGSHLTEIQGSVRPTVEVALKFISVTQIWN----LKIKNRDKYLVTTDSN 562

Query: 596 -----LTEVTESVDYFVQGR------TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLS 644
                + E+  +     +GR      T+      G +R++QV      + D ++  + L+
Sbjct: 563 KAKSDIYEIDNNFALHKEGRFRRDATTVCISMFGGDKRIVQVTTNNLILYDTNF--RRLT 620

Query: 645 FGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
               + E         V+ VS+ DPY+L+ +S G I++
Sbjct: 621 TMKFDYE---------VVHVSVMDPYILITVSRGDIKI 649


>gi|344305212|gb|EGW35444.1| pre-mRNA 3'-end processing factor CF II [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 1348

 Score = 73.9 bits (180), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 99/500 (19%), Positives = 195/500 (39%), Gaps = 77/500 (15%)

Query: 58  LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
           L+V   N+++I+   + ++ S  +K                  L+++  ++L+G +  L 
Sbjct: 29  LIVAKGNLLQIFEPVLIKQQSTPTK--------------PKYKLQIIGQFKLNGLITDLH 74

Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR--E 175
            L       +   D +I++ + AK S+++++  +H +   S+H +E     H  R    E
Sbjct: 75  PLR---TVENPHLDYLIVSTKYAKFSIIKWNHHLHTISTVSLHYYE-----HAIRNSTFE 126

Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE----------------DT 219
                 L+ V+P       L +   +  L  +        D+                D 
Sbjct: 127 KLGISELI-VEPTFNSCSCLRFKNLLCFLPFAVSDEEEEEDDEEDMDLDNKKEKKEKLDI 185

Query: 220 FGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
            G      +  +SS +I+ + LD  ++ V D  F+H Y EP + IL  +   WAG +   
Sbjct: 186 NGKPADAVSFYDSSFIIDAQTLDSSIETVVDIQFMHNYREPTIAILSSKSNVWAGNLLKV 245

Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI-HYHSQSA 336
                   +++    K    ++   NLP++  +++ +PSP+ G L++G N I H  +   
Sbjct: 246 KDNVSFQVMTLDLVSKSTVSVFKIDNLPYEIDRIIPLPSPLNGCLLLGCNEIFHVDNGGI 305

Query: 337 SCALALNNY----AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
              +A+N++      S  S Q+    S S+E D        +   L+   TG    +   
Sbjct: 306 IKRIAVNSFTSLVTASTKSYQDQTDLSLSLE-DCCIIPIPGDHRVLMVLTTGQFFYINFE 364

Query: 393 YDGRVVQRLDLSKTNPSVLTS-------DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
            DG+ ++++ +   + ++ +        ++  + ++L F  +  G+S LVQF        
Sbjct: 365 LDGKAIKKVHIDTVDQALYSQIKLCYPGEVAVLDHNLLFFANENGNSPLVQF-------- 416

Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQ----KTF 501
                   + D+  D     +         +     +E  LY    N  E  Q       
Sbjct: 417 -------RYTDV--DQKRITQEAAKEEKKEEKDDEEDEDDLYMDEENEEEQKQIISNSPI 467

Query: 502 SFAVRDSLVNIGPLKDFSYG 521
            F   D L+N GP+  F+ G
Sbjct: 468 EFIHHDELINNGPISSFTLG 487



 Score = 45.4 bits (106), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 62/305 (20%), Positives = 119/305 (39%), Gaps = 50/305 (16%)

Query: 836  HSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRT 895
            H   +L  +   G ++ Y+ Y F+G                    N    + ++LR +  
Sbjct: 783  HKEEYLTILTIGGEVIMYKLY-FDG-------------------ENYIFKKEKDLRITGA 822

Query: 896  PLDAYTREETPHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSI 954
            P +AY     P G   +R +  F N++G+   F++G  P   M     +    Q      
Sbjct: 823  PENAY-----PLGTTIERRLVYFPNLNGYTSIFVTGIIPYLIMKPMHSIPRIFQFSKIPA 877

Query: 955  VAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEK 1014
            ++ +   +    +G I++ +    +IC+L    TY+  WP+++ I +  +   ITY    
Sbjct: 878  LSISAFSDSKIKNGLIFLDNSKNARICELSLDFTYEFNWPMRQ-IHIGDSIKSITYHETS 936

Query: 1015 NLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPD 1072
            N Y  +VS     P +           G   D   +      +T     Y+  ++++ P 
Sbjct: 937  NTY--VVSTFREIPYD-----------GLDEDGKLIVGTLPDKTPRPVAYKGSIKMISPL 983

Query: 1073 RAGGPWQTRATIPMQSSENALTVRVVTLFNTTT----KENETLLAIGTAYVQGEDVAARG 1128
                 W    TI +  +E A+ V+ + L   ++    K  +  + IG+   + ED+ A G
Sbjct: 984  N----WTVIDTIELDDTEVAMNVQSMMLDVGSSMKKFKNKKEFIVIGSGKYRNEDLVANG 1039

Query: 1129 RVLLF 1133
               +F
Sbjct: 1040 SFKIF 1044


>gi|430810873|emb|CCJ31593.1| unnamed protein product, partial [Pneumocystis jirovecii]
          Length = 301

 Score = 72.4 bits (176), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 70/292 (23%), Positives = 128/292 (43%), Gaps = 41/292 (14%)

Query: 245 HVKDFIFV-HGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
           H+ D  F+ + Y EP + IL+    T  G + ++  T   +A            I++   
Sbjct: 6   HIVDLWFIFYDYREPTLAILYSAFQTSTGLLPYRQDTMTSTA------------IYTVDK 53

Query: 304 LPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASC-ALALNNYAVSLDSSQELPRSSFSV 362
           LP+D + +L +P+PIGG L++G N + Y  Q+A   A+++N++A        +     ++
Sbjct: 54  LPYDLFSVLPLPNPIGGTLLIGNNELVYVDQAARVKAVSVNSFARKCTHLDFIEDYDLNL 113

Query: 363 ELDAAHATWL-----QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSV----LTS 413
            L+ A   +L     Q    LL  + G  V +    DGRVV  L +   + SV    L S
Sbjct: 114 RLNGAVGVYLELLDDQPGAVLLVIEDGRFVQVGFKLDGRVVSSLSVKILDQSVKNDFLKS 173

Query: 414 D---ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS 470
           +   I  + N   F+GS++ +S+L+++   S  +             E      + +   
Sbjct: 174 EASCIVLLNNEQLFIGSKVSNSVLLEWKRQSEIA-------------EKLLSEPRVIFDE 220

Query: 471 SSDALQDMVNGEELSLYGSASN-NTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
             + L D+  GE+  +  ++S            F + D+L + GP+ D + G
Sbjct: 221 DREVLNDLY-GEDFDIVDTSSILQRNGVFGDIQFRLFDTLYSCGPIVDMTIG 271


>gi|119580419|gb|EAW60015.1| hCG2010549, isoform CRA_a [Homo sapiens]
          Length = 323

 Score = 71.2 bits (173), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 33/74 (44%), Positives = 46/74 (62%), Gaps = 1/74 (1%)

Query: 943  LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLK 1002
            LR+HP   +G + +F + HNVNC  GF+Y   QG L+I  LP+  +YD+ WPV+K IPL 
Sbjct: 184  LRLHPVGINGPVNSFALFHNVNCPRGFLYFNRQGKLRISVLPAYLSYDSPWPVRK-IPLC 242

Query: 1003 ATPHQITYFAEKNL 1016
             T H + Y  E  +
Sbjct: 243  CTVHCVAYHVESKI 256


>gi|407035910|gb|EKE37921.1| CPSF A subunit region protein, putative [Entamoeba nuttalli P19]
          Length = 836

 Score = 71.2 bits (173), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 80/327 (24%), Positives = 144/327 (44%), Gaps = 52/327 (15%)

Query: 133 IILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFA----RGPLVKVDPQ 188
           ++L F++AK+SVL +D++ +   I S+HCFE P    LKR +E         P + +D +
Sbjct: 74  LVLLFKEAKVSVLRYDETNNKFVIHSLHCFELP----LKRMQEGLTPTTYTDPRLLIDKR 129

Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKD 248
           GRC  ++ Y   M ++         +G + T             S+ INL    +  + D
Sbjct: 130 GRCISLICYDRLMWVIP--------LGLDKT-------------SYSINLEKFGINRIID 168

Query: 249 FIFVHGYIEPVMVILHERELTWAGRVSWKHHT---CMISALSISTTLKQHPLI----WSA 301
            I + GY  P +  LH +  TW GR+     T    +I +L      ++  ++    +  
Sbjct: 169 CIVLDGYDLPSVAFLHMKIPTWEGRIVNTGETTNEIIILSLEPDVIHERQDIVATISYQF 228

Query: 302 MNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA--SCALALNNYAVSLDSSQELPRSS 359
             +P++A +++    P  G+L++  N+I Y S ++  S  L    + V +  +   P SS
Sbjct: 229 SYVPYNALQIVDC-YPTNGLLILTVNSIIYLSTTSFESFILPFGKFFV-IPKNINGPLSS 286

Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLL-------TVVYDGRVVQRL-DLSKTN-PSV 410
           F +       T + N V  +   T  L ++         V+   +  R+ D+  TN P  
Sbjct: 287 FQI---LQMQTKIMNSVKSIFKLTNHLYIIFSMNGESYYVHLLSIANRICDVIITNSPYK 343

Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQF 437
                 TI ++  F+GS + DS +  +
Sbjct: 344 YHPTTFTISSNHLFIGSTVHDSYIYNY 370


>gi|168066745|ref|XP_001785293.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162663100|gb|EDQ49884.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1090

 Score = 71.2 bits (173), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 126/557 (22%), Positives = 220/557 (39%), Gaps = 121/557 (21%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++A+ L+ +    ++G + +L +    G      +D + ++FE  K  VL++D 
Sbjct: 39  RIEIHLLTASGLQPMLDVPIYGRIATLELFRPPG----ESQDVLFISFERYKFCVLQWDA 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
              GL +T      S      + GR +   G +  VDP  R  G+ +Y GL  +I   ++
Sbjct: 95  ET-GLLVTRAMGDVSD-----RIGRPT-DNGQIGIVDPDCRLIGLHLYDGLFKVIPIDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F++G  +P + +L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCAKPTIAVLYQDNK 185

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
                     H              + P  W   NL + A  L+ VP P+GG +++G  T
Sbjct: 186 D-------ARHVKTYEVQLKEKDFGEGP--WLQNNLDNGAGLLIPVPLPLGGAIIIGEQT 236

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           I Y++ S   A+ +            + ++   V+ D +          LLS   G L L
Sbjct: 237 IVYYNGSVFKAIPIR---------PSITKAYGRVDSDGSR--------YLLSDHNGMLYL 279

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
           L + +D   V  L++     +   S ++ + N + F+GS  GDS L++            
Sbjct: 280 LVISHDKERVSALNVEPLGETSAASTLSYLDNGVVFVGSSYGDSQLIRL----------- 328

Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVN-GEELSLYGSASNNTESAQ-KTFSFAVR 506
                  + +ADA      + S  + L+  VN G  + L           Q  T S A +
Sbjct: 329 -------NHQADA------KNSYVEVLESYVNLGPIVDLCVVDLERQGQGQVVTCSGAFK 375

Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
           D     G L+    G+ IN  ASA            EL G KG+W++   SS        
Sbjct: 376 D-----GSLRIVRNGIGINEQASA------------ELQGIKGMWSLRASSS-------- 410

Query: 567 RMAAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVI 624
                 D Y  +L++S   E R + + T D L E TE   +  + +T+   N     +++
Sbjct: 411 ------DVYDTFLVVSFISETRILAMNTDDELEE-TEIDGFDSEAQTLFCYNAV-HDQLV 462

Query: 625 QVFERGARILDGSYMTQ 641
           QV     R++D     Q
Sbjct: 463 QVTAGSLRLVDAKTRRQ 479


>gi|68471462|ref|XP_720279.1| likely Cleavage and Polyadenylation Specificity Factor subunit
           fragment [Candida albicans SC5314]
 gi|46442139|gb|EAL01431.1| likely Cleavage and Polyadenylation Specificity Factor subunit
           fragment [Candida albicans SC5314]
          Length = 423

 Score = 71.2 bits (173), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 45/199 (22%), Positives = 93/199 (46%), Gaps = 15/199 (7%)

Query: 251 FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYK 310
           F+H Y EP + +L  ++  WAG +           L++   LK    ++   NLP++  +
Sbjct: 3   FLHNYREPTIAVLSSKQEVWAGNLIKSKDNIQFQVLTLDLNLKSTISVFKIDNLPYEIDR 62

Query: 311 LLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNY----AVSLDSSQELPRSSFSVELD 365
           ++ +PSP+ G L+VG N  IH  +      +A+N +      S  S Q+  +S  +++L+
Sbjct: 63  VIPLPSPLNGTLLVGCNELIHVDNGGVLKRIAVNKFTRLITASFKSFQD--QSDLNLKLE 120

Query: 366 AAHATWLQND-VALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS-------DITT 417
                 + +D   LL  +TG+   +    DG+ ++R+ +   +             ++  
Sbjct: 121 NCSVVPIPDDHRVLLILQTGEFYFINFELDGKSIKRIHIDNVDKKTYDKIQLNHPGEVAI 180

Query: 418 IGNSLFFLGSRLGDSLLVQ 436
           +  ++ F+ +  G+S L+Q
Sbjct: 181 LDKNMLFIANSNGNSPLIQ 199


>gi|325186344|emb|CCA20849.1| predicted protein putative [Albugo laibachii Nc14]
          Length = 1148

 Score = 70.9 bits (172), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 114/511 (22%), Positives = 192/511 (37%), Gaps = 115/511 (22%)

Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
           +D I L  +  +  VL +D ++  +   +           + R  E    G    +DP G
Sbjct: 74  QDWIFLVTQRFQFCVLAYDTTLQQIITKANGSLRDT----IGRNSEILTNG---NIDPDG 126

Query: 190 RCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDF 249
           R  G+ +Y     ++        L            F+ R++      LR LD+K     
Sbjct: 127 RLIGMNIYEGYFKVIPIDNHSKSL---------KAAFNIRLD-----ELRILDIK----- 167

Query: 250 IFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAY 309
            F++GY +P + +L+E             H      L       + P  WS  N+   A 
Sbjct: 168 -FLYGYNKPTICVLYED-------FKAARHVKTYFILLKEKDFAEGP--WSQSNVEAGAN 217

Query: 310 KLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHA 369
            L+ VP P GGVL++   TI YH+ +   A+ + N  + +  +     S F         
Sbjct: 218 LLIPVPMPYGGVLIISNQTIVYHNGTYFHAIPMQNTMIQVYGAVGDDGSRF--------- 268

Query: 370 TWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRL 429
                   LL+ + G L ++ +  +G+ V  + L     + + S ++ + N + F+GS  
Sbjct: 269 --------LLADQYGALHVVALQTEGKEVLDVYLEVLGQTSIASCVSYLDNGVVFVGSTF 320

Query: 430 GDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS 489
           GDS LV+              ++E G              S  + L   VN   +  +  
Sbjct: 321 GDSQLVKLNSK----------RDESG--------------SYIEVLDSYVNIGPIIDFCV 356

Query: 490 ASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC 547
              + +   +  T S A +D     G L+    G+ IN  ASA            ELPG 
Sbjct: 357 MDLDRQGQGQIVTCSGADKD-----GSLRVIRNGIGINEQASA------------ELPGI 399

Query: 548 KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDY 605
           KG+W +    +               EY  YL+ S   E R M +  +D + EV   ++ 
Sbjct: 400 KGMWALRESLA--------------SEYDKYLVQSYLNEIRIMTIGDSDEMEEV--EIEA 443

Query: 606 FVQGRTIAAGNLFGRRRVIQVFERGARILDG 636
           F+  +T+   N+      +QV E   RI+D 
Sbjct: 444 FLDAKTLYCRNV-NEDGWLQVTETEVRIIDA 473


>gi|149237256|ref|XP_001524505.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
 gi|146452040|gb|EDK46296.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
          Length = 1380

 Score = 70.9 bits (172), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 71/328 (21%), Positives = 139/328 (42%), Gaps = 60/328 (18%)

Query: 231 ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
           +SS +I   +LD  +  + D  F+H Y +P + +L  R  +WAG +        +  +S+
Sbjct: 223 DSSFIIEAGNLDSSIDTIIDLQFLHNYRDPTIALLSSRSHSWAGSLLKSKDNVHLEVMSL 282

Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI-HYHSQSASCALALNNY-- 345
               K    I+   NLP++  +++ + +P+ G L+VG N I H  +   +  +++N++  
Sbjct: 283 DLLTKLSTSIFKIENLPYEVDRIVPLSAPLNGCLLVGCNEIMHVDNGGIAKRISVNDFTS 342

Query: 346 --AVSLDSSQELPRSSFSVELDAAHATWLQND-VALLSTKTGDLVLLTVVYDGRVVQRLD 402
               S+ S+Q+  +S+  ++L+      + +D   L+ T+ G         DG+ ++R+ 
Sbjct: 343 LTTASVKSNQD--QSNLGLKLENCSVVQIPDDHRVLIVTEQGSFYFANFELDGKSIKRVF 400

Query: 403 LSKTNPSV-------LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG 455
           +   + ++          +I  +  +L F+ +  GDS LVQ             +K    
Sbjct: 401 IDVVDKNMYDKIKFTFPGEIAVLSKNLLFMSNLNGDSPLVQ-------------VKYRNS 447

Query: 456 DIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESA----------------QK 499
            I  D   T+R+ +           G E +    +SN  +                  QK
Sbjct: 448 KILEDTRGTRRVEKGK---------GAEKNKNNVSSNEVDDDDDDDDDLYKEEEEEEQQK 498

Query: 500 TFS-----FAVRDSLVNIGPLKDFSYGL 522
             S     F ++D L+N  P+  F+ GL
Sbjct: 499 VLSKSHIEFILQDRLINNSPISTFTLGL 526


>gi|384250802|gb|EIE24281.1| hypothetical protein COCSUDRAFT_28729 [Coccomyxa subellipsoidea
           C-169]
          Length = 1101

 Score = 70.9 bits (172), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 125/551 (22%), Positives = 206/551 (37%), Gaps = 113/551 (20%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++   L+ V    ++G V ++ +    G      +D + L+ E  K  VLE+D 
Sbjct: 44  RIEIHTLTPEGLKGVADVAIYGRVATMELFRPVG----ESKDLLFLSTERYKFCVLEYDS 99

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQG 209
               L   +    E       + GR     G +  VDP          G +MI L    G
Sbjct: 100 ETGELVTRANGDIED------QVGRPC-DNGQIGIVDP----------GCRMIGLHLYDG 142

Query: 210 GSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELT 269
              ++  +D       F+ RI+  +VI           D IF+ G  +P + +L++    
Sbjct: 143 LFKVIPIDDKGQLHEAFNMRIDELNVI-----------DMIFLEGCAKPTIAVLYQDN-- 189

Query: 270 WAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI 329
                    H      +     L + P  W   NL   A +++AVP P+GG LVVG + I
Sbjct: 190 -----KDARHIKTYEVVLKEKDLTEGP--WRQSNLDAGASRVIAVPEPLGGALVVGESVI 242

Query: 330 HYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA-LLSTKTGDLVL 388
            Y  Q                  Q +  +     +  AH    ++    LL    G+L L
Sbjct: 243 AYMGQ-----------------GQAMKCTPIKATIIRAHGRVDEDGSRYLLGDYVGNLYL 285

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
           L + +DG  V  L +     +   S +T + N + F+GS  GDS LV+            
Sbjct: 286 LVLQHDGEHVAGLKVEPLGRTSAPSTLTYLDNGVVFVGSSGGDSQLVRL----------- 334

Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDS 508
                        P T +   +  + L+ M N   +  +       +   +    +    
Sbjct: 335 ----------HPTPVTPQEPSNFVEVLETMTNLGPIIDFVVVDLERQGQGQVVMCS---- 380

Query: 509 LVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRM 568
               G + D S  LRI  +    G+ +Q+    VELPG KG+W +           +S M
Sbjct: 381 ----GIMADGS--LRIVRN--GIGMIEQAT---VELPGIKGMWALR----------ASHM 419

Query: 569 AAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQV 626
            A+D     +L+IS   E R + +   D L E  E   +    +T+  GN      ++QV
Sbjct: 420 DAFD----TFLVISFVGETRILAINADDELDE-AELPGFSADAQTLCCGNTVS-DHLVQV 473

Query: 627 FERGARILDGS 637
                R++D S
Sbjct: 474 AGADVRLVDAS 484


>gi|449710759|gb|EMD49776.1| cleavage and polyadenylation specificity factor subunit, putative
           [Entamoeba histolytica KU27]
          Length = 836

 Score = 70.9 bits (172), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 80/327 (24%), Positives = 144/327 (44%), Gaps = 52/327 (15%)

Query: 133 IILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFA----RGPLVKVDPQ 188
           ++L F++AK+SVL +D++ +   I S+HCFE P    LKR +E         P + +D +
Sbjct: 74  LVLLFKEAKVSVLRYDETNNKFVIHSLHCFELP----LKRMQEGLTPTTYTDPRLLIDKR 129

Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKD 248
           GRC  ++ Y   M ++         +G + T             S+ INL    +  + D
Sbjct: 130 GRCISLICYDRLMWVIP--------LGLDKT-------------SYSINLEKFGINRIID 168

Query: 249 FIFVHGYIEPVMVILHERELTWAGRVSWKHHT---CMISALSISTTLKQHPLI----WSA 301
            I + GY  P +  LH +  TW GR+     T    +I +L      ++  ++    +  
Sbjct: 169 CIVLDGYDLPSVAFLHMKIPTWEGRIVNTGETTNEIIILSLEPDVIHERQDIVATISYQF 228

Query: 302 MNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA--SCALALNNYAVSLDSSQELPRSS 359
             +P++A +++    P  G+L++  N+I Y S ++  S  L    + V +  +   P SS
Sbjct: 229 SYVPYNALQIVDC-YPTNGLLILTINSIIYLSTTSFESFILPFGKFFV-IPKNINGPLSS 286

Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLL-------TVVYDGRVVQRL-DLSKTN-PSV 410
           F +       T + N V  +   T  L ++         V+   +  R+ D+  TN P  
Sbjct: 287 FQI---LQMQTKIMNSVKSIFKLTNHLYIIFSMNGESYYVHLLSIANRICDVIITNSPYK 343

Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQF 437
                 TI ++  F+GS + DS +  +
Sbjct: 344 YHPTTFTISSNHLFIGSTVHDSYIYNY 370


>gi|67463896|ref|XP_648489.1| cleavage and polyadenylation specificity factor subunit [Entamoeba
           histolytica HM-1:IMSS]
 gi|56464653|gb|EAL43100.1| cleavage and polyadenylation specificity factor subunit, putative
           [Entamoeba histolytica HM-1:IMSS]
          Length = 1150

 Score = 70.9 bits (172), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 80/327 (24%), Positives = 144/327 (44%), Gaps = 52/327 (15%)

Query: 133 IILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFA----RGPLVKVDPQ 188
           ++L F++AK+SVL +D++ +   I S+HCFE P    LKR +E         P + +D +
Sbjct: 74  LVLLFKEAKVSVLRYDETNNKFVIHSLHCFELP----LKRMQEGLTPTTYTDPRLLIDKR 129

Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKD 248
           GRC  ++ Y   M ++         +G + T             S+ INL    +  + D
Sbjct: 130 GRCISLICYDRLMWVIP--------LGLDKT-------------SYSINLEKFGINRIID 168

Query: 249 FIFVHGYIEPVMVILHERELTWAGRVSWKHHT---CMISALSISTTLKQHPLI----WSA 301
            I + GY  P +  LH +  TW GR+     T    +I +L      ++  ++    +  
Sbjct: 169 CIVLDGYDLPSVAFLHMKIPTWEGRIVNTGETTNEIIILSLEPDVIHERQDIVATISYQF 228

Query: 302 MNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA--SCALALNNYAVSLDSSQELPRSS 359
             +P++A +++    P  G+L++  N+I Y S ++  S  L    + V +  +   P SS
Sbjct: 229 SYVPYNALQIVDC-YPTNGLLILTINSIIYLSTTSFESFILPFGKFFV-IPKNINGPLSS 286

Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLL-------TVVYDGRVVQRL-DLSKTN-PSV 410
           F +       T + N V  +   T  L ++         V+   +  R+ D+  TN P  
Sbjct: 287 FQI---LQMQTKIMNSVKSIFKLTNHLYIIFSMNGESYYVHLLSIANRICDVIITNSPYK 343

Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQF 437
                 TI ++  F+GS + DS +  +
Sbjct: 344 YHPTTFTISSNHLFIGSTVHDSYIYNY 370


>gi|365984967|ref|XP_003669316.1| hypothetical protein NDAI_0C04130 [Naumovozyma dairenensis CBS 421]
 gi|343768084|emb|CCD24073.1| hypothetical protein NDAI_0C04130 [Naumovozyma dairenensis CBS 421]
          Length = 1388

 Score = 70.5 bits (171), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 119/565 (21%), Positives = 224/565 (39%), Gaps = 108/565 (19%)

Query: 58  LVVTAANVIEIYVVR-VQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           L+V   N++ IY +  +    S  S +  ET     +     A L L+  ++L+G V+ +
Sbjct: 29  LLVIRTNILSIYHLETILSPRSNTSSSQLETIEDATVTTSKQAKLFLINEFKLNGKVQDI 88

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +  G   NS   + I+L+   AK+S+L FD SI+     S+H +E            S
Sbjct: 89  ASIPLG---NSSSLECILLSTGTAKLSILNFDPSINSFETLSLHYYEEK---FKDISLVS 142

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL--------------KASQGGSGLVGDEDTFGS 222
            A+   +++DP  RC  +L++   ++ L              +       ++ + +   S
Sbjct: 143 LAKKSQLRMDPLNRC--LLMFNNDVMALLPLHSNNEDEEEEEEDENEEDEVLDNYEANLS 200

Query: 223 GGGFSARIE--------SSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHERELTWAG 272
               + RI+         S + N+  L  D+K++ D  F++ + +P + +L++  LTWAG
Sbjct: 201 KTSPNKRIKYNNNQFEGKSKIFNINKLHEDVKNISDIQFLNNFNKPTIAVLYQPTLTWAG 260

Query: 273 RVSWKHHTC--MISALSI----STTLKQHP-----------LIWSAMNLPHDAYKLLAVP 315
            V         MI  L I    ST    H            +I     L  D +K++ + 
Sbjct: 261 NVQLNPLPTHFMIFTLDILSENSTNNANHTTENNNNDLNLIIIAKLKELAWDWFKIIPIS 320

Query: 316 SPIGGVLVVGANTIHYHSQSA--SCALALNNYA-VSLDSSQELPRSSFSVELD------- 365
           +   G +V+G N I Y   +      + LN++A  +L  ++ +  S F +  +       
Sbjct: 321 N---GCVVIGNNEIAYIDNTGVLQSIILLNSFADKNLKKTRIIDESKFQIFFNENVTHVW 377

Query: 366 ----AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDL-----------SKTNPSV 410
               + + T   ++  LL     +L  + +  +GR++ + D+              NP+ 
Sbjct: 378 SPSTSKNKTTEDDETLLLMDAQSNLYYVRLEAEGRLLTKFDIINLPIVNDVLRENCNPTC 437

Query: 411 LTSDITTIGNSL--FFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
           ++   +   NS    F+G   GDSL+V+           + LK      +  + S +  +
Sbjct: 438 ISRLDSNATNSTMDLFIGFLSGDSLVVRL----------NNLKSAIDTRDEHSESNEHTQ 487

Query: 469 RSSSDALQDMVNGEELSLYGSASNNTESAQ------------KTFSFAVRDSLVNIGPLK 516
            +  D        +E +LY     + E A+            + F      SL NIGP+ 
Sbjct: 488 LNGFDE------EDEDNLYSDDEVDVEDARSKRDMETIIHTVQPFDIEYLTSLKNIGPIT 541

Query: 517 DFSYGLRINADASATGISKQSNYEL 541
             + G   + D +  G+   +  E 
Sbjct: 542 SLTVGKVSSLDLNVKGLQNPNKNEF 566


>gi|302769568|ref|XP_002968203.1| hypothetical protein SELMODRAFT_145521 [Selaginella moellendorffii]
 gi|300163847|gb|EFJ30457.1| hypothetical protein SELMODRAFT_145521 [Selaginella moellendorffii]
          Length = 1089

 Score = 70.1 bits (170), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 123/557 (22%), Positives = 218/557 (39%), Gaps = 121/557 (21%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+    ++A  L+ +    ++G + +L +    G      +D + ++ E  K  VL++D 
Sbjct: 39  RIEFHLLTAQGLQPLLDVPIYGRIATLELFRPPG----ETQDVLFVSTERYKFCVLQWDS 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   +M           + GR +   G +  VDP+ R  G+ +Y GL  +I   ++
Sbjct: 95  ETTELVTRAMGDVSD------RIGRPT-DNGQIGIVDPECRLIGLHLYDGLFKVIPIDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F++G  +P + +L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCSKPTIAVLYQDNK 185

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
                     H              + P  WS  NL + A  L+ VP+P+GGV+++G  T
Sbjct: 186 D-------ARHVKTYEIQLKEKDFGEGP--WSQNNLDNGAGMLIPVPTPLGGVIIIGEQT 236

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           I Y+S SA  A+ +            + ++   V+ D +          LLS  TG L L
Sbjct: 237 IVYYSGSAFKAIPIR---------PSITKAYGKVDADGSR--------YLLSDHTGSLHL 279

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
           L + ++   V  L +     +   S ++ + N + ++GS  GDS L++            
Sbjct: 280 LVITHERDRVLGLKVELLGETSAASSLSYLDNGVVYVGSSYGDSQLIKL----------- 328

Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVN-GEELSLYGSASNNTESAQ-KTFSFAVR 506
                  + + D+      R S  + L+  VN G  + L           Q  T S A +
Sbjct: 329 -------NAQVDS------RNSYVEVLESFVNLGPIVDLCVVDLERQGQGQVVTCSGAYK 375

Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
           D     G L+    G+ IN  ASA            EL G KG+W++             
Sbjct: 376 D-----GSLRIVRNGIGINEQASA------------ELQGIKGMWSL------------- 405

Query: 567 RMAAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVI 624
             A   D +  +L++S   E R + +   D L E TE   +  + +T+   N     ++I
Sbjct: 406 -RATSKDVFDIFLVVSFISETRILAMNMDDELEE-TEIEGFDSEAQTLFCHNAI-HDQII 462

Query: 625 QVFERGARILDGSYMTQ 641
           QV     R++D +   Q
Sbjct: 463 QVTSTSLRLVDATSRRQ 479


>gi|385304555|gb|EIF48567.1| rna-binding subunit of the mrna cleavage and polyadenylation factor
           [Dekkera bruxellensis AWRI1499]
          Length = 353

 Score = 70.1 bits (170), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 67/293 (22%), Positives = 129/293 (44%), Gaps = 39/293 (13%)

Query: 243 MKHVKDFIFVHGYIEPVMVILHERE-LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA 301
           +K++ D+ F++ Y EP + IL+  E L+WAG +        +  LS++    +   I   
Sbjct: 69  VKNIMDYQFLYSYREPTIAILYAPEGLSWAGYLXKLKDNMKVVVLSLNLDTHKADSIMVL 128

Query: 302 MNLPHDAYKLLAVPSPIGGVLVVGANTI-HYHSQSASCALALNNYAVSLDSSQELPRSSF 360
            NLP+D   +  +PSPI G L++G+N I H +S  +   +  N Y       +    S  
Sbjct: 129 PNLPYDLNSIYPLPSPINGFLLIGSNEILHVNSLGSIKGVYTNKYFPETSDMKLRDESDL 188

Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG------RVVQRLDLSKTNPSVLTSD 414
           ++E +    +++ +D  LL ++ G   +L+    G      ++++  + +  N SV  ++
Sbjct: 189 NLECEGCSVSFVGDDQVLLISQIGKFYVLSFNESGGISNLNKIIEIPEANYCNVSV--NN 246

Query: 415 ITTIGN----SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS 470
           +  I N    +  FL  +  DS+L+ +                      + P+   + +S
Sbjct: 247 VLQITNIEDCNSAFLCCQGSDSILLHWN--------------------YNVPTRGTVSKS 286

Query: 471 SSDALQDMVNGEELSLY--GSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
           ++   ++    E+  LY     S  +     + +F   D LVN GP  DF+ G
Sbjct: 287 NAGIEKE---DEDSWLYHEDETSQTSNRPLTSCTFTXIDKLVNCGPTSDFTIG 336


>gi|312069702|ref|XP_003137805.1| hypothetical protein LOAG_02219 [Loa loa]
          Length = 1065

 Score = 70.1 bits (170), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 90/387 (23%), Positives = 149/387 (38%), Gaps = 49/387 (12%)

Query: 349 LDSSQELPRSSFS---VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLS 404
           +D   + P   F    + LD    T +  +  LL  + G L  L +V D    V+ L+L 
Sbjct: 1   MDGFTKFPLRDFKHMVLTLDGCVVTVISTNKILLCDRNGRLFTLVLVTDATNSVKSLELK 60

Query: 405 KTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPST 464
               +V+   +T+      F+GSRL DS+ +       T             ++  AP  
Sbjct: 61  FQFKTVIPCTMTSCAPGYLFIGSRLCDSVFLHCIFEQST-------------LDESAPKK 107

Query: 465 KRLRRSSSDALQDMVNGEELSLYGSA---SNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
            +L  +  +A +D    E+  LYG         +SA++  +  V D L+N+GP K  + G
Sbjct: 108 IKLN-TELNANED----EDFELYGEVLPKVAKPDSAEELLNIRVLDKLLNVGPCKKITGG 162

Query: 522 LRINADASATGISKQSNYELVELPGCK--GIWTVYHKSSRGHNADSSRMAAY-------- 571
               +        K   ++LV   G    G   ++ +S R     SS +           
Sbjct: 163 CPSISAYFQEVTRKDPLFDLVCACGHGKFGSICIFQRSVRPEIVTSSSIEGVVQYWAVGR 222

Query: 572 -DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG 630
            +D+ H Y I S E  T+ LET + L E+ E+  +     TIAAG L      +QV    
Sbjct: 223 REDDTHMYFIASKELGTLALETDNDLVEL-EAPIFATSEPTIAAGELADGGLAVQVTTSS 281

Query: 631 ARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL--LVGDPS 688
             ++      Q +                 V S SI DPY+ +   +G + +  L   P 
Sbjct: 282 LVMVAEGQQIQHIPL----------QLTFPVRSASIVDPYIAICTQNGRLLMYELTSHPH 331

Query: 689 TCTVSVQTPAAIESSKKPVSSCTLYHD 715
                +     +     P++S ++Y D
Sbjct: 332 VHLKEIDISKRLRHETSPITSLSIYRD 358



 Score = 62.0 bits (149), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 59/239 (24%), Positives = 104/239 (43%), Gaps = 29/239 (12%)

Query: 756 VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE------TEINSSSEE 809
           +  E+G + I+ +P  + V+ V K     +H+ D    +   D E       +  S++  
Sbjct: 445 IARENGNMYIYSIPELHLVYMVKKI----SHLPDIATDQPYVDDEPATAESIDTMSATMT 500

Query: 810 GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
            T   + E +    ++EL M     +  RP LF +L D T+  Y+ + +    N      
Sbjct: 501 DTFAAKPEEV----IMELLMVGMGMNQGRPMLF-LLIDDTVSVYEMFTY----NNGIQGH 551

Query: 870 PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI--FKNISG-HQGF 926
                + L  + V+    R+ RF    LD     E+   A   +  +  F+ I     G 
Sbjct: 552 LAVRFKRLPYTVVT----RSCRFQG--LDGRAAVESVRDAVRHKTVLHFFERIGNVLNGV 605

Query: 927 FLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-QGILKICQLP 984
           F+  S PC   +     R+HP   DG I++FT  +N  C +GFIY+T  + ++++ +LP
Sbjct: 606 FICSSYPCIFFLETGVPRLHPVNLDGPILSFTTFNNAACPNGFIYLTERERLMRVAKLP 664


>gi|58383228|ref|XP_312466.2| AGAP002472-PA [Anopheles gambiae str. PEST]
 gi|55242305|gb|EAA08181.2| AGAP002472-PA [Anopheles gambiae str. PEST]
          Length = 1138

 Score = 69.3 bits (168), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 117/519 (22%), Positives = 195/519 (37%), Gaps = 126/519 (24%)

Query: 180 GPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
           G L  +DP+ R  G+ +Y GL  II            D+DT                 +L
Sbjct: 119 GILAVIDPKARVIGMRLYEGLFKIIPL----------DKDT-----------NELKATSL 157

Query: 239 RDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298
           R  +M HV+D  F++G   P ++++H+        ++ +H    I    IS   K+   I
Sbjct: 158 RMEEM-HVQDVEFLYGTTHPTLIVIHQD-------INGRH----IKTHEISLKDKEFTKI 205

Query: 299 -WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA--------LNNYAVSL 349
            W   N+  +A  L+AVP P+GG +V+G  +I YH   +  A+A        +N YA   
Sbjct: 206 AWKQDNVETEATMLIAVPMPLGGAIVIGQESIVYHDGDSYVAVAPAIIKQSTINCYA--- 262

Query: 350 DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLS 404
                         +D+    +L  ++A      G+L ++ +  +        V+ + + 
Sbjct: 263 -------------RIDSKGLRYLLGNMA------GNLFMMFLETEENAKGQTTVRDIKVE 303

Query: 405 KTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPST 464
                 +   IT + N + F+GSR GDS LV+    +G +     L E F ++   AP  
Sbjct: 304 LLGEITIPECITYLDNGVLFIGSRHGDSQLVKLNTTAGDNGAYVMLMETFTNL---APIV 360

Query: 465 KRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRI 524
                     L+    G+ ++  GS                       G L+    G+ I
Sbjct: 361 DMCVVD----LERQGQGQMITCSGSFKE--------------------GSLRIIRNGIGI 396

Query: 525 NADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLE 584
              A             ++LPG KG+W +             R+   D  Y   LI+S  
Sbjct: 397 QEHAC------------IDLPGIKGMWAL-------------RVGIDDSPYDNTLILSFV 431

Query: 585 ARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL--DGSYMTQD 642
             T VL  +    E TE        +T    N+    +++QV    AR++  D   M  +
Sbjct: 432 GHTRVLMLSGDEVEETEIAGILGDQQTFYCANV-SHGQILQVTPSSARLISCDNKAMICE 490

Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIR 681
               P N   G    N+T +  + A     + + DG + 
Sbjct: 491 WK-PPDNKRIGVVGANTTQIVCASAQDVYYVEIGDGKLE 528


>gi|242089089|ref|XP_002440377.1| hypothetical protein SORBIDRAFT_09g030580 [Sorghum bicolor]
 gi|241945662|gb|EES18807.1| hypothetical protein SORBIDRAFT_09g030580 [Sorghum bicolor]
          Length = 1783

 Score = 69.3 bits (168), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 108/464 (23%), Positives = 189/464 (40%), Gaps = 112/464 (24%)

Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
           +D + +A E  K  VL++D     L   +M           + GR +   G +  +DP  
Sbjct: 75  QDFLFIATERYKFCVLQWDAEKSELITRAMGDVSD------RIGRPT-DNGQIGIIDPDC 127

Query: 190 RCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKD 248
           R  G+ +Y GL  +I   ++G                F+ R+E   V++++         
Sbjct: 128 RLIGLHLYDGLFKVIPFDNKGQLK-----------EAFNIRLEELQVLDIK--------- 167

Query: 249 FIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDA 308
             F+HG ++P +V+L++           +H      AL       + P  WS  N+ + A
Sbjct: 168 --FLHGCVKPTIVVLYQ------DNKDVRHVKTYEVALK-DKDFVEGP--WSQNNVDNGA 216

Query: 309 YKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAH 368
             L+ VP+P+GGV+++G   I Y + +++          ++   Q + R+   V+ D + 
Sbjct: 217 GLLIPVPAPLGGVIIIGEEQIVYCNANSTFK--------AIPIKQSIIRAYGRVDPDGSR 268

Query: 369 ATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSR 428
                    LL   TG L LL + ++   V  L +     + + S I+ + N + ++GSR
Sbjct: 269 Y--------LLGDNTGILHLLVLTHERERVTGLKIEYLGETSIASSISYLDNGVVYVGSR 320

Query: 429 LGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG 488
            GDS LV+                   +++ADA        S  + L+  VN   +  + 
Sbjct: 321 FGDSQLVKL------------------NLQADASG------SFVEILERYVNLGPIVDFC 356

Query: 489 SASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPG 546
               + +   +  T S A +D     G L+    G+ IN  AS            VEL G
Sbjct: 357 VVDLDRQGQGQVVTCSGAFKD-----GSLRVVRNGIGINEQAS------------VELQG 399

Query: 547 CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
            KG+W++  KSS            ++D Y  YL++S  + T  L
Sbjct: 400 IKGLWSL--KSS------------FNDPYDMYLVVSFISETRFL 429


>gi|91087281|ref|XP_975549.1| PREDICTED: similar to conserved hypothetical protein [Tribolium
           castaneum]
 gi|270010588|gb|EFA07036.1| hypothetical protein TcasGA2_TC010010 [Tribolium castaneum]
          Length = 1149

 Score = 68.9 bits (167), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 105/481 (21%), Positives = 180/481 (37%), Gaps = 110/481 (22%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G L  +DP+ R  G+ +Y     I+   +  S L                       N+R
Sbjct: 119 GILAVIDPKARVIGLRLYDGLFKIIPLEKDNSELKAS--------------------NIR 158

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
            +D   V D  F+HG   P ++++H+        V+ +H    +    IS   K+   + 
Sbjct: 159 -IDELQVHDVEFLHGCANPTLILIHQD-------VNGRH----VKTHEISLREKEFVKVP 206

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  ++ VPSP+GG +++G   I YH       +A             + +S
Sbjct: 207 WRQDNVETEASMIIPVPSPLGGAIIIGQENILYHDGITPVVVA----------PAVIKQS 256

Query: 359 SFS--VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVL 411
           +     ++D     +L  D+A      G L +L +  D R     VV+ L +        
Sbjct: 257 TIVCYAKVDPGGLRYLLGDMA------GHLFMLFLEVDNRGDGNDVVKDLKVELLGEIAT 310

Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS 471
              IT + N + F+GSRLGDS LV+ T     S     + E F ++   AP    L    
Sbjct: 311 PECITYLDNGVLFIGSRLGDSQLVKLTTKPNESGSYVTVMESFTNL---API---LDMCV 364

Query: 472 SDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASAT 531
            D L+    G+ ++  G+                       G L+    G+ I   AS  
Sbjct: 365 VD-LERQGQGQLVTCSGAFKE--------------------GSLRIIRNGIGIQEHAS-- 401

Query: 532 GISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLE 591
                     ++LPG KG+W +                A D  Y   L+++   +T VL 
Sbjct: 402 ----------IDLPGIKGMWAL--------------QVASDGRYDNTLVLAFVGQTRVLS 437

Query: 592 TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSE 651
                 E T+   +    +T   GN+    +++Q+    AR++     T    + P + +
Sbjct: 438 LNGEEVEETDIAGFASDQQTFFCGNVI-HEQIVQITPISARLISAQNKTLLAEWKPPSDK 496

Query: 652 S 652
           +
Sbjct: 497 N 497


>gi|403218521|emb|CCK73011.1| hypothetical protein KNAG_0M01580 [Kazachstania naganishii CBS
           8797]
          Length = 1345

 Score = 68.9 bits (167), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 149/708 (21%), Positives = 285/708 (40%), Gaps = 150/708 (21%)

Query: 61  TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
           T A+ +E+ +VR       +   SG+              L L   ++L   +  LA++ 
Sbjct: 22  TTADYVELLIVRTNLLSIYKVTESGK--------------LLLTHEFKLQARITDLALV- 66

Query: 121 QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF--- 177
            G  +N+   + ++L   + K+S+++F+   + L   S+H +E       K    SF   
Sbjct: 67  -GSVENTGL-NYLLLGIGNCKLSIVKFNSLNNSLETISLHYYEE------KFKANSFIEL 118

Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSA--------R 229
           A+   +++DPQ RC  +L     ++IL  SQ        E+                  +
Sbjct: 119 AKKTELRIDPQNRCA-LLFNNDNIVILPFSQQQEEEDYGEEEEEEDNYNMEDGPNVKKLK 177

Query: 230 IES--------SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH- 278
           +ES        S + + + LD  +++V D  F+  +  P + IL++ +LTWAG +     
Sbjct: 178 LESASTNLTLPSIITDSKKLDSTIENVVDIQFLRNFSRPTLGILYQPKLTWAGNLQLNPL 237

Query: 279 -HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA- 336
               ++ +L+I+ +  +  +I     LP D++ L  +P+  G VL+ G+N + Y   +  
Sbjct: 238 PTKFLVISLNIAVSELEGTVITKLEGLPWDSHTL--IPTWNGCVLL-GSNEVSYIDNTGV 294

Query: 337 -SCALALNNYA-VSLDSSQELPRSSFSVEL--DAAHATWLQ--------NDVALLSTKTG 384
              A+ LN+YA  SL   + +  +   + L  D   + W          +++ LL  ++ 
Sbjct: 295 LQSAIFLNSYADASLRKVRVVDHTDQQITLNKDLVKSLWSAPTKESGGADEILLLMDESS 354

Query: 385 DLVLLTVVYDGRVVQRLDL-----------SKTNPSVLT--SDITTIGNSLFFLGSRLGD 431
           +L  + + ++GR++ + D+              +P+ +T   +     N   F+G + GD
Sbjct: 355 NLYYIQLEFEGRLMTKFDMINLPIVNDIFVHNLHPTCITRIDESKHNININLFIGFQTGD 414

Query: 432 SLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELS---LYG 488
           SL+V+                   +I +   +    +++SS++    V  E+     LYG
Sbjct: 415 SLVVRL-----------------NNIRSAIETRHEYKQTSSESGLGKVEDEDEDEDDLYG 457

Query: 489 -------SASNNTESAQ----KTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISK-- 535
                  +AS N ++A     + F   +   L NIGP+     G   +      G+    
Sbjct: 458 DDGAHDKNASVNNDNAVVHTVQPFDIEMMSCLRNIGPVTSLVIGEASSVQPVIKGLPNPN 517

Query: 536 QSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL--------EART 587
           +  Y LV   G          +  G N    +++   +   A   IS+        + R 
Sbjct: 518 KGEYSLVATCG----------NGTGSNLMVGQISVQPEVELALKFISVTQIWNLKVKNRD 567

Query: 588 MVLETADL------LTEVTESVDYFVQGR------TIAAGNLFGRRRVIQVFERGARILD 635
             L T D       + E+  +   + QGR      T+      G +R++QV      + D
Sbjct: 568 KYLITTDSTKTKSDIYEIENNFALYKQGRLRRDATTVYISMFGGEKRIVQVTTNHLYLYD 627

Query: 636 GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
            ++  + L     N E         V+ VS+ DPY+L+ +S G I + 
Sbjct: 628 TNF--RRLFLNKFNYE---------VVHVSVMDPYLLITLSRGDIMIF 664


>gi|440302955|gb|ELP95261.1| hypothetical protein EIN_430670 [Entamoeba invadens IP1]
          Length = 1175

 Score = 68.9 bits (167), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 82/362 (22%), Positives = 153/362 (42%), Gaps = 46/362 (12%)

Query: 133 IILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCG 192
           +IL F+ A++SV+ ++   +   + S+HCFE PE    ++   +    P + +D +GRC 
Sbjct: 74  LILLFKQARLSVMRYNTETNRFVVHSLHCFEYPELRIREKCTPTAYDDPRMFIDKKGRCI 133

Query: 193 GVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFV 252
            +L Y   + ++                GS         SS+ ++L    +  + D I +
Sbjct: 134 SLLCYDRLLWVIP--------------LGSN-------RSSYRVDLEKFGVSRIVDVISL 172

Query: 253 HGYIEPVMVILHERELTWAGR-VSWKHHTCMISALSISTTL--KQHPLIWSAMN----LP 305
            GY  P +  LH    TW  R V+    T  I+ ++++  +  ++     + +N    LP
Sbjct: 173 SGYETPTLAFLHMTVPTWDARTVNTGEATNEIAIINVNPGVVGEEEQECANVVNRISRLP 232

Query: 306 HDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALAL---------NNYAVSLDSSQELP 356
           ++  K++    P+ G+L++ + ++ Y S ++S +  L           +   +   + LP
Sbjct: 233 YNTLKMVEC-YPLPGILLLASVSVLYISTTSSESFILPFGTYFNPPEVWKGVVPFLKLLP 291

Query: 357 RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDIT 416
                ++L  +     QN + L  T  GD   + +     +VQ + LS   P     +  
Sbjct: 292 MKIRIIQLVKSIHQLSQN-LYLTFTDKGDSYYIHLNCVEGIVQEIVLSNA-PYKFIPNTV 349

Query: 417 TIGNSLFFLGSRLGDSLLVQFT-CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDAL 475
           ++ +   FLGS   DS L  +T C  G      G  + FG    DA   K L+  S    
Sbjct: 350 SLYDDYIFLGSVFHDSYLFNYTICEYG-----KGDIKPFGIHCGDAVRIKNLQERSGQME 404

Query: 476 QD 477
           +D
Sbjct: 405 ED 406


>gi|55976392|sp|Q6E7D1.1|DDB1_SOLCE RecName: Full=DNA damage-binding protein 1; AltName:
           Full=UV-damaged DNA-binding protein 1
 gi|49484911|gb|AAT66742.1| UV-damaged DNA binding protein 1 [Solanum cheesmaniae]
          Length = 1095

 Score = 68.2 bits (165), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 119/540 (22%), Positives = 204/540 (37%), Gaps = 143/540 (26%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL++     IEI+++  Q                    G+    L+ +    ++G + +L
Sbjct: 31  NLIIAKCTRIEIHLLTPQ--------------------GLQCICLQPMLDVPIYGRIATL 70

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
            +    G      +D + +A E  K  VL++D     +   +M           + GR +
Sbjct: 71  ELFRPHG----ETQDLLFIATERYKFCVLQWDTEASEVITRAMGDVSD------RIGRPT 120

Query: 177 FARGPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHV 235
              G +  +DP  R  G+ +Y GL  +I   ++G                F+ R+E   V
Sbjct: 121 -DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLK-----------EAFNIRLEELQV 168

Query: 236 INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
           ++++           F++G  +P +V+L++           +H        +   +LK  
Sbjct: 169 LDIK-----------FLYGCPKPTIVVLYQ------DNKDARH------VKTYEVSLKDK 205

Query: 296 PLI---WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSS 352
             I   W+  NL + A  L+ VP P+ GVL++G  TI Y S SA  A+ +          
Sbjct: 206 DFIEGPWAQNNLDNGASLLIPVPPPLCGVLIIGEETIVYCSASAFKAIPIR--------- 256

Query: 353 QELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLT 412
             + R+   V+ D +          LL    G L LL + ++   V  L +     + + 
Sbjct: 257 PSITRAYGRVDADGSR--------YLLGDHNGLLHLLVITHEKEKVTGLKIELLGETSIA 308

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
           S I+ + N+  F+GS  GDS LV+                         P TK    S  
Sbjct: 309 STISYLDNAFVFIGSSYGDSQLVKLNL---------------------QPDTK---GSYV 344

Query: 473 DALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLRINADASA 530
           + L+  VN   +  +       +   +  T S A +D     G L+    G+ IN  AS 
Sbjct: 345 EVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD-----GSLRIVRNGIGINEQAS- 398

Query: 531 TGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
                      VEL G KG+W++               +A DD Y  +L++S  + T VL
Sbjct: 399 -----------VELQGIKGMWSL--------------RSATDDPYDTFLVVSFISETRVL 433


>gi|12082087|dbj|BAB20761.1| UV-damaged DNA binding protein [Oryza sativa Japonica Group]
          Length = 1090

 Score = 68.2 bits (165), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 111/504 (22%), Positives = 205/504 (40%), Gaps = 116/504 (23%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++   L+ +    ++G + +L +       ++  +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLTPQGLQPMIDVPIYGRIATLELFRP----HNETQDFLFIATERYKFCVLQWDG 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   +M           + GR +   G +  +DP  R  G+ +Y GL  +I   ++
Sbjct: 95  EKSELLTRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F++G ++P +V+L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCVKPTIVVLYQ--- 182

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
                   +H      AL       + P  WS  NL + A  L+ VP+P+GGV+++G  T
Sbjct: 183 ---DNKDARHVKTYEVALK-DKDFVEGP--WSQNNLDNGAGLLIPVPAPLGGVIIIGEET 236

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           I Y + +++          ++   Q + R+   V+ D +          LL    G L L
Sbjct: 237 IVYCNANSTFR--------AIPIKQSIIRAYGRVDPDGSR--------YLLGDNAGILHL 280

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
           L + ++   V  L +     + + S I+ + N + ++GSR GDS LV+            
Sbjct: 281 LVLTHERERVTGLKIEYLGETSIASSISYLDNGVVYVGSRFGDSQLVKL----------- 329

Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVR 506
                  +++AD         S  + L+  VN   +  +     + +   +  T S A +
Sbjct: 330 -------NLQADPNG------SYVEVLERYVNLGPIVDFCVVDLDRQGQGQVVTCSGAFK 376

Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
           D     G L+    G+ IN  AS            VEL G KG+W++  KSS        
Sbjct: 377 D-----GSLRVVRNGIGINEQAS------------VELQGIKGLWSL--KSS-------- 409

Query: 567 RMAAYDDEYHAYLIISLEARTMVL 590
               ++D Y  YL++S  + T  L
Sbjct: 410 ----FNDPYDMYLVVSFISETRFL 429


>gi|115465791|ref|NP_001056495.1| Os05g0592400 [Oryza sativa Japonica Group]
 gi|48475231|gb|AAT44300.1| putative DNA damage binding protein 1 [Oryza sativa Japonica Group]
 gi|113580046|dbj|BAF18409.1| Os05g0592400 [Oryza sativa Japonica Group]
 gi|215694552|dbj|BAG89545.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222632766|gb|EEE64898.1| hypothetical protein OsJ_19757 [Oryza sativa Japonica Group]
          Length = 1090

 Score = 68.2 bits (165), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 111/504 (22%), Positives = 205/504 (40%), Gaps = 116/504 (23%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++   L+ +    ++G + +L +       ++  +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLTPQGLQPMIDVPIYGRIATLELFRP----HNETQDFLFIATERYKFCVLQWDG 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   +M           + GR +   G +  +DP  R  G+ +Y GL  +I   ++
Sbjct: 95  EKSELLTRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F++G ++P +V+L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCVKPTIVVLYQ--- 182

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
                   +H      AL       + P  WS  NL + A  L+ VP+P+GGV+++G  T
Sbjct: 183 ---DNKDARHVKTYEVALK-DKDFVEGP--WSQNNLDNGAGLLIPVPAPLGGVIIIGEET 236

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           I Y + +++          ++   Q + R+   V+ D +          LL    G L L
Sbjct: 237 IVYCNANSTFR--------AIPIKQSIIRAYGRVDPDGSR--------YLLGDNAGILHL 280

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
           L + ++   V  L +     + + S I+ + N + ++GSR GDS LV+            
Sbjct: 281 LVLTHERERVTGLKIEYLGETSIASSISYLDNGVVYVGSRFGDSQLVKL----------- 329

Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVR 506
                  +++AD         S  + L+  VN   +  +     + +   +  T S A +
Sbjct: 330 -------NLQADPNG------SYVEVLERYVNLGPIVDFCVVDLDRQGQGQVVTCSGAFK 376

Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
           D     G L+    G+ IN  AS            VEL G KG+W++  KSS        
Sbjct: 377 D-----GSLRVVRNGIGINEQAS------------VELQGIKGLWSL--KSS-------- 409

Query: 567 RMAAYDDEYHAYLIISLEARTMVL 590
               ++D Y  YL++S  + T  L
Sbjct: 410 ----FNDPYDMYLVVSFISETRFL 429


>gi|218197365|gb|EEC79792.1| hypothetical protein OsI_21216 [Oryza sativa Indica Group]
          Length = 1089

 Score = 67.8 bits (164), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 111/504 (22%), Positives = 205/504 (40%), Gaps = 116/504 (23%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++   L+ +    ++G + +L +       ++  +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLTPQGLQPMIDVPIYGRIATLELFRP----HNETQDFLFIATERYKFCVLQWDG 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   +M           + GR +   G +  +DP  R  G+ +Y GL  +I   ++
Sbjct: 95  EKSELLTRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F++G ++P +V+L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCVKPTIVVLYQ--- 182

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
                   +H      AL       + P  WS  NL + A  L+ VP+P+GGV+++G  T
Sbjct: 183 ---DNKDARHVKTYEVALK-DKDFVEGP--WSQNNLDNGAGLLIPVPAPLGGVIIIGEET 236

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           I Y + +++          ++   Q + R+   V+ D +          LL    G L L
Sbjct: 237 IVYCNANSTFR--------AIPIKQSIIRAYGRVDPDGSR--------YLLGDNAGILHL 280

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
           L + ++   V  L +     + + S I+ + N + ++GSR GDS LV+            
Sbjct: 281 LVLTHERERVTGLKIEYLGETSIASSISYLDNGVVYVGSRFGDSQLVKL----------- 329

Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVR 506
                  +++AD         S  + L+  VN   +  +     + +   +  T S A +
Sbjct: 330 -------NLQADPNG------SYVEVLERYVNLGPIVDFCVVDLDRQGQGQVVTCSGAFK 376

Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
           D     G L+    G+ IN  AS            VEL G KG+W++  KSS        
Sbjct: 377 D-----GSLRVVRNGIGINEQAS------------VELQGIKGLWSL--KSS-------- 409

Query: 567 RMAAYDDEYHAYLIISLEARTMVL 590
               ++D Y  YL++S  + T  L
Sbjct: 410 ----FNDPYDMYLVVSFISETRFL 429


>gi|357132340|ref|XP_003567788.1| PREDICTED: DNA damage-binding protein 1a-like [Brachypodium
           distachyon]
          Length = 1090

 Score = 67.4 bits (163), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 88/367 (23%), Positives = 152/367 (41%), Gaps = 83/367 (22%)

Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
           F  + +     N+R L+   V D  F++G + P +V+L++           +H      A
Sbjct: 144 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCLRPTIVVLYQ------DNKDARHVKTYEVA 196

Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
           L       + P  WS  NL + A  L+ VP+P+GGV+++G  TI Y + +++        
Sbjct: 197 LK-DKDFVEGP--WSQNNLDNGAGLLIPVPAPLGGVIIIGEETIVYCNANSTFK------ 247

Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
             ++   Q + R+   V+ D +          LL   TG L LL +  +   V  L +  
Sbjct: 248 --AIPIKQSIIRAYGRVDPDGSR--------YLLGDNTGILHLLVLTQERERVTGLKIEH 297

Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
              + + S I+ + N + ++GSR GDS LV+                   +++ADA    
Sbjct: 298 LGETSVASSISYLDNGVVYVGSRFGDSQLVKL------------------NLQADATG-- 337

Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLR 523
               S  + L+  VN   +  +     + +   +  T S A +D     G ++    G+ 
Sbjct: 338 ----SFVEVLERYVNLGPIVDFCVVDLDRQGQGQVVTCSGAFKD-----GSIRVVRNGIG 388

Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
           IN  AS            VEL G KG+W++  KSS            ++D Y  +L++S 
Sbjct: 389 INEQAS------------VELQGIKGLWSL--KSS------------FNDPYDTFLVVSF 422

Query: 584 EARTMVL 590
            + T  L
Sbjct: 423 ISETRFL 429


>gi|302788810|ref|XP_002976174.1| hypothetical protein SELMODRAFT_151061 [Selaginella moellendorffii]
 gi|300156450|gb|EFJ23079.1| hypothetical protein SELMODRAFT_151061 [Selaginella moellendorffii]
          Length = 1089

 Score = 67.4 bits (163), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 122/557 (21%), Positives = 216/557 (38%), Gaps = 121/557 (21%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+    ++A  L+ +    ++G + +L +    G      +D + ++ E  K  VL++D 
Sbjct: 39  RIEFHLLTAQGLQPLLDVPIYGRIATLELFRPPG----ETQDVLFVSTERYKFCVLQWDS 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   +M           + GR +   G +  VDP+ R  G+ +Y GL  +I   ++
Sbjct: 95  ETTELVTRAMGDVSD------RIGRPT-DNGQIGIVDPECRLIGLHLYDGLFKVIPIDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F++G  +P + +L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCSKPTIAVLYQDNK 185

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
                     H              + P  W   NL + A  L+ VP+P+GGV+++G  T
Sbjct: 186 D-------ARHVKTYEIQLKEKDFGEGP--WLQNNLDNGAGMLIPVPTPLGGVIIIGEQT 236

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           I Y+S SA  A+ +            + ++   V+ D +          LLS  TG L L
Sbjct: 237 IVYYSGSAFKAIPIR---------PSITKAYGKVDADGSR--------YLLSDHTGSLHL 279

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
           L + ++   V  L +     +   S ++ + N + ++GS  GDS L++            
Sbjct: 280 LVITHERDRVLGLKVELLGETSAASSLSYLDNGVVYVGSSYGDSQLIKLNA--------- 330

Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVN-GEELSLYGSASNNTESAQ-KTFSFAVR 506
                    + D+      R S  + L+  VN G  + L           Q  T S A +
Sbjct: 331 ---------QVDS------RNSYVEVLESFVNLGPIVDLCVVDLERQGQGQVVTCSGAYK 375

Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
           D     G L+    G+ IN  ASA            EL G KG+W++             
Sbjct: 376 D-----GSLRIVRNGIGINEQASA------------ELQGIKGMWSL------------- 405

Query: 567 RMAAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVI 624
             A   D +  +L++S   E R + +   D L E TE   +  + +T+   N     ++I
Sbjct: 406 -RATSKDVFDIFLVVSFISETRILAMNMDDELEE-TEIEGFDSEAQTLFCHNAI-HDQII 462

Query: 625 QVFERGARILDGSYMTQ 641
           QV     R++D +   Q
Sbjct: 463 QVTSTSLRLVDATSRRQ 479


>gi|380494933|emb|CCF32776.1| cft-1, partial [Colletotrichum higginsianum]
          Length = 542

 Score = 67.4 bits (163), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 60/251 (23%), Positives = 106/251 (42%), Gaps = 27/251 (10%)

Query: 889  NLRFSRTPLDAYTRE--ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVH 946
            N   +++P++    E  E P   P +      NI+G+   FL G+ P   +   +  +  
Sbjct: 1    NPTIAKSPVEVADDEANEQPRFVPLRPCA---NINGYSTVFLPGASPSLIV---KSAKSS 54

Query: 947  PQ---LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKA 1003
            P+   L    +   +  H   C  GFIY  S+G  ++ QLP+ S +       + IP+  
Sbjct: 55   PKVVGLQGIGVRGMSSFHTEGCERGFIYADSEGQTRVTQLPADSNFAELGVSVRKIPIGD 114

Query: 1004 TPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEE 1063
                I Y      Y +  S+      ++   L  D +   +      +S  L      E 
Sbjct: 115  AVGLIAYHPPMETYAVACSI------SEHFELPKDDDYHKEWAKETTTSYPL-----TER 163

Query: 1064 YEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGE 1122
              V+++ P      W    T+ ++  E A+ ++ + L  +  TKE   L+ IGTA  +GE
Sbjct: 164  GIVKLMSPTT----WSVIDTVELEPHEVAMCMKTLHLEVSEETKERRMLITIGTAINRGE 219

Query: 1123 DVAARGRVLLF 1133
            D+  RGR+L++
Sbjct: 220  DLPIRGRILVY 230


>gi|350537001|ref|NP_001234275.1| DNA damage-binding protein 1 [Solanum lycopersicum]
 gi|350539125|ref|NP_001233864.1| UV damaged DNA binding protein 1 [Solanum lycopersicum]
 gi|55976440|sp|Q6QNU4.1|DDB1_SOLLC RecName: Full=DNA damage-binding protein 1; AltName: Full=High
           pigmentation protein 1; AltName: Full=UV-damaged
           DNA-binding protein 1
 gi|38455768|gb|AAR20885.1| UV damaged DNA binding protein 1 [Solanum lycopersicum]
 gi|42602165|gb|AAS21683.1| UV-damaged DNA binding protein 1 [Solanum lycopersicum]
          Length = 1090

 Score = 67.4 bits (163), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 113/507 (22%), Positives = 196/507 (38%), Gaps = 123/507 (24%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++   L+ +    ++G + +L +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLTPQGLQPMLDVPIYGRIATLELFRPHG----ETQDLLFIATERYKFCVLQWDT 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               +   +M           + GR +   G +  +DP  R  G+ +Y GL  +I   ++
Sbjct: 95  EASEVITRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F++G  +P +V+L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCPKPTIVVLYQ--- 182

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVG 325
                   +H        +   +LK    I   W+  NL + A  L+ VP P+ GVL++G
Sbjct: 183 ---DNKDARH------VKTYEVSLKDKDFIEGPWAQNNLDNGASLLIPVPPPLCGVLIIG 233

Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGD 385
             TI Y S SA  A+ +            + R+   V+ D +          LL    G 
Sbjct: 234 EETIVYCSASAFKAIPIR---------PSITRAYGRVDADGSR--------YLLGDHNGL 276

Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
           L LL + ++   V  L +     + + S I+ + N+  F+GS  GDS LV+         
Sbjct: 277 LHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAFVFIGSSYGDSQLVKLNL------ 330

Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSF 503
                           P TK    S  + L+  VN   +  +       +   +  T S 
Sbjct: 331 ---------------QPDTK---GSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSG 372

Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNA 563
           A +D     G L+    G+ IN  AS            VEL G KG+W++          
Sbjct: 373 AYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL---------- 405

Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVL 590
                +A DD Y  +L++S  + T VL
Sbjct: 406 ----RSATDDPYDTFLVVSFISETRVL 428


>gi|297799958|ref|XP_002867863.1| hypothetical protein ARALYDRAFT_492777 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297313699|gb|EFH44122.1| hypothetical protein ARALYDRAFT_492777 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 1088

 Score = 66.2 bits (160), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 112/513 (21%), Positives = 203/513 (39%), Gaps = 135/513 (26%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  +S   L+ +    L+G + +L +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLSPQGLQTILDVPLYGRIATLELFRPHG----EAQDFLFVATERYKFCVLQWD- 93

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRES------FARGPLVKVDPQGRCGGVLVY-GLQMI 202
                       +ES E +    G  S         G +  +DP  R  G+ +Y GL  +
Sbjct: 94  ------------YESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKV 141

Query: 203 ILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVI 262
           I   ++G                F+ R+E   V++++           F++G  +P + +
Sbjct: 142 IPFDNKGQLK-----------EAFNIRLEELQVLDIK-----------FLYGCTKPTIAV 179

Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIG 319
           L++           +H        +   +LK+   +   WS  NL + A  L+ VPSP+ 
Sbjct: 180 LYQ------DNKDARH------VKTYEVSLKEKDFVEGPWSQNNLDNGADLLIPVPSPLC 227

Query: 320 GVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
           GVL++G  TI Y S +A  A+ +            + ++   V+LD +          LL
Sbjct: 228 GVLIIGEETIVYCSANAFKAIPIR---------PSITKAYGRVDLDGSR--------YLL 270

Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
              +G + LL + ++   V  L +     + + S I+ + N++ F+GS  GDS L++   
Sbjct: 271 GDHSGLIHLLVITHEKEKVTGLKIELLGETSIASSISYLDNAVVFVGSSYGDSQLIKL-- 328

Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
                           +++ DA        S  + L+  VN   +  +       +   +
Sbjct: 329 ----------------NLQPDATG------SYVEILEKYVNLGPIVDFCVVDLERQGQGQ 366

Query: 500 --TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKS 557
             T S A +D     G L+    G+ IN  AS            VEL G KG+W++  KS
Sbjct: 367 VVTCSGAYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL--KS 407

Query: 558 SRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
           S             D+ +  +L++S  + T +L
Sbjct: 408 S------------IDEAFDTFLVVSFISETRIL 428


>gi|168047617|ref|XP_001776266.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162672361|gb|EDQ58899.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1089

 Score = 65.9 bits (159), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 122/556 (21%), Positives = 214/556 (38%), Gaps = 119/556 (21%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++A+ L+ +    L+G + +L +    G      +D + ++FE  K  VL++D 
Sbjct: 39  RIEIHLLTASGLQSMLDVPLYGRIATLELFRPPG----ESQDVLFISFERYKFCVLQWDA 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQG 209
              G  IT      S      + GR +   G +  VDP  R  G+ +Y     ++     
Sbjct: 95  ET-GSPITRAMGDVSD-----RTGRPT-DNGQIGIVDPDCRLIGLHLYDGMFKVIPIDNK 147

Query: 210 GSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELT 269
           G               F+ R+E   V++++           F++G   P + +L++    
Sbjct: 148 GQ----------LKEAFNIRLEELQVLDIK-----------FLYGCANPTIAVLYQDNKD 186

Query: 270 WAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI 329
                    H              + P  W   NL + A  L+ VP P+GG +++G  TI
Sbjct: 187 -------ARHVKTYEVNLKEKDFGEGP--WLQNNLDNGAGLLIPVPLPLGGAIIIGEQTI 237

Query: 330 HYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLL 389
            Y++ S   A+ +            + ++   V+ D +          LLS   G L LL
Sbjct: 238 VYYNGSVFKAIPIR---------PSITKAYGRVDSDGSR--------YLLSDHNGMLYLL 280

Query: 390 TVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSG 449
            + +D   V  L++     +   S ++ + N + F+GS  GDS L++             
Sbjct: 281 VISHDKERVSALNVEPLGETSAASTLSYLDNGVVFVGSSYGDSQLIRL------------ 328

Query: 450 LKEEFGDIEADAPSTKRLRRSSSDALQDMVN-GEELSLYGSASNNTESAQ-KTFSFAVRD 507
                 + +AD      ++ S  + L+  VN G  + L           Q  T S A +D
Sbjct: 329 ------NHQAD------VKGSYVEVLESFVNLGPIVDLCVVDLERQGQGQVVTCSGAFKD 376

Query: 508 SLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSR 567
                G L+    G+ IN  AS            VEL G KG+W++   SS         
Sbjct: 377 -----GSLRIVRNGIGINEQAS------------VELQGIKGMWSLRASSS--------- 410

Query: 568 MAAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQ 625
                D Y  +L++S   E R + + T D L E TE   +  + +T+   N     +++Q
Sbjct: 411 -----DVYDTFLVVSFISETRILAMNTDDELEE-TEIDGFDSEAQTLFCHNAV-HDQLVQ 463

Query: 626 VFERGARILDGSYMTQ 641
           V     R+++     Q
Sbjct: 464 VTAGSLRLVNAKTRKQ 479


>gi|357135348|ref|XP_003569272.1| PREDICTED: DNA damage-binding protein 1a-like [Brachypodium
           distachyon]
          Length = 1074

 Score = 65.9 bits (159), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 135/605 (22%), Positives = 240/605 (39%), Gaps = 141/605 (23%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V   N +EIY++  Q                          L+L+    L+G + +L
Sbjct: 31  NLIVAKCNRMEIYLLTPQ-------------------------GLQLMVDVPLYGTIATL 65

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
            +        S  +D + ++ E  +  VL +D     L   S    +  +++    GR +
Sbjct: 66  ELFRS----RSETQDFLFISMERYRCIVLHWDGRNSELITRSGG--DVSDFI----GRPT 115

Query: 177 FARGPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHV 235
              G +  +DPQ R  G+ +Y GL  +I                F + G           
Sbjct: 116 -DNGQIGVIDPQNRLIGLSLYDGLFKVI---------------PFDNKGNLK------EA 153

Query: 236 INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
           +N+R L    V D  F++G   P +V+LH+           +H      AL     ++  
Sbjct: 154 LNIR-LQEFLVLDIKFLYGCARPTVVVLHQ------DNKDSRHVKTYEVALEDKDFVEGS 206

Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
              WS  NL + A+  L +P P+GGV+++G +TI Y S +   AL++          Q +
Sbjct: 207 ---WSQSNLDNSAH--LLIPVPLGGVIIIGEHTIVYCSATTFKALSIK---------QSI 252

Query: 356 PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI 415
            R+   V+ D +   +  N        TG L L+ + ++   V  L       + + S I
Sbjct: 253 IRAVGRVDPDGSRYLYGDN--------TGALHLIVITHEWGRVTDLKTHYMGETSIASTI 304

Query: 416 TTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDAL 475
           + + + L ++GSR GDS L++                   +I+ADA +      S  + L
Sbjct: 305 SYLDSGLVYIGSRFGDSQLIKL------------------NIQADASA------SFVEIL 340

Query: 476 QDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISK 535
           +  +N   +  +           +  + +        G  KD S    I A  +   I+ 
Sbjct: 341 EQFMNTGPIVDFCVVDTERRGQGQVITCS--------GAYKDGS----IRAVRNGVVITD 388

Query: 536 QSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
           Q++   VEL G KG+W++  KSS     D+  +  + +E H +L +++E     LE  D+
Sbjct: 389 QAS---VELRGMKGLWSM--KSSLNDPYDTFLVVTFINETH-FLAMNMENE---LEEVDI 439

Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDG-SYMTQDLSFGPSNSESGS 654
               +E+       +T+A G+     ++IQV  R  R++   S    D  F P+      
Sbjct: 440 KGFDSET-------QTLACGSAI-HNQLIQVTSRSVRLVSSVSLELLDQWFAPARFSVNV 491

Query: 655 GSENS 659
            + N+
Sbjct: 492 AAANA 496


>gi|219109892|ref|XP_002176699.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411234|gb|EEC51162.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 1678

 Score = 65.9 bits (159), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 82/333 (24%), Positives = 135/333 (40%), Gaps = 79/333 (23%)

Query: 251 FVHGYIEPVMVILHE--RELTWAGRVSWKHHTC-----MISALSISTTLKQHPLIWSAMN 303
           F+ GY+EPV+V+LH       W+GR+  +          ++ALSIS    +  ++WS + 
Sbjct: 243 FLSGYLEPVLVLLHSDVEGPVWSGRLGRERGVAGAPPLFVTALSISVVHGRTAVLWSQV- 301

Query: 304 LPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSS------QELP 356
           +  DA K+L+      G LVVGANT +          +A+N +A S   +      Q  P
Sbjct: 302 VSADATKILSFGKT--GCLVVGANTLVILEIGKVQQVIAMNGWARSTCPAALQTALQANP 359

Query: 357 RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG--------------------- 395
               +++LD    TWL    A+++ +TG L +L    D                      
Sbjct: 360 VVKLAIQLDGCCVTWLSEHSAIMALRTGQLYVLQRTDDRWAVMPLGQTLGAVGEVAHLAS 419

Query: 396 ------RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSG 449
                 R ++++ + +   S +   +        F GSR GDSL + +      +M  + 
Sbjct: 420 LPIGGLRWLEKMKMDENKASEMQMGV-------LFAGSRTGDSLFLGYAL-EIVTMPWAA 471

Query: 450 LKEE---FGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS----------ASNNTES 496
           +K E   F + E    S        ++ L  ++  EE +LYG+           S   E+
Sbjct: 472 IKSEGQTFINFEGSELSKVATTAPIANGLDRILQLEEEALYGTDRSTPLHIVRDSEEEET 531

Query: 497 AQ--------KTFSFAV------RDSLVNIGPL 515
           A         +  +F V       D LVN+GPL
Sbjct: 532 ADIPSDAKRLRPVAFTVVRTIVPLDVLVNLGPL 564


>gi|147779836|emb|CAN63685.1| hypothetical protein VITISV_020449 [Vitis vinifera]
          Length = 64

 Score = 65.5 bits (158), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 35/47 (74%), Positives = 42/47 (89%)

Query: 355 LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRL 401
           +PRSSFSVELDAA+ATWL NDVA+LSTKTG+L+LLT+ YDGR+   L
Sbjct: 1   MPRSSFSVELDAANATWLSNDVAMLSTKTGELLLLTLXYDGRLFTDL 47


>gi|413946716|gb|AFW79365.1| hypothetical protein ZEAMMB73_562969 [Zea mays]
          Length = 1089

 Score = 65.5 bits (158), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 88/367 (23%), Positives = 151/367 (41%), Gaps = 83/367 (22%)

Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
           F  + +     N+R L+   V D  F+HG  +P +V+L++           +H      A
Sbjct: 144 FDNKGQLKEAFNIR-LEELQVLDIKFLHGCAKPTIVVLYQ------DNKDVRHVKTYEVA 196

Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
           L       + P  WS  N+ + A  L+ VP+P+GGV+++G   I Y + +++        
Sbjct: 197 LK-DKDFVEGP--WSQNNVDNGAGLLIPVPAPLGGVIIIGEEQIVYCNANSTFK------ 247

Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
             ++   Q + R+   V+ D +          LL   TG L LL + ++   V  L +  
Sbjct: 248 --AIPIKQSIIRAYGRVDPDGSRY--------LLGDNTGILHLLVLTHERERVTGLKIEY 297

Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
              + + S I+ + N + ++GSR GDS LV+                   +++ADA    
Sbjct: 298 LGETSIASSISYLDNGVVYVGSRFGDSQLVKL------------------NLQADASG-- 337

Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLR 523
               S  + L+  VN   +  +     + +   +  T S A +D     G L+    G+ 
Sbjct: 338 ----SFVEILERYVNLGPIVDFCVVDLDRQGQGQVVTCSGAFKD-----GSLRVVRNGIG 388

Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
           IN  AS            VEL G KG+W++  KSS             +D +  YL++S 
Sbjct: 389 INEQAS------------VELQGIKGLWSL--KSS------------INDPFDMYLVVSF 422

Query: 584 EARTMVL 590
            + T  L
Sbjct: 423 ISETRFL 429


>gi|157128864|ref|XP_001655231.1| DNA repair protein xp-e [Aedes aegypti]
 gi|108882186|gb|EAT46411.1| AAEL002407-PB [Aedes aegypti]
          Length = 1138

 Score = 65.1 bits (157), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 109/468 (23%), Positives = 174/468 (37%), Gaps = 119/468 (25%)

Query: 180 GPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
           G L  +DP+ R  G+ +Y GL  II            D DT              H +  
Sbjct: 119 GILAVIDPKARVIGMRLYEGLFKIIPL----------DRDT--------------HELKA 154

Query: 239 RDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
             L M+  HV+D  F++G   P ++++H+        ++ +H    I    I+   K   
Sbjct: 155 TSLRMEEVHVQDVEFLYGTQHPTLIVIHQD-------LNGRH----IKTHEINLKDKDFT 203

Query: 297 LI-WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA--------LNNYAV 347
            I W   N+  +A  L+ VP+P+GG +V+G  ++ YH   +  A+A        +N YA 
Sbjct: 204 KIAWKQDNVETEATMLIPVPTPLGGAIVIGQESVVYHDGDSYVAVAPAIIKQSTINCYAR 263

Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
                      + S  L        +N   LLS K   + LL  +             T 
Sbjct: 264 VDSKGFRYLLGNMSGHLFMMFLETEENSKGLLSVKDIKVELLGDI-------------TI 310

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
           P      IT + N + F+GSR GDS LV+    +G +     + E F ++   AP     
Sbjct: 311 PEC----ITYLDNGVLFIGSRHGDSQLVKLNTTAGDNGAYVTVMETFTNL---APIIDMC 363

Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
                  L+    G+ ++  GS                       G L+    G+ I   
Sbjct: 364 IVD----LEKQGQGQMITCSGSYKE--------------------GSLRIIRNGIGIQEH 399

Query: 528 ASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
           A             ++LPG KG+W +             R+   D  Y   L++S    T
Sbjct: 400 AC------------IDLPGIKGMWAL-------------RVGIDDSPYDNTLVLSFVGHT 434

Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNL-FGRRRVIQVFERGARIL 634
            +L  +    E TE   +    +T    N+ FG  ++IQV    AR++
Sbjct: 435 RILTLSGEEVEETEIPGFLSDQQTFYCANVDFG--QIIQVTPTTARLI 480


>gi|449488592|ref|XP_004158102.1| PREDICTED: LOW QUALITY PROTEIN: DNA damage-binding protein 1-like
           [Cucumis sativus]
          Length = 570

 Score = 65.1 bits (157), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 108/504 (21%), Positives = 194/504 (38%), Gaps = 117/504 (23%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++A  L+ +    ++G + +L +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLTAQGLQPMLDVPIYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDT 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   +M           + GR + + G +  +DP  R  G+ +Y GL  +I     
Sbjct: 95  ESSELITRAMGDVSD------RIGRPTDS-GQIGIIDPDCRLIGLHLYDGLFKVI----- 142

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
                            F  + +     N+R L+   V D  F++G   P +V+L++   
Sbjct: 143 ----------------PFDNKGQLKEAFNIR-LEELQVLDIKFLYGCSRPTIVVLYQDNK 185

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
                     H      +       + P  WS  NL + A  L+ VP P+ GV+++G  T
Sbjct: 186 D-------ARHVKTYEVVLKDKDFVEGP--WSQNNLDNGAAVLIPVPPPLCGVIIIGEET 236

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           I Y S +A  A+ +            + R+   V+ D +          LL    G L L
Sbjct: 237 IVYCSATAFKAIPVR---------PSITRAYGRVDADGSR--------YLLGDHAGLLHL 279

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
           L + ++   V  L +     + + S I+ + N+  ++GS  GDS LV+            
Sbjct: 280 LVITHEKERVTGLKIELLGETSIASTISYLDNAFVYIGSSYGDSQLVKL----------- 328

Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVR 506
                  +++ DA      + S  + L+  VN   +  +       +   +  T S A +
Sbjct: 329 -------NVQPDA------KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYK 375

Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
           D     G L+    G+ IN  AS            VEL G KG+W++             
Sbjct: 376 D-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL------------- 405

Query: 567 RMAAYDDEYHAYLIISLEARTMVL 590
             ++ DD +  +L++S  + T +L
Sbjct: 406 -RSSTDDPFDTFLVVSFISETRIL 428


>gi|157128866|ref|XP_001655232.1| DNA repair protein xp-e [Aedes aegypti]
 gi|108882187|gb|EAT46412.1| AAEL002407-PA [Aedes aegypti]
          Length = 980

 Score = 64.7 bits (156), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 109/468 (23%), Positives = 174/468 (37%), Gaps = 119/468 (25%)

Query: 180 GPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
           G L  +DP+ R  G+ +Y GL  II            D DT              H +  
Sbjct: 119 GILAVIDPKARVIGMRLYEGLFKIIPL----------DRDT--------------HELKA 154

Query: 239 RDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
             L M+  HV+D  F++G   P ++++H+        ++ +H    I    I+   K   
Sbjct: 155 TSLRMEEVHVQDVEFLYGTQHPTLIVIHQD-------LNGRH----IKTHEINLKDKDFT 203

Query: 297 LI-WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA--------LNNYAV 347
            I W   N+  +A  L+ VP+P+GG +V+G  ++ YH   +  A+A        +N YA 
Sbjct: 204 KIAWKQDNVETEATMLIPVPTPLGGAIVIGQESVVYHDGDSYVAVAPAIIKQSTINCYAR 263

Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
                      + S  L        +N   LLS K   + LL  +             T 
Sbjct: 264 VDSKGFRYLLGNMSGHLFMMFLETEENSKGLLSVKDIKVELLGDI-------------TI 310

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
           P      IT + N + F+GSR GDS LV+    +G +     + E F ++   AP     
Sbjct: 311 PEC----ITYLDNGVLFIGSRHGDSQLVKLNTTAGDNGAYVTVMETFTNL---APIIDMC 363

Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
                  L+    G+ ++  GS                       G L+    G+ I   
Sbjct: 364 IVD----LEKQGQGQMITCSGSYKE--------------------GSLRIIRNGIGIQEH 399

Query: 528 ASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
           A             ++LPG KG+W +             R+   D  Y   L++S    T
Sbjct: 400 AC------------IDLPGIKGMWAL-------------RVGIDDSPYDNTLVLSFVGHT 434

Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNL-FGRRRVIQVFERGARIL 634
            +L  +    E TE   +    +T    N+ FG  ++IQV    AR++
Sbjct: 435 RILTLSGEEVEETEIPGFLSDQQTFYCANVDFG--QIIQVTPTTARLI 480


>gi|15233515|ref|NP_193842.1| DNA damage-binding protein 1b [Arabidopsis thaliana]
 gi|73620956|sp|O49552.2|DDB1B_ARATH RecName: Full=DNA damage-binding protein 1b; AltName:
           Full=UV-damaged DNA-binding protein 1b; Short=DDB1b
 gi|110739453|dbj|BAF01636.1| UV-damaged DNA-binding protein- like [Arabidopsis thaliana]
 gi|332659001|gb|AEE84401.1| DNA damage-binding protein 1b [Arabidopsis thaliana]
          Length = 1088

 Score = 64.7 bits (156), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 111/513 (21%), Positives = 202/513 (39%), Gaps = 135/513 (26%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  +S   L+ +    L+G + ++ +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLSPQGLQTILDVPLYGRIATMELFRPHG----EAQDFLFVATERYKFCVLQWD- 93

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRES------FARGPLVKVDPQGRCGGVLVY-GLQMI 202
                       +ES E +    G  S         G +  +DP  R  G+ +Y GL  +
Sbjct: 94  ------------YESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRVIGLHLYDGLFKV 141

Query: 203 ILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVI 262
           I   ++G                F+ R+E   V++++           F++G  +P + +
Sbjct: 142 IPFDNKGQLK-----------EAFNIRLEELQVLDIK-----------FLYGCTKPTIAV 179

Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIG 319
           L++           +H        +   +LK    +   WS  NL + A  L+ VPSP+ 
Sbjct: 180 LYQ------DNKDARH------VKTYEVSLKDKNFVEGPWSQNNLDNGADLLIPVPSPLC 227

Query: 320 GVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
           GVL++G  TI Y S +A  A+ +            + ++   V+LD +          LL
Sbjct: 228 GVLIIGEETIVYCSANAFKAIPIR---------PSITKAYGRVDLDGSR--------YLL 270

Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
               G + LL + ++   V  L +     + + S I+ + N++ F+GS  GDS L++   
Sbjct: 271 GDHAGLIHLLVITHEKEKVTGLKIELLGETSIASSISYLDNAVVFVGSSYGDSQLIKL-- 328

Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
                           +++ DA      + S  + L+  VN   +  +       +   +
Sbjct: 329 ----------------NLQPDA------KGSYVEILEKYVNLGPIVDFCVVDLERQGQGQ 366

Query: 500 --TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKS 557
             T S A +D     G L+    G+ IN  AS            VEL G KG+W++  KS
Sbjct: 367 VVTCSGAYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL--KS 407

Query: 558 SRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
           S             D+ +  +L++S  + T +L
Sbjct: 408 S------------IDEAFDTFLVVSFISETRIL 428


>gi|356512636|ref|XP_003525024.1| PREDICTED: DNA damage-binding protein 1a-like isoform 1 [Glycine
           max]
          Length = 1089

 Score = 64.7 bits (156), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 110/504 (21%), Positives = 200/504 (39%), Gaps = 117/504 (23%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  +S   L+ +    ++G + +L +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLSPQGLQPMLDVPIYGRIATLELFRPHG----EAQDYLFIATERYKFCVLQWDS 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   +M           + GR +   G +  +DP  R  G+ +Y GL  +I   ++
Sbjct: 95  ETAELVTRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F++G  +P +V+L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCSKPTIVVLYQ--- 182

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
                   +H      AL     L + P  WS  NL + A  L+ VP P+ GVL++G  T
Sbjct: 183 ---DNKDARHVKTYEVALKDKDFL-EGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEET 236

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           I Y S +A  A+ +            + ++   V+ D +          LL   TG L L
Sbjct: 237 IVYCSANAFKAIPIR---------PSITKAYGRVDPDGSR--------YLLGDHTGLLSL 279

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
           L + ++   V  L +     + + S I+ + N+  ++GS  GDS L++            
Sbjct: 280 LVITHEKEKVTGLKIEPLGETSIASTISYLDNAFVYIGSSYGDSQLIKL----------- 328

Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVR 506
                  +++ DA      + S  + L+  VN   +  +       +   +  T S A +
Sbjct: 329 -------NLQPDA------KGSYVEGLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYK 375

Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
           D     G L+    G+ IN  AS            VEL G KG+W++             
Sbjct: 376 D-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL------------- 405

Query: 567 RMAAYDDEYHAYLIISLEARTMVL 590
             ++ DD +  +L++S  + T +L
Sbjct: 406 -RSSTDDPFDTFLVVSFISETRIL 428


>gi|62318656|dbj|BAD95136.1| UV-damaged DNA-binding protein- like [Arabidopsis thaliana]
          Length = 1088

 Score = 64.7 bits (156), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 111/513 (21%), Positives = 202/513 (39%), Gaps = 135/513 (26%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  +S   L+ +    L+G + ++ +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLSPQGLQTILDVPLYGRIATMELFRPHG----EAQDFLFVATERYKFCVLQWD- 93

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRES------FARGPLVKVDPQGRCGGVLVY-GLQMI 202
                       +ES E +    G  S         G +  +DP  R  G+ +Y GL  +
Sbjct: 94  ------------YESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRVIGLHLYDGLFKV 141

Query: 203 ILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVI 262
           I   ++G                F+ R+E   V++++           F++G  +P + +
Sbjct: 142 IPFDNKGQLK-----------EAFNIRLEELQVLDIK-----------FLYGCTKPTIAV 179

Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIG 319
           L++           +H        +   +LK    +   WS  NL + A  L+ VPSP+ 
Sbjct: 180 LYQ------DNKDARH------VKTYEVSLKDKNFVEGPWSQNNLDNGADLLIPVPSPLC 227

Query: 320 GVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
           GVL++G  TI Y S +A  A+ +            + ++   V+LD +          LL
Sbjct: 228 GVLIIGEETIVYCSANAFKAIPIR---------PSITKAYGRVDLDGSR--------YLL 270

Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
               G + LL + ++   V  L +     + + S I+ + N++ F+GS  GDS L++   
Sbjct: 271 GDHAGLIHLLVITHEKEKVTGLKIELLGETSIASSISYLDNAVVFVGSSYGDSQLIKL-- 328

Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
                           +++ DA      + S  + L+  VN   +  +       +   +
Sbjct: 329 ----------------NLQPDA------KGSYVEILEKYVNLGPIVDFCVVDLERQGQGQ 366

Query: 500 --TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKS 557
             T S A +D     G L+    G+ IN  AS            VEL G KG+W++  KS
Sbjct: 367 VVTCSGAYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL--KS 407

Query: 558 SRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
           S             D+ +  +L++S  + T +L
Sbjct: 408 S------------IDEAFDTFLVVSFISETRIL 428


>gi|356525401|ref|XP_003531313.1| PREDICTED: DNA damage-binding protein 1-like isoform 1 [Glycine
           max]
          Length = 1089

 Score = 64.3 bits (155), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 108/504 (21%), Positives = 200/504 (39%), Gaps = 117/504 (23%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  +S   L+ +    ++G + +L +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLSPQGLQPMLDVPIYGRIATLELFRPHG----EAQDYLFIATERYKFCVLQWDS 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   +M           + GR +   G +  +DP  R  G+ +Y GL  +I   ++
Sbjct: 95  ETGELVTRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F++G  +P +V+L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCSKPTIVVLYQ--- 182

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
                   +H      AL       + P  WS  NL + A  L+ VP P+ GVL++G  T
Sbjct: 183 ---DNKDARHVKTYEVALK-DKDFVEGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEET 236

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           I Y S +A  A+ +            + ++   V+ D +          LL   TG + L
Sbjct: 237 IVYCSANAFKAIPIR---------PSITKAYGRVDPDGSR--------YLLGDHTGLVSL 279

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
           L ++++   V  L +     + + S I+ + N+  ++GS  GDS L++            
Sbjct: 280 LVIIHEKEKVTGLKIEPLGETSIASTISYLDNAFVYVGSSYGDSQLIKL----------- 328

Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVR 506
                  +++ DA      + S  + L+  VN   +  +       +   +  T S A +
Sbjct: 329 -------NLQPDA------KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYK 375

Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
           D     G L+    G+ IN  AS            VEL G KG+W++             
Sbjct: 376 D-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL------------- 405

Query: 567 RMAAYDDEYHAYLIISLEARTMVL 590
             ++ DD +  +L++S  + T +L
Sbjct: 406 -RSSTDDPFDTFLVVSFISETRIL 428


>gi|195145844|ref|XP_002013900.1| GL24391 [Drosophila persimilis]
 gi|194102843|gb|EDW24886.1| GL24391 [Drosophila persimilis]
          Length = 1140

 Score = 63.5 bits (153), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 67/267 (25%), Positives = 114/267 (42%), Gaps = 51/267 (19%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G +  +DP+ R  G+++Y     I+   +  S L                       NLR
Sbjct: 119 GVIAAIDPKARVIGMVLYQGLFTIIPMDKEASEL--------------------KATNLR 158

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
            +D  +V D  F+HG + P ++++H+      GR    H         I+   K+   I 
Sbjct: 159 -MDELNVYDVEFLHGCLNPTIIVIHKDN---DGRHVKSHE--------INLREKEFMKIA 206

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  L+ VPSPIGGV+V+G  +I YH  S       N +AV+  + ++   +
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTIN 259

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTSD 414
            ++  +D     +      LL    G L +L +       G  V+ + + K     +   
Sbjct: 260 CYA-RVDGKGLRY------LLGNMDGQLYMLFLGTSETSKGVTVKDIKVEKLGEISIPEC 312

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGS 441
           IT + N   ++G+R GDS LV+ +  S
Sbjct: 313 ITYLDNGFLYIGARHGDSQLVRLSSES 339


>gi|357623954|gb|EHJ74904.1| putative DNA repair protein xp-e [Danaus plexippus]
          Length = 1128

 Score = 63.5 bits (153), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 102/460 (22%), Positives = 176/460 (38%), Gaps = 107/460 (23%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G L  +DPQ R  G+ +Y     I+   +  + L             S R+E    +N+ 
Sbjct: 120 GILAVIDPQARVIGLRLYDGLFKIIPLDKDSTEL----------KAASLRLEE---LNVY 166

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
           DL+        F+HG   P ++++H+        ++ +H    I    I+   K+   I 
Sbjct: 167 DLE--------FLHGCSNPTLILIHQD-------LNGRH----IKTHEINLRDKEFMKIP 207

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  L+ VPSP+GG +V+G  +I YH   +  A+A       ++        
Sbjct: 208 WKQDNVETEASILIPVPSPLGGAIVIGQESIVYHDGQSYVAVAPPQIKTPINC------- 260

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR----VVQRLDLSKTNPSVLTSD 414
                +D     +L  D+A      G L +L +    R     V+ L +       +   
Sbjct: 261 --YCRVDVRGLRYLLGDIA------GRLFMLLLELSERDGTASVRDLKVELLGDIPIPEC 312

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
           +T + N + F+GSRLGDS LV+                    +  DA    +   + + +
Sbjct: 313 MTYLDNGVVFVGSRLGDSALVRLAA-----------------VRDDASQYVQPMETFT-S 354

Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
           L  +V+   + L     N   +    F          +G L+    G+ I   AS     
Sbjct: 355 LAPIVDMCVVDLERQGQNQLITCSGAF---------KMGSLRIIRNGIGIQEQAS----- 400

Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
                  ++LPG KG+W +    + G              +H  L++S   +T VL    
Sbjct: 401 -------IDLPGIKGMWAL----TLGQGP-----------HHDTLVLSFVGQTRVLTLNG 438

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
              E TE   +    +T   GN+    ++IQV + G R++
Sbjct: 439 EEVEETEIKGFVSDRQTFFTGNVC-HDQLIQVTDEGIRLI 477


>gi|125774475|ref|XP_001358496.1| GA20574 [Drosophila pseudoobscura pseudoobscura]
 gi|54638233|gb|EAL27635.1| GA20574 [Drosophila pseudoobscura pseudoobscura]
          Length = 1140

 Score = 62.8 bits (151), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 67/267 (25%), Positives = 113/267 (42%), Gaps = 51/267 (19%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G +  +DP+ R  G+++Y     I+   +  S L                       NLR
Sbjct: 119 GVIAAIDPKARVIGMVLYQGLFTIIPMDKEASEL--------------------KATNLR 158

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
            +D   V D  F+HG + P ++++H+      GR    H         I+   K+   I 
Sbjct: 159 -MDELSVYDVEFLHGCLNPTIIVIHKDN---DGRHVKSHE--------INLREKEFMKIA 206

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  L+ VPSPIGGV+V+G  +I YH  S       N +AV+  + ++   +
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTIN 259

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTSD 414
            ++  +D     +      LL    G L +L +       G  V+ + + K     +   
Sbjct: 260 CYA-RVDGKGLRY------LLGNMDGQLYMLFLGTSETSKGVTVKDIKVEKLGEISIPEC 312

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGS 441
           IT + N   ++G+R GDS LV+ +  S
Sbjct: 313 ITYLDNGFLYIGARHGDSQLVRLSSES 339


>gi|449435512|ref|XP_004135539.1| PREDICTED: DNA damage-binding protein 1-like [Cucumis sativus]
          Length = 1093

 Score = 62.4 bits (150), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 107/504 (21%), Positives = 197/504 (39%), Gaps = 117/504 (23%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++A  L+ +    ++G + +L +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLTAQGLQPMLDVPIYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDT 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   +M           + GR + + G +  +DP  R  G+ +Y GL  +I   ++
Sbjct: 95  ESSELITRAMGDVSD------RIGRPTDS-GQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F++G   P +V+L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCSRPTIVVLYQDNK 185

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
                     H      +       + P  WS  NL + A  L+ VP P+ GV+++G  T
Sbjct: 186 D-------ARHVKTYEVVLKDKDFVEGP--WSQNNLDNGAAVLIPVPPPLCGVIIIGEET 236

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           I Y S +A  A+ +            + R+   V+ D +          LL    G L L
Sbjct: 237 IVYCSATAFKAIPVR---------PSITRAYGRVDADGSR--------YLLGDHAGLLHL 279

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
           L + ++   V  L +     + + S I+ + N+  ++GS  GDS LV+            
Sbjct: 280 LVITHEKERVTGLKIELLGETSIASTISYLDNAFVYIGSSYGDSQLVKL----------- 328

Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVR 506
                  +++ DA      + S  + L+  VN   +  +       +   +  T S A +
Sbjct: 329 -------NVQPDA------KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYK 375

Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
           D     G L+    G+ IN  AS            VEL G KG+W++             
Sbjct: 376 D-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL------------- 405

Query: 567 RMAAYDDEYHAYLIISLEARTMVL 590
             ++ DD +  +L++S  + T +L
Sbjct: 406 -RSSTDDPFDTFLVVSFISETRIL 428


>gi|167998730|ref|XP_001752071.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162697169|gb|EDQ83506.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 172

 Score = 62.0 bits (149), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 34/96 (35%), Positives = 49/96 (51%), Gaps = 8/96 (8%)

Query: 114 ESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRG 173
           E+ A+ ++  A    RR S+ +        +L F +     R   +HCFE PE+ +L R 
Sbjct: 28  ETAALRTEAAAPGIHRRPSLTMRLR----IILAFTEC----RCLLIHCFEYPEYQYLNRS 79

Query: 174 RESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQG 209
           RE FA    V+ D  GRC  VL+Y  Q++ LKA  G
Sbjct: 80  RERFAMDLSVRADLVGRCASVLIYNSQLVTLKAGHG 115


>gi|340059653|emb|CCC54046.1| putative mitochondrial carrier protein [Trypanosoma vivax Y486]
          Length = 1481

 Score = 62.0 bits (149), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 61/231 (26%), Positives = 103/231 (44%), Gaps = 41/231 (17%)

Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHH-------TCMISALSISTTL 292
           +++V+D  F+    EP++ IL ER+ TWAGRV    W+         T  ++ + IS ++
Sbjct: 268 IRYVRDLQFIGSSGEPLLAILCERQPTWAGRVKLVEWRTKVVESNTLTMHVTWVQISASM 327

Query: 293 KQHP---LIWSAMNLPHDAYKLLAVP---SPIGGVLVVGANTIHYHSQSASCALALNNY- 345
             HP   LI     +P++   +L V      + GV+  G N I + +         N+  
Sbjct: 328 TAHPKLLLIGEVEGVPYNVTHMLPVEPFSQTMSGVVCFGTNVIMHITTKRGYGAYFNDTG 387

Query: 346 ----------AVS----------LDSSQELPRSSFSVELDAAHAT--WLQNDVALLSTKT 383
                     AVS          LD S  L R + S+   AA +    + +++ +L+   
Sbjct: 388 REECINSKFSAVSFGKAVWSDPQLDKSSALARVNMSLANCAATSMVGKMGDELQVLALLE 447

Query: 384 GDLVLLTV--VYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDS 432
            D V++T+  V  G  V+ + ++        S ++ IG  L FLGS +GDS
Sbjct: 448 EDGVVITLHFVARGSSVEEVRITMLGSGCYCSSVSRIGRQLVFLGSTVGDS 498


>gi|2911067|emb|CAA17529.1| UV-damaged DNA-binding protein-like [Arabidopsis thaliana]
 gi|7268907|emb|CAB79110.1| UV-damaged DNA-binding protein-like [Arabidopsis thaliana]
          Length = 1102

 Score = 62.0 bits (149), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 85/370 (22%), Positives = 150/370 (40%), Gaps = 90/370 (24%)

Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
           F  + +     N+R L+   V D  F++G  +P + +L++           +H       
Sbjct: 158 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCTKPTIAVLYQ------DNKDARH------V 204

Query: 286 LSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALAL 342
            +   +LK    +   WS  NL + A  L+ VPSP+ GVL++G  TI Y S +A  A+ +
Sbjct: 205 KTYEVSLKDKNFVEGPWSQNNLDNGADLLIPVPSPLCGVLIIGEETIVYCSANAFKAIPI 264

Query: 343 NNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLD 402
                       + ++   V+LD +          LL    G + LL + ++   V  L 
Sbjct: 265 R---------PSITKAYGRVDLDGSR--------YLLGDHAGLIHLLVITHEKEKVTGLK 307

Query: 403 LSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAP 462
           +     + + S I+ + N++ F+GS  GDS L++                   +++ DA 
Sbjct: 308 IELLGETSIASSISYLDNAVVFVGSSYGDSQLIKL------------------NLQPDA- 348

Query: 463 STKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSY 520
                + S  + L+  VN   +  +       +   +  T S A +D     G L+    
Sbjct: 349 -----KGSYVEILEKYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD-----GSLRIVRN 398

Query: 521 GLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLI 580
           G+ IN  AS            VEL G KG+W++  KSS             D+ +  +L+
Sbjct: 399 GIGINEQAS------------VELQGIKGMWSL--KSS------------IDEAFDTFLV 432

Query: 581 ISLEARTMVL 590
           +S  + T +L
Sbjct: 433 VSFISETRIL 442


>gi|356512638|ref|XP_003525025.1| PREDICTED: DNA damage-binding protein 1a-like isoform 2 [Glycine
           max]
          Length = 1068

 Score = 61.6 bits (148), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 85/367 (23%), Positives = 148/367 (40%), Gaps = 84/367 (22%)

Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
           F  + +     N+R L+   V D  F++G  +P +V+L++           +H      A
Sbjct: 123 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCSKPTIVVLYQ------DNKDARHVKTYEVA 175

Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
           L     L + P  WS  NL + A  L+ VP P+ GVL++G  TI Y S +A  A+ +   
Sbjct: 176 LKDKDFL-EGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIR-- 230

Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
                    + ++   V+ D +          LL   TG L LL + ++   V  L +  
Sbjct: 231 -------PSITKAYGRVDPDGSR--------YLLGDHTGLLSLLVITHEKEKVTGLKIEP 275

Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
              + + S I+ + N+  ++GS  GDS L++                   +++ DA    
Sbjct: 276 LGETSIASTISYLDNAFVYIGSSYGDSQLIKL------------------NLQPDA---- 313

Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLR 523
             + S  + L+  VN   +  +       +   +  T S A +D     G L+    G+ 
Sbjct: 314 --KGSYVEGLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD-----GSLRVVRNGIG 366

Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
           IN  AS            VEL G KG+W++               ++ DD +  +L++S 
Sbjct: 367 INEQAS------------VELQGIKGMWSL--------------RSSTDDPFDTFLVVSF 400

Query: 584 EARTMVL 590
            + T +L
Sbjct: 401 ISETRIL 407


>gi|410079681|ref|XP_003957421.1| hypothetical protein KAFR_0E01320 [Kazachstania africana CBS 2517]
 gi|372464007|emb|CCF58286.1| hypothetical protein KAFR_0E01320 [Kazachstania africana CBS 2517]
          Length = 1350

 Score = 61.2 bits (147), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 144/669 (21%), Positives = 270/669 (40%), Gaps = 133/669 (19%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           L L   ++  G +  +A+L +  A      D ++L    AKIS+++FD   + +   S+H
Sbjct: 48  LFLTNEFKFDGRITDIALLPRQDA----ALDYLLLCTAVAKISIVKFDLESNSIETVSLH 103

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRC------GGVLVYGLQMIILKASQGGSGLV 214
            +E  ++  L        R   +++DP  RC        + V    M   +      G  
Sbjct: 104 YYED-KFKDLSLAE--LTRESKLRLDPASRCLVLFNEDNIAVLPFVMKEDEEDDDEEGEE 160

Query: 215 GDEDTFGSG-GGFSARIES-----SHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHER 266
            DEDT+      F A I       S +++ + +  D++++ D  F++ Y +P + IL++ 
Sbjct: 161 EDEDTYEPRIKRFRANINGRVTFPSTILSAKTIHEDIQNIIDIEFLNNYSKPTVAILYQP 220

Query: 267 ELTWAGRVSWKH--HTCMISALSIST----TLKQHPLIWSAMNLPHDAYKLLAVPSPIGG 320
           +LTW G +         +I  L  +T    T   H +I     LP D ++L+ V +   G
Sbjct: 221 KLTWVGNLQLHPLPTKLLIVTLECNTNGFETSLSHIVIARLNELPWDWHRLIPVTN---G 277

Query: 321 VLVVGANTIHYHSQSA--SCALALNNYAVSLDSSQELPRSSF---SVELDAAHATWLQND 375
           +++VG N + Y   +      + LN++A      + L +S     S E    + + +++ 
Sbjct: 278 IVIVGINELAYVDNTGVLQTVILLNSFA-----DRNLKKSRIIDHSKEESVFNHSAMKH- 331

Query: 376 VALLSTKTGD---------------LVLLTVVYDGRVVQRLDLSK--------------T 406
           + +L T  G+               L  + ++ +GR++ + D+ K              T
Sbjct: 332 ICILKTTDGNEDDADLLLLMDDRSNLYYVQMISEGRLMTQFDIIKLPIINNIFINNLNPT 391

Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
           + S L S  + +   LFF G + GD+    F C       +   ++E  D+  D PS   
Sbjct: 392 SISRLDSSSSRVNLDLFF-GFQSGDA----FVCRLNNIKSAVETRKEHKDV-LDYPS--- 442

Query: 467 LRRSSSDALQDMVNGEEL----SLYGSASNNTESAQ-------------KTFSFAVRDSL 509
               ++D   +  +G +L     LY   + +T+ A              + F  A+  SL
Sbjct: 443 ----NADEYDE--DGADLYGDDDLYSDEATSTQRANSKENGRSNMIETVEPFDIALLSSL 496

Query: 510 VNIGPLKDFSYGLRINADASATGISKQSNYELVELP----GCKGIWTVYHKSSRGHNADS 565
            NIGPL   + G     D +  G+S  +N EL  +     G     T    S R     +
Sbjct: 497 NNIGPLTSLTSGKVSAVDQNNKGLSNPNNNELSIVATSGNGTGSHLTAVLPSVRPEIELA 556

Query: 566 SRMAAYDDEYHAYLIISLEARTMVLETADL------LTEVTESVDYFVQGR-----TIAA 614
            +  +    ++    +  + +   L T D       + E+  +     +GR     T  +
Sbjct: 557 LKFISITQIWN----LKFKGKDKFLVTTDSTKSKSDIYEIDNNFALHREGRLRRDATTVS 612

Query: 615 GNLFGR-RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
             +FG  +R++QV      +LD ++               +   +  V+ VS+ DPY+L+
Sbjct: 613 IAMFGSDKRIVQVTTNHLYLLDTTF-----------RRLNTIKFDYEVVHVSVMDPYILI 661

Query: 674 GMSDGSIRL 682
            +S G I++
Sbjct: 662 TVSRGDIKV 670


>gi|195108657|ref|XP_001998909.1| GI23368 [Drosophila mojavensis]
 gi|193915503|gb|EDW14370.1| GI23368 [Drosophila mojavensis]
          Length = 1140

 Score = 60.8 bits (146), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 65/263 (24%), Positives = 114/263 (43%), Gaps = 49/263 (18%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G +  +DP+ R  G+ +Y     I+   +  S L                       +LR
Sbjct: 119 GFIAAIDPKARVIGMCLYQGLFTIIPLDKDASEL--------------------KATSLR 158

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299
            +D   V D  F+HG + P ++++H+           +H  C    L     +K   L W
Sbjct: 159 -MDELIVYDVEFLHGCLNPTVIVIHKDN-------DGRHVKCHEINLRDKEFMK---LAW 207

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
              N+  +A  L+ VPSPIGGV+V+G  +I YH  S       N +AV+  + ++   + 
Sbjct: 208 KQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTINC 260

Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRV--VQRLDLSKTNPSVLTSDI 415
           ++  +D+    +      LL    G L +L +  +  G+V  V+ + + +     +   I
Sbjct: 261 YA-RVDSKGLRY------LLGNMDGQLYMLFLGINETGKVPTVKDIKVEQLGEISIPECI 313

Query: 416 TTIGNSLFFLGSRLGDSLLVQFT 438
           T + N   ++GSR GDS LV+ +
Sbjct: 314 TYLDNGFLYIGSRHGDSQLVRLS 336


>gi|356525403|ref|XP_003531314.1| PREDICTED: DNA damage-binding protein 1-like isoform 2 [Glycine
           max]
          Length = 1068

 Score = 60.8 bits (146), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 83/367 (22%), Positives = 148/367 (40%), Gaps = 84/367 (22%)

Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
           F  + +     N+R L+   V D  F++G  +P +V+L++           +H      A
Sbjct: 123 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCSKPTIVVLYQ------DNKDARHVKTYEVA 175

Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
           L       + P  WS  NL + A  L+ VP P+ GVL++G  TI Y S +A  A+ +   
Sbjct: 176 LK-DKDFVEGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIR-- 230

Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
                    + ++   V+ D +          LL   TG + LL ++++   V  L +  
Sbjct: 231 -------PSITKAYGRVDPDGSR--------YLLGDHTGLVSLLVIIHEKEKVTGLKIEP 275

Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
              + + S I+ + N+  ++GS  GDS L++                   +++ DA    
Sbjct: 276 LGETSIASTISYLDNAFVYVGSSYGDSQLIKL------------------NLQPDA---- 313

Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLR 523
             + S  + L+  VN   +  +       +   +  T S A +D     G L+    G+ 
Sbjct: 314 --KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD-----GSLRVVRNGIG 366

Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
           IN  AS            VEL G KG+W++               ++ DD +  +L++S 
Sbjct: 367 INEQAS------------VELQGIKGMWSL--------------RSSTDDPFDTFLVVSF 400

Query: 584 EARTMVL 590
            + T +L
Sbjct: 401 ISETRIL 407


>gi|194741158|ref|XP_001953056.1| GF17579 [Drosophila ananassae]
 gi|190626115|gb|EDV41639.1| GF17579 [Drosophila ananassae]
          Length = 1140

 Score = 60.8 bits (146), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 65/264 (24%), Positives = 113/264 (42%), Gaps = 51/264 (19%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G +  +DP+ R  G+ +Y     I+   +  S L                       NLR
Sbjct: 119 GVIAAIDPKARVIGMCLYQGLFTIIPMDKEASEL--------------------KATNLR 158

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
            +D  +V D  F+HG + P ++++H+      GR    H         I+   K+   I 
Sbjct: 159 -MDELNVYDVEFLHGCLNPTVIVIHKDN---DGRHVKSHE--------INLREKEFMKIA 206

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  L+ VPSPIGGV+V+G  +I YH  S       N +AV+  + ++   +
Sbjct: 207 WKQDNVETEATMLITVPSPIGGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTIN 259

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTSD 414
            ++  +D+    +      LL    G L +L +       G  V+ + + +     +   
Sbjct: 260 CYA-RVDSKGFRY------LLGNMDGQLYMLFLGTSETSKGITVKDIKVEQLGEISIPEC 312

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFT 438
           IT + N   ++G+R GDS LV+ +
Sbjct: 313 ITYLDNGFLYIGARHGDSQLVRLS 336


>gi|224061051|ref|XP_002300334.1| predicted protein [Populus trichocarpa]
 gi|222847592|gb|EEE85139.1| predicted protein [Populus trichocarpa]
          Length = 1088

 Score = 60.8 bits (146), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 109/507 (21%), Positives = 193/507 (38%), Gaps = 123/507 (24%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ ++ ++   L+ +    ++G + +L +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEINLLTPQGLQPMLDVPIYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDA 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   +M           + GR +   G +  +DP  R  G+ +Y GL  +I   ++
Sbjct: 95  ETSELITRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F+HG  +P +V+L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLHGCSKPTIVVLYQ--- 182

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVG 325
                   +H        +    LK    I   WS  NL + A  L+ VP P  GVL++G
Sbjct: 183 ---DNKDARH------VKTYEVALKDKDFIEGPWSQNNLDNGADLLIPVPPPFCGVLIIG 233

Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGD 385
             TI Y S +   A+ +            + ++   V+ D +          LL    G 
Sbjct: 234 EETIVYCSANVFRAIPIR---------PSITKAYGRVDADGSR--------YLLGDHAGL 276

Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
           L LL + ++   V  L +     + + S I+ + N+  F+GS  GDS LV+         
Sbjct: 277 LHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAFVFIGSSYGDSQLVKL-------- 328

Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSF 503
                     ++  DA  T        + L   VN   +  +       +   +  T S 
Sbjct: 329 ----------NLHPDAKGT------YVEVLDRYVNLGPIVDFCVVDLERQGQGQVVTCSG 372

Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNA 563
           A +D     G L+    G+ IN  AS            VEL G KG+W++          
Sbjct: 373 AYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL---------- 405

Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVL 590
                +  DD +  +L++S  + T +L
Sbjct: 406 ----RSLTDDPFDTFLVVSFISETRIL 428


>gi|345498295|ref|XP_001607743.2| PREDICTED: DNA damage-binding protein 1-like [Nasonia vitripennis]
          Length = 1140

 Score = 60.8 bits (146), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 90/420 (21%), Positives = 160/420 (38%), Gaps = 86/420 (20%)

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
           +D ++V+D  F+HG   P ++++H+        ++ +H    +    IS   K+   I W
Sbjct: 162 MDEQNVQDVNFLHGCTNPTLILIHQD-------INGRH----VKTHEISLRDKEFVKIPW 210

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
              N+  +A  ++ VPSPI G +++G  +I YH  +         Y   +    +    S
Sbjct: 211 RQDNVEREAMMVIPVPSPICGAIIIGQESILYHDGTT--------YVTVVPPIIKQSTIS 262

Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLTSD 414
              ++D     +L  D+A      G L +L +  D +     V++ L +       +   
Sbjct: 263 CYAKVDNQGLRYLLGDLA------GHLFMLFLEQDKKADGSMVIKDLKVELLGEVSIPEC 316

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
           IT + N + F+GSRLGDS L++       +       E F ++   AP           A
Sbjct: 317 ITYLDNGVIFIGSRLGDSQLIKLNTKPDENGSYCSTMETFTNL---APIVDM-------A 366

Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
           + D+                +    T S A ++     G L+    G+ I   AS     
Sbjct: 367 VVDL------------ERQGQGQIVTCSGAFKE-----GSLRIIRNGIGIQEHAS----- 404

Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
                  ++LPG KG+W +   S    N                L++S   +T +L    
Sbjct: 405 -------IDLPGIKGMWALKVDSVNFDNT---------------LVLSFVGQTRILMLNG 442

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
              E TE   +    +T   GN+     +IQ+    AR++     +    + P N  + S
Sbjct: 443 EEVEETEIPGFVADEQTFHTGNV-TNDVIIQITPTSARLISNKSSSVISEWEPDNKRTIS 501


>gi|170057515|ref|XP_001864517.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167876915|gb|EDS40298.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 1138

 Score = 60.1 bits (144), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 105/473 (22%), Positives = 175/473 (36%), Gaps = 129/473 (27%)

Query: 180 GPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
           G L  +DP+ R  G+ +Y GL  II            D DT              H +  
Sbjct: 119 GILAVIDPKARVIGMRLYEGLFKIIPL----------DRDT--------------HELKA 154

Query: 239 RDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
             L M+  HV+D  F++G   P ++++H+        ++ +H    I    I+   K   
Sbjct: 155 TSLRMEEMHVQDVEFLYGTAHPTLIVIHQD-------LNGRH----IKTHEINLKDKDFT 203

Query: 297 LI-WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA--------LNNYAV 347
            I W   N+  +A  L+ VP+P+GG +V+G  ++ YH   +  A+A        +N YA 
Sbjct: 204 KIAWKQDNVETEATMLIPVPTPLGGAIVIGQESVVYHDGDSYVAVAPAIIKQSTINCYA- 262

Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
                           +D+    +      LL    G L ++ +  +     +L +    
Sbjct: 263 ---------------RVDSRGFRY------LLGNMIGHLFMMFLETEENTRGQLTVKDIK 301

Query: 408 PSVL-----TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAP 462
             +L        IT + N + F+GSR GDS LV+    +  S     + E F ++   AP
Sbjct: 302 VELLGEITIPECITYLDNGVLFIGSRHGDSQLVKLNTTAAASGAYVTVMETFTNL---AP 358

Query: 463 STKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL 522
                       L+    G+ ++  GS                       G L+    G+
Sbjct: 359 IIDMCIVD----LERQGQGQMITCSGSYKE--------------------GSLRIIRNGI 394

Query: 523 RINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIIS 582
            I   A             ++LPG KG+W +             R+   D  Y   L++S
Sbjct: 395 GIQEHAC------------IDLPGIKGMWAL-------------RVGIDDSPYDNTLVLS 429

Query: 583 LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNL-FGRRRVIQVFERGARIL 634
               T +L  +    E TE   +    +T    N+ FG  ++IQV    AR++
Sbjct: 430 FVGHTRILMLSGEEVEETEIPGFLSDQQTFYCANVDFG--QIIQVTPMTARLI 480


>gi|298711490|emb|CBJ26578.1| n/a [Ectocarpus siliculosus]
          Length = 1135

 Score = 60.1 bits (144), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 96/418 (22%), Positives = 166/418 (39%), Gaps = 94/418 (22%)

Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
             A+ +     N+R L+   V D  F+ G  +  + +L++ +       + +H    I  
Sbjct: 143 MDAKGQLKDAFNIR-LEELEVLDIQFLSGCPKATIAVLYQDQR------NARH----IKT 191

Query: 286 LSISTTLKQHPL-IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNN 344
            +IST  K+     W+ +N+ H+A +L+ VP+P GGVL++G  TI YHS  A   + + N
Sbjct: 192 YTISTRDKEFDTGPWAQLNVEHNASELIPVPAPFGGVLILGHQTICYHSGKAFITIPIQN 251

Query: 345 YAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDL--VLLTVVYDGRVVQR 400
             +                   A+  W+  D +  L+S  +G L  V+LT       V+ 
Sbjct: 252 TRM------------------CAYG-WVDADGSRLLVSDHSGGLHVVILTPDATNTAVET 292

Query: 401 LDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
             +     +   S I+ + N + F+GS  GDS L++                   + E D
Sbjct: 293 AHIEALGETSCASSISYLDNGVVFIGSASGDSQLIKL------------------NPEKD 334

Query: 461 APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
           A  T      + D L  +++              +    T S   +D     G L+    
Sbjct: 335 AQGTYIQVLETYDNLGPILD----MCVADLDRQGQGQAVTCSGCSKD-----GSLRIIRN 385

Query: 521 GLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLI 580
           G+ IN  A+            +EL G KG+W++     R  N +  +          YL+
Sbjct: 386 GIGINEHAA------------IELAGIKGMWSL-----RPSNTNHDK----------YLV 418

Query: 581 ISLEARTMVL---ETADLLTEVTE-SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
            +  + T VL   E  D   ++ E  +  F +G T+  G   G    +QV +RG  ++
Sbjct: 419 QAFISETRVLAFEEDEDGDHQLAEGEIAGFQEGCTLFCG-CVGGNMAVQVTKRGVVLI 475


>gi|194901554|ref|XP_001980317.1| GG19434 [Drosophila erecta]
 gi|190652020|gb|EDV49275.1| GG19434 [Drosophila erecta]
          Length = 1140

 Score = 59.7 bits (143), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 67/264 (25%), Positives = 108/264 (40%), Gaps = 53/264 (20%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G +  +DP+ R  G+ +Y     I+   +  S L                       NLR
Sbjct: 119 GVIAAIDPKARVIGMCLYQGLFTIIPLDKDASEL--------------------KATNLR 158

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
            +D  +V D  F+HG + P ++++H+      GR    H         I+   K+   I 
Sbjct: 159 -MDELNVYDVEFLHGCMNPTVIVIHKDN---DGRHVKSHE--------INLREKEFMKIA 206

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  L+ VPSPIGGV+V+G  +I YH  S       N +AV+          
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVA--------PL 251

Query: 359 SFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTS 413
           +F       +A    N +  LL    G L +L +       G  V+ + + +     +  
Sbjct: 252 TFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSKGVTVKDIKVEQLGEISIPE 311

Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
            IT + N   ++G+R GDS LV+ 
Sbjct: 312 CITYLDNGFLYIGARHGDSQLVRL 335


>gi|195329354|ref|XP_002031376.1| GM24084 [Drosophila sechellia]
 gi|194120319|gb|EDW42362.1| GM24084 [Drosophila sechellia]
          Length = 1140

 Score = 59.7 bits (143), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 67/264 (25%), Positives = 108/264 (40%), Gaps = 53/264 (20%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G +  +DP+ R  G+ +Y     I+   +  S L                       NLR
Sbjct: 119 GVIAAIDPKARVIGMCLYQGLFTIIPMDKDASEL--------------------KATNLR 158

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
            +D  +V D  F+HG + P ++++H+      GR    H         I+   K+   I 
Sbjct: 159 -MDELNVYDVEFLHGCLNPTVIVIHKDN---DGRHVKSHE--------INLRDKEFMKIA 206

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  L+ VPSPIGGV+V+G  +I YH  S       N +AV+          
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVA--------PL 251

Query: 359 SFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTS 413
           +F       +A    N +  LL    G L +L +       G  V+ + + +     +  
Sbjct: 252 TFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSKGVTVKDIKVEQLGEISIPE 311

Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
            IT + N   ++G+R GDS LV+ 
Sbjct: 312 CITYLDNGFLYIGARHGDSQLVRL 335


>gi|21357503|ref|NP_650257.1| piccolo [Drosophila melanogaster]
 gi|74872881|sp|Q9XYZ5.1|DDB1_DROME RecName: Full=DNA damage-binding protein 1; Short=D-DDB1; AltName:
           Full=Damage-specific DNA-binding protein 1; AltName:
           Full=Protein piccolo
 gi|4928452|gb|AAD33592.1|AF132145_1 damage-specific DNA binding protein DDBa p127 subunit [Drosophila
           melanogaster]
 gi|7299719|gb|AAF54901.1| piccolo [Drosophila melanogaster]
 gi|220942640|gb|ACL83863.1| DDB1-PA [synthetic construct]
          Length = 1140

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 67/264 (25%), Positives = 108/264 (40%), Gaps = 53/264 (20%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G +  +DP+ R  G+ +Y     I+   +  S L                       NLR
Sbjct: 119 GVIAAIDPKARVIGMCLYQGLFTIIPMDKDASEL--------------------KATNLR 158

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
            +D  +V D  F+HG + P ++++H+      GR    H         I+   K+   I 
Sbjct: 159 -MDELNVYDVEFLHGCLNPTVIVIHKDS---DGRHVKSHE--------INLRDKEFMKIA 206

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  L+ VPSPIGGV+V+G  +I YH  S       N +AV+          
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVA--------PL 251

Query: 359 SFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTS 413
           +F       +A    N +  LL    G L +L +       G  V+ + + +     +  
Sbjct: 252 TFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSKGVTVKDIKVEQLGEISIPE 311

Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
            IT + N   ++G+R GDS LV+ 
Sbjct: 312 CITYLDNGFLYIGARHGDSQLVRL 335


>gi|405970039|gb|EKC34976.1| DNA damage-binding protein 1 [Crassostrea gigas]
          Length = 1160

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 55/231 (23%), Positives = 101/231 (43%), Gaps = 45/231 (19%)

Query: 225 GFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTW--------AGRVSW 276
            F+ R+E   VI+++           F+HG   P ++++H+  L             +S+
Sbjct: 154 AFNIRLEELTVIDIQ-----------FLHGCTTPTLILIHQANLNCYHLMTLCITNLLSF 202

Query: 277 KH--HTCMISALSISTTLKQ-HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHS 333
           K   H   +    IS   K+     W   N+  +A  L+AVP P GG L++G  +I YH 
Sbjct: 203 KQDQHGRHVKTYEISLRDKEFQKGPWKQDNVETEACMLIAVPEPFGGALIIGQESITYHK 262

Query: 334 QSASCALALNNYAVSLDSSQELPRSSFSV--ELDAAHATWLQNDVALLSTKTGDLVLLTV 391
                 +A             + +S+ +   ++DA  + +L  D+       G L +L +
Sbjct: 263 GDNFIPIA----------PPAIKQSTLTCYGKVDANGSRYLLGDMM------GRLFMLML 306

Query: 392 VYDGRV-----VQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
             + ++     V+ L +     + +   IT + N++ ++GSRLGDS LV+ 
Sbjct: 307 EKEEKMDSTVTVKDLKVELLGETTIAECITYLDNAVVYIGSRLGDSQLVKL 357


>gi|407410979|gb|EKF33219.1| cleavage and polyadenylation specificity factor, putative
           [Trypanosoma cruzi marinkellei]
          Length = 1436

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 63/260 (24%), Positives = 105/260 (40%), Gaps = 54/260 (20%)

Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS----------IS 289
           +++V+D  F+    EP++  L ER  TWAGRV    W+        LS           S
Sbjct: 250 IRYVRDMQFIESSGEPIVAFLCERHPTWAGRVKLVEWRTKAVESKMLSSQIVWVQISAAS 309

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIG-------GVLVVGANTIHYHSQSASCALAL 342
           T+ ++  LI    ++P++   +    +P+G       GV+  G NT+ + +      + L
Sbjct: 310 TSNRKLLLIGEVDDVPYNVTHM----TPVGPFAQIPSGVICYGINTVMHVTTKRGYGVYL 365

Query: 343 NNYAVS-----------------LDSSQELPRSSFSVELDAAHATW----LQND---VAL 378
           NN  +                   D   E   + F V L  A  T     + N+   + +
Sbjct: 366 NNGGMEECANSKSSAMSYGKVSWYDPKMETSTALFKVNLSLASCTASFMSIVNEMLHLLV 425

Query: 379 LSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
           +S + G ++ L++      VQ + ++        S IT IG+ + FLGS  GDS      
Sbjct: 426 VSEEDGVVLTLSITAQSSSVQDIRIAILGTGCYCSGITRIGDQIVFLGSAFGDS------ 479

Query: 439 CGSGTSMLSSGLKEEFGDIE 458
           C +   M  S   + F  IE
Sbjct: 480 CIAKVDMFHSDAAKRFQIIE 499



 Score = 47.8 bits (112), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 66/306 (21%), Positives = 120/306 (39%), Gaps = 43/306 (14%)

Query: 912  QRITIFKNISGHQGFFLSGSRPC---WCMVFRERLRVHPQLCDGSIVAFTVLHNVN---- 964
            +RI  F +I G+ G ++ G  P    W    RE L  +     G +  F     +N    
Sbjct: 910  RRIVPFDSIGGNAGAYVCGQHPLFLFWDRRTRE-LEAYRHQTLGPVRGFVPFRIINSGYI 968

Query: 965  -CNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSV 1023
             C  GF+   S      C+ P+G  +     + + I L  TPH + Y        ++ S 
Sbjct: 969  YCCEGFVDFASMDTY--CR-PTGQGW-----LTRRIHLGVTPHFVVYHPPARSCFVVTSK 1020

Query: 1024 -----PVLKPLNQVLSLLIDQEVG--HQIDNH----NLSSVDLH---RTYTVEEYEVRIL 1069
                 P   P +  L+++ D+E G    I       N+  +  +   R    + +E+ ++
Sbjct: 1021 KEPFRPQRAPFDVQLNIVYDEESGGVQSITTEAPVCNMPPIPPNAGIRVPMADRFEICLM 1080

Query: 1070 EPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE---NETLLAIGTAYVQGEDVAA 1126
                +   W    T+ ++ +E  L  +++ +      E      +  + TA+  GED+ +
Sbjct: 1081 ----STTDWACTDTLLLEENERVLGAQMMEIHCEKDAEGLHTAPVCVVSTAFPLGEDITS 1136

Query: 1127 RGRVLLFSTGRNADNPQNLVLSGS--YGPLFSSVQIDFASHFFAICSNSFVFVFLFSFLR 1184
            RGR+LL ST       + L+       GP  + V I    H  A+     + +F F +  
Sbjct: 1137 RGRILLLSTMCTKKKRKILLFHSEPLNGPATAVVGI---RHHIAVAVGGTIKLFRFDWEN 1193

Query: 1185 SLFIIG 1190
               ++G
Sbjct: 1194 RKLVVG 1199


>gi|357519461|ref|XP_003630019.1| DNA damage-binding protein [Medicago truncatula]
 gi|355524041|gb|AET04495.1| DNA damage-binding protein [Medicago truncatula]
          Length = 1171

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 79/349 (22%), Positives = 145/349 (41%), Gaps = 60/349 (17%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++A  L+ +    L+G + +L +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLTAQGLQSILDVPLYGRIATLELFRPHG----ETQDFLFIATERYKFCVLQWDT 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   SM           + GR +   G +  +DP  R  G+ +Y GL  +I   ++
Sbjct: 95  EKSELVTRSMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F++G  +P +V+L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCPKPTIVVLYQ--- 182

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
                   +H      AL       + P  WS  +L + A  L+ VP P+ GVL++G  T
Sbjct: 183 ---DNKDARHVKTYEVALK-DKDFVEGP--WSQNSLDNGADLLIPVPPPLCGVLIIGEET 236

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           I Y S +   A+ +            + ++   V+ D +          LL   TG L L
Sbjct: 237 IVYCSANGFKAIPIR---------AAITKAYGRVDPDGSRY--------LLGDHTGLLSL 279

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           L + ++   V  L +     + + S I+ + N+  ++GS  GDS L++ 
Sbjct: 280 LVITHEKEKVTGLKIEPLGETSIASTISYLDNAFVYIGSSYGDSQLIKL 328


>gi|195449948|ref|XP_002072297.1| GK22405 [Drosophila willistoni]
 gi|194168382|gb|EDW83283.1| GK22405 [Drosophila willistoni]
          Length = 1140

 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 65/264 (24%), Positives = 112/264 (42%), Gaps = 51/264 (19%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G +  +DP+ R  G+ +Y     I+   +  S L                       NLR
Sbjct: 119 GVIAAIDPKARVIGMCLYQGLFTIIPMEKDASEL--------------------KATNLR 158

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
            +D   V D  F+HG + P ++++H+      GR    H         I+   K+   I 
Sbjct: 159 -MDELMVYDVEFLHGCLNPTVIVIHKDN---DGRHVKSHE--------INLRDKEFMKIA 206

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  L+ VPSPIGGV+V+G  +I YH  S       N +AV+  + ++   +
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTIN 259

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTSD 414
            ++  +D+    +      LL    G L +L +       G  V+ + + +     +   
Sbjct: 260 CYA-RVDSKGLRY------LLGNMHGQLYMLFLGTSESSKGITVKDIKVEQLGEISIPEC 312

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFT 438
           IT + N   ++G+R GDS LV+ +
Sbjct: 313 ITYLDNGFLYIGARHGDSQLVRLS 336


>gi|255571318|ref|XP_002526608.1| DNA repair protein xp-E, putative [Ricinus communis]
 gi|223534048|gb|EEF35767.1| DNA repair protein xp-E, putative [Ricinus communis]
          Length = 1033

 Score = 58.9 bits (141), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 84/362 (23%), Positives = 146/362 (40%), Gaps = 85/362 (23%)

Query: 231 ESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST 290
           E+S +I    L+   V D  F++G  +P +V+L++           +H      AL    
Sbjct: 95  ETSELIT--RLEELQVLDIKFLYGCSKPTIVVLYQ------DNKDARHVKTYEVALK-DK 145

Query: 291 TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLD 350
              + P  W+  NL + A  L+ VP P+ GVL++G  TI Y S +A  A+ +        
Sbjct: 146 DFGEGP--WAQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIR------- 196

Query: 351 SSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSV 410
               + R+   V+ D +          LL    G L LL + ++   V  L +     + 
Sbjct: 197 --PSITRAYGRVDADGSR--------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETS 246

Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS 470
           + S I+ + N++ ++GS  GDS LV+                   +++ DA      + S
Sbjct: 247 IASTISYLDNAVVYIGSSYGDSQLVKL------------------NLQPDA------KGS 282

Query: 471 SSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLRINADA 528
             + L+  VN   +  +       +   +  T S A +D     G L+    G+ IN  A
Sbjct: 283 YVEVLESYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD-----GSLRIVRNGIGINEQA 337

Query: 529 SATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM 588
           S            VEL G KG+W++               ++ DD +  +L++S  + T 
Sbjct: 338 S------------VELQGIKGMWSL--------------RSSTDDPFDTFLVVSFISETR 371

Query: 589 VL 590
           +L
Sbjct: 372 IL 373


>gi|195500686|ref|XP_002097479.1| GE26244 [Drosophila yakuba]
 gi|194183580|gb|EDW97191.1| GE26244 [Drosophila yakuba]
          Length = 1140

 Score = 58.9 bits (141), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 67/264 (25%), Positives = 107/264 (40%), Gaps = 53/264 (20%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G +  +DP+ R  G+ +Y     I+   +  S L                       NLR
Sbjct: 119 GVMAAIDPKARVIGMCLYQGLFTIIPLDKDASEL--------------------KATNLR 158

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
            +D   V D  F+HG + P ++++H+      GR    H         I+   K+   I 
Sbjct: 159 -MDELTVYDVEFLHGCLNPTVIVIHKDN---DGRHVKSHE--------INLREKEFMKIA 206

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  L+ VPSPIGGV+V+G  +I YH  S       N +AV+          
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVA--------PL 251

Query: 359 SFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTS 413
           +F       +A    N +  LL    G L +L +       G  V+ + + +     +  
Sbjct: 252 TFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTSETSKGVTVKDIKVEQLGEISIPE 311

Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
            IT + N   ++G+R GDS LV+ 
Sbjct: 312 CITYLDNGFLYIGARHGDSQLVRL 335


>gi|340381612|ref|XP_003389315.1| PREDICTED: DNA damage-binding protein 1-like [Amphimedon
           queenslandica]
          Length = 1142

 Score = 58.9 bits (141), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 104/447 (23%), Positives = 165/447 (36%), Gaps = 92/447 (20%)

Query: 236 INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
           I L DL   ++ D  F+HG   P +  + E      GRV        +    IS   K+ 
Sbjct: 156 IRLEDL---YITDIQFLHGTENPTIAYISEEPSVATGRV--------LKTFVISQRDKEL 204

Query: 296 -PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQE 354
            P  W    +   A  L +VPSP  G++VVGA+++ Y           N+ + ++D    
Sbjct: 205 LPGPWKPNTIEGQASLLCSVPSPYNGLIVVGADSVAY----------FNDTSHTVDPIV- 253

Query: 355 LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRV------VQRLDLSKTNP 408
           +  S  S      H+ +L  D        G L+ L + +   +      +  + L     
Sbjct: 254 IKESVISCIEPLDHSRYLLGDFR------GRLLTLFLEFSEEMESGMTNIVNMKLEVLGE 307

Query: 409 SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
             +   ++ + N + F+GS  GDS LV+         LSS   E  G I           
Sbjct: 308 ISIPHTLSYLDNGVVFVGSTKGDSQLVK---------LSSSPLENGGYI----------- 347

Query: 469 RSSSDALQDMVN-GEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
               D L+ M N G  L +    S      Q          L   G L+    G+ IN  
Sbjct: 348 ----DVLESMTNIGPILDM----SVVDLDKQGRDVLVCCSGLGKDGALRIVKSGIGINEA 399

Query: 528 ASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
           AS            ++LPG KGIW++             + A  +DE    ++++   +T
Sbjct: 400 AS------------IDLPGIKGIWSL-------------KCAGREDELDDTVVLTFVGQT 434

Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGP 647
           M L  A    E TE        +T    N+ G   +IQ+  +  R++D   M     + P
Sbjct: 435 MALRLAGEEVEETELPALVTDQQTFYCSNVTG-NAIIQITTKSVRLMDDKAMELICDWSP 493

Query: 648 SNSE--SGSGSENSTVLSVSIADPYVL 672
            +    S +   +S V+     D Y L
Sbjct: 494 PDGRGISTAACNSSQVMVAVGCDLYYL 520


>gi|255080490|ref|XP_002503825.1| predicted protein [Micromonas sp. RCC299]
 gi|226519092|gb|ACO65083.1| predicted protein [Micromonas sp. RCC299]
          Length = 1114

 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 51/197 (25%), Positives = 84/197 (42%), Gaps = 22/197 (11%)

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
           L+  +V D  F+HG   P + +L+E             H           TL+  P  WS
Sbjct: 157 LEELNVVDVKFMHGCATPTICVLYED-------TKEARHVKTYEVDVKEKTLRDGP--WS 207

Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
             ++   +  ++ VP+P+GG +VVG + I Y ++                +      ++ 
Sbjct: 208 QSDVEGGSSLIIPVPAPLGGAIVVGESVIVYLNKDGG-------------NGAGGAIATK 254

Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGN 420
           SV + A           LLS  TG L LL +V+D R V  L L     + + S ++ + N
Sbjct: 255 SVNVMAHGVVDADGSRYLLSDSTGMLHLLVLVHDRRRVHALKLESLGQTSIASTLSYLDN 314

Query: 421 SLFFLGSRLGDSLLVQF 437
            + ++GS  GDS LV+ 
Sbjct: 315 GVVYVGSAYGDSQLVRL 331


>gi|71654693|ref|XP_815961.1| cleavage and polyadenylation specificity factor [Trypanosoma cruzi
           strain CL Brener]
 gi|50363265|gb|AAT75335.1| cleavage polyadenylation specificity factor CPSF160 [Trypanosoma
           cruzi]
 gi|70881056|gb|EAN94110.1| cleavage and polyadenylation specificity factor, putative
           [Trypanosoma cruzi]
          Length = 1436

 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 62/261 (23%), Positives = 107/261 (40%), Gaps = 54/261 (20%)

Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS----------IS 289
           +++V+D  F+    EP++  L ER  TWAGRV    W+        LS           S
Sbjct: 250 IRYVRDMQFIESSGEPIVAFLCERHPTWAGRVKLVEWRTKAVESKMLSSQIVWVQISAAS 309

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIG-------GVLVVGANTIHYHSQSASCALAL 342
           T+ ++  LI    ++P++   +    +P+G       GV+  G NT+ + +      + L
Sbjct: 310 TSNRKLLLIGEVDDVPYNVTHM----TPVGPFSQIPSGVICYGINTVMHVTTKRGYGVYL 365

Query: 343 NNYAVS-----------------LDSSQELPRSSFSVELDAAHATW----LQND---VAL 378
           NN  +                   D   E   + F V L  A+ T     + N+   + +
Sbjct: 366 NNGGMEECANSKSSAMSYGKVGWCDPKMEASTALFKVNLSLANCTASFMSIVNEMLHLLV 425

Query: 379 LSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
           +S + G ++ L++      VQ + ++        S I  IG+ + FLGS  GDS      
Sbjct: 426 VSEEDGVVLTLSITAQSSSVQGIRIAILGTGCYCSGIARIGDQIVFLGSACGDS------ 479

Query: 439 CGSGTSMLSSGLKEEFGDIEA 459
           C +   M  S + + F  IE+
Sbjct: 480 CIAKVDMFHSDVAKRFQIIES 500



 Score = 47.4 bits (111), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 67/307 (21%), Positives = 121/307 (39%), Gaps = 45/307 (14%)

Query: 912  QRITIFKNISGHQGFFLSGSRPC---WCMVFRERLRVHPQLCDGSIVAFTVLHNVN---- 964
            +RI  F  I G+ G ++ G  P    W    RE L  +     G +  F     +N    
Sbjct: 910  RRIVPFDAIGGNTGAYVCGQHPLFLFWDRRTRE-LEAYRHQTLGPVRGFVPFRIINSGYI 968

Query: 965  -CNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSV 1023
             C  GF+   S      C+ P+G  +     + + I L  TPH + Y        ++ S 
Sbjct: 969  YCCEGFVDFASMDTY--CR-PTGQGW-----LTRRIHLGVTPHFVVYHPPARSCFVVTSK 1020

Query: 1024 -----PVLKPLNQVLSLLIDQEVG------HQIDNHNLSSVDLH---RTYTVEEYEVRIL 1069
                 P   P +  L+++ D+E G       +    N+  +  +   R    + +E+R++
Sbjct: 1021 KEPFRPQRAPFDVQLNIVYDEESGGVQSITTEAPVSNMPPIAPNAGIRVPMADRFEIRLM 1080

Query: 1070 EPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE---NETLLAIGTAYVQGEDVAA 1126
                +   W    T+ ++ +E  L  +++ +      E      +  + TA+  GED+  
Sbjct: 1081 ----STTDWACTDTLLLEENERVLGAQMMEIQCERDAEGLHTAPVCVVSTAFPLGEDITC 1136

Query: 1127 RGRVLLFSTGRNADNPQNLVLSGSY---GPLFSSVQIDFASHFFAICSNSFVFVFLFSFL 1183
            RGR+LL +T       + +VL  S    GP  + V I    H  A+     + +F F + 
Sbjct: 1137 RGRILLLAT-ICTKKKRKIVLFHSEPLNGPATAVVGI---RHHIAVAVGGTIKLFRFDWS 1192

Query: 1184 RSLFIIG 1190
                ++G
Sbjct: 1193 NRKLVVG 1199


>gi|350410909|ref|XP_003489174.1| PREDICTED: DNA damage-binding protein 1-like [Bombus impatiens]
          Length = 1141

 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 90/420 (21%), Positives = 161/420 (38%), Gaps = 86/420 (20%)

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
           ++   V+D  F+HG   P ++++H+        ++ +H    +    IS   K+   + W
Sbjct: 162 MEEHQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEISLRDKEFVKVPW 210

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
              N+  +A  ++ VPSPI G +++G  +I YH          N Y   +    +    +
Sbjct: 211 RQDNVEREAMIVIPVPSPICGAIIIGQESILYHDG--------NTYVAVVPPIIKQSTIT 262

Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLTSD 414
              ++D     +L  D+A      G L +L V  + +     VV+ L +       +   
Sbjct: 263 CYAKVDNQGLRYLLGDMA------GHLFMLFVEQEKKPDGTQVVKDLKVELLGEISIPEC 316

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
           IT + N + F+GSRLGDS LV+                     +AD   +  +   +   
Sbjct: 317 ITYLDNGVIFVGSRLGDSQLVKLIT------------------KADENGSYCVPMETFTN 358

Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
           L  +V+   + L        +    T S A ++     G L+    G+ I   AS     
Sbjct: 359 LAPIVDMAVVDL----ERQGQGQMVTCSGAFKE-----GSLRIIRNGIGIEEHAS----- 404

Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
                  ++LPG KG+W +      G N D++            L++S   +T +L    
Sbjct: 405 -------IDLPGIKGMWAL---KVGGGNFDNT------------LVLSFVGQTRILTLNG 442

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
              E T+   +    +T   GN+      IQ+    AR++     T    + P N  + S
Sbjct: 443 EEVEETDIPGFVADEQTFHTGNV-TNDLFIQITPTSARLISHETKTVVSEWEPENKRTIS 501


>gi|195037449|ref|XP_001990173.1| GH18378 [Drosophila grimshawi]
 gi|193894369|gb|EDV93235.1| GH18378 [Drosophila grimshawi]
          Length = 1140

 Score = 58.2 bits (139), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 66/264 (25%), Positives = 109/264 (41%), Gaps = 51/264 (19%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G +  +DP+ R  G+ +Y     I+   +  S L                       NLR
Sbjct: 119 GFIAAIDPKARVIGMCLYQGLFTIIPLDKDASEL--------------------KATNLR 158

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
            +D   V D  F+HG + P ++++H       GR    H         I+   K+   I 
Sbjct: 159 -MDELTVYDVEFLHGCLNPTVIVIHRDN---DGRHVKSHE--------INLRDKEFMKIA 206

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  L+ VPSPI GV+V+G  +I YH  S       N +AV+  + ++   +
Sbjct: 207 WKQDNVETEATMLIPVPSPICGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTIN 259

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLL----TVVYDGRVVQRLDLSKTNPSVLTSD 414
            ++  +D     +      LL    G L +L    T    G  V+ + + +     +   
Sbjct: 260 CYA-RIDEKGLRY------LLGNMDGQLYMLFLGTTETSKGITVKDIKVEQLGEISIPEC 312

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFT 438
           IT + N   ++GSR GDS LV+ +
Sbjct: 313 ITYLDNGFLYIGSRHGDSQLVRLS 336


>gi|224000243|ref|XP_002289794.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220975002|gb|EED93331.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 1820

 Score = 58.2 bits (139), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 91/410 (22%), Positives = 145/410 (35%), Gaps = 141/410 (34%)

Query: 246 VKDFIFVHGYIEPVMVILHERE-----LTWAGRVSWKHHTCM------------------ 282
           + D  F+ GYIEP +++LH          WAGR+       +                  
Sbjct: 398 IVDIAFLSGYIEPTLLVLHSNPKRGGGRAWAGRLGRTEEVPLSNNGGSGESKDDYGEDID 457

Query: 283 -----------------------ISALSISTTLKQHPLIWSAMN-LPHDAYKLLAVPSPI 318
                                  ++A+S++   ++  ++WS ++ LP DA+KL  VP P 
Sbjct: 458 LEGGDAAKKGPDLVSTGTKYGLSLTAISLAIHQRRSVVLWSLLDALPADAWKL--VPHPS 515

Query: 319 GGVLVVGANTIHYHSQSA--SCALALNNY------------------AVSLDSSQELPRS 358
            GV+V G NT  Y S     SCALA N +                  AV L+ +   P  
Sbjct: 516 DGVIVWGVNTAVYVSMGGKISCALAANGFAKIGCPIGLIPPSGRIGSAVYLEPNPS-PLP 574

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV--------------------------- 391
             +++LD A   ++  DVA++    G L  L +                           
Sbjct: 575 MLALQLDGARVGFVTEDVAIVCLGNGSLYSLELHRAKSMVSPSMFLSMSPLGHRVGGLGV 634

Query: 392 -------------------VYDGRVVQRLDLSK---TNPSVLTSDITTIGNSLFFLGSRL 429
                              + D   V+  D +K   +  SV    I + G  L F GSR+
Sbjct: 635 ASCLSVLAMACHSNSVGHFLVDNEGVKDEDHAKETISKESVSGPKIRSRG--LIFAGSRM 692

Query: 430 GDSLLVQFT---------------CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS-SD 473
           GD  L+ F+                G+G   L     E+   +    P+ K+L++   S 
Sbjct: 693 GDCSLLAFSLNVPIHLVITDVDSETGAGKRKLGGSRPEQLSSMP--EPAQKQLKKEEISP 750

Query: 474 ALQDMVNGEE--LSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
           +  D  +GEE  +    S   +  +     + +  DSL  +GPL    YG
Sbjct: 751 SRTDSEDGEEDIVCAMSSPRRSVRTLSMFRTVSALDSLTGLGPLGQGCYG 800


>gi|195395112|ref|XP_002056180.1| GJ10363 [Drosophila virilis]
 gi|194142889|gb|EDW59292.1| GJ10363 [Drosophila virilis]
          Length = 1140

 Score = 58.2 bits (139), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 64/263 (24%), Positives = 107/263 (40%), Gaps = 49/263 (18%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G +  +DP+ R  G+ +Y     I+   +  S L                       NLR
Sbjct: 119 GFIAAIDPKARVIGMCLYQGLFTIIPLDKDASEL--------------------KATNLR 158

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299
            +D   V D  F+HG   P ++++H+      GR    H   +     I          W
Sbjct: 159 -MDELTVYDVEFLHGCQNPTVIVIHKDN---DGRHVKSHEINLRDKEFIKVA-------W 207

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
              N+  +A  L+ VPS IGGV+V+G  +I YH  S       N +AV+  + ++   + 
Sbjct: 208 KQDNVETEATMLIPVPSSIGGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTINC 260

Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLL----TVVYDGRVVQRLDLSKTNPSVLTSDI 415
           ++  +D+    +      LL    G L +L    T    G  V+ + + +     +   I
Sbjct: 261 YA-RVDSKGLRY------LLGNMDGQLYMLFLGTTETSKGTTVKDIKVEQLGEISIPECI 313

Query: 416 TTIGNSLFFLGSRLGDSLLVQFT 438
           T + N   ++GSR GDS LV+ +
Sbjct: 314 TYLDNGFLYIGSRHGDSQLVRLS 336


>gi|328788389|ref|XP_396048.3| PREDICTED: DNA damage-binding protein 1-like isoform 1 [Apis
           mellifera]
          Length = 1141

 Score = 58.2 bits (139), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 91/420 (21%), Positives = 161/420 (38%), Gaps = 86/420 (20%)

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
           ++   V+D  F+HG   P ++++H+        ++ +H    +    IS   K+   I W
Sbjct: 162 MEEHQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEISLRDKEFVKIPW 210

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
              N+  +A  ++ VPSPI G +++G  +I YH          N Y   +    +    +
Sbjct: 211 RQDNVEREAMIVIPVPSPICGAIIIGQESILYHDG--------NTYVAVVPPIIKQSTIT 262

Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLTSD 414
              ++D     +L  D+A      G L +L V  + +     VV+ L +       +   
Sbjct: 263 CYAKVDNQGLRYLLGDMA------GHLFMLFVEQEKKADGTQVVKDLKVELLGEISIPEC 316

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
           IT + N + F+GSRLGDS LV+                     +AD   +  +   +   
Sbjct: 317 ITYLDNGVIFVGSRLGDSQLVKLIT------------------KADENGSYCVPMETFTN 358

Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
           L  +V+   + L        +    T S A ++     G L+    G+ I   AS     
Sbjct: 359 LAPIVDMAVVDL----ERQGQGQMVTCSGAFKE-----GSLRIIRNGIGIEEHAS----- 404

Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
                  ++LPG KG+W +      G N D++            L++S   +T +L    
Sbjct: 405 -------IDLPGIKGMWAL---KIGGGNFDNT------------LVLSFVGQTRILTLNG 442

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
              E T+   +    +T   GN+      IQ+    AR++     T    + P N  + S
Sbjct: 443 EEVEETDIPGFVADEQTFHTGNV-TNDLFIQITPTSARLISYETKTVVSEWEPENKRTIS 501


>gi|312283457|dbj|BAJ34594.1| unnamed protein product [Thellungiella halophila]
          Length = 1088

 Score = 58.2 bits (139), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 109/507 (21%), Positives = 199/507 (39%), Gaps = 123/507 (24%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++   L+ +    ++G + +L +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLTPQGLQPMLDVPMYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDA 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   +M           + GR +   G +  +DP  R  G+ +Y GL  +I   ++
Sbjct: 95  ESSELITRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F+ G  +P + +L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLFGCAKPTIAVLYQ--- 182

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVG 325
                   +H        +   +LK    +   WS  NL + A  L+ VP P+ GVL++G
Sbjct: 183 ---DNKDARH------VKTYEVSLKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIG 233

Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGD 385
             TI Y S +A  A+ +            + ++   V++D +          LL    G 
Sbjct: 234 EETIVYCSANAFKAIPIR---------PSITKAYGRVDVDGSR--------YLLGDHAGL 276

Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
           + LL + ++   V  L +     + + S I+ + N++ F+GS  GDS LV+         
Sbjct: 277 IHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKL-------- 328

Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSF 503
                     ++  DA      + S  + L+  VN   +  +       +   +  T S 
Sbjct: 329 ----------NLHPDA------KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSG 372

Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNA 563
           A +D     G L+    G+ IN  AS            VEL G KG+W++  KSS     
Sbjct: 373 AFKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL--KSS----- 408

Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVL 590
                   D+ +  +L++S  + T VL
Sbjct: 409 -------IDEAFDTFLVVSFISETRVL 428


>gi|380025901|ref|XP_003696702.1| PREDICTED: LOW QUALITY PROTEIN: DNA damage-binding protein 1-like
           [Apis florea]
          Length = 1141

 Score = 58.2 bits (139), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 91/420 (21%), Positives = 161/420 (38%), Gaps = 86/420 (20%)

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
           ++   V+D  F+HG   P ++++H+        ++ +H    +    IS   K+   I W
Sbjct: 162 MEEHQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEISLRDKEFVKIPW 210

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
              N+  +A  ++ VPSPI G +++G  +I YH          N Y   +    +    +
Sbjct: 211 RQDNVEREAMIVIPVPSPICGAIIIGQESILYHDG--------NTYVAVVPPIIKQSTIT 262

Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLTSD 414
              ++D     +L  D+A      G L +L V  + +     VV+ L +       +   
Sbjct: 263 CYAKVDNQGLRYLLGDMA------GHLFMLFVEQEKKTDGTQVVKDLKVELLGEISIPEC 316

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
           IT + N + F+GSRLGDS LV+                     +AD   +  +   +   
Sbjct: 317 ITYLDNGVIFVGSRLGDSQLVKLIT------------------KADENGSYCVPMETFTN 358

Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
           L  +V+   + L        +    T S A ++     G L+    G+ I   AS     
Sbjct: 359 LAPIVDMAVVDL----ERQGQGQMVTCSGAFKE-----GSLRIIRNGIGIEEHAS----- 404

Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
                  ++LPG KG+W +      G N D++            L++S   +T +L    
Sbjct: 405 -------IDLPGIKGMWAL---KIGGGNFDNT------------LVLSFVGQTRILTLNG 442

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
              E T+   +    +T   GN+      IQ+    AR++     T    + P N  + S
Sbjct: 443 EEVEETDIPGFVADEQTFHTGNV-TNDLFIQITPTSARLISYETKTVVSEWEPENKRTIS 501


>gi|307205760|gb|EFN83990.1| DNA damage-binding protein 1 [Harpegnathos saltator]
          Length = 1138

 Score = 58.2 bits (139), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 50/205 (24%), Positives = 94/205 (45%), Gaps = 35/205 (17%)

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
           +D + V+D  F+HG   P ++++H+        ++ +H    +    IS   K+   I W
Sbjct: 159 MDEQQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEISLRDKEFVKIPW 207

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
              N+  +A  ++ VPSPI G +++G  +I YH  +   A+              + +S+
Sbjct: 208 RQDNVEREAMMVIPVPSPICGAIIIGQESILYHDGTTYIAVV----------PPIIKQST 257

Query: 360 FS--VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLT 412
            +   ++D     +L  D+A      G L +L +  + +     VV+ L +       + 
Sbjct: 258 ITCYAKVDNQGLRYLLGDMA------GHLFMLFLEQEKKPDGTQVVKDLKVELLGEISIP 311

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQF 437
             IT + N + F+GSRLGDS L++ 
Sbjct: 312 ECITYLDNGVIFVGSRLGDSQLIKL 336


>gi|427788481|gb|JAA59692.1| Putative dna damage-binding protein 1 [Rhipicephalus pulchellus]
          Length = 1156

 Score = 57.8 bits (138), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 90/401 (22%), Positives = 154/401 (38%), Gaps = 81/401 (20%)

Query: 246 VKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAM 302
           V+D  F+HG   P +V+LH+         S   H       +   +LK    +   W   
Sbjct: 164 VQDMEFLHGCKTPTIVLLHQD--------SQARHM-----KTYEVSLKDKEFVKGPWKQD 210

Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
           ++  +A  ++AVP P  G L++G  +I YH+         + Y V    +  L R S  V
Sbjct: 211 HVESEANLVIAVPEPFCGALIIGQESITYHNG--------DQYVV---ITPHLIRQSTIV 259

Query: 363 ---ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRV-----VQRLDLSKTNPSVLTSD 414
              ++DA  + +L  D+A      G L +L +  + ++     V+ L L       +   
Sbjct: 260 CYGKVDANGSRYLLGDMA------GRLFMLLLEREDKMDGTTTVKDLKLEFLGEITIAEC 313

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
           IT + N + ++GSRLGDS L++             + E F ++                 
Sbjct: 314 ITYLDNGVVYVGSRLGDSQLIKLHAERNDQGSFVEIMEVFTNL---------------GP 358

Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
           + DM                +    T S A ++     G L+    G+ I+  AS     
Sbjct: 359 IVDMC-------VVDLERQGQGQLVTCSGAFKE-----GSLRIIRNGIGIHEHAS----- 401

Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
                  ++LPG KG+W +        +    R      E    L++S   +T VL  + 
Sbjct: 402 -------IDLPGIKGMWPLRVGPGVAPHGGDGRDPGDSAERDNTLVLSFVRQTRVLMLSG 454

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILD 635
              E TE   +    +T   GN+   +++IQV     R++D
Sbjct: 455 EEVEETELAGFDTSQQTFFCGNV-RNKQLIQVTAAAVRLVD 494


>gi|342186481|emb|CCC95967.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 1456

 Score = 57.8 bits (138), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 80/334 (23%), Positives = 130/334 (38%), Gaps = 77/334 (23%)

Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS-------ISTTL 292
           +++V+D  FV    EP++ +L ER  TWAGRV    W+      + LS       IS  L
Sbjct: 254 LRYVRDLQFVGSSGEPLLGVLCERRPTWAGRVKLVEWRTKAVDTNTLSMQVAWVQISGAL 313

Query: 293 KQHP---LIWSAMNLPHDAYKLLAVPSP---IGGVLVVGANTIHYHSQSASCALALNNYA 346
             HP   L+    ++P++   ++ V S      GV+  G NT+ + +      +  N+  
Sbjct: 314 TTHPKLLLVGEVDSVPYNVTHMIPVESSSQTPSGVICFGINTVMHITTKRGYGVYFNSTG 373

Query: 347 V---------------------SLDSSQELPRSSFSVELDAAHATWLQN------DVALL 379
           +                      L+SS  L R +FS  L    AT           +  +
Sbjct: 374 MEECGSNKSSAMSYGKMSWCDAKLESSTALFRVNFS--LANCTATIFSPRSSDSLQILAV 431

Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
           S + G + +L  +  G  V  + +S        S +T I ++LFFLGS       V F+C
Sbjct: 432 SEEDGVVAVLEFLSQGANVHDIQISVLASGCYCSSLTPISDNLFFLGS------AVSFSC 485

Query: 440 GSGTSMLSSGLKEEFGDIE--------------------ADAPSTKRLRRSSSDALQDMV 479
            +  +  +SG   +F  +E                    AD  S  R  +S+S  L+D  
Sbjct: 486 IASITPTNSGAIGKFKVVESIEAIGSIRDVDVVDCSNDAADCISGPRGNQSNSSWLEDTP 545

Query: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIG 513
             E       A N T       S A R +++++ 
Sbjct: 546 FAE------LAGNTTLDPMPNLSVAQRRAIMDLA 573



 Score = 53.9 bits (128), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 60/279 (21%), Positives = 120/279 (43%), Gaps = 34/279 (12%)

Query: 879  VSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMV 938
            + ++ A + R L+  RT ++  T + T H A  +RI  F  ++G  G ++ G  P + + 
Sbjct: 899  IESIEARKCR-LQRERTMIENDT-QSTRHCA--RRIIPFACVAGQSGAYVCGQHPVFLLW 954

Query: 939  FRERLRV--HPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQ 996
             + + R+  +     G++  F     +    GFIY   +G +   ++ +    +    + 
Sbjct: 955  DKRKRRIAAYRHQSPGAVRGFVSFPQMA--GGFIYC-CEGFVDFARMNTYCAPNGQGWLT 1011

Query: 997  KVIPLKATPH-----------------QITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQ 1039
            + I + ATPH                 + T+  ++  + + + +   + LN V S+  + 
Sbjct: 1012 RRIAIGATPHFLVYDPPGKSCFVVTSEKKTFRPQRAFFDVQLKIHYDEELNTVQSVTAEP 1071

Query: 1040 EVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVT 1099
             V H    +  + V   R   VE++EVR+L     G  W+      ++ +E  L  + V 
Sbjct: 1072 PVCHMPPINPGAGV---RVPMVEQFEVRLL--STTGEQWECTHKFALEENEKVLGAQAVE 1126

Query: 1100 LFNTTT---KENETLLAIGTAYVQGEDVAARGRVLLFST 1135
            L          +  +  + TA+  GEDV  RGR++L ++
Sbjct: 1127 LRQDEAIAGAPSAPVCVLCTAFPLGEDVTCRGRIILLAS 1165


>gi|50288865|ref|XP_446862.1| hypothetical protein [Candida glabrata CBS 138]
 gi|74609915|sp|Q6FSD2.1|CFT1_CANGA RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two
           protein 1
 gi|49526171|emb|CAG59795.1| unnamed protein product [Candida glabrata]
          Length = 1361

 Score = 57.4 bits (137), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 132/674 (19%), Positives = 265/674 (39%), Gaps = 122/674 (18%)

Query: 96  ISAASLELVCHYRLHGNVESLAIL---SQGGADNSRRRDSIILAFEDAKISVLEFDDSIH 152
           I +  L L+  ++L G +  +A++   S G   N      ++L+   AK+S+L +++   
Sbjct: 43  IRSGRLYLMEEHKLSGRINDVALIPKHSNGSNGNGINLSYLLLSTGVAKLSLLMYNNMTS 102

Query: 153 GLRITSMHC----FESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQ 208
            +   S+H     FES   L L       AR   ++++P G     +++   ++ +    
Sbjct: 103 SIETISLHFYEDKFESATMLDL-------ARNSQLRIEPNGNYA--MLFNNDVLAILPFY 153

Query: 209 GGSGLVGDED----------------TFGSGGGFSARIESSH---VINLRDL--DMKHVK 247
            G     DED                 F    G +   + +H   +IN  +L   +K++K
Sbjct: 154 TGINEDEDEDYINNDKSKINDNSKKSLFKRKKGKTQNNKVTHPSIIINCSELGPQIKNIK 213

Query: 248 DFIFVHGYIEPVMVILHERELTWAGR---VSWKHHTCMIS---ALSISTTLKQHPLIWSA 301
           D  F+ G+ +  + +L++ +L W G    V    +  +IS     SI  T     +I   
Sbjct: 214 DIQFLCGFTKSTIGVLYQPQLAWCGNSQLVPLPTNYAIISLDMKFSIDATTFDKAIISEI 273

Query: 302 MNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA--SCALALNNYAVS-LDSSQELPRS 358
             LP D +    +   + G L++G N I +   +      L LN+Y+   L   + + +S
Sbjct: 274 SQLPSDWH---TIAPTLSGSLILGVNEIAFLDNTGVLQSILTLNSYSDKVLPKVRVIDKS 330

Query: 359 SFSVELDAAHATWL----QNDVA----LLSTKTGDLVLLTVVYDGRVVQRLDLS------ 404
           S  V  +      L    +N+ +    LL  + G +  + +  +GR++ + +++      
Sbjct: 331 SHEVFFNTGSKFALIPSNENERSVENILLFDENGCIFNVDLKSEGRLLTQFNITKLPLGE 390

Query: 405 -----KTNP---SVLTSDITTIGNSLFFLGSRLGDSLLVQFT-CGSGTSMLSSGLKEEFG 455
                K+NP   S++ +D   +     F+G + GD+ +++     S   +      +++ 
Sbjct: 391 DVLSQKSNPSSVSIIWAD-GRLDTYTIFIGFQSGDATMLKLNHLHSAIEVEEPTFMKDYV 449

Query: 456 DIEADAPSTKRLRRSS-------SDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRD 507
           + +A A                 SD   D VN +    +G+  SN   +AQ+        
Sbjct: 450 NKQASAAYNNEDDDDDDDDFNLYSDEENDQVNNKNDRTFGTNESNEPFTAQELM------ 503

Query: 508 SLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSR 567
            L NIGP+     G   + + +  G+   +  E+        + T  +      NA  + 
Sbjct: 504 ELRNIGPINSMCVGKVSSIEDNVKGLPNPNKQEI------SIVCTSGYGDGSHLNAILAS 557

Query: 568 MAAYDDEYHAYLIIS------LEARTMVLETADL------LTEVTESVDYFVQGR----- 610
           +    ++   ++ I+      ++ +   L T D       + E+  +     QGR     
Sbjct: 558 VQPRVEKALKFISITKIWNLHIKGKDKFLITTDSTQSQSNIYEIDNNFSQHKQGRLRRDA 617

Query: 611 -TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
            TI    +   +R++QV      + D ++               +   +  V+ VS+ DP
Sbjct: 618 TTIHIATIGDNKRIVQVTTNHLYLYDLTF-----------RRFSTIKFDYEVVHVSVMDP 666

Query: 670 YVLLGMSDGSIRLL 683
           YVL+ +S G I++ 
Sbjct: 667 YVLITLSRGDIKVF 680


>gi|407850337|gb|EKG04765.1| cleavage and polyadenylation specificity factor, putative
           [Trypanosoma cruzi]
          Length = 1436

 Score = 57.4 bits (137), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 61/261 (23%), Positives = 107/261 (40%), Gaps = 54/261 (20%)

Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS----------IS 289
           +++V+D  F+    EP++  L ER  TWAGRV    W+        LS           S
Sbjct: 250 IRYVRDMQFIESSGEPIVAFLCERHPTWAGRVKLVEWRTKAVESKMLSSQIVWVQISAAS 309

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIG-------GVLVVGANTIHYHSQSASCALAL 342
           T+ ++  LI    ++P++   +    +P+G       GV+  G NT+ + +      + L
Sbjct: 310 TSNRKLLLIGEVDDVPYNVTHM----TPVGPFSQIPSGVICYGINTVMHVTTKRGYGVYL 365

Query: 343 NNYAVS-----------------LDSSQELPRSSFSVELDAAHATW----LQND---VAL 378
           NN  +                   D   E   + F V L  A+ T     + N+   + +
Sbjct: 366 NNGGMEECANSKSSAMSYGKVGWCDPKMEASTALFKVNLSLANCTASFMSIVNEMLHLLV 425

Query: 379 LSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
           +S + G ++ L++      VQ + ++        S I  +G+ + FLGS  GDS      
Sbjct: 426 VSEEDGVVLTLSITAQSSSVQGIRIAILGTDCYCSGIARLGDQIVFLGSACGDS------ 479

Query: 439 CGSGTSMLSSGLKEEFGDIEA 459
           C +   M  S + + F  IE+
Sbjct: 480 CIAKVDMFHSDVAKRFRIIES 500



 Score = 47.8 bits (112), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 70/309 (22%), Positives = 122/309 (39%), Gaps = 49/309 (15%)

Query: 912  QRITIFKNISGHQGFFLSGSRPC---WCMVFRERLRVHPQLCDGSIVAFTVLHNVN---- 964
            +RI  F  I G+ G ++ G  P    W    RE L  +     G +  F     +N    
Sbjct: 910  RRIVPFDAIGGNAGAYVCGQHPLFLFWDRRTRE-LEAYRHQTLGPVRGFVPFRIINSGYI 968

Query: 965  -CNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSV 1023
             C  GF+   S      C+ P+G  +     + + I L  TPH + Y        ++ S 
Sbjct: 969  YCCEGFVDFASMDTY--CR-PTGQGW-----LTRRIHLGVTPHFVVYHPPARSCFVVTSK 1020

Query: 1024 -----PVLKPLNQVLSLLIDQEVG--HQIDNH----NLSSVDLH---RTYTVEEYEVRIL 1069
                 P   P +  L ++ D+E G    I       N+  +  +   R    + +E+R++
Sbjct: 1021 KEPFRPQRSPFDVQLKIVYDEESGGVQSITTEAPVCNMPPIAPNAGIRVPMADRFEIRLM 1080

Query: 1070 EPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETL-----LAIGTAYVQGEDV 1124
                +   W    T+ ++ +E  L  +++ +     K+ E L       + TA+  GED+
Sbjct: 1081 ----STTDWACTDTLLLEENERVLGAQMMEI--QCEKDAEGLHTAPVCVVSTAFPLGEDI 1134

Query: 1125 AARGRVLLFSTGRNADNPQNLVLSGSY---GPLFSSVQIDFASHFFAICSNSFVFVFLFS 1181
              RGR+LL +T       + +VL  S    GP  + V I    H  A+     + +F F 
Sbjct: 1135 TCRGRILLLAT-MCTKKKRKIVLFHSEPLNGPATAVVGI---RHHIAVAVGGTIKLFRFD 1190

Query: 1182 FLRSLFIIG 1190
            +     ++G
Sbjct: 1191 WNNRKLVVG 1199


>gi|383863765|ref|XP_003707350.1| PREDICTED: DNA damage-binding protein 1-like [Megachile rotundata]
          Length = 1138

 Score = 57.4 bits (137), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 52/205 (25%), Positives = 95/205 (46%), Gaps = 35/205 (17%)

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
           +D + V+D  F+HG   P ++++H+        ++ +H    +    IS   K+   I W
Sbjct: 162 MDEQQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEISLRDKEFVKIPW 210

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
              N+  +A  ++ VPSPI G +++G  +I YH  +   A+              + +S+
Sbjct: 211 RQDNVEREATMVIPVPSPICGAIIIGQESILYHDGTTYVAVV----------PPIIKQST 260

Query: 360 FS--VELDAAHATWLQNDVALLSTKTGDLVLLTVVY----DG-RVVQRLDLSKTNPSVLT 412
            +   ++D     +L  D+A      G L +L +      DG +VV+ L +       + 
Sbjct: 261 ITCYAKVDNQGLRYLLGDMA------GHLFMLFLEQEKNPDGTQVVKDLKVELLGEISIP 314

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQF 437
             IT + N + F+GSRLGDS L++ 
Sbjct: 315 ECITYLDNGVIFVGSRLGDSQLIKL 339


>gi|186511557|ref|NP_001118940.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
 gi|332657118|gb|AEE82518.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
          Length = 1067

 Score = 57.4 bits (137), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 100/453 (22%), Positives = 178/453 (39%), Gaps = 108/453 (23%)

Query: 147 FDDSIHGLRITSMHCF----ESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI 202
            D  I+G RI ++  F    E+ ++L +   R  F    +++ DP+             +
Sbjct: 54  LDVPIYG-RIATLELFRPHGEAQDFLFIATERYKFC---VLQWDPES----------SEL 99

Query: 203 ILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVI 262
           I +A    S  +G     G    F  + +     N+R L+   V D  F+ G  +P + +
Sbjct: 100 ITRAMGDVSDRIGRPTDNGQVIPFDNKGQLKEAFNIR-LEELQVLDIKFLFGCAKPTIAV 158

Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIG 319
           L++           +H        +   +LK    +   WS  +L + A  L+ VP P+ 
Sbjct: 159 LYQ------DNKDARH------VKTYEVSLKDKDFVEGPWSQNSLDNGADLLIPVPPPLC 206

Query: 320 GVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
           GVL++G  TI Y S SA  A+ +            + ++   V++D +          LL
Sbjct: 207 GVLIIGEETIVYCSASAFKAIPIR---------PSITKAYGRVDVDGSR--------YLL 249

Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
               G + LL + ++   V  L +     + + S I+ + N++ F+GS  GDS LV+   
Sbjct: 250 GDHAGMIHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKL-- 307

Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
                           ++  DA      + S  + L+  +N   +  +       +   +
Sbjct: 308 ----------------NLHPDA------KGSYVEVLERYINLGPIVDFCVVDLERQGQGQ 345

Query: 500 --TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKS 557
             T S A +D     G L+    G+ IN  AS            VEL G KG+W++  KS
Sbjct: 346 VVTCSGAFKD-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL--KS 386

Query: 558 SRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
           S             D+ +  +L++S  + T +L
Sbjct: 387 S------------IDEAFDTFLVVSFISETRIL 407


>gi|225443992|ref|XP_002280744.1| PREDICTED: DNA damage-binding protein 1 isoform 2 [Vitis vinifera]
          Length = 1068

 Score = 57.0 bits (136), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 88/391 (22%), Positives = 153/391 (39%), Gaps = 84/391 (21%)

Query: 202 IILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMV 261
           +I +A    S  +G     G    F  + +     N+R L+   V D  F++G  +P +V
Sbjct: 99  VITRAMGDVSDRIGRPTDNGQVIPFDNKGQLKEAFNIR-LEELQVLDIKFLYGCSKPTIV 157

Query: 262 ILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGV 321
           +L++           +H      AL       + P  W+  NL + A  L+ VP P+ GV
Sbjct: 158 VLYQ------DNKDARHVKTYEVALK-DKDFVEGP--WAQNNLDNGADLLIPVPPPLCGV 208

Query: 322 LVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLST 381
           L++G  TI Y S SA  A+ +            + ++   V+ D +          LL  
Sbjct: 209 LIIGEETIVYCSASAFKAIPIR---------PSITKAYGRVDADGSR--------YLLGD 251

Query: 382 KTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGS 441
             G L LL + ++   V  L +     + + S I+ + N+  ++GS  GDS L++     
Sbjct: 252 HAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAFVYVGSSYGDSQLIKI---- 307

Query: 442 GTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK-- 499
                          ++ DA      + S  + L+  VN   +  +       +   +  
Sbjct: 308 --------------HLQPDA------KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVV 347

Query: 500 TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSR 559
           T S A +D     G L+    G+ IN  AS            VEL G KG+W++      
Sbjct: 348 TCSGAYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL------ 384

Query: 560 GHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
                    ++ DD +  +L++S  + T +L
Sbjct: 385 --------RSSTDDPHDTFLVVSFISETRIL 407


>gi|297740793|emb|CBI30975.3| unnamed protein product [Vitis vinifera]
          Length = 1043

 Score = 57.0 bits (136), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 83/367 (22%), Positives = 145/367 (39%), Gaps = 84/367 (22%)

Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
           F  + +     N+R L+   V D  F++G  +P +V+L++           +H      A
Sbjct: 144 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCSKPTIVVLYQ------DNKDARHVKTYEVA 196

Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
           L       + P  W+  NL + A  L+ VP P+ GVL++G  TI Y S SA  A+ +   
Sbjct: 197 LK-DKDFVEGP--WAQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSASAFKAIPIR-- 251

Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
                    + ++   V+ D +          LL    G L LL + ++   V  L +  
Sbjct: 252 -------PSITKAYGRVDADGSR--------YLLGDHAGLLHLLVITHEKEKVTGLKIEL 296

Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
              + + S I+ + N+  ++GS  GDS L++                    ++ DA    
Sbjct: 297 LGETSIASTISYLDNAFVYVGSSYGDSQLIKI------------------HLQPDA---- 334

Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLR 523
             + S  + L+  VN   +  +       +   +  T S A +D     G L+    G+ 
Sbjct: 335 --KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD-----GSLRIVRNGIG 387

Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
           IN  AS            VEL G KG+W++               ++ DD +  +L++S 
Sbjct: 388 INEQAS------------VELQGIKGMWSL--------------RSSTDDPHDTFLVVSF 421

Query: 584 EARTMVL 590
            + T +L
Sbjct: 422 ISETRIL 428


>gi|225443990|ref|XP_002280735.1| PREDICTED: DNA damage-binding protein 1 isoform 1 [Vitis vinifera]
          Length = 1089

 Score = 57.0 bits (136), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 83/367 (22%), Positives = 145/367 (39%), Gaps = 84/367 (22%)

Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
           F  + +     N+R L+   V D  F++G  +P +V+L++           +H      A
Sbjct: 144 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCSKPTIVVLYQ------DNKDARHVKTYEVA 196

Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
           L       + P  W+  NL + A  L+ VP P+ GVL++G  TI Y S SA  A+ +   
Sbjct: 197 LK-DKDFVEGP--WAQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSASAFKAIPIR-- 251

Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
                    + ++   V+ D +          LL    G L LL + ++   V  L +  
Sbjct: 252 -------PSITKAYGRVDADGSR--------YLLGDHAGLLHLLVITHEKEKVTGLKIEL 296

Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
              + + S I+ + N+  ++GS  GDS L++                    ++ DA    
Sbjct: 297 LGETSIASTISYLDNAFVYVGSSYGDSQLIKI------------------HLQPDA---- 334

Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLR 523
             + S  + L+  VN   +  +       +   +  T S A +D     G L+    G+ 
Sbjct: 335 --KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD-----GSLRIVRNGIG 387

Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
           IN  AS            VEL G KG+W++               ++ DD +  +L++S 
Sbjct: 388 INEQAS------------VELQGIKGMWSL--------------RSSTDDPHDTFLVVSF 421

Query: 584 EARTMVL 590
            + T +L
Sbjct: 422 ISETRIL 428


>gi|301124447|ref|XP_002909707.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262106897|gb|EEY64949.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 328

 Score = 57.0 bits (136), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 65/281 (23%), Positives = 109/281 (38%), Gaps = 75/281 (26%)

Query: 925  GFFLSGSRPCWCMVFRERLRVHPQLCDGS-------------------IVAFTVLHNVNC 965
            G F  G+ P W +  R      P     S                   +++FT  H+ +C
Sbjct: 3    GAFFRGAHPMWILGDRGHASFVPMCVPSSAPPKANGTSKNAAPRVSVPVLSFTPFHHWSC 62

Query: 966  NHGFIYVTSQGILKICQLPSGSTYDNYWP-----VQKVIPLKATPHQITYFA-------- 1012
             +GFIY  S+G L++C+LPS  T     P     V +     AT H + Y          
Sbjct: 63   PNGFIYFHSRGALRVCELPSSKT-STILPSSGGFVLQKAEFGATLHHMLYLGSHGPGGVA 121

Query: 1013 ---EKNLYPLIVSVPVLKPLNQVLSLL------------IDQEVGHQIDNHNLSSVDL-- 1055
               E   Y ++ S   LKP +   +              +D        N    + ++  
Sbjct: 122  EALEAPTYAVVCSA-RLKPADADRATEVEGAEEELEPENLDPNGNPLGSNVMAPTAEMFA 180

Query: 1056 -----HRTYTVEE-YEVRILEPDRAGGPWQTRAT--IPMQSSENALTVRVVTLFNTT--- 1104
                 H  +T E+ YE+R+++ D   G W  R    +  +  E  L+V+++ L++++   
Sbjct: 181  DYETDHMAHTEEDVYELRLVQTDEF-GEWGRRGVFRVHFERYEVVLSVKLMYLYDSSLMK 239

Query: 1105 ----------TKENETLLAIGTAYV--QGEDVAARGRVLLF 1133
                       K+    L +GT +V   GED + RGR+LL+
Sbjct: 240  EEVASTSPEWNKKKRPYLVVGTGWVGPHGEDESGRGRLLLY 280


>gi|195571247|ref|XP_002103615.1| GD18880 [Drosophila simulans]
 gi|194199542|gb|EDX13118.1| GD18880 [Drosophila simulans]
          Length = 1140

 Score = 56.6 bits (135), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 58/207 (28%), Positives = 92/207 (44%), Gaps = 33/207 (15%)

Query: 237 NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
           NLR +D  +V D  F+HG + P ++++H+      GR    H         I+   K+  
Sbjct: 156 NLR-MDELNVYDVEFLHGCLNPTVIVIHKDN---DGRHVKSHE--------INLRDKEFM 203

Query: 297 LI-WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
            I W   N+  +A  L+ VPSPIGGV+V+G  +I YH  S       N +AV+       
Sbjct: 204 KIAWKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVA------- 249

Query: 356 PRSSFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSV 410
              +F       +A    N +  LL    G L +L +       G  V+ + + +     
Sbjct: 250 -PLTFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSKGVTVKDIKVEQLGEIS 308

Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQF 437
           +   IT + N   ++G+R GDS LV+ 
Sbjct: 309 IPECITYLDNGFLYIGARHGDSQLVRL 335


>gi|307186138|gb|EFN71863.1| DNA damage-binding protein 1 [Camponotus floridanus]
          Length = 1136

 Score = 56.6 bits (135), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 48/205 (23%), Positives = 94/205 (45%), Gaps = 35/205 (17%)

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
           +D + V+D  F+HG   P ++++H+        ++ +H    +    I+   K+   I W
Sbjct: 159 MDEQQVQDVNFLHGCTNPTLILIHQD-------INGRH----VKTHEINLREKEFSKIPW 207

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
              N+  +A  ++ VPSPI G +++G  +I YH  +   A+              + +S+
Sbjct: 208 RQDNVEREAMMVIPVPSPICGAIIIGQESILYHDGTTYVAVV----------PPIIKQST 257

Query: 360 FS--VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLT 412
            +   ++D     +L  D+A      G L +L +  + +     VV+ L +       + 
Sbjct: 258 ITCYAKVDNQGLRYLLGDMA------GHLFMLFLELEKKPDGTQVVKDLKVELLGEISIP 311

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQF 437
             IT + N + ++GSRLGDS L++ 
Sbjct: 312 ECITYLDNGVIYVGSRLGDSQLIKL 336


>gi|297809743|ref|XP_002872755.1| UV-damaged DNA-binding protein 1A [Arabidopsis lyrata subsp.
           lyrata]
 gi|297318592|gb|EFH49014.1| UV-damaged DNA-binding protein 1A [Arabidopsis lyrata subsp.
           lyrata]
          Length = 1088

 Score = 56.6 bits (135), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 107/507 (21%), Positives = 199/507 (39%), Gaps = 123/507 (24%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++   L+ +    ++G + +L +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLTPQGLQPMLDVPIYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDA 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   +M           + GR +   G +  +DP  R  G+ +Y GL  +I   ++
Sbjct: 95  ESSELITRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F+ G  +P + +L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLFGCAKPTIAVLYQ--- 182

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVG 325
                   +H        +   +LK    +   WS  NL + A  L+ VP P+ GVL++G
Sbjct: 183 ---DNKDARH------VKTYEVSLKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIG 233

Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGD 385
             TI Y S +A  A+ +            + ++   V++D +          LL    G 
Sbjct: 234 EETIVYCSANAFKAIPIR---------PSITKAYGRVDVDGSR--------YLLGDHAGL 276

Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
           + LL + ++   V  L +     + + S I+ + N++ F+GS  GDS LV+         
Sbjct: 277 IHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKL-------- 328

Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSF 503
                     ++  DA      + S  + L+  +N   +  +       +   +  T S 
Sbjct: 329 ----------NLHPDA------KGSYVEVLERYINLGPIVDFCVVDLERQGQGQVVTCSG 372

Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNA 563
           A +D     G L+    G+ IN  AS            VEL G KG+W++  KSS     
Sbjct: 373 AFKD-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL--KSS----- 408

Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVL 590
                   D+ +  +L++S  + T +L
Sbjct: 409 -------IDEAFDTFLVVSFISETRIL 428


>gi|321478515|gb|EFX89472.1| hypothetical protein DAPPUDRAFT_303245 [Daphnia pulex]
          Length = 1158

 Score = 56.6 bits (135), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 89/395 (22%), Positives = 149/395 (37%), Gaps = 83/395 (21%)

Query: 246 VKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-WSAMNL 304
           ++D  F++G   P +VI+H+             H   +    IS   K+     W   N+
Sbjct: 164 IQDIAFLYGCANPTVVIIHQ-----------DAHGRHVKTREISLRDKEFAKTSWKQDNV 212

Query: 305 PHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVEL 364
             +A  LL VP P GG L++G  +I YH+          NY        +    +   ++
Sbjct: 213 ETEAAMLLPVPEPYGGALIIGQESITYHNG--------QNYVTIAPPIIKQSTVTCYGKV 264

Query: 365 DAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTSDITTIG 419
           D   + +L  D+A      G L +L +      DG V V+ + +       +   +T + 
Sbjct: 265 DPNGSRYLLGDLA------GHLFMLVLEKEEKMDGTVTVRDIKIELLGEVSIPECLTYLD 318

Query: 420 NSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
           N + F+GSR GDS LV+       +     + E F ++   AP            + DM 
Sbjct: 319 NGVVFIGSRFGDSQLVKLNVTPDDNNSYVTVMETFTNL---AP------------IVDMT 363

Query: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY 539
                          +    T S A ++     G L+    G+ I+  AS          
Sbjct: 364 -------IVDLDRQGQGQLVTCSGAYKE-----GSLRIIRNGIGIHEQAS---------- 401

Query: 540 ELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEV 599
             ++LPG KGIW +   SS   + D +            +++S   +T VL       E 
Sbjct: 402 --IDLPGIKGIWALKMGSSGNPSVDDT------------VVLSFVGQTRVLMLNGEEMEE 447

Query: 600 TESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
           TE        +T   GN+ G+  V+Q+     R++
Sbjct: 448 TEIPGLTADQQTFFCGNV-GKDSVLQITTGSVRLI 481


>gi|358338734|dbj|GAA31211.2| DNA damage-binding protein 1, partial [Clonorchis sinensis]
          Length = 1515

 Score = 56.6 bits (135), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 64/266 (24%), Positives = 111/266 (41%), Gaps = 37/266 (13%)

Query: 183 VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGD--EDTFGSGGGFSARIESSHVINLRD 240
           V VDP   C  V +Y   + I+  +  G  L  D  E    +   ++ RIE  +++    
Sbjct: 101 VLVDPGANCVVVRLYHGLLRIIPLNGIGEKLTTDSLEVNQYAANTYNVRIEEGNIV---- 156

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
                  D  F+HGY  P   +++E EL          H            L+   L   
Sbjct: 157 -------DMAFLHGYTLPTFAMIYEDELVL--------HMKTYEISGREPALRNVQLTLD 201

Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
           ++    D+  L+ VP P GGV++VG N I+YH++       ++ Y     +SQ L  ++ 
Sbjct: 202 SIE--PDSKLLIPVPKPFGGVILVGDNIIYYHTKDGP---HISQYIPQAKASQVLCYAAV 256

Query: 361 SVEL----DAAHATWLQNDVALLSTKTGDLVLLT----VVYDGRVVQ-RLDLSKTNPSVL 411
             +     D A   ++ + +A   T +G+ +L +     V   R+   R++L     +  
Sbjct: 257 DAQRYLLGDMAGRLYMVHLLAEDHTPSGNGLLGSTSSAAVPSARIGSIRIEL--LGETAT 314

Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQF 437
              I  + N + F+G  LGDS L++ 
Sbjct: 315 PESIAYVDNGVVFIGCTLGDSQLIRL 340


>gi|340714589|ref|XP_003395809.1| PREDICTED: DNA damage-binding protein 1-like [Bombus terrestris]
          Length = 1141

 Score = 56.6 bits (135), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 88/420 (20%), Positives = 160/420 (38%), Gaps = 86/420 (20%)

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
           ++   V+D  F+HG   P ++++H+        ++ +H    +    IS   K+   + W
Sbjct: 162 MEEHQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEISLRDKEFVKVPW 210

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
              N+  +A  ++ VPSPI G +++G  +I YH          N Y   +    +    +
Sbjct: 211 RQDNVEREAMIVIPVPSPICGAIIIGQESILYHDG--------NTYVAVVPPIIKQSTIT 262

Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLTSD 414
              ++D     +L  D+A      G L +L V  + +     VV+ L +       +   
Sbjct: 263 CYAKVDNQGLRYLLGDMA------GHLFMLFVEQEKKPDGTQVVKDLKVELLGEISIPEC 316

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
           IT + N + F+GSR GDS LV+                     +AD   +  +   +   
Sbjct: 317 ITYLDNGVIFVGSRFGDSQLVKLIT------------------KADENGSYCVPMETFTN 358

Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
           L  +++   + L        +    T S A ++     G L+    G+ I   AS     
Sbjct: 359 LAPIIDMAVVDL----ERQGQGQMVTCSGAFKE-----GSLRIIRNGIGIEEHAS----- 404

Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
                  ++LPG KG+W +      G N D++            L++S   +T +L    
Sbjct: 405 -------IDLPGIKGMWAL---KVGGGNFDNT------------LVLSFVGQTRILTLNG 442

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
              E T+   +    +T   GN+      IQ+    AR++     T    + P N  + S
Sbjct: 443 EEVEETDIPGFVADEQTFHTGNV-TNDLFIQITPTSARLISHETKTVVSEWEPENKRTIS 501


>gi|332030156|gb|EGI69950.1| DNA damage-binding protein 1 [Acromyrmex echinatior]
          Length = 1138

 Score = 56.2 bits (134), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 48/205 (23%), Positives = 94/205 (45%), Gaps = 35/205 (17%)

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
           +D + V+D  F+HG   P ++++H+        ++ +H    +    I+   K+   I W
Sbjct: 159 MDEQQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEINLRDKEFAKIPW 207

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
              N+  +A  ++ VPSPI G +++G  +I YH  +   A+              + +S+
Sbjct: 208 RQDNVEREAMMVIPVPSPICGAIIIGQESILYHDGTTYVAVV----------PPIIKQST 257

Query: 360 FS--VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLT 412
            +   ++D     +L  D+A      G L +L +  + +     VV+ L +       + 
Sbjct: 258 ITCYAKVDNQGLRYLLGDMA------GHLFMLFLEQEKKPDGSQVVKDLKVELLGEISIP 311

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQF 437
             IT + N + ++GSRLGDS L++ 
Sbjct: 312 ECITYLDNGVIYVGSRLGDSQLIKL 336


>gi|15235577|ref|NP_192451.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
 gi|55976605|sp|Q9M0V3.1|DDB1A_ARATH RecName: Full=DNA damage-binding protein 1a; AltName:
           Full=UV-damaged DNA-binding protein 1a; Short=DDB1a
 gi|7267302|emb|CAB81084.1| UV-damaged DNA binding factor-like protein [Arabidopsis thaliana]
 gi|25054828|gb|AAN71904.1| putative UV-damaged DNA binding factor [Arabidopsis thaliana]
 gi|332657117|gb|AEE82517.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
          Length = 1088

 Score = 56.2 bits (134), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 107/507 (21%), Positives = 199/507 (39%), Gaps = 123/507 (24%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++   L+ +    ++G + +L +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLTPQGLQPMLDVPIYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDP 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   +M           + GR +   G +  +DP  R  G+ +Y GL  +I   ++
Sbjct: 95  ESSELITRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F+ G  +P + +L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLFGCAKPTIAVLYQ--- 182

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVG 325
                   +H        +   +LK    +   WS  +L + A  L+ VP P+ GVL++G
Sbjct: 183 ---DNKDARH------VKTYEVSLKDKDFVEGPWSQNSLDNGADLLIPVPPPLCGVLIIG 233

Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGD 385
             TI Y S SA  A+ +            + ++   V++D +          LL    G 
Sbjct: 234 EETIVYCSASAFKAIPIR---------PSITKAYGRVDVDGSR--------YLLGDHAGM 276

Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
           + LL + ++   V  L +     + + S I+ + N++ F+GS  GDS LV+         
Sbjct: 277 IHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKL-------- 328

Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSF 503
                     ++  DA      + S  + L+  +N   +  +       +   +  T S 
Sbjct: 329 ----------NLHPDA------KGSYVEVLERYINLGPIVDFCVVDLERQGQGQVVTCSG 372

Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNA 563
           A +D     G L+    G+ IN  AS            VEL G KG+W++  KSS     
Sbjct: 373 AFKD-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL--KSS----- 408

Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVL 590
                   D+ +  +L++S  + T +L
Sbjct: 409 -------IDEAFDTFLVVSFISETRIL 428


>gi|241260143|ref|XP_002404926.1| DNA repair protein xp-E, putative [Ixodes scapularis]
 gi|215496735|gb|EEC06375.1| DNA repair protein xp-E, putative [Ixodes scapularis]
          Length = 1148

 Score = 55.8 bits (133), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 94/403 (23%), Positives = 155/403 (38%), Gaps = 95/403 (23%)

Query: 246 VKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAM 302
           V+D  F+HG   P +V+LH+         S   H   +    IS  LK    +   W   
Sbjct: 166 VQDMEFLHGCKTPTIVLLHQD--------SQARH---MKTYEIS--LKDKEFVKGPWKQD 212

Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
           ++  +A  ++AVP P      +G  +I YH+      +           +  L R S  V
Sbjct: 213 HVESEATIVIAVPEPFCDARCIGQESITYHNGDQDVVI-----------TPHLIRQSTIV 261

Query: 363 ---ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRV-----VQRLDLSKTNPSVLTSD 414
              ++DA  + +L  D+A      G L +L +  + ++     V+ L L       +   
Sbjct: 262 CYGKVDANGSRYLLGDMA------GRLFMLLLEREDKMDGTTTVKDLKLEFLGEITIAEC 315

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
           +T + N + ++GSRLGDS L++         L+S   E+   +E     T          
Sbjct: 316 MTYLDNGVVYVGSRLGDSQLIK---------LNSERNEQGSYVEVMEVFTN--------- 357

Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
           L  +V+   + L         +    F           G L+    G+ I+  AS     
Sbjct: 358 LGPIVDMCVVDLERQGQGQLVTCSGAFKE---------GSLRIIRNGIGIHEHAS----- 403

Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
                  ++LPG KGIW +        N DSSR           L++S   +T VL  + 
Sbjct: 404 -------IDLPGIKGIWPLR------VNTDSSR--------DNTLVLSFVGQTRVLMLSG 442

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS 637
              E TE   + +  +T   GN+    ++IQV     R++DG 
Sbjct: 443 EEVEETELAGFDISQQTFFCGNV-RNNQLIQVTAAAVRLVDGK 484


>gi|45184764|ref|NP_982482.1| AAL060Wp [Ashbya gossypii ATCC 10895]
 gi|74695871|sp|Q75EY8.1|CFT1_ASHGO RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two
           protein 1
 gi|44980110|gb|AAS50306.1| AAL060Wp [Ashbya gossypii ATCC 10895]
 gi|374105681|gb|AEY94592.1| FAAL060Wp [Ashbya gossypii FDAG1]
          Length = 1305

 Score = 55.8 bits (133), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 120/624 (19%), Positives = 248/624 (39%), Gaps = 124/624 (19%)

Query: 140 AKISVLEFDDSIHGLRITSMHCFESP--EWLHLKRGRESFARGPLVKVDPQGRCGGVLVY 197
            ++S++ FD     L   S+H +++   E   L  G       P ++ +P  RC  +LV+
Sbjct: 82  GRVSIVRFDAENQTLETESLHYYDAKFEELSALTVGA-----APRLEQEPAARC--LLVH 134

Query: 198 GLQMIILKASQGGSGLV-------------GDEDTFGSGGGFSARIESSHVINLRDLDMK 244
               + +   +G                     D  G   G S  + +SH+ +    D+K
Sbjct: 135 NGDCLAVLPLRGHEEEGEEAEEEEEHPAKRARTDADGRLVGASTVMPASHLHS----DIK 190

Query: 245 HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNL 304
           +VKD  F+ G  +  + +L++ +L+W G       T     LS+    ++  +I     L
Sbjct: 191 NVKDMRFLRGLNKSAVGVLYQPQLSWCGNEKLTRQTMKFIILSLDLDDEKSTVINMLQGL 250

Query: 305 PHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASC--ALALNNYAVS-----------LDS 351
           P+  + ++ + +   G ++ G N + Y   + +   A++LN ++ S           L +
Sbjct: 251 PNTLHTIIPLSN---GCVLAGVNELLYVDNTGALQGAISLNAFSNSGLNTRIQDNSKLQA 307

Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRL---------D 402
             E P   F+ + +         D+ LL  +   +  + +  +GR++            +
Sbjct: 308 FFEQPLCYFATQSNG-------RDILLLMDEKARMYNVIIEAEGRLLTTFNCVQLPIVNE 360

Query: 403 LSKTN--PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
           + K N  P+ +  ++     SL F+G + GD++ V+       + L S L+         
Sbjct: 361 IFKRNMMPTSICGNMNLETGSL-FIGFQSGDAMHVRL------NNLKSSLEH-------- 405

Query: 461 APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT------FSFAVRDSLVNIGP 514
                  + + S+ L+   + + + LYG   NN E  +K       F     D L+NIGP
Sbjct: 406 -------KGTVSETLE--TDEDYMELYG---NNAEKEKKNLETESPFDIECLDRLLNIGP 453

Query: 515 LKDFSYGLRINADASATGISKQSNYELVELP----GCKGIWTVYHKSSRGHNADSSRMAA 570
           +   + G   + + +   ++  +  EL  +     G     T+   +       + +  +
Sbjct: 454 VTSLAVGKASSIEHTVAKLANPNKDELSIVATSGNGTGSHLTILENTIVPTVQQALKFIS 513

Query: 571 YDDEYH-------AYLIISLEARTMV-LETADLLTEVTESVDYFVQGRTIAAGNLFGRRR 622
               ++        YL+ +  ++T   + + D   +  ++ D+     T++     G +R
Sbjct: 514 VTQIWNLKIKGKDKYLVTTDSSQTRSDIYSIDRDFKPFKAADFRKNDTTVSTAVTGGGKR 573

Query: 623 VIQVFERGARILDGSY---MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGS 679
           ++QV  +G  + D ++   MT +  F               V+ V I DP++LL  S G 
Sbjct: 574 IVQVTSKGVHLFDINFKRMMTMNFDF--------------EVVHVCIKDPFLLLTNSKGD 619

Query: 680 IRLLVGDPSTCTVSVQT--PAAIE 701
           I++   +P      V+T  P A++
Sbjct: 620 IKIYELEPKHKKKFVKTVLPDALK 643


>gi|320163506|gb|EFW40405.1| UV-damaged DNA binding protein [Capsaspora owczarzaki ATCC 30864]
          Length = 1123

 Score = 55.5 bits (132), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 75/318 (23%), Positives = 127/318 (39%), Gaps = 69/318 (21%)

Query: 237 NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
           N+R L+   V D  F+ GY  P +++L++             H      L       + P
Sbjct: 208 NIR-LEELQVFDIKFLRGYDRPTILVLYQD-------TKETRHVKTYQVLLKEKEFAEGP 259

Query: 297 LIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELP 356
             W+  N+   A  L+ V  P+GGVL+VG  TI YHS SA  ++A+    +         
Sbjct: 260 --WAQNNVEGGASLLIPVLMPLGGVLIVGEQTITYHSGSAFRSVAMRPAII--------- 308

Query: 357 RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-VVQRLDLSKTNPSVLTSDI 415
              +SV         +  +  LL+   G+L+ + + +D +  V  + + +   + + S +
Sbjct: 309 -KCYSV---------IDTNRFLLADSEGNLLSVLLTHDRQDKVTAIKIDRLGVTSILSCL 358

Query: 416 TTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDAL 475
           T + N + F GS+ GDS L++              ++E G       S   L      A+
Sbjct: 359 TYLDNGVVFGGSQFGDSQLLRLATE----------RDETGSFVRVLESFSNLGPICDMAV 408

Query: 476 QDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISK 535
            D+                +    T S A +D     G L+    G+         GI +
Sbjct: 409 VDL------------ERQGQCQVVTCSGAFKD-----GSLRVVRNGV---------GIEE 442

Query: 536 QSNYELVELPGCKGIWTV 553
           Q+    +ELPG KGIW++
Sbjct: 443 QAT---IELPGIKGIWSL 457


>gi|390342012|ref|XP_793599.3| PREDICTED: uncharacterized protein LOC588842 [Strongylocentrotus
           purpuratus]
          Length = 1161

 Score = 55.5 bits (132), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 68/281 (24%), Positives = 111/281 (39%), Gaps = 46/281 (16%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           GP+  +DP+ R  G+ +Y     I+   +    L            F+ R+E  +VI+++
Sbjct: 50  GPIGIIDPECRMIGLRLYDGLFKIIPLDRDNKEL----------KAFNIRLEELNVIDVQ 99

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299
                      F++G  +P +V LH+      GR     H              + P  W
Sbjct: 100 -----------FLYGCHQPTIVFLHQDP---HGR-----HVKTYEVNLREKEFNRGP--W 138

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
              N+  +A  ++AVP P GG L++G  +I YH      A+A             +  S+
Sbjct: 139 KQDNVETEATMVIAVPQPYGGALIIGQESITYHKGDNYVAIA----------PPTIKNST 188

Query: 360 FSV--ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDIT 416
                 LD   + +L  D  L       L+      DG   V+ L L     + +   +T
Sbjct: 189 LVCYGRLDNNGSRYLLGD--LTGRLFLLLLDKEESMDGAATVKDLKLEFLGETSIAECLT 246

Query: 417 TIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI 457
            + N + F+GSRLGDS LV+    S  S     + E F ++
Sbjct: 247 YLDNGVVFIGSRLGDSQLVRLNTESDESGSYVTMMETFTNL 287


>gi|401426989|ref|XP_003877978.1| cleavage and polyadenylation specificity factor-like protein
            [Leishmania mexicana MHOM/GT/2001/U1103]
 gi|322494225|emb|CBZ29522.1| cleavage and polyadenylation specificity factor-like protein
            [Leishmania mexicana MHOM/GT/2001/U1103]
          Length = 1542

 Score = 55.1 bits (131), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 60/246 (24%), Positives = 108/246 (43%), Gaps = 28/246 (11%)

Query: 912  QRITIFKNISG-HQGFFLSGSRPCWCM--VFRERLRVHPQLCDGSIVAFTVLHNVNCNHG 968
            +R+  F+ +   H+G ++ G  P + +      +L         ++  F   H+ + + G
Sbjct: 1009 ERLVPFRGLQDRHKGMYVCGQTPVFLVYHAATNQLVCTRHHATNAVRGFAPFHSRHVHGG 1068

Query: 969  FIYVTSQGILKICQL-PSGSTYDNY-WPVQKVIPLKATPHQITYFAEKNLYPLIVS---- 1022
            F+Y   +G +    + P G    +  W +++V  L  TPHQI Y    +   ++ S    
Sbjct: 1069 FVYC-GEGFVHFATMQPFGELLGSSGWWLERV-RLGCTPHQIIYSPAAHGCFVVASRPQP 1126

Query: 1023 -VPVLKPLNQVLSLLIDQE---VGHQIDNHNL---SSVDLHRTYTVEEYEVRILEPDRAG 1075
              P   P +  L ++ D+E   V H I+  +L   S+       T E YEV+      + 
Sbjct: 1127 FSPKRAPFDVQLRMVEDEEGNRVPHVIEAVSLPPLSAASGSPVPTNERYEVQFF----ST 1182

Query: 1076 GPWQTRATIPMQSSENALTVRVVTLFNTTTKE------NETLLAIGTAYVQGEDVAARGR 1129
              WQ    + + ++E  L+  ++ +   TT +         + A+ TAY  GEDV  RGR
Sbjct: 1183 LDWQCMGRLVLDANEKVLSATLMQVTRDTTMDAANRSTTAPVCALATAYPLGEDVTTRGR 1242

Query: 1130 VLLFST 1135
            +LL +T
Sbjct: 1243 ILLLTT 1248



 Score = 40.4 bits (93), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 49/185 (26%), Positives = 84/185 (45%), Gaps = 37/185 (20%)

Query: 223 GGGFSARIESSHVINLRDLDMK----HVKDFIFVHGYIEPVMVILHERELTWAGRVS--- 275
           GGG S  +    V + R  D+K    +++D  FV    EP++  L E++ TWAGRV    
Sbjct: 283 GGGTSLLLRIGTVTHWRLQDVKTALRNIRDIQFVESAGEPLLAFLFEKQPTWAGRVKLLE 342

Query: 276 WKHH-------TCMIS--ALSISTTLKQHPLIWSAMN-LPHDAYKLLA------VPSPIG 319
           W+         TC I    ++++ +   H L  S ++ LP+D   +        VPS + 
Sbjct: 343 WRSKTVESHMLTCSIEWMKVTLANSTAPHMLSLSEVDGLPYDVTSMTPLTAFQDVPSAVF 402

Query: 320 GV---LVVGANT-----IHYHSQSASCALALNNYAVSLD------SSQELPRSSFSVELD 365
            V   ++V  +T     ++ ++     A +L + AVS +      +SQ L      V L+
Sbjct: 403 CVSRNMMVHVSTKSGYGVYVNATGEEQARSLKSSAVSFEAVQWRSASQALSTDLVKVNLN 462

Query: 366 AAHAT 370
            ++AT
Sbjct: 463 FSNAT 467


>gi|71413926|ref|XP_809084.1| cleavage and polyadenylation specificity factor-like protein
           [Trypanosoma cruzi strain CL Brener]
 gi|70873410|gb|EAN87233.1| cleavage and polyadenylation specificity factor-like protein,
           putative [Trypanosoma cruzi]
          Length = 499

 Score = 55.1 bits (131), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 62/260 (23%), Positives = 105/260 (40%), Gaps = 54/260 (20%)

Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS----------IS 289
           +++V+D  F+    EP++  L ER  TWAGRV    W+        LS           S
Sbjct: 250 IRYVRDMQFIDSSGEPIVAFLCERHPTWAGRVKLVEWRTKAVESKMLSSQIVWVQISAAS 309

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIG-------GVLVVGANTIHYHSQSASCALAL 342
           T+ ++  LI    ++P++   +    +P+G       GV+  G NT+ + +      + L
Sbjct: 310 TSNRKLLLIGEVDDVPYNVTHM----TPVGPFAQIPSGVICYGINTVMHVTTKRGYGVYL 365

Query: 343 NNYAVS-----------------LDSSQELPRSSFSVELDAAHATW----LQND---VAL 378
           NN  +                   D   E   + F V L  A+ T     + N+   + +
Sbjct: 366 NNGGMEECANSKSSAMSYGKVGWCDPKMEASTALFMVNLSLANCTASFMSIVNEMLHLLV 425

Query: 379 LSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
           +S + G ++ L++      VQ + ++        S I  IG+ + FLGS  GDS      
Sbjct: 426 VSEEDGVVLTLSITAQSSSVQGIRIAILGTGCYCSGIARIGDQIVFLGSACGDS------ 479

Query: 439 CGSGTSMLSSGLKEEFGDIE 458
           C +   M  S   + F  IE
Sbjct: 480 CIAKVDMFHSDAAKRFQIIE 499


>gi|427780151|gb|JAA55527.1| Putative dna damage-binding protein 1 [Rhipicephalus pulchellus]
          Length = 1181

 Score = 55.1 bits (131), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 92/420 (21%), Positives = 156/420 (37%), Gaps = 94/420 (22%)

Query: 246 VKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAM 302
           V+D  F+HG   P +V+LH+         S   H       +   +LK    +   W   
Sbjct: 164 VQDMEFLHGCKTPTIVLLHQD--------SQARHM-----KTYEVSLKDKEFVKGPWKQD 210

Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
           ++  +A  ++AVP P  G L++G  +I YH+         + Y V    +  L R S  V
Sbjct: 211 HVESEANLVIAVPEPFCGALIIGQESITYHNG--------DQYVV---ITPHLIRQSTIV 259

Query: 363 ---ELDAAHATWLQNDVA-------------------LLSTKTGDLVLLTVVYDGRV--- 397
              ++DA  + +L  D+A                   LL    G L +L +  + ++   
Sbjct: 260 CYGKVDANGSRYLLGDMAGRLFMLLLEREDKMDGTXYLLGDMAGRLFMLLLEREDKMDGT 319

Query: 398 --VQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG 455
             V+ L L       +   IT + N + ++GSRLGDS L++             + E F 
Sbjct: 320 TTVKDLKLEFLGEITIAECITYLDNGVVYVGSRLGDSQLIKLHAERNDQGSFVEIMEVFT 379

Query: 456 DIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPL 515
           ++                 + DM                +    T S A ++     G L
Sbjct: 380 NL---------------GPIVDMC-------VVDLERQGQGQLVTCSGAFKE-----GSL 412

Query: 516 KDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEY 575
           +    G+ I+  AS            ++LPG KG+W +        +    R      E 
Sbjct: 413 RIIRNGIGIHEHAS------------IDLPGIKGMWPLRVGPGVAPHGGDGRDPGDSAER 460

Query: 576 HAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILD 635
              L++S   +T VL  +    E TE   +    +T   GN+   +++IQV     R++D
Sbjct: 461 DNTLVLSFVRQTRVLMLSGEEVEETELAGFDTSQQTFFCGNV-RNKQLIQVTAAAVRLVD 519


>gi|366994686|ref|XP_003677107.1| hypothetical protein NCAS_0F02680 [Naumovozyma castellii CBS 4309]
 gi|342302975|emb|CCC70752.1| hypothetical protein NCAS_0F02680 [Naumovozyma castellii CBS 4309]
          Length = 1340

 Score = 55.1 bits (131), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 83/388 (21%), Positives = 165/388 (42%), Gaps = 64/388 (16%)

Query: 99  ASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITS 158
             L LV  + L+  +  +A++ Q  +  S     +++A   AKIS++ FD   + L   S
Sbjct: 46  GKLNLVEEFNLNAKITDIALIPQEKSPLS----CLVIASGVAKISIVRFDAVTNSLETLS 101

Query: 159 MHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS------- 211
           +H +E            + A+   ++VDP  R   +L++    I L     G+       
Sbjct: 102 LHYYEDKLS---DISLVTLAKTSKLRVDPMNR--ALLLFNNDSIALLPLFSGNHEDEDED 156

Query: 212 ------GLVGDEDTFGSGGGFSARIESSHVINLRDL--DMKHVKDFIFVHGYIEPVMVIL 263
                  +   E T          +  S + ++++L  ++++V D  F++ + +P + +L
Sbjct: 157 DEEDDYDVTRGEVTTKRSKKNEKHVGQSKIFHVKELHQELQNVLDIQFLNDFTKPTLAVL 216

Query: 264 HERELTWAGRVSWKHH--TCMISALSIST----TLKQHPLIWSAMNLPHDAYKLLAVPSP 317
           ++ +LTW G         + MI  L++ T    T     +I +  +L  D ++LL +   
Sbjct: 217 YQPKLTWVGNTELNPQPTSFMIFTLNLRTNELETAFDVVIIATLHDLSWDWFQLLPISR- 275

Query: 318 IGGVLVVGANTIHYHSQSA--SCALALNNYAVSLDSSQELPRSSFSVELDA---AHATW- 371
             G +V+G N + Y   +      + LN++A   D S +  R     EL+       T+ 
Sbjct: 276 --GCVVMGNNEMAYIDNTGVLQSIIHLNSFA---DKSLQRARIIDETELEVFFNEKVTYF 330

Query: 372 ---------LQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-----------TNPS-V 410
                    + ++  L+   + +L  + +  +GR++ + DL K           +NP+ V
Sbjct: 331 WSASTDKKNIDDETLLIIDASANLYYVRLEAEGRLLTKFDLIKLPIVNDALKDTSNPTCV 390

Query: 411 LTSDITTIGNSL-FFLGSRLGDSLLVQF 437
              D  +  +S+  F+G   GDSL+V+ 
Sbjct: 391 ARVDPNSSNSSMDLFIGYLSGDSLVVRL 418


>gi|259155222|ref|NP_001158852.1| DNA damage-binding protein 1 [Salmo salar]
 gi|223647700|gb|ACN10608.1| DNA damage-binding protein 1 [Salmo salar]
          Length = 1139

 Score = 54.7 bits (130), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 77/341 (22%), Positives = 131/341 (38%), Gaps = 73/341 (21%)

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  ++ VP P GG +++G  +I YH+     A+A      S          
Sbjct: 207 WKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAIAPPTIKQSTIVCHN---- 262

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQR-LDLSKTNPSVLTS 413
                +D   + +L  D+       G L +L +    + DG VV + L +     + +  
Sbjct: 263 ----RVDPNGSRYLLGDME------GRLFMLLLEKEELMDGAVVLKDLRVELLGETSIAE 312

Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
            +T + N + F+GSRLGDS LV+    S  S     + E F ++                
Sbjct: 313 CLTYLDNGVVFVGSRLGDSQLVKLNVDSNDSGSYVAVMETFTNL---------------G 357

Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
            + DM                +    T S A ++     G L+    G+ I+  AS    
Sbjct: 358 PIVDMC-------VVDLERQGQGQLVTCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401

Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
                   ++LPG KG+W +  +S  G   D              L++S   +T VL  +
Sbjct: 402 --------IDLPGIKGLWPL--RSEAGRETDD------------MLVLSFVGQTRVLMLS 439

Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
               E TE   +    +T   GN+   +++IQ+   G R++
Sbjct: 440 GEEVEETELPGFVDNLQTFYCGNV-AHQQLIQITSGGVRLV 479


>gi|223647932|gb|ACN10724.1| DNA damage-binding protein 1 [Salmo salar]
          Length = 1139

 Score = 54.7 bits (130), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 77/341 (22%), Positives = 131/341 (38%), Gaps = 73/341 (21%)

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  ++ VP P GG +++G  +I YH+     A+A      S          
Sbjct: 207 WKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAIAPPTIKQSTIVCHN---- 262

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQR-LDLSKTNPSVLTS 413
                +D   + +L  D+       G L +L +    + DG VV + L +     + +  
Sbjct: 263 ----RVDPNGSRYLLGDME------GRLFMLLLEKEELMDGAVVLKDLRVELLGETSIAE 312

Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
            +T + N + F+GSRLGDS LV+    S  S     + E F ++                
Sbjct: 313 CLTYLDNGVVFVGSRLGDSQLVKLNVDSNDSGSYVAVMETFTNL---------------G 357

Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
            + DM                +    T S A ++     G L+    G+ I+  AS    
Sbjct: 358 PIVDMC-------VVDLERQGQGQLVTCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401

Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
                   ++LPG KG+W +  +S  G   D              L++S   +T VL  +
Sbjct: 402 --------IDLPGIKGLWPL--RSEAGRETDD------------MLVLSFVGQTRVLMLS 439

Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
               E TE   +    +T   GN+   +++IQ+   G R++
Sbjct: 440 GEEVEETELPGFVDNLQTFYCGNV-AHQQLIQITSGGVRLV 479


>gi|156389050|ref|XP_001634805.1| predicted protein [Nematostella vectensis]
 gi|156221892|gb|EDO42742.1| predicted protein [Nematostella vectensis]
          Length = 1157

 Score = 54.3 bits (129), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 54/212 (25%), Positives = 95/212 (44%), Gaps = 40/212 (18%)

Query: 237 NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
           N+R L+  HV D  F++G   P +V +++             H   +    I+  L+ H 
Sbjct: 155 NIR-LEELHVVDIQFLYGCANPTIVFIYQDP-----------HGRHVKTYEIN--LRDHE 200

Query: 297 LI---WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
                W   N+  +A +++AVP+P+GG L++G  +I YH  S   A+A            
Sbjct: 201 FAKGPWKQDNVEVEACRVIAVPNPLGGALIIGQESITYHKGSNYHAIA----------PP 250

Query: 354 ELPRSSFSV--ELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKT 406
            L +SS +   ++D   + +L  D+       G L +L +    + DG   V+ L L   
Sbjct: 251 ALKQSSLTCHGKIDTNGSRYLLGDM------NGRLYMLLLERQELIDGTYEVKDLKLEML 304

Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
             + +   +  + N + F+GS LGDS L + +
Sbjct: 305 GETSIAHCLVYLDNGVVFIGSMLGDSQLAKLS 336


>gi|190345965|gb|EDK37945.2| hypothetical protein PGUG_02043 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 1206

 Score = 53.9 bits (128), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 123/611 (20%), Positives = 236/611 (38%), Gaps = 102/611 (16%)

Query: 98  AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
           +  ++ +CH ++ G ++++  + +GG++     D +++  +  ++S+LEFD         
Sbjct: 55  SGKIKQICHQQVIGVIQNIDRIRKGGSN----LDLLVITSDSGRLSILEFDKD------- 103

Query: 158 SMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE 217
            +  F   +  H K G      G  + VDPQ R   +       ++ KA    + L    
Sbjct: 104 ELKFFPVVQEPHSKNGMNRTTPGEYLCVDPQDRTITIGAIERDKLMYKAQTNNNKL---- 159

Query: 218 DTFGSGGGFSARIES----SHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
                    S+ +ES    +  I +  LD           GY  P++  +   E  +A  
Sbjct: 160 -------ELSSPLESVSKNTLTIQMVSLDT----------GYENPMLAAI---ECNYAHY 199

Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSA-----MNLPHDAYKLLAVPSPIGGVLVVGANT 328
            +   +    S L++     +  L + A     + +P  +  L+ +P+PIGGV+V G++ 
Sbjct: 200 DASLKYDPQSSNLTLQYYEFEQGLNYVARRKDTLEIPSSSTTLVPLPTPIGGVIVAGSSF 259

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           I YH+ +    L L     S   S  +P   ++V     H     N   LL  + GD   
Sbjct: 260 IFYHNPTIDQQLYLP--IPSRAGSSPVPIVCYAV-----HKLKKNNFFILLHNELGDCFR 312

Query: 389 LTVVY--DGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT-CGSGTSM 445
           + + Y  D   V  L +   +    ++ I        F      D +L Q    G   S 
Sbjct: 313 VLIDYDDDSEKVTELSVGYFDTISPSTSINVFKKGYLFANVTNNDKMLYQIEDLGDNDSY 372

Query: 446 LSSGLKEEFGDI-EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFA 504
           +SS       D+ + +     + R   + AL  +++       G+    +ES +   +  
Sbjct: 373 ISSSQFSSLEDVFDGNKKHEFKPRGLRNLALVQIIDSSNPCFGGALVKTSESKESRIAMI 432

Query: 505 VRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNAD 564
              S      LK  ++G+ I+               LV  P      +V+          
Sbjct: 433 TGHS-----HLKLKTHGIPIST--------------LVSSPLPMIATSVF---------- 463

Query: 565 SSRMAAYDDEYHAYLIISLEA--RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRR 622
           ++R++A + +   Y++IS  A  +T+VL   +++ EV +S   FV  +        G + 
Sbjct: 464 TTRLSA-ESKNDEYMVISSSASSKTLVLAIGEVVEEVQDSS--FVTDQPTIGVQQVGLKS 520

Query: 623 VIQVFERGARIL-----DGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
           +IQ++  G R +     +G    +   + P            T++S S     VL+G+S+
Sbjct: 521 LIQIYSNGIRHIRQTETEGKITKKTFDWYP--------PAGITIISASTNQEQVLIGLSN 572

Query: 678 GSIRLLVGDPS 688
             +     DP+
Sbjct: 573 RELCYFEIDPT 583


>gi|322787057|gb|EFZ13281.1| hypothetical protein SINV_13198 [Solenopsis invicta]
          Length = 986

 Score = 53.9 bits (128), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 48/203 (23%), Positives = 92/203 (45%), Gaps = 31/203 (15%)

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
           +D + V+D  F+HG   P ++++H+        ++ +H    +    I+   K+   I W
Sbjct: 159 MDEQQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEINLRDKEFSKIPW 207

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
              N+  +A  ++ VPSP+ G +++G  +I YH          N+Y   +    +    +
Sbjct: 208 RQDNVEREAMMVIPVPSPMCGAIIIGQESILYHDG--------NSYVAVVPPIIKQSTIT 259

Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY----DGRV-VQRLDLSKTNPSVLTSD 414
              ++D     +L  D+A      G L +L +      DG + V+ L +       +   
Sbjct: 260 CYAKVDNQGLRYLLGDMA------GHLFMLFLEQEKKPDGTLSVKDLKVELLGEISIPEC 313

Query: 415 ITTIGNSLFFLGSRLGDSLLVQF 437
           IT + N + ++GSRLGDS L++ 
Sbjct: 314 ITYLDNGVIYVGSRLGDSQLIKL 336


>gi|19114492|ref|NP_593580.1| damaged DNA binding protein Ddb1 [Schizosaccharomyces pombe 972h-]
 gi|46395602|sp|O13807.1|DDB1_SCHPO RecName: Full=DNA damage-binding protein 1; AltName:
           Full=Damage-specific DNA-binding protein 1
 gi|2330717|emb|CAB11219.1| damaged DNA binding protein Ddb1 [Schizosaccharomyces pombe]
          Length = 1072

 Score = 53.9 bits (128), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 95/492 (19%), Positives = 189/492 (38%), Gaps = 97/492 (19%)

Query: 174 RESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESS 233
           RES   GPL+ VDP  R   + VY   + I+   +     +   +       FS RI+  
Sbjct: 111 RES-QSGPLLLVDPFQRVICLHVYQGLLTIIPIFKSKKRFMTSHNNPSLHDNFSVRIQEL 169

Query: 234 HVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
           +V+           D   ++    P + +L            +K    ++   +    ++
Sbjct: 170 NVV-----------DIAMLYNSSRPSLAVL------------YKDSKSIVHLSTYKINVR 206

Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
           +  +    + + HD  +   +PS  GGV V G   ++Y S+    +  L  Y        
Sbjct: 207 EQEIDEDDV-VCHDIEEGKLIPSENGGVFVFGEMYVYYISKDIQVSKLLLTY-------- 257

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
             P ++FS  +     T L + + +++ ++G L     ++    V  ++L K   S + S
Sbjct: 258 --PITAFSPSISNDPETGLDSSIYIVADESGMLYKFKALFTDETVS-MELEKLGESSIAS 314

Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
            +  + ++  F+GS   +S+L+Q                         PS  +      +
Sbjct: 315 CLIALPDNHLFVGSHFNNSVLLQL------------------------PSITK-NNHKLE 349

Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
            LQ+ VN   +S +    + T S+  T S A +D     G L+     + I         
Sbjct: 350 ILQNFVNIAPISDFIIDDDQTGSSIITCSGAYKD-----GTLRIIRNSINI--------- 395

Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
               N  L+E+ G K  ++V            S  A YD+  + +L +  E R +++   
Sbjct: 396 ---ENVALIEMEGIKDFFSV------------SFRANYDN--YIFLSLICETRAIIVSPE 438

Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESG 653
            +    + + D   +  TI    ++G  +++Q+  +  R+ DG  +   +S  P +   G
Sbjct: 439 GVF---SANHDLSCEESTIFVSTIYGNSQILQITTKEIRLFDGKKLHSWIS--PMSITCG 493

Query: 654 SGSENSTVLSVS 665
           S   ++  ++V+
Sbjct: 494 SSFADNVCVAVA 505


>gi|325189950|emb|CCA24429.1| splicing factor putative [Albugo laibachii Nc14]
          Length = 1644

 Score = 53.5 bits (127), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 72/315 (22%), Positives = 129/315 (40%), Gaps = 45/315 (14%)

Query: 299 WSAMNLPHDAYKLLAVP---SPIGGVLVVGANTIHYHSQS---ASCALALNNYAVSLDSS 352
           WS + +P  A KL+AVP      GGVLV+    I Y +++    SC+  L +     +  
Sbjct: 660 WSQV-VPRSANKLVAVPGGNDGPGGVLVIAQGLIQYQNENHPPLSCSFPLRSTG-GPNPV 717

Query: 353 QELPRSSFSVELDAAHATWLQNDV--ALLSTKTGDLVLLTVVYDGRVVQRLDLS--KTNP 408
           Q+  +  + + +  + AT  Q D+   L+ ++ GDL  +++ Y G  VQ+L +    T P
Sbjct: 718 QDERKQGYPMMI-VSTATHKQRDLFFVLMQSEWGDLFKISLEYAGSSVQKLRIQYFDTIP 776

Query: 409 SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
             L   IT  G  L F  S   +  L QF         +  +     D E + PS    +
Sbjct: 777 VALALCITKTG--LLFAASEFSNHYLFQFLSIGEDDDAAQCVSAAENDQEPEIPSFSVRK 834

Query: 469 RSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA 528
             +   + ++ +   ++         E   + ++   +    N   L+   +GL +   A
Sbjct: 835 LKNLAMISNIPSISPITQLLVDDFANEQTPQLYALCGQG---NRSSLRILRHGLPVMEMA 891

Query: 529 SATGISKQSNYELVELPG-CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
           ++             LPG  K +W +                ++ D    Y+++S E  T
Sbjct: 892 ASA------------LPGVAKAVWCLKE--------------SFTDTCDKYIVVSFEDAT 925

Query: 588 MVLETADLLTEVTES 602
           +VLE  D + E+T+S
Sbjct: 926 LVLEIGDTVEEITDS 940


>gi|398020786|ref|XP_003863556.1| cleavage and polyadenylation specificity factor-like protein
            [Leishmania donovani]
 gi|322501789|emb|CBZ36871.1| cleavage and polyadenylation specificity factor-like protein
            [Leishmania donovani]
          Length = 1542

 Score = 53.1 bits (126), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 76/328 (23%), Positives = 140/328 (42%), Gaps = 34/328 (10%)

Query: 833  SAHHSRPFLFAILTDGTILCYQAYLFE--GPENTSKSDDPV-STSRSLSVSNVSASRLRN 889
            SA  +   L  IL+ G ++ Y+    +  GP    K    +   +  + V     +R + 
Sbjct: 930  SAAPTEATLVMILSSGELVTYRVVPADANGPRRCVKVIYHILDVAPEVDVVESIEARKKR 989

Query: 890  LRFSRTPLDAYTREETPHGAPCQRITIFKNISG-HQGFFLSGSRPCWCM--VFRERLRVH 946
            L+  R  L + T ++  H +  +R+  F+ +   H+G ++ G  P + +      +L   
Sbjct: 990  LQEERAHLASVT-QQMRHCS--ERLVPFRGLQDRHKGIYVCGQTPVFLVYHAATNQLVCT 1046

Query: 947  PQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQL-PSGSTYDNY-WPVQKVIPLKAT 1004
                  ++  F   H+ + + GF+Y   +G +    + P G    +  W +++V  L  T
Sbjct: 1047 RHHATNAVRGFAPFHSRHVHGGFVYC-GEGFVHFATMQPFGELLGSSGWWLERV-RLGCT 1104

Query: 1005 PHQITYFAEKNLYPLIVS-----VPVLKPLNQVLSLLIDQE---VGHQIDNHNL---SSV 1053
            PHQ+ Y    +   ++ S      P   P +  L ++ D+E   V H I+  +L   S+ 
Sbjct: 1105 PHQVIYSPAAHGCFVVASRPQPFSPKRAPFDVQLRMVEDEEGNRVPHVIEPVSLPPLSAT 1164

Query: 1054 DLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE------ 1107
                  T E YEV+      +   WQ    + +  +E  L+  ++ +   TT +      
Sbjct: 1165 SGSPVPTNERYEVQFF----STLDWQCMGRLVLDVNEKVLSATLMQVTRDTTMDAANRST 1220

Query: 1108 NETLLAIGTAYVQGEDVAARGRVLLFST 1135
               + A+ TAY  GEDV  RGR+LL +T
Sbjct: 1221 TAPVCALATAYPLGEDVTTRGRILLLTT 1248



 Score = 41.2 bits (95), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 52/206 (25%), Positives = 86/206 (41%), Gaps = 38/206 (18%)

Query: 202 IILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK----HVKDFIFVHGYIE 257
           +   A+  G G    +     GG  S  +    V + R  D+K    +++D  FV    E
Sbjct: 263 VAFGAASAGPGTASSQKV-TQGGVTSLLLRVGTVTHWRLQDVKTALRNIRDVQFVESAGE 321

Query: 258 PVMVILHERELTWAGRVS---WKHH-------TCMIS--ALSISTTLKQHPLIWSAMN-L 304
           P++  L E++ TWAGRV    W+         TC I    ++++ +   H L  S ++ L
Sbjct: 322 PLLAFLFEKQPTWAGRVKLLEWRSKTVESHMLTCSIEWMKVTLANSTAPHMLSLSEVDGL 381

Query: 305 PHDAYKLLAVPS----PIGGVLVVGANTIHYHSQSA----------SCALALNNYAVSLD 350
           P+D   +  +P+    P     V     +H  ++S             A +L + AVSL+
Sbjct: 382 PYDVTSMTPLPAFQDVPSAVFCVSRNMMVHVSTKSGYGVYVNATGEEQARSLKSSAVSLE 441

Query: 351 ------SSQELPRSSFSVELDAAHAT 370
                 +SQ L      V L+ A+AT
Sbjct: 442 AVQWRSASQALSTDLVKVNLNFANAT 467


>gi|146096490|ref|XP_001467824.1| cleavage and polyadenylation specificity factor-like protein
            [Leishmania infantum JPCM5]
 gi|134072190|emb|CAM70891.1| cleavage and polyadenylation specificity factor-like protein
            [Leishmania infantum JPCM5]
          Length = 1542

 Score = 53.1 bits (126), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 76/328 (23%), Positives = 140/328 (42%), Gaps = 34/328 (10%)

Query: 833  SAHHSRPFLFAILTDGTILCYQAYLFE--GPENTSKSDDPV-STSRSLSVSNVSASRLRN 889
            SA  +   L  IL+ G ++ Y+    +  GP    K    +   +  + V     +R + 
Sbjct: 930  SAAPTEATLVMILSSGELVTYRVVPADANGPRRCVKVIYHILDVAPEVDVVESIEARKKR 989

Query: 890  LRFSRTPLDAYTREETPHGAPCQRITIFKNISG-HQGFFLSGSRPCWCM--VFRERLRVH 946
            L+  R  L + T ++  H +  +R+  F+ +   H+G ++ G  P + +      +L   
Sbjct: 990  LQEERAHLASVT-QQMRHCS--ERLVPFRGLQDRHKGIYVCGQTPVFLVYHAATNQLVCT 1046

Query: 947  PQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQL-PSGSTYDNY-WPVQKVIPLKAT 1004
                  ++  F   H+ + + GF+Y   +G +    + P G    +  W +++V  L  T
Sbjct: 1047 RHHATNAVRGFAPFHSRHVHGGFVYC-GEGFVHFATMQPFGELLGSSGWWLERV-RLGCT 1104

Query: 1005 PHQITYFAEKNLYPLIVS-----VPVLKPLNQVLSLLIDQE---VGHQIDNHNL---SSV 1053
            PHQ+ Y    +   ++ S      P   P +  L ++ D+E   V H I+  +L   S+ 
Sbjct: 1105 PHQVIYSPAAHGCFVVASRPQPFSPKRAPFDVQLRMVEDEEGNRVPHVIEPVSLPPLSAT 1164

Query: 1054 DLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE------ 1107
                  T E YEV+      +   WQ    + +  +E  L+  ++ +   TT +      
Sbjct: 1165 SGSPVPTNERYEVQFF----STLDWQCMGRLVLDVNEKVLSATLMQVTRDTTMDAANRST 1220

Query: 1108 NETLLAIGTAYVQGEDVAARGRVLLFST 1135
               + A+ TAY  GEDV  RGR+LL +T
Sbjct: 1221 TAPVCALATAYPLGEDVTTRGRILLLTT 1248



 Score = 41.2 bits (95), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 52/206 (25%), Positives = 86/206 (41%), Gaps = 38/206 (18%)

Query: 202 IILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK----HVKDFIFVHGYIE 257
           +   A+  G G    +     GG  S  +    V + R  D+K    +++D  FV    E
Sbjct: 263 VAFGAASAGPGTASSQKV-TQGGVTSLLLRVGTVTHWRLQDVKTALRNIRDVQFVESAGE 321

Query: 258 PVMVILHERELTWAGRVS---WKHH-------TCMIS--ALSISTTLKQHPLIWSAMN-L 304
           P++  L E++ TWAGRV    W+         TC I    ++++ +   H L  S ++ L
Sbjct: 322 PLLAFLFEKQPTWAGRVKLLEWRSKTVESHMLTCSIEWMKVTLANSTAPHMLSLSEVDGL 381

Query: 305 PHDAYKLLAVPS----PIGGVLVVGANTIHYHSQSA----------SCALALNNYAVSLD 350
           P+D   +  +P+    P     V     +H  ++S             A +L + AVSL+
Sbjct: 382 PYDVTSMTPLPAFQDVPSAVFCVSRNMMVHVSTKSGYGVYVNATGEEQARSLKSSAVSLE 441

Query: 351 ------SSQELPRSSFSVELDAAHAT 370
                 +SQ L      V L+ A+AT
Sbjct: 442 AVQWRSASQALSTDLVKVNLNFANAT 467


>gi|242010743|ref|XP_002426118.1| DNA damage-binding protein, putative [Pediculus humanus corporis]
 gi|212510165|gb|EEB13380.1| DNA damage-binding protein, putative [Pediculus humanus corporis]
          Length = 1148

 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 60/276 (21%), Positives = 116/276 (42%), Gaps = 61/276 (22%)

Query: 173 GRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIES 232
           G++S   G +  +DP+ R  G+ +Y   + I+   +  S L             S R+E 
Sbjct: 113 GKQS-ETGIIAVIDPEARVIGLRLYDGLLKIIPLGKDNSELKAS----------SIRMEE 161

Query: 233 SHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
             V           +D  F+HG   P ++++H+        ++ +H    +    IS   
Sbjct: 162 VEV-----------QDLNFLHGCQNPTIILIHQD-------INGRH----VKTHEISLRD 199

Query: 293 KQH-PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA---LNNYAVS 348
           K+   + W   N+  DA  ++ VP P+ G +++G  +I YH+ +   A+A   +N   ++
Sbjct: 200 KEFVKMPWKQDNVEPDASIVIPVPEPLCGAIIIGQESILYHNGAGYVAVAPPVINQSTIT 259

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRV-------VQRL 401
             +           ++D+  + +L  D+A      G L +L +  + ++          L
Sbjct: 260 CYT-----------QVDSNGSRYLLGDMA------GHLFMLLLETEEKIDGTPCVKENGL 302

Query: 402 DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
            +       +   IT + N + F+GSR GDS LV+ 
Sbjct: 303 KVELLGEISIPEAITYLDNGVLFIGSRCGDSQLVKL 338


>gi|302406266|ref|XP_003000969.1| pre-mRNA-splicing factor rse-1 [Verticillium albo-atrum VaMs.102]
 gi|261360227|gb|EEY22655.1| pre-mRNA-splicing factor rse-1 [Verticillium albo-atrum VaMs.102]
          Length = 1059

 Score = 53.1 bits (126), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 134/581 (23%), Positives = 217/581 (37%), Gaps = 147/581 (25%)

Query: 104 VCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFE 163
           V  + + G + S+A     G++    +D +ILA +  +I+++E+        + + + F+
Sbjct: 59  VLSHDVFGIIRSMAAFRIAGSN----KDYLILATDSGRIAIIEY--------LPAQNRFQ 106

Query: 164 SPEWLHL----KRGRESFARGPLVKVDPQGRCGGVLVYGLQ-----MIILKASQGGSGLV 214
               LHL    K G      G  +  DP+GR    L+  L+      ++ + SQ      
Sbjct: 107 R---LHLETFGKSGIRRVVPGEFLACDPKGRA--CLIASLEKNKLVYVLNRNSQA----- 156

Query: 215 GDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
             E T  S     A     HV+++  LD+          GY  PV   L E + T A + 
Sbjct: 157 --ELTISSP--LEAHKPGVHVLSMVALDV----------GYANPVFAAL-ETDYTEADQD 201

Query: 275 SWKHHTCMISALSISTTLKQHPL---------IWSAMNLPHDAYKLLAVPSPIG-----G 320
                    +AL + T L  + L          WS    P D    L    P G     G
Sbjct: 202 PTGQ-----AALDVETQLVYYELDLGLNHVVRKWSE---PVDNTASLLFQVPGGNDGPSG 253

Query: 321 VLVVGANTIHY-HSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA-- 377
           VLV G   I Y HS   +  + +         + E P    S+     H   L+      
Sbjct: 254 VLVCGEENITYRHSNQEAFRVPVPRRR----GATEDPSRKRSIVAGVMHK--LKGSAGAF 307

Query: 378 --LLSTKTGDLVLLTVVY----DGRV---VQRLDLSKTNPSVLTSDITTIGNSLFFLGSR 428
             LL T+ GDL  +T+      DG     V+RL +   +   + S +  + +   ++ S+
Sbjct: 308 FFLLQTEDGDLFKITIDMIEDRDGNPTGEVKRLKIKYFDTIPVASSLCILKSGFLYVASQ 367

Query: 429 LGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG 488
            G+    QF              E+ GD        + L  SS D   D     E   + 
Sbjct: 368 FGNYQFYQF--------------EKLGD------DDEELEFSSDDFPTDPKQSYEAVFF- 406

Query: 489 SASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA----SATGISKQSNYELV-- 542
                    ++  + A+ +S+ ++ PL D         DA    +A G   +S + ++  
Sbjct: 407 -------HPRELENLALVESIDSMNPLIDCKVANLTGEDAPQIYTACGNGARSTFRILKH 459

Query: 543 ----------ELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLE 591
                     ELPG    +WT+  K SRG            D+Y AY+++S    T+VL 
Sbjct: 460 GLEVNEIVASELPGIPSAVWTL--KLSRG------------DQYDAYIVLSFTNATLVLS 505

Query: 592 TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
             + + EV +S   F+      A  L G   +IQV  +G R
Sbjct: 506 IGETVEEVNDS--GFLTSVPTLAAQLLGGEGLIQVHPKGIR 544


>gi|391335522|ref|XP_003742140.1| PREDICTED: DNA damage-binding protein 1-like [Metaseiulus
           occidentalis]
          Length = 1154

 Score = 52.8 bits (125), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 99/428 (23%), Positives = 165/428 (38%), Gaps = 82/428 (19%)

Query: 257 EPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
           +PV+ I++E + T       +H    + AL     L + P  W   NL  +A  L+ V  
Sbjct: 178 DPVLAIVYEEQQT-------RHMKTHVIALR-DKELMKGP--WGQRNLDLEADMLIPVED 227

Query: 317 PIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDV 376
              GV++VG  TI YH           +Y     S     + S    +D          V
Sbjct: 228 TETGVIIVGGETIVYHYG--------QDYICIQPSFLRTTKISCYCRIDNNRL------V 273

Query: 377 ALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQ 436
            +L    G L +LT+  + + V    L       +   ++ + N + F+GSRLGDS L++
Sbjct: 274 FILGGICGRLFILTLRRENKKVVSHSLDLLGSVSIPECLSYLDNGVVFVGSRLGDSQLIR 333

Query: 437 FTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS-DALQDMVNGEELSLYGSASNNTE 495
                               + A  P  + L   ++  A+ DM+   +L   G     T 
Sbjct: 334 --------------------MHAQEPFIEVLESYTNLGAILDMI-VVDLEKQGQDQLITC 372

Query: 496 SAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYH 555
           S Q              G L+    G+ I+  A             VEL G KGIW +  
Sbjct: 373 SGQGA-----------CGSLRIIRNGIGIHELAC------------VELSGIKGIWAL-- 407

Query: 556 KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLE--TADLLTEVTESVDYFVQGRTIA 613
              R + A        DD     L++S   +T V    + + L +VT    + +  +T  
Sbjct: 408 ---RMNTAQLEEDTPTDDT----LVLSFVGQTRVFNCSSTEELEQVTLPAAFDIDSQTFC 460

Query: 614 AGNLFGRRRVIQVFERGARILDGSYMTQ-DLSFGPSNSESGSGSENSTVLSVSIADPYVL 672
           A N+ G  +VIQV ++   ++  +  T+ D  F P        + N   +++++ +  V 
Sbjct: 461 ARNVLG-NQVIQVTDKRVNLISVTSKTRVDQWFPPEGEIITQCACNDVQVALALKNVLVY 519

Query: 673 LGMSDGSI 680
           L + DGS+
Sbjct: 520 LEIRDGSL 527


>gi|261335516|emb|CBH18510.1| cleavage and polyadenylation specificity factor-like protein,
           putative [Trypanosoma brucei gambiense DAL972]
          Length = 1452

 Score = 52.8 bits (125), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 59/234 (25%), Positives = 98/234 (41%), Gaps = 42/234 (17%)

Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS-------ISTTL 292
           +++V+D  F+    EP++ IL ER+ TWAGRV    W+      + LS       IS T 
Sbjct: 254 IRYVRDVQFIGTLGEPLLAILCERKPTWAGRVKLVEWRTKAVESNMLSQQVTWVQISGTA 313

Query: 293 KQHP---LIWSAMNLPHDAYKLLAVPS---PIGGVLVVGANTIHY--------------- 331
              P   L+     +P++   +L V S    + GV+  G NTI +               
Sbjct: 314 SALPKLLLVGEVDGVPYNVTHMLPVGSISQAMSGVICFGVNTIMHITTRRGYGAYWNETG 373

Query: 332 -----HSQSASCALALNNYA-VSLDSSQELPRSSFSVELDAAHATWLQND-----VALLS 380
                 S+S++ +    N+    L+SS  L R + S+    A     ++D        +S
Sbjct: 374 KEECTSSKSSAVSYGKINWCDKKLESSTALFRVNLSLANCVAATLEGKDDEGSLQAVAVS 433

Query: 381 TKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLL 434
              G +++L  +  G  +  + ++        S IT I   L FLGS + DS +
Sbjct: 434 EDDGVVLMLQFLSQGSNIHDIRIAVLTSGCYCSSITPISERLMFLGSAVSDSCI 487



 Score = 48.1 bits (113), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 66/291 (22%), Positives = 125/291 (42%), Gaps = 37/291 (12%)

Query: 879  VSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMV 938
            + ++ A ++R L+  RT ++  T +   H +  +RI  F  ++G  G ++ G  P + M 
Sbjct: 895  IESIEAKKMR-LQSERTMIENDT-QSVRHCS--RRIIPFAAVAGQSGAYVCGQHPLFLM- 949

Query: 939  FRERLR---VHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPS-GSTYDNYWP 994
            +  R R    +     G +  F    +++   GFIY   +G +    + +  S   N W 
Sbjct: 950  WDNRTRQLVAYRHQAPGPVRGFVPFTSMS--GGFIYCC-EGFVDFAVMNTYCSPGGNGW- 1005

Query: 995  VQKVIPLKATPHQITYFAEKNLYPLIVSVPV-LKPLNQVLSLLIDQEVGHQIDNHNLSSV 1053
            +++ I + ATPH I Y        ++ S  V  +P  Q  S  +  ++ +  D++ + SV
Sbjct: 1006 LRRRIHIGATPHFIVYDPPGRSCFVVTSKKVPFRP--QRASFDVQLKIQYDEDSNTVQSV 1063

Query: 1054 DLH---------------RTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVV 1098
                              R    E +EVR+    + G  W     + +  +E  L  ++V
Sbjct: 1064 TTEAPVCNMPAIKPGTGVRVPLTERFEVRLHSTFKKG--WDCTDKLMLDENEKVLGAQMV 1121

Query: 1099 TLF---NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLV 1146
             +    N        +  + TA+  GEDV  RGR++L ++ RN    +++V
Sbjct: 1122 EIHQDANADGSATAPVCVVCTAFPLGEDVTCRGRIILLAS-RNIKGRRSIV 1171


>gi|74025892|ref|XP_829512.1| cleavage and polyadenylation specificity factor-like protein
           [Trypanosoma brucei brucei strain 927/4 GUTat10.1]
 gi|70834898|gb|EAN80400.1| cleavage and polyadenylation specificity factor-like protein,
           putative [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
          Length = 1452

 Score = 52.8 bits (125), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 59/234 (25%), Positives = 98/234 (41%), Gaps = 42/234 (17%)

Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS-------ISTTL 292
           +++V+D  F+    EP++ IL ER+ TWAGRV    W+      + LS       IS T 
Sbjct: 254 IRYVRDVQFIGTLGEPLLAILCERKPTWAGRVKLVEWRTKAVESNMLSQQVTWVQISGTA 313

Query: 293 KQHP---LIWSAMNLPHDAYKLLAVPS---PIGGVLVVGANTIHY--------------- 331
              P   L+     +P++   +L V S    + GV+  G NTI +               
Sbjct: 314 SALPKLLLVGEVDGVPYNVTHMLPVGSISQAMSGVICFGVNTIMHITTRRGYGAYWNETG 373

Query: 332 -----HSQSASCALALNNYA-VSLDSSQELPRSSFSVELDAAHATWLQND-----VALLS 380
                 S+S++ +    N+    L+SS  L R + S+    A     ++D        +S
Sbjct: 374 KEECTSSKSSAVSYGKINWCDKKLESSTALFRVNLSLANCVAATLEGKDDEGSLQAVAVS 433

Query: 381 TKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLL 434
              G +++L  +  G  +  + ++        S IT I   L FLGS + DS +
Sbjct: 434 EDDGVVLMLQFLSQGSNIHDIRIAVLTSGCYCSSITPISERLMFLGSAVSDSCI 487



 Score = 48.1 bits (113), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 69/296 (23%), Positives = 124/296 (41%), Gaps = 47/296 (15%)

Query: 879  VSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMV 938
            + ++ A ++R L+  RT ++  T +   H +  +RI  F  ++G  G ++ G  P + M 
Sbjct: 895  IESIEAKKMR-LQSERTMIENDT-QSVRHCS--RRIIPFAAVAGQSGAYVCGQHPLFLM- 949

Query: 939  FRERLRV-------HPQLCDGSIVAFTVLHN--VNCNHGFIYVTSQGILKICQLPSGSTY 989
            +  R R         P L  G  V FT +    + C  GF+      ++     P G   
Sbjct: 950  WDNRTRQLVAYRHQAPGLVRG-FVPFTSMPGGFIYCCEGFVDF---AVMNTYCSPGG--- 1002

Query: 990  DNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPV-LKPLNQVLSLLIDQEVGHQIDNH 1048
             N W +++ I + ATPH I Y        ++ S  V  +P  Q  S  +  ++ +  D++
Sbjct: 1003 -NGW-LRRRIHIGATPHFIVYDPPGRSCFVVTSKKVPFRP--QRASFDVQLKIQYDEDSN 1058

Query: 1049 NLSSVDLH---------------RTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENAL 1093
             + SV                  R    E +EVR+    + G  W     + +  +E  L
Sbjct: 1059 TVQSVTTEAPVCNMPAIKPGTGVRVPLTERFEVRLHSTFKKG--WDCTDKLMLDENEKVL 1116

Query: 1094 TVRVVTLF---NTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLV 1146
              ++V +    N        +  + TA+  GEDV  RGR++L ++ RN    +++V
Sbjct: 1117 GAQMVEIHQDANADGSATAPVCVVCTAFPLGEDVTCRGRIILLAS-RNIKGRRSIV 1171


>gi|389602597|ref|XP_001567507.2| cleavage and polyadenylation specificity factor-like protein
            [Leishmania braziliensis MHOM/BR/75/M2904]
 gi|322505515|emb|CAM42945.2| cleavage and polyadenylation specificity factor-like protein
            [Leishmania braziliensis MHOM/BR/75/M2904]
          Length = 1536

 Score = 52.8 bits (125), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 61/250 (24%), Positives = 111/250 (44%), Gaps = 30/250 (12%)

Query: 912  QRITIFKNISG-HQGFFLSGSRPCWCMVFR---ERLRVHPQLCDGSIVAFTVLHNVNCNH 967
            +R+  F  + G H+G ++ G  P + +V+     +L         ++  F   H+ + + 
Sbjct: 1003 ERLVPFCALQGRHKGIYVCGQTPVF-LVYHYATNQLVCTRHHATSAVRGFAPFHSRHVHG 1061

Query: 968  GFIYVTSQGILKICQL-PSGSTYDNY-WPVQKVIPLKATPHQITYFAEKNLYPLIVS--- 1022
            GF+Y   +G +    + P G    +  W +++V  L  TPHQ+ Y    +   ++VS   
Sbjct: 1062 GFVYC-GEGFVHFATMQPFGELLGSSGWWLERVR-LGCTPHQVIYSPAAHGCFVVVSRPQ 1119

Query: 1023 --VPVLKPLNQVLSLLIDQE---VGHQIDNHNL---SSVDLHRTYTVEEYEVRILEPDRA 1074
               P   P +  L ++ D+E   V H ++  +L   S+       T   YEV++     +
Sbjct: 1120 PFSPKRAPFDVQLRMVEDEEGNRVPHVVEPVSLPPLSATSGSPVPTNGRYEVQLF----S 1175

Query: 1075 GGPWQTRATIPMQSSENALTVRVVTLFNTTTKE------NETLLAIGTAYVQGEDVAARG 1128
               WQ    + +  +E  L+  ++ +   TT +         + A+ TAY  GEDV  RG
Sbjct: 1176 TLDWQRVDCLALDVNEKVLSATLMQVSRDTTMDVAYRSATAPVCALATAYPLGEDVTTRG 1235

Query: 1129 RVLLFSTGRN 1138
            RVLL +T + 
Sbjct: 1236 RVLLLATSQQ 1245



 Score = 42.7 bits (99), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 38/138 (27%), Positives = 68/138 (49%), Gaps = 20/138 (14%)

Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHH-------TCMIS--ALSIST 290
           +++++D  FV    EP++  L E++ TWAGRV    W+         TC I    ++++ 
Sbjct: 298 LRNIRDVQFVASAGEPLLAFLFEKQPTWAGRVKLLEWRSKTVESHMLTCSIEWMKVTLAN 357

Query: 291 TLKQHPLIWSAMN-LPHDAYKLLAVPS----PIGGVLVVGANTIHYHSQSASCALALNNY 345
           T   H L  S ++ LP+DA  +  +P+    P   VL V  N + + S  +   + +N  
Sbjct: 358 TAAPHMLSLSEVDGLPYDATSMTPLPAFQDVP-SAVLCVSRNMMVHVSTKSGYGVYVN-- 414

Query: 346 AVSLDSSQELPRSSFSVE 363
           A+  + ++ L  S+ S E
Sbjct: 415 AMGEEQARSLKSSAVSCE 432


>gi|339235331|ref|XP_003379220.1| DNA damage-binding protein 1 [Trichinella spiralis]
 gi|316978142|gb|EFV61158.1| DNA damage-binding protein 1 [Trichinella spiralis]
          Length = 1329

 Score = 52.8 bits (125), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 80/355 (22%), Positives = 152/355 (42%), Gaps = 66/355 (18%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R  +  +SA  L+ V   ++ G + +  + +  G + +     +++      ++++E+D+
Sbjct: 205 RFEVHSVSAEGLQYVTEGKMFGRIGAAKLFTPKGENKAL----MVIVTLKQDVAIVEYDN 260

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQG 209
                RI ++      E      GR + + G L+ V P G   G+ +       +  ++ 
Sbjct: 261 G----RIKTLASRNISE----NFGRPA-SNGILLSVHPDGEVIGLRIMSSTFKCITWNRA 311

Query: 210 GSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELT 269
            S L                  S++ +N     + H+ DF+F+HG+  PV+ +++     
Sbjct: 312 TSKL------------------STYSLNY---SLTHLSDFVFLHGFQFPVIALIY----- 345

Query: 270 WAGRVSWKHH-TCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
             G +  +H  TC IS        +  P  WS  ++  +A+ L+AVP P+ GV+VVG ++
Sbjct: 346 --GDLVGRHVITCRISL--DEQEFENGP--WSRGHIEWEAHTLIAVPPPLCGVIVVGCSS 399

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           + Y           +N  +S  S   L +S  +   DAA           L    G L L
Sbjct: 400 LLY---------IRDNSTISTVSPPFLSKSIVNC-YDAAP----DGLTYFLGQLDGTLSL 445

Query: 389 LTVVYDGRVVQRLDLSKTNPSVL--TSDITTIG----NSLFFLGSRLGDSLLVQF 437
           L +  +     ++ LS+   ++L  TS   ++      SL F+GSR+ DS L++ 
Sbjct: 446 LKLDIETDAEGKVTLSRMRATILGVTSPPDSLSYMHKESLLFVGSRIADSKLLRL 500


>gi|389586447|dbj|GAB69176.1| splicing factor 3B subunit 3 [Plasmodium cynomolgi strain B]
          Length = 1286

 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 127/604 (21%), Positives = 225/604 (37%), Gaps = 114/604 (18%)

Query: 92  LMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSI 151
           L+       L L+    + G +  L      G++    +D +++  +  ++ +L+F +  
Sbjct: 41  LLRADKQGKLNLIASKDVFGIIRCLQTFRLTGSN----KDYVVIGSDSGRLVILQFSNEK 96

Query: 152 HGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGV-------LVYGL----- 199
           +      +HC       + K G      G  + VDP+GR   +        VY L     
Sbjct: 97  NDF--VRVHC-----ETYGKSGLRRIIPGEYIAVDPKGRALMICAIERQKFVYILNRDTK 149

Query: 200 -QMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEP 258
            Q+ I              D  G   GF   I +S   N    D K V +   + G    
Sbjct: 150 EQLTISSPLDAHKSHTICHDVVGMDVGFENPIFASIEQNYEMYD-KQVTNTNEIDGCTRK 208

Query: 259 VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
            ++ L E +L                   ++  +++H        LP D+   L +P P 
Sbjct: 209 TLLCLWEMDL------------------GLNHVIRKH-------TLPIDSSAHLLIPIPG 243

Query: 319 G-----GVLVVGANTIHYHSQS---ASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
           G     GV+V   N + Y         CA     Y   L++ QE    + S+   A H  
Sbjct: 244 GQQGPSGVIVCCDNYLVYKKVEHVDVYCA-----YPRRLETGQE---KNISIVCSALHRI 295

Query: 371 WLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
             +    L+ ++ GDL  + + ++  +V+ +     +   + + I  + +   F+ +  G
Sbjct: 296 R-KFFFILIQSEFGDLYKIEMDHEDGIVKEITCKYFDTVPVANAICVMKSGSLFVAAEFG 354

Query: 431 DSLLVQFT-CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEE--LSLY 487
           +    QF+  G   +      K   G     A  TK+L   ++  L D V      L + 
Sbjct: 355 NHFFYQFSGIGDDDNEAMCTSKHPSGRNAIIAFRTKKL---TNLFLIDQVYSLSPILDMK 411

Query: 488 GSASNNTESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVEL 544
              + N  S Q         +L   GP   L+   +GL I   A              EL
Sbjct: 412 ILDAKNANSPQIY-------ALCGRGPRSSLRILQHGLSIEELADN------------EL 452

Query: 545 PG-CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESV 603
           PG  K IWT+     +  NA          +Y  Y+I+S E  T++LE  + + EV +S+
Sbjct: 453 PGRPKFIWTI-----KKDNAS---------DYDGYIIVSFEGSTLILEIGETVEEVVDSL 498

Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLS 663
              +   T    N+     +IQV + G R ++G  + + +   P N +  + + NST + 
Sbjct: 499 --LLTNVTTIHVNILYDNTLIQVHDTGIRHINGKVVHEWVP--PKNKQIKAATSNSTQIV 554

Query: 664 VSIA 667
           +S++
Sbjct: 555 ISLS 558


>gi|428180158|gb|EKX49026.1| hypothetical protein GUITHDRAFT_68305 [Guillardia theta CCMP2712]
          Length = 1202

 Score = 52.4 bits (124), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 110/554 (19%), Positives = 202/554 (36%), Gaps = 97/554 (17%)

Query: 99  ASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITS 158
             ++ +C     G + S+A     G++    +D ++L  +  +ISVLEF    +      
Sbjct: 47  GKIQSICQMECFGLIRSMASFRLPGSN----KDYLVLGADSGRISVLEFSKERNQFERVH 102

Query: 159 MHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED 218
           +  +        K G      G  +  DP+GR   +     Q ++          V + D
Sbjct: 103 LETYG-------KSGCRRIVPGQFLASDPKGRAVMISAIEKQKLVY---------VFNRD 146

Query: 219 TFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILH----ERELTWAGRV 274
                   S+++  S  +        H        G+  P+   L     + +    G+ 
Sbjct: 147 A-------SSKLTISSPLEAHKASTIHFSIVGVDVGFDNPIFAALEMDYSDADADETGQ- 198

Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP-----IGGVLVVGANTI 329
           S +    +++   +   L     +    + P DA   + +P P       GVLV   N I
Sbjct: 199 SAEEFNKVLTFYELDLGLNH---VVRKASEPIDAASNMLIPVPGDTDGPSGVLVCAENKI 255

Query: 330 HYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLL 389
            Y        +AL      +   Q L  + +S      H         LL ++ GDL  L
Sbjct: 256 AYKKPDHEDVVALIPRRQGMPLDQPLLITGYS------HLKQKDGFFFLLQSEIGDLYRL 309

Query: 390 TVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTS---ML 446
           T+ Y+   V  ++++  +   +   IT +     F+ S  G+  L QF    G+    M+
Sbjct: 310 TLTYNDEEVSEINITYFDTVPVAQSITILKTGFLFVASEFGNHALYQFLSIKGSDESDMM 369

Query: 447 SSGLKEEFGDIE----ADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFS 502
              ++ E   IE    A  P    L     ++L  +++   L L G      E   + ++
Sbjct: 370 PVEVEIEGETIEIPHFAPRPLKNLLLVDEMESLSPILDMRVLDLAG------EETPQIYA 423

Query: 503 FAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGCK-GIWTVYHKSS 558
                 L   GP   L+   +GL +            +   + ELP     +WTV     
Sbjct: 424 ------LCGKGPRSTLRTLRHGLAV------------AEMAVSELPSNPLAVWTV----- 460

Query: 559 RGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLF 618
           +G + D++           Y++++    T+VL   D + EVT+S  +    +T++  +L 
Sbjct: 461 KGSSKDAA---------DKYIVVTFANATIVLSIGDTVEEVTDS-GFLATNKTLSV-SLL 509

Query: 619 GRRRVIQVFERGAR 632
           G   ++QV   G R
Sbjct: 510 GDDSLLQVHPNGLR 523


>gi|402595041|gb|EJW88967.1| hypothetical protein WUBG_00126 [Wuchereria bancrofti]
          Length = 621

 Score = 52.0 bits (123), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 87/361 (24%), Positives = 139/361 (38%), Gaps = 100/361 (27%)

Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
           +W   NL  +A  +++VP P+GG L+ G + I YH +    AL    YA    S      
Sbjct: 201 LWKHDNLEGEANIVISVPEPVGGCLIAGPDAISYH-KGGDDAL---RYAGVPGSRLHNTH 256

Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-----TNPSVLT 412
            +    +D     +L  D+A      G+L +L           L+L K      N +V+ 
Sbjct: 257 PNCYAPVDRDGQRYLLADLA------GNLYMLL----------LELGKDQEQDENSAVIV 300

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTC--GSGTSMLSSGLKEEFGDIEADAPSTKRLRRS 470
            D+             LG++ + +  C   +G   + S     FGD         +L R 
Sbjct: 301 RDMKV---------ESLGETCIAECMCYLDNGVCFIGS----RFGD--------SQLIRL 339

Query: 471 SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA-- 528
           S++                       A  T   ++ DS  N+ P++D +  +R N     
Sbjct: 340 STEP---------------------RADGTGYISLLDSYTNLAPIRDMTV-MRCNGQQQI 377

Query: 529 -SATGISKQSNY----------EL--VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEY 575
            + +G  K              EL  VEL G K ++T+    +RG            DE+
Sbjct: 378 LTCSGAYKDGTIRIIRNGIGIEELASVELKGIKNMFTL---RTRG------------DEF 422

Query: 576 HAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILD 635
             YLI+S ++ T VL       E TE   + V G T+ AG LF  + ++QV      ++D
Sbjct: 423 DDYLILSFDSETHVLFINGEELEDTEITGFAVDGATLWAGCLFHSKTILQVTHGEVILID 482

Query: 636 G 636
           G
Sbjct: 483 G 483


>gi|146420838|ref|XP_001486372.1| hypothetical protein PGUG_02043 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 1206

 Score = 52.0 bits (123), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 120/610 (19%), Positives = 237/610 (38%), Gaps = 96/610 (15%)

Query: 96  ISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLR 155
           + +  ++ +CH ++ G ++++  + +GG++     D +++  +  ++S+LEFD       
Sbjct: 53  LESGKIKQICHQQVIGVIQNIDRIRKGGSN----LDLLVITSDSGRLSILEFDKD----- 103

Query: 156 ITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVG 215
              +  F   +  H K G      G  + VDPQ R   +       ++ KA    + L  
Sbjct: 104 --ELKFFPVVQEPHSKNGMNRTTPGEYLCVDPQDRTITIGAIERDKLMYKAQTNNNKL-- 159

Query: 216 DEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVS 275
                       +  +++  I +  LD           GY  P++  +   E  +A   +
Sbjct: 160 -----ELLSPLESVSKNTLTIQMVSLDT----------GYENPMLAAI---ECNYAHYDA 201

Query: 276 WKHHTCMISALSISTTLKQHPLIWSA-----MNLPHDAYKLLAVPSPIGGVLVVGANTIH 330
              +    S L++     +  L + A     + +P  +  L+ +P+PIGGV+V G++ I 
Sbjct: 202 SLKYDPQSSNLTLQYYEFEQGLNYVARRKDTLEIPSSSTTLVPLPTPIGGVIVAGSSFIF 261

Query: 331 YHSQSASCALALNNYAVSLDS-SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLL 389
           YH+ +    L L    + L + S  +P   ++V     H     N   LL  + GD   +
Sbjct: 262 YHNPTIDQQLYL---PIPLRAGSSPVPIVCYAV-----HKLKKNNFFILLHNELGDCFRV 313

Query: 390 TVVY--DGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT-CGSGTSML 446
            + Y  D   V  L +   +    ++ I        F      D +L Q    G   S +
Sbjct: 314 LIDYDDDSEKVTELSVGYFDTISPSTSINVFKKGYLFANVTNNDKMLYQIEDLGDNDSYI 373

Query: 447 SSGLKEEFGDI-EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAV 505
           SS       D+ + +     + R   + AL  +++       G+    +ES +   +   
Sbjct: 374 SSSQFSSLEDVFDGNKKHEFKPRGLRNLALVQIIDSSNPCFGGALVKTSESKESRIAMIT 433

Query: 506 RDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADS 565
             S      LK  ++G+ I+               LV  P      +V+          +
Sbjct: 434 GHS-----HLKLKTHGIPIST--------------LVSSPLPMIATSVF----------T 464

Query: 566 SRMAAYDDEYHAYLIISLEA--RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
           +R++A + +   Y++IS  A  +T+VL   +++ EV +S   FV  +        G + +
Sbjct: 465 TRLSA-ESKNDEYMVISSSASSKTLVLAIGEVVEEVQDSS--FVTDQPTIGVQQVGLKSL 521

Query: 624 IQVFERGARIL-----DGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
           IQ++  G R +     +G    +   + P            T++S S     VL+G+S+ 
Sbjct: 522 IQIYSNGIRHIRQTETEGKITKKTFDWYP--------PAGITIISASTNQEQVLIGLSNR 573

Query: 679 SIRLLVGDPS 688
            +     DP+
Sbjct: 574 ELCYFEIDPT 583


>gi|290998415|ref|XP_002681776.1| damage-specific DNA binding protein 1 [Naegleria gruberi]
 gi|284095401|gb|EFC49032.1| damage-specific DNA binding protein 1 [Naegleria gruberi]
          Length = 1103

 Score = 51.6 bits (122), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 77/358 (21%), Positives = 151/358 (42%), Gaps = 60/358 (16%)

Query: 82  KNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAK 141
           KN      R+  +G+S+     V  +   G ++++++    G     ++D + +  ED  
Sbjct: 35  KNQYLQVNRLSEEGVSS-----VVEFEAPGRIDTMSLFRPSG----EKQDLLFITIEDTF 85

Query: 142 ISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQ 200
            ++   D  I  L   S+   + P       GR S   G +  +DP  R   + +Y GL 
Sbjct: 86  FTLGFIDGKIETLSSGSI---DDP------VGRRS-ESGSITTIDPLCRAVALSIYEGLL 135

Query: 201 MIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVM 260
            II            ++  F     F+ R+E  +VI++  L+    K          P  
Sbjct: 136 KII--------PFENNKHQFKEA--FNVRLEELNVIDIAFLESLGSK------SKSGPTF 179

Query: 261 VILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGG 320
            +L++  +          H       ++   +++  L  + +N+ H A  L+ VP+P+GG
Sbjct: 180 ALLYQDHV-------GSRHVKTYEVKTLDKDMEESSL--NQLNVDHGANILIPVPAPLGG 230

Query: 321 VLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLS 380
           V+ VG   + Y ++S        N++V+  ++  +   S+  +LD  +  W   D     
Sbjct: 231 VICVGEAQVSYINESN------KNHSVASPANSRMAIRSYG-KLD--NTRWFLGD----- 276

Query: 381 TKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
            ++G L LL++      V  L L +   + ++S I+ + N   F+GS  GDS +++ +
Sbjct: 277 -QSGQLYLLSLQVSDSEVTGLTLKELGVTSISSCISYLDNGYVFIGSNYGDSQVIRIS 333


>gi|198432469|ref|XP_002129207.1| PREDICTED: similar to DNA damage-binding protein 1 (Damage-specific
           DNA-binding protein 1) (UV-damaged DNA-binding factor)
           (DDB p127 subunit) (DNA damage-binding protein a) (DDBa)
           (UV-damaged DNA-binding protein 1) (UV-DDB 1) (Xeroderma
           pigmentosum group E-co... isoform 1 [Ciona intestinalis]
          Length = 1150

 Score = 51.6 bits (122), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 92/438 (21%), Positives = 167/438 (38%), Gaps = 109/438 (24%)

Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
           F+ RIE   VI+ +           F+HGY  P +VI+++           +H    I  
Sbjct: 155 FNIRIEELSVIDAK-----------FLHGYTTPTLVIIYQNS-------QGRHVKTYIVD 196

Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA---- 341
           +     +      W   N+  +A  ++ VP P+ G +++G  +I YH+      +A    
Sbjct: 197 VRDKEVVAGP---WKQENIDAEANFIINVPKPLAGSIIIGQESITYHNGDKYIPIAPPQI 253

Query: 342 ---LNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYD 394
              +N YA                 +D   + +L  D+A      G L +L +    + D
Sbjct: 254 KDTINCYA----------------PVDKDGSRYLLGDLA------GHLFILLLESDEMMD 291

Query: 395 G-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEE 453
           G   V+ L +       +   I+ + N + ++GSRLGDS L++                 
Sbjct: 292 GTNTVRDLKIELLGEVSIPEAISYLDNGVVYIGSRLGDSQLIR----------------- 334

Query: 454 FGDIEADAPSTKRLRRSSSDALQDMVN-GEELSLYGSASNNTESAQ-KTFSFAVRDSLVN 511
              +  D+    R + S    L    N G  + +     +     Q  T S A ++    
Sbjct: 335 ---LPTDSSMEGRPKPSLISVLDTYTNLGPIIDMCVVDLDRQGQGQVVTCSGAFKE---- 387

Query: 512 IGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAY 571
            G L+    G+ I   AS            ++LPG KG+W +          D+SR  +Y
Sbjct: 388 -GSLRIIRNGIGIQEHAS------------IDLPGIKGLWPL-------RVFDTSR--SY 425

Query: 572 DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
           D      L+IS    + +L+ +    E T+   +  + +T    N+    +++Q+ E+  
Sbjct: 426 DT-----LVISFVGHSRILQLSGEEVEETDLPGFDDESQTFYCSNVC-HNQLVQITEKSI 479

Query: 632 RILDGSYMTQDLSFGPSN 649
           R++  +   Q   + P N
Sbjct: 480 RLISHTERRQVHEWKPKN 497


>gi|157873900|ref|XP_001685450.1| cleavage and polyadenylation specificity factor-like protein
            [Leishmania major strain Friedlin]
 gi|68128522|emb|CAJ08654.1| cleavage and polyadenylation specificity factor-like protein
            [Leishmania major strain Friedlin]
          Length = 1541

 Score = 51.6 bits (122), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 76/329 (23%), Positives = 141/329 (42%), Gaps = 36/329 (10%)

Query: 833  SAHHSRPFLFAILTDGTILCYQAYLFE--GPENTSKSDDPV-STSRSLSVSNVSASRLRN 889
            SA  +   L  IL+ G ++ Y+    +  GP    K    +   +  + V     +R + 
Sbjct: 929  SAAPTEATLVMILSSGELVTYRVVPADANGPRRCVKVIYHILDVAPEVDVMESIKARKKR 988

Query: 890  LRFSRTPLDAYTREETPHGAPCQRITIFKNISG-HQGFFLSGSRPCWCMVFR---ERLRV 945
            L+  R  L + T ++  H +  +R+  F+ +   ++G ++ G  P + +V+     +L  
Sbjct: 989  LQEERAHLASVT-QQMRHCS--ERLVPFRGLQDRYKGIYVCGQTPVF-LVYHAATNQLVC 1044

Query: 946  HPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQL-PSGSTYD-NYWPVQKVIPLKA 1003
                   ++  F   H+ + + GF+Y   +G +    + P G     + W +++V  L  
Sbjct: 1045 TRHHATNAVRGFAPFHSRHVHGGFVYC-GEGFVHFATMQPFGELLGCSGWWLERV-RLGC 1102

Query: 1004 TPHQITYFAEKNLYPLIVS-----VPVLKPLNQVLSLLIDQE---VGHQIDNHNL---SS 1052
            TPHQ+ Y    +   ++ S      P   P +  L ++ D+E   V H I+  +L   S+
Sbjct: 1103 TPHQVIYSPAAHGCFVVASRPQPFSPKRAPFDVQLRMVEDEEGNRVPHVIEPVSLPPLSA 1162

Query: 1053 VDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKE----- 1107
                   T E YEV+          WQ    + +  +E  L+  ++ +   TT +     
Sbjct: 1163 TSGSPVPTNERYEVQFFSTLN----WQCMGRLVLDGNEKVLSATLMQVTRDTTMDAANRS 1218

Query: 1108 -NETLLAIGTAYVQGEDVAARGRVLLFST 1135
                + A+ TAY  GEDV  RGR+LL +T
Sbjct: 1219 TTAPVCALATAYPLGEDVTTRGRILLLTT 1247



 Score = 44.3 bits (103), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 50/185 (27%), Positives = 81/185 (43%), Gaps = 37/185 (20%)

Query: 223 GGGFSARIESSHVINLRDLDMK----HVKDFIFVHGYIEPVMVILHERELTWAGRVS--- 275
           GGG S  +    V + R  D+K    +++D  FV    EP++  L E++ TWAGRV    
Sbjct: 282 GGGTSLLLRVGTVTHWRLQDVKSALRNIRDVQFVQSAGEPLLAFLFEKQPTWAGRVKLLE 341

Query: 276 WKHH-------TCMIS--ALSISTTLKQHPLIWSAMN-LPHDAYKLLAVPS----PIGGV 321
           W+         TC I    ++++ +   H L  S ++ LP+D   +  +P+    P    
Sbjct: 342 WRSKTVESHMLTCSIEWMKVTLANSATPHMLSLSEVDGLPYDVTSMTPLPAFQDLPSAVF 401

Query: 322 LVVGANTIHYHSQSA----------SCALALNNYAVSLD------SSQELPRSSFSVELD 365
            V     +H  ++S             A +L + AVSL+      +SQ L      V L+
Sbjct: 402 CVSRNMMVHVSTKSGYGVYVNATGEEQARSLKSSAVSLEAVQWRSASQALSTDLVKVNLN 461

Query: 366 AAHAT 370
            A+AT
Sbjct: 462 FANAT 466


>gi|401883281|gb|EJT47496.1| U2 snRNA binding protein [Trichosporon asahii var. asahii CBS 2479]
          Length = 1216

 Score = 51.2 bits (121), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 130/595 (21%), Positives = 223/595 (37%), Gaps = 132/595 (22%)

Query: 85  GETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISV 144
           G T+  +L    S   L+ +C     G V ++A     G      +D I+L+ +  ++S+
Sbjct: 34  GSTRLEILKLNPSTGQLDSICSSEAFGTVRNVAAFRLAGMG----KDYIVLSSDSGRLSI 89

Query: 145 LEFDDSIHGLRITSMHCFESP-EWLHLKRGRESFARGPLVKVDPQGRC---GGVLVYGLQ 200
           +E       L I+    FES  + ++ K G      G  + VDP+GR    G V    L 
Sbjct: 90  IE-------LVISPTPHFESLYQEVYGKSGSRRTIPGQFLAVDPKGRSAMFGAVEKQKLC 142

Query: 201 MIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVM 260
            I+ + ++G                  A    + V+N+   D           GY  P+ 
Sbjct: 143 YILNRNTEG---------KVYPSSPLEAHKNHTLVVNMIACDT----------GYDNPMF 183

Query: 261 VILHERELTW----------AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYK 310
             L   EL +          A R + KH T     L ++  +++    WS    P D   
Sbjct: 184 AAL---ELDYGDSDHDATGEAYRAAEKHLTFYELDLGLNHVVRK----WSE---PTDRRA 233

Query: 311 LLAVPSP------------IGGVLVVGANTI---HYHSQSASCALALNNYAVSLDSSQEL 355
            L V  P             GGVLV   + +   H  +++    +      ++       
Sbjct: 234 NLLVQVPGGQNANTDRFDGPGGVLVCTEDYVIWKHMDAEAHRVPIPRRRNPMAKPG---- 289

Query: 356 PRSSFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSD 414
            +SS  + + AA    ++     LL ++ GDL   T+ ++G  V+ L +   +   + + 
Sbjct: 290 -QSSRGIIIVAAVTHKIKGSFFFLLQSEDGDLFKATIEHEGEDVRALRIKYFDTVPVATS 348

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTC---------GSGTSMLSSGLKEEFGDIE--ADAPS 463
           +  + +   F+ S  GD  L QF            S T     GL EE          P 
Sbjct: 349 LCILKSGYLFVASEFGDQGLYQFQSLADDDGEREWSSTDYPGFGLGEEHLPYAFFQPRPL 408

Query: 464 TKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGP---LKDFSY 520
              L   +  +L  +++ + ++L G+AS+  +     ++   R      GP    +   +
Sbjct: 409 QNLLLADTLSSLDPILDAQVVNLLGNASDTPQ----IYAACGR------GPRSTFRSLKH 458

Query: 521 GLRINADASATGISKQSNYELVE--LPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHA 577
           GL IN               LVE  LPG    +WT+                + DDEY +
Sbjct: 459 GLDINV--------------LVESPLPGVPNAVWTL--------------KLSEDDEYDS 490

Query: 578 YLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
           Y+++S    T+VL   + + EV ++  +   G T+A   L G   ++QV   G R
Sbjct: 491 YIVLSFPNGTLVLSIGETIEEVNDT-GFLSSGPTLAVQQL-GSAGLLQVHPAGLR 543


>gi|348667612|gb|EGZ07437.1| hypothetical protein PHYSODRAFT_565381 [Phytophthora sojae]
          Length = 1197

 Score = 50.8 bits (120), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 68/301 (22%), Positives = 116/301 (38%), Gaps = 76/301 (25%)

Query: 321 VLVVGANTIHYHSQ---SASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDV- 376
           VLV+G NT+ Y ++     +CA+            Q  PR    V    + AT  Q D+ 
Sbjct: 247 VLVLGENTVQYKNEGHPELTCAIP---------RRQGEPRDIVIV----SAATHKQRDLF 293

Query: 377 -ALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLV 435
             LL ++ GDL  +++ Y G  V+ + +   +   + S +      L F  S   +  L 
Sbjct: 294 FVLLQSELGDLYKISLDYSGNAVEEIKIQFFDTVPVASSMCITKTGLLFCASEFSNHYLF 353

Query: 436 QF-TCGSGT------------SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGE 482
           QF + G G             + LS+    +  +++  A S   L   +   + D+ N +
Sbjct: 354 QFLSIGEGDDTAKCSSLAMDPTELSTFPLRKLTNLQL-ASSMPSLSPVTQLLVDDLANEQ 412

Query: 483 ELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELV 542
              +Y    N+  S+                 L+   +GL I   A++            
Sbjct: 413 TPQMYALCGNSNRSS-----------------LRVLRHGLPITEMAASA----------- 444

Query: 543 ELPG-CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
            LPG  K +W +                +Y D Y  Y+++S E  T+VLE  + + EVT+
Sbjct: 445 -LPGVAKAVWCLKE--------------SYADPYDKYIVVSFEDATLVLEVGETVEEVTQ 489

Query: 602 S 602
           S
Sbjct: 490 S 490


>gi|432851195|ref|XP_004066902.1| PREDICTED: DNA damage-binding protein 1-like [Oryzias latipes]
          Length = 1140

 Score = 50.4 bits (119), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 88/426 (20%), Positives = 160/426 (37%), Gaps = 78/426 (18%)

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  ++ VP P GG +++G  +I YH+     A+A      S          
Sbjct: 207 WKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAIAPPTIKQSTIVCHN---- 262

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTS 413
                +D   + +L  D+       G L +L +    + DG V ++ L +     + +  
Sbjct: 263 ----RVDPNGSRYLLGDME------GRLFMLLLEKEELMDGTVALKDLHVELLGETSIAE 312

Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
            +T + N + F+GSRLGDS LV+    S        + E F ++                
Sbjct: 313 CLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVTVMETFTNL---------------G 357

Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
            + DM                +    T S A ++     G L+    G+ I+  AS    
Sbjct: 358 PILDMC-------VVDLERQGQGQLVTCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401

Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
                   ++LPG KG+W +  +S  G  +D              L++S   +T VL  +
Sbjct: 402 --------IDLPGIKGLWPL--RSEAGRESDD------------MLVLSFVGQTRVLMLS 439

Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL--DGSYMTQDLSFGPSNSE 651
               E TE   +    +T   GN+   +++IQ+     R++  D   +  +    P    
Sbjct: 440 GEEVEETELPGFVDNQQTFYCGNV-AHQQLIQITSGSVRLVLQDSKALVSEWK-EPQGRN 497

Query: 652 SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQ--TPAAIESSKKPVSS 709
               + N T + +++      L +  G ++ +        V+    TP    SS+ P+ +
Sbjct: 498 ISVAACNHTQVVLAVGRALYYLQILAGELKQISTTEMEHEVACLDITPLGESSSESPLCA 557

Query: 710 CTLYHD 715
             L+ D
Sbjct: 558 VGLWTD 563


>gi|406698009|gb|EKD01256.1| U2 snRNA binding protein [Trichosporon asahii var. asahii CBS 8904]
          Length = 1216

 Score = 50.4 bits (119), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 129/595 (21%), Positives = 223/595 (37%), Gaps = 132/595 (22%)

Query: 85  GETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISV 144
           G T+  +L    S   L+ +C     G V ++A     G      +D I+L+ +  ++S+
Sbjct: 34  GSTRLEILKLNPSTGQLDSICSSEAFGTVRNVAAFRLAGMG----KDYIVLSSDSGRLSI 89

Query: 145 LEFDDSIHGLRITSMHCFESP-EWLHLKRGRESFARGPLVKVDPQGRC---GGVLVYGLQ 200
           +E       L I+    FES  + ++ K G      G  + VDP+GR    G V    L 
Sbjct: 90  IE-------LVISPTPHFESLYQEVYGKSGSRRTIPGQFLAVDPKGRSAMFGAVEKQKLC 142

Query: 201 MIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVM 260
            I+ + ++G                  A    + V+N+   D           GY  P+ 
Sbjct: 143 YILNRNTEG---------KVYPSSPLEAHKNHTLVVNMIACDT----------GYDNPMF 183

Query: 261 VILHERELTW----------AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYK 310
             L   EL +          A R + KH T     L ++  +++    WS    P D   
Sbjct: 184 AAL---ELDYGDSDHDATGEAYRAAEKHLTFYELDLGLNHVVRK----WSE---PTDRRA 233

Query: 311 LLAVPSP------------IGGVLVVGANTI---HYHSQSASCALALNNYAVSLDSSQEL 355
            L V  P             GGVLV   + +   H  +++    +      ++       
Sbjct: 234 NLLVQVPGGQNANTDRFDGPGGVLVCTEDYVIWKHMDAEAHRVPIPRRRNPMAKPG---- 289

Query: 356 PRSSFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSD 414
            +SS  + + AA    ++     LL ++ GDL   T+ ++G  V+ L +   +   + + 
Sbjct: 290 -QSSRGIIIVAAVTHKIKGSFFFLLQSEDGDLFKATIEHEGEDVRALRIKYFDTVPVATS 348

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTC---------GSGTSMLSSGLKEEFGDIE--ADAPS 463
           +  + +   F+ S  GD  L QF            S T     GL EE          P 
Sbjct: 349 LCILKSGYLFVASEFGDQGLYQFQSLADDDGEREWSSTDYPGFGLGEEHLPYAFFQPRPL 408

Query: 464 TKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGP---LKDFSY 520
              L   +  +L  +++ + ++L G+AS+  +     ++   R      GP    +   +
Sbjct: 409 QNLLLADTLSSLDPILDAQVVNLLGNASDTPQ----IYAACGR------GPRSTFRSLKH 458

Query: 521 GLRINADASATGISKQSNYELVE--LPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHA 577
           GL +N               LVE  LPG    +WT+                + DDEY +
Sbjct: 459 GLDVNV--------------LVESPLPGVPNAVWTL--------------KLSEDDEYDS 490

Query: 578 YLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
           Y+++S    T+VL   + + EV ++  +   G T+A   L G   ++QV   G R
Sbjct: 491 YIVLSFPNGTLVLSIGETIEEVNDT-GFLSSGPTLAVQQL-GSAGLLQVHPAGLR 543


>gi|358440070|pdb|4A0B|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Cpd-Duplex (
           Pyrimidine At D-1 Position) At 3.8 A Resolution (Cpd 4)
 gi|358440072|pdb|4A0B|C Chain C, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Cpd-Duplex (
           Pyrimidine At D-1 Position) At 3.8 A Resolution (Cpd 4)
          Length = 1159

 Score = 50.4 bits (119), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 96/461 (20%), Positives = 172/461 (37%), Gaps = 116/461 (25%)

Query: 185 VDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK 244
           +DP+ R  G+ +Y     ++   +    L            F+ R+E  HVI+++     
Sbjct: 143 IDPECRMIGLRLYDGLFKVIPLDRDNKEL----------KAFNIRLEELHVIDVK----- 187

Query: 245 HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNL 304
                 F++G   P +  +++      GR     H              + P  W   N+
Sbjct: 188 ------FLYGCQAPTICFVYQDP---QGR-----HVKTYEVSLREKEFNKGP--WKQENV 231

Query: 305 PHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV-- 362
             +A  ++AVPSP GG +++G  +I YH+     A+A             + + S  V  
Sbjct: 232 EAEASMVIAVPSPFGGAIIIGQESITYHNGDKYLAIA-----------PPIIKQSTIVCH 280

Query: 363 -ELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTSDIT 416
             +D   + +L  D+       G L +L +      DG V ++ L +     + +   +T
Sbjct: 281 NRVDPNGSRYLLGDME------GRLFMLLLEKEEQMDGTVTLKDLRVELLGETSIAECLT 334

Query: 417 TIGNSLFFLGSRLGDSLLVQFTCGS---GTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
            + N + F+GSRLGDS LV+    S   G+ +++       G I  D       R+    
Sbjct: 335 YLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMETFTNLGPI-VDMCVVDLERQGQGQ 393

Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
            +                        T S A ++     G L+    G+ I+  AS    
Sbjct: 394 LV------------------------TCSGAFKE-----GSLRIIRNGIGIHEHAS---- 420

Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
                   ++LPG KG+W +    +R              E    L++S   +T VL   
Sbjct: 421 --------IDLPGIKGLWPLRSDPNR--------------ETDDTLVLSFVGQTRVLMLN 458

Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
               E TE + +    +T   GN+   +++IQ+     R++
Sbjct: 459 GEEVEETELMGFVDDQQTFFCGNV-AHQQLIQITSASVRLV 498


>gi|348526664|ref|XP_003450839.1| PREDICTED: DNA damage-binding protein 1-like [Oreochromis
           niloticus]
          Length = 1140

 Score = 50.4 bits (119), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 74/341 (21%), Positives = 129/341 (37%), Gaps = 73/341 (21%)

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  ++ VP P GG +++G  +I YH+     A+A      S          
Sbjct: 207 WKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAIAPPTIKQSTIVCHN---- 262

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTS 413
                +D   + +L  D+       G L +L +    + DG V ++ L +     + +  
Sbjct: 263 ----RVDPNGSRYLLGDME------GRLFMLLLEKEELMDGTVALKDLHVELLGETSIAE 312

Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
            +T + N + F+GSRLGDS LV+    S        + E F ++                
Sbjct: 313 CLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVAVMETFTNL---------------G 357

Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
            + DM                +    T S A ++     G L+    G+ I+  AS    
Sbjct: 358 PIVDMC-------VVDLERQGQGQLVTCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401

Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
                   ++LPG KG+W +  +S  G   D              L++S   +T VL  +
Sbjct: 402 --------IDLPGIKGLWPL--RSEAGRETDD------------MLVLSFVGQTRVLMLS 439

Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
               E TE   +    +T   GN+   +++IQ+     R++
Sbjct: 440 GEEVEETELPGFVDNQQTFYCGNV-AHQQLIQITSGSVRLV 479


>gi|156095699|ref|XP_001613884.1| Splicing factor 3B subunit 3 [Plasmodium vivax Sal-1]
 gi|148802758|gb|EDL44157.1| Splicing factor 3B subunit 3, putative [Plasmodium vivax]
          Length = 1230

 Score = 50.1 bits (118), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 127/607 (20%), Positives = 220/607 (36%), Gaps = 120/607 (19%)

Query: 92  LMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSI 151
           L+       L L+    + G +  L      G++    +D +++  +  ++++L+F +  
Sbjct: 41  LLRADKQGKLNLIASKDIFGIIRCLQTFRLTGSN----KDYVVIGSDSGRLTILQFSNEK 96

Query: 152 HGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGV-------LVYGL----- 199
           +      +HC       + K G      G  + VDP+GR   +        VY L     
Sbjct: 97  NDF--VRVHC-----ETYGKSGLRRIIPGEYIAVDPKGRALMICAIERQKFVYILNRDTK 149

Query: 200 -QMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEP 258
            Q+ I              D  G   GF   + +S   N   LD K V +   +  Y   
Sbjct: 150 EQLTISSPLDAHKSHTICHDVVGMDVGFENPMFASIEQNYEALD-KQVTNTSEIDSYTRK 208

Query: 259 VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
            ++ L      W   +   H                   +      P DA   L +P P 
Sbjct: 209 TLLSL------WEMDLGLNH-------------------VIRKYTFPIDASAHLLIPIPG 243

Query: 319 G-----GVLVVGANTIHYHS---QSASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
           G     GV+V   N + Y         CA     Y   L++ QE   S     L      
Sbjct: 244 GQQGPSGVIVCCDNFLVYKKVDHADVYCA-----YPRRLETGQEKNLSIVCSTLHRIRKF 298

Query: 371 WLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
           +      L+ ++ GDL  + + ++  VV+ +     +   + + I  + +   F+ +  G
Sbjct: 299 FF----ILIQSELGDLYKIEMEHEDGVVKEITCKYFDTVPVANAICVMKSGSLFVAAEFG 354

Query: 431 DSLLVQFTCGSG----TSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEE--L 484
           +    QF+ G G     +M +S  K   G     A  TK+L   ++  L D V      L
Sbjct: 355 NHFFYQFS-GIGDEDNEAMCTS--KHPSGRNAIIAFRTKKL---TNLFLIDQVYSLSPIL 408

Query: 485 SLYGSASNNTESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYEL 541
            +    + N  S Q         +L   GP   L+   +GL I   A             
Sbjct: 409 DMKVIDAKNASSPQIY-------ALCGRGPRSSLRILQHGLSIEELADN----------- 450

Query: 542 VELPG-CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT 600
            ELPG  K IWT+     +  NA          +Y  Y+I+S E  T++LE  + + EV 
Sbjct: 451 -ELPGRPKFIWTI-----KKDNAS---------DYDGYIIVSFEGSTLILEIGETVEEVV 495

Query: 601 ESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENST 660
           +S+   +   T    N+     +IQV + G R ++G  + + +   P N +  + + N  
Sbjct: 496 DSL--LLTNVTTIHVNILYDNSLIQVHDAGIRHINGKVIHEWVP--PKNKQIKAATSNCA 551

Query: 661 VLSVSIA 667
            + +S++
Sbjct: 552 QIVISLS 558


>gi|402222132|gb|EJU02199.1| hypothetical protein DACRYDRAFT_21931 [Dacryopinax sp. DJM-731 SS1]
          Length = 1209

 Score = 50.1 bits (118), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 87/376 (23%), Positives = 150/376 (39%), Gaps = 83/376 (22%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL ++ GDL  +T+ ++   V+ + +   +   + S +  + +   F+ S  G+  L QF
Sbjct: 308 LLQSEDGDLFKVTIDHEDEEVKTMKIKYFDTVPVASSLCILKSGFLFVASEFGNHYLYQF 367

Query: 438 -TCGSGTSML--SSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASN-- 492
              G     +  SS    + G  +    +  R R   +  L D +N  +  +    +N  
Sbjct: 368 QKLGDDDDEIEYSSVSYPDNGMADPIPQAYFRPRPLENLVLADELNSFDPIVDAKVTNLL 427

Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIW 551
           NT++ Q  F+   R +  +   L+   +GL +    S+            ELPG    +W
Sbjct: 428 NTDTPQ-IFAACGRGARSSFRMLR---HGLDVEETVSS------------ELPGIPNAVW 471

Query: 552 TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
           TV  K+              DD+Y AY+I+S    T+VL   + + EV+++  +     T
Sbjct: 472 TVKLKA--------------DDQYDAYIILSFVNGTLVLSIGETIEEVSDT-GFLSSSPT 516

Query: 612 IAAGNLFGRRRVIQVFERGAR------------------ILDGS---------------- 637
           IA   + G   ++QV+  G R                  I+  +                
Sbjct: 517 IAVQQI-GEDSLLQVYPHGIRHVLSDRRVNEWRCPQHTTIVAATTNSRQVAIALSSAQLV 575

Query: 638 YMTQDLSFGPSNSESGSGSENSTVLSVSIAD--------PYVLLGMSDGSIRLLVGDPST 689
           Y   DL  G  N      S  S VL++SIA+        PY+ +G  D ++R++  DP T
Sbjct: 576 YFELDLE-GQLNEYQDRKSLGSGVLAMSIAEVPEGRQRTPYLAVGCEDQTVRIISLDPDT 634

Query: 690 C--TVSVQTPAAIESS 703
               +S+Q   A  SS
Sbjct: 635 TLENISLQALTAPPSS 650


>gi|303271531|ref|XP_003055127.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226463101|gb|EEH60379.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 1223

 Score = 49.7 bits (117), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 62/262 (23%), Positives = 113/262 (43%), Gaps = 35/262 (13%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           GP+  VDP+ R  G+ +Y     ++   Q G               FS R+E   V +++
Sbjct: 130 GPIGAVDPECRMYGLHLYDGLFKVIPMDQTGQ----------LREAFSVRLEELQVFDVK 179

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299
                      F+ G  +P + +L++   T  GR    +  C+            +P  W
Sbjct: 180 -----------FLAGTPKPTIAVLYQD--TKEGRHIKTYEVCLKDK-------DFNPGPW 219

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
           +  ++   +  L+AVP+P+GGV+VVG   I Y ++  +  +              + ++ 
Sbjct: 220 AQNDVESGSRFLIAVPAPLGGVVVVGEKVIAYLNKETTHGVGDGGGGGGGGGGGMIVKA- 278

Query: 360 FSVELDAAHATWLQNDVA----LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI 415
            +++ DA   T+   D      LLS   G L LL +++D   V+ L L     + + S +
Sbjct: 279 IAMQSDATIMTYGAVDKDGSRYLLSDSAGRLHLLVLMHDKTRVRALKLESLGQTSIASSL 338

Query: 416 TTIGNSLFFLGSRLGDSLLVQF 437
           + + N + ++GS  GDS LV+ 
Sbjct: 339 SYLDNGVVYVGSAYGDSQLVRL 360


>gi|68531971|ref|XP_723667.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
 gi|23478038|gb|EAA15232.1| Drosophila melanogaster CG13900 gene product [Plasmodium yoelii
           yoelii]
          Length = 1235

 Score = 49.3 bits (116), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 62/295 (21%), Positives = 121/295 (41%), Gaps = 43/295 (14%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           L+ ++ GDL  + V ++  +V+ +     +   + + I  + +   F+ +  G+    QF
Sbjct: 302 LIQSEYGDLYKIEVNHEDGIVKEIICKYFDTVPIANSICVLKSGALFVAAEFGNHFFYQF 361

Query: 438 T---CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS-SDALQDMVNGEELSLYGSASNN 493
           +     S  +M +S      G     A  T++L+     D +  +    ++ +  + ++N
Sbjct: 362 SGIGNDSNDAMCTSN--HPSGKNAIIAFKTQKLKNLYLVDQIYSLSPIVDMKILDAKNSN 419

Query: 494 TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIWT 552
                       R SL      +   +GL I   A+             ELPG  + IWT
Sbjct: 420 LPQIYALCGRGPRSSL------RILQHGLSIEELANN------------ELPGKPRYIWT 461

Query: 553 VYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTI 612
           V   +S               EY  Y+I+S E  T++LE  + + EV +S+   +   T 
Sbjct: 462 VKKDNS--------------SEYDGYIIVSFEGNTLILEIGETVEEVYDSL--LLTNVTT 505

Query: 613 AAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
              NL      IQV++ G R ++G  + + +   P N +  + + N + + VS++
Sbjct: 506 IHINLLYDNSFIQVYDTGIRHINGKIVQEWIP--PKNKQINAATSNGSQIVVSLS 558


>gi|324502823|gb|ADY41238.1| DNA damage-binding protein 1, partial [Ascaris suum]
          Length = 1129

 Score = 49.3 bits (116), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 41/144 (28%), Positives = 65/144 (45%), Gaps = 12/144 (8%)

Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
           P +W   N+  +A  ++ +P P GGV+VVG   I YH  +       N Y+         
Sbjct: 200 PPLWKQDNIEAEACMVIPIPQPYGGVIVVGHEAISYHKDA-------NAYSAIAPPLIHQ 252

Query: 356 PRSSFSVELDAAHATWLQNDVALLSTKTGDLVL-LTVVYDGRV-VQRLDLSKTNPSVLTS 413
            + S   ++D     +L  D   LS +   L+L L V  DG   V+ L +     + +  
Sbjct: 253 SQISCYGKIDRDGQRYLLGD---LSGRIFMLLLDLDVATDGTASVKDLKVELLGETSIPE 309

Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
            +  + N + F+GSR GDS LV+ 
Sbjct: 310 CVVYLDNGVVFIGSRFGDSQLVRL 333


>gi|313238818|emb|CBY20011.1| unnamed protein product [Oikopleura dioica]
 gi|313245836|emb|CBY34826.1| unnamed protein product [Oikopleura dioica]
          Length = 1135

 Score = 49.3 bits (116), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 128/622 (20%), Positives = 227/622 (36%), Gaps = 165/622 (26%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ ++  +   L+ V  + L+G +  + +        + ++D + +  E     +LE+ D
Sbjct: 43  RIEVNLSTQTGLKPVTEFNLYGRIAVIEVFRY----KNEKKDCLFILTESCYACILEYVD 98

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFAR-GPLVKVDPQGRCGGVLVYGLQMIILKASQ 208
              G  IT         +  ++    S ++ G    VDP+ RC  + +Y   + I+  + 
Sbjct: 99  ---GKIITRA-------YGDMRDKNYSVSQSGMHACVDPEARCIALRLYDGVLKIINLNS 148

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
               L   E           RIE   V+           D  F+H   +P + +L++   
Sbjct: 149 SSKHLTSAEQ----------RIEEILVV-----------DMCFLHTANKPTLALLYDDN- 186

Query: 269 TWAGRVSWKH-HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327
                 S +H  T  I+  +  +    H   +    +  D   ++AVP P+ G+L++G  
Sbjct: 187 ------SSRHLSTIAITLDNSGSGASIHKGPFRHTQVEQDTILIVAVPEPLAGILLLGHV 240

Query: 328 TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLV 387
            I YH        ++ N                 V+      T +     L     G+L 
Sbjct: 241 NITYHDSKNRSTCSIENI----------------VKRTIECVTPIDKHRYLCGDSNGELF 284

Query: 388 LLTVVYDGRVV----QRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGT 443
           LL + Y+   +     RL       + L + ++ I N + F+GS  GDS L++       
Sbjct: 285 LLLLDYNENRIPEERMRLATKYLGRTTLPNTLSYIDNYVVFVGSTFGDSELIRI------ 338

Query: 444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSF 503
                    E  D                                   NN  S Q   S 
Sbjct: 339 ---------EVSD-----------------------------------NN--SGQHFTSL 352

Query: 504 AVRDSLVNIGPLKDFSY--------GLRINADASATGISKQ--------SNYELVELPGC 547
              D+L   GP+KD           G  + A    TG S +          Y  ++L G 
Sbjct: 353 HQYDNL---GPIKDMCIVDFEKQGQGQLVTASGVGTGGSLRIIRNGVGIHEYASIDLEGV 409

Query: 548 KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMV--LETADLLTEVTESVDY 605
           KG+W + + SS      S++  +        L++S   +T+   LE  D +TEV E +  
Sbjct: 410 KGLWALKYLSS------STKQDS--------LLLSFVGQTIFLRLEGQD-VTEV-EEIPG 453

Query: 606 FVQG-RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSV 664
           F  G +T+ AGN+   ++ +Q+ E+  R++    +    S+ P         EN+ +   
Sbjct: 454 FTNGEQTMYAGNV-TDQQFLQITEKQVRLIADESLKG--SWEP--------EENTQINLC 502

Query: 665 SIADPYVLLGMSDGSIRLLVGD 686
           S+    VLLG+   +I L + D
Sbjct: 503 SVNKNQVLLGVGSTAIYLEIND 524


>gi|268536658|ref|XP_002633464.1| C. briggsae CBR-DDB-1 protein [Caenorhabditis briggsae]
          Length = 1134

 Score = 48.9 bits (115), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 39/137 (28%), Positives = 71/137 (51%), Gaps = 19/137 (13%)

Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE--L 364
           D+  L+ VP+P+GGV+V+GAN+  Y +   +  +    Y+ SL     L  + F+    +
Sbjct: 210 DSQVLIPVPAPVGGVIVLGANSALYKASDVNGDVV--PYSCSL-----LKNTIFTCHGIV 262

Query: 365 DAAHATWLQNDVALLSTKTGDLVLLTV-VYDGR---VVQRLDLSKTNPSVLTSDITTIGN 420
           DA+       D  LL+   G L++L + + +GR    V+ + +     + +   +  + N
Sbjct: 263 DAS------GDRFLLADTDGRLLMLLLNIGEGRSGTTVKEMRIEYLGETSVADSVNYVDN 316

Query: 421 SLFFLGSRLGDSLLVQF 437
            + F+GSRLGDS L++ 
Sbjct: 317 GVVFVGSRLGDSQLIRL 333


>gi|124806507|ref|XP_001350742.1| splicing factor 3b, subunit 3, 130kD, putative [Plasmodium
           falciparum 3D7]
 gi|23496869|gb|AAN36422.1|AE014849_41 splicing factor 3b, subunit 3, 130kD, putative [Plasmodium
           falciparum 3D7]
          Length = 1329

 Score = 48.5 bits (114), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 80/379 (21%), Positives = 147/379 (38%), Gaps = 64/379 (16%)

Query: 304 LPHDAYKLLAVPSPIG-----GVLVVGANTIHYHS---QSASCALALNNYAVSLDSSQEL 355
           LP D    L +P P G     GVL+   N + Y     +   CA     Y   L+  Q+ 
Sbjct: 261 LPIDITAHLLIPLPGGQQGPSGVLICCENFLVYKKVDHEDIYCA-----YPRRLEIGQDK 315

Query: 356 PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI 415
             S     +      +      L+ ++ GDL  + V ++  +V+ +     +   + + I
Sbjct: 316 NISIICWTMHRIKKFFF----ILIQSEYGDLYKIEVDHEDGIVKEIVCKYFDTVPIGNSI 371

Query: 416 TTIGNSLFFLGSRLGDSLLVQFT-CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
           + + +   F+ +  G+    QF+  G              G     A  T +L+      
Sbjct: 372 SVLKSGSLFVAAEFGNHYFYQFSGIGDDNKQFMCTSNHPLGKNAIIAFKTNKLKNL---Y 428

Query: 475 LQDMVNGEE--LSLYGSASNNTESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADAS 529
           L D +      L +    + NT + Q  ++   R      GP   L+   +GL I   A 
Sbjct: 429 LVDQIYSLSPILDMKIIDAKNTHTPQ-IYTLCGR------GPRSSLRILQHGLSIEELAD 481

Query: 530 ATGISKQSNYELVELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM 588
                        ELPG  K IWT+   +                EY  Y+++S E  T+
Sbjct: 482 N------------ELPGKPKYIWTIKKDNL--------------SEYDGYIVVSFEGNTL 515

Query: 589 VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPS 648
           +LE  + + EV++++   +   T    N+      IQV++ G R ++G  + + ++  P 
Sbjct: 516 ILEIGESVEEVSDTL--LLNNVTTLHINILYDNSFIQVYDTGIRHINGKVVQEWVA--PK 571

Query: 649 NSESGSGSENSTVLSVSIA 667
           N +  + S NS+ + +S++
Sbjct: 572 NKQIKAASSNSSQIVISLS 590


>gi|198432471|ref|XP_002129229.1| PREDICTED: similar to DNA damage-binding protein 1 (Damage-specific
           DNA-binding protein 1) (UV-damaged DNA-binding factor)
           (DDB p127 subunit) (DNA damage-binding protein a) (DDBa)
           (UV-damaged DNA-binding protein 1) (UV-DDB 1) (Xeroderma
           pigmentosum group E-co... isoform 2 [Ciona intestinalis]
          Length = 1142

 Score = 48.5 bits (114), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 51/217 (23%), Positives = 91/217 (41%), Gaps = 40/217 (18%)

Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
           F+ RIE   VI+ +           F+HGY  P +VI+++           +H    I  
Sbjct: 155 FNIRIEELSVIDAK-----------FLHGYTTPTLVIIYQNS-------QGRHVKTYIVD 196

Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
           +     +      W   N+  +A  ++ VP P+ G +++G  +I YH+      +A    
Sbjct: 197 VRDKEVVAGP---WKQENIDAEANFIINVPKPLAGSIIIGQESITYHNGDKYIPIA---- 249

Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDG-RVVQR 400
              L   Q+       V+ D +   +L  D+A      G L +L +    + DG   V+ 
Sbjct: 250 --PLCFFQDTINCYAPVDKDGSR--YLLGDLA------GHLFILLLESDEMMDGTNTVRD 299

Query: 401 LDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           L +       +   I+ + N + ++GSRLGDS L++ 
Sbjct: 300 LKIELLGEVSIPEAISYLDNGVVYIGSRLGDSQLIRL 336


>gi|193644722|ref|XP_001942922.1| PREDICTED: DNA damage-binding protein 1-like [Acyrthosiphon pisum]
          Length = 1156

 Score = 48.5 bits (114), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 58/267 (21%), Positives = 109/267 (40%), Gaps = 59/267 (22%)

Query: 180 GPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
           G +  +DP  R  G+ +Y GL  II              D  G    +  R+E    + +
Sbjct: 122 GAMAVIDPSARVIGLKLYDGLFKII------------PLDKEGELKAYCLRMEE---VEV 166

Query: 239 RDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH-PL 297
           +D+D        F++G   P ++I+H+  +   GR         I A  +S   K+    
Sbjct: 167 QDID--------FLYGCANPTIIIIHQDTM---GR--------HIKAKELSIKDKEFVKT 207

Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
            W   N+  +A  ++ VP P+ G +++G  ++ YH+ S+  A+          S   + +
Sbjct: 208 PWKQENVETEASMIIPVPEPLCGAIIIGRESVLYHNGSSFIAI----------SPPVIKQ 257

Query: 358 SSFS--VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVL---- 411
           S+      +D     +L  D+A      G L +L + Y+        +      +L    
Sbjct: 258 STIVCYARIDPEGTRYLLGDMA------GHLFMLLLNYEKNPDGTFKIKDPKVDLLGEIS 311

Query: 412 -TSDITTIGNSLFFLGSRLGDSLLVQF 437
               +T + N + ++ SR+GDS L++ 
Sbjct: 312 IPESLTYLDNKIIYVASRVGDSQLIKL 338


>gi|260790329|ref|XP_002590195.1| hypothetical protein BRAFLDRAFT_128289 [Branchiostoma floridae]
 gi|229275385|gb|EEN46206.1| hypothetical protein BRAFLDRAFT_128289 [Branchiostoma floridae]
          Length = 1152

 Score = 48.1 bits (113), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 59/268 (22%), Positives = 111/268 (41%), Gaps = 56/268 (20%)

Query: 185 VDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK 244
           +DP+ R  G+ +Y     ++   +    L            F+ R+E  +VI+++     
Sbjct: 124 IDPECRMIGLRLYDGLFKVIPLDRDNREL----------KAFNIRLEELNVIDVK----- 168

Query: 245 HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL-IWSAMN 303
                 F++G   P +V +++             H   +    IS   K+     W   N
Sbjct: 169 ------FLYGCQVPTVVFVYQ-----------DPHGRHVKTYEISVRDKEFSKGPWKQDN 211

Query: 304 LPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV- 362
           +  +A  ++AVP P  G L++G  +I YH+     A+A             + +S+    
Sbjct: 212 VETEASMVIAVPEPFCGSLIIGQESITYHNGDKYVAVA----------PPAIKQSTLICH 261

Query: 363 -ELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTSDIT 416
             +DA  + +L  D++      G L +L +    + DG V V+ L +     + +   +T
Sbjct: 262 GRVDANGSRYLLGDMS------GRLFMLLLEKEELIDGSVTVKDLKVELLGETSIAECLT 315

Query: 417 TIGNSLFFLGSRLGDSLLVQFTCGSGTS 444
            + N + +LGSRLGDS L++    +  S
Sbjct: 316 YLDNGVVYLGSRLGDSQLIKLNVDADDS 343


>gi|221061705|ref|XP_002262422.1| splicing factor 3b, subunit 3, 130kd [Plasmodium knowlesi strain H]
 gi|193811572|emb|CAQ42300.1| splicing factor 3b, subunit 3, 130kd, putative [Plasmodium knowlesi
           strain H]
          Length = 1276

 Score = 48.1 bits (113), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 123/604 (20%), Positives = 222/604 (36%), Gaps = 114/604 (18%)

Query: 92  LMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSI 151
           L+       L L+    + G +  L      G++    +D +++  +  ++ +L+F +  
Sbjct: 41  LLRADKQGKLNLIVSKDIFGIIRCLQTFRLTGSN----KDYVVIGSDSGRLVILQFSNEK 96

Query: 152 HGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGV-------LVYGL----- 199
           +      +HC       + K G      G  + VDP+GR   +        VY L     
Sbjct: 97  NDF--VRVHC-----ETYGKSGLRRIIPGEYIAVDPKGRALMICAIERQKFVYILNRDNK 149

Query: 200 -QMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEP 258
            Q+ I              D  G   GF   + +S   N    D K V +   +      
Sbjct: 150 EQLTISSPLDAHKSHTICHDVVGMDVGFENPMFASIEQNYEMYD-KQVTNTTEIDACTRK 208

Query: 259 VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
            ++ L E +L                   ++  +++H        LP D    L +P P 
Sbjct: 209 TLLCLWEMDL------------------GLNHVIRKH-------TLPIDMSAHLLIPIPG 243

Query: 319 G-----GVLVVGANTIHYHSQS---ASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
           G     GV+V   N + Y         CA     Y   L++ QE    + S+     H  
Sbjct: 244 GQQGPSGVIVCCDNYLVYKKVEHVDVYCA-----YPRRLETGQE---KNISIVCSTVHRI 295

Query: 371 WLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
             +    L+ ++ GDL  + + +   VV+ +     +   + + I  + +   F+ +  G
Sbjct: 296 R-KFFFILIQSEYGDLYKIEMDHQDGVVKEITCKYFDTVPVANAICVMKSGSLFVAAEFG 354

Query: 431 DSLLVQFT-CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEE--LSLY 487
           +    QF+  G   +      K   G     A  TK+L   ++  L D V      L + 
Sbjct: 355 NHFFYQFSGIGDDDNEAMCTSKHPSGRNAIIAFRTKKL---TNLFLIDQVYSLSPILDMK 411

Query: 488 GSASNNTESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVEL 544
              + N  S Q  ++   R      GP   L+   +GL I   A              EL
Sbjct: 412 ILDAKNANSPQ-IYALCGR------GPRSSLRILQHGLSIEELADN------------EL 452

Query: 545 PG-CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESV 603
           PG  K IWT+     +  NA          +Y  Y+I+S E  T++LE  + + EV +++
Sbjct: 453 PGRPKYIWTI-----KKDNAS---------DYDGYIIVSFEGSTLILEIGETVEEVVDTL 498

Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLS 663
              +   T    N+     +IQV + G R ++G  + + +   P N +  + + N+T + 
Sbjct: 499 --LLTNVTTIHVNILYDNSLIQVHDTGIRHINGKVINEWVP--PKNKQVKAATSNATQIV 554

Query: 664 VSIA 667
           +S++
Sbjct: 555 ISLS 558


>gi|47230701|emb|CAF99894.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 953

 Score = 48.1 bits (113), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 71/341 (20%), Positives = 130/341 (38%), Gaps = 64/341 (18%)

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  ++ VP P GG +++G  +I YH+     A+A      S          
Sbjct: 68  WKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAIAPPTIKQSTIVCHN---- 123

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTS 413
                +D   + +L  D+       G L +L +    + DG V ++ L +     + +  
Sbjct: 124 ----RVDPNGSRYLLGDME------GRLFMLLLEKEELMDGTVALKDLHVELLGETSIAE 173

Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
            +T + N + F+GSRLGDS LV+       S L         +++++   +      +  
Sbjct: 174 CLTYLDNGVVFVGSRLGDSQLVKVRVTHSLSEL---------NVDSNDQGSFVTVMETFT 224

Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
            L  +V+   + L         +    F           G L+    G+ I+  AS    
Sbjct: 225 NLGPIVDMCVVDLERQGQGQLVTCSGAFKE---------GSLRIIRNGIGIHEHAS---- 271

Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
                   ++LPG KG+W +  ++ R              E    L++S   +T VL  +
Sbjct: 272 --------IDLPGIKGLWPLRSEAGR--------------ETDDMLVLSFVGQTRVLMLS 309

Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
               E TE   +    +T   GN+    ++IQ+     R++
Sbjct: 310 GEEVEETELPGFVDNQQTFYCGNV-AHNQLIQITSGSVRLV 349


>gi|68075683|ref|XP_679761.1| splicing factor 3b, subunit 3, 130kD [Plasmodium berghei strain
           ANKA]
 gi|56500578|emb|CAH95367.1| splicing factor 3b, subunit 3, 130kD, putative [Plasmodium berghei]
          Length = 1216

 Score = 48.1 bits (113), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 62/297 (20%), Positives = 120/297 (40%), Gaps = 48/297 (16%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           L+ ++ GDL  + V ++  +V+ +     +   + + I  + +   F+ +  G+    QF
Sbjct: 302 LIQSEYGDLYKIEVNHEDGIVKEIICKYFDTVPIANSICVLKSGALFVAAEFGNHFFYQF 361

Query: 438 T---CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
           +     S  SM +S      G     A  T++L+      L D +    +          
Sbjct: 362 SGIGNDSNESMCTSN--HPSGKNAIIAFKTQKLKNL---YLVDQIYSLPIVDMKILDAKN 416

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +  + ++   R      GP   L+   +GL I   A+             ELPG  + I
Sbjct: 417 SNIPQIYALCGR------GPRSSLRILQHGLSIEELANN------------ELPGKPRYI 458

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WT+   +S               EY  Y+I+S E  T++LE  + + EV +S+   +   
Sbjct: 459 WTIKKDNS--------------SEYDGYIIVSFEGNTLILEIGETVEEVYDSL--LLTNV 502

Query: 611 TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
           T    NL      IQV++ G R ++G  + + +   P N +  + + N + + +S++
Sbjct: 503 TTIHINLLYDNSFIQVYDTGIRHINGKIVQEWVP--PKNKQINAATSNGSQIVISLS 557


>gi|17541566|ref|NP_502299.1| Protein DDB-1 [Caenorhabditis elegans]
 gi|74965443|sp|Q21554.2|DDB1_CAEEL RecName: Full=DNA damage-binding protein 1; AltName:
           Full=Damage-specific DNA-binding protein 1
 gi|5824558|emb|CAA92824.2| Protein DDB-1 [Caenorhabditis elegans]
          Length = 1134

 Score = 48.1 bits (113), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 100/396 (25%), Positives = 166/396 (41%), Gaps = 96/396 (24%)

Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE--L 364
           D+  L+ VP  IGGV+V+G+N++ Y        +    Y  SL     L  ++F+    +
Sbjct: 210 DSSVLIPVPHAIGGVIVLGSNSVLYKPNDNLGEVV--PYTCSL-----LENTTFTCHGIV 262

Query: 365 DAAHATWLQNDVALLSTKTGDLVLL----TVVYDGRVVQRLDLSKTNPSVLTSDITTIGN 420
           DA+   +L      LS   G L++L    T    G  V+ + +     + +   I  I N
Sbjct: 263 DASGERFL------LSDTDGRLLMLLLNVTESQSGYTVKEMRIDYLGETSIADSINYIDN 316

Query: 421 SLFFLGSRLGDSLLVQF-TCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
            + F+GSRLGDS L++  T  +G S   S + E + +I                 ++DMV
Sbjct: 317 GVVFVGSRLGDSQLIRLMTEPNGGSY--SVILETYSNI---------------GPIRDMV 359

Query: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY 539
             E         ++ +    T + A +D     G L+    G+ I+  AS          
Sbjct: 360 MVE---------SDGQPQLVTCTGADKD-----GSLRVIRNGIGIDELAS---------- 395

Query: 540 ELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEV 599
             V+L G  GI+ +   S    NAD+            Y+I+SL   T VL+      E 
Sbjct: 396 --VDLAGVVGIFPIRLDS----NADN------------YVIVSLSDETHVLQITGEELED 437

Query: 600 TESVDYFVQGRTIAAGNLFGRRR---VIQVFERGARILDGSYMTQDLSFGPSNSESGSGS 656
            + ++      TI A  LFG      ++Q  E+  R++  S +++   + P+N E  S  
Sbjct: 438 VKLLEINTDLPTIFASTLFGPNDSGIILQATEKQIRLMSSSGLSK--FWEPTNGEIISK- 494

Query: 657 ENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTV 692
                +SV+ A+  ++L   D ++ LL     TC V
Sbjct: 495 -----VSVNAANGQIVLAARD-TVYLL-----TCIV 519


>gi|195996153|ref|XP_002107945.1| hypothetical protein TRIADDRAFT_18324 [Trichoplax adhaerens]
 gi|190588721|gb|EDV28743.1| hypothetical protein TRIADDRAFT_18324 [Trichoplax adhaerens]
          Length = 1134

 Score = 47.8 bits (112), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 49/200 (24%), Positives = 91/200 (45%), Gaps = 29/200 (14%)

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
           L+   V D  F++G+ EP + +++E   +   R    +   + +A      + + P  W+
Sbjct: 158 LEELQVLDVKFLYGFTEPTIALIYE---SGQNRYLKTYEISLQNA-----DIHRQP--WN 207

Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
              +  +A+ +L VP P  G++V+GA +I Y+    S    L+    SL       R + 
Sbjct: 208 IGKVEEEAFMILPVPPPSCGMVVIGAGSISYYKGQDS----LHITPASLKD-----RITC 258

Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD----GRVVQRLDLSKTNPSVLTSDIT 416
              +D+    +L  D +      G L +L +V +    G  V+ L L     + + S IT
Sbjct: 259 FGRVDSNGCRYLLGDYS------GRLFMLILVQEHSQSGIKVKDLCLEYLGETSIPSCIT 312

Query: 417 TIGNSLFFLGSRLGDSLLVQ 436
            + N+  ++GS  GDS L++
Sbjct: 313 YLDNAFAYIGSSCGDSQLIK 332


>gi|390357128|ref|XP_001198237.2| PREDICTED: splicing factor 3B subunit 3-like [Strongylocentrotus
           purpuratus]
          Length = 949

 Score = 47.4 bits (111), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 61/259 (23%), Positives = 100/259 (38%), Gaps = 39/259 (15%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           L  T+ GD+  +T+  D  +V  + +   +   + + +  +     F+ S  G+  L Q 
Sbjct: 34  LAQTEQGDIFKITLETDDDMVTEIRMKYFDTVPVATSMNVLKTGFLFIASEYGNHYLYQI 93

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS    E GD    AP T R        L+++   E LS   S     
Sbjct: 94  AHLGDDDDEPEFSSATPLEEGDTFFFAPRTLR-------NLEEVDQLESLSPILSCQIAD 146

Query: 495 ESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIWTV 553
            +++ T    V         ++   +GL +            S   + ELPG    +WTV
Sbjct: 147 LASEDTPQLYVACGRGPRSSMRVLRHGLEV------------SEMAVSELPGNPNAVWTV 194

Query: 554 YHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIA 613
             KS              DDEY AY+I+S    T+VL   + + EVT+S   F+      
Sbjct: 195 KKKS--------------DDEYDAYIIVSFVNATLVLSIGETVEEVTDS--GFLGTTPTL 238

Query: 614 AGNLFGRRRVIQVFERGAR 632
           + +L G   ++Q++  G R
Sbjct: 239 SSSLIGDDALLQIYPDGIR 257


>gi|410912407|ref|XP_003969681.1| PREDICTED: DNA damage-binding protein 1-like [Takifugu rubripes]
          Length = 1140

 Score = 47.4 bits (111), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 73/341 (21%), Positives = 127/341 (37%), Gaps = 73/341 (21%)

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  ++ VP P GG +++G  +I YH+     A+A      S          
Sbjct: 207 WKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAIAPPTIKQSTIVCHN---- 262

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTS 413
                +D   + +L  D+       G L +L +    + DG V ++ L +     + +  
Sbjct: 263 ----RVDPNGSRYLLGDME------GRLFMLLLEKEELMDGTVALKDLHVELLGETSIAE 312

Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
            +T + N + F+GSRLGD  LV+    S        + E F ++                
Sbjct: 313 CLTYLDNGVVFVGSRLGDPQLVKLNVDSNDQGSFVTVMETFTNL---------------G 357

Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
            + DM                +    T S A ++     G L+    G+ I+  AS    
Sbjct: 358 PIVDMC-------VVDLERQGQGQLVTCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401

Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
                   ++LPG KG+W +  +S  G   D              L++S   +T VL  +
Sbjct: 402 --------IDLPGIKGLWPL--RSEAGRETDD------------MLVLSFVGQTRVLMLS 439

Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
               E TE   +    +T   GN+    ++IQ+     R++
Sbjct: 440 GEEVEETELPGFVDNQQTFYCGNV-AHNQLIQITSGSVRLV 479


>gi|312076588|ref|XP_003140928.1| xeroderma Pigmentosum Group E Complementing protein [Loa loa]
          Length = 516

 Score = 47.0 bits (110), Expect = 0.059,   Method: Compositional matrix adjust.
 Identities = 78/356 (21%), Positives = 136/356 (38%), Gaps = 90/356 (25%)

Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
           +W   NL  +A  ++ VP P GG L+ G + I YH +    AL    YA    S      
Sbjct: 201 LWKHDNLEGEASMVIGVPEPAGGCLIAGPDAISYH-KGGDDAL---RYAGVPGSRLHNTH 256

Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR----------VVQRLDLSKTN 407
            +    +D     +L  D+A      G+L +L + + G+           V+ + +    
Sbjct: 257 PNCYAPVDRDGQRYLLADLA------GNLYMLLLEF-GKGQEQDESSTVSVKDMKVESLG 309

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC---GSGTSMLSSGLKEEFGDIEADAPST 464
            + +   +  + N + F+GSR GDS L++ +      GT  +S  L + + ++   AP  
Sbjct: 310 NTCIAECMCYLDNGVCFIGSRFGDSQLIRLSTEPRADGTGYIS--LLDSYTNL---AP-- 362

Query: 465 KRLRRSSSDALQDMV----NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
                     ++DM     NG++  L  S +    + +   +    + L +         
Sbjct: 363 ----------IRDMTVMRCNGQQQILTCSGAYKDGTIRIIRNGIGIEELAS--------- 403

Query: 521 GLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLI 580
                                VEL G K ++T+  +               D E+  YLI
Sbjct: 404 ---------------------VELKGIKNMFTLRTR---------------DHEFDDYLI 427

Query: 581 ISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDG 636
           +S ++ T VL       E T+   + V G T+ AG LF    ++QV      ++DG
Sbjct: 428 LSFDSDTHVLLINGEELEDTQITGFVVDGATLWAGCLFQSTTILQVTHGEVILIDG 483


>gi|68060004|ref|XP_671977.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56488645|emb|CAI04030.1| hypothetical protein PB301494.00.0 [Plasmodium berghei]
          Length = 346

 Score = 47.0 bits (110), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 57/269 (21%), Positives = 108/269 (40%), Gaps = 41/269 (15%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           L+ ++ GDL  + V ++  +V+ +     +   + + I  + +   F+ +  G+    QF
Sbjct: 90  LIQSEYGDLYKIEVNHEDGIVKEIICKYFDTVPIANSICVLKSGALFVAAEFGNHFFYQF 149

Query: 438 T---CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS-SDALQDMVNGEELSLYGSASNN 493
           +     S  SM +S      G     A  T++L+     D +  +    ++ +  + ++N
Sbjct: 150 SGIGNDSNESMCTSN--HPSGKNAIIAFKTQKLKNLYLVDQIYSLSPIVDMKILDAKNSN 207

Query: 494 TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIWT 552
                       R SL      +   +GL I   A+             ELPG  + IWT
Sbjct: 208 IPQIYALCGRGPRSSL------RILQHGLSIEELANN------------ELPGKPRYIWT 249

Query: 553 VYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTI 612
           +   +S               EY  Y+I+S E  T++LE  + + EV +S+   +   T 
Sbjct: 250 IKKDNS--------------SEYDGYIIVSFEGNTLILEIGETVEEVYDSL--LLTNVTT 293

Query: 613 AAGNLFGRRRVIQVFERGARILDGSYMTQ 641
              NL      IQV++ G R ++G  + +
Sbjct: 294 IHINLLYDNSFIQVYDTGIRHINGKIVQE 322


>gi|302837243|ref|XP_002950181.1| UV-damaged DNA binding complex subunit 1 protein [Volvox carteri f.
           nagariensis]
 gi|300264654|gb|EFJ48849.1| UV-damaged DNA binding complex subunit 1 protein [Volvox carteri f.
           nagariensis]
          Length = 1104

 Score = 47.0 bits (110), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 60/242 (24%), Positives = 94/242 (38%), Gaps = 55/242 (22%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL  + G + LL + +DG  V  L       +   S +  + + L F+GSR GDS LV+ 
Sbjct: 281 LLGNRQGGMQLLVLAHDGSRVSGLRTEPLGYTCAPSCLAYLDSGLTFVGSRSGDSQLVRI 340

Query: 438 TCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESA 497
           +                     + P T      S  +L  +V+   + L           
Sbjct: 341 SAQP-----------------VNQPPTYLELVDSFPSLAPIVDFVVMDL---------ER 374

Query: 498 QKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKS 557
           Q      +   + + G L+    G+ IN  A+            VELPG KG+W++    
Sbjct: 375 QGQGQLVMCSGIDSDGSLRVVRNGIGINRQAT------------VELPGIKGVWSL---- 418

Query: 558 SRGHNADSSRMAAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAG 615
            R H         YDDEY  YL+++   E R + L T + L E  E   +    +T+  G
Sbjct: 419 -RSH---------YDDEYDKYLLLTFVGETRLLALNTEEELDE-AELPGFDSGSQTLWCG 467

Query: 616 NL 617
           N+
Sbjct: 468 NM 469


>gi|400597418|gb|EJP65151.1| CPSF A subunit region [Beauveria bassiana ARSEF 2860]
          Length = 1212

 Score = 47.0 bits (110), Expect = 0.070,   Method: Compositional matrix adjust.
 Identities = 143/614 (23%), Positives = 217/614 (35%), Gaps = 133/614 (21%)

Query: 64  NVIEIYVVRVQEEGSKES---KNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
           NV++   V  Q  G+KE      SG     +  D      + L+ H  + G + S+A+  
Sbjct: 19  NVVQ--AVLGQFAGTKEQLIITGSGSQLTILRPDPAQGKVIPLLSH-DIFGVLRSIAVFR 75

Query: 121 QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180
             G+     +D IILA +  +I+VLE+  S +      M  F        K G      G
Sbjct: 76  LAGSS----KDYIILATDSGRITVLEYLPSPNRFSRLHMETFG-------KTGIRRVVPG 124

Query: 181 PLVKVDPQGRC---GGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
             +  DP+GR      V    L  ++ + SQ        E T  S     A      VI 
Sbjct: 125 EYLACDPKGRACLISAVEKNKLVYVLNRNSQA-------ELTISSP--LEAHKPGVLVIA 175

Query: 238 LRDLDMKHVKDFIFVHGYIEPVMVILH----ERELTWAGRVSWKHHTCMISALSISTTLK 293
           L  LD+          GY  PV   L     E +    G    +  T ++    +   L 
Sbjct: 176 LTALDV----------GYANPVFAALEIDYTEVDQDNTGEALSEVETHLVY-YELDLGLN 224

Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIG-----GVLVVGANTIHY-HSQSASCALALNNYAV 347
                WS    P D    L    P G     GVLV G   + Y HS   +  + +     
Sbjct: 225 HVVRKWSD---PVDPTASLLFQVPGGNDGPSGVLVCGEENVTYRHSNQDALRVPIPRRR- 280

Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVA----LLSTKTGDLVLLTV--VYDGR----- 396
               + E P    ++     H   L+        LL T  GDL  +T+  V D       
Sbjct: 281 ---GATEDPSRKRNIVAGVMHK--LKGSAGAFFFLLQTDDGDLFKITIDMVEDEEGAPTG 335

Query: 397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGD 456
            VQR+ +   +   + + +  + +   ++ S+ G+    QF              E+ GD
Sbjct: 336 EVQRMKIKYFDTVPVATSLCILKSGFLYVASQFGNYAFYQF--------------EKLGD 381

Query: 457 IEADAPSTKRLRRSSSDALQD-MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPL 515
                     L  SS D   D +   E +  Y   + N          A+ DS+  + PL
Sbjct: 382 ------DDDELEFSSDDFPVDPLAAYEPVYFYPRPAEN---------LALVDSIPAMNPL 426

Query: 516 KDFSYGLRINADA----SATGISKQSNYELV------------ELPGC-KGIWTVYHKSS 558
            D         DA    S  G   +S +  +            ELPG    +WT+   S 
Sbjct: 427 LDCKVANLTGEDAPQIYSICGNGARSTFRTIKHGLEVNEIVASELPGVPSAVWTLKLNS- 485

Query: 559 RGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLF 618
                        D++Y  Y+++S    T+VL   + + EV++S  +     TIAA  L 
Sbjct: 486 -------------DEQYDTYIVLSFTNGTLVLSIGETVEEVSDS-GFLTSVPTIAA-QLL 530

Query: 619 GRRRVIQVFERGAR 632
           G   +IQV  RG R
Sbjct: 531 GTDGLIQVHPRGIR 544


>gi|328869269|gb|EGG17647.1| CPSF domain-containing protein [Dictyostelium fasciculatum]
          Length = 1194

 Score = 47.0 bits (110), Expect = 0.075,   Method: Compositional matrix adjust.
 Identities = 114/567 (20%), Positives = 211/567 (37%), Gaps = 111/567 (19%)

Query: 98  AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
           +  L+ V +    G + S+A     G      +D +I+  +  ++ +LE++ S +     
Sbjct: 46  SGRLDHVLYSEAFGVIRSIAPFRLTGGS----KDYLIVGSDSGRVVILEYNPSKNVFEKV 101

Query: 158 SMHCFESPEWLHLKRGRESFARGPLVKVDPQGRC---GGVLVYGLQMIILKASQGGSGLV 214
               F        + G      G  +  DP+GR    G +    L  I+ + SQ    + 
Sbjct: 102 HQETFG-------RSGCRRIVPGQYISTDPKGRAFMIGAIEKQKLVYILNRDSQAKLSI- 153

Query: 215 GDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVM--VILHERELTWAG 272
                        A    + V ++  +D+          G+  P+   + +   E T   
Sbjct: 154 --------SSPLEAHKAHTIVFSMCGVDV----------GFENPIFATISVDYSEETNIE 195

Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS----PIGGVLVVGANT 328
            V   H+T +++   +   L      WS   +   A  +++VP     P GGVLV     
Sbjct: 196 DVEETHNTKVLTFYELDLGLNNVVRKWSE-EVDRSANLVVSVPGGSDGP-GGVLVCAQGR 253

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE----LDAAHATWLQNDVA--LLSTK 382
           ++Y +   +            D S  +PR +   E    +  +HA+  Q D+   L+ ++
Sbjct: 254 VYYRNIGHA------------DISVSIPRRNGMTEEKSLMIVSHASHKQRDMFFFLVQSE 301

Query: 383 TGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSG 442
            GDL  +T+ Y G +V  + ++  +     + IT + N   F+ S  GD  L  F     
Sbjct: 302 YGDLYKITLDYSGEMVSGMQIAYFDTFPTANCITMLKNGFLFVASEFGDHGLYLFK---- 357

Query: 443 TSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFS 502
               S GL         DAP+      +     + +     L L  + S    S      
Sbjct: 358 ----SLGLD--------DAPTASSAGNTEMVFFEPVFEPRNLVLTATIS----SLSPIVD 401

Query: 503 FAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELV------------ELPGC-KG 549
           F V D L   G  + ++           +G+S+++N  ++            +LPG   G
Sbjct: 402 FKVAD-LAQEGTPQMYAL----------SGVSERANLRVLRHGLPITQMVDSQLPGTPAG 450

Query: 550 IWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE----SVDY 605
           IWT+    +   N     +   +     Y+++S    T+VL   + + EV +    S   
Sbjct: 451 IWTIPQSLTTMRNPQYQGIGTVESPADRYIVVSFVGSTLVLGVGETVEEVQDSGILSTTT 510

Query: 606 FVQGRTIAAGNLFGRRRVIQVFERGAR 632
            +  R++ A NL     ++Q+F +G R
Sbjct: 511 TILIRSMGA-NL---DSIVQIFAQGIR 533


>gi|341884150|gb|EGT40085.1| CBN-DDB-1 protein [Caenorhabditis brenneri]
          Length = 1134

 Score = 46.6 bits (109), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 87/354 (24%), Positives = 143/354 (40%), Gaps = 82/354 (23%)

Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE--L 364
           D+  L+ VPSPI GV+V+G +++ Y S         N+  V   SS  L  + F+    +
Sbjct: 210 DSSMLIPVPSPISGVVVLGTHSLLYKSSE-------NDGEVVPYSSPLLENTIFTSHSIV 262

Query: 365 DAAHATWLQNDV--ALLSTKTGDLVLLTVVYD--GRVVQRLDLSKTNPSVLTSDITTIGN 420
           D     ++ +D    LL      ++LL  V +  G  V+ + +     + +   I  I N
Sbjct: 263 DPTGERFIVSDTDGRLL------MLLLNAVENQSGLSVKEIRIDLLGDTSVAESINYIDN 316

Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480
            + F+GSR GDS L++       S   S L   +                +   ++DM+ 
Sbjct: 317 GVVFIGSRFGDSQLIRLLSEKTNSSYISVLDTYY----------------NIGPIRDMIM 360

Query: 481 GEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYE 540
            E         ++ +    T S A +D     G L+    G+ I   A+           
Sbjct: 361 VE---------SDGQPQLVTCSGAEKD-----GSLRVIRNGIGIEELAT----------- 395

Query: 541 LVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT 600
            V+LPG  GI+ +   SS    AD+            Y+I+SL   T VL+      E  
Sbjct: 396 -VDLPGVVGIFPIRLDSS----ADN------------YVIVSLVEETHVLQITGEELEDV 438

Query: 601 ESVDYFVQGRTIAAGNLFGRRR---VIQVFERGARILDGSYMTQDLSFGPSNSE 651
           + +       T+ AG LFG      V+QV ER  R++    +++   + P+N E
Sbjct: 439 QFLQIDTALPTMFAGTLFGPNDSGLVVQVTERQVRLMSNGGLSK--FWEPANGE 490


>gi|393905247|gb|EJD73911.1| CPSF A subunit region family protein [Loa loa]
          Length = 1145

 Score = 46.6 bits (109), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 78/356 (21%), Positives = 136/356 (38%), Gaps = 90/356 (25%)

Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
           +W   NL  +A  ++ VP P GG L+ G + I YH +    AL    YA    S      
Sbjct: 201 LWKHDNLEGEASMVIGVPEPAGGCLIAGPDAISYH-KGGDDAL---RYAGVPGSRLHNTH 256

Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR----------VVQRLDLSKTN 407
            +    +D     +L  D+A      G+L +L + + G+           V+ + +    
Sbjct: 257 PNCYAPVDRDGQRYLLADLA------GNLYMLLLEF-GKGQEQDESSTVSVKDMKVESLG 309

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC---GSGTSMLSSGLKEEFGDIEADAPST 464
            + +   +  + N + F+GSR GDS L++ +      GT  +S  L + + ++   AP  
Sbjct: 310 NTCIAECMCYLDNGVCFIGSRFGDSQLIRLSTEPRADGTGYIS--LLDSYTNL---AP-- 362

Query: 465 KRLRRSSSDALQDMV----NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
                     ++DM     NG++  L  S +    + +   +    + L +         
Sbjct: 363 ----------IRDMTVMRCNGQQQILTCSGAYKDGTIRIIRNGIGIEELAS--------- 403

Query: 521 GLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLI 580
                                VEL G K ++T+  +               D E+  YLI
Sbjct: 404 ---------------------VELKGIKNMFTLRTR---------------DHEFDDYLI 427

Query: 581 ISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDG 636
           +S ++ T VL       E T+   + V G T+ AG LF    ++QV      ++DG
Sbjct: 428 LSFDSDTHVLLINGEELEDTQITGFVVDGATLWAGCLFQSTTILQVTHGEVILIDG 483


>gi|301110252|ref|XP_002904206.1| pre-mRNA-splicing factor RSE1 [Phytophthora infestans T30-4]
 gi|262096332|gb|EEY54384.1| pre-mRNA-splicing factor RSE1 [Phytophthora infestans T30-4]
          Length = 1197

 Score = 46.6 bits (109), Expect = 0.092,   Method: Compositional matrix adjust.
 Identities = 68/306 (22%), Positives = 114/306 (37%), Gaps = 86/306 (28%)

Query: 321 VLVVGANTIHYHSQ---SASCALALNNYAVSLDSSQELPRSSFSVE--LDAAHATWLQND 375
           VLV+G NT+ Y ++     +CA+               PR        +  + AT  Q D
Sbjct: 247 VLVLGENTVQYKNEGHPELTCAI---------------PRREGEHRDIIIVSAATHKQRD 291

Query: 376 V--ALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
           +   LL ++ GDL  +++ Y G VV+ + +   +   + S +      L F  S   +  
Sbjct: 292 LFFVLLQSELGDLYKISLDYSGNVVEEIKIQFFDTIPVASSMCITKTGLLFCASEFSNHY 351

Query: 434 LVQF-TCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA---------------LQD 477
           L QF + G G        K     ++    ST  LR+ ++ A               + D
Sbjct: 352 LFQFLSIGEG----DDAAKCSSLAMDPTEFSTFPLRKLTNLALASSSASLSPVTQLLVDD 407

Query: 478 MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
           + N +   +Y    NN  S+                 L+   +GL I   A++       
Sbjct: 408 LANEQTPQMYALCGNNNRSS-----------------LRVLRHGLPITEMAASA------ 444

Query: 538 NYELVELPG-CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLL 596
                 LPG  K +W +                +Y D Y  Y+++S E  T+VLE  + +
Sbjct: 445 ------LPGVAKAVWCLKE--------------SYADPYDKYIVVSFEDATLVLEVGETV 484

Query: 597 TEVTES 602
            EV +S
Sbjct: 485 EEVAQS 490


>gi|448528339|ref|XP_003869702.1| hypothetical protein CORT_0D07360 [Candida orthopsilosis Co 90-125]
 gi|380354055|emb|CCG23569.1| hypothetical protein CORT_0D07360 [Candida orthopsilosis]
          Length = 1170

 Score = 46.2 bits (108), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 81/340 (23%), Positives = 136/340 (40%), Gaps = 48/340 (14%)

Query: 304 LPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE 363
           +P+DA  L  VP  IGGVLV GAN I Y          L N ++ L   +   ++S  + 
Sbjct: 229 VPNDANYLAPVPGHIGGVLVCGANWIMYDK--------LGNESILLPLLRRKDQTSVIIS 280

Query: 364 LDAAHATWLQND--VALLSTKTGDLVLLTVVYDG--RVVQRLDLSKTNPSVLTSDITTIG 419
               HA   +N     LL    GDL  L + YD    +++ ++++  +   +  ++    
Sbjct: 281 -HVTHALKKKNYGFFILLQNDLGDLFRLIIDYDSNRELIKDIEITYFDTIPVCYNLNIFK 339

Query: 420 NSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
           N L F        LL QF            L EE    E D    K ++  +    ++  
Sbjct: 340 NGLCFANCINRSQLLYQF----------EKLGEEIS--EEDIRINKTVQMDNIQLTKEKY 387

Query: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY 539
              E  L G  +       ++ S  + DS++N   L   S   ++      T  +     
Sbjct: 388 F--EFKLKGLDNLALIDVVESLS-PITDSILNDDTLVTLSTKSKLKTIVHGTPTTTLVES 444

Query: 540 ELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLII--SLEARTMVLETADLLT 597
           +L   P    I+T             +   A DDE   YL+I  +L  +T+VL   +++ 
Sbjct: 445 QLPIKP--TNIFTT-----------KTSANAVDDE---YLVITSTLSFKTLVLSLGEVIE 488

Query: 598 EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS 637
           EV +S   FV  +   A    G+  ++Q++  G R ++G+
Sbjct: 489 EVNDS--EFVLDQPTVAVQQVGKSSIVQIYSNGLRHINGN 526


>gi|380490733|emb|CCF35810.1| pre-mRNA-splicing factor rse-1 [Colletotrichum higginsianum]
          Length = 1212

 Score = 46.2 bits (108), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 137/603 (22%), Positives = 226/603 (37%), Gaps = 129/603 (21%)

Query: 74  QEEGSKESK--NSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRD 131
           Q  G+KE     +  ++  +L    S   +  V  + + G + S+A     G++    +D
Sbjct: 27  QFSGTKEQNIVTASGSRLTLLRPDPSQGKVITVLSHDIFGIIRSMAAFRLAGSN----KD 82

Query: 132 SIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHL----KRGRESFARGPLVKVDP 187
            +ILA +  +I+++E+        I + + F+    LHL    K G      G  +  DP
Sbjct: 83  YLILATDSGRITIIEY--------IPAQNRFQR---LHLETFGKSGVRRVIPGEYLACDP 131

Query: 188 QGRC---GGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK 244
           +GR      V    L  ++ + SQ        E T  S     A      V+++  LD+ 
Sbjct: 132 KGRACLIASVEKNKLVYVLNRNSQA-------ELTISSP--LEAHKPGVLVLSMVALDV- 181

Query: 245 HVKDFIFVHGYIEPVMVILHERELTWA-----GRVSWKHHTCMISALSISTTLKQHPLIW 299
                    GY  PV   L E E T A     G  + +  T ++    +   L      W
Sbjct: 182 ---------GYANPVFAAL-EIEYTEADQDPTGEAAREAETQLV-YYELDLGLNHVVRKW 230

Query: 300 SAMNLPHDAYKLLAVP---SPIGGVLVVGANTIHY-HSQSASCALALNNY--AVSLDSSQ 353
           S    P  A  L  VP       GVLV G   I Y HS   +  + +     A    S +
Sbjct: 231 SESVDP-TASMLFQVPGGQDGPSGVLVCGEENITYRHSNQEAFRVPIPRRRGATEDPSRK 289

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY----DGRV---VQRLDLSKT 406
               S    +L  +   +      LL T+ GDL   T+      DG     V+RL +   
Sbjct: 290 RHAVSGVMHKLKGSAGAFF----FLLQTEDGDLFKATLDMVEDTDGNPTGEVKRLKIKYF 345

Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
           +   ++S +  + +   +  S+ G+    QF              E+ GD + +      
Sbjct: 346 DTIPVSSSLCILKSGFLYAASQFGNHQFYQF--------------EKLGDDDDE------ 385

Query: 467 LRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINA 526
           L  SS D   D   G +   +          +   + A+ +S+ ++ PL D         
Sbjct: 386 LEFSSDDFPTDPKAGYDAVYF--------HPRPLENLALVESIDSMNPLLDCKVANLTGE 437

Query: 527 DA----SATGISKQSNYELV------------ELPGC-KGIWTVYHKSSRGHNADSSRMA 569
           DA    +A G   +S + ++            ELPG    +WT+  K +RG         
Sbjct: 438 DAPQIYTACGNGARSTFRMLKHGLEVNEIVASELPGIPSAVWTL--KLNRG--------- 486

Query: 570 AYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFER 629
              D+Y AY+++S    T+VL   + + EV++S   F+      A  L G   +IQV  +
Sbjct: 487 ---DQYDAYIVLSFTNGTLVLSIGETVEEVSDS--GFLTSVPTLAAQLLGEDGLIQVHPK 541

Query: 630 GAR 632
           G R
Sbjct: 542 GIR 544


>gi|346971485|gb|EGY14937.1| pre-mRNA-splicing factor RSE1 [Verticillium dahliae VdLs.17]
          Length = 1230

 Score = 46.2 bits (108), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 122/545 (22%), Positives = 200/545 (36%), Gaps = 129/545 (23%)

Query: 133 IILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCG 192
           +ILA +  +I+++E+  + +  +   +  F        K G      G  +  DP+GR  
Sbjct: 102 LILATDSGRIAIIEYLPAQNRFQRLHLETFG-------KSGIRRVVPGEFLACDPKGRA- 153

Query: 193 GVLVYGLQ-----MIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVK 247
             L+  L+      ++ + SQ        E T  S     A     HV+++  LD+    
Sbjct: 154 -CLIASLEKNKLVYVLNRNSQA-------ELTISSP--LEAHKPGVHVLSMVALDV---- 199

Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL------IWSA 301
                 GY  PV   L E + T A +          +AL + T L  + L      +   
Sbjct: 200 ------GYANPVFAAL-ETDYTEADQDPTGQ-----AALDVETQLVYYELDLGLNHVVRK 247

Query: 302 MNLPHDAYKLLAVPSPIG-----GVLVVGANTIHY-HSQSASCALALNNYAVSLDSSQEL 355
            + P D    L    P G     GVLV G   I Y HS   +  + +         + E 
Sbjct: 248 WSEPVDNTASLLFQVPGGNDGPSGVLVCGEENITYRHSNQEAFRVPVPRRR----GATED 303

Query: 356 PRSSFSVELDAAHATWLQNDVA----LLSTKTGDLVLLTVVY----DGRV---VQRLDLS 404
           P     +     H   L+        LL T+ GDL  +T+      DG     V+RL + 
Sbjct: 304 PSRKRCIVAGVMHK--LKGSAGAFFFLLQTEDGDLFKITIDMIEDRDGNPTGEVKRLKIK 361

Query: 405 KTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPST 464
             +   + S +  + +   ++ S+ G+    QF              E+ GD        
Sbjct: 362 YFDTIPVASSLCILKSGFLYVASQFGNYQFYQF--------------EKLGD------DD 401

Query: 465 KRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRI 524
           + L  SS D   D     E   +          ++  + A+ +S+ ++ PL D       
Sbjct: 402 EELEFSSDDFPTDPKQSYEAVFF--------HPRELENLALVESIDSMNPLIDCKVANLT 453

Query: 525 NADA----SATGISKQSNYELV------------ELPGC-KGIWTVYHKSSRGHNADSSR 567
             DA    +A G   +S + ++            ELPG    +WT+  K SRG       
Sbjct: 454 GEDAPQIYTACGNGARSTFRILKHGLEVNEIVASELPGIPSAVWTL--KLSRG------- 504

Query: 568 MAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVF 627
                D+Y AY+++S    T+VL   + + EV +S   F+      A  L G   +IQV 
Sbjct: 505 -----DQYDAYIVLSFTNATLVLSIGETVEEVNDS--GFLTSVPTLAAQLLGGEGLIQVH 557

Query: 628 ERGAR 632
            +G R
Sbjct: 558 PKGIR 562


>gi|444313909|ref|XP_004177612.1| hypothetical protein TBLA_0A02930 [Tetrapisispora blattae CBS 6284]
 gi|387510651|emb|CCH58093.1| hypothetical protein TBLA_0A02930 [Tetrapisispora blattae CBS 6284]
          Length = 1459

 Score = 46.2 bits (108), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 29/116 (25%), Positives = 58/116 (50%), Gaps = 17/116 (14%)

Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHC----F 162
           ++  G +  + ++ Q G++     D ++L   +AKIS+++FD+ ++ L+  S+H     F
Sbjct: 54  FKFSGKITDIVLIPQRGSE----LDCLLLVTPNAKISIIKFDEELNTLKTISLHYYTDEF 109

Query: 163 ESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED 218
           E    L L       AR   ++V+P+ +C  VL++  + I +        +  DED
Sbjct: 110 EKLSMLQL-------ARTSQLRVEPKKKC--VLLFNTESIAILPFTQQFNIDNDED 156


>gi|448111975|ref|XP_004201977.1| Piso0_001448 [Millerozyma farinosa CBS 7064]
 gi|359464966|emb|CCE88671.1| Piso0_001448 [Millerozyma farinosa CBS 7064]
          Length = 1249

 Score = 45.8 bits (107), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 55/239 (23%), Positives = 98/239 (41%), Gaps = 43/239 (17%)

Query: 95  GISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGL 154
            I    L+ +C + +   ++SL  +   G+    ++D +++  +  K+++L++D   + L
Sbjct: 52  NIDTGKLDKICVHNVFSVIQSLEKVRLTGS----QKDYLVVTSDSGKLAILQYDTGRNRL 107

Query: 155 RITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ--MIILKASQGGSG 212
               +  F+ P   H K G      GP +  DPQ R   +L+  L+   +I K      G
Sbjct: 108 ----VTVFQEP---HSKTGFRRNTPGPYLLTDPQNR--AILIGALERNKLIYKVHSDDKG 158

Query: 213 LVGDEDTFGSGGGFSARIESS--HVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTW 270
                     G   S+ +ES   H I L    +          GY  PV V +       
Sbjct: 159 ----------GMQISSPLESQIRHTITLAMCALDT--------GYENPVFVAIEAEYGAL 200

Query: 271 AGRV----SWKHHTCMISALSISTTLKQHPLIWSAMN--LPHDAYKLLAVPSPIGGVLV 323
             +     S  H T + ++  +   L    ++   +N  LP  A  L+ +PSP+GGVL+
Sbjct: 201 DSKEYSIDSQAHQTLLFTSYELDQGLNH--VVRRVVNNKLPISATHLIPLPSPVGGVLI 257


>gi|392593521|gb|EIW82846.1| hypothetical protein CONPUDRAFT_81012 [Coniophora puteana
           RWD-64-598 SS2]
          Length = 1213

 Score = 45.8 bits (107), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 87/384 (22%), Positives = 146/384 (38%), Gaps = 97/384 (25%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL ++ GDL  +T+ +D   V+ L +   +   + S +  + +   F+ S  G+  L QF
Sbjct: 308 LLQSEDGDLFKVTIDHDEDEVKSLKIKYFDTVPVASSLCILKSGFLFVASEFGNHYLYQF 367

Query: 438 TC---------GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG 488
                       S TS  S G+ E F  +     +  R R   + AL D +   +  L  
Sbjct: 368 QKLGDDDDEPEFSSTSFPSFGMAESFIPLPH---AHFRPRGLDNLALADEIESLDPILDA 424

Query: 489 SASN---NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELP 545
              N   N+++ Q  F+   R S      L+   +GL +    S+            ELP
Sbjct: 425 KVMNILPNSDTPQ-IFTACGRGSRSTFRMLR---HGLEVEESVSS------------ELP 468

Query: 546 GC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVD 604
           G    +WT                   DD Y +Y+I+S    T+VL   + + EV ++  
Sbjct: 469 GIPNAVWTTKRTE--------------DDPYDSYIILSFVNGTLVLSIGETIEEVQDT-G 513

Query: 605 YFVQGRTIAAGNLFGRRRVIQVFERGA------------RILDGS--------------- 637
           +     T+A   + G   ++QV  +G             R+  G                
Sbjct: 514 FLSSAPTLAVQQI-GSDALLQVHPQGIRHVLSDRRVNEWRVPQGKTIVCATTNKRQVVVA 572

Query: 638 -------YMTQDLSFGPSNSESGSGSENSTVLSVSIAD--------PYVLLGMSDGSIRL 682
                  Y   DL  G  N      +  STVL++S+ +        PY+ +G  D ++R+
Sbjct: 573 LSSAELVYFELDLD-GQLNEYQDWKAMGSTVLALSVGEVPEGRQRTPYLAVGCEDQTVRI 631

Query: 683 LVGDPSTC--TVSVQT----PAAI 700
           +  DP +   T+S+Q     P+AI
Sbjct: 632 ISLDPESTLETISLQALTAPPSAI 655


>gi|430813298|emb|CCJ29330.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 1197

 Score = 45.8 bits (107), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 118/544 (21%), Positives = 206/544 (37%), Gaps = 92/544 (16%)

Query: 109 LHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWL 168
           +HG + +L      G +    +D +I+  +  +I++LE+    +         +      
Sbjct: 65  VHGIIRTLVGFRLAGTN----KDHLIVGSDSGRITILEYKPDSNAFSKVHQETYG----- 115

Query: 169 HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSA 228
             K G      G  + VDP+GR   +       ++   ++  +                A
Sbjct: 116 --KSGVRRVVPGQYLAVDPKGRATMIASIEKNKLVYVLNRDSA------TNLTISSPLEA 167

Query: 229 RIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILH----ERELTWAGRVSWKHHTCMIS 284
               S V +L  +D+          GY  PV   L     E E   +G+ +++    +++
Sbjct: 168 HKSCSLVFHLIGMDV----------GYENPVFAALEVDYTEAESDPSGK-AYREIQKVLT 216

Query: 285 ALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIG-----GVLVVGANTIHY-HSQSASC 338
              +   L      WS    P D    L V  P G     G LV    +I Y H    + 
Sbjct: 217 YYELDLGLNHVVRKWSD---PVDRKANLLVTVPGGSDGPSGALVCTEGSIFYKHKGKKTH 273

Query: 339 ALALNNYAVSLDSSQ--ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR 396
            + +     SL++SQ  ++  SS   ++  A    LQN+        GDL  +T+  +  
Sbjct: 274 RIPIPTRIGSLENSQKKQIIVSSVVHKMRGAFFFLLQNE-------DGDLFKVTIDSNDG 326

Query: 397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF-TCGSGTSMLS-SGLKEEF 454
            V+ L +   +   +++ ++ + +   F+ S  G+  L QF   G   + +  S +    
Sbjct: 327 EVESLKIKYFDTVPVSTGLSILKSGFLFVASEYGNHHLYQFEKLGDDNNEIEFSSVDFPV 386

Query: 455 GDI-EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT-ESAQKTFSFAVRDSLVNI 512
            D+ E   PS  R R   +  L D +N     +     N T E A + ++   R      
Sbjct: 387 LDLNEGYEPSYFRPRSLENLLLVDDLNSMNPLMDSKILNLTDEDAPQIYALCGR------ 440

Query: 513 GPLKDFS---YGLRINADASATGISKQSNYELVELPGC-KGIWTVYHKSSRGHNADSSRM 568
           GP   F    YGL +N +  A+G           LPG    +WT    SS          
Sbjct: 441 GPRSTFRTLRYGLEVN-EIVASG-----------LPGSPTAVWTTKLTSS---------- 478

Query: 569 AAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFE 628
               D+Y AY+++S    T+VL   + + EV+++  +     T+A   L G   +IQV  
Sbjct: 479 ----DQYDAYIVLSFVNGTLVLSIGETVEEVSDT-GFLSSSPTLAVQQL-GDDALIQVHP 532

Query: 629 RGAR 632
           +G R
Sbjct: 533 KGIR 536


>gi|426192113|gb|EKV42051.1| hypothetical protein AGABI2DRAFT_229642 [Agaricus bisporus var.
           bisporus H97]
          Length = 1213

 Score = 45.8 bits (107), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 86/385 (22%), Positives = 148/385 (38%), Gaps = 99/385 (25%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL ++ GDL  +T+ ++   V+ L +   +   + S +  + +   F+ S  G+  L QF
Sbjct: 308 LLQSEDGDLFKVTIEHEDEEVKALKIKYFDTVPVASSLCILKSGFLFVASEFGNHYLYQF 367

Query: 438 TC---------GSGTSMLSSGLKEEFGDIEADAPSTK-RLRRSSSDALQDMVNGEELSLY 487
                       S TS  SSG+ E     +A  P    + R   + AL D +   +  + 
Sbjct: 368 QKLGDDDEEPEFSSTSFPSSGMAEP----QAALPRVYFKPRPLDNLALADELESLDPIID 423

Query: 488 GSASN---NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVEL 544
               N   N+++ Q  F+   R +  +   L+   +GL +    S+            +L
Sbjct: 424 SKVLNLLPNSDTPQ-IFAACGRGARSS---LRTLQHGLEVEESVSS------------DL 467

Query: 545 PGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESV 603
           PG    +WT                   DD Y +Y+I+S    T+VL   + + EV ++ 
Sbjct: 468 PGIPNAVWTTKRNE--------------DDPYDSYIILSFVNGTLVLSIGETIEEVQDT- 512

Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGAR------------------ILDGS-------- 637
            +     T+A   + G   ++QV   G R                  I+  +        
Sbjct: 513 GFLSSAPTLAVQQI-GSDALLQVHPHGIRHVLADRRVNEWRVPSNKIIVAATTNKRQVVV 571

Query: 638 --------YMTQDLSFGPSNSESGSGSENSTVLSVSIAD--------PYVLLGMSDGSIR 681
                   Y   DL  G  N      +  STVL++SI D        PY+ +G  D ++R
Sbjct: 572 ALSSAELVYFELDLD-GQLNEYQDRKAMGSTVLALSIGDVPEGRQRTPYLAVGCEDQTVR 630

Query: 682 LLVGDPSTC--TVSVQT----PAAI 700
           ++  DP +   T+S+Q     P+AI
Sbjct: 631 IISLDPESTLETISLQALTAPPSAI 655


>gi|409075182|gb|EKM75565.1| hypothetical protein AGABI1DRAFT_64324 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 1213

 Score = 45.4 bits (106), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 86/385 (22%), Positives = 148/385 (38%), Gaps = 99/385 (25%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL ++ GDL  +T+ ++   V+ L +   +   + S +  + +   F+ S  G+  L QF
Sbjct: 308 LLQSEDGDLFKVTIEHEDEEVKALKIKYFDTVPVASSLCILKSGFLFVASEFGNHYLYQF 367

Query: 438 TC---------GSGTSMLSSGLKEEFGDIEADAPSTK-RLRRSSSDALQDMVNGEELSLY 487
                       S TS  SSG+ E     +A  P    + R   + AL D +   +  + 
Sbjct: 368 QKLGDDDEEPEFSSTSFPSSGMAEP----QAALPRVYFKPRPLDNLALADELESLDPIID 423

Query: 488 GSASN---NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVEL 544
               N   N+++ Q  F+   R +  +   L+   +GL +    S+            +L
Sbjct: 424 SKVLNLLPNSDTPQ-IFAACGRGARSS---LRTLQHGLEVEESVSS------------DL 467

Query: 545 PGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESV 603
           PG    +WT                   DD Y +Y+I+S    T+VL   + + EV ++ 
Sbjct: 468 PGIPNAVWTTKRNE--------------DDPYDSYIILSFVNGTLVLSIGETIEEVQDT- 512

Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGAR------------------ILDGS-------- 637
            +     T+A   + G   ++QV   G R                  I+  +        
Sbjct: 513 GFLSSAPTLAVQQI-GSDALLQVHPHGIRHVLADRRVNEWRVPSNKTIVAATTNKRQVVV 571

Query: 638 --------YMTQDLSFGPSNSESGSGSENSTVLSVSIAD--------PYVLLGMSDGSIR 681
                   Y   DL  G  N      +  STVL++SI D        PY+ +G  D ++R
Sbjct: 572 ALSSAELVYFELDLD-GQLNEYQDRKAMGSTVLALSIGDVPEGRQRTPYLAVGCEDQTVR 630

Query: 682 LLVGDPSTC--TVSVQT----PAAI 700
           ++  DP +   T+S+Q     P+AI
Sbjct: 631 IISLDPESTLETISLQALTAPPSAI 655


>gi|353232348|emb|CCD79703.1| putative dna repair protein xp-E [Schistosoma mansoni]
          Length = 1329

 Score = 45.4 bits (106), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 75/323 (23%), Positives = 126/323 (39%), Gaps = 57/323 (17%)

Query: 128 RRRDSIILAFEDAKISVLEF---DDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVK 184
           R  DS+ L    A ++++E    +DS+  + + S    +        R      +G  V 
Sbjct: 72  RETDSLFLLTHKAGVAIIECVRNNDSVEFVTVASGSVED--------RSARIIDQGFDVL 123

Query: 185 VDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDM 243
           +DP      V +Y GL  IIL    G        +  G+    + +IE  +++       
Sbjct: 124 IDPGANYIVVRLYHGLLKIILLQCIG--------EKIGTDFLDTNQIEEGNIV------- 168

Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
               D  F++GY  P   +++E EL          H            L+   L   ++ 
Sbjct: 169 ----DMAFIYGYSLPTFAMIYEDELVL--------HMKTYEIYGREPVLRNVQLTLDSIE 216

Query: 304 LPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE 363
              D+  L+ VP P GGV++VG N I YH++       ++ Y     +SQ L  ++   +
Sbjct: 217 --PDSKLLIPVPKPYGGVILVGDNIICYHTKDGP---HISQYIPQAKASQVLCYAAVDAQ 271

Query: 364 L----DAAHATW----LQNDV-ALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSD 414
                D A   +    L  D+ A  +  T +   L+ V  G +   L      P      
Sbjct: 272 RYLLGDMAGRLYMVHLLSEDISAAANNGTSNSDSLSAVRIGSIRIELLGETATP----ES 327

Query: 415 ITTIGNSLFFLGSRLGDSLLVQF 437
           I  + N + F+GS LGDS L++ 
Sbjct: 328 IAYLDNGVVFIGSTLGDSQLIRL 350


>gi|239613967|gb|EEQ90954.1| UV-damaged DNA binding protein [Ajellomyces dermatitidis ER-3]
 gi|327353314|gb|EGE82171.1| UV-damaged DNA binding protein [Ajellomyces dermatitidis ATCC
           18188]
          Length = 1199

 Score = 45.4 bits (106), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 39/134 (29%), Positives = 61/134 (45%), Gaps = 21/134 (15%)

Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
           L+ VP+P+GG+LV+G  +I Y   +++  +           SQ L  ++  V        
Sbjct: 299 LVPVPAPLGGLLVLGETSIRYLDDASNECI-----------SQPLEEATIFV-------A 340

Query: 371 WLQNDVA--LLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGS 427
           W Q D    LL+   G L  L ++ D    VQ   L +       S +  +G  + F+GS
Sbjct: 341 WEQVDGQRWLLADDYGRLFFLMLILDSDNAVQSWKLDRLGNIPRASVLVYMGGGVTFIGS 400

Query: 428 RLGDSLLVQFTCGS 441
             GDS L++ T GS
Sbjct: 401 HQGDSQLIRITEGS 414


>gi|256088964|ref|XP_002580590.1| DNA repair protein xp-E [Schistosoma mansoni]
          Length = 1329

 Score = 45.4 bits (106), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 75/323 (23%), Positives = 129/323 (39%), Gaps = 57/323 (17%)

Query: 128 RRRDSIILAFEDAKISVLEF---DDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVK 184
           R  DS+ L    A ++++E    +DS+  + + S    +        R      +G  V 
Sbjct: 72  RETDSLFLLTHKAGVAIIECVRNNDSVEFVTVASGSVED--------RSARIIDQGFDVL 123

Query: 185 VDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDM 243
           +DP      V +Y GL  IIL    G        +  G+    + +IE  +++       
Sbjct: 124 IDPGANYIVVRLYHGLLKIILLQCIG--------EKIGTDFLDTNQIEEGNIV------- 168

Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
               D  F++GY  P   +++E EL          H            L+   L   ++ 
Sbjct: 169 ----DMAFIYGYSLPTFAMIYEDELVL--------HMKTYEIYGREPVLRNVQLTLDSIE 216

Query: 304 LPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE 363
              D+  L+ VP P GGV++VG N I YH++       ++ Y     +SQ L  ++   +
Sbjct: 217 --PDSKLLIPVPKPYGGVILVGDNIICYHTKDGP---HISQYIPQAKASQVLCYAAVDAQ 271

Query: 364 L----DAAHATW----LQNDV-ALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSD 414
                D A   +    L  D+ A  +  T +   L+ V  G +  R++L     +     
Sbjct: 272 RYLLGDMAGRLYMVHLLSEDISAAANNGTSNSDSLSAVRIGSI--RIEL--LGETATPES 327

Query: 415 ITTIGNSLFFLGSRLGDSLLVQF 437
           I  + N + F+GS LGDS L++ 
Sbjct: 328 IAYLDNGVVFIGSTLGDSQLIRL 350


>gi|195586770|ref|XP_002083143.1| GD13507 [Drosophila simulans]
 gi|194195152|gb|EDX08728.1| GD13507 [Drosophila simulans]
          Length = 1227

 Score = 45.4 bits (106), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 80/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL T+ GD+  +T+  D  VV  + L   +     + +  +     F+ S  G+  L Q 
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E G+    AP           AL+++V  +EL  +     + 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +            L   GP   L+   +GL +            S   + ELPG    +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTV  ++              DDE+ AY+I+S    T+VL   + + EVT+S  +     
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504

Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
           T+    L G   ++QV+  G R I     + +  + G  +    + ++   V+++S    
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563

Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
                DP                                 ++ +G+SD ++R+L  DP+ 
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPEGEQRSWFLAVGLSDNTVRILSLDPNN 623

Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
           C     TP ++++   P  S  L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642


>gi|195169735|ref|XP_002025674.1| GL20829 [Drosophila persimilis]
 gi|194109167|gb|EDW31210.1| GL20829 [Drosophila persimilis]
          Length = 1225

 Score = 45.4 bits (106), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 80/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL T+ GD+  +T+  D  VV  + L   +     S +  +     F+ S  G+  L Q 
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPASAMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E G+    AP T          L+++V  +EL  +     + 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPRT----------LKNLVLVDELPSFAPIITSQ 411

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +            L   GP   L+   +GL +            S   + ELPG    +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTV  ++              DDE+ AY+I+S    T+VL   + + EVT+S  +     
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504

Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
           T+    L G   ++QV+  G R I     + +  + G  +    + ++   V+++S    
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563

Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
                DP                                 ++ +G++D ++R+L  DP+ 
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPDGEQRSWFLAVGLADNTVRILSLDPNN 623

Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
           C     TP ++++   P  S  L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642


>gi|125977518|ref|XP_001352792.1| GA12611 [Drosophila pseudoobscura pseudoobscura]
 gi|54641542|gb|EAL30292.1| GA12611 [Drosophila pseudoobscura pseudoobscura]
          Length = 1228

 Score = 45.4 bits (106), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 80/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL T+ GD+  +T+  D  VV  + L   +     S +  +     F+ S  G+  L Q 
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPASAMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E G+    AP T          L+++V  +EL  +     + 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPRT----------LKNLVLVDELPSFAPIITSQ 411

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +            L   GP   L+   +GL +            S   + ELPG    +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTV  ++              DDE+ AY+I+S    T+VL   + + EVT+S  +     
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504

Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
           T+    L G   ++QV+  G R I     + +  + G  +    + ++   V+++S    
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563

Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
                DP                                 ++ +G++D ++R+L  DP+ 
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPDGEQRSWFLAVGLADNTVRILSLDPNN 623

Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
           C     TP ++++   P  S  L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642


>gi|308477185|ref|XP_003100807.1| CRE-DDB-1 protein [Caenorhabditis remanei]
 gi|308264619|gb|EFP08572.1| CRE-DDB-1 protein [Caenorhabditis remanei]
          Length = 1154

 Score = 45.1 bits (105), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 87/358 (24%), Positives = 147/358 (41%), Gaps = 82/358 (22%)

Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
           ++  DA  L+ VP+PI GVLV+ AN+I Y S          N  V   +S  L  + F+ 
Sbjct: 206 SIAADASVLIPVPAPISGVLVLAANSILYKSSDV-------NGDVVPYASPLLDNTVFTC 258

Query: 363 E--LDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR---VVQRLDLSKTNPSVLTSDITT 417
              +D +   ++ +D     T+   L+L+  + +GR    V+ + +     + +   I  
Sbjct: 259 HGLVDPSGERFILSD-----TEGRLLMLILNIGEGRSGITVKDMRIEYLGETSIADSINY 313

Query: 418 IGNSLFFLGSRLGDSLLVQFT-CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ 476
           I   + F+GSRLGDS L++     SG S   S + E + +I                 ++
Sbjct: 314 IDAGVVFVGSRLGDSQLIRLMPTPSGGSY--SVVLETYSNI---------------GPIR 356

Query: 477 DMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQ 536
           DM+  E         ++ ++   T S A +D     G L+    G+ I   AS       
Sbjct: 357 DMIMVE---------SDGQAQLVTCSGAEKD-----GSLRVIRNGIGIEELAS------- 395

Query: 537 SNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLL 596
                VEL G  GI+ +   S+  +                Y+I+SL   T VL+     
Sbjct: 396 -----VELAGVIGIFPIRLNSTTDN----------------YVIVSLAEETHVLQINGEE 434

Query: 597 TEVTESVDYFVQGRTIAAGNLFG---RRRVIQVFERGARILDGSYMTQDLSFGPSNSE 651
            E  + +    +  TI A  +FG      ++QV E+  R +  S +++   + P N E
Sbjct: 435 LEDVQLLQICTEMPTIFASTIFGPDNSEVLLQVTEKHVRFMAFSGLSK--IWEPPNGE 490


>gi|261193401|ref|XP_002623106.1| UV-damaged DNA binding protein [Ajellomyces dermatitidis SLH14081]
 gi|239588711|gb|EEQ71354.1| UV-damaged DNA binding protein [Ajellomyces dermatitidis SLH14081]
          Length = 1168

 Score = 45.1 bits (105), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 39/134 (29%), Positives = 61/134 (45%), Gaps = 21/134 (15%)

Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
           L+ VP+P+GG+LV+G  +I Y   +++  +           SQ L  ++  V        
Sbjct: 299 LVPVPAPLGGLLVLGETSIRYLDDASNECI-----------SQPLEEATIFV-------A 340

Query: 371 WLQNDVA--LLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGS 427
           W Q D    LL+   G L  L ++ D    VQ   L +       S +  +G  + F+GS
Sbjct: 341 WEQVDGQRWLLADDYGRLFFLMLILDSDNAVQSWKLDRLGNIPRASVLVYMGGGVTFIGS 400

Query: 428 RLGDSLLVQFTCGS 441
             GDS L++ T GS
Sbjct: 401 HQGDSQLIRITEGS 414


>gi|226291941|gb|EEH47369.1| DNA damage-binding protein 1a [Paracoccidioides brasiliensis Pb18]
          Length = 1209

 Score = 45.1 bits (105), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 51/174 (29%), Positives = 74/174 (42%), Gaps = 38/174 (21%)

Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQ 334
           +W+   C I+       LK+         L   A  L+ VP+P+GG+LV+G  +I Y   
Sbjct: 279 AWQDTGC-IAVFKALDLLKEE--------LEMGASFLIPVPAPLGGLLVLGETSIRYLD- 328

Query: 335 SASCALALNNYAVSLDSSQELPRSSFSVELDAA--HATWLQNDVA--LLSTKTGDLVLLT 390
                          D++ E      S+ LD A     W Q D    LL+   G L  L 
Sbjct: 329 ---------------DATNE----CISLPLDEATIFVAWEQVDGQRWLLADDYGRLFFLM 369

Query: 391 VVYD-GRVVQ--RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGS 441
           ++ D    VQ  +LDL    P    S +  +G  + F+GS  GDS L++ T GS
Sbjct: 370 LILDEDNAVQSWKLDLLGNIPR--ASVLVYLGGGVTFIGSHQGDSQLIRITEGS 421


>gi|325094412|gb|EGC47722.1| DNA damage-binding protein 1a [Ajellomyces capsulatus H88]
          Length = 1201

 Score = 45.1 bits (105), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 45/144 (31%), Positives = 66/144 (45%), Gaps = 25/144 (17%)

Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
            L   A  L+ VP+P+GG+LV+G  +I Y   +++  +           SQ L  ++  V
Sbjct: 293 ELEMGASFLVPVPAPLGGLLVLGETSIRYLDDASNECI-----------SQPLKEATIFV 341

Query: 363 ELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQ--RLDLSKTNPSVLTSDITT 417
                   W Q D    LL+   G L  L +V D    VQ  +LDL    P    S +  
Sbjct: 342 -------AWEQVDGQRWLLADDYGRLFFLMLVLDTDNAVQSWKLDLLGDIPR--ASVLVY 392

Query: 418 IGNSLFFLGSRLGDSLLVQFTCGS 441
           +G  + F+GS  GDS L++ T GS
Sbjct: 393 MGGGITFIGSHQGDSELIRITEGS 416


>gi|367001853|ref|XP_003685661.1| hypothetical protein TPHA_0E01320 [Tetrapisispora phaffii CBS 4417]
 gi|357523960|emb|CCE63227.1| hypothetical protein TPHA_0E01320 [Tetrapisispora phaffii CBS 4417]
          Length = 1357

 Score = 45.1 bits (105), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 70/365 (19%), Positives = 153/365 (41%), Gaps = 71/365 (19%)

Query: 97  SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
           S   L L   ++L+G V  +A++ Q  +    + D +I+    AK+S++ F+   + L  
Sbjct: 45  STNKLHLNYEFKLNGRVSDIALIKQVDS----KLDYLIILTATAKLSLVNFNVFTNSLET 100

Query: 157 TSMHCFESPEWLH--LKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV 214
            S+H +E     +  LK  +ES  R     +D    C  VL++    I +      +   
Sbjct: 101 ISLHYYEDKFRQNSILKLAKESKLR-----IDQAKNC--VLLFNNDNIAILPISSTTDEF 153

Query: 215 GDED-----------------TFGSGGGFSARIESSHVINLRDLDM----KHVKDFIFVH 253
            DED                  F S      +I +S +I L+  ++    +++ D  F+ 
Sbjct: 154 EDEDLGQESSAKTVKRGNMSIKFPSQSQKKNKITNSSII-LKSTELNSKIQNIIDIQFLS 212

Query: 254 GYIEPVMVILHERELTWAGR-----VSWKHHTCMISAL-------------SISTTLKQH 295
            + +P + +L++ +L W G      +  ++    ++ L             S++  L + 
Sbjct: 213 NFSKPTLSVLYQPKLAWIGNSNLVTLPTQYMILTLNILERENIKSQENGENSLNQDLIET 272

Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY--HSQSASCALALNNY--AVSLDS 351
            +I     LP++ + ++ + +   G  +VG+N I Y  H+      + +N +    +L  
Sbjct: 273 TIIGQVSELPYELHTIIPLNN---GSTLVGSNEIIYIDHTGVLQSLIIINQFQDKETLKK 329

Query: 352 SQELPRSSFSVELDA------AHATWLQNDV-----ALLSTKTGDLVLLTVVYDGRVVQR 400
            + + +S  ++ L+       A +    N+V      L+  +  ++ L+ +  +GR++  
Sbjct: 330 GRVIDKSKQNIILNKPIKFINAGSRVESNNVDDKNNVLIFDENNNIYLVNITLEGRLLIN 389

Query: 401 LDLSK 405
            D++K
Sbjct: 390 FDINK 394


>gi|240275059|gb|EER38574.1| DNA damage-binding protein 1a [Ajellomyces capsulatus H143]
          Length = 1134

 Score = 45.1 bits (105), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 45/144 (31%), Positives = 66/144 (45%), Gaps = 25/144 (17%)

Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
            L   A  L+ VP+P+GG+LV+G  +I Y   +++  +           SQ L  ++  V
Sbjct: 293 ELEMGASFLVPVPAPLGGLLVLGETSIRYLDDASNECI-----------SQPLKEATIFV 341

Query: 363 ELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQ--RLDLSKTNPSVLTSDITT 417
                   W Q D    LL+   G L  L +V D    VQ  +LDL    P    S +  
Sbjct: 342 -------AWEQVDGQRWLLADDYGRLFFLMLVLDTDNAVQSWKLDLLGDIPR--ASVLVY 392

Query: 418 IGNSLFFLGSRLGDSLLVQFTCGS 441
           +G  + F+GS  GDS L++ T GS
Sbjct: 393 MGGGITFIGSHQGDSELIRITEGS 416


>gi|225558618|gb|EEH06902.1| DNA damage-binding protein 1a [Ajellomyces capsulatus G186AR]
          Length = 1201

 Score = 45.1 bits (105), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 45/144 (31%), Positives = 66/144 (45%), Gaps = 25/144 (17%)

Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
            L   A  L+ VP+P+GG+LV+G  +I Y   +++  +           SQ L  ++  V
Sbjct: 293 ELEMGASFLVPVPAPLGGLLVLGETSIRYLDDASNECI-----------SQPLKEATIFV 341

Query: 363 ELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQ--RLDLSKTNPSVLTSDITT 417
                   W Q D    LL+   G L  L +V D    VQ  +LDL    P    S +  
Sbjct: 342 -------AWEQVDGQRWLLADDYGRLFFLMLVLDTDNAVQSWKLDLLGDIPR--ASVLVY 392

Query: 418 IGNSLFFLGSRLGDSLLVQFTCGS 441
           +G  + F+GS  GDS L++ T GS
Sbjct: 393 MGGGITFIGSHQGDSELIRITEGS 416


>gi|170589359|ref|XP_001899441.1| Xeroderma Pigmentosum Group E Complementing protein [Brugia malayi]
 gi|158593654|gb|EDP32249.1| Xeroderma Pigmentosum Group E Complementing protein, putative
           [Brugia malayi]
          Length = 521

 Score = 45.1 bits (105), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 43/155 (27%), Positives = 67/155 (43%), Gaps = 31/155 (20%)

Query: 497 AQKTFSFAVRDSLVNIGPLKDFSYGLRINADA---SATGISKQSNY----------EL-- 541
           A  T   ++ DS  N+ P++D +  +R N      + +G  K              EL  
Sbjct: 345 ADGTGYISLLDSYTNLAPIRDMTV-MRCNGQQQILTCSGAYKDGTIRIIRNGIGIEELAS 403

Query: 542 VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
           VEL G K ++T+  +               DDE+  YLI+S ++ T VL       E TE
Sbjct: 404 VELKGIKNMFTLRTR---------------DDEFDDYLILSFDSETHVLLINGEELEDTE 448

Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDG 636
              + V G T+ AG LF  + ++QV      ++DG
Sbjct: 449 ITGFTVDGATLWAGCLFHSKTILQVTHGEVILIDG 483


>gi|225680146|gb|EEH18430.1| DNA damage-binding protein [Paracoccidioides brasiliensis Pb03]
          Length = 1138

 Score = 44.7 bits (104), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 44/138 (31%), Positives = 63/138 (45%), Gaps = 29/138 (21%)

Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA--H 368
           L+ VP+P+GG+LV+G  +I Y                  D++ E      S+ LD A   
Sbjct: 318 LIPVPAPLGGLLVLGETSIRYLD----------------DATNE----CISLPLDEATIF 357

Query: 369 ATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQ--RLDLSKTNPSVLTSDITTIGNSLF 423
             W Q D    LL+   G L  L ++ D    VQ  +LDL    P    S +  +G  + 
Sbjct: 358 VAWEQVDGQRWLLADDYGRLFFLMLILDEDNAVQSWKLDLLGNIPR--ASVLVYLGGGVT 415

Query: 424 FLGSRLGDSLLVQFTCGS 441
           F+GS  GDS L++ T GS
Sbjct: 416 FIGSHQGDSQLIRITEGS 433


>gi|241952575|ref|XP_002419009.1| pre-mRNA-splicing factor, putative; pre-spliceosome component,
           putative [Candida dubliniensis CD36]
 gi|223642349|emb|CAX42591.1| pre-mRNA-splicing factor, putative [Candida dubliniensis CD36]
          Length = 1187

 Score = 44.7 bits (104), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 93/395 (23%), Positives = 162/395 (41%), Gaps = 62/395 (15%)

Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
           +K+ P   ++  LP D   ++ +P  IGG+LV G+N   Y          L+   + L  
Sbjct: 218 VKKKPASLNSDPLPDDVNYMIPLPGHIGGMLVCGSNWCFYD--------KLDGPRIYLPL 269

Query: 352 SQELPRSSFSVELD-AAHATWLQNDVALLSTKTGDLVLLTVVY--DGRVVQRLDLS--KT 406
            +   ++  S+ ++   H    +N   LL    GDL  LTV Y  D   ++ + ++   T
Sbjct: 270 PRRDGQTQESIIVNHVTHVLKKKNFFILLQNTLGDLFKLTVDYDFDKETIKNISITYFDT 329

Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
            P  L+ +I    N   F+     D LL QF              E+ GD  A+      
Sbjct: 330 IPPALSLNI--FKNGFLFVNVLNNDKLLYQF--------------EKLGDDLAE----NE 369

Query: 467 LRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL--RI 524
           L  +SSD             Y S  N   +   TF     D+L  I  L+  S  +  RI
Sbjct: 370 LVINSSD-------------YDSLDNVRGTDTTTFKLKGLDNLALIDVLETLSPIIDSRI 416

Query: 525 NADASATGISKQSNYELVE--LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLII- 581
           N D+    +S  S  + +   +P    + +    +          + + +DE   YL+I 
Sbjct: 417 N-DSKLVTLSSHSYVKSITHGVPTTTLVESPLPITPTDIFTTKLSLESANDE---YLVIS 472

Query: 582 -SLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG---ARILDGS 637
            SL ++T+VL   +++ +V +S   FV  ++  +    G   V+QV+  G    R ++G 
Sbjct: 473 SSLSSKTLVLSIGEVVEDVEDS--EFVLDQSTISVQQVGIASVVQVYSNGIKHIRTVNGK 530

Query: 638 YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVL 672
             T D  F P+       S N+  + +++++  V+
Sbjct: 531 KKTTDW-FPPAGITITHASTNNQQVLIALSNLNVV 564


>gi|295667673|ref|XP_002794386.1| DNA damage-binding protein 1a [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226286492|gb|EEH42058.1| DNA damage-binding protein 1a [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 1195

 Score = 44.7 bits (104), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 44/138 (31%), Positives = 63/138 (45%), Gaps = 29/138 (21%)

Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA--H 368
           L+ VP+P+GG+LV+G  +I Y                  D++ E      S+ LD A   
Sbjct: 292 LIPVPAPLGGLLVLGETSIRYLD----------------DATNE----CISLPLDEATIF 331

Query: 369 ATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQ--RLDLSKTNPSVLTSDITTIGNSLF 423
             W Q D    LL+   G L  L ++ D    VQ  +LDL    P    S +  +G  + 
Sbjct: 332 VAWEQVDGQRWLLADDYGRLFFLMLILDEDNAVQSWKLDLLGNIPR--ASVLVYLGGGVT 389

Query: 424 FLGSRLGDSLLVQFTCGS 441
           F+GS  GDS L++ T GS
Sbjct: 390 FIGSHQGDSQLIRITEGS 407


>gi|308808936|ref|XP_003081778.1| putative UV-damaged DNA binding factor (ISS) [Ostreococcus tauri]
 gi|116060244|emb|CAL56303.1| putative UV-damaged DNA binding factor (ISS) [Ostreococcus tauri]
          Length = 1282

 Score = 44.7 bits (104), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 55/209 (26%), Positives = 100/209 (47%), Gaps = 30/209 (14%)

Query: 234 HVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
              N+R L+   V+D  F+HG  +P + +L+ R++  A           I    I    K
Sbjct: 376 EAFNIR-LEELRVEDIQFLHGTAKPTIAVLY-RDMKEA---------VHIKTYEIGVREK 424

Query: 294 QH-PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSS 352
           +     W+  +L   + K++ VP+P+GGV+V+G  TI Y ++++      ++  V L + 
Sbjct: 425 EFVSSPWAQNDLEGGSSKIIPVPAPVGGVVVLGEETIVYLNKTS------DDTDVFLKAI 478

Query: 353 QELPRSSF----SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP 408
               RSS     +++ D +          LL    G L LL +V+DG+ V  L + +   
Sbjct: 479 NIPERSSIVCYGAIDPDGSRY--------LLGDHDGTLYLLVLVHDGKRVNELKIERLGE 530

Query: 409 SVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           + + S ++ + N + F+GS  GDS L++ 
Sbjct: 531 TSIPSTVSYLDNGVVFVGSAYGDSQLIKL 559


>gi|346327528|gb|EGX97124.1| pre-mRNA splicing factor RSE1 [Cordyceps militaris CM01]
          Length = 1206

 Score = 44.7 bits (104), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 144/639 (22%), Positives = 231/639 (36%), Gaps = 116/639 (18%)

Query: 74  QEEGSKES---KNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRR 130
           Q  G+KE      SG     +  D      + L+ H  + G + S+A+    G++    +
Sbjct: 21  QFAGTKEQLIITGSGSQLTLLRPDPAQGKVIALLSH-DIFGILRSIAVFRLAGSN----K 75

Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGR 190
           D IILA +  +I++LE+    +      M  F        K G      G  +  DP+GR
Sbjct: 76  DYIILATDSGRITILEYLPGPNRFNRLHMETFG-------KSGIRRVVPGEYLACDPKGR 128

Query: 191 C---GGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVK 247
                 V    L  ++ + SQ        E T  S     A      VI +  LD+    
Sbjct: 129 ACLISAVEKNKLVYVLNRNSQA-------ELTISSP--LEAHKPGVLVIAMVALDV---- 175

Query: 248 DFIFVHGYIEPVMVILH----ERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
                 GY  PV   L     E +    G    +  T ++    +   L      WS   
Sbjct: 176 ------GYANPVFAALEIEYTEVDQDITGEALSEVETQLV-YYELDLGLNHVVRKWSD-- 226

Query: 304 LPHDAYKLLAVPSPIG-----GVLVVGANTIHY-HSQSASCALALNNYAVSLDSSQELPR 357
            P D    L    P G     GVLV G   I Y HS   +  + +         + E P 
Sbjct: 227 -PVDPTASLLFQVPGGNDGPSGVLVCGEENITYRHSNQDALRVPIPRRR----GATEDPS 281

Query: 358 SSFSVELDAAHATWLQNDVA----LLSTKTGDLVLLTV--VYDGR-----VVQRLDLSKT 406
              ++     H   L+        LL +  GDL  +T+  V D        VQR+ +   
Sbjct: 282 RKRNIVAGVMHK--LKGSAGAFFFLLQSDDGDLFKITIDMVEDEEGAPTGEVQRMKIKYF 339

Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEF--GDIEADAPST 464
           +   + + +  + +   ++ S+ G+    QF              E+F    + A  P  
Sbjct: 340 DTVPVATSLCILKSGFLYVASQFGNYAFYQFEKLGDDDDEVEFSSEDFPVDPLAAYEPVY 399

Query: 465 KRLRRSSSDALQDMVNGEELSLYGSASNNT-ESAQKTFSFAVRDSLVNIGPLKDFSYGLR 523
              R + + AL D +      L    +N T E A + F+     +      LK   +GL 
Sbjct: 400 FYPRLAENLALVDSIPAMNPLLDCKVANLTGEDAPQIFTICGNGARSTFRTLK---HGLE 456

Query: 524 INADASATGISKQSNYELVELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIIS 582
           +N   ++            ELPG    +WT+   S              D++Y AY+++S
Sbjct: 457 VNEIVAS------------ELPGVPSAVWTLKLNS--------------DEQYDAYIVLS 490

Query: 583 LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQ 641
               T+VL   + + EV++S  +     TIAA  L G   +IQV  RG R I +G     
Sbjct: 491 FTNGTLVLSIGETVEEVSDS-GFLTSVPTIAA-QLLGTDGLIQVHPRGIRHIRNG----- 543

Query: 642 DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
                  N    S  ++ ++++ S     V + +S G I
Sbjct: 544 -------NVNEWSAPQHRSIVAASTNSHQVAIALSSGEI 575


>gi|358366432|dbj|GAA83053.1| UV-damaged DNA binding protein [Aspergillus kawachii IFO 4308]
          Length = 1643

 Score = 44.7 bits (104), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 65/264 (24%), Positives = 105/264 (39%), Gaps = 29/264 (10%)

Query: 185 VDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK 244
           +DP GR   + VY   + ++   Q  S   G +    SG       E       R +D  
Sbjct: 62  IDPSGRFMTLEVYEGVIAVVPIVQLPSKKRGRQVAPPSGPDAPRVGELGEPTTAR-IDEL 120

Query: 245 HVKDFIFVHGYIEP--VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAM 302
            V+   F+H    P  + ++  + +     +V   H++   S+       ++  L   + 
Sbjct: 121 FVRSSAFLHVQSGPPRLALLYEDNQKKVRLKVRALHYSAATSSTGADAAFEES-LDGFSQ 179

Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
            L   A  L+ VP+P+GG+LV+G  +I Y               V  DS++ + R     
Sbjct: 180 ELDLGASHLIPVPAPLGGLLVLGETSIKY---------------VDTDSNEIVSRP---- 220

Query: 363 ELDAA--HATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQRLDLSKTNPSVLTSDITT 417
            LD A     W Q D    LL+   G L  L +V D    VQ   L     +   S +  
Sbjct: 221 -LDEATIFVAWEQVDSQRWLLADDYGRLFFLMLVLDSNNQVQSWKLDHLGNTARASVLIY 279

Query: 418 IGNSLFFLGSRLGDSLLVQFTCGS 441
           +G  + F+GS  GDS +++   GS
Sbjct: 280 LGGGVIFVGSHQGDSQVLRIGNGS 303


>gi|310793065|gb|EFQ28526.1| CPSF A subunit region [Glomerella graminicola M1.001]
          Length = 1212

 Score = 44.3 bits (103), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 129/570 (22%), Positives = 212/570 (37%), Gaps = 131/570 (22%)

Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
           + + G + S+A     G++    +D +ILA +  +I+++E+        I + + F+   
Sbjct: 62  HDIFGIIRSMAAFRLAGSN----KDYLILATDSGRITIIEY--------IPAQNRFQR-- 107

Query: 167 WLHL----KRGRESFARGPLVKVDPQGRC---GGVLVYGLQMIILKASQGGSGLVGDEDT 219
            LHL    K G      G  +  DP+GR      V    L  ++ + SQ        E T
Sbjct: 108 -LHLETFGKSGVRRVIPGEYLACDPKGRACLIASVEKNKLVYVLNRNSQA-------ELT 159

Query: 220 FGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWA-----GRV 274
             S     A      V+++  LD+          GY  PV   L E E T A     G  
Sbjct: 160 ISSP--LEAHKPGVLVLSMVALDV----------GYANPVFAAL-EIEYTEADQDPTGEA 206

Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIG-----GVLVVGANTI 329
           + +  T ++    +   L      WS    P D    L    P G     GVLV G   I
Sbjct: 207 AREAETQLV-YYELDLGLNHVVRKWSE---PVDPTASLLFQVPGGQDGPSGVLVCGEENI 262

Query: 330 HY-HSQSASCALALNNY--AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDL 386
            Y HS   +  + +     A    S +    S    +L  +   +      L+ T+ GDL
Sbjct: 263 TYRHSNQEAFRVPIPRRRGATEDPSRKRHVVSGVMHKLKGSAGAFF----FLIQTEDGDL 318

Query: 387 VLLTVVY----DGRV---VQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
              T+      DG     V+RL +   +   ++S +  + +   +  S+ G+    QF  
Sbjct: 319 FKATIDMVEDADGNPTGEVKRLKIKYFDTIPVSSSLCILKSGFLYAASQFGNHQFYQF-- 376

Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
                       E+ GD        + L  SS D   D   G +   +          + 
Sbjct: 377 ------------EKLGD------DDEELEFSSDDFPTDPKAGYDAVYF--------HPRP 410

Query: 500 TFSFAVRDSLVNIGPLKDFSYGLRINADA----SATGISKQSNYELV------------E 543
             + A+ +S+ ++ PL D         DA    +A G   +S + ++            E
Sbjct: 411 LENLALVESIDSMNPLLDCKVANLTGEDAPQIYTACGNGARSTFRMLKHGLEVNEIVASE 470

Query: 544 LPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTES 602
           LPG    +WT+  K +RG            D+Y AY+++S    T+VL   + + EV++S
Sbjct: 471 LPGIPSAVWTL--KLNRG------------DQYDAYIVLSFTNGTLVLSIGETVEEVSDS 516

Query: 603 VDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
              F+      A  L G   +IQV  +G R
Sbjct: 517 --GFLTSVPTLAAQLLGEDGLIQVHPKGIR 544


>gi|60677959|gb|AAX33486.1| RE01065p [Drosophila melanogaster]
          Length = 1227

 Score = 44.3 bits (103), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 79/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL T+ GD+  +T+  D  VV  + L   +     + +  +     F+ S  G+  L Q 
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E G+    AP           AL+++V  +EL  +     + 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +            L   GP   L+   +GL +            S   + ELPG    +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTV  ++              DDE+ AY+I+S    T+VL   + + EVT+S  +     
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504

Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
           T+    L G   ++QV+  G R I     + +  + G  +    + ++   V+++S    
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563

Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
                DP                                 ++ +G++D ++R+L  DP+ 
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPEGEQRSWFLAVGLADNTVRILSLDPNN 623

Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
           C     TP ++++   P  S  L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642


>gi|195490209|ref|XP_002093045.1| GE20993 [Drosophila yakuba]
 gi|194179146|gb|EDW92757.1| GE20993 [Drosophila yakuba]
          Length = 1227

 Score = 44.3 bits (103), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 79/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL T+ GD+  +T+  D  VV  + L   +     + +  +     F+ S  G+  L Q 
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E G+    AP           AL+++V  +EL  +     + 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +            L   GP   L+   +GL +            S   + ELPG    +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTV  ++              DDE+ AY+I+S    T+VL   + + EVT+S  +     
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504

Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
           T+    L G   ++QV+  G R I     + +  + G  +    + ++   V+++S    
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563

Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
                DP                                 ++ +G++D ++R+L  DP+ 
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPEGEQRSWFLAVGLADNTVRILSLDPNN 623

Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
           C     TP ++++   P  S  L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642


>gi|429859776|gb|ELA34542.1| pre-mRNA-splicing factor rse1 [Colletotrichum gloeosporioides Nara
           gc5]
          Length = 1212

 Score = 44.3 bits (103), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 138/607 (22%), Positives = 226/607 (37%), Gaps = 137/607 (22%)

Query: 74  QEEGSKESKNSGETKRRVLM---DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRR 130
           Q  G+KE      +  R+ +   D      + L+ H  + G + S+A     G++    +
Sbjct: 27  QFSGTKEQNIVTASGSRLTLLRPDPSQGKVITLLSH-DIFGIIRSMAAFRLAGSN----K 81

Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHL----KRGRESFARGPLVKVD 186
           D +ILA +  +I+++E+        I + + F+    LHL    K G      G  +  D
Sbjct: 82  DYLILATDSGRITIVEY--------IPAQNRFQR---LHLETFGKSGVRRVIPGEYLACD 130

Query: 187 PQGRC---GGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDM 243
           P+GR      V    L  ++ + +Q        E T  S     A      V+++  LD+
Sbjct: 131 PKGRACLIASVEKNKLVYVLNRNAQA-------ELTISSP--LEAHKPGVLVLSMVALDV 181

Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWA-----GRVSWKHHTCMISALSISTTLKQHPLI 298
                     GY  PV   L E E T A     G  + +  T ++    +   L      
Sbjct: 182 ----------GYANPVFAAL-EIEYTEADQDPTGEAAREAETQLVY-YELDLGLNHVVRK 229

Query: 299 WSAMNLPHDAYKLLAVPSPIG-----GVLVVGANTIHY-HSQSASCALALNNY--AVSLD 350
           WS    P D    L    P G     GVLV G   I Y HS   +  + +     A    
Sbjct: 230 WSE---PVDPTASLLFQVPGGQDGPSGVLVCGEENITYRHSNQEAFRVPIPRRRGATEDP 286

Query: 351 SSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDL--VLLTVVYDGR-----VVQRLDL 403
           S +    S    +L  +   +      LL T+ GDL   ++ +V D        V+RL +
Sbjct: 287 SRKRHIVSGVMHKLKGSAGAFF----FLLQTEDGDLFKAVIDMVEDADGNPTGEVKRLKI 342

Query: 404 SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPS 463
              +   ++S +  + +   +  S+ G+    QF              E+ GD + +   
Sbjct: 343 KYFDTVPVSSSLCILKSGFLYAASQFGNHQFYQF--------------EKLGDDDEEK-- 386

Query: 464 TKRLRRSSSDALQDMVNG-EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL 522
                 SS D   D   G + +  Y     N          A+ +S+ ++ PL D     
Sbjct: 387 ----EFSSDDFPADPKAGYDAVYFYPRPLEN---------LALVESIDSMNPLLDCKVAN 433

Query: 523 RINADA----SATGISKQSNYELV------------ELPGC-KGIWTVYHKSSRGHNADS 565
               DA    +A G   +S + ++            ELPG    +WT+  K SRG     
Sbjct: 434 LTGEDAPQIYTACGNGARSTFRMLKHGLEVNEIVASELPGIPSAVWTL--KLSRG----- 486

Query: 566 SRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQ 625
                  D+Y AY+++S    T+VL   + + EV++S   F+      A  L G   +IQ
Sbjct: 487 -------DQYDAYIVLSFTNGTLVLSIGETVEEVSDS--GFLTSVPTLAAQLLGEDGLIQ 537

Query: 626 VFERGAR 632
           V  +G R
Sbjct: 538 VHPKGIR 544


>gi|258572939|ref|XP_002540651.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237900917|gb|EEP75318.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 1144

 Score = 44.3 bits (103), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 39/142 (27%), Positives = 61/142 (42%), Gaps = 21/142 (14%)

Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
           NL   A  L+ VP P+GG+L++G   I Y          ++N  ++L            +
Sbjct: 239 NLELGAEILVPVPLPLGGILILGEKCIKYVD-------TISNETITL-----------PL 280

Query: 363 ELDAAHATW--LQNDVALLSTKTGDLVLLTVVYD-GRVVQRLDLSKTNPSVLTSDITTIG 419
           E +     W  L N   LL+   G L  L +V D    V+   +     +   S +  +G
Sbjct: 281 EYNTVFVAWEQLDNQRWLLADDYGRLFFLMLVLDSANAVRTWKVDLLGETSRASVLVHLG 340

Query: 420 NSLFFLGSRLGDSLLVQFTCGS 441
             + FLGS  GDS +++ T GS
Sbjct: 341 GGVVFLGSHQGDSHVIRITEGS 362


>gi|195336406|ref|XP_002034829.1| GM14250 [Drosophila sechellia]
 gi|194127922|gb|EDW49965.1| GM14250 [Drosophila sechellia]
          Length = 1227

 Score = 44.3 bits (103), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 79/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL T+ GD+  +T+  D  VV  + L   +     + +  +     F+ S  G+  L Q 
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E G+    AP           AL+++V  +EL  +     + 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +            L   GP   L+   +GL +            S   + ELPG    +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTV  ++              DDE+ AY+I+S    T+VL   + + EVT+S  +     
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504

Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
           T+    L G   ++QV+  G R I     + +  + G  +    + ++   V+++S    
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563

Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
                DP                                 ++ +G++D ++R+L  DP+ 
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPEGEQRSWFLAVGLADNTVRILSLDPNN 623

Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
           C     TP ++++   P  S  L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642


>gi|194749950|ref|XP_001957397.1| GF24063 [Drosophila ananassae]
 gi|190624679|gb|EDV40203.1| GF24063 [Drosophila ananassae]
          Length = 1228

 Score = 44.3 bits (103), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 79/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL T+ GD+  +T+  D  VV  + L   +     + +  +     F+ S  G+  L Q 
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E G+    AP           AL+++V  +EL  +     + 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +            L   GP   L+   +GL +            S   + ELPG    +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTV  ++              DDE+ AY+I+S    T+VL   + + EVT+S  +     
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504

Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
           T+    L G   ++QV+  G R I     + +  + G  +    + ++   V+++S    
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563

Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
                DP                                 ++ +G++D ++R+L  DP+ 
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPEGEQRSWFLAVGLADNTVRILSLDPNN 623

Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
           C     TP ++++   P  S  L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642


>gi|24654874|ref|NP_728546.1| CG13900, isoform A [Drosophila melanogaster]
 gi|23092721|gb|AAF47416.2| CG13900, isoform A [Drosophila melanogaster]
 gi|60678131|gb|AAX33572.1| LD01809p [Drosophila melanogaster]
 gi|220950356|gb|ACL87721.1| CG13900-PA [synthetic construct]
 gi|289803030|gb|ADD20765.1| FI04459p [Drosophila melanogaster]
          Length = 1227

 Score = 44.3 bits (103), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 79/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL T+ GD+  +T+  D  VV  + L   +     + +  +     F+ S  G+  L Q 
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E G+    AP           AL+++V  +EL  +     + 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +            L   GP   L+   +GL +            S   + ELPG    +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTV  ++              DDE+ AY+I+S    T+VL   + + EVT+S  +     
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504

Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
           T+    L G   ++QV+  G R I     + +  + G  +    + ++   V+++S    
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563

Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
                DP                                 ++ +G++D ++R+L  DP+ 
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPEGEQRSWFLAVGLADNTVRILSLDPNN 623

Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
           C     TP ++++   P  S  L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642


>gi|449459948|ref|XP_004147708.1| PREDICTED: splicing factor 3B subunit 3-like [Cucumis sativus]
 gi|449513493|ref|XP_004164340.1| PREDICTED: splicing factor 3B subunit 3-like [Cucumis sativus]
          Length = 1214

 Score = 44.3 bits (103), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 73/324 (22%), Positives = 126/324 (38%), Gaps = 57/324 (17%)

Query: 320 GVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
           GVLV   N + Y +Q      A+      +    +LP     + + AA          LL
Sbjct: 246 GVLVCAENFVIYKNQGHPDVRAV------IPRRADLPAERGVLIVSAAMHKQKTMFFFLL 299

Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
            T+ GD+  +T+ ++   V+ L +   +   +T+ +  + +   F  S  G+  L QF  
Sbjct: 300 QTEYGDIFKVTLEHNNDSVKELKIKYFDTIPVTASMCVLKSGFLFAASEFGNHSLYQFQA 359

Query: 440 -GSGTSMLSSG-----LKEEFGDIEADAPSTKRLRR-SSSDALQDMVNGEELSLYGSASN 492
            G    + SS       +E F  +       K L R    ++L  +++ + ++L+     
Sbjct: 360 IGEDADVESSSATLMETEEGFQPVFFQPRRLKNLMRIDQVESLMPIMDMKIINLF----- 414

Query: 493 NTESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-K 548
             E   + F+   R      GP   L+    GL I            S   + ELPG   
Sbjct: 415 -EEETPQIFTLCGR------GPRSSLRILRPGLAI------------SEMAVSELPGVPS 455

Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
            +WTV                  +DE+ AY+++S    T+VL   + + EV++S   F+ 
Sbjct: 456 AVWTVKKN--------------INDEFDAYIVVSFANATLVLSIGETVEEVSDS--GFLD 499

Query: 609 GRTIAAGNLFGRRRVIQVFERGAR 632
                A +L G   ++QV   G R
Sbjct: 500 TTPSLAVSLIGDDSLMQVHPNGIR 523


>gi|194864680|ref|XP_001971056.1| GG14635 [Drosophila erecta]
 gi|190652839|gb|EDV50082.1| GG14635 [Drosophila erecta]
          Length = 1227

 Score = 44.3 bits (103), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 79/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL T+ GD+  +T+  D  VV  + L   +     + +  +     F+ S  G+  L Q 
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E G+    AP           AL+++V  +EL  +     + 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +            L   GP   L+   +GL +            S   + ELPG    +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTV  ++              DDE+ AY+I+S    T+VL   + + EVT+S  +     
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504

Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
           T+    L G   ++QV+  G R I     + +  + G  +    + ++   V+++S    
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563

Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
                DP                                 ++ +G++D ++R+L  DP+ 
Sbjct: 564 VYFEMDPSGELNEYTERSEMPAEIMCMALGTVPDGEQRSWFLAVGLADNTVRILSLDPNN 623

Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
           C     TP ++++   P  S  L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642


>gi|384490729|gb|EIE81951.1| hypothetical protein RO3G_06656 [Rhizopus delemar RA 99-880]
          Length = 967

 Score = 43.9 bits (102), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 41/154 (26%), Positives = 72/154 (46%), Gaps = 17/154 (11%)

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
           S + +    + L+ VP P+GG+LV+G   I Y        L   N  +S+D ++    ++
Sbjct: 198 STIKVEASTHALVPVPEPLGGLLVIGEYIITYFD-----PLTNTNRELSIDPAR---VTA 249

Query: 360 FSVELDAAHATWLQND-----VALLSTKTGDLVLLTVVYDGRVV---QRLDLSKTNPSV- 410
           +    D ++   L ++     V  + T    +V L+  + G+V    Q ++    +P V 
Sbjct: 250 WEFMKDESNRYLLGDEEGYLYVFSIETSHNKVVNLSSTFIGQVPSFNQNIESKANHPQVS 309

Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTS 444
             S I  +GN +F++GS  GDS L+Q   G   S
Sbjct: 310 RPSCIVDLGNLMFYIGSTHGDSCLIQLIKGQEKS 343


>gi|66811906|ref|XP_640132.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
 gi|74854972|sp|Q54SA7.1|SF3B3_DICDI RecName: Full=Probable splicing factor 3B subunit 3
 gi|60468134|gb|EAL66144.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
          Length = 1256

 Score = 43.9 bits (102), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 83/338 (24%), Positives = 124/338 (36%), Gaps = 75/338 (22%)

Query: 319 GGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE----LDAAHATWLQN 374
           GGVLV   + I Y +Q  +            +    +PR   S      L  +H++  Q 
Sbjct: 256 GGVLVASEDYIVYRNQDHA------------EVRSRIPRRYGSDPNKGVLIISHSSHKQK 303

Query: 375 DVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDS 432
            +   L+ ++ GDL  +T+ Y G  V  ++++  +  VL + +T + N   F  S  GD 
Sbjct: 304 GMFFFLVQSEHGDLYKITLDYQGDQVSEVNVNYFDTIVLANCLTVLKNGFLFAASEFGDH 363

Query: 433 LLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL----RRSSSDALQDMVNGEELSLYG 488
            L  F         S G +EE G  +        L    R S    ++++ N E  S   
Sbjct: 364 TLYFFK--------SIGDEEEEGQAKRLEDKDGHLWFTPRNSCGTKMEELKNLEPTSHLS 415

Query: 489 SASNNTESAQKTFSFAVRDSLVNIGP-------------LKDFSYGLRINADASATGISK 535
           S S           F V D +    P             LK   +GL +    +A     
Sbjct: 416 SLS-------PIIDFKVLDLVREENPQLYSLCGTGLNSSLKVLRHGLSVTTITTAN---- 464

Query: 536 QSNYELVELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
                   LPG   GIWTV   +S   NA         D+   Y+++S    T VL   D
Sbjct: 465 --------LPGVPSGIWTVPKSTS--PNA--------IDQTDKYIVVSFVGTTSVLSVGD 506

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
            + E  ES    ++  T       G   +IQVF  G R
Sbjct: 507 TIQENHES--GILETTTTLLVKSMGDDAIIQVFPTGFR 542


>gi|302423344|ref|XP_003009502.1| DNA damage-binding protein 1b [Verticillium albo-atrum VaMs.102]
 gi|261352648|gb|EEY15076.1| DNA damage-binding protein 1b [Verticillium albo-atrum VaMs.102]
          Length = 1119

 Score = 43.5 bits (101), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 41/136 (30%), Positives = 61/136 (44%), Gaps = 13/136 (9%)

Query: 312 LAVPSPIGGVLV----VGANTIHYHSQSASCALA-LNNYAVSLDSS----QELPRSSFSV 362
           L +P P    L+    V ++   YH +  + A A L    V+ ++       L +   S 
Sbjct: 221 LEIPDPFARTLIPVSIVESDVKRYHRRDTTNASAQLGGLIVAGETMLIYVDTLTKVKISK 280

Query: 363 ELDAAH--ATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTI 418
            LD      +W + DV   LL+   G+L LLT+  DG +V  L L     +   S +  +
Sbjct: 281 ALDEPRIFVSWAKYDVTRYLLADDYGNLHLLTLEVDGVIVTGLSLKTIGKTSRASCLVYM 340

Query: 419 GNSLFFLGSRLGDSLL 434
           GN + FLGS  GDS L
Sbjct: 341 GNEILFLGSHHGDSQL 356


>gi|402077250|gb|EJT72599.1| pre-mRNA-splicing factor RSE1 [Gaeumannomyces graminis var. tritici
           R3-111a-1]
          Length = 1216

 Score = 43.5 bits (101), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 60/279 (21%), Positives = 110/279 (39%), Gaps = 68/279 (24%)

Query: 378 LLSTKTGDLVLLTV--VYDGR-----VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
           LL T+ GDL  +T+  + D        VQRL +   +   ++S++  + +   F+ S  G
Sbjct: 310 LLQTEDGDLFKVTIDMLEDAEGNTTGEVQRLKIKYFDTIPVSSNLCILKSGFLFVASEFG 369

Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
           +    QF              E+ GD        + L  SS +   D     E + +   
Sbjct: 370 NHHFYQF--------------EKLGD------DDEELEFSSENFPSDPAEPYEPAYF--- 406

Query: 491 SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA----SATGISKQSNYELV---- 542
                  + T + A+ +S+ ++ PL D       + DA    + +G   +S + ++    
Sbjct: 407 -----YPRPTENLALVESVESMNPLMDLKVANLTDEDAPQIYTVSGNGARSTFRMLKHGL 461

Query: 543 --------ELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
                   +LPG    +WT                 A DD+Y +Y+++S    T+VL   
Sbjct: 462 EVNEIVASQLPGTPSAVWTT--------------KIARDDQYDSYIVLSFTNGTLVLSIG 507

Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
           + + EV+++   F+   +  A    G   ++QV  RG R
Sbjct: 508 ETVEEVSDT--GFLSSVSTLAVQQLGEDGLVQVHPRGIR 544


>gi|226480826|emb|CAX73510.1| glyceraldehyde 3-phosphate dehydrogenase [Schistosoma japonicum]
          Length = 332

 Score = 43.5 bits (101), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 54/212 (25%), Positives = 92/212 (43%), Gaps = 34/212 (16%)

Query: 128 RRRDSIILAFEDAKISVLEF---DDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVK 184
           R  DS+ L    A ++++E    +DS+  + + S     S E     R      +G  V 
Sbjct: 72  RETDSLFLLTHKAGVAIIECVRNNDSVEFVTVAS----GSVE----DRSARIIDQGFDVL 123

Query: 185 VDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFG-SGGGFSARIESSHVINLRDLD 242
           +DP      V +Y GL  IIL    G        DT   +   +S RIE  +++      
Sbjct: 124 IDPGANYIVVRLYHGLLKIILLQCIGDKIGTDFLDTNQWTVNTYSVRIEEGNIV------ 177

Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAM 302
                D  F++GY  P   +++E EL    + +++ +    +  ++  TL          
Sbjct: 178 -----DMAFIYGYSLPTFAMIYEDELVLHMK-TYEIYGREPALRNVQLTLD--------- 222

Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQ 334
           ++  D+  L+ VP P GGV++VG N I YH++
Sbjct: 223 SIEPDSKLLIPVPKPYGGVILVGDNIICYHTK 254


>gi|195428692|ref|XP_002062402.1| GK16677 [Drosophila willistoni]
 gi|194158487|gb|EDW73388.1| GK16677 [Drosophila willistoni]
          Length = 1273

 Score = 43.5 bits (101), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 61/262 (23%), Positives = 99/262 (37%), Gaps = 45/262 (17%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL T+ GD+  +T+  D  VV  + L   +     + +  +     F+ S  G+  L Q 
Sbjct: 347 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 406

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E G+    AP           AL+++V  +EL  +     + 
Sbjct: 407 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIVTSQ 456

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +            L   GP   L+   +GL +            S   + ELPG    +
Sbjct: 457 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 504

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTV  +               DDE+ AY+I+S    T+VL   + + EVT+S  +     
Sbjct: 505 WTVKKR--------------VDDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 549

Query: 611 TIAAGNLFGRRRVIQVFERGAR 632
           T+    L G   ++QV+  G R
Sbjct: 550 TLCCAAL-GDDALVQVYPDGIR 570


>gi|213405251|ref|XP_002173397.1| U2 snRNP-associated protein Sap130 [Schizosaccharomyces japonicus
           yFS275]
 gi|212001444|gb|EEB07104.1| U2 snRNP-associated protein Sap130 [Schizosaccharomyces japonicus
           yFS275]
          Length = 1166

 Score = 43.5 bits (101), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 121/561 (21%), Positives = 202/561 (36%), Gaps = 99/561 (17%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           + L+     +G V ++A L   G     ++D ++L  +  + ++LE+D   + L      
Sbjct: 56  MNLMISQNCYGIVRNIAPLRLTGF----KKDYLVLTSDSGRFTILEYDIGKNKLVSVYQE 111

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGV-------LVYGLQ------MII---L 204
            F        K G      G  + +D +GR   V       LVY L       + I   L
Sbjct: 112 AFG-------KSGIRRIVPGEYLALDAKGRAAMVASTEKNKLVYVLNRDSEANLTISSPL 164

Query: 205 KASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILH 264
           +A + G+      D  G   G+   I ++  +   DLD   + +                
Sbjct: 165 EAHKAGTICF---DLVGLDTGYENPIFAALEVEYSDLDHDPLGEL--------------- 206

Query: 265 ERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS----PIGG 320
                      +KH   +++   +   L      WS + +   AYKL+ VP     P  G
Sbjct: 207 -----------YKHSEKVLTYYELDLGLNHVVKRWSKV-VDRSAYKLIRVPGGNDGP-SG 253

Query: 321 VLVVGANTIHY-HSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
           V+V+    I Y H Q  S  + +        ++  LP     + + A       +   LL
Sbjct: 254 VIVISTGWISYRHLQRQSHFVPIPTRETKATTNTALP-----IIVSAVMHKMRDSFFYLL 308

Query: 380 STKTGDLVLLTVVYDGRV-VQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
               GDL+ LT+  D    V+ L +   +     + +  + + L F G   G+  L QF 
Sbjct: 309 QNSDGDLLKLTMELDDHSQVKELRIKYFDTIPFAAILNILKSGLLFAGCEGGNHHLYQFE 368

Query: 439 CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQ 498
                S+     + EF         +K   +  +  L  + N   L    S    T++  
Sbjct: 369 -----SLAIDDDEPEFSSANFSEEQSKHSPKKLTYKLHPLQNISLLDEIPSLFPLTDAIV 423

Query: 499 KTFSFAVRDSLVNI-GPLKDFSYGLRINADASATGISKQSNYELVELPGCK-GIWTVYHK 556
              S      L  + G  K+ S  L +    SAT +       L ELPG    IWTV  K
Sbjct: 424 TRTSTDANSQLYTLCGRHKEASLRL-LKRGVSATEVV------LSELPGAPIAIWTVKQK 476

Query: 557 SSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGN 616
                          +D Y  Y+++S    T+VL   + + EV +S        T+    
Sbjct: 477 --------------LNDPYDKYMVLSFTNGTLVLSIGETVEEVLDS-GLLSSVSTLNVRQ 521

Query: 617 LFGRRRVIQVFERGARILDGS 637
           L GR  V+Q+  +G R +  +
Sbjct: 522 L-GRSSVVQIHSKGIRCISAN 541


>gi|350629921|gb|EHA18294.1| damage-specific DNA binding protein [Aspergillus niger ATCC 1015]
          Length = 1140

 Score = 43.5 bits (101), Expect = 0.77,   Method: Compositional matrix adjust.
 Identities = 42/139 (30%), Positives = 61/139 (43%), Gaps = 25/139 (17%)

Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
           A  L+ VP+P+GG+LV+G  +I Y               V  DS++ + R      LD A
Sbjct: 245 ASHLIPVPAPLGGLLVLGETSIKY---------------VDTDSNEIVSRP-----LDEA 284

Query: 368 --HATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQRLDLSKTNPSVLTSDITTIGNSL 422
                W Q D    LL+   G L  L +V D    VQ   L     +   S +  +G  +
Sbjct: 285 TIFVAWEQVDSQRWLLADDYGRLFFLMLVLDSNNQVQSWKLDHLGNTARASVLIYLGGGV 344

Query: 423 FFLGSRLGDSLLVQFTCGS 441
            F+GS  GDS +++   GS
Sbjct: 345 IFVGSHQGDSQVLRIGNGS 363


>gi|134077422|emb|CAK45676.1| unnamed protein product [Aspergillus niger]
          Length = 1133

 Score = 43.5 bits (101), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 42/139 (30%), Positives = 61/139 (43%), Gaps = 25/139 (17%)

Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
           A  L+ VP+P+GG+LV+G  +I Y               V  DS++ + R      LD A
Sbjct: 229 ASHLIPVPAPLGGLLVLGETSIKY---------------VDTDSNEIVSRP-----LDEA 268

Query: 368 --HATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQRLDLSKTNPSVLTSDITTIGNSL 422
                W Q D    LL+   G L  L +V D    VQ   L     +   S +  +G  +
Sbjct: 269 TIFVAWEQVDSQRWLLADDYGRLFFLMLVLDSNNQVQSWKLDHLGNTARASVLIYLGGGV 328

Query: 423 FFLGSRLGDSLLVQFTCGS 441
            F+GS  GDS +++   GS
Sbjct: 329 IFVGSHQGDSQVLRIGNGS 347


>gi|317031116|ref|XP_001392900.2| UV-damaged DNA binding protein [Aspergillus niger CBS 513.88]
          Length = 1124

 Score = 43.5 bits (101), Expect = 0.82,   Method: Compositional matrix adjust.
 Identities = 42/139 (30%), Positives = 61/139 (43%), Gaps = 25/139 (17%)

Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
           A  L+ VP+P+GG+LV+G  +I Y               V  DS++ + R      LD A
Sbjct: 229 ASHLIPVPAPLGGLLVLGETSIKY---------------VDTDSNEIVSRP-----LDEA 268

Query: 368 --HATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQRLDLSKTNPSVLTSDITTIGNSL 422
                W Q D    LL+   G L  L +V D    VQ   L     +   S +  +G  +
Sbjct: 269 TIFVAWEQVDSQRWLLADDYGRLFFLMLVLDSNNQVQSWKLDHLGNTARASVLIYLGGGV 328

Query: 423 FFLGSRLGDSLLVQFTCGS 441
            F+GS  GDS +++   GS
Sbjct: 329 IFVGSHQGDSQVLRIGNGS 347


>gi|154286506|ref|XP_001544048.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
 gi|150407689|gb|EDN03230.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
          Length = 1158

 Score = 43.1 bits (100), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 44/144 (30%), Positives = 65/144 (45%), Gaps = 25/144 (17%)

Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
            L   A  L+ VP+P+GG+LV+G  +I Y   +++  +           SQ L  ++  V
Sbjct: 250 ELEMGASFLVPVPAPLGGLLVLGETSIRYLDDASNECI-----------SQPLKEATIFV 298

Query: 363 ELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQ--RLDLSKTNPSVLTSDITT 417
                   W Q D    LL+   G L  L +V D    VQ  +LDL    P    S +  
Sbjct: 299 -------AWEQVDGQRWLLADDYGRLFFLMLVLDTDNAVQSWKLDLLGDIPR--ASVLVY 349

Query: 418 IGNSLFFLGSRLGDSLLVQFTCGS 441
           +G  + F+GS  GD  L++ T GS
Sbjct: 350 MGGGITFIGSHQGDPELIRITEGS 373


>gi|195126264|ref|XP_002007593.1| GI12293 [Drosophila mojavensis]
 gi|193919202|gb|EDW18069.1| GI12293 [Drosophila mojavensis]
          Length = 1227

 Score = 43.1 bits (100), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 61/262 (23%), Positives = 99/262 (37%), Gaps = 45/262 (17%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL T+ GD+  +T+  D  VV  + L   +     + +  +     F+ S  G+  L Q 
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E G+    AP           AL+++V  +EL  +     + 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +            L   GP   L+   +GL +            S   + ELPG    +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTV  +               DDE+ AY+I+S    T+VL   + + EVT+S  +     
Sbjct: 460 WTVKKR--------------IDDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504

Query: 611 TIAAGNLFGRRRVIQVFERGAR 632
           T+    L G   ++QV+  G R
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIR 525


>gi|328700785|ref|XP_001945395.2| PREDICTED: DNA damage-binding protein 1-like [Acyrthosiphon pisum]
          Length = 1072

 Score = 43.1 bits (100), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 41/202 (20%), Positives = 88/202 (43%), Gaps = 38/202 (18%)

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
           ++  +++D  F++G+  P ++I++E                  +A+  +  +K+      
Sbjct: 188 MEETNIQDIGFLYGFTNPTIIIIYE------------------NAMGRTIKIKKIIDSKK 229

Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
             ++  +A  ++ VPSP+ G +++G N+I YH  + SC +              LP    
Sbjct: 230 YKSIEKEASMVIPVPSPLCGAIIIGENSIFYH--NGSCNII------------RLPIRQ- 274

Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRV-----VQRLDLSKTNPSVLTSDI 415
            +E+       L+    LL   +G L++L + Y+  +     V  L L       +   +
Sbjct: 275 KIEIVCYTRVDLEGTRYLLGDHSGCLLMLFLKYEKTLNGKFKVTDLYLRYFGEISIPISL 334

Query: 416 TTIGNSLFFLGSRLGDSLLVQF 437
           T + N + ++ S+ GDS L++ 
Sbjct: 335 TYLDNKVIYVASKFGDSQLIKL 356


>gi|195376606|ref|XP_002047087.1| GJ13230 [Drosophila virilis]
 gi|194154245|gb|EDW69429.1| GJ13230 [Drosophila virilis]
          Length = 1229

 Score = 43.1 bits (100), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 61/262 (23%), Positives = 99/262 (37%), Gaps = 45/262 (17%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL T+ GD+  +T+  D  VV  + L   +     + +  +     F+ S  G+  L Q 
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E G+    AP           AL+++V  +EL  +     + 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +            L   GP   L+   +GL +            S   + ELPG    +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTV  +               DDE+ AY+I+S    T+VL   + + EVT+S  +     
Sbjct: 460 WTVKKR--------------IDDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504

Query: 611 TIAAGNLFGRRRVIQVFERGAR 632
           T+    L G   ++QV+  G R
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIR 525


>gi|346970653|gb|EGY14105.1| hypothetical protein VDAG_00787 [Verticillium dahliae VdLs.17]
          Length = 1160

 Score = 43.1 bits (100), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 41/136 (30%), Positives = 61/136 (44%), Gaps = 13/136 (9%)

Query: 312 LAVPSPIGGVLV----VGANTIHYHSQSASCALA-LNNYAVSLDSS----QELPRSSFSV 362
           L +P P    L+    V ++   YH +  + A A L    V+ ++       L +   S 
Sbjct: 221 LEIPDPFARTLIPVSIVESDVKRYHRRDTTNASAQLGGLIVAGETMLIYVDTLTKVKISK 280

Query: 363 ELDAAH--ATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTI 418
            LD      +W + DV   LL+   G+L LLT+  DG +V  L L     +   S +  +
Sbjct: 281 ALDEPRIFVSWAKYDVTRYLLADDYGNLHLLTLEVDGVIVTGLSLKTIGKTSRASCLVYM 340

Query: 419 GNSLFFLGSRLGDSLL 434
           GN + FLGS  GDS L
Sbjct: 341 GNEILFLGSHHGDSQL 356


>gi|195012560|ref|XP_001983703.1| GH16029 [Drosophila grimshawi]
 gi|193897185|gb|EDV96051.1| GH16029 [Drosophila grimshawi]
          Length = 1228

 Score = 43.1 bits (100), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 61/262 (23%), Positives = 99/262 (37%), Gaps = 45/262 (17%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL T+ GD+  +T+  D  VV  + L   +     + +  +     F+ S  G+  L Q 
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E G+    AP           AL+++V  +EL  +     + 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +            L   GP   L+   +GL +            S   + ELPG    +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTV  +               DDE+ AY+I+S    T+VL   + + EVT+S  +     
Sbjct: 460 WTVKKR--------------IDDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504

Query: 611 TIAAGNLFGRRRVIQVFERGAR 632
           T+    L G   ++QV+  G R
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIR 525


>gi|154320780|ref|XP_001559706.1| hypothetical protein BC1G_01862 [Botryotinia fuckeliana B05.10]
          Length = 238

 Score = 42.7 bits (99), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 45/182 (24%), Positives = 73/182 (40%), Gaps = 36/182 (19%)

Query: 57  NLVVTAANVIEIYVVR--------VQEEGSKESKNSGETKRRVLMD-GIS---------- 97
           NLVV  +++++I+  +        + E+ S  +K+      RV  D G+           
Sbjct: 28  NLVVAKSSLLQIFTTKTVSVDLDELSEKDSSTAKDDTNIDPRVNNDDGVEDSFLGTDSIM 87

Query: 98  -------AASLELVCHYRLHGNVESL----AILSQGGADNSRRRDSIILAFEDAKISVLE 146
                     L LV  Y L G V SL     I S+ G +      +I++ F+DAK+S++E
Sbjct: 88  QRPELARTTKLVLVAEYNLSGTVTSLVRVKTISSKTGGE------AILVGFKDAKLSLVE 141

Query: 147 FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKA 206
           +D    G+   S+H +E  E                + VDP  RC  +      + IL  
Sbjct: 142 WDPERPGISTISVHFYEQDELQGSPWAPSLSDCVNYLTVDPGSRCAALKFGARNLAILPF 201

Query: 207 SQ 208
            Q
Sbjct: 202 KQ 203


>gi|198420618|ref|XP_002125906.1| PREDICTED: similar to Splicing factor 3B subunit 3
           (Spliceosome-associated protein 130) (SAP 130)
           (Pre-mRNA-splicing factor SF3b 130 kDa subunit)
           (SF3b130) (STAF130) [Ciona intestinalis]
          Length = 1216

 Score = 42.7 bits (99), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 96/463 (20%), Positives = 163/463 (35%), Gaps = 115/463 (24%)

Query: 304 LPHDAYKLLAVP---SPIGGVLVVGANTIHYHS--QSASCALALNNYAVSLDSSQELPRS 358
           L   A  L++VP      GGVLV   N I Y +          +      LD     P  
Sbjct: 228 LEERANHLISVPGGNDGPGGVLVCAENYITYKNFGDQPDIRTPIPRRRNDLDD----PER 283

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTI 418
              V   A H T       L+ T+ GD+  +T+  D  +V  + L   +   ++  +  +
Sbjct: 284 GMIVVCSATHKTKSMF-FFLIQTEQGDIFKVTLETDEDMVTEIRLKYFDTVPVSMAMCVL 342

Query: 419 GNSLFFLGSRLGDSLLVQFTC---GSGTSMLSSGLKEEFGDIEADAPSTKR--------- 466
                F+ + +G+  L Q          +  SS +  E GD    AP   R         
Sbjct: 343 RTGFLFVAAEMGNHCLYQIAHLGDDDDETEFSSAMPLEEGDTFFYAPRALRNLVLVDELD 402

Query: 467 -LRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRIN 525
            L    +  + D+ N +   LY +      S+                 L+   +GL + 
Sbjct: 403 SLSPIMTCLISDLANEDTPQLYVTCGRGPRSS-----------------LRVLRHGLEV- 444

Query: 526 ADASATGISKQSNYELVELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLE 584
                      S   + ELPG    +WTV  K               ++E+ +Y+I+S  
Sbjct: 445 -----------SEMAVSELPGNPNAVWTVKIKE--------------EEEFDSYIIVSFV 479

Query: 585 ARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS------- 637
             T+VL   + + EVT+S   F+      + +L G   ++QV+  G R +          
Sbjct: 480 NATLVLSIGETVEEVTDS--GFLGTTPTLSCSLLGENALVQVYPDGIRHIRADKRVNEWK 537

Query: 638 ---------------------------YMTQDLSFGPSNSESGSGSENSTVLSVSIAD-- 668
                                      Y   D S G  N  +     NS V+ + ++   
Sbjct: 538 TPGKKTILRCAVNQRQVVIALTGGELVYFEMDQS-GQLNEYTERKEMNSEVVCMDLSKVP 596

Query: 669 ------PYVLLGMSDGSIRLLVGDPSTC--TVSVQT-PAAIES 702
                  ++ +G++D ++R++  DP+ C   +S+Q  PA  ES
Sbjct: 597 PTEQRTRFLAVGLADNTVRIISLDPTDCLQPLSMQALPATPES 639


>gi|328874742|gb|EGG23107.1| UV-damaged DNA binding protein1 [Dictyostelium fasciculatum]
          Length = 1116

 Score = 42.7 bits (99), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 44/196 (22%), Positives = 77/196 (39%), Gaps = 41/196 (20%)

Query: 504 AVRDSLVNIGPLKDF----------------SYGLR---INADASATGISKQSNYELVEL 544
            V D+  N+GP+ DF                S G +   +    +  GI++Q++   ++L
Sbjct: 335 TVLDTFANLGPIPDFCLVDIEKQGQNQIVACSGGFKEGSLRVIRNGIGITEQAS---IDL 391

Query: 545 PGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVD 604
           PG K IW++   S R                  YLI+S  + T VLE      E TE   
Sbjct: 392 PGIKAIWSLARGSDR------------------YLILSFISSTKVLEFQGEDIEETEIAG 433

Query: 605 YFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSV 664
           + +Q  T+  GN+   ++++Q+   G  ++D         + PS+      S     + +
Sbjct: 434 FDLQSPTLYCGNV-ADKQILQISTSGIYLVDHETNLNYDVWKPSSGSINLASHQGNQILI 492

Query: 665 SIADPYVLLGMSDGSI 680
           S     +   + D  I
Sbjct: 493 SFGKTLIYFEIKDQKI 508


>gi|67516629|ref|XP_658200.1| hypothetical protein AN0596.2 [Aspergillus nidulans FGSC A4]
 gi|40747539|gb|EAA66695.1| hypothetical protein AN0596.2 [Aspergillus nidulans FGSC A4]
 gi|259489136|tpe|CBF89158.1| TPA: damaged DNA binding protein (Eurofung) [Aspergillus nidulans
           FGSC A4]
          Length = 1132

 Score = 42.4 bits (98), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 65/265 (24%), Positives = 108/265 (40%), Gaps = 31/265 (11%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G    +DP GR   + +Y   ++++   Q  S   G +    +G       E    I  R
Sbjct: 117 GSRCMIDPSGRFMTLEIYDGMIVVIPIIQLPSKRRGRQVALPTGPDAPRIGELGEPIITR 176

Query: 240 DLDMKHVKDFIFVHGYI-EPVMVILHE-RELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
            +D   V+   F+H     P + +L+E  +     +V    ++    A S  T++  +  
Sbjct: 177 -IDELFVRSSAFLHVQAGSPRLALLYEDNQKKVKLKVRELKYSTAAGAESEFTSIADY-- 233

Query: 298 IWSAMNLPHDAYKLLAVPSPI---GGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQE 354
              A  L   A  L+ VP+P+   GG+L++G  +I Y         A NN  VS    Q 
Sbjct: 234 ---AQELDLGASHLIPVPAPLAAAGGLLILGETSIKYVD-------ADNNEIVS----QP 279

Query: 355 LPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLT 412
           L  ++  V        W Q D    LL+   G L  L +V     V+R +L     +   
Sbjct: 280 LEEATIFV-------AWEQVDSQRWLLADDYGRLFFLMLVLRNSEVERWELHSLGNTSRA 332

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQF 437
           S +  +G  + F+GS  GDS +++ 
Sbjct: 333 SVLVYLGGGVVFVGSHQGDSQVIRI 357


>gi|383847297|ref|XP_003699291.1| PREDICTED: splicing factor 3B subunit 3-like [Megachile rotundata]
          Length = 1217

 Score = 42.4 bits (98), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 59/261 (22%), Positives = 100/261 (38%), Gaps = 43/261 (16%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           L  T+ GD+  +T+  D  +V  + L   +   + + +  +     F+ S  G+  L Q 
Sbjct: 302 LAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVAASMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKR--LRRSSSDALQDMVNGEELSLYGSASN 492
                       SS +  E GD    AP   R  +     D+L  ++  +   L   A+ 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLRNLVLVDEMDSLSPIMACQVADL---ANE 418

Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIW 551
           +T     T     R +L      +   +GL +            S   + ELPG    +W
Sbjct: 419 DTPQLYITCGRGPRSTL------RVLRHGLEV------------SEMAVSELPGNPNAVW 460

Query: 552 TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
           TV  +               D+EY AY+I+S    T+VL   + + EVT+S   F+    
Sbjct: 461 TVKRR--------------VDEEYDAYIIVSFVNATLVLSIGETVEEVTDS--GFLGTTP 504

Query: 612 IAAGNLFGRRRVIQVFERGAR 632
             + +  G   ++QV+  G R
Sbjct: 505 TLSCSALGEDALVQVYPDGIR 525


>gi|66553024|ref|XP_623333.1| PREDICTED: splicing factor 3B subunit 3 isoform 1 [Apis mellifera]
 gi|380015815|ref|XP_003691890.1| PREDICTED: splicing factor 3B subunit 3-like [Apis florea]
          Length = 1217

 Score = 42.4 bits (98), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 59/261 (22%), Positives = 100/261 (38%), Gaps = 43/261 (16%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           L  T+ GD+  +T+  D  +V  + L   +   + + +  +     F+ S  G+  L Q 
Sbjct: 302 LAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVAASMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKR--LRRSSSDALQDMVNGEELSLYGSASN 492
                       SS +  E GD    AP   R  +     D+L  ++  +   L   A+ 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLRNLVLVDEMDSLSPIMACQVADL---ANE 418

Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIW 551
           +T     T     R +L      +   +GL +            S   + ELPG    +W
Sbjct: 419 DTPQLYITCGRGPRSTL------RVLRHGLEV------------SEMAVSELPGNPNAVW 460

Query: 552 TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
           TV  +               D+EY AY+I+S    T+VL   + + EVT+S   F+    
Sbjct: 461 TVKRR--------------VDEEYDAYIIVSFVNATLVLSIGETVEEVTDS--GFLGTTP 504

Query: 612 IAAGNLFGRRRVIQVFERGAR 632
             + +  G   ++QV+  G R
Sbjct: 505 TLSCSALGEDALVQVYPDGIR 525


>gi|242803623|ref|XP_002484212.1| UV-damaged DNA binding protein, putative [Talaromyces stipitatus
           ATCC 10500]
 gi|218717557|gb|EED16978.1| UV-damaged DNA binding protein, putative [Talaromyces stipitatus
           ATCC 10500]
          Length = 1140

 Score = 42.4 bits (98), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 39/137 (28%), Positives = 63/137 (45%), Gaps = 21/137 (15%)

Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
           A  L+ VP+P+GG+LV+G   I Y   +       NN  +    S+ L  ++  V     
Sbjct: 245 ASHLIPVPAPLGGLLVLGETCIKYIDDA-------NNETI----SRPLDEATIFV----- 288

Query: 368 HATWLQNDVA--LLSTKTGDLVLLTVVYDGR-VVQRLDLSKTNPSVLTSDITTIGNSLFF 424
              W+Q D    LL+   G L  L +V D R  V+   +     +   S +  +G  + F
Sbjct: 289 --AWVQVDGQRWLLADDYGRLFFLMLVLDSRNEVEGWKIDYLGSASRASVLIYLGAGMTF 346

Query: 425 LGSRLGDSLLVQFTCGS 441
           +GS  GDS +++ + GS
Sbjct: 347 IGSHQGDSQVIRISEGS 363


>gi|330792580|ref|XP_003284366.1| hypothetical protein DICPUDRAFT_86223 [Dictyostelium purpureum]
 gi|325085712|gb|EGC39114.1| hypothetical protein DICPUDRAFT_86223 [Dictyostelium purpureum]
          Length = 1064

 Score = 42.0 bits (97), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 128/565 (22%), Positives = 214/565 (37%), Gaps = 117/565 (20%)

Query: 109 LHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWL 168
           ++G +  L + S GG     ++D + ++ E  K  +L +D     +   +    E     
Sbjct: 14  IYGRISVLKLFSAGG-----KQDYLFISTESFKFCILAYDSEKKEIVTKASGNAED---- 64

Query: 169 HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSA 228
               GR + A G L  +DP GR          +I L   +G   L+  E       G + 
Sbjct: 65  --TIGRPTEA-GQLGIIDPDGR----------LIALHLYEGLLKLINIEK------GLNN 105

Query: 229 RIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
            I+ +   N R L+   V D  F++G   P + +L      +      KH    I    +
Sbjct: 106 PIQKTAA-NTR-LEELQVMDMTFLYGCKIPTIAVL------FKDTKDEKH----IVTYEV 153

Query: 289 STTLKQH-PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
           S   ++  P  WS  N+    Y  + V  P+GGVLVV  N I Y +   + ++A      
Sbjct: 154 SQKDQELCPGPWSQSNV--GVYSSMLVAVPLGGVLVVADNGITYMNGRTTRSIA------ 205

Query: 348 SLDSSQELPRSSFSV--ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
                  +P + F     +D   + +L  D        G L +L ++   + V  L    
Sbjct: 206 -------IPYTKFLAYDRVDKDGSRYLFGD------HFGRLSVLVLLNHQQRVTELKFET 252

Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
              + + S I+ + + + F+GS  GDS L++                   + E D P+T 
Sbjct: 253 LGRTSIPSSISYLDSGVVFIGSSSGDSQLIRL------------------NTEKD-PATD 293

Query: 466 RLRRSSSDALQDMVN-GEELSLYGSASNNTESAQ-KTFSFAVRDSLVNIGPLKDFSYGLR 523
               S    L++  N G  +      +     AQ  T S   RD     G L+    G+ 
Sbjct: 294 ----SYISHLENFTNIGPIVDFCLVDTEKQGQAQIVTCSGTYRD-----GTLRVIRNGI- 343

Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
                   GI++++   L+EL G KG+W +        N  S  +   D     YLI+S 
Sbjct: 344 --------GIAEKA---LIELEGVKGLWPI------KENDPSDPLNPKD----QYLIVSF 382

Query: 584 EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDG-SYMTQD 642
              T VL+      E TE         TI   N+     ++QV  +   +++  ++   D
Sbjct: 383 IGYTKVLQFQGEEIEETEFEGLDSNSSTILCSNIDKENVIVQVTNQAINLINPITFKRVD 442

Query: 643 LSFGPSNSESGSGSENSTVLSVSIA 667
               PS S     S N + +++SI 
Sbjct: 443 QWKSPSGSPINLVSSNQSQIALSIG 467


>gi|340721347|ref|XP_003399083.1| PREDICTED: splicing factor 3B subunit 3-like [Bombus terrestris]
 gi|350406701|ref|XP_003487854.1| PREDICTED: splicing factor 3B subunit 3-like [Bombus impatiens]
          Length = 1217

 Score = 42.0 bits (97), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 59/261 (22%), Positives = 100/261 (38%), Gaps = 43/261 (16%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           L  T+ GD+  +T+  D  +V  + L   +   + + +  +     F+ S  G+  L Q 
Sbjct: 302 LAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVAASMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKR--LRRSSSDALQDMVNGEELSLYGSASN 492
                       SS +  E GD    AP   R  +     D+L  ++  +   L   A+ 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLRNLVLVDEMDSLSPIMACQVADL---ANE 418

Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIW 551
           +T     T     R +L      +   +GL +            S   + ELPG    +W
Sbjct: 419 DTPELYITCGRGPRSTL------RVLRHGLEV------------SEMAVSELPGNPNAVW 460

Query: 552 TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
           TV  +               D+EY AY+I+S    T+VL   + + EVT+S   F+    
Sbjct: 461 TVKRR--------------VDEEYDAYIIVSFVNATLVLSIGETVEEVTDS--GFLGTTP 504

Query: 612 IAAGNLFGRRRVIQVFERGAR 632
             + +  G   ++QV+  G R
Sbjct: 505 TLSCSALGEDALVQVYPDGIR 525


>gi|451818558|ref|YP_007454759.1| succinate dehydrogenase/fumarate reductase, flavoprotein subunit
            [Clostridium saccharoperbutylacetonicum N1-4(HMT)]
 gi|451784537|gb|AGF55505.1| succinate dehydrogenase/fumarate reductase, flavoprotein subunit
            [Clostridium saccharoperbutylacetonicum N1-4(HMT)]
          Length = 493

 Score = 42.0 bits (97), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 30/112 (26%), Positives = 52/112 (46%), Gaps = 7/112 (6%)

Query: 1057 RTYTVEEYEVRILEPDRAGGPWQTRAT-IPMQSSENALTVRVVTLFNTTTK----ENETL 1111
            R Y+  E+   +++P+  G P    AT +  + +E A  + V  +F T       EN   
Sbjct: 122  RAYSDGEFTWHVVKPEGGGVPGPRAATTMTKRMTEKARELGVEIIFETPVNKIIMENGEA 181

Query: 1112 LAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQIDFA 1163
            + +      GE++ ARG+ ++ +TG   DNPQ  ++    G  F     +FA
Sbjct: 182  VGVIAKNKAGEEIEARGKAVILATGGFGDNPQ--MIKEETGYEFGKTIFNFA 231


>gi|121699866|ref|XP_001268198.1| UV-damaged DNA binding protein, putative [Aspergillus clavatus NRRL
           1]
 gi|119396340|gb|EAW06772.1| UV-damaged DNA binding protein, putative [Aspergillus clavatus NRRL
           1]
          Length = 1140

 Score = 42.0 bits (97), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 39/136 (28%), Positives = 61/136 (44%), Gaps = 25/136 (18%)

Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA--H 368
           L+ VP+P+GG+L++G  +I Y               V  D+++ + R      LD A   
Sbjct: 248 LIPVPAPLGGLLILGETSIKY---------------VDADNNEIISRP-----LDEATIF 287

Query: 369 ATWLQNDVA--LLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFL 425
             W Q D    LL+   G L  L +V D    V+   L     +   S +  +G  + FL
Sbjct: 288 VAWEQVDSQRWLLADDYGRLFFLMLVLDSDNQVESWKLDLLGKTSRASVLVYLGGGVLFL 347

Query: 426 GSRLGDSLLVQFTCGS 441
           GS  GDS +++ + GS
Sbjct: 348 GSHQGDSQVLRISNGS 363


>gi|242018509|ref|XP_002429717.1| Splicing factor 3B subunit, putative [Pediculus humanus corporis]
 gi|212514723|gb|EEB16979.1| Splicing factor 3B subunit, putative [Pediculus humanus corporis]
          Length = 1218

 Score = 42.0 bits (97), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 68/323 (21%), Positives = 122/323 (37%), Gaps = 65/323 (20%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           L  T+ GD+  +T+  D  +V  + L   +   + + +  +     F+ S  G+  L Q 
Sbjct: 302 LAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVATSMCVMKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E GD    AP           AL+++V  +E+      S + 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGDTFFFAPR----------ALRNLVQVDEMD-----SLSP 406

Query: 495 ESAQKTFSFAVRDS-----LVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPG 546
             A +    A  D+     L   GP   L+   +GL +            S   + ELPG
Sbjct: 407 IMACQVADLANEDTPQLYMLCGRGPRSTLRVLRHGLEV------------SEMAVSELPG 454

Query: 547 C-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDY 605
               +WTV  +               ++EY AY+I+S    T+VL   + + EVT+S   
Sbjct: 455 NPNAVWTVKRR--------------VEEEYDAYIIVSFVNATLVLSIGETVEEVTDS--G 498

Query: 606 FVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS 665
           F+      + +  G   ++QV+  G R +              N     G +  T++  +
Sbjct: 499 FLGTTPTLSCSALGDDALVQVYPDGIRHIRADKRV--------NEWKAPGKK--TIMKCA 548

Query: 666 IADPYVLLGMSDGSIRLLVGDPS 688
           +    V++ ++ G +     DP+
Sbjct: 549 VNQRQVVIALTAGELVYFEMDPT 571


>gi|393212467|gb|EJC97967.1| hypothetical protein FOMMEDRAFT_162310 [Fomitiporia mediterranea
           MF3/22]
          Length = 1161

 Score = 42.0 bits (97), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 72/331 (21%), Positives = 128/331 (38%), Gaps = 73/331 (22%)

Query: 306 HDAYKLLAVPSPI-------GGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
            D+  L+ VP  I       GGVLV+G +TI ++S         ++      S+ ++P++
Sbjct: 224 EDSNLLIPVPPQIKSSWNVNGGVLVLGGSTIAFYSIDRKQKKKNSSSQSKS-STSKIPQA 282

Query: 359 SFSVELDAAHATWLQNDVA----LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSD 414
             +       A W Q D      LL    G L LL +      +  + L + +P    + 
Sbjct: 283 EVNWPYFDITA-WAQIDEDGLRYLLGDSFGRLALLAINPQYAYLDIVLLGEVSPP---TS 338

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
           +T + +   ++GS  GDS L++ T    ++     + + F +I   AP    +   + D+
Sbjct: 339 LTPLASQYIYVGSHFGDSQLIRVTSERSSNGSYLEISDTFKNI---APIMDAVFEDTDDS 395

Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
            Q  +    ++  G  S                     G L+    G   N DA   GI+
Sbjct: 396 GQPTI----ITCSGGEST--------------------GSLRVIRNGANFNEDARIEGIA 431

Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLE--T 592
                         G+W +  +              YDD +H Y++++ +  T +LE   
Sbjct: 432 N-----------ITGMWPIRRQ--------------YDDTFHHYMLVTTDTNTHLLELPN 466

Query: 593 ADLLTEVTESVDY---FVQGRTIAAGNLFGR 620
           +   T V+ S D+    +  RT+ AGN+  R
Sbjct: 467 SQQETAVSRSNDFSDLTIDSRTLVAGNMLTR 497


>gi|302831461|ref|XP_002947296.1| hypothetical protein VOLCADRAFT_73165 [Volvox carteri f.
           nagariensis]
 gi|300267703|gb|EFJ51886.1| hypothetical protein VOLCADRAFT_73165 [Volvox carteri f.
           nagariensis]
          Length = 1221

 Score = 41.6 bits (96), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 71/353 (20%), Positives = 130/353 (36%), Gaps = 76/353 (21%)

Query: 271 AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI---GGVLVVGAN 327
           A  ++ KH T     L ++  L++    W+   + + A  L+AVP      GGVLV   N
Sbjct: 199 AASMAQKHLTFYEMDLGLNNVLRK----WTE-PIDNGANLLVAVPGGADGPGGVLVCAEN 253

Query: 328 TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLV 387
            I Y +Q      A+      L   + +   S++     A++ +L      + ++ GD+ 
Sbjct: 254 FIIYKNQDHEEVRAVIPRRSDLPGDRGVLIVSYATHKKKAYSFFL------VQSEYGDIY 307

Query: 388 LLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447
            +T+ Y+G  V  L +   +     + I  +     F  S  G+  L QF          
Sbjct: 308 KVTLAYEGEAVTELKIKYFDTIPPCTSIAVLKTGFLFAASEYGNHALYQFV--------- 358

Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
            G  E+  D+E           SSS AL     G +   +          +   +  + D
Sbjct: 359 -GTGEDDEDVE-----------SSSAALVQTEEGFQPVFF--------EPRPLKNLLLID 398

Query: 508 SLVNIGPLKDFS-----------------YGLRINADASATGISKQSNYELVELPGCK-G 549
            + ++ P+ D                   +G R +      G++  +   +  LPG    
Sbjct: 399 EMASLMPITDMKVANLLNEEIPQIYALCGHGPRASLSVLRPGLAV-TELAVSPLPGAPTA 457

Query: 550 IWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTES 602
           +WTV   ++              DE+ A++++S    T+V    + + E  ES
Sbjct: 458 VWTVRRNAT--------------DEFDAFIVVSFANATLVFSIGEEVKETNES 496


>gi|302820387|ref|XP_002991861.1| hypothetical protein SELMODRAFT_448595 [Selaginella moellendorffii]
 gi|300140399|gb|EFJ07123.1| hypothetical protein SELMODRAFT_448595 [Selaginella moellendorffii]
          Length = 1292

 Score = 41.6 bits (96), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 47/216 (21%), Positives = 85/216 (39%), Gaps = 38/216 (17%)

Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFS----YGLRINADASATGISKQSNYELVE----- 543
             E  Q +F   V+    NI P+ DFS    YG + +   +  G  ++ +  ++      
Sbjct: 419 KVEDGQLSFQSFVQ----NIAPILDFSLVDYYGEKQDQMFACCGGDEEGSVRIIRNGNSV 474

Query: 544 ---------LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
                      G  GIWT+ ++              + D YHA+ +IS    T VL    
Sbjct: 475 EKLICTPPVYQGVSGIWTMRYR--------------FKDPYHAFFLISFVEETRVLSVGL 520

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
              ++T++V +  Q  T+A G L     V QV+    ++   +          SN  S +
Sbjct: 521 NFVDITDAVGFESQVNTLACG-LVEDGWVAQVWRYEVKLCSPTKAAHPAGVSGSNPLSTT 579

Query: 655 GSENSTVLSV-SIADPYVLLGMSDGSIRLLVGDPST 689
             +    +SV ++    V+L ++   + L++G   T
Sbjct: 580 WRKPGYPISVGAVCRSRVILALARPGLLLMLGATQT 615


>gi|452824087|gb|EME31092.1| DNA damage-binding protein 1 isoform 1 [Galdieria sulphuraria]
          Length = 1128

 Score = 41.2 bits (95), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 55/211 (26%), Positives = 84/211 (39%), Gaps = 35/211 (16%)

Query: 236 INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
           I L +LD   V D  F++G+ +P + +L      +      +H      +L         
Sbjct: 151 IRLEELD---VLDIQFLYGHSKPTIAVL------YTDSEENRHLKTYTVSLK-DKDFGNG 200

Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
           PL     NL   A  L+ VP+PIGGV+V+G  T+ Y S S      L  Y  S+  S  +
Sbjct: 201 PLFQG--NLESGASMLIPVPTPIGGVVVLGQETVTYISGS-----GLRGYH-SIPVSATI 252

Query: 356 PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLL---------TVVYDGRVVQRLDLSKT 406
            R+   ++ D            LL  + G L LL         T       +  L +   
Sbjct: 253 FRAYGRIDKDGTR--------YLLGDEKGILYLLVLEQSTSLSTFTETETKITGLKIQTL 304

Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
             + L S I  + N   ++GS  GDS L++ 
Sbjct: 305 GETSLPSTIDYLDNGFVYIGSCHGDSQLIRL 335


>gi|258570355|ref|XP_002543981.1| pre-mRNA splicing factor rse1 [Uncinocarpus reesii 1704]
 gi|237904251|gb|EEP78652.1| pre-mRNA splicing factor rse1 [Uncinocarpus reesii 1704]
          Length = 1209

 Score = 41.2 bits (95), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 65/265 (24%), Positives = 108/265 (40%), Gaps = 40/265 (15%)

Query: 378 LLSTKTGDLVLLTV--VYDGR-----VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
           LL T+ GDL  +T+  V D        V+RL L   +   + S +  + N   F+ S  G
Sbjct: 307 LLQTEDGDLFKVTIDMVEDDNGQPTGEVRRLKLKYFDTVPIASSLCILKNGFLFVASENG 366

Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGD--IEADAPSTKRLRRSSSDALQDMVNGEELSLYG 488
           +    QF         +    ++F    +E  AP   R R + +  L + +N     +  
Sbjct: 367 NHHFYQFEKLGDDDEETEFTSDDFSSDPLEPLAPVYFRPRPAENLNLVESINSVNPLMSC 426

Query: 489 SASNNTES-AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC 547
             +N TE  A + ++     +      LK   +GL +         S+    EL  +P  
Sbjct: 427 KVANLTEDDAPQLYTLCGTGARSTFRTLK---HGLEV---------SEIVESELPSVPSA 474

Query: 548 KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
             +WT   K +R            +D+Y AY+I+S    T+VL   + + EVT++  +  
Sbjct: 475 --VWTT--KLTR------------NDQYDAYIILSFTNGTLVLSIGETVEEVTDT-GFLS 517

Query: 608 QGRTIAAGNLFGRRRVIQVFERGAR 632
              T+A   L G   +IQV  +G R
Sbjct: 518 SAPTLAVQQL-GEDSLIQVHPKGIR 541


>gi|452824086|gb|EME31091.1| DNA damage-binding protein 1 isoform 2 [Galdieria sulphuraria]
          Length = 1150

 Score = 41.2 bits (95), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 55/211 (26%), Positives = 84/211 (39%), Gaps = 35/211 (16%)

Query: 236 INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
           I L +LD   V D  F++G+ +P + +L      +      +H      +L         
Sbjct: 151 IRLEELD---VLDIQFLYGHSKPTIAVL------YTDSEENRHLKTYTVSLK-DKDFGNG 200

Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
           PL     NL   A  L+ VP+PIGGV+V+G  T+ Y S S      L  Y  S+  S  +
Sbjct: 201 PLFQG--NLESGASMLIPVPTPIGGVVVLGQETVTYISGS-----GLRGYH-SIPVSATI 252

Query: 356 PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLL---------TVVYDGRVVQRLDLSKT 406
            R+   ++ D            LL  + G L LL         T       +  L +   
Sbjct: 253 FRAYGRIDKDGTR--------YLLGDEKGILYLLVLEQSTSLSTFTETETKITGLKIQTL 304

Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
             + L S I  + N   ++GS  GDS L++ 
Sbjct: 305 GETSLPSTIDYLDNGFVYIGSCHGDSQLIRL 335


>gi|115397303|ref|XP_001214243.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114192434|gb|EAU34134.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 1140

 Score = 41.2 bits (95), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 42/146 (28%), Positives = 60/146 (41%), Gaps = 25/146 (17%)

Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVS--LDSSQELPRS 358
           A  L   A  L+ VP+P+GG+L++G  +I Y           NN  VS  LD +      
Sbjct: 237 AQELDLGASHLIPVPAPLGGLLILGETSIKYVDDD-------NNEIVSRLLDEA------ 283

Query: 359 SFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGR-VVQRLDLSKTNPSVLTSDI 415
                       W Q D    LL+   G L  L +V D    VQ   L     +   S +
Sbjct: 284 -------TIFVAWEQVDSQRWLLADDYGRLFFLMLVLDSENQVQGWQLDHLGNTSRASTL 336

Query: 416 TTIGNSLFFLGSRLGDSLLVQFTCGS 441
             +G  + F+GS  GDS +++   GS
Sbjct: 337 VYLGGGVIFVGSHQGDSQVLRVGDGS 362


>gi|171691144|ref|XP_001910497.1| hypothetical protein [Podospora anserina S mat+]
 gi|170945520|emb|CAP71632.1| unnamed protein product [Podospora anserina S mat+]
          Length = 1158

 Score = 41.2 bits (95), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 43/188 (22%), Positives = 72/188 (38%), Gaps = 53/188 (28%)

Query: 282 MISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA 341
           +I    +   +K+H    +    PH           +GGV+VVG   + Y          
Sbjct: 233 LIPVRKVEEEVKRHNFRNTGSAKPH-----------LGGVIVVGETRLLY---------- 271

Query: 342 LNNYAVSLDSSQELPRSSFSVELDAA--HATWLQNDVA--LLSTKTGDLVLLTVVYDGRV 397
                       ++ +++   +LD A     W + +V    L+   G L LLT+  DG  
Sbjct: 272 ----------IDDVTKATVESKLDKASIFVKWAEYNVQTYFLADDYGSLHLLTINTDGAE 321

Query: 398 VQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI 457
           V+ + L+K   +   S++  +GN + F+ S  GDS L Q                   D+
Sbjct: 322 VKGMVLTKIGVTSRASELVYLGNEMLFVASHHGDSRLFQL------------------DL 363

Query: 458 EADAPSTK 465
            AD P+ K
Sbjct: 364 SADKPADK 371


>gi|406602265|emb|CCH46158.1| Pre-mRNA-splicing factor [Wickerhamomyces ciferrii]
          Length = 1123

 Score = 40.8 bits (94), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 61/255 (23%), Positives = 100/255 (39%), Gaps = 81/255 (31%)

Query: 493 NTESAQKTFSFA-VRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVE--LPG-CK 548
           N ++  K +S + V+DS      LK   YGL IN              E+VE  LPG   
Sbjct: 406 NDDAFTKIYSLSGVKDS----SSLKILQYGLSIN--------------EIVESDLPGIAN 447

Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
            +WT                   +DE+  YL+IS    T+VL   + + E+T+S    + 
Sbjct: 448 KVWTTKLNK--------------NDEFDKYLVISFMDTTLVLSIGENVEEITDS-GLALN 492

Query: 609 GRTIAAGNLFGRRRVIQVFERGAR------------------ILDGSYMTQDLSFGPSNS 650
             TI    + G   ++Q+   G R                  IL  S   + ++ G SN 
Sbjct: 493 EETIGIQQI-GINSLVQIHSNGIRNIKNGELINEWQPPAGIKILTTSTTNRQIAIGLSND 551

Query: 651 E---------------SGSGSENSTVLSVSIAD--------PYVLLGMSDGSIRLLVGDP 687
           E               +      S ++S+S+ D        P++++G  D +IR+L  DP
Sbjct: 552 ELVYFEVDDRDRLIEYNERKELTSRIVSLSLGDIPEGRLRSPFLIVGCQDSTIRVLSTDP 611

Query: 688 STC--TVSVQTPAAI 700
            +    +S+Q  ++I
Sbjct: 612 GSTLELLSLQALSSI 626


>gi|1399512|gb|AAC47162.1| repE [Dictyostelium discoideum]
          Length = 1139

 Score = 40.8 bits (94), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 51/204 (25%), Positives = 87/204 (42%), Gaps = 29/204 (14%)

Query: 234 HVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
           +V N+R L+   V D  F++G   P + +L +           + H       S  T L 
Sbjct: 150 NVNNVR-LEELQVLDMTFLYGCKVPTIAVLFKD-------TKDEKHISTYEISSKDTELV 201

Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
             P  WS  N+    Y  L VP P+GGVLVV  N I Y +   + ++A+ +Y   L  ++
Sbjct: 202 VGP--WSQSNV--GVYSSLLVPVPLGGVLVVADNGITYLNGKVTRSVAV-SYTKFLAFTR 256

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
                   V+ D +          L     G L +L +++  + V  L   +     + S
Sbjct: 257 --------VDKDGSR--------FLFGDHFGRLSVLVLIHQQQKVMELKFEQLGRISIPS 300

Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
            I+ + + + ++GS  GDS L++ 
Sbjct: 301 SISYLDSGVVYIGSSSGDSQLIRL 324


>gi|407923753|gb|EKG16818.1| Cleavage/polyadenylation specificity factor A subunit [Macrophomina
           phaseolina MS6]
          Length = 1129

 Score = 40.8 bits (94), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 62/261 (23%), Positives = 102/261 (39%), Gaps = 30/261 (11%)

Query: 185 VDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK 244
           +DP GR   + +Y   + ++  ++ G    GD +    G    +RIE   V +   L  K
Sbjct: 121 LDPTGRFMTLELYEGIVTVVPLTEKGKR-KGDPEVSALGEPVPSRIEEMFVRSSAFLHRK 179

Query: 245 HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNL 304
             +         +P++ +L+E +     R+  +      +     +     P+      L
Sbjct: 180 SPESE-------KPLVALLYEEDEDSKIRLRLRQLAFQTAGTEEQSVAALEPVEGLKEEL 232

Query: 305 PHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVEL 364
              A  L+ VP P  GVLV+G   I Y           N+Y  +L   + L  S+  V  
Sbjct: 233 DLGASHLIPVPGPCYGVLVLGETCITY----------FNDYTKAL-VKKPLQDSTIFV-- 279

Query: 365 DAAHATWLQ--NDVALLSTKTGDLVLLTVVYDGR--VVQRLDLSKTNPSVLTSDITTIGN 420
                 W Q  N   LL+   G L L  ++ D    VV+   L K   +   S +  +  
Sbjct: 280 -----AWEQIDNQRFLLADDFGGLYLFMLLLDDNSGVVEGWRLDKIGETSRASVLVYLDA 334

Query: 421 SLFFLGSRLGDSLLVQFTCGS 441
              F+GS  GDS +++ T GS
Sbjct: 335 GHVFVGSHEGDSQVIRITEGS 355


>gi|347829304|emb|CCD45001.1| similar to pre-mRNA-splicing factor rse1 [Botryotinia fuckeliana]
          Length = 1212

 Score = 40.8 bits (94), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 85/377 (22%), Positives = 151/377 (40%), Gaps = 52/377 (13%)

Query: 320 GVLVVGANTIHY-HSQSASCALALNNYAVSLDSSQELPRSSFSV--ELDAAHATWLQNDV 376
           GVLV G + I Y HS   +  +A+     + +  Q        V  +L  A   +     
Sbjct: 253 GVLVCGEDNITYRHSNQEAFRVAIPRRRGATEDPQRKRNIVAGVMHKLKGAAGAFF---- 308

Query: 377 ALLSTKTGDLVLLTV--VYDGR-----VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRL 429
            LL T  GDL  +T+  V D        V+RL +   +   + + +  + +   F+ S  
Sbjct: 309 FLLQTDDGDLFKITIEMVEDDNGQPTGEVRRLKIKYFDTVPVATSLCILKSGFLFVASEF 368

Query: 430 GDSLLVQFTCGSGTSMLSSGLKEEF--GDIEADAPSTKRLRRSSSDALQDMVNGEELSLY 487
           G+    QF         +  + ++F  G  E+  P     R + + +L + ++     + 
Sbjct: 369 GNHQFYQFEKLGDDDEETEFVSDDFPTGAHESYTPIYFHPRPAENLSLVESIDSMNPLMD 428

Query: 488 GSASNNT-ESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPG 546
              +N T E A + +S     +      LK   +GL ++    +            ELPG
Sbjct: 429 CKVANLTDEDAPQIYSICGTGARSTFRTLK---HGLEVSEIVES------------ELPG 473

Query: 547 C-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDY 605
               +WT   K +RG            D Y AY+I+S    T+VL   + + EVT++  +
Sbjct: 474 VPSAVWTT--KLTRG------------DTYDAYIILSFSNGTLVLSIGETVEEVTDT-GF 518

Query: 606 FVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS 665
                T+A   L G   +IQV  +G R +   +   + +  P +    + + N   ++V+
Sbjct: 519 LSSAPTLAVQQL-GEDSLIQVHPKGIRHIRADHRVNEWA-APQHRSIVAATTNERQVAVA 576

Query: 666 IAD-PYVLLGM-SDGSI 680
           ++    V   M SDGS+
Sbjct: 577 LSSGEIVYFEMDSDGSL 593


>gi|313235544|emb|CBY10999.1| unnamed protein product [Oikopleura dioica]
          Length = 1185

 Score = 40.8 bits (94), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 87/428 (20%), Positives = 157/428 (36%), Gaps = 105/428 (24%)

Query: 319 GGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE------LDAAHATWL 372
           GGV+V   N + Y            N+    D    +PR    ++      +  AHAT  
Sbjct: 246 GGVIVCAENYLIY-----------KNFGDQPDIRFPIPRRRNDLDDPERGMIIVAHATHK 294

Query: 373 QNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
              +   LL T+ GDL  +T+  +  +V  + L   +   ++S +  +     F+    G
Sbjct: 295 TRSMFFFLLQTEQGDLFKVTLETEEDIVTEIRLKYFDTVPVSSSLCVLRTGFLFVAGEFG 354

Query: 431 DSLLVQFT-CGSGTSMLSSGLKEEFGDIEADAPSTKRLRR----SSSDALQDMVNGEELS 485
           +  L Q T  G           E   + E    + + LR        D+L  ++N E   
Sbjct: 355 NHNLYQITRLGEDDDEPEFSSAEPLEEGETFFFTPRGLRNLALTDEMDSLSPVLNCEVAD 414

Query: 486 LYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELP 545
           L   A+ +T     T     R +L      +   +GL +            S   + ELP
Sbjct: 415 L---ANEDTPQLYVTCGRGPRSTL------RVLRHGLEV------------SEMAVSELP 453

Query: 546 GC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVD 604
           G    +WTV                + D ++ +Y+I+S    T+VL   + + E+T+S  
Sbjct: 454 GNPNAVWTV--------------KTSADADHDSYIIVSFVNATLVLSIGETVEEITDS-G 498

Query: 605 YFVQGRTIAAGNLFGRRRVIQVFERGAR-------------------------------I 633
           +     T+++G L G   ++Q++  G R                                
Sbjct: 499 FLGTTPTLSSG-LMGEDALVQIYPEGIRHIRSDRRVNEWRAPDRKQIVRCACNRQQVVIA 557

Query: 634 LDGS---YMTQDLSFGPSNSESGSGSENSTVLSVSIAD--------PYVLLGMSDGSIRL 682
           L G    Y   D + G  N  +      S ++++ + D         ++ +G+SDG++R+
Sbjct: 558 LTGGEIVYFEMDPT-GQLNEYTERREFGSEIIALDVGDVPAGEQRCRFLAVGLSDGTVRI 616

Query: 683 LVGDPSTC 690
           +  DP+ C
Sbjct: 617 ISLDPNDC 624


>gi|427798971|gb|JAA64937.1| Putative damage-specific dna binding complex subunit ddb1, partial
           [Rhipicephalus pulchellus]
          Length = 1259

 Score = 40.8 bits (94), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 60/268 (22%), Positives = 103/268 (38%), Gaps = 57/268 (21%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           L  T+ GD+  +T+  D  +V  + L   +   + + +  +     F+ +  G+  L Q 
Sbjct: 302 LAQTEQGDIFKITLETDEDMVTEIKLKYFDTIPVAASMCVLKTGFLFVAAEFGNHCLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E GD    AP           AL++++  EEL     A   T
Sbjct: 362 ARLGEEDEEPEFSSAIPLEEGDTFFFAPR----------ALRNLLPVEELDSLSPAMGCT 411

Query: 495 ------ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELP 545
                 E   + +    R      GP   ++   +GL +            S   + ELP
Sbjct: 412 IADLANEDTPQLYVACGR------GPRSCIRVLRHGLEV------------SEMAVSELP 453

Query: 546 GC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVD 604
           G    +WTV  K+              D++Y AY+I+S    T+VL   + + EVT+S  
Sbjct: 454 GNPNAVWTVKRKA--------------DEDYDAYIIVSFVNATLVLSIGETVEEVTDS-G 498

Query: 605 YFVQGRTIAAGNLFGRRRVIQVFERGAR 632
           +     T++   + G   ++QV+  G R
Sbjct: 499 FLGTTPTLSCAQI-GDDALVQVYPEGIR 525


>gi|154295205|ref|XP_001548039.1| pre-mRNA splicing factor 3b [Botryotinia fuckeliana B05.10]
          Length = 1020

 Score = 40.4 bits (93), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 129/600 (21%), Positives = 226/600 (37%), Gaps = 94/600 (15%)

Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
           + + G + ++A     G++    +D II+  +  +I+++EF  + +      +  F    
Sbjct: 62  HDVFGIIRAIAAFRLAGSN----KDYIIITSDSGRITIVEFVPAQNKFNRLHLETFG--- 114

Query: 167 WLHLKRGRESFARGPLVKVDPQGRC---GGVLVYGLQMIILKASQGGSGLVGDEDTFGSG 223
               K G      G  + VDP+GR      V    L  ++ + SQ        E T  S 
Sbjct: 115 ----KSGVRRVVPGQYLAVDPKGRACLTASVEKNKLVYVLNRNSQA-------ELTISSP 163

Query: 224 GGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILH----ERELTWAGRVSWKHH 279
               A    + V  L  LD+          GY  PV   L     E +    G+ ++   
Sbjct: 164 --LEAHKAQTLVFALVALDV----------GYANPVFAALEIDYGESDQDPTGQ-AYDEI 210

Query: 280 TCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI---GGVLVVGANTIHY-HSQS 335
              +    +   L      WS   +   A  L  VP       GVLV G + I Y HS  
Sbjct: 211 EKQLVYYELDLGLNHVVRKWSE-PVDRTANILFQVPGGTDGPSGVLVCGEDNITYRHSNQ 269

Query: 336 ASCALALNNYAVSLDSSQELPRSSFSV--ELDAAHATWLQNDVALLSTKTGDLVLLTV-- 391
            +  +A+     + +  Q        V  +L  A   +      LL T  GDL  +T+  
Sbjct: 270 EAFRVAIPRRRGATEDPQRKRNIVAGVMHKLKGAAGAFF----FLLQTDDGDLFKITIEM 325

Query: 392 VYDGR-----VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSML 446
           V D        V+RL +   +   + + +  + +   F+ S  G+    QF         
Sbjct: 326 VEDDNGQPTGEVRRLKIKYFDTVPVATSLCILKSGFLFVASEFGNHQFYQFEKLGDDDEE 385

Query: 447 SSGLKEEF--GDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT-ESAQKTFSF 503
           +  + ++F  G  E+  P     R + + +L + ++     +    +N T E A + +S 
Sbjct: 386 TEFVSDDFPTGAHESYTPIYFHPRPAENLSLVESIDSMNPLMDCKVANLTDEDAPQIYSI 445

Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIWTVYHKSSRGHN 562
               +      LK   +GL ++    +            ELPG    +WT   K +RG  
Sbjct: 446 CGTGARSTFRTLK---HGLEVSEIVES------------ELPGVPSAVWTT--KLTRG-- 486

Query: 563 ADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRR 622
                     D Y AY+I+S    T+VL   + + EVT++  +     T+A   L G   
Sbjct: 487 ----------DTYDAYIILSFSNGTLVLSIGETVEEVTDT-GFLSSAPTLAVQQL-GEDS 534

Query: 623 VIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIAD-PYVLLGM-SDGSI 680
           +IQV  +G R +   +   + +  P +    + + N   ++V+++    V   M SDGS+
Sbjct: 535 LIQVHPKGIRHIRADHRVNEWA-APQHRSIVAATTNERQVAVALSSGEIVYFEMDSDGSL 593


>gi|212539802|ref|XP_002150056.1| UV-damaged DNA binding protein, putative [Talaromyces marneffei
           ATCC 18224]
 gi|210067355|gb|EEA21447.1| UV-damaged DNA binding protein, putative [Talaromyces marneffei
           ATCC 18224]
          Length = 1139

 Score = 40.4 bits (93), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 39/139 (28%), Positives = 62/139 (44%), Gaps = 25/139 (17%)

Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
           A  L+ VP+P+GG+LV+G   I Y                 +D ++     + S  LD A
Sbjct: 245 ASHLIPVPAPLGGLLVLGETCIKY-----------------IDDAK---NETISNPLDEA 284

Query: 368 --HATWLQNDVA--LLSTKTGDLVLLTVVYDGR-VVQRLDLSKTNPSVLTSDITTIGNSL 422
                W+Q D    LL+   G L  L +V D +  V+   L     +   S +  +G  +
Sbjct: 285 TIFVAWVQVDGQRWLLADDYGRLFFLMLVLDSQNEVEGWKLDYLGEASRASVLIYLGAGM 344

Query: 423 FFLGSRLGDSLLVQFTCGS 441
            F+GS  GDS +++ + GS
Sbjct: 345 TFIGSHQGDSQVIRISEGS 363


>gi|166240328|ref|XP_637896.2| UV-damaged DNA binding protein1 [Dictyostelium discoideum AX4]
 gi|238064940|sp|B0M0P5.1|DDB1_DICDI RecName: Full=DNA damage-binding protein 1; AltName: Full=DNA
           repair protein E; AltName: Full=UV-damaged DNA-binding
           protein 1
 gi|165988543|gb|EAL64385.2| UV-damaged DNA binding protein1 [Dictyostelium discoideum AX4]
          Length = 1181

 Score = 40.4 bits (93), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 51/204 (25%), Positives = 87/204 (42%), Gaps = 29/204 (14%)

Query: 234 HVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
           +V N+R L+   V D  F++G   P + +L +           + H       S  T L 
Sbjct: 192 NVNNVR-LEELQVLDMTFLYGCKVPTIAVLFKD-------TKDEKHISTYEISSKDTELV 243

Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
             P  WS  N+    Y  L VP P+GGVLVV  N I Y +   + ++A+ +Y   L  ++
Sbjct: 244 VGP--WSQSNV--GVYSSLLVPVPLGGVLVVADNGITYLNGKVTRSVAV-SYTKFLAFTR 298

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
                   V+ D +          L     G L +L +++  + V  L   +     + S
Sbjct: 299 --------VDKDGSR--------FLFGDHFGRLSVLVLIHQQQKVMELKFEQLGRISIPS 342

Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
            I+ + + + ++GS  GDS L++ 
Sbjct: 343 SISYLDSGVVYIGSSSGDSQLIRL 366


>gi|320593036|gb|EFX05445.1| uv-damaged DNA-binding protein [Grosmannia clavigera kw1407]
          Length = 1504

 Score = 40.4 bits (93), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 36/135 (26%), Positives = 58/135 (42%), Gaps = 21/135 (15%)

Query: 306 HDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELD 365
           H+        + +GG+LVVG   + Y   +  C +             E+P  + S+   
Sbjct: 562 HNVRNTATATANLGGLLVVGETRLLYIDSTTKCTV-------------EVPLRAASI--- 605

Query: 366 AAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGN-SL 422
                W + D    LL+ + G L LLT++  G VV  LD+S    +   S +  + +  L
Sbjct: 606 --FVAWARYDATHYLLADEYGTLHLLTILVSGAVVDNLDVSPIGKTSRASCLVYLPDRRL 663

Query: 423 FFLGSRLGDSLLVQF 437
            F+GS  GDS L + 
Sbjct: 664 LFVGSHNGDSQLFRL 678


>gi|380481704|emb|CCF41690.1| CPSF A subunit region, partial [Colletotrichum higginsianum]
          Length = 932

 Score = 40.4 bits (93), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 58/144 (40%), Gaps = 14/144 (9%)

Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSS-----QELPRSSFS 361
           D Y  + +P PI     V      YH +  + A A     + +  +       L R+   
Sbjct: 179 DPYARIVIPVPI-----VEDEVKRYHKRDTTGAKAQLGGLIVVGETLLVYVDTLTRTVVE 233

Query: 362 VELD--AAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
             L+  A    W   D     LS   G+L LLT+  +G VV  L L     +   S +  
Sbjct: 234 SGLNSPAIFVAWAAYDDTNYFLSDDYGNLHLLTIETEGVVVTNLSLRLLGVTSRASCLVH 293

Query: 418 IGNSLFFLGSRLGDSLLVQFTCGS 441
           +GN L FLGS  GDS L+Q    S
Sbjct: 294 MGNGLLFLGSHYGDSQLLQINMES 317


>gi|70992271|ref|XP_750984.1| UV-damaged DNA binding protein [Aspergillus fumigatus Af293]
 gi|66848617|gb|EAL88946.1| UV-damaged DNA binding protein, putative [Aspergillus fumigatus
           Af293]
 gi|159124553|gb|EDP49671.1| UV-damaged DNA binding protein, putative [Aspergillus fumigatus
           A1163]
          Length = 1140

 Score = 40.0 bits (92), Expect = 7.9,   Method: Compositional matrix adjust.
 Identities = 38/135 (28%), Positives = 60/135 (44%), Gaps = 25/135 (18%)

Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA--H 368
           L+ VP+P+GG+L++G  +I Y               V  D+++ + R      LD A   
Sbjct: 248 LIPVPAPLGGLLILGEMSIKY---------------VDADNNEIISRP-----LDEATIF 287

Query: 369 ATWLQNDVA--LLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFL 425
             W Q D    LL+   G L  L +V D    V+   L     +   S +  +G  + FL
Sbjct: 288 VAWEQVDSQRWLLADDYGRLFFLMLVLDSDSQVESWKLDHLGNTSRASVLVYLGGGILFL 347

Query: 426 GSRLGDSLLVQFTCG 440
           GS  GDS +++ + G
Sbjct: 348 GSHQGDSQVLRISNG 362


>gi|238491136|ref|XP_002376805.1| UV-damaged DNA binding protein, putative [Aspergillus flavus
           NRRL3357]
 gi|220697218|gb|EED53559.1| UV-damaged DNA binding protein, putative [Aspergillus flavus
           NRRL3357]
          Length = 1117

 Score = 40.0 bits (92), Expect = 9.0,   Method: Compositional matrix adjust.
 Identities = 39/143 (27%), Positives = 62/143 (43%), Gaps = 19/143 (13%)

Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
           A  L   A  L+ VP+P+GG+L++G  +I Y           NN  +    S+ L  ++ 
Sbjct: 215 AQELDLGASHLIPVPAPLGGLLILGETSIKYVDDD-------NNEII----SRPLDEATI 263

Query: 361 SVELDAAHAT-WLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTI 418
            V  +   +  W      LL+   G L  L +V D    VQ   L     +   S +  +
Sbjct: 264 FVAWEGVDSQRW------LLADDYGRLFFLMLVLDSDNQVQGWKLDHLGNTSRASALVYL 317

Query: 419 GNSLFFLGSRLGDSLLVQFTCGS 441
           G  + F+GS  GDS +++   GS
Sbjct: 318 GGGIVFVGSHQGDSQVLRIGNGS 340


>gi|169773185|ref|XP_001821061.1| UV-damaged DNA binding protein [Aspergillus oryzae RIB40]
 gi|83768922|dbj|BAE59059.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 1139

 Score = 39.7 bits (91), Expect = 9.5,   Method: Compositional matrix adjust.
 Identities = 39/143 (27%), Positives = 62/143 (43%), Gaps = 19/143 (13%)

Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
           A  L   A  L+ VP+P+GG+L++G  +I Y           NN  +    S+ L  ++ 
Sbjct: 237 AQELDLGASHLIPVPAPLGGLLILGETSIKYVDDD-------NNEII----SRPLDEATI 285

Query: 361 SVELDAAHAT-WLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTI 418
            V  +   +  W      LL+   G L  L +V D    VQ   L     +   S +  +
Sbjct: 286 FVAWEGVDSQRW------LLADDYGRLFFLMLVLDSDNQVQGWKLDHLGNTSRASALVYL 339

Query: 419 GNSLFFLGSRLGDSLLVQFTCGS 441
           G  + F+GS  GDS +++   GS
Sbjct: 340 GGGIVFVGSHQGDSQVLRIGNGS 362


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.318    0.134    0.394 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 18,442,275,783
Number of Sequences: 23463169
Number of extensions: 787973245
Number of successful extensions: 1817905
Number of sequences better than 100.0: 634
Number of HSP's better than 100.0 without gapping: 337
Number of HSP's successfully gapped in prelim test: 297
Number of HSP's that attempted gapping in prelim test: 1813822
Number of HSP's gapped (non-prelim): 1826
length of query: 1192
length of database: 8,064,228,071
effective HSP length: 154
effective length of query: 1038
effective length of database: 8,745,867,341
effective search space: 9078210299958
effective search space used: 9078210299958
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 83 (36.6 bits)