BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 001853
         (1004 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255539681|ref|XP_002510905.1| cleavage and polyadenylation specificity factor cpsf, putative
            [Ricinus communis]
 gi|223550020|gb|EEF51507.1| cleavage and polyadenylation specificity factor cpsf, putative
            [Ricinus communis]
          Length = 1461

 Score = 1628 bits (4216), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 792/1028 (77%), Positives = 886/1028 (86%), Gaps = 30/1028 (2%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELP-SKRGIGPVPNLV 59
            MS+AAYKM+HWPTGI +C SG+ITHSRAD+VPQIP IQT+ LDSE P SKRGIGP+PNL+
Sbjct: 1    MSYAAYKMLHWPTGIESCASGYITHSRADFVPQIPPIQTDNLDSEWPPSKRGIGPMPNLI 60

Query: 60   VTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAIL 119
            VTA +V+E+YVVRVQE+GS+ES++S ETKR  LMDG+S ASLELVCHYRLHGNVES+ +L
Sbjct: 61   VTAGSVLEVYVVRVQEDGSRESRSSRETKRGGLMDGVSGASLELVCHYRLHGNVESMVVL 120

Query: 120  SQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR 179
               G D+SRRRDSIILAF+DAKISVLEFDDSIHGLR +SMHCFE PEWLHLKRGRESFAR
Sbjct: 121  PTEGGDSSRRRDSIILAFKDAKISVLEFDDSIHGLRTSSMHCFEGPEWLHLKRGRESFAR 180

Query: 180  GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
            GPL+KVDPQGRCGG+LVY +QMIIL+A+Q  SGLVGD+D   SGG  SAR++SS+VINLR
Sbjct: 181  GPLLKVDPQGRCGGILVYDMQMIILRAAQASSGLVGDDDALSSGGSISARVQSSYVINLR 240

Query: 240  DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299
            D+DMKHVKDFIF+H YIEPV+VILHERELTWAGRVSWKHHTCMISALSISTTLKQ  LIW
Sbjct: 241  DMDMKHVKDFIFLHDYIEPVVVILHERELTWAGRVSWKHHTCMISALSISTTLKQPTLIW 300

Query: 300  SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
            S +NLPHDAYKLLAVP PIGGVLV+ ANTIHYHS+SA+ ALALNNYAVS+DSSQELPR+S
Sbjct: 301  SVVNLPHDAYKLLAVPPPIGGVLVICANTIHYHSESATYALALNNYAVSIDSSQELPRAS 360

Query: 360  FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIG 419
            FSVELDA  A WL NDVALLS K G+L+LL++VYDGRVVQRLDLSK+  SVLTSDITTIG
Sbjct: 361  FSVELDAVKAAWLLNDVALLSAKNGELLLLSLVYDGRVVQRLDLSKSKASVLTSDITTIG 420

Query: 420  NSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
            NSLFFLGSRLGDSLLVQFT G G S++SSGLKEE G+IE D PS KRL+RS+SD LQDMV
Sbjct: 421  NSLFFLGSRLGDSLLVQFTNGLGPSVVSSGLKEEVGEIEGDVPSAKRLKRSASDGLQDMV 480

Query: 480  NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY 539
            +GEELSLYGS +NNTESAQK+FSFAVRDSL+N+GPLKDFSYGLR N DASATGI+KQSNY
Sbjct: 481  SGEELSLYGSTANNTESAQKSFSFAVRDSLINVGPLKDFSYGLRSNYDASATGIAKQSNY 540

Query: 540  EL--------------------------VELPGCKGIWTVYHKSSRGHNADSSRMAAYDD 573
            +L                          V+LPGC+GIWTVYHK++RGHN D S+MAA  D
Sbjct: 541  DLVCCSGHGKNGTLCILRQSIRPEMITEVDLPGCRGIWTVYHKNARGHNVDLSKMAAAAD 600

Query: 574  EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 633
            EYHAYLIIS+EARTMVLETADLL+EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI
Sbjct: 601  EYHAYLIISMEARTMVLETADLLSEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 660

Query: 634  LDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 693
            LDGS+MTQDLS G SNSES  GSE++TV SVSIADPYVL+ M+DGSIRLL+GD STC VS
Sbjct: 661  LDGSFMTQDLSIGSSNSESSPGSESATVSSVSIADPYVLIKMTDGSIRLLIGDSSTCMVS 720

Query: 694  VQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDG---ADGGPLDQG 750
            + TP+A E+S++ VS+CTLYHDKGPEPWLRK STDAWLSTGV EAIDG   ADGGP DQG
Sbjct: 721  INTPSAFENSERSVSACTLYHDKGPEPWLRKASTDAWLSTGVSEAIDGAESADGGPHDQG 780

Query: 751  DIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEG 810
            DIY +VCYESGALEIFDVPNFN VF+VDKFVSG+TH+ D Y+RE  KDS+ + N  SEE 
Sbjct: 781  DIYCIVCYESGALEIFDVPNFNRVFSVDKFVSGKTHLADAYVREPPKDSQEKTNRISEEV 840

Query: 811  TGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDP 870
             G GRKEN H+MK VELAMQRWS HHSRPFLF +LTDGTILCY AYLFE P+ TSK++D 
Sbjct: 841  AGLGRKENAHNMKAVELAMQRWSGHHSRPFLFGVLTDGTILCYHAYLFEAPDATSKTEDS 900

Query: 871  VSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSG 930
            VS    + + ++SASRLRNLRF R PLD+Y +EET     CQRITIF NISGHQGFFL G
Sbjct: 901  VSAQNPVGLGSISASRLRNLRFVRVPLDSYIKEETSTENSCQRITIFNNISGHQGFFLLG 960

Query: 931  SRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
            SRP W MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHG IYVTSQG LKICQLPS S YD
Sbjct: 961  SRPAWFMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGNLKICQLPSFSNYD 1020

Query: 991  NYWPVQKV 998
            NYWPVQK+
Sbjct: 1021 NYWPVQKI 1028


>gi|225455571|ref|XP_002268371.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Vitis vinifera]
          Length = 1442

 Score = 1627 bits (4214), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 778/1024 (75%), Positives = 871/1024 (85%), Gaps = 41/1024 (4%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
            MS+AAYKMMHWPTGI NC SGF+THSRAD+ PQI  IQT++L+SE P+KR IGP+PNL+V
Sbjct: 1    MSYAAYKMMHWPTGIENCASGFVTHSRADFAPQIAPIQTDDLESEWPTKRQIGPLPNLIV 60

Query: 61   TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
            TAAN++E+Y+VRVQE+ S+ES+ S ETKR  +M GIS A+LELVC YRLHGNVE++ +L 
Sbjct: 61   TAANILEVYMVRVQEDDSRESRASAETKRGGVMAGISGAALELVCQYRLHGNVETMTVLP 120

Query: 121  QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180
             GG DNSRRRDSIILAF+DAKISVLEFDDSIHGLR +SMHCFE PEW HLKRG ESFARG
Sbjct: 121  SGGGDNSRRRDSIILAFQDAKISVLEFDDSIHGLRTSSMHCFEGPEWFHLKRGHESFARG 180

Query: 181  PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240
            PLVKVDPQGRC GVLVYGLQMIILKASQ G GLVGDE+   SG   SAR+ESS+VI+LRD
Sbjct: 181  PLVKVDPQGRCSGVLVYGLQMIILKASQAGYGLVGDEEALSSGSAVSARVESSYVISLRD 240

Query: 241  LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
            LDMKHVKDF FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS
Sbjct: 241  LDMKHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300

Query: 301  AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
            A+NLPHDAYKLL VPSPIGGV+V+ AN+IHYHSQSASCALALNNYAVS D+SQE+PRSSF
Sbjct: 301  AVNLPHDAYKLLPVPSPIGGVVVISANSIHYHSQSASCALALNNYAVSADNSQEMPRSSF 360

Query: 361  SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGN 420
            SVELDAA+ATWL NDVA+LSTKTG+L+LLT+ YDGRVV RLDLSK+  SVLTS I  IGN
Sbjct: 361  SVELDAANATWLSNDVAMLSTKTGELLLLTLAYDGRVVHRLDLSKSRASVLTSGIAAIGN 420

Query: 421  SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480
            SLFFLGSRLGDSLLVQF     TS+LSS +KEE GDIE D PS KRLR+SSSDALQDMVN
Sbjct: 421  SLFFLGSRLGDSLLVQF-----TSILSSSVKEEVGDIEGDVPSAKRLRKSSSDALQDMVN 475

Query: 481  GEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYE 540
            GEELSLYGSA N+TE++QKTFSF+VRDS +N+GPLKDF+YGLRINAD  ATGI+KQSNYE
Sbjct: 476  GEELSLYGSAPNSTETSQKTFSFSVRDSFINVGPLKDFAYGLRINADPKATGIAKQSNYE 535

Query: 541  L--------------------------VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDE 574
            L                          VELPGCKGIWTVYHK++RGHNADS++MA  DDE
Sbjct: 536  LVCCSGHGKNGALCILQQSIRPEMITEVELPGCKGIWTVYHKNTRGHNADSTKMATKDDE 595

Query: 575  YHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
            YHAYLIISLE+RTMVLETADLL EVTESVDY+VQG TI+AGNLFGRRRV+QV+ RGARIL
Sbjct: 596  YHAYLIISLESRTMVLETADLLGEVTESVDYYVQGCTISAGNLFGRRRVVQVYARGARIL 655

Query: 635  DGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSV 694
            DG++MTQDL            SE+STVLSVSIADPYVLL MSDG+I+LLVGDPSTCTVS+
Sbjct: 656  DGAFMTQDLPI----------SESSTVLSVSIADPYVLLRMSDGNIQLLVGDPSTCTVSI 705

Query: 695  QTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYS 754
              PA  ESSKK +S+CTLYHDKGPEPWLRKTSTDAWLSTG+GEAIDGADG   DQGDIY 
Sbjct: 706  NIPAVFESSKKSISACTLYHDKGPEPWLRKTSTDAWLSTGIGEAIDGADGAAQDQGDIYC 765

Query: 755  VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
            VV YESG LEIFDVPNFNCVF+VDKF+SG  H+VDT + E  +D++  ++ +SEE   QG
Sbjct: 766  VVSYESGDLEIFDVPNFNCVFSVDKFMSGNAHLVDTLILEPSEDTQKVMSKNSEEEADQG 825

Query: 815  RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
            RKEN H++KVVELAMQRWS  HSRPFLF ILTDGTILCY AYL+EGPE+T K+++ VS  
Sbjct: 826  RKENAHNIKVVELAMQRWSGQHSRPFLFGILTDGTILCYHAYLYEGPESTPKTEEAVSAQ 885

Query: 875  RSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPC 934
             SLS+SNVSASRLRNLRF R PLD YTREE   G    R+T+FKNI G QG FLSGSRP 
Sbjct: 886  NSLSISNVSASRLRNLRFVRVPLDTYTREEALSGTTSPRMTVFKNIGGCQGLFLSGSRPL 945

Query: 935  WCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWP 994
            W MVFRER+RVHPQLCDGSIVAFTVLHN+NCNHG IYVTSQG LKICQLP+ S+YDNYWP
Sbjct: 946  WFMVFRERIRVHPQLCDGSIVAFTVLHNINCNHGLIYVTSQGFLKICQLPAVSSYDNYWP 1005

Query: 995  VQKV 998
            VQK+
Sbjct: 1006 VQKI 1009


>gi|296084122|emb|CBI24510.3| unnamed protein product [Vitis vinifera]
          Length = 1448

 Score = 1621 bits (4198), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 778/1030 (75%), Positives = 871/1030 (84%), Gaps = 47/1030 (4%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
            MS+AAYKMMHWPTGI NC SGF+THSRAD+ PQI  IQT++L+SE P+KR IGP+PNL+V
Sbjct: 1    MSYAAYKMMHWPTGIENCASGFVTHSRADFAPQIAPIQTDDLESEWPTKRQIGPLPNLIV 60

Query: 61   TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
            TAAN++E+Y+VRVQE+ S+ES+ S ETKR  +M GIS A+LELVC YRLHGNVE++ +L 
Sbjct: 61   TAANILEVYMVRVQEDDSRESRASAETKRGGVMAGISGAALELVCQYRLHGNVETMTVLP 120

Query: 121  QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180
             GG DNSRRRDSIILAF+DAKISVLEFDDSIHGLR +SMHCFE PEW HLKRG ESFARG
Sbjct: 121  SGGGDNSRRRDSIILAFQDAKISVLEFDDSIHGLRTSSMHCFEGPEWFHLKRGHESFARG 180

Query: 181  PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240
            PLVKVDPQGRC GVLVYGLQMIILKASQ G GLVGDE+   SG   SAR+ESS+VI+LRD
Sbjct: 181  PLVKVDPQGRCSGVLVYGLQMIILKASQAGYGLVGDEEALSSGSAVSARVESSYVISLRD 240

Query: 241  LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
            LDMKHVKDF FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS
Sbjct: 241  LDMKHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300

Query: 301  AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
            A+NLPHDAYKLL VPSPIGGV+V+ AN+IHYHSQSASCALALNNYAVS D+SQE+PRSSF
Sbjct: 301  AVNLPHDAYKLLPVPSPIGGVVVISANSIHYHSQSASCALALNNYAVSADNSQEMPRSSF 360

Query: 361  SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGN 420
            SVELDAA+ATWL NDVA+LSTKTG+L+LLT+ YDGRVV RLDLSK+  SVLTS I  IGN
Sbjct: 361  SVELDAANATWLSNDVAMLSTKTGELLLLTLAYDGRVVHRLDLSKSRASVLTSGIAAIGN 420

Query: 421  SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480
            SLFFLGSRLGDSLLVQF     TS+LSS +KEE GDIE D PS KRLR+SSSDALQDMVN
Sbjct: 421  SLFFLGSRLGDSLLVQF-----TSILSSSVKEEVGDIEGDVPSAKRLRKSSSDALQDMVN 475

Query: 481  GEELSLYGSASNNTESAQ------KTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
            GEELSLYGSA N+TE++Q      KTFSF+VRDS +N+GPLKDF+YGLRINAD  ATGI+
Sbjct: 476  GEELSLYGSAPNSTETSQVEAQVGKTFSFSVRDSFINVGPLKDFAYGLRINADPKATGIA 535

Query: 535  KQSNYEL--------------------------VELPGCKGIWTVYHKSSRGHNADSSRM 568
            KQSNYEL                          VELPGCKGIWTVYHK++RGHNADS++M
Sbjct: 536  KQSNYELVCCSGHGKNGALCILQQSIRPEMITEVELPGCKGIWTVYHKNTRGHNADSTKM 595

Query: 569  AAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFE 628
            A  DDEYHAYLIISLE+RTMVLETADLL EVTESVDY+VQG TI+AGNLFGRRRV+QV+ 
Sbjct: 596  ATKDDEYHAYLIISLESRTMVLETADLLGEVTESVDYYVQGCTISAGNLFGRRRVVQVYA 655

Query: 629  RGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS 688
            RGARILDG++MTQDL            SE+STVLSVSIADPYVLL MSDG+I+LLVGDPS
Sbjct: 656  RGARILDGAFMTQDLPI----------SESSTVLSVSIADPYVLLRMSDGNIQLLVGDPS 705

Query: 689  TCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLD 748
            TCTVS+  PA  ESSKK +S+CTLYHDKGPEPWLRKTSTDAWLSTG+GEAIDGADG   D
Sbjct: 706  TCTVSINIPAVFESSKKSISACTLYHDKGPEPWLRKTSTDAWLSTGIGEAIDGADGAAQD 765

Query: 749  QGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSE 808
            QGDIY VV YESG LEIFDVPNFNCVF+VDKF+SG  H+VDT + E  +D++  ++ +SE
Sbjct: 766  QGDIYCVVSYESGDLEIFDVPNFNCVFSVDKFMSGNAHLVDTLILEPSEDTQKVMSKNSE 825

Query: 809  EGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSD 868
            E   QGRKEN H++KVVELAMQRWS  HSRPFLF ILTDGTILCY AYL+EGPE+T K++
Sbjct: 826  EEADQGRKENAHNIKVVELAMQRWSGQHSRPFLFGILTDGTILCYHAYLYEGPESTPKTE 885

Query: 869  DPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFL 928
            + VS   SLS+SNVSASRLRNLRF R PLD YTREE   G    R+T+FKNI G QG FL
Sbjct: 886  EAVSAQNSLSISNVSASRLRNLRFVRVPLDTYTREEALSGTTSPRMTVFKNIGGCQGLFL 945

Query: 929  SGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGST 988
            SGSRP W MVFRER+RVHPQLCDGSIVAFTVLHN+NCNHG IYVTSQG LKICQLP+ S+
Sbjct: 946  SGSRPLWFMVFRERIRVHPQLCDGSIVAFTVLHNINCNHGLIYVTSQGFLKICQLPAVSS 1005

Query: 989  YDNYWPVQKV 998
            YDNYWPVQK+
Sbjct: 1006 YDNYWPVQKI 1015


>gi|356559917|ref|XP_003548242.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Glycine max]
          Length = 1447

 Score = 1609 bits (4167), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 775/1026 (75%), Positives = 880/1026 (85%), Gaps = 38/1026 (3%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSK--RGIGPVPNL 58
            MSFAAYKMM  PTGI NC +GF+THSR+D+VP    +Q ++LD+E PS+    +G +PNL
Sbjct: 1    MSFAAYKMMQCPTGIDNCAAGFLTHSRSDFVP----LQPDDLDAEWPSRPRHHVGSLPNL 56

Query: 59   VVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAI 118
            VVTAANV+E+Y VR+QE+  +  K + +++R  L+DGI+ ASLELVCHYRLHGNVE++A+
Sbjct: 57   VVTAANVLEVYAVRLQED--QPPKAAADSRRGALLDGIAGASLELVCHYRLHGNVETMAV 114

Query: 119  LSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFA 178
            LS GG D SRRRDSI+L F DAKISVLE+DDSIHGLR +S+HCFE PEWLHLKRGRE FA
Sbjct: 115  LSIGGGDVSRRRDSIMLTFADAKISVLEYDDSIHGLRTSSLHCFEGPEWLHLKRGREQFA 174

Query: 179  RGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
            RGP+VKVDPQGRCGGVL+Y LQMIILKA+Q GSGLVG++D  GS G  +ARIESS++INL
Sbjct: 175  RGPVVKVDPQGRCGGVLIYDLQMIILKATQAGSGLVGEDDALGSSGAVAARIESSYMINL 234

Query: 239  RDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298
            RDLDM+HVKDF FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI
Sbjct: 235  RDLDMRHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 294

Query: 299  WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
            WSA+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALALN+YAV+LDSSQE+PRS
Sbjct: 295  WSAVNLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALALNSYAVTLDSSQEIPRS 354

Query: 359  SFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTI 418
            SF+VELDAA+ATWL +DVALLSTKTG+L+LLT+VYDGRVVQRLDLSK+  SVL+S ITTI
Sbjct: 355  SFNVELDAANATWLLSDVALLSTKTGELLLLTLVYDGRVVQRLDLSKSKASVLSSGITTI 414

Query: 419  GNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDM 478
            GNSLFFL SRLGDS+LVQF+CGSG SMLSS LKEE GDIEADAPS KRLRRS SDALQDM
Sbjct: 415  GNSLFFLASRLGDSMLVQFSCGSGVSMLSSNLKEEVGDIEADAPS-KRLRRSPSDALQDM 473

Query: 479  VNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSN 538
            V+GEELSLYGSA N TESAQK+FSFAVRDSL+N+GPLKDFSYGLRINADA+ATGI+KQSN
Sbjct: 474  VSGEELSLYGSAPNRTESAQKSFSFAVRDSLINVGPLKDFSYGLRINADANATGIAKQSN 533

Query: 539  YEL--------------------------VELPGCKGIWTVYHKSSRGHNADSSRMAAYD 572
            YEL                          VELPGCKGIWTVYHKS+R HNADSS+MA  D
Sbjct: 534  YELVCCSGHGKNGSLCVLRQSIRPEVITEVELPGCKGIWTVYHKSTRSHNADSSKMADDD 593

Query: 573  DEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
            DEYHAYLIISLEARTMVLETADLL+EVTESVDY+VQG+T+AAGNLFGR RVIQV+ERGAR
Sbjct: 594  DEYHAYLIISLEARTMVLETADLLSEVTESVDYYVQGKTLAAGNLFGRCRVIQVYERGAR 653

Query: 633  ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTV 692
            ILDGS+MTQD+SFG SN ESGS S+++  LSVSIADP+VLL MSDGSIRLL+GDPSTCT+
Sbjct: 654  ILDGSFMTQDVSFGASNLESGSASDSAIALSVSIADPFVLLRMSDGSIRLLIGDPSTCTI 713

Query: 693  SVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDI 752
            SV +PA+ ESSK  VSSCTLYHDKGPEPWLRKTSTDAWLSTGVGE IDG DG   D GDI
Sbjct: 714  SVTSPASFESSKGSVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGETIDGTDGAAQDHGDI 773

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            Y VVC+++G LEIFDVPNFNCVF+V+ F+SG++H+VD  M+E LKDS+       +    
Sbjct: 774  YCVVCFDNGNLEIFDVPNFNCVFSVENFMSGKSHLVDALMKEVLKDSK---QGDRDGVIN 830

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QGRKENI  MKVVELAMQRWS  HSRPFLF IL+DGTILCY AYL+E P++TSK +D  S
Sbjct: 831  QGRKENIPDMKVVELAMQRWSGQHSRPFLFGILSDGTILCYHAYLYESPDSTSKVEDSAS 890

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
               S+ +S+ + SRLRNLRF R PLDAY RE+T +G PCQ+ITIFKNI  ++GFFLSGSR
Sbjct: 891  AGGSIGLSSTNVSRLRNLRFVRVPLDAYAREDTSNGPPCQQITIFKNIGSYEGFFLSGSR 950

Query: 933  PCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNY 992
            P W MV RERLRVHPQLCDGSIVAFTVLHNVNCN G IYVTSQG+LKICQLPSGS YD+Y
Sbjct: 951  PAWVMVLRERLRVHPQLCDGSIVAFTVLHNVNCNQGLIYVTSQGVLKICQLPSGSNYDSY 1010

Query: 993  WPVQKV 998
            WPVQK+
Sbjct: 1011 WPVQKI 1016


>gi|356530945|ref|XP_003534039.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Glycine max]
          Length = 1449

 Score = 1593 bits (4125), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 777/1027 (75%), Positives = 884/1027 (86%), Gaps = 38/1027 (3%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDS-ELPSK--RGIGPVPN 57
            MSFAAYKMM  PTGI NC +GF+THSR+D+VP    +Q ++LD+ E PS+    +GP+PN
Sbjct: 1    MSFAAYKMMQCPTGIDNCAAGFLTHSRSDFVP----LQPDDLDAAEWPSRPRHHVGPLPN 56

Query: 58   LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
            LVVTAANV+E+Y VR+QE+  +    S +++R  L+DGI+ ASLEL CHYRLHGNVE++A
Sbjct: 57   LVVTAANVLEVYAVRLQED-QQPKDASDDSRRGTLLDGIAGASLELECHYRLHGNVETMA 115

Query: 118  ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
            +LS GG D SR+RDSIIL F DAKISVLE+DDSIHGLR +S+HCFE PEWLHLKRGRE F
Sbjct: 116  VLSIGGGDVSRKRDSIILTFADAKISVLEYDDSIHGLRTSSLHCFEGPEWLHLKRGREQF 175

Query: 178  ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
            ARGP+VK+DPQGRCGGVL+Y LQMIILKA+Q GSGLVGD+D FGS G  +ARIESS++IN
Sbjct: 176  ARGPVVKIDPQGRCGGVLIYDLQMIILKATQVGSGLVGDDDAFGSSGAVAARIESSYMIN 235

Query: 238  LRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
            LRDLDM+HVKDF FV+GYIEPVMVILHERELTWAGRVSW HHTCMISALSISTTLKQHPL
Sbjct: 236  LRDLDMRHVKDFTFVYGYIEPVMVILHERELTWAGRVSWTHHTCMISALSISTTLKQHPL 295

Query: 298  IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
            IWSA+NLPHDAYKLLAVPSPIGGVLV+GANTIHYHSQSASCALALNNYAV+LDSSQE+PR
Sbjct: 296  IWSAVNLPHDAYKLLAVPSPIGGVLVIGANTIHYHSQSASCALALNNYAVTLDSSQEIPR 355

Query: 358  SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
            SSF+VELDAA+ATWL +DVALLSTKTG+L+LL +VYDGRVVQRLDLSK+  SVL+S ITT
Sbjct: 356  SSFNVELDAANATWLLSDVALLSTKTGELLLLMLVYDGRVVQRLDLSKSKASVLSSGITT 415

Query: 418  IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD 477
            IGNSLFFL SRLGDS+LVQF+CGSG SM+SS LKEE GDIE DAPS KRLRRS SDALQD
Sbjct: 416  IGNSLFFLASRLGDSMLVQFSCGSGVSMMSSNLKEEVGDIEVDAPS-KRLRRSPSDALQD 474

Query: 478  MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
            MV+GEELSLYGSA+N TESAQK+FSFAVRDSL+N+GPLKDFSYGLRINADA+ATGI+KQS
Sbjct: 475  MVSGEELSLYGSATNRTESAQKSFSFAVRDSLINVGPLKDFSYGLRINADANATGIAKQS 534

Query: 538  NYEL--------------------------VELPGCKGIWTVYHKSSRGHNADSSRMAAY 571
            NYEL                          VELPGCKGIWTVYHKS+R HNADSS+MA  
Sbjct: 535  NYELVCCSGHGKNGSLCVLRQSIRPEVITEVELPGCKGIWTVYHKSTRSHNADSSKMADD 594

Query: 572  DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
            DDEYHAYLIISLEARTMVLETADLL+EVTESVDY+VQG+T+AAGNLFGRRRVIQV+ERGA
Sbjct: 595  DDEYHAYLIISLEARTMVLETADLLSEVTESVDYYVQGKTLAAGNLFGRRRVIQVYERGA 654

Query: 632  RILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
            RILDGS+MTQD+SFG SNSESGS SE++  LSVSIADP+VLL MSDGSIRLL+GDPSTCT
Sbjct: 655  RILDGSFMTQDVSFGASNSESGSASESAIALSVSIADPFVLLRMSDGSIRLLIGDPSTCT 714

Query: 692  VSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGD 751
            +SV +PA+ ESSK  VSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDG DG   D GD
Sbjct: 715  ISVTSPASFESSKGSVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGTDGAAQDHGD 774

Query: 752  IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT 811
            IY VVC+++G LEIFD+PNFNCVF+V+ F+SG++H+VD  M+E LKDS+       +   
Sbjct: 775  IYCVVCFDNGNLEIFDIPNFNCVFSVENFMSGKSHLVDALMKEVLKDSK---QGDRDGVV 831

Query: 812  GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
             QGRK+NI +MKVVELAMQRWS  HSRPFLF IL+DGTILCY AYL+E P+ TSK +D  
Sbjct: 832  NQGRKDNIPNMKVVELAMQRWSGQHSRPFLFGILSDGTILCYHAYLYESPDGTSKVEDSA 891

Query: 872  STSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
            S   S+ +S+ + SRLRNLRF R PLDAY RE+T +G+PCQ+ITIFKNI  +QGFFLSGS
Sbjct: 892  SAGGSIGLSSTNVSRLRNLRFVRVPLDAYPREDTSNGSPCQQITIFKNIGSYQGFFLSGS 951

Query: 932  RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            RP W MV RERLRVHPQLCDGSIVAFTVLHNVNCNHG IYVTSQG+LKICQLPSGS YD+
Sbjct: 952  RPAWVMVLRERLRVHPQLCDGSIVAFTVLHNVNCNHGLIYVTSQGVLKICQLPSGSNYDS 1011

Query: 992  YWPVQKV 998
            YWPVQK+
Sbjct: 1012 YWPVQKI 1018


>gi|224120960|ref|XP_002318462.1| predicted protein [Populus trichocarpa]
 gi|222859135|gb|EEE96682.1| predicted protein [Populus trichocarpa]
          Length = 1455

 Score = 1581 bits (4093), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 777/1033 (75%), Positives = 862/1033 (83%), Gaps = 44/1033 (4%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKR----GIGPVP 56
            MS+AAYKMMHWPT I  C SGF+THSR++    +P + T++LDS+ PS+R    GIGP P
Sbjct: 1    MSYAAYKMMHWPTTIDTCVSGFVTHSRSESA-HLPQLHTDDLDSDWPSRRRHGGGIGPTP 59

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            NL+V + NV+E+YVVRVQEEG++   +SGE KR  +MDG++ ASLELVCHYRLHGNVES+
Sbjct: 60   NLIVASGNVLELYVVRVQEEGAR---SSGELKRGGVMDGVAGASLELVCHYRLHGNVESM 116

Query: 117  AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +LS  G D+SRRRDSIILAF+DAKISVLEFDDSIHGLR +SMHCFE P+W HLKRGRES
Sbjct: 117  GVLSVEGGDDSRRRDSIILAFKDAKISVLEFDDSIHGLRTSSMHCFEGPDWRHLKRGRES 176

Query: 177  FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
            FARGPLVKVDPQGRCGGVLVY LQMIILKA+Q GS LV DED FGSG   SA I SS++I
Sbjct: 177  FARGPLVKVDPQGRCGGVLVYDLQMIILKAAQAGSALVQDEDAFGSGAAISAHIASSYII 236

Query: 237  NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
            NLRDLDMKHVKDFIFVH YIEPV+V+LHERELTWAGRV WKHHTCMISALSISTTLKQ  
Sbjct: 237  NLRDLDMKHVKDFIFVHDYIEPVVVVLHERELTWAGRVVWKHHTCMISALSISTTLKQPT 296

Query: 297  LIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELP 356
            LIWS  NLPHDAYKLLAVPSPIGGVLV+G NTIHYHS+SASCALALN+YA S+DSSQELP
Sbjct: 297  LIWSIGNLPHDAYKLLAVPSPIGGVLVIGVNTIHYHSESASCALALNSYAASVDSSQELP 356

Query: 357  RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDIT 416
            R++FSVELDAA+ATWL  DVALLSTKTG+L+LLT+VYDGRVVQRLDLSK+  SVLTSDIT
Sbjct: 357  RATFSVELDAANATWLLKDVALLSTKTGELLLLTLVYDGRVVQRLDLSKSKASVLTSDIT 416

Query: 417  TIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ 476
            T+GNS FFLGSRLGDSLLVQFT G G+SMLS GLKEE GDIE D PS KRL+ SSSDALQ
Sbjct: 417  TLGNSFFFLGSRLGDSLLVQFTSGLGSSMLSPGLKEEVGDIEGDLPSAKRLKVSSSDALQ 476

Query: 477  DMVNGEELSLYGSASNNTESAQ-----KTFSFAVRDSLVNIGPLKDFSYGLRINADASAT 531
            DMV+GEELSLY SA NN ES+Q     KTFSF VRDSL+N+GPLKDF+YGLRINADA+AT
Sbjct: 477  DMVSGEELSLYSSAPNNAESSQVVSVIKTFSFTVRDSLINVGPLKDFAYGLRINADANAT 536

Query: 532  GISKQSNYEL--------------------------VELPGCKGIWTVYHKSSRGHNADS 565
            GISKQSNYEL                          VELPGCKGIWTVYHK++R H+ DS
Sbjct: 537  GISKQSNYELVCCSGHGKNGALCVLQQSIRPEMITEVELPGCKGIWTVYHKNARIHSVDS 596

Query: 566  SRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQ 625
             +MA+ DDEYHAYLIIS+EARTMVLETAD LTEVTESVDYFVQGRTIAAGNLFGRRRV+Q
Sbjct: 597  LKMAS-DDEYHAYLIISMEARTMVLETADHLTEVTESVDYFVQGRTIAAGNLFGRRRVVQ 655

Query: 626  VFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG 685
            VFERGARILDGS+MTQDLSFG SNSE+G  SE+STV+ VSI DPYVL+ M+DGSI++LVG
Sbjct: 656  VFERGARILDGSFMTQDLSFGGSNSETGR-SESSTVMHVSIVDPYVLVRMADGSIQILVG 714

Query: 686  DPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGG 745
            DPS CTVSV TP+A +SS K VS+CTLYHDKGPEPWLRKTSTDAWLSTG+ EAIDGAD G
Sbjct: 715  DPSACTVSVNTPSAFQSSTKSVSACTLYHDKGPEPWLRKTSTDAWLSTGISEAIDGADSG 774

Query: 746  PLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINS 805
              +QGDIY VVCYE+GALEIFDVPNFN VF VDKFVSG+TH++DT   E  KD    +  
Sbjct: 775  AHEQGDIYCVVCYETGALEIFDVPNFNSVFFVDKFVSGKTHLLDTCTGEPAKDM---MKG 831

Query: 806  SSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTS 865
              EE  G GRKE+  +MKVVEL M RWS  HSRPFLF ILTDGTILCY AYLFEGP+ TS
Sbjct: 832  VKEEVAGAGRKESTQNMKVVELTMLRWSGRHSRPFLFGILTDGTILCYHAYLFEGPDGTS 891

Query: 866  KSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQG 925
            K +D VS   S+  S +SASRLRNLRF R PLD YTREET     CQRIT FKNISG+QG
Sbjct: 892  KLEDSVSAQNSVGASTISASRLRNLRFVRVPLDTYTREETSSETSCQRITTFKNISGYQG 951

Query: 926  FFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPS 985
            FFLSGSRP W MVFRERLRVHPQLCDGSIVAFTVLH VNCNHG IYVTSQG LKIC L S
Sbjct: 952  FFLSGSRPAWFMVFRERLRVHPQLCDGSIVAFTVLHTVNCNHGLIYVTSQGNLKICHLSS 1011

Query: 986  GSTYDNYWPVQKV 998
             S+YDNYWPVQK+
Sbjct: 1012 VSSYDNYWPVQKI 1024


>gi|30696088|ref|NP_199979.2| cleavage and polyadenylation specificity factor subunit 1
            [Arabidopsis thaliana]
 gi|290457637|sp|Q9FGR0.2|CPSF1_ARATH RecName: Full=Cleavage and polyadenylation specificity factor subunit
            1; AltName: Full=Cleavage and polyadenylation specificity
            factor 160 kDa subunit; Short=AtCPSF160; Short=CPSF 160
            kDa subunit
 gi|332008729|gb|AED96112.1| cleavage and polyadenylation specificity factor subunit 1
            [Arabidopsis thaliana]
          Length = 1442

 Score = 1514 bits (3920), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 742/1027 (72%), Positives = 861/1027 (83%), Gaps = 38/1027 (3%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQT-EELDSELPS-KRGIGPVPNL 58
            MSFAAYKMMHWPTG+ NC SG+ITHS +D   QIP++   +++++E P+ KRGIGP+PN+
Sbjct: 1    MSFAAYKMMHWPTGVENCASGYITHSLSDSTLQIPIVSVHDDIEAEWPNPKRGIGPLPNV 60

Query: 59   VVTAANVIEIYVVRVQEEG-SKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
            V+TAAN++E+Y+VR QEEG ++E +N    KR  +MDG+   SLELVCHYRLHGNVES+A
Sbjct: 61   VITAANILEVYIVRAQEEGNTQELRNPKLAKRGGVMDGVYGVSLELVCHYRLHGNVESIA 120

Query: 118  ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
            +L  GG ++S+ RDSIIL F DAKISVLEFDDSIH LR+TSMHCFE P+WLHLKRGRESF
Sbjct: 121  VLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRESF 180

Query: 178  ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
             RGPLVKVDPQGRCGGVLVYGLQMIILK SQ GSGLVGD+D F SGG  SAR+ESS++IN
Sbjct: 181  PRGPLVKVDPQGRCGGVLVYGLQMIILKTSQVGSGLVGDDDAFSSGGTVSARVESSYIIN 240

Query: 238  LRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
            LRDL+MKHVKDF+F+HGYIEPV+VIL E E TWAGRVSWKHHTC++SALSI++TLKQHP+
Sbjct: 241  LRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHPV 300

Query: 298  IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
            IWSA+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALALNNYA S DSSQELP 
Sbjct: 301  IWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELPA 360

Query: 358  SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
            S+FSVELDAAH TW+ NDVALLSTK+G+L+LLT++YDGR VQRLDLSK+  SVL SDIT+
Sbjct: 361  SNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDITS 420

Query: 418  IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD 477
            +GNSLFFLGSRLGDSLLVQF+C SG +    GL++E  DIE +    KRLR  +SD  QD
Sbjct: 421  VGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLRM-TSDTFQD 479

Query: 478  MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
             +  EELSL+GS  NN++SAQK+FSFAVRDSLVN+GP+KDF+YGLRINADA+ATG+SKQS
Sbjct: 480  TIGNEELSLFGSTPNNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSKQS 539

Query: 538  NYEL--------------------------VELPGCKGIWTVYHKSSRGHNADSSRMAAY 571
            NYEL                          VELPGCKGIWTVYHKSSRGHNADSS+MAA 
Sbjct: 540  NYELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADSSKMAAD 599

Query: 572  DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
            +DEYHAYLIISLEARTMVLETADLLTEVTESVDY+VQGRTIAAGNLFGRRRVIQVFE GA
Sbjct: 600  EDEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVIQVFEHGA 659

Query: 632  RILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
            RILDGS+M Q+LSFG SNSES SGSE+STV SVSIADPYVLL M+D SIRLLVGDPSTCT
Sbjct: 660  RILDGSFMNQELSFGASNSESNSGSESSTVSSVSIADPYVLLRMTDDSIRLLVGDPSTCT 719

Query: 692  VSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGD 751
            VS+ +P+ +E SK+ +S+CTLYHDKGPEPWLRK STDAWLS+GVGEA+D  DGGP DQGD
Sbjct: 720  VSISSPSVLEGSKRKISACTLYHDKGPEPWLRKASTDAWLSSGVGEAVDSVDGGPQDQGD 779

Query: 752  IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT 811
            IY VVCYESGALEIFDVP+FNCVF+VDKF SGR H+ D  + E     E E+N +SE+ T
Sbjct: 780  IYCVVCYESGALEIFDVPSFNCVFSVDKFASGRRHLSDMPIHEL----EYELNKNSEDNT 835

Query: 812  GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
                 + I + +VVELAMQRWS HH+RPFLFA+L DGTILCY AYLF+G ++T K+++ +
Sbjct: 836  S---SKEIKNTRVVELAMQRWSGHHTRPFLFAVLADGTILCYHAYLFDGVDST-KAENSL 891

Query: 872  STSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
            S+    ++++  +S+LRNL+F R PLD  TRE T  G   QRIT+FKNISGHQGFFLSGS
Sbjct: 892  SSENPAALNSSGSSKLRNLKFLRIPLDTSTREGTSDGVASQRITMFKNISGHQGFFLSGS 951

Query: 932  RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            RP WCM+FRERLR H QLCDGSI AFTVLHNVNCNHGFIYVT+QG+LKICQLPS S YDN
Sbjct: 952  RPGWCMLFRERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTAQGVLKICQLPSASIYDN 1011

Query: 992  YWPVQKV 998
            YWPVQK+
Sbjct: 1012 YWPVQKI 1018


>gi|24415580|gb|AAN41460.1| putative cleavage and polyadenylation specificity factor 160 kDa
            subunit [Arabidopsis thaliana]
          Length = 1442

 Score = 1512 bits (3914), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 741/1027 (72%), Positives = 860/1027 (83%), Gaps = 38/1027 (3%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQT-EELDSELPS-KRGIGPVPNL 58
            MSFAAYKMMHWPTG+ NC SG+ITHS +D   QIP++   +++++E P+ KRGIGP+PN+
Sbjct: 1    MSFAAYKMMHWPTGVENCASGYITHSLSDSTLQIPIVSVHDDIEAEWPNPKRGIGPLPNV 60

Query: 59   VVTAANVIEIYVVRVQEEG-SKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
            V+TAAN++E+Y+VR QEEG ++E +N    KR  +MDG+   SLELVCHYRLHGNVES+A
Sbjct: 61   VITAANILEVYIVRAQEEGNTQELRNPKLAKRGGVMDGVYGVSLELVCHYRLHGNVESIA 120

Query: 118  ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
            +L  GG ++S+ RDSIIL F DAKISVLEFDDSIH LR+TSMHCFE P+WLHLKRGRESF
Sbjct: 121  VLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRESF 180

Query: 178  ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
             RGPLVKVDPQGRCGGVLVYGLQMIILK SQ GSGLVGD+D F SGG  SAR+ESS++IN
Sbjct: 181  PRGPLVKVDPQGRCGGVLVYGLQMIILKTSQVGSGLVGDDDAFSSGGTVSARVESSYIIN 240

Query: 238  LRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
            LRDL+MKHVKDF+F+HGYIEPV+VIL E E TWAGRVSWKHHTC++SALSI++TLKQHP+
Sbjct: 241  LRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHPV 300

Query: 298  IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
            IWSA+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALALNNYA S DSSQELP 
Sbjct: 301  IWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELPA 360

Query: 358  SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
            S+FSVELDAAH TW+ NDVALLSTK+G+L+LLT++YDGR VQRLDLSK+  SVL SDIT+
Sbjct: 361  SNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDITS 420

Query: 418  IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD 477
            +GNSLFFLGSRLGDSLLVQF+C SG +    GL++E  DIE +    KRLR  +SD  QD
Sbjct: 421  VGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLRM-TSDTFQD 479

Query: 478  MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
             +  EELSL+GS  +N++SAQK+FSFAVRDSLVN+GP+KDF+YGLRINADA+ATG+SKQS
Sbjct: 480  TIGNEELSLFGSTPDNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSKQS 539

Query: 538  NYEL--------------------------VELPGCKGIWTVYHKSSRGHNADSSRMAAY 571
            NYEL                          VELPGCKGIWTVYHKSSRGHNADSS+MAA 
Sbjct: 540  NYELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADSSKMAAD 599

Query: 572  DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
            +DEYHAYLIISLEARTMVLETADLLTEVTESVDY+VQGRTIAAGNLFGRRRVIQVFE GA
Sbjct: 600  EDEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVIQVFEHGA 659

Query: 632  RILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
            RILDGS+M Q+LSFG SNSES SGSE+STV SVSIADPYVLL M+D SIRLLVGDPSTCT
Sbjct: 660  RILDGSFMNQELSFGASNSESNSGSESSTVSSVSIADPYVLLRMTDDSIRLLVGDPSTCT 719

Query: 692  VSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGD 751
            VS+ +P+ +E SK+ +S+CTLYHDKGPEPWLRK STDAWLS+GVGEA+D  DGGP DQGD
Sbjct: 720  VSISSPSVLEGSKRKISACTLYHDKGPEPWLRKASTDAWLSSGVGEAVDSVDGGPQDQGD 779

Query: 752  IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT 811
            IY VVCYESGALEIFDVP+FNCVF+VDKF SGR H+ D  + E     E E+N +SE+ T
Sbjct: 780  IYCVVCYESGALEIFDVPSFNCVFSVDKFASGRRHLSDMPIHEL----EYELNKNSEDNT 835

Query: 812  GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
                 + I + +VVELAMQRWS HH+RPFLFA+L DGTILCY AYLF+G ++T K+++ +
Sbjct: 836  S---SKEIKNTRVVELAMQRWSGHHTRPFLFAVLADGTILCYHAYLFDGVDST-KAENSL 891

Query: 872  STSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
            S     ++++  +S+LRNL+F R PLD  TRE T  G   QRIT+FKNISGHQGFFLSGS
Sbjct: 892  SPENPAALNSSGSSKLRNLKFLRIPLDTSTREGTSDGVASQRITMFKNISGHQGFFLSGS 951

Query: 932  RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            RP WCM+FRERLR H QLCDGSI AFTVLHNVNCNHGFIYVT+QG+LKICQLPS S YDN
Sbjct: 952  RPGWCMLFRERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTAQGVLKICQLPSASIYDN 1011

Query: 992  YWPVQKV 998
            YWPVQK+
Sbjct: 1012 YWPVQKI 1018


>gi|10257491|dbj|BAB11613.1| cleavage and polyadenylation specificity factor subunit [Arabidopsis
            thaliana]
          Length = 1448

 Score = 1507 bits (3902), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 742/1033 (71%), Positives = 861/1033 (83%), Gaps = 44/1033 (4%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQT-EELDSELPS-KRGIGPVPNL 58
            MSFAAYKMMHWPTG+ NC SG+ITHS +D   QIP++   +++++E P+ KRGIGP+PN+
Sbjct: 1    MSFAAYKMMHWPTGVENCASGYITHSLSDSTLQIPIVSVHDDIEAEWPNPKRGIGPLPNV 60

Query: 59   VVTAANVIEIYVVRVQEEG-SKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
            V+TAAN++E+Y+VR QEEG ++E +N    KR  +MDG+   SLELVCHYRLHGNVES+A
Sbjct: 61   VITAANILEVYIVRAQEEGNTQELRNPKLAKRGGVMDGVYGVSLELVCHYRLHGNVESIA 120

Query: 118  ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
            +L  GG ++S+ RDSIIL F DAKISVLEFDDSIH LR+TSMHCFE P+WLHLKRGRESF
Sbjct: 121  VLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRESF 180

Query: 178  ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
             RGPLVKVDPQGRCGGVLVYGLQMIILK SQ GSGLVGD+D F SGG  SAR+ESS++IN
Sbjct: 181  PRGPLVKVDPQGRCGGVLVYGLQMIILKTSQVGSGLVGDDDAFSSGGTVSARVESSYIIN 240

Query: 238  LRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
            LRDL+MKHVKDF+F+HGYIEPV+VIL E E TWAGRVSWKHHTC++SALSI++TLKQHP+
Sbjct: 241  LRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHPV 300

Query: 298  IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
            IWSA+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALALNNYA S DSSQELP 
Sbjct: 301  IWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELPA 360

Query: 358  SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
            S+FSVELDAAH TW+ NDVALLSTK+G+L+LLT++YDGR VQRLDLSK+  SVL SDIT+
Sbjct: 361  SNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDITS 420

Query: 418  IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD 477
            +GNSLFFLGSRLGDSLLVQF+C SG +    GL++E  DIE +    KRLR  +SD  QD
Sbjct: 421  VGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLRM-TSDTFQD 479

Query: 478  MVNGEELSLYGSASNNTESAQ------KTFSFAVRDSLVNIGPLKDFSYGLRINADASAT 531
             +  EELSL+GS  NN++SAQ      K+FSFAVRDSLVN+GP+KDF+YGLRINADA+AT
Sbjct: 480  TIGNEELSLFGSTPNNSDSAQVTSSVLKSFSFAVRDSLVNVGPVKDFAYGLRINADANAT 539

Query: 532  GISKQSNYEL--------------------------VELPGCKGIWTVYHKSSRGHNADS 565
            G+SKQSNYEL                          VELPGCKGIWTVYHKSSRGHNADS
Sbjct: 540  GVSKQSNYELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADS 599

Query: 566  SRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQ 625
            S+MAA +DEYHAYLIISLEARTMVLETADLLTEVTESVDY+VQGRTIAAGNLFGRRRVIQ
Sbjct: 600  SKMAADEDEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVIQ 659

Query: 626  VFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG 685
            VFE GARILDGS+M Q+LSFG SNSES SGSE+STV SVSIADPYVLL M+D SIRLLVG
Sbjct: 660  VFEHGARILDGSFMNQELSFGASNSESNSGSESSTVSSVSIADPYVLLRMTDDSIRLLVG 719

Query: 686  DPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGG 745
            DPSTCTVS+ +P+ +E SK+ +S+CTLYHDKGPEPWLRK STDAWLS+GVGEA+D  DGG
Sbjct: 720  DPSTCTVSISSPSVLEGSKRKISACTLYHDKGPEPWLRKASTDAWLSSGVGEAVDSVDGG 779

Query: 746  PLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINS 805
            P DQGDIY VVCYESGALEIFDVP+FNCVF+VDKF SGR H+ D  + E     E E+N 
Sbjct: 780  PQDQGDIYCVVCYESGALEIFDVPSFNCVFSVDKFASGRRHLSDMPIHEL----EYELNK 835

Query: 806  SSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTS 865
            +SE+ T     + I + +VVELAMQRWS HH+RPFLFA+L DGTILCY AYLF+G ++T 
Sbjct: 836  NSEDNTS---SKEIKNTRVVELAMQRWSGHHTRPFLFAVLADGTILCYHAYLFDGVDST- 891

Query: 866  KSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQG 925
            K+++ +S+    ++++  +S+LRNL+F R PLD  TRE T  G   QRIT+FKNISGHQG
Sbjct: 892  KAENSLSSENPAALNSSGSSKLRNLKFLRIPLDTSTREGTSDGVASQRITMFKNISGHQG 951

Query: 926  FFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPS 985
            FFLSGSRP WCM+FRERLR H QLCDGSI AFTVLHNVNCNHGFIYVT+QG+LKICQLPS
Sbjct: 952  FFLSGSRPGWCMLFRERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTAQGVLKICQLPS 1011

Query: 986  GSTYDNYWPVQKV 998
             S YDNYWPVQK+
Sbjct: 1012 ASIYDNYWPVQKI 1024


>gi|297792471|ref|XP_002864120.1| hypothetical protein ARALYDRAFT_495232 [Arabidopsis lyrata subsp.
            lyrata]
 gi|297309955|gb|EFH40379.1| hypothetical protein ARALYDRAFT_495232 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 1444

 Score = 1507 bits (3902), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 741/1027 (72%), Positives = 857/1027 (83%), Gaps = 36/1027 (3%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQ-TEELDSELPS-KRGIGPVPNL 58
            MSFAA+KMMHWPTG+ NC SG+ITHS +D   QIP++   +++++E P+ KRGIGP+PN+
Sbjct: 1    MSFAAFKMMHWPTGVENCASGYITHSLSDSTLQIPIVSGDDDMEAEWPNHKRGIGPLPNV 60

Query: 59   VVTAANVIEIYVVRVQEEG-SKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
            V+TA N++E+Y+VR QEEG ++E +     KR  +MDG+S  SLELVCHYRLHGNVES+A
Sbjct: 61   VITAGNILEVYIVRAQEEGNTQELRIPKLVKRGGVMDGVSGVSLELVCHYRLHGNVESIA 120

Query: 118  ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
            +L  GG ++S+ RDSIIL F DAKISVLEFDDSIH LR+TSMHCFE P+WLHLKRGRESF
Sbjct: 121  VLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRESF 180

Query: 178  ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
             RGPLVKVDPQGRCGGVLVYGLQMIILKASQ GSGLVGD+D F SGG  SAR+ESS++IN
Sbjct: 181  PRGPLVKVDPQGRCGGVLVYGLQMIILKASQVGSGLVGDDDAFSSGGTVSARVESSYIIN 240

Query: 238  LRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
            LRDL+MKHVKDF+F+HGYIEPV+VIL E E TWAGRVSWKHHTC++SALSI+TTLKQHP+
Sbjct: 241  LRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINTTLKQHPV 300

Query: 298  IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
            IWSA+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALALNNYA S DSSQELP 
Sbjct: 301  IWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELPA 360

Query: 358  SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
            S+FSVELDAAH TW+ +DVALLSTK+G+L+LLT++YDGR VQRLDLSK+  SVL SDIT+
Sbjct: 361  SNFSVELDAAHGTWISSDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDITS 420

Query: 418  IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD 477
            +GNSLFFLGSRLGDSLLVQF+C SG +    GL++E  DIE +    KRL R SSD  QD
Sbjct: 421  VGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRL-RISSDTFQD 479

Query: 478  MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
             +  EELSL+GS  NN++SAQK+FSFAVRDSLVN+GP+KDF+YGLRINADA+ATG+SKQS
Sbjct: 480  TIGNEELSLFGSTPNNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSKQS 539

Query: 538  NYEL--------------------------VELPGCKGIWTVYHKSSRGHNADSSRMAAY 571
            NYEL                          VELPGCKGIWTVYHKSSRGHNADSS+MAA 
Sbjct: 540  NYELVCCSGHGKNGALCVLRQSVRPEMITEVELPGCKGIWTVYHKSSRGHNADSSKMAAD 599

Query: 572  DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
            +DEYHAYLIIS+EARTMVLETADLLTEVTESVDY+VQGRTIAAGNLFGRRRVIQVFE GA
Sbjct: 600  EDEYHAYLIISVEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFGRRRVIQVFEHGA 659

Query: 632  RILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
            RILDGS+M Q+LSFG  NSES SGSE+STV SVSIADPYVLL M+D SIRLLVGDPSTCT
Sbjct: 660  RILDGSFMNQELSFGAPNSESNSGSESSTVSSVSIADPYVLLRMTDDSIRLLVGDPSTCT 719

Query: 692  VSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGD 751
            VS+ +P+ +E SKK +S+CTL+HDKGPEPWLRK STDAWLS+GVGEA+D ADGGP DQGD
Sbjct: 720  VSISSPSVLEGSKKKISACTLFHDKGPEPWLRKASTDAWLSSGVGEAVDSADGGPQDQGD 779

Query: 752  IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT 811
            IY V+CYESGALEIFDVP FNCVF+VDKF SGR H+ D  + E     E E+N +SE+  
Sbjct: 780  IYCVLCYESGALEIFDVPGFNCVFSVDKFASGRRHLSDMPIHEL----EYELNKNSED-N 834

Query: 812  GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
               R E I + KVVEL+MQRWS  H+RPFLFA+L DGTILCY AYLFEG ++T K+++ V
Sbjct: 835  ASSRNEEIKNTKVVELSMQRWSGPHTRPFLFAVLADGTILCYHAYLFEGVDST-KAENSV 893

Query: 872  STSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
            S+    ++++  +S+LRNL+F R P D  TRE T  G   QRIT+FKNISGHQGFFLSGS
Sbjct: 894  SSENPAALNSSGSSKLRNLKFLRIPFDTSTREGTSDGVASQRITMFKNISGHQGFFLSGS 953

Query: 932  RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            RP WCM+FRERLR H QLCDGSI AFTVLHNVNCNHGFIYVTSQ +LKICQLPS S YDN
Sbjct: 954  RPGWCMLFRERLRFHSQLCDGSIAAFTVLHNVNCNHGFIYVTSQVVLKICQLPSASIYDN 1013

Query: 992  YWPVQKV 998
            YWPVQK+
Sbjct: 1014 YWPVQKI 1020


>gi|449470342|ref|XP_004152876.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Cucumis sativus]
          Length = 1504

 Score = 1477 bits (3825), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 747/1076 (69%), Positives = 844/1076 (78%), Gaps = 81/1076 (7%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
            MSFAAY+MMHWPTGI NC S +ITHSRAD+VP +    +++LDS+   +R IGPVPNLVV
Sbjct: 1    MSFAAYRMMHWPTGIENCDSAYITHSRADFVPAVT-SHSDDLDSDWHPRRDIGPVPNLVV 59

Query: 61   TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
            TA NV+E+YVVRV EEG +ESK+SGE +R  +MDG+S ASLELVCHYRLHGNVES+AILS
Sbjct: 60   TAGNVLEVYVVRVLEEGGRESKSSGEVRRGGIMDGVSGASLELVCHYRLHGNVESMAILS 119

Query: 121  QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180
              G D S++RDSIIL F++AKISVLEFDDS H LR +SMHCF+ P+WLHLKRGRESFARG
Sbjct: 120  SRGGDGSKKRDSIILVFQEAKISVLEFDDSTHSLRTSSMHCFDGPQWLHLKRGRESFARG 179

Query: 181  PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240
            P+VKVDPQGRCGGVLVYGLQMIILKASQ GSGLV D++ FG+ G  SAR+ESS++INLRD
Sbjct: 180  PVVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNTGAISARVESSYLINLRD 239

Query: 241  LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
            LD+KHVKDF+FVHGYIEPVMVILHE+ELTWAGRVSWKHHTCM+SALSISTTLKQHPLIWS
Sbjct: 240  LDVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMVSALSISTTLKQHPLIWS 299

Query: 301  AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
            A NLPHDAYKLLAVPSPIGGVLV+ AN+IHY+SQSASC LALNNYAVS DSSQ++PRS+F
Sbjct: 300  ASNLPHDAYKLLAVPSPIGGVLVISANSIHYNSQSASCMLALNNYAVSADSSQDMPRSNF 359

Query: 361  SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGN 420
            +VELDAA+ATWL NDVALLSTKTG+L+LL +VYDGRVVQRLDLSK+  SVLTS I +IGN
Sbjct: 360  NVELDAANATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSKASVLTSGIASIGN 419

Query: 421  SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEF-------------------------- 454
            SLFFLGSRLGDSLLVQF+CG G+S L+S LK+E                           
Sbjct: 420  SLFFLGSRLGDSLLVQFSCGVGSSGLASNLKDEITYYTQNLQKEMVPPTLPSALVHESKP 479

Query: 455  ----GDIEADAPS----------------------TKRLRRSSSDALQDMVNGEELSLYG 488
                G IE +  +                        R+ R         V G+ELSLYG
Sbjct: 480  TQAKGTIELNNNNLCVENDIVDVVEVDITNMTILGENRIARRDETLTDTQVGGDELSLYG 539

Query: 489  SASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELV------ 542
            SA+NNTESAQK FSFAVRDSL+NIGPLKDFSYGLRINAD +ATGI+KQSNYELV      
Sbjct: 540  SAANNTESAQKIFSFAVRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYELVCCSGHG 599

Query: 543  --------------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIIS 582
                                ELPGCKGIWTVYHK++RG  ADSSRM   DDEYHAYLIIS
Sbjct: 600  KNGALCILRQSIRPEMITEVELPGCKGIWTVYHKNTRGSIADSSRMVPDDDEYHAYLIIS 659

Query: 583  LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD 642
            LEARTMVL T +LLTEVTESVDYFV GRTIAAGNLFGRRRVIQV+E GARILDGS+MTQD
Sbjct: 660  LEARTMVLVTGELLTEVTESVDYFVHGRTIAAGNLFGRRRVIQVYESGARILDGSFMTQD 719

Query: 643  LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIES 702
            L+   + +ESG+ SE  TVLS SI+DPYVLL M+DGSIRLLVGD S+C+VSV  PAA  S
Sbjct: 720  LNLVVNGNESGNASEGCTVLSASISDPYVLLTMTDGSIRLLVGDSSSCSVSVSAPAAFGS 779

Query: 703  SKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGA 762
            SKK VSSCTLY DKG EPWLR TSTDAWLSTGVGE IDG DG   DQGDIY V CY++G 
Sbjct: 780  SKKCVSSCTLYQDKGIEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYCVACYDNGD 839

Query: 763  LEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSM 822
            LEIFDVPNF  VF VDKFVSG++H+VD  + +  K SE + N  S+E    GR E+  +M
Sbjct: 840  LEIFDVPNFTSVFYVDKFVSGKSHLVDHQISDLQKSSEVDQN--SQELISHGRNESSQNM 897

Query: 823  KVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNV 882
            KV+E+AMQRWS  HSRPFLF ILTDGTILCY AYLFE  ++ SK DD VS   S+S SN+
Sbjct: 898  KVIEVAMQRWSGQHSRPFLFGILTDGTILCYHAYLFESTDSASKIDDSVSIDNSVSSSNM 957

Query: 883  SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRER 942
            S+SRLRNLRF R PLD   RE+ P+G    R++IFKNISG+QG FL GSRP W MVFRER
Sbjct: 958  SSSRLRNLRFLRVPLDIQGREDMPNGTLSCRLSIFKNISGYQGLFLCGSRPAWFMVFRER 1017

Query: 943  LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
            LRVHPQLCDG IVAF VLHNVNCNHG IYVTSQG+LKICQLPS S YDNYWPVQKV
Sbjct: 1018 LRVHPQLCDGPIVAFAVLHNVNCNHGLIYVTSQGVLKICQLPSTSNYDNYWPVQKV 1073


>gi|218194461|gb|EEC76888.1| hypothetical protein OsI_15095 [Oryza sativa Indica Group]
          Length = 1503

 Score = 1199 bits (3101), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 612/1039 (58%), Positives = 746/1039 (71%), Gaps = 64/1039 (6%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTE-----ELDSELPSKRG--IG 53
            MS+AAYKMMHWPTG+ +C +GF+THS +D                ++DS   + R   +G
Sbjct: 1    MSYAAYKMMHWPTGVDHCAAGFVTHSPSDAAAFFTAATVGPGPEGDIDSAAAASRPRRLG 60

Query: 54   PVPNLVVTAANVIEIYVVRVQEE------GSKESKNSGETKRRVLMDGISAASLELVCHY 107
            P PNLVV AANV+E+Y VR +        G++ S +SG      ++DGIS A LELVC+Y
Sbjct: 61   PSPNLVVAAANVLEVYAVRAETAAEDGGGGTQPSSSSG-----AVLDGISGARLELVCYY 115

Query: 108  RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
            RLHGN+ES+ +LS G A+N  RR +I LAF+DAKI+ LEFDD+IHGLR +SMHCFE PEW
Sbjct: 116  RLHGNIESMTVLSDG-AEN--RRATIALAFKDAKITCLEFDDAIHGLRTSSMHCFEGPEW 172

Query: 168  LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFS 227
             HLKRGRESFA GP++K DP GRCG  L YGLQMIILKA+Q G  LVG+++   +    +
Sbjct: 173  QHLKRGRESFAWGPVIKADPLGRCGAALAYGLQMIILKAAQVGHSLVGEDEPTCALSSTA 232

Query: 228  ARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
             RIESS++I+LR LDM HVKDF FVHGYIEPV+VILHE+E TWAGR+  KHHTCMISA S
Sbjct: 233  VRIESSYLIDLRALDMNHVKDFAFVHGYIEPVLVILHEQEPTWAGRILSKHHTCMISAFS 292

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
            IS TLKQHP+IWSA NLPHDAY+LLAVP PI GVLV+ AN+IHYHSQS SC+L LNN++ 
Sbjct: 293  ISMTLKQHPVIWSAANLPHDAYQLLAVPPPISGVLVICANSIHYHSQSTSCSLDLNNFSS 352

Query: 348  SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
              D S E+ +S+F VELDAA ATW  ND+ + S+K G+++LLTVVYDGRVVQRLDL K+ 
Sbjct: 353  HPDGSPEISKSNFQVELDAAKATWFSNDIVMFSSKAGEMLLLTVVYDGRVVQRLDLMKSK 412

Query: 408  PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
             SVL+S +T+IGNS FFLGSRLGDSLLVQF+ G+  S+L     E   DIE D P +KRL
Sbjct: 413  ASVLSSAVTSIGNSFFFLGSRLGDSLLVQFSYGASKSVLQDLTNERSADIEGDLPFSKRL 472

Query: 468  RRSSSDALQDMVNGEELSLYG-SASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINA 526
            +R  SD LQD+ + EELS     A N+ ESAQK  S+ VRD+L+N+GPLKDFSYGLR NA
Sbjct: 473  KRIPSDVLQDVTSVEELSFQNIIAPNSLESAQK-ISYIVRDALINVGPLKDFSYGLRANA 531

Query: 527  DASATGISKQSNYEL--------------------------VELPGCKGIWTVYHKSSRG 560
            D +A G +KQSNYEL                          VELP C+GIWTVY+KS RG
Sbjct: 532  DPNAMGNAKQSNYELVCCSGHGKNGSLSVLQQSIRPDLITEVELPSCRGIWTVYYKSYRG 591

Query: 561  HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
              A+       D+EYHAYLIISLE RTMVLET D L EVTE+VDYFVQ  TIAAGNLFGR
Sbjct: 592  QMAE-------DNEYHAYLIISLENRTMVLETGDDLGEVTETVDYFVQASTIAAGNLFGR 644

Query: 621  RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
            RRVIQV+ +GAR+LDGS+MTQ+L+F  +++   S SE   V   SIADPYVLL M DGS+
Sbjct: 645  RRVIQVYGKGARVLDGSFMTQELNF-TTHASESSSSEALGVACASIADPYVLLKMVDGSV 703

Query: 681  RLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAID 740
            +LL+GD  TCT+SV  P+   SS + +++CTLY D+GPEPWLRKT +DAWLSTG+ EAID
Sbjct: 704  QLLIGDYCTCTLSVNAPSIFISSSERIAACTLYRDRGPEPWLRKTRSDAWLSTGIAEAID 763

Query: 741  GADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
            G      DQ DIY ++CYESG LEIF+VP+F CVF+V+ F+SG   +VD + +   +DS 
Sbjct: 764  GNGTSSHDQSDIYCIICYESGKLEIFEVPSFRCVFSVENFISGEALLVDKFSQLIYEDST 823

Query: 801  TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
             E    ++      +KE   S+++VELAM RWS   SRPFLF +L DGT+LCY A+ +E 
Sbjct: 824  KERYDCTKASL---KKEAGDSIRIVELAMHRWSGQFSRPFLFGLLNDGTLLCYHAFSYEA 880

Query: 861  PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH-GAPCQRITIFKN 919
             E+  K   P+S   S    N S SRLRNLRF R  +D  +RE+ P  G P  RIT F N
Sbjct: 881  SESNVKR-VPLSPQGSADHHNASDSRLRNLRFHRVSIDITSREDIPTLGRP--RITTFNN 937

Query: 920  ISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
            + G++G FLSG+RP W MV R+RLRVHPQLCDG I AFTVLHNVNC+HGFIYVTSQG LK
Sbjct: 938  VGGYEGLFLSGTRPAWVMVCRQRLRVHPQLCDGPIEAFTVLHNVNCSHGFIYVTSQGFLK 997

Query: 980  ICQLPSGSTYDNYWPVQKV 998
            ICQLPS   YDNYWPVQKV
Sbjct: 998  ICQLPSAYNYDNYWPVQKV 1016


>gi|75145059|sp|Q7XWP1.2|CPSF1_ORYSJ RecName: Full=Probable cleavage and polyadenylation specificity
            factor subunit 1; AltName: Full=Cleavage and
            polyadenylation specificity factor 160 kDa subunit;
            Short=CPSF 160 kDa subunit
 gi|38345987|emb|CAD39979.2| OSJNBa0032B23.5 [Oryza sativa Japonica Group]
          Length = 1441

 Score = 1191 bits (3081), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 610/1039 (58%), Positives = 744/1039 (71%), Gaps = 64/1039 (6%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTE-----ELDSELPSKRG--IG 53
            MS+AAYKMMHWPTG+ +C +GF+THS +D                ++DS   + R   +G
Sbjct: 1    MSYAAYKMMHWPTGVDHCAAGFVTHSPSDAAAFFTAATVGPGPEGDIDSAAAASRPRRLG 60

Query: 54   PVPNLVVTAANVIEIYVVRVQEE------GSKESKNSGETKRRVLMDGISAASLELVCHY 107
            P PNLVV AANV+E+Y VR +        G++ S +SG      ++DGIS A LELVC+Y
Sbjct: 61   PSPNLVVAAANVLEVYAVRAETAAEDGGGGTQPSSSSG-----AVLDGISGARLELVCYY 115

Query: 108  RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
            RLHGN+ES+ +LS G A+N  RR +I LAF+DAKI+ LEFDD+IHGLR +SMHCFE PEW
Sbjct: 116  RLHGNIESMTVLSDG-AEN--RRATIALAFKDAKITCLEFDDAIHGLRTSSMHCFEGPEW 172

Query: 168  LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFS 227
             HLKRGRESFA GP++K DP GRCG  L YGLQMIILKA+Q G  LVG+++   +    +
Sbjct: 173  QHLKRGRESFAWGPVIKADPLGRCGAALAYGLQMIILKAAQVGHSLVGEDEPTCALSSTA 232

Query: 228  ARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
              IESS++I+LR LDM HVKDF FVHGYIEPV+VILHE+E TWAGR+  KHHTCMISA S
Sbjct: 233  VCIESSYLIDLRALDMNHVKDFAFVHGYIEPVLVILHEQEPTWAGRILSKHHTCMISAFS 292

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
            IS TLKQHP+IWSA NLPHDAY+LLAVP PI GVLV+ AN+IHYHSQS SC+L LNN++ 
Sbjct: 293  ISMTLKQHPVIWSAANLPHDAYQLLAVPPPISGVLVICANSIHYHSQSTSCSLDLNNFSS 352

Query: 348  SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
              D S E+ +S+F VELDAA ATWL ND+ + STK G+++LLTVVYDGRVVQRLDL K+ 
Sbjct: 353  HPDGSPEISKSNFQVELDAAKATWLSNDIVMFSTKAGEMLLLTVVYDGRVVQRLDLMKSK 412

Query: 408  PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
             SVL+S +T+IGNS FFLGSRLGDSLLVQF+  +  S+L     E   DIE D P +KRL
Sbjct: 413  ASVLSSAVTSIGNSFFFLGSRLGDSLLVQFSYCASKSVLQDLTNERSADIEGDLPFSKRL 472

Query: 468  RRSSSDALQDMVNGEELSLYG-SASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINA 526
            +R  SD LQD+ + EELS     A N+ ESAQK  S+ VRD+L+N+GPLKDFSYGLR NA
Sbjct: 473  KRIPSDVLQDVTSVEELSFQNIIAPNSLESAQK-ISYIVRDALINVGPLKDFSYGLRANA 531

Query: 527  DASATGISKQSNYEL--------------------------VELPGCKGIWTVYHKSSRG 560
            D +A G +KQSNYEL                          VELP C+GIWTVY+KS RG
Sbjct: 532  DPNAMGNAKQSNYELVCCSGHGKNGSLSVLQQSIRPDLITEVELPSCRGIWTVYYKSYRG 591

Query: 561  HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
              A+       D+EYHAYLIISLE RTMVLET D L EVTE+VDYFVQ  TIAAGNLFGR
Sbjct: 592  QMAE-------DNEYHAYLIISLENRTMVLETGDDLGEVTETVDYFVQASTIAAGNLFGR 644

Query: 621  RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
            RRVIQV+ +GAR+LDGS+MTQ+L+F  +++   S SE   V   SIADPYVLL M DGS+
Sbjct: 645  RRVIQVYGKGARVLDGSFMTQELNF-TTHASESSSSEALGVACASIADPYVLLKMVDGSV 703

Query: 681  RLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAID 740
            +LL+GD  TCT+SV  P+   SS + +++CTLY D+GPEPWL KT +DAWLSTG+ EAID
Sbjct: 704  QLLIGDYCTCTLSVNAPSIFISSSERIAACTLYRDRGPEPWLTKTRSDAWLSTGIAEAID 763

Query: 741  GADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
            G      DQ DIY ++CYESG LEIF+VP+F CVF+V+ F+SG   +VD + +   +DS 
Sbjct: 764  GNGTSSHDQSDIYCIICYESGKLEIFEVPSFRCVFSVENFISGEALLVDKFSQLIYEDST 823

Query: 801  TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
             E    ++      +KE   S+++VELAM RWS   SRPFLF +L DGT+LCY A+ +E 
Sbjct: 824  KERYDCTKASL---KKEAGDSIRIVELAMHRWSGQFSRPFLFGLLNDGTLLCYHAFSYEA 880

Query: 861  PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH-GAPCQRITIFKN 919
             E+  K   P+S   S    N S SRLRNLRF R  +D  +RE+ P  G P  RIT F N
Sbjct: 881  SESNVKR-VPLSPQGSADHHNASDSRLRNLRFHRVSIDITSREDIPTLGRP--RITTFNN 937

Query: 920  ISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
            + G++G FLSG+RP W MV R+RLRVHPQLCDG I AFTVLHNVNC+HGFIYVTSQG LK
Sbjct: 938  VGGYEGLFLSGTRPAWVMVCRQRLRVHPQLCDGPIEAFTVLHNVNCSHGFIYVTSQGFLK 997

Query: 980  ICQLPSGSTYDNYWPVQKV 998
            ICQLPS   YD+YWPVQKV
Sbjct: 998  ICQLPSAYNYDSYWPVQKV 1016


>gi|222628488|gb|EEE60620.1| hypothetical protein OsJ_14038 [Oryza sativa Japonica Group]
          Length = 1441

 Score = 1191 bits (3080), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 610/1039 (58%), Positives = 744/1039 (71%), Gaps = 64/1039 (6%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTE-----ELDSELPSKRG--IG 53
            MS+AAYKMMHWPTG+ +C +GF+THS +D                ++DS   + R   +G
Sbjct: 1    MSYAAYKMMHWPTGVDHCAAGFVTHSPSDAAAFFTAATVGPGPEGDIDSAAAASRPRRLG 60

Query: 54   PVPNLVVTAANVIEIYVVRVQEE------GSKESKNSGETKRRVLMDGISAASLELVCHY 107
            P PNLVV AANV+E+Y VR +        G++ S +SG      ++DGIS A LELVC+Y
Sbjct: 61   PSPNLVVAAANVLEVYAVRAETAAEDGGGGTQPSSSSG-----AVLDGISGARLELVCYY 115

Query: 108  RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
            RLHGN+ES+ +LS G A+N  RR +I LAF+DAKI+ LEFDD+IHGLR +SMHCFE PEW
Sbjct: 116  RLHGNIESMTVLSDG-AEN--RRATIALAFKDAKITCLEFDDAIHGLRTSSMHCFEGPEW 172

Query: 168  LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFS 227
             HLKRGRESFA GP++K DP GRCG  L YGLQMIILKA+Q G  LVG+++   +    +
Sbjct: 173  QHLKRGRESFAWGPVIKADPLGRCGAALAYGLQMIILKAAQVGHSLVGEDEPTCALSSTA 232

Query: 228  ARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
              IESS++I+LR LDM HVKDF FVHGYIEPV+VILHE+E TWAGR+  KHHTCMISA S
Sbjct: 233  VCIESSYLIDLRALDMNHVKDFAFVHGYIEPVLVILHEQEPTWAGRILSKHHTCMISAFS 292

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
            IS TLKQHP+IWSA NLPHDAY+LLAVP PI GVLV+ AN+IHYHSQS SC+L LNN++ 
Sbjct: 293  ISMTLKQHPVIWSAANLPHDAYQLLAVPPPISGVLVICANSIHYHSQSTSCSLDLNNFSS 352

Query: 348  SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
              D S E+ +S+F VELDAA ATWL ND+ + STK G+++LLTVVYDGRVVQRLDL K+ 
Sbjct: 353  HPDGSPEISKSNFQVELDAAKATWLSNDIVMFSTKAGEMLLLTVVYDGRVVQRLDLMKSK 412

Query: 408  PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
             SVL+S +T+IGNS FFLGSRLGDSLLVQF+  +  S+L     E   DIE D P +KRL
Sbjct: 413  ASVLSSAVTSIGNSFFFLGSRLGDSLLVQFSYCASKSVLQDLTNERSADIEGDLPFSKRL 472

Query: 468  RRSSSDALQDMVNGEELSLYG-SASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINA 526
            +R  SD LQD+ + EELS     A N+ ESAQK  S+ VRD+L+N+GPLKDFSYGLR NA
Sbjct: 473  KRIPSDVLQDVTSVEELSFQNIIAPNSLESAQK-ISYIVRDALINVGPLKDFSYGLRANA 531

Query: 527  DASATGISKQSNYEL--------------------------VELPGCKGIWTVYHKSSRG 560
            D +A G +KQSNYEL                          VELP C+GIWTVY+KS RG
Sbjct: 532  DPNAMGNAKQSNYELVCCSGHGKNGSLSVLQQSIRPDLITEVELPSCRGIWTVYYKSYRG 591

Query: 561  HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
              A+       D+EYHAYLIISLE RTMVLET D L EVTE+VDYFVQ  TIAAGNLFGR
Sbjct: 592  QMAE-------DNEYHAYLIISLENRTMVLETGDDLGEVTETVDYFVQASTIAAGNLFGR 644

Query: 621  RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
            RRVIQV+ +GAR+LDGS+MTQ+L+F  +++   S SE   V   SIADPYVLL M DGS+
Sbjct: 645  RRVIQVYGKGARVLDGSFMTQELNF-TTHASESSSSEALGVACASIADPYVLLKMVDGSV 703

Query: 681  RLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAID 740
            +LL+GD  TCT+SV  P+   SS + +++CTLY D+GPEPWL KT +DAWLSTG+ EAID
Sbjct: 704  QLLIGDYCTCTLSVNAPSIFISSSERIAACTLYRDRGPEPWLTKTRSDAWLSTGIAEAID 763

Query: 741  GADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
            G      DQ DIY ++CYESG LEIF+VP+F CVF+V+ F+SG   +VD + +   +DS 
Sbjct: 764  GNGTSSHDQSDIYCIICYESGKLEIFEVPSFRCVFSVENFISGEALLVDKFSQLIYEDST 823

Query: 801  TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
             E    ++      +KE   S+++VELAM RWS   SRPFLF +L DGT+LCY A+ +E 
Sbjct: 824  KERYDCTKASL---KKEAGDSIRIVELAMHRWSGQFSRPFLFGLLNDGTLLCYHAFSYEA 880

Query: 861  PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH-GAPCQRITIFKN 919
             E+  K   P+S   S    N S SRLRNLRF R  +D  +RE+ P  G P  RIT F N
Sbjct: 881  SESNVKR-VPLSPQGSADHHNASDSRLRNLRFHRVSIDITSREDIPTLGRP--RITTFNN 937

Query: 920  ISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
            + G++G FLSG+RP W MV R+RLRVHPQLCDG I AFTVLHNVNC+HGFIYVTSQG LK
Sbjct: 938  VGGYEGLFLSGTRPAWVMVCRQRLRVHPQLCDGPIEAFTVLHNVNCSHGFIYVTSQGFLK 997

Query: 980  ICQLPSGSTYDNYWPVQKV 998
            ICQLPS   YD+YWPVQKV
Sbjct: 998  ICQLPSAYNYDSYWPVQKV 1016


>gi|357162146|ref|XP_003579318.1| PREDICTED: probable cleavage and polyadenylation specificity factor
            subunit 1-like [Brachypodium distachyon]
          Length = 1442

 Score = 1181 bits (3055), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 611/1038 (58%), Positives = 744/1038 (71%), Gaps = 61/1038 (5%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTE--ELDSELPSK----RGIGP 54
            MS+AAYKMMHWPTGI +C +GFITH  +D             E D  L +     + +GP
Sbjct: 1    MSYAAYKMMHWPTGIDHCAAGFITHCPSDAAAFCSAAAASGPEGDVGLVAAARHPKRLGP 60

Query: 55   VPNLVVTAANVIEIYVVRVQEEGS------KESKNSGETKRRVLMDGISAASLELVCHYR 108
             PNLVV AANV+E+Y VR     +      + S +SG      + DGIS A LELVCHYR
Sbjct: 61   TPNLVVAAANVLEVYAVRADAAAADGAGGAQPSSSSG-----AVFDGISGARLELVCHYR 115

Query: 109  LHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWL 168
            LHGN+ES+AILS G A+N  RRDSI LAF DAKI+ LEFDD+IHGLR +SMHCFE PEW 
Sbjct: 116  LHGNIESMAILSDG-AEN--RRDSIALAFRDAKITCLEFDDAIHGLRTSSMHCFEGPEWQ 172

Query: 169  HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSA 228
            HLKRGRESFA GP++K DP GRCG  LVYGLQMIILK++Q G  LVG+++   +    + 
Sbjct: 173  HLKRGRESFAWGPVIKSDPLGRCGAALVYGLQMIILKSAQVGQSLVGEDEPTRALSSAAV 232

Query: 229  RIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
            RIESS++I+LR LD  HVKDF FVHGYIEPV+VILHERE TWAGR+S KHHTCMISA SI
Sbjct: 233  RIESSYLIDLRALDTNHVKDFTFVHGYIEPVLVILHEREPTWAGRISSKHHTCMISAFSI 292

Query: 289  STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVS 348
            S TLKQHP+IWSA N+PHDAY++L+VP PI GVLV+ AN+IHYHSQS SC+LALNN+A  
Sbjct: 293  SMTLKQHPMIWSAANIPHDAYQILSVPPPISGVLVICANSIHYHSQSTSCSLALNNFASQ 352

Query: 349  LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP 408
             D S E+ + +F VELDAA ATWL ND+ + S KTG+++LLTVVYDGR VQ+LDL K+  
Sbjct: 353  PDGSPEIHKVNFHVELDAAKATWLSNDIVMFSAKTGEMLLLTVVYDGRTVQKLDLMKSKA 412

Query: 409  SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
            SV++S +TTIG+S FFLGSR+GDSLLVQF+CG  TS++     E   DIE D P +KRL+
Sbjct: 413  SVISSGVTTIGSSFFFLGSRVGDSLLVQFSCGVPTSVIPDIADERSADIEGDLPFSKRLK 472

Query: 469  RSSSDALQDMVNGEELSLYGSA-SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
            R  SD LQD+ + EELS   +   N+ ESAQK  S+ VRD+LVN+GPLKDFSYGLR+NAD
Sbjct: 473  RVPSDILQDVTSVEELSFQNNMLPNSLESAQK-ISYVVRDALVNVGPLKDFSYGLRVNAD 531

Query: 528  ASATGISKQSNYEL--------------------------VELPGCKGIWTVYHKSSRGH 561
             +ATG +KQSNYEL                          VELP C+GIWTVY+KSSRGH
Sbjct: 532  PNATGNAKQSNYELVCCSGHGKNGALSVLQQSIRPDLITEVELPSCRGIWTVYYKSSRGH 591

Query: 562  NADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRR 621
              +       D+EYHAYLIISLE+RTMVLET D L EVTE+VDY+VQG TI AGNLFGRR
Sbjct: 592  TTE-------DNEYHAYLIISLESRTMVLETGDDLGEVTETVDYYVQGATITAGNLFGRR 644

Query: 622  RVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENST-VLSVSIADPYVLLGMSDGSI 680
            RVIQV+  GAR+LDGS+MTQ+L+F   +SES S       V S SIADPYVLL M DG+I
Sbjct: 645  RVIQVYATGARVLDGSFMTQELNFTALSSESSSSGSEPLGVASASIADPYVLLKMVDGTI 704

Query: 681  RLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAID 740
            +LLVGD STC +S+  P+ + S  + +S+CTLYHD+GPEPWLRKT  DAWLS+GV  A+D
Sbjct: 705  QLLVGDHSTCALSINAPSTLTSRGERISACTLYHDRGPEPWLRKTRGDAWLSSGVTVAVD 764

Query: 741  GADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
             +     DQ DIY ++CYESG LEIF+VP+F  VF+V  F SG + +VD + +   +DS 
Sbjct: 765  VSGSSSQDQSDIYCIICYESGKLEIFEVPSFRQVFSVGSFFSGESLLVDAFAQGFTEDSA 824

Query: 801  TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
                   +E     +KE  +++++VELAM RWS   SRPFLF +L DGT+LCYQAY +EG
Sbjct: 825  ---EGRQDETKVSLKKEVANNIRIVELAMHRWSGQFSRPFLFGLLNDGTLLCYQAYCYEG 881

Query: 861  PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNI 920
             E+  K    +S   S+ + N S SRL+NLRF R  +D  +RE+    A   RITIF N+
Sbjct: 882  LESNIKGTS-LSPDGSVDLGNASDSRLKNLRFHRVSVDITSREDISSLAR-PRITIFNNV 939

Query: 921  SGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKI 980
             G++G FLSG+RP W MV R+R RVHPQLCDG I AFTVLHNVNC+HG IYVTSQG LKI
Sbjct: 940  GGYEGLFLSGTRPVWVMVCRQRFRVHPQLCDGPIEAFTVLHNVNCSHGLIYVTSQGFLKI 999

Query: 981  CQLPSGSTYDNYWPVQKV 998
            CQLPS   YDNYWPVQK+
Sbjct: 1000 CQLPSAYNYDNYWPVQKI 1017


>gi|168021793|ref|XP_001763425.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685218|gb|EDQ71614.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1452

 Score =  981 bits (2536), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 528/1061 (49%), Positives = 685/1061 (64%), Gaps = 93/1061 (8%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADY-VPQIPLIQTEELDSELPSKRGIGPVPNLV 59
            MS+AA+KM+H PTG+ NC + ++THS  +     IPL       ++L +  G G  PNLV
Sbjct: 1    MSYAAFKMVHCPTGVDNCVAAYVTHSAGETDSDSIPLP-----GADLIASGGSGFPPNLV 55

Query: 60   VTAANVIEIYVVRVQE------EGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNV 113
            +T ANV+E++ VR+ E       GS    N   T R  LM G+S   LEL CHYRLHGNV
Sbjct: 56   ITKANVLEVFHVRLLEGDDSAANGSNGVGNPETTPRGGLMAGLSYVKLELACHYRLHGNV 115

Query: 114  ESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRG 173
            ESL +LS   A+  + RD+IIL F DAKISVLEFDDS HGLRI S+H FE PEW +LKRG
Sbjct: 116  ESLGVLSYRHAEGRKGRDAIILTFRDAKISVLEFDDSTHGLRIGSLHYFEGPEWQYLKRG 175

Query: 174  RESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESS 233
            RE FA GP V+ DP GRC GVL+Y  Q+++LKA+Q G GL  ++++   GG   A + +S
Sbjct: 176  REQFASGPSVRADPVGRCAGVLIYNSQLVLLKAAQVGYGLGDEDESLIMGGKLCAHVATS 235

Query: 234  HVINLRDLDMKHVKDFIFVHG--------------YIEPVMVILHERELTWAGRVSWKHH 279
            ++++LRDLDMKH+KDF+F+HG              YIEPV+V+LHE++ TWAGRV+ + H
Sbjct: 236  YIVSLRDLDMKHIKDFVFLHGKLLFLIQYIFAFSSYIEPVLVVLHEKDPTWAGRVAVRRH 295

Query: 280  TCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCA 339
            TC I+ALSI+TTLKQHP IWSA NLP+DAYKLLAVP+PIGGVLV  AN++HYHSQS SCA
Sbjct: 296  TCAITALSINTTLKQHPHIWSATNLPYDAYKLLAVPAPIGGVLVFCANSLHYHSQSGSCA 355

Query: 340  LALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ 399
            L LN +AV+ + S E PRS  SVELD AHATW+ N+VAL+STK G L+ L +VY+GR VQ
Sbjct: 356  LGLNEFAVAPEGSAEYPRSKMSVELDCAHATWVANEVALISTKNGMLLFLNLVYEGRSVQ 415

Query: 400  RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEA 459
            RL+L+K+  SVLTS + TIG + FFLGSRL DSLLVQ T GS +   SS +    GDIEA
Sbjct: 416  RLELTKSKASVLTSCMCTIGENFFFLGSRLADSLLVQHTLGSASGRTSSLM----GDIEA 471

Query: 460  D--APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTE-SAQKTFSFAVRDSLVNIGPLK 516
            D  AP+ KRL+R  S+  + +   EE+SLY S    ++ S +KTF+F VRDSLVNI PL+
Sbjct: 472  DLSAPAAKRLKREPSEEEEGVSA-EEMSLYYSTPTASDISQKKTFTFTVRDSLVNICPLR 530

Query: 517  DFSYGLRINADASATGISKQSNYEL--------------------------VELPGCKGI 550
            DF+YGLR NAD SATG+ KQSNYEL                          V LPGC GI
Sbjct: 531  DFAYGLRSNADQSATGLGKQSNYELVACSGHGKNGSLSVLHQSIRPDLINKVALPGCSGI 590

Query: 551  WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
            WTVYHK+ R  + +     + DDE+HAYLIISLE+RTMVLET D L EVTE+V+Y+ +G 
Sbjct: 591  WTVYHKTDRDDSNEFDFGTSEDDEFHAYLIISLESRTMVLETGDTLGEVTENVEYYTEGN 650

Query: 611  TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGS-ENSTVLSVSIADP 669
            TIAAGNLFGRR V+QV++ G R+LDG+ M Q+L    S  E+ S    N+ V+   IADP
Sbjct: 651  TIAAGNLFGRRFVVQVYQNGLRLLDGAKMLQELLITNSELENNSSEVANNLVIEAVIADP 710

Query: 670  YVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTD- 728
            Y+LL M+DGS++L+VGD     +S+  P     +   +++ TLY DKGP  WLR+T ++ 
Sbjct: 711  YMLLKMTDGSLQLVVGDVENTKLSIPQPQGFGITTDAITAFTLYQDKGPHQWLRRTCSEM 770

Query: 729  -----AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSG 783
                  W ST              DQG +Y +VC  SG  EI+++P   CV+ VD F  G
Sbjct: 771  NSDRSQWSSTS-------------DQGYVYCIVCRISGRFEIYELPRMVCVYAVDNFNHG 817

Query: 784  RTHIVDTYMREALKDSETEINSSSEEGTGQGR---KENIHSMKVVELAMQRWSAHHSRPF 840
             + + D  + E   +S   +   +EE    G    ++   S+ V ++  + W     RPF
Sbjct: 818  MSVLWDQKVLERRANSNAALKEGAEEDKAPGDALLRDAGLSLHVSQICFESWGEKFGRPF 877

Query: 841  LFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAY 900
            L A L+DGT+LCY A+ ++  E++   +      R  + S    SRL +LRF+R P+D  
Sbjct: 878  LLATLSDGTMLCYHAFSYDANESSDALE-----FRETATSLKDLSRLTHLRFARIPIDWV 932

Query: 901  TREETPHGAPC---QRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAF 957
            + +E   GA      +   FKN+    G F++G RP W MV R RLR HPQ CDG+I+ F
Sbjct: 933  SGQED--GAKVLYETKFCSFKNVGSFPGVFVTGLRPTWLMVCRGRLRPHPQFCDGAILGF 990

Query: 958  TVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
            T LHNVNC HGFIY+T+QG LKICQLPS   YDN WPVQK+
Sbjct: 991  TPLHNVNCAHGFIYITAQGQLKICQLPSLLFYDNDWPVQKI 1031


>gi|302814354|ref|XP_002988861.1| hypothetical protein SELMODRAFT_184138 [Selaginella moellendorffii]
 gi|300143432|gb|EFJ10123.1| hypothetical protein SELMODRAFT_184138 [Selaginella moellendorffii]
          Length = 1413

 Score =  902 bits (2330), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 496/1051 (47%), Positives = 656/1051 (62%), Gaps = 116/1051 (11%)

Query: 1   MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
           MS+AA K++H PTG++ C S FITHS  +              S   S      +PNLV+
Sbjct: 1   MSYAAIKLVHGPTGVSACASAFITHSPVNPASS----------SGWKSGNAKDSLPNLVL 50

Query: 61  TAANVIEIYVVRVQEEGSKESKNSGE-------------TKRRVLMDGISAASLELVCHY 107
             ANV+EIY VR QE G ++S   GE              KR   M GI+AA LELVC Y
Sbjct: 51  VKANVLEIYNVRFQE-GDEKSARGGEQLVGSACVAFPASAKRGGFMSGITAAWLELVCQY 109

Query: 108 RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
           RL G V+S+AIL +G  D  R RD+IILAF  AK SVL FDD+   L+ +SMH FE PEW
Sbjct: 110 RLFGIVDSMAILHRG-RDGGRHRDAIILAFPAAKFSVLFFDDATQQLKTSSMHYFEGPEW 168

Query: 168 LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFS 227
           +HLKRGRE F  GPLV+ D QGRC GVL+Y  Q++++KA+Q   GLV ++D   SG   S
Sbjct: 169 IHLKRGREKFPGGPLVRADSQGRCAGVLIYKSQLVMMKAAQEAYGLVEEDDP--SGNIVS 226

Query: 228 ARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
           ARIESS+V+NL++L M HVKDF+F++GYIEPV+ ILHERELTWAGRV+++  TC ++ALS
Sbjct: 227 ARIESSYVVNLQELGMMHVKDFVFLYGYIEPVVAILHERELTWAGRVTFRRDTCCVTALS 286

Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
           I+T  K+HP +W    LP+DAY LLAVPSPIGGVLV+ AN+I Y+SQ ++C +A+N  A 
Sbjct: 287 INTNTKKHPRLWFQTGLPYDAYSLLAVPSPIGGVLVLCANSILYYSQVSTCIVAVNELAT 346

Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
               S E+PRS FS+ELDAAHATWL  D ALLSTKTG LV L +++DGR VQRL+LSK+ 
Sbjct: 347 PPAGSLEMPRSKFSIELDAAHATWLSYDAALLSTKTGMLVHLHLIFDGRNVQRLELSKSK 406

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
            SVL+S + TIG+  FF+GSRLGDSLLVQF   S ++ LS       G+ +     +KR+
Sbjct: 407 GSVLSSSLCTIGDMFFFVGSRLGDSLLVQFGSASTSNSLSQSYD---GEDDIMVRPSKRM 463

Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFS-------- 519
           R      L D  N + L  Y SA ++++     F F+VRDSL NIGP++D +        
Sbjct: 464 R------LDDDANEQSLYQYKSAVSDSQK-NMNFLFSVRDSLCNIGPIRDITGRSQNPSE 516

Query: 520 -------------YG----LRINADASATGISKQSNYEL------------VELPGCKGI 550
                        +G    L I + +       Q+N  L            V+LPGC G+
Sbjct: 517 QPGSAQDLIACCGHGKNGSLNIISRSIRPDFITQANMSLLFFAVAYALFFQVKLPGCVGV 576

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTVYH+        S ++ A  DEYHAYLIISLE+RTMVLET + L EVT+SV+Y+ +G 
Sbjct: 577 WTVYHR--------SGQIPAEKDEYHAYLIISLESRTMVLETGETLGEVTDSVEYYTEGP 628

Query: 611 TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPY 670
           +I+AGNLFGRRR+ QV+++G RILDG+  TQDL  G    E G+  E     S S ADPY
Sbjct: 629 SISAGNLFGRRRIAQVYQKGVRILDGARQTQDLQVG----EPGNAIE-----SASFADPY 679

Query: 671 VLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAW 730
           VLL M DGS +L+VGD  T TVSV TP  +  S  P+S+CTLY+D+GP PWLR+ + D W
Sbjct: 680 VLLRMQDGSCQLVVGDSETLTVSVSTPPELGLSPDPISACTLYNDRGPSPWLRRATGDVW 739

Query: 731 LSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDT 790
            + GV +A         DQGD+Y +VC  SG +E  ++P+  C++ V++   G   + D 
Sbjct: 740 QTLGVPDA-----NFAFDQGDMYCIVCRNSGTMEFLELPSMACLYRVERLPYGVQVLADN 794

Query: 791 YMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTI 850
             R A K      ++  EEG  + R E +  +KVV++ +  W   + RPF+F +L+DGT+
Sbjct: 795 --RTASKVPVDTSSNKDEEGAEEIR-ERMSKIKVVDICVDTWGEKYGRPFVFVLLSDGTL 851

Query: 851 LCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG-- 908
           L Y+A+++EG ++ + + D  S               RNLRF R  LD    EE  +   
Sbjct: 852 LSYRAFIYEGQDSGAHASDGTS--------------FRNLRFLRLQLDLELGEEDSNADE 897

Query: 909 -APCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNH 967
               Q+I  FK++ G QG FL+G +P W M+FRE++R+HPQ  DG IVAFT LHNVNC H
Sbjct: 898 VRSVQKIIPFKDVGGLQGLFLAGGKPTWLMIFREQIRLHPQASDGPIVAFTSLHNVNCQH 957

Query: 968 GFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
           G IYVT++  LKIC+L +   YDN WPVQK+
Sbjct: 958 GLIYVTNEASLKICRLSNILNYDNDWPVQKI 988


>gi|302761560|ref|XP_002964202.1| hypothetical protein SELMODRAFT_82277 [Selaginella moellendorffii]
 gi|300167931|gb|EFJ34535.1| hypothetical protein SELMODRAFT_82277 [Selaginella moellendorffii]
          Length = 1413

 Score =  899 bits (2324), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 498/1053 (47%), Positives = 659/1053 (62%), Gaps = 120/1053 (11%)

Query: 1   MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
           MS+AA K++H PTG++ C S FITHS     P  P        S   S      +PNLV+
Sbjct: 1   MSYAAIKLVHGPTGVSACASAFITHS-----PVNP-----ASSSGWKSGNAKDSLPNLVL 50

Query: 61  TAANVIEIYVVRVQEEGSKESKNSGE-------------TKRRVLMDGISAASLELVCHY 107
             ANV+EIY VR QE G ++S   GE              KR   M GI+AA LELVC Y
Sbjct: 51  VKANVLEIYNVRFQE-GDEKSARGGEQLVGSACVAFPASAKRGGFMSGITAAWLELVCQY 109

Query: 108 RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
           RL G V+S+AIL +G  D  R RD+IILAF  AK SVL FDD+   L+ +SMH FE PEW
Sbjct: 110 RLFGIVDSMAILHRG-RDGGRHRDAIILAFPAAKFSVLFFDDATQQLKTSSMHYFEGPEW 168

Query: 168 LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFS 227
           +HLKRGRE F  GPLV+ D QGRC GVL+Y  Q++++KA+Q   GLV ++D   SG   S
Sbjct: 169 IHLKRGREKFPGGPLVRADSQGRCAGVLIYKCQLVMMKAAQEAYGLVEEDDP--SGNIVS 226

Query: 228 ARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
           ARIESS+V+NL++L M HVKDF+F++GYIEPV+ ILHERELTWAGRV+++  TC ++ALS
Sbjct: 227 ARIESSYVVNLQELGMMHVKDFVFLYGYIEPVVAILHERELTWAGRVTFRRDTCCVTALS 286

Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
           I+T  K+HP +W    LP+DAY LLAVPSPIGGVLV+ AN+I Y+SQ ++C +A+N  A 
Sbjct: 287 INTNTKKHPRLWFQTGLPYDAYSLLAVPSPIGGVLVLCANSILYYSQVSTCIVAVNELAT 346

Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
               S E+PRS FS+ELDAAHATWL  D ALLSTKTG LV L +++DGR VQRL+LSK+ 
Sbjct: 347 PPAGSLEMPRSKFSIELDAAHATWLSYDAALLSTKTGMLVHLHLIFDGRNVQRLELSKSK 406

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEF-GDIEADAPSTKR 466
            SVL+S + TIG+  FF+GSRLGDSLLVQF    G++  S+ L+  + G+ +     +KR
Sbjct: 407 GSVLSSSLCTIGDKFFFVGSRLGDSLLVQF----GSASTSNSLEHSYDGEDDIMVRPSKR 462

Query: 467 LRRSSSDALQDMVNGEELSLYGSASNNTESAQK-TFSFAVRDSLVNIGPLKDFSY----- 520
           +R      L D  +  E SLY   S  ++S +   F F+VRDSL NIGP++D +      
Sbjct: 463 MR------LDD--DASEQSLYQYKSGVSDSQKNMNFLFSVRDSLCNIGPIRDITCRSQNP 514

Query: 521 --------------------GLRINADASATGISKQSNYEL------------VELPGCK 548
                                L I + +       Q+N  L            V+LPGC 
Sbjct: 515 SEQPGSAQDLIACCGHGKNGSLNIISRSIRPDFITQANMSLLFFAVAYALFFQVKLPGCV 574

Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
           G+WTVYH+        S ++ A  DEYHAYLIISLE+RTMVLET + L EVT+SV+Y+ +
Sbjct: 575 GVWTVYHR--------SGQIPAEKDEYHAYLIISLESRTMVLETGETLGEVTDSVEYYTE 626

Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
           G +I+AGNLFGRRR+ QV+++G RILDG+  TQDL  G    E G+  E     S S AD
Sbjct: 627 GPSISAGNLFGRRRIAQVYQKGVRILDGARQTQDLQVG----EPGNAIE-----SASFAD 677

Query: 669 PYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTD 728
           PYVLL M DGS +L+VGD  T TVSV TP  +  S  P+S+CTLY+D+GP PWLR+ + D
Sbjct: 678 PYVLLRMQDGSCQLVVGDSETLTVSVSTPPELGLSPDPISACTLYNDRGPSPWLRRATGD 737

Query: 729 AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
            W + GV +A         DQGD+Y +VC  SG +E  ++P+  C++ V++   G   + 
Sbjct: 738 VWQTLGVPDA-----NFAFDQGDMYCIVCRNSGTMEFLELPSMACLYRVERLPYGVQVLA 792

Query: 789 DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
           D+  R A K      ++  EEG  + R E +  +KVV++ +  W   + RPF+F +L+DG
Sbjct: 793 DS--RTASKVPVDTSSNKDEEGAEEIR-ERMSKIKVVDICVDTWGEKYGRPFVFVLLSDG 849

Query: 849 TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG 908
           T+L Y+A+++EG ++ + + D  S               RNLRF R  LD    EE  + 
Sbjct: 850 TLLSYRAFIYEGQDSGAHASDGTS--------------FRNLRFLRLQLDLELGEEDSNA 895

Query: 909 ---APCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNC 965
                 Q+I  FK++ G QG FL+G +P W M+FRE++R+HPQ  DG IVAFT LHNVNC
Sbjct: 896 DEVRSVQKIIPFKDVGGLQGLFLAGGKPTWLMIFREQIRLHPQASDGPIVAFTSLHNVNC 955

Query: 966 NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
            HG IYVT++  LKIC+L +   YDN WPVQK+
Sbjct: 956 QHGLIYVTNEASLKICRLSNILNYDNDWPVQKI 988


>gi|414587801|tpg|DAA38372.1| TPA: hypothetical protein ZEAMMB73_993613 [Zea mays]
          Length = 573

 Score =  656 bits (1692), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 331/569 (58%), Positives = 411/569 (72%), Gaps = 34/569 (5%)

Query: 1   MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTE--------ELDSELP-SKRG 51
           MS+AAYKMMH PTGI +C +GFITHS AD                   ++DS    + R 
Sbjct: 1   MSYAAYKMMHLPTGIDHCAAGFITHSPADAAAFSTPAPAPTAAAGPDGDIDSTAARAPRR 60

Query: 52  IGPVPNLVVTAANVIEIYVVRVQ-EEGSKESKNSGETKRRVLMDGISAASLELVCHYR-- 108
           +GP PNLVV+AANV+E+Y VR +   G++++ NS  T    ++DGIS A LELVCHYR  
Sbjct: 61  VGPTPNLVVSAANVLEVYAVRAEVATGAEDAGNSSSTG--TILDGISGARLELVCHYRCK 118

Query: 109 ----------------LHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIH 152
                           LHGN+ES+A+LS G      RRDSI + F DAKI+ LEFDDSI+
Sbjct: 119 QMALASLHSLLAVNFRLHGNIESMAVLSDG---TENRRDSIAVTFNDAKITCLEFDDSIN 175

Query: 153 GLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSG 212
           GLR +SMHCFE PEW HLKRGRESFA GP++K DPQGRCG VLVYGLQ+IILKA+Q G  
Sbjct: 176 GLRTSSMHCFEGPEWFHLKRGRESFAWGPIIKGDPQGRCGAVLVYGLQIIILKAAQVGQS 235

Query: 213 LVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAG 272
           LVG+++        + RIESS+VI+LRDL+M H+KDF FVHGYIEPV+VILHERE TWAG
Sbjct: 236 LVGEDEPTRVLSSTAVRIESSYVIDLRDLEMNHIKDFTFVHGYIEPVLVILHEREPTWAG 295

Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYH 332
           R+S K  TCM+SA SIS  LKQHP+IWSA  LPHDAY+LLAVP PI G+LV+ AN+IHYH
Sbjct: 296 RISSKSQTCMLSAFSISMGLKQHPMIWSAAKLPHDAYQLLAVPPPISGILVICANSIHYH 355

Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
           SQS SC+LALN+++   D S E+ ++SF VELD A ATWL +D+ + S+K G+++LLTVV
Sbjct: 356 SQSTSCSLALNSFSSQPDGSPEILKTSFHVELDVAKATWLSHDIVMFSSKNGEILLLTVV 415

Query: 393 YDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE 452
           YDGR VQRLDL K+  SVL+S  TT+G+S  FLGSRL DSLLVQF+CG  TS+L   L +
Sbjct: 416 YDGRAVQRLDLMKSKASVLSSGATTLGSSFIFLGSRLADSLLVQFSCGMPTSVLPD-LTD 474

Query: 453 EFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNI 512
           E  DIE+D P +KRL+R  SD LQD+ + EELS +  A  N   + +  SF VRD+L+N+
Sbjct: 475 EPADIESDLPFSKRLKRIPSDVLQDVTSVEELSFHNKAVPNIVDSAEKISFVVRDALINV 534

Query: 513 GPLKDFSYGLRINADASATGISKQSNYEL 541
           GPLKDF+YGLR N+D +A GI+KQSNYEL
Sbjct: 535 GPLKDFAYGLRTNSDPNAAGIAKQSNYEL 563


>gi|242075248|ref|XP_002447560.1| hypothetical protein SORBIDRAFT_06g003580 [Sorghum bicolor]
 gi|241938743|gb|EES11888.1| hypothetical protein SORBIDRAFT_06g003580 [Sorghum bicolor]
          Length = 374

 Score =  444 bits (1141), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 220/359 (61%), Positives = 267/359 (74%), Gaps = 14/359 (3%)

Query: 1   MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTE-------ELDSELPSK-RGI 52
           MS+AAYKMMHWPT I +C +GFITHS AD                  ++DS   S  R +
Sbjct: 1   MSYAAYKMMHWPTSIDHCAAGFITHSPADAAAFSSAAPAAAASGPDGDIDSAAASAPRRV 60

Query: 53  GPVPNLVVTAANVIEIYVVRVQEE-GSKESKNSGETKRRVLMDGISAASLELVCHYRLHG 111
           GP PNLVV+AANV+E+Y VR     G+++  NS  T    ++DGIS A LELVCHYRLHG
Sbjct: 61  GPTPNLVVSAANVLEVYAVRADSATGAEDVGNSSSTG--AILDGISGARLELVCHYRLHG 118

Query: 112 NVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLK 171
           N+ES+A+LS G      RRDSI + F+DAKI+ +EFDDS +GLR +SMHCFE PEW HLK
Sbjct: 119 NIESMAVLSDG---TENRRDSIAVTFKDAKIACMEFDDSTNGLRTSSMHCFEGPEWFHLK 175

Query: 172 RGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIE 231
           RGRESFA GP++K DPQGRCG VLVYGLQMIILKA++ G  LVG+++        + RIE
Sbjct: 176 RGRESFAWGPIIKADPQGRCGAVLVYGLQMIILKAAEVGQSLVGEDEPTRMLSSTAVRIE 235

Query: 232 SSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
           SS+VI+LRDL+M H+KDF FVHGYIEPV+VILHERE TWAGR+S K  TCM+SA SIS  
Sbjct: 236 SSYVIDLRDLEMNHIKDFTFVHGYIEPVLVILHEREPTWAGRISSKSQTCMLSAFSISMG 295

Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLD 350
           LKQHP+IWSA  LPHDAY+LLAVP PI G+LV+ AN+IHYHSQS SC+LALN+++   D
Sbjct: 296 LKQHPMIWSAAKLPHDAYQLLAVPPPISGILVICANSIHYHSQSTSCSLALNSFSSQPD 354


>gi|449524573|ref|XP_004169296.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like, partial [Cucumis sativus]
          Length = 741

 Score =  434 bits (1116), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 212/302 (70%), Positives = 237/302 (78%), Gaps = 2/302 (0%)

Query: 697 PAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVV 756
           PAA  SSKK VSSCTLY DKG EPWLR TSTDAWLSTGVGE IDG DG   DQGDIY V 
Sbjct: 11  PAAFGSSKKCVSSCTLYQDKGIEPWLRMTSTDAWLSTGVGETIDGTDGSLQDQGDIYCVA 70

Query: 757 CYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRK 816
           CY++G LEIFDVPNF  VF VDKFVSG++H+VD  + +  K SE + NS  +E    GR 
Sbjct: 71  CYDNGDLEIFDVPNFTSVFYVDKFVSGKSHLVDHQISDLQKSSEVDQNS--QELISHGRN 128

Query: 817 ENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRS 876
           E+  +MKV+E+AMQRWS  HSRPFLF ILTDGTILCY AYLFE  ++ SK DD VS   S
Sbjct: 129 ESSQNMKVIEVAMQRWSGQHSRPFLFGILTDGTILCYHAYLFESTDSASKIDDSVSIDNS 188

Query: 877 LSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWC 936
           +S SN+S+SRLRNLRF R PLD   RE+ P+G   +R++IFKNISG+QG FL GSRP W 
Sbjct: 189 VSSSNMSSSRLRNLRFLRVPLDIQGREDMPNGTLSRRLSIFKNISGYQGLFLCGSRPAWF 248

Query: 937 MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQ 996
           MVFRERLRVHPQLCDG IVAF VLHNVNCNHG IYVTSQG+LKICQLPS S YDNYWPVQ
Sbjct: 249 MVFRERLRVHPQLCDGPIVAFAVLHNVNCNHGLIYVTSQGVLKICQLPSTSNYDNYWPVQ 308

Query: 997 KV 998
           KV
Sbjct: 309 KV 310


>gi|255075065|ref|XP_002501207.1| predicted protein [Micromonas sp. RCC299]
 gi|226516471|gb|ACO62465.1| predicted protein [Micromonas sp. RCC299]
          Length = 1423

 Score =  418 bits (1074), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 331/1073 (30%), Positives = 514/1073 (47%), Gaps = 156/1073 (14%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
            MSFA +K +H PTG+ +  + + TH   D  P                       PNLVV
Sbjct: 1    MSFAIHKQVHPPTGVDHAVAAYFTHPIGDGGP-----------------------PNLVV 37

Query: 61   TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
              AN + ++ +R        +  SG+        G  A SLE+V  + L+G V S+A++ 
Sbjct: 38   MQANHLTVFAIRRD----PSADASGDAAL-----GAKAMSLEVVAEFDLNGTVGSIAVMR 88

Query: 121  QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR- 179
            +       +RD++++A  ++K+SV+E+D S   +  +S+H +E+P       G  S  R 
Sbjct: 89   RRSGAPRNQRDALLIAVRESKLSVIEWDPSEMTVVPSSLHSWETPVG---TGGVPSALRV 145

Query: 180  ---GPLVKVDPQGRCGGVLVY--GLQMIIL----KASQGGSGLVGDEDTFGSGGGFSARI 230
                PL   DP+GRC  VL+   G   + L     A     G  G +D    G G +A +
Sbjct: 146  APLPPLAIADPEGRCAAVLLRAEGRSRLALCPAVDADADADGDGGGDDGDRRGQGPAASV 205

Query: 231  ESSHVINLR-DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
              S V++L  DL +  V+D  F+HGY EPV++ILHERE TWA R+   + TC+++A+SI+
Sbjct: 206  RKSFVVDLTADLALSGVRDAAFLHGYGEPVVLILHEREPTWAARMPLVNDTCVLTAVSIN 265

Query: 290  TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL 349
               K+  +IW    LP   Y+L A+P P+GG +V+  N + + SQ +S ALALN  A   
Sbjct: 266  LDTKRCTVIWQREKLPCTCYRLCAMPDPLGGAIVLSNNFLLHESQESSKALALNPLAGGG 325

Query: 350  DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ---RLDLSKT 406
              S        SVELD+AHA  L     L++TK G L+LL++  +GR +     + L + 
Sbjct: 326  TESA----LGVSVELDSAHAAVLSERQVLVTTKQGALMLLSLRVEGRRLAAHGAMHLRRA 381

Query: 407  NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT---CGSGTSMLSSGL--KEEFGDIEADA 461
              +VL+S +  I   L FLGSR+GDSLLV            ML +    K + G+ E   
Sbjct: 382  GGAVLSSGMCLITKRLLFLGSRVGDSLLVSLKKKEAAGAAQMLPAAAPKKRKAGEAEPPK 441

Query: 462  PSTKRLRRSSS----DALQDMVNGE-ELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLK 516
            P     +  +S    D L+ M+ GE E +   + +   E     ++F VRDS++ I P+ 
Sbjct: 442  PPPPPQKVGTSQDDEDELEAMLYGEGEAAAKAANAGRKE--DPGYTFTVRDSVLGISPII 499

Query: 517  DFSYGLRINADA---------SATGISKQSNYELVE---------------LPGCKGIWT 552
            D + G   +            +A G  K     +++               LPG  G WT
Sbjct: 500  DLTAGASASVQGDTEERAELVAACGHGKNGALAILQRGIQPELVTEVEAGTLPGLMGTWT 559

Query: 553  VYHKS---SRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
            VYH+S    R   + ++  AA  D +H+YL+ISLE+ TMVLET + L EV+E+V+     
Sbjct: 560  VYHESRDNERLRESGAAAAAANVDPFHSYLVISLESTTMVLETGEELREVSEAVELVTDA 619

Query: 610  RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
             T+AAGN+ GR+R+ QV + G RI +G    QDLS       +G  S +  +++  + DP
Sbjct: 620  ATLAAGNMHGRKRIAQVHKGGVRICEGPVKIQDLSAA-EMPAAGDVSPDLEIIAAQVLDP 678

Query: 670  YVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIES--SKKPVSSCTLYHDKGP--------- 718
            YVL  MSDGS+R+L GD    +V   +P++  +  + + ++S  L  D  P         
Sbjct: 679  YVLCRMSDGSLRVLKGDEEKGSVEAMSPSSYANLPTGESIASAALVDDSVPAAERPGLTT 738

Query: 719  -EP-WLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFT 776
             EP +LR+T+T    STGV          P D+      V    G LE++ +P+   +++
Sbjct: 739  REPGFLRRTAT----STGV---------LPEDEEGTVLAVTRVGGTLELYALPSCERIWS 785

Query: 777  VDKFVSGRTHIVDTYMREALK-DSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAH 835
             D    G   +      + +  D + E+  +          ++  + ++VE  +  +   
Sbjct: 786  ADGLSEGLNVLAPGGAGDDVNVDGDGEVEPT----------DDYPAPEIVEFRLDAFPRA 835

Query: 836  HSRPFLFAILTDGTILCYQAYLF-EGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSR 894
            H RP L A+  DG++L Y+A+L   G  N      P                   LRF R
Sbjct: 836  HERPMLTALRGDGSVLVYRAFLCPPGAGNVGHEAKP------------------QLRFCR 877

Query: 895  TPLD------AYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQ 948
             P++           +   G+   R     +  G +G F+SG RP W +V R R+   P 
Sbjct: 878  VPIELEGGGGGMVDTKALSGSRLTRFERVGDRGGIRGVFVSGPRPLWLLVRRSRVLALPI 937

Query: 949  LCDGS-IVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVVF 1000
              +    V+FT  HNVNC +GF+  T+ G ++ICQ+P    Y+  WPV+K+  
Sbjct: 938  RGEAQRTVSFTPFHNVNCLNGFMLGTAAGGVRICQIPGRMHYEAAWPVRKLAL 990


>gi|449477808|ref|XP_004155129.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like [Cucumis sativus]
          Length = 643

 Score =  403 bits (1036), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 188/255 (73%), Positives = 220/255 (86%), Gaps = 1/255 (0%)

Query: 1   MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
           MSFAAY+MMHWPTGI NC S +ITHSRAD+VP +    +++LDS+   +R IGPVPNLVV
Sbjct: 1   MSFAAYRMMHWPTGIENCDSAYITHSRADFVPAVTS-HSDDLDSDWHPRRDIGPVPNLVV 59

Query: 61  TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
           TA NV+E+YVVRV EEG +ESK+SGE KR  +MDG+S ASLELVCHYRLHGNVES+AILS
Sbjct: 60  TAGNVLEVYVVRVLEEGGRESKSSGEVKRGGIMDGVSWASLELVCHYRLHGNVESMAILS 119

Query: 121 QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180
             G D S++RDSIIL F++AKISVLEFDDS H LR +SMHCF+ P+WLHLKRGRESFARG
Sbjct: 120 SRGGDGSKKRDSIILVFQEAKISVLEFDDSTHSLRTSSMHCFDGPQWLHLKRGRESFARG 179

Query: 181 PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240
           P+VKVDPQGRCGGVLVYGLQMIILKASQ GSGLV D++ FG+ G  SAR+ESS++INLRD
Sbjct: 180 PVVKVDPQGRCGGVLVYGLQMIILKASQAGSGLVVDDEAFGNTGAISARVESSYLINLRD 239

Query: 241 LDMKHVKDFIFVHGY 255
           LD+KHVKDF+FVH Y
Sbjct: 240 LDVKHVKDFVFVHVY 254



 Score =  367 bits (943), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 189/258 (73%), Positives = 208/258 (80%), Gaps = 26/258 (10%)

Query: 455 GDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGP 514
           GDIE DA + KR+RRSSSDALQDMV G+ELSLYGSA+NNTESAQK FSFAVRDSL+NIGP
Sbjct: 342 GDIEVDAHTAKRMRRSSSDALQDMVGGDELSLYGSAANNTESAQKIFSFAVRDSLINIGP 401

Query: 515 LKDFSYGLRINADASATGISKQSNYELV--------------------------ELPGCK 548
           LKDFSYGLRINAD +ATGI+KQSNYELV                          ELPGCK
Sbjct: 402 LKDFSYGLRINADPNATGIAKQSNYELVCCSGHGKNGALCILRQSIRPEMITEVELPGCK 461

Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
           GIWTVYHK++RG  ADSSRM   DDEYHAYLIISLEARTMVL T +LLTEVTESVDYFV 
Sbjct: 462 GIWTVYHKNTRGSIADSSRMVPDDDEYHAYLIISLEARTMVLVTGELLTEVTESVDYFVH 521

Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
           GRTIAAGNLFGRRRVIQV+E GARILDGS+MTQDL+   + +ESG+ SE  TVLS SI+D
Sbjct: 522 GRTIAAGNLFGRRRVIQVYESGARILDGSFMTQDLNLVVNGNESGNASEGCTVLSASISD 581

Query: 669 PYVLLGMSDGSIRLLVGD 686
           PYVLL M+DGSIRLLVG+
Sbjct: 582 PYVLLTMTDGSIRLLVGE 599


>gi|145348791|ref|XP_001418827.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144579057|gb|ABO97120.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 1386

 Score =  399 bits (1024), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 304/1076 (28%), Positives = 496/1076 (46%), Gaps = 196/1076 (18%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
            MS A ++ +H PTG+ +  + + T    D                       G  PNL+V
Sbjct: 1    MSHAVHREVHPPTGVDHAVTAYFTRPVGD-----------------------GGDPNLIV 37

Query: 61   TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
             +AN I +Y V     G +ES                   L++   +   G + S+++L 
Sbjct: 38   ASANRITVYAV--NRRGDEES-------------------LDVCAEFDAQGAIGSMSVLR 76

Query: 121  QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFES-----PEWLHLKRGRE 175
            +       +RD++++A  + K+SV+E+D +   +  +SMH FES     P    L+  RE
Sbjct: 77   RRFGAPRNQRDALLIAIRERKLSVVEYDAATGDVCCSSMHSFESALGCNPLGTTLRMSRE 136

Query: 176  SFARGPLVKVDPQGRCGGVLV----YGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIE 231
            +    PLV  DP+GRC  V++       ++ +L +  GG GLV ++D  G   G +A + 
Sbjct: 137  A----PLVVSDPEGRCAAVVLREDGVAGKVRVLPSVDGGLGLVANDDE-GRVRGPAASVR 191

Query: 232  SSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
             S  ++L  + +  ++D  F+HGY EP + +L+E+  TWAGR +    TC I ALS+   
Sbjct: 192  ESFPLHLPGVRL--IRDACFLHGYGEPALAVLYEKTPTWAGRYNLSKDTCEIVALSVDVD 249

Query: 292  LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
             ++  +IW   NLP  +YKL A+  P+GG LV   + + + SQ +S  L LN +      
Sbjct: 250  KQKGTVIWRRQNLPSSSYKLTALLPPLGGALVFSQDFLLHESQESSSVLGLNTFGHG--G 307

Query: 352  SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVL 411
             QE   +   + LD A A+ +  D  L++TKTG L+LL +  DGR ++R+ L +   +VL
Sbjct: 308  PQE--GNDAEITLDGAQASVVSEDRVLVTTKTGALLLLALHTDGRSLRRMMLQRAGGAVL 365

Query: 412  TSDITTIGNSLFFLGSRLGDSLLVQFTCG---SGTSML-----------SSGLKEEFGDI 457
            +S +  +   L FLGSR+GDSLLV+FT     +   ML           +   K++  ++
Sbjct: 366  SSGMCLLSRDLLFLGSRIGDSLLVKFTPKEEPTAPLMLPDAEDESEDEATEKSKDDDDEL 425

Query: 458  EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKD 517
            EA    T +     +DA+Q      E    G A          +   V+DSL+ + P+ D
Sbjct: 426  EALLYGTTKTETVQTDAVQ-----TEKKREGLAGIIPGLKVAGYDLKVKDSLLGVAPVVD 480

Query: 518  FSYGLRINADASATGISKQSNYELV-----------------------------ELPGCK 548
             + G      ++  G +K    EL+                              LP  +
Sbjct: 481  IAVGA-----SAPMGSNKNERTELITACGQGKNGALAILTRGVQPELVTEVESGTLPNLQ 535

Query: 549  GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
            G+WT+++   R   +   R     + +H +L++S+++ TM++ET + L EV+ S+++   
Sbjct: 536  GLWTLHY---RKEGSKEER-----EPFHHHLLLSMKSSTMIMETGEELQEVSASLEFITN 587

Query: 609  GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
              T+AA N+FG    +QV   G R+L G    QD+     ++  G     + + S  I D
Sbjct: 588  QATLAASNIFGHYCSVQVTGTGIRVLKGGVKVQDVGLQDMDAPKG-----AAIASAQILD 642

Query: 669  PYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDK--------GPEP 720
            PY+++ +SDGSIRLL GD    +VS+    AI +S   V++  L  D         G E 
Sbjct: 643  PYIIVRLSDGSIRLLSGDEKQMSVSLMETGAIPTSS--VTAFALVDDSVEAADAAGGGER 700

Query: 721  ---WLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTV 777
               W+ + +T+  ++   G    GA     +  +    +  E G+LE+F +P+   ++  
Sbjct: 701  KSGWIHRAATNGTITGLEGNKKSGA----CNNSEAIVALTREGGSLELFSLPSCTRIWCA 756

Query: 778  DKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHS 837
            D    G        MR  +   +T +N+ S               ++V++ +  +   H 
Sbjct: 757  DGLSEG--------MR--VLSPQTPVNAESS------------VPEIVDIRIDSFQDAHE 794

Query: 838  RPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPL 897
            RP L A+  DGT+L Y+ ++          D+P+  +               LRFSR  +
Sbjct: 795  RPLLTAVRGDGTLLLYKGFIVPAGTTYEGQDEPLEKN--------------ELRFSRVNV 840

Query: 898  D-------------AYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLR 944
            D             A    ++  GA   RI       G QG F++G  P W +V R R+ 
Sbjct: 841  DVEGSGLNVAGIGAAGQLRDSLAGARLTRIGNVGEGQGVQGIFVAGPNPLWLIVRRSRVL 900

Query: 945  VHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVVF 1000
              P   +G +VAFTV HNVNC HGFI  T+ G ++ICQ+PS   Y+  WPV+KV  
Sbjct: 901  ALPTRGEGEVVAFTVFHNVNCPHGFILGTALGGVRICQMPSKMHYEAAWPVRKVAL 956


>gi|410911304|ref|XP_003969130.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Takifugu rubripes]
          Length = 1444

 Score =  397 bits (1019), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 310/1052 (29%), Positives = 501/1052 (47%), Gaps = 173/1052 (16%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            NLVV   + + +Y +    E + ++  S ++K R          LE V  + L GNV S+
Sbjct: 29   NLVVAGTSQLFVYRIIHDVESTSKTDKSSDSKTR-------KEKLEQVAAFSLFGNVMSM 81

Query: 117  AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
              +   GA+    RD+++L+F+DAK+SV+E+D   H L+  S+H FE    L L+ G   
Sbjct: 82   ESVQLVGAN----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEE---LELRDGFVQ 134

Query: 177  FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
                P+V+VDP+ RC  +L+YG ++++L   +     + DE   G G G  +    +++I
Sbjct: 135  NVHIPIVRVDPENRCAVMLIYGTKLVVLPFRKDT---LTDEQEVGVGEGPKSSFLPTYII 191

Query: 237  NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
            ++R+LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++   K 
Sbjct: 192  DVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNIMQKV 251

Query: 295  HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
            HP+IWS  NLP D  +++AVP PIGGV+V   N++ Y +QS     +ALN+      +  
Sbjct: 252  HPVIWSLSNLPFDCTQVMAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTNGTTAFP 311

Query: 354  ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
               +    + LD + A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 312  LRLQDEVKITLDCSQADFIAYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371

Query: 413  SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADA-----PSTKRL 467
            + + T+     FLGSRLG+SLL+++T       L  G  ++  + E D      P +K+ 
Sbjct: 372  TCMVTMEPGYLFLGSRLGNSLLLKYTEKLQDMPLEEGKDQQDKEKEKDMDKQEEPPSKKK 431

Query: 468  RRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
            R  SS    D V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + S G     
Sbjct: 432  RVESSSNWTDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANASMGEPAFL 487

Query: 522  ---LRINADAS-----ATGISKQSNYELV------------ELPGCKGIWTVY------- 554
                + N +        +G  K     ++            ELPGC  +WTV        
Sbjct: 488  SEEFQSNPEPDLEVVVCSGHGKNGALSVLQRSIRPQVVTTFELPGCHDMWTVISNEPVQK 547

Query: 555  --HKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTI 612
               ++ R     +   A  D + H +LI+S E  TM+L+T   + E+  S  +  QG T+
Sbjct: 548  EQEETEREGKEKTEPPAEEDTKKHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTV 606

Query: 613  AAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVL 672
             AGN+   + +IQV   G R+L+G  +TQ L F P +         S ++  S+ADPYV+
Sbjct: 607  FAGNIGDNKYIIQVSPMGIRLLEG--VTQ-LHFIPVDL-------GSPIVHCSLADPYVV 656

Query: 673  LGMSDGSIRLLVGD-----PSTCTVSVQTPAAIESSK-------KPVS---------SCT 711
            +  ++G + + V         T  +++Q P     S+       + VS         SC+
Sbjct: 657  IMTAEGVVTMFVLKIDSYMGKTHRLALQKPQISTQSRVIALCAYRDVSGMFTTENKVSCS 716

Query: 712  LYHDKGPEPWLRKTSTDAWLSTGV---------GEAIDGADGGPLDQGDI---------- 752
            +  D          +    LST +         G++  G     +++             
Sbjct: 717  ITEDISIRSQSEAETIIQDLSTNIVDDEEEMLYGDSNTGPSKEEMNRSSFAGPSEGSYSK 776

Query: 753  -----YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSS 807
                 + ++  +SG +EI+ +P++  VF V  F  G+  +VD+   ++    E E     
Sbjct: 777  AEPSHWCLITRDSGVMEIYQLPDWRLVFLVKNFPVGQRVLVDSSSGQSATQGEKE--GKK 834

Query: 808  EEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLF--EGPENTS 865
            EE T QG    +  + +V L       +HSRP+L  +  D  +L Y+A+ +  + P+N  
Sbjct: 835  EEVTRQGEIPLVKEVTLVSLGY-----NHSRPYLL-VHVDQELLIYEAFPYDQQQPQNNL 888

Query: 866  KSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE--------ETPHGAPCQ----- 912
            K                       +RF + P +   RE        +   G   +     
Sbjct: 889  K-----------------------VRFKKVPHNINFREKKSKLRKDKKAEGTAAEDSVAA 925

Query: 913  -----RITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCN 966
                 R   F++ISG+ G F+ G  P W +V  R  LR+HP   DG I +F+  HN+NC 
Sbjct: 926  RGRISRFRYFEDISGYSGVFICGPSPHWMLVTSRGALRLHPMSIDGPIESFSPFHNINCP 985

Query: 967  HGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
             GF+Y   QG L+I  LP+  +YD  WPV+K+
Sbjct: 986  KGFLYFNKQGELRISVLPTYLSYDAPWPVRKI 1017


>gi|303285993|ref|XP_003062286.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226455803|gb|EEH53105.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 1469

 Score =  392 bits (1008), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 318/1100 (28%), Positives = 521/1100 (47%), Gaps = 165/1100 (15%)

Query: 1    MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV 60
            MSFA +K +H PTG+ +  + + TH         P+              G G  PNLVV
Sbjct: 1    MSFAIHKQVHPPTGVDHACAAYFTH---------PI--------------GSGAPPNLVV 37

Query: 61   TAANVIEIYVVRVQEEGSKESKNSGETKRR---------VLMDGISAA------------ 99
              AN + IY +R   +G      SG   +          ++ D IS A            
Sbjct: 38   LQANRLTIYAIR--RDGDARDNPSGNATKEADDAAIAASLVADAISGAGATASATIDADD 95

Query: 100  ---SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
               SLE+V  + L+G V S+A L +       +RD+++LA  ++K+SV+EFD S   L  
Sbjct: 96   AEVSLEVVAEFDLNGTVGSIATLRRRFGAPREQRDALLLAVRESKLSVVEFDPSTLSLVC 155

Query: 157  TSMHCFESPEWLHLKRGRESFAR----GPLVKVDPQGRCGGVLVY---GLQMIILKASQG 209
            +S+H +E+P       G  S  R     P+V  DP+GRC  VL+    G ++ +L     
Sbjct: 156  SSLHSWETPPG---AGGVPSALRLAPTPPVVVADPEGRCAAVLLRAEGGTRLALLPTDND 212

Query: 210  GSGLVGDEDTFGSGG----GFSARIESSHVINL-RDLDMKHVKDFIFVHGYIEPVMVILH 264
               + G + + G G     G +A ++ S+V++L R++ +++V+D  F+HGY EPV+++LH
Sbjct: 213  AMDVDGGDGSEGKGRRTLRGTAAAVKKSYVVDLVREMGVRYVRDVCFLHGYGEPVLLVLH 272

Query: 265  ERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVV 324
            E  LTWA R +    T  +SA+S++   ++H +IW    LPH  Y+L A+P+P+GG +V+
Sbjct: 273  EERLTWAARATLVKDTMRLSAISLNVDARKHTVIWRRSALPHSCYRLTAMPAPLGGAIVL 332

Query: 325  GANTIHYHSQSASCALALNNYA---VSLDSSQELPRSSFSVELDAAHATWLQNDVALLST 381
              N + + SQ +S ALALN  A      D + +   ++ +  LD A+A  +    AL++T
Sbjct: 333  SQNFLLHESQESSAALALNPLAGGGRGDDPAAKAAAAASAAALDGAYAAVISEKQALVTT 392

Query: 382  KTGDLVLLTVVYDGRVVQR---LDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
            K G L LL++  +GR +     + L +   +VL+S +  +   L FLGSR+GDSLLV   
Sbjct: 393  KAGALYLLSLRIEGRRLATRGGMHLKRAGGAVLSSGMCLVTRRLLFLGSRVGDSLLVS-R 451

Query: 439  CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESA- 497
            C +  +  ++  +       A   +   +R        D V G   +   +A+    +  
Sbjct: 452  CSTARASTAAPGRRPRAAAAAATTAAAEVRLLPIRPQIDGVGGVSAASLRAAAAAHRAPD 511

Query: 498  QKTFSFAVRDSLVNIGPLKDFSYGLRINADASATG----------------------ISK 535
               ++F VRDS++ I P+ D + G    A AS +G                      + +
Sbjct: 512  HPGYTFTVRDSVLGISPVIDLTVG----ASASVSGDTIERTELIAACGHGKNGALAVLQR 567

Query: 536  QSNYELVE------LPGCKGIWTVYHKSS---RGHNADSSRMAAYDDEYHAYLIISLEAR 586
                ELV       LPG KG WTV+H S+   R   + ++  A   D YHAYL+ISL + 
Sbjct: 568  GIQPELVTEVESGTLPGLKGTWTVHHDSADNERLRGSAAAAAAQAVDPYHAYLVISLASS 627

Query: 587  TMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG 646
            TM+LET + L EV+E V+      T+ AGN FGR R++QV+++G R+  G    QD++  
Sbjct: 628  TMILETGEELKEVSEHVELVTDAATLCAGNAFGRERIVQVYDKGVRVAAGPVKVQDIAST 687

Query: 647  PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS-------VQTP-- 697
               +++G G E   +++  I+ PYVL  +SDGS+ +L GD  + T+         + P  
Sbjct: 688  ELVADAGDG-EGIEIVAAEISFPYVLCRLSDGSLAVLKGDEESKTLVKLDVDALARLPPG 746

Query: 698  -----AAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAI--DGADGGPLDQG 750
                 A +     P ++    HD+ P  +L++ +T    +T    +   +  D     + 
Sbjct: 747  GGIACATLVDDSTPAAAHGGLHDRSPG-FLKRATTATATTTTTTASASREDGDDDDDSRR 805

Query: 751  DIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEG 810
             ++  V    GALE++ +P+ +  +T +    G   +                     + 
Sbjct: 806  PMFLAVTRTGGALELYSLPSCDKAWTANGLSEGVAVLSPA--------GSASAALVDRDA 857

Query: 811  TGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDP 870
                      + ++VEL +  ++  H RP L A+  DG +L Y+A+              
Sbjct: 858  AAAADAGADRAPEIVELRVDAFARAHERPLLTALRADGAVLVYRAF-------------- 903

Query: 871  VSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA------PCQRITIFKNI---S 921
                 + +V+      L  LRF+R P++    E    GA      P  R+T F+ +    
Sbjct: 904  -----TCAVAGPGGRALTQLRFARVPVEL---EGGGGGAVDLSALPGSRLTRFERVGDRG 955

Query: 922  GHQGFFLSGSRPCWCMVFRERLRVHPQLCDGS-IVAFTVLHNVNCNHGFIYVTSQGILKI 980
            G +G F+SG +P W +  R R+   P   +   +V+FT  HNVNC+ GFI  T+ G ++I
Sbjct: 956  GIRGVFVSGPQPLWLLARRSRVLALPVRGEAQRVVSFTAFHNVNCHAGFILGTAAGGVRI 1015

Query: 981  CQLPSGSTYDNYWPVQKVVF 1000
            CQ+P    Y+  WPV+K+  
Sbjct: 1016 CQIPGRMHYEAAWPVRKLAL 1035


>gi|348512553|ref|XP_003443807.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Oreochromis niloticus]
          Length = 1456

 Score =  387 bits (994), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 315/1080 (29%), Positives = 504/1080 (46%), Gaps = 217/1080 (20%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            NLVV   + + +Y +    E + ++  S ++K R          LE V  + L GN+ S+
Sbjct: 29   NLVVAGTSQLFVYRIIHDVESTSKADKSSDSKSR-------KEKLEQVASFSLFGNIMSM 81

Query: 117  AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
            A +   GA     RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82   ASVQLVGAS----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177  FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
                P+V+VDP+ RC  +LVYG ++++L   +     + DE   G G G  +    S++I
Sbjct: 135  NVHIPVVRVDPENRCAVMLVYGTKLVVLPFRKDT---LTDEQESGVGEGPKSSFLPSYII 191

Query: 237  NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
            ++R+LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++   K 
Sbjct: 192  DVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNIMQKV 251

Query: 295  HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS--- 351
            HP+IWS  NLP D  +++AVP PIGGV+V   N++ Y +QS         Y VSL+S   
Sbjct: 252  HPVIWSLSNLPFDCTQVMAVPKPIGGVVVFAVNSLLYLNQSVP------PYGVSLNSQTN 305

Query: 352  -SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKT 406
             +   P   +    + LD   + ++  D  ++S K G++ +LT++ DG R V+     K 
Sbjct: 306  GTTAFPLRVQDEVKLTLDCCQSDFIAYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKA 365

Query: 407  NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADA---PS 463
              SVLT+ + T+     FLGSRLG+SLL+++T     +    G + +  + + D    PS
Sbjct: 366  AASVLTTCMVTMEPGYLFLGSRLGNSLLLKYTEKLQETPAEEGKERQDKEKDKDKQEPPS 425

Query: 464  TKRLRRSSSD----------ALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNI 512
             K+   SS++           L D V+  E+ +YGS A + T+ A  T+SF V DS++NI
Sbjct: 426  KKKRVESSTNWTVCVILDFFVLSDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNI 481

Query: 513  GPLKDFSYG--------LRINADAS-----ATGISKQSNYELV------------ELPGC 547
            GP  + S G         + N +        +G  K     ++            ELPGC
Sbjct: 482  GPCANASMGEPAFLSEEFQSNPEPDLEVVVCSGYGKNGALSVLQRSIRPQVVTTFELPGC 541

Query: 548  KGIWTVYHKSSRGHNADSSRMAAY------------DDEYHAYLIISLEARTMVLETADL 595
              +WTV     +    D   +               D + H +LI+S E  TM+L+T   
Sbjct: 542  HDMWTVISSDVKEDKTDKEEVEKEEEEKKTEPPLEDDAKKHGFLILSREDSTMILQTGQE 601

Query: 596  LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
            + E+  S  +  QG T+ AGN+   + +IQV   G R+L+G    + L F P +      
Sbjct: 602  IMELDTS-GFATQGPTVYAGNIGDNKYIIQVSPMGLRLLEG---VRQLHFIPVDL----- 652

Query: 656  SENSTVLSVSIADPYVLLGMSDGSIRLLVGDP-----STCTVSVQTPAAIESSKKPVSSC 710
               S ++  S+ADPYV++  ++G + + V         T  +++Q P  I S  + ++ C
Sbjct: 653  --GSPIVHCSVADPYVVIMTAEGVVTMFVLKSDSYMGKTHRLALQKP-QIPSQSRVITLC 709

Query: 711  -------------------------------TLYHDKG-----PEPWLRKTSTDAWLSTG 734
                                           T+ HD        E  L   S  +  +T 
Sbjct: 710  AYRDVSGMFTTENKVSCSIKEDTIRSQSEAETIIHDMSNTVDDEEEMLYGDSNAS--ATP 767

Query: 735  VGEAIDGADGGPLDQGDI----------YSVVCYESGALEIFDVPNFNCVFTVDKFVSGR 784
              E I+ +   P   G            + ++  E+G +EI+ +P++  VF V  F  G+
Sbjct: 768  AKEDINRSFVAPTTSGSEATSSKAEPTHWCMIIRENGVMEIYQLPDWRLVFLVKNFPVGQ 827

Query: 785  THIVDTYMREALKDSETEINSSSEEGT-GQGRKENIHSMK----VVELAMQRWSAHHSRP 839
              +VD+              SS +  T G+G+KE +        V E+A+     +HS+P
Sbjct: 828  RVLVDS--------------SSGQSATQGEGKKEEVTRQGEIPLVKEVALVSLGNNHSKP 873

Query: 840  FLFAILTDGTILCYQAYLF--EGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPL 897
            +L  +  +  +L Y+A+ +  + P+N  K                       +RF + P 
Sbjct: 874  YLL-VHVEQELLIYEAFQYDQQQPQNNLK-----------------------VRFKKVPH 909

Query: 898  DAYTRE----------------ETPHGAPCQ--RITIFKNISGHQGFFLSGSRPCWCMVF 939
            +   RE                E   G   +  R   F++ISG+ G F+ G  P W +V 
Sbjct: 910  NINFREKKSKLKKDKKAESSATEESSGVKGRIARFRFFEDISGYSGVFICGPSPHWMLVT 969

Query: 940  -RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
             R  LR+HP   DGSI +F+  HN+NC  GF+Y   QG L+I  LP+  +YD  WPV+K+
Sbjct: 970  SRGALRLHPMTIDGSIESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKI 1029


>gi|432883539|ref|XP_004074300.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Oryzias latipes]
          Length = 1456

 Score =  386 bits (992), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 320/1074 (29%), Positives = 502/1074 (46%), Gaps = 205/1074 (19%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            NLVV   + + +Y +    E +  S  S + K R          LE V  + L GNV S+
Sbjct: 29   NLVVAGTSQLFVYRIIHDVESTSSSDKSSDAKTR-------KEKLEQVASFSLFGNVMSM 81

Query: 117  AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
            A +   GA     +D+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82   ASVQLTGAS----KDALLLSFKDAKLSVIEYDPGTHDLKTLSLHYFEEPE---LRDGFFQ 134

Query: 177  FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
                P+V+VDP+ RC  +L+YG ++++L   +     + DE   G G G  +    S++I
Sbjct: 135  NVHIPIVRVDPENRCAVMLIYGTKLVVLPFRKDT---LSDEQEGGVGEGPKSSFLPSYII 191

Query: 237  NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
            ++R+LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++   K 
Sbjct: 192  DVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNIMQKV 251

Query: 295  HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS--- 351
            HP+IWS  NLP D  +++AVP PIGGV+V   N++ Y +QS         Y VSL+S   
Sbjct: 252  HPVIWSLSNLPFDCTQVMAVPKPIGGVVVFAVNSLLYLNQSVP------PYGVSLNSQTN 305

Query: 352  -SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKT 406
             +   P   +    + LD   + ++  D  ++S K G++ +LT++ DG R V+     K 
Sbjct: 306  GTTSFPLRVQEEVKITLDCCQSDFIAYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKA 365

Query: 407  NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
              SVLT+ + T+     FLGSRLG+SLL+++T     +    G  ++  + E   P  K+
Sbjct: 366  AASVLTTCMVTMEPGYLFLGSRLGNSLLLKYTEKLQEAPAEDGNDKQ--EKEKQEPPNKK 423

Query: 467  LRRSSSD-----------ALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGP 514
             R  SS             L D V+  E+ +YGS A + T+ A  TFSF V DS++NIGP
Sbjct: 424  KRVESSSNWTGCSASYFFVLSDEVD--EIEVYGSEAQSGTQLA--TFSFEVCDSILNIGP 479

Query: 515  LKDFSYG--------LRINADAS-----ATGISKQSNYELV------------ELPGCKG 549
              + S G         + N +        +G  K     ++            ELPGC  
Sbjct: 480  CANASMGEPAFLSEEFQSNPEPDLEIVVCSGYGKNGALSVLQRSIRPQVVTTFELPGCHD 539

Query: 550  IWTVY----HKSSRG--HNADSSRMAAYDD---------EYHAYLIISLEARTMVLETAD 594
            +WTV      K S G    AD+ +    D          + H +LI+S E  TM+L+T  
Sbjct: 540  MWTVISGEDKKESEGGEKEADAEKKEEQDKTEPPLEDDAKKHGFLILSREDSTMILQTGQ 599

Query: 595  LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
             + E+  S  +  QG T+ AGN+   + +IQV   G R+L+G    + L F P +     
Sbjct: 600  EIMELDTS-GFATQGPTVFAGNIGDNQYIIQVSPMGLRLLEG---VKQLHFIPVDL---- 651

Query: 655  GSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT-----VSVQTPAAIESSK----- 704
                S ++  S+ADPYV++  ++G + + V    T       +++Q P     S+     
Sbjct: 652  ---GSPIVHCSVADPYVVIMTAEGVVTMFVLKSDTYMGKTHRLALQKPQISTLSRVIALC 708

Query: 705  -----------KPVSSC---------------TLYHDKG----PEPWLRKTSTDAWLSTG 734
                       +  SSC               T+Y D       E  +    + A ++ G
Sbjct: 709  AYRDVSGMFTTENKSSCSSKEDLILRSNSETETVYQDLSNTVDDEEEMLYGESGASMAAG 768

Query: 735  V-----GEAIDGADGGPLDQGDI----YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRT 785
                  G A     GG    G      + V+  E+G +EI+ +P++  VF V  F  G+ 
Sbjct: 769  KEEMSRGSAATAPPGGEGSAGKAEPSHWCVLIRENGVMEIYQLPDWRLVFLVKNFPVGQR 828

Query: 786  HIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAIL 845
             +VD+    +   S T+ +   EE T QG    +  + +V L   R     SRP+L  + 
Sbjct: 829  VLVDS----SSGQSATQGDGKKEEVTRQGEIPLVKEVALVALGNNR-----SRPYLL-VH 878

Query: 846  TDGTILCYQAYLF--EGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE 903
             +  +L Y+A+ +  + P+N  K                       +RF + P     RE
Sbjct: 879  VENELLVYEAFPYDQQQPQNNLK-----------------------VRFKKVPHSINFRE 915

Query: 904  ETPH---------GAPCQRITI---------FKNISGHQGFFLSGSRPCWCMVF-RERLR 944
            + P          G P + + +         F++ISG+ G F+ G  P W ++  R  LR
Sbjct: 916  KKPKLKKDKKAEGGGPEENVAVKSRISRFRYFEDISGYSGVFICGPSPHWMLITSRGGLR 975

Query: 945  VHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
            +HP   DG I +F+  HN+NC  GF+Y   QG L+I  LP+  +YD  WPV+K+
Sbjct: 976  LHPMTIDGPIESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKI 1029


>gi|444523674|gb|ELV13604.1| Cleavage and polyadenylation specificity factor subunit 1 [Tupaia
            chinensis]
          Length = 1469

 Score =  385 bits (989), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 319/1086 (29%), Positives = 498/1086 (45%), Gaps = 216/1086 (19%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            NLVV  A   ++YV R+  +    +KN   T+ +   +      LELV  +   GNV S+
Sbjct: 29   NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELVASFSFFGNVMSM 81

Query: 117  AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
            A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82   ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177  FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
                P V+VDP GRC  +L+YG ++++L       ++   GLVG+        G  +   
Sbjct: 135  NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232  SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
             S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187  PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290  TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
             T + HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 247  ITQRVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTAG 306

Query: 349  LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
              +     +    + LD AHA ++  D  ++S K G++ +LT+V DG R V+     K  
Sbjct: 307  TTAFPLRTQDGVRLTLDCAHAAFISYDKMVISLKGGEIYVLTLVTDGMRSVRAFHFDKAA 366

Query: 408  PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
             SVLT+ + T+     FLGSRLG+SLL+++T         +    E  D E      KR+
Sbjct: 367  ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT--EKLQEPPASAVREAADKEEPPSKKKRV 424

Query: 468  RRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFS-- 519
              +     SS   QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + +  
Sbjct: 425  DPTGGWSGSSTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMG 480

Query: 520  ---------------YGLRINAD--ASATGI---------SKQSNYELV----------- 542
                           YGL   A+     TG+         S + + E+V           
Sbjct: 481  EPAFLSEEVGTGVAEYGLIGQAEGWGRRTGLTPAPVQFQNSPEPDLEIVVCSGYGKNGAL 540

Query: 543  ---------------ELPGCKGIWTVY-------HKSSRGHNADSSRMAAYDD--EYHAY 578
                           ELPGC  +WTV         ++ +    +  R A  +D    H +
Sbjct: 541  SVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKDEEETPKAEGTEQPRAAEAEDGVRRHGF 600

Query: 579  LIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY 638
            LI+S E  TM+L+T   + E+  S  +  QG T+ AGN+   R ++QV   G R+L+G  
Sbjct: 601  LILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG-- 657

Query: 639  MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTC-----TVS 693
                L F P +         + ++  ++ADPYV++  ++G + + +    T       ++
Sbjct: 658  -VNQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMFLLKSDTYGGRHHRLA 709

Query: 694  VQTPAAIESSKKPVSSCTLYHD-------------KGPEPWLRKTSTDAWLSTGVGEAID 740
            +  P     SK  V +  LY D                EP  R +     L       +D
Sbjct: 710  LHKPPLHHQSK--VITLCLYRDVSGMFTTESRLGGARDEPGARGSCEVEGLGAETSPTVD 767

Query: 741  G------ADGG----------------PLDQGDI--------YSVVCYESGALEIFDVPN 770
                    D G                P D+           + ++  E+G +E++ +P+
Sbjct: 768  DEEEMLYGDSGSLFSPSKEETRRSSQPPADRDPAPFRAEPTHWCLLVRENGTMEMYQLPD 827

Query: 771  FNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQ 830
            +  VF V  F  G+  +VD+    +     T+  +  EE T QG    +  + +V L   
Sbjct: 828  WRLVFLVKNFPVGQRVLVDS----SFGQPATQAEARKEEATRQGELPLVKEVLLVALG-- 881

Query: 831  RWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNL 890
               +  SRP+L  +  D  +L Y+A+    P ++            L   N+       +
Sbjct: 882  ---SRQSRPYLL-VHVDQELLLYEAF----PHDS-----------QLGQGNL------KV 916

Query: 891  RFSRTPLDAYTRE-------------ETPHGAPCQ----RITIFKNISGHQGFFLSGSRP 933
            RF + P +   RE              T  GA  +    R   F++I G+ G F+ G  P
Sbjct: 917  RFKKVPHNINFREKKLKPSKKKAEGGSTEEGAGARGRVARFRYFEDIYGYSGVFICGPSP 976

Query: 934  CWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNY 992
             W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD  
Sbjct: 977  HWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAP 1036

Query: 993  WPVQKV 998
            WPV+K+
Sbjct: 1037 WPVRKI 1042


>gi|405977622|gb|EKC42064.1| Cleavage and polyadenylation specificity factor subunit 1
           [Crassostrea gigas]
          Length = 1369

 Score =  382 bits (981), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 303/993 (30%), Positives = 468/993 (47%), Gaps = 151/993 (15%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           +E +  + L GN+ S+  +   GA     RDS++L+F +AK+SV+E+D   H L+ TS+H
Sbjct: 5   MECLATFTLFGNIMSMKYVKLPGA----LRDSLLLSFSEAKLSVVEYDPGTHDLQTTSLH 60

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
            FE P    +K G  +    P V+VDP GRC  +LVYG  M+IL   +      GD    
Sbjct: 61  FFEEPS---MKGGFFTNYCIPEVRVDPDGRCAAMLVYGTHMVILPFRRDVMVEEGD---- 113

Query: 221 GSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
              G   + I SS++I+LR+ D K  +VKDF F+HGY EP + IL E   TWAGR + + 
Sbjct: 114 NLAGTSKSPILSSYIIDLRNFDEKIINVKDFQFLHGYYEPTVFILFEPLQTWAGRTAVRA 173

Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASC 338
            TC I A+S++   K HP+IWS  +LP D  ++LAVP PIGGV+++  N++ Y +QS   
Sbjct: 174 DTCSIVAISLNLQEKVHPVIWSLGSLPFDCCQVLAVPRPIGGVIIIAVNSLLYLNQSVP- 232

Query: 339 ALALNNYAVSLDS----SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV 391
                 Y VSL+S    S   P   +    + LD   A ++  D  +LS K G+L +LT+
Sbjct: 233 -----PYGVSLNSISAQSTLFPLRVQEGVRIALDCCQAAFMSYDKIVLSLKGGELYVLTL 287

Query: 392 VYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGL 450
           V DG R V+  +  K+  SVLTS +    +   FLGSRLG+SLL+++T  +   + +  L
Sbjct: 288 VVDGMRSVRSFNFDKSAASVLTSCMCICEDGFLFLGSRLGNSLLLKYTEKASECLENGDL 347

Query: 451 KEEFGDIEADAPSTKRLRRSSSDALQDMV----NGEELSLYGSASNNTESAQKTFSFAVR 506
            ++    + D P+ K+ +   S  +   V    N  +L +YGSA N T +   +++F V 
Sbjct: 348 DKK----KEDEPAAKKKKVEGSTEIASDVSQIENLYDLEVYGSAENPTSTTITSYTFEVC 403

Query: 507 DSLVNIGPL------------KDFSYGLRINADASAT-GISKQSNYELV----------- 542
           D++ NIGP             ++FS     + +   T G  K     ++           
Sbjct: 404 DNIWNIGPCGNIVMGEPAFLSEEFSSCEDPDIEMVMTSGYGKNGALSVLQRSIRPQVVTT 463

Query: 543 -ELPGCKGIWTVYH--KSSRGHNADSSRMAAYDDEY---HAYLIISLEARTMVLETADLL 596
            ELPGC  +WTV       +  + ++S     DD     H++LI+S    +M+LET   +
Sbjct: 464 FELPGCLDMWTVKSLVPKEKSEDKENSMEDDSDDNIEGGHSFLILSRSDSSMILETGQEM 523

Query: 597 TEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGS 656
            E+  S  +  Q  TI AGN+ G R ++QV +   R+L+G    Q +       ++G   
Sbjct: 524 NELDHS-GFSTQTTTIFAGNIGGDRYIVQVSDTSLRLLEGVRQIQHIPL-----DTG--- 574

Query: 657 ENSTVLSVSIADPYVLLGMSDGSI--------------RLLVGDPSTCTVSVQTPAAIES 702
             S V+  S+ADPY++L   +G I              RL+VG PS   +S  +   + S
Sbjct: 575 --SPVVQCSLADPYIVLLTQEGQILMFTLRTESVGLGVRLVVGKPS---ISQHSKVEVIS 629

Query: 703 SKKPVSSCTLYHDK------GPEPWLRKTSTDAWLSTGV-----------GEAIDGADGG 745
           + K VS      ++       P+    KT T+   S              GE        
Sbjct: 630 AYKDVSGLFTCMNQMEDVQVTPDTKATKTVTERSFSIDAKTADEEDELLYGETESNVFNS 689

Query: 746 PLDQGDI-------------------YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTH 786
             + G                     + ++C E+G LEI+ +P++  V+ V  F  G+  
Sbjct: 690 SFNMGQTAEMESPTKEKKQTEAKPTYWLLLCRENGVLEIYSIPDYKKVYYVKNFPMGQKL 749

Query: 787 IVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILT 846
           +VD+           ++      G  Q  K N     + EL M       SRP L A + 
Sbjct: 750 LVDS----------VQVTDKLSSGERQ-EKVNAECPALKELLMVGLGYKDSRPHLLARVE 798

Query: 847 DGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETP 906
           D        Y++E       S D     R   + +    R +  +  +   + + +EE  
Sbjct: 799 D------DLYIYEAFSYPQSSIDNHLKLRFKKIQHDLILREKRSKSKKKDPEEFQKEEKK 852

Query: 907 HGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNC 965
            G    ++  FK+++G+ G F+ G+ P W  V  R  LR+HP   DG +  F+  HN+NC
Sbjct: 853 VG----KMRYFKDVAGYSGVFVCGAYPHWIFVTSRGSLRIHPMGIDGPVWCFSEFHNINC 908

Query: 966 NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
            HGF+Y    G L+I  LP+  TYD  WPV+KV
Sbjct: 909 PHGFLYFNKMGELRISVLPTHLTYDAPWPVRKV 941


>gi|229335612|ref|NP_001108153.2| cleavage and polyadenylation specificity factor subunit 1 [Danio
            rerio]
          Length = 1449

 Score =  375 bits (962), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 299/1024 (29%), Positives = 488/1024 (47%), Gaps = 178/1024 (17%)

Query: 94   DGIS-AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIH 152
            DG S    LE V  + L GNV S+A +   G +    RD+++L+F+DAK+SV+E+D   H
Sbjct: 58   DGKSRKEKLEQVASFSLFGNVMSMASVQLVGTN----RDALLLSFKDAKLSVVEYDPGTH 113

Query: 153  GLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSG 212
             L+  S+H FE PE   L+ G       P+V+VDP+ RC  +LVYG  +++L   +    
Sbjct: 114  DLKTLSLHYFEEPE---LRDGFVQNVHIPMVRVDPENRCAVMLVYGTCLVVLPFRKDT-- 168

Query: 213  LVGDEDTFGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTW 270
             + DE     G G  +    S++I++R+LD K  ++ D  F+HGY EP ++IL E   TW
Sbjct: 169  -LADEQEGIVGEGQKSSFLPSYIIDVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTW 227

Query: 271  AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIH 330
             GRV+ +  TC I A+S++   K HP+IWS  NLP D  +++AVP PIGGV+V   N++ 
Sbjct: 228  PGRVAVRQDTCSIVAISLNIMQKVHPVIWSLSNLPFDCNQVMAVPKPIGGVVVFAVNSLL 287

Query: 331  YHSQSAS-CALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLL 389
            Y +QS     ++LN+      +    P+    + LD + A+++ +D  ++S K G++ +L
Sbjct: 288  YLNQSVPPFGVSLNSLTNGTTAFPLRPQEEVKITLDCSQASFITSDKMVISLKGGEIYVL 347

Query: 390  TVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
            T++ DG R V+     K   SVLT+ + T+     FLGSRLG+SLL+++T     + +  
Sbjct: 348  TLITDGMRSVRAFHFDKAAASVLTTCMMTMEPGYLFLGSRLGNSLLLRYTEKLQETPMEE 407

Query: 449  GLKEEFGDIEADAPSTKRLRRSSSDA-------LQDMVNGEELSLYGS-ASNNTESAQKT 500
            G + E  + +   P  K+ R  S+ A       L D ++  E+ +YGS A + T+ A  T
Sbjct: 408  GKENEEKEKQ---PPNKKKRVDSNWAGCPGKGNLPDELD--EIEVYGSEAQSGTQLA--T 460

Query: 501  FSFAVRDSLVNIGPLKDFSYG--------LRINADAS-----ATGISKQSNYELV----- 542
            +SF V DS++NIGP    S G         + N +        +G  K     ++     
Sbjct: 461  YSFEVCDSILNIGPCASASMGEPAFLSEEFQTNPEPDLEVVVCSGYGKNGALSVLQKSIR 520

Query: 543  -------ELPGCKGIWTVYHKSSR---------GHNADSSRMAAY---DDEYHAYLIISL 583
                   ELPGC  +WTV +   +         G + +  +       D + H +LI+S 
Sbjct: 521  PQVVTTFELPGCHDMWTVIYCEEKPEKPSAEGDGESPEEEKREPTIEDDKKKHGFLILSR 580

Query: 584  EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDL 643
            E  TM+L+T   + E+  S  +  QG T+ AGN+   + +IQV   G R+L+G      L
Sbjct: 581  EDSTMILQTGQEIMELDTS-GFATQGPTVYAGNIGDNKYIIQVSPMGIRLLEG---VNQL 636

Query: 644  SFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI---------------RLLVGDPS 688
             F P +         S ++  S+ADPYV++  ++G +               RL +  P 
Sbjct: 637  HFIPVDL-------GSPIVHCSVADPYVVIMTAEGVVTMFVLKNDSYMGKSHRLALQKPQ 689

Query: 689  TCT------------------------------VSVQTPAAIESSKKPVSSCT------L 712
              T                              ++++T +  E+  + +S+        L
Sbjct: 690  IHTQSRVITLCAYRDVSGMFTTENKVSFLAKEEIAIRTNSETETIIQDISNTVDDEEEML 749

Query: 713  YHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN 772
            Y +  P     K  +    +           G    +   + ++  E+G +EI+ +P++ 
Sbjct: 750  YGESNPLTSPNKEESSRGSAAASSAHTGKESGSGRQEPSHWCLLVRENGVMEIYQLPDWR 809

Query: 773  CVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRW 832
             VF V  F  G+  +VD+    +   S T+     EE T QG   +I  +K  E+A+   
Sbjct: 810  LVFLVKNFPVGQRVLVDS----SASQSATQGELKKEEVTRQG---DIPLVK--EVALVSL 860

Query: 833  SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRF 892
              +HSRP+L A + +  +L Y+A+ ++  +                    + S L+ +RF
Sbjct: 861  GYNHSRPYLLAHV-EQELLIYEAFPYDQQQ--------------------AQSNLK-VRF 898

Query: 893  SRTPLDAYTREET--------PHG---------APCQRITIFKNISGHQGFFLSGSRPCW 935
             + P +   RE+         P G             R   F++ISG+ G F+ G  P W
Sbjct: 899  KKMPHNINYREKKVKVRKDKKPEGQGEDTLGVKGRVARFRYFQDISGYSGVFICGPSPHW 958

Query: 936  CMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWP 994
             +V  R  +R+HP   DG+I +F+  HN+NC  GF+Y   QG L+I  LP+  +YD  WP
Sbjct: 959  MLVTSRGAMRLHPMTIDGAIESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWP 1018

Query: 995  VQKV 998
            V+K+
Sbjct: 1019 VRKI 1022


>gi|156364999|ref|XP_001626630.1| predicted protein [Nematostella vectensis]
 gi|156213514|gb|EDO34530.1| predicted protein [Nematostella vectensis]
          Length = 1420

 Score =  374 bits (961), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 310/1100 (28%), Positives = 498/1100 (45%), Gaps = 210/1100 (19%)

Query: 3   FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
           +A YK  H PTG+  C +     +R                             NLVV  
Sbjct: 2   YAIYKETHPPTGVEFCVNCHFYSARES---------------------------NLVVAG 34

Query: 63  ANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQG 122
              + ++ +  Q+EGS  +++ G + +R          LELV  + L GN+ESL  +   
Sbjct: 35  TTEVRVFRLCYQQEGSSSAESGGSSLKR---------KLELVGQHSLFGNIESLHAIRLA 85

Query: 123 GADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL 182
           G      RDS++++F+DAK+S++++D   H ++  S+H FE  +   +K    +  R P+
Sbjct: 86  G----NTRDSLLMSFKDAKLSIVDYDPGKHDIKTRSLHFFEDEK---IKSHCLAQDRAPV 138

Query: 183 VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD 242
           V++DP+ RC  +L YG  +++L   Q G      +D+  S       +  S++I+++++D
Sbjct: 139 VRIDPERRCAVMLAYGTHLVVLPFRQEGGIDDTAQDSIISSSD-RPPVLPSYIIDVKEID 197

Query: 243 MK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
            K  ++ D  F+HGY EP ++IL+E   TWAGR++ ++ TC + A+S++ + K HP++W 
Sbjct: 198 EKTCNILDIQFLHGYYEPTLLILYEPLKTWAGRLAMRNDTCALVAVSLNMSQKAHPVVWQ 257

Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQE------ 354
              LP D   ++ VP PIGGVLV   N + Y +QS         Y VS++S  E      
Sbjct: 258 LSCLPFDCIYVMPVPKPIGGVLVCCMNALLYLNQSVP------PYGVSVNSIGENSTVFP 311

Query: 355 -LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
             P+   ++ L+ ++A ++ ND  + S K G++ ++T++ DG R V+     KT  SVLT
Sbjct: 312 LKPQKGVTITLEGSNAIFIANDKLVFSLKGGEIYVVTLIADGVRSVRNFVFDKTAASVLT 371

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
           S +   G+   FLGSRLG+SLLV++T       +  G +     ++ D  + +R +  + 
Sbjct: 372 SCVCECGDGYLFLGSRLGNSLLVKYT--EKPQDIVYGTENNAQSMQCD--NIERWQILNG 427

Query: 473 DALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----LRIN--- 525
             L  + + +EL +YG A         +++F V DSL+NIGP      G    L ++   
Sbjct: 428 SLLLIVDDLDELEVYG-AQQEAGVELTSYTFEVCDSLLNIGPCSCMDIGEPAFLSVSSYF 486

Query: 526 ADA--------SATGISKQSNYELV------------ELPGCKGIWTVYHKSSRG----- 560
           ADA        S +G  K     ++            ELPGC  +WTV+ K  +      
Sbjct: 487 ADAQELDLEVVSCSGYGKNGALTVLQRSIRPQVVTTFELPGCTDMWTVFSKDQKKGAQTN 546

Query: 561 --HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLF 618
             H   S      +++YH++LI+S E  +M+L+T   + EV +S  +  Q  TI AGN  
Sbjct: 547 AIHRYPSQPCTQGNEKYHSFLILSREDSSMILKTEQEIMEVDQS-GFSTQCATIYAGNFG 605

Query: 619 GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
               ++QV   G R+L+G    Q +       +SG     S ++  S+ DPY +L M+DG
Sbjct: 606 NGSYILQVTPLGVRLLEGVNQLQHIPM-----DSGL----SNIVWCSVCDPYAVLLMADG 656

Query: 679 SIRLL--VGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD----------------KGPEP 720
           S+ L+  +   S   ++V  P+  +SSK  V +C  Y D                K P P
Sbjct: 657 SVILIEFIKSASGPKLTVSRPSLSQSSK--VCACCTYKDMSGLFTTENSNLEEVSKVPSP 714

Query: 721 WLRKTS----------------------TDAWLSTGVGEAIDGADGGPLD------QGDI 752
               T+                      T   L+    E        P++      Q   
Sbjct: 715 KPEMTAPPRQEKESLTIDEEDELLYGGDTSLTLTFEPPEPSKAESAAPVEVFEEPLQPSY 774

Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
           + +VC E+G +EI+ +P F  VF V  F      IVD+       DS     SS  E   
Sbjct: 775 WCLVCRENGVMEIYSLPGFTRVFFVKNFSKAPRVIVDS------GDSGASTQSSVSEE-- 826

Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
                   S+ V E+ +      + R  L A++ D  +L Y+A+ +   E          
Sbjct: 827 -------ESLNVREVLLTGLGYKNRRATLVAVM-DQDLLIYEAFSYPTVEGH-------- 870

Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAP-------------CQRITIFKN 919
                           NLRF +   +   RE+ P   P                + +F +
Sbjct: 871 ---------------LNLRFKKLQHNIQIREKKPKQEPKNDSETKSGLDPKVAMLRVFND 915

Query: 920 ISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGIL 978
           IS + G F+ GS P W  V  R     HP   DG +  F   HNVNC  GF+Y  ++G L
Sbjct: 916 ISSYSGIFVCGSYPFWIFVTNRGAFHWHPMSIDGPVTCFAAFHNVNCPKGFLYFNTRGEL 975

Query: 979 KICQLPSGSTYDNYWPVQKV 998
           +I  LP+  +YD+ WPV+KV
Sbjct: 976 RISVLPTHLSYDSPWPVRKV 995


>gi|414587798|tpg|DAA38369.1| TPA: hypothetical protein ZEAMMB73_163106, partial [Zea mays]
          Length = 483

 Score =  363 bits (933), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 178/317 (56%), Positives = 212/317 (66%), Gaps = 10/317 (3%)

Query: 685 GDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADG 744
            DPSTCT+S+  PA   SS + +S+CTLY D+GPEPWLRKT TDAWLST VGEAID  D 
Sbjct: 33  ADPSTCTISINAPAIFASSSERISACTLYCDRGPEPWLRKTHTDAWLSTDVGEAIDDNDN 92

Query: 745 GPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEIN 804
              D  DIY ++CYESG LEIF+VP+F  VF+VD FVSG   + D + R + KDS     
Sbjct: 93  SSHDLSDIYCIICYESGKLEIFEVPSFKRVFSVDNFVSGPAILFDVFSRNSTKDSGIGDR 152

Query: 805 SSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENT 864
            +S+      +KE   ++K+VELAM RWS   SRPFLF +L DGT+LCY AY FEG E+ 
Sbjct: 153 DASKVSV---KKEEAANIKIVELAMHRWSGQFSRPFLFGLLNDGTLLCYHAYYFEGSESN 209

Query: 865 SKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPC---QRITIFKNIS 921
            +         S  + N + SRLRNLRF R  +D  +R++      C    RITIF N+ 
Sbjct: 210 VQCAPFSPHGGSPDIGNATDSRLRNLRFCRVSIDISSRDDI----SCLVRPRITIFNNVG 265

Query: 922 GHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
           G++G FL G RP W  V R+R RVHPQLCDG IVAFTVLHNVNC  G IYVTSQG LKIC
Sbjct: 266 GYEGLFLGGPRPTWVFVCRQRFRVHPQLCDGPIVAFTVLHNVNCCRGLIYVTSQGFLKIC 325

Query: 982 QLPSGSTYDNYWPVQKV 998
           QLPS   YDNYWPVQKV
Sbjct: 326 QLPSAYNYDNYWPVQKV 342


>gi|414587799|tpg|DAA38370.1| TPA: hypothetical protein ZEAMMB73_163106 [Zea mays]
          Length = 461

 Score =  363 bits (931), Expect = 4e-97,   Method: Compositional matrix adjust.
 Identities = 178/317 (56%), Positives = 212/317 (66%), Gaps = 10/317 (3%)

Query: 685 GDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADG 744
            DPSTCT+S+  PA   SS + +S+CTLY D+GPEPWLRKT TDAWLST VGEAID  D 
Sbjct: 33  ADPSTCTISINAPAIFASSSERISACTLYCDRGPEPWLRKTHTDAWLSTDVGEAIDDNDN 92

Query: 745 GPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEIN 804
              D  DIY ++CYESG LEIF+VP+F  VF+VD FVSG   + D + R + KDS     
Sbjct: 93  SSHDLSDIYCIICYESGKLEIFEVPSFKRVFSVDNFVSGPAILFDVFSRNSTKDSGIGDR 152

Query: 805 SSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENT 864
            +S+      +KE   ++K+VELAM RWS   SRPFLF +L DGT+LCY AY FEG E+ 
Sbjct: 153 DASKVSV---KKEEAANIKIVELAMHRWSGQFSRPFLFGLLNDGTLLCYHAYYFEGSESN 209

Query: 865 SKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPC---QRITIFKNIS 921
            +         S  + N + SRLRNLRF R  +D  +R++      C    RITIF N+ 
Sbjct: 210 VQCAPFSPHGGSPDIGNATDSRLRNLRFCRVSIDISSRDDI----SCLVRPRITIFNNVG 265

Query: 922 GHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
           G++G FL G RP W  V R+R RVHPQLCDG IVAFTVLHNVNC  G IYVTSQG LKIC
Sbjct: 266 GYEGLFLGGPRPTWVFVCRQRFRVHPQLCDGPIVAFTVLHNVNCCRGLIYVTSQGFLKIC 325

Query: 982 QLPSGSTYDNYWPVQKV 998
           QLPS   YDNYWPVQKV
Sbjct: 326 QLPSAYNYDNYWPVQKV 342


>gi|340710064|ref|XP_003393618.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like [Bombus terrestris]
          Length = 1417

 Score =  362 bits (929), Expect = 6e-97,   Method: Compositional matrix adjust.
 Identities = 292/1030 (28%), Positives = 472/1030 (45%), Gaps = 159/1030 (15%)

Query: 58  LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
           L V  AN+I I+ +    + +K+ K +     ++         LE +  Y LHGNV S+ 
Sbjct: 30  LAVAGANIIRIFRLIPDVDITKKEKYTESRPPKM--------KLECLSQYTLHGNVMSMQ 81

Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
            ++  G+    +RDS++L+F DAK+SV+E+D   H LR  S+H FE  E   ++ G  + 
Sbjct: 82  AVTLVGS----QRDSLLLSFRDAKLSVVEYDQDTHDLRTVSLHYFEEEE---IRDGWTNH 134

Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR--IESSHV 235
              P+V+VDP+GRC  +L+YG ++++L   +  S  + D D   +    S +  I SS++
Sbjct: 135 HHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPS--LDDGDLLDNSKALSNKTPILSSYM 192

Query: 236 INLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
           I L+ L+  M ++ D  F+HGY EP ++IL+E   T++GR++ +  TC + A+S++   +
Sbjct: 193 IVLKSLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQR 252

Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
            HP+IWS  NLP D Y+ + V  P+GG L++  N++ Y +QS      +  Y VSL+S  
Sbjct: 253 VHPIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQS------IPPYGVSLNSLA 306

Query: 354 EL-------PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSK 405
           E        P+    + L+ +   ++ +D  ++S K+G+L +L++  D  R V+     K
Sbjct: 307 ETSTNFPLKPQEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDK 366

Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS-GLKEEFGDIEADAPST 464
              SVLTS +    ++  FLGSRLG+SLL++FT     ++ ++   +    + E +    
Sbjct: 367 AAASVLTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPENLQNTNENEIILEENETEETPA 426

Query: 465 KRLRRS------SSDALQDMVNGEELSLYGSASNNTESAQKT-FSFAVRDSLVNIGPLKD 517
           K++++       +SD L D+ + EEL +YGS      S Q T + F V DSL+NIGP  +
Sbjct: 427 KKIKQDFIGDWMASDVL-DIKDPEELEVYGSERETHTSIQITSYIFEVCDSLLNIGPCGN 485

Query: 518 FSYGLRINADASATGISKQSNYELV--------------------------ELPGCKGIW 551
            S G         +  S+  + ELV                          ELPGC+ +W
Sbjct: 486 ISMGEPAFLSEEFSH-SQDPDVELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMW 544

Query: 552 TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
           TV      G   +  ++    +  HA+LI+S E  TM+L+T   + EV +S  +  QG T
Sbjct: 545 TVI-----GTLNNDEQIRPEAEGSHAFLILSQEDSTMILQTGQEINEVDQS-GFSTQGST 598

Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
           I AGNL   R ++QV + G R+L G    Q +                 ++  S ADPYV
Sbjct: 599 IFAGNLGANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYV 648

Query: 672 LLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLY---------------HDK 716
            L   DG + LL       T  +   AA    +  + +   Y                D+
Sbjct: 649 TLLSEDGQVMLLTLREGRGTAKLHAQAANLLFRPQIEALCAYRDVSGIFTTQLPENVEDE 708

Query: 717 GPE--------------------------PWLRKTSTDAWLSTGVGEAIDGADGGPLDQG 750
            PE                           +   T +    S G+ +          +  
Sbjct: 709 APEEEHNIEEPPIVGNIDNEDDLLYGDAPAFQMPTPSHTKTSEGISKRTPWWQKHLQEIK 768

Query: 751 DIYSVVCY-ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEE 809
             Y ++ Y +SG LEI+ +P+    + +  F  G+  + D+     L+ +      + E 
Sbjct: 769 PTYWLLVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQTTPVNEIPNPE- 827

Query: 810 GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
                       M+V E+ M     H +RP L   L D  +  YQAY +  P+   K   
Sbjct: 828 ------------MQVREILMVALGHHGNRPMLLVRL-DSELQIYQAYRY--PKGHLK--- 869

Query: 870 PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLS 929
                + L    +       LR    P+   TR        C  +  F NI+G+ G F+ 
Sbjct: 870 --LRFKKLDHGIIPGQLKPKLRDEDIPMMNETRH-------CM-MRYFSNIAGYNGVFIC 919

Query: 930 GSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGST 988
              P W  +  R  LR HP   DG + +F   +N+NC  GF+Y   +  L+IC LP+  +
Sbjct: 920 SDYPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQGFLYFNRKEELRICVLPTHLS 979

Query: 989 YDNYWPVQKV 998
           YD  WPV+KV
Sbjct: 980 YDAPWPVRKV 989


>gi|307190910|gb|EFN74734.1| Cleavage and polyadenylation specificity factor subunit 1
           [Camponotus floridanus]
          Length = 1418

 Score =  361 bits (927), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 299/1029 (29%), Positives = 478/1029 (46%), Gaps = 157/1029 (15%)

Query: 58  LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
           LVV  ANVI ++ +    + ++  K +     ++         LE +  Y LHGN+ S+ 
Sbjct: 30  LVVAGANVIRVFRLIPDIDMTRREKYTENRPPKM--------KLECLAQYTLHGNIMSMQ 81

Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
            +   G+    +RDS++L+F DAK+SV+E+D  IH LR  S+H FE  E   +K G  + 
Sbjct: 82  AVHLIGS----QRDSLLLSFRDAKLSVVEYDQDIHDLRTVSLHYFEEEE---IKDGWTNH 134

Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR---IESSH 234
              P+V+VDP+GRC  +L+YG ++++L   +  S  + D D   S    S+    I SS+
Sbjct: 135 HHIPIVRVDPEGRCAIMLIYGRKLVVLPFRKDPS--LDDGDLLDSAKLTSSNKTPILSSY 192

Query: 235 VINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
           +I L+ L+  M +V D  F++GY EP ++IL+E   T++GR++ +  TC + A+S++   
Sbjct: 193 MIVLKTLEEKMDNVIDLQFLYGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQ 252

Query: 293 KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYAVSLDS 351
           + HP+IWS  NLP D Y+++ V  P+GG L++  N++ Y +QS     ++LN+ A +  +
Sbjct: 253 RVHPIIWSVSNLPFDCYQVVPVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLADTSTN 312

Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSV 410
               P+    + L+ +   ++  D  ++S K+G+L +L++  D  R V+     K   SV
Sbjct: 313 FPLKPQEGVKMSLEGSQVAFISGDRLVISLKSGELYVLSLFADSMRSVRGFHFDKAAASV 372

Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE-EFGDIEADAPSTKRLRR 469
           LTS +    ++  FLGSRLG+SLL++FT     ++ +    E    + E++    K+ ++
Sbjct: 373 LTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPETLKNLNDNEITIEENESEETPAKKAKQ 432

Query: 470 S------SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG-- 521
                  +SD L D+ + EEL +YGS + +T     ++ F V DSL+NIGP  + S G  
Sbjct: 433 DFLGDWMASDVL-DIKDPEELEVYGSET-HTSIQITSYIFEVCDSLLNIGPCGNISMGEP 490

Query: 522 ------LRINAD-----ASATGISKQSNYELV------------ELPGCKGIWTVYHKSS 558
                    N D      + +G  K     ++            ELPGC+ +WTV     
Sbjct: 491 AFLSEEFLHNQDPDVELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMWTVI---- 546

Query: 559 RGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLF 618
            G   +  ++ A  +  HA+LI+S E  TM+L+T   + EV +S  +  QG T+ AGNL 
Sbjct: 547 -GTLNNDEQVKAEAEGSHAFLILSQEDSTMILQTGQEINEVDQS-GFSTQGSTVFAGNLG 604

Query: 619 GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
             R ++QV + G R+L G    Q +                 ++  S ADPYV L   DG
Sbjct: 605 ANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYVTLLSEDG 654

Query: 679 SIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGP--EPWLRKTSTDAWLST--G 734
            + LL       T  +    A    +  + +   Y D        L +T+ D  +     
Sbjct: 655 QVVLLTLREVRGTARLHAQPANLLFRPQIEALCTYRDVSGIFTTQLSETTDDEQVEEEHN 714

Query: 735 VGEA-----IDGADG-----------------GPLD----------------QGDIYSVV 756
           V E      ID  D                   PLD                +   + +V
Sbjct: 715 VEEPSLLSNIDNEDDLLYGDAPAFQMPAPSYQKPLDGVSKKAPWWQRHLQEIKPTYWLLV 774

Query: 757 CYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRK 816
             +SG LEI+ +P+    + +  F  G+  + D+     L+ +      + E        
Sbjct: 775 YRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQSAPVNEIPNPE-------- 826

Query: 817 ENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRS 876
                M+V E+ M     H +RP L   L D  +  YQAY +  P+   K          
Sbjct: 827 -----MQVREILMVALGHHGNRPMLLVRL-DSELQIYQAYKY--PKGYLK---------- 868

Query: 877 LSVSNVSASRLRNLRFSRTP--LDAYTREE-TPHGAPCQRITI---FKNISGHQGFFLSG 930
                    R + L     P  L    +EE  P  A   RI +   F NI+G+ G F+  
Sbjct: 869 --------LRFKKLEHGIIPGRLSPKPKEEDMPMNASETRICMMRYFSNIAGYNGVFICC 920

Query: 931 SRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
             P W  +  R  LR HP   DG I +F   +NVNC  GF+Y   +  L+IC LP+  +Y
Sbjct: 921 DYPHWIFLTGRGELRTHPMGIDGPITSFAAFNNVNCPQGFLYFNRKEELRICVLPTHLSY 980

Query: 990 DNYWPVQKV 998
           D  WPV+KV
Sbjct: 981 DAPWPVRKV 989


>gi|242021233|ref|XP_002431050.1| Cleavage and polyadenylation specificity factor 160 kDa subunit,
           putative [Pediculus humanus corporis]
 gi|212516279|gb|EEB18312.1| Cleavage and polyadenylation specificity factor 160 kDa subunit,
           putative [Pediculus humanus corporis]
          Length = 1409

 Score =  360 bits (924), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 301/1025 (29%), Positives = 471/1025 (45%), Gaps = 155/1025 (15%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           +LVV   N++ ++ +    +    +K    T+RR          LE +  + L  NV S+
Sbjct: 29  SLVVAGKNILRVFQLIPDID---PTKRDAYTERRP-----PKMKLECLSSFSLFANVMSM 80

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +S  G+     RD+++L+F +AK+ V+E+D   H LR  S+H FE  +   +K G  +
Sbjct: 81  QAVSLAGSS----RDALLLSFREAKLCVVEYDPDSHDLRTLSLHYFEEED---MKGGWTN 133

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL---KASQGGSGLVGDEDTFGSG-GGFSARIES 232
               P V+VDP+GRC  +LVYG +++IL   + S+     +   D   S      A + S
Sbjct: 134 HYDIPYVRVDPEGRCAAMLVYGRKLVILPFRRESKLDDPDIALLDPHSSSVATAKAPVLS 193

Query: 233 SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST 290
           S+ I LR++D  +++V D  F++GY EP ++IL+E   T+AGR++ +  TC + A+S++ 
Sbjct: 194 SYTITLREIDEKLENVIDIQFLYGYYEPTLLILYEPLKTFAGRIAVRSDTCAMIAVSLNI 253

Query: 291 TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYAVSL 349
             + HP IWS  NLP +  + + VP P+GG L+   N + Y +QS     +++N+ A + 
Sbjct: 254 QQRVHPAIWSVGNLPFNCTQAIPVPKPLGGTLIFSVNALIYLNQSIPPFGVSVNSIAENS 313

Query: 350 DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNP 408
            + Q   +    + L+ + AT++ +D  +LS KTG+L +L+++ D  R V+     K   
Sbjct: 314 TNFQLKIQEGVKITLEGSQATFISHDRLVLSLKTGELYVLSLLADNIRSVRGFHFDKAAA 373

Query: 409 SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
           SVLT+ +    +   FLGSRLG+SLL++FT    +      L E     E   PS +R +
Sbjct: 374 SVLTTCLCVCEDKYLFLGSRLGNSLLLRFTEKESSEAPIITLDESIR--EVPVPSKRRRQ 431

Query: 469 RSSSDAL----QDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRI 524
            +  D +     D+ + +EL +YG+   ++     +F F V DSL+NIGP  + S G   
Sbjct: 432 DALGDWMASDVADIRDLDELEVYGTQEASSSVQITSFMFEVCDSLLNIGPCGNVSMGEPA 491

Query: 525 NADASATGISKQSNYELV--------------------------ELPGCKGIWTVYHKSS 558
                 +  ++  + ELV                          ELPGC  +WTV     
Sbjct: 492 FLSEEFSN-NRDPDLELVTTSGHGKNGAICVLQRTIRPQVVTTFELPGCLDMWTVI---- 546

Query: 559 RGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLF 618
            G  +DS    A DD  HA+LI+S +  TM+L+T   + EV  S  +  QG TI AGNL 
Sbjct: 547 -GPQSDSGPTQAEDDISHAFLILSQKDSTMILQTGQEINEVDHS-GFNTQGPTIFAGNLA 604

Query: 619 GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
             + ++QV + G R+L G    Q +               S+V+  S ADPYV L   DG
Sbjct: 605 SNKYIVQVSKAGVRLLRGLEQIQHIPL----------DLGSSVVHASTADPYVALLTEDG 654

Query: 679 SIRLLVGDPS--TCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWL----- 731
            + LL    S     +SV  P  I ++ +    CT     G    L   +T+  L     
Sbjct: 655 QVVLLTLRESRGQGRLSVFKP-TIPTNPRVSKICTYRDVSG----LFTLTTEEELQNATF 709

Query: 732 ---STGVGEAIDGAD----GG--------------------PLDQGDIYS---------V 755
              S  + +  D  D    GG                    P  +   YS          
Sbjct: 710 KSDSKNMKKEADDEDEMLYGGSEVKFQLLPITNTNEPSPPRPFVRWKKYSQEIKPNYWMF 769

Query: 756 VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGR 815
           V  E+G L+I+ +P+F   F + +   G   + D             ++ +   G     
Sbjct: 770 VLRETGTLDIYSLPDFRPSFQIRRIGQGHRVLYDV------------LDMAQTSGMDGSD 817

Query: 816 KENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLF-EGPENTSKSDDPVSTS 874
              IH + VV L       H  R  +  + T+  ++ YQA+ F +GP    +        
Sbjct: 818 DPEIHELLVVSL------GHLGRRPILLLRTENDLMIYQAFKFAKGPNLKIR-------F 864

Query: 875 RSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPC 934
           R L  + +   R    +        Y  E     A   R+  F NISG+ G F+ G  P 
Sbjct: 865 RRLPQTLILKERKAKFKVK------YENEVESERA--TRLRYFSNISGYNGVFVCGPNPH 916

Query: 935 WC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW 993
           W  +  R  LR HP L DG + +F   HNVNC  GF+Y TS+  L+IC LP+  +YD  W
Sbjct: 917 WLFLTARGELRSHPMLIDGRVTSFASFHNVNCPLGFLYFTSKCELRICILPTHLSYDAPW 976

Query: 994 PVQKV 998
           PV+KV
Sbjct: 977 PVRKV 981


>gi|350413821|ref|XP_003490124.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like [Bombus impatiens]
          Length = 1417

 Score =  360 bits (924), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 292/1030 (28%), Positives = 471/1030 (45%), Gaps = 159/1030 (15%)

Query: 58  LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
           L V  AN+I I+ +    + +K+ K +     ++         LE +  Y LHGNV S+ 
Sbjct: 30  LAVAGANIIRIFRLIPDVDITKKEKYTESRPPKM--------KLECLSQYTLHGNVMSMQ 81

Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
            ++  G+    +RDS++L+F DAK+SV+E+D   H LR  S+H FE  E   ++ G  + 
Sbjct: 82  AVTLVGS----QRDSLLLSFRDAKLSVVEYDQDTHDLRTVSLHYFEEEE---IRDGWTNH 134

Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR--IESSHV 235
              P+V+VDP+GRC  +L+YG ++++L   +  S  + D D   +    S +  I SS++
Sbjct: 135 HHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPS--LDDGDLLDNSKALSNKTPILSSYM 192

Query: 236 INLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
           I L+ L+  M ++ D  F+HGY EP ++IL+E   T++GR++ +  TC + A+S++   +
Sbjct: 193 IVLKSLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQR 252

Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
            HP+IWS  NLP D Y+ + V  P+GG L++  N++ Y +QS      +  Y VSL+S  
Sbjct: 253 VHPIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQS------IPPYGVSLNSLA 306

Query: 354 EL-------PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSK 405
           E        P+    + L+ +   ++ +D  ++S K+G+L +L++  D  R V+     K
Sbjct: 307 ETSTNFPLKPQEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDK 366

Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS-GLKEEFGDIEADAPST 464
              SVLTS +    ++  FLGSRLG+SLL++FT     ++ ++   +    + E +    
Sbjct: 367 AAASVLTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPENLQNTNENEIILEENETEETPA 426

Query: 465 KRLRRS------SSDALQDMVNGEELSLYGSASNNTESAQKT-FSFAVRDSLVNIGPLKD 517
           K++++       +SD L D+ + EEL +YGS      S Q T + F V DSL+NIGP  +
Sbjct: 427 KKIKQDFIGDWMASDVL-DIKDPEELEVYGSERETHTSIQITSYIFEVCDSLLNIGPCGN 485

Query: 518 FSYGLRINADASATGISKQSNYELV--------------------------ELPGCKGIW 551
            S G         +  S+  + ELV                          ELPGC+ +W
Sbjct: 486 ISMGEPAFLSEEFSH-SQDPDVELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMW 544

Query: 552 TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
           TV      G   +  ++    +  HA+LI+S E  TM+L+T   + EV +S  +  QG T
Sbjct: 545 TVI-----GTLNNDEQIRPEAEGSHAFLILSQEDSTMILQTGQEINEVDQS-GFSTQGST 598

Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
           I AGNL   R ++QV + G R+L G    Q +                 ++  S ADPYV
Sbjct: 599 IFAGNLGANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYV 648

Query: 672 LLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLY---------------HDK 716
            L   DG + LL       T  +   AA    +  + +   Y                D+
Sbjct: 649 TLLSEDGQVMLLTLREGRGTAKLHVQAANLLFRPQIEALCAYRDVSGIFTTQLPENVEDE 708

Query: 717 GPE--------------------------PWLRKTSTDAWLSTGVGEAIDGADGGPLDQG 750
            PE                           +   T +    S GV +          +  
Sbjct: 709 APEEEHNIEEPPIVGNIDNEDDLLYGDAPAFQMPTPSHTKTSEGVSKRTPWWQKHLQEIK 768

Query: 751 DIYSVVCY-ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEE 809
             Y ++ Y +SG LEI+ +P+    + +  F  G+  + D+     L+ +      + E 
Sbjct: 769 PTYWLLVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQTTPVNEIPNPE- 827

Query: 810 GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
                       M+V E+ M     H +RP L   L D  +  YQAY +  P+   K   
Sbjct: 828 ------------MQVREILMVALGHHGNRPMLLVRL-DSELQIYQAYRY--PKGHLK--- 869

Query: 870 PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLS 929
                + L    +        R    P+   TR        C  +  F NI+G+ G F+ 
Sbjct: 870 --LRFKKLDHGIIPGQLRPKPRDEDIPMMNETRH-------CM-MRYFSNIAGYNGVFIC 919

Query: 930 GSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGST 988
              P W  +  R  LR HP   DG + +F   +N+NC  GF+Y   +  L+IC LP+  +
Sbjct: 920 SDYPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQGFLYFNRKEELRICVLPTHLS 979

Query: 989 YDNYWPVQKV 998
           YD  WPV+KV
Sbjct: 980 YDAPWPVRKV 989


>gi|91078626|ref|XP_968117.1| PREDICTED: similar to cleavage and polyadenylation specificity
           factor cpsf [Tribolium castaneum]
          Length = 1413

 Score =  353 bits (907), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 292/1038 (28%), Positives = 470/1038 (45%), Gaps = 179/1038 (17%)

Query: 58  LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
           LV + ANVI+++ +    +         ET           + LE V  Y L GN+ S+ 
Sbjct: 30  LVTSGANVIKVFRLIPDIDTKTRIDKFNETNP-------PKSKLECVAQYTLFGNIMSMQ 82

Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
            ++   +     RD+++LAF+DAK+SV+E+D   H L+  S+H FE  +   +K G    
Sbjct: 83  SVNLANSP----RDALLLAFKDAKLSVVEYDPETHDLKTLSLHYFEEDD---MKDGWTHH 135

Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED---TFGSGGGFSARIESSH 234
              P+V+ DP+ RC  + V+G ++++L   +  +    D D     G   G  A I +S+
Sbjct: 136 YHVPMVRADPENRCAVMTVFGRKLVVLPFRRENAIDDTDADIKPMIGGAYGSKAPILASY 195

Query: 235 VINLRDL--DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
           +I L+D    + ++ D  F+HGY EP ++IL E   T+AGRV+ +  TC ++A+S++   
Sbjct: 196 MIVLKDFIDKVDNIIDIQFLHGYYEPTLLILFEPLKTFAGRVAVRTDTCAMAAISLNLQQ 255

Query: 293 KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSS 352
           K HP+IWS  NLP D  K + +  P+GG L+   N + Y +QS      +  Y VSL+S 
Sbjct: 256 KVHPIIWSVANLPFDCVKAVPIKKPLGGTLIFAVNALIYLNQS------IPPYGVSLNSI 309

Query: 353 QE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLS 404
            E        P+    + LD A AT+L++D  +LS K G+L +LT++ D  R V+     
Sbjct: 310 AENSTNFPLKPQDDLCISLDCAQATFLEDDTIVLSLKGGELYVLTLLADNMRYVRSFHFE 369

Query: 405 KTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT--CGSGTSMLSSGLKEEFGDIEADAP 462
           K   SVLT+ I+   N+  FLGSRLG+SLL++FT  C    ++            E   P
Sbjct: 370 KAAASVLTTCISVCENNFLFLGSRLGNSLLLRFTEKCNEVITL-----------DETIEP 418

Query: 463 STKRLRRSSS------DALQDMVNG------------EELSLYGSASNNTESAQKTFSFA 504
           S KRL+ S+S      D + D +N             EEL +YG+    +     ++ F 
Sbjct: 419 SAKRLKASNSTSENEDDKVLDTLNDCMASDVLDIRDPEELEVYGNQKQASLQIS-SYVFE 477

Query: 505 VRDSLVNIGPL------------KDFSYGLRINADASATG----------ISKQSNYELV 542
           V DSL+NIGP             ++FS  L ++ +   T           + K    ++V
Sbjct: 478 VCDSLLNIGPCGNISLGEPAFLSEEFSENLDLDLELVTTAGYGKNGALCVLQKSVRPQIV 537

Query: 543 ---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEV 599
               LPGC  +WTV+    +                HA+LI+S E  TM+L+T D + E+
Sbjct: 538 TTFTLPGCSNMWTVHAGEDK----------------HAFLILSQEDGTMILQTGDEINEI 581

Query: 600 TESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENS 659
            ++  +     T+ AGNL   + ++QV     R+L G    Q +       E G     S
Sbjct: 582 -DNTGFATHIPTVYAGNLGNLKYIVQVTSSAVRLLQGINQLQHIPL-----ELG-----S 630

Query: 660 TVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKG-- 717
            ++ V+  DPY+ L  +DG +  L+   +     +    +  S+  PV++  +Y D    
Sbjct: 631 PIVHVTSVDPYISLLTTDGQVITLMLREARGVAKLVISKSTLSNSPPVTTICMYRDVSGL 690

Query: 718 ------------PEPWLRKTSTDAWLSTGVGEAIDGADGG--------PLDQGDIYS--- 754
                       PE ++ ++ T   +     + + G D          P  +  +Y    
Sbjct: 691 FTSKIPEDFTHIPEHFINESETKMEVENE-DDLLYGDDSDFKMPTLNPPQPKPKVYYNWW 749

Query: 755 -------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSET 801
                         V  E+  LEI+ +P+F   + +     G   +VD    E++  S +
Sbjct: 750 KKYLLDVRPSYWLFVVRENSNLEIYSIPDFKLCYYITNLCFGHKVLVDNL--ESVTISAS 807

Query: 802 EINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGP 861
              S++ E   Q R+ ++  + VV L       H SRP L   L +  +  Y+ + F  P
Sbjct: 808 TPISAAHEANIQ-RQFDVKEILVVALG-----NHGSRPLLMVRL-ERDLYIYEVFRF--P 858

Query: 862 ENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNIS 921
               K          +   NVS       R      D +  +E        ++  F NI+
Sbjct: 859 RGNLKMRFRKIKHSLIYSPNVSG------RIDTEDSDFFAIQER-----IIKMRYFTNIA 907

Query: 922 GHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKI 980
           G+ G F+ G+ P W  M  R  LR HP   DG +++F   +NVNC  GF+Y   +  L+I
Sbjct: 908 GYNGVFVCGANPHWIFMSARGELRTHPMTIDGEVLSFAAFNNVNCPQGFLYFNRKSELRI 967

Query: 981 CQLPSGSTYDNYWPVQKV 998
             LP+  +YD  WPV+KV
Sbjct: 968 GVLPTHLSYDAAWPVRKV 985


>gi|383863556|ref|XP_003707246.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like [Megachile rotundata]
          Length = 1415

 Score =  353 bits (906), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 295/1028 (28%), Positives = 471/1028 (45%), Gaps = 157/1028 (15%)

Query: 58  LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
           LVV   N+I ++ +    + +K  K +     ++         LE +  Y LHGNV S+ 
Sbjct: 30  LVVAGGNIIRVFRLIPDVDITKREKYTESRPPKM--------KLECLAQYTLHGNVMSMQ 81

Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
            ++  G+    +RDS++L+F DAK+SV+E+D  IH LR  S+H FE  E   ++ G  + 
Sbjct: 82  AVTLVGS----QRDSLLLSFRDAKLSVVEYDQDIHDLRTVSLHYFEEEE---IRDGWTNH 134

Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
              P+V+VDP+GRC  +L+YG ++++L   +  S   GD             I SS++I 
Sbjct: 135 HHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPSLDDGDLLDNSKASSNKTPILSSYMIV 194

Query: 238 LRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
           L+ L+  M ++ D  F+HGY EP ++IL+E   T++GR++ +  TC + A+S++   + H
Sbjct: 195 LKSLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQRVH 254

Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
           P+IWS  NLP D Y+ + V  P+GG L++  N++ Y +QS      +  Y VSL+S  E 
Sbjct: 255 PIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQS------IPPYGVSLNSLAET 308

Query: 356 -------PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
                  P+    + L+ +   ++ +D  ++S K+G+L +L++  D  R V+     K  
Sbjct: 309 STNFPLKPQEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDKAA 368

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFT-CGSGTSMLSSGLKEEFGDIEADAPSTKR 466
            SVLTS +    ++  FLGSRLG+SLL++F    S  S   +  +    + E +    K+
Sbjct: 369 ASVLTSCVCMCDDNYLFLGSRLGNSLLLRFIEKESENSQNMNENEITIEENETEETPAKK 428

Query: 467 LRRS------SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
           +++       +SD L D+ + EEL +YGS + +T     ++ F V DSL+NIGP  + S 
Sbjct: 429 VKQDFIGDWMASDVL-DIKDPEELEVYGSET-HTSIQITSYIFEVCDSLLNIGPCGNISM 486

Query: 521 GLRINADASATGISKQSNYELV--------------------------ELPGCKGIWTVY 554
           G         +  S+  + ELV                          ELPGC+ +WTV 
Sbjct: 487 GEPAFLSEEFSH-SQDPDVELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMWTVI 545

Query: 555 HKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAA 614
                G   +  ++    +  HA+LI+S E  TM+L+T   + EV +S  +  QG T+ A
Sbjct: 546 -----GALNNDEQVRPEAEGSHAFLILSQEDSTMILQTGQEINEVDQS-GFSTQGSTVFA 599

Query: 615 GNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLG 674
           GNL   R ++QV + G R+L G    Q +                 ++  S ADPYV L 
Sbjct: 600 GNLGANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYVSLL 649

Query: 675 MSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD-------KGPE-------- 719
             DG + LL       T  +    A    +  + +   Y D       + PE        
Sbjct: 650 SEDGQVMLLTLREGRGTAKLHAQTANLLFRPQIEALCAYRDVSGIFTTQLPENVEDEVPE 709

Query: 720 --------PWLRKTSTDAWLSTGVGEAID--------GADG----GPLDQGDI------Y 753
                   P +     +  L  G G A           ++G     P  Q  +      Y
Sbjct: 710 EEHNTEEPPIVGNIDNEDDLLYGDGPAFQMPAPSQTKSSEGTSKRAPWWQKHLQEIKPTY 769

Query: 754 SVVCY-ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            ++ Y +SG LEI+ +P+    + +  F  G+  + D+     L+ +      + E    
Sbjct: 770 WLLVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQTAPVNEIPNPE---- 825

Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
                    M+V E+ M     H +RP L   L D  +  YQ Y +  P+   K      
Sbjct: 826 ---------MQVREILMVALGHHGNRPMLLVRL-DSELQIYQTYRY--PKGHLK------ 867

Query: 873 TSRSLSVSNVSASRLR-NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
               L    +    +  NLR      D     ET H   C  +  F NI+G+ G F+   
Sbjct: 868 ----LRFKKLDHGIIPGNLRPKPKEEDMSAMNETRH---CM-MRYFSNIAGYNGVFICSD 919

Query: 932 RPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
            P W  +  R  LR HP   DG I +F   +N+NC  GF+Y   +  L+IC LP+  +YD
Sbjct: 920 YPHWIFLTGRGELRTHPMGIDGPITSFAPFNNINCPQGFLYFNRKEELRICVLPTHLSYD 979

Query: 991 NYWPVQKV 998
             WPV+KV
Sbjct: 980 APWPVRKV 987


>gi|443684051|gb|ELT88095.1| hypothetical protein CAPTEDRAFT_161045 [Capitella teleta]
          Length = 1410

 Score =  350 bits (899), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 307/1042 (29%), Positives = 465/1042 (44%), Gaps = 188/1042 (18%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLV    N I +Y +  + +  ++  ++ ETK        +   LE V  Y L GNV S+
Sbjct: 29  NLVTAGVNQIRVYRLVAESKPVEKESHTTETKS-------AKQKLECVADYELCGNVSSI 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +S  GA     RD+++L FE+AK+S+ ++D     L+  S+H FE  +   L+ G   
Sbjct: 82  ESISLVGA----ARDALLLCFEEAKLSLCDYDPDTDDLKTISLHYFEDAD---LENG--C 132

Query: 177 FARG---PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESS 233
             RG     V+VDP+GRC  +L+YG  +I+L   +       D  +  S     + I S+
Sbjct: 133 CQRGLHHSEVRVDPEGRCAVMLIYGTHLIVLPFRKESPSDEIDATSCAS----KSPIMST 188

Query: 234 HVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
           ++I+LR LD +  +V D  F+HGY EP ++IL+E   TW  RV+ +  TC I A+S++  
Sbjct: 189 YIIDLRTLDERVTNVVDIQFLHGYYEPTVLILYEPLPTWTCRVAVRKDTCSIVAISLNLQ 248

Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
            K HP+IWS  NLP+D  +   VP PIGGV+V   N++ Y +QS         Y VSL+S
Sbjct: 249 DKTHPIIWSHSNLPYDCLRTFPVPKPIGGVIVFAVNSLLYLNQS------FPPYGVSLNS 302

Query: 352 SQEL-------PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDL 403
                      P+    + LD A A ++ ND  ++S K G+L +LT+V D  R V+   L
Sbjct: 303 LTSFNTEFLLKPQEGVRMSLDCAQAEFIDNDKLVISLKGGELYVLTLVIDSMRAVRSFHL 362

Query: 404 SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPS 463
            K   SVLT+ +   G++  FLGSRLG+SLL+++         SS      G+ +     
Sbjct: 363 DKAAASVLTTCMCMCGDNYLFLGSRLGNSLLLRYQ--EKKPEASSSSDASPGEEQRKEKM 420

Query: 464 TKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT-FSFAVRDSLVNIGPL------- 515
           T  +    S  +  + + +EL +YG  S   ES   T F F V DS++NIGP        
Sbjct: 421 TLAIGLVGSSDVSKLDDLDELEVYGRDSQAVESEDITQFMFEVCDSIINIGPCGQVEMGE 480

Query: 516 -----KDFSYGLRINADASATG----------ISKQSNYELV---ELPGCKGIWTVYHKS 557
                ++FS+    + +   T           + +Q   ++V   ELPGC  +WTV    
Sbjct: 481 PAFLSEEFSHQEDPDLELVTTSGYGKNGAISILQRQIRPQVVTTFELPGCTDVWTVLGSP 540

Query: 558 SRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNL 617
                +D     +     HA+L++S    +MVLET   + E+  S  +     T+ A N+
Sbjct: 541 DEQQGSDEKLAGS-----HAFLLLSRADSSMVLETGQEIMELDHS-GFCTDAPTVHAANI 594

Query: 618 FGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
              R ++QV      +L G    Q L+   S          S V+S S+ADP+VLL   D
Sbjct: 595 GNGRYIVQVGPNAIWLLKGVERIQHLALDVS----------SPVVSCSLADPHVLLLCED 644

Query: 678 GSIRLLV-----GDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKG---------PEP--- 720
           G +  LV      DP   T+S+ T    + SK  V +  LY D            EP   
Sbjct: 645 GQLLHLVLSVQGDDP---TLSLLTTKLHQKSK--VIAINLYRDTSGLFVVASSESEPSAT 699

Query: 721 --------------------------------------WLRKTSTDAWLSTGVGEAIDGA 742
                                                 W ++ S          E  +GA
Sbjct: 700 TTTEATETTTPQQQTEEGVDDEDDLLYGDSDISAITSTWQKQESEKEEKKEEEEEEAEGA 759

Query: 743 DGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETE 802
           D  P      ++V+   +G LE++ +P++   F V  F +G   ++D+     L  S   
Sbjct: 760 DIQP----TYWAVIIRATGNLELYSLPDWQLCFLVKNFATGNKLLIDSMQAADLSASFVA 815

Query: 803 INSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPE 862
              S++E              V E+ +  +  + S+P L A + D  +  Y+ +   G +
Sbjct: 816 PERSTQEVPF-----------VHEVMLHGFGVNGSQPLLMARVHD-ELYIYKVFSHVGSK 863

Query: 863 NTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTR-----EETPHGAPCQRITIF 917
                               +  RL+ +RF R       R     E+ P      R   F
Sbjct: 864 --------------------AKGRLQ-VRFKRRSHGLIIRPRDREEKIPENKKWLR--PF 900

Query: 918 KNISGHQGFFLSGSRPCW-CMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG 976
            +ISG+ G F+ GS P W  M  R  LR HP   DG+I  FT  HNVNC  GF+Y +S  
Sbjct: 901 TDISGYSGVFICGSYPHWLIMTQRGTLRGHPMAIDGTIPCFTAFHNVNCPKGFLYFSSNE 960

Query: 977 ILKICQLPSGSTYDNYWPVQKV 998
            L+IC LP+  +YD  WPV+KV
Sbjct: 961 ELRICVLPTHLSYDAPWPVRKV 982


>gi|440793679|gb|ELR14857.1| CPSF A subunit region protein [Acanthamoeba castellanii str. Neff]
          Length = 1477

 Score =  348 bits (893), Expect = 8e-93,   Method: Compositional matrix adjust.
 Identities = 256/898 (28%), Positives = 429/898 (47%), Gaps = 169/898 (18%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGI----------------SAAS 100
           NL+V   NV+E+Y +   E+      +   T+     DG+                +  S
Sbjct: 30  NLIVAKTNVLEVYALHRHEDSKARPIDRQSTRP---TDGVISLRGEEPKDAPPYAGTQHS 86

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           + LV    L GN+ES+A +   G      +D+++L+F DAKISVLEFD + + LR  S+H
Sbjct: 87  MRLVLSSSLFGNIESMAAVRFPGTS----KDALLLSFRDAKISVLEFDIATNDLRTISLH 142

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
            FE      +K G + +   P ++VDPQ RC  +L +  ++++L   Q  S +       
Sbjct: 143 YFED---YKVKEGHDHYIHVPELRVDPQQRCAAMLAFDRKLVVLPFRQHASLM-----EI 194

Query: 221 GSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHT 280
            +GG     ++ S +++LR + + +VKDF+F+ GY EP ++IL+E   TW+GRV+   +T
Sbjct: 195 ENGGQEDQPVKPSFLLDLRAMGIINVKDFVFLQGYYEPTLLILYEPTQTWSGRVAVNRNT 254

Query: 281 CMISALSIST-----TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI------ 329
           C+ +A+S++          HP++WSA  LP+D  +L+AVP PIGG L +  N++      
Sbjct: 255 CVAAAVSLNLWQHRGQTSAHPVVWSAEFLPYDTQRLIAVPGPIGGALALSTNSLLYLNQV 314

Query: 330 -------------------HYHSQSASCALALNNYA-VSLDSSQELP---RSSFSVELDA 366
                              H+ +Q+++  L LN +A + L      P   ++   + LDA
Sbjct: 315 SFPYRLILPAHGADVSITSHHDTQASASCLPLNVFADLYLSPQTPFPSAGKNRVGIALDA 374

Query: 367 AHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNS----- 421
           A   +L +D  L+S K G+L +  ++ DGR V  + L+K   SV+TS + T+        
Sbjct: 375 ARDVFLADDQLLVSLKGGELYIFHLLSDGRTVNDIQLTKAGSSVITSCMATLSGEGADER 434

Query: 422 LFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEE--FGDIEADAPSTKRLRRSSSDA----- 474
             FLGSR+GDSLL+Q+T    ++   +G  +   F DI+ +  +         +A     
Sbjct: 435 FLFLGSRVGDSLLLQYTTADASAPKQNGATKGSLFDDIKKEEDNDDDDEDEEEEASGEGE 494

Query: 475 LQDMVNGE-ELSLYGSASNNTESAQK-----TFSFAVRDSLVNIGPLKDFSYGLRIN-AD 527
           +++  +GE E+  +G      +  +K     T+ F V DSLVN+GP+ DF+ G   + A 
Sbjct: 495 VKEEPDGEGEVDEFGRRIREEDRRKKKGLLTTYKFKVCDSLVNVGPITDFAIGESFDPAS 554

Query: 528 ASATGISKQSNYELV---------------------------ELPGCKGIWTVYHKSSRG 560
            S      Q + E+V                           +L GCK  WT+YH+S   
Sbjct: 555 VSMAEQEGQRSVEIVTCSGQGKNGSLCVLQHGVRPELVHASADLAGCKAFWTLYHRSEER 614

Query: 561 HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT-ESVDYFVQGRTIAAGNLFG 619
              ++        EYHAYL++S E +T V+   D L E++ E  D+ V   T+ AGNLF 
Sbjct: 615 QGEEA--------EYHAYLLLSEEEQTRVI-AGDGLDELSNEETDFNVAAPTVDAGNLFE 665

Query: 620 RRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGS 679
           + R++QV + G  +LDG   TQ +            S    + + SIADPYVL+ M+DG+
Sbjct: 666 QTRIVQVHQHGLILLDGVKATQRI------------STPGQIAAASIADPYVLVLMADGA 713

Query: 680 IRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAI 739
           +RL   DP++  + VQT        + + +  L++                     G A+
Sbjct: 714 LRLYFADPTSSKL-VQTSLQNIHEVRDIMAMHLFY---------------------GGAM 751

Query: 740 DGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDS 799
            G      D+  I++ +  ++G L+I+ VP F+ VF+ ++  +G   I +  MR   + +
Sbjct: 752 RGKKARTNDE--IFAAIAKDNGRLDIYSVPEFDLVFSAERAANGPRLINNVLMRPPPQSA 809

Query: 800 ETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYL 857
             + ++ +             S ++ E+A+       S P LF  L +G +L Y+ +L
Sbjct: 810 AAQQSADTT------------SARIAEIALHSIGNIPSLPHLFLYLDNGELLLYRGFL 855



 Score = 77.4 bits (189), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 33/75 (44%), Positives = 41/75 (54%)

Query: 912  QRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIY 971
            +RI  F  +    G F+SGS P W    R   R++P   D  + AF   HN NC HGFIY
Sbjct: 967  RRIHYFGTVGKSNGVFISGSAPAWVFAQRGYARLYPMKLDTFVRAFAEFHNANCPHGFIY 1026

Query: 972  VTSQGILKICQLPSG 986
               +G LKICQLP+ 
Sbjct: 1027 FNHEGTLKICQLPAA 1041


>gi|345482082|ref|XP_001607052.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like [Nasonia vitripennis]
          Length = 1415

 Score =  347 bits (891), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 293/1025 (28%), Positives = 461/1025 (44%), Gaps = 151/1025 (14%)

Query: 58  LVVTAANVIEIY-VVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           LVV  AN+I ++ ++   + G KE        +           LE +  Y LHGNV S+
Sbjct: 30  LVVAGANIIRVFRLIPDVDPGKKEKFTESRPPK---------MRLECLAQYTLHGNVMSM 80

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +   G+     RDS++L+F +AK+SV+E+D  IH LR  S+H FE  E   +K G  +
Sbjct: 81  QAVQLIGSP----RDSLLLSFREAKLSVVEYDPEIHSLRTVSLHYFEEEE---IKDGWTN 133

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P+V+VDP+GRC  +L+YG ++++L   +      GD             I SS++I
Sbjct: 134 HHHVPIVRVDPEGRCAVMLIYGRKLVVLPFRKDPILDEGDLIENPKSSSHKTPILSSYMI 193

Query: 237 NLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
            L+ L+  M ++ D  F+HGY EP ++IL+E   T+AGR++ +  TC + A+S++   K 
Sbjct: 194 VLKSLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFAGRIAVRQDTCAMVAISLNIQQKV 253

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYAVSLDSSQ 353
           HP+IWS  NLP D Y+ +AV  P+GG L++  N++ Y +QS     ++LN+   +  +  
Sbjct: 254 HPIIWSVSNLPFDCYQAVAVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLTDNCTNFP 313

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
             P+    + L+++   ++  D  ++S KTG+L +L++  D  R V+     K   SVLT
Sbjct: 314 LKPQEGVKISLESSQVAFISPDRLVISLKTGELYVLSLFADSMRSVRGFHFDKAAASVLT 373

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS-SGLKEEFGDIEADAPSTKRLRRS- 470
           S +    ++  FLGSRLG+SLL++FT      +   S L+       +    TK+++   
Sbjct: 374 SCVCLCDDNYLFLGSRLGNSLLLRFTEKESEKINDISMLEMSLNSSNSQEQPTKKIKLDY 433

Query: 471 -----SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRIN 525
                +SD L D+ + EEL +YGS +  T     ++ F V DSL+NIGP  + S G    
Sbjct: 434 LEDWMASDVL-DIKDPEELEVYGSET-QTSIQITSYIFEVCDSLLNIGPCGNISMGEPAF 491

Query: 526 ADASATGISKQSNYELV--------------------------ELPGCKGIWTVYHKSSR 559
                +  S + + ELV                          +LPG + IWTV   +  
Sbjct: 492 LSEEFSNNS-EPDVELVTTSGYGKNGALCVLQRSIRPQVITTFDLPGYENIWTVIDSTVS 550

Query: 560 GHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFG 619
            + A +          H +LI++ +  TMVL+T   + EV +   +  QG TI AGNL  
Sbjct: 551 DNRAKTETEGT-----HGFLILTQDDSTMVLQTGQEINEVVDQSGFSTQGTTIFAGNLGS 605

Query: 620 RRRVIQVFERGARILDG----SYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
            R +IQV + G R+L G     +M  DL                 ++  S ADPYV L  
Sbjct: 606 NRYIIQVTQMGVRLLQGLEQIQHMPMDLG--------------CPIVHASCADPYVSLLS 651

Query: 676 SDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGV 735
            DG + LL       T  +   A     +  + +   Y D                +T +
Sbjct: 652 EDGQVVLLTLREGRGTARLHAQAVNLMFRPQIEAVCAYRD-----------VSGLFTTIL 700

Query: 736 GEAID--GADGGPLDQGDIYSVVCYES----GALEIFDVPNFNCVFTVDK---------- 779
            E +D    D    D+  I      E     G  + F +P    V   +           
Sbjct: 701 PEDVDEEAFDNDSSDEPQIIENPDNEDDLLYGDTQTFQMPAIPVVKPQETPTKKPPWWQQ 760

Query: 780 -----------FVSGRTHIVDTYMREALKDS---------ETEINSSSEEGTGQGRKENI 819
                      FV      ++ Y    L+ S         +  ++ S E  T QG ++N 
Sbjct: 761 YLQEIKPTYWLFVYRDNGTLEVYSLPELRLSYLIKNFGFGQNILHDSMEFTTIQGSQQNE 820

Query: 820 ---HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRS 876
                ++V E+A+     H +RP L   L D  +  YQ Y +  P+   K          
Sbjct: 821 PVNPEVQVREIAVVALGHHGNRPMLLVRL-DSELQIYQVYRY--PKGHLK---------- 867

Query: 877 LSVSNVSASRLRNLRFSRTPLDAYTREETP--HGAPCQRITIFKNISGHQGFFLSGSRPC 934
           L    +  + +  + FSR        E+ P  +      +  F NI+G+ G F+ G  P 
Sbjct: 868 LRFKKIDHNFI--VGFSRI---GPKEEDMPSMNDTRLCMMRYFSNIAGYNGVFIGGDYPH 922

Query: 935 WCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW 993
           W  +  R  LR HP   DG + +F   +NVNC  GF+Y   +  L+IC LP+  +YD  W
Sbjct: 923 WIFLTGRGELRAHPMNIDGPVKSFAPFNNVNCPQGFLYFNRKDELRICVLPTHLSYDAPW 982

Query: 994 PVQKV 998
           PV+KV
Sbjct: 983 PVRKV 987


>gi|47217773|emb|CAG05995.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 1446

 Score =  347 bits (890), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 294/1060 (27%), Positives = 474/1060 (44%), Gaps = 219/1060 (20%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV   + + +Y +    E + ++  S ++K R          LE V  + L GNV S+
Sbjct: 29  NLVVAGTSQLFVYRIIHDVESTSKTDKSSDSKTR-------KEKLEQVAAFSLFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +   GA+    RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE       R++
Sbjct: 82  ESVQLVGAN----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPEL------RDT 131

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
                                                  DE   G G G  +    +++I
Sbjct: 132 LT-------------------------------------DEQELGVGEGPKSSFLPTYII 154

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R+LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +   C I A+S++   K 
Sbjct: 155 DVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQAQCSIVAISLNIMQKV 214

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  NLP D  +++AVP PIGGV+V   N++ Y +QS     +ALN+      +  
Sbjct: 215 HPVIWSLSNLPFDCTQVMAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTNGTTAFP 274

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD + A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 275 LRLQDEVKITLDCSQADFIAYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 334

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGL----KEEFGDIEADAPSTKRLR 468
           + + T+     FLGSRLG+SLL+++T       L  G     KE+  D++        L 
Sbjct: 335 TCMVTMEPGYLFLGSRLGNSLLLKYTEKLQEMPLEEGKDKQEKEKDNDMDKQV-YVHTLN 393

Query: 469 RSSSDALQDMVNGE--ELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRIN 525
             S+ +  D    E  E+ +YGS A + T+ A  T+SF V DS++NIGP  + S G    
Sbjct: 394 SFSAHSQHDFFVDEVDEIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANASMGEPAF 451

Query: 526 ADASATGISKQSNYELV--------------------------ELPGCKGIWTVYHKSSR 559
                 G + + + E+V                          ELPGC  +WTV     +
Sbjct: 452 LSEEFQG-NPEPDLEVVVCSGHGKNGALSVLQRSIRPQVVTTFELPGCHDMWTVISNEVK 510

Query: 560 GHNADSSRMAAY---------DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
                     ++         D + H +LI+S E  TM+L+T   + E+  S  +  QG 
Sbjct: 511 EDKKVPQSPGSFTATHYSLEEDTKKHGFLILSREDSTMILQTGQEIMELDTS-GFATQGP 569

Query: 611 TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPY 670
           T+ AGN+   + +IQV   G R+L+G    + L F P +         S ++  S+ADPY
Sbjct: 570 TVFAGNIGDNKYIIQVSPMGIRLLEG---VKQLHFIPVDL-------GSPIVHCSVADPY 619

Query: 671 VLLGMSDGSIRLLVGDP-----STCTVSVQTPAAIESSKKPVSSC--------------- 710
           V++  ++G + + V         T  +++Q P  I +  + ++ C               
Sbjct: 620 VVIMTAEGVVTMFVLKVDSYMGKTHRLALQKP-QISTQSRVIALCAYRDVSGMFTTENKV 678

Query: 711 -----------------TLYHD------KGPEPWLRKTST-------DAWLSTGVGEAID 740
                            T+ HD         E  L   S+       +  + + V     
Sbjct: 679 SCAIAEDFNIRSQSETETVIHDLSSNIVDDEEEMLYGDSSSNAGPSKEEMIRSFVAPGPS 738

Query: 741 GADGGPLD-QGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDS 799
            ++GGP   +   + +V  ESG +EI+ +P++  VF V  F  G+  +VD+   ++    
Sbjct: 739 VSEGGPSKAEPSHWCLVTRESGVMEIYQLPDWRLVFLVKNFPVGQRVLVDSSSGQSATQG 798

Query: 800 ETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLF- 858
           + E     EE T QG    +  + +V L       +HSRP+L  +  +  +L Y+A+ + 
Sbjct: 799 DKE--GKKEEMTRQGEIPLVKEVTLVSLGY-----NHSRPYLL-VHVEQELLVYEAFPYD 850

Query: 859 -EGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE--------ETPHGA 909
            + P+N  K                       +RF + P +   RE        +   GA
Sbjct: 851 QQQPQNNLK-----------------------VRFKKVPHNINFREKKSKLRKDKKAEGA 887

Query: 910 PCQ----------RITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFT 958
             +          R   F++ISG+ G F+ G  P W +V  R  LR+HP   DG I +F+
Sbjct: 888 AAEDGVAARGRISRFRYFEDISGYSGVFICGPSPHWMLVTSRGALRLHPMTIDGPIESFS 947

Query: 959 VLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
             HN+NC  GF+Y   QG L+I  LP+  +YD  WPV+K+
Sbjct: 948 PFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKI 987


>gi|195583398|ref|XP_002081509.1| GD25678 [Drosophila simulans]
 gi|194193518|gb|EDX07094.1| GD25678 [Drosophila simulans]
          Length = 1450

 Score =  344 bits (883), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 299/1053 (28%), Positives = 481/1053 (45%), Gaps = 163/1053 (15%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
            NLVV  ANV+++Y +    E S+  K N  E +    M       LE +  Y L+GNV S
Sbjct: 29   NLVVAGANVLKVYRIAPNVEASQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82

Query: 116  LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
            L  +S  GA     RD+++++F+DAK+SVL+ D     L+  S+H FE  +   ++ G  
Sbjct: 83   LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
                 P V+VDP  RC  +LVYG ++++L   +  S     L   +    +     +R  
Sbjct: 136  GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195

Query: 230  IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
            I +S++I LRDLD K  +V D  F+HGY EP ++IL+E   T  GR+  +  TC++ A+S
Sbjct: 196  IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
            ++   + HP+IW+  +LP D  ++  +  PIGG LV+  N + Y +QS         Y V
Sbjct: 256  LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309

Query: 348  SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
            SL+SS +        P+    + LD A+  ++  D  ++S +TGDL +LT+  D  R V+
Sbjct: 310  SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369

Query: 400  RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
                 K   SVLTS I  + +   FLGSRLG+SLL+ FT    +++++            
Sbjct: 370  NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVEQQTEQQQ 429

Query: 448  SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
              L++E   +E +     +L  + + A    +  EEL +YGS +  +    + F F V D
Sbjct: 430  RNLQDEDQSLE-EILDVDQLEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCD 488

Query: 508  SLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS---------N 538
            SL+N+ P+     G R+  +                     +ATG SK           N
Sbjct: 489  SLMNVAPINYMCAGERVEFEEDGVTLRPHAESLQDLKIELVAATGHSKNGALSVFVNCLN 548

Query: 539  YELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
             +++   EL GC  +WTV+         D+++ ++ +D+ H ++++S    T+VL+T   
Sbjct: 549  PQIITSFELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQE 599

Query: 596  LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
            + E+ E+  + V   TI  GNL  +R ++QV  R  R+L G+ + Q++            
Sbjct: 600  INEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI---------- 648

Query: 656  SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
               S V+ VSIADPYV L + +G +  L    +  T  +       SS   V + + Y D
Sbjct: 649  DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAISAYKD 708

Query: 716  -------KG----------------------PEPWLRKTSTDAWLSTGVGEAI------D 740
                   KG                       EP ++    +  L    G A       D
Sbjct: 709  LSGLFTVKGDDINLTGSSNSAFGHSFGGYMKAEPNMKVEDEEDLLYGDAGSAFKMNSMAD 768

Query: 741  GADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
             A        D +             VV  +SG LEI+ +P+   V+ V+   +G T + 
Sbjct: 769  LAKQSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDVGNGATVLT 828

Query: 789  DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
            D    E +  S T   +S          ++ +S   +EL++     +  RP L  + T  
Sbjct: 829  DAM--EFVPISLTTQENSKAGIVQACMPQHANSPLPLELSVIGLGLNGERPLLL-VRTRV 885

Query: 849  TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG 908
             +L YQ  +F  P+   K        R L   N+   +  ++       D     E+   
Sbjct: 886  ELLIYQ--VFRYPKGHLK-----IRFRKLDXXNLLDQQPTHIELDEN--DEQEEIESYQM 936

Query: 909  AP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNC 965
             P   Q++  F N+ G  G  + G  PC+  + FR  LR+H  L +G + +F   +NVN 
Sbjct: 937  QPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVNI 996

Query: 966  NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
             +GF+Y  +   LKI  LPS  +YD+ WPV+KV
Sbjct: 997  PNGFLYFDTTYELKISVLPSYLSYDSIWPVRKV 1029


>gi|198415711|ref|XP_002123169.1| PREDICTED: similar to cleavage and polyadenylation specificity factor
            1, partial [Ciona intestinalis]
          Length = 1370

 Score =  343 bits (880), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 300/1099 (27%), Positives = 480/1099 (43%), Gaps = 197/1099 (17%)

Query: 3    FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
            +A Y+ +H PTG+  C        +                             NL+VTA
Sbjct: 2    YAWYRQIHAPTGVEQCVYCNFASEKEK---------------------------NLLVTA 34

Query: 63   ANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQG 122
            A+ + +Y +    E + +++N  E       + +    L+ +  ++L GNV  +  +   
Sbjct: 35   ASQLTVYRLERNYEVTTKTENGEE-------NTVVKEKLQQIGSWQLFGNVVRMRSVRLA 87

Query: 123  GADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL 182
            GA    + DS++L+F +AK+S++EFD + H ++ TS+H FE   +   K G       P 
Sbjct: 88   GA----KLDSVLLSFAEAKLSIIEFDQATHDIKTTSLHYFEDALY---KDGSYQRITLPK 140

Query: 183  VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGF--SARIESSHVINLRD 240
            + VDP+ RC  + +    + ++      + L  D+           + R  +S+ I+L  
Sbjct: 141  IAVDPESRCVALQLTTKSVAVVPLRANTAALATDDGAAPQDNVSLQNKRSTTSYTIDLHA 200

Query: 241  LD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298
            +D  ++ + D  F+HGY EP +++L E   TWAGRV+ +  TC I A+S++   + HP++
Sbjct: 201  VDARLQRIIDIQFLHGYNEPTLLVLFESLRTWAGRVAMRQDTCNIVAISLNMAEQLHPVV 260

Query: 299  WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQE---- 354
            WS   LP D      VP PIGGVL+   N+I Y +QS         Y  SL+S+ E    
Sbjct: 261  WSLNGLPFDCKYAYPVPKPIGGVLIFAVNSILYLNQSVP------PYGTSLNSTTENSTS 314

Query: 355  ---LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSV 410
                P+    + LD +HA ++  +  ++S K G+L +LT++ D  R V+     K+  SV
Sbjct: 315  FPLKPQEDVCMTLDCSHAMFISPESLVISLKNGELYVLTLLVDSMRSVRNFHFDKSASSV 374

Query: 411  LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS 470
            LTS +T + +   FLGSRLG+SLL+++T        +  +       E  A   KRL  +
Sbjct: 375  LTSCLTVLDDGFLFLGSRLGNSLLLKYT-------EARPVFRNCYHTEEPAAKRKRLNTA 427

Query: 471  SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASA 530
            +  A  D  N  +L +YG  +  +E    ++ F V DSLVNIGP      G    A  S 
Sbjct: 428  ADWAASD-TNDIDLQMYGKDTVTSEPL-SSYKFEVCDSLVNIGPCGAAELGEP--AFLSE 483

Query: 531  TGIS-KQSNYELV--------------------------ELPGCKGIWTVYHKSSRGHNA 563
              +S ++S+ EL                           ELPGC  +WTV     +    
Sbjct: 484  EFVSQRESDLELAILSGHGKNGAISVLQRSVKPQVVTTFELPGCIDMWTVKSVCEKTELP 543

Query: 564  DSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
              ++      + H+YLI+S E  T++LET   + EV E+  +  + +++  GN+ G + +
Sbjct: 544  TKTQ-----QQQHSYLILSREESTLILETGKEIMEV-ENSGFNTREQSVFVGNIGGDKEL 597

Query: 624  I-QVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
            I QV   G  +L G  + Q +       E G     S +   SI DPY LL  SDG + +
Sbjct: 598  ILQVCASGVWLLAGVKLLQHIPL-----ELG-----SPITQCSICDPYALLLTSDGDLIM 647

Query: 683  LV----------GDPSTCTVSVQTPAAIE--------------------------SSKK- 705
            L                C  S+     IE                          SS K 
Sbjct: 648  LTLTNDLDSENGVKLECCNPSINQVPQIEHVCLYKDTSGLFKTASGPSDVFLPEDSSNKG 707

Query: 706  -------------PVSSCTLYHDKGPEPWLRKTSTDAWL----------STGVGEAIDGA 742
                         P+SS T   D+  E    ++  D             S    E +DG 
Sbjct: 708  VSDSEIPSSLPRTPLSSKTFTVDEEDELLYGESDPDVIFAPQFAPNVPKSPTQNEPLDGD 767

Query: 743  DGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETE 802
              G  ++   ++++  E+  LEI+ +P+ + V+T+  F  G+  + ++    +   S+ +
Sbjct: 768  KEGN-EEFTFWAIIARENRNLEIYSMPSLDLVYTIKNFSFGQKLLTNSGPVHSYSVSKDD 826

Query: 803  INSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPE 862
             ++S    T    K  I  + +V L  +     +S P L A + +  IL Y+ + F  PE
Sbjct: 827  KSTS----TRYSDKPRIFEILLVGLGYK-----NSSPHLIARIEE-EILIYEVFKFSAPE 876

Query: 863  NTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISG 922
               K +     S  +    V+ S    +   R P+   T+ +      C R   F NI G
Sbjct: 877  KFKKYN-----SLQIRFKKVNHS----MMIRRAPVTHETKTDQLEHRNCLR--TFSNIGG 925

Query: 923  HQGFFLSGSRPCWCMV-FRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
            + G FL G  P W  V  R  L  HP   DGS+  F   HNVNC +GF+Y  SQG L+IC
Sbjct: 926  YSGVFLCGPYPYWIFVTIRGALCCHPMSVDGSVSCFVPFHNVNCPNGFLYFNSQGELRIC 985

Query: 982  QLPSGSTYDNYWPVQKVVF 1000
             LP    YD  WP++K+  
Sbjct: 986  MLPPHMKYDTAWPMRKITL 1004


>gi|195485994|ref|XP_002091320.1| GE12310 [Drosophila yakuba]
 gi|194177421|gb|EDW91032.1| GE12310 [Drosophila yakuba]
          Length = 1455

 Score =  341 bits (875), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 296/1054 (28%), Positives = 484/1054 (45%), Gaps = 165/1054 (15%)

Query: 57   NLVVTAANVIEIYVVRVQ-EEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
            NLVV  ANV+++Y +    E G ++  N  E +    M       LE +  Y L+GNV S
Sbjct: 29   NLVVAGANVLKVYRIAPNVEAGQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82

Query: 116  LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
            L  +S  GA     RD+++++F+DAK+SVL+ D     L+  S+H FE  +   ++ G  
Sbjct: 83   LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
                 P V+VDP  RC  +LVYG ++++L   +  S     L   +    +     +R  
Sbjct: 136  GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195

Query: 230  IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
            I +S++I LRDLD K  +V D  F+HGY EP ++IL+E   T  GR+  +  TC++ A+S
Sbjct: 196  IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
            ++   + HP+IW+  +LP D  ++  +  PIGG LV+  N + Y +QS         Y V
Sbjct: 256  LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309

Query: 348  SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
            SL+SS +        P+    + LD A+  ++  D  ++S +TGDL +LT+  D  R V+
Sbjct: 310  SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369

Query: 400  RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
                 K   SVLTS I  + +   FLGSRLG+SLL+ FT    +++++            
Sbjct: 370  NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVEQQTEQQQ 429

Query: 448  SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
              L++E  ++E +     +L  + + A    +  EEL +YG+ +  +    + F F V D
Sbjct: 430  RNLQDEDQNLE-EIFDVDQLEMAPTQAKSRRIEDEELEVYGTGAKASVLQLRKFIFEVCD 488

Query: 508  SLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS---------N 538
            SL+N+ P+     G R+  +                     +ATG SK           N
Sbjct: 489  SLMNVAPINYMCAGERVEFEEDGATLRPHAESLQDLKIELVAATGHSKNGALSVFVNCIN 548

Query: 539  YELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
             +++   EL GC  +WTV+         D+++ ++ +D+ H ++++S    T+VL+T   
Sbjct: 549  PQIITSFELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQE 599

Query: 596  LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
            + E+ E+  + V   TI  GNL  +R ++QV  R  R+L G+ + Q++            
Sbjct: 600  INEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI---------- 648

Query: 656  SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
               S V+ VSIADPYV L + +G +  L    +  T  +       SS   V + + Y D
Sbjct: 649  DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAISAYKD 708

Query: 716  -------KG----------------------PEPWLRKTSTDAWLSTGVGEAI------D 740
                   KG                       EP ++    +  L    G A       D
Sbjct: 709  LSGLFTVKGDDINLTGSSNSGFGHSFGGYMKAEPNMKVEDEEDLLYGDAGNAFKMNSMAD 768

Query: 741  GADGGPLDQGD------------IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
             A        D             + +V  +SG LEI+ +P+   V+ V+   +G   + 
Sbjct: 769  LAKQSKQKNSDWWRRLLVQAKPSYWLIVARQSGTLEIYSMPDMKLVYLVNDVGNGAMVLT 828

Query: 789  DTYMREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAMQRWSAHHSRPFLFAILTD 847
            D      +  +  E   +S+ G  Q    ++ +S   +EL++     +  RP L  + T 
Sbjct: 829  DAMEFVPISLTTQE---NSKAGIVQACMPQHANSPLPLELSVIGLGLNGERPLLL-VRTR 884

Query: 848  GTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH 907
              +L YQ  +F  P+   K        R L   N+   +  ++       DA    E+  
Sbjct: 885  VELLIYQ--VFRYPKGHLK-----IRFRKLDQLNLLDQQPTHIELDEN--DAQEEIESYQ 935

Query: 908  GAP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVN 964
              P   Q++  F N+ G  G  + G  PC+  + FR  LR+H  L +G + +F   +NVN
Sbjct: 936  MQPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVN 995

Query: 965  CNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
              +GF+Y  +   LKI  LPS  +YD+ WPV+KV
Sbjct: 996  IPNGFLYFDTTYELKISVLPSYLSYDSTWPVRKV 1029


>gi|194883064|ref|XP_001975624.1| GG22421 [Drosophila erecta]
 gi|190658811|gb|EDV56024.1| GG22421 [Drosophila erecta]
          Length = 1455

 Score =  341 bits (874), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 301/1053 (28%), Positives = 481/1053 (45%), Gaps = 163/1053 (15%)

Query: 57   NLVVTAANVIEIYVVRVQ-EEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
            NLVV  ANV+++Y +    E G ++  N  E +    M       LE +  Y L+GNV S
Sbjct: 29   NLVVAGANVLKVYRIAPNVEAGQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82

Query: 116  LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
            L  +S  GA     RD+++++F+DAK+SVL+ D     L+  S+H FE  +   ++ G  
Sbjct: 83   LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
                 P V+VDP  RC  +LVYG ++++L   +  S     L   +    +     +R  
Sbjct: 136  GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195

Query: 230  IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
            I +S++I LRDLD K  +V D  F+HGY EP ++IL+E   T  GR+  +  TC++ A+S
Sbjct: 196  IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
            ++   + HP+IW+  +LP D  ++  +  PIGG LV+  N + Y +QS         Y V
Sbjct: 256  LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309

Query: 348  SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
            SL+SS +        P+    + LD A+  ++  D  ++S +TGDL +LT+  D  R V+
Sbjct: 310  SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369

Query: 400  RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
                 K   SVLTS I  + +   FLGSRLG+SLL+ FT    +++++            
Sbjct: 370  NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVEQQTEQQQ 429

Query: 448  SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
              L++E   I  +     +L  + + A    +  EEL +YGS +  +    + F F V D
Sbjct: 430  RNLQDE-EQIMEEIFDVDQLEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCD 488

Query: 508  SLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS---------N 538
            SL+N+ P+     G R+  +                     +ATG SK           N
Sbjct: 489  SLMNVAPVNYMCAGERVEFEEDGATLRPHAESLQDVKIELVAATGHSKNGALSVFVNCIN 548

Query: 539  YELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
             +++   EL GC  +WTV+         D+++ ++ +D+ H ++++S    T+VL+T   
Sbjct: 549  PQIITSFELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQE 599

Query: 596  LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
            + E+ E+  + V   TI  GNL  +R ++QV  R  R+L G+ + Q++       E G  
Sbjct: 600  INEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI-----EVG-- 651

Query: 656  SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
               S V+ VSIADPYV L + +G +  L    +  T  +       SS   V + + Y D
Sbjct: 652  ---SPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAISAYKD 708

Query: 716  -------KG----------------------PEPWLRKTSTDAWLSTGVGEAI------D 740
                   KG                       EP ++    +  L    G A       D
Sbjct: 709  LSGLFTVKGDDINLTGSSNSGFGHSFGGYMKAEPNMKVEDEEDLLYGDAGSAFKMNSMAD 768

Query: 741  GADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
             A        D +             VV  +SG LEI+ +P+   V+ V+    G   IV
Sbjct: 769  LAKQSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDV--GNGAIV 826

Query: 789  DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
             T   E +  S T   +S          ++ +S   +EL++     +  RP L  + T  
Sbjct: 827  LTDAMEFVPISLTTQENSKAGIVQACMPQHANSPLPLELSLTGLGLNGERPLLM-VRTRV 885

Query: 849  TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG 908
             +L YQ  +F  P+   K        R L   N+   +  ++       D     E+   
Sbjct: 886  ELLIYQ--VFRYPKGHLK-----IRFRKLDQLNLLDQQPTHIELDEN--DEQEDIESYQM 936

Query: 909  AP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNC 965
             P   Q++  F N+ G  G  + G  PC+  + FR  LR+H  L +G + +F   +NVN 
Sbjct: 937  QPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVNI 996

Query: 966  NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
             +GF+Y  +   LKI  LPS  +YD+ WPV+KV
Sbjct: 997  PNGFLYFDTTYELKISVLPSYLSYDSTWPVRKV 1029


>gi|45552619|ref|NP_995833.1| cleavage and polyadenylation specificity factor 160, isoform A
            [Drosophila melanogaster]
 gi|18203551|sp|Q9V726.1|CPSF1_DROME RecName: Full=Cleavage and polyadenylation specificity factor subunit
            1; AltName: Full=Cleavage and polyadenylation specificity
            factor 160 kDa subunit; Short=CPSF 160 kDa subunit;
            Short=dCPSF 160
 gi|7303176|gb|AAF58240.1| cleavage and polyadenylation specificity factor 160, isoform A
            [Drosophila melanogaster]
          Length = 1455

 Score =  340 bits (872), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 297/1054 (28%), Positives = 483/1054 (45%), Gaps = 165/1054 (15%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
            NLVV  ANV+++Y +    E S+  K N  E +    M       LE +  Y L+GNV S
Sbjct: 29   NLVVAGANVLKVYRIAPNVEASQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82

Query: 116  LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
            L  +S  GA     RD+++++F+DAK+SVL+ D     L+  S+H FE  +   ++ G  
Sbjct: 83   LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
                 P V+VDP  RC  +LVYG ++++L   +  S     L   +    +     +R  
Sbjct: 136  GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195

Query: 230  IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
            I +S++I LRDLD K  +V D  F+HGY EP ++IL+E   T  GR+  +  TC++ A+S
Sbjct: 196  IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
            ++   + HP+IW+  +LP D  ++  +  PIGG LV+  N + Y +QS         Y V
Sbjct: 256  LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309

Query: 348  SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
            SL+SS +        P+    + LD A+  ++  D  ++S +TGDL +LT+  D  R V+
Sbjct: 310  SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369

Query: 400  RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
                 K   SVLTS I  + +   FLGSRLG+SLL+ FT    +++++            
Sbjct: 370  NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDEVEQQSEQQQ 429

Query: 448  SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
              L++E  ++E +     +L  + + A    +  EEL +YGS +  +    + F F V D
Sbjct: 430  RNLQDEDQNLE-EIFDVDQLEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCD 488

Query: 508  SLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS---------N 538
            SL+N+ P+     G R+  +                     +ATG SK           N
Sbjct: 489  SLMNVAPINYMCAGERVEFEEDGVTLRPHAESLQDLKIELVAATGHSKNGALSVFVNCIN 548

Query: 539  YELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
             +++   EL GC  +WTV+         D+++ ++ +D+ H ++++S    T+VL+T   
Sbjct: 549  PQIITSFELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQE 599

Query: 596  LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
            + E+ E+  + V   TI  GNL  +R ++QV  R  R+L G+ + Q++            
Sbjct: 600  INEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI---------- 648

Query: 656  SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
               S V+ VSIADPYV L + +G +  L    +  T  +       SS   V + + Y D
Sbjct: 649  DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAISAYKD 708

Query: 716  -------KG----------------------PEPWLRKTSTDAWLSTGVGEAI------D 740
                   KG                       EP ++    +  L    G A       D
Sbjct: 709  LSGLFTVKGDDINLTGSSNSAFGHSFGGYMKAEPNMKVEDEEDLLYGDAGSAFKMNSMAD 768

Query: 741  GADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
             A        D +             VV  +SG LEI+ +P+   V+ V+   +G   + 
Sbjct: 769  LAKQSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDVGNGSMVLT 828

Query: 789  DTYMREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAMQRWSAHHSRPFLFAILTD 847
            D      +  +  E   +S+ G  Q    ++ +S   +EL++     +  RP L  + T 
Sbjct: 829  DAMEFVPISLTTQE---NSKAGIVQACMPQHANSPLPLELSVIGLGLNGERPLLL-VRTR 884

Query: 848  GTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH 907
              +L YQ  +F  P+   K        R +   N+   +  ++       D     E+  
Sbjct: 885  VELLIYQ--VFRYPKGHLK-----IRFRKMDQLNLLDQQPTHIDLDEN--DEQEEIESYQ 935

Query: 908  GAP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVN 964
              P   Q++  F N+ G  G  + G  PC+  + FR  LR+H  L +G + +F   +NVN
Sbjct: 936  MQPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVN 995

Query: 965  CNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
              +GF+Y  +   LKI  LPS  +YD+ WPV+KV
Sbjct: 996  IPNGFLYFDTTYELKISVLPSYLSYDSVWPVRKV 1029


>gi|195334368|ref|XP_002033855.1| GM20208 [Drosophila sechellia]
 gi|194125825|gb|EDW47868.1| GM20208 [Drosophila sechellia]
          Length = 1455

 Score =  340 bits (871), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 297/1059 (28%), Positives = 480/1059 (45%), Gaps = 175/1059 (16%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
            NLVV  ANV+++Y +    E S+  K N  E +    M       LE +  Y L+GNV S
Sbjct: 29   NLVVAGANVLKVYRIAPNVEASQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82

Query: 116  LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
            L  +S  GA     RD+++++F+DAK+SVL+ D     L+  S+H FE  +   ++ G  
Sbjct: 83   LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
                 P V+VDP  RC  +LVYG ++++L   +  S     L   +    +     +R  
Sbjct: 136  GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195

Query: 230  IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
            I +S++I LRDLD K  +V D  F+HGY EP ++IL+E   T  GR+  +  TC++ A+S
Sbjct: 196  IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
            ++   + HP+IW+  +LP D  ++  +  PIGG LV+  N + Y +QS         Y V
Sbjct: 256  LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309

Query: 348  SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
            SL+SS +        P+    + LD A+  ++  D  ++S +TGDL +LT+  D  R V+
Sbjct: 310  SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369

Query: 400  RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEA 459
                 K   SVLTS I  + +   FLGSRLG+SLL+ FT    +++++        D+E 
Sbjct: 370  NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVIT------LDDVEQ 423

Query: 460  DAPSTKRLRRSSSDALQDM-----------------VNGEELSLYGSASNNTESAQKTFS 502
             +   +R  +     L+++                 +  EEL +YGS +  +    + F 
Sbjct: 424  QSEQQQRNLQDEDQNLEEIFDVDQVEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFI 483

Query: 503  FAVRDSLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS----- 537
            F V DSL+N+ P+     G R+  +                     +ATG SK       
Sbjct: 484  FEVCDSLMNVAPINYMCAGERVEFEEDGVTLRPHAESLQDLKIELVAATGHSKNGALSVF 543

Query: 538  ----NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
                N +++   EL GC  +WTV+         D+++ ++ +D+ H ++ +S    T+VL
Sbjct: 544  VNCLNPQIITSFELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMFLSQRNSTLVL 594

Query: 591  ETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNS 650
            +T   + E+ E+  + V   TI  GNL  +R ++QV  R  R+L G+ + Q++       
Sbjct: 595  QTGQEINEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI----- 648

Query: 651  ESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSC 710
                    S V+ VSIADPYV L + +G +  L    +  T  +       SS   V + 
Sbjct: 649  -----DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAI 703

Query: 711  TLYHD-------KG----------------------PEPWLRKTSTDAWLSTGVGEAI-- 739
            + Y D       KG                       EP ++    +  L    G A   
Sbjct: 704  SAYKDLSGLFTVKGDDINLTGSSNSAFGHSFGGYMKAEPNMKVEDEEDLLYGDAGSAFKM 763

Query: 740  ----DGADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSG 783
                D A        D +             VV  +SG LEI+ +P+   V+ V+   +G
Sbjct: 764  NSMADLAKQSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDVGNG 823

Query: 784  RTHIVDTYMREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAMQRWSAHHSRPFLF 842
               + D      +  +  E   +S+ G  Q    ++ +S   +EL++     +  RP L 
Sbjct: 824  AMVLTDAMEFVPISLTTQE---NSKAGIVQACMPQHANSPLPLELSVIGLGLNGERPLLL 880

Query: 843  AILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTR 902
             + T   +L YQ  +F  P+   K        R L   N+   +  ++       D    
Sbjct: 881  -VRTRVELLIYQ--VFRYPKGHLK-----IRFRKLDQLNLLDQQPTHIELDEN--DEQEE 930

Query: 903  EETPHGAP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTV 959
             E+    P   Q++  F N+ G  G  + G  PC+  + FR  LR+H  L +G + +F  
Sbjct: 931  IESYQMQPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAA 990

Query: 960  LHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
             +NVN  +GF+Y  +   LKI  LPS  +YD+ WPV+KV
Sbjct: 991  FNNVNIPNGFLYFDTTYELKISVLPSYLSYDSIWPVRKV 1029


>gi|195455711|ref|XP_002074834.1| GK23274 [Drosophila willistoni]
 gi|194170919|gb|EDW85820.1| GK23274 [Drosophila willistoni]
          Length = 1463

 Score =  332 bits (850), Expect = 8e-88,   Method: Compositional matrix adjust.
 Identities = 295/1065 (27%), Positives = 470/1065 (44%), Gaps = 179/1065 (16%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
            NLVV  ANV+++Y +    E S+  K N  E      M       LE +  Y L+GNV S
Sbjct: 29   NLVVAGANVLKVYRIAPNVEASQRQKLNPSE------MRVAPKMRLECLATYSLYGNVMS 82

Query: 116  LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
            L  +S  GA     RD+++++F+DAK+SVL+ D   + L+  S+H FE  +   ++ G  
Sbjct: 83   LQCVSLAGA--GAMRDALLVSFKDAKLSVLQHDPDTYALKTLSLHYFEEED---IRGGWT 137

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
                 P V+VDP  RC  +L+YG ++++L   +  S     L   +    +      R  
Sbjct: 138  GRYYVPEVRVDPDARCAVMLIYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTALVTRTP 197

Query: 230  IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
            I +S++I LRDLD K  +V D  F+HGY EP ++IL+E   T AGR+  +  TC++ A+S
Sbjct: 198  IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCAGRIKVRSDTCVLVAIS 257

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
            ++   + HP+IW+  NLP D  +LL +  PIGG LV+  N + Y +QS         Y V
Sbjct: 258  LNIQQRVHPIIWTVNNLPFDCLRLLPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 311

Query: 348  SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
            SL+SS +        P+    + LD A+  ++  D  ++S +TGDL +LT+  D  R V+
Sbjct: 312  SLNSSADNSTSFPLKPQDGVRISLDCANFAFIDVDKLVVSLRTGDLYVLTLCVDSMRTVR 371

Query: 400  RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
                 K   SVLTS I        FLGSRLG+SLL+ FT    +++++            
Sbjct: 372  NFHFHKAASSVLTSCICVCHMEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVEQQQQQQA 431

Query: 448  ----SGLKEEFGDIEAD-------APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTES 496
                S   E  G ++ D       APS  + RR         +  EEL +YG+ +  +  
Sbjct: 432  AEEPSEEAEIEGILDMDQLEAATSAPSQAKSRR---------IEDEELEVYGTGAKASVL 482

Query: 497  AQKTFSFAVRDSLVNIGPLKDFSYGLRINAD--------------------ASATGISKQ 536
              + F F V DSL+N+ P+     G R+  +                     +ATG SK 
Sbjct: 483  QLRKFVFEVCDSLINVAPINYMCAGERVEFEEDGTTLRPHAESLQDVKIELVAATGHSKN 542

Query: 537  S---------NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLE 584
                      N +++   EL GC  +WTV+         D+++  +  D+ H ++++S +
Sbjct: 543  GALSVFVNCINPQIITSFELEGCLDVWTVFD--------DATKKTSRQDQ-HDFMLLSQK 593

Query: 585  ARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLS 644
              T+VL+T   + E+ E+  + V   TI  GNL   R ++QV  R  R+L G+ + Q++ 
Sbjct: 594  NSTLVLQTGQEINEI-ENTGFTVNQATIFVGNLGQNRFIVQVTTRHVRLLQGTRLVQNVP 652

Query: 645  FGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSK 704
                          S V+ V+IADPYV L + +G +  L    S  T  +       SS 
Sbjct: 653  I----------DVGSPVVQVAIADPYVCLRVFNGQVITLALRESRGTPRLAINKHTISSS 702

Query: 705  KPVSSCTLYHD-------------------------------KGPEPWLRKTSTDAWLST 733
              V +   Y D                                  EP ++    +  L  
Sbjct: 703  PAVVAIAAYKDLSGLFTVKSDDILNLTGSGSNSAFGSTFGGYMKSEPHMKVEDEEDLLYG 762

Query: 734  GVGEAI------DGADGGPLDQGD-------------IYSVVCYESGALEIFDVPNFNCV 774
              G A       D A        D              + VV  +SG LEI+ +P+   V
Sbjct: 763  DAGNAFKMNTMADLAKQSKQKNSDWWRRMLVQAAKPTYWLVVARQSGTLEIYSMPDMKLV 822

Query: 775  FTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSA 834
            + V+   +G   + D    E +  S T   +S          ++ +S   +EL++     
Sbjct: 823  YLVNDVGNGAMVLTDAM--EFVPISLTSQENSKAGIVQSCMPQHANSPLPLELSLVGLGL 880

Query: 835  HHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSR 894
            +  RP L  + T   +L YQ  +F  P+   K        R +   N+   +  ++    
Sbjct: 881  NGERPLLL-VRTRLELLIYQ--VFRYPKGHLK-----IRFRKMDQLNLLDQQPTHVNLDD 932

Query: 895  TPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGS 953
               +             Q++  F N+ G  G  + G  PC+  +  R  LR+H  L +G 
Sbjct: 933  NEENEELESYNMQPKYVQKLRPFNNVGGMSGVMICGVNPCFLFLTSRGELRIHRLLGNGE 992

Query: 954  IVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
            + +F   +N+N  +GF++  +   LKI  LPS  +YD+ WPV+KV
Sbjct: 993  VRSFAAFNNINIPNGFLFFDTTFELKISVLPSYLSYDSTWPVRKV 1037


>gi|194756960|ref|XP_001960738.1| GF11349 [Drosophila ananassae]
 gi|190622036|gb|EDV37560.1| GF11349 [Drosophila ananassae]
          Length = 1455

 Score =  331 bits (849), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 293/1061 (27%), Positives = 473/1061 (44%), Gaps = 179/1061 (16%)

Query: 57   NLVVTAANVIEIYVVRVQ-EEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
            NLVV  ANV+++Y +    E G ++  N  E      M       LE +  Y L+GNV S
Sbjct: 29   NLVVAGANVLKVYRIAPNVEAGQRQKLNPTE------MRVAPKMRLECLATYTLYGNVMS 82

Query: 116  LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
            L  +S  GA     RD+++++F+DAK+SVL+ D   + L+  S+H FE  +   ++ G  
Sbjct: 83   LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTYALKTLSLHYFEEED---IRGGWT 135

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
                 P+V+VDP  RC  +LVYG ++++L   +  +     L   +    +      R  
Sbjct: 136  GRYFVPVVRVDPDSRCAVMLVYGKRLVVLPFRKDNTLDEIELADVKPIKKAPTAMVTRTP 195

Query: 230  IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
            I +S++I LRDLD K  +V D  F+HGY EP ++IL+E   T  GR+  +  TC++ A+S
Sbjct: 196  IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
            ++   + HP+IW+  +LP D  ++  +  PIGG LV+  N + Y +QS         Y V
Sbjct: 256  LNIQQRVHPIIWTVNSLPFDCQQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309

Query: 348  SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
            SL+SS +        P+    + LD A+  ++  D  ++S +TGDL +LT+  D  R V+
Sbjct: 310  SLNSSADNSTSFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369

Query: 400  RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK-------- 451
                 K   SVLTS I  + +   FLGSRLG+SLL+ FT    +++++            
Sbjct: 370  NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVDQQADQQL 429

Query: 452  ----------EEFGDIEA--DAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
                      +E  D++    AP+  + RR         +  EEL +YGS +  +    +
Sbjct: 430  QRQQSEDQTLDEILDVDQLELAPTQAKSRR---------IEDEELEVYGSGAKASVLQLR 480

Query: 500  TFSFAVRDSLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS-- 537
             F F V DSL+N+ P+     G R+  +                     +ATG SK    
Sbjct: 481  KFVFEVCDSLINVAPINYMCAGERVEFEEDGTTLRPHAENLNDLKIELVAATGHSKNGAL 540

Query: 538  -------NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
                   N +++   EL GC  +WTV+         D+++  +  D+ H ++++S    T
Sbjct: 541  SVFVNCINPQIITSFELDGCLDVWTVFD--------DATKKTSRHDQ-HDFMLLSQRNST 591

Query: 588  MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGP 647
            +VL+T   + E+ E+  + V   TI  GNL  +R ++QV  R  R+L G+ + Q++    
Sbjct: 592  LVLQTGQEINEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI-- 648

Query: 648  SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPV 707
                       S V+ VSIADPYV L + +G +  L    +  T  +       SS   V
Sbjct: 649  --------DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAV 700

Query: 708  SSCTLYHD-----------------------------KGPEPWLRKTSTDAWLSTGVGEA 738
             + + Y D                                EP ++    +  L    G A
Sbjct: 701  VAISAYKDLSGLFTVKADDVNLTGSSSSAFGHSFGGYMKAEPHMKVEDEEDLLYGDAGNA 760

Query: 739  I------DGADGGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKF 780
                   D A        D +             VV  +SG LEI+ +P+   V+ V+  
Sbjct: 761  FKMNSMADLAKQSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDV 820

Query: 781  VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
             +G   + D    E +  S T   +S          ++ +S   +EL +     +  RP 
Sbjct: 821  GNGAMVLTDAM--EFVPISLTTQENSKAGIVQACMPQHANSPLPLELTVLGLGLNGERPL 878

Query: 841  LFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAY 900
            L  + T   +L YQ  +F  P+   K        R L   N+   +  ++       D  
Sbjct: 879  LL-VRTRVELLIYQ--VFRYPKGHLK-----IRFRKLEQLNLMDHQPSHIELDEN--DER 928

Query: 901  TREETPHGAP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAF 957
               E+    P   Q++  F N+ G  G  + G  PC+  +  R  LR+H  L +G + +F
Sbjct: 929  EEMESYQMQPKYVQKLRPFANVGGLSGIMVCGVNPCFVFLTSRGELRIHRLLGNGDVRSF 988

Query: 958  TVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
               +NVN  +GF+Y  +   LKI  LPS  +YD+ WP++KV
Sbjct: 989  AAFNNVNIPNGFLYFDTTFELKISVLPSYLSYDSTWPIRKV 1029


>gi|270003792|gb|EFA00240.1| hypothetical protein TcasGA2_TC003068 [Tribolium castaneum]
          Length = 1392

 Score =  329 bits (844), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 284/1038 (27%), Positives = 460/1038 (44%), Gaps = 200/1038 (19%)

Query: 58  LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
           LV + ANVI+++ +    +         ET           + LE V  Y L GN+ S+ 
Sbjct: 30  LVTSGANVIKVFRLIPDIDTKTRIDKFNETNP-------PKSKLECVAQYTLFGNIMSMQ 82

Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
            ++   +     RD+++LAF+DAK+SV+E+D   H L+  S+H FE  +   +K G    
Sbjct: 83  SVNLANSP----RDALLLAFKDAKLSVVEYDPETHDLKTLSLHYFEEDD---MKDGWTHH 135

Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED---TFGSGGGFSARIESSH 234
              P+V+ DP+ RC  + V+G ++++L   +  +    D D     G   G  A I +S+
Sbjct: 136 YHVPMVRADPENRCAVMTVFGRKLVVLPFRRENAIDDTDADIKPMIGGAYGSKAPILASY 195

Query: 235 VINLRDL--DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
           +I L+D    + ++ D  F+HGY EP ++IL E   T+AGRV+ +  TC ++A+S++   
Sbjct: 196 MIVLKDFIDKVDNIIDIQFLHGYYEPTLLILFEPLKTFAGRVAVRTDTCAMAAISLNLQQ 255

Query: 293 KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSS 352
           K HP+IWS  NLP D  K + +  P+GG L+   N + Y +QS      +  Y VSL+S 
Sbjct: 256 KVHPIIWSVANLPFDCVKAVPIKKPLGGTLIFAVNALIYLNQS------IPPYGVSLNSI 309

Query: 353 QE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLS 404
            E        P+    + LD A AT+L++D  +LS K G+L +LT++ D  R V+     
Sbjct: 310 AENSTNFPLKPQDDLCISLDCAQATFLEDDTIVLSLKGGELYVLTLLADNMRYVRSFHFE 369

Query: 405 KTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT--CGSGTSMLSSGLKEEFGDIEADAP 462
           K   SVLT+ I+   N+  FLGSRLG+SLL++FT  C    ++            E   P
Sbjct: 370 KAAASVLTTCISVCENNFLFLGSRLGNSLLLRFTEKCNEVITL-----------DETIEP 418

Query: 463 STKRLRRSSS------DALQDMVNG------------EELSLYGSASNNTESAQKTFSFA 504
           S KRL+ S+S      D + D +N             EEL +YG+    +     ++ F 
Sbjct: 419 SAKRLKASNSTSENEDDKVLDTLNDCMASDVLDIRDPEELEVYGNQKQASLQIS-SYVFE 477

Query: 505 VRDSLVNIGPL------------KDFSYGLRINADASATG----------ISKQSNYELV 542
           V DSL+NIGP             ++FS  L ++ +   T           + K    ++V
Sbjct: 478 VCDSLLNIGPCGNISLGEPAFLSEEFSENLDLDLELVTTAGYGKNGALCVLQKSVRPQIV 537

Query: 543 ---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEV 599
               LPGC  +WTV+    +                HA+LI+S E  TM+L+T D + E+
Sbjct: 538 TTFTLPGCSNMWTVHAGEDK----------------HAFLILSQEDGTMILQTGDEINEI 581

Query: 600 TESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENS 659
            ++  +     T+ AG                 I    ++  +L               S
Sbjct: 582 -DNTGFATHIPTVYAG-----------------INQLQHIPLELG--------------S 609

Query: 660 TVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKG-- 717
            ++ V+  DPY+ L  +DG +  L+   +     +    +  S+  PV++  +Y D    
Sbjct: 610 PIVHVTSVDPYISLLTTDGQVITLMLREARGVAKLVISKSTLSNSPPVTTICMYRDVSGL 669

Query: 718 ------------PEPWLRKTSTDAWLSTGVGEAIDGADGG--------PLDQGDIYS--- 754
                       PE ++ ++ T   +     + + G D          P  +  +Y    
Sbjct: 670 FTSKIPEDFTHIPEHFINESETKMEVENE-DDLLYGDDSDFKMPTLNPPQPKPKVYYNWW 728

Query: 755 -------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSET 801
                         V  E+  LEI+ +P+F   + +     G   +VD    E++  S +
Sbjct: 729 KKYLLDVRPSYWLFVVRENSNLEIYSIPDFKLCYYITNLCFGHKVLVDNL--ESVTISAS 786

Query: 802 EINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGP 861
              S++ E   Q R+ ++  + VV L       H SRP L   L +  +  Y+ + F  P
Sbjct: 787 TPISAAHEANIQ-RQFDVKEILVVALG-----NHGSRPLLMVRL-ERDLYIYEVFRF--P 837

Query: 862 ENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNIS 921
               K          +   NVS       R      D +  +E        ++  F NI+
Sbjct: 838 RGNLKMRFRKIKHSLIYSPNVSG------RIDTEDSDFFAIQER-----IIKMRYFTNIA 886

Query: 922 GHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKI 980
           G+ G F+ G+ P W  M  R  LR HP   DG +++F   +NVNC  GF+Y   +  L+I
Sbjct: 887 GYNGVFVCGANPHWIFMSARGELRTHPMTIDGEVLSFAAFNNVNCPQGFLYFNRKSELRI 946

Query: 981 CQLPSGSTYDNYWPVQKV 998
             LP+  +YD  WPV+KV
Sbjct: 947 GVLPTHLSYDAAWPVRKV 964


>gi|215701517|dbj|BAG92941.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 265

 Score =  323 bits (829), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 161/246 (65%), Positives = 191/246 (77%), Gaps = 1/246 (0%)

Query: 254 GYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLA 313
           GYIEPV+VILHE+E TWAGR+  KHHTCMISA SIS TLKQHP+IWSA NLPHDAY+LLA
Sbjct: 19  GYIEPVLVILHEQEPTWAGRILSKHHTCMISAFSISMTLKQHPVIWSAANLPHDAYQLLA 78

Query: 314 VPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
           VP PI GVLV+ AN+IHYHSQS SC+L LNN++   D S E+ +S+F VELDAA ATWL 
Sbjct: 79  VPPPISGVLVICANSIHYHSQSTSCSLDLNNFSSHPDGSPEISKSNFQVELDAAKATWLS 138

Query: 374 NDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
           ND+ + STK G+++LLTVVYDGRVVQRLDL K+  SVL+S +T+IGNS FFLGSRLGDSL
Sbjct: 139 NDIVMFSTKAGEMLLLTVVYDGRVVQRLDLMKSKASVLSSAVTSIGNSFFFLGSRLGDSL 198

Query: 434 LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASN 492
           LVQF+  +  S+L     E   DIE D P +KRL+R  SD LQD+ + EELS     A N
Sbjct: 199 LVQFSYCASKSVLQDLTNERSADIEGDLPFSKRLKRIPSDVLQDVTSVEELSFQNIIAPN 258

Query: 493 NTESAQ 498
           + ESAQ
Sbjct: 259 SLESAQ 264


>gi|195150431|ref|XP_002016158.1| GL10645 [Drosophila persimilis]
 gi|194110005|gb|EDW32048.1| GL10645 [Drosophila persimilis]
          Length = 1459

 Score =  321 bits (823), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 291/1067 (27%), Positives = 475/1067 (44%), Gaps = 187/1067 (17%)

Query: 57   NLVVTAANVIEIYVVRVQ-EEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
            NLVV  AN++++Y +    E G ++  N  E      M       LE +  Y L+GNV S
Sbjct: 29   NLVVAGANMLKVYRISPNVEAGQRQKLNPNE------MRIAPKMRLECLATYFLYGNVMS 82

Query: 116  LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
            L  +S  GA     +D+++++F+DAK+SVL+ D   + L+  S+H FE  +   ++ G  
Sbjct: 83   LQCVSLAGA----MQDALLVSFKDAKLSVLQHDPDTYALKTLSLHYFEEED---IRGGWT 135

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR------ 229
                 P+V+VDP  RC  +LVYG ++++L   +  S    DE        F         
Sbjct: 136  GRYFVPVVRVDPDARCAVMLVYGKRLVVLPFRKDNSL---DEIELTDVKPFKKAPTAMVS 192

Query: 230  ---IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMIS 284
               I +S++I L++LD K  +V D  F+HGY EP ++IL+E   T +GR+  +  TC++ 
Sbjct: 193  RTPIMASYLITLKELDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCSGRIKVRSDTCVLV 252

Query: 285  ALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNN 344
            A+S++   + HP+IW+  +LP D +++  +  PIGG LV+  N + Y +QS         
Sbjct: 253  AISLNIQQRVHPIIWTVNSLPFDCFQVYPIQKPIGGCLVMTVNAVIYLNQSVP------P 306

Query: 345  YAVSLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-R 396
            Y VSL+SS +        P+    + LD A+  ++  D  ++S +TG+L +LT+  D  R
Sbjct: 307  YGVSLNSSADNSTSFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGELYVLTLCVDSMR 366

Query: 397  VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK----- 451
             V+     K   SVLTS I    +   FLGSRLG+SLL+ FT    +++++  +      
Sbjct: 367  TVRNFHFHKAAASVLTSCICVCHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDVDAEQQA 426

Query: 452  ----------------EEFGDIEAD--APSTKRLRRSSSDALQDMVNGEELSLYGSASNN 493
                            EE  D++    AP   + RR         +  EEL +YGS +  
Sbjct: 427  EQQQQKQQRVQEDQDIEEVYDVDQIELAPPQAKSRR---------IEDEELEVYGSGAKA 477

Query: 494  TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD--------------------ASATGI 533
            +    + F F V DSL+N+ P+     G R+  +                     +ATG 
Sbjct: 478  SVLQLRKFIFEVCDSLINVAPINYMCAGERVEFEEDGTTLRPHAENLHDLKIELVAATGH 537

Query: 534  SKQS---------NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLII 581
            SK           N +++   EL GC  +WTV+         D+++  +  D+ H ++++
Sbjct: 538  SKNGALSVFVNCINPQIITSFELDGCLDVWTVFD--------DATKKTSRHDQ-HDFMLL 588

Query: 582  SLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQ 641
            S    T+VL+T   + E+ E+  + V   TI  GNL  +R ++QV  R  R+L G+ + Q
Sbjct: 589  SQSNSTLVLQTGQEINEI-ENTGFTVNQATIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQ 647

Query: 642  DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG-----SIRLLVGDPSTCT---VS 693
            ++               S V+ V+IADPYV L M +G     ++R   G P         
Sbjct: 648  NVPI----------DVGSPVVQVAIADPYVCLRMLNGQVITLALRETRGSPRLAINKHTI 697

Query: 694  VQTPA--AIESSKKPVSSCTLYHDK--------------------GPEPWLRKTSTDAWL 731
              +PA  AI + K      T+  D                       EP ++    +  L
Sbjct: 698  TSSPAVVAIAAYKDLSGLFTVKSDDVLNLTGGTGSGFGHSFGGYMKAEPNMKVEDEEDLL 757

Query: 732  STGVGEAIDGADGGPLDQGD------------------IYSVVCYESGALEIFDVPNFNC 773
                G A        L Q                     + VV  +SG LEI+ +P+   
Sbjct: 758  YGDAGNAFKINSMAVLAQQSKQKNSDWWRRLLVQAKPSYWLVVSRKSGTLEIYSMPDMKL 817

Query: 774  VFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAMQRW 832
            V+ ++   +G   + D     +L  S  E   +S+ G  Q    ++ +S   +EL++   
Sbjct: 818  VYHINDVGNGAMVLSDALEFVSLSSSTQE---NSKVGIVQSCMPQHANSPLPLELSLVGL 874

Query: 833  SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRF 892
              +  RP L  + T   +L YQ  +F  P+   K        R L   N+   +  ++  
Sbjct: 875  GLNGERPVLM-VRTRVELLIYQ--VFRYPKGNLKI-----RFRKLEQLNLLDQQPSHIEL 926

Query: 893  SRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCD 951
                 +             Q++  F N+ G  G  + G  PC+  +  R  LR+H    +
Sbjct: 927  EENDEEEELESYNMQPKYVQKLRPFSNVGGLAGIMVCGVNPCFVFLTARGELRIHRLQGN 986

Query: 952  GSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
            G + +F   +NVN  +GF+Y  +   LKI  LPS  +YD+ WPV+KV
Sbjct: 987  GDVRSFAAFNNVNIPNGFLYFDTTFELKISVLPSYLSYDSVWPVRKV 1033


>gi|198457226|ref|XP_001360595.2| GA10080 [Drosophila pseudoobscura pseudoobscura]
 gi|198135905|gb|EAL25170.2| GA10080 [Drosophila pseudoobscura pseudoobscura]
          Length = 1459

 Score =  321 bits (822), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 291/1067 (27%), Positives = 474/1067 (44%), Gaps = 187/1067 (17%)

Query: 57   NLVVTAANVIEIYVVRVQ-EEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
            NLVV  AN++++Y +    E G ++  N  E      M       LE +  Y L+GNV S
Sbjct: 29   NLVVAGANMLKVYRISPNVEAGQRQKLNPNE------MRIAPKMRLECLATYFLYGNVMS 82

Query: 116  LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
            L  +S  GA     +D+++++F+DAK+SVL+ D   + L+  S+H FE  +   ++ G  
Sbjct: 83   LQCVSLAGA----MQDALLVSFKDAKLSVLQHDPDTYALKTLSLHYFEEED---IRGGWT 135

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR------ 229
                 P+V+VDP  RC  +LVYG ++++L   +  S    DE        F         
Sbjct: 136  GRYFVPVVRVDPDARCAVMLVYGKRLVVLPFRKDNSL---DEIELTDVKPFKKAPTAMVS 192

Query: 230  ---IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMIS 284
               I +S++I L++LD K  +V D  F+HGY EP ++IL+E   T  GR+  +  TC++ 
Sbjct: 193  RTPIMASYLITLKELDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLV 252

Query: 285  ALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNN 344
            A+S++   + HP+IW+  +LP D +++  +  PIGG LV+  N + Y +QS         
Sbjct: 253  AISLNIQQRVHPIIWTVNSLPFDCFQVYPIQKPIGGCLVMTVNAVIYLNQSVP------P 306

Query: 345  YAVSLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-R 396
            Y VSL+SS +        P+    + LD A+  ++  D  ++S +TG+L +LT+  D  R
Sbjct: 307  YGVSLNSSADNSTSFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGELYVLTLCVDSMR 366

Query: 397  VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK----- 451
             V+     K   SVLTS I    +   FLGSRLG+SLL+ FT    +++++  +      
Sbjct: 367  TVRNFHFHKAAASVLTSCICVCHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDVDAEQQA 426

Query: 452  ----------------EEFGDIEAD--APSTKRLRRSSSDALQDMVNGEELSLYGSASNN 493
                            EE  D++    AP   + RR         +  EEL +YGS +  
Sbjct: 427  EQQQQKQQRVQEDQDIEEVYDVDQIELAPPQAKSRR---------IEDEELEVYGSGAKA 477

Query: 494  TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD--------------------ASATGI 533
            +    + F F V DSL+N+ P+     G R+  +                     +ATG 
Sbjct: 478  SVLQLRKFIFEVCDSLINVAPINYMCAGERVEFEEDGTTLRPHAENLHDLKIELVAATGH 537

Query: 534  SKQS---------NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLII 581
            SK           N +++   EL GC  +WTV+         D+++  +  D+ H ++++
Sbjct: 538  SKNGALSVFVNCINPQIITSFELDGCLDVWTVFD--------DATKKTSRHDQ-HDFMLL 588

Query: 582  SLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQ 641
            S    T+VL+T   + E+ E+  + V   TI  GNL  +R ++QV  R  R+L G+ + Q
Sbjct: 589  SQSNSTLVLQTGQEINEI-ENTGFTVNQATIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQ 647

Query: 642  DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG-----SIRLLVGDPSTCT---VS 693
            ++               S V+ V+IADPYV L M +G     ++R   G P         
Sbjct: 648  NVPI----------DVGSPVVQVAIADPYVCLRMLNGQVITLALRETRGSPRLAINKHTI 697

Query: 694  VQTPA--AIESSKKPVSSCTLYHDK--------------------GPEPWLRKTSTDAWL 731
              +PA  AI + K      T+  D                       EP ++    +  L
Sbjct: 698  TSSPAVVAIAAYKDLSGLFTVKSDDVLNLTGGSGSGFGHSFGGYMKAEPNMKVEDEEDLL 757

Query: 732  STGVGEAIDGADGGPLDQGD------------------IYSVVCYESGALEIFDVPNFNC 773
                G A        L Q                     + VV  +SG LEI+ +P+   
Sbjct: 758  YGDAGNAFKINSMAVLAQQSKQKNSDWWRRLLVQAKPSYWLVVSRKSGTLEIYSMPDMKL 817

Query: 774  VFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAMQRW 832
            V+ ++   +G   + D     +L  S  E   +S+ G  Q    ++ +S   +EL++   
Sbjct: 818  VYHINDVGNGAMVLSDALEFVSLSSSTQE---NSKVGIVQSCMPQHANSPLPLELSLVGL 874

Query: 833  SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRF 892
              +  RP L  + T   +L YQ  +F  P+   K        R L   N+   +  ++  
Sbjct: 875  GLNGERPVLM-VRTRVELLIYQ--VFRYPKGNLKI-----RFRKLEQLNLLDQQPSHIEL 926

Query: 893  SRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCD 951
                 +             Q++  F N+ G  G  + G  PC+  +  R  LR+H    +
Sbjct: 927  EENDEEEELESYNMQPKYVQKLRPFSNVGGLAGIMVCGVNPCFVFLTARGELRIHRLQGN 986

Query: 952  GSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
            G + +F   +NVN  +GF+Y  +   LKI  LPS  +YD+ WPV+KV
Sbjct: 987  GDVRSFAAFNNVNIPNGFLYFDTTFELKISVLPSYLSYDSVWPVRKV 1033


>gi|290981010|ref|XP_002673224.1| CPSF A subunit [Naegleria gruberi]
 gi|284086806|gb|EFC40480.1| CPSF A subunit [Naegleria gruberi]
          Length = 1373

 Score =  313 bits (803), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 268/1066 (25%), Positives = 456/1066 (42%), Gaps = 205/1066 (19%)

Query: 3    FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
            FA YK +H PT ++ C     T +  +                           NL++  
Sbjct: 2    FACYKQLHPPTAVSFCLKARFTSANDE---------------------------NLIIVK 34

Query: 63   ANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL-AILSQ 121
             N++E+Y+++                        + +++ LV  + L G ++S+ A+  Q
Sbjct: 35   NNIMEVYLIKP-----------------------NTSNIVLVKVFELFGVIDSIIAVCLQ 71

Query: 122  GGADNSRRRDSIILAFED-AKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180
            G      +++ +++ FED AK+SV+EFD+    L+  S+H  E      L+ G+  F   
Sbjct: 72   G-----MKKEMLLINFEDEAKVSVVEFDEKRSDLKTLSLHYLEDD---FLREGKARFFHN 123

Query: 181  PLVKVDPQGRCGGVLVYGLQMIILKASQGGS------------GLVGDEDTFGSGGGFSA 228
              + +DPQ R   V++   +++IL   Q G              L GD++      G   
Sbjct: 124  QPIILDPQNRFATVIICDSKLVILPFRQSGEDVSLSTEDNFLFALSGDQEEANENVGDQK 183

Query: 229  R-----IESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMI 283
            +     ++   +I+L DL +K+VKD+ F++GY EP ++ LHE E TW+GR++ K +T  +
Sbjct: 184  KHHQPEVQRQVIIDLNDLGIKNVKDYCFLNGYNEPTILFLHENEQTWSGRLAAKSNTSTV 243

Query: 284  SALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI-GGVLVVGANTIHYHSQSASCALAL 342
            +A+S     K +P IWS  +LPHD  KL+ +   + GG LV+G N+I + +Q A+  L+ 
Sbjct: 244  TAVSFDLFRKYYPKIWSVGSLPHDCNKLIPLQEDVAGGALVIGMNSIIHINQCATYGLSF 303

Query: 343  NNYAVSLDSSQELPRSSF---SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ 399
            N++AVS + +  +  ++F   ++  D    T++  D  L+S K G+L  + +   G  + 
Sbjct: 304  NDFAVS-NPNLSINFNTFDGPALFFDTVAYTFIARDKLLVSLKDGELYTMYLESGGSRIN 362

Query: 400  RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEA 459
             +++ KT+ +   S + T+  +L FLGS++GDS+L ++         S    EE   + A
Sbjct: 363  NINIKKTSNTTPASCMCTLKGNLIFLGSKIGDSVLYEYQEKVEVETSSLDTDEEMSSVFA 422

Query: 460  -----DAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGP 514
                 +    KR      D    +   EE ++  S S  ++         ++    NIGP
Sbjct: 423  AGENFEPEKKKRKLADDDDFFAALEKDEEPTVIESFSKVSKKETTKVELKIKHVFTNIGP 482

Query: 515  LKDFSYGLRINADASATGISKQSNYELVELPGCKGI-----WTVYHKSSRGHNADSSRMA 569
            +   +  +  + D S  G   ++N   +    C GI      TV ++S +      + + 
Sbjct: 483  ISHLTAAVTSSFDMS--GFKSKTNDNQLSAIACSGIGRHGCLTVLNRSLQPDIQSEATLP 540

Query: 570  ---------AYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
                     +   E+  YLI+SLE +T V E+   L EVT    +     T+  G +  R
Sbjct: 541  FLVKQVWTISQKTEHDLYLILSLEDKTKVFESKATLAEVTSKSMFVTNETTLNIGKI--R 598

Query: 621  RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
              ++QV  R + +L GS         P           S++    I DPYVLL   DGS+
Sbjct: 599  ESIVQV-TRKSVMLIGS--------EPKQVHHSKKEIRSSI----ILDPYVLLHFYDGSL 645

Query: 681  RLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAID 740
             LL  D    T        IES+   +++  LY    PE          +   G+ E   
Sbjct: 646  VLLTHDNGRVT---SKQLDIESNHGKITAVCLYK-TNPE----------FEFFGINEK-- 689

Query: 741  GADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE 800
                    +G     V +  GA EI  VP+  CVF+  +F    T + D           
Sbjct: 690  --------EGKYLCCVYWTDGAFEILSVPDMTCVFSFSQFYQFHTTLFD----------- 730

Query: 801  TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
                   E  +    +  +    V E+A++   +    P+L ++L+D T+  Y+++L   
Sbjct: 731  -------EGQSSNTTQSEVKYPYVTEMALRGIGSDSEMPYLVSVLSDNTVHIYRSFL--- 780

Query: 861  PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSR------TPLDAYTREETPHGAPCQ-- 912
             + T+KS D               +RL  LRFS+       P+    ++        +  
Sbjct: 781  -DRTTKSKD---------------NRLTRLRFSKFQHDDLLPISEIDKKSQTFTLNLKSK 824

Query: 913  -----------RITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLH 961
                       ++  FKNI G+ G F +G +P W       LRVHP      +  FT  H
Sbjct: 825  YLFPKSDLGRSQLIPFKNIGGYGGLFKTGEKPFWLFTEHSNLRVHPTQSRDPVTTFTPYH 884

Query: 962  NVNCNHGFIYVT-------SQGILKICQLPSGSTYDNYWPVQKVVF 1000
            + NC HGFIY+T        Q  L I  L +   ++ YWP +K++ 
Sbjct: 885  HENCPHGFIYLTDKEQDNKKQSKLHISSLNANVKFNAYWPQRKILL 930


>gi|157110889|ref|XP_001651294.1| cleavage and polyadenylation specificity factor cpsf [Aedes
           aegypti]
 gi|108883895|gb|EAT48120.1| AAEL000832-PA [Aedes aegypti]
          Length = 1417

 Score =  311 bits (797), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 286/1021 (28%), Positives = 467/1021 (45%), Gaps = 137/1021 (13%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           +LV   ANV+++Y  R+  +    S++   T R   M       LE +  Y L GN+ S+
Sbjct: 29  SLVTGGANVLKVY--RLIPDADATSRDKFTTTRPPNM------KLECMATYTLFGNIMSM 80

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +S  G+    +RD+++++F+DAK+SV++FD     L+  S+H FE  +   +K G   
Sbjct: 81  QSVSLAGS----QRDALLISFQDAKLSVVQFDPDNFELKTLSLHYFEEED---IKGGWTG 133

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL---KASQGGSGLVGDEDTFGSGGG---FSARI 230
               P+V+VDP  RC  +LVYG ++++L   K S      V D                I
Sbjct: 134 HYHTPIVRVDPDNRCAVMLVYGRKLVVLPFRKDSSLDEIEVQDVKPMKKAPTQLIAKTPI 193

Query: 231 ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
            +S+VI L++ +  + +V D  F+HGY EP ++IL+E   T+ GR++ +  TCM+ ALS+
Sbjct: 194 LASYVIELKESEERIDNVIDIQFLHGYYEPTLLILYEPVKTFPGRIAVRSDTCMMVALSL 253

Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAV 347
           +   + HP+IW+   LP D  + +A+  PIGG L++  N + Y +QS     ++LN+ A 
Sbjct: 254 NIQQRVHPVIWTVNCLPFDCLQAIAISKPIGGCLILSVNALIYLNQSVPPYGVSLNSIAD 313

Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKT 406
              +    P+    + LDAA   +++ +  +LS K G+L +LT+  D  R V+    SK 
Sbjct: 314 HCTNFPLKPQDGVRISLDAAQVCFIEPEKLVLSLKGGELYVLTLCADSMRSVRSFHFSKA 373

Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
             SVLT  I  +     FLGSRLG+SLL++F     + +++    EE  + E      KR
Sbjct: 374 ASSVLTCCICVVEEEYLFLGSRLGNSLLLRFKEKDESMVITIDDTEEVVEKEP-----KR 428

Query: 467 LRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRI-- 524
           LR          +  EEL +YGS    T     ++ F V DS++NIGP+   + G RI  
Sbjct: 429 LR----------LEQEELEVYGSG-QKTSVQLTSYIFEVCDSILNIGPIGHMAVGERISE 477

Query: 525 ---NADASATGISKQSNYELVELP--GCKGIWTVYHKSSRGHNADSSRMAAY--DDEYHA 577
              + +     +  + + E+V     G  G   V   S +     S  ++     D+ H+
Sbjct: 478 EEQDENKDVQFVPNKLDLEIVTSSGHGKNGALCVLQNSIKPQVITSFGLSGCLDVDDMHS 537

Query: 578 YLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS 637
           ++I+S EA TMVL+T D + E+ E+  +     TI  GN+ G R ++QV  +  R+L G+
Sbjct: 538 FMILSQEAGTMVLQTGDEINEI-ENTGFATNVPTIHVGNIGGNRFIVQVTTKSIRLLQGT 596

Query: 638 YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV-----GDPSTC-- 690
            + Q++                 + +VSIADPYV +  S+G +  L      G P     
Sbjct: 597 RLLQNIPI----------DLGCPLAAVSIADPYVCVRSSEGRVITLALREGKGTPRLAVN 646

Query: 691 --TVSVQTPAAIE-SSKKPVSS--CTLYHD------------------KGPEPWLRKTST 727
             T+S  TPA +  S  K VS    T Y D                    PEP ++    
Sbjct: 647 KNTIST-TPAVVAISVYKDVSGMFTTKYEDFYDGSKAGSSAYSSGFGYMKPEPHMKIEDE 705

Query: 728 DAWLSTGVGEAI------DGADGGPLDQGD--------------IYSVVCYESGALEIFD 767
           +  L    G +       D A        D              +Y+V   ++G LEI+ 
Sbjct: 706 EDLLYGESGRSFKMTSMADMAIETKKKNTDFWRKFMQPVKPTFWLYAV--RDNGNLEIYS 763

Query: 768 VPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEIN---SSSEEGTGQGRKENIHSMKV 824
           +P+   V+ +    +G   + D+     L+  +T  +   +S+   +  G   N+   ++
Sbjct: 764 MPDLKLVYLITNIGNGNKVLQDSMEFVPLQVGQTAADADVTSNAFTSPFGFNPNLLPKEI 823

Query: 825 VELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSA 884
           + +A+     H +RP LF  L +  +L Y+ Y +      SK    +   R    S V+ 
Sbjct: 824 LMVAL---GHHGTRPMLFVRL-ENDLLVYRVYRY------SKGHLKLRFRR--VPSGVTG 871

Query: 885 SRLRNLRFSRTPLDAYTREETPHGAPCQ-----RITIFKNISGHQGFFLSGSRPCWCMVF 939
              +       P D    +   H           I  F N++G+ G  + G +P + M+ 
Sbjct: 872 PIFKIAPRQSAPTDQEGEKPDEHSTKIMYENISMIRYFNNVNGYNGVAVCGEKP-YIMLL 930

Query: 940 RER--LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQK 997
             R  LR H       +  F   +NVNC +GF+Y   Q  LKI   P   +YD+ WPV+K
Sbjct: 931 TSRGELRAHRLYAKTIMKGFAPFNNVNCPNGFLYFDEQYELKIAVFPGYLSYDSIWPVRK 990

Query: 998 V 998
           +
Sbjct: 991 I 991


>gi|194474008|ref|NP_001124043.1| cleavage and polyadenylation specificity factor subunit 1 [Rattus
           norvegicus]
 gi|149066087|gb|EDM15960.1| cleavage and polyadenylation specific factor 1, 160kDa (predicted),
           isoform CRA_a [Rattus norvegicus]
          Length = 1386

 Score =  309 bits (791), Expect = 6e-81,   Method: Compositional matrix adjust.
 Identities = 219/669 (32%), Positives = 345/669 (51%), Gaps = 75/669 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN G T+ +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
           + + T+     FLGSRLG+SLL+++T        SS    E  D E      KR+  +  
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVDPTVG 429

Query: 471 -SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG------- 521
            +    QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G       
Sbjct: 430 WTGGKTQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFLSE 485

Query: 522 -------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH---------- 555
                  L I        + + + + K    ++V   ELPGC  +WTV            
Sbjct: 486 ENSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEETP 545

Query: 556 KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAG 615
           K+       S+  A  D   H +LI+S E  TM+L+T   + E+  S  +  QG T+ AG
Sbjct: 546 KAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAG 604

Query: 616 NLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
           N+   R ++QV   G R+L+G      L F P +         + ++  ++ADPYV++  
Sbjct: 605 NIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYVVIMS 654

Query: 676 SDGSIRLLV 684
           ++G + + +
Sbjct: 655 AEGHVTMFL 663



 Score =  109 bits (273), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 71/247 (28%), Positives = 114/247 (46%), Gaps = 15/247 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 778  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 833

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 834  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 883

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 884  VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 943

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 944  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1003

Query: 992  YWPVQKV 998
             WPV+K+
Sbjct: 1004 PWPVRKI 1010


>gi|74212803|dbj|BAE33365.1| unnamed protein product [Mus musculus]
          Length = 741

 Score =  308 bits (790), Expect = 8e-81,   Method: Compositional matrix adjust.
 Identities = 219/673 (32%), Positives = 344/673 (51%), Gaps = 79/673 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN G T+ +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
           + + T+     FLGSRLG+SLL+++T        SS    E  D E      KR+  +  
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVG 429

Query: 471 ---SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
                   QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G     
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485

Query: 522 -----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH------ 555
                      L I        + + + + K    ++V   ELPGC  +WTV        
Sbjct: 486 SEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEE 545

Query: 556 ----KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
               K+       S+  A  D   H +LI+S E  TM+L+T   + E+  S  +  QG T
Sbjct: 546 EETPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPT 604

Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
           + AGN+   R ++QV   G R+L+G      L F P +         + ++  ++ADPYV
Sbjct: 605 VFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYV 654

Query: 672 LLGMSDGSIRLLV 684
           ++  ++G + + +
Sbjct: 655 VIMSAEGHVTMFL 667


>gi|197245729|gb|AAI68713.1| Cpsf1 protein [Rattus norvegicus]
          Length = 1439

 Score =  308 bits (789), Expect = 9e-81,   Method: Compositional matrix adjust.
 Identities = 219/671 (32%), Positives = 345/671 (51%), Gaps = 77/671 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN G T+ +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
           + + T+     FLGSRLG+SLL+++T        SS    E  D E      KR+  +  
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVDPTVG 429

Query: 471 -SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG------- 521
            +    QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G       
Sbjct: 430 WTGGKTQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFLSE 485

Query: 522 ---------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH-------- 555
                    L I        + + + + K    ++V   ELPGC  +WTV          
Sbjct: 486 EFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEE 545

Query: 556 --KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIA 613
             K+       S+  A  D   H +LI+S E  TM+L+T   + E+  S  +  QG T+ 
Sbjct: 546 TPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVF 604

Query: 614 AGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
           AGN+   R ++QV   G R+L+G      L F P +         + ++  ++ADPYV++
Sbjct: 605 AGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYVVI 654

Query: 674 GMSDGSIRLLV 684
             ++G + + +
Sbjct: 655 MSAEGHVTMFL 665



 Score =  109 bits (272), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 71/247 (28%), Positives = 114/247 (46%), Gaps = 15/247 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 780  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 835

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 836  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 885

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 886  VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 945

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 946  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1005

Query: 992  YWPVQKV 998
             WPV+K+
Sbjct: 1006 PWPVRKI 1012


>gi|148697644|gb|EDL29591.1| cleavage and polyadenylation specific factor 1, isoform CRA_c [Mus
           musculus]
          Length = 1388

 Score =  308 bits (789), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 219/671 (32%), Positives = 344/671 (51%), Gaps = 77/671 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN G T+ +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
           + + T+     FLGSRLG+SLL+++T        SS    E  D E      KR+  +  
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVG 429

Query: 471 ---SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
                   QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G     
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485

Query: 522 ---------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH-------- 555
                    L I        + + + + K    ++V   ELPGC  +WTV          
Sbjct: 486 SEENSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEE 545

Query: 556 --KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIA 613
             K+       S+  A  D   H +LI+S E  TM+L+T   + E+  S  +  QG T+ 
Sbjct: 546 TPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVF 604

Query: 614 AGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
           AGN+   R ++QV   G R+L+G      L F P +         + ++  ++ADPYV++
Sbjct: 605 AGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYVVI 654

Query: 674 GMSDGSIRLLV 684
             ++G + + +
Sbjct: 655 MSAEGHVTMFL 665



 Score =  110 bits (274), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 71/247 (28%), Positives = 114/247 (46%), Gaps = 15/247 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 780  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 835

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 836  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 885

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 886  VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPS 945

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 946  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1005

Query: 992  YWPVQKV 998
             WPV+K+
Sbjct: 1006 PWPVRKI 1012


>gi|16751835|ref|NP_444423.1| cleavage and polyadenylation specificity factor subunit 1 isoform 2
           [Mus musculus]
 gi|17374611|sp|Q9EPU4.1|CPSF1_MOUSE RecName: Full=Cleavage and polyadenylation specificity factor
           subunit 1; AltName: Full=Cleavage and polyadenylation
           specificity factor 160 kDa subunit; Short=CPSF 160 kDa
           subunit
 gi|11762096|gb|AAG40326.1|AF322193_1 cleavage and polyadenylation specificity factor 1 [Mus musculus]
 gi|38614159|gb|AAH56388.1| Cleavage and polyadenylation specific factor 1 [Mus musculus]
          Length = 1441

 Score =  308 bits (788), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 219/673 (32%), Positives = 344/673 (51%), Gaps = 79/673 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN G T+ +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
           + + T+     FLGSRLG+SLL+++T        SS    E  D E      KR+  +  
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVG 429

Query: 471 ---SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
                   QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G     
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485

Query: 522 -----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH------ 555
                      L I        + + + + K    ++V   ELPGC  +WTV        
Sbjct: 486 SEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEE 545

Query: 556 ----KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
               K+       S+  A  D   H +LI+S E  TM+L+T   + E+  S  +  QG T
Sbjct: 546 EETPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPT 604

Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
           + AGN+   R ++QV   G R+L+G      L F P +         + ++  ++ADPYV
Sbjct: 605 VFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYV 654

Query: 672 LLGMSDGSIRLLV 684
           ++  ++G + + +
Sbjct: 655 VIMSAEGHVTMFL 667



 Score =  109 bits (273), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 71/247 (28%), Positives = 114/247 (46%), Gaps = 15/247 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 782  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 837

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 838  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 887

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 888  VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPS 947

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 948  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1007

Query: 992  YWPVQKV 998
             WPV+K+
Sbjct: 1008 PWPVRKI 1014


>gi|255918233|ref|NP_001157645.1| cleavage and polyadenylation specificity factor subunit 1 isoform 1
           [Mus musculus]
          Length = 1450

 Score =  307 bits (787), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 219/673 (32%), Positives = 344/673 (51%), Gaps = 79/673 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN G T+ +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
           + + T+     FLGSRLG+SLL+++T        SS    E  D E      KR+  +  
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVG 429

Query: 471 ---SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
                   QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G     
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485

Query: 522 -----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH------ 555
                      L I        + + + + K    ++V   ELPGC  +WTV        
Sbjct: 486 SEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEE 545

Query: 556 ----KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
               K+       S+  A  D   H +LI+S E  TM+L+T   + E+  S  +  QG T
Sbjct: 546 EETPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPT 604

Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
           + AGN+   R ++QV   G R+L+G      L F P +         + ++  ++ADPYV
Sbjct: 605 VFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYV 654

Query: 672 LLGMSDGSIRLLV 684
           ++  ++G + + +
Sbjct: 655 VIMSAEGHVTMFL 667



 Score =  109 bits (273), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 71/247 (28%), Positives = 114/247 (46%), Gaps = 15/247 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 782  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 837

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 838  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 887

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 888  VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPS 947

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 948  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1007

Query: 992  YWPVQKV 998
             WPV+K+
Sbjct: 1008 PWPVRKI 1014


>gi|148697642|gb|EDL29589.1| cleavage and polyadenylation specific factor 1, isoform CRA_a [Mus
           musculus]
          Length = 1417

 Score =  307 bits (787), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 219/673 (32%), Positives = 344/673 (51%), Gaps = 79/673 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN G T+ +   +      LELV  +   GNV S+
Sbjct: 56  NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 108

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 109 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 161

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 162 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 218

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 219 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 278

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct: 279 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 338

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 339 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 398

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
           + + T+     FLGSRLG+SLL+++T        SS    E  D E      KR+  +  
Sbjct: 399 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVG 456

Query: 471 ---SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
                   QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G     
Sbjct: 457 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 512

Query: 522 -----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH------ 555
                      L I        + + + + K    ++V   ELPGC  +WTV        
Sbjct: 513 SEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEE 572

Query: 556 ----KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
               K+       S+  A  D   H +LI+S E  TM+L+T   + E+  S  +  QG T
Sbjct: 573 EETPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPT 631

Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
           + AGN+   R ++QV   G R+L+G      L F P +         + ++  ++ADPYV
Sbjct: 632 VFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYV 681

Query: 672 LLGMSDGSIRLLV 684
           ++  ++G + + +
Sbjct: 682 VIMSAEGHVTMFL 694



 Score =  110 bits (274), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 71/247 (28%), Positives = 114/247 (46%), Gaps = 15/247 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 809  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 864

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 865  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 914

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 915  VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPS 974

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 975  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1034

Query: 992  YWPVQKV 998
             WPV+K+
Sbjct: 1035 PWPVRKI 1041


>gi|427780291|gb|JAA55597.1| Putative mrna cleavage and polyadenylation factor ii complex
           subunit cft1 cpsf subunit [Rhipicephalus pulchellus]
          Length = 1237

 Score =  307 bits (786), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 257/862 (29%), Positives = 387/862 (44%), Gaps = 170/862 (19%)

Query: 251 FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYK 310
           F+HGY EP ++IL+E   TW GR++ +  TC I ALS++   + HP+IWS  NLP D  +
Sbjct: 3   FLHGYYEPTLLILYEPLRTWPGRIAIRQDTCCILALSLNLQQRVHPVIWSYTNLPFDCLR 62

Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL-------PRSSFSVE 363
           LLAVP P+GGVL++  +++ Y +QS         Y VSL+S  +        P+    + 
Sbjct: 63  LLAVPRPLGGVLIMAVDSLLYLNQSVP------PYGVSLNSFTDFSTSFPLKPQEGLKIS 116

Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSL 422
           LD A A +L  D  +LS K G+L +LT+  DG R V+     K   SVLT+ +T   +  
Sbjct: 117 LDCAQACFLSYDRLVLSLKGGELYVLTLFNDGMRSVRNFYFDKAAASVLTTSMTLCEDGY 176

Query: 423 FFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD----- 477
            FLGSRLG+SLL+ +T      +     ++E  + +A+ P +K+ R    DA+ D     
Sbjct: 177 LFLGSRLGNSLLLHYT-EKAAEVDDIAKRDEKTESDANDPPSKKKRM---DAIGDWMASD 232

Query: 478 --MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG--------LRINAD 527
             +++ +EL +YGS +  T+    +++F V DSL+NIGP      G           N D
Sbjct: 233 VALIDPDELEVYGSETMATKQL-TSYTFEVCDSLINIGPCGKICMGEPAFLSEEFVQNTD 291

Query: 528 -----ASATGISKQSNYELV------------ELPGCKGIWTVY---------HKSSRGH 561
                 +  G  K     ++            ELPGC  +WTV           K+    
Sbjct: 292 PDLELVTTAGYGKNGALCVLQRSVRPQVVTTFELPGCVHMWTVMGPPAEKKPPEKTEESD 351

Query: 562 NADSSRMAAYDD--EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFG 619
           +  S   AA       HA+LI+S    +M+L+T   + E+  S  +  Q  T+ AGNL  
Sbjct: 352 DPASEDKAAEQPLTNTHAFLILSRADSSMILQTDQEINELDHS-GFSTQNPTVFAGNLGD 410

Query: 620 RRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGS 679
            R V+QV   G R+LDG+   Q +               S++++ S+ADP+V++  ++G 
Sbjct: 411 GRYVLQVCPMGVRLLDGTRQLQHIPL----------DVGSSIVAGSLADPHVIIRSAEGL 460

Query: 680 I--RLLVGDPST-CTVSVQTPAAIESSKKPVSSCT------LYHDKGPEPWLRKTSTDAW 730
           +    L GDP+  C ++V  P       K +S C       L+  +  EP   + +    
Sbjct: 461 VIHLTLRGDPAAGCRLAVLRPQLTAVKAKILSICVYKDVSGLFTTQYREP--DEPAKPEK 518

Query: 731 LSTGVGEAIDGADGGPLDQGD--------------------------------------- 751
                 E+ID +  G LD  D                                       
Sbjct: 519 PLPPPKESIDMSSNGLLDDEDELLYGESEENPIQKEPVRMTSEEAPSVAESMFEIKEVAP 578

Query: 752 -IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETE-INSSSEE 809
             +  V  E+G LEI+ +P +   F V  F  G+  +VD+    A   +++E ++  S E
Sbjct: 579 TYWLFVARENGVLEIYSLPEYKLCFLVKNFPMGQKVLVDSVQMTAPSGTKSEKLSDMSHE 638

Query: 810 GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
                    +H + VV L ++     HSRP L A + D  +L Y+A+ F           
Sbjct: 639 SM-----PVVHEILVVGLGIR-----HSRPLLLARV-DEDLLIYEAFPF----------- 676

Query: 870 PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE-----ETPHGAPCQR-------ITIF 917
              T R   +          LRF +   D + RE     + P     ++       +  F
Sbjct: 677 -YETQREGHL---------KLRFKKMSHDIFLRERKYKTQKPENEEEEKAFQSRQWLHPF 726

Query: 918 KNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG 976
            +ISG+ G FL G RP W  M  R  LR HP   DG I  F   HNVNC  GF++   QG
Sbjct: 727 SDISGYSGVFLCGYRPYWLFMSSRGELRCHPMFVDGPIHCFAPFHNVNCPKGFLHFNKQG 786

Query: 977 ILKICQLPSGSTYDNYWPVQKV 998
            L+I  LP+  TYD  WPV+KV
Sbjct: 787 ELRISTLPTHLTYDAPWPVRKV 808


>gi|193702313|ref|XP_001945086.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like [Acyrthosiphon pisum]
          Length = 1335

 Score =  306 bits (783), Expect = 6e-80,   Method: Compositional matrix adjust.
 Identities = 279/1005 (27%), Positives = 449/1005 (44%), Gaps = 192/1005 (19%)

Query: 58  LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
           LVV   N++ +Y +   +   +  K                   E +  Y L GN+  L 
Sbjct: 30  LVVAGVNILRVYRLVPTDTTCQPPK----------------TKFECLAQYTLFGNIMCL- 72

Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
              Q         D+++L+F +AK S++E+D  +H LR  S+H FE  ++   K G    
Sbjct: 73  ---QSVTLCPSSPDALLLSFSEAKFSLVEYDRDMHSLRTLSLHYFEDDKF---KNGHTQH 126

Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
              PL++VDP GRC   LVYG   ++L       G   D++        SA++  S+ I 
Sbjct: 127 WSPPLIRVDPDGRCVVGLVYGSYFVVLPF-----GRTIDDN------AKSAQVMPSYTIP 175

Query: 238 LRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
           +  +D  M ++ DF F+HGY EP ++IL+E   T+AGR++ +  TC + A+S++     H
Sbjct: 176 ISKIDPKMNNIMDFDFLHGYYEPTLLILYEPVKTFAGRIAVRKDTCAMVAISLNIQQHVH 235

Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQE 354
           P+IWS  +LP+D  K++AV  PIGGVL++  N++ Y +QS     +ALN+ A +L +   
Sbjct: 236 PVIWSLDSLPYDCQKVIAVSRPIGGVLIMAVNSLIYLNQSVPPFGVALNSIAKTLTNFPL 295

Query: 355 LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTS 413
             +   ++ LD A AT++ +D  + S   GDL ++T+  D  R V+     K   SVLT+
Sbjct: 296 GQQEDINLVLDRATATFISSDKLVTSLCNGDLYVITLYADSMRAVRSFHFEKCASSVLTT 355

Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
            IT   +S  FLGSRLG+SLL+++   S ++               D PS KR +   +D
Sbjct: 356 CITVCLDSYLFLGSRLGNSLLLRYYARSQSN--------------DDEPSIKRKKTDETD 401

Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
             +D+V   EL +YGS    T    +++SF V DS++NIGP    S G    A  S    
Sbjct: 402 --EDLV---ELEVYGSEV-QTSICLESYSFEVCDSIINIGPCSQASIGE--PAYISDEFS 453

Query: 534 SKQSNYELVELP--GCKGIWTVYHKSSRGHNADSSRMAAYDD--------EYHAYLIISL 583
           S + + EL+     G  G  +V H+S +     +  +  Y D        ++H ++I++ 
Sbjct: 454 SDEHDVELLCTSGHGKNGALSVLHRSIKPQLVTTFHLDGYKDMWTVHGENDFHTFMILTN 513

Query: 584 EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDL 643
              T++L+T   + E+ +S  Y  +  T+   N+   + VIQV     R+L+GS   Q +
Sbjct: 514 VDSTLILQTGQEINEL-DSSGYATREHTVFVCNM--NKFVIQVLRYSVRLLNGSEQLQSV 570

Query: 644 S--FGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI---------RLLVGDPSTCTV 692
           S  FG            S ++  S  +PY +L   DG +         R+L+  P+    
Sbjct: 571 SLDFG------------SPIIHGSSCNPYAVLLTEDGQVIVLTVKSTGRILLMRPTNFEQ 618

Query: 693 SVQTP--------AAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADG 744
             QT         + + SS  P +   L    GP     K   D ++S  V +  +   G
Sbjct: 619 IPQTKTLAVYRDVSGLFSSTMPQAEIPLV---GP-----KLQHDHFVSDSVEDEEEMLYG 670

Query: 745 GPLDQGD--------------------------IYSVVCYESGALEIFDVPNFNCVFTVD 778
              D                              + V+  ++G +EI+ +P+F       
Sbjct: 671 DARDPSSRETPHNSVSNKNTMWWLKFLEVPTPTYWVVLTRDNGYMEIYTLPDFKI----- 725

Query: 779 KFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRK-ENIHSMKVVELAMQRWSAHHS 837
                       Y    + +S   +  S EEG    +K E I  + +V L  Q       
Sbjct: 726 -----------KYRAANIDESPMILKDSLEEGCYFPKKTEIIKEILIVPLGYQ-----DK 769

Query: 838 RPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPL 897
           RP +F  L D  ++ Y  +    PE T K                       +RF +  +
Sbjct: 770 RPIMFVRL-DNEVVIYGIH--RHPEGTLK-----------------------MRFHK--M 801

Query: 898 DAYTREETPHGAPCQRITI---FKNISGHQGFFLSGSRPCWCMV-FRERLRVHPQLCDGS 953
            +    ++  G P +  ++   F  ++GH G F+ G  P   ++  R  LR HP   DG 
Sbjct: 802 TSLLTFQSRSGNPLEGTSLLRYFSKVAGHNGVFICGQNPHLILLTVRGELRCHPLHIDGP 861

Query: 954 IVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
           I+ F   HNVNC+ GF+Y  S   L+I  LP+  +YD  WP++KV
Sbjct: 862 IMCFAPFHNVNCSQGFLYFNSDHKLRISILPTHLSYDEPWPLRKV 906


>gi|334326317|ref|XP_001364707.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like [Monodelphis domestica]
          Length = 1449

 Score =  305 bits (782), Expect = 6e-80,   Method: Compositional matrix adjust.
 Identities = 224/687 (32%), Positives = 354/687 (51%), Gaps = 99/687 (14%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV   + + +Y +    E S +S  + E K    +       LELV  +   GNV S+
Sbjct: 29  NLVVAGTSQLYVYRLNHDAETSTKSDRNAEGK----LHKEHKEKLELVASFSFFGNVMSM 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 85  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +L+YG ++++L       ++   GLVG+        G  +   
Sbjct: 138 NVHTPRVRVDPDGRCAVMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQKSSFL 189

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 190 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 249

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL 349
              K HP+IWS  NLP D  + LAVP PIGGV++   N++ Y +QS         Y VSL
Sbjct: 250 ILQKVHPVIWSLTNLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVP------PYGVSL 303

Query: 350 DS----SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRL 401
           +S    +   P   +    + LD A A ++  D  ++S K G++ +LT++ DG R V+  
Sbjct: 304 NSLTAGTTAFPLRMQDGVKITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRSF 363

Query: 402 DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI-EAD 460
              K   SVLT+ + T+     FLGSRLG+SLL+++T        S+  +    ++ + D
Sbjct: 364 HFDKAAASVLTTCMITMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAAREAPSREVSDKD 423

Query: 461 APSTKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNI 512
            P  K+ R  S+        A QD V+  E+ +YGS A + T+ A  T+SF V DS++NI
Sbjct: 424 EPPVKKKRVESTLGWAGGKSAPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNI 479

Query: 513 GPLKDFSYG----------------LRI------NADASATGISKQSNYELV---ELPGC 547
           GP  + + G                L I        + + + + K    ++V   ELPGC
Sbjct: 480 GPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGC 539

Query: 548 KGIWTVY-------HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLT 597
             +WTV         ++++G  A+   SS     D + H +LI+S E  TM+L+T   + 
Sbjct: 540 YDMWTVIAPLRKEEDETTKGEGAEQEPSSPETEDDGKRHGFLILSREDSTMILQTGQEIM 599

Query: 598 EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSE 657
           E+  S  +  QG T+ AGN+   R ++QV   G R+L+G      L F P +        
Sbjct: 600 ELDTS-GFATQGPTVYAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL------- 648

Query: 658 NSTVLSVSIADPYVLLGMSDGSIRLLV 684
            S ++  ++ADPYV++  ++G + + +
Sbjct: 649 GSPIVQCAVADPYVVIMSAEGHVTMFL 675



 Score =  111 bits (277), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 78/264 (29%), Positives = 124/264 (46%), Gaps = 49/264 (18%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + V+  ESGA+EI+ +P++  VF V  F  G+  +VD+    +     T+ ++  EE T 
Sbjct: 790  WCVLVRESGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPATQGDTKKEEVTR 845

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L  ++     +RP+L  +  D  +L Y+A+  +             
Sbjct: 846  QGELPLVKEVLLVALGNRQ-----TRPYLL-VHVDQELLIYEAFAHD------------- 886

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAP-----------------CQRIT 915
                   S +  S L+ +RF + P +   RE+ P  +                    R  
Sbjct: 887  -------SQLGQSNLK-VRFKKVPHNINFREKKPKPSKKKPEGGGTEEGAGARGRVARFR 938

Query: 916  IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
             F++I G+ G F+ G  P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   
Sbjct: 939  YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNR 998

Query: 975  QGILKICQLPSGSTYDNYWPVQKV 998
            QG L+I  LP+  +YD  WPV+K+
Sbjct: 999  QGELRISVLPAYLSYDAPWPVRKI 1022


>gi|397497327|ref|XP_003819464.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 [Pan paniscus]
 gi|410336497|gb|JAA37195.1| cleavage and polyadenylation specific factor 1, 160kDa [Pan
           troglodytes]
          Length = 1442

 Score =  305 bits (782), Expect = 7e-80,   Method: Compositional matrix adjust.
 Identities = 220/677 (32%), Positives = 348/677 (51%), Gaps = 86/677 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
            SVLT+ + T+     FLGSRLG+SLL+++T        +++  +  KEE    +    +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426

Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
           T     +     QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G 
Sbjct: 427 TAGWSAAGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGE 482

Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
                          L I        + + + + K    ++V   ELPGC  +WTV    
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 542

Query: 555 ------HKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
                 +    G   + S   A DD   H +LI+S E  TM+L+T   + E+  S  +  
Sbjct: 543 RKEEEDNPKGEGTEQEPSTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFAT 601

Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
           QG T+ AGN+   R ++QV   G R+L+G      L F P +         + ++  ++A
Sbjct: 602 QGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVA 651

Query: 668 DPYVLLGMSDGSIRLLV 684
           DPYV++  ++G + + +
Sbjct: 652 DPYVVIMSAEGHVTMFL 668



 Score =  108 bits (269), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 69/247 (27%), Positives = 114/247 (46%), Gaps = 15/247 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 783  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 838

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 839  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 888

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +        E         R   F++I G+ G F+ G  
Sbjct: 889  VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 948

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG + +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 949  PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1008

Query: 992  YWPVQKV 998
             WPV+K+
Sbjct: 1009 PWPVRKI 1015


>gi|410042329|ref|XP_003954555.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
           specificity factor subunit 1 [Pan troglodytes]
          Length = 1296

 Score =  305 bits (781), Expect = 8e-80,   Method: Compositional matrix adjust.
 Identities = 220/677 (32%), Positives = 348/677 (51%), Gaps = 86/677 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
            SVLT+ + T+     FLGSRLG+SLL+++T        +++  +  KEE    +    +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426

Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
           T     +     QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G 
Sbjct: 427 TAGWSAAGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGE 482

Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
                          L I        + + + + K    ++V   ELPGC  +WTV    
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNEALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 542

Query: 555 ------HKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
                 +    G   + S   A DD   H +LI+S E  TM+L+T   + E+  S  +  
Sbjct: 543 RKEEEDNPKGEGTEQEPSTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFAT 601

Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
           QG T+ AGN+   R ++QV   G R+L+G      L F P +         + ++  ++A
Sbjct: 602 QGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVA 651

Query: 668 DPYVLLGMSDGSIRLLV 684
           DPYV++  ++G + + +
Sbjct: 652 DPYVVIMSAEGHVTMFL 668



 Score = 77.8 bits (190), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 49/141 (34%), Positives = 66/141 (46%), Gaps = 5/141 (3%)

Query: 860 GPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA-PCQRITIFK 918
           G E +   DD        S S  S S+    R S+ P D   R+  P  A P     + +
Sbjct: 732 GSETSPTVDDEEEMLYGDSGSLFSPSKEEARRSSQPPAD---RDPAPFRAEPTHWCLLVR 788

Query: 919 NISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGI 977
                   F+ G  P W +V  R  LR+HP   DG + +F   HNVNC  GF+Y   QG 
Sbjct: 789 ENGTMXXXFICGPSPPWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGE 848

Query: 978 LKICQLPSGSTYDNYWPVQKV 998
           L+I  LP+  +YD  WPV+K+
Sbjct: 849 LRISVLPAYLSYDAPWPVRKI 869


>gi|56676371|ref|NP_037423.2| cleavage and polyadenylation specificity factor subunit 1 [Homo
           sapiens]
 gi|23503048|sp|Q10570.2|CPSF1_HUMAN RecName: Full=Cleavage and polyadenylation specificity factor
           subunit 1; AltName: Full=Cleavage and polyadenylation
           specificity factor 160 kDa subunit; Short=CPSF 160 kDa
           subunit
 gi|16878041|gb|AAH17232.1| Cleavage and polyadenylation specific factor 1, 160kDa [Homo
           sapiens]
 gi|119602516|gb|EAW82110.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform
           CRA_c [Homo sapiens]
 gi|123993607|gb|ABM84405.1| cleavage and polyadenylation specific factor 1, 160kDa [synthetic
           construct]
 gi|123999626|gb|ABM87355.1| cleavage and polyadenylation specific factor 1, 160kDa [synthetic
           construct]
 gi|307684758|dbj|BAJ20419.1| cleavage and polyadenylation specific factor 1, 160kDa [synthetic
           construct]
          Length = 1443

 Score =  305 bits (781), Expect = 9e-80,   Method: Compositional matrix adjust.
 Identities = 219/678 (32%), Positives = 348/678 (51%), Gaps = 87/678 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
            SVLT+ + T+     FLGSRLG+SLL+++T        +++  +  KEE    +    +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426

Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
           T     +     QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G 
Sbjct: 427 TAGWSAAGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAVGE 482

Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
                          L I        + + + + K    ++V   ELPGC  +WTV    
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 542

Query: 555 ------HKSSRGHNADSSRMAAYDDE--YHAYLIISLEARTMVLETADLLTEVTESVDYF 606
                 +    G   + S     DD+   H +LI+S E  TM+L+T   + E+  S  + 
Sbjct: 543 RKEEEDNPKGEGTEQEPSTTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFA 601

Query: 607 VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI 666
            QG T+ AGN+   R ++QV   G R+L+G      L F P +         + ++  ++
Sbjct: 602 TQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAV 651

Query: 667 ADPYVLLGMSDGSIRLLV 684
           ADPYV++  ++G + + +
Sbjct: 652 ADPYVVIMSAEGHVTMFL 669



 Score =  108 bits (270), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 69/247 (27%), Positives = 114/247 (46%), Gaps = 15/247 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 784  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 839

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 840  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 889

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +        E         R   F++I G+ G F+ G  
Sbjct: 890  VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 949

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG + +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 950  PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1009

Query: 992  YWPVQKV 998
             WPV+K+
Sbjct: 1010 PWPVRKI 1016


>gi|338728511|ref|XP_001505047.3| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like isoform 1 [Equus caballus]
          Length = 1444

 Score =  305 bits (781), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 219/675 (32%), Positives = 350/675 (51%), Gaps = 80/675 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN    + +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEAPTKNDRNAEGKAHRE--HREKLELVASFSFFGNVMSM 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 85  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 138 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 194

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 195 DVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 254

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+      +  
Sbjct: 255 HPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 314

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 315 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 374

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
           + + T+     FLGSRLG+SLL+++T        +S ++E     E + P +K+ R  S+
Sbjct: 375 TSMVTMEPGYLFLGSRLGNSLLLKYT-EKLQEPPASAVREA---AEKEEPPSKKKRVDST 430

Query: 473 -------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG--- 521
                     QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G   
Sbjct: 431 VGWSGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGEPA 486

Query: 522 -------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY----- 554
                        L I        + + + + K    ++V   ELPGC  +WTV      
Sbjct: 487 FLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRK 546

Query: 555 --HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
              ++ +G   +   S+  A  D   H +LI+S E  TM+L+T   + E+  S  +  QG
Sbjct: 547 EQEETPKGEGTEQEPSAPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQG 605

Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
            T+ AGN+   R ++QV   G R+L+G      L F P +         S ++  ++ADP
Sbjct: 606 PTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GSPIVQCAVADP 655

Query: 670 YVLLGMSDGSIRLLV 684
           YV++  ++G + + +
Sbjct: 656 YVVIMSAEGHVTMFL 670



 Score =  107 bits (268), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 70/247 (28%), Positives = 114/247 (46%), Gaps = 15/247 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 785  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 840

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 841  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 890

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +        E         R   F++I G+ G F+ G  
Sbjct: 891  VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGVGARGRVARFRYFEDIYGYSGVFICGPS 950

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 951  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1010

Query: 992  YWPVQKV 998
             WPV+K+
Sbjct: 1011 PWPVRKI 1017


>gi|1045574|gb|AAC50293.1| cleavage and polyadenylation specificity factor [Homo sapiens]
          Length = 1442

 Score =  305 bits (780), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 219/678 (32%), Positives = 348/678 (51%), Gaps = 87/678 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
            SVLT+ + T+     FLGSRLG+SLL+++T        +++  +  KEE    +    +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426

Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
           T     +     QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G 
Sbjct: 427 TAGWSAAGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAVGE 482

Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
                          L I        + + + + K    ++V   ELPGC  +WTV    
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 542

Query: 555 ------HKSSRGHNADSSRMAAYDDE--YHAYLIISLEARTMVLETADLLTEVTESVDYF 606
                 +    G   + S     DD+   H +LI+S E  TM+L+T   + E+  S  + 
Sbjct: 543 RKEEEDNPKGEGTEQEPSTTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFA 601

Query: 607 VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI 666
            QG T+ AGN+   R ++QV   G R+L+G      L F P +         + ++  ++
Sbjct: 602 TQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAV 651

Query: 667 ADPYVLLGMSDGSIRLLV 684
           ADPYV++  ++G + + +
Sbjct: 652 ADPYVVIMSAEGHVTMFL 669



 Score =  108 bits (269), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 69/247 (27%), Positives = 114/247 (46%), Gaps = 15/247 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 784  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 839

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 840  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 889

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +        E         R   F++I G+ G F+ G  
Sbjct: 890  VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 949

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG + +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 950  PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1009

Query: 992  YWPVQKV 998
             WPV+K+
Sbjct: 1010 PWPVRKI 1016


>gi|395512730|ref|XP_003760588.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 [Sarcophilus harrisii]
          Length = 1449

 Score =  305 bits (780), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 224/687 (32%), Positives = 353/687 (51%), Gaps = 99/687 (14%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV   + + +Y +    E S +S  + E K    +       LELV  +   GNV S+
Sbjct: 29  NLVVAGTSQLYVYRLNHDAETSTKSDRNAEGK----LHKEHKEKLELVASFSFFGNVMSM 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 85  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +L+YG ++++L       ++   GLVG+        G  +   
Sbjct: 138 NVHTPRVRVDPDGRCAVMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQKSSFL 189

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 190 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 249

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL 349
              K HP+IWS  NLP D  + LAVP PIGGV++   N++ Y +QS         Y VSL
Sbjct: 250 ILQKVHPVIWSLTNLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVP------PYGVSL 303

Query: 350 DS----SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRL 401
           +S    +   P   +    + LD A A ++  D  ++S K G++ +LT++ DG R V+  
Sbjct: 304 NSLTAGTTAFPLRMQDGVKITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRSF 363

Query: 402 DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI-EAD 460
              K   SVLT+ + T+     FLGSRLG+SLL+++T        SS  +    ++ + D
Sbjct: 364 HFDKAAASVLTTCMITMEPGYLFLGSRLGNSLLLKYTEKLQEPPASSAREAPSREVSDKD 423

Query: 461 APSTKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNI 512
            P  K+ R  S+        A QD V+  E+ +YGS A + T+ A  T+SF V DS++NI
Sbjct: 424 EPPVKKKRVESTLGWAGGKSAPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNI 479

Query: 513 GPLKDFSYG----------------LRI------NADASATGISKQSNYELV---ELPGC 547
           GP  + + G                L I        + + + + K    ++V   ELPGC
Sbjct: 480 GPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGC 539

Query: 548 KGIWTVY-------HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLT 597
             +WTV         ++++G   +   SS     D + H +LI+S E  TM+L+T   + 
Sbjct: 540 YDMWTVIAPLRKEEDETTKGEGPEQEPSSPETEDDGKRHGFLILSREDSTMILQTGQEIM 599

Query: 598 EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSE 657
           E+  S  +  QG T+ AGN+   R ++QV   G R+L+G      L F P +        
Sbjct: 600 ELDTS-GFATQGPTVYAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL------- 648

Query: 658 NSTVLSVSIADPYVLLGMSDGSIRLLV 684
            S ++  ++ADPYV++  ++G + + +
Sbjct: 649 GSPIVQCAVADPYVVIMSAEGHVTMFL 675



 Score =  111 bits (277), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 78/264 (29%), Positives = 124/264 (46%), Gaps = 49/264 (18%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + V+  ESGA+EI+ +P++  VF V  F  G+  +VD+    +     T+ ++  EE T 
Sbjct: 790  WCVLVRESGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPATQGDTKKEEVTR 845

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L  ++     +RP+L  +  D  +L Y+A+  +             
Sbjct: 846  QGELPLVKEVLLVALGNRQ-----TRPYLL-VHVDQELLIYEAFAHD------------- 886

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAP-----------------CQRIT 915
                   S +  S L+ +RF + P +   RE+ P  +                    R  
Sbjct: 887  -------SQLGQSNLK-VRFKKVPHNINFREKKPKPSKKKPEGGGAEEGAGARGRVARFR 938

Query: 916  IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
             F++I G+ G F+ G  P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   
Sbjct: 939  YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNR 998

Query: 975  QGILKICQLPSGSTYDNYWPVQKV 998
            QG L+I  LP+  +YD  WPV+K+
Sbjct: 999  QGELRISVLPAYLSYDAPWPVRKI 1022


>gi|384946686|gb|AFI36948.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
           mulatta]
          Length = 1428

 Score =  305 bits (780), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 221/679 (32%), Positives = 347/679 (51%), Gaps = 90/679 (13%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
            SVLT+ + T+     FLGSRLG+SLL+++T         +    E  D E      KR+
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT--EKLQEPPASAVREAADKEEPPSKKKRV 424

Query: 468 RRSSS------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
             ++S         QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + 
Sbjct: 425 DATASWSAGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAM 480

Query: 521 G----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY- 554
           G                L I        + + + + K    ++V   ELPGC  +WTV  
Sbjct: 481 GEPAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIA 540

Query: 555 --------HKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDY 605
                   +    G   ++    A DD   H +LI+S E  TM+L+T   + E+  S  +
Sbjct: 541 PVRKEEEDNPKGEGTEQEARSPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GF 599

Query: 606 FVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS 665
             QG T+ AGN+   R ++QV   G R+L+G      L F P +         + ++  +
Sbjct: 600 ATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCA 649

Query: 666 IADPYVLLGMSDGSIRLLV 684
           +ADPYV++  ++G + + +
Sbjct: 650 VADPYVVIMSAEGHVTMFL 668



 Score =  107 bits (267), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 70/247 (28%), Positives = 115/247 (46%), Gaps = 15/247 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 783  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 838

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 839  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 888

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E         R   F++I G+ G F+ G  
Sbjct: 889  VRFKKVPHNINFREKKPKPSKKKAEGGGTEEGAGARGRVARFRYFEDIYGYSGVFICGPS 948

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG + +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 949  PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1008

Query: 992  YWPVQKV 998
             WPV+K+
Sbjct: 1009 PWPVRKI 1015


>gi|27807297|ref|NP_777145.1| cleavage and polyadenylation specificity factor subunit 1 [Bos
           taurus]
 gi|1706101|sp|Q10569.1|CPSF1_BOVIN RecName: Full=Cleavage and polyadenylation specificity factor
           subunit 1; AltName: Full=Cleavage and polyadenylation
           specificity factor 160 kDa subunit; Short=CPSF 160 kDa
           subunit
 gi|929007|emb|CAA58152.1| cleavage and polyadenylation specificity factor, 160 kDa subunit
           [Bos taurus]
 gi|296480730|tpg|DAA22845.1| TPA: cleavage and polyadenylation specificity factor subunit 1 [Bos
           taurus]
          Length = 1444

 Score =  305 bits (780), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 221/678 (32%), Positives = 345/678 (50%), Gaps = 86/678 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T  +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDSEAPTKNDRSTDGKAHRE--HREKLELVASFSFFGNVMSM 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 85  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +L+YG ++++L       ++   GLVG+        G  +   
Sbjct: 138 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 189

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 190 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 249

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+    
Sbjct: 250 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTG 309

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 310 TTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 369

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
            SVLT+ + T+     FLGSRLG+SLL+++T        S+    E  D E      KR+
Sbjct: 370 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASTA--REAADKEEPPSKKKRV 427

Query: 468 RRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
             +     S    QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G
Sbjct: 428 DATTGWSGSKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMG 483

Query: 522 ----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYHK 556
                           L I        + + + + K    ++V   ELPGC  +WTV   
Sbjct: 484 EPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAP 543

Query: 557 SSR---------GHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYF 606
             +         G   +     A DD   H +LI+S E  TM+L+T   + E+  S  + 
Sbjct: 544 VRKEQEETLKGEGTEPEPGAPEAEDDGRRHGFLILSREDSTMILQTGQEIMELDAS-GFA 602

Query: 607 VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI 666
            QG T+ AGN+   R ++QV   G R+L+G      L F P +         S ++  ++
Sbjct: 603 TQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GSPIVQCAV 652

Query: 667 ADPYVLLGMSDGSIRLLV 684
           ADPYV++  ++G + + +
Sbjct: 653 ADPYVVIMSAEGHVTMFL 670



 Score =  112 bits (279), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 71/247 (28%), Positives = 116/247 (46%), Gaps = 15/247 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+GA+EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 785  WCLLVRENGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 840

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +   RP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 841  QGELPLVKEVLLVALG-----SRQRRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 890

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E T       R   F++I G+ G F+ G  
Sbjct: 891  VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGTGPRGRVARFRYFEDIYGYSGVFICGPS 950

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HN+NC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 951  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDA 1010

Query: 992  YWPVQKV 998
             WPV+K+
Sbjct: 1011 PWPVRKI 1017


>gi|392306997|ref|NP_001254722.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
           mulatta]
 gi|380812168|gb|AFE77959.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
           mulatta]
 gi|383417835|gb|AFH32131.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
           mulatta]
          Length = 1442

 Score =  304 bits (778), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 221/679 (32%), Positives = 347/679 (51%), Gaps = 90/679 (13%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
            SVLT+ + T+     FLGSRLG+SLL+++T         +    E  D E      KR+
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT--EKLQEPPASAVREAADKEEPPSKKKRV 424

Query: 468 RRSSS------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
             ++S         QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + 
Sbjct: 425 DATASWSAGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAM 480

Query: 521 G----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY- 554
           G                L I        + + + + K    ++V   ELPGC  +WTV  
Sbjct: 481 GEPAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIA 540

Query: 555 --------HKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDY 605
                   +    G   ++    A DD   H +LI+S E  TM+L+T   + E+  S  +
Sbjct: 541 PVRKEEEDNPKGEGTEQEARSPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GF 599

Query: 606 FVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS 665
             QG T+ AGN+   R ++QV   G R+L+G      L F P +         + ++  +
Sbjct: 600 ATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCA 649

Query: 666 IADPYVLLGMSDGSIRLLV 684
           +ADPYV++  ++G + + +
Sbjct: 650 VADPYVVIMSAEGHVTMFL 668



 Score =  107 bits (268), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 70/247 (28%), Positives = 115/247 (46%), Gaps = 15/247 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 783  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 838

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 839  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 888

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E         R   F++I G+ G F+ G  
Sbjct: 889  VRFKKVPHNINFREKKPKPSKKKAEGGGTEEGAGARGRVARFRYFEDIYGYSGVFICGPS 948

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG + +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 949  PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1008

Query: 992  YWPVQKV 998
             WPV+K+
Sbjct: 1009 PWPVRKI 1015


>gi|321475208|gb|EFX86171.1| hypothetical protein DAPPUDRAFT_313209 [Daphnia pulex]
          Length = 1260

 Score =  303 bits (777), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 281/1017 (27%), Positives = 447/1017 (43%), Gaps = 159/1017 (15%)

Query: 57  NLVVTAANVIEIY--VVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVE 114
           NLVV  ANV+ ++  +    E+  ++    G+  +           LE +  Y L G V 
Sbjct: 29  NLVVAGANVLRVFRLIPNTDEKMLRKESADGQPPK---------MKLECLASYNLFGKVM 79

Query: 115 SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
           S+A +S  G+     +D+I+++F  AK+S++E+D     L+  S+H FE    L    G 
Sbjct: 80  SIAAVSLPGSS----QDTILMSFAHAKLSLIEYDPVSDNLKTLSLHNFEVVSIL--DEGI 133

Query: 175 ESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIE-SS 233
            S  + P ++VDP+GRC  +L++   + IL               F       + +  SS
Sbjct: 134 GSNHKIPEIRVDPEGRCAALLIFRNTLAILP--------------FRKDSAHDSNVTLSS 179

Query: 234 HVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
           ++I L DL+ +  +V D  F+HGY EP ++IL+E   T+ GR++ +  TC + A+S++T 
Sbjct: 180 YIIKLTDLEERVDNVIDVQFLHGYYEPTLIILYEPVGTFPGRIAVRQDTCNMVAVSLNTQ 239

Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVSLD 350
            + HP+IWS  +LP D  +LL VP P+GG L++  N++ Y +QS     +++N+ A    
Sbjct: 240 QRVHPIIWSLNSLPFDCSQLLPVPKPLGGALIMAVNSVIYVNQSVPPYGVSVNSIADHCT 299

Query: 351 SSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPS 409
           S    P     + LD A A +LQ D  +LS K G+L +LT+  D  R V++  L K   S
Sbjct: 300 SFPLKPYEGSRIGLDCARAAFLQYDRVVLSLKGGELYVLTLFADSMRSVRKFHLEKAAAS 359

Query: 410 VLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRR 469
           VLT+ +    N LF LGSRLG+SLL+ F         +    +      A  P  ++   
Sbjct: 360 VLTTCLCICDNYLF-LGSRLGNSLLLAFQ--------TKDYNQYATPFAAKKPKMEQFSL 410

Query: 470 SSSDALQDMVNGEELS--LYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
                L D ++ EE+   LYG    +T+S   ++ F V DSL+NIGP    + G      
Sbjct: 411 LFDQEL-DHLDEEEIDNYLYGEDHESTDSKAISYQFEVCDSLLNIGPCGQMAVG---EPA 466

Query: 528 ASATGISKQSNYELVELP-----GCKGIWTVYHKSSRGHNADSSRMAAYDDEY------- 575
           ++ T   K+S    VE+      G  G   V  ++ +     +  +    D +       
Sbjct: 467 STCTDFDKKSPDPDVEIVTTSGYGKNGAICVLQRTMKPQVVTTFELPEVSDMFTVFASRN 526

Query: 576 ------HAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFER 629
                 H YL++S    TMVL+T   + E+ +S  + V   TI A NL   R ++QV   
Sbjct: 527 NEDAIMHTYLLLSRADSTMVLQTGQEINEMDQS-GFSVTSPTILAANLGNNRFIVQVCPT 585

Query: 630 GARILDGS-YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV----------LLGMSDG 678
             R+LD +  + Q+L              +  + S S +DPYV          LL   +G
Sbjct: 586 SVRLLDATATVIQELVM----------DSDFLITSASASDPYVAVLTENGRIGLLTFVEG 635

Query: 679 SIRLLV------GDPSTCTVSVQTPAAIESSKKPVS---SCTLYHDKGPEPWLRKTSTDA 729
           S   ++        P  C    +  + + ++  P +     T  H        +K   D 
Sbjct: 636 SQLEMIFPVLSKNSPVVCVCLYRDISGLFNTTIPETDSPETTKLHTANKSLNAKKEMDDE 695

Query: 730 ----WLSTGVGEAIDGADGGPLDQGDIYSVVCY--------------ESGALEIFDVPNF 771
               +  T   E+    D           V+ Y              ++G LEI+ +   
Sbjct: 696 EDYLYGDTNTEESRPTEDKTHTKFTPQQKVIDYFREIKPTFWLSIIRQNGTLEIYSLAGQ 755

Query: 772 NCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQR 831
           +    V+ F +   H+    +   +K  ET + SS+                +VE+ +  
Sbjct: 756 S---VVETFQTVHVHLGHRLIFN-MKADETSLPSSTH-------------CNIVEMGIFG 798

Query: 832 WSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRN-- 889
               H RP L    +D  +L Y+A              PV  S+  +   +   +L +  
Sbjct: 799 LGHLHRRPLLMIRTSDFGVLLYEAI----------PALPVYDSKQKNELKIRFRKLNHSL 848

Query: 890 -LRFSRTPLDAYTREE------TPHGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRE 941
            LR ++T    Y R+        P+     +   F NI+G+ G F+ G  P W  M  R 
Sbjct: 849 LLRETKT----YVRKGGQSVVLEPYAWKTNQFKYFSNIAGYTGVFIGGPYPHWLFMTSRG 904

Query: 942 RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
            LR+HP   DGSI  F   HNVNC  GFIY+  +  L+IC LP+   YD  WPV+KV
Sbjct: 905 ELRLHPMSIDGSIKCFACFHNVNCAQGFIYLNRKDELRICLLPTLFNYDAPWPVRKV 961


>gi|354491124|ref|XP_003507706.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 isoform 2 [Cricetulus griseus]
          Length = 1388

 Score =  303 bits (776), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 217/671 (32%), Positives = 347/671 (51%), Gaps = 77/671 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN G T+ +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE  E   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEESE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS- 471
           + + T+     FLGSRLG+SLL+++T        SS    E  D E      KR+  ++ 
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVDPTAG 429

Query: 472 ----SDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
                   QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G     
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485

Query: 522 ---------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY-------HK 556
                    L I        + + + + K    ++V   ELPGC  +WTV         +
Sbjct: 486 SEENSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEE 545

Query: 557 SSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIA 613
           + R  + +   ++  A  D   H +LI+S E  TM+L+T   + E+  S  +  QG T+ 
Sbjct: 546 APRAESTEQESTTPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVF 604

Query: 614 AGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
           AGN+   R ++QV   G R+L+G      L F P +         + ++  ++ADPYV++
Sbjct: 605 AGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYVVI 654

Query: 674 GMSDGSIRLLV 684
             ++G + + +
Sbjct: 655 MSAEGHVTMFL 665



 Score =  109 bits (273), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 71/247 (28%), Positives = 114/247 (46%), Gaps = 15/247 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 780  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 835

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 836  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 885

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 886  VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 945

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 946  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1005

Query: 992  YWPVQKV 998
             WPV+K+
Sbjct: 1006 PWPVRKI 1012


>gi|351713968|gb|EHB16887.1| Cleavage and polyadenylation specificity factor subunit 1
           [Heterocephalus glaber]
          Length = 1440

 Score =  303 bits (776), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 220/678 (32%), Positives = 347/678 (51%), Gaps = 89/678 (13%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEGLTKNDKTTEGKSHRE-----KLELVASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +L+YG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
            SVLT+ + T+     FLGSRLG+SLL+++T         +    E  D E      KR+
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT--EKLQEPPASTVREAADKEEPPSKKKRV 424

Query: 468 RRSSSDA-----LQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
             ++  A      QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G
Sbjct: 425 DSAAGWAGNKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVG 480

Query: 522 ----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH- 555
                           L I        + + + + K    ++V   ELPGC  +WTV   
Sbjct: 481 EPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAP 540

Query: 556 --------KSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYF 606
                     + G   + S   A DD   H +LI+S E  TM+L+T   + E+  S  + 
Sbjct: 541 VRKEEEETPKAEGSEQEPSAPEAQDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFA 599

Query: 607 VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI 666
            QG T+ AGN+   R ++QV   G R+L+G      L F P +         + ++  ++
Sbjct: 600 TQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAV 649

Query: 667 ADPYVLLGMSDGSIRLLV 684
           ADPYV++  ++G + + +
Sbjct: 650 ADPYVVIMSAEGHVTMFL 667



 Score =  110 bits (276), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 70/246 (28%), Positives = 115/246 (46%), Gaps = 14/246 (5%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 782  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SSGQPTTQGEARKEEATR 837

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 838  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 887

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 888  VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 947

Query: 933  PCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNY 992
            P W +V    LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD  
Sbjct: 948  PHWLLVTGRGLRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAP 1007

Query: 993  WPVQKV 998
            WPV+K+
Sbjct: 1008 WPVRKI 1013


>gi|338728513|ref|XP_003365689.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like [Equus caballus]
          Length = 1450

 Score =  303 bits (775), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 219/681 (32%), Positives = 350/681 (51%), Gaps = 86/681 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN    + +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEAPTKNDRNAEGKAHRE--HREKLELVASFSFFGNVMSM 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 85  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 138 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 194

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 195 DVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 254

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+      +  
Sbjct: 255 HPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 314

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 315 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 374

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
           + + T+     FLGSRLG+SLL+++T        +S ++E     E + P +K+ R  S+
Sbjct: 375 TSMVTMEPGYLFLGSRLGNSLLLKYT-EKLQEPPASAVREA---AEKEEPPSKKKRVDST 430

Query: 473 -------------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDF 518
                           QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + 
Sbjct: 431 VGWSGSPRAAGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANA 486

Query: 519 SYG----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTV 553
           + G                L I        + + + + K    ++V   ELPGC  +WTV
Sbjct: 487 AMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTV 546

Query: 554 Y-------HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESV 603
                    ++ +G   +   S+  A  D   H +LI+S E  TM+L+T   + E+  S 
Sbjct: 547 IAPVRKEQEETPKGEGTEQEPSAPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS- 605

Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLS 663
            +  QG T+ AGN+   R ++QV   G R+L+G      L F P +         S ++ 
Sbjct: 606 GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GSPIVQ 655

Query: 664 VSIADPYVLLGMSDGSIRLLV 684
            ++ADPYV++  ++G + + +
Sbjct: 656 CAVADPYVVIMSAEGHVTMFL 676



 Score =  108 bits (269), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 70/247 (28%), Positives = 114/247 (46%), Gaps = 15/247 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 791  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 846

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 847  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 896

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +        E         R   F++I G+ G F+ G  
Sbjct: 897  VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGVGARGRVARFRYFEDIYGYSGVFICGPS 956

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 957  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1016

Query: 992  YWPVQKV 998
             WPV+K+
Sbjct: 1017 PWPVRKI 1023


>gi|354491122|ref|XP_003507705.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 isoform 1 [Cricetulus griseus]
          Length = 1441

 Score =  303 bits (775), Expect = 5e-79,   Method: Compositional matrix adjust.
 Identities = 217/673 (32%), Positives = 347/673 (51%), Gaps = 79/673 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN G T+ +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE  E   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEESE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS- 471
           + + T+     FLGSRLG+SLL+++T        SS    E  D E      KR+  ++ 
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVDPTAG 429

Query: 472 ----SDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
                   QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G     
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485

Query: 522 -----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY------- 554
                      L I        + + + + K    ++V   ELPGC  +WTV        
Sbjct: 486 SEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEE 545

Query: 555 HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
            ++ R  + +   ++  A  D   H +LI+S E  TM+L+T   + E+  S  +  QG T
Sbjct: 546 EEAPRAESTEQESTTPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPT 604

Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
           + AGN+   R ++QV   G R+L+G      L F P +         + ++  ++ADPYV
Sbjct: 605 VFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYV 654

Query: 672 LLGMSDGSIRLLV 684
           ++  ++G + + +
Sbjct: 655 VIMSAEGHVTMFL 667



 Score =  109 bits (272), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 71/247 (28%), Positives = 114/247 (46%), Gaps = 15/247 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 782  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 837

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 838  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 887

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 888  VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 947

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 948  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1007

Query: 992  YWPVQKV 998
             WPV+K+
Sbjct: 1008 PWPVRKI 1014


>gi|354491126|ref|XP_003507707.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 isoform 3 [Cricetulus griseus]
          Length = 1449

 Score =  302 bits (774), Expect = 5e-79,   Method: Compositional matrix adjust.
 Identities = 217/673 (32%), Positives = 347/673 (51%), Gaps = 79/673 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN G T+ +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE  E   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEESE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS- 471
           + + T+     FLGSRLG+SLL+++T        SS    E  D E      KR+  ++ 
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVDPTAG 429

Query: 472 ----SDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
                   QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G     
Sbjct: 430 WTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFL 485

Query: 522 -----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY------- 554
                      L I        + + + + K    ++V   ELPGC  +WTV        
Sbjct: 486 SEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEE 545

Query: 555 HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
            ++ R  + +   ++  A  D   H +LI+S E  TM+L+T   + E+  S  +  QG T
Sbjct: 546 EEAPRAESTEQESTTPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPT 604

Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
           + AGN+   R ++QV   G R+L+G      L F P +         + ++  ++ADPYV
Sbjct: 605 VFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYV 654

Query: 672 LLGMSDGSIRLLV 684
           ++  ++G + + +
Sbjct: 655 VIMSAEGHVTMFL 667



 Score =  108 bits (271), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 71/247 (28%), Positives = 114/247 (46%), Gaps = 15/247 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 782  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 837

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 838  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 887

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 888  VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 947

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 948  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1007

Query: 992  YWPVQKV 998
             WPV+K+
Sbjct: 1008 PWPVRKI 1014


>gi|345779232|ref|XP_532356.3| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 [Canis lupus familiaris]
          Length = 1460

 Score =  301 bits (770), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 212/630 (33%), Positives = 331/630 (52%), Gaps = 72/630 (11%)

Query: 100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
            LELV  +   GNV S+A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+
Sbjct: 84  KLELVASFSFFGNVMSMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSL 139

Query: 160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDT 219
           H FE PE   L+ G       P V+VDP GRC  +L+YG ++++L   +     + +E  
Sbjct: 140 HYFEEPE---LRDGFVQNVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHE 193

Query: 220 FGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
              G G  +    S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +
Sbjct: 194 GLMGEGQRSSFLPSYIIDVRGLDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVR 253

Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA- 336
             TC I A+S++ T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS  
Sbjct: 254 QDTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVP 313

Query: 337 SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG- 395
              +ALN       +     +    + LD A A ++  D  ++S K G++ +LT++ DG 
Sbjct: 314 PYGVALNGLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGM 373

Query: 396 RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG 455
           R V+     K   SVLT+ + T+     FLGSRLG+SLL+++T        S+    E  
Sbjct: 374 RSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAA--REAA 431

Query: 456 DIEADAPSTKRLRRSS-----SDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSL 509
           D E      KR+  ++         QD V+  E+ +YGS A + T+ A  T+SF V DS+
Sbjct: 432 DKEEPPSKKKRVDCAAGWSGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSI 487

Query: 510 VNIGPLKDFSYG----------------LRI------NADASATGISKQSNYELV---EL 544
           +NIGP  + + G                L I        + + + + K    ++V   EL
Sbjct: 488 LNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFEL 547

Query: 545 PGCKGIWTVY-------HKSSRGHNA--DSSRMAAYDD-EYHAYLIISLEARTMVLETAD 594
           PGC  +WTV         ++S+G  A  +SS + A DD   H +LI+S E  TM+L+T  
Sbjct: 548 PGCYDMWTVIAPVRKEQEETSKGEVAEQESSALEAEDDGRRHGFLILSREDSTMILQTGQ 607

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
            + E+  S  +  QG T+ AGN+   R ++QV   G R+L+G      L F P +     
Sbjct: 608 EIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL---- 659

Query: 655 GSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
               S ++  ++ADPYV++  ++G + + +
Sbjct: 660 ---GSPIVQCAVADPYVVIMSAEGHVTMFL 686



 Score =  109 bits (273), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 70/247 (28%), Positives = 114/247 (46%), Gaps = 15/247 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 801  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQAEARKEEATR 856

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 857  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 906

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +        E         R   F++I G+ G F+ G  
Sbjct: 907  VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 966

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 967  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1026

Query: 992  YWPVQKV 998
             WPV+K+
Sbjct: 1027 PWPVRKI 1033


>gi|410987992|ref|XP_004000273.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 [Felis catus]
          Length = 1432

 Score =  300 bits (768), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 211/632 (33%), Positives = 334/632 (52%), Gaps = 76/632 (12%)

Query: 100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
            LELV  +   GNV S+A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+
Sbjct: 56  KLELVASFSFFGNVMSMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSL 111

Query: 160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDT 219
           H FE PE   L+ G       P V+VDP GRC  +L+YG ++++L   +     + +E  
Sbjct: 112 HYFEEPE---LRDGFVQNVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHE 165

Query: 220 FGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
              G G  +    S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +
Sbjct: 166 GLMGEGQRSSFLPSYIIDVRGLDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVR 225

Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA- 336
             TC I A+S++ T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS  
Sbjct: 226 QDTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVP 285

Query: 337 SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG- 395
              +ALN       +     +    + LD A A ++  D  ++S K G++ +LT++ DG 
Sbjct: 286 PYGVALNGLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGM 345

Query: 396 RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG 455
           R V+     K   SVLT+ + T+     FLGSRLG+SLL+++T        +S ++E   
Sbjct: 346 RSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT-EKLQEPPASAVREA-- 402

Query: 456 DIEADAPSTKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRD 507
             + + P +K+ R  S+          QD V+  E+ +YGS A + T+ A  T+SF V D
Sbjct: 403 -ADKEEPPSKKKRVDSTVGWSGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCD 457

Query: 508 SLVNIGPLKDFSYG----------------LRI------NADASATGISKQSNYELV--- 542
           S++NIGP  + + G                L I        + + + + K    ++V   
Sbjct: 458 SILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTF 517

Query: 543 ELPGCKGIWTVY-------HKSSRGHNADS--SRMAAYDD-EYHAYLIISLEARTMVLET 592
           ELPGC  +WTV         ++S+G  A+   S + A DD   H +LI+S E  TM+L+T
Sbjct: 518 ELPGCYDMWTVIAPVRKEQEETSKGEGAEQEPSTLEAEDDGRRHGFLILSREDSTMILQT 577

Query: 593 ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSES 652
              + E+  S  +  QG T+ AGN+   R ++QV   G R+L+G      L F P +   
Sbjct: 578 GQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-- 631

Query: 653 GSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
                 S ++  ++ADPYV++  ++G + + +
Sbjct: 632 -----GSPIVQCAVADPYVVIMSAEGHVTMFL 658



 Score =  111 bits (277), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 79/264 (29%), Positives = 122/264 (46%), Gaps = 49/264 (18%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G LEI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 773  WCLLVRENGTLEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQAEARKEEATR 828

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++        
Sbjct: 829  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQ------- 871

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTRE---------------ETPHGAPCQ--RIT 915
                L   N+       +RF + P +   RE               E   GA  +  R  
Sbjct: 872  ----LGQGNL------KVRFKKVPHNINFREKKPKPSKKKVEGGSAEEGAGARGRVARFR 921

Query: 916  IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
             F++I G+ G F+ G  P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   
Sbjct: 922  YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNR 981

Query: 975  QGILKICQLPSGSTYDNYWPVQKV 998
            QG L+I  LP+  +YD  WPV+K+
Sbjct: 982  QGELRISVLPAYLSYDAPWPVRKI 1005


>gi|158287218|ref|XP_309311.4| AGAP011340-PA [Anopheles gambiae str. PEST]
 gi|157019545|gb|EAA05261.4| AGAP011340-PA [Anopheles gambiae str. PEST]
          Length = 1434

 Score =  299 bits (766), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 283/1050 (26%), Positives = 477/1050 (45%), Gaps = 179/1050 (17%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            +LV   ANV+++Y  RV  +    +++     R   M       LE V  YRL+GN++S+
Sbjct: 29   SLVTGGANVLKVY--RVIPDADPATRDKYTAARPPNM------KLECVASYRLNGNIKSM 80

Query: 117  AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
              +S  G+     RD+++++F DAK+SV++FD     L+  S+H FE  +   ++ G   
Sbjct: 81   QSVSLAGS----LRDALLISFPDAKLSVVQFDPDNFDLKTLSLHYFEDED---IRGGWTG 133

Query: 177  FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR--I 230
                P+V+VDP  RC  +LVYG ++++L   +  S     L   +    +     A+  I
Sbjct: 134  HYHIPMVRVDPDNRCAVMLVYGRKLVVLPFRKDSSLDEIELQDVKPIKKAPMQLVAKTPI 193

Query: 231  ESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
             +S++I L+DLD K  +V D  F+HGY EP ++IL+E   T+ GR++ +  TC + ALS+
Sbjct: 194  LASYIIELKDLDEKIDNVIDIQFLHGYYEPTLLILYEPVRTFPGRIAVRSDTCTMVALSL 253

Query: 289  STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVS 348
            +   + HP+IW+  +LP D  + + +  PIGG LV+  N++ Y +QS         Y VS
Sbjct: 254  NIQQRVHPVIWTVNSLPFDCIQAIPINKPIGGCLVMCVNSLIYLNQSVP------PYGVS 307

Query: 349  LDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQR 400
            L+SS +        P+    + LDAA   +++ +  +LS K G+L +LT+  D  R V+ 
Sbjct: 308  LNSSADHSTSFPLKPQDGVRISLDAAQVCFIEPEKLVLSLKGGELYVLTLCADSMRSVRN 367

Query: 401  LDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
               +K   SVLTS I    +   FLGSRLG+SLL++F     + +++    ++ G +E +
Sbjct: 368  FHFNKAAASVLTSCICVCEDEYLFLGSRLGNSLLLRFKEKDESLVITI---DDSGAVEKE 424

Query: 461  APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
                KR R    +            +YGS    T     ++ F V D+++NIGP+   + 
Sbjct: 425  P---KRPRLEEEEL----------EVYGSGY-KTSVQLTSYIFEVCDNVLNIGPIAHMAV 470

Query: 521  GLRINADASATG-----ISKQSNYELVE--------------------------LPGCKG 549
            G R+  + +        +  + + E+V                           L GC  
Sbjct: 471  GERVAEEDAENQPDVQIVQNKLDIEVVTSSGHGKNGALCVLQSSIKPQVITSFGLSGCVD 530

Query: 550  IWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
            +WTV+ ++        +R A      HA++I+S E  TMVL+T + + E+ E+  +    
Sbjct: 531  VWTVFDEA-------VARRAEDGPSTHAFMILSQEGGTMVLQTGEEINEI-ENTGFATTV 582

Query: 610  RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
             TI  GN+   R ++QV  +  R+L G+ + Q++                 + SV+I DP
Sbjct: 583  PTIHVGNIGTNRFIVQVTTKSIRLLQGTRLLQNIPI----------DLGCPLASVAIVDP 632

Query: 670  YVLLGMSDGSIRLLV-----GDPSTC----TVSVQTPAAIE-SSKKPVSSCTLYHDK--- 716
            YV +  S+G +  L      G P       T+S  TPA +  S+ + VS   L+  K   
Sbjct: 633  YVCVRSSEGRVITLALREGKGTPRLAVNKNTIS-PTPAVVAISAYRDVSG--LFTKKIED 689

Query: 717  --------------------GPEPWLRKTSTDAWLSTGVGE----------AIDGADGGP 746
                                 PEP ++    +  L    G           AI G  GG 
Sbjct: 690  VYDLSRGGAASAYSSGFGSMKPEPHMKIEDEEDLLYGESGRSFKMTSMADMAIAGKSGGS 749

Query: 747  LD---------QGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDT--YMREA 795
             D         +   +     ++G LEI+ +P+   V+ +    +G   + D+  ++   
Sbjct: 750  ADFWMKYMQQVKPTYWLFAARDNGTLEIYSMPDLKLVYLITNVGNGNKVLSDSMEFVPLP 809

Query: 796  LKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQA 855
            +  S ++ ++SS  G   G   ++   +++ +A+    ++ SRP LF I  +  +L Y+ 
Sbjct: 810  MGKSASQEDASSAFGASFGVSASLLPKEILMVAL---GSYGSRPLLF-IRLEHDLLIYRV 865

Query: 856  YLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAP----- 910
            + +      SK    +   R LS S V+    R    S         E+    A      
Sbjct: 866  FRY------SKGHLKLRFKR-LSTS-VTCPVFRTPEPSGAGATEAANEQQQARATKVLYE 917

Query: 911  -CQRITIFKNISGHQGFFLSGSRPCWCMVFRE-RLRVHPQLCDGSIVAFTVLHNVNCNHG 968
                I  F N+SG+ G  + G +P +  +     LR H       + AF   +NVNC +G
Sbjct: 918  NISMIRYFANVSGYAGVAVCGEKPYFLFLTAHGELRSHRLYARTVMKAFAPFNNVNCPNG 977

Query: 969  FIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
            F+Y   Q  LKI   P+  +YD+ WPV+K+
Sbjct: 978  FLYFDEQYELKISIFPTYLSYDSVWPVRKI 1007


>gi|395860104|ref|XP_003802355.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 [Otolemur garnettii]
          Length = 1441

 Score =  299 bits (766), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 217/678 (32%), Positives = 344/678 (50%), Gaps = 89/678 (13%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEVLTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
            SVLT+ + T+     FLGSRLG+SLL+++T         +    E  D E      KR+
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT--EKLQEPPASATRESADKEEPPSKKKRV 424

Query: 468 RRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
             S          QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G
Sbjct: 425 DPSVGWSGGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMG 480

Query: 522 ----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY-- 554
                           L I        + + + + K    ++V   ELPGC  +WTV   
Sbjct: 481 EPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAS 540

Query: 555 -----HKSSRGHNADSSRMAAYDDE---YHAYLIISLEARTMVLETADLLTEVTESVDYF 606
                 ++ +G   +        +E    H +LI+S E  TM+L+T   + E+  S  + 
Sbjct: 541 VRKEEEETPKGEGTEQESGVPEGEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFA 599

Query: 607 VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI 666
            QG T+ AGN+   R ++QV   G R+L+G      L F P +         + ++  ++
Sbjct: 600 TQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAV 649

Query: 667 ADPYVLLGMSDGSIRLLV 684
           ADPYV++  ++G + + +
Sbjct: 650 ADPYVVIMSAEGHVTMFL 667



 Score =  110 bits (276), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 78/264 (29%), Positives = 122/264 (46%), Gaps = 49/264 (18%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 782  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 837

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++        
Sbjct: 838  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQ------- 880

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTRE-------------ETPHGAPCQ----RIT 915
                L   N+       +RF + P +   RE              T  GA  +    R  
Sbjct: 881  ----LGQGNL------KVRFKKVPHNINFREKKPKPSKKKAEGGSTEEGAGVRGRVARFR 930

Query: 916  IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
             F++I G+ G F+ G  P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   
Sbjct: 931  YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNR 990

Query: 975  QGILKICQLPSGSTYDNYWPVQKV 998
            QG L+I  LP+  +YD  WPV+K+
Sbjct: 991  QGELRISVLPAYLSYDAPWPVRKI 1014


>gi|417406474|gb|JAA49895.1| Putative mrna cleavage and polyadenylation factor ii complex
           subunit cft1 cpsf subunit [Desmodus rotundus]
          Length = 1444

 Score =  299 bits (765), Expect = 6e-78,   Method: Compositional matrix adjust.
 Identities = 216/675 (32%), Positives = 347/675 (51%), Gaps = 80/675 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEAPTKNDRSTEGKAHRE--HREKLELVASFSFFGNVMSM 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 85  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 138 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 194

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 195 DVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 254

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct: 255 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTSGTTAFP 314

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +      LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 315 LRTQEGVRTTLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRSFHFDKAAASVLT 374

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
           S + T+     FLGSRLG+SLL+++T        +S ++E     + + PS+K+ R   +
Sbjct: 375 SSMVTMEPGYLFLGSRLGNSLLLKYT-EKLQEAPASAVREA---ADKEEPSSKKKRVDPT 430

Query: 473 -------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG--- 521
                     QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G   
Sbjct: 431 VGWSGGQSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGEPA 486

Query: 522 -------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY----- 554
                        L I        + + + + K    ++V   ELPGC  +WTV      
Sbjct: 487 FLSEEFQNSPEPDLEIVLCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVVAPVRK 546

Query: 555 --HKSSRGHNADSSRMAAY---DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
              ++ +G   +   +      D   H +LI+S E  TM+L+T   + E+  S  +  QG
Sbjct: 547 EQEETPKGEGTEQEPITPETEDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQG 605

Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
            T+ AGN+   R ++QV   G R+L+G      L F P +         S ++  ++ADP
Sbjct: 606 PTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GSPIVQCAVADP 655

Query: 670 YVLLGMSDGSIRLLV 684
            V++  ++G + + +
Sbjct: 656 CVVIMSAEGHVAMFL 670



 Score =  107 bits (266), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 75/264 (28%), Positives = 120/264 (45%), Gaps = 49/264 (18%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 785  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 840

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+  +             
Sbjct: 841  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAFAHD------------- 881

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAP-----------------CQRIT 915
                   S +    L+ +RF + P +   RE+ P  +                    R  
Sbjct: 882  -------SQLGQGNLK-VRFKKVPHNINFREKKPKPSKKKADGGGAEEGAGARGRVARFR 933

Query: 916  IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
             F++I G+ G F+ G  P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   
Sbjct: 934  YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNR 993

Query: 975  QGILKICQLPSGSTYDNYWPVQKV 998
            QG L+I  LP+  +YD  WPV+K+
Sbjct: 994  QGELRISVLPAYLSYDAPWPVRKI 1017


>gi|344236599|gb|EGV92702.1| Cleavage and polyadenylation specificity factor subunit 1
           [Cricetulus griseus]
          Length = 1419

 Score =  298 bits (764), Expect = 7e-78,   Method: Compositional matrix adjust.
 Identities = 217/671 (32%), Positives = 347/671 (51%), Gaps = 82/671 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN G T+ +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE  E   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEESE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ 353
           HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEA--DAPSTKRLRRS 470
           + + T+     FLGSRLG+SLL+++T        SS ++E      A  + P +K+ R  
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS-VREAADKASAHNEEPPSKKKRVD 430

Query: 471 SSDAL-------QDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
            +          QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G 
Sbjct: 431 PTAGWTGGKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGE 486

Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY--- 554
                          L I        + + + + K    ++V   ELPGC  +WTV    
Sbjct: 487 PAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 546

Query: 555 ----HKSSRGHNAD---SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
                ++ R  + +   ++  A  D   H +LI+S E  TM+L+T   + E+  S  +  
Sbjct: 547 RKEEEEAPRAESTEQESTTPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFAT 605

Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
           QG T+ AGN+   R ++QV   G R+L+G      L F P +         + ++  ++A
Sbjct: 606 QGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVA 655

Query: 668 DPYVLLGMSDG 678
           DPYV++  ++G
Sbjct: 656 DPYVVIMSAEG 666



 Score =  109 bits (272), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 71/247 (28%), Positives = 114/247 (46%), Gaps = 15/247 (6%)

Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
           + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 760 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 815

Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
           QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 816 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 865

Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                   N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 866 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPS 925

Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
           P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 926 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 985

Query: 992 YWPVQKV 998
            WPV+K+
Sbjct: 986 PWPVRKI 992


>gi|301773406|ref|XP_002922132.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
           specificity factor subunit 1-like [Ailuropoda
           melanoleuca]
          Length = 1469

 Score =  298 bits (762), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 210/632 (33%), Positives = 333/632 (52%), Gaps = 76/632 (12%)

Query: 100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
            LELV  +   GNV S+A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+
Sbjct: 103 KLELVASFSFFGNVMSMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSL 158

Query: 160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDT 219
           H FE PE   L+ G       P V+VDP GRC  +L+YG ++++L   +     + +E  
Sbjct: 159 HYFEEPE---LRDGFVQNVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHE 212

Query: 220 FGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
              G G  +    S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +
Sbjct: 213 GLMGEGQRSSFLPSYIIDVRGLDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVR 272

Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA- 336
             TC I A+S++ T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS  
Sbjct: 273 QDTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVP 332

Query: 337 SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG- 395
              +ALN       +     +    + LD A A ++  D  ++S K G++ +LT++ DG 
Sbjct: 333 PYGVALNGLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGM 392

Query: 396 RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG 455
           R V+     K   SVLT+ + T+     FLGSRLG+SLL+++T        +S ++E   
Sbjct: 393 RSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT-EKLQEPPASAVREA-- 449

Query: 456 DIEADAPSTKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRD 507
             + + P +K+ R  S+          QD V+  E+ +YGS A + T+ A  T+SF V D
Sbjct: 450 -ADKEEPPSKKKRVDSTVGWSGGKSMPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCD 504

Query: 508 SLVNIGPLKDFSYG----------------LRI------NADASATGISKQSNYELV--- 542
           S++NIGP  + + G                L I        + + + + K    ++V   
Sbjct: 505 SILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTF 564

Query: 543 ELPGCKGIWTVY-------HKSSRGHNADS--SRMAAYDD-EYHAYLIISLEARTMVLET 592
           ELPGC  +WTV         ++ +G  A+   S + A DD   H +LI+S E  TM+L+T
Sbjct: 565 ELPGCYDMWTVIAPVRKEQEETPKGEGAEQEPSALEADDDGRRHGFLILSREDSTMILQT 624

Query: 593 ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSES 652
              + E+  S  +  QG T+ AGN+   R ++QV   G R+L+G      L F P +   
Sbjct: 625 GQEIMELDTS-GFATQGPTVFAGNIGDSRYIVQVSPLGIRLLEG---VNQLHFIPVDL-- 678

Query: 653 GSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
                 S ++  ++ADPYV++  ++G + + +
Sbjct: 679 -----GSPIVQCAVADPYVVIMSAEGHVTMFL 705



 Score =  102 bits (254), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 78/271 (28%), Positives = 122/271 (45%), Gaps = 56/271 (20%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 820  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 875

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++        
Sbjct: 876  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQ------- 918

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTRE---------------ETPHGAPCQ--RIT 915
                L   N+       +RF + P +   RE               E   GA  +  R  
Sbjct: 919  ----LGQGNL------KVRFKKVPHNINFREKKPKPSKKKVEGGSAEEGAGARGRVARFR 968

Query: 916  IFKNISGHQG-------FFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNH 967
             F++I G+ G        F+ G  P W +V  R  LR+HP   DG I +F   HNVNC  
Sbjct: 969  YFEDIYGYSGGGGACPQVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPR 1028

Query: 968  GFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
            GF+Y   QG L+I  LP+  +YD  WPV+K+
Sbjct: 1029 GFLYFNRQGELRISVLPAYLSYDAPWPVRKI 1059


>gi|380014171|ref|XP_003691113.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like [Apis florea]
          Length = 1583

 Score =  298 bits (762), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 210/668 (31%), Positives = 343/668 (51%), Gaps = 81/668 (12%)

Query: 58  LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
           LVV  AN+I ++ +    + +K+ K +     ++         LE +  Y LHGNV S+ 
Sbjct: 30  LVVAGANIIRVFRLIPDVDITKKEKYTESRPPKM--------KLECLSQYTLHGNVMSMQ 81

Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
            ++  G+    +RDS++L+F DAK+SV+E+D   H LR  S+H FE  E   ++ G  + 
Sbjct: 82  AVTLVGS----QRDSLLLSFRDAKLSVVEYDQDTHDLRTVSLHYFEEEE---IRDGWTNH 134

Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
              P+V+VDP+GRC  +L+YG ++++L   +  S   GD             I SS++I 
Sbjct: 135 HHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPSLDDGDLLDNSKASSNKTPILSSYMIV 194

Query: 238 LRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
           L+ L+  M ++ D  F+HGY EP ++IL+E   T++GR++ +  TC + A+S++   + H
Sbjct: 195 LKCLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQRVH 254

Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
           P+IWS  NLP D Y+ + V  P+GG L++  N++ Y +QS      +  Y VSL+S  E 
Sbjct: 255 PIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQS------IPPYGVSLNSLAET 308

Query: 356 -------PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
                  P+    + L+ +   ++ +D  ++S K+G+L +L++  D  R V+     K  
Sbjct: 309 STNFPLKPQEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDKAA 368

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE-EFGDIEADAPSTKR 466
            SVLTS +    ++  FLGSRLG+SLL++FT     ++ ++   E    + E +    K+
Sbjct: 369 ASVLTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPENLQNTNENEIVLEENETEETPAKK 428

Query: 467 LRRS------SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
           +++       +SD L D+ + EEL +YGS + +T     ++ F V DSL+NIGP  + S 
Sbjct: 429 IKQDFIGDWMASDVL-DIKDPEELEVYGSET-HTSIQITSYIFEVCDSLLNIGPCGNISM 486

Query: 521 G--------LRINAD-----ASATGISKQSNYELV------------ELPGCKGIWTVYH 555
           G           N D      + +G  K     ++            ELPGC+ +WTV  
Sbjct: 487 GEPAFLSEEFSHNQDPDVELVTTSGYGKNGALCVLQHSIRPQVVTTFELPGCEDMWTVI- 545

Query: 556 KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAG 615
               G   +  ++    +  HA+LI+S E  TM+L+T   + EV +S  +  QG TI AG
Sbjct: 546 ----GTLNNDEQIRPEAEGSHAFLILSQEDSTMILQTGQEINEVDQS-GFSTQGSTIFAG 600

Query: 616 NLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
           NL   R ++QV + G R+L G    Q +                 ++  S ADPYV L  
Sbjct: 601 NLGANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYVTLLS 650

Query: 676 SDGSIRLL 683
            DG + LL
Sbjct: 651 EDGQVMLL 658



 Score = 89.0 bits (219), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 69/250 (27%), Positives = 105/250 (42%), Gaps = 40/250 (16%)

Query: 755 VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
           +V  +SG LEI+ +P+    + +  F  G+  + D+     L+ +      + E      
Sbjct: 772 LVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQTTPVNEIPNPE------ 825

Query: 815 RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
                  M+V E+ M     H +RP L   L D  +  YQAY +  P+   K        
Sbjct: 826 -------MQVREILMVALGHHGNRPMLLVRL-DSELQIYQAYRY--PKGHLKL------- 868

Query: 875 RSLSVSNVSASRLRNLRFSRTP--LDAYTREETPHGAPCQR---ITIFKNISGHQGFFLS 929
                      R + L     P  L    R+E        R   +  F NI+G+ G F+ 
Sbjct: 869 -----------RFKKLDHGIIPGHLRPRPRDEDMPAMNDTRHCMMRYFSNIAGYNGVFIC 917

Query: 930 GSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGST 988
              P W  +  R  LR HP   DG + +F   +N+NC  GF+Y   +  L+IC LP+  +
Sbjct: 918 SDYPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQGFLYFNRKEELRICVLPTHLS 977

Query: 989 YDNYWPVQKV 998
           YD  WPV+KV
Sbjct: 978 YDAPWPVRKV 987


>gi|355680843|gb|AER96659.1| cleavage and polyadenylation specific factor 1, 160kDa [Mustela
           putorius furo]
          Length = 1399

 Score =  298 bits (762), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 210/636 (33%), Positives = 329/636 (51%), Gaps = 81/636 (12%)

Query: 100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
            LELV  +   GNV S+A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+
Sbjct: 21  KLELVASFSFFGNVMSMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSL 76

Query: 160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVL----VYGLQMIILKASQGGSGLVG 215
           H FE PE   L+ G       P V+VDP GRC  +L    +YG ++++L   +     + 
Sbjct: 77  HYFEEPE---LRDGFVQNVHAPRVRVDPDGRCAAMLTAMLIYGSRLVVLPFRRES---LA 130

Query: 216 DEDTFGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGR 273
           +E     G G  +    S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GR
Sbjct: 131 EEHEGLMGEGQRSSFLPSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGR 190

Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHS 333
           V+ +  TC I A+S++ T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +
Sbjct: 191 VAVRQDTCCIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLN 250

Query: 334 QSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
           QS     +ALN       +     +    + LD A A ++  D  ++S K G++ +LT++
Sbjct: 251 QSVPPYGVALNGLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLI 310

Query: 393 YDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
            DG R V+     K   SVLT+ + T+     FLGSRLG+SLL+++     T  L     
Sbjct: 311 TDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKY-----TEKLQEAPA 365

Query: 452 EEFGDIEADAPSTKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSF 503
               + + D P +K+ R  S+        A QD V+  E+ +YGS A + T+ A  T+SF
Sbjct: 366 GAVRETDKDEPPSKKKRVESAVGWSGGKSAPQDEVD--EIEVYGSEAQSGTQLA--TYSF 421

Query: 504 AVRDSLVNIGPLKDFSYG----------------LRI------NADASATGISKQSNYEL 541
            V DS++NIGP  + + G                L I        + + + + K    ++
Sbjct: 422 EVCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQV 481

Query: 542 V---ELPGCKGIWTVYHKSSR---------GHNADSSRMAAYDD-EYHAYLIISLEARTM 588
           V   ELPGC  +WTV   + +         G   + S + A DD   H +LI+S E  TM
Sbjct: 482 VTTFELPGCYDMWTVIAPARKEQEETPKGDGAEQEPSALEADDDGRRHGFLILSREDSTM 541

Query: 589 VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPS 648
           +L+T   + E+  S  +  QG T+ AGN+   R ++QV   G R+L+G      L F P 
Sbjct: 542 ILQTGQEIMELDTS-GFATQGPTVFAGNIGDGRYIVQVSPLGIRLLEG---VSQLHFIPV 597

Query: 649 NSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
           +         S ++  ++ADPYV++  ++G + + +
Sbjct: 598 DL-------GSPIVQCAVADPYVVIMSAEGHVTMFL 626



 Score =  108 bits (271), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 70/247 (28%), Positives = 114/247 (46%), Gaps = 15/247 (6%)

Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
           + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 741 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 796

Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
           QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 797 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 846

Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                   N++    +     +        E         R   F++I G+ G F+ G  
Sbjct: 847 VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAAARGRVARFRYFEDIYGYSGVFICGPS 906

Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
           P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 907 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 966

Query: 992 YWPVQKV 998
            WPV+K+
Sbjct: 967 PWPVRKI 973


>gi|24653655|ref|NP_725397.1| cleavage and polyadenylation specificity factor 160, isoform B
           [Drosophila melanogaster]
 gi|15292103|gb|AAK93320.1| LD38533p [Drosophila melanogaster]
 gi|21627189|gb|AAM68553.1| cleavage and polyadenylation specificity factor 160, isoform B
           [Drosophila melanogaster]
          Length = 1420

 Score =  296 bits (759), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 287/1054 (27%), Positives = 462/1054 (43%), Gaps = 200/1054 (18%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
           NLVV  ANV+++Y +    E S+  K N  E +    M       LE +  Y L+GNV S
Sbjct: 29  NLVVAGANVLKVYRIAPNVEASQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82

Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
           L  +S  GA     RD+++++F+DAK+SVL+ D     L+  S+H FE  +   ++ G  
Sbjct: 83  LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135

Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
                P V+VDP  RC  +LVYG ++++L   +  S     L   +    +     +R  
Sbjct: 136 GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195

Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
           I +S++I LRDLD K  +V D  F+HGY EP ++IL+E   T  GR+             
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRI------------- 242

Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
                                 K+  +  PIGG LV+  N + Y +QS         Y V
Sbjct: 243 ----------------------KVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 274

Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
           SL+SS +        P+    + LD A+  ++  D  ++S +TGDL +LT+  D  R V+
Sbjct: 275 SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 334

Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
                K   SVLTS I  + +   FLGSRLG+SLL+ FT    +++++            
Sbjct: 335 NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDEVEQQSEQQQ 394

Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
             L++E  ++E +     +L  + + A    +  EEL +YGS +  +    + F F V D
Sbjct: 395 RNLQDEDQNLE-EIFDVDQLEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCD 453

Query: 508 SLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS---------N 538
           SL+N+ P+     G R+  +                     +ATG SK           N
Sbjct: 454 SLMNVAPINYMCAGERVEFEEDGVTLRPHAESLQDLKIELVAATGHSKNGALSVFVNCIN 513

Query: 539 YELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
            +++   EL GC  +WTV+         D+++ ++ +D+ H ++++S    T+VL+T   
Sbjct: 514 PQIITSFELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQE 564

Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
           + E+ E+  + V   TI  GNL  +R ++QV  R  R+L G+ + Q++            
Sbjct: 565 INEI-ENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI---------- 613

Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
              S V+ VSIADPYV L + +G +  L    +  T  +       SS   V + + Y D
Sbjct: 614 DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAISAYKD 673

Query: 716 -------KG----------------------PEPWLRKTSTDAWLSTGVGEAI------D 740
                  KG                       EP ++    +  L    G A       D
Sbjct: 674 LSGLFTVKGDDINLTGSSNSAFGHSFGGYMKAEPNMKVEDEEDLLYGDAGSAFKMNSMAD 733

Query: 741 GADGGPLDQGD------------IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
            A        D             + VV  +SG LEI+ +P+   V+ V+   +G   + 
Sbjct: 734 LAKQSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDVGNGSMVLT 793

Query: 789 DTYMREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAMQRWSAHHSRPFLFAILTD 847
           D      +  +  E   +S+ G  Q    ++ +S   +EL++     +  RP L  + T 
Sbjct: 794 DAMEFVPISLTTQE---NSKAGIVQACMPQHANSPLPLELSVIGLGLNGERPLLL-VRTR 849

Query: 848 GTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH 907
             +L YQ  +F  P+   K        R +   N+   +  ++       D     E+  
Sbjct: 850 VELLIYQ--VFRYPKGHLK-----IRFRKMDQLNLLDQQPTHIDLDEN--DEQEEIESYQ 900

Query: 908 GAP--CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVN 964
             P   Q++  F N+ G  G  + G  PC+  + FR  LR+H  L +G + +F   +NVN
Sbjct: 901 MQPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVN 960

Query: 965 CNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
             +GF+Y  +   LKI  LPS  +YD+ WPV+KV
Sbjct: 961 IPNGFLYFDTTYELKISVLPSYLSYDSVWPVRKV 994


>gi|358415280|ref|XP_003583063.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 [Bos taurus]
          Length = 1490

 Score =  295 bits (756), Expect = 7e-77,   Method: Compositional matrix adjust.
 Identities = 209/635 (32%), Positives = 327/635 (51%), Gaps = 82/635 (12%)

Query: 100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
            LELV  +   GNV S+A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+
Sbjct: 114 KLELVASFSFFGNVMSMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSL 169

Query: 160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLV 214
           H FE PE   L+ G       P V+VDP GRC  +L+YG ++++L       ++   GLV
Sbjct: 170 HYFEEPE---LRDGFVQNVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLV 226

Query: 215 GDEDTFGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAG 272
           G+        G  +    S++I++R LD K  ++ D  F+HGY EP ++IL E   TW G
Sbjct: 227 GE--------GQRSSFLPSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPG 278

Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYH 332
           +V+ +  TC I A+S++ T K HP+IWS  +LP D  + LAVP PIGGV++   N++ Y 
Sbjct: 279 KVAVRQDTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYL 338

Query: 333 SQSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV 391
           +QS     +ALN+      +     +    + LD A A ++  D  ++S K G++ +LT+
Sbjct: 339 NQSVPPYGVALNSLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTL 398

Query: 392 VYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGL 450
           + DG R V+     K   SVLT+ + T+     FLGSRLG+SLL+++T        S+  
Sbjct: 399 ITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASTA- 457

Query: 451 KEEFGDIEADAPSTKRLRRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFA 504
             E  D E      KR+  +     S    QD V+  E+ +YGS A + T+ A  T+SF 
Sbjct: 458 -REAADKEEPPSKKKRVDATTGWSGSKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFE 512

Query: 505 VRDSLVNIGPLKDFSYG----------------LRI------NADASATGISKQSNYELV 542
           V DS++NIGP  + + G                L I        + + + + K    ++V
Sbjct: 513 VCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVV 572

Query: 543 ---ELPGCKGIWTVYHKSSR---------GHNADSSRMAAYDD-EYHAYLIISLEARTMV 589
              ELPGC  +WTV     +         G   +     A DD   H +LI+S E  TM+
Sbjct: 573 TTFELPGCYDMWTVIAPVRKEQEETLKGEGTEPEPGAPEAEDDGRRHGFLILSREDSTMI 632

Query: 590 LETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSN 649
           L+T   + E+  S  +  QG T+ AGN+   R ++QV   G R+L+G      L F P +
Sbjct: 633 LQTGQEIMELDAS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVD 688

Query: 650 SESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
                    S ++  ++ADPYV++  ++G + + +
Sbjct: 689 L-------GSPIVQCAVADPYVVIMSAEGHVTMFL 716



 Score =  112 bits (280), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 71/247 (28%), Positives = 116/247 (46%), Gaps = 15/247 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+GA+EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 831  WCLLVRENGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 886

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +   RP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 887  QGELPLVKEVLLVALG-----SRQRRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 936

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E T       R   F++I G+ G F+ G  
Sbjct: 937  VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGTGPRGRVARFRYFEDIYGYSGVFICGPS 996

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HN+NC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 997  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDA 1056

Query: 992  YWPVQKV 998
             WPV+K+
Sbjct: 1057 PWPVRKI 1063


>gi|195056749|ref|XP_001995154.1| GH22991 [Drosophila grimshawi]
 gi|193899360|gb|EDV98226.1| GH22991 [Drosophila grimshawi]
          Length = 1426

 Score =  295 bits (755), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 291/1065 (27%), Positives = 451/1065 (42%), Gaps = 216/1065 (20%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
            NLVV  ANV+++Y +    +  +  K N  E +    M       LE +  Y L+GNV S
Sbjct: 29   NLVVAGANVLKVYRIAPNVDAVQRQKLNPSEMRLAPKM------RLECLASYTLYGNVMS 82

Query: 116  LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
            L  +S  GA     RD+++++F+DAK+SVL+ D     L+  S+H FE  +   ++ G  
Sbjct: 83   LQSVSLAGA----MRDALLISFKDAKLSVLQLDADTQTLKTLSLHYFEEDD---IRGGWT 135

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
                 P+V+VDP  RC  +LVYG ++++L   +  S     L   +    +      R  
Sbjct: 136  GRYHVPVVRVDPDARCAIMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVTRTP 195

Query: 230  IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
            I +S++I L DLD K  +V D  F+HGY EP ++IL+E   T AGR+  +  T       
Sbjct: 196  IMASYLIALADLDEKLDNVLDIQFLHGYYEPTLLILYEPVRTCAGRIKVRSDT------- 248

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
                                      +  PIGG LV+  N + Y +QS         Y V
Sbjct: 249  -----------------------FFPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 279

Query: 348  SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
            SL+SS +        P+ +  + LD A+  ++  D  ++S +TGDL +LT+  D  R V+
Sbjct: 280  SLNSSADNSTAFPLKPQDNVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 339

Query: 400  RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
                 K   SVLTS I        FLGSRLG+SLL+ FT    +++++            
Sbjct: 340  NFHFHKAAASVLTSCICVCHTEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVEATVEQQT 399

Query: 448  -----SGLKEE--FGDIEA-DAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
                   L EE    D+E  +AP   + RR         +  EEL +YGS +  +    +
Sbjct: 400  IEQSPEELAEESPVYDVEQHEAPPQSKSRR---------IEDEELEVYGSGAKASVLQLR 450

Query: 500  TFSFAVRDSLVNIGPLKDFSYG-----------LRINAD---------ASATGISKQS-- 537
             F F V DSL+N+ P+     G           LR +AD          +ATG SK    
Sbjct: 451  KFIFEVCDSLINVAPINYMCAGERVEFEEDGATLRPHADNLNDLKIELVAATGHSKNGAL 510

Query: 538  -------NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
                   N +++   EL GC  +WTV+  ++R   A ++R      + H ++++S  + T
Sbjct: 511  SVFVNCINPQIITSFELEGCLDVWTVFDDATR--KATTAR-----QDQHDFMLLSQRSST 563

Query: 588  MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGP 647
            +VL+T   + E+ E+  + V   TI  GNL  +R ++QV  R  R+L G+ + Q++    
Sbjct: 564  LVLQTGQEINEI-ENTGFTVNQPTIYVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI-- 620

Query: 648  SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPV 707
                       S V+ VSIADPYV L + +G +  L    +  T  +       SS   V
Sbjct: 621  --------DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAV 672

Query: 708  SSCTLYHD------------------------------KGPEPWLRKTSTDAWLSTGVGE 737
             +   Y D                                 EP ++    +  L    G 
Sbjct: 673  VAIAAYKDLSGLFTCKADDVLNLTGSSGAGFANSFGGYMKAEPHMKVEDEEDLLYGDAGS 732

Query: 738  AI------DGADGGPLDQGD------------IYSVVCYESGALEIFDVPNFNCVFTVDK 779
            A       D A        D             + VV  +SG LEI+ +P+   V+ V+ 
Sbjct: 733  AFKLNSMADLAKQSKQKNSDWWRRQLIQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVND 792

Query: 780  FVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAMQRWSAHHSR 838
              +G   + D    E +  S T+ NS +  G       ++ +S   +EL +     H  R
Sbjct: 793  IGNGALVLSDAM--EFVPISLTQENSKA--GILHACMPQHANSPLPLELCLVGLGQHGER 848

Query: 839  PFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLD 898
            P L  + T   +L YQ + +                  +    +    L   + +   LD
Sbjct: 849  PLLL-VRTRLELLIYQVFRY------------AKGHLKIRFRKLEQLHLLEQQPTHIELD 895

Query: 899  AYTREETP----HGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGS 953
                EE           Q++  F N+ G  G  + G  PC+  +  R  LR+H  L +G 
Sbjct: 896  GEDVEEAESYNMQAKYVQKLRYFANVGGLAGIMVCGVNPCFVFLTSRGELRIHRLLGNGD 955

Query: 954  IVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
            + +F   +NVN  HGF+Y  +   LKI  LPS  +YD  WPV+KV
Sbjct: 956  VRSFAAFNNVNIPHGFLYFDTTYELKISVLPSYLSYDAAWPVRKV 1000


>gi|110750698|ref|XP_624382.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 [Apis mellifera]
          Length = 1415

 Score =  293 bits (750), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 209/668 (31%), Positives = 343/668 (51%), Gaps = 81/668 (12%)

Query: 58  LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
           LVV  AN+I ++ +    + +K+ K +     ++         LE +  Y LHGNV S+ 
Sbjct: 30  LVVAGANIIRVFRLIPDVDITKKEKYTESRPPKM--------KLECLSQYTLHGNVMSMQ 81

Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
            ++  G+    +RDS++L+F DAK+SV+E+D   H LR  S+H FE  E   ++ G  + 
Sbjct: 82  AVTLVGS----QRDSLLLSFRDAKLSVVEYDQDTHDLRTVSLHYFEEEE---IRDGWTNH 134

Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
              P+V+VDP+GRC  +L+YG ++++L   +  S   GD             I SS++I 
Sbjct: 135 HHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPSLDDGDLLDNSKASSNKTPILSSYMIV 194

Query: 238 LRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
           L+ L+  M ++ D  F+HGY EP ++IL+E   T++GR++ +  TC + A+S++   + H
Sbjct: 195 LKCLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQRVH 254

Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
           P+IWS  NLP D Y+ + V  P+GG L++  N++ Y +QS      +  Y VSL+S  E 
Sbjct: 255 PIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQS------IPPYGVSLNSLAET 308

Query: 356 -------PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
                  P+    + L+ +   ++ +D  ++S K+G+L +L++  D  R V+     K  
Sbjct: 309 STNFPLKPQEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDKAA 368

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS-GLKEEFGDIEADAPSTKR 466
            SVLTS +    ++  FLGSRLG+SLL++FT     ++ ++   +    + E +    K+
Sbjct: 369 ASVLTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPENLQNTNENEIILEENETEETPAKK 428

Query: 467 LRRS------SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
           +++       +SD L D+ + EEL +YGS + +T     ++ F V DSL+NIGP  + S 
Sbjct: 429 IKQDFIGDWMASDVL-DIKDPEELEVYGSET-HTSIQITSYIFEVCDSLLNIGPCGNISM 486

Query: 521 G--------LRINAD-----ASATGISKQSNYELV------------ELPGCKGIWTVYH 555
           G           N D      + +G  K     ++            ELPGC+ +WTV  
Sbjct: 487 GEPAFLSEEFSHNQDPDVELVTTSGYGKNGALCVLQHSIRPQVVTTFELPGCEDMWTVI- 545

Query: 556 KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAG 615
               G   +  ++    +  HA+LI+S E  TM+L+T   + EV +S  +  QG TI AG
Sbjct: 546 ----GTLNNDEQIRPEAEGSHAFLILSQEDSTMILQTGQEINEVDQS-GFSTQGSTIFAG 600

Query: 616 NLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
           NL   R ++QV + G R+L G    Q +                 ++  S ADPYV L  
Sbjct: 601 NLGANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYVTLLS 650

Query: 676 SDGSIRLL 683
            DG + LL
Sbjct: 651 EDGQVMLL 658



 Score = 89.0 bits (219), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 69/250 (27%), Positives = 105/250 (42%), Gaps = 40/250 (16%)

Query: 755 VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
           +V  +SG LEI+ +P+    + +  F  G+  + D+     L+ +      + E      
Sbjct: 772 LVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQTTPVNEIPNPE------ 825

Query: 815 RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
                  M+V E+ M     H +RP L   L D  +  YQAY +  P+   K        
Sbjct: 826 -------MQVREILMVALGHHGNRPMLLVRL-DSELQIYQAYRY--PKGHLKL------- 868

Query: 875 RSLSVSNVSASRLRNLRFSRTP--LDAYTREETPHGAPCQR---ITIFKNISGHQGFFLS 929
                      R + L     P  L    R+E        R   +  F NI+G+ G F+ 
Sbjct: 869 -----------RFKKLDHGIIPGHLRPRPRDEDMPAMNDTRHCMMRYFSNIAGYNGVFIC 917

Query: 930 GSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGST 988
              P W  +  R  LR HP   DG + +F   +N+NC  GF+Y   +  L+IC LP+  +
Sbjct: 918 SDYPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQGFLYFNRKEELRICVLPTHLS 977

Query: 989 YDNYWPVQKV 998
           YD  WPV+KV
Sbjct: 978 YDAPWPVRKV 987


>gi|301628217|ref|XP_002943254.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 [Xenopus (Silurana) tropicalis]
          Length = 628

 Score =  293 bits (749), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 204/626 (32%), Positives = 318/626 (50%), Gaps = 74/626 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV   + + +Y +    E S + +   E K            LEL+  +   GNV S+
Sbjct: 29  NLVVAGTSQLYVYRLNPNCESSSKGEKGSEVKGH-------KEKLELMASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F++AK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKEAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +L+YG Q+++L       ++   GLVG+        G  +   
Sbjct: 135 NVHNPKVRVDPSGRCAVMLIYGTQLVVLPFRRDTLAEEHDGLVGE--------GQKSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R+LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRELDEKLLNIIDMQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
              K HP+IWS  NLP+D  + LAVP PIGGV++   N++ Y +QS     ++LN+    
Sbjct: 247 IMQKVHPVIWSLTNLPYDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVSLNSLTNG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             S    P+    V LD + AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTSFPLKPQEGLRVTLDCSQATFISYDKMVISLKGGEIYVLTLITDGMRSVRSFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
            SVLT+ +T +     FLGSRLG+SLL+++T     S        +    + D P  K+ 
Sbjct: 367 ASVLTTSMTPMEPGYLFLGSRLGNSLLLRYTEKVQDSPAGPSKDPD----KQDEPPNKKK 422

Query: 468 RRSSSDALQ-----DMVNG-EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
           R  SS A       +MV+  +E+ +YGS    + +   T+SF V DS++NIGP    S G
Sbjct: 423 RVDSSLARPGGSKGNMVDEIDEIEVYGSEM-QSGTQLSTYSFEVCDSILNIGPCATASMG 481

Query: 522 ----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYHK 556
                           L I        + + + + K    ++V   ELPGC  +WTV   
Sbjct: 482 EPAFLSEEFQESPEPDLEIVLCSGYGKNGALSVLQKSIRPQVVTTFELPGCHDMWTVISN 541

Query: 557 SSRGHNADSSRM------AAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
             +               A  D   H +LI+S +  TM+L+T   + E+  S  +  Q  
Sbjct: 542 HKKEEQEGEKEGETPPVEAEEDTNRHGFLILSRDDSTMILQTGQEIMELDTS-GFATQDP 600

Query: 611 TIAAGNLFGRRRVIQVFERGARILDG 636
           T+ AGN+   + ++QV  RG R+L+G
Sbjct: 601 TVYAGNIGDNKYIVQVSPRGIRLLEG 626


>gi|322792443|gb|EFZ16427.1| hypothetical protein SINV_15375 [Solenopsis invicta]
          Length = 1532

 Score =  293 bits (749), Expect = 5e-76,   Method: Compositional matrix adjust.
 Identities = 206/621 (33%), Positives = 328/621 (52%), Gaps = 63/621 (10%)

Query: 100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
            LE +  Y LHGN+ S+  +   G+    +RDS++L+F DAK+SV+E+D  IH LR  S+
Sbjct: 32  KLECLAQYTLHGNIMSMQAVHLIGS----QRDSLLLSFRDAKLSVVEYDQDIHDLRTVSL 87

Query: 160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE-D 218
           H FE  E   +K G  +    P+V+VDP+GRC  +L++G ++++L   +  S   GD  D
Sbjct: 88  HYFEEEE---IKDGWTNHHHIPIVRVDPEGRCAVMLIFGRKLVVLPFRKDPSLDDGDLLD 144

Query: 219 TFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSW 276
           T        A I SS++I L+ L+  M +V D  F+HGY EP ++IL+E   T+AGR++ 
Sbjct: 145 TAKLTSSNKAPILSSYMIVLKSLEEKMDNVIDLQFLHGYYEPTLLILYEPVRTFAGRIAV 204

Query: 277 KHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS- 335
           +  TC + A+S++   + HP+IWS  NLP D Y+ + V  P+GG L++  N++ Y +QS 
Sbjct: 205 RQDTCAMVAISLNIQQRVHPIIWSVSNLPFDCYQAVPVKKPLGGTLIMAFNSLIYLNQSI 264

Query: 336 ASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG 395
               ++LN+ A +  +    P+    + L+ A   ++  D  ++S K+G+L +L++  D 
Sbjct: 265 PPYGVSLNSLADTSTNFPLKPQEGVKMSLEGAQVAFISADRLVISLKSGELYVLSLFADS 324

Query: 396 -RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS-SGLKEE 453
            R V+     K   SVLTS +    ++  FLGSRLG+SLL++FT     ++ + +G +  
Sbjct: 325 MRSVRGFHFDKAAASVLTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPETLKNLNGGEIT 384

Query: 454 FGDIEADAPSTKRLRRS------SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
             + E++    K+ ++       +SD L D+ + EEL +YGS + +T     ++ F V D
Sbjct: 385 IEENESEETPAKKAKQDFLGDWMASDVL-DIKDPEELEVYGSET-HTSIQITSYIFEVCD 442

Query: 508 SLVNIGPLKDFSYG--------LRINAD-----ASATGISKQSNYELV------------ 542
           SL+NIGP  + S G           N D      + +G  K     ++            
Sbjct: 443 SLLNIGPCGNISMGEPAFLSEEFLQNQDPDVELVTTSGYGKNGALCVLQRSIRPQVVTTF 502

Query: 543 ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTES 602
           ELPGC+ +WTV        N D  +  A  +  HA+LI+S E  TM+L+T   + EV +S
Sbjct: 503 ELPGCEDMWTVIGTL----NNDEIKTEA--EGSHAFLILSQEDSTMILQTGQEINEVDQS 556

Query: 603 VDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVL 662
             +  QG T+ AGNL   R ++QV + G R+L G    Q +                 ++
Sbjct: 557 -GFSTQGSTVFAGNLGANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIV 605

Query: 663 SVSIADPYVLLGMSDGSIRLL 683
             S ADPYV L   DG + LL
Sbjct: 606 HASCADPYVTLLSEDGQVMLL 626



 Score = 95.1 bits (235), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 72/248 (29%), Positives = 107/248 (43%), Gaps = 35/248 (14%)

Query: 755 VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
           +V  +SG LEI+ +P+    + +  F  G+  + D+     L+   T IN          
Sbjct: 740 LVYRDSGTLEIYSLPDLRLSYLIRNFGFGQYVLHDSMESTTLQS--TPINEIPHP----- 792

Query: 815 RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
                  M+V E+ M     H +RP L   L D  +  YQ Y +  P+   K        
Sbjct: 793 ------DMQVREILMVALGHHGNRPMLLVRL-DSELQIYQVYRY--PKGYLK-------- 835

Query: 875 RSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI---FKNISGHQGFFLSGS 931
             L    +    +   R S  P      E+ P      RI +   F NI+G+ G F+   
Sbjct: 836 --LRFKKLDHGIIPG-RLSPRP----KEEDVPRNTSDTRICVMRYFSNIAGYNGVFICSD 888

Query: 932 RPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
            P W  +  R  LR HP   DGS+ +F   +N+NC  GF+Y   +  L+IC LP+  +YD
Sbjct: 889 YPHWIFLTGRGELRTHPMGIDGSVTSFAAFNNINCPQGFLYFNRKEELRICVLPTHLSYD 948

Query: 991 NYWPVQKV 998
             WPV+KV
Sbjct: 949 APWPVRKV 956


>gi|195381337|ref|XP_002049409.1| GJ21566 [Drosophila virilis]
 gi|194144206|gb|EDW60602.1| GJ21566 [Drosophila virilis]
          Length = 1420

 Score =  290 bits (741), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 284/1055 (26%), Positives = 447/1055 (42%), Gaps = 202/1055 (19%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
           NLVV  ANV+++Y +    + ++  K N  E +    M       LE +  Y L+GNV S
Sbjct: 29  NLVVAGANVLKVYRIAPNVDAAQRQKLNPTEMRLAPKM------RLECLASYSLYGNVMS 82

Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
           L  +S  G      RD+++++F+DAK+SVL+ D     L+  S+H FE  +   ++ G  
Sbjct: 83  LQSVSLAGG----MRDALLISFKDAKLSVLQLDADTQALKTLSLHYFEEED---IRGGWT 135

Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
                P+V+VDP  RC  +LVYG ++++L   +  S     L   +    +      R  
Sbjct: 136 GRYHVPVVRVDPDARCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVTRTP 195

Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
           I +S++I L DLD K  +V D  F+HGY EP ++IL+E   T AGR+             
Sbjct: 196 IMASYLIALADLDEKLDNVLDIQFLHGYYEPTLLILYEPVRTCAGRI------------- 242

Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
                                 K+  +  PIGG LV+  N I Y +QS         Y V
Sbjct: 243 ----------------------KVFPIQKPIGGCLVMTVNAIIYLNQSVP------PYGV 274

Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
           SL+SS +        P+ +  + LD A+  ++  D  ++S +TGDL +LT+  D  R V+
Sbjct: 275 SLNSSADNSTSFPLKPQDNVRLSLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 334

Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEA 459
                K   SVLTS I        FLGSRLG+SLL+ FT    +++++    E   + +A
Sbjct: 335 NFHFHKAAASVLTSCICVCHTEYIFLGSRLGNSLLLHFTEEDQSTVITLDDMENAVEQQA 394

Query: 460 DAPSTKRL----------RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSL 509
              +  +L            + S A    +  EEL +YGS +  +    + F F V DSL
Sbjct: 395 VEQAPPQLDEEQVYDVDQHEAPSQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCDSL 454

Query: 510 VNIGPLKDFSYGLRINAD--------------------ASATGISKQS---------NYE 540
           +N+ P+     G R+  +                     +ATG SK           N +
Sbjct: 455 INVAPINYMCAGERVEFEEDGSTLRPHAESLNEVKIELVAATGHSKNGALSVFVNCINPQ 514

Query: 541 LV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLT 597
           ++   EL GC  +WTV+  ++R     ++R      E H ++++S  + T+VL+T   + 
Sbjct: 515 IITSFELDGCLDVWTVFDDATR--KPTTAR-----QEQHDFMLLSQRSSTLVLQTGQEIN 567

Query: 598 EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSE 657
           E+ E+  + V   TI  GNL  +R ++QV  R  R+L G+ + Q++              
Sbjct: 568 EI-ENTGFTVNQPTIYVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI----------DV 616

Query: 658 NSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD-- 715
            S V+ VSIADPYV L + +G +  L    +  T  +       SS   V +   Y D  
Sbjct: 617 GSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAIAAYKDLS 676

Query: 716 ----------------KGP------------EPWLRKTSTDAWLSTGVGEAI------DG 741
                            GP            EP ++    +  L    G A       D 
Sbjct: 677 GLFTCKADDVLNLTGSSGPGFVNSFGGYMKAEPHMKVEDEEDLLYGDAGNAFKLNSMADL 736

Query: 742 ADGGPLDQGD------------IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVD 789
           A        D             + VV  +SG LEI+ +P+   V+ V+   +G   + D
Sbjct: 737 AKQSKQKNSDWWRRQLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDVGNGALVLND 796

Query: 790 TYMREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
               E +  S T+ NS +  G       ++ +S   +EL +     H  RP L  + T  
Sbjct: 797 AM--EFVPISLTQENSKA--GILHACMPQHANSPLPLELCLVGLGQHGERPLLL-VRTRL 851

Query: 849 TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETP-- 906
            +L YQ + +                  +    +    L + + +   LD    EE    
Sbjct: 852 ELLIYQVFRY------------AKGHLKIRFRKLEQLHLLDQQPTHIELDGDEAEEAESY 899

Query: 907 --HGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNV 963
                  Q++  F N+ G  G  + G  P +  +  R  LR+H  L +  + +F   +NV
Sbjct: 900 NMQPKYVQKLRYFSNVGGLAGIMVCGMNPVFVFLTARGELRIHRLLGNADVRSFAAFNNV 959

Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
           N  HGF+Y  +   LKI  LPS  +YD  WPV+KV
Sbjct: 960 NIPHGFLYFDTTYELKISVLPSYLSYDAAWPVRKV 994


>gi|195122290|ref|XP_002005645.1| GI18959 [Drosophila mojavensis]
 gi|193910713|gb|EDW09580.1| GI18959 [Drosophila mojavensis]
          Length = 1431

 Score =  289 bits (739), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 280/1068 (26%), Positives = 446/1068 (41%), Gaps = 217/1068 (20%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
            NLVV  ANV+++Y +    + ++  K N  E +    M       LE +  Y L+GNV S
Sbjct: 29   NLVVAGANVLKVYRIAPNVDATQRQKLNPSEMRLAPKM------RLECLASYSLYGNVMS 82

Query: 116  LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
            L  +S  G      RD+++++F+DAK+SVL+ D     L+  S+H FE  +   ++ G  
Sbjct: 83   LQSVSLAGG----MRDALLVSFKDAKLSVLQLDADTQTLKTLSLHYFEEDD---IRGGWT 135

Query: 176  SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
                 P+V+VDP  RC  +LVYG ++++L   +  S     L   +    +     +R  
Sbjct: 136  GRYHVPVVRVDPDARCAIMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTALVSRTP 195

Query: 230  IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
            I +S++I L DLD K  +V D  F+HGY EP ++IL+E   T AGR+             
Sbjct: 196  IMASYLIALADLDEKLDNVLDIQFLHGYYEPTLLILYEPVRTCAGRI------------- 242

Query: 288  ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
                                  K+  +  PIGG LV+  N + Y +QS         Y V
Sbjct: 243  ----------------------KVFPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 274

Query: 348  SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
            SL+SS +        P+ +  + LD A+  ++  D  ++S +TGDL +LT+  D  R V+
Sbjct: 275  SLNSSADNSTSFPLKPQDNVRLSLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 334

Query: 400  RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
                 K   SVLTS I        FLGSRLG+SLL+ FT    +++++            
Sbjct: 335  NFHFHKAAASVLTSCICVCHTEYIFLGSRLGNSLLLHFTEEDQSTVITLDDVESAATAAA 394

Query: 448  --------SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
                      + +    ++ D         + S A    +  EEL +YGS +  +    +
Sbjct: 395  TGAGEQQQQAIDQSPPQMDEDQVYDVEQHEAPSQAKSRRIEDEELEVYGSGAKASVLQLR 454

Query: 500  TFSFAVRDSLVNIGPLKDFSYGLRINAD--------------------ASATGISKQS-- 537
             F F V DSL+N+ P+     G R+  +                     +ATG SK    
Sbjct: 455  KFIFEVCDSLINVAPINYMCAGERVEFEEDGTTLRPHAESLTDLKIELVAATGHSKNGAL 514

Query: 538  -------NYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
                   N +++   EL GC  +WTV+  ++R       + +    E H ++++S  + T
Sbjct: 515  SVFVNCINPQIITSFELDGCLDVWTVFDDATR-------KPSTARQEQHDFMLLSQRSST 567

Query: 588  MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGP 647
            +VL+T   + E+ E+  + V   TI  GNL  +R ++QV  R  R+L G+ + Q++    
Sbjct: 568  LVLQTGQEINEI-ENTGFTVNQPTIYVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPI-- 624

Query: 648  SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPV 707
                       S V+ VSIADPYV L + +G +  L    +  T  +       SS   V
Sbjct: 625  --------DVGSPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSAPAV 676

Query: 708  SSCTLYHD------------------------------KGPEPWLRKTSTDAWLSTGVGE 737
             +   Y D                                 EP ++    +  L    G 
Sbjct: 677  VAIAAYKDLSGLFTCKADDVLNLTGSTGAGFANSFGGYMKAEPHMKVEDEEDLLYGDAGN 736

Query: 738  AI------DGADGGPLDQGD------------IYSVVCYESGALEIFDVPNFNCVFTVDK 779
            A       D A        D             + VV  +SG LEI+ +P+   V+ V+ 
Sbjct: 737  AFKLNSMADLAKQSKQKNTDWWRRQLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVND 796

Query: 780  FVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAMQRWSAHHSR 838
              +G   + D    E +  S T+ NS +  G       ++ +S   +EL++     H  R
Sbjct: 797  VGNGALVLTDAM--EFVPISLTQENSKA--GILHACMPQHANSPLPLELSLVGLGQHGDR 852

Query: 839  PFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLD 898
            P L  + T   +L YQ + +                  L +      +L  L    T ++
Sbjct: 853  PLLL-VRTRLELLIYQVFRY--------------AKGHLKIRFRKLEQLHLLDQQPTHIE 897

Query: 899  AYTREETPHGAP-------CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLC 950
                EET             Q++  F N+ G  G  + G  PC+  +  R  LR+H  L 
Sbjct: 898  LINEEETDEAESYNMQPKYVQKLRYFNNVGGLAGIMVCGVNPCFIFLTARGELRIHRLLG 957

Query: 951  DGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
            +  + +F   +NVN  HGF+Y  +   LKI  LP+  +YD  WPV+KV
Sbjct: 958  NAEVRSFAAFNNVNIPHGFLYFDTTYELKISVLPTYLSYDAAWPVRKV 1005


>gi|332018184|gb|EGI58789.1| Cleavage and polyadenylation specificity factor subunit 1
           [Acromyrmex echinatior]
          Length = 1412

 Score =  289 bits (739), Expect = 7e-75,   Method: Compositional matrix adjust.
 Identities = 214/665 (32%), Positives = 342/665 (51%), Gaps = 76/665 (11%)

Query: 58  LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
           LVV  ANVI ++ +    + ++  K + ET+            LE +  Y LHGN+ S+ 
Sbjct: 30  LVVAGANVIRVFRLIPDVDMTRREKYT-ETRP-------PKMKLECLTQYTLHGNIMSMQ 81

Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
            +   G+    +RDS++L+F DAK+SV+E+D  IH LR  S+H FE  E   +K G  + 
Sbjct: 82  AVHLIGS----QRDSLLLSFRDAKLSVVEYDQDIHDLRTVSLHYFEEEE---IKDGWTNH 134

Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR---IESSH 234
              P+V+VDP+GRC  +L++G ++++L   +  S  + D D   S    S     I SS+
Sbjct: 135 HHIPIVRVDPEGRCAVMLIFGRKLVVLPFRKDPS--LDDGDLLDSAKLTSTNKTPILSSY 192

Query: 235 VINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
           +I L+ L+  M +V D  F+HGY EP ++IL+E   T++GR++ +  TC + A+S++   
Sbjct: 193 MIVLKTLEEKMDNVIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQ 252

Query: 293 KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYAVSLDS 351
           + HP+IWS  NLP D Y+ + V  P+GG L++  N++ Y +QS     ++LN+ A S  +
Sbjct: 253 RVHPIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLADSSTN 312

Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSV 410
               P+    + L+ +   ++  D  ++S K+G+L +L++  D  R V+     K   SV
Sbjct: 313 FPLKPQEGVKMSLEGSQVAFISADRLVISLKSGELYVLSLFADSMRSVRGFHFDKAAASV 372

Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM--LSSGLKEEFGDIEADAPSTKRLR 468
           LTS +    ++  FLGSRLG+SLL++FT     ++  L+        +   + P+ K  +
Sbjct: 373 LTSCVCMCEDNYLFLGSRLGNSLLLRFTEKEPETLKNLNDNEITIEENENEETPAKKTKQ 432

Query: 469 R-----SSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG-- 521
                  +SD L D+ + EEL +YGS + +T     ++ F V DSL+NIGP  + S G  
Sbjct: 433 DFLGDWMASDVL-DIKDPEELEVYGSET-HTSIQITSYIFEVCDSLLNIGPCGNISMGEP 490

Query: 522 ------LRINAD-----ASATGISKQSNYELV------------ELPGCKGIWTVYHKSS 558
                    N D      + +G  K     ++            +LPGC+ +WTV     
Sbjct: 491 AFLSEEFLQNQDPDVELVTTSGYGKNGALCVLQRSIRPQVVTTFQLPGCEDMWTVIGIV- 549

Query: 559 RGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLF 618
              N D  R    ++  HA+LI+S E  TMVL+T   + EV +S  +  QG T+ AGNL 
Sbjct: 550 ---NNDEIRT---EEGSHAFLILSQEDSTMVLQTGQEINEVDQS-GFSTQGSTVFAGNLG 602

Query: 619 GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
             R ++QV + G R+L G    Q +                 ++  S ADPYV L   DG
Sbjct: 603 ANRYIVQVTQMGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYVALLSEDG 652

Query: 679 SIRLL 683
            + LL
Sbjct: 653 QVMLL 657



 Score = 94.4 bits (233), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 72/248 (29%), Positives = 107/248 (43%), Gaps = 35/248 (14%)

Query: 755 VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
           +V  +SG LEI+ +P+    + +  F  G+  + D+     L+   T IN          
Sbjct: 771 LVYRDSGTLEIYSLPDLRLSYLIRNFGYGQYVLHDSMESTTLQ--STPINEIPHP----- 823

Query: 815 RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
                  M+V E+ M     H +RP L   L D  +  YQAY +  P+   K        
Sbjct: 824 ------DMQVREILMVALGHHGNRPMLLVRL-DSDLQIYQAYRY--PKGYLK-------- 866

Query: 875 RSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI---FKNISGHQGFFLSGS 931
             L    +    +   R S  P      E+ P      RI +   F NI+G+ G F+   
Sbjct: 867 --LRFKKLDHGIIPG-RLSPRP----KEEDVPRNRNITRICVMRYFSNIAGYNGVFICSD 919

Query: 932 RPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
            P W  +  R  LR HP   DG + +F   +N+NC  GF+Y   +  L+IC LP+  +YD
Sbjct: 920 YPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQGFLYFNRKEELRICVLPTHLSYD 979

Query: 991 NYWPVQKV 998
             WPV+KV
Sbjct: 980 APWPVRKV 987


>gi|426361048|ref|XP_004047737.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 [Gorilla gorilla gorilla]
          Length = 1440

 Score =  287 bits (734), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 210/675 (31%), Positives = 336/675 (49%), Gaps = 84/675 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP G C  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGTCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
            SVLT+ + T+     FLGSRLG+SLL+++T        +S ++E     + + P +K+ 
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT-EKLQEPPASAVREA---ADKEEPPSKKK 422

Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVR---DSLVNIGPLKDFSYG--- 521
           R  ++               G  +     A    + AV    DS++NIGP  + + G   
Sbjct: 423 RVDATAGWSGEGRSRAGQERGQVTQGWSGAGAPLTVAVPQVCDSILNIGPCANAAMGEPA 482

Query: 522 -------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY----- 554
                        L I        + + + + K    ++V   ELPGC  +WTV      
Sbjct: 483 FLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRK 542

Query: 555 ----HKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
               +    G   + S   A DD   H +LI+S E  TM+L+T   + E+  S  +  QG
Sbjct: 543 EEEDNPKGEGTEQEPSTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQG 601

Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
            T+ AGN+   R ++QV   G R+L+G      L F P +         + ++  ++ADP
Sbjct: 602 PTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADP 651

Query: 670 YVLLGMSDGSIRLLV 684
           YV++  ++G + + +
Sbjct: 652 YVVIMSAEGHVTMFL 666



 Score =  108 bits (270), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 69/247 (27%), Positives = 114/247 (46%), Gaps = 15/247 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 781  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 836

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 837  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 886

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +        E         R   F++I G+ G F+ G  
Sbjct: 887  VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 946

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG + +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 947  PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1006

Query: 992  YWPVQKV 998
             WPV+K+
Sbjct: 1007 PWPVRKI 1013


>gi|312380158|gb|EFR26239.1| hypothetical protein AND_07834 [Anopheles darlingi]
          Length = 1503

 Score =  287 bits (734), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 277/1057 (26%), Positives = 457/1057 (43%), Gaps = 185/1057 (17%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            +LV   ANV+++Y  R+  +    ++      R   M       LE +  YRL GN+ SL
Sbjct: 42   SLVTGGANVLKVY--RIIPDADPATREKYSATRPPNM------KLECMASYRLFGNIMSL 93

Query: 117  AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
              +S  G+    +RD+++++F DAK+SV++FD     L+  S+H FE  +   ++ G   
Sbjct: 94   QSVSLAGS----QRDALLISFPDAKLSVVQFDPDNFDLKTLSLHYFEDED---IRGGWTG 146

Query: 177  FARGPLVKVDPQGRCGGVLVYGLQMIIL---KASQGGSGLVGDEDTFGSGGGF---SARI 230
                PLV+VDP  RC  +LVYG ++++L   K S      + D                I
Sbjct: 147  HYHIPLVRVDPDNRCAVMLVYGRKLVVLPFRKDSSLDEIEMQDVKPIKKTPTLLIAKTPI 206

Query: 231  ESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
             +S++I L+DLD K  +V D  F+HGY EP ++IL+E   T+ GR++ +  TC + ALS+
Sbjct: 207  LASYIIELKDLDEKIDNVIDVQFLHGYYEPTLLILYEPVRTFPGRIAVRSDTCTMVALSL 266

Query: 289  STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVS 348
            +   + HP+IW+  +LP D  + + +  PIGG LV+  N++ Y +QS         Y VS
Sbjct: 267  NIQQRVHPVIWTVNSLPFDCLQAVPISKPIGGCLVMCVNSLIYLNQSVP------PYGVS 320

Query: 349  LDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRL 401
            L+SS +        P+    + LDAA   +++++  +LS K G+L +LT+  D       
Sbjct: 321  LNSSADHSTNFPLKPQDGVRISLDAAQVCFIESEKLVLSLKGGELYVLTLCADS------ 374

Query: 402  DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADA 461
                         I        FLGSRLG+SLL++F     + +++    ++ G +E + 
Sbjct: 375  ----------MRSICVCETEYLFLGSRLGNSLLLRFREKDESLVITI---DDSGTVEKE- 420

Query: 462  PSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
               KR R    +            +YGS    T     ++ F V DS++NIGP+   + G
Sbjct: 421  --QKRQRLEEEEL----------EVYGSGY-KTSVQLTSYIFEVCDSVLNIGPIAHMAVG 467

Query: 522  LRINAD-------------------ASATGISKQSNYELVE------------LPGCKGI 550
             RI  +                    +A+G  K     +++            L GC  +
Sbjct: 468  ERICEEEMEEGAEVQFVPNKLDVEVVTASGHGKNGALCVLQSSIKPQVITSFGLSGCLDV 527

Query: 551  WTVYHKSSRGHNADSSRMAAYDD---EYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
            WTV+ +++       +R    DD     HA++I+S E  TMVL+T + + E+ E+  +  
Sbjct: 528  WTVFDEAAGPGGVTGTRKP--DDAPPPNHAFMILSQEGATMVLQTGEEINEI-ENTGFAT 584

Query: 608  QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
               TI  GN+   R ++QV  +  R+L G+ + Q++                 + SVSI 
Sbjct: 585  DVPTIHVGNIGSNRFIVQVTTKSIRLLQGTRLLQNIPI----------DLGCPLASVSIV 634

Query: 668  DPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD------------ 715
            DPYV +  S+G +  L       T  +       S+  PV + + Y D            
Sbjct: 635  DPYVCVRSSEGRVITLALREGKGTPRLAVNKNTISASPPVIAISAYRDVSGMFTRKLEDS 694

Query: 716  ----KG---------------PEPWLRKTSTDAWLSTGVGEAID---GADGGPLDQG--- 750
                KG               PEP ++    +  L    G +      AD    D+G   
Sbjct: 695  FDVSKGGGATSAYSSGFGSMKPEPNMKIEDEEDLLYGESGRSFKVTSMADMALADKGGGN 754

Query: 751  -------------DIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREAL- 796
                           + +   ++G LEI+ +P+    + +    +G   + D+     L 
Sbjct: 755  ADFWLKYMQQIKPTYWLLAARDNGNLEIYSMPDLKLAYLISNVGNGNKVLSDSMEFVPLP 814

Query: 797  --KDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQ 854
              K   ++  ++S  G   G      S+   E+ M    ++ SRP LF I  +  +L Y+
Sbjct: 815  MAKPGTSQEEATSAFGASFGSGGVPVSLLPKEILMVALGSYGSRPILF-IRLEQDLLIYR 873

Query: 855  AYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQR- 913
             + +       +     S+    +   V A RL NL   +    A T    P+G   Q  
Sbjct: 874  VFRYAKGHLKLRFKRLTSSVTCPAFRTVPA-RLANLP-DKPATGATTDATEPNGKDTQEH 931

Query: 914  -----------ITIFKNISGHQGFFLSGSRPCWCMVFRE-RLRVHPQLCDGSIVAFTVLH 961
                       I  F N+SG+ G  + G +P +  +     LR H       + AF   +
Sbjct: 932  ATKVQYENISMIRYFGNVSGYAGVAVCGEKPYFLFLTAHGELRSHRLYARTVMKAFAPFN 991

Query: 962  NVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
            NVNC +GF+Y   Q  LKI  LP+  +YD+ WPV+K+
Sbjct: 992  NVNCPNGFLYFDEQYQLKISILPTYLSYDSVWPVRKI 1028


>gi|119602512|gb|EAW82106.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform
           CRA_a [Homo sapiens]
 gi|119602513|gb|EAW82107.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform
           CRA_a [Homo sapiens]
 gi|119602514|gb|EAW82108.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform
           CRA_a [Homo sapiens]
          Length = 1365

 Score =  286 bits (733), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 202/620 (32%), Positives = 322/620 (51%), Gaps = 80/620 (12%)

Query: 115 SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
           S+A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G 
Sbjct: 2   SMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGF 54

Query: 175 ESFARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSAR 229
                 P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  + 
Sbjct: 55  VQNVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSS 106

Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
              S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S
Sbjct: 107 FLPSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAIS 166

Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYA 346
           ++ T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+  
Sbjct: 167 LNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLT 226

Query: 347 VSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSK 405
               +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K
Sbjct: 227 TGTTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDK 286

Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADA 461
              SVLT+ + T+     FLGSRLG+SLL+++T        +++  +  KEE    +   
Sbjct: 287 AAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRV 346

Query: 462 PSTKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
            +T     +     QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + 
Sbjct: 347 DATAGWSAAGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAV 402

Query: 521 G----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY- 554
           G                L I        + + + + K    ++V   ELPGC  +WTV  
Sbjct: 403 GEPAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIA 462

Query: 555 --------HKSSRGHNADSSRMAAYDDE--YHAYLIISLEARTMVLETADLLTEVTESVD 604
                   +    G   + S     DD+   H +LI+S E  TM+L+T   + E+  S  
Sbjct: 463 PVRKEEEDNPKGEGTEQEPSTTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTS-G 521

Query: 605 YFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSV 664
           +  QG T+ AGN+   R ++QV   G R+L+G      L F P +         + ++  
Sbjct: 522 FATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQC 571

Query: 665 SIADPYVLLGMSDGSIRLLV 684
           ++ADPYV++  ++G + + +
Sbjct: 572 AVADPYVVIMSAEGHVTMFL 591



 Score =  108 bits (270), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 69/247 (27%), Positives = 114/247 (46%), Gaps = 15/247 (6%)

Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
           + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 706 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 761

Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
           QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 762 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 811

Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                   N++    +     +        E         R   F++I G+ G F+ G  
Sbjct: 812 VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 871

Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
           P W +V  R  LR+HP   DG + +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 872 PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 931

Query: 992 YWPVQKV 998
            WPV+K+
Sbjct: 932 PWPVRKI 938


>gi|390347522|ref|XP_003726804.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like [Strongylocentrotus purpuratus]
          Length = 1439

 Score =  286 bits (731), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 219/726 (30%), Positives = 349/726 (48%), Gaps = 105/726 (14%)

Query: 3   FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
           +A Y+ +H PTG+ +C      H  +                  P ++      NLVV  
Sbjct: 2   YAFYREIHPPTGVEHC---VYCHFFS------------------PDQQ------NLVVAK 34

Query: 63  ANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQG 122
            + + +Y + +  + +K +    + K +          LE    + + G V S+    Q 
Sbjct: 35  GSELTVYSM-ITVDSNKPTDKESKPKNK----------LEEAATFHIFGKVMSM----QS 79

Query: 123 GADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL 182
                  RD+++L+F +AK+S++E+D ++H L+  SMH FE  E    K G       P+
Sbjct: 80  AQVTGSGRDALLLSFMNAKVSIVEYDPNMHDLKTLSMHYFEEDE---TKEGVYRNIFHPV 136

Query: 183 VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD 242
           VKVDP  RC  +L YG ++++L   +   GLV D D   S       +  S+VI L ++D
Sbjct: 137 VKVDPDHRCAIMLTYGSKLVVLPFRR--DGLVEDLDKSMSASTRRGALMPSYVIRLNEMD 194

Query: 243 --MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
             + +V D  F+HGY EP ++IL+E   TWAGRV+ +  TC I ALS++   K HP+IWS
Sbjct: 195 DPICNVLDIQFLHGYYEPTLLILYEPLRTWAGRVAVRQDTCSIVALSLNMAQKVHPIIWS 254

Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS----SQELP 356
             +LP+D  ++ AVP PIGGVL++  N++ Y +QS      +  Y VSL+S    S   P
Sbjct: 255 QSSLPYDCMQVQAVPKPIGGVLILAVNSLLYLNQS------IPPYGVSLNSLTDWSTAFP 308

Query: 357 ---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
              +    + +D   AT++  D   LS K G++ +LT++ DG R V+   L K   SVLT
Sbjct: 309 LKTQEGVKLSMDCTQATFISYDRLALSLKDGEIYVLTLLVDGMRSVRGFHLDKAAASVLT 368

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
           + I  +G+   FLGSRLG+SLL+++T     +  S   K E      + PS K     +S
Sbjct: 369 TCICPMGDGFLFLGSRLGNSLLLKYTEKVSETSPSDASKTEEPKPGEEPPSKKMRSDDAS 428

Query: 473 DALQD----MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG------- 521
           D +      + + +EL +YG     T +   ++SF + DSL+NIGP  +   G       
Sbjct: 429 DWMASDTKFLDDPDELEVYGKQVQKTGTQLTSYSFEICDSLLNIGPCGNMIMGEPAFLSE 488

Query: 522 -LRINAD-----ASATGISKQSNYELVE------------LPGCKGIWTV--YHKSSRGH 561
             + N D      + +G  K     +++            LPGC  +WTV    K+    
Sbjct: 489 EFQGNVDPDLELVTTSGYGKNGALSVLQRTIRPQVVTTFNLPGCLDMWTVKSLKKAKADE 548

Query: 562 NADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRR 621
            ++ S  +  D + HA+LI+S +  +MVL+T   +TEV     +  Q  TI A N+   R
Sbjct: 549 KSEESETSPEDKDRHAFLILSKQDSSMVLQTGQEITEVAAG-GFSTQAPTIFASNMGDDR 607

Query: 622 RVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIR 681
            ++QV  +   +++G    Q +               S +   S+ADPY+LL   +G   
Sbjct: 608 YIVQVMNKSICLMEGVEQIQHMVL----------DVGSPIKQCSLADPYLLLLTENGDPI 657

Query: 682 LLVGDP 687
           L+   P
Sbjct: 658 LMTLKP 663



 Score =  100 bits (250), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 72/257 (28%), Positives = 112/257 (43%), Gaps = 32/257 (12%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + V C E+G LE++ +P+    F V  F  G   +VD               S S   TG
Sbjct: 776  WCVFCRENGQLEMYSLPDMVLAFLVKNFPMGSKVLVD---------------SGSAFMTG 820

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
               +++    +V E+ +        + ++ A++ D  I+ Y+A+    P NT   +  + 
Sbjct: 821  DQSQQHEMLQQVQEVLLVGLGHDRKKIYMLALVEDD-IMIYEAF----PYNTVTQEHHLR 875

Query: 873  TSRSLSVSNVSASRLRNLRFSRTP----------LDAYTREETPHGAPCQRITIFKNISG 922
              R   + +    + +  R S+ P                +         R+  F N+  
Sbjct: 876  V-RFRKIPHKILMKPKKTRTSKKPTAEGGTKTETETEAESDTKTQTRRVNRLREFHNVQT 934

Query: 923  HQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
            + G F+SGS P W  V  R  LR HP   DG+I  F   HNVNC +GF+Y   +  L+IC
Sbjct: 935  YSGVFISGSHPYWLFVTSRGALRTHPMPVDGAISCFASFHNVNCPNGFLYFNRKEELRIC 994

Query: 982  QLPSGSTYDNYWPVQKV 998
             LPS  +YD  WPV+KV
Sbjct: 995  VLPSHLSYDAPWPVRKV 1011


>gi|355698297|gb|EHH28845.1| Cleavage and polyadenylation specificity factor 160 kDa subunit
           [Macaca mulatta]
          Length = 1436

 Score =  286 bits (731), Expect = 6e-74,   Method: Compositional matrix adjust.
 Identities = 216/704 (30%), Positives = 343/704 (48%), Gaps = 116/704 (16%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
            SVLT+ + T+     FLGSRLG+SLL+++T         +    E  D E      KR+
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT--EKLQEPPASAVREAADKEEPPSKKKRV 424

Query: 468 RRSSS------DALQDMVNGEELSLYGSASNN--------------------TESAQKTF 501
             ++S         QD V+  E+ +YGS + +                    ++  Q+  
Sbjct: 425 DATASWSAGGKSVPQDEVD--EIEVYGSEAQSGTQLATYSFEVRLRQQGPHPSQCPQRPL 482

Query: 502 SFAVR---DSLVNIGPLKDFSYG--LRINADASATGISKQSNYELV-------------- 542
           +FAV    DS++NIGP  + + G    ++ +      S + + E+V              
Sbjct: 483 TFAVPQVCDSILNIGPCANAAMGEPAFLSEEVPRVVNSPEPDLEIVVCSGHGKNGALSVL 542

Query: 543 ------------ELPGCKGIWTVY---------HKSSRGHNADSSRMAAYDD-EYHAYLI 580
                       ELPGC  +WTV          +    G   ++    A DD   H +LI
Sbjct: 543 QKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEDNPKGEGTEQEARSPEADDDGRRHGFLI 602

Query: 581 ISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMT 640
           +S E  TM   T   + E+  S  +  QG T+ AGN+   R ++QV   G R+L+G    
Sbjct: 603 LSREDSTM---TGQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---V 655

Query: 641 QDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
             L F P +         + ++  ++ADPYV++  ++G + + +
Sbjct: 656 NQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMFL 692



 Score = 85.1 bits (209), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 38/87 (43%), Positives = 52/87 (59%), Gaps = 1/87 (1%)

Query: 913  RITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIY 971
            R   F++I G+ G F+ G  P W +V  R  LR+HP   DG + +F   HNVNC  GF+Y
Sbjct: 923  RFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLY 982

Query: 972  VTSQGILKICQLPSGSTYDNYWPVQKV 998
               QG L+I  LP+  +YD  WPV+K+
Sbjct: 983  FNRQGELRISVLPAYLSYDAPWPVRKI 1009


>gi|308805673|ref|XP_003080148.1| cleavage and polyadenylation specificity factor (ISS) [Ostreococcus
            tauri]
 gi|116058608|emb|CAL54315.1| cleavage and polyadenylation specificity factor (ISS), partial
            [Ostreococcus tauri]
          Length = 1473

 Score =  285 bits (730), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 226/848 (26%), Positives = 364/848 (42%), Gaps = 176/848 (20%)

Query: 260  MVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIG 319
            + IL+E+  TWAGR +    TC I ALS+    ++  +IW   NLP  +YKL A+  P+G
Sbjct: 6    LAILYEKTPTWAGRYNLAKDTCEIVALSVDVDKQKSTVIWRRQNLPSSSYKLTALLPPLG 65

Query: 320  GVLVVGANTIHYHSQSASCALALNNYA------------------VSLDSSQELPRS--- 358
            GVLV   + + + SQ +S AL LN +                   +  D+    P +   
Sbjct: 66   GVLVFSQDFLLHESQESSSALCLNTFGRGGPQEGNDAETVARLAGMGEDAVANPPPACAA 125

Query: 359  -----SFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
                    + LD A A  +  D  L++TK G L LL +  DGR ++R+ L +   +VL+S
Sbjct: 126  RAVDCGLEITLDGAQAAVVSEDRVLVTTKMGALFLLALHTDGRSLRRMMLQRAGGAVLSS 185

Query: 414  DITTIGNSLFFLGSRLGDSLLVQFTCGSGTS---MLSSGLKEEFGDIEADAPSTKRLRRS 470
             +  +   L FLGSR+GDSLLV+FT  S  +   ML  G  +E    E +  S KR +  
Sbjct: 186  GMCLLSRDLLFLGSRIGDSLLVKFTPKSEPAAPLMLPKGEDDEETVDEVEKGSGKRSKSG 245

Query: 471  SSDALQDMV-----------------NGEELS--LYGSAS-------NNTESAQKT---- 500
               A++                    + +EL   LYG+           T++A+K     
Sbjct: 246  DGAAIRKRAKSTEDPPPAPSTPSPEDDDDELEALLYGTTKAESVIGDETTQTAEKKREGL 305

Query: 501  -----------FSFAVRDSLVNIGPLKDFSYGLR--INAD-------ASATGISKQ---- 536
                       + F V+DSL+ + P+ D + G    +  D        +A G  K     
Sbjct: 306  AGVVPGLKVAGYDFKVKDSLLGVAPVVDITVGASAPVGTDTAERTELVTACGQGKNGALA 365

Query: 537  -----------SNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA 585
                       +  E   LP  +G+W ++ +       + +R     + +H +L++ L+ 
Sbjct: 366  ILTRGVQPELVTEVEAGTLPTLQGLWALHDRK------EGTR--EVREPFHNHLLLKLQ- 416

Query: 586  RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSF 645
                        EV+ S+++     T+AA N FG    +Q+ E   RIL      QD++ 
Sbjct: 417  ------------EVSASLEFITDQATLAAANFFGHFCSLQITETSIRILKSGMKVQDVTL 464

Query: 646  GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKK 705
                +  GS      + S  I DPY+++ +SDG++RLL GD    TVS+    A+ +S +
Sbjct: 465  ADIKAPKGS-----VIASAEILDPYIMIRLSDGTLRLLAGDEKKMTVSLMESGAMPTSSR 519

Query: 706  PVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEI 765
                       G   W+ +++T+  ++   G    GA     +Q +    +  E G+LE+
Sbjct: 520  RTRLVEALKKSG---WIHRSATNGTITGLEGSKKSGAS----NQKEAIVAIAREGGSLEL 572

Query: 766  FDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVV 825
            F +P+   ++  D    G   +  T        SE  I                   ++V
Sbjct: 573  FSLPSCTRIWNADGLSEGSRVLSPTRPVH----SELRIP------------------EIV 610

Query: 826  ELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSAS 885
            ++ +  +   H RP L A+  DGT+L Y+ ++         S++P++             
Sbjct: 611  DIRIDSFEEAHERPLLTAVRGDGTLLLYRGFIVPAGTTCEGSEEPLARG----------- 659

Query: 886  RLRNLRFSRTPLD-------------AYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                LRFSR  +D             A    ++  G    RI+      G QG F++G  
Sbjct: 660  ---ELRFSRVNIDVEGSGLNVAGVGVAGQVRDSLAGTRLTRISNVGEGQGLQGIFVAGPN 716

Query: 933  PCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNY 992
            P W +V R R+   P   +G IVAFT  HNVNC +GFI  T+ G ++ICQ+PS   Y+  
Sbjct: 717  PLWLIVRRSRVLALPTRGEGEIVAFTDFHNVNCPYGFILGTAVGGVRICQMPSKMHYEAA 776

Query: 993  WPVQKVVF 1000
            WPV+K+  
Sbjct: 777  WPVRKIAL 784


>gi|431908147|gb|ELK11750.1| Cleavage and polyadenylation specificity factor subunit 1 [Pteropus
           alecto]
          Length = 671

 Score =  284 bits (727), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 212/661 (32%), Positives = 328/661 (49%), Gaps = 114/661 (17%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN    + +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEAPTKNDRSAEGKAHRE--HREKLELVASFSFFGNVMSM 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 85  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +L+YG ++++L       ++   GLVG+        G  +   
Sbjct: 138 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 189

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 190 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 249

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 250 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 309

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A A ++  D  ++S K G++ +LT+V DG R V+     K  
Sbjct: 310 TTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLVTDGLRSVRAFHFDKAA 369

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC----GSGTSMLSSGLKEEFGDIEADAPS 463
            SVLTS + T+     FLGSRLG+SLL+++T        +++  +  KEE        P 
Sbjct: 370 ASVLTSSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEAPASTVREAADKEE--------PP 421

Query: 464 TKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPL 515
           +K+ R  S+          QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP 
Sbjct: 422 SKKKRVDSTVGWSGGKSVAQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPC 477

Query: 516 K-------------------------DFSYGLRINADASATGISKQSNYELV-------- 542
                                     + + GL  +   +    S + + E+V        
Sbjct: 478 ANAAMGEPAFLSEEVPVWEVQGGGGVECTVGLWPHPSLAQFQNSPEPDLEIVMCSGYGKN 537

Query: 543 ------------------ELPGCKGIWTVY-------HKSSRGHNAD---SSRMAAYDDE 574
                             ELPGC  +WTV         ++ +G   +   S+  A  D  
Sbjct: 538 GALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEQEETPKGEAVEPEPSAPDADDDGR 597

Query: 575 YHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
            H +LI+S E  TM+L+T   + E+  S  +  QG T+ AGN+   R ++QV   G R+L
Sbjct: 598 RHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLL 656

Query: 635 D 635
           +
Sbjct: 657 E 657


>gi|148697643|gb|EDL29590.1| cleavage and polyadenylation specific factor 1, isoform CRA_b [Mus
           musculus]
          Length = 1311

 Score =  284 bits (726), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 196/601 (32%), Positives = 310/601 (51%), Gaps = 68/601 (11%)

Query: 129 RRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQ 188
           +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G       P V+VDP 
Sbjct: 11  KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQNVHTPRVRVDPD 67

Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK--HV 246
           GRC  +L+YG ++++L   +     + +E     G G  +    S++I++R LD K  ++
Sbjct: 68  GRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYIIDVRALDEKLLNI 124

Query: 247 KDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPH 306
            D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K HP+IWS  +LP 
Sbjct: 125 IDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVHPVIWSLTSLPF 184

Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSSFSVELD 365
           D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +     +    + LD
Sbjct: 185 DCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLRTQEGVRITLD 244

Query: 366 AAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFF 424
            A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT+ + T+     F
Sbjct: 245 CAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLF 304

Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-----SSDALQDMV 479
           LGSRLG+SLL+++T        SS    E  D E      KR+  +          QD V
Sbjct: 305 LGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVGWTGGKTVPQDEV 362

Query: 480 NGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----------------L 522
           +  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G                L
Sbjct: 363 D--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVGEPAFLSEEFQNSPEPDL 418

Query: 523 RI------NADASATGISKQSNYELV---ELPGCKGIWTVYH----------KSSRGHNA 563
            I        + + + + K    ++V   ELPGC  +WTV            K+      
Sbjct: 419 EIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEETPKAESTEQE 478

Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
            S+  A  D   H +LI+S E  TM+L+T   + E+  S  +  QG T+ AGN+   R +
Sbjct: 479 PSAPKAEEDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNIGDNRYI 537

Query: 624 IQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
           +QV   G R+L+G      L F P +         + ++  ++ADPYV++  ++G + + 
Sbjct: 538 VQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMF 587

Query: 684 V 684
           +
Sbjct: 588 L 588



 Score =  110 bits (274), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 71/247 (28%), Positives = 114/247 (46%), Gaps = 15/247 (6%)

Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
           + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 703 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATR 758

Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
           QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 759 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 808

Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                   N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 809 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPS 868

Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
           P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 869 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 928

Query: 992 YWPVQKV 998
            WPV+K+
Sbjct: 929 PWPVRKI 935


>gi|327287424|ref|XP_003228429.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like [Anolis carolinensis]
          Length = 1294

 Score =  283 bits (725), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 213/680 (31%), Positives = 336/680 (49%), Gaps = 89/680 (13%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV   + + +Y +    E + +S  S E K            LELV  +   GNV S+
Sbjct: 29  NLVVAGTSQLYVYRLNHDSESTTKSDRSSEGKSH-------KEKLELVAAFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P V+VDP GRC  +L+YG ++++L   +     + DE     G G  +    S++I
Sbjct: 135 NVHIPKVRVDPDGRCAVMLIYGTRLVVLPFRRD---TLTDEHEGVVGEGQKSSFLPSYII 191

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
           ++R+LD K  ++ D  F++GY EP ++IL E   TW GRV+ +  TC I A+S++   K 
Sbjct: 192 DIRELDEKLLNIIDMQFLYGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNIMQKV 251

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS--- 351
           HP+IWS  NLP D  + LAVP PIGGV++   N++ Y +QS         Y VSL+S   
Sbjct: 252 HPVIWSLSNLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVP------PYGVSLNSLTN 305

Query: 352 -SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKT 406
            +   P   +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K 
Sbjct: 306 GTTVFPLRIQEGVKITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRSFHFDKA 365

Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
             SVLT+ + T+     FLGSRLG+SLL+++T       +++  K+     E      KR
Sbjct: 366 AASVLTTCMITMDPGYLFLGSRLGNSLLLRYTEKLQEPPVNAA-KDATEKTEEPPVKKKR 424

Query: 467 LRRSSS-----DALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
           + + ++      A QD V+  E+ +YGS + +  +   T+SF V DS++NIGP  + + G
Sbjct: 425 VEQQANWAGGKSAPQDEVD--EIEVYGSEAQSG-TQLSTYSFEVCDSILNIGPCANAAMG 481

Query: 522 ----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYHK 556
                           L I        + + + + K    ++V   ELPGC  +WTV   
Sbjct: 482 EPAFLSEEFQNSLEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAP 541

Query: 557 SSRGHNADSSRMAAY----------DDEYHAYLIISLEARTMVLETADLLTEVTESVDY- 605
                  D+   +A           D + H +LI+S E  TMV        +    +D  
Sbjct: 542 QKAEQEEDAQGESAEKEPSPPEPPDDGKRHGFLILSREDSTMVNPANGPTGQEIMELDTS 601

Query: 606 -FVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSV 664
                 T  AGN+   R ++QV   G R+L+G      L F P +         S ++  
Sbjct: 602 GLAPRSTQDAGNIGENRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GSPIVQC 651

Query: 665 SIADPYVLLGMSDGSIRLLV 684
           ++ADPYV++  S+G + + V
Sbjct: 652 AVADPYVVIMSSEGQVTMFV 671



 Score = 88.2 bits (217), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 66/227 (29%), Positives = 104/227 (45%), Gaps = 19/227 (8%)

Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
           + V+  E+G +EI+ +P +  VF V  F  G+  +VD+   +    +E    +  EE   
Sbjct: 786 WCVLVRENGTMEIYQLPEWRLVFLVKNFPMGQRVLVDSSFGQPASQAE----AKKEEVIR 841

Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
           Q     +  + +V L  ++     SRP+L  +  D  +L Y+A+      + S+      
Sbjct: 842 QTEMPLVKEVLLVALGNRQ-----SRPYLL-VHVDQELLIYEAF-----NHDSQLGQTNL 890

Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREE--TPHGAPCQRITIFKNISGHQGFFLSG 930
             R   V +    R +  R S+   ++   EE   P G    R   F++I G+ G F+ G
Sbjct: 891 KVRFKKVPHNINFREKKPRPSKKKTESAGGEEASVPRGR-VARFRYFEDIYGYSGVFICG 949

Query: 931 SRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG 976
             P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG
Sbjct: 950 PSPHWLLVTSRGALRLHPMTIDGPIESFAPFHNVNCPKGFLYFNRQG 996


>gi|307191845|gb|EFN75271.1| Cleavage and polyadenylation specificity factor subunit 1
           [Harpegnathos saltator]
          Length = 1214

 Score =  281 bits (720), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 232/835 (27%), Positives = 371/835 (44%), Gaps = 128/835 (15%)

Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAM 302
           M +V D  F+HGY EP ++IL+E   T++GR++ +  TC + A+S++   + HP+IWS  
Sbjct: 1   MDNVIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQRVHPIIWSVS 60

Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYAVSLDSSQELPRSSFS 361
           NLP D Y+ + V  P+GG L++  N++ Y +QS     ++LN+ A +  +    P+    
Sbjct: 61  NLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLADTSTNFPLRPQDGVK 120

Query: 362 VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGN 420
           + L+ A   +L  D  ++S K+G+L +L++  D  R V+     K   SVLTS +    +
Sbjct: 121 ISLEGAQVAFLSADRLVISLKSGELYVLSLFADSMRSVRGFHFDKAAASVLTSCVCMCED 180

Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE-EFGDIEADAPSTKRLRRS------SSD 473
           +  FLGSRLG+SLL++FT     ++ S    E    D + + P  K+ ++       +SD
Sbjct: 181 NYLFLGSRLGNSLLLRFTEKEPETIKSLDDGEINIEDNDNEEPPAKKAKQDFLGDWMASD 240

Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL----------R 523
            L D+ + EEL +YGS + +T     ++ F V DSL+NIGP  + S G            
Sbjct: 241 VL-DIKDPEELEVYGSET-HTSIQITSYIFEVCDSLLNIGPCGNISMGEPAFLSEEFAHN 298

Query: 524 INAD---ASATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRM 568
            N D    + +G  K     ++            ELPGC+ +WTV      G   +  ++
Sbjct: 299 QNPDVELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMWTVI-----GSLNNDEQV 353

Query: 569 AAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFE 628
            +  +  HA+LI+S E  TMVL+T   + EV +S  +  QG T+ AGNL   R ++QV +
Sbjct: 354 KSETEGSHAFLILSQEDSTMVLQTGQEINEVDQS-GFSTQGSTVFAGNLGANRYIVQVTQ 412

Query: 629 RGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS 688
            G R+L G    Q +                 ++  S ADPYV+L   DG + LL     
Sbjct: 413 MGVRLLQGIEQIQHMPI----------DLGCPIVHASCADPYVILLSEDGQVMLLTLREV 462

Query: 689 TCTVSVQTPAAIESSKKPVSSCTLYHD--------------------------------- 715
             T  +    A    +  + +   Y D                                 
Sbjct: 463 RGTAKLHAQTANLLFRPQIEALCAYRDVSGIFTTQLPESAEDEQTEEEHNVEEPSIIGNI 522

Query: 716 -------KGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCY-ESGALEIFD 767
                   G  P  +  +     + G  + I        +    Y ++ Y +SG LEI+ 
Sbjct: 523 DNEDDLLYGDAPAFQMPTPSHPKTDGTTKKIPWWQKHLQEIKPTYWLLVYRDSGTLEIYS 582

Query: 768 VPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVEL 827
           +P+    + +  F  G+  + D+     L+ +      + E             ++V E+
Sbjct: 583 LPDLRLSYLIRNFGYGQYMLHDSMESTTLQSAPINETLNPE-------------LQVREV 629

Query: 828 AMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRL 887
            M     H +RP L   L D  +  YQAY +  P+   K          L    +    +
Sbjct: 630 LMVALGHHGNRPMLLVRL-DSELQIYQAYKY--PKGHLK----------LRFKKLDHGII 676

Query: 888 RNLRFSRTPLDAYTREETPHGAPCQRITI---FKNISGHQGFFLSGSRPCWCMVF-RERL 943
                SR P      E+ P  A   RI +   F NI+G+ G F+    P W  +  R  L
Sbjct: 677 PG-HLSRKP----KEEDVPVNANETRICMMRYFSNIAGYNGVFICSDYPHWIFLTGRGEL 731

Query: 944 RVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
           R HP   DGS+ +F   +N+NC  GF+Y   +  L+IC LP+  +YD  WPV+KV
Sbjct: 732 RTHPMGIDGSVTSFAAFNNINCPQGFLYFNRKEELRICVLPTHLSYDAPWPVRKV 786


>gi|440904368|gb|ELR54893.1| Cleavage and polyadenylation specificity factor subunit 1, partial
           [Bos grunniens mutus]
          Length = 1417

 Score =  280 bits (717), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 206/635 (32%), Positives = 320/635 (50%), Gaps = 89/635 (14%)

Query: 100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
            LELV  +   GNV S+A +   GA    +RD+++L       SV+E+D   H L+  S+
Sbjct: 65  KLELVASFSFFGNVMSMASVQLAGA----KRDALLL-------SVVEYDPGTHDLKTLSL 113

Query: 160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLV 214
           H FE PE   L+ G       P V+VDP GRC  +L+YG ++++L       ++   GLV
Sbjct: 114 HYFEEPE---LRDGFVQNVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLV 170

Query: 215 GDEDTFGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAG 272
           G+        G  +    S++I++R LD K  ++ D  F+HGY EP ++IL E   TW G
Sbjct: 171 GE--------GQRSSFLPSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPG 222

Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYH 332
           RV+ +  TC I A+S++ T K HP+IWS  +LP D  + LAVP PIGGV++   N++ Y 
Sbjct: 223 RVAVRQDTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYL 282

Query: 333 SQSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV 391
           +QS     +ALN+      +     +    + LD A A ++  D  ++S K G++ +LT+
Sbjct: 283 NQSVPPYGVALNSLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTL 342

Query: 392 VYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGL 450
           + DG R V+     K   SVLT+ + T+     FLGSRLG+SLL+++T        S+  
Sbjct: 343 ITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASTA- 401

Query: 451 KEEFGDIEADAPSTKRLRRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFA 504
             E  D E      KR+  +     S    QD V+  E+ +YGS A + T+ A  T+SF 
Sbjct: 402 -REAADKEEPPSKKKRVDATTGWSGSKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFE 456

Query: 505 VRDSLVNIGPLKDFSYG----------------LRI------NADASATGISKQSNYELV 542
           V DS++NIGP  + + G                L I        + + + + K    ++V
Sbjct: 457 VCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVV 516

Query: 543 ---ELPGCKGIWTVYHKSSR---------GHNADSSRMAAYDD-EYHAYLIISLEARTMV 589
              ELPGC  +WTV     +         G   +     A DD   H +LI+S E  TM+
Sbjct: 517 TTFELPGCYDMWTVIAPVRKEQEETLKGEGTEPEPGAPEAEDDGRRHGFLILSREDSTMI 576

Query: 590 LETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSN 649
           L+T   + E+  S  +  QG T+ AGN+   R ++QV   G R+L+G      L F P +
Sbjct: 577 LQTGQEIMELDAS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVD 632

Query: 650 SESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
                    S ++  ++ADPYV++  ++G + + +
Sbjct: 633 L-------GSPIVQCAVADPYVVIMSAEGHVTMFL 660



 Score =  112 bits (279), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 71/247 (28%), Positives = 116/247 (46%), Gaps = 15/247 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+GA+EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 775  WCLLVRENGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 830

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +   RP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 831  QGELPLVKEVLLVALG-----SRQRRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 880

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E T       R   F++I G+ G F+ G  
Sbjct: 881  VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGTGPRGRVARFRYFEDIYGYSGVFICGPS 940

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HN+NC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 941  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDA 1000

Query: 992  YWPVQKV 998
             WPV+K+
Sbjct: 1001 PWPVRKI 1007


>gi|441648592|ref|XP_004093268.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
           specificity factor subunit 1 [Nomascus leucogenys]
          Length = 1177

 Score =  280 bits (716), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 202/651 (31%), Positives = 321/651 (49%), Gaps = 115/651 (17%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
            SVLT+ + T+     FLGSRLG+SLL+++                          T++L
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKY--------------------------TEKL 400

Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
           +   + A+++  + EE            S +K                       R++A 
Sbjct: 401 QEPPASAVREAADKEE----------PPSKKK-----------------------RVDAT 427

Query: 528 ASATGISKQSNYELV----ELPGCKGIWTVY---------HKSSRGHNADSSRMAAYDD- 573
              +G  ++S    V    ELPGC  +WTV          +    G   + S   A DD 
Sbjct: 428 VGWSGEGQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEDNPKGEGTEQEPSTPEADDDC 487

Query: 574 EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 633
             H +LI+S E  TM+L+T   + E+  S  +  QG T+ AGN+   R ++QV   G R+
Sbjct: 488 RRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRL 546

Query: 634 LDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
           L+G      L F P +         + ++  ++ADPYV++  ++G + + +
Sbjct: 547 LEG---VNQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMFL 587



 Score =  107 bits (268), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 70/247 (28%), Positives = 114/247 (46%), Gaps = 15/247 (6%)

Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
           + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 702 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 757

Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
           QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 758 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 807

Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                   N++    +     +      T E         R   F++I G+ G F+ G  
Sbjct: 808 VRFKKVPHNINFREKKPKPSKKKAEGGGTEEGAGARGRVARFRYFEDIYGYSGVFICGPS 867

Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
           P W +V  R  LR+HP   DG + +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 868 PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 927

Query: 992 YWPVQKV 998
            WPV K+
Sbjct: 928 PWPVXKI 934


>gi|403302917|ref|XP_003942095.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 [Saimiri boliviensis boliviensis]
          Length = 1390

 Score =  274 bits (700), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 203/645 (31%), Positives = 322/645 (49%), Gaps = 105/645 (16%)

Query: 115 SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
           S+A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G 
Sbjct: 2   SMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGF 54

Query: 175 ESFARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSAR 229
                 P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  + 
Sbjct: 55  VQNVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSS 106

Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
              S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S
Sbjct: 107 FLPSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAIS 166

Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI--------------------------GGV 321
           ++ T K HP+IWS  +LP D  + LAVP PI                          GGV
Sbjct: 167 LNITQKVHPVIWSLTSLPFDCTQALAVPKPIGENPGGAEGSAGRGAVSLPTSLCPPPGGV 226

Query: 322 LVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLS 380
           ++   N++ Y +QS     +ALN+      +     +    + LD A AT++  D  ++S
Sbjct: 227 VIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLRTQEGVRITLDCAQATFISYDKMVIS 286

Query: 381 TKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
            K G++ +LT++ DG R V+     K   SVLT+ + T+     FLGSRLG+SLL+++T 
Sbjct: 287 LKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTE 346

Query: 440 G----SGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS-ASNNT 494
                  +++  +G KEE    +    +T           QD V+  E+ +YGS A + T
Sbjct: 347 KLQEPPASAVREAGDKEEPPSKKKRVDATAGWSAGGKSVPQDEVD--EIEVYGSEAQSGT 404

Query: 495 ESAQKTFSFAVRDSLVNIGPLKDFSYG----------------LRI------NADASATG 532
           + A  T+SF V DS++NIGP  + + G                L I        + + + 
Sbjct: 405 QLA--TYSFEVCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGHGKNGALSV 462

Query: 533 ISKQSNYELV---ELPGCKGIWTVY---------HKSSRGHNADSSRMAAYDD-EYHAYL 579
           + K    ++V   ELPGC  +WTV          +    G   + S   A DD   H +L
Sbjct: 463 LQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEENPKGEGTEQEPSTPEADDDSRRHGFL 522

Query: 580 IISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYM 639
           I+S E  TM+L+T   + E+  S  +  QG T+ AGN+   R ++QV   G R+L+G   
Sbjct: 523 ILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNVGDDRYIVQVSPLGIRLLEG--- 578

Query: 640 TQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
              L F P +         + ++  ++ADPYV++  ++G + + +
Sbjct: 579 VNQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMFL 616



 Score =  109 bits (273), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 77/264 (29%), Positives = 123/264 (46%), Gaps = 49/264 (18%)

Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
           + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 731 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 786

Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
           QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++        
Sbjct: 787 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQ------- 829

Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTRE---------------ETPHGAPCQ--RIT 915
               LS  N+       +RF + P +   RE               E   GA  +  R  
Sbjct: 830 ----LSQGNL------KVRFKKVPHNINFREKKPKPSKKKAEGGSAEEGAGARGRVARFR 879

Query: 916 IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
            F++I G+ G F+ G  P W +V  R  LR+HP   DG + +F   HN+NC  GF+Y   
Sbjct: 880 YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPVDSFAPFHNINCPRGFLYFNR 939

Query: 975 QGILKICQLPSGSTYDNYWPVQKV 998
           QG L+I  LP+  +YD  WPV+K+
Sbjct: 940 QGELRISVLPAYLSYDAPWPVRKI 963


>gi|348555856|ref|XP_003463739.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 isoform 2 [Cavia porcellus]
          Length = 1387

 Score =  271 bits (693), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 210/676 (31%), Positives = 333/676 (49%), Gaps = 88/676 (13%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   + + A     +      GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRPTEGKSHREKLGAGGPPSLSF----GNVMSM 82

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +              I      ++SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 83  ASVQLXXXXXX------IALISFPQLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 133

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +L+YG ++++L       ++   GLVG+        G  +   
Sbjct: 134 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 185

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 186 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 245

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+  + 
Sbjct: 246 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTLG 305

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 306 TTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 365

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
            SVLT+ + T+     FLGSRLG+SLL+++T        S+    E  D E      KR+
Sbjct: 366 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPAST--VREAADKEEPPSKKKRV 423

Query: 468 RRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
             +     S    QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G
Sbjct: 424 DSTAGWAGSKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVG 479

Query: 522 --------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH--- 555
                         L I        + + + + K    ++V   ELPGC  +WTV     
Sbjct: 480 EPAFLSEENSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVR 539

Query: 556 ------KSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
                   + G   + S   A DD   H +LI+S E  TM+L+T   + E+  S  +  Q
Sbjct: 540 KEEEETPKAEGSEQEPSAPEAEDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQ 598

Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
           G T+ AGN+   R ++QV   G R+L+G      L F P +         + ++  ++AD
Sbjct: 599 GPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAVAD 648

Query: 669 PYVLLGMSDGSIRLLV 684
           PYV++  ++G + + +
Sbjct: 649 PYVVIMSAEGHVTMFL 664



 Score =  108 bits (269), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 71/247 (28%), Positives = 114/247 (46%), Gaps = 15/247 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 779  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSSGQPTTQGEVR----KEEATR 834

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 835  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 884

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 885  VRFKKVPHNINFREKKPKPSKKKAEGGSTDEGSGVRGRVARFRYFEDIYGYSGVFICGPS 944

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 945  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1004

Query: 992  YWPVQKV 998
             WPV+K+
Sbjct: 1005 PWPVRKI 1011


>gi|402879380|ref|XP_003903320.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
           specificity factor subunit 1 [Papio anubis]
          Length = 1389

 Score =  271 bits (693), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 204/646 (31%), Positives = 321/646 (49%), Gaps = 108/646 (16%)

Query: 115 SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
           S+A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G 
Sbjct: 2   SMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGF 54

Query: 175 ESFARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSAR 229
                 P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  + 
Sbjct: 55  VQNVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSS 106

Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
              S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S
Sbjct: 107 FLPSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAIS 166

Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI-------------------------GGVL 322
           ++ T K HP+IWS  +LP D  + LAVP PI                         GGV+
Sbjct: 167 LNITQKVHPVIWSLTSLPFDCTQALAVPKPIGEYPGSGWGCVEGALSLPTSLCPPPGGVV 226

Query: 323 VVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLST 381
           V   N++ Y +QS     +ALN+      +     +    + LD A AT++  D  ++S 
Sbjct: 227 VFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLRIQEGVRITLDCAQATFISYDKMVISL 286

Query: 382 KTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG 440
           K G++ +LT++ DG R V+     K   SVLT+ + T+     FLGSRLG+SLL+++T  
Sbjct: 287 KGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT-- 344

Query: 441 SGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS------DALQDMVNGEELSLYGS-ASNN 493
                  +    E  D E      KR+  ++S         QD V+  E+ +YGS A + 
Sbjct: 345 EKLQEPPASAVREAADKEEPPSKKKRVDATASWSAGGKSVPQDEVD--EIEVYGSEAQSG 402

Query: 494 TESAQKTFSFAVRDSLVNIGPLKDFSYG----------------LRI------NADASAT 531
           T+ A  T+SF V DS++NIGP  + + G                L I        + + +
Sbjct: 403 TQLA--TYSFEVCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGHGKNGALS 460

Query: 532 GISKQSNYELV---ELPGCKGIWTVY---------HKSSRGHNADSSRMAAYDD-EYHAY 578
            + K    ++V   ELPGC  +WTV          +    G   ++    A DD   H +
Sbjct: 461 VLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEDNPKGEGTEQEARSPEADDDGRRHGF 520

Query: 579 LIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY 638
           LI+S E  TM+L+T   + E+  S  +  QG T+ AGN+   R ++QV   G R+L+G  
Sbjct: 521 LILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG-- 577

Query: 639 MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
               L F P +         + ++  ++ADPYV++  ++G + + +
Sbjct: 578 -VNQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMFL 615



 Score =  108 bits (269), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 70/247 (28%), Positives = 115/247 (46%), Gaps = 15/247 (6%)

Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
           + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 730 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 785

Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
           QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 786 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 835

Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                   N++    +     +      T E         R   F++I G+ G F+ G  
Sbjct: 836 VRFKKVPHNINFREKKPKPSKKKAEGGGTEEGAGXRGRVARFRYFEDIYGYSGVFICGPS 895

Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
           P W +V  R  LR+HP   DG + +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 896 PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 955

Query: 992 YWPVQKV 998
            WPV+K+
Sbjct: 956 PWPVRKI 962


>gi|307107849|gb|EFN56091.1| hypothetical protein CHLNCDRAFT_145620 [Chlorella variabilis]
          Length = 1626

 Score =  271 bits (692), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 313/1247 (25%), Positives = 478/1247 (38%), Gaps = 314/1247 (25%)

Query: 4    AAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTAA 63
            A    +H PT + +C + ++TH++          Q               P+P+LVV  +
Sbjct: 7    AVCTQVHPPTAVTHCTAAWLTHAQRQ--------QGSGSADGDDGGGSGDPLPDLVVVRS 58

Query: 64   NVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGG 123
              +E+Y VR  E G   + ++             A SL+ +   RL G  ES+A+L +G 
Sbjct: 59   TQLELYSVRGSEAGGPATTHT-------------AQSLDQLASCRLFGVAESVAVL-RGR 104

Query: 124  ADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV 183
            A    +RD ++L F DAK+SVL +D   H L  +S+H FE      LK GR  F   PL 
Sbjct: 105  APG--QRDVLLLTFRDAKLSVLHWDAGRHELAPSSLHYFEGDA--SLKLGRTVFPYPPLA 160

Query: 184  KVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSG-------------------- 223
              DP GRCG V+++  Q+ +L A         D + FG G                    
Sbjct: 161  VTDPLGRCGAVIIFRHQLAVLPAV--------DSELFGLGLSAAEEDEEEAAATAALGLA 212

Query: 224  --------------------------GGFSARIESSHVINLRDLDMKHVKDFIFVHGYIE 257
                                         +A + +S+V NL    +K V+D  F+HGY E
Sbjct: 213  PPDGGGAADGEAGAPRGGAAAAAAGLPAAAAAVGNSYVDNLGKAGIKEVRDACFLHGYSE 272

Query: 258  PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
            PV+++LHE E TWAG +  K  TC+++ALS++ T K HP IW A  LP DAY+L A  +P
Sbjct: 273  PVLMVLHEAEPTWAGNLRQKKDTCVLTALSLNLTRKHHPKIWGAQELPSDAYRLSA--AP 330

Query: 318  IGGVLVVGANTIHYHSQSASCALALN---------------------------------- 343
             GGVLV+  + + ++ Q     + L+                                  
Sbjct: 331  CGGVLVLCQHLVLHYRQGQQSGVVLHPSALPPAAAPPPLLFDPQAMAEAGGPGPASAAYA 390

Query: 344  -NYAVSL------------DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLT 390
              +AV +            D+SQ    ++  V  D A   WL  + ALL  ++G L+ L 
Sbjct: 391  RQHAVDVHPETVPAAVRFCDASQA---AALKVTADGASVCWLSPESALLCLRSGQLLQLA 447

Query: 391  VVYD--GRVVQRLDLSKTNPSVLTSDITTIGNS----------------------LFFLG 426
            ++    G   + L +++   +   S   ++  +                      L FLG
Sbjct: 448  LLPQQAGGSARHLAVARAGAAPHPSCCCSLSGAHRAPHMPGSAAAAAAGQAPQPALVFLG 507

Query: 427  SRLGDSLLVQFT----CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV--- 479
            S  GDSLLV+ T     G+     ++       D  AD P++KRLR    +         
Sbjct: 508  SAAGDSLLVRATPAAAAGTKRPAEAATGAAGEEDGTADEPASKRLRLEGIEVGSAAAAVE 567

Query: 480  --------------------------------NGEELSLYGSA--------------SNN 493
                                              EE  +YG+A              +  
Sbjct: 568  ATAAAAAAAQGAAAAAAEARAAAGGGPAGSDSEDEEALIYGTALYSSAAGVAPAAAAAVP 627

Query: 494  TESAQ-KTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWT 552
            T S Q + +   V DSL NIGPL+DF+        A A G +           G  G  T
Sbjct: 628  TPSWQLQRYQLKVLDSLANIGPLRDFAVA---EPAAGAGGEAVPPALVGCSGEGKGGTLT 684

Query: 553  VYHKS----------------SRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLL 596
            V  +S                  G    +   A  +  +HAYL++S +  T VL T + L
Sbjct: 685  VLRRSVVPDVITEHRGAASASGGGSGQAAGEAAGQEGGHHAYLLLSFQGATKVLATGEEL 744

Query: 597  TEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD-----LSFGPSNSE 651
             EVTESV++ V   T+AAG++   RR+ Q F +G R+LDG    QD     L+   + + 
Sbjct: 745  REVTESVEFAVDTPTLAAGSVCCGRRIAQAFPQGLRLLDGEESVQDVWASELAAPAAAAA 804

Query: 652  SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQT--------------- 696
            +G       ++S  + DPYVLL ++DG+ R L  DP  C +S  +               
Sbjct: 805  AGGAPGGGAIVSADMCDPYVLLYLADGTARFLTADPVACRLSAASAAGAGPEAAAAAEAA 864

Query: 697  -----PAAIESSKKPVSSCTLYHD-------KGPEPWLRKTSTDAWLSTGVGEAIDGADG 744
                 P A E     +++C+L+ D       + P+   +            G     A  
Sbjct: 865  EAALRPVAAEER---ITACSLFADSCGWLAARLPQTQQQTQQQQQQQGQQDGGTTAQAAA 921

Query: 745  GPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEIN 804
                 G +Y+VVC  SGA +++ +P +  VF+    ++G   ++ T    A   +     
Sbjct: 922  SGGGCGAVYAVVCRASGACQLYALPAWQPVFSSSTSLAGGPALL-TGSGGAGGVAAAAAA 980

Query: 805  SSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSR-----------------PFLFAILTD 847
            +++         E     +VVE+ +  +    +                  P L A+  D
Sbjct: 981  AAAAAAAAGVEDEMDGPGEVVEVRLVSFGPAAAGRRDAAAARASPAPACEPPLLLALTAD 1040

Query: 848  GTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLR-----NLRFSRTPLDAYTR 902
              +L YQA+        ++      T R   +       L       LR  R        
Sbjct: 1041 HQLLAYQAFSASPGSGGTRGSSGSGTPRFRRLRLDLPPLLPPAGGPQLRLRRLHCFEGLG 1100

Query: 903  EETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD---------GS 953
            EE P                + G F++G  P W +  R  L  HP               
Sbjct: 1101 EEAP----------------YSGVFVAGQHPHWLVASRGGLLPHPHFLPQPAGPGAAAVG 1144

Query: 954  IVAFTVLHNVNCNHGFIYVTS--QGILKICQLPSGSTYDNYWPVQKV 998
               FT  HNVNC HGFI  TS  +  ++I QLP  +  D  WP Q+V
Sbjct: 1145 AAGFTPFHNVNCPHGFIVATSGARSGIQISQLPPRTRLDAPWPRQRV 1191


>gi|348555854|ref|XP_003463738.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 isoform 1 [Cavia porcellus]
          Length = 1440

 Score =  270 bits (691), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 209/678 (30%), Positives = 332/678 (48%), Gaps = 90/678 (13%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   + + A     +      GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRPTEGKSHREKLGAGGPPSLSF----GNVMSM 82

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +              I      ++SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 83  ASVQLXXXXXX------IALISFPQLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 133

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +L+YG ++++L       ++   GLVG+        G  +   
Sbjct: 134 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 185

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 186 PSYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 245

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+  + 
Sbjct: 246 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTLG 305

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 306 TTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 365

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
            SVLT+ + T+     FLGSRLG+SLL+++T         +    E  D E      KR+
Sbjct: 366 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYT--EKLQEPPASTVREAADKEEPPSKKKRV 423

Query: 468 RRS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
             +     S    QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G
Sbjct: 424 DSTAGWAGSKTVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVG 479

Query: 522 ----------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYH- 555
                           L I        + + + + K    ++V   ELPGC  +WTV   
Sbjct: 480 EPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAP 539

Query: 556 --------KSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYF 606
                     + G   + S   A DD   H +LI+S E  TM+L+T   + E+  S  + 
Sbjct: 540 VRKEEEETPKAEGSEQEPSAPEAEDDGRRHGFLILSREDSTMILQTGQEIMELDTS-GFA 598

Query: 607 VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI 666
            QG T+ AGN+   R ++QV   G R+L+G      L F P +         + ++  ++
Sbjct: 599 TQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GAPIVQCAV 648

Query: 667 ADPYVLLGMSDGSIRLLV 684
           ADPYV++  ++G + + +
Sbjct: 649 ADPYVVIMSAEGHVTMFL 666



 Score =  107 bits (268), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 71/247 (28%), Positives = 114/247 (46%), Gaps = 15/247 (6%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            + ++  E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T 
Sbjct: 781  WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSSGQPTTQGEVR----KEEATR 836

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
            QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 837  QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 886

Query: 873  TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                    N++    +     +      T E +       R   F++I G+ G F+ G  
Sbjct: 887  VRFKKVPHNINFREKKPKPSKKKAEGGSTDEGSGVRGRVARFRYFEDIYGYSGVFICGPS 946

Query: 933  PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 947  PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1006

Query: 992  YWPVQKV 998
             WPV+K+
Sbjct: 1007 PWPVRKI 1013


>gi|296227035|ref|XP_002807684.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
           specificity factor subunit 1 [Callithrix jacchus]
          Length = 1394

 Score =  266 bits (681), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 201/672 (29%), Positives = 322/672 (47%), Gaps = 124/672 (18%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN    + +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSAEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
            SVLT+ ++                    F C +G   +                     
Sbjct: 367 ASVLTTSVSGTEG----------------FLCAAGGKSVP-------------------- 390

Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG------ 521
                   QD    +E+ +YGS + +  +   T+SF V DS++NIGP  + + G      
Sbjct: 391 --------QD--EXDEIEVYGSETQSG-TQLATYSFEVCDSILNIGPCANAAMGEPAFLS 439

Query: 522 ----------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVY-------- 554
                     L I        + + + + K    ++V   ELPGC  +WTV         
Sbjct: 440 EEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEE 499

Query: 555 -HKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTI 612
            +    G   + S   A DD   H +LI+S E  TM+L+T   + E+  S  +  QG T+
Sbjct: 500 ENPKGEGTEQEPSTPEADDDSRRHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTV 558

Query: 613 AAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVL 672
            AGN+   R ++QV   G R+L+G      L F P +         + ++  ++ADPYV+
Sbjct: 559 FAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFVPVDL-------GAPIVQCAVADPYVV 608

Query: 673 LGMSDGSIRLLV 684
           +  ++G + + +
Sbjct: 609 IMSAEGHVTMFL 620



 Score =  110 bits (276), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 70/247 (28%), Positives = 116/247 (46%), Gaps = 15/247 (6%)

Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
           + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 735 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 790

Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
           QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++  S   + 
Sbjct: 791 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQLSQGNLK 840

Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                   N++    +     +      T E         R   F++I G+ G F+ G  
Sbjct: 841 VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGAGARGRVARFRYFEDIYGYSGVFICGPS 900

Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
           P W +V  R  LR+HP   DG + +F   HN+NC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 901 PHWLLVTGRGALRLHPMGIDGPVDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDA 960

Query: 992 YWPVQKV 998
            WPV+K+
Sbjct: 961 PWPVRKI 967


>gi|55725165|emb|CAH89449.1| hypothetical protein [Pongo abelii]
          Length = 565

 Score =  263 bits (672), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 182/536 (33%), Positives = 282/536 (52%), Gaps = 65/536 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPS 463
            SVLT+ + T+     FLGSRLG+SLL+++T        +++  +  KEE    +    +
Sbjct: 367 ASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDA 426

Query: 464 TKRLRRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
           T     +     QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G 
Sbjct: 427 TAGWSAAGKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGE 482

Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTV 553
                          L I        + + + + K    ++V   ELPGC  +WTV
Sbjct: 483 PAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTV 538


>gi|281205270|gb|EFA79463.1| CPSF domain-containing protein [Polysphondylium pallidum PN500]
          Length = 1395

 Score =  256 bits (654), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 209/734 (28%), Positives = 340/734 (46%), Gaps = 109/734 (14%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLV+   +++++Y +R      ++ +     +++   D +    LEL    +L   +ESL
Sbjct: 31  NLVIAKTSLLQVYTIRYDRIEQQQQQQQQTNEQQSQQDTLKPW-LELNLELQLFSIIESL 89

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +   G D     DS+IL+F DAK+S+++++ +   L I S+H FE      LK GR++
Sbjct: 90  NCVRLPGDD----IDSLILSFRDAKVSIVKYNKATEKLDIRSLHYFEGNS--ELKGGRKT 143

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVG-------------------DE 217
           F   PL++VD Q RC  +L+Y   + +L   +  S L                     DE
Sbjct: 144 FRTPPLIRVDYQQRCAVMLLYDRHLAVLPFPRSFSILDDEEEEEEEEAAVVADQQQQHDE 203

Query: 218 D-----------TFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHER 266
           +              S      +   S+VI+L  L +++VKDF F+H Y EP ++ LHE 
Sbjct: 204 NEQQQPQDDQQQQQTSEKNKKKKQSESYVISLNSLGIENVKDFCFLHTYYEPTLLFLHEP 263

Query: 267 ELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGA 326
             TW  R+S K  T +++A+S++   +Q P+IWS  +LP++  +L+ VP P+GG +V+  
Sbjct: 264 SQTWTSRISSKKFTNVLTAVSLNIAQRQQPVIWSIEHLPYNCERLVPVPDPLGGAMVLTP 323

Query: 327 NTIHYHSQSASCALALNNYA-VSLDSSQELPRSSFSVE----LDAAHATWLQNDVALLST 381
           N + Y +QS+   L  N YA +      + P  S S      LD A+  +L  D  L S 
Sbjct: 324 NILFYFNQSSRYGLECNEYAQIDTGDQFQFPIDSSSTNLVFTLDCANFIFL-GDRLLGSL 382

Query: 382 KTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT--- 438
           K G+L++  ++ DGR VQR+ ++K   SVL+S    + ++L FLGSRLGDSLL+Q+T   
Sbjct: 383 KGGELLIFHLISDGRNVQRISITKAGASVLSSTSCVLTDNLLFLGSRLGDSLLLQYTEKI 442

Query: 439 ----CGSGTSMLSSGLKE----EFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
                      LS+  K+    E  D+  D     +   S +D     +  +E  ++   
Sbjct: 443 IDVDSSDNVENLSNPYKKKKTSEVFDLFDDEERNSKTGASDADGNGQSLFDDEDDIF--- 499

Query: 491 SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRIN-ADASATGISKQSNYELV------- 542
            N+ ++  K++   + D + NIGP+ D   G+  + A  S     +Q + ELV       
Sbjct: 500 -NDKKNQLKSYRLNICDHITNIGPVSDLITGVSYDHASVSNDESFEQRSLELVACSGHGK 558

Query: 543 -------------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
                              ELPG +  WT+Y+      +   S  +       +      
Sbjct: 559 NGALTILQYGVRPELNTSFELPGVRQSWTLYYDDPLAASQSGSSASNAAASAASKKRQHE 618

Query: 584 EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDG-SYMTQD 642
           E  T+V +T   L EV +         TI   N+FGRRR+  V + G ++L G S +TQ+
Sbjct: 619 EDSTLVFQTGGQLKEVAK-----FDHATITVANMFGRRRIALVHQNGIKLLSGHSNITQE 673

Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS-TCTVSVQTPAAIE 701
           +                +V    I DPYVL+   DG+I L  G+   T  +  + P    
Sbjct: 674 IKL-------------KSVKMAYIVDPYVLILHKDGTISLYQGNTGITQLLEYELPQP-- 718

Query: 702 SSKKPVSSCTLYHD 715
             K  V SC+++HD
Sbjct: 719 --KDGVMSCSMFHD 730



 Score = 95.9 bits (237), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 54/184 (29%), Positives = 90/184 (48%), Gaps = 26/184 (14%)

Query: 823  KVVELAMQRW-SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSN 881
            K+VE+ +    ++ HS P+L  +   G IL Y+A          K  D +  ++ L    
Sbjct: 872  KIVEIVIHYLHNSPHSSPYLMILNEFGDILIYKAI---------KYKDSMDNTKEL---- 918

Query: 882  VSASRLRNLRFSRTPLDAYTREETPHGAP-------CQRITIFKNISGHQGFFLSGSRPC 934
                 +R ++ +   L +  RE +    P        ++I  F NI GH+G F+ G R  
Sbjct: 919  -----IRFIKHTDQNLHSKQREYSYGIDPSSESSFYIRKIVAFDNIGGHKGVFMCGKRSL 973

Query: 935  WCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWP 994
            W    +  LR HP      + +FT  HN+NC++GFIY T +G+L+I QL +   ++N W 
Sbjct: 974  WFFCEKNYLRAHPMNFKDPVTSFTCFHNINCSYGFIYFTEKGVLRINQLSNMMNFENEWA 1033

Query: 995  VQKV 998
            ++K+
Sbjct: 1034 IRKI 1037


>gi|330799483|ref|XP_003287774.1| hypothetical protein DICPUDRAFT_32967 [Dictyostelium purpureum]
 gi|325082229|gb|EGC35718.1| hypothetical protein DICPUDRAFT_32967 [Dictyostelium purpureum]
          Length = 1453

 Score =  248 bits (634), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 198/738 (26%), Positives = 332/738 (44%), Gaps = 119/738 (16%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGIS-AASLELVCHYRLHGNVES 115
           NLV++  N +++Y +       K  KN   T ++  +  +    SLEL+   +L G +ES
Sbjct: 31  NLVLSKNNTLQVYKI-------KYVKNENTTTQQKQIKKVEIKPSLELLIELKLFGTIES 83

Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
           +A +   G +    +DS++L F DAKISVL+++  I    I S+H +E+ E+   K GR 
Sbjct: 84  MASVRYPGEN----KDSLLLTFRDAKISVLDYNIDIMDFEIRSLHFYENDEF---KNGRI 136

Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHV 235
            F   P++K+D Q RC  +L+Y   +++L   Q  S L  +++             ++  
Sbjct: 137 HFKHPPILKIDTQQRCATMLLYDRNIVVLPFKQISSILDDEDEEEKDEEDEKENDNANQD 196

Query: 236 INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
                 D     +F F++GY EP ++ LHE   TW  R++ K  T  ++A+SI+ + K  
Sbjct: 197 YTEEFDDDDDDNNFCFLYGYYEPTILFLHEPSQTWTSRIAVKRLTSQLTAISINFSTKLA 256

Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYA-VSLDSSQE 354
            +IW   N+P++  +L++VP P+ G LV+  N + + +Q++   LA+N YA + +    E
Sbjct: 257 SIIWHTSNMPYNCDQLVSVPEPLSGALVITPNIMFHVNQTSKYGLAVNEYANIDIGDKFE 316

Query: 355 LPRS---SFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVL 411
            P     +    LD ++  +L+ D  + S K G+L++  ++ DGR VQR+ +SK   SVL
Sbjct: 317 FPLDETLNLVFTLDRSNFVFLEADKFIGSLKGGELLIFHLISDGRTVQRIHVSKAGGSVL 376

Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS 471
            + +  + ++L FLGSRLGDSLL+Q+T  S T        +E  + E  +   K+ + S 
Sbjct: 377 ATCMCVVSDNLLFLGSRLGDSLLLQYTEKSIT--------DESLEHENFSNPYKKQKTSE 428

Query: 472 SDAL-----------QDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
            + L            D V  EE  L+    N  +S Q      + D ++N+GP+ D   
Sbjct: 429 QEKLLNQQQQQQKDEMDEVLDEEDELFKEKKNQLKSYQ----LGICDQILNVGPVGDMVI 484

Query: 521 GLRINADASATGISKQSNY--ELVELPGCKG----------------------------- 549
           G  +N       +     Y    +EL  C G                             
Sbjct: 485 GQALNPTYDLNTLPSDPAYMPRFLELVTCSGYGKNGSISILQNSVKPEIVGAFDSEGVVN 544

Query: 550 -IWTVYHKSSRGHNADSSR------------------------MAAYDDEYHAYLIISL- 583
             WTVY+K+S     D                               +++Y  YL IS+ 
Sbjct: 545 SFWTVYNKASSSIKEDEEEKLIGKKRTINEIIKEEQQYEQQQQKQPIEEDYLDYLYISMS 604

Query: 584 EARTMVLETADLLTEVTESVDYF---VQGRTIAAGNLFGRRRVIQVFERGARIL-DGSYM 639
              T +L+T    T   E    F    + RT+  GNLF +RR++ + E   ++L D + +
Sbjct: 605 NGTTNILDT----TSSEEGKLTFKGEFEYRTLDMGNLFNKRRIVLINENSIKLLNDYNNI 660

Query: 640 TQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAA 699
            Q++                 + S  I DPYVL+  SD SI+L   D     ++    + 
Sbjct: 661 VQEIKLS------------KPIKSTFIQDPYVLVHYSDNSIQLFKCDYKLLKLNQFNFSL 708

Query: 700 IESSKKPVSSCTLYHDKG 717
               +  V + +L+ DK 
Sbjct: 709 NHGDEGKVLTSSLFFDKN 726



 Score = 52.8 bits (125), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 43/182 (23%), Positives = 87/182 (47%), Gaps = 27/182 (14%)

Query: 820  HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSV 879
             ++++VE++++    ++S+P+L      G ++ Y+++  E  +   K  +     R LS 
Sbjct: 876  ENLEIVEISLE--ILNNSQPYLLLKNRIGDLIVYKSFKKENGDLRFKKYNHNFILRDLSN 933

Query: 880  SNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF 939
            ++ S +            D Y +         + I   K  S + G F+ G +P W  +F
Sbjct: 934  NSKSINS-----------DGYRK---------KSIVNIKLSSKNNGVFIGGQKPVW--IF 971

Query: 940  RER--LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-QGILKICQLPSGSTYDNYWPVQ 996
             E+  +R+H    DG+IV+    HN +C +GF+Y T  +  +KI  L     ++N + ++
Sbjct: 972  NEKGYIRLHSMDFDGAIVSLKPFHNADCPNGFLYYTEDKQHIKIGYLNGLMNFENEYAIR 1031

Query: 997  KV 998
            +V
Sbjct: 1032 RV 1033


>gi|428186188|gb|EKX55039.1| hypothetical protein GUITHDRAFT_160593 [Guillardia theta CCMP2712]
          Length = 2290

 Score =  238 bits (607), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 142/412 (34%), Positives = 222/412 (53%), Gaps = 38/412 (9%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMD---GISAASLELVCHYRLHGNV 113
           NL V     +E+YV++ +E+   ++ N  +  ++   D   G   A+L+ V  Y L+GNV
Sbjct: 31  NLAVVKGTQLELYVLKEEEKKHSKTCNGKQNGQKAAGDSGHGHGGATLQCVGRYDLNGNV 90

Query: 114 ESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRG 173
           ES+A +   G    R RD + L F DAK+S+LE+D+SI  +   S+H FE  E   +++G
Sbjct: 91  ESMAFVRLPG----RNRDHLFLVFRDAKLSILEYDNSIDDIVNVSLHLFEDDE---IRKG 143

Query: 174 RESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF------------- 220
           R SF R PL++VDP  RC  +LVY  +M+++     GS L  D++               
Sbjct: 144 RVSFGRAPLLRVDPLQRCAALLVYESKMVVIPFKHKGSDLEEDDEILTQPNKKFKSESAS 203

Query: 221 -------GSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
                  G+       I  ++V++L +  +KHV DF F+ GY EP +  LHE   TWAGR
Sbjct: 204 SNTVTRLGAPSDNKLGILPTYVVDLDEAGIKHVVDFTFLDGYYEPTISFLHENSRTWAGR 263

Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHS 333
           ++  + T MI+ +S++ + ++ P+IWSA  LPH++  ++A+P+P GGV+VV +N + Y +
Sbjct: 264 LAVSNFTGMITTVSLNISQRRQPIIWSASKLPHNSRHIVALPAPAGGVVVVSSNALIYRN 323

Query: 334 QSASCALALNNYAVSL-DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
               CAL LN YA++  D       +   +  D  H   L+    L S  TG+  ++ V 
Sbjct: 324 HEQKCALKLNEYAIAAGDGGNRFDTAGDIICFDTVHPVRLEGYQMLFSLVTGESYIMGVQ 383

Query: 393 Y--DGRVVQRLDLS----KTNPS-VLTSDITTIGNSLFFLGSRLGDSLLVQF 437
              DG  ++ L L     K +PS    S +  +G+S  FLGSRLGDS LV+ 
Sbjct: 384 LDTDGNTIKALTLDLVDVKLSPSGGFASIMCRVGDSYLFLGSRLGDSSLVKM 435



 Score = 77.0 bits (188), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 76/314 (24%), Positives = 135/314 (42%), Gaps = 77/314 (24%)

Query: 501 FSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELV------------------ 542
           + F + D+L NIGP+     G R++A     G  K+ + ELV                  
Sbjct: 587 YRFELCDTLTNIGPI-----GSRLDA-----GAVKKDSVELVTASGGLQYGKLGVLQRSL 636

Query: 543 --------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDE----YHAYLIISL--EARTM 588
                    LP  + +WTV+  +++  + D       ++E     HAY++IS   +  T+
Sbjct: 637 NPVVMTAVPLPDAQAVWTVFGPTAKAADEDMEEDGNEEEEQSAGMHAYMVISQGNDKGTI 696

Query: 589 VLETADLLT-EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGP 647
           VL+  +L   +  E VD+ V  +T+  GN+FG +R++QV      +L+G    Q+L    
Sbjct: 697 VLKGRELEEFDEDEQVDFEVDAKTVCVGNIFGNQRIVQVTPWNVYVLNGPRKEQELPV-- 754

Query: 648 SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPV 707
               +G+G +   +++  I DPY+ L + DG + LLVGD S+  V+      +      +
Sbjct: 755 ---VAGNGLQ---IVAAYIRDPYIALILQDGRLNLLVGDASSMQVNY-----VSHEIHNI 803

Query: 708 SSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFD 767
           ++   + D  P+                GEA D        Q D+       +G  +++ 
Sbjct: 804 TAACFFLDPIPD----------------GEANDDP-----QQRDVMLAAAPRNGHFQLYT 842

Query: 768 VPNFNCVFTVDKFV 781
           +P+   V+    FV
Sbjct: 843 LPSLELVYDAADFV 856



 Score = 45.1 bits (105), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 22/92 (23%), Positives = 42/92 (45%), Gaps = 8/92 (8%)

Query: 913  RITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD--GSIVAFTVLHNVNCNHGFI 970
            R+       G +G  ++  +P   +  R   R+HP   D    + +    +N+ C  G +
Sbjct: 1068 RLMPLGGAGGLEGVLIAARQPAVVLFGRGLPRIHPWKLDRGEGVRSAARFNNLQCKDGIV 1127

Query: 971  YVT------SQGILKICQLPSGSTYDNYWPVQ 996
             +       ++G+LKIC +P G + D  WP++
Sbjct: 1128 CIADKGRDRAKGVLKICNIPEGISGDTPWPLR 1159


>gi|9794904|gb|AAF98386.1| cleavage and polyadenylation specificity factor [Drosophila
           melanogaster]
          Length = 507

 Score =  237 bits (605), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 162/497 (32%), Positives = 258/497 (51%), Gaps = 49/497 (9%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESK-NSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
           NLVV  ANV+++Y +    E S+  K N  E +    M       LE +  Y L+GNV S
Sbjct: 29  NLVVAGANVLKVYRIAPNVEASQRQKLNPSEMRLAPKM------RLECLATYTLYGNVMS 82

Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
           L  +S  GA     RD+++++F+DAK+SVL+ D     L+  S+H FE  +   ++ G  
Sbjct: 83  LQCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDD---IRGGWT 135

Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR-- 229
                P V+VDP  RC  +LVYG ++++L   +  S     L   +    +     +R  
Sbjct: 136 GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195

Query: 230 IESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
           I +S++I LRDLD K  +V D  F+HGY EP ++IL+E   T  GR+  +  TC++ A+S
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255

Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
           ++   + HP+IW+  +LP D  ++  +  PIGG LV+  N + Y +QS         Y V
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVP------PYGV 309

Query: 348 SLDSSQE-------LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQ 399
           SL+SS +        P+    + LD A+  ++  D  ++S +TGDL +LT+  D  R V+
Sbjct: 310 SLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVR 369

Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------ 447
                K   SVLTS I  + +   FLGSRLG+SLL+ FT    +++++            
Sbjct: 370 NFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDEVEQQSEQQQ 429

Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
             L++E  ++E +     +L  + + A    +  EEL +YGS +  +    + F F V D
Sbjct: 430 RNLQDEDQNLE-EIFDVDQLEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCD 488

Query: 508 SLVNIGPLKDFSYGLRI 524
           SL+N+ P+     G R+
Sbjct: 489 SLMNVAPINYMCAGERV 505


>gi|58702050|gb|AAH90169.1| LOC564406 protein, partial [Danio rerio]
          Length = 416

 Score =  229 bits (584), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 129/342 (37%), Positives = 203/342 (59%), Gaps = 14/342 (4%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           LE V  + L GNV S+A +   G +    RD+++L+F+DAK+SV+E+D   H L+  S+H
Sbjct: 66  LEQVASFSLFGNVMSMASVQLVGTN----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLH 121

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
            FE PE   L+ G       P+V+VDP+ RC  +LVYG  +++L   +     + DE   
Sbjct: 122 YFEEPE---LRDGFVQNVHIPMVRVDPENRCAVMLVYGTCLVVLPFRKDT---LADEQEG 175

Query: 221 GSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
             G G  +    S++I++R+LD K  ++ D  F+HGY EP ++IL E   TW GRV+ + 
Sbjct: 176 IVGEGQKSSFLPSYIIDVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQ 235

Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-S 337
            TC I A+S++   K HP+IWS  NLP D  +++AVP PIGGV+V   N++ Y +QS   
Sbjct: 236 DTCSIVAISLNIMQKVHPVIWSLSNLPFDCNQVMAVPKPIGGVVVFAVNSLLYLNQSVPP 295

Query: 338 CALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-R 396
             ++LN+      +    P+    + LD + A+++ +D  ++S K G++ +LT++ DG R
Sbjct: 296 FGVSLNSLTNGTTAFPLRPQEEVKITLDCSQASFITSDKMVISLKGGEIYVLTLITDGMR 355

Query: 397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
            V+     K   SVLT+ + T+     FLGSRLG+SLL+++T
Sbjct: 356 SVRAFHFDKAAASVLTTCMMTMEPGYLFLGSRLGNSLLLRYT 397


>gi|449661926|ref|XP_002167992.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like [Hydra magnipapillata]
          Length = 1122

 Score =  228 bits (580), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 193/664 (29%), Positives = 305/664 (45%), Gaps = 124/664 (18%)

Query: 57  NLVVTAANVIEIYVV----RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGN 112
           NLV      + +Y +     V  +G + SK         ++D +    LEL+  + L GN
Sbjct: 29  NLVTAGGQRLNVYRLCDADMVVSDGDQSSK---------IVDSVGKRRLELLASFTLFGN 79

Query: 113 VESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
           + ++ ++  G    S  RDS++LAF+ AK+S++EFD   H L+  SMH FE+ E+   K 
Sbjct: 80  IINMQVVRLG----SNVRDSLLLAFKHAKLSIVEFDPLSHDLKTDSMHYFENDEF---KG 132

Query: 173 GRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIES 232
           G       PLV+VDP+ RC  +L+Y   +++L        +  DE    S G     +  
Sbjct: 133 GLSHNIYLPLVRVDPEQRCACMLIYNRHLVVLPFKHD---IKLDESEELSDGEHIKSVLP 189

Query: 233 SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST 290
           S++I+L  L+  + ++ +  F+HGY +P ++ L E   T  GRV+ +  T  +SA+S++ 
Sbjct: 190 SYMIDLHSLEQPLLNITELQFLHGYHQPTLMFLFEPVQTSTGRVAVRQDTFCVSAISLNM 249

Query: 291 TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLD 350
           T K HP+IWS  NLP D + L  +  PIGGVLV  +N++ Y +QS      +  Y VSL+
Sbjct: 250 TEKVHPVIWSVTNLPFDCHMLRPIEKPIGGVLVFASNSLIYLNQS------IPPYGVSLN 303

Query: 351 SSQE----LP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLD 402
           S  E     P   +    + L  +    +  D  +LS K G++ +L+++ DG R V+   
Sbjct: 304 SITEGSTMFPLKIQEDVVITLAESSCDAIATDQFILSLKGGEIYVLSLLSDGLRTVRSFH 363

Query: 403 LSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAP 462
             K   SVL S +  I +   FLGSRLG+SLL+++T                   E D+ 
Sbjct: 364 FEKAAGSVLASCVCWIEHGFVFLGSRLGNSLLLRYT-------------------EKDSA 404

Query: 463 STKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL 522
           S     +S    ++ M  G                         DSL+NIGP+   + G 
Sbjct: 405 SIA--EKSKEAKVEKMYGGGVGGGIIVC----------------DSLLNIGPITKAALGE 446

Query: 523 RINADASATGISKQSNYELV--------------------------ELPGCKGIWTVYHK 556
                    G S+Q + E+V                          ELPGC  +WTV  K
Sbjct: 447 PAFLSEEFFG-SRQIDLEMVCCSGYGKNGTLTVLQRSIRPQVVTTFELPGCVNMWTVCGK 505

Query: 557 SSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGN 616
           SS+             + YH+YLI+S +  TMVL+T   +TE+  S  + VQ  TI A N
Sbjct: 506 SSKESV----------ENYHSYLILSRDDSTMVLKTGAEITELDNS-GFNVQQPTIFACN 554

Query: 617 LFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMS 676
               + ++QV  +   +L+ +     +S            +   +   SI+DPYV++  S
Sbjct: 555 HLSNKYILQVCPQSIHLLEDTVQINSISL----------QDTIKITQCSISDPYVVMVDS 604

Query: 677 DGSI 680
            G +
Sbjct: 605 TGQL 608


>gi|147799623|emb|CAN68460.1| hypothetical protein VITISV_027523 [Vitis vinifera]
          Length = 558

 Score =  227 bits (579), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 119/175 (68%), Positives = 142/175 (81%), Gaps = 5/175 (2%)

Query: 424 FLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEE 483
           F GS+LGDSLLVQFT     S+ SS +++  GDIE B PS KR RRSSSDALQDMVNG++
Sbjct: 331 FEGSQLGDSLLVQFT-----SIPSSSVEKRVGDIEGBVPSAKRSRRSSSDALQDMVNGDK 385

Query: 484 LSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVE 543
           L LYGSA N+TE++QKTFSF+V DSL+++GPLKDF+YGLRINAD  ATGI KQ     VE
Sbjct: 386 LPLYGSAPNSTETSQKTFSFSVNDSLIDVGPLKDFAYGLRINADLKATGIVKQKMITEVE 445

Query: 544 LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTE 598
           LPGC+ IWTVYHK++RGHNADS++M   DDEY AYLIIS E+RTMVLET +LL E
Sbjct: 446 LPGCERIWTVYHKNTRGHNADSTKMITKDDEYCAYLIISPESRTMVLETVELLGE 500


>gi|149512998|ref|XP_001514888.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like, partial [Ornithorhynchus anatinus]
          Length = 831

 Score =  225 bits (573), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 215/738 (29%), Positives = 342/738 (46%), Gaps = 141/738 (19%)

Query: 233 SHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST 290
           S++I++R LD K  ++ D  F+HGY EP ++IL+E   TW GRV+ +  TC I A+S++ 
Sbjct: 8   SYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILYEPNQTWPGRVAVRQDTCSIVAISLNI 67

Query: 291 TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLD 350
             K HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS      +  Y VSL+
Sbjct: 68  LQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQS------VPPYGVSLN 121

Query: 351 S----SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLD 402
           S    +   P   R    + LD A A ++  D  ++S K G++ +LT++ DG R V+   
Sbjct: 122 SLTAGTTAFPLRLREGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRSFH 181

Query: 403 LSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADA- 461
             K   SVLT+ + T+     FLGSRLG+SLL+++T         S  +E   D  AD  
Sbjct: 182 FDKAAASVLTTCMITMEPGYLFLGSRLGNSLLLKYTEKLQEPPAGSA-REPARDSGADKQ 240

Query: 462 -PSTKRLRRSSS-------DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNI 512
            P  K+ R   +        A QD V+  E+ +YGS A + T+ A  T+SF V DS++NI
Sbjct: 241 EPPVKKKRVEQALSWAGGKSAAQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNI 296

Query: 513 GPLKDFSYG----------------LRI------NADASATGISKQSNYELV---ELPGC 547
           GP  + + G                L I        + + + + K    ++V   ELPGC
Sbjct: 297 GPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGC 356

Query: 548 KGIWTVYHK-------SSRGHNADSSRMAAY---DDEYHAYLIISLEARTMVLETADLLT 597
             +WTV          S +G  A+S         D + H +LI+S E  TM+L+T   + 
Sbjct: 357 YDMWTVIAPVRKEEGDSPKGEGAESEPTPPEPEDDGKRHGFLILSREDSTMILQTGQEIM 416

Query: 598 EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSE 657
           E+  S  +  QG T+ AGN+   R ++QV   G R+L+G      L F P +        
Sbjct: 417 ELDTS-GFATQGPTVYAGNIGDDRYIVQVSPLGLRLLEG---VNQLHFIPVDL------- 465

Query: 658 NSTVLSVSIADPYVLLGMSDGSIR--LLVGDP---STCTVSVQTPAAIESSKKPVSSCTL 712
            S ++  ++ADPYV++  ++G +   LL  D     T  +++  P  + S  K ++ C +
Sbjct: 466 GSPIVQCAVADPYVVIMSAEGHVTMFLLKSDSYGGRTHRLALHKP-PLHSQSKVIALC-V 523

Query: 713 YHD-----------KGP--EPWLRKTSTDAWLSTGVGEAIDGADG--------------- 744
           Y D            GP  +P LR  S    L   +   +D  +                
Sbjct: 524 YRDVSGMFTTESRASGPRDDPSLRGQSEAEPLLQELSHTVDDEEEMLYGDSSSLFSPSRD 583

Query: 745 -------GPLDQGDI--------YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVD 789
                   P D+           + V+  ++GA+EI+ +P +  VF V  F  G+  +VD
Sbjct: 584 EPRRSSLPPADRDAPQYRAEPTHWCVLVRDNGAMEIYQLPEWRLVFLVKNFPMGQRVLVD 643

Query: 790 TYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAI----- 844
           +   +    S  +  +  EE   QG    +  + +V L  ++     +RP+L  +     
Sbjct: 644 SSFGQPAA-SAAQAEAKKEEPARQGELPLVKEVLLVALGNRQ-----TRPYLLRLKWAIR 697

Query: 845 ---LTDGTILCYQAYLFE 859
              LT  T +  Q Y+ +
Sbjct: 698 DSELTSITFIDMQLYIHQ 715


>gi|340371789|ref|XP_003384427.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like [Amphimedon queenslandica]
          Length = 1408

 Score =  224 bits (571), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 235/880 (26%), Positives = 376/880 (42%), Gaps = 207/880 (23%)

Query: 3   FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
           +A Y+ +H PTG+ +C S    HS  + V                            V +
Sbjct: 2   YAVYREVHPPTGVEHCTSCHFVHSEKEQV---------------------------AVAS 34

Query: 63  ANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQG 122
            +++ I+ V      ++  +N G+ K            L     +  HGN++SL  +   
Sbjct: 35  TSLLRIFDV------AQLQRNDGKAK------------LVQCLEFSFHGNIQSLDKVRLR 76

Query: 123 GADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL 182
            +D    RD ++L+F DAK+S++E++   +GL+  SMH FE  E   ++ G       P+
Sbjct: 77  HSD----RDCLLLSFNDAKLSIVEYNPETNGLKTVSMHQFEDEE---IRGGILHNDSRPV 129

Query: 183 VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD 242
           VKVDP+GRC  +L++G  + +    Q    L  D     S    +  I  ++ I+LRDL 
Sbjct: 130 VKVDPEGRCAVMLLFGSHLAVCPFQQD---LSIDTPLSPSPSLDTHDILPTYTISLRDLP 186

Query: 243 --MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
             +  +KD  F+ GY  P ++ L E   TWAGR+S +  + M+  LS++T+ K H +IW+
Sbjct: 187 EPLPVIKDMTFIEGYTSPTLLFLSEVSPTWAGRISLRQDSMMLLGLSLNTSDKSHTVIWT 246

Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALN---NYAVSLDSSQELP 356
             NLP D+  L  VP P+GGVLV GANT+ Y +QS+    L+LN   +Y        E  
Sbjct: 247 LKNLPFDSSYLHPVPKPLGGVLVFGANTLIYLNQSSPPYGLSLNSITDYTTRFLLKNE-- 304

Query: 357 RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG--RVVQRLDLSKTNPSVLTSD 414
             S  + LD + + ++ N+  L+S ++GD+ ++T+  D   R V+R+   K   S+L+S 
Sbjct: 305 -GSLGIRLDCSQSVFISNEQLLVSLQSGDIYIVTLFPDSGMRGVKRITFDKAASSILSSC 363

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
           I +I     FLGSRL +SLL+++     ++ +   + E  G                  A
Sbjct: 364 ICSIKPHFLFLGSRLANSLLLRY-----STTVKQNIVEPIG-----------------GA 401

Query: 475 LQDMVNGEELSLYGSASNNTESAQK----TFSFAVRDSLVNIGPLKDFSYGLRINADASA 530
           + D+   +++ +YG ++ +  ++       +S  V DSL+ IGP+   + G    A  S 
Sbjct: 402 ILDL---DDIEVYGESAVSQSTSSSSLLTNYSLEVCDSLLCIGPVVKATIGE--PAFLSE 456

Query: 531 TGISKQS-NYELV--------------------------ELPGCKGIWTV---------- 553
             + K   + ELV                          ELPGC  +WTV          
Sbjct: 457 EFVDKSDLDLELVLCSGHGKNGALSVLQRTIRPQVVTTFELPGCIDMWTVKSEGEEEEKG 516

Query: 554 ----YHKSSRGHNADSSRMAAYDDE-YHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
                   + G   D SR         H YLI+S    TMVL+T   +TE+ +S  +  Q
Sbjct: 517 EETKEEGQNEGGEKDQSREKEEKGSGQHDYLILSRSDSTMVLQTGQEITELDQS-GFATQ 575

Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDG----SYMTQDLSFGPSNSESGSGSENSTVLSV 664
             T+ AGN+     ++Q      R+L G     Y+  D+  G              V  V
Sbjct: 576 SATVFAGNV--GSFIVQATRTDIRLLKGIKQLCYVALDMGGG--------------VKCV 619

Query: 665 SIADPYVLLGMSDGSIRLL--------VGDPST--------------------CTVSVQ- 695
            +  PYV++ + +G I LL        +  PS                      T S+Q 
Sbjct: 620 DVCSPYVIVLLMEGEIGLLKLVDESLVLSWPSLGNNTPVNHISAYTDTSGLFDVTSSLQF 679

Query: 696 ----------TPAAIESSKKPVSSCTLYHDK-----GPEPWLRKTSTDAWLSTGVGEAID 740
                      P A    K+P  S +L +D+     GP     K    + +   +     
Sbjct: 680 EGDGSEKEEEVPIAPPPVKRPHLSSSLLYDEDELLYGPVKTEVKEENASPMEASLAAE-- 737

Query: 741 GADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKF 780
             +  P      + ++C E GALEI+ VP F  VF V  F
Sbjct: 738 -PEAPPPITPTHWCLLCKEDGALEIYSVPEFQFVFAVRNF 776



 Score = 81.3 bits (199), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 36/83 (43%), Positives = 49/83 (59%), Gaps = 1/83 (1%)

Query: 917 FKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQ 975
           F NI+G+ G F+ G  P W  M  R  L +HP   DG + +F    NVNC  GF+Y   +
Sbjct: 894 FSNIAGYSGVFVCGPYPHWIFMAARGHLSIHPMYIDGPVQSFAPFDNVNCPSGFLYFNKE 953

Query: 976 GILKICQLPSGSTYDNYWPVQKV 998
             L+I  LP+  +YD+YWPV+KV
Sbjct: 954 SELRISVLPTQLSYDSYWPVRKV 976


>gi|33411762|emb|CAD58786.1| cleavage and polyadenylation specificity factor 1 [Bos taurus]
          Length = 880

 Score =  221 bits (564), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 161/497 (32%), Positives = 252/497 (50%), Gaps = 62/497 (12%)

Query: 233 SHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST 290
           S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ 
Sbjct: 8   SYIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNI 67

Query: 291 TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASC-ALALNNYAVSL 349
           T K HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+     
Sbjct: 68  TQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGT 127

Query: 350 DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNP 408
            +     +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   
Sbjct: 128 TAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAA 187

Query: 409 SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
           SVLT+ + T+     FLGSRLG+SLL+++T        S+    E  D E      KR+ 
Sbjct: 188 SVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASTA--REAADKEEPPSKKKRVD 245

Query: 469 RS-----SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG- 521
            +     S    QD V+  E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G 
Sbjct: 246 ATTGWSGSKSVPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMGE 301

Query: 522 ---------------LRI------NADASATGISKQSNYELV---ELPGCKGIWTVYHKS 557
                          L I        + + + + K    ++V   ELPGC  +WTV    
Sbjct: 302 PAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPV 361

Query: 558 SR---------GHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
            +         G   +     A DD   H +LI+S E  TM+L+T   + E+  S  +  
Sbjct: 362 RKEQEETLKGEGTEPEPGAPEAEDDGRRHGFLILSREDSTMILQTGQEIMELDAS-GFAT 420

Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
           QG T+ AGN+   R ++QV   G R+L+G      L F P +         S ++  ++A
Sbjct: 421 QGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDL-------GSPIVQCAVA 470

Query: 668 DPYVLLGMSDGSIRLLV 684
           DPYV++  ++G + + +
Sbjct: 471 DPYVVIMSAEGHVTMFL 487



 Score =  111 bits (278), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 71/247 (28%), Positives = 116/247 (46%), Gaps = 15/247 (6%)

Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
           + ++  E+GA+EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 602 WCLLVRENGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARKEEATR 657

Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
           QG    +  + +V L      +   RP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 658 QGELPLVKEVLLVALG-----SRQRRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLK 707

Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                   N++    +     +      T E T       R   F++I G+ G F+ G  
Sbjct: 708 VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGTGPRGRVARFRYFEDIYGYSGVFICGPS 767

Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
           P W +V  R  LR+HP   DG I +F   HN+NC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 768 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDA 827

Query: 992 YWPVQKV 998
            WPV+K+
Sbjct: 828 PWPVRKI 834


>gi|324499955|gb|ADY39993.1| Cleavage and polyadenylation specificity factor subunit 1 [Ascaris
            suum]
          Length = 1434

 Score =  219 bits (559), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 246/990 (24%), Positives = 412/990 (41%), Gaps = 182/990 (18%)

Query: 101  LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
            LE + H RL   V+SLA+        +    S++L F+ AK+SV+ F  +   L+  S+H
Sbjct: 107  LECIIHVRLLAPVKSLAV---ARIPQNPSCSSLLLGFDTAKLSVVGFSAAERSLKTISLH 163

Query: 161  CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
            CFE      LK G  +    P+++VDP  RC  +L+YG  + +L        L       
Sbjct: 164  CFEEE---MLKDGYVTDLPSPVIRVDPAQRCAVMLIYGRYLAVLPFDDTSPHL------- 213

Query: 221  GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
                        ++ + L  +D  + ++ D  F+ GY EP ++ L+E   T AGR   ++
Sbjct: 214  -----------HTYTVALSSIDPRLVNIIDIAFLDGYYEPTLLFLYEPAQTTAGRACVRY 262

Query: 279  HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS- 337
             T  +  +S++T  + H  +W   NLP D  ++L +P PIGG L++GAN + Y +QS   
Sbjct: 263  DTVCMLGVSLNTKEQVHASVWQLNNLPMDCNQVLMIPRPIGGALIIGANELIYLNQSVPP 322

Query: 338  CALALNNYAVSLDSSQELPRSS---FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD 394
            C   LN+    +D   + P  S    ++ LD   A  +  +  ++  ++G L +LT+V D
Sbjct: 323  CGSLLNS---CMDGFTKFPLKSEKEMALTLDGCAACVISTNKVVVCARSGALFILTLVVD 379

Query: 395  G-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEE 453
                V+ ++        +   +T       F+GSR+GDSL +++          S L   
Sbjct: 380  STNSVKSIEFKHEFDVSIPHTVTACSPGYLFVGSRVGDSLFIEYV---------SEL--- 427

Query: 454  FGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQ---KTFSFAVRDSLV 510
               +  D P  K+L+    +  QD +  E+L LYG A  +  S     +   F V D ++
Sbjct: 428  ---VPVDDPIEKKLK---VEVPQDDLEDEDLELYGKALPSVISQDVSVEKMRFRVLDRML 481

Query: 511  NIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIW--------------TVYHK 556
            N+ P K  +           +G S+  N  L E P    ++               ++ +
Sbjct: 482  NVAPCKKMT-----------SGCSEGLNSYLQEQPRLDPVFDRVCACGHGKDSSICIFQQ 530

Query: 557  SSRGHNADSSRMAAY---------DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
            S R     SS +            +D+ H Y+I S E  ++ LET + L E+   V +  
Sbjct: 531  SIRPDIITSSSIEGVIQYWAVGRREDDTHMYIIASKELGSLALETDNDLVELEAPV-FIT 589

Query: 608  QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
               TIAAG L      +QV      ++      Q +                 VLS SI 
Sbjct: 590  SESTIAAGELADGGLSVQVTTSSIVVVAEGQQIQLIPL----------QLTFPVLSASIV 639

Query: 668  DPYVLLGMSDGSIRL--LVGDPSTCTVSVQTPAAIESSKKPVSSCTLYH----------- 714
            DP+V +   +G + L  L   P     +V  P  I  +K P+++  +Y            
Sbjct: 640  DPFVAICTQNGRLLLYELDNTPHVHLKAVDLPGNIIHNKSPITALCIYRDMSGTIRFCSS 699

Query: 715  -------------------DKGPEPWLRKTSTDAWLSTGVGEAIDGADGGP--------- 746
                               D   +  L   S +          I G    P         
Sbjct: 700  SSAASHGANAINTKQHIDIDDFDDMLLYGDSKNKQKEAKKKRKIVGTRQNPGETPHLETD 759

Query: 747  -LDQGDI----YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREA--LKDS 799
             +D   I    + V+  E+G L I+ +P    V+ V K     +H+ D  + E   L D 
Sbjct: 760  VVDPNTIVPSHWIVMARENGNLYIYSIPEMQLVYMVKKL----SHLPDVAIDEMNYLGDE 815

Query: 800  E---TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAY 856
                ++I S++       + E I    +VE+ +     +  RP LF ++ D  +  Y+ +
Sbjct: 816  SVVASDIASNTLNEALVAKPEEI----IVEVLLTGMGMNQGRPMLFVVV-DDMVSVYEMF 870

Query: 857  LFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFS----RTPLDAYTREETPHGA--- 909
            +++   N       V   R L  + V+    R+ RF     R P++A  R+   +     
Sbjct: 871  MYD---NGVVEHLAVRFKR-LPYTTVT----RSCRFQGNDGRAPVEA-ARDTVRYRTALH 921

Query: 910  PCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGF 969
            P +RI    N     G F+  S PC  ++    LR+HP   +G I++FT  +NV C +GF
Sbjct: 922  PFERIGNILN-----GVFICSSYPCVFLMDSGILRMHPLNLEGPILSFTAFNNVLCPNGF 976

Query: 970  IYVTS-QGILKICQLPSGSTYDNYWPVQKV 998
            IY+T  +  ++I +LP+    D+  PV+K+
Sbjct: 977  IYLTEREWAMRIAKLPTDVELDSSLPVRKI 1006


>gi|119602515|gb|EAW82109.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform
           CRA_b [Homo sapiens]
          Length = 377

 Score =  218 bits (555), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 133/369 (36%), Positives = 205/369 (55%), Gaps = 31/369 (8%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVS 348
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+    
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTG 306

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTN 407
             +     +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K  
Sbjct: 307 TTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAA 366

Query: 408 PSVLTSDIT 416
            SVLT+ ++
Sbjct: 367 ASVLTTSVS 375


>gi|402591342|gb|EJW85272.1| hypothetical protein WUBG_03818, partial [Wuchereria bancrofti]
          Length = 1025

 Score =  210 bits (535), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 245/975 (25%), Positives = 407/975 (41%), Gaps = 149/975 (15%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           LE +   RL   V+S AI        +   DS +L F+DAK+S++  + +   L+  S+H
Sbjct: 62  LECLLAVRLLAPVQSFAI---ARISQNPDCDSFLLGFDDAKLSIVAVNPADRCLKTISLH 118

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
           CFE      LK G       P+++VDP  RC  +LV+G  + +L  +   + L       
Sbjct: 119 CFEDE---LLKDGFTKNLPRPVIRVDPGQRCASMLVFGRYLAVLPFNDSSAQL------- 168

Query: 221 GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
                       S+ + L  +D  + +V D +F+ GY EP ++ L+E   T  GR   ++
Sbjct: 169 -----------HSYTVQLSQIDSRLVNVVDMVFLDGYYEPTLLFLYEPVQTTCGRACVRY 217

Query: 279 HTCMISALSISTTLKQHPL--IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA 336
            T  +  L +S  +K+  L  +W   NLP D  ++LA+P P+GG+L+V  N + Y +QS 
Sbjct: 218 DT--MCVLGVSLNVKEQVLASVWQLTNLPMDCNQILAIPRPVGGILLVATNELIYLNQSV 275

Query: 337 S-CALALNNYAVSLDSSQELPRSSF---SVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
             C ++LN+    +D   + P   F   ++ LD A  T +  +  LL  + G L  L +V
Sbjct: 276 PPCGISLNS---CMDGFTKFPLKDFKHMALTLDGAVVTVVSTNKILLCDRNGRLFTLILV 332

Query: 393 YDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
            D    V+ L+L     +V+   +T+      F+GSRL DS+ +   C    S L     
Sbjct: 333 TDATNSVKSLELKFQFETVIPCTMTSCAPGYLFIGSRLCDSVFLH--CIFEQSTLEES-- 388

Query: 452 EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS---ASNNTESAQKTFSFAVRDS 508
                      +TK+++ S+     +    E+  LYG         +  ++  +  V D 
Sbjct: 389 -----------ATKKIKLSTEPNANE--EDEDFELYGEMLPKVAKPDITEELLNIRVLDK 435

Query: 509 LVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCK--GIWTVYHKSSRGHNADSS 566
           L+N+GP K  + G    +        K   ++LV   G    G   +  +S R     SS
Sbjct: 436 LLNVGPCKKITGGCPSISAYFQEITRKDPLFDLVCACGHGKFGSICILQRSIRPEIITSS 495

Query: 567 RMAAY---------DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNL 617
            +            +D+ H Y I S E  T+ LET + L E+ E+  +     TIAAG L
Sbjct: 496 SIEGVVQYWAIGRREDDTHMYFIASRELGTLALETDNDLVEL-EAPIFSTSESTIAAGEL 554

Query: 618 FGRRRVIQV-------FERGARILDGSYMTQDLSFGPSNSES-----GSGSENSTVLSVS 665
                 +QV          G +I    Y+   L+F   N+          ++N  +L   
Sbjct: 555 ADGGLAVQVTTSSLVMVAEGQQI---QYIPLQLTFPVRNASIVDPYIAICTQNGRLLMYE 611

Query: 666 IAD-PYVLLGMSDGSIRL------------------LVGDPSTCTVSVQTPAAIESSKKP 706
           + + P+V L   D S RL                  ++   S   +S Q  A   +   P
Sbjct: 612 LTNHPHVHLKEIDISKRLRHETSPITSLSVYRDMSGIIRFCSAANMSQQQQATGANMHIP 671

Query: 707 -------VSSCTLYHDKGPEPWLRKTSTDAWLSTGVG--EAIDGADGGPLDQGDI----Y 753
                  V    LY D       RK +       G+   E     D   +D   I    +
Sbjct: 672 EQEDFEDVDDLLLYGDSKKS---RKETLSKRRIVGMKLTEQNTHFDTDVIDPNTIVPSHW 728

Query: 754 SVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG- 812
             +  E+G + I+ +P  + V+ V K     +H+ D    +   D E     ++ EGT  
Sbjct: 729 IAIARENGNMYIYSIPELHLVYMVKKI----SHLPDIATDQPYVDDE----PATGEGTDA 780

Query: 813 -QGRKENIHSMK----VVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
             G   +  ++K    ++EL +     +  RP LF +L D T+  Y+ + +    N    
Sbjct: 781 MSGTMTDTFAVKPEEVIMELLLVGMGMNQGRPLLF-LLIDDTVSAYEMFTY----NNGIQ 835

Query: 868 DDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI--FKNISG-HQ 924
                  + L  + V+    R+ RF  T  D     E+   A   +  +  F+ I     
Sbjct: 836 GHLAIRFKRLPYTTVT----RSCRFQGT--DGRAAVESVRDAVRHKTVLHFFERIGNVLN 889

Query: 925 GFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG-ILKICQL 983
           G F+  S PC   +     R+HP   DG I++FT  +N  C +GFIY+T +  ++++ +L
Sbjct: 890 GVFICSSYPCIFFLESGVPRLHPVNLDGPILSFTTFNNAACPNGFIYLTERDRLMRVAKL 949

Query: 984 PSGSTYDNYWPVQKV 998
           PS    D  +PV+++
Sbjct: 950 PSDMILDASYPVKRI 964


>gi|147827332|emb|CAN62175.1| hypothetical protein VITISV_001516 [Vitis vinifera]
          Length = 1989

 Score =  201 bits (510), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 121/228 (53%), Positives = 148/228 (64%), Gaps = 49/228 (21%)

Query: 383  TGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSG 442
            +G+L+LLT+V DGRVV +L LSK+  SV TS I  IG+SL F GS+LGDSLLVQF     
Sbjct: 1657 SGELLLLTLVCDGRVVYKLGLSKSRASVFTSGIAAIGSSLSFPGSQLGDSLLVQF----- 1711

Query: 443  TSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFS 502
            T++ SS ++++ GD E D PSTKR RRSSSDALQDM NG++L LY               
Sbjct: 1712 TAIPSSSVEKKVGDSEGDVPSTKRSRRSSSDALQDMDNGDKLPLY--------------- 1756

Query: 503  FAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYEL--------------------- 541
              V DSL+N+GPLKDF+YGLRIN D  ATGI KQSNYEL                     
Sbjct: 1757 --VSDSLINVGPLKDFAYGLRINTDLKATGIVKQSNYELMCCSGHGKNGALCILQQSIRP 1814

Query: 542  -----VELPGCKGIWTVYHKSSRGHNADSSRMA-AYDDEYHAYLIISL 583
                 VELPGCKGIWTVYHK++RGHNADS +M+  +D E+ A++  SL
Sbjct: 1815 ERITEVELPGCKGIWTVYHKNTRGHNADSIKMSHVFDLEFRAFIFFSL 1862


>gi|38014465|gb|AAH60475.1| LOC398931 protein, partial [Xenopus laevis]
          Length = 363

 Score =  200 bits (508), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 109/303 (35%), Positives = 178/303 (58%), Gaps = 23/303 (7%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           LEL+  +   GN+ S+A +   GA    +RD+++L+F++AK+SV+E+D   H L+  S+H
Sbjct: 66  LELMASFSFFGNIMSMASVQLAGA----KRDALLLSFKEAKLSVVEYDPGTHDLKTLSLH 121

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVG 215
            FE PE   L+ G       P V+VDP GRC  +L+YG Q+++L       ++   GLVG
Sbjct: 122 YFEEPE---LRDGFVQNVHIPKVRVDPSGRCAVMLIYGTQLVVLPFRRDTLAEEHEGLVG 178

Query: 216 DEDTFGSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGR 273
           +        G  +    S++I++R+LD K  ++ D  F+HGY EP ++IL E   TW GR
Sbjct: 179 E--------GQKSSFLPSYIIDVRELDEKLLNIIDMQFLHGYYEPTLLILFEPNQTWPGR 230

Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHS 333
           V+ +  TC I A+S++   K HP+IWS  +LP+D  + LAVP P+GGV++   N++ Y +
Sbjct: 231 VAVRQDTCSIVAISLNIMQKVHPIIWSLNSLPYDCTQALAVPKPVGGVVIFAVNSLLYLN 290

Query: 334 QSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
           QS     ++LN+      S    P+    + LD + AT++  D  ++S K G++ ++T++
Sbjct: 291 QSVPPYGVSLNSLTNGTTSFPLKPQEEVRITLDCSQATFISYDKMVISLKGGEIYVVTLI 350

Query: 393 YDG 395
            DG
Sbjct: 351 TDG 353


>gi|328773280|gb|EGF83317.1| hypothetical protein BATDEDRAFT_21894 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 1673

 Score =  198 bits (503), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 181/744 (24%), Positives = 330/744 (44%), Gaps = 152/744 (20%)

Query: 98  AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
           AA LEL   +R+HGN+ SL ++     + S + D+++L+F++AK+S++E+      L   
Sbjct: 87  AACLELAAQFRVHGNITSLGVVPM---NYSGKADALLLSFKEAKMSLVEYSQFTQKLVTV 143

Query: 158 SMHCFESPEWLHLKRGRESFARGPL-VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGD 216
           SMH FE  E+  L     S  R P  +KVDPQG C  + +YG ++ IL   Q G+ L+ D
Sbjct: 144 SMHYFEREEFKKLG----SIDRPPPEIKVDPQGYCAAMRIYGDRLAILPFKQDGADLLND 199

Query: 217 EDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
            +   S   F   I    V+   DLD  ++++ DF F+ GY  P + I+++ E TW  R+
Sbjct: 200 LNDANSKYPFRPSI----VLPFLDLDKSIRNIIDFTFLFGYAVPTIAIMYQTEQTWTARL 255

Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY--- 331
             +  T  I+ +S+ T  + +P+++    LP++   L++VP+PIGG++V+  N I +   
Sbjct: 256 GIRKDTVSIAVISLDTAEESYPVLYKIEKLPYNCTMLVSVPTPIGGLIVLSHNAIIFTDQ 315

Query: 332 -HSQSASCA----------LALNNYAVSLDSSQELP---------------RSSFSVELD 365
            H+   +C           + L  Y + LD  Q  P                   ++ LD
Sbjct: 316 IHAPGIACIVNAYFDSETNIMLTPYELQLDMVQPRPPRPPSVFFAQNKYTDYKELAISLD 375

Query: 366 AAHATWLQNDVALLSTKTGDLVLLTVVYDGRV----------VQRLDLSK---------- 405
            +   ++  D+ LL  + G+++ + ++ +  V          V+   L++          
Sbjct: 376 GSRGMFISPDIFLLVLRDGEMIQVDLIGEEGVGRSWKRRKGGVKSFQLTRLGIRMTAPVH 435

Query: 406 -------TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIE 458
                  +NP  L+   +++     FLGSR G  L   +   S  +  +  L  +F ++E
Sbjct: 436 LFPLADASNPLSLSGRNSSVPLGGSFLGSR-GSKLRYNYLFASSRTTDACLL--QFVEVE 492

Query: 459 ADAPSTKRLRRSSSDALQDMVNGE----ELSLYGSASNNTES------------------ 496
             A S+  +  +++  + +  NGE    +  LYG ++   ++                  
Sbjct: 493 EFAKSSVSMNGAAN--MNNTDNGEDDELDKDLYGDSTTAKQTDTDMSALLSSDEHGHGEI 550

Query: 497 -AQKTFSFAVRDSLVNIGPLKDFSYGLRINAD---------------ASATG-------- 532
            +++T  F + DS+  + PL+DF+ GL                     +ATG        
Sbjct: 551 VSEQTLRFRLCDSVTVVSPLRDFAVGLPAETSEHRFSPKIGGCDLEIVAATGHGPHGHLA 610

Query: 533 -ISKQSNYELV---ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM 588
            +++    ++V   ELP  + +WT+        + D   ++   D +H Y+I+S  + T 
Sbjct: 611 ILNRSVRPQIVTTFELPQIEEMWTI---RCAKFDKDYRLVSEPTDAFHKYVILSHSSGTS 667

Query: 589 VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSF--- 645
           +L+  +  TE+ ++  ++  G T+  G L     ++QV   G  + D  +   D +    
Sbjct: 668 ILKAGEAFTEMDDTT-FYQAGPTVGVGALLDETIIVQVHPNGVILFD--FSKYDFTIIDR 724

Query: 646 --------------GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
                         G    E   G ++  V+S S  DPY +L M+ G I LL  D +T  
Sbjct: 725 LNTNRMHALYIFVEGTKLQEMRVGDDDIWVISCSFMDPYAMLLMNTGHIVLLSLDETTHQ 784

Query: 692 VSVQTPAAIESSKKPVSSCTLYHD 715
           ++  +    E  K+ VS+ +LY D
Sbjct: 785 ITQIS----EYKKRLVSTFSLYCD 804



 Score = 84.0 bits (206), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 81/313 (25%), Positives = 124/313 (39%), Gaps = 85/313 (27%)

Query: 753  YSVVCYESGALEIFDVPNFN--CVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEG 810
            +  V  ++G L ++ +P+F   C F +  F +     +D  +  +     T  N++ +E 
Sbjct: 930  WCFVYTDTGHLLVYTLPDFKECCAFPL--FSTLPVLAMDVPLWRSRSIDSTFANTTGDE- 986

Query: 811  TGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDP 870
                       + VV L     S     P+L  +  +G +  Y+  +F  P  TS +DD 
Sbjct: 987  --------FEEILVVNLGN---SKDRQTPYLVCLAANGDLAVYK--IFVCP--TSSNDDD 1031

Query: 871  VS--------TSRSLSVSNVSASRLRN---LRFSRTPLDAYTRE---------------E 904
             S         SR+ +   + A  L+    +R  R P D  TR+               +
Sbjct: 1032 TSFVNSGTFKQSRTPAELELDAQNLKKRLAIRLVRIPHDQITRDLQFYTDNEGDKIDLVQ 1091

Query: 905  TPHGAPC----QRITIFKNI--SG---HQGFFLSGSRPCWCMVFRER------------- 942
             P   P     Q +  F  I  SG   + G  ++GSRPCW MV  +              
Sbjct: 1092 EPQHQPTFLKRQHLKPFDAIGWSGGNMYSGVVVTGSRPCWIMVALQSRQQDLDVISFDNS 1151

Query: 943  -----------------LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPS 985
                             LR HP   DG +  F  LHNVN  HGF+Y+  +G+ +ICQLP 
Sbjct: 1152 VACSTKLPPVPLLGTNMLRFHPMPVDGPMKCFAPLHNVNVAHGFLYINWKGLFRICQLPP 1211

Query: 986  GSTYDNYWPVQKV 998
               +D+ WPV KV
Sbjct: 1212 QFNFDHDWPVCKV 1224


>gi|66812672|ref|XP_640515.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
 gi|60468551|gb|EAL66554.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
          Length = 1628

 Score =  197 bits (502), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 156/585 (26%), Positives = 266/585 (45%), Gaps = 128/585 (21%)

Query: 239 RDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298
           +++++++VKDF F+HGY EP ++ LHE   TW  R++ K  TC ++A+S++   K    I
Sbjct: 281 KNIEIENVKDFCFLHGYYEPTILFLHEPIQTWTSRIAVKKFTCQMTAISLNLLTKAGSFI 340

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W+  N P++   L++VP P+GG LV+ AN + Y +Q++   LA+N YA S+D+S  +   
Sbjct: 341 WNVSNFPYNCEMLVSVPEPLGGALVITANIMFYVNQTSRYGLAVNEYA-SIDTSTIIGSQ 399

Query: 359 SFS----------VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP 408
            F             LD ++  +L++D  + S K G+L++  ++ DGR VQR+ +SK   
Sbjct: 400 PFDFPIDDTLNLVFTLDRSNFVFLESDKFIGSLKGGELLIFHLISDGRSVQRIHVSKAGG 459

Query: 409 SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGT------SMLSSGLKEE-------FG 455
           SVLTS I  + N+L FLGSRLGDSLL+Q+T  S T         S+  K++         
Sbjct: 460 SVLTSCICVLSNNLIFLGSRLGDSLLLQYTEKSITDDQLEHENFSNPYKKQKTSEVFDLF 519

Query: 456 DIEADAPSTKRLRRSSSDALQDMVNGEELS-------------LYGSASNNTESAQKTFS 502
           D  ++  +      +++   Q+  +   ++             L+    N  +S Q    
Sbjct: 520 DENSETNNNNNSNNNNNKENQEKSSSSSIASKLLEEIEDEEDQLFKEKKNQLKSYQ---- 575

Query: 503 FAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY-----ELV--------------- 542
             + D ++NIGP+ D   G  I+     T    Q  Y     ELV               
Sbjct: 576 LGICDQIINIGPIGDIVVGQSIDPTYDETIQPNQPEYVPKTLELVTCSGYGKNGSISVLQ 635

Query: 543 -----------ELPGCKGIWTVY------------------HKSSRGHNADSSRMAAY-- 571
                      ELPG   +WTVY                   K SR  N ++ +      
Sbjct: 636 NNIKPELVMAFELPGILNVWTVYKEEIEEEHIEKEIKKNTSKKRSRDENNNNEQEDNEQE 695

Query: 572 ----------------DDEYHAYLIISL-EARTMVLETADLLTEVTESVDYFVQGRTIAA 614
                           D  +H YL +SL +  T++ ET   L EV +        +++  
Sbjct: 696 DNEDNEEEEEEEKMQKDKNWHDYLYLSLKDGTTLIFETGRDLKEVGK-----FNFKSLDI 750

Query: 615 GNLFGRRRVIQVFERGARILDG-SYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
           GNLFGR+R++ +++ G ++++G   + Q++              N  + S  I DP++LL
Sbjct: 751 GNLFGRKRIVVIYQGGIKLINGFDRVIQEIQI------------NEPIKSSYICDPFILL 798

Query: 674 GMSDGSIRLLVG-DPSTCTVSVQTPAAIESSKKPVSSCTLYHDKG 717
              +G+I++  G D     +     +   +  + + S +L+ D+ 
Sbjct: 799 QFHNGTIQIFKGIDEENQLIQFSINSISNNLNQSIFSSSLFFDRN 843



 Score = 91.3 bits (225), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 58/168 (34%), Positives = 89/168 (52%), Gaps = 18/168 (10%)

Query: 57  NLVVTAANVIEIYVVRVQE----EGSKESKNSGETKRRVLMDGISAA-------SLELVC 105
           NLV+   NV++IY +R ++    E   +S+   + ++      I+         SLEL+ 
Sbjct: 32  NLVLAKTNVLQIYKIRYEKIEKYENVSDSQPQQQQEQEQQQQDITQKKKIELKPSLELII 91

Query: 106 HYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESP 165
             +L GN+ES+A +    ++    RDS+IL F DAKISVL++D  +    I S+H FE  
Sbjct: 92  EKKLFGNIESMASVRYPNSE----RDSLILTFRDAKISVLDYDSDLLDFEIRSLHYFEKD 147

Query: 166 EWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL 213
           E+   K GR  F   PL+KVD Q RC  +L+Y   + +L   +  S L
Sbjct: 148 EF---KGGRNHFKHPPLLKVDTQQRCAVMLLYDRNLAVLPFKKTSSIL 192



 Score = 52.8 bits (125), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 27/104 (25%), Positives = 50/104 (48%), Gaps = 17/104 (16%)

Query: 912  QRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQ----------------LCDGSIV 955
            +RI  F +ISG +G F+ G +P W    +  LR+H                      ++ 
Sbjct: 1122 KRIFEFSSISGKRGLFIGGKKPIWAFCEKGYLRLHSMDSSDNSNSNNSNNNNNNNSNTVE 1181

Query: 956  AFTVLHNVNCNHGFIYVTSQG-ILKICQLPSGSTYDNYWPVQKV 998
             FT  +N++C  GFIY + +  ++KIC L +   ++N   ++++
Sbjct: 1182 TFTSFNNISCQDGFIYFSKEKDVIKICTLSTLMNFENDIAIRRI 1225


>gi|301093545|ref|XP_002997618.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262110008|gb|EEY68060.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 1744

 Score =  197 bits (502), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 229/978 (23%), Positives = 387/978 (39%), Gaps = 249/978 (25%)

Query: 235  VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
            ++ LR++++   V D  F+ GY+EP +++LHE   + +  GR++    T  ++ +SI+  
Sbjct: 278  LLRLREVEITGKVIDLAFLDGYLEPTLMVLHEENDKNSTCGRLAVGFDTYCLTVISINMK 337

Query: 292  LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
             + HP IW+  NLP D ++L+   +P+GGV+V+ AN I Y +Q+    LA N +A    +
Sbjct: 338  TRLHPKIWTVKNLPSDCFRLIPCRAPLGGVVVLSANAILYFNQTQFHGLATNVFASKTVN 397

Query: 352  SQELPRS------------SFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD----- 394
                P S              +V L      +LQ    LL+  +G + +L++ Y+     
Sbjct: 398  QSVFPLSEAVYETPEHETVQLNVVLYDCQFEYLQEKELLLTMPSGQVYVLSLPYEDTSSR 457

Query: 395  ----------GRVVQRLDLSKTNPSVLTSDITTIG-NSLFFLGSRLGDSLLVQFTCGSGT 443
                      GR    L L     S+  S +         F+GSR GDS+L         
Sbjct: 458  GLYGFGGVSSGRNAS-LSLRMLRSSIQASCVCIDDEKQTLFIGSRSGDSVLFALDKKKLV 516

Query: 444  SMLSSGLKEEFGDI------EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESA 497
            +      K+E   I      +  AP  K     S  A ++  + ++L LYG+A    E A
Sbjct: 517  TATEEEQKDEEMPIKEVVIKQESAPEIK-----SEPAEEEEEDEDDLFLYGAAPTKEEPA 571

Query: 498  QKT---------------------------FSFAVR--DSLVNIGPLKDFSYGLRINADA 528
              +                           + + +R  D L +IG +     G+  NAD 
Sbjct: 572  ATSSTECTNGVGVSSVKTEENGAPEQDTGSYDYELRQIDVLPSIGQITSIELGVENNAD- 630

Query: 529  SATGISKQSNYELV--------------------------ELPGCKGIWTVYHKSSRGHN 562
                 S +   ELV                          EL GC+ +WTV         
Sbjct: 631  -----SNEKREELVISGGYERSGAISVLHNGLRPIVGTEAELNGCRAMWTVSSSLPSATR 685

Query: 563  ADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRR 622
            +   R       Y+AYLI+S+  RTMVL T + +  + +   ++  G T+AA NLF ++R
Sbjct: 686  SSDGR------SYNAYLILSVAHRTMVLRTGEGMEPLEDDSGFYTSGSTLAAANLFNKQR 739

Query: 623  VIQVFERGARIL------------------DGSYM----------------TQDLSFGPS 648
            ++Q+F++GAR++                  +G+                  TQ+++    
Sbjct: 740  IVQIFKQGARVMMEVPEEETSNGQEKSAKTEGAEDEEEDDEDDGPRVKLVCTQEITLEGD 799

Query: 649  NSESGSGSENSTV--LSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTP--------- 697
                G   + S+V  +SV + DPY+LL ++DGS+RLL+GD     +SV  P         
Sbjct: 800  VECGGMNVDTSSVGIVSVDVVDPYILLLLTDGSVRLLMGDEEDLELSVIDPEIDYAEGIS 859

Query: 698  ---AAIESSKKPVSSCTLYHD--------------------------------------- 715
                + + SK   SS  L++D                                       
Sbjct: 860  EANGSADMSKHGSSSACLFYDWAGMFVENAWVEEEQEERHEATQSRAKRAEDDDDMDALY 919

Query: 716  -KGPEPWLRKT-STDAWLSTGVGEAIDGADGGPLDQ---GDIYSVVCYESGALEIFDVPN 770
               P P +  T +T +  ST      DG+   PL Q     +   +C+  G+L +F +P+
Sbjct: 920  SSKPSPKVATTNATKSTPSTATPRNEDGSVSIPLLQQKDAKMMCGMCFGDGSLHVFSLPD 979

Query: 771  F--------------NCVFTVDKFVSGRTHIVDTYMREALKDSETEIN-SSSEEGTGQGR 815
            F              + V T++ +  GR   V       L      +N S+S    G+ +
Sbjct: 980  FKKRGVFPYLTFAPQSLVNTLEHYQVGRNKTVK------LSAPVLGLNASTSSANDGRIK 1033

Query: 816  KENIHSMKVVELAMQRW--------SAHHSRPFLFAILTDGTILCYQAY-LFEGPENTSK 866
            K +  +  V ++ + R         + + SR  +   L +G +L Y A   FE  +  + 
Sbjct: 1034 KSHTINSPVADIVIHRVGPSEGQHNAQYLSRMVMLVFLANGDLLMYSAAPKFESLKPRAN 1093

Query: 867  SD-DPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQR---------ITI 916
             +  PV     +    ++   L     +    +A    E    A   +         +T 
Sbjct: 1094 GEIAPVFHFVRVGTELITRPFLPPKARTNAHNEAGNNPEVNTSAVLAKLRAGFRYPMLTC 1153

Query: 917  FKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGS------IVAFTVLHNVNCNHGFI 970
            F N++   G F  G+ P W +  R      P +C+ +      +++FT  H+ NC +GFI
Sbjct: 1154 FHNVNNMSGAFFRGAHPMWILGDRGHASFVP-MCNAAPRVSVPVLSFTSFHHWNCPNGFI 1212

Query: 971  YVTSQGILKICQLPSGST 988
            Y  S+G L++C+LPS  T
Sbjct: 1213 YFHSRGALRVCELPSSKT 1230


>gi|301103686|ref|XP_002900929.1| cleavage and polyadenylation specificity factor subunit, putative
            [Phytophthora infestans T30-4]
 gi|262101684|gb|EEY59736.1| cleavage and polyadenylation specificity factor subunit, putative
            [Phytophthora infestans T30-4]
          Length = 1561

 Score =  196 bits (498), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 227/964 (23%), Positives = 384/964 (39%), Gaps = 234/964 (24%)

Query: 235  VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
            ++ LR++++   V D  F+ GY+EP +++LHE   + +  GR++    T  ++ +SI+  
Sbjct: 108  LLRLREVEITGKVIDLAFLDGYLEPTLMVLHEENDKNSTCGRLAVGFDTYYLTVISINMK 167

Query: 292  LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL-- 349
             + HP IW+  NLP D ++L+   +P+GGV+V+ AN I Y +Q+    LA N +A     
Sbjct: 168  TRLHPKIWTVKNLPSDCFRLIPCRAPLGGVVVLSANAILYFNQTQFHGLATNVFASKTVN 227

Query: 350  -------DSSQELPR---SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD----- 394
                   D+  E P    +  +V L      +LQ+   LL+   G + +L++ Y+     
Sbjct: 228  QSVFPLSDAVYETPEHETAQLNVVLYDCQFEYLQDKELLLTMPCGQVYVLSLPYEDTSSR 287

Query: 395  ----------GRVVQRLDLSKTNPSVLTSDITTIG-NSLFFLGSRLGDSLLVQFTCGSGT 443
                      GR    L L     S+  S +         F+GSR GDS+L         
Sbjct: 288  GLYGFGGVSSGRNAS-LSLRMLRSSIQASCVCIDDEKQTLFIGSRSGDSVLFALDKKKLV 346

Query: 444  SMLSSGLKEEFGDI------EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESA 497
            +      K+E   I      +  AP  K     S  A ++  + ++L LYG+A    E A
Sbjct: 347  TATEEEQKDEEMPIKEVVIKQESAPEIK-----SEPAEEEEEDEDDLFLYGAAPTKEEPA 401

Query: 498  QKT---------------------------FSFAVR--DSLVNIGPLKDFSYGLRINADA 528
              +                           + + +R  D L +IG +     G+  NAD 
Sbjct: 402  ATSSTECTNGVGVSSVKTEENGAPEQDTGPYDYELRQIDVLPSIGQITSIELGVENNAD- 460

Query: 529  SATGISKQSNYELV--------------------------ELPGCKGIWTVYHKSSRGHN 562
                 S +   ELV                          EL GC+ +WTV         
Sbjct: 461  -----SNEKREELVISGGYERSGAISVLHNGLRPIVGTEAELNGCRAMWTVSSSLPSATR 515

Query: 563  ADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRR 622
            +   R       Y+AYLI+S+  RTMVL T + +  + +   ++  G T+AA NLF ++R
Sbjct: 516  SSDGR------SYNAYLILSVAHRTMVLRTGEGMEPLEDDSGFYTSGPTLAAANLFNKQR 569

Query: 623  VIQVFERGARIL------------------DGSYM----------------TQDLSFGPS 648
            ++Q+F++GAR++                  +G+                  TQ+++    
Sbjct: 570  IVQIFKQGARVMMEVPEEETSNGQEKSGKAEGAEDEEEDDEDDGPRVKLVCTQEITLEGD 629

Query: 649  NSESGSGSENSTV--LSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTP--------- 697
                G   + S+V  +SV + DPY+LL ++D S+RLL+GD     +SV  P         
Sbjct: 630  VECGGMNVDTSSVGIVSVDVVDPYILLLLTDVSVRLLMGDEEDLELSVIDPEIDYAEGIS 689

Query: 698  ---AAIESSKKPVSSCTLYHD-------------KGPEPWLRK-TSTDAWLSTGVGEAID 740
                + + SK   SS  L++D               P P +    +T +  ST      D
Sbjct: 690  EANGSADMSKHGSSSACLFYDWAEDDDDMDALYSSKPSPKVATMNATKSMPSTATPRNED 749

Query: 741  GADGGPLDQ---GDIYSVVCYESGALEIFDVPNF--------------NCVFTVDKFVSG 783
            G+   PL Q     +   +C+  G+L +F +P+F              + V T++ +  G
Sbjct: 750  GSVSIPLLQQKDAKMMCSMCFGDGSLHVFSLPDFKKRGVFPYLTFAPQSLVNTLEHYQVG 809

Query: 784  RTHIVDTYMREALKDSETEIN-SSSEEGTGQGRKENIHSMKVVELAMQRW--------SA 834
            R   V       L      +N S+S    G+ +K +  +  V ++ + R         + 
Sbjct: 810  RNKTVK------LSAPALGLNASTSSANDGRIKKSHTINSPVADIVIHRVGPSEGQHNAQ 863

Query: 835  HHSRPFLFAILTDGTILCYQAY-LFEGPENTSKSD-DPVSTSRSLSVSNVSASRLRNLRF 892
            + SR  +   L +G +L Y A   FE  +  +  +  PV     +    ++   L     
Sbjct: 864  YLSRMVMLVFLANGDLLMYSAAPKFESLKPRANGEIAPVFHFVRVGTELITRPFLPPKAR 923

Query: 893  SRTPLDAYTREETPHGAPCQR---------ITIFKNISGHQGFFLSGSRPCWCMVFRERL 943
            +    +A    E    A   +         +T F N++   G F  G+ P W +  R   
Sbjct: 924  TNAHNEAGNNPEVNTSAVLAKLRAGFRYPMLTCFYNVNNMSGAFFRGAHPMWILGDRGHA 983

Query: 944  RVHPQLCDGS-------------------IVAFTVLHNVNCNHGFIYVTSQGILKICQLP 984
               P     S                   +++FT  H+ +C +GFIY  S+G L++C+LP
Sbjct: 984  SFVPMCVPSSAPPKANGTSKNAAPRVSVPVLSFTPFHHWSCPNGFIYFHSRGALRVCELP 1043

Query: 985  SGST 988
            S  T
Sbjct: 1044 SSKT 1047


>gi|194374339|dbj|BAG57065.1| unnamed protein product [Homo sapiens]
          Length = 330

 Score =  191 bits (486), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 114/302 (37%), Positives = 170/302 (56%), Gaps = 35/302 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL 349
            T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS         Y V+L
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVP------PYGVAL 300

Query: 350 DS 351
           +S
Sbjct: 301 NS 302


>gi|341892673|gb|EGT48608.1| CBN-CPSF-1 protein [Caenorhabditis brenneri]
          Length = 1440

 Score =  191 bits (484), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 175/623 (28%), Positives = 297/623 (47%), Gaps = 93/623 (14%)

Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
           +DSI++AF+DAK+S++  ++    ++  S+H FE+    +L+ G   +   P+V+ DP+ 
Sbjct: 91  QDSILMAFDDAKLSIITINEKERNMQTISLHAFENE---YLRDGFVKYFHPPIVRTDPEN 147

Query: 190 RCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVK 247
           RC   LVYG  + IL   +                  S RI S ++I L+ +D  + +V 
Sbjct: 148 RCAASLVYGKHIAILPFHEN-----------------SKRIHS-YIIPLKQIDPRLDNVA 189

Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHD 307
           D +F+ GY EP ++ L+E   T  GR   ++ T  I  +S++   +Q  ++W   NLP D
Sbjct: 190 DIVFLDGYYEPTILFLYEPLQTTPGRACVRYDTMCIMGVSVNIVDRQFAVVWQTANLPMD 249

Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELP---RSSFSVE 363
              LL +P P+GG +V G+NTI Y +Q+   C + LN+     D   + P     S  + 
Sbjct: 250 CATLLPIPKPLGGAIVFGSNTIVYLNQAVPPCGIVLNS---CYDGFTKFPLKDMKSMKMT 306

Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKTNPSVLTSDITTIGNS 421
           LD + + ++++    + T+ G+L LL +V    G  V+ L+ SK   + +   +T     
Sbjct: 307 LDCSTSVYMEDGRIAVGTRDGELFLLRLVTSSGGATVKSLEFSKVWDTSIAYTLTVCAPG 366

Query: 422 LFFLGSRLGDSLLVQFTCGSGT--SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
             FLGSRLGDS L++++    T  S+    +++E   +EA+                  +
Sbjct: 367 HLFLGSRLGDSQLLEYSLIKTTRESVKRHKMEQEQNHVEAE------------------L 408

Query: 480 NGEELSLYGSA-----SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
           + ++L LYG A     +++ E   ++  F+  D L NIGP+K    G R N  ++    +
Sbjct: 409 DEDDLELYGGAIEEQQNDDEEQITESLQFSELDRLRNIGPVKSMCVG-RPNYMSNDLVDA 467

Query: 535 KQSN--YELVELP--GCKGIWTVYHKSSRGHNADSSRMAA---------YDDEYHAYLII 581
           K+ +  ++++     G  G   V+ +S R     SS +            ++E H YLI+
Sbjct: 468 KRRDPVFDVITASGHGKNGSLCVHQRSLRPEIVTSSLLEGAEQLWAVGRKENESHKYLIV 527

Query: 582 SLEARTMVLETADLLTEVTESVDYFVQGR-TIAAGNLFGRRRVIQVFERG-ARILDGSYM 639
           S    T+VLE  + L E+ E +  FV G+ T+AAG L      +QV     A + DG  +
Sbjct: 528 SRIRSTLVLELGEELIELEEPL--FVTGQPTVAAGELSQGAFAVQVTSTSIALVTDGQQL 585

Query: 640 TQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL--LVGDPSTCTVSV--- 694
            +                N  V+  SI DPYV +   +G + L  LV +P      +   
Sbjct: 586 AE-----------VKIDSNFPVVQASIVDPYVAVLTQNGRLLLYTLVSNPYMQLQEIDLA 634

Query: 695 QTPAA--IESSKKPVSSCTLYHD 715
           QTP +  I  S   ++S ++Y D
Sbjct: 635 QTPFSTFIAQSASQITSISMYAD 657



 Score = 51.6 bits (122), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 67/291 (23%), Positives = 118/291 (40%), Gaps = 32/291 (10%)

Query: 723  RKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVS 782
            ++   DA  S   GE  D  D         + V+ +E+G L +  +P    V+ + +F +
Sbjct: 736  KRLGHDAIQSGRGGEQSDAIDPSSYTSISHWLVLAHENGRLSVHSLPEMELVYQIGRFPN 795

Query: 783  GRTHIVD-----TYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHS 837
                +VD           +K       +S EE      K+N    +++E  +     + S
Sbjct: 796  VPELLVDLTPEEEEKERRIKAQLAAKEASDEEQLNAEMKKNCE--RIMEAQIVGMGINQS 853

Query: 838  RPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPL 897
             P L AI+ D  ++ Y+ +            +P S    L ++         LR S    
Sbjct: 854  HPILMAIV-DEQVIMYEMFA-----------NPNSQPGHLGIAFRKLPHFICLRSSPYLK 901

Query: 898  DAYTR------EETPHGAPCQRITIFKNISG-HQGFFLSGSRPCWCMVFRE--RLRVHPQ 948
                R      EE     P   I  F+ +S  + G  + G+ P   +V+     ++ HP 
Sbjct: 902  SDGKRAAFQIVEEDGKRYPL--IHSFERVSTVNNGVIIGGAVPT-LLVYGAWGGMQTHPM 958

Query: 949  LCDGSIVAFTVLHNVNCNHGFIYVT-SQGILKICQLPSGSTYDNYWPVQKV 998
              DGSI AFT  +  N  +GF+Y+T  +  L+I ++ +   Y+  +PV+K+
Sbjct: 959  TIDGSIKAFTPFNIDNVPYGFVYMTQKKSELRIAKMHADFDYEMPYPVKKI 1009


>gi|393245434|gb|EJD52944.1| hypothetical protein AURDEDRAFT_81080 [Auricularia delicata
           TFB-10046 SS5]
          Length = 1422

 Score =  190 bits (482), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 221/981 (22%), Positives = 410/981 (41%), Gaps = 164/981 (16%)

Query: 52  IGPVPNLVVTAANVIEIYVVRVQ------------EEGSKESKNSGETKRRVLMD----- 94
           +G   NLVV   N++ ++ VR++            +E  +  +     +  V MD     
Sbjct: 40  LGVATNLVVARQNLLRVFEVRIEAAPLPSQEKLLADEQGRGRRGMEAVEGEVEMDVGGEG 99

Query: 95  ----GI-------------SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAF 137
               GI             +   L LV  +RLHG V  L  + Q  A    + D ++++F
Sbjct: 100 FVSAGIVKSAGQHARQRQRTVTRLYLVRQHRLHGIVTGLGRV-QTMASLEDKLDRLLVSF 158

Query: 138 EDAKISVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLV 196
           +DAKI++LE+ +  H L   S+H +E +P+ L     R        +++DP  RC  + +
Sbjct: 159 KDAKIALLEWSEVSHDLSTISIHTYERAPQMLAFDSARALTE----LRIDPNSRCAALTL 214

Query: 197 YGLQMIILKASQGGSGLVGDEDTFGSGGGFSARI--ESSHVINLRDLD--MKHVKDFIFV 252
            G  + IL   +  + L  D D     GG S  +    S +++L ++D  ++++ D  F+
Sbjct: 215 PGDAVAILPFYESQAELDMDVDQ----GGVSRDVPYSPSFILSLPEVDNDIRNIIDIAFL 270

Query: 253 HGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLL 312
            G+  P + +L E + TW GR++    T  +  L++    + +P+I S   LP+D+ +L+
Sbjct: 271 PGFNNPTLAVLFETQRTWTGRLAEFKDTVRLRILTLDVVTRTYPIIGSVDGLPYDSMRLV 330

Query: 313 AVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATW 371
           A P+ +GGV+V+ AN + +  QS  + A+A N +A  + S    P       L+ + A +
Sbjct: 331 ACPAALGGVIVLTANAVLHIDQSGKNVAVAANGWAARV-SEFPTPAPERDETLEGSRAVF 389

Query: 372 LQNDVALLSTKTGDLVLLTVVYDGRVVQRLDL-SKTNPSVLTSDITTIGNSLFFLGSRLG 430
           + +   LL  + G +V + ++ +GR+V ++D+  +   + + + +  + + L  +GS  G
Sbjct: 390 VSDKTFLLVYRDGSIVPVELILEGRMVTKIDMGQRLAQTTIPTVVCAVQDDLVLVGSTAG 449

Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ-----DMVNGEELS 485
            S+L++ T              E  DI  DA S +    ++ ++       D ++ E+  
Sbjct: 450 PSILLKVT-------------HEEEDITPDAGSARENGAANGNSTNGATYDDPMDSEDED 496

Query: 486 LYGSASNNTESAQKTFS--------------FAVRDSLVNIGPLKDFSYGLRINAD---- 527
           LYG       S   T +               A+ DSL   GP+ D ++ L  N +    
Sbjct: 497 LYGGTDMMVTSTSGTLTVGGTAALEKRRILRLALADSLCGHGPISDMAFILGRNGERHVP 556

Query: 528 -------ASATG--------ISKQSNYELVELPGCKGIWTVYHKSS---RGHNADSSRMA 569
                     TG        +  +   +L  + G +G+W+   + +    G N +    A
Sbjct: 557 ELLAGVGVGHTGGLARFQRDLPARVKRKLHRISGNRGVWSFPVRRAVKVAGMNIERPTGA 616

Query: 570 AYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ--GRTIAAGNLFGRRRVIQVF 627
           A  D     +I+S +A      +   + + +  VD   +    TI AG  F R  ++QV 
Sbjct: 617 ADWD----TVIVSTDATPSPGLSRVAVKDSSTDVDILTRLPAITIGAGPFFQRTAILQVV 672

Query: 628 ERGARIL--DGS--YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
               R+L  DGS   + +DL            +  + + + SI+DP+V++   D ++ L 
Sbjct: 673 NNAIRVLEADGSERQVIKDLD---------GTTPRAKIRACSISDPFVVVVREDDTLGLF 723

Query: 684 VGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDK------GPEPWLR-KTSTDAWLSTGVG 736
           VG+     +  +  + +        + + Y D       G    L+ K   +A  ST + 
Sbjct: 724 VGETGKGKLRRKDMSMLGDKASRYLAASFYQDHSGLFQVGTARSLKGKEKANAPASTTIE 783

Query: 737 EAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREAL 796
            A+D        +G  + V+C   G +EI+ +P    VF+          + DT+     
Sbjct: 784 AAMDEG------RGSQWLVLCRPQGVVEIWALPKLTLVFSCGGVSDIPPVMADTF----- 832

Query: 797 KDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAY 856
              +    S  ++   Q    ++      E+ +        RP L  +L  GT+  Y   
Sbjct: 833 ---DLATPSPVQDPPRQAEDHDVE-----EILISPIGETTPRPHLLVLLRSGTVAVYDTA 884

Query: 857 LFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI 916
             E P        PV+T R   +  ++  R+ +      P++   +   P  AP   I  
Sbjct: 885 PVELP--------PVATGREAGL-QLAFVRIMSRAVDTAPIERAEKRGAP--APRHLIPF 933

Query: 917 FKNISGHQGFFLSGSRPCWCM 937
             ++S   G FL+G +P W +
Sbjct: 934 STSVS---GVFLTGGKPGWIL 951


>gi|134025022|gb|AAI35011.1| LOC564406 protein [Danio rerio]
          Length = 348

 Score =  189 bits (480), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 104/283 (36%), Positives = 165/283 (58%), Gaps = 13/283 (4%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           LE V  + L GNV S+A +   G +    RD+++L+F+DAK+SV+E+D   H L+  S+H
Sbjct: 66  LEQVASFSLFGNVMSMASVQLVGTN----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLH 121

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
            FE PE   L+ G       P+V+VDP+ RC  +LVYG  +++L   +     + DE   
Sbjct: 122 YFEEPE---LRDGFVQNVHIPMVRVDPENRCAVMLVYGTCLVVLPFRKDT---LADEQEG 175

Query: 221 GSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
             G G  +    S++I++R+LD K  ++ D  F+HGY EP ++IL E   TW GRV+ + 
Sbjct: 176 IVGEGQKSSFLPSYIIDVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQ 235

Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-S 337
            TC I A+S++   K HP+IWS  NLP D  +++AVP PIGGV+V   N++ Y +QS   
Sbjct: 236 DTCSIVAISLNIMQKVHPVIWSLSNLPFDCNQVMAVPKPIGGVVVFAVNSLLYLNQSVPP 295

Query: 338 CALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLS 380
             ++LN+      +    P+    + LD + A+++ +D  ++S
Sbjct: 296 FGVSLNSLTNGTTAFPLRPQEEVKITLDCSQASFITSDKMVIS 338


>gi|313232279|emb|CBY09388.1| unnamed protein product [Oikopleura dioica]
          Length = 1451

 Score =  188 bits (478), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 182/695 (26%), Positives = 313/695 (45%), Gaps = 77/695 (11%)

Query: 57  NLVVTAANVIEIYVVR--VQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVE 114
           NL V A N++ +Y +R  V E G+   +                   EL   + L G V 
Sbjct: 45  NLAVAAGNMLSVYRIRSSVDEAGNHFDR------------------FELCDEFELWGIVV 86

Query: 115 SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
            +  L   G+     RDS++L+ E++K  ++E++     L   SMH F+  +   L+RG 
Sbjct: 87  CMTRLRLAGS----VRDSLLLSIEESKCVIVEYEPDTGSLSTISMHFFQDED---LRRGF 139

Query: 175 ESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS-GLVGD--EDTFGSGGGFSARIE 231
              +   L +VD   RC  VLVYG  + +L   +     L G   +  F    GF A   
Sbjct: 140 RKLSSMALARVDGFNRCAAVLVYGSYLAVLPFRRSTERDLSGQRHQAVFYENSGFIA--- 196

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
             ++I+L+ L +K   V DF F+ GY +P +++L+E   TW GRV+ +  TC + ALSI+
Sbjct: 197 --NMIDLQSLPVKIASVLDFQFLEGYNDPTILLLYEALPTWTGRVTERQDTCGMVALSIN 254

Query: 290 TTLKQHPLIWSAMNLPH-DAYK--LLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNY 345
              + HP+IW    LP  + Y   L  +P P+GG L+   N++ Y  QS     +ALN+ 
Sbjct: 255 LIDETHPVIWQMAGLPFPNPYSSALFPIPKPLGGSLLFATNSLIYLDQSVPPYGVALNSL 314

Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLS 404
            +   +     +    + L    A  L +D   +S ++GD+ ++T+  D    V+R  L 
Sbjct: 315 PLGCTNFALKTQDVAPLNLQNCKACMLSDDSICVSLESGDVYIITLKKDSLNNVRRFYLD 374

Query: 405 KTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPST 464
           +   SV+ + ++ + ++L FLGSRLG+SLL+++ C   +   S+ L+    D        
Sbjct: 375 QVASSVIPTTLSKLSDNLIFLGSRLGNSLLLRYKCKENSKKSSTSLENGEKDGVEIENKE 434

Query: 465 KRLRRSSSDALQDMVNG------EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDF 518
           +     + +  +   NG      +++  YG    N +    ++ F   D+L NIGP    
Sbjct: 435 EEKNELNFEIEKSSENGSPENKRKKMRYYGDEIFNLD-VNTSYDFETMDNLSNIGPCGPV 493

Query: 519 SYGLRINADASATGI---SKQSNYELVELPGCK----GIWTVYHKSSRGHNA-------- 563
                 N + +   +   ++  N ++  L G      G  TV HKS R   A        
Sbjct: 494 ELIHTANHNDNYDHVGSDARDRNIDVCVLSGKDKTGFGSITVLHKSVRPSIASQFPFPMN 553

Query: 564 --DSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEV-TESVDYFVQGRTIAAGNLFGR 620
             D   +   ++E H+ L+++ + +TMV +T  +L E+  E        +TI    +   
Sbjct: 554 FSDMWTLRRSEEETHSLLVMTKKDQTMVFQTGAILEELKKEECGLATNAKTIFCATIGNG 613

Query: 621 RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGS- 679
           + ++QV  R   ++D    TQ+         SG       ++ V+  DPYV++  S G+ 
Sbjct: 614 KYIVQVLPRAVVLVDMD--TQETIQNKPFDLSGQ------IIQVA-CDPYVVILASKGTI 664

Query: 680 IRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYH 714
           I L++ + S  T  ++T  A E   +      + H
Sbjct: 665 ISLVLFENSDGTAMLKTSTAPECKNQDDPEKKIMH 699



 Score = 61.2 bits (147), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 59/242 (24%), Positives = 107/242 (44%), Gaps = 35/242 (14%)

Query: 760  SGALEIFDVPNFNCVFTV-DKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN 818
            +G+LEI+ +P+  C+    D+  +    I++T               S  EG+ +GR+ +
Sbjct: 814  NGSLEIYSLPD--CLLRFGDRNFANAPRILET---------------SRFEGS-EGRRVD 855

Query: 819  IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLS 878
            +  + V E+ +       S P++  ++ D  ++    Y F    N  +++ PV + R + 
Sbjct: 856  V--LDVQEMNVFNMGPS-SLPYIVVMIGDQLMI----YRFRATLNRFQTESPVLSGRFIK 908

Query: 879  VSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMV 938
            + +    + + LR      D  ++ +  +    ++   F NIS H G FL G+ P W   
Sbjct: 909  LQD----KTKLLRRIPGVHDESSKTKNRNNKIMRQ---FMNISDHNGIFLGGAYPTWIFC 961

Query: 939  FRE-RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVT-SQGILKICQLPSGSTYDNYWPVQ 996
             +  RL +H    +G + AFT   N  C  GF+Y   S   L +  L     YD  WP +
Sbjct: 962  GQNGRLNIHSMWQEGFVNAFTPFDNEKCADGFLYFRHSTKTLTVANLQPFLKYDADWPFK 1021

Query: 997  KV 998
            K+
Sbjct: 1022 KI 1023


>gi|268580265|ref|XP_002645115.1| Hypothetical protein CBG16808 [Caenorhabditis briggsae]
 gi|296439546|sp|A8XPU7.1|CPSF1_CAEBR RecName: Full=Probable cleavage and polyadenylation specificity
           factor subunit 1; AltName: Full=Cleavage and
           polyadenylation specificity factor 160 kDa subunit;
           Short=CPSF 160 kDa subunit
          Length = 1454

 Score =  188 bits (478), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 150/576 (26%), Positives = 266/576 (46%), Gaps = 81/576 (14%)

Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
           +DSI++ F+DAK+S++  ++    ++  S+H FE+    +L+ G  ++   P+V+ DP  
Sbjct: 92  QDSILMTFDDAKLSIVAVNEKERNMQTISLHAFENE---YLRDGFTTYFNPPIVRTDPAN 148

Query: 190 RCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVK 247
           RC   LVYG  + IL   +    ++                  S++I L+ +D  + +V 
Sbjct: 149 RCAASLVYGKHIAILPFHENSKRIL------------------SYIIPLKQIDPRLDNVA 190

Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHD 307
           D +F+ GY EP ++ L+E   T  GR   ++ T  I  +S++   +Q  ++W   NLP D
Sbjct: 191 DMVFLEGYYEPTILFLYEPLQTTPGRACVRYDTMCIMGVSVNIVDRQFAVVWQTANLPMD 250

Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELP---RSSFSVE 363
              LL++P P+GG +V G+NTI Y +Q+   C + LN+     D   + P        + 
Sbjct: 251 CNSLLSIPKPLGGAVVFGSNTIVYLNQAVPPCGIVLNS---CYDGFTKFPLKDMKHLKMT 307

Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKTNPSVLTSDITTIGNS 421
           LD + + ++++    + ++ GDL LL +V    G  V+ L+ SK   + +   +T     
Sbjct: 308 LDCSTSVYMEDGRIAVGSREGDLYLLRLVTSSGGATVKSLEFSKVCDTSIAFTLTVCAPG 367

Query: 422 LFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNG 481
             F+GSRLGDS L+++T                  ++    S K+ R    +  +  ++ 
Sbjct: 368 HLFVGSRLGDSQLLEYTL-----------------LKVTKESAKKQRLEQQNPSEIELDE 410

Query: 482 EELSLYGSA-----SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQ 536
           +++ LYG A     +++ E   ++  F   D L+N+GP+K   +G R N  ++    +K+
Sbjct: 411 DDIELYGGAIEMQQNDDDEQISESLQFRELDRLLNVGPVKSMCFG-RPNYMSNDLIDAKR 469

Query: 537 SN--YELVELP--GCKGIWTVYHKSSRGHNADSSRMAA---------YDDEYHAYLIISL 583
            +  ++LV     G  G   V+ +S R     SS +            ++E H YLI+S 
Sbjct: 470 KDPVFDLVTASGHGKNGALCVHQRSMRPEIITSSLLEGAEQLWAVGRKENESHKYLIVS- 528

Query: 584 EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG-ARILDGSYMTQD 642
             R+ ++          E   +     T+AAG L      +QV     A + DG  M Q+
Sbjct: 529 RVRSTLILELGEELVELEEQLFVTNEPTVAAGELLQGALAVQVTSTCIALVTDGQQM-QE 587

Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
           +              N  V+  SI DPYV +   +G
Sbjct: 588 VHI----------DSNFPVVQASIVDPYVAVLTQNG 613



 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 72/300 (24%), Positives = 132/300 (44%), Gaps = 38/300 (12%)

Query: 723  RKTSTDAWLSTGVGEAIDGADGGPLDQGDIYS------VVCYESGALEIFDVPNFNCVFT 776
            ++   DA +S+  GE  D      +D    YS      VV +++G + I  +P+   V+ 
Sbjct: 737  KRLGHDAIMSSRGGEQSDA-----IDPTRTYSSITHWLVVAHDNGRITIHSLPDLELVYQ 791

Query: 777  VDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT--------GQGRKENIHSM------ 822
            + +F +    +VD  + E  K+ + +  ++ E+           +  ++ ++S       
Sbjct: 792  IGRFSNVPELLVDMTVEEEEKEKKAKQTAAQEKEKETEKKKDDAKNEEDQVNSEMKKLCE 851

Query: 823  KVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNV 882
            KVVE  +     + + P L AI+ D  ++ Y+ +    P+        V+  +   +  +
Sbjct: 852  KVVEAQIVGMGINQAHPVLIAII-DEEVVLYEMFASYNPQPGHLG---VAFRKLPHLIGL 907

Query: 883  SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISG-HQGFFLSGSRPCWCMVFRE 941
              S   N+   R P +     E  HG     I  F+ IS  + G  + G+ P   +V+  
Sbjct: 908  RTSPYVNIDGKRAPFEM----EMEHGKRYTLIHPFERISSINNGVMIGGAVPT-LLVYGA 962

Query: 942  --RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQ-GILKICQLPSGSTYDNYWPVQKV 998
               ++ H    DGSI AFT  +N N  HGF+Y+T Q   L+I ++     YD  +PV+K+
Sbjct: 963  WGGMQTHQMTIDGSIKAFTPFNNENVLHGFVYMTQQKSELRIARMHPDFDYDMPYPVKKI 1022


>gi|339253000|ref|XP_003371723.1| cleavage and polyadenylation specificity factor subunit 1
           [Trichinella spiralis]
 gi|316967988|gb|EFV52332.1| cleavage and polyadenylation specificity factor subunit 1
           [Trichinella spiralis]
          Length = 1376

 Score =  188 bits (478), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 165/616 (26%), Positives = 286/616 (46%), Gaps = 74/616 (12%)

Query: 99  ASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITS 158
           AS ELV   +++G + S+AI    G     + D I+LA +DAK+SV+ +D   H L   S
Sbjct: 65  ASFELVLSEQVYGRLASVAIARLTGF----QLDVILLAIDDAKLSVVGYDIETHSLVTLS 120

Query: 159 MHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL---KASQGGSGLVG 215
           MH +E   +   K G   F   P++++DP+ RC  + +YG  +++L   + S   S  + 
Sbjct: 121 MHYYEDDLF---KLGFTRFEIPPMLRMDPERRCAAMTIYGAHLVVLPLVRESLYESMNIV 177

Query: 216 DEDTFGSGGGFSARIESSHV-INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
           D      G  FS R+ S  V  N  D  M +V D  F+HG+ EP +++L+E   T AGRV
Sbjct: 178 DPSQ-RPGWPFSLRLTSYTVAFNAIDAKMHNVTDMCFLHGFYEPTVLLLYEPTQTTAGRV 236

Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQ 334
             +  T  I A+S++   K H +IW+  NLP DA+ LLA+P P+GGVL+   N+I Y +Q
Sbjct: 237 VVRQDTYQILAVSLNPKDKTHAVIWTLGNLPFDAFALLALPKPLGGVLLFSVNSIIYLNQ 296

Query: 335 SASCA-LALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY 393
           S  C  + +N+      +     RS   V LD +HA  + +  A L  ++G + ++++++
Sbjct: 297 SVPCCGILINDNGRGFTNYPLRDRSELMVTLDGSHAALIDSANAALVLRSGLVFVVSLLF 356

Query: 394 DG-RVVQRLDLSKTN-----PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447
           D   +V+ + L+ ++     PS +++    + ++  F+GS +G+S L  +       +++
Sbjct: 357 DRLNMVKEILLTASSVRGAAPSTVSA---CVSSNCLFVGSAIGNSALYAYEAIEQVDVVA 413

Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASN--NTES-AQKTFSFA 504
             L                 R +  + L DM       LYG       TE+  Q  F F 
Sbjct: 414 VTLPA---------------RDTGLNLLDDM------QLYGELIRPCTTETLVQTKFEFR 452

Query: 505 VRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNAD 564
             D L ++GP +  + G    A  +       S++ +   PG  G +TV  +S R     
Sbjct: 453 RLDQLASLGPCRAITVGESSVAMVNNFYEDYVSDWLVAGGPGTDGSFTVMQRSVRPRLLT 512

Query: 565 SSRMAAYDDEYHA-----------------YLIISLEARTMVLETADLLTEVTESVDYFV 607
            +R+    + +                   Y++++ + RT+V   +  +TE+ ++  + +
Sbjct: 513 QTRVEDVLNAWSVGAQLIGSVDRSASPRPQYMLLTTKQRTVVFTLSSGITEIFDT-GFEI 571

Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
           +  TIA G++     V+QV +    +L      Q ++                V   S+ 
Sbjct: 572 RFETIACGDMMNGAYVVQVTKENLVLLHRGQQVQCINL----------RVFEEVCQASVI 621

Query: 668 DPYVLLGMSDGSIRLL 683
           DPYV L +  G + L 
Sbjct: 622 DPYVALIVRHGHVLLF 637



 Score = 67.0 bits (162), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 51/197 (25%), Positives = 88/197 (44%), Gaps = 38/197 (19%)

Query: 838 RPFLFAILTDGTILCYQAYLFEGPENTSK----------------------------SDD 869
           RPFLFA++ +  +L Y+A+ +  P+   +                            +DD
Sbjct: 755 RPFLFAVVEE-QLLIYEAFHYPYPQQRYRLSVRFKKVRHTAILQRFRRIGRDDFKLLADD 813

Query: 870 -----PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE--ETPHGAPCQRITIFKNISG 922
                     R  S  + + SR R  R S    +A+  E     + AP ++++ F+N++G
Sbjct: 814 FQFSEQYRRRRKRSKHDSNRSR-RGDRHSGRRQEAHEHEPYRLTYEAPARQLSPFENVAG 872

Query: 923 HQGFFLSGSRPCWCMVFRE-RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
           + G F+ G  P +C + ++  LR+HP   DG +VAF    +      F Y T+ G++++ 
Sbjct: 873 YAGLFIGGGYPYFCFLSKQGDLRLHPMHIDGPVVAFAPYCSPKQLRAFAYFTADGMMRVS 932

Query: 982 QLPSGSTYDNYWPVQKV 998
            LPS   +D   P  KV
Sbjct: 933 SLPSKFDFDRSIPSMKV 949


>gi|326432241|gb|EGD77811.1| hypothetical protein PTSG_08901 [Salpingoeca sp. ATCC 50818]
          Length = 1506

 Score =  186 bits (473), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 180/690 (26%), Positives = 297/690 (43%), Gaps = 128/690 (18%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLV    NV+ +Y + VQ +G+ + +   E      +DG+     + V   R  GN    
Sbjct: 29  NLVTVQGNVLSVYNL-VQAQGAADKRCHLEADISFTLDGVP----QDVATVRPRGN---- 79

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE-----WLHLK 171
                        RD +I  F+DA+++++ FD  +  L   S+H FE  +     W   +
Sbjct: 80  ------------SRDLLIFTFKDARVAIVRFDPKMRDLETVSLHAFEDTDTKLGGWHSEQ 127

Query: 172 RGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF-GSGGGFSARI 230
           R R        V VDP  RC  ++VYG ++I++  S G +    + DT   +   F++R 
Sbjct: 128 RLR--------VCVDPLHRCAALMVYGCKLIVISFSSGTATAAPEADTQEDTEQSFTSR- 178

Query: 231 ESSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
               VI+L  L   +  V D  F+ GY  P + ILH+    W G ++    T  ++ALS+
Sbjct: 179 ----VIDLLSLPSTIGRVDDMAFLDGYDVPCLAILHQPRPAWVGHMAKTKDTAHVTALSL 234

Query: 289 S------------TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA 336
           +                  P++W   NLP D + L  VP+P+GGV+V+G N + Y +QS 
Sbjct: 235 ALDEMTARRAPTAPPPPPPPVVWHQENLPSDTFALQPVPAPLGGVVVIGVNVLFYVTQSL 294

Query: 337 SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR 396
             +LALN Y+ +  ++    ++  S++LD AH   L     L +  +GD+ LLT+V    
Sbjct: 295 VRSLALNGYSRASTNAPIQEQTGISLDLDGAHHALLTPTQILFALPSGDIHLLTIVCTDV 354

Query: 397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT---CGSGTSMLSSGL--K 451
            V  L + K   SV+ SDI T+G    F+ SR   SLL+++      + T +  SG+  +
Sbjct: 355 TVDGLRMDKLATSVIGSDICTLGRRHIFIASRHATSLLLEWAPIPLSATTHIDVSGVSGR 414

Query: 452 EEFGDIEADAPSTKRLRRSSS------------DALQDMVNGEELSLYGSASNNTESAQK 499
           ++ G     + ST  L  S+S            D   D+V+G     +G  S        
Sbjct: 415 DDAGLYGTSSDSTAALNTSASRDGSSTGGDDLDDVYGDVVDGGTTGAHGIGSGGR---VM 471

Query: 500 TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVE---------------- 543
           T     RD+L  + P+K  + G   +A         +S YELV                 
Sbjct: 472 TVKLMARDALPTVAPIKSTAVG--TSAQGVVPHADPRSQYELVSCIGHDKNGALANISYS 529

Query: 544 -----------LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLET 592
                      L   K  W V+  +S+               +H +++ S   +TMV   
Sbjct: 530 LKPQVLLTEDALSSVKDCWAVHSNNSK---------------HHTHVVFSKPKKTMVFRV 574

Query: 593 ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSES 652
           A    ++     +  +  T+ AGN+ GR+ V+QV  +   +LD     +D  F       
Sbjct: 575 AGDFEQLRHPRGFDTEASTVFAGNVMGRQLVLQVTAKHVMLLDD----RDCVF------D 624

Query: 653 GSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
               +   +  VS+ADPY+ L ++D + ++
Sbjct: 625 ERMKKGVRITKVSVADPYIALLLNDATTKV 654



 Score = 48.9 bits (115), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 58/270 (21%), Positives = 95/270 (35%), Gaps = 44/270 (16%)

Query: 757  CYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEIN------------ 804
            C ++G L IF VP+   VF    F        D+  R+ +   E E              
Sbjct: 807  CDKNGVLSIFQVPDMREVFCCTVFSVLPNVAWDSVYRKEIGPVELEPEMPLKRAKTMDEK 866

Query: 805  -----------SSSEEGTGQGRKENIHSMKVVELAMQRWSA----HHSRPFLFAILTDGT 849
                       +  E G+ Q  ++    ++  E+ +    A      SRP LF       
Sbjct: 867  GQSVFVEADEEADDESGSAQAEEDEQDRLQRKEMTIVELLAIGLGRGSRPHLFLRNETQH 926

Query: 850  ILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA 909
            ++ Y+ +       TS      S  R          RLR      T +D    + +    
Sbjct: 927  VIVYEIF-------TS------SYKRHEKYEGRLQIRLRKRHQHPTWIDERLAQSS--SI 971

Query: 910  PCQRITIFKNISGHQGFFLSGSRPCWCMV--FRERLRVHPQLCDGSIVAFTVLHNVNCNH 967
            P      F +ISG  G F+   RP W M     + +R H    DG++  FT L +     
Sbjct: 972  PPAAFRPFADISGCDGVFVCARRPSWFMCDHTHKVVRHHAMRFDGAVQCFTQLKHAMHTS 1031

Query: 968  GFIYVTSQGILKICQLPSGSTYDNYWPVQK 997
             F+Y T +G++++    +G       P ++
Sbjct: 1032 CFLYFTGKGVMRMATTAAGQVLSTPLPSRR 1061


>gi|213407244|ref|XP_002174393.1| cleavage factor one Cft1 [Schizosaccharomyces japonicus yFS275]
 gi|212002440|gb|EEB08100.1| cleavage factor one Cft1 [Schizosaccharomyces japonicus yFS275]
          Length = 1431

 Score =  186 bits (471), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 255/1069 (23%), Positives = 430/1069 (40%), Gaps = 213/1069 (19%)

Query: 57   NLVVTAANVIEIY-VVRVQEEGS--------------KESKNSGETKRRVL-MDGISAAS 100
            NL+V   + ++++ +VRVQ +                +E+    ET  +++     +  +
Sbjct: 29   NLIVAKDDFLQVFDIVRVQRDSDDVEDAFGSSMNLRMEENDAFMETNMQLIRTHEHTVYT 88

Query: 101  LELVCHYRLHGNVESLAILSQ--GGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITS 158
            L LV   R+ G ++ LA++    GG       D ++L    AK+S+L +D     L   S
Sbjct: 89   LRLVYQTRVFGTIKDLAVVKPKLGGFTT----DLLVLLTNYAKVSILVWDSLTQQLSTVS 144

Query: 159  MHCFES---PEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVG 215
            MH +ES   P+ +      E+ A+   + VDP+  C  +  YG  M I+   +     + 
Sbjct: 145  MHYYESVVPPKPI----AEETPAQ---LIVDPESTCCVLRFYGDMMAIIPFRKPEDLEME 197

Query: 216  DEDTFGSGG-GFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAG 272
            D +               S V+    LD  +  V D  F+ GY E  + +L+  E T   
Sbjct: 198  DANAQSEKPVDVQCVYLPSFVLTASQLDYSIARVLDSKFLEGYREATLALLYCPEQTSTV 257

Query: 273  RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY- 331
             +  +  T  ++ +++    +   +I S  NLP+D Y +L VP+P+GG L++G N I Y 
Sbjct: 258  FLPVRKDTVSLAVITLDIEQRASAVITSIHNLPYDIYCILPVPAPLGGSLLLGGNEIIYV 317

Query: 332  HSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL-----QNDVALLSTKTGDL 386
             S  ++  +A+N +  +  + Q   RSSF +EL+      L     ++   LL   TG L
Sbjct: 318  DSAGSTVGIAVNPFYRNATNFQLEDRSSFQLELEGTIGVPLSSPRTESVSVLLIHPTGQL 377

Query: 387  VLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFTC 439
              L  + DG+ V+ LDL     + N ++L S +T    + +   FLGS+ GDS LVQ++ 
Sbjct: 378  FYLDFLMDGKNVKNLDLHPASDELNNALLQSGVTCALPVADHELFLGSQTGDSYLVQWSR 437

Query: 440  GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
             S  +         + D E DA           D L D        +Y + S       K
Sbjct: 438  RSINNQTQEEGTLTYKDEENDA-------DEEVDELDD--------IYDTGSKEKAKRNK 482

Query: 500  -----TFSFAVRDSLVNIGPLKDFSYG---------------LRINADASATGIS----- 534
                      V D L N+GP+ +F  G               L +    + TG S     
Sbjct: 483  FVELGPLRLEVHDVLSNVGPIIEFCTGKAGSLAYFPQDNHGPLEVTC-VTGTGKSGSLVV 541

Query: 535  -KQSNYELVE----LPGCKGIWTVYHKSSRGHNADSSRMAAY-DDEYHAYLIISLEARTM 588
             ++S   +VE      GC+ +WT+ H + R  N  S     Y DDEY  YL++S E  + 
Sbjct: 542  FRRSISPVVEGKFNFEGCQSLWTI-HVTGRLKNPRSHGSERYLDDEYDTYLVVSKEKESF 600

Query: 589  VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS-YMTQDLSFGP 647
            V    +   EV +S D+  +G TI  G L G  R++Q+     R+ D + ++ Q ++ G 
Sbjct: 601  VFTAGETFDEVEDS-DFNTKGSTINVGGLLGGMRIVQICTTSLRVYDPNIHLVQRINLG- 658

Query: 648  SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPV 707
                     +   V++ S+ DPYV+L +  G I L   D  T  +       +    K V
Sbjct: 659  ---------KKQNVVAASVCDPYVVLVLLGGRILLYSMDAETQRL---IKMDLHKQLKNV 706

Query: 708  SSCTLYHDKGPEPWLRKTSTDAWL-----STGVGE-AIDGADGGP--------------- 746
             + +LY     +P +++  ++  L     S G  +  +DG D  P               
Sbjct: 707  KAASLYSTN--DPVMQELFSELDLGRNNSSPGKSDIQMDGVDTQPDRPSMPAGNQVTETN 764

Query: 747  ---LDQGDIYS----VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDS 799
               LD+    +     V ++ G L++  +P ++CV   D F       + T + + L   
Sbjct: 765  VSTLDEQSFAAHFVLFVLHDDGRLKVLHLPTYSCVLECDVF------DLPTVLYDGLSSE 818

Query: 800  E-TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLF 858
              TE++ SS+E              +VE+             L        I  Y+ ++ 
Sbjct: 819  RVTEMHESSQE--------------LVEVLATDLGDEAKEAHLLIRSRMNEITVYKPFVC 864

Query: 859  EGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREET------------- 905
              P  T K++                     LRFS+ P +  TRE T             
Sbjct: 865  SNPV-THKTE---------------------LRFSKIPQEGMTRESTECSLQDLVAETEQ 902

Query: 906  ---PHGAPCQ------------RITIFKNISGHQGFFLSGSRPCWCMVFRERL-RVHPQL 949
               P  A  Q            R+   + I  H   F++G++P + +     + + HP L
Sbjct: 903  ENAPKDASEQKPQKSSSTVDKPRMVALQRIGNHSAVFITGAKPFFLLKTAHSVAKFHPLL 962

Query: 950  CDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
             +  I++    H  +   G+I+V     + IC+      YD+ W  +KV
Sbjct: 963  SECRILSLASFHTEHAPKGYIFVDENYDINICRFQDDINYDHRWGYKKV 1011


>gi|25148482|ref|NP_500157.2| Protein CPSF-1 [Caenorhabditis elegans]
 gi|22096347|sp|Q9N4C2.2|CPSF1_CAEEL RecName: Full=Probable cleavage and polyadenylation specificity
           factor subunit 1; AltName: Full=Cleavage and
           polyadenylation specificity factor 160 kDa subunit;
           Short=CPSF 160 kDa subunit
 gi|373220398|emb|CCD73182.1| Protein CPSF-1 [Caenorhabditis elegans]
          Length = 1454

 Score =  186 bits (471), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 162/589 (27%), Positives = 273/589 (46%), Gaps = 84/589 (14%)

Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
           +DSI++ F+DAK+S++  ++    ++  S+H FE+    +L+ G  +  + PLV+ DP  
Sbjct: 92  QDSILMTFDDAKLSIVSINEKERNMQTISLHAFENE---YLRDGFINHFQPPLVRSDPSN 148

Query: 190 RCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVK 247
           RC   LVYG  + IL   +                  S RI S +VI L+ +D  + ++ 
Sbjct: 149 RCAACLVYGKHIAILPFHEN-----------------SKRIHS-YVIPLKQIDPRLDNIA 190

Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHD 307
           D +F+ GY EP ++ L+E   T  GR   ++ T  I  +S++   +Q  ++W   NLP D
Sbjct: 191 DMVFLDGYYEPTILFLYEPIQTTPGRACVRYDTMCIMGVSVNIVDRQFAVVWQTANLPMD 250

Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSS---FSVE 363
             +LL +P P+GG LV G+NT+ Y +Q+   C L LN+     D   + P        + 
Sbjct: 251 CSQLLPIPKPLGGALVFGSNTVVYLNQAVPPCGLVLNS---CYDGFTKFPLKDLKHLKMT 307

Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKTNPSVLTSDITTIGNS 421
           LD + + ++++    + ++ GDL LL ++    G  V+ L+ SK   + +   +T     
Sbjct: 308 LDCSTSVYMEDGRIAVGSRDGDLFLLRLMTSSGGGTVKSLEFSKVYETSIAYSLTVCAPG 367

Query: 422 LFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD--ALQDMV 479
             F+GSRLGDS L+++T    T                   + KRL+  + D  A +  +
Sbjct: 368 HLFVGSRLGDSQLLEYTLLKTTRDC----------------AVKRLKIDNKDPAAAEIEL 411

Query: 480 NGEELSLYGSA-----SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
           + +++ LYG A     +++ E   ++  F   D L N+GP+K    G R N  ++    +
Sbjct: 412 DEDDMELYGGAIEEQQNDDDEQIDESLQFRELDRLRNVGPVKSMCVG-RPNYMSNDLVDA 470

Query: 535 KQSN--YELVELP--GCKGIWTVYHKSSRGHNADSSRMAA---------YDDEYHAYLII 581
           K+ +  ++LV     G  G   V+ +S R     SS +            ++E H YLI+
Sbjct: 471 KRRDPVFDLVTASGHGKNGALCVHQRSLRPEIITSSLLEGAEQLWAVGRKENESHKYLIV 530

Query: 582 SLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG-ARILDGSYMT 640
           S   R+ ++          E   +     T+AAG L      +QV     A + DG  M 
Sbjct: 531 S-RVRSTLILELGEELVELEEQLFVTGEPTVAAGELSQGALAVQVTSTCIALVTDGQQM- 588

Query: 641 QDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL--LVGDP 687
           Q++              N  V+  SI DPYV L   +G + L  LV +P
Sbjct: 589 QEVHI----------DSNFPVIQASIVDPYVALLTQNGRLLLYELVMEP 627



 Score = 42.4 bits (98), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 57/262 (21%), Positives = 111/262 (42%), Gaps = 34/262 (12%)

Query: 755  VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVD-TYMREALKDSETEINSSSEEGTGQ 813
            +V +E+G L I  +P    V+ + +F +    +VD T   E  +       ++ E     
Sbjct: 777  IVSHENGRLSIHSLPEMEVVYQIGRFSNVPELLVDLTVEEEEKERKAKAQQAAKEASVPT 836

Query: 814  GRKENIHSM------KVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
               E +++       +V+E  +     + + P L AI+ D  ++ Y+ +          S
Sbjct: 837  DEAEQLNTEMKQLCERVLEAQIVGMGINQAHPILMAIV-DEQVVLYEMF---------SS 886

Query: 868  DDPVSTSRSLSVSNV-------SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNI 920
             +P+     +S   +       ++S L N    R P +     +  +G     I  F+ +
Sbjct: 887  SNPIPGHLGISFRKLPHFICLRTSSHL-NSDGKRAPFEM----KINNGKRFSLIHPFERV 941

Query: 921  SG-HQGFFLSGSRPCWCMVFRE--RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-QG 976
            S  + G  + G+ P   +V+     ++ H    DG I AFT  +N N  HG +Y+T  + 
Sbjct: 942  SSVNNGVMIVGAVPTL-LVYGAWGGMQTHQMTVDGPIKAFTPFNNENVLHGIVYMTQHKS 1000

Query: 977  ILKICQLPSGSTYDNYWPVQKV 998
             L+I ++     Y+  +PV+K+
Sbjct: 1001 ELRIARMHPDFDYEMPYPVKKI 1022


>gi|260835073|ref|XP_002612534.1| hypothetical protein BRAFLDRAFT_58262 [Branchiostoma floridae]
 gi|229297911|gb|EEN68543.1| hypothetical protein BRAFLDRAFT_58262 [Branchiostoma floridae]
          Length = 318

 Score =  184 bits (468), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 107/297 (36%), Positives = 168/297 (56%), Gaps = 38/297 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           +LVV     + +Y ++   E S++ K                  +ELV  + ++GN+ S+
Sbjct: 29  SLVVAGTTQLHVYRLKGDMEKSRKQK------------------MELVASFSMYGNIMSV 70

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +   G+D    RD+++L+F DAK+S++E+D   H L+  SMH FE  E   +K G  S
Sbjct: 71  ESVQLAGSD----RDALLLSFMDAKLSIVEYDPGTHDLKTASMHYFEEEE---VKDGYVS 123

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
               P+V+VDP+GRC  +L+YG ++++L   + G+    DE    +G   S  I  +++I
Sbjct: 124 NYHAPMVRVDPEGRCAVMLIYGKRLVVLPFRKEGAV---DEAEMSAGSKSS--ILPTYMI 178

Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
            L+DLD +  +V D  F+HGY +P ++IL+E   TW GRV+ +  TC I A+S++   + 
Sbjct: 179 KLQDLDERLINVVDLQFLHGYFDPTLLILYEPLQTWPGRVAVRQDTCCIVAVSLNIAQRV 238

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
           HP+IWS  NLP D  + +AVP PIGGVLV   N++ Y +QS         Y VSL+S
Sbjct: 239 HPIIWSVGNLPFDCKQAVAVPKPIGGVLVFAVNSLLYLNQSVP------PYGVSLNS 289


>gi|391328522|ref|XP_003738737.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like [Metaseiulus occidentalis]
          Length = 1500

 Score =  183 bits (465), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 122/404 (30%), Positives = 199/404 (49%), Gaps = 61/404 (15%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGIS----AASLELVCHYRLHGN 112
           NLVV    VI++Y                    R++ DG++     A LE    +   GN
Sbjct: 29  NLVVAGGTVIKVY--------------------RLVCDGLNETDDKAKLEHQQTFNCFGN 68

Query: 113 VESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
           +  +  +    +     RDS++  F++ KIS++E+D + H L+  ++   E  E+   K 
Sbjct: 69  ISGMEKIRLNAS-----RDSLLFVFKETKISLVEYDPATHELQTLAIRSLEKEEY---KE 120

Query: 173 GRESFARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFS 227
           G  +F    L+KVDP  RC  VL+YG  + I+      A+     +   + T  +  GF 
Sbjct: 121 GFYNFVGNTLIKVDPLNRCAAVLIYGKHLAIIPFVKKDATDLSDPIASSKSTQTNTSGFL 180

Query: 228 ARIESSHVINLRDLD----MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMI 283
                 + I L DLD    + ++ D  F++GY EP +++L+E   TW GRV+ +  TC I
Sbjct: 181 ----EYYTIRLIDLDEEKGVNNIHDMTFLNGYYEPTLLLLYEPIRTWTGRVAIRQDTCSI 236

Query: 284 SALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALN 343
            ALS++   + HP +WS   LP +++K+L VP PIGGVL++  N + Y +QS        
Sbjct: 237 MALSLNVYQRVHPPVWSFSGLPFNSFKVLPVPKPIGGVLILSVNALLYLNQSVPA----- 291

Query: 344 NYAVSLDSSQELPRSSFSVE--------LDAAHATWLQNDVALLSTKTGDLVLLTVVYDG 395
            Y VSL+   E   +SF ++        LD     +L     LLS   GDL +L++  DG
Sbjct: 292 -YGVSLNCFTEC-STSFPLKDQAGPPLTLDCCRCEFLSETKILLSVANGDLYVLSLFTDG 349

Query: 396 -RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
            R + + +  K   + + + I+       F+GSR+G+SLL+++T
Sbjct: 350 MRSINQFEFKKIATTTVATCISLCEPGYLFVGSRIGNSLLLRYT 393



 Score =  151 bits (382), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 140/522 (26%), Positives = 217/522 (41%), Gaps = 114/522 (21%)

Query: 543  ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTES 602
            ELPGC  +WTV   S+R  + D        ++ H +LI+S    TM+L+T   + E+  S
Sbjct: 603  ELPGCTDLWTVRSSSTRSPDVD--------EDSHQFLILSRPDSTMILQTGQEINELDHS 654

Query: 603  VDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVL 662
              +  Q  TI AGNL   R +IQV     R+L+G    Q +               S ++
Sbjct: 655  -GFCTQSPTIFAGNLADGRYIIQVCPNSVRLLEGVKQLQQVPI----------DVGSPLV 703

Query: 663  SVSIADPYVLLGMSDGSIRLLV--GDPST-CTVSVQTPAAIESSKKPVSSCTLYHD---- 715
            S SIAD +VL+   DG +  L   GD +T   +SV  P     +K  +++  +Y D    
Sbjct: 704  SASIADLHVLVMSQDGLVIQLTLRGDDTTGYKLSVLKPQ-FPGAKSKITALCIYKDVSGL 762

Query: 716  -----KGPEPWLR-KTSTDAWLSTGV------------------GEAIDGAD--GGPLDQ 749
                 + PE   + KT     + T V                  G ++D  D   G L+ 
Sbjct: 763  FVTKIQKPEDIAKPKTEAKTKVKTEVAKKVLRSADFDDEDELLYGSSVDIKDLVAGGLNA 822

Query: 750  GDI-----------------------------YSVVCYESGALEIFDVPNFNCVFTVDKF 780
             +I                             +  +  E+GALEI+  P++   + V  F
Sbjct: 823  ANIVPTTQTKDTAEEEDYEENVRKIAPVEPTFWVFLARENGALEIYSFPDYKLRYFVKNF 882

Query: 781  VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
                  + +  ++ A    +T   S+SE              KV+E+ +     H SRP 
Sbjct: 883  -----PLCNKILQNAAATGQTTSASTSEAQLP----------KVMEIFVCALGMHQSRPL 927

Query: 841  LFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAY 900
            LFA + D  +  Y+AY F               ++      +   RL++   +  P   Y
Sbjct: 928  LFARV-DSELHIYEAYPF--------------VNQKEGHLKLQFRRLQH-AVTMEPRRVY 971

Query: 901  TREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTV 959
             ++E       + I  F+++ G+ G F+ G RP W  +  R  LR HP L DG I +F  
Sbjct: 972  KQKEGDPTLSLRWIRAFQDVCGYNGVFVCGRRPHWIFLTARGELRAHPMLNDGRIYSFAT 1031

Query: 960  LHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVVFF 1001
             HNVNC  GF++    G L+IC LPS   YD  WP++K+  +
Sbjct: 1032 FHNVNCEKGFLFFNKYGELRICALPSYLNYDAPWPMRKIPIY 1073


>gi|390358535|ref|XP_789715.3| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like [Strongylocentrotus purpuratus]
          Length = 1223

 Score =  182 bits (461), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 141/456 (30%), Positives = 221/456 (48%), Gaps = 60/456 (13%)

Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYH 332
           RV+ +  TC I ALS++   K HP+IWS  +LP+D  ++ AVP PIGGVL++  N++ Y 
Sbjct: 11  RVAVRQDTCSIVALSLNMAQKVHPIIWSQSSLPYDCMQVQAVPKPIGGVLILAVNSLLYL 70

Query: 333 SQSASCALALNNYAVSLDS----SQELP---RSSFSVELDAAHATWLQNDVALLSTKTGD 385
           +QS      +  Y VSL+S    S   P   +    + +D   AT++  D   LS K G+
Sbjct: 71  NQS------IPPYGVSLNSLTDWSTAFPLKTQEGVKLSMDCTQATFISYDRLALSLKDGE 124

Query: 386 LVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTS 444
           + +LT++ DG R V+   L K   SVLT+ I  +G+   FLGSRLG+SLL+++T     +
Sbjct: 125 IYVLTLLVDGMRSVRGFHLDKAAASVLTTCICPMGDGFLFLGSRLGNSLLLKYTEKVSET 184

Query: 445 MLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD----MVNGEELSLYGSASNNTESAQKT 500
             +   K E      + P+ K     +SD +      + + +EL +YG     T +   +
Sbjct: 185 SPTDASKTEEPKPGEEPPTKKMRSDDASDWMASDTKFLDDPDELEVYGKQVQKTGTQLTS 244

Query: 501 FSFAVRDSLVNIGPLKDFSYG--------LRINAD-----ASATGISKQSNYELVE---- 543
           +SF + DSL+NIGP  +   G         + N D      + +G  K     +++    
Sbjct: 245 YSFEICDSLLNIGPCGNMIMGEPAFLSEEFQGNVDPDLELVTTSGYGKNGALSVLQRTIR 304

Query: 544 --------LPGCKGIWTVYHKSSRGHNAD----SSRMAAYDDEYHAYLIISLEARTMVLE 591
                   LPGC  +WTV  KS +   AD     S  +  D + HA+LI+S +  +MVL+
Sbjct: 305 PQVVTTFNLPGCLDMWTV--KSLKEAKADEKSEESEASPEDKDRHAFLILSKQDSSMVLQ 362

Query: 592 TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSE 651
           T   +TEV     +  Q  TI A N+   R ++QV  +   +++G    Q +        
Sbjct: 363 TGQEITEVAAG-GFSTQAPTIFASNMGDDRYIVQVMNKSICLMEGVEQIQHMVL------ 415

Query: 652 SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDP 687
                  S +   S+ADPY+LL   +G   L+   P
Sbjct: 416 ----DVGSPIKQCSLADPYLLLLTENGDPILMTLKP 447



 Score =  106 bits (265), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 76/257 (29%), Positives = 117/257 (45%), Gaps = 32/257 (12%)

Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
           + V C E+G LE++ +P+    F V  F  G   +VD               S S   TG
Sbjct: 560 WCVFCRENGQLEMYSLPDMVLAFLVKNFPMGSKVLVD---------------SGSAFMTG 604

Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
              +++    +V E+ +        + ++ A++ D  I+ Y+A+    P NT   +  + 
Sbjct: 605 DQSQQHEMLQQVQEVLLVGLGHDRKKIYMLALVEDD-IMIYEAF----PYNTVTQEHHLR 659

Query: 873 TSRSLSVSNVSASRLRNLRFSRTPL-DAYTREETPHGAP---------CQRITIFKNISG 922
             R   + +    + +  R S+ P  +  T+ ET   A            R+  F N+  
Sbjct: 660 V-RFRKIPHKILMKPKKTRTSKKPTAEGGTKPETETEAESDTKTTSRRVNRLREFHNVQT 718

Query: 923 HQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
           + G F+SGS P W  V  R  LR HP   DG+I  F   HNVNC +GF+Y   +  L+IC
Sbjct: 719 YSGVFISGSHPYWLFVTSRGALRTHPMPVDGAISCFASFHNVNCPNGFLYFNRKEELRIC 778

Query: 982 QLPSGSTYDNYWPVQKV 998
            LPS  +YD  WPV+KV
Sbjct: 779 VLPSHLSYDAPWPVRKV 795


>gi|393907593|gb|EJD74705.1| CPSF A subunit region family protein [Loa loa]
          Length = 990

 Score =  182 bits (461), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 167/640 (26%), Positives = 276/640 (43%), Gaps = 83/640 (12%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           LE +   RL   V+S AI        +   DS++L F+DAK+S++  + +   L+  S+H
Sbjct: 62  LECLLAVRLLAPVQSFAI---ARIPQNPDCDSLLLGFDDAKLSIVGVNPADRSLKTISLH 118

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
           CFE      LK G       P+++VDP  RC  +LV+G  + +L  +  G+ L       
Sbjct: 119 CFEDE---LLKDGFTKNLPRPVIRVDPGQRCAAMLVFGRYLAVLPFNDSGAQL------- 168

Query: 221 GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
                       S+ + L  +D  + +V D +F+ GY EP ++ L+E   T  GR   ++
Sbjct: 169 -----------HSYTVQLSQIDSRLVNVVDMVFLDGYYEPTLLFLYEPVQTTCGRACVRY 217

Query: 279 HTCMISALSISTTLKQHPL--IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA 336
            T  +  L +S  +K+  L  +W   NLP D  ++LA+P P+GG+L+V  N + Y +QS 
Sbjct: 218 DT--MCVLGVSLNVKEQVLASVWQLTNLPMDCNQILAIPRPVGGILLVATNELIYLNQSV 275

Query: 337 -SCALALNNYAVSLDSSQELPRSSFS---VELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
             C ++LN+    +D   + P   F    + LD    T +  +  LL  + G L  L +V
Sbjct: 276 PPCGISLNS---CMDGFTKFPLRDFKHMVLTLDGCVVTVISTNKILLCDRNGRLFTLVLV 332

Query: 393 YDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
            D    V+ L+L     +V+   +T+      F+GSRL DS+ +       T        
Sbjct: 333 TDATNSVKSLELKFQFKTVIPCTMTSCAPGYLFIGSRLCDSVFLHCIFEQST-------- 384

Query: 452 EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT---ESAQKTFSFAVRDS 508
                ++  AP   +L  +  +A +D    E+  LYG         +SA++  +  V D 
Sbjct: 385 -----LDESAPKKIKL-NTELNANED----EDFELYGEVLPKVAKPDSAEELLNIRVLDK 434

Query: 509 LVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCK--GIWTVYHKSSRGHNADSS 566
           L+N+GP K  + G    +        K   ++LV   G    G   ++ +S R     SS
Sbjct: 435 LLNVGPCKKITGGCPSISAYFQEVTRKDPLFDLVCACGHGKFGSICIFQRSVRPEIVTSS 494

Query: 567 RMAAY---------DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNL 617
            +            +D+ H Y I S E  T+ LET + L E+ E+  +     TIAAG L
Sbjct: 495 SIEGVVQYWAVGRREDDTHMYFIASKELGTLALETDNDLVEL-EAPIFATSEPTIAAGEL 553

Query: 618 FGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
                 +QV      ++      Q +                 V S SI DPY+ +   +
Sbjct: 554 ADGGLAVQVTTSSLVMVAEGQQIQHIPL----------QLTFPVRSASIVDPYIAICTQN 603

Query: 678 GSIRL--LVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
           G + +  L   P      +     +     P++S ++Y D
Sbjct: 604 GRLLMYELTSHPHVHLKEIDISKRLRHETSPITSLSIYRD 643



 Score = 70.9 bits (172), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 62/250 (24%), Positives = 111/250 (44%), Gaps = 29/250 (11%)

Query: 759 ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE------TEINSSSEEGTG 812
           E+G + I+ +P  + V+ V K     +H+ D    +   D E       +  S++   T 
Sbjct: 733 ENGNMYIYSIPELHLVYMVKKI----SHLPDIATDQPYVDDEPATAESIDTMSATMTDTF 788

Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
             + E +    ++EL M     +  RP LF +L D T+  Y+ + +    N         
Sbjct: 789 AAKPEEV----IMELLMVGMGMNQGRPMLF-LLIDDTVSVYEMFTY----NNGIQGHLAV 839

Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI--FKNISG-HQGFFLS 929
             + L  + V+    R+ RF    LD     E+   A   +  +  F+ I     G F+ 
Sbjct: 840 RFKRLPYTVVT----RSCRFQG--LDGRAAVESVRDAVRHKTVLHFFERIGNVLNGVFIC 893

Query: 930 GSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-QGILKICQLPSGST 988
            S PC   +     R+HP   DG I++FT  +N  C +GFIY+T  + ++++ +LP+   
Sbjct: 894 SSYPCIFFLETGVPRLHPVNLDGPILSFTTFNNAACPNGFIYLTERERLMRVAKLPNDMI 953

Query: 989 YDNYWPVQKV 998
            D  +PV+++
Sbjct: 954 LDTSYPVKRI 963


>gi|49619065|gb|AAT68117.1| cleavage and polyadenylation specific factor 1 [Danio rerio]
          Length = 1105

 Score =  181 bits (460), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 187/727 (25%), Positives = 314/727 (43%), Gaps = 166/727 (22%)

Query: 388 LLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSML 446
           +LT++ DG R V+     K   SVLT+ + T+     FLGSRLG+SLL+++T      + 
Sbjct: 2   VLTLITDGMRSVRAFHFDKAAASVLTTCMMTMEPGYLFLGSRLGNSLLLRYT----EKLQ 57

Query: 447 SSGLKEEFGDIEADAPSTKRLRRSSSD--------ALQDMVNGEELSLYGS-ASNNTESA 497
            + ++E   + E +     + +R  S+         L D ++  E+ +YGS A + T+ A
Sbjct: 58  ETPMEEGKENEEKEKEPPNKKKRVDSNWAGCPKKGNLPDELD--EIEVYGSEAQSGTQLA 115

Query: 498 QKTFSFAVRDSLVNIGPLKDFSYG--------LRINADAS-----ATGISKQSNYELV-- 542
             T+SF V DS++NIGP    S G         + N +        +G  K     ++  
Sbjct: 116 --TYSFEVCDSILNIGPCASASMGEPAFLSEEFQTNPEPDLEVVVCSGYGKNGALSVLQK 173

Query: 543 ----------ELPGCKGIWTVYHKSSR---------GHNADSSRMAAY---DDEYHAYLI 580
                     ELPGC  +WTV +   +         G + +  +       D + H +LI
Sbjct: 174 SIRPQVVTTFELPGCHDMWTVIYCEEKPEKPSAEGDGESPEEEKREPTIEDDKKKHGFLI 233

Query: 581 ISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMT 640
           +S E  TM+L+T   + E+  S  +  QG T+ AGN+   + +IQV   G R+L+G    
Sbjct: 234 LSREDSTMILQTGQEIMELDTS-GFATQGPTVYAGNIGDNKYIIQVSPMGIRLLEG---V 289

Query: 641 QDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI---------------RLLVG 685
             L F P +         S ++  S+ADPYV++  ++G +               RL + 
Sbjct: 290 NQLHFIPVDL-------GSPIVHCSVADPYVVIMTAEGVVTMFVLKNDSYMGKSHRLALQ 342

Query: 686 DPSTCT------------------------------VSVQTPAAIESSKKPVSSCT---- 711
            P   T                              ++++T +  E+  + +S+      
Sbjct: 343 KPQIHTQSRVITLCAYRDVSGMFTTENKVSFLAKEEIAIRTNSETETIIQDISNTVDDEE 402

Query: 712 --LYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVP 769
             LY +  P     K  +    +           G    +   + ++  E+G +EI+ +P
Sbjct: 403 EMLYGESNPLTSPNKEESSRGSAAASSAHTGKESGSGRQEPSHWCLLVRENGVMEIYQLP 462

Query: 770 NFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAM 829
           ++  VF V  F  G+  +VD+    +   S T+     EE T QG   +I  +K  E+A+
Sbjct: 463 DWRLVFLVKNFPVGQRVLVDS----SASQSATQGELKKEEVTRQG---DIPLVK--EVAL 513

Query: 830 QRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRN 889
                 HSRP+L A + +  +L Y+A+ ++               +  + SN+       
Sbjct: 514 VSLGYSHSRPYLLAHV-EQELLIYEAFPYD---------------QQQAQSNL------K 551

Query: 890 LRFSRTPLDAYTREET--------PHG---------APCQRITIFKNISGHQGFFLSGSR 932
           +RF + P +   RE+         P G             R   F++ISG+ G F+ G  
Sbjct: 552 VRFKKMPHNINYREKKVKVRKDKKPEGQGEDSLGVKGRVARFRYFQDISGYSGVFICGPS 611

Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
           P W +V  R  +R+HP   DG+I +F+  HN+NC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 612 PHWMLVTSRGAMRLHPMTIDGAIESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDA 671

Query: 992 YWPVQKV 998
            WPV+K+
Sbjct: 672 PWPVRKI 678


>gi|308459872|ref|XP_003092248.1| CRE-CPSF-1 protein [Caenorhabditis remanei]
 gi|308253976|gb|EFO97928.1| CRE-CPSF-1 protein [Caenorhabditis remanei]
          Length = 1448

 Score =  181 bits (458), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 155/578 (26%), Positives = 263/578 (45%), Gaps = 86/578 (14%)

Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
           +DSI++AF+DAK+S++  ++    ++  S+H FE+    +L+ G  ++   P+V+ DP  
Sbjct: 92  QDSILMAFDDAKLSIVAVNEKERNMQTISLHAFENE---YLRDGFINYFHPPIVRTDPSN 148

Query: 190 RCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVK 247
           RC   LVYG  + IL   +    ++                  S++I L+ +D  + +V 
Sbjct: 149 RCAASLVYGKHIAILPFHENSKRIL------------------SYIIPLKQIDPRLDNVA 190

Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHD 307
           D +F+ GY EP ++ L+E   T  GR   ++ T  I  +S++   +Q  ++W   NLP D
Sbjct: 191 DMVFLDGYYEPTILFLYEPLQTTPGRACVRYDTMCIMGVSVNIVDRQFAVVWQTANLPMD 250

Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELP---RSSFSVE 363
              LL +P P+GG LV G+NTI Y +Q+   C + LN+     D   + P        + 
Sbjct: 251 CTSLLPIPKPLGGALVFGSNTIVYLNQAVPPCGVVLNS---CYDGFTKFPLKDMKHLKMT 307

Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKTNPSVLTSDITTIGNS 421
           LD A + ++++    +  + G L LL +V    G  V+ ++ S+   + +   +T     
Sbjct: 308 LDCATSVYMEDGRIAVGGRDGVLYLLRLVTSSGGATVKSMEFSRVWETSIAYCLTVCAPG 367

Query: 422 LFFLGSRLGDSLLVQFTCGSGT--SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
             F+GSRLGDS LV++T    T  S     ++++ G+IE D                   
Sbjct: 368 HLFIGSRLGDSQLVEYTLLKMTKESAKRQKIEKDPGEIELDE------------------ 409

Query: 480 NGEELSLYGSA-----SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
             +++ LYG A     +++ E   ++  F   D L N+GP+K   +G R N  +S     
Sbjct: 410 --DDMELYGGAIEMQLNDDEEQILESLEFRELDRLRNVGPVKSMCFG-RPNYMSSDLAEM 466

Query: 535 KQSN--YELVELP--GCKGIWTVYHKSSRGHNADSSRMAA---------YDDEYHAYLII 581
           K+ +  ++LV     G  G   V+ +S R     SS +            ++E H YLI+
Sbjct: 467 KRRDPVFDLVTASGHGKNGALCVHQRSLRPEIITSSILEGAEQLWAVGRKENESHKYLIV 526

Query: 582 SLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG-ARILDGSYMT 640
           S   R+ ++          E   +     T+AAG L      +QV     A + DG  M 
Sbjct: 527 S-RVRSTLVLELGEELVELEEQLFVTNEPTVAAGELSQGALAVQVTSTCIALVTDGQQM- 584

Query: 641 QDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
           Q++              N  V+  SI DPYV +   +G
Sbjct: 585 QEVHI----------DSNFPVVQASIQDPYVAVLTQNG 612



 Score = 59.3 bits (142), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 64/289 (22%), Positives = 121/289 (41%), Gaps = 26/289 (8%)

Query: 723  RKTSTDAWLSTGVGEAIDGADGGPLDQGDIYS------VVCYESGALEIFDVPNFNCVFT 776
            R+   DA +S+  GE  D      +D    YS      +V +++G L I  +P+   V+ 
Sbjct: 741  RRLGHDAIMSSRGGEQSDA-----IDPTRTYSSITHWLMVAHDNGRLSIHSLPDMELVYQ 795

Query: 777  VDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQR----W 832
            + +F +    ++D    E  K+ + +   ++++      +      K+ E  M+      
Sbjct: 796  IGRFSNVPELLMDMTTDEEEKERKAKAQQAAKDTAADEDQLTTEMKKLCERVMEAQIVGM 855

Query: 833  SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRF 892
              + S P L AI+ D  ++ Y+ +    P+        ++  +      +  S   N   
Sbjct: 856  GINQSHPVLMAIV-DEQVVMYEMFSHYNPQAGHLG---IAFRKLPHFICLRTSSHLNSDG 911

Query: 893  SRTPLDAYTREETPHGAPCQRITIFKNISG-HQGFFLSGSRPCWCMVFR-ERLRVHPQLC 950
             R P +     E  +G     I  F+ IS  + G  + G+ P   +      ++ H    
Sbjct: 912  KRAPFEM----EVENGKRYTLIHPFERISSINNGVMIGGAVPTLVVYGAWGGMQTHQMTI 967

Query: 951  DGSIVAFTVLHNVNCNHGFIYVTSQ-GILKICQLPSGSTYDNYWPVQKV 998
            DG I AFT  +N N  HGF+Y+T Q   L+I ++     Y+  +P++K+
Sbjct: 968  DGPIKAFTPFNNENVLHGFVYMTQQKSELRIARMHPDFDYEMPYPMKKI 1016


>gi|384487281|gb|EIE79461.1| hypothetical protein RO3G_04166 [Rhizopus delemar RA 99-880]
          Length = 1468

 Score =  179 bits (454), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 163/650 (25%), Positives = 277/650 (42%), Gaps = 97/650 (14%)

Query: 88  KRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEF 147
           K+  ++   +   LELV  ++++G + ++  +           DS++L F DAK+S+LE+
Sbjct: 87  KKGGMISDTTLGRLELVAQFKMNGIITTMGTVRTNSPRGREGCDSLLLGFSDAKMSLLEW 146

Query: 148 DDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL---VKVDPQGRCGGVLVYGLQMIIL 204
             S + +   S+H +E  E+      ++ F   P    + +DPQ RC     Y  ++ +L
Sbjct: 147 SSSTNSIITVSIHYYERDEF------KKEFLTNPYPSAIHIDPQQRCAVFNFYDNKLAVL 200

Query: 205 KASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVI 262
              Q  S  + +    G           S +I+L  LD  +K+V D  F+  Y EP + I
Sbjct: 201 PFRQ--SDKLDERQGEGEEDEEKWPYYPSFIIDLATLDSRIKNVIDMTFLSDYYEPTLAI 258

Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVL 322
           L + E TW GR+     T  +  +S+  T K +P+I+S   LP+D +KL+A+P P+ G+L
Sbjct: 259 LFQPEQTWTGRLGNNKDTVSLVVISLDITAKIYPIIYSIDKLPYDCFKLVAMPKPVTGML 318

Query: 323 VVGANTIHYHSQ-SASCALALNNYAVSLDSSQELPRSSFS-------VELDAAHATWLQN 374
           V+ AN+I + SQ S    +A+N Y      + + P   +        + L+ A A     
Sbjct: 319 VIAANSILHVSQGSPGMGVAVNGYT---KKTTDFPGMIYEPSLIELGLSLEGAKALAFGG 375

Query: 375 DVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN---------------PSVLTSDITTIG 419
           D  L+  + G   L+ V  DG  V  + +S+                 P +L S  + + 
Sbjct: 376 DRCLIFMQNGHWALVEVRRDGNKVVGMAISEIKHDLPVMEKKPPRFDTPPLLASVPSCVT 435

Query: 420 N----SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDAL 475
           N      FFLGSR+GDSLL+++          +       D   +      +     D +
Sbjct: 436 NVKAGEYFFLGSRVGDSLLIKYDANRVNHQSVAPPVFRVCDTMLNTGPIVDMAVGDVDTV 495

Query: 476 QDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISK 535
           +   +  +L L  S+ +    A   F         +I P   F++      D+ A     
Sbjct: 496 EQQEDWPQLELVSSSGHGKNGALCVFQ-------RHIYPQTSFAFH---QFDSQA----- 540

Query: 536 QSNYELVELPGCKGIWTVY-HKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
                         IW++   K+ +  N         DD++   L IS    T+VL   D
Sbjct: 541 --------------IWSIKCRKNDQQQNE--------DDDFDKLLFISKSKSTLVLSAGD 578

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL--DGSYMTQDLSFGPSNSES 652
            L EV     ++ +G TIA   LF   R++QV+  G  +L  +G  +           ++
Sbjct: 579 ELQEV--KTGFYTRGSTIAVSTLFDATRIVQVYATGVMVLTPEGKRI-----------QT 625

Query: 653 GSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTC-TVSVQTPAAIE 701
                 + ++  SI DPY+LL + +  I  L GD ST   + +Q P  I+
Sbjct: 626 VPIPRGAKIVEASIHDPYILLTLDNNKILALQGDASTKDIIHIQLPNHIK 675



 Score = 81.6 bits (200), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 67/259 (25%), Positives = 107/259 (41%), Gaps = 40/259 (15%)

Query: 760  SGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENI 819
            +G L I+ +P+F   F   +F      IVD        DS              G K  I
Sbjct: 811  TGILRIYSLPDFKEHFACPQFSIAPDLIVD--------DS--------------GVKSRI 848

Query: 820  HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSV 879
             +  + E+ M         P L        I+ Y+A+ +    +  +     S  +   V
Sbjct: 849  PTNNIQEILMTHIGKERKDPHLVVRTDTNDIIIYKAFTYLDESSPDRLALRFSRVQHEYV 908

Query: 880  SNVSASRLRNLRFSRTPLDAYTREET--------------PHGAPCQR--ITIFKNISGH 923
            S  S+S     +  R  +D +   +T                    QR  +  F +++G+
Sbjct: 909  SRKSSSHESKPKKKRGIIDEFEIPDTDLNEEEEDLKLSTKKMDKKIQRKLLIPFTDVAGY 968

Query: 924  QGFFLSGSRPCWCMV-FRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQ 982
             G F++G++P W M   +  +RVHP   +  IV FT  HNVNC HGFI V S+  +++ +
Sbjct: 969  AGVFVAGAQPAWLMCSCKSFVRVHPMKTEHEIVGFTQFHNVNCQHGFITVDSKSTIQLSR 1028

Query: 983  LPS-GSTYDNYWPVQKVVF 1000
            L + G  YD  W +QKV+ 
Sbjct: 1029 LRTEGINYDLDWVIQKVLL 1047


>gi|49619061|gb|AAT68115.1| cleavage and polyadenylation specificity factor 1 [Danio rerio]
          Length = 312

 Score =  179 bits (453), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 99/253 (39%), Positives = 149/253 (58%), Gaps = 18/253 (7%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           LE V  + L GNV S+A +   G +    RD+++L+F+DAK+SV+E+D   H L+  S+H
Sbjct: 66  LEQVASFSLFGNVMSMASVQLVGTN----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLH 121

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
            FE PE   L+ G       P+V+VDP+ RC  +LVYG  +++L      +  + DE   
Sbjct: 122 YFEEPE---LRDGFVQNVHIPMVRVDPENRCAVMLVYGTCLVVLPFR---NDTLADEQEG 175

Query: 221 GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
             G G       S++I++R+LD  + ++ D  F+HGY EP ++IL E   TW GRV+ + 
Sbjct: 176 IVGEGQKFSFLPSYIIDVRELDETLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQ 235

Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASC 338
            TC I A+S++   K HP+IWS  NLP D  +++AVP PIGGV+V   N++ Y +QS   
Sbjct: 236 DTCSIVAISLNIMQKVHPVIWSLSNLPFDCNQVMAVPKPIGGVVVFAVNSLLYLNQSVP- 294

Query: 339 ALALNNYAVSLDS 351
                 + VSL+S
Sbjct: 295 -----PFGVSLNS 302


>gi|170576536|ref|XP_001893668.1| CPSF A subunit region family protein [Brugia malayi]
 gi|158600196|gb|EDP37499.1| CPSF A subunit region family protein [Brugia malayi]
          Length = 1323

 Score =  178 bits (451), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 169/647 (26%), Positives = 278/647 (42%), Gaps = 96/647 (14%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           LE +   RL   V+S AI       +    DS++L F+DAK+S++  + +   L+  S+H
Sbjct: 62  LECLLAVRLLAPVQSFAIARISQNPDC---DSLLLGFDDAKLSIVAVNPADRCLKTISLH 118

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
           CFE      LK G       P+++VDP  RC  +LV+G  + +L  +   + L       
Sbjct: 119 CFEDE---LLKDGFTKNLPRPVIRVDPGQRCASMLVFGRYLAVLPFNDSSTQL------- 168

Query: 221 GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
                       S+ + L  +D  + +V D +F+ GY EP ++ L+E   T  GR   ++
Sbjct: 169 -----------HSYTVQLSQIDSRLVNVVDMVFLDGYYEPTLLFLYEPVQTTCGRACVRY 217

Query: 279 HTCMISALSISTTLKQHPL--IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA 336
            T  +  L +S  +K+  L  +W   NLP D  ++LA+P P+GG+L+V  N + Y +QS 
Sbjct: 218 DT--MCVLGVSLNVKEQVLASVWQLTNLPMDCNQILAIPRPVGGILLVATNELIYLNQSV 275

Query: 337 -SCALALNNYAVSLDSSQELPRSSF---SVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
             C ++LN+    +D   + P   F   ++ LD A  T +  +  LL  + G L  L +V
Sbjct: 276 PPCGISLNS---CMDGFTKFPLKDFKHMALTLDGAVVTVVSTNKILLCDRNGRLFTLILV 332

Query: 393 YDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
            D    V+ L+L     +V+   +T+      F+GSRL DS+ +   C    S L     
Sbjct: 333 TDATNSVKSLELKFQFETVIPCTMTSCAPGYLFIGSRLCDSVFLH--CIFEQSTLEES-- 388

Query: 452 EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT---ESAQKTFSFAVRDS 508
                      +TK+++ S+     +    E+  LYG         +  ++  +  V D 
Sbjct: 389 -----------ATKKMKLSTEPNANE--EDEDFELYGEVLPKVAKPDVTEELLNIRVLDK 435

Query: 509 LVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCK--GIWTVYHKSSRGHNADSS 566
           L+N+GP K  + G    +        K   ++LV   G    G   +  +S R     SS
Sbjct: 436 LLNVGPCKKITGGCPSVSAYFQEITRKDPLFDLVCACGHGKFGSICILQRSIRPEIITSS 495

Query: 567 RMAAY---------DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNL 617
            +            +D+ H Y I S E  T+ LET + L E+ E+  +     TIAAG L
Sbjct: 496 SIEGVVQYWAVGRREDDTHMYFIASRELGTLALETDNDLVEL-EAPIFSTSESTIAAGEL 554

Query: 618 FGRRRVIQV-------FERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPY 670
                 +QV          G +I    Y+   L+F               V S SI DPY
Sbjct: 555 ADGGLAVQVTTSSLVMVAEGQQI---QYIPLQLTF--------------PVRSASIVDPY 597

Query: 671 VLLGMSDGSIRL--LVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
           + +   +G + +  L   P      +     +     P++S ++Y D
Sbjct: 598 IAICTQNGRLLMYELTNQPHVSLKEIDISKRLRHETSPITSLSIYRD 644



 Score = 68.2 bits (165), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 62/253 (24%), Positives = 110/253 (43%), Gaps = 29/253 (11%)

Query: 756 VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE------TEINSSSEE 809
           +  E+G + I+ +P  + V+ V K     +H+ D    +   D E       +  S +  
Sbjct: 731 IARENGNMYIYSIPELHLVYMVKKI----SHLPDIATDQPYVDDEPVTGEGIDAMSGTMT 786

Query: 810 GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
            T   + E +    ++EL +     +  RP LF +L D T+  Y+ + +    N      
Sbjct: 787 DTFAVKPEEV----IMELLLVGMGMNQGRPLLF-LLIDDTVSAYEMFTY----NNGIQGH 837

Query: 870 PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI--FKNISG-HQGF 926
                + L  + V+    R+ RF  T  D     E+   A   +  +  F+ I     G 
Sbjct: 838 LAIRFKRLPYTTVT----RSCRFQGT--DGRAAVESVRDAVRHKTVLHFFERIGNVLNGV 891

Query: 927 FLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG-ILKICQLPS 985
           F+  S PC   +     R+HP   DG I++FT  +N  C +GFIY+T +   +++ +LPS
Sbjct: 892 FICSSYPCIFFLESGVPRLHPVNLDGPILSFTTFNNAVCPNGFIYLTERDRFMRVAKLPS 951

Query: 986 GSTYDNYWPVQKV 998
               D  +PV+++
Sbjct: 952 DMILDASYPVKRI 964


>gi|407929511|gb|EKG22329.1| Cleavage/polyadenylation specificity factor A subunit [Macrophomina
           phaseolina MS6]
          Length = 1418

 Score =  175 bits (443), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 224/969 (23%), Positives = 380/969 (39%), Gaps = 145/969 (14%)

Query: 99  ASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITS 158
           + L LV  Y L G V SLA +     D     D++++AF DAK+S++E+D + H L   S
Sbjct: 81  SKLVLVAEYPLEGTVLSLARIK--ALDTKSGGDALLIAFRDAKMSLVEWDPANHALSTIS 138

Query: 159 MHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV-GDE 217
           +H +E  E        +       +  DP  RC  +      + IL   Q G  LV GD+
Sbjct: 139 IHYYEGEELHGAPWDADLGHYHNFLAADPSSRCAALKFGARHLAILPFRQLGDDLVEGDD 198

Query: 218 ---------------DTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVM 260
                          +   +G       +SS  ++L  +D  + H     F+H Y EP  
Sbjct: 199 YDPDFDEPMDAPAAKEKATNGDVAQTPYKSSFALSLPQIDPALTHPVHLDFLHEYREPTF 258

Query: 261 VILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGG 320
            I+   +   A  +  +      +  ++    K    + S   LP+D +K++ +P P+GG
Sbjct: 259 GIISANKAAAASLLYERRDLLTYTVFTLDLEEKASTALLSVAGLPYDTHKVIPLPLPVGG 318

Query: 321 VLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVA 377
            L++GAN  IH      + A+A+N++A    S     +S  ++ L+ A    L  +N   
Sbjct: 319 ALLLGANQFIHVDQAGKTSAVAVNDFAKQCSSFPMSDQSELAMRLEGASIELLSPENGDL 378

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS-------VLTSDITTIGNSLFFLGSRLG 430
           L+  K G L +++   DGR V  L + K +            S  T++G +  F+GS  G
Sbjct: 379 LVVLKDGSLAVISFKLDGRSVSGLSIRKISEEKGGHVVPTAASCTTSLGRNRMFIGSEDG 438

Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
           DS+L+ +T     + LS   K    ++ AD            D      +G   +   SA
Sbjct: 439 DSVLLGWT--KKAAQLSR--KRSHAEMLADDAELSFDEEDLEDDDDLYGDGPSTAKTASA 494

Query: 491 SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSN-YELV------- 542
           S+   S    ++F + D ++++ P+KD +       D +   + + ++  +LV       
Sbjct: 495 SSEA-SDPSNYTFRIHDIMLSLAPIKDVALASHKVTDTAIGTLERAADQLDLVVSTGRGA 553

Query: 543 -------------------ELPGCKGIWTVYHK--SSRGHNADSSRMA----AYDDEYHA 577
                              E    + +W+V+ K  + +G  A  S+ A    A D +Y  
Sbjct: 554 AGGLALMRREIDPVILRKGEFSNARAVWSVHAKKPAPKGMVAAGSQDAEAKLAADVDYDQ 613

Query: 578 YLIISL------EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
           +LI+S       E   +   TA    E  +         TI  G + G  R++QV +   
Sbjct: 614 FLIVSRSNGDGGEESAIFNITATGFEETNKGDFEREDAATINVGTIAGGTRIVQVLKAEI 673

Query: 632 RILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTC 690
           R  D    + Q L   P   E+GS      ++S S ADPY+L+   D S+ +L  D +  
Sbjct: 674 RSYDSELGLDQIL---PMEDENGS---ELRIISASFADPYILVIRDDSSVIVLQADANGE 727

Query: 691 TVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQG 750
              +     + S+K                         WLS  + ++    +       
Sbjct: 728 MEEIDRGDTLLSTK-------------------------WLSGCIHQSQSTGEKA----- 757

Query: 751 DIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEG 810
              + +    G L IF++P+ +    V   +         ++   L    T   SSS   
Sbjct: 758 --LAYLLSAEGGLHIFELPDLSKPVYVAASLG--------FLPPTLTADFTPRRSSS--- 804

Query: 811 TGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDP 870
                K  +  + V EL      + +  P+L    +   ++ YQ Y F   E        
Sbjct: 805 -----KAALTEVIVAELG----DSTYKTPYLIVRTSSNDLVIYQPYHFPAHEVVKP---- 851

Query: 871 VSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSG 930
                +L    +   RL    FS  P  A   E+T  G      TI  N+ G+   F++G
Sbjct: 852 --FFENLRWLKIPQPRLPE--FSEEP--ALESEDTGIGKESILTTI-ANVGGYSAVFMAG 904

Query: 931 SRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY- 989
           + P + +     L    ++   S+   +  H   C+ GF Y+ + G L++CQLP G  Y 
Sbjct: 905 TSPSFILKESSSLPRVIKMRTKSVKNLSSFHRAECDRGFAYINADGNLRVCQLPRGYRYG 964

Query: 990 DNYWPVQKV 998
           D  W V+K+
Sbjct: 965 DAGWAVKKI 973


>gi|291232724|ref|XP_002736306.1| PREDICTED: cleavage and polyadenylation specific factor 1-like
           [Saccoglossus kowalevskii]
          Length = 304

 Score =  173 bits (439), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 118/354 (33%), Positives = 175/354 (49%), Gaps = 62/354 (17%)

Query: 3   FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
           +A Y+ +H PTGI +C  G                  EE               NL++  
Sbjct: 2   YALYRQIHPPTGIEHCVYGH-------------FFSKEE--------------KNLIIAG 34

Query: 63  ANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQG 122
           A  + +Y + + +  SK+ K+  E  R                 + L GN+ SL      
Sbjct: 35  ATDLHVYRL-LSDVDSKQKKSKLEHLRS----------------FSLFGNIMSLQTTRLA 77

Query: 123 GADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL 182
           GA     RD+++L+F+DAK+SV+E+D   H L+  S+H FE      LK G  S    P 
Sbjct: 78  GAS----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEEA---LKEGYVSNYYIPQ 130

Query: 183 VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD 242
           V VDP  RC  +L+YG ++++L   + G+    D+D    G   S+ +  S++INL+D+D
Sbjct: 131 VVVDPDNRCAVMLMYGSKLVVLPFRREGAA--EDQDGVLPGSSKSSFL-PSYIINLQDID 187

Query: 243 MK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
            K  ++ D  F+HGY EP + IL E   TW GRV+ +  TC I A+S++   + HP+IWS
Sbjct: 188 QKLINIIDIKFLHGYYEPTLFILFEPLRTWPGRVAVRKDTCCIVAISLNIEQRVHPVIWS 247

Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQE 354
             NLP D  K + VP PIGGVLV   +++ Y +QS         Y VSL+   E
Sbjct: 248 LNNLPFDCIKAIPVPKPIGGVLVFAVDSLLYLNQSVP------PYGVSLNGLTE 295


>gi|296414526|ref|XP_002836950.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295632796|emb|CAZ81141.1| unnamed protein product [Tuber melanosporum]
          Length = 1468

 Score =  173 bits (438), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 234/1077 (21%), Positives = 412/1077 (38%), Gaps = 227/1077 (21%)

Query: 57   NLVVTAANVIEIYVVRVQE----EGSKESKNSGETKRRVL----------------MDGI 96
            N++V   ++++I+     E        ++K  G+  RR+L                    
Sbjct: 29   NVLVAKTSLLQIFTTTTYETELNSALADAKQPGDIDRRILDADEEQTFAADIALQRSQVE 88

Query: 97   SAASLELVCHYRLHGNV---ESLAILS--QGGADNSRRRDSIILAFEDAKISVLEFDDSI 151
            S   L LV  Y L G+V   + + +LS   GG       ++++ +F+DAK S++E+D   
Sbjct: 89   SVTKLVLVAEYPLSGSVTGLQRIKLLSTRSGG-------EAVLASFKDAKCSLMEWDPET 141

Query: 152  HGLRITSMHCFESPEWLHLKRGRESFARGPLVK--------VDPQGRCGGVLVYGLQMII 203
            + +   S+H +E          RE F   P+V          DP  RC  +   G  + I
Sbjct: 142  NSITTISLHYYE----------REEFC-SPVVSDGLPTELVADPGSRCAALRFSGDMLAI 190

Query: 204  LKASQ------------------------------------GGSGLVGDED--TFGSGGG 225
            +   Q                                    G   ++G+ D  T  +  G
Sbjct: 191  IPFRQREDEELSLGRGDADEVMGDEDGDNDDWDPEMAGTARGEDTIMGEGDVKTTDATEG 250

Query: 226  FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMI 283
                   S V+++  LD  + HV    F+H Y EP   IL+    TW G ++ +     I
Sbjct: 251  KDRPYHPSFVLSVSQLDDAISHVISLTFLHEYREPTFGILYSPRRTWTGLLAAEGRKDTI 310

Query: 284  SALSISTTLKQHP--LIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCAL 340
            S + I+  L+Q     I S   LP+D +K++ +  P GG L+VG N  IH      +  +
Sbjct: 311  SYIVITLDLEQKASTPILSVSGLPYDIFKVVPLAPPTGGSLLVGGNELIHVDQAGKTTGV 370

Query: 341  ALNNYAVSLDSSQELP-RSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRV 397
            A+N +         L  +S   +EL+ +    L+++    LL TK G+ V++    DGR 
Sbjct: 371  AVNPFCRRSTGFAGLADQSDLCLELEGSQVVELESEGGDMLLFTKRGEGVIVGFRMDGRN 430

Query: 398  VQRLDLSKTNP---SVLTSDITT---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
            V  + ++K N    S++   ++T   +G    F+G   GD+ ++++           G+K
Sbjct: 431  VSGVKITKLNNHPGSIVGGRVSTAVGLGGRRLFVGCIEGDARVLKWRRKGERKKAGEGIK 490

Query: 452  EE----------FGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTF 501
            EE          +G +E            SS     + NG          N+   +Q  +
Sbjct: 491  EEVLENEDEDDVYGALEDMDDDLYGGGGDSSFRKDSLTNGRR--------NSEAKSQGEY 542

Query: 502  SFAVRDSLVNIGPLKDFSYG------------------LRINADASATGISKQSNYELV- 542
             F   D L N+GP +D + G                  L +   +  +  S+ S   ++ 
Sbjct: 543  IFQTHDRLTNLGPFRDITLGKPTFPEESRERQKGVSPELELVTTSGPSNTSEDSGISIIR 602

Query: 543  -----------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDE-----YHAYLIISLEAR 586
                       + P C+ +WTV  +S+   NA        DD      +  +L ++    
Sbjct: 603  KSISPTIVGRFDFPQCQALWTVRARSANTSNAAVGLGGEEDDRSVEESFDRFLFVTKNDE 662

Query: 587  TMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG 646
            + V    D   EV    D+  +G TI  G +    R++QV     R+ D       +   
Sbjct: 663  SQVFRVGDTFEEV-RGTDFESEGETIEVGVVGNGMRIVQVVSEQVRVYDCDLQLSQII-- 719

Query: 647  PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKP 706
            P   E  +G E   V    + DPY+LL   DGS  +   D +   ++ +   AI+  K  
Sbjct: 720  PMFDEE-TGEEGPNVHRARVCDPYILLIKVDGSPAVYKMDSTNLELAEERADAIKFDKYQ 778

Query: 707  VSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQ-GDIYSVVCYESGALEI 765
             S C     K                 G+   +D     P++   D    +    G L+I
Sbjct: 779  -SGCIYASTK-----------------GIFIPLD----APVENVKDYLLFLLTVEGGLQI 816

Query: 766  FDVPN-FNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKV 824
            +D+ N    +F+ + F        +T       D+ T   ++ E+      K+ I  + V
Sbjct: 817  YDLSNPVTPLFSAESF--------NTLYPLLRTDNPTSPTANREK---HRSKQLIIEILV 865

Query: 825  VELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTS--KSDDPVSTSRSLSVSNV 882
             ++      +    P+L A  ++  +  Y+ ++   P      KS +P   S  LS+S  
Sbjct: 866  ADMG----DSIFKEPYLIARSSNNDLTFYKPFISSSPSTLRFIKSPNPHIASNELSLSAG 921

Query: 883  SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCM-VFRE 941
            + +  R L                        T   N++G+   FL G+ P + +   + 
Sbjct: 922  TKNIFRPL------------------------TAVYNLAGYSAVFLPGADPSFVIKTAKS 957

Query: 942  RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
              R+H +L    + + +  H+   + GF+YV S GI+++  +P+  T+D  W  +KV
Sbjct: 958  SPRIH-KLAGTGVRSLSSFHSAGADRGFVYVDSLGIVRVALMPAEFTFDGNWGYKKV 1013


>gi|395740218|ref|XP_002819588.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 [Pongo abelii]
          Length = 1388

 Score =  173 bits (438), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 102/269 (37%), Positives = 152/269 (56%), Gaps = 29/269 (10%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T+ +   +      LEL   +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct: 82  ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK-----ASQGGSGLVGDEDTFGSGGGFSARIE 231
               P V+VDP GRC  +LVYG ++++L       ++   GLVG+        G  +   
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGE--------GQRSSFL 186

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
            S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++
Sbjct: 187 PSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLN 246

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
            T K HP+IWS  +LP D  + LAVP PI
Sbjct: 247 ITQKVHPVIWSLTSLPFDCTQALAVPKPI 275



 Score =  115 bits (287), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 101/347 (29%), Positives = 167/347 (48%), Gaps = 56/347 (16%)

Query: 379 LSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           +S K G++ +LT++ DG R V+     K   SVLT+ + T+     FLGSRLG+SLL+++
Sbjct: 285 ISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKY 344

Query: 438 TCG----SGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS-ASN 492
           T        +++  +  KEE    +    +T     +     QD V+  E+ +YGS A +
Sbjct: 345 TEKLQEPPASAVREAADKEEPPSKKKRVDATAGWSAAGKSVPQDEVD--EIEVYGSEAQS 402

Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYG----------------LRI------NADASA 530
            T+ A  T+SF V DS++NIGP  + + G                L I        + + 
Sbjct: 403 GTQLA--TYSFEVCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGHGKNGAL 460

Query: 531 TGISKQSNYELV---ELPGCKGIWTVY---------HKSSRGHNADSSRMAAYDD-EYHA 577
           + + K    ++V   ELPGC  +WTV          +    G   + S   A DD   H 
Sbjct: 461 SVLQKSIRPQVVTTFELPGCYDMWTVIAPLRKEEEDNPKGEGTEQEPSTPEADDDGRRHG 520

Query: 578 YLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS 637
           +LI+S E  TM+L+T   + E+  S  +  QG T+ AGN+   R ++QV   G R+L+G 
Sbjct: 521 FLILSREDSTMILQTGQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG- 578

Query: 638 YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
                L F P +         + ++  ++ADPYV++  ++G + + +
Sbjct: 579 --VNQLHFIPVDL-------GAPIVQCAVADPYVVIMSAEGHVTMFL 616



 Score = 89.0 bits (219), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 68/251 (27%), Positives = 110/251 (43%), Gaps = 49/251 (19%)

Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
           + ++  E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 731 WCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQGEARREEATR 786

Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
           QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++        
Sbjct: 787 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLIYEAF----PHDSQ------- 829

Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTRE-------------ETPHGAPCQ----RIT 915
               L   N+       +RF + P +   RE              T  GA  +    R  
Sbjct: 830 ----LGQGNL------KVRFKKVPHNINFREKKPKPSKKKAEGGSTEEGAGARGRVARFR 879

Query: 916 IFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
            F++I G+ G F+ G  P W +V  R  LR+HP   DG + +F   HNVNC  GF+Y   
Sbjct: 880 YFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNR 939

Query: 975 QGILKICQLPS 985
           Q   ++   PS
Sbjct: 940 QEPQRLSGSPS 950


>gi|384253955|gb|EIE27429.1| hypothetical protein COCSUDRAFT_64224 [Coccomyxa subellipsoidea
           C-169]
          Length = 1137

 Score =  172 bits (436), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 148/477 (31%), Positives = 227/477 (47%), Gaps = 68/477 (14%)

Query: 225 GFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMIS 284
             S  + +S+++ L  L +  V+D +F+H Y EPV+++LHE + +W G++     T  ++
Sbjct: 18  ALSTTVGNSYMLKLAKLGISEVRDAVFLHRYSEPVLLVLHETKPSWGGQLRNSKDTMEVT 77

Query: 285 ALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNN 344
           A S++   K+H  +WS  NLP DA+KL+ VP   GG LV+  N + Y SQ A+ A A   
Sbjct: 78  AFSLNVAHKRHTRLWSIGNLPSDAFKLIEVPG--GGGLVICQNLLIYVSQEAAAAAASGA 135

Query: 345 YAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLS 404
                          F ++L      WL ++  LL   +G L+L+ V  +G   +RL +S
Sbjct: 136 PRA----------EGFELDLTDCSGAWLADNSLLLGLASGQLILVNVQLEGS--KRLKVS 183

Query: 405 KTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPST 464
           K   +   S +  +G  L FLGS + +SLL++     G ++L  G +E+    EADA   
Sbjct: 184 KAQGAPPPSCMCRLGPELLFLGSWVANSLLIR-AVPEGQTLLLGGPEEQAS--EADATHA 240

Query: 465 KRLRRSSSDALQDMVN--GEELSLY-----GSASNNTESAQKTFSFAVRDSLVNIGPLKD 517
            +  R   DA  D+ N   +E+SL        A  +T  A K +S  V DSLV+IG ++D
Sbjct: 241 SKRPRLDPDA-ADLGNEDEDEVSLIYRTDAQPALPSTTGASK-YSLQVVDSLVSIGIVQD 298

Query: 518 FSYGLRINADASATGISKQSN-------------------------YEL---VELPGCKG 549
              G   +  A    ++K                             EL   V LPG   
Sbjct: 299 LVTG-EASTSAPQEWVAKTERGPPKLLAAVGSDKFGAVAVLRSSLVPELVTEVPLPGVDQ 357

Query: 550 IWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTES-VDYFVQ 608
           +W V H    G   D S +      YHA+L ++ ++ T VL T + L E   S VD+ + 
Sbjct: 358 MWAV-HFQPEGLPVDDSLL------YHAFLFLNEKSGTKVLRTGEELDETDSSQVDFILS 410

Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS 665
            RT+ AGNL G  R++QV  RG  +L GS   QDL       +   G  N+T+++ S
Sbjct: 411 SRTVFAGNLLGNSRIVQVHARGVVLLSGSSRVQDLPV-----QDLIGVSNTTIVAAS 462



 Score = 67.8 bits (164), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 52/169 (30%), Positives = 67/169 (39%), Gaps = 31/169 (18%)

Query: 839 PFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLD 898
           PFL  +L DGT L Y+A  F  P                    V   RL     +  P  
Sbjct: 553 PFLLLLLADGTFLAYRA--FHTPRG-----------------RVCFKRLSLPAHAHCPPQ 593

Query: 899 AYTREETPHGAPCQRITIFKNISGHQ-----GFFLSGSRPCWCMVFRERLRVHPQLCDGS 953
               + T   AP   +T F  +   +     G F+SG RP W +  R  L  H    +G 
Sbjct: 594 DRRSKTT---APSSSMTRFDGLGESKEHVNSGMFVSGERPLWLVASRGTLVAHAMDVEGR 650

Query: 954 IVAFTVLHNVNCNHGFIYV----TSQGILKICQLPSGSTYDNYWPVQKV 998
           +   T  HN+NC  GFI           LKICQLP  +  D  WP+QK+
Sbjct: 651 VSGMTPFHNINCPLGFITACMAENDGETLKICQLPMRTRLDTPWPLQKI 699


>gi|358338426|dbj|GAA28838.2| cleavage and polyadenylation specificity factor subunit 1
           [Clonorchis sinensis]
          Length = 1741

 Score =  172 bits (435), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 141/482 (29%), Positives = 225/482 (46%), Gaps = 67/482 (13%)

Query: 129 RRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQ 188
           R DS++L+F +AK++V+ FD   + L+  S+H +E   + +LK GR  F+  P+++VDP 
Sbjct: 27  RLDSLLLSFTEAKVAVMGFDPVQYELKTLSLHNYE---FENLKSGRTHFSHLPILRVDPL 83

Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGDE--------DTFGSGGGFS------ARIESSH 234
            RC  VLVY   + +L   +  +   GD+        +T    G  S      A + ++ 
Sbjct: 84  QRCAVVLVYDRHLAVLPFRRSEALAAGDKYLAKPVTNNTARGAGSLSWERRATAPLLATF 143

Query: 235 VINLRD---LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
              L       + +V D  F++G+ EP +++L+E   TWAGRVS +  TC I ALS +  
Sbjct: 144 TTCLSSSTGEKINNVLDMQFLNGFYEPTLLVLYEPIGTWAGRVSARRDTCCIVALSFNLQ 203

Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYA---V 347
            + +P+IW   +LP+D   + +VP PIGGVL++  N+I Y  Q+  SC L LN YA    
Sbjct: 204 KRTNPVIWFQESLPYDCTYVHSVPEPIGGVLILATNSIIYMKQTLPSCGLPLNCYAQVTT 263

Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLT--VVYDGRVVQRLDLSK 405
           +    Q++P+    + LD      + +   L+ T+TG + LL+  V +  + V  L L +
Sbjct: 264 NFPMRQDVPQCG-PLTLDGCRIVTMTDSQFLIVTRTGKMCLLSLWVEHTTQTVSSLLLHE 322

Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGS------------GTSMLSSGLKEE 453
              SV    +  +     F+GSRL DS+L+  T  +              +  +   + +
Sbjct: 323 IGCSVPPYSVALLDKGYVFVGSRLCDSVLLHLTASTMFVNTLGRIVDLDETTTADNFRTD 382

Query: 454 FGDIEADA---------PSTKRLRRSSSDALQD----MVNGE------ELSLYGSASNNT 494
              IE DA         P+ K     SS         +V+G       ++ LYG    N 
Sbjct: 383 IPMIERDAESIPVDKNNPTEKEAENVSSGTPSKPSGSIVHGPYVFDEVDVELYGDTILNP 442

Query: 495 ESAQK---TFSFAVRDSLVNIGPL-----KDFSYGLRINADASATG-ISKQSNYELVELP 545
            S  +   T+ F V D LVN GP+      +  Y    N D +    I+ Q+    VEL 
Sbjct: 443 PSDVRELNTYKFEVADRLVNFGPMGLLTSGEVPYLAPGNTDPTDEALIAAQAEMHHVELL 502

Query: 546 GC 547
            C
Sbjct: 503 AC 504



 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 65/272 (23%), Positives = 114/272 (41%), Gaps = 30/272 (11%)

Query: 748  DQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSS 807
            D+   ++ + + +G LEI+ +P+F  ++ V  F      +VD     A + ++ E+N  +
Sbjct: 1007 DKSRYFAFIVFTNGVLEIYSLPDFTLLYEVHHFSDLPAMLVDC---RAGQGNKVEVNLEN 1063

Query: 808  EEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
                    ++NI    V+E+ +     +  RP L  + T   I  ++A         S +
Sbjct: 1064 IPNCPAAEEDNIPP-TVLEITVFPIGRNRDRPVLL-VRTSQEIAFFEALC------PSHN 1115

Query: 868  DDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREET-PHGAPCQRITI--------FK 918
            +     S S S   +   R R L     PL A  R  T P  A  Q   +        F+
Sbjct: 1116 EAHPFASESWSQEGL---RWRRLPIP-CPLVAPRRVRTDPKIADVQSTMLTRKNLLRPFE 1171

Query: 919  NISGHQGFFLSGSRPCWCMVFRE-RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGI 977
            +I GH G F+ G+ P W        +RV     DG + +F  L+   C  GF+Y T    
Sbjct: 1172 DIDGHCGVFVCGATPIWLFSSDTGHIRVFNHSIDGIMGSFAPLNTDICPSGFVYFTYSNE 1231

Query: 978  LKICQLPSGSTYDNY----W-PVQKVVFFLYF 1004
            +++  L  G ++  +    W P++   +FL +
Sbjct: 1232 MRLATLLPGYSFKEHLGMRWVPLELTPYFLQY 1263


>gi|171695066|ref|XP_001912457.1| hypothetical protein [Podospora anserina S mat+]
 gi|170947775|emb|CAP59938.1| unnamed protein product [Podospora anserina S mat+]
          Length = 1441

 Score =  170 bits (430), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 238/1041 (22%), Positives = 402/1041 (38%), Gaps = 191/1041 (18%)

Query: 57  NLVVTAANVIEIYVVRV--------QEEGSKESKNSGETKRRVLMD--GISAA------- 99
           NLVV  +++++I+  ++        Q+     ++N+G  + R+  D  G+ A+       
Sbjct: 28  NLVVAKSSLLQIFRTKIVSTEIDASQQGSGARTRNAGRYESRLANDDDGLEASFLGGDSL 87

Query: 100 ----------SLELVCHYRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFD 148
                      L LV    L G +  LA +    + N R   D++++AF+DA++S++E+D
Sbjct: 88  AFKTDRTNNTKLVLVSEISLSGTITGLAKIK---SQNLRSGGDALLVAFKDARLSLVEWD 144

Query: 149 DSIHGLRITSMHCFESPE-----WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMII 203
              H L   S+H +E  E     W        +F     +  DP GRC  +   G+ + I
Sbjct: 145 AERHDLSTVSIHYYEQDELQGSPWAPPLSNFTNF-----LAADPGGRCAALKFGGMNLAI 199

Query: 204 LKASQG---------------GSGLVGDEDTFGSGGGF--SARIESSHVINLRDLD--MK 244
           L   Q                G   V  E    +GG          S V+ L +LD  + 
Sbjct: 200 LPFKQADEDIDMDDDWDEDLDGPRPVKQEAAVVNGGSSIKETPYSPSFVLRLSNLDPSLL 259

Query: 245 HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNL 304
           H     F+H Y EP   IL    +  +  +  K H   +   ++    +    I S   L
Sbjct: 260 HPVHLAFLHEYREPTFGILAST-VNASNSLGRKDHLAYM-VFTLDLQQRASTTILSVPGL 317

Query: 305 PHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE 363
           P D +++  +P+P+GG L+VGAN  IH         +A+N       S     +S  ++ 
Sbjct: 318 PQDLFRVQPLPAPVGGALLVGANELIHIDQSGKPNGVAVNPLTKQCTSFGLSDQSDLNLR 377

Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT-- 417
           L+      L  +  L+    G + L+T   DGR V  LD+    S+T  S++   ++T  
Sbjct: 378 LEECTIDVLSAEELLVILSDGRMALVTFRIDGRTVSGLDVKLLPSETGGSLIPGRVSTLS 437

Query: 418 -IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG-DIEADAPSTKRLRRSSSDAL 475
            IG S+ F GS  GDSL+  +T     S       ++ G DI+              D  
Sbjct: 438 RIGKSVMFAGSEEGDSLVFGWTKKQNQSGRKKSRLQDVGLDIDMADEEDLDEDEDEDDLY 497

Query: 476 QDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADAS------ 529
            +    ++ ++  +ASN  E      +F + D L++I P++  +YG  ++A  S      
Sbjct: 498 AEEPTPKQQAV-ATASNVKEG---DLTFRIHDRLLSIAPIQSMTYGQPVDAPGSEEEQNS 553

Query: 530 -----------ATGISKQSNYELV------------ELPGCKGIWTVYHKS--SRGHNAD 564
                        G +K S   ++            E P  +G WTV  K    +    D
Sbjct: 554 AGVRSELQLVCGVGRNKSSAMAIMNLAIPPKVIGRFEFPEARGFWTVCAKKPVPKSLQGD 613

Query: 565 SSRMAAYDD-----EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIA 613
               A  +D     +Y  ++I++       E   +   TA     +T +      G TI 
Sbjct: 614 KGPGAIGNDYGTSGQYDKFMIVAKVDLDGYEKSDVYALTAAGFESLTGTEFDPAAGFTIE 673

Query: 614 AGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
           AG +    R+IQV +   R  DG      +   P   E       +T  S SIADPY+L+
Sbjct: 674 AGTMGKDNRIIQVLKSEVRCYDGDLGLSQIV--PMMDEETGAEPRAT--SASIADPYLLI 729

Query: 674 GMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLST 733
             +D S+ +          S+     +E  +K         DK         +T  WL+ 
Sbjct: 730 IRNDQSVFI---------ASIHDDNELEEVEK--------EDK-------TLATTKWLTG 765

Query: 734 GVGEAIDGADGGPLDQGD--------IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRT 785
            +    +G  G   + GD        I   +   SGAL I+ +P+               
Sbjct: 766 CLYTDTNGVFGE--ESGDKKAKLPESILMFLLSASGALYIYRLPDL-------------- 809

Query: 786 HIVDTYMREALKDSETEINS--SSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFA 843
                Y+ E L    T +++  ++ +GT    KE +  + V +L           P+L  
Sbjct: 810 -CKPVYVAEGLSYIPTGLSADYAARKGTA---KETVSEILVADLG----DTTAKSPYLIL 861

Query: 844 ILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE 903
              +  +  Y+ Y ++           +   ++L    +  S L     +++P +    E
Sbjct: 862 RHANDDLTMYEPYRYQLGAG-------LEFPKTLFFQKIPNSVL-----AKSPAEETDDE 909

Query: 904 ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNV 963
           E  H A C  +    NI G+   FL G  P + +   + +     L   ++ A +  H  
Sbjct: 910 EVTHQAKCLALRRCNNIGGYSTVFLPGPSPSFIIKSSKSMPKVLPLQGAAVTAISSFHTE 969

Query: 964 NCNHGFIYVTSQGILKICQLP 984
            C HGFIY  S  I+++ QLP
Sbjct: 970 GCEHGFIYADSHNIVRVSQLP 990


>gi|148886831|sp|Q7SEY2.2|CFT1_NEUCR RecName: Full=Protein cft-1; AltName: Full=Cleavage factor two
           protein 1
          Length = 1456

 Score =  168 bits (426), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 221/972 (22%), Positives = 388/972 (39%), Gaps = 144/972 (14%)

Query: 94  DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHG 153
           D  ++A L LV    L G +  LA + +    +S   D ++L+F DA++S++E++   + 
Sbjct: 96  DRANSAKLVLVAEVTLPGTMTGLARIKKPSGSSSGGADCLLLSFRDARLSLVEWNVERNT 155

Query: 154 LRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL 213
           L   S+H +E  E +             L+  DP  RC  +      + IL   Q    +
Sbjct: 156 LETVSIHYYEKEELVGSPWVAPLHQYPTLLVADPASRCAALKFSERNLAILPFKQPDEDM 215

Query: 214 VGD------------EDTFGSGGGFSARIES-----SHVINLRDLD--MKHVKDFIFVHG 254
             D            +D  G+    ++ IE      S V+ L  L+  + H     F+H 
Sbjct: 216 DMDNWDEELDGPRPKKDLSGAVANGASTIEDTPYSPSFVLRLSKLEASLLHPVHLAFLHE 275

Query: 255 YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
           Y +P + +L   +          H T M+  L +    +    I +   LP D ++++A+
Sbjct: 276 YRDPTIGVLSSTKTASNSLGHKDHFTYMVFTLDLQQ--RASTTILAVNGLPQDLFRVVAL 333

Query: 315 PSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
           P+P+GG L+VGAN  IH      S  +A+N       S   + ++   + L+      L 
Sbjct: 334 PAPVGGALLVGANELIHIDQSGKSNGIAVNPLTKQTTSFSLVDQADLDLRLEGCAIDVLA 393

Query: 374 NDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDITTI---GNSLFF 424
            ++   LL    G L L+T   DGR V  L +    P    SV+ S +T++   G S  F
Sbjct: 394 AELGEFLLILNDGRLGLITFRIDGRTVSGLSIKMIAPEAGGSVIQSRVTSLSRMGRSTMF 453

Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
           +GS  GDS+L+ +T   G +      ++    ++              D   D + GEE 
Sbjct: 454 VGSEEGDSVLLGWTRRQGQT------QKRKSRLQDADLDLDLDDEDLEDDDDDDLYGEES 507

Query: 485 SLYGSASNNTESAQK-TFSFAVRDSLVNIGPLKDFSYGLRINADAS-------------- 529
           +    A +  ++ +    +F + D L++I P++  +YG  +    S              
Sbjct: 508 ASPEQAMSAAKAIKSGDLNFRIHDRLLSIAPIQKMTYGQPVTLPDSEEERNSEGVRSDLQ 567

Query: 530 ---ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDD- 573
              A G  K S   ++            E P  +G WTV  K          +    +D 
Sbjct: 568 LVCAVGRGKASALAIMNLAIQPKIIGRFEFPEARGFWTVCAKKPIPKTLVGDKGPMNNDY 627

Query: 574 ----EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
               +YH ++I++       E   +   TA     +T +      G T+ AG +    R+
Sbjct: 628 DTSGQYHKFMIVAKVDLDGYETSDVYALTAAGFESLTGTEFEPAAGFTVEAGTMGKDSRI 687

Query: 624 IQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
           +QV +   R  DG   ++Q +     + E+G+      V + SIADP++LL   D S+ +
Sbjct: 688 LQVLKSEVRCYDGDLGLSQIVPM--LDEETGA---EPRVRTASIADPFLLLIRDDFSVFI 742

Query: 683 LVGDPSTCTVSVQTPA-AIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDG 741
               P    +        I +S K ++ C LY D          ++  +    VG+    
Sbjct: 743 AEMSPKLLELEEVEKEDQILTSTKWLAGC-LYTD----------TSGVFADETVGKGT-- 789

Query: 742 ADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSET 801
                  + +I   +   SG L I+ +P+      V + +S        Y+   L     
Sbjct: 790 -------KDNILMFLLSTSGVLYIYRLPDLTKPVYVAEGLS--------YIPPGLS---- 830

Query: 802 EINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGP 861
             + ++ +GT    KE++  + V +L        H  P+L     +  +  YQ Y  +  
Sbjct: 831 -ADYAARKGTA---KESVAEILVADLG----DTTHKSPYLILRHANDDLTLYQPYRLK-- 880

Query: 862 ENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA----PCQRITIF 917
              + +  P S S       +   ++ N  F++ P +    ++ PH A    P +R +  
Sbjct: 881 ---ATAGQPFSKS-------LFFQKVPNSTFAKAPEEKPADDDEPHNAQRFLPMRRCS-- 928

Query: 918 KNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGI 977
            NISG+   FL GS P + +   +       L    + A +  H   C HGFIY  + GI
Sbjct: 929 -NISGYSTVFLPGSSPSFILKTAKSSPRVLSLQGSGVQAMSSFHTEGCEHGFIYADTNGI 987

Query: 978 LKICQLPSGSTY 989
            ++ Q+P+ S+Y
Sbjct: 988 ARVTQIPTDSSY 999


>gi|147864212|emb|CAN80950.1| hypothetical protein VITISV_016701 [Vitis vinifera]
          Length = 262

 Score =  166 bits (421), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 89/148 (60%), Positives = 105/148 (70%), Gaps = 26/148 (17%)

Query: 451 KEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLV 510
           +++ GDIE D PS KR RRSSSDALQDM N ++L LYG A N+TE++QKTFSF+V DSL+
Sbjct: 53  RKKVGDIEGDVPSAKRSRRSSSDALQDMFNSDKLPLYGLAPNSTETSQKTFSFSVSDSLI 112

Query: 511 NIGPLKDFSYGLRINADASATGISKQSNYEL--------------------------VEL 544
           N+GPLKDF+YGLRINAD  ATGI KQSNYEL                          VEL
Sbjct: 113 NVGPLKDFAYGLRINADLKATGIVKQSNYELMCCSGHGKNGALCILQQSIRPERITEVEL 172

Query: 545 PGCKGIWTVYHKSSRGHNADSSRMAAYD 572
           PGCKGIWTVYHK++RGHNADS +M + D
Sbjct: 173 PGCKGIWTVYHKNTRGHNADSIKMVSAD 200


>gi|325189779|emb|CCA24259.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 1911

 Score =  165 bits (418), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 152/568 (26%), Positives = 253/568 (44%), Gaps = 123/568 (21%)

Query: 235 VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
           ++ LR+LD+K  + DF F+ GY+EP ++ILHE    +  +GR +  + T  ++ LSI+  
Sbjct: 382 LLRLRELDIKGRIADFAFLDGYLEPTLMILHEENERIASSGRFAIGYDTMCLTVLSITLN 441

Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYA----- 346
            + HP+IW   NLP D ++++    PIGG L++  N I Y +Q+    + LN +A     
Sbjct: 442 SRLHPVIWCVKNLPADCFRIIPCKVPIGGALLLSTNAILYFNQTQFYGIKLNVFADKTVN 501

Query: 347 --------VSLDSSQELPRSS--------------FSVELDAAHATWLQNDVALLSTKTG 384
                    + +  + LP +S               S+ L   H  +L +   LLS    
Sbjct: 502 QSLFPCQDATYEVLEPLPDASEPPAQGRLAFIEKPLSILLYDCHYDYLGSSDILLSLPDD 561

Query: 385 DLVLLTVVY-DGRVVQRLDLSKTNPSVL---TSDITTIG-------NSLFFLGSRLGDSL 433
            L +L +     RV    + + T   +L    S  +T         N   F+GSR GDS+
Sbjct: 562 SLYVLKMPQTSNRVFSVEEYNHTGKFILRKVASPASTASCLLVNRENDSIFIGSRCGDSV 621

Query: 434 LVQF--------TCGSGTSMLS----SGLKEEFG-DIEADAPSTKRLRRSSSDALQDMVN 480
           L              SGT ++S    SG     G D + +A   ++L+   S    D  +
Sbjct: 622 LYSAHRQKINARKTLSGTVVMSDGSISGTSNVRGADTDNEAALAEKLQAFGSTIALDATD 681

Query: 481 GEELSLYG-----SASNNTESAQKTFSFA------------VRDSLVNIGPLKDFSYGLR 523
            ++  LYG      ++     +   FSF+              D +  IG +     G++
Sbjct: 682 EDDAFLYGPTLSQESTGGAMPSSDCFSFSSMKQEDHSLHLQAIDFIPGIGQITSMDLGVQ 741

Query: 524 INADAS--------ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNA 563
            N+D++        + G SK  +  ++            EL GC+ +WTV   SS    +
Sbjct: 742 SNSDSNEQHEELVVSGGSSKDGSISVIHHGLRPIVSTAAELSGCRAMWTVVGMSSDVPES 801

Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
             +R       Y +YLI+S+  RTM+L T + +  + +   ++  G T+ A NLF +RR+
Sbjct: 802 QVTR------RYDSYLILSVAQRTMILRTGEEMEPLEDDSGFYTCGPTLCATNLFSQRRI 855

Query: 624 IQVFERGARILDGSYM----------------------TQDLSFGPSNSESGS---GSEN 658
           +QVF++G R++  + +                      TQ++ F   + ESG     + N
Sbjct: 856 VQVFKQGVRVMQQASIPASEAKEDDEGTQDVPLTRLVCTQEIPFA-GDIESGGMNVDTAN 914

Query: 659 STVLSVSIADPYVLLGMSDGSIRLLVGD 686
             ++SV   DPY+LL ++DGSIRLL GD
Sbjct: 915 VGIVSVDTIDPYILLLLTDGSIRLLEGD 942



 Score = 53.1 bits (126), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 66/305 (21%), Positives = 123/305 (40%), Gaps = 64/305 (20%)

Query: 756  VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDS----ETEINSSSEEG- 810
            +CY  G+L ++ VP+F  +            +++T   E+ + S    +T  +  S+ G 
Sbjct: 1085 LCYGDGSLHVYSVPDFGKMGIFPYVTFAPKFLLNTMTPESRRASYGYGDTARHRISKGGP 1144

Query: 811  ----------TGQGRKENIHSMK--VVELAMQRWSA----HHSRPF----LFAILTDGTI 850
                      T +GR    H++   V ++A+ R       H+S+ F    L   L +G +
Sbjct: 1145 RLGFSAIPADTNEGRIRKAHAINSPVADIAIHRIGPSEGQHNSQLFSHMVLLVFLANGDL 1204

Query: 851  LCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLR-NLRFSRTPLDAYTREETPHGA 909
            + Y+      P   S  D    +   + V+    +R    ++  +   +A T +E   G+
Sbjct: 1205 IMYKLL----PSIPSPRDSKQPSFHFVRVNENLITRPNLPMKAIKDSGNAGTHDENSLGS 1260

Query: 910  P----------------CQRITIFKNISGHQGFFLSGSRPCWCMVFRER-----LRVHPQ 948
                                +T F N++ + G F  G+ P W +  + +     L +   
Sbjct: 1261 TEASTSAIIAKLRANFRYPMLTRFFNVNNNSGMFFRGAYPVWILPNQGQPVFVPLNIAAA 1320

Query: 949  LCDGS--------IVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD-----NYWPV 995
              D +        +++FT  H+ NC +GF+Y  S G L++C+LPS          N + +
Sbjct: 1321 PSDPTRRTTFKVPVLSFTPFHHWNCPNGFVYFHSSGSLRVCELPSSQNSTLLPSGNGFVL 1380

Query: 996  QKVVF 1000
            QKV F
Sbjct: 1381 QKVRF 1385


>gi|325187036|emb|CCA21579.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 1912

 Score =  165 bits (417), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 153/569 (26%), Positives = 252/569 (44%), Gaps = 124/569 (21%)

Query: 235 VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
           ++ LR+LD+K  + DF F+ GY+EP ++ILHE    +  +GR +  + T  ++ LSI+  
Sbjct: 382 LLRLRELDIKGRIADFAFLDGYLEPTLMILHEENERIASSGRFAIGYDTMCLTVLSITLN 441

Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYA----- 346
            + HP+IW   NLP D ++++    PIGG L++  N I Y +Q+    + LN +A     
Sbjct: 442 SRLHPVIWCVKNLPADCFRIIPCKVPIGGALLLSTNAILYFNQTQFYGIKLNVFADKTVN 501

Query: 347 --------VSLDSSQELPRSS--------------FSVELDAAHATWLQNDVALLSTKTG 384
                    + +  + LP +S               S+ L   H  +L +   LLS    
Sbjct: 502 QSLFPCQDATYEVLEPLPDASEPPAQGRLAFIEKPLSILLYDCHYDYLGSSDILLSLPDD 561

Query: 385 DLVLLTVVY-DGRVVQRLDLSKTNPSVL---TSDITTIG-------NSLFFLGSRLGDSL 433
            L +L +     RV    + + T   +L    S  +T         N   F+GSR GDS+
Sbjct: 562 SLYVLKMPQTSNRVFSVEEYNHTGKFILRKVASPASTASCLLVNRENDSIFIGSRCGDSV 621

Query: 434 LVQF--------TCGSGTSMLS----SGLKEEFG-DIEADAPSTKRLRRSSSDALQDMVN 480
           L              SGT ++S    SG     G D + +A   ++L+   S    D  +
Sbjct: 622 LYSAHRQKINARKTLSGTVVMSDGSISGTSNVRGADTDNEAALAEKLQAFGSTIALDATD 681

Query: 481 GEELSLYG------SASNNTESAQKTFSFA------------VRDSLVNIGPLKDFSYGL 522
            ++  LYG      S       +   FSF+              D +  IG +     G+
Sbjct: 682 EDDAFLYGPTLSQESTGGGKLPSSDCFSFSSMKQEDHSLHLQAIDFIPGIGQITSMDLGV 741

Query: 523 RINADAS--------ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHN 562
           + N+D++        + G SK  +  ++            EL GC+ +WTV   SS    
Sbjct: 742 QSNSDSNEQHEELVVSGGSSKDGSISVIHHGLRPIVSTAAELSGCRAMWTVVGMSSDVPE 801

Query: 563 ADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRR 622
           +  +R       Y +YLI+S+  RTM+L T + +  + +   ++  G T+ A NLF +RR
Sbjct: 802 SQVTR------RYDSYLILSVAQRTMILRTGEEMEPLEDDSGFYTCGPTLCATNLFSQRR 855

Query: 623 VIQVFERGARILDGSYM----------------------TQDLSFGPSNSESGS---GSE 657
           ++QVF++G R++  + +                      TQ++ F   + ESG     + 
Sbjct: 856 IVQVFKQGVRVMQQASIPASEAKEDDEGTQDVPLTRLVCTQEIPFA-GDIESGGMNVDTA 914

Query: 658 NSTVLSVSIADPYVLLGMSDGSIRLLVGD 686
           N  ++SV   DPY+LL ++DGSIRLL GD
Sbjct: 915 NVGIVSVDTIDPYILLLLTDGSIRLLEGD 943



 Score = 53.1 bits (126), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 66/305 (21%), Positives = 123/305 (40%), Gaps = 64/305 (20%)

Query: 756  VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDS----ETEINSSSEEG- 810
            +CY  G+L ++ VP+F  +            +++T   E+ + S    +T  +  S+ G 
Sbjct: 1086 LCYGDGSLHVYSVPDFGKMGIFPYVTFAPKFLLNTMTPESRRASYGYGDTARHRISKGGP 1145

Query: 811  ----------TGQGRKENIHSMK--VVELAMQRWSA----HHSRPF----LFAILTDGTI 850
                      T +GR    H++   V ++A+ R       H+S+ F    L   L +G +
Sbjct: 1146 RLGFSAIPADTNEGRIRKAHAINSPVADIAIHRIGPSEGQHNSQLFSHMVLLVFLANGDL 1205

Query: 851  LCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLR-NLRFSRTPLDAYTREETPHGA 909
            + Y+      P   S  D    +   + V+    +R    ++  +   +A T +E   G+
Sbjct: 1206 IMYKLL----PSIPSPRDSKQPSFHFVRVNENLITRPNLPMKAIKDSGNAGTHDENSLGS 1261

Query: 910  P----------------CQRITIFKNISGHQGFFLSGSRPCWCMVFRER-----LRVHPQ 948
                                +T F N++ + G F  G+ P W +  + +     L +   
Sbjct: 1262 TEASTSAIIAKLRANFRYPMLTRFFNVNNNSGMFFRGAYPVWILPNQGQPVFVPLNIAAA 1321

Query: 949  LCDGS--------IVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD-----NYWPV 995
              D +        +++FT  H+ NC +GF+Y  S G L++C+LPS          N + +
Sbjct: 1322 PSDPTRRTTFKVPVLSFTPFHHWNCPNGFVYFHSSGSLRVCELPSSQNSTLLPSGNGFVL 1381

Query: 996  QKVVF 1000
            QKV F
Sbjct: 1382 QKVRF 1386


>gi|336276223|ref|XP_003352865.1| hypothetical protein SMAC_04980 [Sordaria macrospora k-hell]
 gi|380092984|emb|CCC09221.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 1486

 Score =  165 bits (417), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 233/979 (23%), Positives = 391/979 (39%), Gaps = 159/979 (16%)

Query: 94  DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHG 153
           D  ++A L LV    L G +  LA + +    +S   D ++L+F DA++S++E++   + 
Sbjct: 95  DRANSAKLVLVAEVTLPGTITGLARIKKPSGSSSGGADCLLLSFRDARLSLVEWNVDRNT 154

Query: 154 LRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL 213
           L   S+H +E  E +             L+  DP  RC  +      + IL   Q    +
Sbjct: 155 LETISIHYYEKEELVGSPWVAPLHHYPTLLLADPASRCAALKFSERNLAILPFKQPDEDM 214

Query: 214 VGD------------EDTFGSGGGFSARIES-----SHVINLRDLD--MKHVKDFIFVHG 254
             D            +D  G+    ++ IE      S V+ L  L+  + H     F+H 
Sbjct: 215 DMDNWDEELDGPRPKKDLSGAIANGASTIEDTPYSPSFVLRLSKLEASLLHPVHLAFLHE 274

Query: 255 YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
           Y +P + +L   +          H T M+  L +    +    I +   LP D ++++A+
Sbjct: 275 YRDPTIGVLSSTKTASNSLGHRDHFTYMVFTLDLQQ--RASTTILAVNGLPQDLFRVVAL 332

Query: 315 PSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
           P+P+GG L+VGAN  IH      S  +A+N       S   + +S   + L+      L 
Sbjct: 333 PAPVGGALLVGANELIHIDQSGKSNGIAVNPLTKQTTSFSLVDQSDLDLRLEGCAIDVLA 392

Query: 374 NDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDITT---IGNSLFF 424
            ++   LL    G L L+T   DGR V  L +    P    SV+ S +T+   +G S  F
Sbjct: 393 AELGEFLLILNDGRLGLITFRIDGRTVSGLSIKMLAPEAGGSVIQSRVTSLSRVGRSTVF 452

Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
           +GS  GDS+L+ +T   G +      K    DI+ D                        
Sbjct: 453 VGSEEGDSVLLGWTRRQGQTQKR---KSRIQDIDLDLDLDDEDLEDDD-----------D 498

Query: 485 SLYGSASNNTE---SAQKT-----FSFAVRDSLVNIGPLKDFSYGLRINADAS------- 529
            LYG  S + E   SA K       +F + D L++I P++  +YG  +    S       
Sbjct: 499 DLYGEESTSPEQAISAAKAVKSGELNFRIHDRLLSIAPIQKMTYGQPVTLPDSEEERNSE 558

Query: 530 ----------ATGISKQSNYELV------------ELPGCKGIWTVYHKS--SRGHNADS 565
                     A G  K S   ++            E P  +G WTV  K    +    D 
Sbjct: 559 GVRSDLQLVCAVGRGKASALAIMNLAIQPKIIGRFEFPEARGFWTVCAKKPIPKTLVGDK 618

Query: 566 SRMAA-YDD--EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGN 616
             M+  YD   ++H ++I++       E   +   TA     +T +      G T+ AG 
Sbjct: 619 GPMSNDYDTSGQHHKFMIVAKVDLDGYETSDVYALTAAGFESLTGTEFEPAAGFTVEAGT 678

Query: 617 LFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
           +    R++QV +   R  DG   ++Q +     + E+G+      V + SIADP++LL  
Sbjct: 679 MGKDCRILQVLKSEVRCYDGDLGLSQIVPM--LDEETGA---EPRVRTASIADPFLLLIR 733

Query: 676 SDGSIRLLVGDPSTCTV-SVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTG 734
            D S+ +    P    +  V+    + +  K ++ C LY D                +TG
Sbjct: 734 DDFSVFVAEMSPKLLELDEVEKEDQMLTGTKWLAGC-LYTD----------------TTG 776

Query: 735 VGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMRE 794
           V  A + A  G  D  +I   +   SG L I+ +P+      V + +S        Y+  
Sbjct: 777 V-FADEAAGKGTKD--NILMFLLSTSGVLYIYRLPDLTKPVYVAEGLS--------YIPP 825

Query: 795 ALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQ 854
            L       + ++ +GT    KE++  + V +L        H  P+L     +  +  YQ
Sbjct: 826 GLS-----ADYAARKGTA---KESVAEILVADLG----DTTHKSPYLILRHANDDLTLYQ 873

Query: 855 AYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA----P 910
            Y  +     + +  P S S       +   ++ N  F++ P +    ++  H A    P
Sbjct: 874 PYRVK-----ATAGQPFSKS-------LFFQKVPNSTFAKAPEEKPVEDDELHNAQRFLP 921

Query: 911 CQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFI 970
            +R T   NISG+   FL GS P + +   +       L    + A +  H   C HGFI
Sbjct: 922 MRRCT---NISGYSTVFLPGSSPSFILKTAKSSPRVLGLQGSGVQAMSSFHTEGCEHGFI 978

Query: 971 YVTSQGILKICQLPSGSTY 989
           Y  + GI ++ Q+P+ S++
Sbjct: 979 YADTNGIARVTQIPTDSSF 997


>gi|241060959|ref|XP_002408050.1| cleavage and polyadenylation specificity factor, putative [Ixodes
           scapularis]
 gi|215492346|gb|EEC01987.1| cleavage and polyadenylation specificity factor, putative [Ixodes
           scapularis]
          Length = 1241

 Score =  165 bits (417), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 173/646 (26%), Positives = 268/646 (41%), Gaps = 113/646 (17%)

Query: 388 LLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSML 446
           +LT+  DG R V+  +  K   SVLT+ +T       FLGSRLG+SLL+ +T      M 
Sbjct: 198 VLTLFNDGMRSVRNFNFDKAAASVLTTSMTLCEEGYLFLGSRLGNSLLLHYT-EKAAEME 256

Query: 447 SSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVR 506
            +G KE+               ++  D    +++ +EL +YGS +  T+    +++F V 
Sbjct: 257 EAGKKED---------------KAEGDVNVALIDPDELEVYGSETLATKQL-TSYTFEVC 300

Query: 507 DSLVNIGPLKDFSYG--------LRINAD-----ASATGISKQSNYELV----------- 542
           DSL+NIGP      G           N+D      +  G  K     ++           
Sbjct: 301 DSLINIGPCGKICMGEPAFLSEEFTQNSDPDLELVTTAGYGKNGALCVLQRSVRPQVVTT 360

Query: 543 -ELPGCKGIWTVYHKSSRGHNA------DSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
            ELPGC  +WTV    +           D     A     HA+LI+S    +M+L+T   
Sbjct: 361 FELPGCVHMWTVMGPPTEKKKKEASEESDEQAADATLTNTHAFLILSRADSSMILQTDQE 420

Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
           + E+  S  +  Q  T+ AGNL   R V+QV   G R+LDG+   Q +            
Sbjct: 421 INELDHS-GFSTQNPTVFAGNLGDGRYVLQVCPMGVRLLDGTRQLQHIPL---------- 469

Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLV--GDPST-CTVSVQTP--AAIESSKKPVSSC 710
              S ++  S+ADP+VL+    G +  L   GDP++ C ++V  P   A+ S +    +C
Sbjct: 470 DVGSPIVGGSLADPHVLIRSEGGLVVHLTLRGDPASGCRLAVLRPQLTAVVSHRANALTC 529

Query: 711 --------------TLYHD----KGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDI 752
                          LY D    +  +  +R T+ +    +      +  +  P      
Sbjct: 530 HCIAVSGVLDDEDELLYGDSEDTRATKEPVRVTAMET--ESETANVFELKEVKP----TF 583

Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETE-INSSSEEGT 811
           +  V  E+G LEI+ +P++   F V  F  G+  +VD+    A   +++E ++  S E  
Sbjct: 584 WVFVARENGVLEIYSLPDYKLCFLVKNFPMGQRVLVDSVQMTAPSGTKSEKLSDMSHE-- 641

Query: 812 GQGRKENIHSMKVV-ELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDP 870
                     M VV E+ M       SRP L A + D  +L Y+A+ F   +        
Sbjct: 642 ---------CMPVVHEILMVGLGVRQSRPLLLARV-DEDLLIYEAFPFYETQREGH---- 687

Query: 871 VSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSG 930
                 L    ++   +   R  +T       EE    +    +  F +ISG+ G FL G
Sbjct: 688 ----LKLRFKKLNHDIILRSRKYKTQKPENEEEEKAFQSRLW-LQPFSDISGYSGVFLCG 742

Query: 931 SRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQ 975
            RP W  M  R  LR HP   DG +  F   HNVNC  GF++   Q
Sbjct: 743 HRPHWLFMSSRGELRYHPMFVDGPVYCFAPFHNVNCPKGFLHFNKQ 788



 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 50/95 (52%), Gaps = 5/95 (5%)

Query: 181 PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240
           P+++VDP  RC  +LV+   + ++   +  +    +E   G   G    +   + + L +
Sbjct: 58  PMIRVDPCNRCAAMLVFSRTIAVVPFRKDTAA---EEQETGPTFGNKPPLLDWYPVALTE 114

Query: 241 LDMK--HVKDFIFVHGYIEPVMVILHERELTWAGR 273
           LD K  +V D  F+HGY EP ++IL+E   TW G+
Sbjct: 115 LDEKINNVIDMQFLHGYYEPTLLILYEPLRTWPGK 149



 Score = 40.0 bits (92), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 31/103 (30%), Positives = 51/103 (49%), Gaps = 9/103 (8%)

Query: 236 INLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
           + L +LD K  +V D  F+HGY EP ++IL+E   TW G V    +  M S  + +    
Sbjct: 158 VALTELDEKINNVIDMQFLHGYYEPTLLILYEPLRTWPGYVLTLFNDGMRSVRNFNFDKA 217

Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA 336
              ++ ++M L  + Y  L   S +G  L+     +HY  ++A
Sbjct: 218 AASVLTTSMTLCEEGYLFLG--SRLGNSLL-----LHYTEKAA 253


>gi|389740693|gb|EIM81883.1| hypothetical protein STEHIDRAFT_65512 [Stereum hirsutum FP-91666
           SS1]
          Length = 1438

 Score =  164 bits (415), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 233/1006 (23%), Positives = 416/1006 (41%), Gaps = 167/1006 (16%)

Query: 47  PSKRGIGPVPNLVVTAANVIEIYVVRV------------QEEGSKESKNSGETKRRVLMD 94
           P  +   P+ NLVV  +N++ I  VR             +E      K +   +  V MD
Sbjct: 31  PDSQKALPLFNLVVARSNLLRILEVREVPTLRPIHLDDERERRGNVRKGTEPVEGEVEMD 90

Query: 95  ---------GISAAS----------LELVCHYRLHGNV---ESLAILSQGGADNSRRRDS 132
                    G S AS             V  YRLHG V   E++ I+S          D 
Sbjct: 91  EQGEGYVNMGASTASNGAPRPTVLRFYFVRDYRLHGTVTGLETVRIMSS----LEDEMDR 146

Query: 133 IILAFEDAKISVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRC 191
           ++++F+DAKI++LE+    H L   S+H +E +P+ L L    +S      ++ DP  +C
Sbjct: 147 LLVSFKDAKIALLEWSTDTHSLSTVSIHTYERAPQLLSL----DSNMFTAQLRTDPLSQC 202

Query: 192 GGVLVYGLQMIILKASQGGSGL-VGDED-TFGSGGGFSARIESSHVINLR---DLDMKHV 246
             + +      IL   Q    L V D+D T      +S     S +++L    D  +++V
Sbjct: 203 AALSLPKDAFAILPFYQTQVDLDVMDQDQTRARDVPYSP----SFILDLAAEVDERIRNV 258

Query: 247 KDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPH 306
            DF+F+ G+  P + +L + + TW GR+     T  +   +++   + +P+I S   LP+
Sbjct: 259 VDFVFLPGFSHPTVAVLFQAQQTWTGRLKEYKDTMRLFIFTLNVVTRSYPIITSVEGLPY 318

Query: 307 DAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFS---- 361
           D   ++  P+ +GGV+V+ +N+ IH    S   ALA+N +   +    ++P ++ +    
Sbjct: 319 DCLSVVPCPAALGGVVVLTSNSVIHIDQASRRVALAVNGW---MPRVSDMPVTALAQGDQ 375

Query: 362 --VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-----TNPSVLTSD 414
             +EL+ +  T++ +    +  K G +  +    DG+VV +L +S      T PSV    
Sbjct: 376 GRLELEGSRMTFVDDKTLFIVLKDGTIHPVEFFVDGKVVSKLSISPPLAQTTTPSV---- 431

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
           I  I N  FF+GS  G S L++           SG++E+  D + +    K    +  D+
Sbjct: 432 IRKITNEHFFVGSTAGPSALLKV----------SGVEEDIQD-DVEEIDGKTAPAAVVDS 480

Query: 475 LQDMVNGEELSLYGSA--------------SNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
           +  M   ++  LYGS+              + +T   +     ++ DSL   GP+ D ++
Sbjct: 481 VDGMDIDDDDDLYGSSKADPTPTANGNAVETTSTTRKRTVIHLSLCDSLPAHGPISDMTF 540

Query: 521 GLRINAD------ASATGISKQSNYELVE--LP-----------GCKGIWTVYHKSSRGH 561
            +  N D       +ATG      + L +  LP           G +G+W++  + +   
Sbjct: 541 SMTKNGDRAVPELVAATGSGLLGGFTLFQRDLPIRTKRKLHAIGGARGVWSLPVRQAVRV 600

Query: 562 NADSSRMAAYD-DEYHAYLIISLEARTM--VLETADLLTEVTESVDYFVQG-RTIAAGNL 617
           N  S +         +  +IIS +A     +   A   ++   ++   + G  T+ A   
Sbjct: 601 NGVSYQTPQNPLRSDNDTIIISTDATPSPGISRIATRSSKTDLNITTRIPGVTTVGAAPF 660

Query: 618 FGRRRVIQVFERGARIL--DGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
           F    ++ V     R+L  DGS         P     G+ +  + + + SI DPYV +  
Sbjct: 661 FQGTAILHVLSNAIRVLEPDGSERQ------PIKDMDGN-NYRAKIKNCSICDPYVFVLR 713

Query: 676 SDGSIRLLVGDPSTCTVSVQ--TPAAIESSKKPVSSCTLYHDKGP-----EPWLRKTSTD 728
            D +I L +G+     +  +  +P   ++S+  ++ C      G         L  ++T 
Sbjct: 714 EDETIGLFIGETERGKIRRKDMSPMGDKTSRY-IAGCFFSDTTGTFQAHVNSSLNGSNTT 772

Query: 729 AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIV 788
              +T   +++  A      Q   + ++    G +EI+ +P    VF+     + +  +V
Sbjct: 773 KQNATSTLQSVMNA-----GQKTQWLLLVRPQGVMEIWTLPKLTLVFSTTALATLQPLLV 827

Query: 789 DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
           D+    AL          S     Q RK     + + ++ +        RP LF +L  G
Sbjct: 828 DSLDPPAL----------SSLPQDQPRKP--QELDIDQILVAPLGETSPRPHLFVLLRSG 875

Query: 849 TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREET--P 906
            +  Y+A  FE P     + DP   SR  S+  V   ++ +  F     D   +E++   
Sbjct: 876 QLAIYEAVSFELP-----TGDPEPASRP-SILPVKLVKVLSRAFDIQHPDEQPQEKSVLA 929

Query: 907 HGAPCQRITI-FKNISGHQ----GFFLSGSRPCWCM-VFRERLRVH 946
                QR+ I F      +    G F +G RPCW +   +  +RVH
Sbjct: 930 ELKKIQRLFIPFVTSPAPEKTFTGVFFTGDRPCWILGTDKGGIRVH 975


>gi|317157892|ref|XP_001826637.2| protein cft1 [Aspergillus oryzae RIB40]
 gi|391864317|gb|EIT73613.1| mRNA cleavage and polyadenylation factor II complex, subunit CFT1
           [Aspergillus oryzae 3.042]
          Length = 1389

 Score =  163 bits (412), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 221/947 (23%), Positives = 372/947 (39%), Gaps = 153/947 (16%)

Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGR 190
           ++I+LAF +AK++++E+D   +G+   S+H +E  +        +  + G ++ VDP  R
Sbjct: 88  EAILLAFRNAKLALIEWDPGRYGICTISIHYYERDDSTSSPWVPDLSSCGSILSVDPSSR 147

Query: 191 CGGVLVYGLQ-MIILKASQGGSGLVGDE------DTFGSGG--------------GFSAR 229
           C  V  +G++ + IL   Q G  LV D+      +  GS G                 A 
Sbjct: 148 CA-VFNFGIRNLAILPFHQPGDDLVMDDYGELDDERLGSHGLESGTDCDMTKESIAHRAP 206

Query: 230 IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
             SS V+ L  LD  + H     F++ Y EP   IL+ +  T    +  +      +  +
Sbjct: 207 YSSSFVLPLAALDPSILHPISLAFLYEYREPTFGILYSQVATSNALLHERKDVVFYTVFT 266

Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA 346
           +    +    + S   LP D +K++A+P P+GG L++G+N  +H      + A+ +N ++
Sbjct: 267 LDLEQRASTTLLSVSRLPSDLFKVVALPPPVGGALLIGSNELVHVDQAGKTNAVGVNEFS 326

Query: 347 VSLDSSQELPRSSFSVELDAAHATWLQ--NDVALLSTKTGDLVLLTVVYDGRVVQRLDLS 404
             + S     +S  ++ L+      L   N   LL   TG++VL+    DGR V  + + 
Sbjct: 327 RQVSSFSMTDQSDLALRLEGCIVERLSETNGDLLLVPTTGEIVLVKFRLDGRSVSGISVH 386

Query: 405 KTNPSV-------LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE---EF 454
              P           S    +G+   FLGS   DS+L+      G S+ SSG K+   + 
Sbjct: 387 PIPPHAGGDIVKSAASSSAFLGDKRVFLGSEDADSILL------GWSVPSSGTKKPRPQA 440

Query: 455 GDIEADAPSTKRLRRSSSDALQDMVNG--EELSLYGSASNNTESAQKTFSFAVRDSLVNI 512
              E D+       +S  D  +D +     E+ + G   +        ++F   D L+NI
Sbjct: 441 RHTEEDSGGFSDEDQSEDDVYEDDLYATVPEVVVDGRRPSAESFGSSLYNFREYDRLLNI 500

Query: 513 GPLKDFSYGLRINADASATGISKQSNYELV----------------------------EL 544
           GPLKD ++G    +          S  ELV                            +L
Sbjct: 501 GPLKDIAFGRSFTSLGGEENAGNDSGLELVASQGWDRSGGLAVMKRGLELQVLNSMRTDL 560

Query: 545 PGCKGIWTVYHKSSRGHNADS---SRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
             C  +WT    +S  H  ++   +   A + E H Y+++S +A +   E +++     +
Sbjct: 561 ASC--VWT----ASVAHMEEAVSKTTTQAENRECHQYVVVS-KATSAEREQSEVFRVEGQ 613

Query: 602 SVDYFV-------QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG---PSNSE 651
            +  F        +  TI  G L G+ RV+Q+     R  DG     DL      P   E
Sbjct: 614 ELRPFRAPEFNPNEDVTIDIGTLIGKNRVVQILRSEVRSYDG-----DLGLAQIYPVWDE 668

Query: 652 SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCT 711
               SE    +S S+ DPYV +   D ++ LL  D S     V+    I +SK   +SC 
Sbjct: 669 --DTSEERMAISSSLVDPYVAILRDDSTLLLLQADDSGDLDEVELNEQIANSKW--TSCC 724

Query: 712 LYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNF 771
           LY DK                TG+  +I  A    L Q  +   +  +   L I+ +P+ 
Sbjct: 725 LYFDK----------------TGIFSSI-SATSDELAQNSMTLFLMTQDCRLFIYRLPDQ 767

Query: 772 NCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQR 831
             +      + G   +      E  K S T              +E +  + V +L    
Sbjct: 768 KLL----AIIEGVDCLPPVLSSEPPKRSTT--------------REVLTEIVVADLG-DS 808

Query: 832 WSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLR 891
           WS   S P+L        +  Y+ ++      T    +P +    L  +N+   R+    
Sbjct: 809 WS---SFPYLIIRSRHDDLAVYRPFI----SITKSVGEPHADLNFLKETNLVLPRI---- 857

Query: 892 FSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD 951
            +    D  + EE     P   + I  NISG    F  G  P + +           L  
Sbjct: 858 -TSGVEDQSSTEEVIKSVP---LRIVSNISGFSAIFRPGVSPGFIVRTSTSSPHFLGLKG 913

Query: 952 GSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
           G   + +      C  GFI + S+G++ +CQ+P G   D  W +Q++
Sbjct: 914 GYAQSLSKFQTSECGEGFILLDSKGVIHVCQMPLGVQLDYPWTIQQI 960


>gi|336388105|gb|EGO29249.1| hypothetical protein SERLADRAFT_445076 [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 1424

 Score =  163 bits (412), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 204/968 (21%), Positives = 383/968 (39%), Gaps = 149/968 (15%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRR-------------VLMDG-------- 95
           N+VV  +NV+ I+ VR +E     ++   E  RR             V MDG        
Sbjct: 47  NVVVARSNVLRIFEVR-EERPPMSTQTEDERDRRSHVRKGTEAVEGEVEMDGQGEGYVNM 105

Query: 96  ----------ISAASLELVCHYRLHGNV---ESLAILSQGGADNSRRRDSIILAFEDAKI 142
                      + +    V  + LHG V   E++ I+S     N    D ++++F+DAKI
Sbjct: 106 GTVKKGAVHLPTVSRFYFVREHMLHGTVTGLETVRIMSS----NDDNLDRLLVSFKDAKI 161

Query: 143 SVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQM 201
           ++LE+ D IH L   S+H +E +P+ + L    +S      ++VDP  RC  + +    +
Sbjct: 162 ALLEWSDDIHDLITVSIHTYERAPQLMAL----DSSLFHTKLRVDPSSRCAALSLPKDAI 217

Query: 202 IILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIE 257
            IL   Q  + L V ++D              S +++L    D +++HV DF+F+ G+  
Sbjct: 218 AILPFFQSQAELDVMEQD---QNQARDVPYSPSFILDLASDVDENIRHVIDFVFLPGFNN 274

Query: 258 PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
           P + +L + E TW+GR+     T  +   ++      +P+I +   LP D   L+   + 
Sbjct: 275 PTIAVLFQTEQTWSGRLKEFKDTAKLIIFTLDLLSHTYPVITAVDGLPFDCISLVPCVAS 334

Query: 318 IGGVLVVGANTIHY-HSQSASCALALNNYAVSLDSSQELP-----RSSFSVELDAAHATW 371
           +GGV+++ +NTI Y    S   AL +N ++ S  S   +P      +S ++ L+  HA  
Sbjct: 335 LGGVVIMSSNTIIYVDPASRRVALPVNGWS-SRVSDMPMPALSGDEASRNISLEGCHAVL 393

Query: 372 LQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKT-NPSVLTSDITTIGNSLFFLGSRLG 430
           + +    +  K G +  + +V DG+ V +L ++     + + S +  I     FLGS +G
Sbjct: 394 VDDRTMFVFLKDGTVYPVELVADGKTVSKLSMAPALAQTTIPSMVRKINEDHLFLGSIVG 453

Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
            S+L++             L      ++A           +  ++  + +         +
Sbjct: 454 ASVLLKTVRVEEEVEDEEKLPAHAAVVDAPTTMDLDDDDDTMPSMNGVTH---------S 504

Query: 491 SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD------ASATG------------ 532
           +N     +     ++ DSL   GP+ D ++ L    D       +ATG            
Sbjct: 505 NNIIHRTRSVVHLSLCDSLPAYGPISDVTFSLAKLGDRYVPELVAATGSGFLGGFTLFQR 564

Query: 533 -ISKQSNYELVELPGCKGIWTV-YHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM-- 588
            +  ++  +L  + G +GIW+    +  R +     R     +  +  +IIS +A     
Sbjct: 565 DLPSRTKRKLHAIGGARGIWSFPVRQQVRVNGLSYERPVNSFESENDTVIISTDANPSPG 624

Query: 589 VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILD-GSYMTQDLSFGP 647
           V   A   ++   ++   + G TI AG+ F R  ++ V     R+L+ G  + +DL    
Sbjct: 625 VSRIATRTSKSDIAIPTRIPGTTIGAGSFFQRTAILHVMTNAIRVLESGKQIIKDLD--- 681

Query: 648 SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQ--TPAAIESSKK 705
                        + + SI DP+VL+   D +I L +G+     +  +  +P   +SS+ 
Sbjct: 682 ------GNIPRPRIKACSICDPFVLIIREDDTIGLFIGEAERGKIRRKDMSPMGDKSSRY 735

Query: 706 PV------SSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYE 759
                   +SC         P       D  +++ +   ++       +    + ++   
Sbjct: 736 LAGCFFTDNSCIFETHANDLPSSASNGVDKNVTSTMQAVVNS------NSRSQWLILVRP 789

Query: 760 SGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENI 819
            G +EI+ +P     F+          + D+Y   AL   +              RK N 
Sbjct: 790 QGVMEIWTLPKLTLAFSTSSLAMLEHILSDSYDTPALSPPQ-----------DHPRKSN- 837

Query: 820 HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSV 879
             + V ++ +         P+L   L  G I+ Y+A     P +            S+  
Sbjct: 838 -DLDVEQIILAPLGETAPLPYLLVFLRSGQIVIYEAVPTPAPAD------------SIPP 884

Query: 880 SNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNI----------SGHQGFFLS 929
           S VS  +++ ++ +    +    EET      ++  I +            S   G F +
Sbjct: 885 SRVSVLKVKFIKTATKIFELPKHEETEKSILAEQKRISRQFVPFVTSPTPGSVLSGVFFT 944

Query: 930 GSRPCWCM 937
           G RP W +
Sbjct: 945 GDRPSWIV 952


>gi|393220097|gb|EJD05583.1| cleavage factor protein [Fomitiporia mediterranea MF3/22]
          Length = 1450

 Score =  162 bits (411), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 225/962 (23%), Positives = 390/962 (40%), Gaps = 157/962 (16%)

Query: 76  EGSKESKNSGETKRRVLMDGISAASLEL--VCHYRLHGNV---ESLAILSQGGADNSRRR 130
           EG  E    GE    +    +S  + +   +  +RLHG V   E + ILS          
Sbjct: 89  EGEVEMDTQGEGFVNMASKPLSMTTYQFHFIREHRLHGIVTGLEPVKILSS----TEDSL 144

Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQG 189
           D ++++F+DAK+++LE+   +H L   S+H +E +P+   L   + +   G L +VDP  
Sbjct: 145 DRLLVSFKDAKLALLEWSPELHDLVTVSIHTYERAPQMTFLDPSKFT---GQL-RVDPLS 200

Query: 190 RCGGVLVYG--LQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVK 247
           RC  + +    L ++    SQ    LV  + T      +S       + N  D  +++V 
Sbjct: 201 RCAALSLPCDCLAILPFYHSQVDLDLVDADQTVSRDIPYSPSF-ILDLFNQVDHRIRNVI 259

Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHD 307
           DF F+ G+  P + +L + + TW GR+     TC +   ++      +P+I S  NLPHD
Sbjct: 260 DFAFLPGFNNPTLAVLFQTQHTWTGRLKEFKDTCNLFIFTLDLVTHMYPIITSVENLPHD 319

Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQ-SASCALALNNYA--VSLDSSQELPRSSFS--V 362
            + +L   S +GGV+++  N++ Y  Q S    L +N +A  VS    Q+L     +  +
Sbjct: 320 CFAMLPCDSSLGGVVIISCNSLIYVDQASRKTVLPVNGWAARVSDMPMQQLRPEEMNRDL 379

Query: 363 ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITTI 418
            L+ AHAT++ +    + T+ G ++ + +V DGR   RL L    S+T    L  ++   
Sbjct: 380 HLEGAHATFVDSRTFFIITRDGLVLPVEIVMDGRTALRLALHPAMSQTTTPALVRNVAFR 439

Query: 419 GNS--------LFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEF------GDIEAD-APS 463
             S        + F+GS +G S+L++ T           ++EE       GDI A  A +
Sbjct: 440 SASGDQAPRSQILFVGSTVGPSVLLRVTW----------VEEEIQKDKQQGDIPAAVADN 489

Query: 464 TKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFS---FAVRDSLVNIGPLKDFSY 520
              +     D +   V  E  + +G  +  +++A +T S    ++ DSL   GP+   ++
Sbjct: 490 PMAVDFDDEDDIYGDVAKETQTTHGQPTAASQAAVETKSVIHLSLCDSLSAYGPINSMAF 549

Query: 521 GLRINAD------ASATGISKQSNYELVE-------------LPGCKGIWTVYHKSSRGH 561
            L  N D       +ATG ++   + L +             + G +GIW +  + S   
Sbjct: 550 ALTRNGDRPTAELVAATGYARLGGFTLFQRDVPTRSKRKLHAVGGARGIWCIPVRQSLKV 609

Query: 562 NAD--SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR----TIAAG 615
           N    S  +     E    +I+S +A      T       +   D  +  R    TI A 
Sbjct: 610 NGSERSRNLLPGSSEVVDTVIVSTDANPSPGLTR--FAAKSSRNDIAITARRTETTIGAA 667

Query: 616 NLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGM 675
             F R  +I V     R+L+      D S      +    ++   +    I+DP++L+  
Sbjct: 668 PFFQRTAIIHVTTDLIRVLE-----PDCSERQCIRDMDGSNKRPKIRFCCISDPFILVIR 722

Query: 676 SDGSIRLLVGDPSTCTVSVQ--TPAAIESSKKPVSSCTLYHDKG------PEPWLRKTST 727
            D S+ L VGD     +  +  TP   E   +  + C      G       E      ++
Sbjct: 723 EDESLGLFVGDAERGRIRRKDMTPMG-EKVSRYSAGCFFLDQSGIFELHMSESSPTTGTS 781

Query: 728 DAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHI 787
           D     G G      D     +G  + V+C   G +EI+ +P    VF+        + +
Sbjct: 782 DDKQRMGTGSLESAVDA---QRGTQWLVLCRPQGVVEIWTLPKLALVFSTSSLKDLPSVV 838

Query: 788 VDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTD 847
            D++   AL        S  E+   + ++ +I  ++  ++        +  P L  +L  
Sbjct: 839 SDSFDPPAL--------SLPEDPPRKPQEADIELLQFAQIG-----ELYPHPHLIVMLRC 885

Query: 848 GTILCYQAYLFEGPENTSKSDDPVSTSRSLSV----------------------SNVSAS 885
           G +  YQA   +      K D P ST R+ ++                      S+V A 
Sbjct: 886 GQLAIYQAVAVD------KDDFPESTVRTSTLKIKFIKMGTRSFEPRQLEPAEKSSVIAE 939

Query: 886 RLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF-RERLR 944
           + R LR S  P       E             K +S   G F++G  PCW +   ++ L+
Sbjct: 940 QRRALR-SLVPFIVSPNSE-------------KRVS---GVFVTGDEPCWIVATDKDGLK 982

Query: 945 VH 946
           +H
Sbjct: 983 IH 984


>gi|336375160|gb|EGO03496.1| hypothetical protein SERLA73DRAFT_165174 [Serpula lacrymans var.
           lacrymans S7.3]
          Length = 1428

 Score =  160 bits (406), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 204/972 (20%), Positives = 383/972 (39%), Gaps = 153/972 (15%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRR-------------VLMDG-------- 95
           N+VV  +NV+ I+ VR +E     ++   E  RR             V MDG        
Sbjct: 47  NVVVARSNVLRIFEVR-EERPPMSTQTEDERDRRSHVRKGTEAVEGEVEMDGQGEGYVNM 105

Query: 96  --------------ISAASLELVCHYRLHGNV---ESLAILSQGGADNSRRRDSIILAFE 138
                          + +    V  + LHG V   E++ I+S     N    D ++++F+
Sbjct: 106 GTVKSTGKKGAVHLPTVSRFYFVREHMLHGTVTGLETVRIMSS----NDDNLDRLLVSFK 161

Query: 139 DAKISVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY 197
           DAKI++LE+ D IH L   S+H +E +P+ + L    +S      ++VDP  RC  + + 
Sbjct: 162 DAKIALLEWSDDIHDLITVSIHTYERAPQLMAL----DSSLFHTKLRVDPSSRCAALSLP 217

Query: 198 GLQMIILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVH 253
              + IL   Q  + L V ++D              S +++L    D +++HV DF+F+ 
Sbjct: 218 KDAIAILPFFQSQAELDVMEQD---QNQARDVPYSPSFILDLASDVDENIRHVIDFVFLP 274

Query: 254 GYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLA 313
           G+  P + +L + E TW+GR+     T  +   ++      +P+I +   LP D   L+ 
Sbjct: 275 GFNNPTIAVLFQTEQTWSGRLKEFKDTAKLIIFTLDLLSHTYPVITAVDGLPFDCISLVP 334

Query: 314 VPSPIGGVLVVGANTIHY-HSQSASCALALNNYAVSLDSSQELP-----RSSFSVELDAA 367
             + +GGV+++ +NTI Y    S   AL +N ++ S  S   +P      +S ++ L+  
Sbjct: 335 CVASLGGVVIMSSNTIIYVDPASRRVALPVNGWS-SRVSDMPMPALSGDEASRNISLEGC 393

Query: 368 HATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKT-NPSVLTSDITTIGNSLFFLG 426
           HA  + +    +  K G +  + +V DG+ V +L ++     + + S +  I     FLG
Sbjct: 394 HAVLVDDRTMFVFLKDGTVYPVELVADGKTVSKLSMAPALAQTTIPSMVRKINEDHLFLG 453

Query: 427 SRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSL 486
           S +G S+L++             L      ++A           +  ++  + +      
Sbjct: 454 SIVGASVLLKTVRVEEEVEDEEKLPAHAAVVDAPTTMDLDDDDDTMPSMNGVTH------ 507

Query: 487 YGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD------ASATG-------- 532
              ++N     +     ++ DSL   GP+ D ++ L    D       +ATG        
Sbjct: 508 ---SNNIIHRTRSVVHLSLCDSLPAYGPISDVTFSLAKLGDRYVPELVAATGSGFLGGFT 564

Query: 533 -----ISKQSNYELVELPGCKGIWTV-YHKSSRGHNADSSRMAAYDDEYHAYLIISLEAR 586
                +  ++  +L  + G +GIW+    +  R +     R     +  +  +IIS +A 
Sbjct: 565 LFQRDLPSRTKRKLHAIGGARGIWSFPVRQQVRVNGLSYERPVNSFESENDTVIISTDAN 624

Query: 587 TM--VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILD-GSYMTQDL 643
               V   A   ++   ++   + G TI AG+ F R  ++ V     R+L+ G  + +DL
Sbjct: 625 PSPGVSRIATRTSKSDIAIPTRIPGTTIGAGSFFQRTAILHVMTNAIRVLESGKQIIKDL 684

Query: 644 SFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQ--TPAAIE 701
                            + + SI DP+VL+   D +I L +G+     +  +  +P   +
Sbjct: 685 D---------GNIPRPRIKACSICDPFVLIIREDDTIGLFIGEAERGKIRRKDMSPMGDK 735

Query: 702 SSKKPV------SSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSV 755
           SS+         +SC         P       D  +++ +   ++       +    + +
Sbjct: 736 SSRYLAGCFFTDNSCIFETHANDLPSSASNGVDKNVTSTMQAVVNS------NSRSQWLI 789

Query: 756 VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGR 815
           +    G +EI+ +P     F+          + D+Y   AL   +              R
Sbjct: 790 LVRPQGVMEIWTLPKLTLAFSTSSLAMLEHILSDSYDTPALSPPQ-----------DHPR 838

Query: 816 KENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSR 875
           K N   + V ++ +         P+L   L  G I+ Y+A     P +            
Sbjct: 839 KSN--DLDVEQIILAPLGETAPLPYLLVFLRSGQIVIYEAVPTPAPAD------------ 884

Query: 876 SLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNI----------SGHQG 925
           S+  S VS  +++ ++ +    +    EET      ++  I +            S   G
Sbjct: 885 SIPPSRVSVLKVKFIKTATKIFELPKHEETEKSILAEQKRISRQFVPFVTSPTPGSVLSG 944

Query: 926 FFLSGSRPCWCM 937
            F +G RP W +
Sbjct: 945 VFFTGDRPSWIV 956


>gi|302694047|ref|XP_003036702.1| hypothetical protein SCHCODRAFT_63425 [Schizophyllum commune H4-8]
 gi|300110399|gb|EFJ01800.1| hypothetical protein SCHCODRAFT_63425 [Schizophyllum commune H4-8]
          Length = 1396

 Score =  160 bits (404), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 211/970 (21%), Positives = 378/970 (38%), Gaps = 172/970 (17%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGIS------------------- 97
           N+V    N + IY VR +   SK    +   K+   M+G+                    
Sbjct: 38  NVVTARGNTLSIYEVREETATSKSPTEAKSQKKDDAMEGVKEERQTPVVQVRSLSKKTYP 97

Query: 98  ---------AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFD 148
                    +    LV  +RLHG V  L  +    +      D ++++F+DAKI++LE+ 
Sbjct: 98  DSDSHSQPLSTKFHLVREHRLHGVVTGLQAVKIISSLEDHL-DRLLVSFKDAKIALLEWS 156

Query: 149 DSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQ 208
            +   L   S+H +E    + +     +F     ++VDPQ RC  + +      IL   Q
Sbjct: 157 TATQDLLTVSIHTYERAIQM-VATDISAFTSE--LRVDPQSRCAALSLPKDAFAILPPCQ 213

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIEPVMVILHE 265
               +  D                S ++NL    +  +++V DF F+ G+  P + +L+E
Sbjct: 214 VSDSVCRD-----------VPYSPSFILNLPSEVESGIRNVIDFTFLPGFSNPTVAVLYE 262

Query: 266 RELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVG 325
              TW GR++ +  T  ++  ++    +++P+I  A  LP D   +LA PS  GGV+VV 
Sbjct: 263 TYQTWTGRLNEQKDTVKMAFFTLDIVNRRYPVIGLATGLPCDCLSVLACPS-TGGVMVVA 321

Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELP------RSSFSVELDAAHATWLQNDVALL 379
           +N+I Y  QS    +   N  +   S   LP        + ++EL+ + + ++ +  A +
Sbjct: 322 SNSIIYVDQSGRKVVLPVNAWIPRMSDIALPTNLTPEEQARTLELEGSRSIFVDDKTAFI 381

Query: 380 STKTGDLVLLTVVYDGRVVQRLDL-----SKTNPSVLTSDITTIGNSLFFLGSRLGDSLL 434
             K G +  + +V  GRVV +L L       T PS+L      I N    +GS  GDS  
Sbjct: 382 ILKDGTIYPVELVTAGRVVSKLALGTPLAKTTIPSILRR----INNDYLLVGSASGDS-- 435

Query: 435 VQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDAL--QDM-VNGEELSLYGSAS 491
                    ++LS+   EE  D + D  +      +S  AL  QD+ ++ ++  +YG + 
Sbjct: 436 ---------ALLSTSWVEEVIDDDVDMEAN-----TSVAALEQQDIEMDDDDDDIYGPSI 481

Query: 492 NNTESAQK------------TFSFAVRDSLVNIGPLKDFSYGLRINAD------ASATGI 533
             T ++QK                +  D+L   GP+ D ++ +  N D       +ATG 
Sbjct: 482 IKTGTSQKESAAPMSKKTRSVLRLSFCDALPAYGPIADLTFTVGKNGDRPVAELVTATGS 541

Query: 534 SKQSNYELVE--LP-----------GCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLI 580
                + L +  LP           G +G+W++  + S      S+ +A +D      LI
Sbjct: 542 GHLGGFTLFQKDLPLRKKKKLPIISGARGVWSLPIRRS-----SSAAVAEHDT-----LI 591

Query: 581 ISLEA-------RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 633
           IS +A       R  V  T   L+ V+      V G TI AG  F R  ++ V     R+
Sbjct: 592 ISTDANPSPGFSRLAVRATKGDLSVVSR-----VNGMTIGAGPFFQRTAILHVMTNAIRV 646

Query: 634 L--DGS--YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPST 689
           L  DG+   + +D+               + + S SI DPYVL+   D +I L +G+ + 
Sbjct: 647 LEPDGNERQIIKDME---------GNVPRAKIKSCSICDPYVLIFREDDTIGLFIGETTR 697

Query: 690 CTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVG--EAIDGADGGPL 747
             +  +  + +       ++   + D      +   + DA   T +     +D +     
Sbjct: 698 GKIRRKDMSPMGEKSSRYTAGGFFTDTASVFRVYHQNADANTETTIPMHSVVDASSKSQ- 756

Query: 748 DQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSS 807
                + V+    G +EI+ +P    VF+     + +  + D+    AL   +       
Sbjct: 757 -----WLVLVRPQGVVEIWTLPKLTLVFSTTLLATLQNVLTDSQEPPALSPPQDPPRKPQ 811

Query: 808 EEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
           E             + + ++ +        +P L  +L  G +  Y+A+           
Sbjct: 812 E-------------LDIEQILLTNLGQSDPKPHLLVLLRSGHLAIYEAFATNQAPIVEPP 858

Query: 868 DDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFF 927
             P ++S  +    +++      R   T       ++       +    F       G F
Sbjct: 859 LKPRASSLQIQFVKIASKAFEMQRTDETEKGILAEQK----KALRTFVPFACAGAPAGVF 914

Query: 928 LSGSRPCWCM 937
            +G RP W +
Sbjct: 915 FTGDRPHWIV 924


>gi|348679545|gb|EGZ19361.1| putative cleavage and polyadenylation specificity factor CPSF
           [Phytophthora sojae]
          Length = 1752

 Score =  158 bits (400), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 162/634 (25%), Positives = 260/634 (41%), Gaps = 155/634 (24%)

Query: 235 VINLRDLD-MKHVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
           ++ LR+L+ M  V D  F+ GY+EP +++LHE   + +  GR++    T  I+ +SI+  
Sbjct: 277 LLRLRELEIMGKVIDLAFLDGYLEPTLMVLHEENEKNSTCGRLAAGFDTYCITVISINMN 336

Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL-- 349
            + HP IW+  NLP D +KL    +P+GGV+V+ AN   Y +Q+    LA N +A     
Sbjct: 337 TRLHPKIWTVKNLPSDCFKLFPCRAPLGGVVVLSANAFLYFNQTQFHGLATNVFASKTVN 396

Query: 350 -------DSSQELP---RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ 399
                  D+  E P    +   + L      +L     LL+   GD  +L++ Y+    +
Sbjct: 397 QSVFPLSDAVYETPDHEMAQLHIVLYDCQFEYLHEKEVLLTMPNGDAYVLSLPYEDTSSR 456

Query: 400 RL----DLSKTNPSVLTSDITTIG-----------NSLFFLGSRLGDSLLVQFTCGSGTS 444
            L      S +  + L+  +   G               F+GSR GDS+L        TS
Sbjct: 457 GLYGFGGASSSRNASLSLRMLRSGIQAHCLCVNEEKKTLFVGSRSGDSVLYALDQKKLTS 516

Query: 445 MLSSGLK----EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA--------SN 492
                 K    EE    E            +  A ++  + ++L LYG+A        S 
Sbjct: 517 AGGEASKQQEDEEMLIKEEVVKEEVTAEVKAEPAEEEEEDEDDLFLYGAAPTKEEPTTSG 576

Query: 493 NTESAQKTFSFAVR----------------------DSLVNIGPLKDFSYGLRINADASA 530
           +TE+   T   AV+                      D L +IG +     G+  NAD   
Sbjct: 577 STEAVNGTNGSAVKKEENGHAVEEESGPYDYVLHQIDVLPSIGQITSIELGIENNAD--- 633

Query: 531 TGISKQSNYELV--------------------------ELPGCKGIWTVYHKSSRGHNAD 564
              S +   ELV                          EL GC+ +WTV         + 
Sbjct: 634 ---SNEKREELVISGGYERSGAISVLHNGLRPIVGTEAELNGCRAMWTVSSSLPSATKSS 690

Query: 565 SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVI 624
             R       Y+AYLI+S+  RTMVL T + +  + +   ++  G T+AA NLF ++R++
Sbjct: 691 DGR------SYNAYLILSVAHRTMVLRTGEGMEPLEDDSGFYTSGPTLAAANLFNKQRIV 744

Query: 625 QVFERGARIL------------DGS----------------------YMTQDLSFGPSNS 650
           Q+F++GAR++            DG+                        TQ+++      
Sbjct: 745 QIFKQGARVMMEVPDEETSNGNDGAEKTAKPEDEEVDDEDDGPKVKLVCTQEITLEGDVE 804

Query: 651 ESGSGSENSTV--LSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTP----------- 697
             G   + +TV  +SV + DPY+LL ++DGS+RLL+GD     ++V  P           
Sbjct: 805 CGGMNVDTATVGIVSVDVVDPYILLLLTDGSVRLLMGDEEDMELTVIDPEIDYLDGVTES 864

Query: 698 -AAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAW 730
               ++SK   SS  L++D     W      +AW
Sbjct: 865 NGTADASKHGSSSACLFYD-----WAGMFRENAW 893



 Score = 59.7 bits (143), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 28/81 (34%), Positives = 45/81 (55%), Gaps = 7/81 (8%)

Query: 914  ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGS------IVAFTVLHNVNCNH 967
            +T F N++   G F  G+ P W +  R +    P +C  +      +++FT  H+ NC +
Sbjct: 1159 LTTFYNVNNMSGAFFRGAHPMWILGDRGQPTFIP-MCSAAPKVSVPVLSFTPFHHWNCPN 1217

Query: 968  GFIYVTSQGILKICQLPSGST 988
            GFIY  S+G L++C+LPS  T
Sbjct: 1218 GFIYFHSRGALRVCELPSSKT 1238


>gi|449543656|gb|EMD34631.1| hypothetical protein CERSUDRAFT_116804 [Ceriporiopsis subvermispora
           B]
          Length = 1440

 Score =  156 bits (395), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 218/990 (22%), Positives = 403/990 (40%), Gaps = 178/990 (17%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRR-------------VLMDGISAASLE- 102
           N+VV  ++++ I+ VR +E     S+   E +RR             V MDG     L  
Sbjct: 49  NVVVARSSLLRIFEVR-EEPAPISSQKEDERERRASVRKGTEAVEGEVEMDGSGEGFLNM 107

Query: 103 ---------------------LVCHYRLHG---NVESLAILSQGGADNSRRRDSIILAFE 138
                                L+  +RLHG    +E + I++        R D ++++F+
Sbjct: 108 GSVKSTAQNGSVQPPTINRFYLIREHRLHGIVTGIEGVRIVTS----LEDRLDRLLVSFK 163

Query: 139 DAKISVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY 197
           DAKI++LE+ D++H L   S+H +E +P+ + L    +S    P ++ DP  RC  +L+ 
Sbjct: 164 DAKIALLEWSDAVHDLVTVSIHTYERAPQLMAL----DSSLFRPTLRADPLSRCAALLLP 219

Query: 198 GLQMIILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVH 253
              + IL   Q  + L V ++DT             S +++L    D  +++V DF+F+ 
Sbjct: 220 RDSIAILPFYQSQAELDVVEQDT---SQLRDVPYSPSFIVDLSAEVDDRIRNVIDFVFLP 276

Query: 254 GYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLA 313
           G+  P + +L +++ TW GR+     T  +   ++    + +P+I S   LP+D + + A
Sbjct: 277 GFNNPTIAVLFQKQQTWTGRLREYKDTVSLYIFTLDLVTRNYPVITSTEGLPYDCFAVAA 336

Query: 314 VPSPIGGVLVVGANTIHYHSQSA-SCALALNNY-------AVSLDSSQELPRSSFSVELD 365
             + +GGV+++ +N I Y  QS+   AL +N +        V   S+QE  R    + L+
Sbjct: 337 CSTALGGVVILASNAIIYVDQSSRRVALPVNGWPPRVSDMPVQALSAQEQLR---DLRLE 393

Query: 366 AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-----TNPSVLTSDITTIGN 420
            +H  ++ +    +  K G +  + +V DG+ V +L +S      T P+V    +  +  
Sbjct: 394 GSHFVFVDDRTLFIILKDGTVYPVELVLDGKSVSKLTMSSAVARTTIPTV----VRRVQT 449

Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480
              F+GS +G S+L++      T+ +   + +E  D+E     T     +  D+   M  
Sbjct: 450 DHLFIGSTVGPSVLLK------TARVEEDIADE--DVEMSVAPT-----AVVDSTDTMDL 496

Query: 481 GEELSLYGSASNNTE------------SAQKT-FSFAVRDSLVNIGPLKDFSYGLRINAD 527
            +E  LYGS    T             S ++T    ++ DSL   GP+ D ++ L  N D
Sbjct: 497 DDEDDLYGSTKETTHRVDGLVNGAADASKKRTVVHLSLCDSLPAHGPIADMTFALAKNGD 556

Query: 528 ------ASATGISKQSNYELVE--LP-----------GCKGIWTVYHKSSRGHNADSSRM 568
                  +ATG      + L +  LP           G +G+W++  + +   N  +   
Sbjct: 557 RAVPELVAATGSGTLGGFTLFQRDLPTRVKRKLHAIGGGRGMWSLPVRQAVKVNGSTYEK 616

Query: 569 AAYDDEYHAY---LIISLEARTM--VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
            A  + +H+    +IIS +A     +   A        ++   + G TI A   F    +
Sbjct: 617 PA--NPFHSVNDSVIISTDANPSPGLSRIASRNQNGDITITTRIPGTTIGAAPFFQGTAI 674

Query: 624 IQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
           + V      ++    +  D S      +         + + SI DP+VL+   D +I L 
Sbjct: 675 LHVMYNVTNVI--RVLEPDGSERQIIKDVDGNVARPKIRACSICDPFVLIIREDDTIGLF 732

Query: 684 VGDPSTCTVSVQTPAAI-ESSKKPVSSCTLYHDKGP-EPWLRKTSTDAWLSTGVGEAIDG 741
           +G+P    +  +  + + + + + ++ C      G  +  L   +     +T   ++   
Sbjct: 733 IGEPERGKIRRKDMSPMGDKTSRYLTGCFFTDTTGTFQTHLNPLAAGTEAATSTLQS--A 790

Query: 742 ADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSET 801
            + G   Q   + ++C   G LEI+ +      F+     S  + +VDTY    L     
Sbjct: 791 INAGSRSQ---WLILCRPQGTLEIWTLSKLTLAFSTTLIPSLESVVVDTYDVPHL----- 842

Query: 802 EINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGP 861
              S  ++   + ++ +I  + V  L          RP+L   L  G +  Y+      P
Sbjct: 843 ---SLPQDPPRKPQELDIEQIVVAPLG-----ESSPRPYLTVFLRSGQLAVYETIPVAPP 894

Query: 862 ENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNIS 921
                  DP+  SRS ++          +RF +    A+  ++         +   K IS
Sbjct: 895 A------DPLPNSRSCTIL---------VRFRKVLSKAFDIQQQNEEVEKSVLAEQKRIS 939

Query: 922 --------------GHQGFFLSGSRPCWCM 937
                            G F +G RPCW +
Sbjct: 940 RLLIPFVTSPNPGQTLSGVFFTGDRPCWIL 969


>gi|353231025|emb|CCD77443.1| putative cleavage and polyadenylation specificity factor cpsf
           [Schistosoma mansoni]
          Length = 1825

 Score =  155 bits (393), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 146/586 (24%), Positives = 252/586 (43%), Gaps = 123/586 (20%)

Query: 4   AAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTAA 63
           A +K +  PT + NC    +TH +                           + NLV+T  
Sbjct: 15  AVFKHISPPTAVDNCLYCHLTHPK---------------------------LKNLVITRG 47

Query: 64  NVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGG 123
             IEIY V+        S  SGET+               V    ++ N+  +  +   G
Sbjct: 48  GFIEIYNVK--------SSASGETR------------FNWVYGTSVYENIADIVTVRFTG 87

Query: 124 ADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV 183
                   S++L+F +AK++V+ F+     LR  S+H +E   + +LK GR +F + P++
Sbjct: 88  DLLD----SLLLSFPEAKVAVMNFNPVTFELRTLSLHNYE---FENLKSGRMNFTKLPIL 140

Query: 184 KVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED----TFGSGGGFSARIESSHVINLR 239
           ++DP  RC  +LVY   + +L   +    +  + D    +  +   +  R  +  +    
Sbjct: 141 RLDPHQRCAVMLVYDRHLAVLPFRRTEVLVSAETDPKHISVRNSLLWQQRATAPLLATFT 200

Query: 240 DL-------DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
                     + +V D  F++G+ EP +++L+E   TWAGRVS +  TC I ALS +   
Sbjct: 201 TCLSTSTGEKINNVLDMQFLYGFYEPTLLVLYEPIGTWAGRVSARRDTCCIVALSFNLQK 260

Query: 293 KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYA---VS 348
           + +P+IW   +LP D   +++VP PIGGV+V+ AN+I Y  Q+  SC L LN YA    +
Sbjct: 261 RTNPVIWFQESLPFDCRSVISVPQPIGGVVVMAANSILYLKQTLPSCGLPLNCYAQISTN 320

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKT 406
               Q++P S   + +D      L     L+ T++G+L LL++  +   + V  L   K 
Sbjct: 321 FPMRQDVP-SCGPLSIDGCRVVTLNETQFLIGTRSGNLYLLSLWLEQATQTVTSLLFHKV 379

Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQF---------------------TCGSGTSM 445
             +V    +  + +   F+GSR  DS+L++                      + G+  ++
Sbjct: 380 GHAVPPHCMVLLESKYLFIGSRFCDSVLMKIDYSLLCVDANGKEVDHQLLNQSSGTNNTL 439

Query: 446 LSSGLKEEFGDIEAD------------------------APSTKRLRRSSSDALQD---M 478
             S L +    +E D                        + STKR     +D + D    
Sbjct: 440 KDSELVDGKSIVEDDSDEIPNKCPRIEEGENDKTISKSLSQSTKRNTLDENDIISDNHYK 499

Query: 479 VNGEELSLYGSASNNTESAQK---TFSFAVRDSLVNIGPLKDFSYG 521
            +  ++ LYG +  +  S  +    +SF V D L+N+GP+   + G
Sbjct: 500 FDEVDVELYGESILSPPSIYREIVNYSFKVVDRLINLGPMGQLTSG 545



 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 57/251 (22%), Positives = 101/251 (40%), Gaps = 27/251 (10%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            ++ + + +G LEI+ +P+F  ++ V  F      ++D      +   +     ++ +   
Sbjct: 1086 FAFIVFTNGVLEIYSLPDFTLLYEVHHFTDLPQMLID---HRGVSSEQLHKQYTNSQNVS 1142

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
                ++I    ++E+ +        RP L  + T   I  ++A L   P+          
Sbjct: 1143 YTEDDSIPP-PILEILVYPIGIDKDRPVLM-VRTSQEIAFFEA-LCPSPDE--------- 1190

Query: 873  TSRSLSVSNVSASRLRNLRFS-RTPLDAYTREET-PHGAPCQRITI--------FKNISG 922
             S  L        RLR  R     PL A  R  T P     Q   +        F+NI  
Sbjct: 1191 -SYPLISGTFYEGRLRWRRLPLPCPLVAPRRVRTDPKIMDVQSTLLTRTHMLRSFENIGD 1249

Query: 923  HQGFFLSGSRPCWCMVFRE-RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
            H+G F+ G  P W       +LRV P   DG + +F  L+   C+ GF+Y T    +++ 
Sbjct: 1250 HRGVFVCGGNPIWLFATDSGQLRVFPHSIDGIMGSFAPLNAKICHSGFVYFTFSNEMRLA 1309

Query: 982  QLPSGSTYDNY 992
             LP G +++ +
Sbjct: 1310 TLPPGYSFNEH 1320


>gi|426194401|gb|EKV44332.1| hypothetical protein AGABI2DRAFT_187183 [Agaricus bisporus var.
            bisporus H97]
          Length = 1413

 Score =  155 bits (392), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 218/1011 (21%), Positives = 400/1011 (39%), Gaps = 128/1011 (12%)

Query: 57   NLVVTAANVIEIYVVRVQE-----EGSKESKNSGETKR-------RVLMD---------- 94
            N+VV  +N++ I+ VR +      +   E +  G+T+R        V MD          
Sbjct: 49   NVVVARSNLLRIFEVREEPAPFPTQADDERERKGKTRRGTEAVEGEVEMDEEGEGFVNIA 108

Query: 95   --GISAASLELVCHY------RLHGNV---ESLAILSQGGADNSRRRDSIILAFEDAKIS 143
               I    L  V  +      RLHG V   E + I+    A    + D ++++F+DAKI+
Sbjct: 109  KSAIQKTKLPTVTKFYFIREHRLHGIVTGLEGVRIM----ASLEDKLDRLLVSFKDAKIA 164

Query: 144  VLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI 202
            +LE+ D+IH L   S+H +E +P+ + L        R  L +VDP  RC  + +    + 
Sbjct: 165  LLEWSDTIHDLVTVSIHTYERAPQLISLD---SPLFRSDL-RVDPISRCAALSLPKHAIA 220

Query: 203  ILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIEP 258
            IL   Q  + L V ++D   S          S +++L    + ++++V DF+F+ G+  P
Sbjct: 221  ILPFYQTQAELDVMEQDQSQSK---DVPYSPSFILDLPIQVEENIRNVIDFVFLPGFNNP 277

Query: 259  VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
             + IL + + TW GR+     T  +   ++    +   +I S   LP+DA+ LL   + I
Sbjct: 278  TIAILFQTQQTWTGRLRESKDTARLIIFTLDILTQNSTIITSVEGLPYDAFSLLPCSTAI 337

Query: 319  GGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPR---SSFSVELDAAHATWLQN 374
            GGV+V+  N++ Y  QS+   +L +N +A  +      P    ++  + L+   +  + +
Sbjct: 338  GGVIVITGNSVIYVDQSSRRVSLQVNGWATRISDLPYPPMEEDAALKLHLEGCRSAMVDD 397

Query: 375  DVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSL----FFLGSRLG 430
                L  K G +  + ++ DG+ V +L ++   P++  + I T+   +     F+GS +G
Sbjct: 398  KTVFLIYKDGTVYPVELIADGKTVSKLIMA---PALAQTTIPTVVKRVDEDHLFIGSAVG 454

Query: 431  DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
             S+L++            G K     +  D          + D   D + G+        
Sbjct: 455  PSILLKTAHVEQEVEEEHGSKSGPAVVTQDV---------TMDDDDDDIYGDSTMETEPT 505

Query: 491  SNNTESAQKT---FSFAVRDSLVNIGPLKDFSYGLRINAD------ASATGISKQSNYEL 541
            +N     +KT      ++RD L   GP+   ++ L +N +       +ATG      + L
Sbjct: 506  ANGVTHVRKTKTVIHLSLRDYLPAYGPISSMTFSLAMNGEKAVPELVAATGAGSLGGFTL 565

Query: 542  VE--LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAY----LIISLEARTMVLETADL 595
             +  LP  K    +Y   SRG  +   R     +  H +    LI+S +       +   
Sbjct: 566  FQRDLPTVKKRKILYISGSRGIWSLPIRQPLRSNTSHGHDYDTLILSTDINPSPGSSRIA 625

Query: 596  LTEVTE--SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL--DGSYMTQDLSFGPSNSE 651
            +  +    S++    G TI A   F R  ++ V     R+L  DG+          +  +
Sbjct: 626  VRSMNRDVSINSRTPGLTIGAAPFFQRTAILHVMTNAIRVLHPDGTERQ-------TIPD 678

Query: 652  SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG-DPSTCTVSVQTPAAIESSKKPVSSC 710
                     +   SIADP+VL+   D SI + V  D         +P   +SS+  ++ C
Sbjct: 679  KDGNMPRPKIRFCSIADPFVLVMREDDSIGMFVATDREKIRRKDMSPMGDKSSRY-LAGC 737

Query: 711  TLYHDKGPEPWLRKTSTD--AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDV 768
                  G    L + + D  +  +T   +   GA          + ++    G LEI+ +
Sbjct: 738  FFTDTTG----LFEANFDNKSPATTSTLQITSGAKSQ-------WLLLVRPQGVLEIWTL 786

Query: 769  PNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELA 828
            P  +  F+     S ++ + DT+   A                 Q        + + ++ 
Sbjct: 787  PKLSLAFSTPAIASLQSVLTDTHDPPA-------------PSLPQDPPRKPQDLDIEQIL 833

Query: 829  MQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLR 888
            +         P L   L  G +  Y+A +    +N    D P +TS  +    ++A    
Sbjct: 834  LAPIGESSPTPHLCVFLRSGQLAIYEAVVLG--QNPEVPDTPRATSLQIQFVKIAAKSFE 891

Query: 889  NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCM-VFRERLRVHP 947
              R            +  +      +T  +    + G F +G RP W +   R  ++V+P
Sbjct: 892  IQRPEENEKGILAEHKKINRMFIPFVTSPRPSVTYSGVFFTGDRPHWILSTDRSGVQVYP 951

Query: 948  QLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
                  + AFT          F+  T  G + +  +P    +D   P++ +
Sbjct: 952  S-GHNVVHAFTPCSLWESKGEFLMYTEDGPILVEWVPDFQ-FDGPLPMRSI 1000


>gi|367052335|ref|XP_003656546.1| hypothetical protein THITE_2121311 [Thielavia terrestris NRRL 8126]
 gi|347003811|gb|AEO70210.1| hypothetical protein THITE_2121311 [Thielavia terrestris NRRL 8126]
          Length = 1460

 Score =  155 bits (392), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 242/1057 (22%), Positives = 409/1057 (38%), Gaps = 211/1057 (19%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLM---------DGISAA-------- 99
           NLVV  +++++++  +V     + S +SG   R             DG+ A+        
Sbjct: 28  NLVVAKSSLLQVFRTKVVSTELEASPDSGHRSRNAARYESRLANDDDGLEASFLGGDSLA 87

Query: 100 ---------SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDS 150
                     L LV    L G V  LA +    A +    DS+++A +DA++S++E+D  
Sbjct: 88  LRTDRANVTKLVLVAETPLAGTVTGLARIKTPHARHGC--DSLLIALKDARLSLVEWDAE 145

Query: 151 IHGLRITSMHCFESPEWLHLKRGRESFARGPL------VKVDPQGRCGGVLVYGLQMIIL 204
            H L   S+H +E  E       + S    PL      ++ DP  RC  +      + IL
Sbjct: 146 RHALATVSIHYYEQEEL------QGSPWAAPLSHYVNFLEADPGSRCAALKFGARNLAIL 199

Query: 205 KASQGGSGL-VGDEDTFGSGGGFSARIESSHVIN-----------------LRDLD--MK 244
              Q    + +GD D     G   A+ +SS VI+                 L +LD  + 
Sbjct: 200 PFRQADEDIDMGDWDG-ELDGPRPAKDQSSAVIDGASNIEDTPYSPSFVLRLSNLDPSLL 258

Query: 245 HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNL 304
           H     F+H Y EP   IL              H T M+  L +    K    I S   L
Sbjct: 259 HPVHLAFLHEYREPTFGILASTASASNSLGRKDHFTYMVFTLDLQQ--KASTTILSVGGL 316

Query: 305 PHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE 363
           P D ++++ +P+P+GG L+VG+N  IH      +  +A+N       S   + +S  ++ 
Sbjct: 317 PQDLFRVVPLPAPVGGALLVGSNELIHIDQSGKANGVAVNPMTRQCTSFGLVDQSELNLR 376

Query: 364 LDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT 417
           L+      L  D+   L+    G + L+T   DGR V  L+L    + +  S++   +TT
Sbjct: 377 LEGCVVDVLTADLGELLVILNDGRMALVTFRIDGRTVSGLELRMLPASSGGSIIPGRVTT 436

Query: 418 ---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
              +G +  F G   GDS+L                   FG  +  + + +R  R+    
Sbjct: 437 LSRVGRNAMFAGLEEGDSVL-------------------FGWAKKQSQAGRRRPRAKDAV 477

Query: 475 LQ------------DMVNGEELSLYGSASNNTESAQKT------FSFAVRDSLVNIGPLK 516
           LQ            +  + ++L     A+    S+  +       +F + D LV+I P++
Sbjct: 478 LQMDEEAGEEEEEEEDEDEDDLYGEEPAARQQPSSTASSLMTGDLTFRIHDRLVSIAPIQ 537

Query: 517 DFSYGLRINADAS-----------------ATGISKQSNYELV------------ELPGC 547
             +YG  +    S                 A G  K ++   +            E P  
Sbjct: 538 AMTYGQPVWLPGSEEERNSAGVHSDLQLVCAVGRDKSASLATINLAIAPKVIGRFEFPEA 597

Query: 548 KGIWTV-----YHKSSRGHNADSSRMAAYDD--EYHAYLII------SLEARTMVLETAD 594
           +G WT+       KS +G  A +S    YD   +Y  ++I+        E   +   TA 
Sbjct: 598 RGFWTMCAKKPIPKSLQGDKAGASLGNGYDTSGQYDKFMIVGKVDLDGYEKSDVYALTAA 657

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESG 653
               +  +      G TI AG +    R+IQV +   R  DG + ++Q L   P   E  
Sbjct: 658 GFESLGGTEFDPAAGITIEAGTMGKGSRIIQVLKSEVRCYDGDFGLSQIL---PMQDEE- 713

Query: 654 SGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLY 713
           +G+E   V S S+ADP++L+   D S+ +   D S     +     + S+ K ++ C LY
Sbjct: 714 TGAEPRAV-SASVADPFLLIIRDDSSVFIARIDSSNELEELDKDDPVLSTTKWLTGC-LY 771

Query: 714 HDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNC 773
            D          S   +    +G+    A         +   +   SGAL I+ +P+   
Sbjct: 772 AD----------SAGVFAEESMGKPASTAQC-------VLMFLLSASGALYIYRLPDLAR 814

Query: 774 VFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWS 833
              V + +S        Y+   L       + +  +GT    KE +  + V +L      
Sbjct: 815 PIYVAEGLS--------YIPPGLS-----ADYAGRKGTA---KETLAEILVADLG----D 854

Query: 834 AHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFS 893
           + H  P+L     +  +  YQ +        S+     + S +L    V      N   +
Sbjct: 855 STHKSPYLILRHANDDLTLYQPF-------RSRKATEQAFSETLFFQKVP-----NTALA 902

Query: 894 RTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERL-RVHPQLCDG 952
           ++P +A   +E  H      +    N+ G+   F+ G+ P + +   + + RV P L   
Sbjct: 903 KSPQEA-DEDEASHQPRFLSMRRCDNVGGYSTVFVPGASPSFIIASSKSMPRVMP-LQGS 960

Query: 953 SIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
            ++A +  H   C HGFIY  S+ I ++CQ P G  Y
Sbjct: 961 GVIAMSPFHTEGCEHGFIYADSRRIARVCQFPDGCIY 997


>gi|224135031|ref|XP_002321966.1| predicted protein [Populus trichocarpa]
 gi|222868962|gb|EEF06093.1| predicted protein [Populus trichocarpa]
          Length = 180

 Score =  155 bits (391), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 85/152 (55%), Positives = 100/152 (65%), Gaps = 22/152 (14%)

Query: 733 TGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYM 792
           TG+ EAIDGADGG  DQGDIY V+CYE+GALEIFDVPNFN VF VDKFVSG+TH+VD++M
Sbjct: 4   TGISEAIDGADGGAHDQGDIYRVICYETGALEIFDVPNFNSVFIVDKFVSGKTHLVDSFM 63

Query: 793 REALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILC 852
            E  +D    +N   EE  G GRKE      +V L              F ILT GTILC
Sbjct: 64  GEPPRDLTKGMN---EEVAGAGRKE------IVLL-------------FFGILTYGTILC 101

Query: 853 YQAYLFEGPENTSKSDDPVSTSRSLSVSNVSA 884
           Y A LFEGP+  SK +DPVS   S+  S++SA
Sbjct: 102 YHACLFEGPDGNSKLEDPVSAQNSVGDSSISA 133


>gi|409076059|gb|EKM76433.1| hypothetical protein AGABI1DRAFT_108759 [Agaricus bisporus var.
            burnettii JB137-S8]
          Length = 1413

 Score =  154 bits (390), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 217/1011 (21%), Positives = 399/1011 (39%), Gaps = 128/1011 (12%)

Query: 57   NLVVTAANVIEIYVVRVQE-----EGSKESKNSGETKR-------RVLMD---------- 94
            N+VV  +N++ I+ VR +      +   E +  G+T+R        V MD          
Sbjct: 49   NVVVARSNLLRIFEVREEPAPFPTQADDERERKGKTRRGTEAVEGEVEMDEEGEGFVNIA 108

Query: 95   --GISAASLELVCHY------RLHGNV---ESLAILSQGGADNSRRRDSIILAFEDAKIS 143
               I    L  V  +      RLHG V   E + I+    A    + D ++++F+DAKI+
Sbjct: 109  KSAIQKTKLPTVTKFYFIREHRLHGIVTGLEGVRIM----ASLEDKLDRLLVSFKDAKIA 164

Query: 144  VLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI 202
            +LE+ D+IH L   S+H +E +P+ + L        R  L +VDP  RC  + +    + 
Sbjct: 165  LLEWSDTIHDLVTVSIHTYERAPQLISLD---SPLFRSDL-RVDPISRCAALSLPKHAIA 220

Query: 203  ILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIEP 258
            IL   Q  + L V ++D              S +++L    + ++++V DF+F+ G+  P
Sbjct: 221  ILPFYQTQAELDVMEQD---QSQAKDVPYSPSFILDLPIQVEENIRNVIDFVFLPGFNNP 277

Query: 259  VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
             + IL + + TW GR+     T  +   ++    +   +I S   LP+DA+ LL   + I
Sbjct: 278  TIAILFQTQQTWTGRLRESKDTARLIIFTLDILTQNSTIITSVEGLPYDAFSLLPCSTAI 337

Query: 319  GGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPR---SSFSVELDAAHATWLQN 374
            GGV+V+  N++ Y  QS+   +L +N +A  +      P    ++  + L+   +  + +
Sbjct: 338  GGVIVITGNSVIYVDQSSRRVSLQVNGWATRISDLPYPPMEEDATLKLHLEGCRSAMVDD 397

Query: 375  DVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSL----FFLGSRLG 430
                L  K G +  + ++ DG+ V +L ++   P++  + I T+   +     F+GS +G
Sbjct: 398  KTVFLIYKDGTVYPVELIADGKTVSKLIMA---PALAQTTIPTVVKRVDEDHLFIGSAVG 454

Query: 431  DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
             S+L++            G K     +  D          + D   D + G+        
Sbjct: 455  PSILLKTAHVEQEVEEEHGSKSGPAVVTQDV---------TMDDDDDDIYGDSTMETEPT 505

Query: 491  SNNTESAQKT---FSFAVRDSLVNIGPLKDFSYGLRINAD------ASATGISKQSNYEL 541
            +N     +KT      ++RD L   GP+   ++ L +N +       +ATG      + L
Sbjct: 506  ANGVTHVRKTKTVIHLSLRDYLPAYGPISSMTFSLAMNGEKAVPELVAATGAGSLGGFTL 565

Query: 542  VE--LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAY----LIISLEARTMVLETADL 595
             +  LP  K    +Y   SRG  +   R     +  H +    LI+S +       +   
Sbjct: 566  FQRDLPTVKKRKILYISGSRGIWSLPIRQPLRSNTSHGHDYDTLILSTDINPSPGSSRIA 625

Query: 596  LTEVTE--SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL--DGSYMTQDLSFGPSNSE 651
            +  +    S++    G TI A   F R  ++ V     R+L  DG+          +  +
Sbjct: 626  VRSMNRDVSINSRTPGLTIGAAPFFQRTAILHVMTNAIRVLHPDGTERQ-------TIPD 678

Query: 652  SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG-DPSTCTVSVQTPAAIESSKKPVSSC 710
                     +   SIADP+VL+   D SI + V  D         +P   +SS+  ++ C
Sbjct: 679  KDGNMPRPKIRFCSIADPFVLVMREDDSIGMFVATDREKIRRKDMSPMGDKSSRY-LAGC 737

Query: 711  TLYHDKGPEPWLRKTSTD--AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDV 768
                  G    L + + D  +  +T   +   GA          + ++    G LEI+ +
Sbjct: 738  FFTDTTG----LFEANFDNKSPATTSTLQITSGAKSQ-------WLLLVRPQGVLEIWTL 786

Query: 769  PNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELA 828
            P  +  F+     S ++ + DT+   A                 Q        + + ++ 
Sbjct: 787  PKLSLAFSTPAIASLQSVLTDTHDPPA-------------PSLPQDPPRKPQDLDIEQIL 833

Query: 829  MQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLR 888
            +         P L   L  G +  Y+A +    +N    D P +TS  +    ++A    
Sbjct: 834  LAPIGESSPTPHLCVFLRSGQLAIYEAVVLG--QNPEVPDTPRATSLQIQFVKIAAKSFE 891

Query: 889  NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCM-VFRERLRVHP 947
              R            +  +      +T  +    + G F +G RP W +   R  ++V+P
Sbjct: 892  IQRPEENEKGILAEHKKINRMFIPFVTSPRPSVTYSGVFFTGDRPHWILSTDRSGVQVYP 951

Query: 948  QLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
                  + AFT          F+  T  G + +  +P    +D   P++ +
Sbjct: 952  S-GHNVVHAFTPCSLWESKGEFLMYTEDGPILVEWVPDFQ-FDGPLPMRSI 1000


>gi|256079900|ref|XP_002576222.1| cleavage and polyadenylation specificity factor cpsf [Schistosoma
           mansoni]
          Length = 1958

 Score =  154 bits (390), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 147/586 (25%), Positives = 255/586 (43%), Gaps = 106/586 (18%)

Query: 4   AAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTAA 63
           A +K +  PT + NC    + H          +     +D+ L        + NLV+T  
Sbjct: 15  AVFKHISPPTAVDNCLYCHLKH----------ISPPTAVDNCLYCHLTHPKLKNLVITRG 64

Query: 64  NVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGG 123
             IEIY V+        S  SGET+               V    ++ N+  +  +   G
Sbjct: 65  GFIEIYNVK--------SSASGETR------------FNWVYGTSVYENIADIVTVRFTG 104

Query: 124 ADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV 183
                   S++L+F +AK++V+ F+     LR  S+H +E   + +LK GR +F + P++
Sbjct: 105 DLLD----SLLLSFPEAKVAVMNFNPVTFELRTLSLHNYE---FENLKSGRMNFTKLPIL 157

Query: 184 KVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED----TFGSGGGFSARIESSHVINLR 239
           ++DP  RC  +LVY   + +L   +    +  + D    +  +   +  R  +  +    
Sbjct: 158 RLDPHQRCAVMLVYDRHLAVLPFRRTEVLVSAETDPKHISVRNSLLWQQRATAPLLATFT 217

Query: 240 DL-------DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
                     + +V D  F++G+ EP +++L+E   TWAGRVS +  TC I ALS +   
Sbjct: 218 TCLSTSTGEKINNVLDMQFLYGFYEPTLLVLYEPIGTWAGRVSARRDTCCIVALSFNLQK 277

Query: 293 KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYA---VS 348
           + +P+IW   +LP D   +++VP PIGGV+V+ AN+I Y  Q+  SC L LN YA    +
Sbjct: 278 RTNPVIWFQESLPFDCRSVISVPQPIGGVVVMAANSILYLKQTLPSCGLPLNCYAQISTN 337

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLSKT 406
               Q++P S   + +D      L     L+ T++G+L LL++  +   + V  L   K 
Sbjct: 338 FPMRQDVP-SCGPLSIDGCRVVTLNETQFLIGTRSGNLYLLSLWLEQATQTVTSLLFHKV 396

Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQF---------------------TCGSGTSM 445
             +V    +  + +   F+GSR  DS+L++                      + G+  ++
Sbjct: 397 GHAVPPHCMVLLESKYLFIGSRFCDSVLMKIDYSLLCVDANGKEVDHQLLNQSSGTNNTL 456

Query: 446 LSSGLKEEFGDIEAD------------------------APSTKRLRRSSSDALQD---M 478
             S L +    +E D                        + STKR     +D + D    
Sbjct: 457 KDSELVDGKSIVEDDSDEIPNKCPRIEEGENDKTISKSLSQSTKRNTLDENDIISDNHYK 516

Query: 479 VNGEELSLYGSASNNTESAQK---TFSFAVRDSLVNIGPLKDFSYG 521
            +  ++ LYG +  +  S  +    +SF V D L+N+GP+   + G
Sbjct: 517 FDEVDVELYGESILSPPSIYREIVNYSFKVVDRLINLGPMGQLTSG 562



 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 57/259 (22%), Positives = 104/259 (40%), Gaps = 27/259 (10%)

Query: 753  YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
            ++ + + +G LEI+ +P+F  ++ V  F      ++D      +   +     ++ +   
Sbjct: 1103 FAFIVFTNGVLEIYSLPDFTLLYEVHHFTDLPQMLID---HRGVSSEQLHKQYTNSQNVS 1159

Query: 813  QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
                ++I    ++E+ +        RP L  + T   I  ++A L   P+          
Sbjct: 1160 YTEDDSIPP-PILEILVYPIGIDKDRPVLM-VRTSQEIAFFEA-LCPSPDE--------- 1207

Query: 873  TSRSLSVSNVSASRLRNLRFS-RTPLDAYTREET-PHGAPCQRITI--------FKNISG 922
             S  L        RLR  R     PL A  R  T P     Q   +        F+NI  
Sbjct: 1208 -SYPLISGTFYEGRLRWRRLPLPCPLVAPRRVRTDPKIMDVQSTLLTRTHMLRSFENIGD 1266

Query: 923  HQGFFLSGSRPCWCMVFRE-RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
            H+G F+ G  P W       +LRV P   DG + +F  L+   C+ GF+Y T    +++ 
Sbjct: 1267 HRGVFVCGGNPIWLFATDSGQLRVFPHSIDGIMGSFAPLNAKICHSGFVYFTFSNEMRLA 1326

Query: 982  QLPSGSTYDNYWPVQKVVF 1000
             LP G +++ +  ++ +  
Sbjct: 1327 TLPPGYSFNEHLGIKWITL 1345


>gi|358056450|dbj|GAA97624.1| hypothetical protein E5Q_04302 [Mixia osmundae IAM 14324]
          Length = 1305

 Score =  152 bits (385), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 169/673 (25%), Positives = 300/673 (44%), Gaps = 98/673 (14%)

Query: 55  VPNLVVTAANVIEIY-----VVRVQE---EGSKESKNSGETKRRVLMDGISAASLELVCH 106
           V NLVV  +N +++Y      V VQ    +GS  S    +T+            L+L+  
Sbjct: 37  VRNLVVARSNFLQVYEVLEEPVPVQSSVTDGSSASMREDQTR------------LQLLAE 84

Query: 107 YRLHGNVESLAILSQGGADNSRR--RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFES 164
           +  HG V  LA LS     ++R+  R  ++++F DAK++V+E+ D +H L   SMH FE 
Sbjct: 85  HVCHGIVTGLARLS---TLDTRQDGRHRLVISFRDAKMTVMEWSDQLHDLAPVSMHSFE- 140

Query: 165 PEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGG 224
                L +G +  A   +++VD   RC  +L+    + IL   Q  S L   ED     G
Sbjct: 141 -RLPQLSQG-DLGAFQAVLRVDQASRCVALLLPDNTLGILPFFQDLSEL---ED-MTREG 194

Query: 225 GFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCM 282
             S     S  I+L ++   +++V DF F+ G+ EP + IL +R+ TW GR+ +      
Sbjct: 195 LQSLPYAPSLTIDLSEIGPGIRNVVDFAFLPGFSEPTIAILFQRKPTWTGRIDFAKDITS 254

Query: 283 ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALA 341
           +  +++    + +P+I+ A  LP+DA  L   P  +GGV+++ AN+ +H    S    +A
Sbjct: 255 LVMVTLDIGSRNYPVIFEADGLPYDALSLSVCPRELGGVVILCANSLVHIDQSSKMTGIA 314

Query: 342 LNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRL 401
           +N +  +L  ++   R +  + L+ A   ++   VA+L T+TG+   L +  DGR V  +
Sbjct: 315 VNGWTSTLTDARLDSRPTLRLVLEGAQCAFVGQQVAVLCTRTGETFSLHLEKDGRNVSSM 374

Query: 402 DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADA 461
           D      + + + I T+G +  F+GS  G S+L+++   SG             DI    
Sbjct: 375 DCRPRAVTCIPACIETVGAAYVFVGSAQGQSVLLRWASQSGAG----------ADILDIT 424

Query: 462 PSTKRLRRSSSDALQDMVNGEELSLYGSA-SNNTESAQ-----KTFSFAVRDSLVNIGPL 515
            S   L +  SDA+ D        LY +A ++N    Q     K     + D+L   G +
Sbjct: 425 ESGTGLVQ--SDAMDD-------DLYATAGAHNGNGHQIAPTGKDVQLELCDTLPGYGTI 475

Query: 516 KDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEY 575
           +  +       D ++  + + S      +    G+ T++         D     A D  +
Sbjct: 476 RHIAV-----LDHTSASLDEPSLVACTGVQAMAGLTTIHRHVPSVRQVDLDLPTARDIRH 530

Query: 576 HAYLIISLEAR----------TMVLETADLLTEVTESVDYFVQGRT---------IAAGN 616
                + LE R           ++  T    + +  ++D   Q  T         +AAG+
Sbjct: 531 --IWTVGLEQRQKMGRGPITHQIICSTGS--SSMVYTLDQDTQAATLARKSAEVPLAAGS 586

Query: 617 LFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMS 676
            F R +V++V E   R+            G   +E+  G  ++  + V+++DP+V +  +
Sbjct: 587 FFSRSQVLEVTEDMLRLYSPD--------GQITTEAPHGQADA--IDVTVSDPFVAVLSA 636

Query: 677 DGSIRLLVGDPST 689
             ++ +  GDP+T
Sbjct: 637 ARNVTVFFGDPTT 649


>gi|440637976|gb|ELR07895.1| hypothetical protein GMDG_02777 [Geomyces destructans 20631-21]
          Length = 1495

 Score =  152 bits (384), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 235/986 (23%), Positives = 386/986 (39%), Gaps = 153/986 (15%)

Query: 97   SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
            S   L LV  Y L G V SLA +    +++    +S++L+F+DAK+S++E+D   HGL  
Sbjct: 147  STTKLVLVGEYALAGTVTSLARIKI--SESKSGGESLLLSFKDAKLSLVEWDPERHGLST 204

Query: 157  TSMHCFESPE-----WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS 211
             S+H +E  E     W        ++     +  DP+GRC  +      + IL   QG  
Sbjct: 205  VSIHYYEQEEIGGSPWDPYLSNCFNY-----LTADPRGRCAALKFGARNLAILPFRQGDE 259

Query: 212  GLVGDE--------------DTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGY 255
                D+               T  + G        S V+ L  LD  + H     F++ Y
Sbjct: 260  DTTMDDWDEELDGPRPTTAIITSENKGHEDTPYAPSFVLRLSSLDPTLIHTVHLAFLYEY 319

Query: 256  IEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVP 315
             EP   IL       +  +  +         ++    K    I     LP+D +K++ +P
Sbjct: 320  REPTFGILSSTLSPSSSLLDERKDQLSYMVFTLDLNQKASTTILVVTGLPYDLFKVIPLP 379

Query: 316  SPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQN 374
            SPIGG L+VG N  IH      +  +A+N  A S  S   + +SS  + L+      L  
Sbjct: 380  SPIGGALLVGGNELIHIDQSGKANGVAVNALAKSCTSFGLVDQSSLQMRLEGCAVEQLSA 439

Query: 375  DVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS-------VLTSDITTIGNSLFFL 425
            D    L+   TG+L +L+   DGR V  L+L +  PS          S  + I ++  F+
Sbjct: 440  DNGEMLIILNTGELAVLSFRMDGRSVSGLNLRRV-PSESGICMGAQASCTSLINHNSMFI 498

Query: 426  GSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEE-- 483
            GS   DS+++ ++  S  +      +      +          +   D  +D + GE   
Sbjct: 499  GSEDTDSIVLGWSRKSKQAGRRRS-QPTIDAGDDADVDGTDEDQEDEDEDEDDLYGESTA 557

Query: 484  -LSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL----RINADASATGISKQSN 538
             + L G  + +  S    ++F + DSLVNI PL+D +       R + D  AT IS +SN
Sbjct: 558  AIPLKGEVAADANSKAGDYAFRIHDSLVNIAPLRDVTLSKPETPREDEDEEAT-ISTRSN 616

Query: 539  YELV--------------------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYD 572
            +ELV                          E P  +GIWT+  K       +  +  A  
Sbjct: 617  FELVGVTGRNTSGSLAFLRREIEPNVIGRFEFPEARGIWTLCAKRPLIKGLEPEKSEAIL 676

Query: 573  D-------EYHAYLIISL-------EARTMVLETADLLTEVTESVDYF-VQGRTIAAGNL 617
            D       ++   +I+S        E+   VL +A    E     ++    G TI  G +
Sbjct: 677  DPESELGAQFDRLMIVSKSTEDTPEESSVYVLTSAGF--EALADTEFEPAAGATIKCGTV 734

Query: 618  FGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMS 676
                RV+Q+ +   R  DG   + Q L     + E+G+      +++ SI DPYVLL   
Sbjct: 735  GNGMRVVQILKSEVRSYDGDLGLAQILPM--FDDETGA---EPKIVAASIVDPYVLLIRD 789

Query: 677  DGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVG 736
            D SI +   D       ++       + K +S C LY+D                STG+ 
Sbjct: 790  DASIFVASCDSDNDLEEIERGDDSLLTNKWLSGC-LYND----------------STGMF 832

Query: 737  EAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHIVDTYM--R 793
                 ++G    +  I S++  E GAL ++ +P+ +  ++  +      T I   Y   R
Sbjct: 833  AETALSNGTVSKKSVIMSLLNSE-GALFMYALPDLSKPIYQANGVSFIPTTISPDYATRR 891

Query: 794  EALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCY 853
              + ++ TE+                       L      A    P+L    ++  +  Y
Sbjct: 892  STVAETLTEV-----------------------LLADLGDATSKSPYLIFRASNDDLTIY 928

Query: 854  QAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQR 913
            +   F+ P     S+ P   S+SL    +    +       T + A   E    G+P + 
Sbjct: 929  EP--FQVP-----SEAPRPLSKSLHFQKIHNPHVAKTANPETEV-AADAESAKRGSPMRA 980

Query: 914  ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVT 973
            I    N+ G    FL G  P + +   +       L    + + +  H   C+ GFIYV 
Sbjct: 981  IA---NVGGLSSVFLPGDSPSFVVKSSKSTPRVVGLRGHGVRSLSGFHTEGCDRGFIYVD 1037

Query: 974  SQGILKICQL-PSGSTYDNYWPVQKV 998
            S+GI ++ QL P  +  D    ++KV
Sbjct: 1038 SKGIARVSQLEPETNVTDIGLTLRKV 1063


>gi|257215708|emb|CAX83006.1| Cleavage and polyadenylation specificity factor subunit 1
           [Schistosoma japonicum]
          Length = 462

 Score =  151 bits (382), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 112/397 (28%), Positives = 195/397 (49%), Gaps = 45/397 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLV+T +  IEIY ++        S  SGET+     + +   S+        + N+  +
Sbjct: 41  NLVITRSGFIEIYNIK--------SSVSGETR----FNWVYGTSV--------YENIADI 80

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +   G            +F +AK++V+ F+     LR  S+H +E   + +LK GR +
Sbjct: 81  VSVRFAGDLLDSLLL----SFSEAKVAVMNFNPITFELRTLSLHNYE---FENLKSGRMN 133

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILK--------ASQGGSGLVGDEDTFGSGGGFSA 228
           F + P++++DP  RC  +LVY   + +L         +++     +G  +        +A
Sbjct: 134 FTKLPILRLDPYQRCAVMLVYDRHLAVLPFRRTEVLVSAETDPKHIGVRNFLLWQQRATA 193

Query: 229 RIESSHVINLRDL---DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
            + ++    L       + +V D  F+HG+ EP +++L+E   TWAGRVS +  TC I A
Sbjct: 194 PLLATFTTCLSTSTGEKINNVLDMQFLHGFYEPTLLVLYEPIGTWAGRVSARRDTCCIVA 253

Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNN 344
           LS +   + +P+IW   +LP D   ++ VP PIGGV+++ AN+I Y  Q+  SC+L LN 
Sbjct: 254 LSFNLQKRTNPVIWFQESLPFDCRSVIPVPQPIGGVVIMAANSILYLKQTLPSCSLPLNC 313

Query: 345 YA---VSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQ 399
           YA    +    Q++P S   + +D      L     L+ T++G+L LL++  +   + V 
Sbjct: 314 YAQISTNFPMRQDVP-SCGPLSIDGCRVVTLNETQFLIGTRSGNLYLLSLWLEQATQTVT 372

Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQ 436
            L   K   +V    +  + +   F+GSR  DS+L++
Sbjct: 373 SLLFHKVGHAVPPHCMVLLESKYLFIGSRFCDSVLMK 409


>gi|260835071|ref|XP_002612533.1| hypothetical protein BRAFLDRAFT_120973 [Branchiostoma floridae]
 gi|229297910|gb|EEN68542.1| hypothetical protein BRAFLDRAFT_120973 [Branchiostoma floridae]
          Length = 1003

 Score =  150 bits (378), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 143/529 (27%), Positives = 215/529 (40%), Gaps = 104/529 (19%)

Query: 543 ELPGCKGIWTVY------HKSSRGHNADSSRMAAYDDEY-----------------HAYL 579
           +LPGC  +WTV            G  A+S+      +                   H +L
Sbjct: 107 DLPGCLDMWTVIGIPPESKPQEEGEKAESAGSEEKPEGEKEETKEEGPPDVDLTNSHGFL 166

Query: 580 IISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYM 639
           I+S E  TMVL+T   + E+  S  +  QG T+ AGN+   + +IQV   G R+L G   
Sbjct: 167 ILSREDSTMVLQTGKEIMELDHS-GFSTQGPTVYAGNIGNNKYIIQVSPYGIRLLQGVKQ 225

Query: 640 TQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL--VGDPSTCTVSVQTP 697
            Q L F          S+    +  S+ADPY L+   DG I LL  V DP      +   
Sbjct: 226 LQHLPFD---------SKGPAFVLASVADPYALVMSEDGQILLLTLVNDPYGSGHRLSAK 276

Query: 698 AAIESSKKPVSSCTLYHDKG------------PEPWLRKTSTDAW------LSTGV---- 735
               + K    +   Y D              P P + K + +        + TGV    
Sbjct: 277 KIDMAGKSQAITVCAYRDTSGLFTVSSPSTTTPAPEVEKDAAEPAAEDAVAMETGVDDED 336

Query: 736 ----GEAIDGADGGPLDQGDI-------------------YSVVCYESGALEIFDVPNFN 772
               GE      G  + + ++                   + V+C E+G+LEI+++P+F+
Sbjct: 337 EMLYGEPSAKPSGPAVVREEVKPSTSTVQEPVVKEVEPTHWCVICRENGSLEIYNLPDFS 396

Query: 773 CVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRW 832
            V+ V  F +G   +VD++   +   +                K+      V E+ M   
Sbjct: 397 LVYLVKNFPTGMKLLVDSFQSTSSASTSQS------------DKQGDQLASVKEILMVGL 444

Query: 833 SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRF 892
               SRP L A + D  +L Y+A+    P + S    P  T   +    V  + +   R 
Sbjct: 445 GHKGSRPHLLARV-DEDLLIYEAF----PYHLS----PSYTMLKIRFKKVQHNLILRERK 495

Query: 893 SRTPLDAYTREET--PHGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQL 949
                 A  +EE+    G+  Q    F +ISG+ G F+ GS P W  M  R  LR+HP  
Sbjct: 496 GGKTKKAGDQEESDGQTGSRIQHFRTFTDISGYSGLFICGSSPHWLFMTSRGALRIHPMS 555

Query: 950 CDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
            DG++  F+  HNVNC  GF+Y    G L+I  LP+  +YD  WPV+KV
Sbjct: 556 IDGAVTCFSPFHNVNCPKGFLYFNRGGELRISVLPTHLSYDAPWPVRKV 604


>gi|317036382|ref|XP_001398211.2| protein cft1 [Aspergillus niger CBS 513.88]
          Length = 1393

 Score =  149 bits (377), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 215/1019 (21%), Positives = 408/1019 (40%), Gaps = 159/1019 (15%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           +L+V   ++++IY +  +     E  ++ +   ++L++            Y L G V  L
Sbjct: 28  DLIVVRTSLLQIYSLH-KVASHAEGADAQQESTKLLLEK----------EYSLSGTVTGL 76

Query: 117 ----AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
                + S+ G +      ++++AF +AK+S++E+D    G+   S+H +E  +      
Sbjct: 77  CRVKVLNSKSGGE------AVLVAFRNAKLSLIEWDPERRGISTISIHYYERDDLTRSPW 130

Query: 173 GRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFGS--------- 222
             +    G ++ VDP  RC  +  +G++ + I+   Q G  LV D+  +GS         
Sbjct: 131 VPDLNNCGSILSVDPSSRCA-IFNFGIRNLAIIPFHQPGDDLVMDD--YGSDLGEGISTD 187

Query: 223 ---GGG-----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHER 266
              GGG           +      S V+ L  LD  + H     F++ Y EP   IL+ +
Sbjct: 188 HDLGGGTVADKAKEGIVYQTPYAPSFVLPLTTLDPSILHPISLAFLYEYREPTFGILYSQ 247

Query: 267 ELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGA 326
             T +  +  +      +  ++    +   ++ S   LP D ++++A+P P+GG L++G+
Sbjct: 248 VATSSALLPERKDVVFYTVFTLDLEQQASTVLLSVSRLPSDLFRVVALPPPVGGALLIGS 307

Query: 327 NT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKT 383
           N  +H      + A+ +N ++  + S     +S  ++ L+      L +     LL   T
Sbjct: 308 NELVHIDQAGKTNAVGVNEFSRQVSSFSMTDQSDLALRLENCIVECLGDSSGDMLLVLTT 367

Query: 384 GDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI-------TTIGNSLFFLGSRLGDSLLVQ 436
           G++ ++    DGR V  + +         + I       T IG+   FLGS  GDS+L+ 
Sbjct: 368 GEMAIVKFKLDGRSVSGISVHLLPAHAGLTSIYSAAAASTFIGDGKIFLGSEDGDSVLLG 427

Query: 437 FTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV--NGEELSLYGSASNNT 494
           ++  S ++       ++  D  AD        +S  D  +D +     + +L G   +  
Sbjct: 428 YSYSSSSTKKHRLQAKQVIDDSADMSEED---QSDDDVYEDDLYSTSPDTTLTGRRPSGE 484

Query: 495 ESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI---- 550
            SA   + F + D L+NIGPL+D + G R++ +   TG    S    +++   +G     
Sbjct: 485 SSAFGLYDFRIHDKLINIGPLRDITMGKRLSTNLEKTGDRTNSTSPELQIVASQGSHKSG 544

Query: 551 -WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA-------------RTMVLETADLL 596
              V  +    H   S  + + D  + A L    EA             R  V+ T    
Sbjct: 545 GLVVMAREIDPHVVASISLESVDCIWTASLTREEEAVSGTSEKMGQQSQRCYVIATEVKG 604

Query: 597 TEVTESVDYFVQGR----------------TIAAGNLFGRRRVIQVFERGARILDGSY-M 639
           ++  ES+ + V G                 TI+ G    R+RV+QV +   R  D    +
Sbjct: 605 SDREESLIFVVDGHDLKPFRAPDFNPNEDVTISVGTQESRKRVVQVLKNEVRSYDFDLSL 664

Query: 640 TQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAA 699
           TQ       ++     ++    +S S+AD  + +   D ++  L  D S     V     
Sbjct: 665 TQIYPIWDDDT-----NDERMAVSASLADSCLAILRDDSTLLFLQADDSGDLDEVVFGED 719

Query: 700 IESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYE 759
           + S K    SC LY DK                TG+  +ID     P+ + D++  +   
Sbjct: 720 VASGK--WISCCLYSDK----------------TGMFSSIDRTLSEPV-KNDMFLFLLSH 760

Query: 760 SGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENI 819
              L ++ V +   + ++ +   G + ++                 SSE     G +EN+
Sbjct: 761 DCKLFVYRVRD-QKLLSIIEGTDGLSPLL-----------------SSEPPKRSGTRENL 802

Query: 820 HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSV 879
               V +L  + WSA    P+L        ++ Y+ ++             VST     +
Sbjct: 803 IEAIVADLG-ETWSAS---PYLILRSETDDLIIYKPFV-------------VSTGPVEGI 845

Query: 880 SNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF 939
            ++  S+  N    R P    + + +      + + I  +ISG    F+ G+   + +  
Sbjct: 846 HSLKFSKETNSVLPRIPPGVSSTQPSGSDYRARPLRILPDISGLSAVFMPGASAGFIIRT 905

Query: 940 RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
                   +L   +  + + L    C+ GFIY+ SQ  ++ C+LP  + +D  W +++V
Sbjct: 906 SASAPHFLRLRGENSRSVSSLDTPECSKGFIYLDSQSTVRFCKLPPMTRFDYQWTLKRV 964


>gi|170102106|ref|XP_001882269.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164642641|gb|EDR06896.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 1406

 Score =  149 bits (377), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 203/975 (20%), Positives = 381/975 (39%), Gaps = 188/975 (19%)

Query: 57  NLVVTAANVIEIYVVR-----VQEEGSKESKNSGETKR-------RVLMD---------- 94
           N+VV  +N++ I+ VR     +Q +   E +   + +R        V MD          
Sbjct: 51  NVVVARSNLLRIFEVREEPCPIQNQADDERERRSKVRRGTEAVEGEVAMDEQGDGFINIA 110

Query: 95  --------GISAASLELVCHYRLHG---NVESLAILSQGGADNSRRRDSIILAFEDAKIS 143
                     +      V  + LHG    +E + I+S        R D ++++F+DAKI+
Sbjct: 111 KSQKCPTHTPTVTRFYFVREHHLHGIVTGIEGVKIMSS----LEDRLDRLLISFKDAKIA 166

Query: 144 VLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI 202
           +LE+ D++H L   S+H +E +P+ + +     S  R  L + DP  RC  + +    + 
Sbjct: 167 LLEWSDAVHDLITVSIHTYERAPQLMSID---SSLFRTEL-RTDPISRCAALSLPRHALA 222

Query: 203 ILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIEP 258
           IL   Q  + L V D+D              S +++L    D ++++V DF F+ G+  P
Sbjct: 223 ILPFYQSQAELEVMDQD---QSQAKDVPYSPSFILDLPAQVDQNIRNVIDFAFLPGFNNP 279

Query: 259 VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
            + +L + + TW GR+     T  +   ++    + +P+I S   LPH+   LL   + +
Sbjct: 280 TIAVLFQTQQTWTGRLREFKDTVRLVIFTLDIVTQNYPIITSVEGLPHECLALLPCGTSL 339

Query: 319 GGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQELPRSSFSVE-------LDAAHAT 370
           GGV+++ +N I Y  QS+    L +N +   +    ++P  S + E       L+ + A 
Sbjct: 340 GGVVIITSNAIIYTDQSSKRVVLPVNGWVSRI---SDIPLPSLTPEEQLRNICLEGSRAV 396

Query: 371 WLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSL----FFLG 426
           ++ +    +  K G +  L +V DG+ V +L +S   P +  + I ++   L    F +G
Sbjct: 397 FVDDRNLFVILKDGTVYPLEIVVDGKTVSKLTMS---PPLAQTSIPSVLRKLDDDHFLVG 453

Query: 427 SRLGDSLLVQFTCGSGTSMLSSGLKEEFG---DIEADAPSTKRLRRSSSDALQDMVNGEE 483
           S +G S+L++          ++ ++EE     D+EA AP+T        +   D  N   
Sbjct: 454 SSVGPSVLLK----------AAHIEEEVAEDHDMEA-APATVVYDADDMEFDDDDGNLPR 502

Query: 484 LSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD------ASATGISKQS 537
           ++          +       ++RDSL   GP+ D ++ L  N D       +ATG     
Sbjct: 503 VA-------QPMAKPTVIHLSLRDSLPAYGPISDMTFSLAKNGDRPVPELVAATGSGFLG 555

Query: 538 NYELVE--LP-----------GCKGIWTV------------YHKSSRGHNADSSRMAAYD 572
            + L +  LP           G +G+W++            Y K+     A++  +    
Sbjct: 556 GFTLFQRDLPVRTKRKLHVIGGARGLWSLPIRQPVKASGISYEKAVNPFQAENDSLIIST 615

Query: 573 DEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
           D  +    +S   +  V+ TA             + G TI A   F R  V+ V     R
Sbjct: 616 D-INPSPGLSRAGKNDVMITAR------------IPGTTIGAAPFFQRTTVLHVMTNALR 662

Query: 633 ILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
           +L+ G  + +D+                 + + SI+DP+VL+   D SI L +G+     
Sbjct: 663 VLEPGMQIIKDMD---------GNMPRPRIRACSISDPFVLILREDDSIGLFIGETERGK 713

Query: 692 VSVQTPAAIESSKKPVSSCTLYHDKG-PEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQG 750
           +  +  + +        SC      G  E     ++T   +++ +  A++    G     
Sbjct: 714 IRRKDMSPMGDK----VSCFYTDTTGLLESNFENSTTPVGVTSTLSAAVNAGSKGQ---- 765

Query: 751 DIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEG 810
             + ++    G +E++ +P     F+ D   S +  +VD++   A               
Sbjct: 766 --WLILVRPQGIVELWTLPKLTLGFSADGLTSLQNVLVDSHDPPA-------------PS 810

Query: 811 TGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDP 870
             Q          V ++ +        RP L   L  G +  Y+                
Sbjct: 811 LPQDPPRKPQEFDVEQILVAPIGESSPRPHLCVFLRSGQLTIYEVLPLG----------- 859

Query: 871 VSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISG-------- 922
             T+ +L     +  +++ ++ S    +    EE   G   ++  I++            
Sbjct: 860 -RTTEALPKVRPAHVKIKFVKISSMAFEIQRPEEGEKGIIAEQKRIYRMFVPFVTSASPG 918

Query: 923 --HQGFFLSGSRPCW 935
               G F +G RP W
Sbjct: 919 VTFSGVFFTGDRPNW 933


>gi|116182170|ref|XP_001220934.1| hypothetical protein CHGG_01713 [Chaetomium globosum CBS 148.51]
 gi|88186010|gb|EAQ93478.1| hypothetical protein CHGG_01713 [Chaetomium globosum CBS 148.51]
          Length = 1394

 Score =  148 bits (374), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 244/1050 (23%), Positives = 401/1050 (38%), Gaps = 204/1050 (19%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLM---------DGISAA-------- 99
            NL V  +++++I+  +V       S+N+G   R             DG+ A+        
Sbjct: 41   NLAVAKSSLLQIFRTKVIATELDTSQNNGHRTRNANRYESRLANDDDGLEASFLGGDSLA 100

Query: 100  ---------SLELVCHYRLHGNVESLAILSQGGADNSRR-RDSIILAFEDAKISVLEFDD 149
                      L LV  + L G V  L  +      N+R   DS++LAF+DAK+S++E+D 
Sbjct: 101  QRTDRANYTKLVLVAEFPLAGTVTGLVRIK---TPNARLGLDSLLLAFKDAKLSLVEWDT 157

Query: 150  SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQG 209
              H L   S+H +E  E                +  DP  RC  +      + IL   Q 
Sbjct: 158  EHHTLSTVSIHYYEQEELQGSPWAAPLSHYANFLAADPGSRCAALKFGARNLAILPFKQA 217

Query: 210  GSGL-VGDEDTFGSGGGFSARIES----------------SHVINLRDLD--MKHVKDFI 250
               + +GD D    G   +  + S                S V+ L +LD  + H     
Sbjct: 218  DEDIDMGDWDEELDGPRPAKDLSSAVINGASNIEDTPYSPSFVLRLSNLDPSLLHPVHLA 277

Query: 251  FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYK 310
            F+H Y EP   IL              H   M+  L +    K    I S   LP D ++
Sbjct: 278  FLHEYREPTFGILASTAAASNSLGRKDHFVYMVFTLDLQQ--KASTTILSVTGLPQDLFR 335

Query: 311  LLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHA 369
            ++ +P+P+GG L+VG+N  IH         +A+N       S   + +S  ++ L+    
Sbjct: 336  VVPLPAPVGGALLVGSNELIHIDQSGKPNGVAVNPMTKHCTSFGLVDQSDLNLRLEGCVI 395

Query: 370  TWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGN 420
              L  D+   L+    G + ++T+  DGR V  L+L    + +  S++   ++T   IG 
Sbjct: 396  DVLAADLGELLIILNDGQMAVMTLRIDGRTVSGLELKILPASSGGSIVPGRVSTLSRIGR 455

Query: 421  SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA------ 474
            +  F G   GDS+L                   FG  +      +R  R+  +A      
Sbjct: 456  NAMFAGLEEGDSVL-------------------FGWAKKQTQVGRRKPRTKDNAGDVDVE 496

Query: 475  ---LQDMVNGEELSLYGSASNNTESAQKTF--------SFAVRDSLVNIGPLKDFSYGLR 523
                 +    +E  LYG AS                  S  V D L+N+GP++  +Y   
Sbjct: 497  EDEDIEEEEEDEDDLYGEASAPQHQPVSAVSGLLSGEASLRVHDRLINLGPIQAMTYSQP 556

Query: 524  INADAS-----------------ATGISKQS-----NYEL-------VELPGCKGIWTVY 554
            +    S                 A G  K +     N E+        E P  +G WT+ 
Sbjct: 557  VWLPGSEEERNSAGVHSDLQLVCAVGREKSASLVTMNLEIQPKVIGRFEFPEARGFWTMC 616

Query: 555  HKSSRGHNADSSRMAAY-------DDEYHAYLIIS------LEARTMVLETADLLTEVTE 601
             K        S +   +         +Y  ++I++       E   +   TA     +  
Sbjct: 617  AKKPIPKTLQSDKGGNFLGKDYDVSGQYDKFMIVAKVDLDGYEKSDVYALTAAGFESLGG 676

Query: 602  SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENST 660
            +      G TI AG +    R+IQV +   R  DG + ++Q +     + E+G+      
Sbjct: 677  TEFDPAAGITIEAGTMGKGSRIIQVLKSEVRCYDGDFGLSQIVPM--LDEETGA---EPR 731

Query: 661  VLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEP 720
             +S SIADP +L+   D S+ +   D S     ++      ++ K ++ C LY D     
Sbjct: 732  AISASIADPLLLIIRDDSSVFVAQMDSSNELEELEKEDQTLATTKWLTGC-LYAD----- 785

Query: 721  WLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKF 780
                 +T A+      E + G  G P     I   +   SG+L I+ +P+ +    V + 
Sbjct: 786  -----TTGAF-----AEEVAGKGGKPAQA--ILVFLLSASGSLYIYRLPDLSKPVYVAEG 833

Query: 781  VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
            +S        Y+   L       + S+ +GT    KE +  + V +LA  R    H+   
Sbjct: 834  LS--------YIPPGLS-----ADYSARKGTA---KETVAEILVADLA-NRSQLRHAN-- 874

Query: 841  LFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAY 900
                  D TI  YQ + +    +TS   D    S++L        +L N  F+++P +A 
Sbjct: 875  -----DDLTI--YQPFRY----STSAGAD---FSKTLFF-----QKLPNAAFAKSPEEAD 915

Query: 901  TREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMV-FRERLRVHPQLCDGSIVAFTV 959
              E T H      +    NI+G+   FL G+ P + +   +   RV P L    ++A + 
Sbjct: 916  EDEAT-HQPRMLSMRRCSNIAGYSTVFLPGASPSFIIKSSKSAPRVLP-LQGAGVIAMSP 973

Query: 960  LHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
             H   C +GFIY  SQ + ++ QLP    Y
Sbjct: 974  FHTEGCENGFIYADSQHMARVTQLPQDWNY 1003


>gi|345566738|gb|EGX49680.1| hypothetical protein AOL_s00078g169 [Arthrobotrys oligospora ATCC
           24927]
          Length = 1407

 Score =  147 bits (372), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 226/1077 (20%), Positives = 407/1077 (37%), Gaps = 258/1077 (23%)

Query: 57  NLVVTAANVIEIYVVRVQEEGS-----KESKNSGETKRRVLM-----DGISAAS------ 100
           NLVV   ++++I+ +   E+        E+K+ G + RRV       D  +  S      
Sbjct: 31  NLVVAKTSLLQIFRLVEYEDAEGEFALDEAKDEGGSDRRVFEGRDHEDSFTVESGMHLQR 90

Query: 101 --------LELVCHYRLHGNVESLAIL----SQGGADNSRRRDSIILAFEDAKISVLEFD 148
                   L+LV  Y L+G+V S+  +    S+ G D       ++++F+ AKIS+LE+D
Sbjct: 91  ETIEKTTKLDLVAQYHLYGSVTSMVKIRIPTSKSGGD------CLLVSFDSAKISLLEWD 144

Query: 149 DSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVK--------VDPQGRCGGV-LVYGL 199
            + H +   S+H +E  E+           R PL           DP+ RC      + L
Sbjct: 145 PAAHSISTISLHYYEGDEF-----------RSPLTPEFPINYLISDPKSRCAAFKFNHDL 193

Query: 200 QMII-LKASQGGSGLVGDEDTF-----------------------GSGGGFSARIESSHV 235
             I+  + ++     + D D+F                       G G         S V
Sbjct: 194 VAILPFRQTEDEDLEIPDNDSFTYDLEDDDDAEKPKKDVEMKDNTGEGKPSDTPYHPSFV 253

Query: 236 INLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
           ++   LD  ++ + D +F+H Y EP   I+++ +    G +  +        +++    +
Sbjct: 254 LSASQLDESVERIIDIVFLHEYREPTFGIVYQPQQGSVGMLERRKDPTHFIVVTLDLDQR 313

Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI-HYHSQSASCALALNNYAVSLDSS 352
               I SA NLP D +K +A+P PIGG L++G + I H        A+A+N+YA    + 
Sbjct: 314 ASTSIMSAKNLPFDIWKAVALPPPIGGTLLLGEHEIVHVDQAGKMSAVAVNSYAQQYSAF 373

Query: 353 QELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTN--- 407
               +S   + L++  A  L N+    L+ T  GD  +L+   +GR +  L + +     
Sbjct: 374 NMTDQSDLELNLESCSAISLPNENGDVLIVTIAGDFAILSFKAEGRSISSLSVRRIQSKD 433

Query: 408 ----PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPS 463
                S     +  +GN  FFLGS   D++L  +      + LS               S
Sbjct: 434 GYPFTSAPCETLVEVGNRRFFLGSLDSDAMLWGYKRKGEKTSLSQK-------------S 480

Query: 464 TKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT------------FSFAVRDSLVN 511
             +L R  +       + +E  LYG ++    + +K             + F   D L N
Sbjct: 481 EVKLERDDA-EDNVEDDDDEDDLYGESTVTPITPRKASSGNIGRGSSGEYVFRRHDRLQN 539

Query: 512 IGPLKDFSYG--------LRINADA-------SATGISKQSNYEL------------VEL 544
           +GP +  ++G        L+++          + TG   +    +             + 
Sbjct: 540 VGPCRQMAFGRPAMLPEKLKLHQGVLPELELMATTGRGVEGAVTVFNTSICPRVSATFDF 599

Query: 545 PGCKGIWTVYHKSSRGHNA--DSSRMAAYDDE------YHAYLIISLEARTMV------- 589
             C+ +W V+ K  +   +   SS    Y+++      Y  YL  S  + T+V       
Sbjct: 600 KDCQRLWAVHSKQVKKGQSMIPSSVSKGYEEQIGATEDYSTYLFASNTSETLVYKVGTKF 659

Query: 590 --LETADL-LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG 646
             LE  D+  TEV  ++++         G      R+ QV E   ++ D     Q +   
Sbjct: 660 EPLEGTDIETTEVCPTLEF---------GTFQDGLRIAQVCETNVKVYDSEL--QLIQII 708

Query: 647 PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS-VQTPAAIESSKK 705
            +N E   G  +  ++S S ADPY+LL   D SI        T  +  ++ PA I+ +K 
Sbjct: 709 STNDEDPDGGPH--IVSASFADPYMLLICGDSSILACQCHERTLELDRIELPATIKDTK- 765

Query: 706 PVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCY---ESGA 762
                                T+  L T   E            G    V+C+   E G 
Sbjct: 766 --------------------YTNGCLYTSSSEV--------FGLGTKSQVLCFLLTEEGT 797

Query: 763 LEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG-QGRKENIHS 821
           L++F +PNF    T++ F                 D   ++ S  E        ++ I  
Sbjct: 798 LQVFTLPNFELKATLEHF-----------------DMSLQLVSPDETALRFHTARDEIEE 840

Query: 822 MKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSN 881
           + V +L      A    P+L        I+ Y+ ++  G                     
Sbjct: 841 IIVADLGDNISKA----PYLIVKTKRDDIIIYEPFISNG--------------------- 875

Query: 882 VSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRE 941
           +   ++ N   +  P  + + +++P G P  +I    ++ G+   F++G  P +     +
Sbjct: 876 ICFKKIYN---TVLPTVSLSEQKSPSG-PLVKI---DDLGGYSVAFMAGDTPTFITKSSK 928

Query: 942 RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
            L    +L  G + + +  +      GF+Y+ S+G  ++C  P  S  ++ W  Q++
Sbjct: 929 TLPKLYKLQGGMVRSLSPFNTKETERGFLYIDSKGTARVCHFPEVSM-EHTWLSQRI 984


>gi|406865186|gb|EKD18229.1| CPSF A subunit region [Marssonina brunnea f. sp. 'multigermtubi'
           MB_m1]
          Length = 1443

 Score =  147 bits (371), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 234/1046 (22%), Positives = 414/1046 (39%), Gaps = 188/1046 (17%)

Query: 57  NLVVTAANVIEIYVVRVQ------EEGSKESKNSGETKRRVLMDGISAA----------- 99
           NL+V   ++++++  ++       EEG+  SK + +    +  DG+ A+           
Sbjct: 28  NLIVAKTSLLQVFTTKITSIELGIEEGA--SKQNDKWDPSLDNDGLDASFIGADSLLRPD 85

Query: 100 -----SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGL 154
                 L LV  Y L G + SLA +    + +    +++++ F DAK+S++E+D +  G+
Sbjct: 86  RARRTKLVLVAEYTLSGTITSLARIKTLSSKSGG--EALLVGFRDAKLSLVEWDPARPGI 143

Query: 155 RITSMHCFESPEWLHLKRG------RESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQ 208
              S+H +E  E   L+R       +ES      +  DP  RC  +   G  + I+   Q
Sbjct: 144 STISIHYYEQDE---LQRSPWAPNLKESVN---YLIADPGSRCAALKFGGRNLGIIPFKQ 197

Query: 209 GGSGLVGDE----------------DTFGSGGGFSARIESSHVINLRDLDMKHVKD--FI 250
               +  D+                    S          S V+ L  LD   +      
Sbjct: 198 DDEDVNMDDWDEEIDGPRPADKVITKATNSSNDKETPYGPSFVLRLATLDPNLINPIHLA 257

Query: 251 FVHGYIEPVMVILHERELTWAGRVSWK--HHTCMISALSISTTLKQHPLIWSAMNLPHDA 308
           F++ Y EP   IL   ++  +  +  +  H T M+  L +    +    I S   LP+D 
Sbjct: 258 FLYEYREPTFGILSSSQMPASSLLFERRDHLTYMVFTLDLQQ--RASTTIMSVTGLPYDL 315

Query: 309 YKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
           ++++ + +P+GG L++G N  IH      +  +A+N +A    S   + +S   + L+ +
Sbjct: 316 FEVVPLDAPVGGALLIGTNELIHIDQAGKANGVAVNVFAKQCTSFGLVDQSGLDMRLEGS 375

Query: 368 HATWL--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDITT---I 418
               L  Q+   ++  +TG++ +L+   DGR V  L + + +     SV+ + ++T   I
Sbjct: 376 KIEQLSIQSGEMIIFLQTGEIAILSFHMDGRSVSSLSVRRVSAEAGGSVIPARVSTLSHI 435

Query: 419 GNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDM 478
           G +  F+GS   DS+++ +   S  S   S  K     IE    ++        D   D 
Sbjct: 436 GQNTLFVGSACADSMVLGW---SRKSNQVSRRKPRVEVIEDADDASLDELDDEDDDADDD 492

Query: 479 VNGEELSLYGSASN------NTESAQKTFSFAVRDSLVNIGPLKDFSYG---LRINADAS 529
           + GE  S+   A+N         S    + F V DSLVNI P+ + ++G   L  N D  
Sbjct: 493 LYGEGPSIIQDATNGVAKSDTVNSKAGDYVFQVHDSLVNIAPIVNITFGNASLSQNEDEK 552

Query: 530 ATGISKQSNYELV--------------------------ELPGCKGIWTVYHK--SSRGH 561
              +  +   ELV                          E P  +GIWT+  K  + +G 
Sbjct: 553 LDSVGVRGYLELVASVGKQRAGALAVIHQNIQPKVIGRFEFPEARGIWTMSAKRPAEKGL 612

Query: 562 NADSSRMA-----AYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYF-------VQG 609
            A   + +     A D +Y   +I+S +A +   ET+D+    + + +           G
Sbjct: 613 EAKKEKSSTSGDYAIDAQYDRLMIVS-KALSDGTETSDVYALTSANFEALTGTEFEPAAG 671

Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
            TI AG L    RVIQV +   R  DG+  + Q L       +  +G+E   ++S S AD
Sbjct: 672 STIEAGTLGNGNRVIQVLKSEVRSYDGNLGLAQILPM----YDDDTGAE-PKIVSASFAD 726

Query: 669 PYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTD 728
           PY+LL   D SI +   D +     ++       + K ++ C LY D             
Sbjct: 727 PYLLLFRDDSSIFVAQSDENNELEEIEREDDALLATKWLTGC-LYAD------------- 772

Query: 729 AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNF-NCVFTVDKFVSGRTHI 787
              S GV   +    G  +++ ++   +    GAL I+ +P+  N V+  +         
Sbjct: 773 ---SRGVFAPVQSDKGQKVEE-NVMMFLLSAGGALHIYALPDLSNAVYVAEGLC------ 822

Query: 788 VDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSR-PFLFAILT 846
              ++   L  +     S++ E              + EL +       +R P+L    +
Sbjct: 823 ---FVPPVLSAAYAARRSAARE-------------TITELVVADLGDETARSPYLILRPS 866

Query: 847 DGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETP 906
              +  Y+      P +TS      S S     S +   ++ N   +R P    +  ET 
Sbjct: 867 TDDLTIYE------PFHTS------SESSGGLASTLQFLKIHNPHLARNP--DVSAAETA 912

Query: 907 HGAPCQR---ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNV 963
            G    R   + +  N+ G+   FL G  P + M   +       L    +   +  H  
Sbjct: 913 DGIQETRDEPMRVISNLGGYCTVFLPGGSPSFIMKSAKSTPKVISLQGLGVRGMSSFHTE 972

Query: 964 NCNHGFIYVTSQGILKICQLPSGSTY 989
            C+ GFIY    G+ ++ QLP  +T+
Sbjct: 973 GCDRGFIYTDVDGLARVSQLPKDTTF 998


>gi|340924328|gb|EGS19231.1| hypothetical protein CTHT_0058560 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 1460

 Score =  146 bits (368), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 237/1042 (22%), Positives = 400/1042 (38%), Gaps = 183/1042 (17%)

Query: 57  NLVVTAANVIEIYVVR--------VQEEGSKESKNSGETKRRVLMD--GISAA------- 99
           NLVV  +++++++  +        +Q  G+ + +++   + R+  D  G+ A+       
Sbjct: 28  NLVVAKSSLLQVFRTKTVTTEIDTLQTNGASKGRSAARYENRLANDDDGLEASFLGGDSL 87

Query: 100 ----------SLELVCHYRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFD 148
                      L LV    L G V  L+ +       SR   +S++LAF DAK+S++E+D
Sbjct: 88  GFRADRTTNTKLVLVYETPLAGTVIGLSKIK---TSTSRSGCESLLLAFRDAKLSLVEWD 144

Query: 149 DSIHGLRITSMHCFESPEWLHLKRGRESFARGPL------VKVDPQGRCGGVLVYGLQMI 202
              + L   S+H +E  E       + S    PL      +  DP  RC  +      + 
Sbjct: 145 AERNALGTVSIHYYEQEEL------QGSPWAAPLSHYVNFLVADPGSRCAALKFAARNLA 198

Query: 203 ILKASQGGSGL-VGDEDTFGSG------------GGFSARIES-----SHVINLRDLD-- 242
           IL   Q    + +GD D    G               ++ IE      S V+ L +LD  
Sbjct: 199 ILPFRQVDEDIDMGDWDEELDGPRPQKDVSNAAVSNGASNIEDTPYSPSFVLRLSNLDPS 258

Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAM 302
           + H     F+H Y EP   IL              H+T M+  L +    K    I S  
Sbjct: 259 LLHPVHLAFLHEYREPTFGILASTSSASNALGRKDHYTYMVFTLDLQQ--KASTTILSVS 316

Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFS 361
            LP D Y+++ +P+P+GG L+VG N  IH         +A+N       S     +S  +
Sbjct: 317 GLPQDLYRVVPLPAPVGGALLVGCNELIHIDQSGKPNGVAVNPMTKQCTSFGLADQSDLN 376

Query: 362 VELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDI 415
           + L+      L  D+   L+    G +VL+T   DGR V  L+L    P    +++   I
Sbjct: 377 IRLEGCIIDVLTPDLGEFLMILNDGRMVLITFRIDGRTVSGLELRLVPPASGGTIIPGRI 436

Query: 416 TT---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
           +T   IG ++ F GS  GDSL+  +     T   +   + +    + D            
Sbjct: 437 STLSRIGKNVMFAGSEEGDSLVFGW-----TKKQTQAGRRKSKPRDDDFYMDDYEEEEEE 491

Query: 473 DALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRI---NADAS 529
               D+   E  S +   S  +       SF + D L++I P++  +YG  +    ++  
Sbjct: 492 VDEDDLYGEETTSHHQPVSAASSLLSGDLSFRIHDRLISIAPIQSMTYGQPVWMPGSEEE 551

Query: 530 ATGISKQSNYELV--------------------------ELPGCKGIWTVYHKSSRGHNA 563
              I   ++ +LV                          E    +G WT+  K     + 
Sbjct: 552 RNSIGVHADLQLVCAVGRDKSSCLATMNLAIQPKVIGQFEFSEARGFWTMCAKKPIPKSL 611

Query: 564 DSSRMAA------YDD--EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQG 609
            S +  +      YD   +Y  ++I++       E   +   TA     +  +      G
Sbjct: 612 QSDKGVSVLGGNDYDTGGQYDRFMIVAKVDLDGYEKSDVYALTAAGFEGLCGTEFDPAAG 671

Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
            TI AG +    R++Q+ +   R  DG +    +   P   E  +G+E   V + SIADP
Sbjct: 672 ITIEAGTMGKGSRIVQILKSEVRSYDGDFGLSQIV--PMMDEE-TGAEPRAV-TASIADP 727

Query: 670 YVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDA 729
           Y+L+   D S  +   D S     ++    +  S K +S C LY+D              
Sbjct: 728 YLLIIRDDSSAFIAGIDSSNELEELRKEDKVLVSSKWLSGC-LYND-------------- 772

Query: 730 WLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHIV 788
             ST +          P     I   +   SGAL I+ +P+ +  ++  D          
Sbjct: 773 --STAIFAEETAKSSKPTQS--ILLFLLSSSGALYIYRLPDLSKPIYVTDGLA------- 821

Query: 789 DTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDG 848
             Y+  AL    T       +GT    KE I  + V +L        H  P+L    ++ 
Sbjct: 822 --YIPPALSSDFT-----VRKGT---PKEAITEIMVADLG----DTTHKSPYLILRHSND 867

Query: 849 TILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHG 908
            +  YQ Y ++           + T +  S   +   +L N  F+R P +   +++ P  
Sbjct: 868 DLTIYQPYRYK-----------LGTGQVFS-KTLFFQKLPNPSFARAP-EETEQDDVPPQ 914

Query: 909 APCQRITIFKNISGHQGFFLSGSRPCWCMVFRERL-RVHPQLCDGSIVAFTVLHNVNCNH 967
                +    NI+G+   FL G  P + +   + + RV P L    ++A +  H   C+H
Sbjct: 915 PRLLSMRRCNNIAGYSTVFLPGHSPSFILKSAKSMPRVVP-LQGAGVIAMSPFHTEGCDH 973

Query: 968 GFIYVTSQGILKICQLPSGSTY 989
           GFIY  S  I ++ Q+P   +Y
Sbjct: 974 GFIYADSHNIARVTQIPEDWSY 995


>gi|452001482|gb|EMD93941.1| hypothetical protein COCHEDRAFT_1129958 [Cochliobolus
           heterostrophus C5]
          Length = 1385

 Score =  146 bits (368), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 224/1025 (21%), Positives = 402/1025 (39%), Gaps = 180/1025 (17%)

Query: 57  NLVVTAANVIEIYVVR-----VQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
           NLVV   ++++++ ++     V   G  E++N+      E     L    S A L LV  
Sbjct: 28  NLVVAKNSLLQVFELKSTTTEVTPGGGDEAENAAANLDTEAADVPLQRTESTAKLVLVGE 87

Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
           + L G V SLA +         R +++++AF DAK+S++E+D   + L   S+H +E+P+
Sbjct: 88  FPLAGTVVSLARVK--ALSTKSRGEALLVAFRDAKLSLVEWDPESYSLHTISIHYYENPD 145

Query: 167 ------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQ------------ 208
                 W    +   +F     +  DP  RC  +      + IL   Q            
Sbjct: 146 LPGIAPWSADLKDTYNF-----LTADPSSRCAALKFGSHNLAILPFRQRDLVDDDYDSDA 200

Query: 209 -GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHE 265
            G      ++ T  + G  +    SS V+ L +LD  + H     F+H Y EP   I+  
Sbjct: 201 DGPKESKPEQQT--ASGSHTTPYTSSFVLPLTNLDPTLTHPVHLAFLHEYREPTFGIVAA 258

Query: 266 RELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVG 325
              T    ++ +      S  ++    K    + S   LP+D  +++ +PSPIGG L+VG
Sbjct: 259 SRDTAPSLLAHRKDILTYSVFTLDLEQKASTTLLSVSGLPYDITRVVPLPSPIGGALLVG 318

Query: 326 AN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTK 382
           +N  IH      +  +A+N +A +  S     +S  ++ L+      L ++    L+   
Sbjct: 319 SNEIIHVDQGGKTNGVAVNEFAKACTSFPLSDQSDLALRLEGCSVELLSHEAGDVLVVLN 378

Query: 383 TGDLVLLTVVYDGRVVQRLDLSKTNP-------SVLTSDITTIGNSLFFLGSRLGDSLLV 435
            G L++LT   DGR V  + +                S  + +G    F+GS  GDS+++
Sbjct: 379 NGRLLVLTFTLDGRTVSGMTVHPVAADHGGHLIKAAASCTSNLGRGRLFVGSEDGDSVML 438

Query: 436 QFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASNNT 494
            +T        +S L+ +  +   D            D   D+ N    ++   +A+ + 
Sbjct: 439 GWTS------TASHLRRKQSNANIDTDEDMSDEEDMEDMEDDLYNDTAPAVQKITAAASE 492

Query: 495 ESAQKTFSFAVRDSLVNIGPLKD------------------FSYGLRINADASATGISKQ 536
            +A  T++F + D L +I P+K+                   S G    A AS T ++++
Sbjct: 493 PTAPGTYTFRIHDVLPSIAPIKNAVLHPGKDTESLNRGEVMLSTGR--GAAASITALNRE 550

Query: 537 SNYELV---ELPGCKGIWTVYHKS------SRGHNADSSRMAAYDDEYHAYLIISLEAR- 586
            +   V   +LP  +G W V+ +       +     D     A + +Y  YL++S     
Sbjct: 551 LHPVTVATRQLPSARGTWAVHARKQAPGDVTAAFGEDMEANMATNVDYDQYLVVSKTGED 610

Query: 587 ----TMVLE-TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQ 641
               T+V E   + LTE  +      +G T+  G L    +V+QV     R  D S +  
Sbjct: 611 GTESTVVYEVNGNELTETDKGDFEREEGSTLFVGVLAAGTKVVQVMRTEIRTYD-SELNM 669

Query: 642 DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIE 701
           D      + ESG+      V++ S ADPY+L+   D S+++               A  +
Sbjct: 670 DQILPMEDEESGN---EVNVINASFADPYLLVLREDSSVKIFR-------------ATGD 713

Query: 702 SSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESG 761
              + V +  L             S   WLS  + ++            ++++ +    G
Sbjct: 714 GELEDVEATGL-------------SNSQWLSASLFKSASFT--------EVFAFLLTPEG 752

Query: 762 ALEIFDVPNFNCVFTVDKFVSGRTHIVDT-YM--REALKDSETEINSSSEEGTGQGRKEN 818
            L +F V +      V + +S    ++   Y+  R A+K + TEI               
Sbjct: 753 GLRVFAVSDMEKPCYVAEALSFLPPVLGMDYVPKRSAIKATITEI--------------- 797

Query: 819 IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLS 878
                   LA     A    P L    +   ++ Y+A+                 S S S
Sbjct: 798 --------LAADLGDATTKSPHLIVRTSSDNLVIYKAF----------------HSPSRS 833

Query: 879 VSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCM 937
            +++    LR ++ S+  +  YT +     +  +  +    +I G+   F  G+ P +  
Sbjct: 834 AADLWTKNLRWVKLSQQHIPRYTEDGGAEDSGFESTLLTLSDIGGYSTVFQRGTTPAF-- 891

Query: 938 VFRERLRVHPQ---LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNY-W 993
           +F+E     P+   L    + + T  H  +C  GF Y+ S   L+I QLP  + Y +  W
Sbjct: 892 IFKESSSA-PRVIGLSGKPVKSLTSFHTSSCQRGFAYLDSTDTLRISQLPPQTHYGHLGW 950

Query: 994 PVQKV 998
             +++
Sbjct: 951 ATRRM 955


>gi|261201748|ref|XP_002628088.1| protein CFT1 [Ajellomyces dermatitidis SLH14081]
 gi|239590185|gb|EEQ72766.1| protein CFT1 [Ajellomyces dermatitidis SLH14081]
          Length = 1403

 Score =  146 bits (368), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 226/1032 (21%), Positives = 408/1032 (39%), Gaps = 176/1032 (17%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V  + +++++ +     GS   ++  +T+ +          L LV  Y L G +  L
Sbjct: 28  NLIVAKSTLLQVFNLVNVVYGSAPGQSDEKTRSQY-------TKLVLVAEYALSGTITDL 80

Query: 117 A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
                + S+ G +      ++++   +AK+S++E+D   H +  TS+H +E  + +H+  
Sbjct: 81  GRVKILNSKSGGE------AVLVGTRNAKLSLIEWDPERHKIATTSIHYYERDD-VHISP 133

Query: 173 GRESFARGPL-VKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---------------- 214
              + A  P  + VDP  RC  VL +G + + IL   Q G  LV                
Sbjct: 134 WTPNLANCPSHLTVDPSSRCA-VLNFGKKNLAILPFHQVGDDLVMDDFDSDVEEPPRDTN 192

Query: 215 -----GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERE 267
                 DE    +G  F     SS V+ +  L+  M H     F++ Y EP   IL+ + 
Sbjct: 193 HTAEGQDEAKKSNGLAFHTPYASSFVLPIAALEPAMLHPISLAFLYEYREPTFGILYSQV 252

Query: 268 LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327
            T +  +  +      S  ++    +    + S   LP+D +K++A+P P+GG L++G N
Sbjct: 253 ATSSALLHDRKDVVFYSVFTLDLEQRASTTLLSVSRLPNDLFKVVALPPPVGGALLIGTN 312

Query: 328 T-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTG 384
             +H      + A+ +N +A    S     +S   + L+ +    L  +N   LL    G
Sbjct: 313 ELVHIDQAGKTNAVGVNEFAREASSFSMADQSDLEMRLEGSIVEQLGTENGDMLLVLLNG 372

Query: 385 DLVLLTVVYDGRVVQRLDLS-----------KTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
            + +L+   DGR V  + L            K  PS        +G    F GS   DS+
Sbjct: 373 KMAVLSFKLDGRSVSGISLRLVPDLAGGSLLKARPSC----SVPLGRGKIFFGSEESDSV 428

Query: 434 LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLY------ 487
           L+      G S  S+  K+       D         SS +  +D  +  E  LY      
Sbjct: 429 LI------GWSRPSTRPKDPPVQGAGD---DNIAELSSDEEEEDDEDIYEDDLYATPVPT 479

Query: 488 GSASNNTESAQKT----FSFAVRDSLVNIGPLKDFSYGLRI---NADASATGISKQSNYE 540
           G+ +  + S + T    ++F + D L N+GP++D + G      + D      S  +N E
Sbjct: 480 GAKARGSLSVKGTNLNDYTFRIHDRLWNLGPMRDLTLGRPAGSRDKDKRQPVSSLSTNLE 539

Query: 541 LVELPG--------------------------CKGIWTVYHKSSRGHNADSSRMAAYDDE 574
           LV   G                            G W+V+ K  +  +   S        
Sbjct: 540 LVATQGYGKAGGLTILRREIDPYVIDSLMIKDTDGAWSVHVKDPKLPSQSGSLPLNASSN 599

Query: 575 YHAYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFE 628
           Y  YL++S      + +++V   +    E T++ ++   + RTI  G L G  RV+QV +
Sbjct: 600 YDHYLLLSKSKGSDKEKSVVYTMSSGGLEETKASEFNPNEDRTIDIGTLAGGTRVVQVLK 659

Query: 629 RGARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDP 687
              R  D G  + Q       +      SE   V+  S ADPYVL+   D S+ LL  D 
Sbjct: 660 GEVRSYDSGLGLAQIFPVWDEDM-----SEEKYVVHASFADPYVLIIRDDQSVLLLQADG 714

Query: 688 STCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPL 747
           S     ++    I S+     S +LY DK       +T+    LS  V            
Sbjct: 715 SGDLDEIEADGIINSTT--WISGSLYQDKYRSFMSYETAPSRKLSDNV------------ 760

Query: 748 DQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHIVDTYMREALKDSETEINSS 806
               +  ++  ES  L IF +PN    VFT +                   D   +I S+
Sbjct: 761 ----LLFLLSSES-KLHIFHLPNAKEPVFTAECV-----------------DLLPQILST 798

Query: 807 SEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSK 866
                    +E++  + V ++      +    P+L    ++  ++ Y+ Y      +T+ 
Sbjct: 799 EPPPKRATYRESLTEILVADIG----DSVSRTPYLILRSSNNDLILYEPY------HTTH 848

Query: 867 SDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGF 926
           S +  S       S++   +  N  F +    +   + +  GA  + + +  ++ G++  
Sbjct: 849 STEKKS-------SDLRFLKTINHHFPKFHAGSNVEDSSHIGALPKPLRVLGDVCGYRTV 901

Query: 927 FLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSG 986
           F+ G+ PC+ +     +     L   ++ + +  +   C  GF+YV +  ++++C+ P  
Sbjct: 902 FMPGNSPCFVIKSSTSIPHVLNLRGKTVHSLSSFNIPACERGFVYVDADNVVRMCRFPRN 961

Query: 987 STYDNYWPVQKV 998
           + +D  W  +K+
Sbjct: 962 THFDGSWATRKI 973


>gi|451849663|gb|EMD62966.1| hypothetical protein COCSADRAFT_92785 [Cochliobolus sativus ND90Pr]
          Length = 1405

 Score =  145 bits (367), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 223/1021 (21%), Positives = 400/1021 (39%), Gaps = 172/1021 (16%)

Query: 57  NLVVTAANVIEIYVVR-----VQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
           NLVV   ++++++ ++     V   G  E++N+      E     L    S A L LV  
Sbjct: 28  NLVVAKNSLLQVFELKSTTTEVTPGGGDEAENAAANLDTEAADVPLQRTESTAKLVLVGE 87

Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
           + L G V SLA +         R +++++AF DAK+S++E+D   + L   S+H +E+P+
Sbjct: 88  FPLAGTVVSLARVK--ALSTKSRGEALLVAFRDAKLSLVEWDPESYNLHTISIHYYENPD 145

Query: 167 ------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQ-------GGSGL 213
                 W    +   +F     +  DP  RC  +      + IL   Q         S  
Sbjct: 146 LPGIAPWSADLKDTYNF-----LTADPSSRCAALKFGSHNLAILPFRQRDLVDDDYDSDA 200

Query: 214 VGDEDTF----GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERE 267
            G +++      + G  +    SS V+ L +LD  + H     F+H Y EP   I+    
Sbjct: 201 DGPKESKLEQQAASGSHTTPYTSSFVLPLTNLDPTLTHPVHLAFLHEYREPTFGIVAASR 260

Query: 268 LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327
            T    ++ +      S  ++    K    + S   LP+D  +++ +PSPIGG L+VG+N
Sbjct: 261 DTAPSLLAHRKDILTYSVFTLDLEQKASTTLLSVSGLPYDITRVVPLPSPIGGALLVGSN 320

Query: 328 -TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTG 384
             IH      +  +A+N +A +  S     +S  ++ L+      L ++    L+    G
Sbjct: 321 EIIHVDQGGKTSGVAVNEFAKTCTSFPLSDQSDMALRLEGCSVELLSHEAGDVLIVLNNG 380

Query: 385 DLVLLTVVYDGRVVQRLDLSKTNP-------SVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
            L++LT   DGR V  + +                S  + +G    F+GS  GDS+++ +
Sbjct: 381 RLLVLTFTLDGRTVSGMTVHPVAADHGGHLIKAAASCTSNLGRGRLFVGSEDGDSVMLGW 440

Query: 438 TCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASNNTES 496
           T        +S L+ +  +   D            D   D+ N    ++   +A+ +  +
Sbjct: 441 TS------TASHLRRKQSNANIDTDEDMSDEEDMDDMEDDLYNDTAPAVQKITAAASEPT 494

Query: 497 AQKTFSFAVRDSLVNIGPLKD-----------FSYG-LRINADASATGISKQSNYEL--- 541
           A  T++F + D L +I P+K+            + G + ++    A       N EL   
Sbjct: 495 APGTYTFRIHDVLPSIAPIKNAVLHPGKDTESLNRGEIMLSTGRGAAAAITALNRELHPV 554

Query: 542 ----VELPGCKGIWTVYHKS------SRGHNADSSRMAAYDDEYHAYLIISLEAR----- 586
                +LP  +G W V+ +       +     D     A + +Y  YL++S         
Sbjct: 555 TAATRQLPSARGTWAVHARKQAPGDVTAAFGEDMEANMATNVDYDQYLVVSKTGEDGTES 614

Query: 587 TMVLE-TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSF 645
           T+V E   + LTE  +      +G T+  G L    +V+QV     R  D S +  D   
Sbjct: 615 TVVYEVNGNELTETDKGDFEREEGSTLFVGILAAGTKVVQVMRTEIRTYD-SELNMDQIL 673

Query: 646 GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKK 705
              + ESG+      V++ S ADPY+L+   D S+++               A  +   +
Sbjct: 674 PMEDEESGN---ELNVINASFADPYLLVLREDSSVKIFR-------------ATGDGELE 717

Query: 706 PVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEI 765
            V +  L             S   WLS  + ++            ++++ +    G L +
Sbjct: 718 DVEATGL-------------SNSQWLSASLFKSASFT--------EVFAFLLTPEGGLRV 756

Query: 766 FDVPNFNCVFTVDKFVSGRTHIVDT-YM--REALKDSETEINSSSEEGTGQGRKENIHSM 822
           F V +      V + +S    ++   Y+  R A+K + TEI                   
Sbjct: 757 FAVSDMEKPCYVAEALSFLPPVLGMDYVPKRSAIKATITEI------------------- 797

Query: 823 KVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNV 882
               LA     A    P L    +   I+ Y+A+                 S S S +++
Sbjct: 798 ----LAADLGDATTKSPHLIIRTSSDNIVIYKAF----------------HSPSRSAADL 837

Query: 883 SASRLRNLRFSRTPLDAYTREETPHGAPCQRITI-FKNISGHQGFFLSGSRPCWCMVFRE 941
               LR ++ S+  +  YT +     +  +   +   +I G+   F  G+ P +  +F+E
Sbjct: 838 WTKNLRWVKLSQQHIPRYTEDGGAEDSGFESTLLALSDIGGYSTVFQRGTTPAF--IFKE 895

Query: 942 RLRVHPQ---LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNY-WPVQK 997
                P+   L    + + T  H  +C  GF Y+ S   L+I QLP  + Y +  W  ++
Sbjct: 896 SSSA-PRVIGLSGKPVKSLTSFHTSSCQRGFAYLDSTDTLRISQLPPQTHYGHLGWATRR 954

Query: 998 V 998
           +
Sbjct: 955 M 955


>gi|367018592|ref|XP_003658581.1| hypothetical protein MYCTH_2294503 [Myceliophthora thermophila ATCC
            42464]
 gi|347005848|gb|AEO53336.1| hypothetical protein MYCTH_2294503 [Myceliophthora thermophila ATCC
            42464]
          Length = 1547

 Score =  145 bits (366), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 233/1003 (23%), Positives = 382/1003 (38%), Gaps = 174/1003 (17%)

Query: 94   DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRR------------DSIILAFEDAK 141
            D  +   L LV  + L G V  LA +    A+ +               DS+++AF DA+
Sbjct: 93   DRANTTKLVLVAEFPLAGTVTGLARIRTPKANRNHDGGAGHAGHAGHGCDSLLIAFRDAR 152

Query: 142  ISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL------VKVDPQGRCGGVL 195
            +S++E+D   H L   S+H +E  E       + S    PL      +  DP  RC  + 
Sbjct: 153  LSLVEWDAEQHTLSTISIHYYEQEEL------QGSPWAAPLSHYVNFLVADPGSRCAALK 206

Query: 196  VYGLQMIILKASQGGSGL-VGDEDTFGSGGGFSARIESSHVIN----------------- 237
                 + IL   Q    + +GD D    G   +    S+ V+N                 
Sbjct: 207  FGARNLAILPFRQADEDIDMGDWDEELDGPRPAKDPSSNAVVNGASNIEDTPYSPSFVLR 266

Query: 238  LRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
            L +LD  + H     F+H Y EP   IL              H   M+  L +    K  
Sbjct: 267  LSNLDPSLLHPVHLAFLHEYREPTFGILASATAPSNALGRKDHLVYMVFTLDLQQ--KAS 324

Query: 296  PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQE 354
              I S   LP D ++++ +P+P+GG L+VG+N  IH         +A+N       +   
Sbjct: 325  TTILSVSGLPQDLFRVVPLPAPVGGALLVGSNELIHVDQSGKPNGVAVNPMTRQCTNFGL 384

Query: 355  LPRSSFSVELDAAHATWLQNDVALLST--KTGDLVLLTVVYDGRVVQRLDLSKTNPSV-- 410
            + +S  ++ L+      L  D+  L      G   ++T   DGR V  L++     S   
Sbjct: 385  VDQSDLNLRLEGCAIDVLTPDLGELFVVLNDGRAAVVTFRIDGRTVSGLEIKMLPESAGG 444

Query: 411  -----LTSDITTIGNSLFFLGSRLGDSLLVQFT---CGSGTSMLSSGLKEEFGDIEADAP 462
                   S ++ IG +  F G   GDSLL+ +      +G   L +      GD++A+  
Sbjct: 445  SLIPGRVSTLSRIGRNAVFAGREEGDSLLLGWAKRQAQTGRRRLRARDAAGSGDVDAEG- 503

Query: 463  STKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT-------------FSFAVRDSL 509
                L     D + +  + +E           ESA +               SF V D L
Sbjct: 504  --AELAEGDEDVVAEGEDEDEDEEDEDDLYGEESAPRQQPVSAASSFLSGDVSFRVHDRL 561

Query: 510  VNIGPLKDFSY----------------GLRINADASAT-GISKQSNYELV---------- 542
            +++ P++  +Y                G+R + +   T G  K +    V          
Sbjct: 562  LSVAPIQALTYSQPVYLAGSEEERNSAGVRSDLNLVCTVGRDKSAALATVNLAIQPRVIG 621

Query: 543  --ELPGCKGIWTV-----YHKSSRGHNADSSRMAAYDD--EYHAYLIISLEARTMVLETA 593
              E P  +G WTV       KS +G  A +S    YD   +Y  ++I++ +      E +
Sbjct: 622  RFEFPEARGFWTVCAKKPVPKSLQGDKAGNSLSKDYDTAGQYDRFMIVA-KVDLDGYEKS 680

Query: 594  DLLTEVTESVDYF-------VQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG 646
            D+        +           G TI AG +    R+IQ+ +   R  DG +    +   
Sbjct: 681  DVYALTAAGFEGLGGTEFDPAAGITIEAGTMGKGSRIIQILKSEVRCYDGDFGLSQIV-- 738

Query: 647  PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKP 706
            P   E  +G+E   V S SI DP++L+   D S  +   D S     +       +S K 
Sbjct: 739  PMLDEE-TGAEPRAV-SASIVDPFLLIIRDDSSAFIAQVDSSNELEELDKEDPTLASTKW 796

Query: 707  VSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIF 766
            ++ C LY D          +T A+     G+      GG L Q  +   +   SGAL I+
Sbjct: 797  LTGC-LYAD----------TTGAFAEEAPGK------GGKLSQ-SVLMFLLSASGALHIY 838

Query: 767  DVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVE 826
             +P+ +    V + +S        Y+   L       + S+ +GT    KE I  + V +
Sbjct: 839  RLPDLSKPVYVAEGLS--------YIPPGLS-----ADYSARKGTA---KETIAEILVAD 882

Query: 827  LAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASR 886
            L        H  P L    T+  +  YQ + +    NT      +  S++L        +
Sbjct: 883  LG----DMTHKSPHLILRHTNDDLTLYQPFRY----NTGAG---LEFSKTLFF-----QK 926

Query: 887  LRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVH 946
            L N  F+++P +A   E T H      +    N+ G+   FL G+ P + +   + +   
Sbjct: 927  LPNTVFAKSPEEADDDEAT-HQPRFLSMRRCANVGGYSTVFLPGASPSFIIKSSKSVPKV 985

Query: 947  PQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
              L    ++A +  H   C HGFIY  S+ + ++ QLP   +Y
Sbjct: 986  LPLQGTGVIAMSPFHTEGCEHGFIYADSRDMARVAQLPQDWSY 1028


>gi|242798830|ref|XP_002483249.1| cleavage and polyadenylation specificity factor subunit A, putative
           [Talaromyces stipitatus ATCC 10500]
 gi|218716594|gb|EED16015.1| cleavage and polyadenylation specificity factor subunit A, putative
           [Talaromyces stipitatus ATCC 10500]
          Length = 1382

 Score =  145 bits (365), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 242/1027 (23%), Positives = 399/1027 (38%), Gaps = 186/1027 (18%)

Query: 57  NLVVTAANVIEIYVV-------RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRL 109
           NLVV   ++++IY +        V E G + + N    KR           L+L   Y L
Sbjct: 28  NLVVIKTSLLQIYNLVTETVTPSVLENGQRANDNE---KRN------ETTKLQLFAEYDL 78

Query: 110 HGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWL 168
           HG V  +   S+    NSR   D+++L+F +AK+S++E++  I  +   S+H +E  +  
Sbjct: 79  HGTVTDI---SRINILNSRSGGDALLLSFRNAKLSLIEWNPEIQNISTVSIHYYEKEDIT 135

Query: 169 HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDE-------DTF 220
                 +       + VDP  RC  VL +G++ + IL   Q G  LV DE       D F
Sbjct: 136 LSPWAPDLSQCDSHLTVDPSSRCA-VLNFGVRNLAILPFHQAGDDLVMDEYDPDLDMDDF 194

Query: 221 GSGGGFSARIES----------------SHVINLRDLD--MKHVKDFIFVHGYIEPVMVI 262
                 ++  +S                S V+ L  LD  + H     F+H Y EP   I
Sbjct: 195 TGQDKNTSHTDSKKGTEKDHTHQTPYAASFVLPLTALDPTLIHPIGLTFLHEYREPTFGI 254

Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVL 322
           L+    T A  +  +    + S  ++    +    + S   LP D   ++A+P+P+GG L
Sbjct: 255 LYSPIATSAALLEERKDVVVYSVFTLDLEQRASTPLLSIAKLPSDLLHIMALPAPVGGAL 314

Query: 323 VVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LL 379
           ++G+N  IH      + A+A+N +A  + S   + +S   + L+ +    +  +    LL
Sbjct: 315 LIGSNELIHVDQSGKASAVAVNEFAKQVSSFPMIDQSDLGLRLENSVVEVINKECGDILL 374

Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDIT--------TIGNSLFFLGSRLGD 431
           +  TG+LVL+    DGR V    +    P+    D+         ++G+   F+GS   D
Sbjct: 375 TLSTGELVLVHFKIDGRSVSGPVVCPV-PTNSGGDVVGATASCSISLGSGKVFIGSEDTD 433

Query: 432 SLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS--SDALQDMVNGEELSLYGS 489
           SLL+     S  S  S    E+  D + +      +      S A ++ VN        +
Sbjct: 434 SLLLDCYVSSAVSKKSKDHGEDQFDEDMNDEDDDDMYEDDLYSSAPKEAVNK-------A 486

Query: 490 ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKG 549
            SN   SA + +SF V D L ++  L+  + G   + D+ A  +S QS +EL EL    G
Sbjct: 487 VSNG--SASEDYSFRVLDKLPSLASLRSVTVGKPASRDSDAGNVS-QSVHEL-ELAAAYG 542

Query: 550 ---------IWTVYH----KSSRGHNADS------SRMAAYDDEYHAYLIISLEARTMVL 590
                    +    H     +  G  ADS      S  +  +D          E+ + V 
Sbjct: 543 SGRNGGVALLQRALHLDGISTMNGETADSVWNINTSTKSGRNDPSEG------ESPSYVF 596

Query: 591 ETADLLTEVTESVDYFVQGR----------------TIAAGNLFGRRRVIQVFERGARIL 634
            T    T+  E++ Y V G                 T+  G L G  RV+QV     R+ 
Sbjct: 597 LTKSNSTDNEETLVYAVNGSNLEPFSAPDVNPNGDPTVDIGTLAGNSRVVQVLTGEVRVY 656

Query: 635 DGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 693
           D +  M Q     P   E   G E   V S S ADPY+L+   D S+ LL  D S     
Sbjct: 657 DTNLGMAQ---IYPVWDED-EGDERFAV-STSFADPYLLIIRDDSSVLLLHSDESGDLDE 711

Query: 694 VQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIY 753
           +  P  I SS+  +  C LY DK                  V E  D A       G+ Y
Sbjct: 712 LSKPETI-SSQSWLCGC-LYTDK----------------HNVFE--DNA------TGNTY 745

Query: 754 SVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQ 813
             +  +   L +F +P    V   +                   D  + I SS +     
Sbjct: 746 MFLLNQECKLFMFRLPTRELVSVTEGV-----------------DYVSSILSSDQPAKRL 788

Query: 814 GRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVST 873
             +E I  + V +L         + P+L        ++ Y+               PV  
Sbjct: 789 NSRETIAELLVADLG----EISTASPYLIIRSATDDLIIYK---------------PVRE 829

Query: 874 SRSLSVSNVSASRLR--NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
           +     + V+   ++  N    + P++A   +        +R+    +I G+    +SG+
Sbjct: 830 NSKDEKTGVTLKYIKESNHFLPKVPIEAAATDTQQRMPGLRRLA---DIGGYAAVLMSGA 886

Query: 932 RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
            P   +   + L     +   SI   +   +  C  G IYV ++ +++ C+L   +  D 
Sbjct: 887 SPSLVVRTSKSLPRVFSIQSDSIRGISGFDSAGCEKGLIYVDNEHVVRTCRLHDNTQLDF 946

Query: 992 YWPVQKV 998
            WP++K+
Sbjct: 947 SWPIRKI 953


>gi|196012166|ref|XP_002115946.1| hypothetical protein TRIADDRAFT_59883 [Trichoplax adhaerens]
 gi|190581722|gb|EDV21798.1| hypothetical protein TRIADDRAFT_59883 [Trichoplax adhaerens]
          Length = 1187

 Score =  144 bits (363), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 95/299 (31%), Positives = 144/299 (48%), Gaps = 19/299 (6%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+      I +Y +   +E       S      +  D      LE +  Y  +G +  +
Sbjct: 29  NLLTAGPTCIRVYDIIKDQEDIDLDNRSDNADNHLNKDNKLHPELEFLASYSFYGKIYGI 88

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
               +        RDS+ + F DAK+S++E+D     L   S+H FE  E   LK G   
Sbjct: 89  ----ESVRFRHHHRDSLFICFADAKLSLVEYDADNSNLTTLSLHTFEDDE---LKNGFSR 141

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE-DTFGSGGGFSARIESSHV 235
               P+++VDP  RC  ++V  + + IL     G      + D   + G +   +  S+V
Sbjct: 142 NLSIPIIRVDPDNRCAAMVVSNVHLAILPFRHRGPAEQQVQIDPKNTSGKYP--LMPSYV 199

Query: 236 INLRDLDMKHVKDFI---FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
           +++RDL  + V   I   F+ GY EP ++IL E   TW+GRV+ +  TC I A+S++T  
Sbjct: 200 VDVRDLGNEKVSRLIDIRFLEGYYEPTILILCEILRTWSGRVAVRQDTCSILAVSLNTID 259

Query: 293 KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
           K HP+IWS  NLP D    + VP PIGGVL+  AN + + +QS         YA SL+S
Sbjct: 260 KVHPVIWSLNNLPFDCLGAITVPRPIGGVLIFAANCLLHLNQSKP------PYAESLNS 312



 Score = 75.9 bits (185), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 88/391 (22%), Positives = 159/391 (40%), Gaps = 100/391 (25%)

Query: 458 EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT--ESAQKTFSFAVRDSLVNIGPL 515
           + D P++K+LR       +++       LY + ++ T  ES  ++++F V D ++++GP 
Sbjct: 336 DTDEPTSKKLRTDDEKEDEELE-----KLYSAHTSCTAKESYLRSYTFEVCDRILHVGPC 390

Query: 516 KDFSYGLRINADASATGISKQSNYELV--------------------------ELPGCKG 549
              + G        +T + ++S+ E+V                          +LPGC  
Sbjct: 391 ASIAIG------QISTFVQEESDVEVVICSGHDKNGALSVLNKGIKPQVVASYDLPGCVD 444

Query: 550 IWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
           +WTV  K  R ++ +        +  H +LIIS +  TM+L T   +TEV E + +  Q 
Sbjct: 445 MWTV--KDIRLNDENDGDFET--ENTHKFLIISRDNLTMILRTGKEITEV-EQLGFLTQT 499

Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
           +T+ AGNL     +IQV      ++      Q L               S ++  S+ DP
Sbjct: 500 KTVFAGNLDNGNCIIQVTPYEVILVSKGEKIQQLEL----------ENESPIVFCSLQDP 549

Query: 670 YVLLGMSDGSIRLL---VGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD----------- 715
           Y+ L +  GSI +L   + D     V +     +  S+  +++C L+ D           
Sbjct: 550 YISLLLEGGSIMMLAFELSDNGEKQVKLVNTTPLNHSR--IAACCLFQDNNGRMSVSDGI 607

Query: 716 --KGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDI--------------------- 752
             + P P    T+  A L       ID  +   LD  D                      
Sbjct: 608 SIRTPSP----TNEPAELMEDEKFTIDDDELLYLDVNDTNLQTNDVPVASTSYTDNLERK 663

Query: 753 ---YSVVCYESGALEIFDVPNFNCVFTVDKF 780
              +  +C ++G LE++ +P+++ V+TV+ F
Sbjct: 664 VSYWLFLCLDNGKLEVYSIPSYDKVYTVNGF 694



 Score = 48.1 bits (113), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 22/50 (44%), Positives = 28/50 (56%)

Query: 949 LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
           L DG +  F   +  NC +GF+Y  S+  L+IC L    TYD  WPV KV
Sbjct: 767 LVDGYVKCFAPFNIANCPNGFLYFNSEEDLRICVLDQRFTYDCPWPVHKV 816


>gi|239611898|gb|EEQ88885.1| protein CFT1 [Ajellomyces dermatitidis ER-3]
 gi|327352847|gb|EGE81704.1| CFT1 [Ajellomyces dermatitidis ATCC 18188]
          Length = 1402

 Score =  144 bits (363), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 227/1032 (21%), Positives = 408/1032 (39%), Gaps = 177/1032 (17%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V  + +++++ +     GS   ++  +T+ +          L LV  Y L G +  L
Sbjct: 28  NLIVAKSTLLQVFNLVNVVYGSAPGQSDEKTRSQY-------TKLVLVAEYALSGTITDL 80

Query: 117 A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
                + S+ G +      ++++   +AK+S++E+D   H +  TS+H +E  + +H+  
Sbjct: 81  GRVKILNSKSGGE------AVLVGTRNAKLSLIEWDPERHKIATTSIHYYERDD-VHISP 133

Query: 173 GRESFARGPL-VKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---------------- 214
              + A  P  + VDP  RC  VL +G + + IL   Q G  LV                
Sbjct: 134 WTPNLANCPSHLTVDPSSRCA-VLNFGKKNLAILPFHQVGDDLVMDDFDSDVEEPPRDTN 192

Query: 215 -----GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERE 267
                 DE    +G  F     SS V+ +  L+  M H     F++ Y EP   IL+ + 
Sbjct: 193 HTAEGQDEAKKSNGLAFHTPYASSFVLPIAALEPAMLHPISLAFLYEYREPTFGILYSQV 252

Query: 268 LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327
            T +  +  +      S  ++    +    + S   LP+D +K++A+P P+GG L++G N
Sbjct: 253 ATSSALLHDRKDVVFYSVFTLDLEQRASTTLLSVSRLPNDLFKVVALPPPVGGALLIGTN 312

Query: 328 T-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTG 384
             +H      + A+ +N +A    S     +S   + L+ +    L  +N   LL    G
Sbjct: 313 ELVHIDQAGKTNAVGVNEFAREASSFSMADQSDLEMRLEGSIVEQLGTENGDMLLVLLNG 372

Query: 385 DLVLLTVVYDGRVVQRLDLS-----------KTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
            + +L+   DGR V  + L            K  PS        +G    F GS   DS+
Sbjct: 373 KMAVLSFKLDGRSVSGISLRLVPDLAGGSLLKARPSC----SVPLGRGKIFFGSEESDSV 428

Query: 434 LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLY------ 487
           L+      G S  S+  K    D          +   SSD  +D  +  E  LY      
Sbjct: 429 LI------GWSRPSTRPK----DPPVQGAGDDNIAELSSDEEEDDEDIYEDDLYATPVPT 478

Query: 488 GSASNNTESAQKT----FSFAVRDSLVNIGPLKDFSYGLRI---NADASATGISKQSNYE 540
           G+ +  + S + T    ++F + D L N+GP++D + G      + D      S  +N E
Sbjct: 479 GAKARGSLSVKGTNLNDYTFRIHDRLWNLGPMRDLTLGRPAGSRDKDKRQPVSSLSTNLE 538

Query: 541 LVELPG--------------------------CKGIWTVYHKSSRGHNADSSRMAAYDDE 574
           LV   G                            G W+V+ K  +  +   S        
Sbjct: 539 LVATQGYGKAGGLTILRREIDPYVIDSLMIKDTDGAWSVHVKDPKLPSQSGSLPLNASSN 598

Query: 575 YHAYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFE 628
           Y  YL++S      + +++V   +    E T++ ++   + RTI  G L G  RV+QV +
Sbjct: 599 YDHYLLLSKSKGSDKEKSVVYTMSSGGLEETKASEFNPNEDRTIDIGTLAGGTRVVQVLK 658

Query: 629 RGARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDP 687
              R  D G  + Q       +      SE   V+  S ADPYVL+   D S+ LL  D 
Sbjct: 659 GEVRSYDSGLGLAQIFPVWDEDM-----SEEKYVVHASFADPYVLIIRDDQSVLLLQADG 713

Query: 688 STCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPL 747
           S     ++    I S+     S +LY DK       +T+    LS  V            
Sbjct: 714 SGDLDEIEADGIINSTT--WISGSLYQDKYRSFMSYETAPSRKLSDNV------------ 759

Query: 748 DQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHIVDTYMREALKDSETEINSS 806
               +  ++  ES  L IF +PN    VFT +                   D   +I S+
Sbjct: 760 ----LLFLLSSES-KLHIFHLPNAKEPVFTAECV-----------------DLLPQILST 797

Query: 807 SEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSK 866
                    +E++  + V ++      +    P+L    ++  ++ Y+ Y      +T+ 
Sbjct: 798 EPPPKRATYRESLTEILVADIG----DSVSRTPYLILRSSNNDLILYEPY------HTTH 847

Query: 867 SDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGF 926
           S +  S       S++   +  N  F +    +   + +  GA  + + +  ++ G++  
Sbjct: 848 STEKKS-------SDLRFLKTINHHFPKFHAGSNVEDSSHIGALPKPLRVLGDVCGYRTV 900

Query: 927 FLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSG 986
           F+ G+ PC+ +     +     L   ++ + +  +   C  GF+YV +  ++++C+ P  
Sbjct: 901 FMPGNSPCFVIKSSTSIPHVLNLRGKTVHSLSSFNIPACERGFVYVDADNVVRMCRFPRN 960

Query: 987 STYDNYWPVQKV 998
           + +D  W  +K+
Sbjct: 961 THFDGSWATRKI 972


>gi|46120520|ref|XP_385083.1| hypothetical protein FG04907.1 [Gibberella zeae PH-1]
          Length = 1436

 Score =  144 bits (363), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 228/1002 (22%), Positives = 380/1002 (37%), Gaps = 165/1002 (16%)

Query: 72  RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQ-----GGADN 126
           R  ++   ES   G     V  D  +   L LV    L G V  LA +       GG   
Sbjct: 68  RANDDDGLESSFLGGETMIVKTDRTNNTKLVLVAELPLSGAVTGLAKVKTKHSKCGG--- 124

Query: 127 SRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR-GPLVKV 185
               +++++A++ AK+ +  +D     L   S+H +E  E LH      SF      ++ 
Sbjct: 125 ----EALLIAYKAAKLCMAVWDPEKSTLETISIHYYEKEE-LHGAPWEVSFDEYANYLEA 179

Query: 186 DPQGRCGGVLVYGLQMIILKASQGGSGLVGDE------------DTFGSGGGFSARIESS 233
           DP  RC         + IL   Q    L  D+            +T     G S  +E  
Sbjct: 180 DPGSRCAAFQFGSRNIAILPFRQAEEDLEMDDWDEDLDGPRPVKETAAVANGDSDTVEPP 239

Query: 234 HV------INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
           +       + L D  + H   F F+H Y EP   IL   +          H T  +  L 
Sbjct: 240 YTPSFVLRLPLLDPSLLHPVHFAFLHEYREPTFGILSSSQERAHSLGQKDHLTYKVFTLD 299

Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA 346
           +    +    I S  +LP D +K+LA+P+P+GG L++G N  IH      +  +A+N+ A
Sbjct: 300 LQQ--RASTTILSVTDLPRDLFKILALPAPVGGALLIGENELIHVDQSGKANGVAVNSMA 357

Query: 347 VSLDSSQELPRSSFSVELD--AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLS 404
             + S     ++  ++ L+        ++N   LL    G + +++ + DGR V  L + 
Sbjct: 358 RQITSFSLTDQADLNLRLEHCVVEQLHIENGELLLVLNDGQIGIVSFLIDGRTVSGLSIK 417

Query: 405 ----KTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI 457
               +   +VL S  +T   +G + FF+GS +GDS+++ +T   G        K    D 
Sbjct: 418 MVTDENGGNVLKSRASTASKLGKNTFFVGSEMGDSVVLGWTRKMGQEKRR---KPRLIDT 474

Query: 458 EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKD 517
           +      +       D   D+   E  +   + + N        SF + D+L++I P+KD
Sbjct: 475 DIALDVDELDLEDDDDEDDDLYGTESAAAKPAQALNGSGRSGELSFRIHDTLLSIAPIKD 534

Query: 518 FSYGLR--------------INAD---ASATGISKQSNYELV------------ELPGCK 548
            + G                + +D   A   G  K  +  ++            E P  +
Sbjct: 535 LTPGKTSFLPDSEEMTLSDGVVSDLHLACIVGRGKAGSLAILNRNIQPKIIGRFEFPEAR 594

Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHA------YLIISLEARTMVLETADLLT----- 597
           G WT+  K         S  A   DEY A      Y+I++ +      ET+D+       
Sbjct: 595 GFWTMSVKKPLPKALGGS--AGVGDEYEAFGQHDKYMIVA-KVDLDGYETSDVYALTGAG 651

Query: 598 -EVTESVDY-FVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGS 654
            E  +  ++    G T+ AG +  + R+IQV +   R  DG   +TQ L     + E+G+
Sbjct: 652 FETLKETEFDPAAGFTVEAGTMGKQMRIIQVLKSEVRSYDGDLGLTQILPM--LDEETGA 709

Query: 655 GSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYH 714
                 V S SI DPY+LL   D S+ L   D +     V+   A   + K  + C LY 
Sbjct: 710 ---EPRVTSASIVDPYLLLIRDDSSLLLAQIDSNNELEEVEKMDATLQNTKWHAGC-LYA 765

Query: 715 DKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-C 773
           D                 T      +  D G  ++  I   +   +GAL ++ +P+ +  
Sbjct: 766 D-----------------TEGAFQFNANDKGETEK--IMMFLLSSTGALHVYALPDLSKP 806

Query: 774 VFTVDKFVSGRTHI-VDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRW 832
           V+  +       H+  D  +R  L                   KE +  + V +L     
Sbjct: 807 VYVAEGLSYVPPHLSADYTLRRGLA------------------KETLREILVADLG---- 844

Query: 833 SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSAS----RLR 888
                 P+L        +  Y+               P+   R    SN+SA+    ++ 
Sbjct: 845 DTISQSPYLILRNQTDDLTIYE---------------PIHHVRPGGESNLSAALSFKKMS 889

Query: 889 NLRFSRTPLDAYTRE-ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHP 947
           N+  + TP      + E P   P +R     NI+G+   FL GS P + +   + +    
Sbjct: 890 NVTLATTPAQTEDDDVEQPRFMPMRRCA---NINGYSTVFLPGSSPSFVLKSSKSIPRVI 946

Query: 948 QLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
            L    I   +  H   C+ GFIY   +GI ++ Q PS + +
Sbjct: 947 GLQGLGIRGMSSFHTEGCDRGFIYADDKGIARVTQFPSDTNF 988


>gi|408396642|gb|EKJ75797.1| hypothetical protein FPSE_03977 [Fusarium pseudograminearum CS3096]
          Length = 1427

 Score =  143 bits (360), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 228/1001 (22%), Positives = 377/1001 (37%), Gaps = 163/1001 (16%)

Query: 72  RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQ-----GGADN 126
           R  ++   ES   G     V  D  +   L LV    L G V  LA +       GG   
Sbjct: 68  RANDDDGLESSFLGGETMIVKTDRTNNTKLVLVAELPLSGAVTGLAKVKTKHSKCGG--- 124

Query: 127 SRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR-GPLVKV 185
               +++++A++ AK+ +  +D     L   S+H +E  E LH      SF      ++ 
Sbjct: 125 ----EALLIAYKAAKLCMAVWDPEKSTLETISIHYYEK-EELHGAPWEVSFDEYANYLEA 179

Query: 186 DPQGRCGGVLVYGLQMIILKASQGGSGLVGDE------------DTFGSGGGFSARIESS 233
           DP  RC         + IL   Q    L  D+            +T     G S  +E  
Sbjct: 180 DPGSRCAAFQFGSRNIAILPFRQAEEDLEMDDWDEDLDGPRPVKETATVANGDSDTVEPP 239

Query: 234 HV------INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
           +       + L D  + H   F F+H Y EP   IL   +          H T  +  L 
Sbjct: 240 YTPSFVLRLPLLDPSLLHPVHFAFLHEYREPTFGILSSSQEPAHSLGQKDHLTYKVFTLD 299

Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA 346
           +    +    I S  +LP D +K+LA+P+P+GG L++G N  IH      +  +A+N+ A
Sbjct: 300 LQQ--RASTTILSVTDLPRDLFKILALPAPVGGALLIGENELIHVDQSGKANGVAVNSMA 357

Query: 347 VSLDSSQELPRSSFSVELD--AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLS 404
             + S     ++  ++ L+        ++N   LL    G + +++ + DGR V  L + 
Sbjct: 358 RQITSFSLTDQADLNLRLEHCVVEQLHIENGELLLVLNDGQIGIVSFLIDGRTVSGLSVK 417

Query: 405 ----KTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI 457
               +   +VL S  +T   +G + FF+GS +GDS+++ +T   G        K    D 
Sbjct: 418 MVTDENGGNVLKSRASTASKLGKNAFFVGSEMGDSVVLGWTRKMGQEKRR---KPRLIDT 474

Query: 458 EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKD 517
           +      +       D   D+   E  +   + + N        SF + D+L++I P+KD
Sbjct: 475 DIALDVDELDLEDDDDEDDDLYGTESAAAKPAQALNGSGRSGELSFRIHDTLLSIAPIKD 534

Query: 518 FSYGLR--------------INAD---ASATGISKQSNYELV------------ELPGCK 548
            + G                + +D   A   G  K  +  ++            E P  +
Sbjct: 535 LTPGKTSFLPDSEEMTLSDGVVSDLHLACIVGRGKAGSLAILNRNIQPKIIGRFEFPEAR 594

Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEY-----HAYLIISLEARTMVLETADLLT------ 597
           G WT+  K         S  A   DEY     H   +I  +      ET+D+        
Sbjct: 595 GFWTMSVKKPLPKALGGS--AGVGDEYETFGQHDKYMIVAKVDLDGYETSDVYALTGAGF 652

Query: 598 EVTESVDY-FVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSG 655
           E  +  ++    G T+ AG +  + R+IQV +   R  DG   +TQ L     + E+G+ 
Sbjct: 653 ETLKETEFDPAAGFTVEAGTMGKQMRIIQVLKSEVRSYDGDLGLTQILPM--LDEETGA- 709

Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
                V S SI DPY+LL   D S+ L   D +     V+   A   + K  + C LY D
Sbjct: 710 --EPRVTSASIVDPYLLLIRDDSSLLLAQIDSNNELEEVEKMDATLQNTKWHAGC-LYAD 766

Query: 716 KGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CV 774
                            T     +  +D G  ++  I   +   +GAL ++ +P+ +  V
Sbjct: 767 -----------------TKGAFQLSASDKGETEK--IMMFLLSSTGALHVYALPDLSKPV 807

Query: 775 FTVDKFVSGRTHI-VDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWS 833
           +  +       H+  D  +R  L                   KE +  + V +L      
Sbjct: 808 YVAEGLSYVPPHLSADYTLRRGLA------------------KETLREILVADLG----D 845

Query: 834 AHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSAS----RLRN 889
                P+L        +  Y+               P+   R    SN+SA+    +  N
Sbjct: 846 TISQSPYLILRNQTDDLTIYE---------------PIRHVRPGGESNLSAALSFKKTSN 890

Query: 890 LRFSRTPLDAYTRE-ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQ 948
           +  + TP      E E P   P +R     NI+G+   FL GS P + +   + +     
Sbjct: 891 VTLATTPAQTEDDEVEQPRFMPMRRCA---NINGYSTVFLPGSSPSFVLKSSKSIPRVIG 947

Query: 949 LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
           L    I   +  H   C+ GFIY   +GI ++ Q PS + +
Sbjct: 948 LQGLGIRGMSSFHTEGCDRGFIYADDKGIARVTQFPSDTNF 988


>gi|322694449|gb|EFY86278.1| Cleavage factor two protein 1 [Metarhizium acridum CQMa 102]
          Length = 1431

 Score =  142 bits (358), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 230/993 (23%), Positives = 377/993 (37%), Gaps = 144/993 (14%)

Query: 72  RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRD 131
           R  ++   ES   G     V  D      L L+    L G V  LA +     +     +
Sbjct: 70  RANDDDGLESSFLGVESLIVRADPSHNTKLVLISEIPLAGTVIGLARVKI--KNTPSGGE 127

Query: 132 SIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV---KVDPQ 188
           +++LA++ AK+ + E+D   H L  TS+H +E  E   L+        G  V   + DP 
Sbjct: 128 ALLLAYKAAKMCLTEWDPQRHTLETTSIHYYEKDE---LQGAPWEMPFGDYVNYLEADPG 184

Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGD---ED-------------TFGSGGG----FSA 228
            RC         + IL  +Q    L  D   ED             T G G G      +
Sbjct: 185 SRCVAFKFGSRNLAILPFTQSEEDLEMDDWDEDLDGPCPVKEEPPTTNGDGPGDHDLVKS 244

Query: 229 RIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISAL 286
           R   S V+ L  LD  + H     F+H Y EP   IL   +          H T  +  L
Sbjct: 245 RYTPSFVLRLPLLDPSLLHPVHLAFLHEYREPTFGILSSMQSPSPALGIKDHLTYKVFTL 304

Query: 287 SISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNY 345
            +    +    I S   LP D ++++A+P+P+GG L+VG N  IH         +A+N+ 
Sbjct: 305 DLQQ--RASTTILSVTGLPQDLFRVIALPAPMGGALLVGENELIHIDQSGKPNGVAVNDM 362

Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDL 403
           A  + S   + +S   + L+      L ND+   LL    G L ++    DGR V ++ +
Sbjct: 363 AKQMTSFSLVDQSELGLRLEGCAVELLANDIGELLLILNDGRLAIICFHIDGRTVSKISI 422

Query: 404 ----SKTNPSVLTSDITTI---GNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGD 456
               ++   +++ S ++ I   G++  FLGS   DS+++ ++   G        K +   
Sbjct: 423 RLVSAECGGNLIKSQVSCISKLGSNTLFLGSESNDSIVLGWSRKQGQE------KRKKSR 476

Query: 457 IEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASNNTESAQKTFSFAVRDSLVNIGPL 515
           +     +         D   D + G + SL   S + N  S     SF V+D+L++I P+
Sbjct: 477 LLDPDLALDVDDLDLDDDEDDDLYGNDSSLAKPSQTINGSSKPGEVSFRVQDTLLSIAPI 536

Query: 516 KDFSYGL-RINADASATGISKQSNYEL----------------------------VELPG 546
           +D + G      D+    +SK    EL                             + P 
Sbjct: 537 RDVACGAPAFVPDSEEATLSKGVTAELELACAVGRGFSGSVAILNREIQPKVIGRFDFPE 596

Query: 547 CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIIS------LEARTMVLETADLLTEVT 600
            +G WT+  K      A  +       +Y  Y+I++       E   +   TA     + 
Sbjct: 597 ARGFWTMCVKKPLSKGAAVASDYDTTAQYDKYMIVAKVDLDGYETSDVYALTAAGFETLK 656

Query: 601 ESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENS 659
           ++      G T+ AG +  + R+IQV +   R  DG   ++Q L   P   E       +
Sbjct: 657 DTEFEPAAGFTVEAGTMGKQMRIIQVLKSEVRCYDGDLGLSQIL---PMLDEDTGAEPRA 713

Query: 660 TVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPE 719
           T  S SI DPY+LL   D SI +     +     V  P     S K  S C LY+D    
Sbjct: 714 T--SASIVDPYLLLNRDDSSIFIAQIHSNNELEEVFKPDGTLKSTKWASGC-LYND---- 766

Query: 720 PWLRKTSTDAWLSTGVG-EAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVD 778
                  T     + V  +  D AD        I   +   +GAL ++ +P+      V 
Sbjct: 767 -------TQGIFQSNVNKQKADAAD-------RIMMFLLSSAGALHVYALPD------VS 806

Query: 779 KFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSR 838
           K +         ++ EAL      ++++     G   KE+I  + V +L      A    
Sbjct: 807 KPI---------FVAEALTSIPPFLSAAFVARKG-ASKESITEILVADLG----DAISQT 852

Query: 839 PFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLD 898
           P+L        +  Y+      P    +  D   ++  L    V+ S  +       P  
Sbjct: 853 PYLIVRHASDDLTIYE------PVRCQEEGDAELSASLLFKKCVNTSLAKT-----APEV 901

Query: 899 AYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFT 958
           +    E P   P +R     N++G+   FL G+ P + +           L    +   +
Sbjct: 902 SEDDAEPPRFVPLRRCA---NVNGYGAVFLPGASPSFVLKSSHSEPRVIGLQGLGVRGMS 958

Query: 959 VLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
             H   C+ GFIYV  +GI ++ QLPS +++ +
Sbjct: 959 TFHTEGCDRGFIYVDVEGIARVTQLPSNASFTD 991


>gi|392558419|gb|EIW51607.1| hypothetical protein TRAVEDRAFT_176174 [Trametes versicolor
           FP-101664 SS1]
          Length = 1431

 Score =  142 bits (358), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 183/813 (22%), Positives = 344/813 (42%), Gaps = 117/813 (14%)

Query: 103 LVCHYRLHGNVESL-AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHC 161
           LV  +RLHG V  L A+ +    ++  + D ++++F+DAKI++LE+ D+IH +   S+H 
Sbjct: 123 LVREHRLHGTVTGLEAVRTVHSLED--KLDRLLVSFKDAKIALLEWSDAIHDVMTVSIHT 180

Query: 162 FE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILK--ASQGGSGLVGDED 218
           +E +P+ + L        RG L +VDP  RC  + +    + IL    SQ    L+  E 
Sbjct: 181 YERAPQLMALD---SPLFRGEL-RVDPLSRCAALSLPKDSLAILPFYQSQAELDLMEQES 236

Query: 219 TFGSGGGFSARIESSHVINL-RDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS 275
           +      +S     S V++L  D+D  +++V DF F+ G+  P + +L + + TW GR+ 
Sbjct: 237 SQARDVPYSP----SFVLDLANDVDQRIRNVIDFAFLPGFNNPTVAVLCQYQQTWTGRLK 292

Query: 276 WKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQ- 334
               T  +   ++      +PLI +   LP+D   L    + IGGV ++ +N I +  Q 
Sbjct: 293 EYKDTVGLFIFTLDLVTNNYPLITAVDGLPYDCLSLTPCSTAIGGVFILASNAIIFVDQA 352

Query: 335 SASCALALNNY---AVSLDSSQELPRSSF-SVELDAAHATWLQNDVALLSTKTGDLVLLT 390
           S    L +N +      L      P+    +++L+ A  T++ +    +  K G +  + 
Sbjct: 353 SRRVILPVNGWPPRTSDLTMPSLTPQEQLRNLQLEGARFTFVDDKTLFVILKDGTVHPVE 412

Query: 391 VVYDGRVVQRLDLSK-----TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
           +V DG+ V RL ++      T P+V    +  + +   F+GS +G S+L++      T+ 
Sbjct: 413 LVLDGKTVSRLSMADALARTTIPAV----VARVRDDYLFVGSMVGPSVLLR------TAH 462

Query: 446 LSSGLKEEFGDIEAD-----APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK- 499
           +   +KEE  D++A      AP+         D      NGE+ S  G+ +   +S +K 
Sbjct: 463 VEEVIKEEDVDMDAGPATVVAPADTMDLDDDDDLYGPSGNGEQPSANGATNGTVDSVKKR 522

Query: 500 -TFSFAVRDSLVNIGPLKDFSYGLRINAD------ASATG-------------ISKQSNY 539
                ++ D+L   G + D ++GL  N D       +ATG             +  +S  
Sbjct: 523 TVVRLSLCDALPAHGAISDMAFGLARNGDRVVPELIAATGSGELGGFHLFQRDMPTRSKR 582

Query: 540 ELVELPGCKGIWTV-YHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM--VLETADLL 596
           +L  + G +G+W++   ++ +       R ++ +D     +IIS +A     +   A   
Sbjct: 583 KLHAIGGARGMWSLAVRQAMKVSGGTLERPSSQNDS----VIISTDANPSPGLSRIATRS 638

Query: 597 TEVTESVDYFVQGRTIAAGNLFGRRRVIQVF---ERGARIL--DGS--YMTQDLSFGPSN 649
                ++   + G T+ A   F    ++ +        R+L  DG+   + +DL      
Sbjct: 639 AHSDIAITTRIPGTTLGAAPFFQGTAILHILFNVTNAIRVLEPDGTERQIIKDLE----- 693

Query: 650 SESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSS 709
                 +    + S SI DP+VL+   D +I L +G+     +  +  + +        +
Sbjct: 694 ----GTAPRPKIKSCSICDPFVLIIREDDTIGLFIGELERGKIRRKDMSPMGDKTSRYVA 749

Query: 710 CTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDI-------YSVVCYESGA 762
              + D           T   L T V E     +     QG +       + ++    G 
Sbjct: 750 GGFFTD-----------TSGLLQTFVNEQAPAENVTSTLQGAMNAGNKSQWLILVRPQGV 798

Query: 763 LEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSM 822
           +E++ +P     F+     +    + D+Y   AL        S  ++   + ++ +I  +
Sbjct: 799 VELWTLPKLTLAFSTTLLATLDPILTDSYDGPAL--------SLPQDPPRKPQELDIDQI 850

Query: 823 KVVELAMQRWSAHHSRPFLFAILTDGTILCYQA 855
            +  L   R      RP L  +L  G +  Y+A
Sbjct: 851 VIAPLGESR-----PRPHLIVLLRSGQLAVYEA 878


>gi|121797760|sp|Q2TZ19.1|CFT1_ASPOR RecName: Full=Protein cft1; AltName: Full=Cleavage factor two
           protein 1
 gi|83775384|dbj|BAE65504.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 1393

 Score =  142 bits (357), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 166/665 (24%), Positives = 276/665 (41%), Gaps = 102/665 (15%)

Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGR 190
           ++I+LAF +AK++++E+D   +G+   S+H +E  +        +  + G ++ VDP  R
Sbjct: 88  EAILLAFRNAKLALIEWDPGRYGICTISIHYYERDDSTSSPWVPDLSSCGSILSVDPSSR 147

Query: 191 CGGVLVYGLQ-MIILKASQGGSGLVGDE------DTFGSGG--------------GFSAR 229
           C  V  +G++ + IL   Q G  LV D+      +  GS G                 A 
Sbjct: 148 CA-VFNFGIRNLAILPFHQPGDDLVMDDYGELDDERLGSHGLESGTDCDMTKESIAHRAP 206

Query: 230 IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
             SS V+ L  LD  + H     F++ Y EP   IL+ +  T    +  +      +  +
Sbjct: 207 YSSSFVLPLAALDPSILHPISLAFLYEYREPTFGILYSQVATSNALLHERKDVVFYTVFT 266

Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA 346
           +    +    + S   LP D +K++A+P P+GG L++G+N  +H      + A+ +N ++
Sbjct: 267 LDLEQRASTTLLSVSRLPSDLFKVVALPPPVGGALLIGSNELVHVDQAGKTNAVGVNEFS 326

Query: 347 VSLDSSQELPRSSFSVELDAAHATWLQ--NDVALLSTKTGDLVLLTVVYDGRVVQRLDLS 404
             + S     +S  ++ L+      L   N   LL   TG++VL+    DGR V  + + 
Sbjct: 327 RQVSSFSMTDQSDLALRLEGCIVERLSETNGDLLLVPTTGEIVLVKFRLDGRSVSGISVH 386

Query: 405 KTNPSV-------LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE---EF 454
              P           S    +G+   FLGS   DS+L+      G S+ SSG K+   + 
Sbjct: 387 PIPPHAGGDIVKSAASSSAFLGDKRVFLGSEDADSILL------GWSVPSSGTKKPRPQA 440

Query: 455 GDIEADAPSTKRLRRSSSDALQDMVNG--EELSLYGSASNNTESAQKTFSFAVRDSLVNI 512
              E D+       +S  D  +D +     E+ + G   +        ++F   D L+NI
Sbjct: 441 RHTEEDSGGFSDEDQSEDDVYEDDLYATVPEVVVDGRRPSAESFGSSLYNFREYDRLLNI 500

Query: 513 GPLKDFSYGLRINADASATGISKQSNYELV----------------------------EL 544
           GPLKD ++G    +          S  ELV                            +L
Sbjct: 501 GPLKDIAFGRSFTSLGGEENAGNDSGLELVASQGWDRSGGLAVMKRGLELQVLNSMRTDL 560

Query: 545 PGCKGIWTVYHKSSRGHNADS---SRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
             C  +WT    +S  H  ++   +   A + E H Y+++S +A +   E +++     +
Sbjct: 561 ASC--VWT----ASVAHMEEAVSKTTTQAENRECHQYVVVS-KATSAEREQSEVFRVEGQ 613

Query: 602 SVDYFV-------QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG---PSNSE 651
            +  F        +  TI  G L G+ RV+Q+     R  DG     DL      P   E
Sbjct: 614 ELRPFRAPEFNPNEDVTIDIGTLIGKNRVVQILRSEVRSYDG-----DLGLAQIYPVWDE 668

Query: 652 SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCT 711
               SE    +S S+ DPYV +   D ++ LL  D S     V+    I +SK   +SC 
Sbjct: 669 --DTSEERMAISSSLVDPYVAILRDDSTLLLLQADDSGDLDEVELNEQIANSKW--TSCC 724

Query: 712 LYHDK 716
           LY DK
Sbjct: 725 LYFDK 729


>gi|412986884|emb|CCO15310.1| predicted protein [Bathycoccus prasinos]
          Length = 1595

 Score =  142 bits (357), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 100/325 (30%), Positives = 163/325 (50%), Gaps = 72/325 (22%)

Query: 181 PLV-KVDPQGRCGGVLVYGLQMIILK----------------ASQGGSGLVGDEDTFGSG 223
           P++ + DP+GRC  VL+   +   +K                +S G   +   ++  G G
Sbjct: 208 PIIGRADPEGRCAAVLLRNEEKAKVKIMPASETSTSSNYIKESSNGSKKMTTKKE--GEG 265

Query: 224 GGF-SARIESSHVINLRDL---DMKHVKDFIFVHGYIEPVMVILHEREL-TWAGRVSWKH 278
             +  A I SS  +++R +       V+D  F+HGY EPV++IL+E    TW+GR+S + 
Sbjct: 266 TVYVPATIGSSFDLDVRKILGPSAAFVRDCCFLHGYGEPVLMILYESNPPTWSGRLSLRM 325

Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASC 338
            TC + A+SI  T K++ ++W+   LP  AY L  VP+P+GGVLV+ +  I Y SQS+S 
Sbjct: 326 DTCKLVAVSIDCTKKKYTIVWTREKLPSAAYSLFPVPNPLGGVLVLSSGHILYESQSSSA 385

Query: 339 ALALN----------NYAVSLD------------------------SSQELPRSSFSVEL 364
               +          N+A  +                         SS E  ++ F V+L
Sbjct: 386 TYISDFLGKGGPQEGNFAEEIARNNGVEGQAAHANPVPHVNSNKNVSSYETTQNEFQVQL 445

Query: 365 DAAHATWLQNDVALLSTKTGDLVLLTVVYD------------GRVVQRLDLSKTNPSVLT 412
           DAA    ++ +VA++S+KTG L+  TV+ +            GR  +R+ + K+  +VL+
Sbjct: 446 DAAKIEMIRENVAIISSKTGQLI--TVILETVGGAASVGSKVGRRCRRIRVLKSGNAVLS 503

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQF 437
           S +  +G  L F+GSR+GDSLL+ +
Sbjct: 504 SGLAAVGKDLLFIGSRVGDSLLIGY 528



 Score =  135 bits (339), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 141/560 (25%), Positives = 230/560 (41%), Gaps = 118/560 (21%)

Query: 501  FSFAVRDSLVNIGPLKDFSYGLR--INAD-------ASATGISKQSNY---------ELV 542
            + F+V+DSL+ I P+ D + G    +  D        +A G  K             ELV
Sbjct: 668  YKFSVKDSLLCISPVVDLTVGASAPVGTDLDPRTELVAACGHGKNGALAILTRGITPELV 727

Query: 543  ------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA--RTMVLETAD 594
                   LPG +  W      +   N  + R    D+ +  +LI+SL +   TMVLET +
Sbjct: 728  TEVESGALPGLRACWAT---RTEDDNDGTVRPKRKDELFDEHLILSLSSTKTTMVLETGE 784

Query: 595  LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESG- 653
             L EV++ VD+ V   T+A   +F  R + QV +   R     +  +   F   + +   
Sbjct: 785  ELREVSKEVDFIVDEETLACERIFNGRAIAQVTKTKIR-----FTRKGKKFAVDDIDLAF 839

Query: 654  -SGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAI-------ESSKK 705
              G E + +    I +  + L +SDGSIR+++GD  T T ++              ++  
Sbjct: 840  LKGGEGAQITLAIIQNDAIALRLSDGSIRIILGDSKTNTFTLLEKVGELFASDNHSNTGS 899

Query: 706  PVSSCTLYHD----------------KGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQ 749
             V++ TLY D                + P  WL +T      + G  E  D +     + 
Sbjct: 900  DVTAFTLYDDSVACTDSFGGGGGGLNRAP-GWLERT------ACGDREEKDESK----EN 948

Query: 750  GDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEE 809
             ++        G L ++ +P+   +++      G         RE L  + T I+S    
Sbjct: 949  NNVVFATISRDGTLALYSLPSLKKLWSSGGVSDG---------REILAPNSTGIDSIDFN 999

Query: 810  GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
               +  K  +  +++   A    +A + RP L     DG++L YQA  F+ P +      
Sbjct: 1000 DECEVEKYTVSDIRLDAFA----NAAYERPLLTCFRADGSVLAYQA--FKSPSSN----- 1048

Query: 870  PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYT--REETPHGAPCQ---RITIFKNIS--- 921
                                LRF+R P++  T   E T +    Q   R+T  +NI    
Sbjct: 1049 -------------------ELRFARVPIEIETAGSELTNNDVSVQGGSRLTRIENIGDGR 1089

Query: 922  GHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSI-VAFTVLHNVNCNHGFIYVTSQGILKI 980
            G  G F+SG  P W +V R R+   P   +G   +AF   HNVNC  GFI  T++G +++
Sbjct: 1090 GIAGVFVSGLNPIWLIVRRGRVLALPTRGEGGARIAFAPFHNVNCPKGFILATNEGGIRV 1149

Query: 981  CQLPSGSTYDNYWPVQKVVF 1000
            C+LP     +  WPV+K+  
Sbjct: 1150 CRLPGKMHIEAQWPVRKLAL 1169


>gi|347838999|emb|CCD53571.1| similar to Cleavage and polyadenylation specificity factor subunit
           1 [Botryotinia fuckeliana]
          Length = 1447

 Score =  140 bits (354), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 231/1050 (22%), Positives = 405/1050 (38%), Gaps = 197/1050 (18%)

Query: 57  NLVVTAANVIEIYVVR--------VQEEGSKESKNSGETKRRVLMD-GIS---------- 97
           NLVV  +++++I+  +        + E+ S  +K+      RV  D G+           
Sbjct: 28  NLVVAKSSLLQIFTTKTVSVDLDELSEKDSSTAKDDTNIDPRVNNDDGVEDSFLGTDSIM 87

Query: 98  -------AASLELVCHYRLHGNVESLA----ILSQGGADNSRRRDSIILAFEDAKISVLE 146
                     L LV  Y L G V SL     I S+ G +      +I++ F+DAK+S++E
Sbjct: 88  QRPELARTTKLVLVAEYNLSGTVTSLVRVKTISSKTGGE------AILVGFKDAKLSLVE 141

Query: 147 FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKA 206
           +D    G+   S+H +E  E                + VDP  RC  +      + IL  
Sbjct: 142 WDPERPGISTISVHFYEQDELQGSPWAPSLSDCVNYLTVDPGSRCAALKFGARNLAILPF 201

Query: 207 SQGGSGLVGDEDTFGSG--------------GGFSARIESSHVINLRDLDMK-----HVK 247
            Q     + D D    G              G       SS V+ L  LD       H++
Sbjct: 202 KQDEDVNMDDWDEELDGPRPAKISQKAAAEDGQLDTPYGSSFVLRLSSLDPSIIFPIHLE 261

Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWK--HHTCMISALSISTTLKQHPLIWSAMNLP 305
              F++ Y EP   IL       +  +  +  H T M+  L +    K    I S   LP
Sbjct: 262 ---FLYEYREPTFGILSSTMAPSSALLQERRDHLTYMVFTLDMHQ--KASTTILSVGGLP 316

Query: 306 HDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVEL 364
           +D ++++ +  P+GG L+VG N  IH      +  +A+N +A        L ++   + L
Sbjct: 317 YDLFRIVPLAPPVGGALLVGTNELIHIDQAGKANGVAVNMFAKQCTGFSLLDQADLDLRL 376

Query: 365 DAAHATWL--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLT---SDI 415
           +      L  +N   L+   +GD+ +L+   DGR V  L + + +     ++LT   S +
Sbjct: 377 EGCKIDQLSIENGEMLIILHSGDIAILSFRMDGRSVSGLSIRRVSAELGGAILTGAASCV 436

Query: 416 TTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDAL 475
           +++G    F+GS + DS+++ +   SG +       +     E D            +  
Sbjct: 437 SSLGAGSLFVGSEVSDSVILGWNRKSGQTSRRKSRLDSSAIAEVDE---AMFDEEDLEDD 493

Query: 476 QDMVNGEELSLYGSASNNTESAQKT--FSFAVRDSLVNIGPLKDFSYG---LRINADASA 530
            D + G+  ++  + +N T S  KT  ++F + DS+VNI P+ + ++G   L +  D   
Sbjct: 494 DDDLYGDGPTITHATANITASNSKTGDYTFRIHDSMVNIAPITNIAFGEAALSLGKDEEL 553

Query: 531 TGISKQSNYELV--------------------------ELPGCKGIWTVYHK--SSRGHN 562
                QS  +LV                          +LP  +GIWT+  K  + +G  
Sbjct: 554 KSSGVQSELQLVAAVGREKGGSLAVINREIQPNVIGRFDLPEARGIWTMSAKRPAPKGLQ 613

Query: 563 ADSSRMA-----AYDDEYHAYLIISL--EARTMVLETA-----DLLTEVTESVDYF-VQG 609
            +  +         D +Y   +I+S   +A   + E+A     D   E     ++    G
Sbjct: 614 VNKEKSVTSGDYGVDAQYDRLMIVSKASDAEDAIEESAVYALTDAGFEALTGTEFEPAAG 673

Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
            TI AG L    RV+Q+ +   R  DG   + Q L     + E+G+      ++S S AD
Sbjct: 674 STIEAGTLGNGMRVVQILKSEVRSYDGDLGLAQILPM--LDDETGA---EPKIISASFAD 728

Query: 669 PYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTD 728
           P++LL   D SI +   D       ++    I  S K ++ C LY D           +D
Sbjct: 729 PFLLLIRDDASIFVAQCDDDNDLEEIERVDDILLSTKWLTGC-LYDD------YSGAFSD 781

Query: 729 AWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDK---FVSGRT 785
           +  S   GE             ++   +    GAL I+ +P+ +    V +   FV    
Sbjct: 782 SK-SNKAGE-------------NVKMFLLSAGGALHIYALPDLSKPVYVAEGICFVPPVL 827

Query: 786 HIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAIL 845
                  + A +++ TEI                       L      +    P+L    
Sbjct: 828 SADYAARKSAARETLTEI-----------------------LVANLGDSVSQSPYLILRP 864

Query: 846 TDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREET 905
           ++  +  Y+ +  +            S S  L  S +   +++N   ++ P    + EE 
Sbjct: 865 SNDDLTIYEPFRVK------------SASPDLLSSTLQFLKIQNTHLTQAP--DVSAEEQ 910

Query: 906 PHGA------PCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTV 959
             GA      P + I+   N+ G+   F+ G  P + +   +       L    + + + 
Sbjct: 911 VDGAQQTSDKPMRAIS---NLGGYSTVFMPGGSPSFIIKSSKTAPKVLSLQGTGVRSLSS 967

Query: 960 LHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
            H   C+ GFIY +++GI ++ Q P  +T+
Sbjct: 968 FHTEGCDRGFIYASTEGIARVAQFPPNTTF 997


>gi|189203597|ref|XP_001938134.1| conserved hypothetical protein [Pyrenophora tritici-repentis
           Pt-1C-BFP]
 gi|187985233|gb|EDU50721.1| conserved hypothetical protein [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 1407

 Score =  140 bits (352), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 168/697 (24%), Positives = 292/697 (41%), Gaps = 86/697 (12%)

Query: 57  NLVVTAANVIEIYVVR-----VQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
           NLVV   ++++I+ ++     V     + S+N+      E     L    + A L LV  
Sbjct: 28  NLVVAKNSLLQIFELKSTTTEVTPGAGENSENAAANLDTEAADVPLQRTENTAKLVLVAE 87

Query: 107 YRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESP 165
           + L G V SLA +    A N++ + +++++AF DAK+S++E+D   + L   S+H +E+P
Sbjct: 88  FPLAGTVISLARVK---ALNTKSKGEALLVAFRDAKLSLVEWDPETYNLHTISIHYYENP 144

Query: 166 E------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQ----------- 208
           +      W    +   +F     +  DP  RC  +      + IL   Q           
Sbjct: 145 DLPGIAPWSADLKDTYNF-----LTADPSSRCAALKFGTHNLAILPFRQRDLVEDDYDSD 199

Query: 209 -GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHE 265
             G      +   G+ G       SS V+ L +LD  + H     F+H Y EP   I+  
Sbjct: 200 ADGPKETKADQANGTNGEHKTPYSSSFVLPLTNLDPTLTHPVHLAFLHEYREPTFGIVAA 259

Query: 266 RELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVG 325
              T    ++ +      S  ++    K    + S   LP+D  K++ +PSPIGG L+VG
Sbjct: 260 SRATAPSLLAQRKDILTYSVFTLDLEQKASTTLLSVSGLPYDITKVVPLPSPIGGALLVG 319

Query: 326 AN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTK 382
            N  IH      +  +A+N +A +  S     +S  ++ L+      L  +    L+   
Sbjct: 320 GNEIIHVDQGGKTNGVAVNEFAKACTSFSLSDQSDLALHLEGCSIELLSQETGDVLIVLN 379

Query: 383 TGDLVLLTVVYDGRVVQRLDLSKTNP-------SVLTSDITTIGNSLFFLGSRLGDSLLV 435
            G L++LT   DGR V  + +                S  + +G    F+GS  G+S+++
Sbjct: 380 NGRLLILTFTLDGRTVSGMTIQTVAADHGGHLLKSAASCTSNLGRGRLFIGSEDGESVML 439

Query: 436 QFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASNNT 494
            +T       L++ L+ +  + + D            D   D+ N    +++  +A+ + 
Sbjct: 440 GWTG------LTNQLRRKLSNADLDG-EDDSEEEEIDDMEDDLYNDTAPTMHKITAAVSE 492

Query: 495 ESAQKTFSFAVRDSLVNIGPLKD-----------FSYG-LRINADASATGISKQSNYEL- 541
            +A  T++F + D L +I P+KD            + G + ++    A G     + EL 
Sbjct: 493 PTAPGTYTFRIHDVLPSIAPIKDAVLHPGKVTESLNRGEIMLSTGRGAAGAITALDRELH 552

Query: 542 ------VELPGCKGIWTVYHKS------SRGHNADSSRMAAYDDEYHAYLIISL--EART 587
                  ELP   G+W V+ +       +     D+    A D +Y  YL++S   E  T
Sbjct: 553 PISVATKELPSAHGVWAVHARKQAPGDVTAAFGEDTEANMATDVDYDQYLVMSKNGEDGT 612

Query: 588 MVLE-TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG 646
           +V E   D LTE  +      +G T+  G L    +V+QV     RI D       +   
Sbjct: 613 VVYEVNGDKLTETDKGDFEREEGTTLLVGILAAGTKVVQVMRTEVRIYDSELNLVHIQSM 672

Query: 647 PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
               E GS  E + +++ S ADPY+L+   D S+++ 
Sbjct: 673 EEEEEGGSTKELN-IINASFADPYLLILREDSSVKIF 708


>gi|19112233|ref|NP_595441.1| cleavage factor one Cft1 (predicted) [Schizosaccharomyces pombe
            972h-]
 gi|74582544|sp|O74733.1|CFT1_SCHPO RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
            1
 gi|3738146|emb|CAA21247.1| cleavage factor one Cft1 (predicted) [Schizosaccharomyces pombe]
          Length = 1441

 Score =  139 bits (351), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 233/1061 (21%), Positives = 412/1061 (38%), Gaps = 187/1061 (17%)

Query: 57   NLVVTAANVIEIYVV-RVQEEGS-----------------KESKNSGETKRRVL-MDGIS 97
            NLVV+  N + ++ + ++Q++ S                  ES+   ET   ++  +  +
Sbjct: 29   NLVVSKVNSLHLFEIEKIQKDESSFPLDDSLQNEFSTSIIDESQAFMETNMHLIRTNEQT 88

Query: 98   AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
               L LV   ++ G +  ++ L   G++     D +I+  + AK+S LE+D         
Sbjct: 89   TYVLRLVSQVKVFGTITEISALKGKGSNGC---DLLIMLTDYAKVSTLEWDMQSQSFVTN 145

Query: 158  SMHCFESPEWLHLKRGRESFARGPL-VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGD 216
            S+H +E      +K      +  P  + VDP   C  +L +   M+ +        L  +
Sbjct: 146  SLHYYED-----VKSSNICSSHTPTQLLVDPDSDCC-LLRFLTDMMAIIPYPANEDLDME 199

Query: 217  EDTF-GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
            E     S    S   + S V+    LD  +  + D  F++GY EP + IL+  E T    
Sbjct: 200  EAAIENSKISSSYAYKPSFVLASSQLDASISRILDVKFLYGYREPTLAILYSPEQTSTVT 259

Query: 274  VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY-H 332
            +  +  T + S +++    +   +I +  +LP+D Y  +++P+P+GG L++G N + Y  
Sbjct: 260  LPLRKDTVLFSLVTLDLEQRASAVITTIQSLPYDIYASVSIPTPLGGSLLLGGNELIYVD 319

Query: 333  SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL-----QNDVALLSTKTGDLV 387
            S   +  + +N+Y           +S F++EL+   A  L     +    +L   +G   
Sbjct: 320  SAGRTVGIGVNSYYSKCTDFPLQDQSDFNLELEGTIAIPLTSSKTETPFVVLVHTSGQFF 379

Query: 388  LLTVVYDGRVVQRLDLS----KTNPSVLTSDITTI---GNSLFFLGSRLGDSLLVQFTCG 440
             L  + DG+ V+ L L     + N   L S IT     G +L FLGS+  DS L++++  
Sbjct: 380  YLDFLLDGKSVKGLSLQALDLEINDDFLKSGITCAVPAGENLVFLGSQTTDSYLLRWSRR 439

Query: 441  SGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT 500
            +          EE    E D      L  ++   + DM++  E      +          
Sbjct: 440  TT--------NEEVRLDEGD----DTLYGTNDAEMDDMLDIYETDESVGSKRKIAYENGP 487

Query: 501  FSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY---ELV--------------- 542
                + D L NIGP+ DF+ G      A +     Q N+   ELV               
Sbjct: 488  LRLEICDVLTNIGPITDFAVG-----KAGSYSYFPQDNHGPLELVGTAGADGAGGLVVFR 542

Query: 543  -----------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVL 590
                       +  GC+ +WTV   S +  N  S   A Y + E   YL++S E  + + 
Sbjct: 543  RNIFPLIAGEFQFDGCEALWTV-SISGKLRNMKSRIQAQYSNPELETYLVLSKEKESFIF 601

Query: 591  ETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSN 649
               +   EV  S D+    +T+  G+L    R++Q+     R+ D +  +TQ  +F    
Sbjct: 602  LAGETFDEVQHS-DFSKDSKTLNVGSLLSGMRMVQICPTSLRVYDSNLRLTQLFNF---- 656

Query: 650  SESGSGSENSTVLSVSIADPYVLLGMSDGSI----------RLLVGDPSTCTVSVQTPAA 699
                  S+   V+S SI DP +++    G I          RL+  D       V+T A+
Sbjct: 657  ------SKKQIVVSTSICDPCIIVVFLGGGIALYKMDLKSQRLIKTDLQNRLSDVKT-AS 709

Query: 700  IESSKKPVSSCTLY----------------HDKGPEPWL-----RKTSTDAWLSTGVGEA 738
            + S         L+                +D   E  L      KTS +  +  G  ++
Sbjct: 710  LVSPDSSALFAKLFTYNETLNAKGQIANGMNDSASETDLDIQPNHKTSNNDQM--GYDQS 767

Query: 739  IDGADGGP--------------LDQGDIYS----VVCYESGALEIFDVPNFNCVFTVDKF 780
            +  AD  P              LDQ  +          + G L+++++ +F+ +   D F
Sbjct: 768  V-SADDVPEVDNTIVTEKNVSNLDQESLEKHPILFALTDEGKLKVYNLADFSLLMECDVF 826

Query: 781  VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
                T      +   ++   T  N  S             S ++VEL +         P 
Sbjct: 827  DLPPT------LFNGMESERTYFNKES-------------SQELVELLVADLGDDFKEPH 867

Query: 841  LFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAY 900
            LF       I  Y+A+L+    NT K  + ++ ++   V   + +R        TP DA 
Sbjct: 868  LFLRSRLNEITVYKAFLYS---NTDKHKNLLAFAK---VPQETMTREFQANVG-TPRDAE 920

Query: 901  TREETPHGAPCQ--RITIFKNISGHQGFFLSGSRPCWCM-VFRERLRVHPQLCDGSIVAF 957
            +  E    +     ++T  + +  H   F++G +P   +       +  P   +  I++ 
Sbjct: 921  STMEKKASSSVDHLKMTALEVVGNHSAVFVTGRKPFLILSTLHSNAKFFPISSNIPILSV 980

Query: 958  TVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
               H  +   G+IYV     ++IC+      YDN WP +KV
Sbjct: 981  APFHAHHAPQGYIYVDENSFIRICKFQEDFEYDNKWPYKKV 1021


>gi|302831157|ref|XP_002947144.1| hypothetical protein VOLCADRAFT_87503 [Volvox carteri f. nagariensis]
 gi|300267551|gb|EFJ51734.1| hypothetical protein VOLCADRAFT_87503 [Volvox carteri f. nagariensis]
          Length = 2830

 Score =  139 bits (351), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 123/476 (25%), Positives = 205/476 (43%), Gaps = 81/476 (17%)

Query: 575  YHAYLIISL-EARTMVLETADLLTEVTES--VDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
            +HAYL+I++   RTMVL   D L +VT S   ++ V   T+AAGNLF    ++Q    G 
Sbjct: 1889 FHAYLLITMGRVRTMVLRCTDGLDDVTNSPECEFLVNQPTLAAGNLFHNAVIVQACPMGL 1948

Query: 632  RILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS-------------IADPYVLLGMSDG 678
            R+L+G  + Q+L      +     ++ S                    ADPYVL+G+SDG
Sbjct: 1949 RVLEGMTLVQELRVSDFQASRPKTAQYSFCCRTKHPIAHRAMGPIPQAADPYVLVGLSDG 2008

Query: 679  SIRLLVGDPSTCTVSVQTPAA-------IESSKKPVSSCTLYHDKGPEPWLRKTSTDAWL 731
            +  LL GDP + T+ V T AA         S ++ +++  L+ D+            +W+
Sbjct: 2009 TAVLLEGDPLSLTLGVATAAAEQLMAVPARSRQQRLAAACLHRDE-----------TSWM 2057

Query: 732  STGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVS-------GR 784
            ++        +         I+  +C  SG LE + +P+   VF      +       G 
Sbjct: 2058 ASATAAEAASS----GSSFSIFLWICRLSGRLECYSLPSMRLVFHSSGLAAAEEVLRMGP 2113

Query: 785  THIVDTY--MREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHS----- 837
              + D Y         +E E++     G G G  E+     VVEL ++ +    S     
Sbjct: 2114 AVMYDVYDLFGGGGGGAEAELDG----GGGSGIMED----PVVELRVESFLGGGSPAVPD 2165

Query: 838  --RPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSA----------- 884
              RP L  +   G ++ YQ  L   P ++   + P +   +   S               
Sbjct: 2166 CERPVLLVMAASGNLVAYQIALRRLPLDSLSHEAPAAMGAAAGSSGGGGGIGGGAALGPR 2225

Query: 885  -SRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERL 943
             +R  +L ++     +++R +       ++  +    + + G F++GSRP W +  R  L
Sbjct: 2226 MARFDHLAYTDPSSKSHSRTDI------RKYPVASQGTSYSGVFVAGSRPLWLVASRGGL 2279

Query: 944  RVHPQLCDGSIVAFTVLHNVNCNHGFIYV-TSQGILKICQLPSGSTYDNYWPVQKV 998
              HP   +G++ A T  HN NC  GFI   +S+G+LK+CQLP  +  D  W  ++V
Sbjct: 2280 VPHPMFAEGAVAAMTPFHNANCPLGFISACSSRGLLKVCQLPPHTRLDTPWVTRRV 2335



 Score =  105 bits (261), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 67/215 (31%), Positives = 120/215 (55%), Gaps = 25/215 (11%)

Query: 228  ARIESSHVINL-RDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISAL 286
            A + + +++NL + + ++ V+D +F+HGY EPV+++LHE + TW G +  +  TC ++A+
Sbjct: 1305 ATLGNGYLLNLNKMMGIREVRDCVFLHGYTEPVLLLLHEPDPTWVGMLRERKDTCCLAAI 1364

Query: 287  SISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYA 346
            SIS  LK+H ++W   +LP+D +KLLAVP     VLV+  N +   SQ++  A ALN+ A
Sbjct: 1365 SISLRLKRHTILWKLASLPYDCFKLLAVPY-RPAVLVISPNLLLLCSQASQHAAALNSNA 1423

Query: 347  VS--------LDSSQELP---------RSSFSVELDAA-----HATWLQN-DVALLSTKT 383
            +         LD S+E P         + + +V  D A     +AT + + +V     ++
Sbjct: 1424 LPGEVPPPLILDPSREPPAATAARLAAQYALNVHPDCAPAAGRNATLMADLEVVAAGLQS 1483

Query: 384  GDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTI 418
            G L+ + + ++G   QR+ + +T    + S +  I
Sbjct: 1484 GTLLAVHLQFEGPADQRITVVRTGGGPIASAMVGI 1518



 Score = 95.5 bits (236), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 65/188 (34%), Positives = 94/188 (50%), Gaps = 8/188 (4%)

Query: 56   PNLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMD-GISAASLELVCHYRLHGNVE 114
            PNL+V   N +E++ +R     +  +  +           G   A LELV  Y LHG VE
Sbjct: 1078 PNLIVVRTNRLEVHSLRSSAVATNAAAATATAAATASAAVGSGGARLELVVSYHLHGVVE 1137

Query: 115  SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
            SLA+LS G   +S RRD+++LAF + K+SV+E++   H LR +S+H FE    +  + GR
Sbjct: 1138 SLAVLSGG---SSSRRDALLLAFREGKLSVVEWNPRTHSLRTSSLHYFEGDPGVQ-REGR 1193

Query: 175  ESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSG---LVGDEDTFGSGGGFSARIE 231
             +    P V  DP GRC  +     Q+ +L A +  +G    V D    G G G   RI 
Sbjct: 1194 IAVPLPPRVVTDPAGRCAAMSFCFSQLALLPALEVKAGAWQCVDDGGVMGVGRGERERIG 1253

Query: 232  SSHVINLR 239
              H+   R
Sbjct: 1254 GVHINERR 1261


>gi|322704830|gb|EFY96421.1| Cleavage factor two protein 1 [Metarhizium anisopliae ARSEF 23]
          Length = 1433

 Score =  139 bits (349), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 234/995 (23%), Positives = 376/995 (37%), Gaps = 156/995 (15%)

Query: 72  RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRD 131
           R  ++   ES   G     V  D  +   L L+    L G V  LA +     +     +
Sbjct: 70  RANDDDGLESSFLGGESLIVRADPSNITKLVLITEIPLAGTVIGLARVKV--KNTPSGGE 127

Query: 132 SIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV---KVDPQ 188
           +++LA++ AK+ + E+    H L  TS+H +E  E   L+      + G  V   + DP 
Sbjct: 128 ALLLAYKAAKMCLTEWHPQRHTLETTSIHYYEKDE---LQGAPWEMSFGDYVNYLEADPG 184

Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGD---ED-------------TFGSGGG----FSA 228
            RC         + IL  +Q    L  D   ED             T G G G      +
Sbjct: 185 SRCVAFKFGSRNLAILPFTQSEEDLEMDDWDEDLDGPRPVKEELPLTNGDGPGDHDLVKS 244

Query: 229 RIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISAL 286
           R   S V+ L  LD  + H     F+H Y EP   IL   +   A      H T  +  L
Sbjct: 245 RYTPSFVLRLPLLDPSLLHPVHLAFLHEYREPTFGILSSMQSPSAALGIKDHLTYKVFTL 304

Query: 287 SISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNY 345
            +    +    I S   LP D ++++A+P+P+GG L+VG N  IH         +A+N+ 
Sbjct: 305 DLQQ--RASTTILSVTGLPQDLFRVMALPAPMGGALLVGENELIHIDQSGKPNGVAVNDM 362

Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDL 403
           A  + S   + +S   + L+      L ND+   LL    G L ++    DGR V ++ +
Sbjct: 363 AKQMTSFSLVDQSELGLRLEGCAVELLANDIGELLLILNDGRLAIVCFHIDGRTVSKISI 422

Query: 404 ----SKTNPSVLTSDITTI---GNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGD 456
               ++   +++ S ++ I   G++  FLGS   DS+++ ++   G        K +   
Sbjct: 423 RLVSAEYGGNLIKSQVSCISKLGSNTLFLGSESNDSIVLGWSRKQGQE------KRKKSR 476

Query: 457 IEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASNNTESAQKTFSFAVRDSLVNIGPL 515
           +     +         D   D + G + SL   S + N  S     SF ++DSL++I P+
Sbjct: 477 LLDPDLALDVDDLDLDDDEDDDLYGNDASLAKPSQTINGGSKPGEVSFRIQDSLLSIAPI 536

Query: 516 KDFSYGL-RINADASATGISKQSNYEL----------------------------VELPG 546
           +D + G   +  D+    +SK    EL                             + P 
Sbjct: 537 RDVACGAPALVPDSEEATLSKGVTAELELACAVGRGSSGSVAILNREIQPKVIGRFDFPE 596

Query: 547 CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIIS------LEARTMVLETADLLTEVT 600
            +G WT+  K      A  +       +Y  Y+I++       E   +   TA     + 
Sbjct: 597 ARGFWTMCAKKPLSKGAAVASDFDTTGQYDKYMIVAKVDLDGYETSDVYALTAAGFETLK 656

Query: 601 ESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENS 659
           ++      G T+ AG +  + R+IQV +   R  DG   ++Q L   P   E       +
Sbjct: 657 DTEFEPAAGFTVEAGTMGKQMRIIQVLKSEVRCYDGDLGLSQIL---PMLDEDTGAEPRA 713

Query: 660 TVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPE 719
           T  S SI DPY+LL   D SI +     +     V  P     S K  S C LY+D    
Sbjct: 714 T--SASIVDPYLLLIRDDSSIFIAQIHSNNELEEVLKPDGTLKSTKWASGC-LYND---- 766

Query: 720 PWLRKTSTDAWLSTGVGEAIDGADGGPLDQGD-IYSVVCYESGALEIFDVPNFN-CVFTV 777
                  T       V E          D+ D I   +    GAL ++ +P+ +  VF  
Sbjct: 767 -------TQGIFQNNVNEQ-------QADETDRIMMFLLSSVGALHVYALPDVSRPVFVA 812

Query: 778 DKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHS 837
           +   S     +  ++  A           + +G     KE+I  + V +L      A   
Sbjct: 813 EALTS-----IPPFLSAAF---------VARKGAS---KESITEILVADLG----DAISQ 851

Query: 838 RPFLFAILTDGTILCYQA--YLFEGPENTSKS---DDPVSTSRSLSVSNVSASRLRNLRF 892
            P+L        +  Y+   Y  EG    S S      V+TS + +   VS         
Sbjct: 852 TPYLIVRHASDDLTIYEPVRYQAEGDAELSASLLFKKCVNTSLAKTAPEVSED------- 904

Query: 893 SRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDG 952
                DA    E P   P +R     N++G+   FL  + P + +           L   
Sbjct: 905 -----DA----EPPRFVPLRRCA---NVNGYGAVFLPNASPSFVLKSSHSEPRVMGLQGL 952

Query: 953 SIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGS 987
            +   +  H   C+ GFIYV  +GI ++ QLPS +
Sbjct: 953 GVRGMSTFHTEGCDRGFIYVDMEGIARVTQLPSNA 987


>gi|325094074|gb|EGC47384.1| cleavage factor two protein 1 [Ajellomyces capsulatus H88]
          Length = 1377

 Score =  139 bits (349), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 212/984 (21%), Positives = 384/984 (39%), Gaps = 156/984 (15%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           L LV  Y L G +  L  +     D+    +++++A  +AK+S++E+D   H +  TS+H
Sbjct: 65  LVLVAEYALSGTITDLGRVKI--LDSKSGGEAVLVATRNAKLSLIEWDPERHQISTTSIH 122

Query: 161 CFESPEWLHLKRGRESFARGP-LVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---- 214
            +E  + +++     + A  P  + VDP  RC  VL +G + + IL   Q G  LV    
Sbjct: 123 YYERDD-VNISPWTPNLASCPSYLTVDPSSRCA-VLNFGKKNLAILPFHQVGDDLVMDDF 180

Query: 215 -----------------GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGY 255
                             DE    +G  F     SS V+ +  L+  M H     F++ Y
Sbjct: 181 DSDVEEPHRNMNQTAEETDEANKSNGPVFQTPYASSFVLPIAALEPSMLHPISLAFLYEY 240

Query: 256 IEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVP 315
            EP   IL+ +  T +  +  +      S  ++    +    + S   LP+D +K++ +P
Sbjct: 241 REPTFGILYSQVATSSALLHDRKDVVFYSVFTLDLEQRASTTLLSVSRLPNDLFKVVPLP 300

Query: 316 SPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL-- 372
            P+GG L++G+N  +H      + A+ +N +A    S     +S   + L+ +    L  
Sbjct: 301 PPVGGALLIGSNELVHIDQAGKTNAVGVNEFAREASSFSMADQSDLEMRLEDSIVEQLGA 360

Query: 373 QNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDIT---TIGNSLFFL 425
           +N   LL    G + +L+   DGR V  + L     +   S+L +  +    +     F 
Sbjct: 361 ENGDMLLVLLNGKMAVLSFKLDGRSVSGISLRPVPDQAGSSLLKAKPSCSVPVSRGKIFF 420

Query: 426 GSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ--------D 477
           GS  GDS+L+ ++  S  +      +   G+I   +           D            
Sbjct: 421 GSEEGDSVLMGWSRPSARTKDPRAQRTGEGNIAQLSDEDDDDEEEDDDDDAYEDDLYATP 480

Query: 478 MVNG----EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRI---NADASA 530
           M  G    + +S+ G+  N+       + F + D L N+GP++D + G      + D   
Sbjct: 481 MTTGIKARDYVSVNGTGFND-------YIFRIHDRLWNLGPMRDLTLGRPPGPRDKDKRQ 533

Query: 531 TGISKQSNYELVELPG--------------------------CKGIWTVYHKSSRGHNAD 564
              S  +N ELV   G                            G  +VY K  +  +  
Sbjct: 534 PVSSILTNLELVTTQGYGKAGGLAILRREIDPFVIDSLMIKDTDGARSVYVKDPKLPSQS 593

Query: 565 SSRMAAYDDEYHAYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLF 618
            S        Y  YL++S      + +++V   +    E T++ ++   + RTI  G L 
Sbjct: 594 GSLPLNPGSNYDHYLLLSKSKGLDKEKSVVYRMSSGGLEETKAPEFNPNEDRTIDIGTLA 653

Query: 619 GRRRVIQVFERGARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
              RV+QV +   R  D G  + Q       +      SE  +V+  S ADPYVL+   D
Sbjct: 654 SGTRVVQVLKGEVRSYDSGLGLAQIFPVWDEDM-----SEEKSVVHTSFADPYVLIIRDD 708

Query: 678 GSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGE 737
            SI LL  D S      +T   I S+     S +LY DK                     
Sbjct: 709 QSILLLQADESGDLDEAETDGIINSTT--WISGSLYQDK-------------------YR 747

Query: 738 AIDGADGGP-LDQGD-IYSVVCYESGALEIFDVPNF-NCVFTVDKFVSGRTHIVDTYMRE 794
           + +  +G P + Q D +   +      L +F +PN    VFT +                
Sbjct: 748 SFNSYEGPPNMKQSDNVLLFLLSSESKLYVFHLPNAREPVFTTESI-------------- 793

Query: 795 ALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQ 854
              D   +I S+         +E I  + V +L      +    P+L    ++  +  Y+
Sbjct: 794 ---DLLPQILSTEPPPRRVTYRETITELLVADLG----DSVSRSPYLILRSSNSDLTLYE 846

Query: 855 AYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRI 914
            Y +     TS ++   S  R + ++N    +      S + ++ +    T    P +  
Sbjct: 847 PYHY-----TSSTEKQFSDLRFVKIANHHFPKFH----SESNVEKHPANCTALSKPLR-- 895

Query: 915 TIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
            +  ++ G++  F+ G+ PC+ +     +     L   ++ + +  +   C  GF+YV +
Sbjct: 896 -VLGDVCGYRTVFMPGNSPCFIIKSSTSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDT 954

Query: 975 QGILKICQLPSGSTYDNYWPVQKV 998
             ++++C+ P  + +D  W  +K+
Sbjct: 955 DNVVRMCRFPRNTHFDGSWAARKI 978


>gi|226290902|gb|EEH46330.1| cleavage and polyadenylation specificity factor subunit A
           [Paracoccidioides brasiliensis Pb18]
          Length = 1343

 Score =  138 bits (348), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 231/1035 (22%), Positives = 398/1035 (38%), Gaps = 174/1035 (16%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V    ++++Y +     GS   ++  +T+ +        + L LV  Y L G V  L
Sbjct: 28  NLIVAKTTLLQVYNLVNVVYGSGPGQSDEKTRSQY-------SKLVLVAEYALSGTVTDL 80

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +     D+    ++I++A  +AK+S++E+D   H +  TS+H +E  + +H+     +
Sbjct: 81  GRVKI--LDSKSGGEAILVATRNAKLSLIEWDPEKHQISTTSIHYYERDD-VHISPWTPN 137

Query: 177 FARGP-LVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV-------------------- 214
            A  P  + VDP  RC  VL +G + + IL   Q G  LV                    
Sbjct: 138 LAACPSQLTVDPSSRCA-VLNFGKKNLAILPFHQMGDDLVMGDFDSDHDEERQIDTNHTA 196

Query: 215 --GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTW 270
              DE     G  +     SS V+ +  L+  M H     F++ Y EP   IL+ +    
Sbjct: 197 EERDEANKPDGPVYQTPYASSFVLPIAALEPSMLHPISLAFLYEYREPTFGILYSQVAAS 256

Query: 271 AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-I 329
           +  +  +      S  ++    +    + S   LP+D +K++ +P P+GG L+VG+N  +
Sbjct: 257 SALLHDRKDVVFYSVFTLDLEQRASTTLLSVPRLPNDLFKVIPLPPPVGGALLVGSNELV 316

Query: 330 HYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTGDLV 387
           H      + A+ +N +A    S     +S   + L+      L  +N   LL    G + 
Sbjct: 317 HVDQAGRTNAVGVNEFAREASSFSMADQSDLEMRLEGCVVEQLGTENCDMLLVLLNGVMA 376

Query: 388 LLTVVYDGRVVQRLDLS-----------KTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQ 436
           +++   DGR V  + L            +T PS        +G    F GS  GDS+L+ 
Sbjct: 377 VVSFKLDGRSVSGIYLRPVSDQAGGAILRTKPSC----SALVGRGKIFFGSEEGDSMLIG 432

Query: 437 FTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTES 496
           ++  S  + +     E   D  A+    +       DA +D +    ++  G  S NT S
Sbjct: 433 WSRPSAGATVPPA-PETGEDNVAELSEDEEEEDDDEDAYEDDLYATPVT-PGINSRNTTS 490

Query: 497 AQKT----FSFAVRDSLVNIGPLKDFSYGL---RINADASATGISKQSNYELVELPG--- 546
              T    + F + D L N+GP++D + G      + D   +  S  +  ELV   G   
Sbjct: 491 VNGTSLNDYIFRIHDRLWNLGPMRDITLGRPPGSRDKDKRQSVSSLSAYLELVTTQGYGR 550

Query: 547 -----------------------CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
                                    G+ +V+ K  +      S        Y  YL++S 
Sbjct: 551 AGGLAILRREIDPYVIDSLMIKDTDGVRSVHVKDPKLPTQSGSLPVNAGSNYDHYLLLSK 610

Query: 584 -----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARILDGS 637
                + +++V + +    E T + ++   + RTI  G L G  RV+QV +   R  D +
Sbjct: 611 SKGFDKEKSVVYKMSSGGLEETRAPEFNPNEDRTIDIGTLAGGTRVVQVLKGEVRSYDSA 670

Query: 638 YMTQDLSFG---PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSV 694
            +   L      P   E    SE  +V+  S ADPYVL+   D SI LL  D S     +
Sbjct: 671 NLHLGLGLAQIYPVWDE--DTSEERSVVHASFADPYVLIIRDDSSILLLQADESGDLDEI 728

Query: 695 QTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQ--GDI 752
           +T   IES+     S +LY DK            ++LS          +G P  +   ++
Sbjct: 729 ETDGIIESTT--WISGSLYQDK----------YRSFLS---------YEGTPNRKPSDNV 767

Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYM---REALKDSETEINSSSEE 809
              +      L IF +PN        + V     I+ T +   R   ++  TEI      
Sbjct: 768 LLFLLNSESKLYIFHLPNAKEPVYTAESVDLLPQILPTELPPRRTTYRECLTEI------ 821

Query: 810 GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
                            L      +    P+L        ++ Y+ Y          ++ 
Sbjct: 822 -----------------LVADLGDSVSRTPYLILRSNSNELILYEPY-----HTVQSTEK 859

Query: 870 PVSTSRSLSVSN------VSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGH 923
            +S  R L ++N      +  S L NL  S   L    R     G  C   T+F      
Sbjct: 860 RLSDLRFLKIANHHFPKFLPESNLGNLSDSDRQL---ARPLRALGDVCGYRTVF------ 910

Query: 924 QGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQL 983
               + G+ PC+ +     +     L   ++ + +  +   C  GF+YV +  ++++C+ 
Sbjct: 911 ----MPGNSPCFIIKSATSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDTDNVVRMCRF 966

Query: 984 PSGSTYDNYWPVQKV 998
           P  + +D  W  +K+
Sbjct: 967 PRNTHFDGSWAARKI 981


>gi|225558298|gb|EEH06582.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
          Length = 1408

 Score =  138 bits (348), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 212/984 (21%), Positives = 385/984 (39%), Gaps = 156/984 (15%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           L LV  Y L G +  L  +     D+    +++++A  +AK+S++E+D   H +  TS+H
Sbjct: 65  LVLVAEYALSGTITDLGRVKI--LDSKSGGEAVLVATRNAKLSLIEWDPERHQICTTSIH 122

Query: 161 CFESPEWLHLKRGRESFARGP-LVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---- 214
            +E  + +++     + A  P  + VDP  RC  VL +G + + IL   Q G  LV    
Sbjct: 123 YYERDD-VNISPWTPNLASCPSYLTVDPSSRCA-VLNFGKKNLAILPFHQVGDDLVMDDF 180

Query: 215 -----------------GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGY 255
                             DE    +G  F     SS V+ +  L+  M H     F++ Y
Sbjct: 181 DSDVEEPHRNMNQTAEETDEANKSNGPVFQTPYASSFVLPIAALEPSMLHPISLAFLYEY 240

Query: 256 IEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVP 315
            EP   IL+ +  T +  +  +      S  ++    +    + S   LP+D +K++ +P
Sbjct: 241 REPTFGILYSQVATSSALLHDRKDVVFYSVFTLDLEQRASTTLLSVSRLPNDLFKVVPLP 300

Query: 316 SPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL-- 372
            P+GG L++G+N  +H      + A+ +N +A    S     +S   + L+ +    L  
Sbjct: 301 PPVGGALLIGSNELVHIDQAGKTNAVGVNEFAREASSFSMADQSDLEMRLEDSIVEQLGA 360

Query: 373 QNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDIT---TIGNSLFFL 425
           +N   LL    G + +L+   DGR V  + L     +   S+L +  +    +     F 
Sbjct: 361 ENGDMLLVLLNGKMAVLSFKLDGRSVSGISLRPVPDQAGSSLLKAKPSCSVPVSRGKIFF 420

Query: 426 GSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ--------D 477
           GS  GDS+L+ ++  S  +      +   G+I   +           D            
Sbjct: 421 GSEEGDSVLMGWSRPSARTKDPRAQRTGEGNIAQLSDEDDDDEEEDDDDDAYEDDLYATP 480

Query: 478 MVNG----EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRI---NADASA 530
           M  G    + +S+ G+  N+       + F + D L N+GP++D + G      + D   
Sbjct: 481 MTTGIKARDYVSVNGTGFND-------YIFRIHDRLWNLGPMRDLTLGRPPGPRDKDKRQ 533

Query: 531 TGISKQSNYELVELPG--------------------------CKGIWTVYHKSSRGHNAD 564
              S  +N ELV   G                            G  +VY K  +  +  
Sbjct: 534 PVSSILTNLELVTTQGYGKAGGLAILRREIDPFVIDSLMIKDTDGARSVYVKDPKLPSQS 593

Query: 565 SSRMAAYDDEYHAYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLF 618
            S        Y  YL++S      + +++V   +    E T++ ++   + RTI  G L 
Sbjct: 594 GSLPLNPGSNYDHYLLLSKSKGLDKEKSVVYRMSSGGLEETKAPEFNPNEDRTIDIGTLA 653

Query: 619 GRRRVIQVFERGARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
              RV+QV +   R  D G  + Q       +      SE  +V+  S ADPYVL+   D
Sbjct: 654 SGTRVVQVLKGEVRSYDSGLGLAQIFPVWDEDM-----SEEKSVVHTSFADPYVLIIRDD 708

Query: 678 GSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGE 737
            SI LL  D S      +T   I S+     S +LY DK                     
Sbjct: 709 QSILLLQADESGDLDEAETDGIINSTT--WISGSLYQDK-------------------YR 747

Query: 738 AIDGADGGP-LDQGD-IYSVVCYESGALEIFDVPNF-NCVFTVDKFVSGRTHIVDTYMRE 794
           + +  +G P + Q D +   +      L +F +PN    VFT +                
Sbjct: 748 SFNSYEGPPNMKQSDNVLLFLLSSESKLYVFHLPNAREPVFTTESI-------------- 793

Query: 795 ALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQ 854
              D   +I S+         +E I  + V +L      +    P+L    ++  ++ Y+
Sbjct: 794 ---DLLPQILSTEPPPRRVTYRETITELLVADLG----DSVSRSPYLILRSSNSDLILYE 846

Query: 855 AYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRI 914
            Y +     TS ++   S  R + ++N    +      S + ++ +    T    P +  
Sbjct: 847 PYHY-----TSSTEKQFSDLRFVKIANHHFPKFH----SESNVEKHPANCTTLSKPLR-- 895

Query: 915 TIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
            +  ++ G++  F+ G+ PC+ +     +     L   ++ + +  +   C  GF+YV +
Sbjct: 896 -VLGDVCGYRTVFMPGNSPCFIIKSSTSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDT 954

Query: 975 QGILKICQLPSGSTYDNYWPVQKV 998
             ++++C+ P  + +D  W  +K+
Sbjct: 955 DNVVRMCRFPRNTHFDGSWAARKI 978


>gi|240277254|gb|EER40763.1| cleavage factor two protein 1 [Ajellomyces capsulatus H143]
          Length = 1408

 Score =  138 bits (348), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 212/984 (21%), Positives = 384/984 (39%), Gaps = 156/984 (15%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           L LV  Y L G +  L  +     D+    +++++A  +AK+S++E+D   H +  TS+H
Sbjct: 65  LVLVAEYALSGTITDLGRVKI--LDSKSGGEAVLVATRNAKLSLIEWDPERHQISTTSIH 122

Query: 161 CFESPEWLHLKRGRESFARGP-LVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---- 214
            +E  + +++     + A  P  + VDP  RC  VL +G + + IL   Q G  LV    
Sbjct: 123 YYERDD-VNISPWTPNLASCPSYLTVDPSSRCA-VLNFGKKNLAILPFHQVGDDLVMDDF 180

Query: 215 -----------------GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGY 255
                             DE    +G  F     SS V+ +  L+  M H     F++ Y
Sbjct: 181 DSDVEEPHRNMNQTAEETDEANKSNGPVFQTPYASSFVLPIAALEPSMLHPISLAFLYEY 240

Query: 256 IEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVP 315
            EP   IL+ +  T +  +  +      S  ++    +    + S   LP+D +K++ +P
Sbjct: 241 REPTFGILYSQVATSSALLHDRKDVVFYSVFTLDLEQRASTTLLSVSRLPNDLFKVVPLP 300

Query: 316 SPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL-- 372
            P+GG L++G+N  +H      + A+ +N +A    S     +S   + L+ +    L  
Sbjct: 301 PPVGGALLIGSNELVHIDQAGKTNAVGVNEFAREASSFSMADQSDLEMRLEDSIVEQLGA 360

Query: 373 QNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDIT---TIGNSLFFL 425
           +N   LL    G + +L+   DGR V  + L     +   S+L +  +    +     F 
Sbjct: 361 ENGDMLLVLLNGKMAVLSFKLDGRSVSGISLRPVPDQAGSSLLKAKPSCSVPVSRGKIFF 420

Query: 426 GSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ--------D 477
           GS  GDS+L+ ++  S  +      +   G+I   +           D            
Sbjct: 421 GSEEGDSVLMGWSRPSARTKDPRAQRTGEGNIAQLSDEDDDDEEEDDDDDAYEDDLYATP 480

Query: 478 MVNG----EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRI---NADASA 530
           M  G    + +S+ G+  N+       + F + D L N+GP++D + G      + D   
Sbjct: 481 MTTGIKARDYVSVNGTGFND-------YIFRIHDRLWNLGPMRDLTLGRPPGPRDKDKRQ 533

Query: 531 TGISKQSNYELVELPG--------------------------CKGIWTVYHKSSRGHNAD 564
              S  +N ELV   G                            G  +VY K  +  +  
Sbjct: 534 PVSSILTNLELVTTQGYGKAGGLAILRREIDPFVIDSLMIKDTDGARSVYVKDPKLPSQS 593

Query: 565 SSRMAAYDDEYHAYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLF 618
            S        Y  YL++S      + +++V   +    E T++ ++   + RTI  G L 
Sbjct: 594 GSLPLNPGSNYDHYLLLSKSKGLDKEKSVVYRMSSGGLEETKAPEFNPNEDRTIDIGTLA 653

Query: 619 GRRRVIQVFERGARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
              RV+QV +   R  D G  + Q       +      SE  +V+  S ADPYVL+   D
Sbjct: 654 SGTRVVQVLKGEVRSYDSGLGLAQIFPVWDEDM-----SEEKSVVHTSFADPYVLIIRDD 708

Query: 678 GSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGE 737
            SI LL  D S      +T   I S+     S +LY DK                     
Sbjct: 709 QSILLLQADESGDLDEAETDGIINSTT--WISGSLYQDK-------------------YR 747

Query: 738 AIDGADGGP-LDQGD-IYSVVCYESGALEIFDVPNF-NCVFTVDKFVSGRTHIVDTYMRE 794
           + +  +G P + Q D +   +      L +F +PN    VFT +                
Sbjct: 748 SFNSYEGPPNMKQSDNVLLFLLSSESKLYVFHLPNAREPVFTTESI-------------- 793

Query: 795 ALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQ 854
              D   +I S+         +E I  + V +L      +    P+L    ++  +  Y+
Sbjct: 794 ---DLLPQILSTEPPPRRVTYRETITELLVADLG----DSVSRSPYLILRSSNSDLTLYE 846

Query: 855 AYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRI 914
            Y +     TS ++   S  R + ++N    +      S + ++ +    T    P +  
Sbjct: 847 PYHY-----TSSTEKQFSDLRFVKIANHHFPKFH----SESNVEKHPANCTALSKPLR-- 895

Query: 915 TIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
            +  ++ G++  F+ G+ PC+ +     +     L   ++ + +  +   C  GF+YV +
Sbjct: 896 -VLGDVCGYRTVFMPGNSPCFIIKSSTSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDT 954

Query: 975 QGILKICQLPSGSTYDNYWPVQKV 998
             ++++C+ P  + +D  W  +K+
Sbjct: 955 DNVVRMCRFPRNTHFDGSWAARKI 978


>gi|395324102|gb|EJF56549.1| hypothetical protein DICSQDRAFT_93527 [Dichomitus squalens LYAD-421
           SS1]
          Length = 1433

 Score =  138 bits (347), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 217/996 (21%), Positives = 396/996 (39%), Gaps = 192/996 (19%)

Query: 57  NLVVTAANVIEIYVVR-------VQEEGSKES-----KNSGETKRRVLMD---------- 94
           N+VV  ++++ I+ VR        Q+E  KE      K +   +  V MD          
Sbjct: 42  NVVVARSSLLRIFEVREEPAPVSTQKEVEKERRAAVRKGTEAVEGEVEMDTSGEGFVNMG 101

Query: 95  ---GISAAS-------LELVCHYRLHGNVESL-AILSQGGADNSRRRDSIILAFEDAKIS 143
              G++ A+         LV  +RLHG V  L A+ +    D+  + D ++++F+DAKI+
Sbjct: 102 TSAGLNGAAHPPTVNRFYLVREHRLHGTVTGLEAVRTVHSLDD--KLDRLLVSFKDAKIA 159

Query: 144 VLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI 202
           +LE+  S+H +   S+H +E +P+ + +        R  L + DP  RC  + +    + 
Sbjct: 160 LLEWSLSLHDVITVSIHTYERAPQLIAID---SPLFRSEL-RADPLSRCAALSLPKDSLA 215

Query: 203 ILK--ASQGGSGLVGDEDTFGSGGGFSARIESSHVINL-RDLD--MKHVKDFIFVHGYIE 257
           IL    SQ    ++  E +      +S     S +++L  D+D  +++V DF F+ G+  
Sbjct: 216 ILPFYQSQAELDILEQEASQARDVPYSP----SFILDLANDVDKRIRNVIDFTFLPGFHN 271

Query: 258 PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
           P + +L + + TW GR+     T  +   ++      +P+I +   LP+D + L    + 
Sbjct: 272 PTVAVLCQYQQTWTGRLKEYKDTVGLYIFTLDFVTNNYPVITAVDGLPYDCFALTPCSTA 331

Query: 318 IGGVLVVGANTIHYHSQSA-SCALALNNYAVSLD-------SSQELPRSSFSVELDAAHA 369
           IGGV+++ +N + +  QS     L +N +   +        ++QE  R    ++L+ A  
Sbjct: 332 IGGVVILASNAVLFVDQSGRRVILPVNGWPPRVSDLPMPPLTAQEQTR---DLQLEGARF 388

Query: 370 TWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-----TNPSVLTSDITTIGNSLFF 424
            ++ +    L  K G +  + ++ DGR V +L +S      T P+V    +  IG+   F
Sbjct: 389 VFVDDKKLFLILKDGTVYPIELIQDGRTVSKLTMSDALARTTIPAV----VKRIGDDHIF 444

Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIE---ADAPST--------------KRL 467
           +GS +G S+L++          ++ ++EE  D +   A+ P+T                L
Sbjct: 445 IGSIVGPSVLLK----------TARVEEEIHDEDVAMAEGPATVVDTSKTVDMMDDDDDL 494

Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
              S+ A Q   NG        A +N  + +     ++ D++   GP+ D ++GL  N D
Sbjct: 495 YGPSTIADQPAANGTA----NGAVDNVRT-RTVVHLSLCDAIPAHGPISDMTFGLSRNGD 549

Query: 528 ------ASATGISKQSNYELVE--LP-----------GCKGIW------------TVYHK 556
                  +ATG     ++ L +  +P           G +G+W            T + +
Sbjct: 550 RLVPELVAATGSGHLGSFSLFQRDMPTRFKRKLHAIGGGRGMWSLPVRQQVKTGGTTFER 609

Query: 557 SSRGHNADSSRMAAYDDEYHAYLIISLEART--MVLETADLLTEVTESVDYFVQGRTIAA 614
            S   +AD+  +    D   +  +  +  R+    +     +  VT     F QG  I  
Sbjct: 610 PSNPFHADNDTVIISTDANPSPGLSRIATRSSHSDITITTRIPGVTLGAAPFFQGTAILH 669

Query: 615 GNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLG 674
             +F    VI+V E      DG+          S  +    +    + S SI DP++L+ 
Sbjct: 670 -VMFNVTNVIRVLEP-----DGTERQ-------SIKDLDGNAARPRIKSCSICDPFILII 716

Query: 675 MSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTG 734
             D +I L +G+     +  +  + +        +   Y D      L +T  +A     
Sbjct: 717 REDDTIGLFIGEIERGKIRRKDMSPMGEKTSKYLAGYFYTDTS---GLFQTFLNA---EA 770

Query: 735 VGEAIDGADGGPLDQGDI--YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYM 792
            GEA      G ++ G+   +  +    G +EI+ +P     F+     +    I D+  
Sbjct: 771 PGEAATSTLQGAMNAGNKTHWLTLVRPQGVVEIWTLPKLTLAFSTTTLATLDPVISDSLE 830

Query: 793 REALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILC 852
             AL                Q        + V +L +      H RP L  +L  G +  
Sbjct: 831 PPAL-------------SLPQDPPRKPQELDVDQLVIAPLGESHPRPHLIVLLRSGQLAI 877

Query: 853 YQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNL-RFSRTPLDAYTREETPHGAPC 911
           Y+A     P       DP+  +RSL++       L NL +      D    EE       
Sbjct: 878 YEAVAASPPA------DPLPPTRSLTL-------LVNLVKVKSKAFDIQHTEEEQKSVLA 924

Query: 912 QRITIFKNI----------SGHQGFFLSGSRPCWCM 937
           ++  I + +            + G F +G RP W +
Sbjct: 925 EQKRISRLLLPFVTSPAPGQTYSGVFFTGDRPSWIV 960


>gi|388581811|gb|EIM22118.1| hypothetical protein WALSEDRAFT_28358 [Wallemia sebi CBS 633.66]
          Length = 1259

 Score =  137 bits (346), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 168/675 (24%), Positives = 284/675 (42%), Gaps = 121/675 (17%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNV--- 113
           N+V TA N ++IY + +                        +A L L   Y+LHG +   
Sbjct: 30  NIVTTANNTLKIYEIDIDS-------------------NTPSAKLILRREYQLHGEIIGI 70

Query: 114 ESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRG 173
           +S+ ILS         +D +++AF DAKI++LE+ D I+ +   S+H +E  + + + + 
Sbjct: 71  QSIKILST----TEDGKDRLLIAFRDAKIALLEWSDEINDIVTVSIHTYERSQQV-ISQD 125

Query: 174 RESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESS 233
              F    +++ DP+ RC  +L+    + IL      + L  D D   S          S
Sbjct: 126 MSRFK--AILRSDPENRCSALLLPDDSLAILPVHSAHAEL-EDLDQDVSNAIKDVPYAPS 182

Query: 234 HVINLR--DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
            ++ L+  D D+ +V D+ F+ G+  P + +L E   TW GR+S    TC +  L++   
Sbjct: 183 FILPLKSIDSDICNVIDYTFLPGFHNPTLAVLCEPRQTWTGRLSDSQDTCQVFFLTLDLV 242

Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLD 350
            + +P+I +  NLP+D+  L A P  IGGV ++ AN  IH          A N +A +L 
Sbjct: 243 TQVYPIIATVDNLPYDSMSLKAAPKEIGGVAILSANAIIHVDQNGRPVGRATNGWA-TLT 301

Query: 351 SSQEL--PRSSFSVELDAAHATWLQ------NDVALLSTKTGDLVLLTVVYDGRVVQRLD 402
           S++    P     V L+ A   +LQ      +  ALL    G++  +    +GR + R+D
Sbjct: 302 SARNFDAPPKDLFVRLEGASIEFLQPKSKQTHPQALLFLPNGEIHAVQFYREGRTISRID 361

Query: 403 LSK-----TNPS-VLTSDITTIGNS---LFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEE 453
           +SK     + PS     DI   G S     F+ S +G S L++   G   + L    K+E
Sbjct: 362 ISKPFAKGSIPSGAYRLDIDGQGLSGGQFVFIPSMVGTSFLIR--VGKSLNDLELFPKQE 419

Query: 454 FGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFS------FAVRD 507
                          +  + A  DM    +  LYGS+    +  ++         F + D
Sbjct: 420 ---------------KVGTTAYDDMDVDVDEELYGSSDKKADEKEEEEEISSEPPFTICD 464

Query: 508 SLVNIGPLKDFSYG---------LRINADASA------TGISKQSNYE---LVELPGCKG 549
            + + GP++D + G         L+I A   A      T   ++  +E    +++ G  G
Sbjct: 465 YIESYGPIQDITIGRYMQTRNSPLQILAATGAGHVGGITAFHQEVPFESKHKLDVQGNHG 524

Query: 550 IWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
           +WT ++ +  G+      + A D +    +   L +  + L   D   EV          
Sbjct: 525 LWT-FNVTGVGN-----VLVATDSKSKTKISKLLPSNEVALIAED--NEV---------- 566

Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
            TIAA       R++ +     ++L    + Q               EN  V   SI+DP
Sbjct: 567 -TIAADTAANSTRILMITSNAIKVLKEDGIEQ----------QSLQIENGEVQRASISDP 615

Query: 670 YVLLGMSDGSIRLLV 684
           Y+L   S+GSI L +
Sbjct: 616 YILTLQSNGSISLFI 630


>gi|302924728|ref|XP_003053954.1| predicted protein [Nectria haematococca mpVI 77-13-4]
 gi|256734895|gb|EEU48241.1| predicted protein [Nectria haematococca mpVI 77-13-4]
          Length = 1429

 Score =  137 bits (346), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 224/994 (22%), Positives = 376/994 (37%), Gaps = 150/994 (15%)

Query: 75  EEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSII 134
           ++G + S   G     V  D  +   L LV    L G V  LA +      +    ++++
Sbjct: 72  DDGLESSFLGGGESMLVRTDRTNNTKLVLVAELPLTGTVIGLAKIKTKYTKSGG--EALL 129

Query: 135 LAFEDAKISVLEFDDSIHGLRITSMHCFESPE-----WLHLKRGRESFARGPLVKVDPQG 189
           LA++ AK+ + E+D   + L   S+H +E  E     W   +   + +     ++ DP  
Sbjct: 130 LAYKAAKMCLCEWDPKKNTLETLSIHYYEKDELQGAPW---EVAFDEYVN--FLEADPGS 184

Query: 190 RCGGVLVYGLQMIILKASQGGSGLVGD---EDTFGS---------GGGFSARIESSHV-- 235
           RC         + IL   Q    L  D   ED  G            G S  +E+++   
Sbjct: 185 RCAAFQFGSRNIAILPFRQAEEDLEMDDWDEDLDGPRPVKESTAVANGDSDTLEAAYTPS 244

Query: 236 ----INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT 291
               + L D  + H     F+H Y EP   IL   +          H T  +  L +   
Sbjct: 245 FVLRLPLLDPSLLHPVHLAFLHEYREPTFGILSSSQERAHSLGQKDHLTYKVFTLDLQQ- 303

Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLD 350
            +    I S  +LP D YK++A+P+P+GG L++G N  IH      +  +A+N+ A  + 
Sbjct: 304 -RASTTILSVTDLPRDLYKMIALPAPVGGALLIGENEFIHIDQSGKANGVAVNSMARQMT 362

Query: 351 SSQELPRSSFSVELDAA--HATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLS---K 405
           S     ++  ++ L+       +++N   LL    G L +++   DGR V  + +    +
Sbjct: 363 SFSLSDQADLNLRLEGCIIEQLYIENGELLLILNDGRLGIVSFRIDGRTVSGISIKMIPE 422

Query: 406 TNPSVL----TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADA 461
            N   L     S  + +G + FF+GS  GDS+++    G    M     ++         
Sbjct: 423 ENGGRLIKSRASTASKLGKNTFFIGSETGDSVVL----GWSRKMSQEKRRKTRLVDADLG 478

Query: 462 PSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
                L     D   D + G E +   + + N        SF + D+L++I P++D + G
Sbjct: 479 LDVDDLDLEDDDDEDDDLYGTETAAKPTQALNGAGKSGELSFRIHDTLLSIAPIRDLTSG 538

Query: 522 -LRINADASATGISKQ--SNYELV--------------------------ELPGCKGIWT 552
                 D+    +SK   S+ +L                           E P  +G WT
Sbjct: 539 KAAFLPDSEEATLSKGVVSDLQLACVVGRGNSGSLAILNRHIQPKIIGRFEFPEARGFWT 598

Query: 553 VYHK----SSRGHNADSSRMAAYDDEYHAYLIIS------LEARTMVLETADLLTEVTES 602
           +  K     S G N           ++  Y+I++       E   +   TA     + E+
Sbjct: 599 MCVKKPVPKSLGGNVTVGNDYETFGQHDKYMIVAKVDLDGYETSDVYALTAAGFETLKET 658

Query: 603 VDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTV 661
                 G T+ AG +  + RVIQV +   R  DG   +TQ L     + E+G+      V
Sbjct: 659 EFDPAAGFTVEAGTMGKQMRVIQVLKSEVRSYDGDLGLTQILPM--LDEETGA---EPRV 713

Query: 662 LSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPW 721
           +S SIADPY+LL   D S+ +   D +     V+   +   S K  + C LY D      
Sbjct: 714 ISASIADPYLLLIRDDSSVLIAQIDSNNELEEVEKTDSTLQSTKWHAGC-LYTD------ 766

Query: 722 LRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKF 780
                     + GV +   G  G   D   I   +   +GAL ++ +P+ +  V+  +  
Sbjct: 767 ----------TKGVFQPSVGDKGA--DTSKIMMFLLSSTGALHVYALPDLSKPVYVAEGL 814

Query: 781 VSGRTHI-VDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRP 839
                H+  D  +R  L                   KEN+  + V +L           P
Sbjct: 815 CYVPPHLSADYTLRRGLA------------------KENLRELLVADLG----DTVSQSP 852

Query: 840 FLFAILTDGTILCYQA--YLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPL 897
           +L        +  Y+   Y  EG E T         S +L+    S + L       +  
Sbjct: 853 YLILRNQTDDLTIYEPLRYQPEGAEPT--------LSATLTFKKTSNAALATSPVETSQE 904

Query: 898 DAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAF 957
           DA    + P   P +      N++G+   FL G  P + +   + +     L    I   
Sbjct: 905 DAV---QQPRFVPLRTCA---NVNGYSTVFLPGPSPSFILKSSKSIPRVIGLQGLGIRGM 958

Query: 958 TVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
           +  H   C+ GFIY   +GI ++ QLPS + + +
Sbjct: 959 STFHTEGCDRGFIYADDEGIARVTQLPSETNFTD 992


>gi|426235955|ref|XP_004011942.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 [Ovis aries]
          Length = 819

 Score =  137 bits (345), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 112/384 (29%), Positives = 176/384 (45%), Gaps = 54/384 (14%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV  A   ++YV R+  +    +KN   T  +   +      LELV  +   GNV S+
Sbjct: 29  NLVV--AGTSQLYVYRLNRDSEAPTKNDRSTDGKAHRE--HREKLELVASFSFFGNVMSM 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAK--------------ISVLEFDDSIHGLRITSMHCF 162
           A +   GA    +RD+++L+F+DAK              +    FD +   +  TSM   
Sbjct: 85  ASVQLAGA----KRDALLLSFKDAKGGYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTM 140

Query: 163 ESPEWL----------------HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKA 206
           E P +L                 L+    S  R    K +P  +   V         ++ 
Sbjct: 141 E-PGYLFLGSRLGNSLLLKYTEKLQEPPASTTREAADKEEPPSKKKRVDATTGWAGRVRE 199

Query: 207 SQGGSGLVGDEDTFGSGG---------GFSARIESSH---VINLRDLDMK--HVKDFIFV 252
            +     V + + +GS            F  R  S     +I++R LD K  ++ D  F+
Sbjct: 200 GELPQDEVDEIEVYGSEAQSGTQLATYSFEVRWGSEWLPGIIDVRALDEKLLNIVDLQFL 259

Query: 253 HGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLL 312
           HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K HP+IWS  +LP D  + L
Sbjct: 260 HGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVHPVIWSLTSLPFDCTQAL 319

Query: 313 AVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATW 371
           AVP PIGGV++   N++ Y +QS     +ALN+      +     +    + LD A A +
Sbjct: 320 AVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLRTQEGVRITLDCAQAAF 379

Query: 372 LQNDVALLSTKTGDLVLLTVVYDG 395
           +  D  ++S K G++ +LT++ DG
Sbjct: 380 ISYDKMVISLKGGEIYVLTLITDG 403



 Score = 73.2 bits (178), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 37/93 (39%), Positives = 51/93 (54%), Gaps = 3/93 (3%)

Query: 907 HGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNC 965
           +G    R+ + K    H   F+ G  P W +V  R  LR+HP   DG I +F   HN+NC
Sbjct: 486 YGGRHHRLALHKPPLHH--VFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNINC 543

Query: 966 NHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
             GF+Y   QG L+I  LP+  +YD  WPV+K+
Sbjct: 544 PRGFLYFNRQGELRISVLPAYLSYDAPWPVRKI 576



 Score = 52.0 bits (123), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 54/160 (33%), Positives = 76/160 (47%), Gaps = 22/160 (13%)

Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDL-VLLTVVYDG-RVVQRLDLSKTNPSVLTSDI 415
           S  SV+L  A     + D  LLS K      +LT++ DG R V+     K   SVLT+ +
Sbjct: 83  SMASVQLAGA-----KRDALLLSFKDAKGGYVLTLITDGMRSVRAFHFDKAAASVLTTSM 137

Query: 416 TTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL-------- 467
            T+     FLGSRLG+SLL+++T         +    E  D E      KR+        
Sbjct: 138 VTMEPGYLFLGSRLGNSLLLKYT--EKLQEPPASTTREAADKEEPPSKKKRVDATTGWAG 195

Query: 468 RRSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVR 506
           R    +  QD V+  E+ +YGS A + T+ A  T+SF VR
Sbjct: 196 RVREGELPQDEVD--EIEVYGSEAQSGTQLA--TYSFEVR 231


>gi|159123784|gb|EDP48903.1| cleavage and polyadenylation specificity factor subunit A, putative
           [Aspergillus fumigatus A1163]
          Length = 1401

 Score =  136 bits (343), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 173/744 (23%), Positives = 307/744 (41%), Gaps = 114/744 (15%)

Query: 57  NLVVTAANVIEIY-VVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
           NLVV   +V++I+ +++VQ     E+  +   +     D +    L L   Y L G V  
Sbjct: 28  NLVVVKTSVLQIFSLLKVQHHSRGETIETKSARP----DQVETTKLVLEREYPLSGTVVD 83

Query: 116 LA----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLK 171
           +     + S+ G +      +++LAF +AK+S++E+D   HG+   S+H +E  +     
Sbjct: 84  ICRVKILNSKSGGE------ALLLAFRNAKLSLVEWDPERHGISTISIHYYERDDLTRSP 137

Query: 172 RGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFG--------- 221
              +  + G ++ VDP  RC  V  +G++ + IL   Q G  L  D+  F          
Sbjct: 138 WVPDLSSCGSILSVDPSSRCA-VFNFGIRNLAILPFHQPGDDLAMDDYEFHLHQDDLNQV 196

Query: 222 ---SGGGFSAR--------IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHEREL 268
               G G  ++          SS V+ L  LD  + H     F++ Y EP   IL+ +  
Sbjct: 197 SDHVGNGLKSKDSTVYQTPYASSFVLPLTALDPSILHPVSLAFLYEYREPTFGILYSQIA 256

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
           T    +S +  +   +  ++    +    + S   LP D +K++A+P P+GG L++G+N 
Sbjct: 257 TSHALLSERKDSIFYTVFTLDLEQRASTTLLSVPKLPSDLFKVVALPPPVGGALLIGSNE 316

Query: 329 -IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGD 385
            +H      + A+ +N +A  + +   + +S  ++ L+      + +     LL   +G+
Sbjct: 317 LVHVDQAGKTNAVGVNEFARQVSAFSMVDQSDLALRLEGCVVEHISDSTGDLLLVLSSGN 376

Query: 386 LVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFT 438
           +VL+    DGR V  + L    ++   +++ S  ++   +G+   F GS   DS+L+ ++
Sbjct: 377 MVLVHFQLDGRSVSGISLRPLPTQAGGTIMKSAASSSAFLGSGRVFFGSEDADSVLLSWS 436

Query: 439 CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ-DMVNGE-ELSLYGSASNNTES 496
                       +    ++  D        +S  DA + D+   E E    G   +   +
Sbjct: 437 SMPN----PKKSRPRMSNVAEDREEASDDSQSEEDAYEDDLYTAEPETPALGRRPSAETT 492

Query: 497 AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKG------- 549
               + F   D L NIGPL+D + G   +   +   + K +  EL EL   +G       
Sbjct: 493 GVGAYIFQTLDRLPNIGPLRDITLGKPASTVENTGRLIKNACSEL-ELVAAQGSGRNGGL 551

Query: 550 ----------------------IWTVYHKSSRGHN--ADSSRMAAYDDEYHAYLIISLEA 585
                                 +WT       G     D  ++   + EY  Y+I+S + 
Sbjct: 552 VLMKREIEPDVTASFDAQSVQEVWTAVVALGSGAPLVLDEQQI---NQEYRQYVILS-KP 607

Query: 586 RTMVLETADLLTEVTESVDYFVQGR-------TIAAGNLFGRRRVIQVFERGARILDGSY 638
            T   ET+++    T+ +  F           TI  G L  ++RV+QV     R    SY
Sbjct: 608 ETPDKETSEVFIADTQDLKPFRAPEFNPNNDVTIEIGTLSCKKRVVQVLRNEVR----SY 663

Query: 639 MTQDLSFG-----PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 693
              D+  G     P   E    S+    +S S+ADPY+ +   D ++ +L  D S     
Sbjct: 664 ---DIDLGLAQIYPVWDE--DTSDERMAVSASLADPYIAILRDDSTLMILQADDSGDLDE 718

Query: 694 VQTPAAIESSKKPVSSCTLYHDKG 717
           V+   A  + K    SC LY DK 
Sbjct: 719 VELNEAARAGK--WRSCCLYWDKA 740



 Score = 44.3 bits (103), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 26/88 (29%), Positives = 44/88 (50%), Gaps = 4/88 (4%)

Query: 914 ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIV---AFTVLHNVNCNHGFI 970
           + I  NIS     F+ G RP   ++   +   H     G  V   +   L + + + GFI
Sbjct: 884 LRILPNISNFSAVFMPG-RPASFILKTAKSCPHVFRLRGEFVRSLSIFDLASPSLDTGFI 942

Query: 971 YVTSQGILKICQLPSGSTYDNYWPVQKV 998
           YV S+ +L+IC+ PS + +D  W ++K+
Sbjct: 943 YVDSKDVLRICRFPSDTLFDYTWALRKI 970


>gi|429851266|gb|ELA26469.1| protein cft1 [Colletotrichum gloeosporioides Nara gc5]
          Length = 1411

 Score =  136 bits (343), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 231/1015 (22%), Positives = 393/1015 (38%), Gaps = 160/1015 (15%)

Query: 69   YVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSR 128
            Y  R+ ++   ES   G     V  D  +   L LV  Y + G V  LA +      NS+
Sbjct: 66   YDRRLNDDDGLESSFLGGDGMLVRADRTNNTKLVLVAEYPIFGVVAGLARIK---IQNSK 122

Query: 129  RR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL----- 182
               +++++A   A++S++++D   H L   S+H +E  E         S   GPL     
Sbjct: 123  SGGEALLIATRVARLSLVQWDPEKHALEDVSIHFYEKEEL------EGSPFDGPLSNYPT 176

Query: 183  -VKVDPQGRCGGV---------LVYGLQMIILKASQGGSGLVGDED---------TFGSG 223
             +  DP  RC  +         L + L    +        + G            T G+ 
Sbjct: 177  HLAADPGSRCAALRFGSRYIAFLPFKLNDEDIDMDDWDEDVDGPRPAKEPSATAATNGTS 236

Query: 224  GGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTC 281
                    +S+V+ L  LD  + H     F+H Y EP   I+   +          H + 
Sbjct: 237  NLADVPYSTSYVLPLPQLDPSLLHPVHLAFLHEYREPTFGIISSMQRRSNTLPRKDHFSY 296

Query: 282  MISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCAL 340
             +  L +    +    I S  NLP D +K++A+P PIGG L+VG N  IH         +
Sbjct: 297  KVFTLDLQQ--RASTAILSVNNLPQDLFKVIALPGPIGGALLVGTNELIHIDQSGKPNGV 354

Query: 341  ALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTGDLVLLTVVYDGRVV 398
            A+N +     S     +S   + L+  +   +  +N   L+    G L ++    DGR V
Sbjct: 355  AVNAFTKETTSFPLADQSELDLRLEHCYIEQMSPENGELLMVLSDGRLAIIAFKIDGRTV 414

Query: 399  QRLDL----SKTNPSVL---TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
              L +    ++   +V+    S I+ +  + FF+GS   DSL+V      G +   +   
Sbjct: 415  SGLSVRIVPAEAGGNVVQCGASSISRLSKNAFFIGSTGSDSLVV------GVTRKQTQNA 468

Query: 452  EEFGDIEADAPSTKRLRRSSSDALQDMVNGE-ELSLYGSASNNTESAQKTFSFAVRDSLV 510
             +   +  D+ +         D   D + GE   ++  S + N        SF V DSL+
Sbjct: 469  RKKTRLVDDSFADDLEDEDIDDDDDDDLYGETTTTVQSSTAANGVPKGGEISFRVHDSLL 528

Query: 511  NIGPLKDFSYG--------------------LRINA-----DASATGISKQSNYELV--- 542
            ++ P+KD + G                    L++ A     +A+A  I  Q+    V   
Sbjct: 529  SLAPVKDMTTGKQAFIPESEDEKNSVGVVADLQLAAAVGKGNAAAIAIMNQNIQPKVIGK 588

Query: 543  -ELPGCKGIWT--VYHKSSRGHNADSSRMAAYDDEYHA------YLIISLEARTMVLETA 593
             E P  +G WT  V     +    D    AA   E+ A      ++I+S +      ET+
Sbjct: 589  FEFPEARGFWTMCVQKPIPKSLQGDKGANAAVGSEFDASSIYDKFMIVS-KVDLDGYETS 647

Query: 594  DLLTEVTESVDYF-------VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSF 645
            D+        + F         G T+ AG +    R+IQV +   R  DG   ++Q L  
Sbjct: 648  DVYALTGAGFEAFTGTEFDPAAGFTVEAGTMGKHMRIIQVLKSEVRCYDGDLGLSQILPM 707

Query: 646  GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKK 705
               + E+G+      V+S SIADPY+LL   D SI +   D +     ++       S +
Sbjct: 708  --LDEETGA---EPRVVSASIADPYLLLVRDDASIMVAQIDNNNELEEMEKQDDTILSTQ 762

Query: 706  PVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEI 765
             ++ C LY D                +TGV   I    G P  Q  I+  +    GAL I
Sbjct: 763  WLAGC-LYTD----------------TTGVFAPIQTDKGTPESQ-SIFMFLLSAVGALYI 804

Query: 766  FDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVV 825
            + +P+ +    V    +G T+ V  ++           + +   GT Q   E +  + V 
Sbjct: 805  YALPDLSKPVYV---AAGMTY-VPPFL---------SADYAVRRGTVQ---ETLTEVLVA 848

Query: 826  ELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSAS 885
            +L      A  S P+L     +  I  Y+    E        D     +++L    ++  
Sbjct: 849  KLG----DATESSPYLILRHANDDITIYEPIRLE------SQDKSEGLAKTLHFQKIT-- 896

Query: 886  RLRNLRFSRTPLDAYTRE--ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERL 943
               N   +++P++    +  E P   P +      NI+G+   FL G+ P + +   +  
Sbjct: 897  ---NPALAKSPVEVADDDANEQPRFVPLRPCA---NINGYSTVFLPGASPSFIIKSAKSA 950

Query: 944  RVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
                 L    +   +  H   C  GFIY  S+G  ++ QLP+ ++++    ++K+
Sbjct: 951  PKVLGLQGIGVRGMSSFHTEGCERGFIYADSEGHTRVTQLPADTSFELGVSIRKI 1005


>gi|146324727|ref|XP_747211.2| cleavage and polyadenylation specificity factor subunit A, putative
           [Aspergillus fumigatus Af293]
 gi|148886828|sp|Q4WCL1.2|CFT1_ASPFU RecName: Full=Protein cft1; AltName: Full=Cleavage factor two
           protein 1
 gi|129556124|gb|EAL85173.2| cleavage and polyadenylation specificity factor subunit A, putative
           [Aspergillus fumigatus Af293]
          Length = 1401

 Score =  136 bits (343), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 173/744 (23%), Positives = 307/744 (41%), Gaps = 114/744 (15%)

Query: 57  NLVVTAANVIEIY-VVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVES 115
           NLVV   +V++I+ +++VQ     E+  +   +     D +    L L   Y L G V  
Sbjct: 28  NLVVVKTSVLQIFSLLKVQHHSRGETIETKSARP----DQVETTKLVLEREYPLSGTVVD 83

Query: 116 LA----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLK 171
           +     + S+ G +      +++LAF +AK+S++E+D   HG+   S+H +E  +     
Sbjct: 84  ICRVKILNSKSGGE------ALLLAFRNAKLSLVEWDPERHGISTISIHYYERDDLTRSP 137

Query: 172 RGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFG--------- 221
              +  + G ++ VDP  RC  V  +G++ + IL   Q G  L  D+  F          
Sbjct: 138 WVPDLSSCGSILSVDPSSRCA-VFNFGIRNLAILPFHQPGDDLAMDDYEFHLHQDDLNQV 196

Query: 222 ---SGGGFSAR--------IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHEREL 268
               G G  ++          SS V+ L  LD  + H     F++ Y EP   IL+ +  
Sbjct: 197 SDHVGNGLKSKDSTVYQTPYASSFVLPLTALDPSILHPVSLAFLYEYREPTFGILYSQIA 256

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
           T    +S +  +   +  ++    +    + S   LP D +K++A+P P+GG L++G+N 
Sbjct: 257 TSHALLSERKDSIFYTVFTLDLEQRASTTLLSVPKLPSDLFKVVALPPPVGGALLIGSNE 316

Query: 329 -IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGD 385
            +H      + A+ +N +A  + +   + +S  ++ L+      + +     LL   +G+
Sbjct: 317 LVHVDQAGKTNAVGVNEFARQVSAFSMVDQSDLALRLEGCVVEHISDSTGDLLLVLSSGN 376

Query: 386 LVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFT 438
           +VL+    DGR V  + L    ++   +++ S  ++   +G+   F GS   DS+L+ ++
Sbjct: 377 MVLVHFQLDGRSVSGISLRPLPTQAGGTIMKSAASSSAFLGSGRVFFGSEDADSVLLSWS 436

Query: 439 CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ-DMVNGE-ELSLYGSASNNTES 496
                       +    ++  D        +S  DA + D+   E E    G   +   +
Sbjct: 437 SMPN----PKKSRPRMSNVAEDREEASDDSQSEEDAYEDDLYTAEPETPALGRRPSAETT 492

Query: 497 AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKG------- 549
               + F   D L NIGPL+D + G   +   +   + K +  EL EL   +G       
Sbjct: 493 GVGAYIFQTLDRLPNIGPLRDITLGKPASTVENTGRLIKNACSEL-ELVAAQGSGRNGGL 551

Query: 550 ----------------------IWTVYHKSSRGHN--ADSSRMAAYDDEYHAYLIISLEA 585
                                 +WT       G     D  ++   + EY  Y+I+S + 
Sbjct: 552 VLMKREIEPDVTASFDAQSVQEVWTAVVALGSGAPLVLDEQQI---NQEYRQYVILS-KP 607

Query: 586 RTMVLETADLLTEVTESVDYFVQGR-------TIAAGNLFGRRRVIQVFERGARILDGSY 638
            T   ET+++    T+ +  F           TI  G L  ++RV+QV     R    SY
Sbjct: 608 ETPDKETSEVFIADTQDLKPFRAPEFNPNNDVTIEIGTLSCKKRVVQVLRNEVR----SY 663

Query: 639 MTQDLSFG-----PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 693
              D+  G     P   E    S+    +S S+ADPY+ +   D ++ +L  D S     
Sbjct: 664 ---DIDLGLAQIYPVWDE--DTSDERMAVSASLADPYIAILRDDSTLMILQADDSGDLDE 718

Query: 694 VQTPAAIESSKKPVSSCTLYHDKG 717
           V+   A  + K    SC LY DK 
Sbjct: 719 VELNEAARAGK--WRSCCLYWDKA 740



 Score = 44.3 bits (103), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 26/88 (29%), Positives = 44/88 (50%), Gaps = 4/88 (4%)

Query: 914 ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIV---AFTVLHNVNCNHGFI 970
           + I  NIS     F+ G RP   ++   +   H     G  V   +   L + + + GFI
Sbjct: 884 LRILPNISNFSAVFMPG-RPASFILKTAKSCPHVFRLRGEFVRSLSIFDLASPSLDTGFI 942

Query: 971 YVTSQGILKICQLPSGSTYDNYWPVQKV 998
           YV S+ +L+IC+ PS + +D  W ++K+
Sbjct: 943 YVDSKDVLRICRFPSETLFDYTWALRKI 970


>gi|396471273|ref|XP_003838832.1| similar to cleavage and polyadenylation specificity factor subunit
           A [Leptosphaeria maculans JN3]
 gi|312215401|emb|CBX95353.1| similar to cleavage and polyadenylation specificity factor subunit
           A [Leptosphaeria maculans JN3]
          Length = 1402

 Score =  136 bits (342), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 170/706 (24%), Positives = 295/706 (41%), Gaps = 99/706 (14%)

Query: 57  NLVVTAANVIEIYVVR-----VQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
           NLVV   ++++I+ ++     V  EG  E  N+      E          + A L LV  
Sbjct: 28  NLVVAKNSLLQIFEIKSTTTEVTPEGGDEVDNAAANLDTEAADVQFQRTENTAKLVLVAE 87

Query: 107 YRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESP 165
           + L G V SLA +    A N++ R +++++AF DAK+S++E+D   + L   S+H +E+P
Sbjct: 88  FPLAGTVISLARIK---ALNTKSRGEALLVAFRDAKLSLVEWDPETYNLHTISIHYYENP 144

Query: 166 E------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQG-------GSG 212
           +      W    +   +F     +  DP  RC  +      + IL   Q         S 
Sbjct: 145 DLPGIAPWSADLKDTYNF-----LTADPSSRCAALKFGTHNLAILPFRQSDLVEDDYDSD 199

Query: 213 LVGDEDT------FGSGGGFSARI--ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVI 262
           L G  DT        SGG  + +    SS V+ L +LD  + H     F+H Y EP   I
Sbjct: 200 LDGPRDTKPDQAEAPSGGETTHKTPYSSSFVLPLTNLDPTLTHPVHLAFLHQYREPTFGI 259

Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVL 322
           +          ++ +      S  ++    K    + S   LP+D  +++ +P PIGG L
Sbjct: 260 IAASRAAAPSLLANRKDILTYSVFTLDLEQKASTTLLSVTGLPYDISRVVPLPHPIGGAL 319

Query: 323 VVGAN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LL 379
           ++G N  IH      +  +A+N +A +  S     +S  ++ L+  +   L  D    +L
Sbjct: 320 LLGNNEIIHVDQGGKTNGVAVNEFAKACTSFPLSDQSDLALHLEGCNVELLSQDTGDVVL 379

Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPS----VLTSDITTIGNSL---FFLGSRLGDS 432
               G L+++T   +GR V  + +          VL +  +   N +    F+GS  G+S
Sbjct: 380 VLNNGRLLIMTFTLEGRTVSGMTIQTVAADHGGHVLKAGSSCTSNLVRGRLFIGSEDGES 439

Query: 433 LLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG--SA 490
           +L+      G S  ++ L+    ++  D   T        D L D +  +        +A
Sbjct: 440 VLL------GWSSATASLRRRHSNVGLDGDGTSEEEEEDIDDLDDDLYNDTAPAVQKITA 493

Query: 491 SNNTESAQKTFSFAVRDSLVNIGPLKD------------------FSYGLRINADASATG 532
           + +  +   T+SF + D+L +I P++D                   S G    A  + TG
Sbjct: 494 AASEPTPPGTYSFRIHDTLPSIAPIRDAVLHPGKVTDSLNRGEIMLSTGR--GAAGAITG 551

Query: 533 ISKQ---SNYELVELPGCKGIWTVYHKSS------RGHNADSSRMAAYDDEYHAYLIISL 583
           + ++    +    ELP   GIW V+ +             D+    + D +Y  YL++S 
Sbjct: 552 LDRELHPVSLAASELPSTHGIWAVHARKQAPGGVVTAFGEDTEANMSTDVDYDQYLVVSK 611

Query: 584 EAR-----TMVLET-ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS 637
            +      T+V E   + L+E  +      +G T+  G L    +V+QV     R  D S
Sbjct: 612 TSEDGSESTVVYEVHGNELSETDKGDFEREEGSTLFVGVLAAGTKVVQVMRTEVRTYD-S 670

Query: 638 YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
            +  D      + E+G+      V++ S ADPY+L+   D S+++L
Sbjct: 671 ELNMDQILPMEDEETGN---ELRVINASFADPYLLVLREDSSVKIL 713


>gi|330919204|ref|XP_003298516.1| hypothetical protein PTT_09264 [Pyrenophora teres f. teres 0-1]
 gi|311328242|gb|EFQ93393.1| hypothetical protein PTT_09264 [Pyrenophora teres f. teres 0-1]
          Length = 1388

 Score =  136 bits (342), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 170/699 (24%), Positives = 293/699 (41%), Gaps = 90/699 (12%)

Query: 57  NLVVTAANVIEIYVVR-----VQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
           NLVV   ++++I+ ++     V     + S+N+      E     L    + A L LV  
Sbjct: 28  NLVVAKNSLLQIFELKSTTTEVTPGSGENSENAAANLDTEAADVPLQRTENTAKLVLVAE 87

Query: 107 YRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESP 165
           + L G V SLA +    A N++ + +++++AF DAK+S++E+D   + L   S+H +E+P
Sbjct: 88  FPLAGTVISLARVK---ALNTKSKGEALLVAFRDAKLSLVEWDPETYNLHTISIHYYENP 144

Query: 166 E------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE-- 217
           +      W    +   +F     +  DP  RC  +      + IL   Q    LV D+  
Sbjct: 145 DLPGIAPWSADLKDTYNF-----LTADPSSRCAALKFGTHNLAILPFRQ--RDLVEDDYD 197

Query: 218 ------------DTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVIL 263
                           + G       SS V+ L +LD  + H     F+H Y EP   I+
Sbjct: 198 SDAEVPKETKADQANDTSGEHKTPYSSSFVLPLTNLDPTLTHPVHLAFLHEYREPTFGIV 257

Query: 264 HERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLV 323
                T    ++ +      S  ++    K    + S   LP+D  K++ +PSPIGG L+
Sbjct: 258 AASRATAPSLLAQRKDILTYSVFTLDLEQKASTTLLSVSGLPYDITKVVPLPSPIGGALL 317

Query: 324 VGAN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLS 380
           VG N  IH      +  +ALN +A +  S     +S  ++ L+      L  +    L+ 
Sbjct: 318 VGRNEIIHVDQGGKTNGVALNEFAKACTSFSLSDQSDLALHLEGCSIELLSQETGDVLIV 377

Query: 381 TKTGDLVLLTVVYDGRVVQRLDLSKTNP-------SVLTSDITTIGNSLFFLGSRLGDSL 433
              G L++LT   DGR V  + +                S  + +G    F+GS  G+S+
Sbjct: 378 LNNGRLLILTFTLDGRTVSGMTIQTVAADHGGHLVKSAASCTSNLGRGRLFIGSEDGESV 437

Query: 434 LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG-SASN 492
           ++ +T       L++ L+ +  + + D            D   D+ N    +++  +A+ 
Sbjct: 438 MLGWTG------LTNQLRRKLSNADLDG-EDDSDEEEIDDMEDDLYNDTAPTMHKITAAV 490

Query: 493 NTESAQKTFSFAVRDSLVNIGPLKD-----------FSYG-LRINADASATGISKQSNYE 540
           +  +A  T++F + D L +I P+KD            + G + ++    A G     + E
Sbjct: 491 SEPTAPGTYTFRIHDVLPSIAPIKDAVLHPGKVTESLNRGEIMLSTGRGAAGAITALDRE 550

Query: 541 L-------VELPGCKGIWTVYHKS------SRGHNADSSRMAAYDDEYHAYLIISL--EA 585
           L        ELP   G+W V+ +       +     D+    A D +Y  YL++S   E 
Sbjct: 551 LHPISVATKELPLAHGVWAVHARKQAPGDVTAAFGEDTEANMATDVDYDQYLVMSKNGED 610

Query: 586 RTMVLE-TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLS 644
            T+V E   D LTE  +      +G T+  G L    +V+QV     RI D       + 
Sbjct: 611 GTVVYEVNGDQLTETDKGDFEREEGTTLLVGVLAAGTKVVQVMRTEVRIYDSELNLVHIQ 670

Query: 645 FGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
                 E GS  E + +++ S ADPY+L+   D S+++ 
Sbjct: 671 SMEEEEEGGSTKELN-IINASFADPYLLILREDSSVKIF 708


>gi|291232722|ref|XP_002736302.1| PREDICTED: cleavage and polyadenylation specific factor 1-like
           [Saccoglossus kowalevskii]
          Length = 984

 Score =  135 bits (340), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 163/668 (24%), Positives = 264/668 (39%), Gaps = 169/668 (25%)

Query: 423 FFLGSRLGDSLLVQFT--CGSGTSMLSSGLKE---EFGDIEADAPSTKRLRRSSSDALQD 477
            FLGSRLG+SLL+++       T  +++G K+   +    + + P+ K+    +SD +  
Sbjct: 6   LFLGSRLGNSLLLKYVEKAQESTDSVTNGAKKTEEDEETNKEEPPNKKKRTDDASDWIAS 65

Query: 478 MV-----NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG--------LRI 524
            V     + +EL +YGS +    +   +++F V DS++NIGP      G         + 
Sbjct: 66  DVALLAEDVDELEVYGSQTQ-AGTQLTSYTFEVCDSIMNIGPCTKAVMGEPVFLSEEFQT 124

Query: 525 NAD-----ASATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSR 567
           N D      + +G SK     ++            ELPGC  +WTV     +  N D  +
Sbjct: 125 NPDPDMELVALSGYSKNGALSVLQRSIRPQVVTTFELPGCIDMWTVVGPPEK-ENKDQPK 183

Query: 568 MAAYDD---------EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLF 618
               ++           HA+LI+S +  +M+L T   + E+  S  +  QG T+ AGNL 
Sbjct: 184 EKTEEEGDKKPDALTNGHAFLILSRDDSSMILSTGQEIMELDHS-GFSTQGPTVYAGNLG 242

Query: 619 GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
               ++QV   G R+L+G    Q +               S ++  S++DPY LL    G
Sbjct: 243 NNAYILQVSPMGVRLLEGVNQLQHIPL----------DLGSPIVLCSVSDPYALLMSEKG 292

Query: 679 SI--------------RLLVGDPSTCTVS-VQTPAAIE--------SSKKPVSSCTLYHD 715
            +              RL +  P    +S + T  A +        SSK   +S      
Sbjct: 293 ELVLLTLKPDGFAGGHRLAISRPQIPQISRILTLCAYKDTSGMFTTSSKMESTSDETEEK 352

Query: 716 KGPEPWLRKTSTDAWLSTG-------VGEAIDGADGGPLDQGD----------------- 751
           K  +P +   S  + +S          GE+ D +   P  + +                 
Sbjct: 353 KITKPSVADISMTSEISNVDDEDEMLYGES-DASLFSPTKKEEKSSFLQTREVLSETKPT 411

Query: 752 IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT 811
            +  +  E+G LEI+ +P+F   F V  F  G   +VD+Y          ++ +S+  G+
Sbjct: 412 YWCAMSRENGVLEIYSLPDFKLAFLVKNFPMGFKVMVDSY----------QMTASAPGGS 461

Query: 812 GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
            +  +++     V EL +      + +  L A + D  +  Y+A+               
Sbjct: 462 SKSDQQHDMMPIVKELLLIGLGHKNKKTHLLARV-DEDLYIYEAF--------------- 505

Query: 872 STSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
            T    S+ N     LR LRF +                                F+ G 
Sbjct: 506 -THDQSSLDN----HLR-LRFRKV-------------------------------FVCGP 528

Query: 932 RPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYD 990
            P W  M  R  LR HP   DGS+  F   HN+NC  GF+Y    G L+IC LP+  +YD
Sbjct: 529 YPHWLFMTSRGALRSHPMHIDGSVTCFAPFHNINCPKGFLYFNKHGELRICVLPTHLSYD 588

Query: 991 NYWPVQKV 998
             WPV+KV
Sbjct: 589 ALWPVRKV 596


>gi|147772179|emb|CAN73417.1| hypothetical protein VITISV_017053 [Vitis vinifera]
          Length = 609

 Score =  134 bits (336), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 69/122 (56%), Positives = 80/122 (65%), Gaps = 26/122 (21%)

Query: 503 FAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYEL--------------------- 541
           F V DSL+N+GPLK F+Y LRINAD  ATGI KQSN+EL                     
Sbjct: 430 FEVNDSLINVGPLKVFAYALRINADLKATGIVKQSNFELMCCSGHGKNGALCILQQSIRP 489

Query: 542 -----VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLL 596
                VEL GC+ IWTVYHK++RGHNADS++M   DDEY AYLIIS E+RTMVLET +LL
Sbjct: 490 EMITEVELSGCERIWTVYHKNTRGHNADSTKMVTKDDEYCAYLIISPESRTMVLETVELL 549

Query: 597 TE 598
            E
Sbjct: 550 GE 551


>gi|315045910|ref|XP_003172330.1| serine/threonine protein kinase [Arthroderma gypseum CBS 118893]
 gi|311342716|gb|EFR01919.1| serine/threonine protein kinase [Arthroderma gypseum CBS 118893]
          Length = 1397

 Score =  132 bits (332), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 228/1030 (22%), Positives = 397/1030 (38%), Gaps = 178/1030 (17%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V   ++++++ +     GS  +   G+  ++   D +  A L L   Y + G +  L
Sbjct: 28  NLIVAKTSLLQVFSLVNVTYGSAPA---GQPDQKGRHDRLQHAKLVLAAEYEVPGTITGL 84

Query: 117 AILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
             +      NS+   D+I+++  +AK+S++E+D   HG+   S+H +E  E  H+     
Sbjct: 85  ERVR---ISNSKSGGDAILVSSRNAKLSLIEWDPQKHGITTISIHYYEGEES-HMSPWVP 140

Query: 176 SFAR-GPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDE-DTFGSGGGFSARIES 232
                   + VDP G C  +  +G+  + IL   Q G  LV D+ D   +G   +  +  
Sbjct: 141 DLGSCSSSLTVDPNGNCA-IFNFGIHSLAILPFHQAGDDLVMDDYDAIPNGDDTTDAVND 199

Query: 233 -----------------SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
                            S V+ +  LD  + H     F+H Y EP   IL+ +       
Sbjct: 200 AQKPAPGNAVHDKPYAPSFVLPMTALDPALTHPIHMEFLHEYREPTFGILYSQVARSMSL 259

Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYH 332
              +      S  ++    K    + +   LP D +K++ +P PIGG L++G N  +H  
Sbjct: 260 TIDRKDIVSYSIFTLDLQQKASTSLLTVSRLPSDIFKVVPLPPPIGGALLIGTNELVHVD 319

Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLT 390
               + A+ +N +A    +     +S   + L+      L +     LL    G + +LT
Sbjct: 320 QAGKTNAVGVNEFARQASAFSMADQSDLEMRLEGCMVEQLGSGAGDVLLILSDGRMAILT 379

Query: 391 VVYDGRVVQRLDL----SKTNPSVLTSDIT---TIGNSLFFLGSRLGDSLLVQFTCGS-- 441
              DGR V  + L     ++  S++ S  +   ++G +  F GS  GDS+L+ ++  S  
Sbjct: 380 FKVDGRSVAGISLHFVAEQSGGSIIKSRPSCSASLGRNKLFYGSEEGDSILLGWSKHSSA 439

Query: 442 --------------GTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLY 487
                         GT+ LS   +++  D +           +++   + +VNG+     
Sbjct: 440 TKKPSKAAGGGNEDGTANLSDEEEQDDDDDDMYEDDLYSANPTTTQQEKQVVNGD----- 494

Query: 488 GSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC 547
           G+A+         F+    D L ++GP +D + G    + +     S       +EL   
Sbjct: 495 GAAN---------FTLRAHDRLWSLGPYRDITLGRPPKSKSKDRQDSVPEISAPLELVAA 545

Query: 548 KGI-----WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA------------RTMVL 590
           +G       TV  +       DS +M   DD Y  + I  ++             R ++L
Sbjct: 546 RGFGKAGGLTVLKREIDPFTIDSLKM---DDVYGVWSIRVIDPKSKDAGLSRSYDRYLLL 602

Query: 591 ETADLLTEVTESVDYFV----------------QGRTIAAGNLFGRRRVIQVFERGARIL 634
             A    +  ESV Y V                +  TI  G L    RV+QV     R  
Sbjct: 603 AKAK-GDDKEESVVYSVGSSGLDSIDAPEFNPNEDCTIDIGTLATGSRVVQVLRTEIRSY 661

Query: 635 DGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS--TCTV 692
           D +     +   P   E    SE  TV+  S A+PY+L    D S+ +L  D +     V
Sbjct: 662 DCNLGLAQIY--PVWDE--DTSEERTVIQASFAEPYLLTIRDDNSLLILQADKNGDLDEV 717

Query: 693 SVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDI 752
            +Q  AA   S K VS C LY DK      +  S+D              D       +I
Sbjct: 718 EIQGSAA---SAKWVSGC-LYEDK-----TKIFSSD-------------LDTEHAATPNI 755

Query: 753 YSVVCYESGALEIFDVPNF-NCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT 811
              +    G L IF +PN    +  VD                 L  S     SSS    
Sbjct: 756 LLFLLDSDGNLSIFRLPNITEPLCRVDNL--------------NLLPSNLPYESSSRRPV 801

Query: 812 GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
               +E +  + V +L      A H  P++        ++ Y+ Y   G    SK     
Sbjct: 802 ---NRETLTELLVADLG----DAIHKSPYMILRTKHDDLVLYEPYRITGENGRSKLQ--F 852

Query: 872 STSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGS 931
             + +  V     ++  N   +R+P            +P + +    ++ G++  F+SG 
Sbjct: 853 IKAVNHVVMGPRTNQPMNKDINRSP------------SPSKLLRALSDVCGYKTVFMSGQ 900

Query: 932 RPCWCM---VFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGST 988
            PC+ +   + R  +    +L   ++ + T  H   C  GF YV    ++++ +LPS + 
Sbjct: 901 NPCFILKSAIARPNVL---RLRGKAVQSLTGFHIAACERGFAYVDEDNVIRMSRLPSNTR 957

Query: 989 YDNYWPVQKV 998
           +D+ W  +K+
Sbjct: 958 FDSAWATRKI 967


>gi|409046890|gb|EKM56369.1| hypothetical protein PHACADRAFT_93103 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 1417

 Score =  132 bits (332), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 193/892 (21%), Positives = 368/892 (41%), Gaps = 124/892 (13%)

Query: 103 LVCHYRLHGNV---ESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
            V  +RLHG V   ES+ I+S          D ++++F+DAKI++LE+ D+++ L   S+
Sbjct: 120 FVREHRLHGTVTGMESIRIVSS----QEDGLDRLLVSFKDAKIALLEWSDAVNDLLTVSI 175

Query: 160 HCFE-SPEWLHLKRGRESFARGPL----VKVDPQGRCGGVLVYGLQMIILKASQGGSGL- 213
           H +E +P+ + L+         PL    ++ DP  RC  +++    + IL   Q  + L 
Sbjct: 176 HTYERAPQMMALE--------APLFHSQLRTDPLSRCAALMLPKDSLAILPFYQSQADLD 227

Query: 214 VGDEDTFGSGGGFSARIESSHVINLR-DLD--MKHVKDFIFVHGYIEPVMVILHERELTW 270
           + ++DT  S          S V+++  D+D  +KHV D +F+ G+  P + +L +   TW
Sbjct: 228 IMEQDTQTSCRDIP--YSPSFVLDMTTDVDERIKHVIDLVFLPGFNSPTIAVLFQNTQTW 285

Query: 271 AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIH 330
             R+     T  +   ++    +  P++ +  NLP+D   L+   + +GGV++V AN + 
Sbjct: 286 TSRLREYKDTVGLIIFTLDLVTRNCPVLTAVDNLPYDCLYLVPCSAQLGGVVIVSANALI 345

Query: 331 YHSQ-SASCALALNNYAVSLDSSQELPR-----SSFSVELDAAHATWLQNDVALLSTKTG 384
           Y +Q S    L +N +   + S   LP+      S +++L+ ++A ++ ++   +    G
Sbjct: 346 YVAQTSRRVILPVNGWQARV-SDHPLPQLTEEEKSRNLKLEGSYAVFVDDNKLFVLLSDG 404

Query: 385 DLVLLTVVYDGRVVQRLDL-SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGT 443
            +  + V  DGR V RL + S    + + + +  + +   F+GS  G S+L++      T
Sbjct: 405 TVYPMEVHADGRTVSRLTMGSALAQTTIPAIVRRVTDENLFIGSTAGPSVLLK------T 458

Query: 444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSF 503
           S +   +KEE  +++  AP+      +  D   D  +GE       A   T         
Sbjct: 459 SHVEEDVKEEDVEMDT-APAAVVDEANEMDLDDD--DGELCHWVHFAKKRT-----VVHL 510

Query: 504 AVRDSLVNIGPLKDFSYGLRINAD------ASATGISKQSNYELVE--LP---------- 545
           ++ DS+   GP+ D ++ L    D       +ATG      + L +  LP          
Sbjct: 511 SLCDSIPAYGPVSDMTFSLTRVGDRPVAELVAATGSGGLGGFTLFQRDLPSRVKRKLHAV 570

Query: 546 -GCKGIWTV-YHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM--VLETADLLTEVTE 601
            G +G+W++   ++ R + +   R +      +  +IIS +A     +   A   ++   
Sbjct: 571 GGGRGMWSLAVRQAVRVNGSTYERPSNPHHGGNDAVIISTDANPSPGLSRIASRSSKSDI 630

Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL--DGS--YMTQDLSFGPSNSESGSGSE 657
            +   + G T+ A + F    ++ V     R+L  DG+   + +DL              
Sbjct: 631 QITTRIPGTTVGAASFFQGTAILHVMSNAIRVLEPDGTERQIIKDLD---------GSVP 681

Query: 658 NSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAI-ESSKKPVSSCTLYHDK 716
              +   S+ DP++++   D S+ L +G+P    +  +  + + E + K ++ C  + D 
Sbjct: 682 RPKIRYCSMCDPFIMVIREDDSLGLFIGEPERGKIRRKDMSPMGEKTSKYIAGC-FFMDT 740

Query: 717 GPEPWLRKTSTDAWLSTGVGEAIDGA-DGGPLDQGDIYSVVCYESGALEIFDVPNFNCVF 775
                 R  +  A     V   +    + G   Q   + ++    G LE++ +P    VF
Sbjct: 741 TGIFQSRVNAAAAAADKNVTSTLQTVMNAGTRTQ---WLLLVRPQGVLEVWSLPKLALVF 797

Query: 776 TVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAH 835
           +     +  + +VD+    AL                Q        + V ++A+      
Sbjct: 798 STSHVSALESVLVDSGDSPAL-------------SLPQDPPRKPQDLDVEQIAIAPLGES 844

Query: 836 HSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRT 895
            S+ +L   L  G    Y+A     P   S    P + + +L V  V        +    
Sbjct: 845 SSKLYLLVFLRCGLFAVYEAL----PAPASTDPPPPTRTSTLCVKFV--------KVVTR 892

Query: 896 PLDAYTREETPHGAPCQRITIFKNI----------SGHQGFFLSGSRPCWCM 937
             D    EE       ++  I + +              G FL+G RPCW +
Sbjct: 893 AFDIQQSEEVEKSVLAEQKRISRQLIPFVTSPTPGRAFSGVFLTGDRPCWIL 944


>gi|350633238|gb|EHA21604.1| hypothetical protein ASPNIDRAFT_51242 [Aspergillus niger ATCC 1015]
          Length = 1406

 Score =  132 bits (332), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 213/1034 (20%), Positives = 407/1034 (39%), Gaps = 179/1034 (17%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           +L+V   ++++IY +  +     E  ++ +   ++L++            Y L G V  L
Sbjct: 28  DLIVVRTSLLQIYSLH-KVASHAEGADAQQESTKLLLEK----------EYSLSGTVTGL 76

Query: 117 A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
                + S+ G +      ++++AF +AK+S++E+D    G+   S+H +E  +      
Sbjct: 77  CRVKVLNSKSGGE------AVLVAFRNAKLSLIEWDPERRGISTISIHYYERDDLTRSPW 130

Query: 173 GRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFGS--------- 222
             +    G ++ VDP  RC  +  +G++ + I+   Q G  LV D+  +GS         
Sbjct: 131 VPDLNNCGSILSVDPSSRCA-IFNFGIRNLAIIPFHQPGDDLVMDD--YGSDLGEGISTD 187

Query: 223 ---GGG-----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHER 266
              GGG           +      S V+ L  LD  + H     F++ Y EP   IL+ +
Sbjct: 188 HDLGGGTVADKAKEGIVYQTPYAPSFVLPLTTLDPSILHPISLAFLYEYREPTFGILYSQ 247

Query: 267 ELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGA 326
             T +  +  +      +  ++    +   ++ S   LP D ++++A+P P+GG L++G+
Sbjct: 248 VATSSALLPERKDVVFYTVFTLDLEQQASTVLLSVSRLPSDLFRVVALPPPVGGALLIGS 307

Query: 327 NT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKT 383
           N  +H      + A+ +N ++  + S     +S  ++ L+      L +     LL   T
Sbjct: 308 NELVHIDQAGKTNAVGVNEFSRQVSSFSMTDQSDLALRLENCIVECLGDSSGDMLLVLTT 367

Query: 384 GDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI-------TTIGNSLFFLGSRLGDSLLVQ 436
           G++ ++    DGR V  + +         + I       T IG+   FLGS  GDS+L+ 
Sbjct: 368 GEMAIVKFKLDGRSVSGISVHLLPAHAGLTSIYSAAAASTFIGDGKIFLGSEDGDSVLLG 427

Query: 437 FTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV--NGEELSLYGSASNNT 494
           ++  S ++       ++  D  AD        +S  D  +D +     + +L G   +  
Sbjct: 428 YSYSSSSTKKHRLQAKQVIDDSADMSEED---QSDDDVYEDDLYSTSPDTTLTGRRPSGE 484

Query: 495 ESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI---- 550
            SA   + F + D L+NIGPL+D + G R++ +   TG    S    +++   +G     
Sbjct: 485 SSAFGLYDFRIHDKLINIGPLRDITMGKRLSTNPEKTGDRTNSTSPELQIVASQGSHKSG 544

Query: 551 -WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA-------------RTMVLETADLL 596
              V  +    H   S  + + D  + A L    EA             R  V+ T    
Sbjct: 545 GLVVMAREIDPHVVASISLESVDCIWTASLTREEEAVSGTSEKMGQQSQRCYVIATEVKG 604

Query: 597 TEVTESVDYFVQGR----------------TIAAGNLFGRRRVIQVFERGARILDGSYMT 640
           ++  ES+ + V G                 TI+ G    R+RV+QV +   R  D    T
Sbjct: 605 SDREESLIFVVDGHDLKPFRAPDFNPNEDVTISIGTQESRKRVVQVLKNEVRSYDFGKFT 664

Query: 641 -----QDLSFGPSNSES-------GSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS 688
                ++ + G   S +          ++    +S S+AD  + +   D ++  L  D S
Sbjct: 665 PSRCRRNFADGTDLSLTQIYPIWDDDTNDERMAVSASLADSCLAILRDDSTLLFLQADDS 724

Query: 689 TCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLD 748
                V     + S K    SC LY DK                TG+  +ID     P+ 
Sbjct: 725 GDLDEVVFGEDVASGK--WISCCLYSDK----------------TGMFSSIDRTLSEPV- 765

Query: 749 QGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSE 808
           + D++  +      L +       C+      + G   ++      +   S+  I++   
Sbjct: 766 KNDMFLFLLSHDCKLFV------KCLLWSSFALRGWHLMLSKSSGLSRPRSKAAIDN--- 816

Query: 809 EGTGQGRKENIHSMKVVELAM----QRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENT 864
               +G +  + S+ ++E  +    + WSA    P+L        I+C+     EG    
Sbjct: 817 ----RGDRRFVASVNLIEAIVADLGETWSAS---PYL--------IVCHH---IEG---- 854

Query: 865 SKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQ 924
                         + ++  S+  N    R P    + + +      + + I  +ISG  
Sbjct: 855 --------------IHSLKFSKETNSVLPRIPPGVSSTQPSGSDYRARPLRILPDISGLS 900

Query: 925 GFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLP 984
             F+ G+   + +          +L   +  + + L    C+ GFIY+ SQ  ++ C+LP
Sbjct: 901 AVFMPGASAGFIIRTSASAPHFLRLRGENSRSVSSLDTPECSKGFIYLDSQSTVRFCKLP 960

Query: 985 SGSTYDNYWPVQKV 998
             + +D  W +++V
Sbjct: 961 PMTRFDYQWTLKRV 974


>gi|115490949|ref|XP_001210102.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114196962|gb|EAU38662.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 908

 Score =  132 bits (331), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 176/731 (24%), Positives = 297/731 (40%), Gaps = 148/731 (20%)

Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGR 190
           ++++LAF +AK+S++E+D   HG+   S+H +E  +        +  + G ++ V+P  R
Sbjct: 62  EAVLLAFRNAKLSLIEWDPERHGISTISIHYYERDDLTCSPWVPDLSSCGSILDVEPSSR 121

Query: 191 CGGVLVYGLQ-MIILKASQGGSGLVGD-------------EDTFGSGGGFSARIESSHVI 236
           C  V  +G++ + I+   Q G  LV D             ++T      ++    SS V+
Sbjct: 122 CA-VFNFGIRNLAIIPFHQPGDDLVMDDYDSDLDERKHVDQETTRESPAYATPYASSFVL 180

Query: 237 NLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
            L   D  + H     F+H Y EP   IL+ +  T    +  +      S  ++    + 
Sbjct: 181 PLTAFDPSILHPISLAFLHEYREPTFGILYSQVATSNALLHERKDVVFYSVFTLDLEQRA 240

Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQ 353
              + S   LP D + ++A+P P+GG L++G+N  +H      + A+ +N ++  + +  
Sbjct: 241 STTLLSVARLPSDLFHVVALPPPVGGSLLIGSNELVHVDQAGKTNAVGVNEFSRQVSAFS 300

Query: 354 ELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRL---------- 401
              +S  ++ L+      L ++    +L   TG++VL+    DGR V  +          
Sbjct: 301 MTDQSDLALRLEGCRVERLADNSGDMILILSTGNMVLIKFKLDGRSVSGISVHPVPVHAG 360

Query: 402 -DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
            DL K+      S    +GN   FLGS   DSLL+      G S LSSG           
Sbjct: 361 GDLMKS----AASSSAFLGNGEVFLGSEDADSLLL------GWSDLSSG----------- 399

Query: 461 APSTKRLR------RSSSDALQDMVNGEEL---SLYGSASNNTESAQKT---------FS 502
              TKRLR        S D   D ++ +++    LY ++ + T   ++          ++
Sbjct: 400 ---TKRLRSHKNDANDSGDVSDDNMSDDDVYEDDLYSTSPDATADGRRVSADPSSFGLYN 456

Query: 503 FAVRDSLVNIGPLKDFSYG--------LRINADASATGISKQS---NYELVEL------- 544
           F + D L+NI PL+D + G         + N  A    ++ Q    N  L+ +       
Sbjct: 457 FRINDRLLNIAPLRDITLGKPSTFDKDRKDNVSAELELVASQGSDRNGGLIAMRREIDPE 516

Query: 545 -------PGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA---- 593
                       +WT   +SS G ++            H Y+I+S +      ET     
Sbjct: 517 VLASFTIDSANCVWTACVESSGGKDS------------HQYVIVSKQTNIDKEETEIFRV 564

Query: 594 ---DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFG---P 647
              DL       V+   +  TI  G L  + RV+QV +   R  D      DL      P
Sbjct: 565 DGLDLKPIKAPEVNPN-EEVTIDVGTLAKQSRVVQVLKNEVRCYDA-----DLGLAQIYP 618

Query: 648 SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPV 707
              E    S+    +S S+ DPYV +   D ++ LL  D S     V+ P  + ++ K +
Sbjct: 619 VWDE--DTSDEHPAVSASVTDPYVAILRDDSTLLLLHVDDSGDVDEVEMPDNM-AAHKWL 675

Query: 708 SSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFD 767
           SSC LY DK                TGV  +     G    Q D++  +  +   L I+ 
Sbjct: 676 SSC-LYLDK----------------TGVFASNTDTKGS--RQNDMFLFLLGQDCRLFIYR 716

Query: 768 VPNFNCVFTVD 778
           +P+   V T+D
Sbjct: 717 LPDLLLVSTID 727


>gi|169603229|ref|XP_001795036.1| hypothetical protein SNOG_04622 [Phaeosphaeria nodorum SN15]
 gi|160706354|gb|EAT88382.2| hypothetical protein SNOG_04622 [Phaeosphaeria nodorum SN15]
          Length = 1338

 Score =  132 bits (331), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 168/728 (23%), Positives = 294/728 (40%), Gaps = 144/728 (19%)

Query: 57  NLVVTAANVIEIY-----VVRVQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
           NL+V   ++++++     V  V   G  E+ N+      E     L    + A L LV  
Sbjct: 28  NLIVAKNSLLQVFELKSTVTEVASGGEGEADNAAANFDTEAADVPLQRIENTAKLVLVGE 87

Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
           + L G V SLA +     +   R +++++AF DAK+S++E+D   + L   S+H +E+P+
Sbjct: 88  FPLAGTVISLARVK--ALNTKSRAEALLVAFRDAKLSLVEWDPETYNLHTISIHYYENPD 145

Query: 167 ------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL---------------- 204
                 W    +   +F     +  DP  RC  +      + IL                
Sbjct: 146 VPGLAPWDAELKDTYNF-----LTADPSSRCAALKFGTHNLAILPFRQRDLAEDEYDSDN 200

Query: 205 KASQGGSGLVGDEDTFGSGGGFSARI--ESSHVINLRDLD--MKHVKDFIFVHGYIEPVM 260
           +A+Q G      E   G+ G  + +    SS V+ L +LD  + H     F+H Y EP  
Sbjct: 201 EAAQEGKA----ERANGANGDDAVKTPYSSSFVLPLTNLDPTLTHPVHLAFLHEYREPTF 256

Query: 261 VILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGG 320
            ++   + T A  ++ +      +  ++    K    + S   LP+D  +++ +P PIGG
Sbjct: 257 GVISSSKATAASLLTHRKDILTYTVFTLDLEQKASTTLLSVPGLPYDLTQVVPLPHPIGG 316

Query: 321 VLVVGAN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA-- 377
            L+VG+N  IH      +  +A+N  A +  S     ++  ++ L+      L  D    
Sbjct: 317 ALLVGSNEIIHVDQAGKTNGVAVNELAKACTSFALSDQADLALRLEGCTLELLSQDTGDV 376

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI--------TTIGNSLFFLGSRL 429
           ++    G + +LT   DGR V  + +    P+    +I        T +G    F+GS  
Sbjct: 377 MIVLNDGSIFILTFSLDGRNVSAMTIQPV-PADNGGNILKTRASCSTNLGRGRLFIGSED 435

Query: 430 GDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS 489
           G+S+L+ +T                        ++ +LRR  S+  Q   + E++S    
Sbjct: 436 GESVLMGWTS-----------------------TSNQLRRKQSNTAQSG-DDEDMSDVEE 471

Query: 490 AS---------NNTESAQK-------------TFSFAVRDSLVNIGPLKD---------- 517
                      N+T +  K             T++F V D L +I P++D          
Sbjct: 472 EEVDDLDDDLYNDTATTVKKITAAAAEPTAPGTYTFRVHDVLPSIAPIRDTVLHPGKDTE 531

Query: 518 -FSYG-LRINADASATGISKQSNYEL-------VELPGCKGIWTVYHKSSR--------G 560
             + G + ++    A G     N EL        ELP   G+W V+ K           G
Sbjct: 532 SLTKGEIMLSTGRGAAGAITALNRELHPTMLAQTELPSSNGVWAVHAKKQAPAGIVADFG 591

Query: 561 HNADSSRMAAYDDEYHAYLIISLE-----ARTMVLETADLLTEVTESVDYFV-QGRTIAA 614
            +A+++  A+ D +Y  YL++S         T+V E        TE  D+   +G T++ 
Sbjct: 592 QDAEAN--ASSDVDYDQYLVVSKAWEDGTESTVVYEVHGNELSETEKGDFERDEGLTLSV 649

Query: 615 GNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLG 674
           G L    +V+QV     R  D     + +   P   E      N  +++ S ADPY+L+ 
Sbjct: 650 GVLARGTKVVQVLRSEVRTYDSELGMEQII--PMEDEETGNELN--IINASFADPYLLIQ 705

Query: 675 MSDGSIRL 682
             D S+++
Sbjct: 706 REDSSVKI 713


>gi|414587800|tpg|DAA38371.1| TPA: hypothetical protein ZEAMMB73_571351 [Zea mays]
          Length = 108

 Score =  132 bits (331), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 68/98 (69%), Positives = 77/98 (78%)

Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGP 647
           MVL+T D L EVTE+VDY VQG TIAAGNLFGR RVIQV+ +GAR+LDGS+MTQ+L+F  
Sbjct: 1   MVLQTGDDLGEVTETVDYNVQGSTIAAGNLFGRCRVIQVYAKGARVLDGSFMTQELNFSM 60

Query: 648 SNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG 685
             SES   SE     S SIADPYVLL MSDGSIRLL+G
Sbjct: 61  HTSESSLNSEPLAAASASIADPYVLLKMSDGSIRLLIG 98


>gi|225679191|gb|EEH17475.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
          Length = 1377

 Score =  132 bits (331), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 230/1037 (22%), Positives = 397/1037 (38%), Gaps = 175/1037 (16%)

Query: 53  GPVPNLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGN 112
           G V    V    ++++Y +     GS   ++  +T+ +        + L LV  Y L G 
Sbjct: 4   GAVAAFRVAKTTLLQVYNLVNVVYGSGPGQSDEKTRSQY-------SKLVLVAEYALSGT 56

Query: 113 VESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
           V  L  +     D+    ++I++A  +AK+S++E+D   H +  TS+H +E  + +H+  
Sbjct: 57  VTDLGRVKI--LDSKSGGEAILVATRNAKLSLIEWDPEKHQISTTSIHYYERDD-VHISP 113

Query: 173 GRESFARGP-LVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---------------- 214
              + A  P  + VDP  RC  VL +G + + IL   Q G  LV                
Sbjct: 114 WTPNLAACPSQLTVDPSSRCA-VLNFGKKNLAILPFHQMGDDLVMGDFDSDHDEERQIDT 172

Query: 215 ------GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHER 266
                  DE     G  +     SS V+ +  L+  M H     F++ Y EP   IL+ +
Sbjct: 173 NHTAEERDEANKPDGPVYQTPYASSFVLPIAALEPSMLHPISLAFLYEYREPTFGILYSQ 232

Query: 267 ELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGA 326
               +  +  +      S  ++    +    + S   LP+D +K++ +P P+GG L+VG+
Sbjct: 233 VAASSALLHDRKDVVFYSVFTLDLEQRASTTLLSVPRLPNDLFKVIPLPPPVGGALLVGS 292

Query: 327 NT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKT 383
           N  +H      + A+ +N +A    S     +S   + L+      L  +N   LL    
Sbjct: 293 NELVHVDQAGRTNAVGVNEFAREASSFSMADQSDLEMRLEGCVVEQLGTENCDMLLVLLN 352

Query: 384 GDLVLLTVVYDGRVVQRLDLS-----------KTNPSVLTSDITTIGNSLFFLGSRLGDS 432
           G + +++   DGR V  + L            +T PS        +G    F GS  GDS
Sbjct: 353 GVMAVVSFKLDGRSVSGIYLRPVSDQAGGAILRTKPSC----SALVGRGKIFFGSEEGDS 408

Query: 433 LLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASN 492
           +L+ ++  S  + +     E   D  A+    +       DA +D +    ++  G  S 
Sbjct: 409 MLIGWSRPSAGATVPPA-PETGEDNVAELSEDEEEEDDDEDAYEDDLYATPVT-PGINSR 466

Query: 493 NTESAQKT----FSFAVRDSLVNIGPLKDFSYGL---RINADASATGISKQSNYELVELP 545
           NT S   T    + F + D L N+GP++D + G      + D   +  S  +  ELV   
Sbjct: 467 NTASVNGTSLNDYIFRIHDRLWNLGPMRDITLGRPPGSRDKDKRQSVSSLSAYLELVTTQ 526

Query: 546 G--------------------------CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYL 579
           G                            G+ +V+ K  +      S        Y  YL
Sbjct: 527 GYGRAGGLAILRREIDPYVIDSLMIKDTDGVRSVHVKDPKLPTQSGSLPVNAGSNYDHYL 586

Query: 580 IISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARI 633
           ++S      + +++V + +    E T + ++   + RTI  G L G  RV+QV +   R 
Sbjct: 587 LLSKSKGFDKEKSVVYKMSSGGLEETRAPEFNPNEDRTIDIGTLAGGTRVVQVLKGEVRS 646

Query: 634 LD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTV 692
            D G  + Q       ++     SE  +V+  S ADPYVL+   D SI LL  D S    
Sbjct: 647 YDSGLGLAQIYPVWDEDT-----SEERSVVHASFADPYVLIIRDDSSILLLQADESGDLD 701

Query: 693 SVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQ--G 750
            ++T   IES+     S +LY DK            ++LS          +G P  +   
Sbjct: 702 EIETDGIIESTT--WISGSLYQDK----------YRSFLSY---------EGTPNRKPSD 740

Query: 751 DIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYM---REALKDSETEINSSS 807
           ++   +      L IF +PN        + V     I+ T +   R   ++  TEI    
Sbjct: 741 NVLLFLLNSESKLYIFHLPNAKEPVYTAESVDLLPQILPTELPPRRTTYRECLTEI---- 796

Query: 808 EEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
                              L      +    P+L        ++ Y+ Y          +
Sbjct: 797 -------------------LVADLGDSVSRTPYLILRSNSNELILYEPYHI-----VQST 832

Query: 868 DDPVSTSRSLSVSN------VSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNIS 921
           +  +S  R L ++N      +  S L NL  S   L    R     G  C   T+F    
Sbjct: 833 EKRLSDLRFLKIANHHFPKFLPESNLGNLSDSDRQL---ARPLRALGDVCGYRTVF---- 885

Query: 922 GHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
                 + G+ PC+ +     +     L   ++ + +  +   C  GF+YV +  ++++C
Sbjct: 886 ------MPGNSPCFIIKSATSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDTDNVVRMC 939

Query: 982 QLPSGSTYDNYWPVQKV 998
           + P  + +D  W  +K+
Sbjct: 940 RFPRNTHFDGSWAARKI 956


>gi|119484094|ref|XP_001261950.1| cleavage and polyadenylation specificity factor subunit A, putative
           [Neosartorya fischeri NRRL 181]
 gi|148886830|sp|A1DB13.1|CFT1_NEOFI RecName: Full=Protein cft1; AltName: Full=Cleavage factor two
           protein 1
 gi|119410106|gb|EAW20053.1| cleavage and polyadenylation specificity factor subunit A, putative
           [Neosartorya fischeri NRRL 181]
          Length = 1400

 Score =  131 bits (330), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 181/751 (24%), Positives = 310/751 (41%), Gaps = 127/751 (16%)

Query: 57  NLVVTAANVIEIY-VVRVQEE---GSKESKNSGETKRRVLMDGISAASLELVCHYRLHGN 112
           NLVV   +V++I+ +++VQ     G+ E K++         D +    L L   Y L G 
Sbjct: 28  NLVVVKTSVLQIFSLLKVQHHLRGGTIEGKSARP-------DRVETTKLVLEREYPLSGT 80

Query: 113 VESLA---ILS--QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
           V  +    IL+   GG       ++++LAF +AK+S++E+D   HG+   S+H +E  + 
Sbjct: 81  VVDICRVKILNPKSGG-------EALLLAFRNAKLSLVEWDPERHGISTLSIHYYERDDL 133

Query: 168 LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGD-------EDT 219
                  +  + G ++ VDP  RC  V  +G++ + IL   Q G  L  D       +D 
Sbjct: 134 TRSPWVPDLSSCGSILSVDPSSRCA-VFNFGIRNLAILPFHQPGDDLAMDDYEFHLHQDD 192

Query: 220 FGS-----GGGFSAR--------IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILH 264
           F       G    ++          SS V+ L  LD  + H     F++ Y EP   +L+
Sbjct: 193 FNQVSDHVGNDLKSKDRTVYQTPYASSFVLPLTALDPSILHPVSLAFLYEYREPTFGVLY 252

Query: 265 ERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVV 324
            +  T    +  +  +   +  ++    +    + S   LP D +K++A+P P+GG L++
Sbjct: 253 SQIATSHALLPERKDSIFYTVFTLDLEQRASTTLLSVPKLPSDLFKVVALPPPVGGALLI 312

Query: 325 GANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLST 381
           G+N  +H      + A+ +N +A  + +   + +S  ++ L+      L +     LL  
Sbjct: 313 GSNELVHVDQAGKTNAVGVNEFARQVSAFSMVDQSDLALRLEGCVVEHLSDSTGDLLLVL 372

Query: 382 KTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFLGSRLGDSLL 434
            +G++VL+    DGR V  + L    ++   +++ S  ++   +G+   F GS   DS+L
Sbjct: 373 SSGNMVLVHFQLDGRSVSGISLRPLPAQAGGTIMKSAASSSAFLGSGRVFFGSEDADSVL 432

Query: 435 VQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ-DMVNGE-ELSLYGSASN 492
           + ++  S         +    ++  D        +S  D  + D+   E E    G   +
Sbjct: 433 LSWSSMSSN---PKKPRPRMSNVAEDREEASVDSQSEEDVYEDDLYTAEPETPALGRRPS 489

Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYG-----------LRINADASATGISKQS---N 538
              S    + F + D L NIGPL+D + G           L  NA +    I+ Q    N
Sbjct: 490 AETSGVGVYIFQILDRLPNIGPLRDITLGKPASTVENTGRLIENACSELELIAAQGSGRN 549

Query: 539 YELV--------------ELPGCKGIWTVYHKSSRGHN--ADSSRMAAYDDEYHAYLIIS 582
             LV              +    +G+WT       G     D  R+   + EY  Y+I+S
Sbjct: 550 GGLVLMKREIEPDVAASFDAQSVQGVWTAVVALGSGAPLVPDEQRI---NQEYRQYVILS 606

Query: 583 L-------EARTMVLETADL----LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
                   ++   + +  DL      E   + D      TI  G L  +RRV+QV     
Sbjct: 607 KPEAPDKEQSEVFIADKQDLKPFKAPEFNPNNDV-----TIEIGTLSCKRRVVQVLRNEV 661

Query: 632 RILDGSYMTQDLSFG-----PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGD 686
           R    SY   D+  G     P   E    S+    +S S+ADPY+ +   D ++ LL  D
Sbjct: 662 R----SY---DIDLGLAQIYPVWDE--DTSDERMAVSASLADPYIAILRDDSTLMLLQAD 712

Query: 687 PSTCTVSVQTPAAIESSKKPVSSCTLYHDKG 717
            S     V+   +  + K    SC LY DK 
Sbjct: 713 DSGDLDEVELDDSTRAGK--WRSCCLYWDKA 741


>gi|121925707|sp|Q0UUE2.1|CFT1_PHANO RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two
           protein 1
          Length = 1375

 Score =  131 bits (330), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 168/728 (23%), Positives = 294/728 (40%), Gaps = 144/728 (19%)

Query: 57  NLVVTAANVIEIY-----VVRVQEEGSKESKNSG-----ETKRRVLMDGISAASLELVCH 106
           NL+V   ++++++     V  V   G  E+ N+      E     L    + A L LV  
Sbjct: 28  NLIVAKNSLLQVFELKSTVTEVASGGEGEADNAAANFDTEAADVPLQRIENTAKLVLVGE 87

Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
           + L G V SLA +     +   R +++++AF DAK+S++E+D   + L   S+H +E+P+
Sbjct: 88  FPLAGTVISLARVK--ALNTKSRAEALLVAFRDAKLSLVEWDPETYNLHTISIHYYENPD 145

Query: 167 ------WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL---------------- 204
                 W    +   +F     +  DP  RC  +      + IL                
Sbjct: 146 VPGLAPWDAELKDTYNF-----LTADPSSRCAALKFGTHNLAILPFRQRDLAEDEYDSDN 200

Query: 205 KASQGGSGLVGDEDTFGSGGGFSARI--ESSHVINLRDLD--MKHVKDFIFVHGYIEPVM 260
           +A+Q G      E   G+ G  + +    SS V+ L +LD  + H     F+H Y EP  
Sbjct: 201 EAAQEGKA----ERANGANGDDAVKTPYSSSFVLPLTNLDPTLTHPVHLAFLHEYREPTF 256

Query: 261 VILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGG 320
            ++   + T A  ++ +      +  ++    K    + S   LP+D  +++ +P PIGG
Sbjct: 257 GVISSSKATAASLLTHRKDILTYTVFTLDLEQKASTTLLSVPGLPYDLTQVVPLPHPIGG 316

Query: 321 VLVVGAN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA-- 377
            L+VG+N  IH      +  +A+N  A +  S     ++  ++ L+      L  D    
Sbjct: 317 ALLVGSNEIIHVDQAGKTNGVAVNELAKACTSFALSDQADLALRLEGCTLELLSQDTGDV 376

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI--------TTIGNSLFFLGSRL 429
           ++    G + +LT   DGR V  + +    P+    +I        T +G    F+GS  
Sbjct: 377 MIVLNDGSIFILTFSLDGRNVSAMTIQPV-PADNGGNILKTRASCSTNLGRGRLFIGSED 435

Query: 430 GDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS 489
           G+S+L+ +T                        ++ +LRR  S+  Q   + E++S    
Sbjct: 436 GESVLMGWTS-----------------------TSNQLRRKQSNTAQSG-DDEDMSDVEE 471

Query: 490 AS---------NNTESAQK-------------TFSFAVRDSLVNIGPLKD---------- 517
                      N+T +  K             T++F V D L +I P++D          
Sbjct: 472 EEVDDLDDDLYNDTATTVKKITAAAAEPTAPGTYTFRVHDVLPSIAPIRDTVLHPGKDTE 531

Query: 518 -FSYG-LRINADASATGISKQSNYEL-------VELPGCKGIWTVYHKSSR--------G 560
             + G + ++    A G     N EL        ELP   G+W V+ K           G
Sbjct: 532 SLTKGEIMLSTGRGAAGAITALNRELHPTMLAQTELPSSNGVWAVHAKKQAPAGIVADFG 591

Query: 561 HNADSSRMAAYDDEYHAYLIISLE-----ARTMVLETADLLTEVTESVDYFV-QGRTIAA 614
            +A+++  A+ D +Y  YL++S         T+V E        TE  D+   +G T++ 
Sbjct: 592 QDAEAN--ASSDVDYDQYLVVSKAWEDGTESTVVYEVHGNELSETEKGDFERDEGLTLSV 649

Query: 615 GNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLG 674
           G L    +V+QV     R  D     + +   P   E      N  +++ S ADPY+L+ 
Sbjct: 650 GVLARGTKVVQVLRSEVRTYDSELGMEQII--PMEDEETGNELN--IINASFADPYLLIQ 705

Query: 675 MSDGSIRL 682
             D S+++
Sbjct: 706 REDSSVKI 713


>gi|310789917|gb|EFQ25450.1| CPSF A subunit region [Glomerella graminicola M1.001]
          Length = 1439

 Score =  130 bits (328), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 229/1013 (22%), Positives = 381/1013 (37%), Gaps = 173/1013 (17%)

Query: 69  YVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAIL----SQGGA 124
           Y  R+ ++   ES   G     V  D      L LV  Y + G V  LA +    S+ G 
Sbjct: 66  YDRRLNDDDGLESSFLGGDGMLVRADRAVNTKLVLVAEYPIFGVVTGLARIKIQHSKSGG 125

Query: 125 DNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL-- 182
           +      ++++A   A++S+++++   H L   S+H +E  E       + S   GPL  
Sbjct: 126 E------ALLIATRVARLSLVQWNSEKHALEDISIHYYEKEEL------QGSPFDGPLAN 173

Query: 183 ----VKVDPQGRCGGVLVYGLQMI-ILKASQG--------------GSGLVGDEDTFGSG 223
               +  DP  RC   L +G + I  L   Q               G     +  T  + 
Sbjct: 174 YRTHLAADPGSRCAA-LSFGPRYIAFLPFKQADEDIDMDDWDEDVDGPRPAKEPPTTAAT 232

Query: 224 GGFS----ARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
            G S        +S+V+ L  LD  + H     F+H Y EP   I+   +          
Sbjct: 233 NGTSNIADVPYSTSYVLPLPQLDPSLLHPVYLAFLHEYREPTFGIISSTQRRSNTLPRKD 292

Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSA 336
           H +  +  L +    +    I S  NLP D +K++A+P P+GG L+VG N  IH      
Sbjct: 293 HFSYKVFTLDLQQ--RASTAILSVNNLPQDLFKVVALPGPVGGALLVGTNELIHIDQSGK 350

Query: 337 SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTGDLVLLTVVYD 394
              +A+N +     +     +S   + L+  H   +  +N   L+    G L ++T   D
Sbjct: 351 PNGVAVNAFTKETTNFPLADQSDLDLRLEHCHIELMSAENGELLMVLSDGRLAIITFKID 410

Query: 395 GRVVQRLDLSKTNPSV-------LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447
           GR V  + +      V         S I+ +  ++FF+GS   DSL++ +T     +   
Sbjct: 411 GRTVSGVSVKPVAAEVGGNIVQCSVSTISKLSRNVFFVGSTGSDSLVLGWTRKQAQN--- 467

Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
           +  K    D   +            D          +   G+ +N   S     +F V D
Sbjct: 468 ARRKTRLVDDSFEYDLEDEDMDDGDDDDLYGETTTTMIQPGATANGV-SKGGDLTFRVHD 526

Query: 508 SLVNIGPLKDFSYGLR-INAD------------------------ASATGISKQSNYELV 542
           SL++I P+KD + G +  N D                        A A  I  Q+    V
Sbjct: 527 SLLSIAPVKDMTSGKQAFNPDSEEANNSVGVVADLQLACVVGRGNAGAVAILNQNIQPKV 586

Query: 543 ----ELPGCKGIWT--VYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLL 596
               E P  +G WT  V     +    D    AA   E+ A    S+  + M++   DL 
Sbjct: 587 IGKFEFPEARGFWTMCVQKPVPKSLQGDKGANAAVGSEFDAS---SIYDKFMIVSKVDL- 642

Query: 597 TEVTESVDYFV-----------------QGRTIAAGNLFGRRRVIQVFERGARILDGSY- 638
            +  E+ D +                   G T+ AG +    R+IQV +   R  DG   
Sbjct: 643 -DGYETSDVYALTGAGFEALTGTEFDPAAGFTVEAGTMGKHMRIIQVLKSEVRCYDGDLG 701

Query: 639 MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPA 698
           ++Q L     + E+G+      V+S SIADPY+LL   D SI +   D +     V+   
Sbjct: 702 LSQILPM--LDEETGA---EPRVVSASIADPYLLLVRDDSSIMVAQIDNNCELEEVEKQD 756

Query: 699 AIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCY 758
               S K ++ C LY D                +TG    +    G P  Q +I+  +  
Sbjct: 757 DAILSTKWLAGC-LYAD----------------TTGRFAPVQTDKGTPEGQ-NIFMFLLS 798

Query: 759 ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN 818
            +GAL I+ +P+ +    V    +G T++                + +   GT Q   E 
Sbjct: 799 AAGALYIYALPDLSKPVYV---AAGLTYVPPLL----------SADYAVRRGTVQ---ET 842

Query: 819 IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLS 878
           +  + V +L         + P+L     +  +  Y+    E  + T      V  S++L 
Sbjct: 843 LTELLVADLG----DTTTTSPYLILRHANDDLTIYEPIRLESQDKT------VGLSKTLH 892

Query: 879 VSNVSASRLRNLRFSRTPLDAYTRE--ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWC 936
              ++     N   +++P++    E  E P   P +      NI+G+   FL G+ P + 
Sbjct: 893 FQKIT-----NPALAKSPVEVADDEANEQPRFVPLRPC---PNINGYSTVFLPGASPSFI 944

Query: 937 MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
           +   +       L    +   +  H   C  GFIY  S+G  ++ QLP+ + +
Sbjct: 945 IKSSKSSPKVIGLQGIGVRGMSSFHTEGCERGFIYADSEGQTRVTQLPADTNF 997


>gi|326471884|gb|EGD95893.1| protein kinase subdomain-containing protein [Trichophyton tonsurans
            CBS 112818]
          Length = 1398

 Score =  130 bits (328), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 237/1027 (23%), Positives = 391/1027 (38%), Gaps = 166/1027 (16%)

Query: 57   NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            NL+V   ++++++ +     GS  +    +  R    D    A L L   Y + G +  L
Sbjct: 28   NLIVAKTSLLQVFSLVNVTYGSTTATQPDQKGRN---DRSQHAKLVLAAEYEVPGTITGL 84

Query: 117  AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
              +    + +    D+I+++  +AK+S++E+D   HG+   S+H +E  E  H+      
Sbjct: 85   QRVRISNSKSGG--DAILVSSRNAKLSLIEWDPEKHGISTISIHYYEGEES-HMSPWVPD 141

Query: 177  FARGPL-VKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDE---------------DT 219
                P  + VDP G C  +  +G+  + IL   Q G  LV D+               D 
Sbjct: 142  LGSCPSSLTVDPNGNCA-IFNFGIHSLAILPFHQAGDDLVMDDYDATPNGDDSTDMVSDA 200

Query: 220  FGSGGGFSARIES---SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
              S  G +A  +    S V+ +  LD  + H     F+H Y EP   IL+ +        
Sbjct: 201  QKSAPGNTAHDKPYAPSFVLPMAALDPALTHPIHMEFLHEYREPTFGILYSQVARSTSLT 260

Query: 275  SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHS 333
              +      S  ++    +    + +   LP D +K++ +P P+GG L++G N  +H   
Sbjct: 261  IDRKDVVSYSIFTLDLQQRASTSLLTVSRLPSDVFKIVPLPPPVGGALLIGTNELVHVDQ 320

Query: 334  QSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTV 391
               + A+ +N +A    +     +S   + L+      L +     LL    G + +L+ 
Sbjct: 321  AGKTNAVGVNEFARQASAFSMADQSDLEMRLEGCIVEQLGSGTGDVLLILADGRMSILSF 380

Query: 392  VYDGRVVQRLDL-----------SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG 440
              DGR V  + L           +K  PS   S    +G +  F GS  GDS+L+ ++  
Sbjct: 381  KVDGRSVSGISLHFVAEQSGGLITKARPSCSAS----LGRNKLFYGSEEGDSILLGWSRP 436

Query: 441  SGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTES---- 496
            S T+   S  K   G  E+ A           D   D +  ++L     AS   E     
Sbjct: 437  SSTTKRPS--KAADGVDESGAADLSDEAEQDDDGDDDDMYEDDLHSVNPASIRQEKQVVN 494

Query: 497  --AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI---- 550
              +   F+F   D L ++GP +D + G    + +     S  +    +EL   +G     
Sbjct: 495  GDSPADFTFRAYDRLWSLGPYRDITLGKPPKSKSKDQRDSVPAIAAPLELVAARGFGKSG 554

Query: 551  -WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEAR---TMVLETAD---LLTEVT--- 600
              TV  +    +  DS +M   DD Y  + I  ++ +   T +  + D   LL +     
Sbjct: 555  GLTVLKREVDPYTIDSLKM---DDVYGVWSIRVVDPKSKDTRLSRSYDKYLLLAKAKGDD 611

Query: 601  --ESVDYFV----------------QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD 642
              ESV Y V                +  T+  G L    RV+QV     R  D  Y    
Sbjct: 612  KEESVVYSVGSSGLDSIDAPEFNPNEDCTVDIGTLATGTRVVQVLRTEIRSHD--YNLGL 669

Query: 643  LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS--TCTVSVQTPAAI 700
                P   E    SE  TV+  S A+PY+L    D S+ +L  D +     V VQ  AA 
Sbjct: 670  AQIYPVWDE--DTSEERTVIQASFAEPYLLTIRDDHSLLILQTDKNGDLDEVEVQGSAA- 726

Query: 701  ESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYES 760
              S K VS C LY DK           + + S    E     + GP    +I   +    
Sbjct: 727  --SGKWVSGC-LYEDK----------MNIFFSDFDIE----NEAGP----NILLFLLDVD 765

Query: 761  GALEIFDVPNFN-CVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENI 819
            G L IF +PN +  +  VD                 L  S     SSS        +E +
Sbjct: 766  GNLSIFRLPNISEPLCRVDNL--------------NLLPSNLPYESSSRRPV---NRETL 808

Query: 820  HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSV 879
              + + +L      A H  P++        ++ Y+ Y   G    S         R L  
Sbjct: 809  TELLIADLG----DAIHKSPYMILRTKHDDLVLYEPYRIAGESGHSG-------LRFLKA 857

Query: 880  SN--VSASRLR---NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPC 934
             N  V   R     N   +R+P            + C+ +    ++ G++  F+SG  PC
Sbjct: 858  VNHVVMGPRTDQGVNHDINRSP------------SSCKLLRALPDVCGYKTVFMSGHNPC 905

Query: 935  WCMVFRERLRVHPQLCDGSIV-AFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW 993
            + ++     R H     G  V + +  H   C  GF YV    ++++ +LPS + +D+ W
Sbjct: 906  F-ILKSAIARPHVLRLRGKAVQSLSGFHIAACERGFAYVDEDNVIRMSRLPSNTRFDSGW 964

Query: 994  PVQKVVF 1000
              +K+  
Sbjct: 965  ATRKIAL 971


>gi|390599704|gb|EIN09100.1| hypothetical protein PUNSTDRAFT_67240 [Punctularia strigosozonata
           HHB-11173 SS5]
          Length = 1439

 Score =  130 bits (327), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 203/924 (21%), Positives = 364/924 (39%), Gaps = 138/924 (14%)

Query: 97  SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
           + A L LV  +RLHG V  L  +    + N    D ++++FEDAKI+VLE+ +  H L  
Sbjct: 118 TVARLRLVREHRLHGMVTGLGRIKILSSLNDGL-DRLLISFEDAKIAVLEWSEEQHDLLT 176

Query: 157 TSMHCFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVG 215
            S+H +E +P+ + L     S   G L +VDP  RC  + +      I+   Q       
Sbjct: 177 VSIHTYERAPQLMSLN---ASLFHGWL-RVDPISRCAALALPCDAFAIIPFHQTLE---- 228

Query: 216 DEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIEPVMVILHERELTWAG 272
                       A    S +++L    D  + +V D  F+ G+  P + +L +   TW G
Sbjct: 229 -----------EAPYAPSFILDLTSEVDQRIHNVVDMSFLPGFNNPTVAVLFQPTQTWTG 277

Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYH 332
           R++    T  +   ++    + +P+I S  NLP+D   + A  + +GGV+V+ +N+I + 
Sbjct: 278 RLTEYKDTMKLLVFTLDAVTRNYPVITSVDNLPYDCLSVHACSAAVGGVIVITSNSIIHV 337

Query: 333 SQSA-SCALALNNYAVSLDSSQELP----RSSFSVELDAAHATWLQNDVALLSTKTGDLV 387
           SQS+   AL++N +A  +      P     ++ ++ L+ +   ++ +    L  K G + 
Sbjct: 338 SQSSRRVALSVNGWASRVTDMSLAPVQAEYATRNLALEGSRLAFVDDRTFFLFLKDGTVY 397

Query: 388 LLTVVYDGRVVQRLDLSKT-NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSML 446
            + +  DG VV  + +      S + + +T +     F+GS  G S+L++ T        
Sbjct: 398 PVELSLDGAVVSTISMGHALAQSAIPAVVTPVTQEHIFVGSTAGTSVLLKIT-------- 449

Query: 447 SSGLKEEFGDIEADAPSTKRLRRSSSDALQDM-------VNGEELSLYGSASNNTESAQK 499
              ++EE  D  +DA +   +  + S  + D        +  +  SL    +N T  + K
Sbjct: 450 --SVEEEVEDNASDAVAAAVVDTADSMVMDDDDDIYGVSMKTDAQSLSNGHANGTHLSVK 507

Query: 500 TFS---FAVRDSLVNIGPLKDFSYGLRINAD------ASATGISKQSNYELVE--LP--- 545
             S    ++ DSL   G + D S+ L  N +       +ATG      + L +  LP   
Sbjct: 508 KRSVTHLSLSDSLPGYGSISDMSFSLAKNGEKVVPELVAATGSGSMGGFTLFQRDLPART 567

Query: 546 --------GCKGIW------------TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA 585
                   G +G+W            T Y ++     AD+  +    D   A  +     
Sbjct: 568 KRKLHAIGGGRGMWSLSLRPTVKVNGTSYERAVNPFQADNDTVVVSTDANPAPGLSRFSH 627

Query: 586 RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL--DGS--YMTQ 641
           RT   E          S+   V G+TI A   F R  ++ V     R+L  DGS   + +
Sbjct: 628 RTPRTEI---------SITTRVPGQTIGAAPFFQRTAILHVMSNAIRVLEPDGSERQVIK 678

Query: 642 DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAI- 700
           DL                 +   SI DP+VL+   D +I L +G+     +  +  + + 
Sbjct: 679 DLD---------GNMARPKIRHCSICDPFVLIVREDDTIGLFIGESERGKIRRKDMSPMG 729

Query: 701 ESSKKPVSSCTLYHDKGPEPWLRKTSTDA---WLSTGVGEAIDGADG-----------GP 746
           + + + ++ C    + G      + + ++     +T   + +  AD            G 
Sbjct: 730 DKTSRYLTGCFFTDNAGVFDLRSQANGNSGADKTATSTLQGVVNADSRSQWLLLVRPQGV 789

Query: 747 LDQGDIYSVV-CYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINS 805
           L+  D+  +  C      +I+ +P  + VF+V    +    + D+    AL         
Sbjct: 790 LEASDLSPIPGCRRLNEKQIWTLPKLSIVFSVRLASTLDWVLADSGDGPAL--------- 840

Query: 806 SSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTS 865
            S  G    R +    + V +  +        +P L   L  G +  YQA     P   S
Sbjct: 841 -SMPGESPRRPQE---LDVEQAVIAPLGETAPQPHLLLFLRSGQLAIYQAI----PMQAS 892

Query: 866 KSDDPVS-TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQ 924
             D+ +S  S  +  + V+       R   +       ++         +T     +   
Sbjct: 893 SVDESLSRPSLGVRFAKVATRVFEIQRQDDSEKSILAEQKKISRVLIPFLTSPSPTTTFS 952

Query: 925 GFFLSGSRPCWCMVF-RERLRVHP 947
           G F +G  PCW +   R  +R+HP
Sbjct: 953 GVFFTGDHPCWILKPDRSGIRIHP 976


>gi|440466842|gb|ELQ36086.1| hypothetical protein OOU_Y34scaffold00669g71 [Magnaporthe oryzae Y34]
 gi|440481991|gb|ELQ62520.1| hypothetical protein OOW_P131scaffold01068g7 [Magnaporthe oryzae
            P131]
          Length = 1475

 Score =  130 bits (326), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 230/1054 (21%), Positives = 399/1054 (37%), Gaps = 195/1054 (18%)

Query: 57   NLVVTAANVIEIYVVRV---QEEGSKESKNSGETKRRVLMD--GISAA------------ 99
            NLVV  +++++I+  R+   + +G+ +S  +       L D  G+ A+            
Sbjct: 51   NLVVAKSSLLQIFATRLVPAELDGTSQSAKATHNYDTKLNDDEGLEASFLGGDAAIIRSD 110

Query: 100  ----SLELVCHYRLHGNVESLAILSQGGADNSRRR---------DSIILAFEDAKISVLE 146
                 L LV  + L G +  LA +       S            D +++AF+DAK+S++E
Sbjct: 111  RNHTKLVLVAEFPLSGTITGLARVKANATKTSNGNGAGSSSSGGDFLLIAFKDAKLSLVE 170

Query: 147  FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL------VKVDPQGRCGGVLVYGLQ 200
            +D     L   S+H +E  E       + S    PL      +  DP  RC   L +G +
Sbjct: 171  WDPDRRSLETISIHYYEQNEL------QSSPWAAPLSDYVNFLVADPGSRCAA-LKFGAR 223

Query: 201  MIILKASQGGSGLVGDED----------------TFGSGGGFSARIES-----SHVINLR 239
             + +   +   G +G +D                T  + G     +E      S V+ L 
Sbjct: 224  SLAIIPFKQADGDIGMDDWDEELDGPRPAQEKPATAATNGTTDNVVEDTPYTPSFVLRLP 283

Query: 240  DLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
            +LD  + H     F++ Y EP   IL    +T +  ++ K H    +  ++    K    
Sbjct: 284  NLDPALLHPVHLAFLYEYREPTFGILSS-NITPSTYLARKDH-LTYTVFTLDLQQKASTT 341

Query: 298  IWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELP 356
            I S   LP D  +++A+P+P+GG L+VG+N  IH      +  +A+N    S  S     
Sbjct: 342  ILSVGGLPKDLTRVIALPAPVGGALLVGSNELIHIDQSGKANGVAVNPMTKSCTSFSLAD 401

Query: 357  RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY--DGRVVQRLDLSKTNP------ 408
            +S   + L+      L  +         D  L T+V+  DGR V  L +    P      
Sbjct: 402  QSDLGLRLEGCMINVLSAEDGQFIIVLNDGRLATLVFHIDGRTVSGLKIKMVAPEAGGQL 461

Query: 409  -SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
                 S +T +G +  F GS  GDS++  +          S  K +  D + D       
Sbjct: 462  LQTSVSCLTRLGRNALFAGSDRGDSVVFGWNRKHNQ---VSKRKPKIQDPDLDLDIDYDD 518

Query: 468  RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL----- 522
                 D   D+    E +   ++++  E+      F V D +V+I P++D ++G      
Sbjct: 519  LEDDEDDDDDLYADTEKTKATTSASTGETKTDDLIFRVHDRMVSIAPIRDVTFGKPPPPT 578

Query: 523  ---RINADASA----------TGISKQSNYELV------------ELPGCKGIWTV---- 553
               R   D +A           G  K S+  ++            E P  +G+WT+    
Sbjct: 579  DAERNTKDPAAVQSELQLVAVVGRDKASSLAIINREMTPVSIGRFEFPEARGLWTLSTQK 638

Query: 554  -YHKSSRGHNADSSRMAAYDD----EYHAYLIISLEARTMVLETADLLTEVTESVDYF-- 606
               K  +  N +    AA +     +Y  Y+I++ E      ET+D+        +    
Sbjct: 639  PLPKPLQASNKNPKTAAATESILSAQYDQYMIVAKEDDDG-FETSDVYALTAAGFETLSG 697

Query: 607  -----VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENST 660
                   G TI AG +    ++IQV +   R  DG   +TQ +     + E+G       
Sbjct: 698  TEFEPAAGFTIEAGTMGDHTKIIQVLKSEVRCYDGDLGLTQIIPM--LDEETG---HEPR 752

Query: 661  VLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEP 720
              S SIADPY+L+   D S  +   +  +    ++    I SS K  + C LY D     
Sbjct: 753  ATSASIADPYLLIIRDDSSAFIAHVNEDSEIEEIEKEDKIISSTKWSTGC-LYAD----- 806

Query: 721  WLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKF 780
                       S G   A       P     I   +   +GAL I+ +P+ +        
Sbjct: 807  -----------SKGAFAATQQTAKSPKSTPTIMMFLLSAAGALYIYALPDIS-------- 847

Query: 781  VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
                      Y+ E L      +++      G  R E I  + V +L    + + H    
Sbjct: 848  -------RPVYVAEGLCYVPPYLSADYSARKGMAR-ETISEILVTDLGDTVFKSPH---- 895

Query: 841  LFAIL--TDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLD 898
               IL  ++  +  Y+ Y          ++D  S ++ L +      +L N   ++ P +
Sbjct: 896  --VILRHSNHDLTIYEPYRI--------AEDSQSLTKILRL-----RKLPNPAVAKAP-E 939

Query: 899  AYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQ---LCDGSIV 955
            A   E+ P  +    +    NI+G+   F+ G  P + +   +  +  P+   L    + 
Sbjct: 940  ATNSEDPPLMSRNMPLRACANIAGYSAVFMPGHSPSFLI---KSAKATPKVIGLRGSGVR 996

Query: 956  AFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
            A +  H   C  GFIY  S G+ ++ Q+P  +++
Sbjct: 997  AMSSFHTEGCERGFIYADSAGVARVAQIPKDTSF 1030


>gi|403411348|emb|CCL98048.1| predicted protein [Fibroporia radiculosa]
          Length = 1437

 Score =  130 bits (326), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 202/978 (20%), Positives = 388/978 (39%), Gaps = 153/978 (15%)

Query: 57  NLVVTAANVIEIYVVRVQE------------------------EGSKESKNSGE------ 86
           N+VV  +N++ I+ VR +                         EG  E   SGE      
Sbjct: 45  NVVVARSNLLRIFEVREEPAPFSTQKEDERDRRASMRKGTEAVEGEVEMDASGEGFVNMG 104

Query: 87  ----TKRRVLMDGISAASLELVCHYRLHG---NVESLAILSQGGADNSRRRDSIILAFED 139
               T +  ++   +     L+  +RLHG    +E + I++    ++S   D ++++F+D
Sbjct: 105 SVKSTGQNGILHQPTVNRFYLIREHRLHGIVTGIEGVRIIT--SIEDSF--DRLLVSFKD 160

Query: 140 AKISVLEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPL----VKVDPQGRCGGV 194
           AKI++LE+ +++H L   S+H +E +P+ + +          PL    ++ DP  RC  +
Sbjct: 161 AKIALLEWSEAMHDLITVSIHTYERAPQLMAID--------APLFRSQLRADPLSRCAAL 212

Query: 195 LVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIF 251
            +    + IL   Q  + L  D     +          S +++L    D  +++V DF+F
Sbjct: 213 SLPKDSIAILPFYQSQAEL--DIMEHETSQARDVPYSPSFILDLSADVDTRIRNVIDFVF 270

Query: 252 VHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKL 311
           + G+  P + +L + + TW GR+     T  +   ++    + +P+I +   LPHD + +
Sbjct: 271 LPGFNSPTIAVLFQYQQTWTGRLKEYKDTVGLILFTLDLVTRHYPVITAIDGLPHDCFAM 330

Query: 312 LAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSSFS-------VE 363
               + +GGV+V+ +N+I Y  Q+     L ++ +   L    +LP  S S       ++
Sbjct: 331 APCSTALGGVVVLASNSIIYVDQATRRVILPVSGW---LPRISDLPIPSLSHQDQQRDLQ 387

Query: 364 LDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLS-KTNPSVLTSDITTIGNSL 422
           L+ +   ++ +    +  K G +  + ++ DG+ V RL ++     + + S +  + +  
Sbjct: 388 LEGSQFVFVDDRTLYVVLKDGTVYPVEIIVDGKTVSRLSMAPPVARTTMPSLVRKMQDDY 447

Query: 423 FFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD--APSTKRLRRSSSDALQDMVN 480
            F+GS +G S+L++ T          G   E   + A   AP+         D       
Sbjct: 448 LFVGSIIGPSVLLKTT---RVEEDIEGDDVEMASVPATVVAPNNAMDLDDDDDLYGGSAV 504

Query: 481 GEELSLYGSASNNTESAQK---TFSFAVRDSLVNIGPLKDFSYGLRINAD------ASAT 531
            E+  + G   N + +  K       +  DSL   GP+ D ++ L  N D       +AT
Sbjct: 505 IEQPHMNGITQNGSTAISKKRTVVQLSFCDSLPAYGPIADMTFTLAKNGDRAVPELVAAT 564

Query: 532 GISKQSNYELVE--LP-----------GCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAY 578
           G      + L++  LP           G +GIW++  + +   N  +    A  + YHA 
Sbjct: 565 GSGMLGGFTLLQRDLPTRTKRKMHAIGGGRGIWSLLVRQAVKVNGSTYERPA--NPYHAE 622

Query: 579 ---LIISLEARTM--VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 633
              ++IS +A     +   A    +    +   + G TI A   F    ++ V      +
Sbjct: 623 NDSIVISTDANPSPGLSRIASRNAQGDIQITTRIPGTTIGAAPFFQGTAILHVMINVTNV 682

Query: 634 LDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 693
           +    +  D +      +         +   SI DP+VL+   D SI L +G+     + 
Sbjct: 683 I--RVLEPDGTERQVIKDWDGNIPRPKIRFCSICDPFVLIIRDDDSIGLFIGESERGKIR 740

Query: 694 VQTPAAI-ESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDI 752
            +  + + E + + ++ C  + D      + + +  A +           + G   Q   
Sbjct: 741 RKDMSPMGEKTSRYLAGC-FFTDTSGIFQVHQNAQAAGIEGATSTLQSVMNAGNRTQ--- 796

Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
           + ++C   G +EI+ +P     F+        + + D Y   AL        S  ++   
Sbjct: 797 WLILCRPQGVIEIWTLPKLGLAFSTTHAAGLESVLTDLYDPPAL--------SVPQDPPR 848

Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
           + ++ +I  + V  L          RP L   L  G +  Y+ +      +T    +P+ 
Sbjct: 849 KPQELDIEQLLVAPLG-----ESSPRPHLMLFLRSGQLAVYEVH------STPVPAEPLP 897

Query: 873 TSRS--LSVSNVSA-SRLRNLRFSRTPLDAYTREET-------PHG---APCQRITIFKN 919
            +RS  L V  V   SR  N++ S     +   E+        P     +P Q  +    
Sbjct: 898 AARSSTLLVKFVKVLSRAFNIQHSDEVEKSVLAEQKRISHLLIPFATSPSPGQTFS---- 953

Query: 920 ISGHQGFFLSGSRPCWCM 937
                G FL+G RP W +
Sbjct: 954 -----GVFLTGDRPSWLL 966


>gi|378734083|gb|EHY60542.1| histone H2A [Exophiala dermatitidis NIH/UT8656]
          Length = 1361

 Score =  130 bits (326), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 205/976 (21%), Positives = 374/976 (38%), Gaps = 170/976 (17%)

Query: 97  SAASLELVCHYRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLR 155
           S   L LV  Y L G + SL  +      NS+   D++++AF DAK+S++E+D ++H + 
Sbjct: 49  SETKLVLVAEYNLAGTITSLGRVK---IPNSKSGGDAVLVAFRDAKLSLIEWDPALHSIS 105

Query: 156 ITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVG 215
             S+H +E  +   +    +       + VDP  RC         + I+   Q    L  
Sbjct: 106 TLSIHYYEHHDLQSIPWQPDLSKCVSHLTVDPSSRCAAFNFGVSNLAIIPLHQVRDELAM 165

Query: 216 DE---------DTFGSGGGFSARIES-------SHVINLRDLD--MKHVKDFIFVHGYIE 257
           DE         +     G    + +S       S V+ L  LD  + H  D  F+H Y +
Sbjct: 166 DEFDEVDGEVKERLSPDGQNENKHDSPDTPFKPSFVLPLTALDPGLLHPVDMAFLHEYRD 225

Query: 258 PVMVILHERELTWAGRVSWKHH----TCMISALSISTTLKQHPLIWSAMNLPHDAYKLLA 313
           P + IL+    + A R S  +H      + +  ++    K    + S   LP+D Y+++A
Sbjct: 226 PTVGILY----STAARSSNMNHERRDVTIYAVYALDIGQKASTALQSVQKLPNDLYRVMA 281

Query: 314 VPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL 372
           +P P+GG L++G N  IH      + A+A+N  A    S      +++ ++L+      L
Sbjct: 282 LPPPVGGALLIGGNELIHIDQSGKTIAIAVNELAKEASSFPMADHANYRLKLEGCQIEHL 341

Query: 373 QNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSV-------LTSDITTIGNSLF 423
            N     L+  KTG+L LL+   DGR+V  + L +             ++  T +G++  
Sbjct: 342 GNPSGDMLVILKTGELALLSFRMDGRMVSSMALRRVGEGQSQGLALGASTCSTNLGSNRL 401

Query: 424 FLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA-----LQDM 478
           F+GS   DS+L+    G  T+ L                +  R++  + DA      ++ 
Sbjct: 402 FIGSEESDSILL--ATGRKTTQLRR--------------TNSRIQSQADDAGLFDDNEED 445

Query: 479 VNGEELSLYGSASN----NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
              +E  LY   ++    N  +     +F + D L +I P+ D +        A  + ++
Sbjct: 446 GIEDEDDLYAELADELNGNASTDVSGHNFRLLDRLPSIAPINDVALANVGKRRAEESEVT 505

Query: 535 KQ-----------------------SNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAY 571
           +Q                       S    ++     G+W  +   +RG         A 
Sbjct: 506 RQELAVAYGRGHAGGLAFLSRKLEPSVTRQIKFERPIGVW-CFSSGNRGQQ------GAE 558

Query: 572 DDEYHAYLIISL-----EARTMVLETAD-LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQ 625
           ++ +   ++IS        RT +L   D  L  + ES      G  I    L      IQ
Sbjct: 559 EENFDDLVMISQTTDDGAGRTKLLRLIDGDLNSMGESEFDESAGAAIGVFKLEATNHTIQ 618

Query: 626 VFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG 685
           V     R+ D  +    + F   + E G   + +  + VS  DPY+++   DGS+ LL  
Sbjct: 619 VLPTELRVYDAGFALSQI-FPIVDEEEG---QTARAVKVSFVDPYLVVVKDDGSMSLLKA 674

Query: 686 DPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGG 745
           D +     V+ P  + +    + S TLY D           TD    T           G
Sbjct: 675 DKAGELDEVELPENLRAWS--ILSATLYQD-----------TDDMFQTSRFY------NG 715

Query: 746 PLDQGDIYSVVCYESGALEIFDVPNFNC-VFTVDKFVSGRTHIV-DTYMREALKDSETEI 803
               G I +++  + G   +  +PN +  VF  D      TH++ D  + +  ++     
Sbjct: 716 TATPGPILTILT-QDGHFCLLSLPNVSIQVFQCDSLPFLPTHLMQDLQLPKHWRN----- 769

Query: 804 NSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPEN 863
                       K+++  + + +L     ++   +P+L      G ++ Y+++       
Sbjct: 770 ------------KDDLGEVLLADLG----NSTDRQPYLVVRNLVGDVIIYESFAMP---- 809

Query: 864 TSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGH 923
                D + + R   V   +A  L +             EE    +  Q +    N++GH
Sbjct: 810 -----DVLGSFRFKKVFTKAAGELED------------GEEVGQPSTLQPMQAVTNVAGH 852

Query: 924 QGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQL 983
              F+ G +P   M     +    +L    + +   +H   C  G + V +   +K C +
Sbjct: 853 ASVFIPGRQPLLIMREASTMPRVYELNPTKLKSMNSVHTGTCRQGLVLVDADDEIKFCNI 912

Query: 984 PSGSTYD-NYWPVQKV 998
           P  +    + W +++V
Sbjct: 913 PDSTVLGLSDWVIRRV 928


>gi|389641257|ref|XP_003718261.1| cft-1 [Magnaporthe oryzae 70-15]
 gi|351640814|gb|EHA48677.1| cft-1 [Magnaporthe oryzae 70-15]
          Length = 1452

 Score =  129 bits (325), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 230/1054 (21%), Positives = 399/1054 (37%), Gaps = 195/1054 (18%)

Query: 57   NLVVTAANVIEIYVVRV---QEEGSKESKNSGETKRRVLMD--GISAA------------ 99
            NLVV  +++++I+  R+   + +G+ +S  +       L D  G+ A+            
Sbjct: 28   NLVVAKSSLLQIFATRLVPAELDGTSQSAKATHNYDTKLNDDEGLEASFLGGDAAIIRSD 87

Query: 100  ----SLELVCHYRLHGNVESLAILSQGGADNSRRR---------DSIILAFEDAKISVLE 146
                 L LV  + L G +  LA +       S            D +++AF+DAK+S++E
Sbjct: 88   RNHTKLVLVAEFPLSGTITGLARVKANATKTSNGNGAGSSSSGGDFLLIAFKDAKLSLVE 147

Query: 147  FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL------VKVDPQGRCGGVLVYGLQ 200
            +D     L   S+H +E  E       + S    PL      +  DP  RC   L +G +
Sbjct: 148  WDPDRRSLETISIHYYEQNEL------QSSPWAAPLSDYVNFLVADPGSRCAA-LKFGAR 200

Query: 201  MIILKASQGGSGLVGDED----------------TFGSGGGFSARIES-----SHVINLR 239
             + +   +   G +G +D                T  + G     +E      S V+ L 
Sbjct: 201  SLAIIPFKQADGDIGMDDWDEELDGPRPAQEKPATAATNGTTDNVVEDTPYTPSFVLRLP 260

Query: 240  DLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
            +LD  + H     F++ Y EP   IL    +T +  ++ K H    +  ++    K    
Sbjct: 261  NLDPALLHPVHLAFLYEYREPTFGILSS-NITPSTYLARKDH-LTYTVFTLDLQQKASTT 318

Query: 298  IWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELP 356
            I S   LP D  +++A+P+P+GG L+VG+N  IH      +  +A+N    S  S     
Sbjct: 319  ILSVGGLPKDLTRVIALPAPVGGALLVGSNELIHIDQSGKANGVAVNPMTKSCTSFSLAD 378

Query: 357  RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY--DGRVVQRLDLSKTNP------ 408
            +S   + L+      L  +         D  L T+V+  DGR V  L +    P      
Sbjct: 379  QSDLGLRLEGCMINVLSAEDGQFIIVLNDGRLATLVFHIDGRTVSGLKIKMVAPEAGGQL 438

Query: 409  -SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
                 S +T +G +  F GS  GDS++  +          S  K +  D + D       
Sbjct: 439  LQTSVSCLTRLGRNALFAGSDRGDSVVFGWNRKHNQ---VSKRKPKIQDPDLDLDIDYDD 495

Query: 468  RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL----- 522
                 D   D+    E +   ++++  E+      F V D +V+I P++D ++G      
Sbjct: 496  LEDDEDDDDDLYADTEKTKATTSASTGETKTDDLIFRVHDLMVSIAPIRDVTFGKPPPPT 555

Query: 523  ---RINADASA----------TGISKQSNYELV------------ELPGCKGIWTV---- 553
               R   D +A           G  K S+  ++            E P  +G+WT+    
Sbjct: 556  DAERNTKDPAAVQSELQLVAVVGRDKASSLAIINREMTPVSIGRFEFPEARGLWTLSTQK 615

Query: 554  -YHKSSRGHNADSSRMAAYDD----EYHAYLIISLEARTMVLETADLLTEVTESVDYF-- 606
               K  +  N +    AA +     +Y  Y+I++ E      ET+D+        +    
Sbjct: 616  PLPKPLQASNKNPKTAAATESILSAQYDQYMIVAKEDDDG-FETSDVYALTAAGFETLSG 674

Query: 607  -----VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENST 660
                   G TI AG +    ++IQV +   R  DG   +TQ +     + E+G       
Sbjct: 675  TEFEPAAGFTIEAGTMGDHTKIIQVLKSEVRCYDGDLGLTQIIPM--LDEETG---HEPR 729

Query: 661  VLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEP 720
              S SIADPY+L+   D S  +   +  +    ++    I SS K  + C LY D     
Sbjct: 730  ATSASIADPYLLIIRDDSSAFIAHVNEDSEIEEIEKEDKIISSTKWSTGC-LYAD----- 783

Query: 721  WLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKF 780
                       S G   A       P     I   +   +GAL I+ +P+ +        
Sbjct: 784  -----------SKGAFAATQQTAKSPKSTPTIMMFLLSAAGALYIYALPDIS-------- 824

Query: 781  VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
                      Y+ E L      +++      G  R E I  + V +L    + + H    
Sbjct: 825  -------RPVYVAEGLCYVPPYLSADYSARKGMAR-ETISEILVTDLGDTVFKSPH---- 872

Query: 841  LFAIL--TDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLD 898
               IL  ++  +  Y+ Y          ++D  S ++ L +      +L N   ++ P +
Sbjct: 873  --VILRHSNHDLTIYEPYRI--------AEDSQSLTKILRL-----RKLPNPAVAKAP-E 916

Query: 899  AYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQ---LCDGSIV 955
            A   E+ P  +    +    NI+G+   F+ G  P + +   +  +  P+   L    + 
Sbjct: 917  ATNSEDPPLMSRNMPLRACANIAGYSAVFMPGHSPSFLI---KSAKATPKVIGLRGSGVR 973

Query: 956  AFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
            A +  H   C  GFIY  S G+ ++ Q+P  +++
Sbjct: 974  AMSSFHTEGCERGFIYADSAGVARVAQIPKDTSF 1007


>gi|156040479|ref|XP_001587226.1| hypothetical protein SS1G_12256 [Sclerotinia sclerotiorum 1980]
 gi|154696312|gb|EDN96050.1| hypothetical protein SS1G_12256 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 1447

 Score =  129 bits (323), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 184/745 (24%), Positives = 307/745 (41%), Gaps = 153/745 (20%)

Query: 57  NLVVTAANVIEIYV-----VRVQEEGSKES---KNSGETKRRVL-MDGISAA-------- 99
           NLVV  A++++I+      V + E   K+S   K+   T  R    DG+ A+        
Sbjct: 28  NLVVAKASLLQIFTTKTVSVDLDELSGKDSSTVKDVTSTDPRAHDEDGVEASFLGADSIL 87

Query: 100 ---------SLELVCHYRLHGNVESLA----ILSQGGADNSRRRDSIILAFEDAKISVLE 146
                     L L+  Y L G V SL     I S+ G +      ++++ F+DAK+S++E
Sbjct: 88  PRSELARTTKLVLIAEYNLSGTVTSLVRVKTISSKTGGE------ALLVGFKDAKLSLVE 141

Query: 147 FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKA 206
           +D    G+   S+H +E  E                + VDP  RC  +      + IL  
Sbjct: 142 WDPERPGISTISVHFYEQDELQGSPWAPSLSDCVNYLTVDPGSRCAALKFGARNLAILPF 201

Query: 207 SQGGSGLVGDED--------------TFGSGGGFSARIESSHVINLRDLDMKHV--KDFI 250
            Q     + D D              +    G  +    SS V+ L  LD   +      
Sbjct: 202 KQDEDVNMDDWDEELDGPRPAKISQKSAAENGILATPYGSSFVLRLSSLDPSLIFPIHLE 261

Query: 251 FVHGYIEPVMVILHERELTWAGRVSWK--HHTCMISALSISTTLKQHPLIWSAMNLPHDA 308
           F++ Y EP   IL       +  +  +  H T M+  L I    K    I S   LP+D 
Sbjct: 262 FLYEYREPTFGILSSTMAPSSALLQERKDHLTYMVFTLDIHQ--KASTTILSVGGLPYDL 319

Query: 309 YKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
           + ++ +  P+GG L+VGAN  IH      +  +A+N +A    +   L +S  ++ L+  
Sbjct: 320 FMIVPLAPPVGGALLVGANELIHIDQAGKANGVAVNMFAKQCTNFSLLDQSDLALRLEGC 379

Query: 368 HATWL--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS----VLT---SDITTI 418
               L  +N   L+   +GD+ +L+   DGR V  L + + +      +LT   S ++++
Sbjct: 380 KIDQLSIENGEMLIILHSGDIAILSFRMDGRSVSGLSIRRVSAELGGDILTGAASCVSSL 439

Query: 419 GNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDM 478
           G    F+GS + DS+++ ++  SG                   PS ++ R  SS A+ D+
Sbjct: 440 GAGALFVGSEVSDSVILGWSRKSGQ------------------PSRRKSRLDSS-AIADV 480

Query: 479 ----------------VNGEELSLYGSASNNTESAQKT--FSFAVRDSLVNIGPLKDFSY 520
                           + G+  ++  +A+N T S  K   ++F++ DS+VNI P+ + ++
Sbjct: 481 DEAMLDEEDLEDDDDDLYGDGPTISPTAANVTASNSKAGDYTFSIHDSMVNIAPITNITF 540

Query: 521 G-----------LRINADAS------ATGISKQSNYELV------------ELPGCKGIW 551
           G           L++N   S      A G  K  +  ++            ELP  +GIW
Sbjct: 541 GEVALSSDKEEELKLNGVQSELQLLAAVGREKGGSLAVINRNIQPNVIGRFELPEARGIW 600

Query: 552 TVYHK--SSRGHNADSSRMA-----AYDDEYHAYLIISL--EARTMVLETA-----DLLT 597
           T+  K  + +G   +  +         D +Y   +I+S   EA   + E+A     +   
Sbjct: 601 TMSAKKPAPKGLQVNKEKTVIGGDYGVDAQYDRLMIVSKASEAEDAIDESAVYALTNAGF 660

Query: 598 EVTESVDYF-VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSG 655
           E     ++    G TI AG L    RVIQV +   R  DG   + Q L     + E+G+ 
Sbjct: 661 EALSGTEFEPAAGSTIEAGTLGNGMRVIQVLKSEVRSYDGDLGLAQILPM--LDDETGA- 717

Query: 656 SENSTVLSVSIADPYVLLGMSDGSI 680
                ++S S ADP++LL   D SI
Sbjct: 718 --EPKIISASFADPFLLLIRDDASI 740



 Score = 46.2 bits (108), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 26/119 (21%), Positives = 53/119 (44%), Gaps = 1/119 (0%)

Query: 872 STSRSLSVSNVSASRLRNLRFSRTP-LDAYTREETPHGAPCQRITIFKNISGHQGFFLSG 930
           STS +L  S +   ++ N   ++ P + A  + +       + +    N+ G+   F+ G
Sbjct: 879 STSPNLLSSTLQFLKIHNTHLAQAPDVSAEEQADETQQTSDKPMRAVSNLGGYSVVFMPG 938

Query: 931 SRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
             P + +   + L     L    +   +  H   C+ GFIY  ++GI+++ Q P  +T+
Sbjct: 939 GSPSFIVKSSKTLPKVLSLQGTGVRGLSSFHTEGCDRGFIYADTEGIVRVAQFPPTTTF 997


>gi|148886829|sp|A2R919.1|CFT1_ASPNC RecName: Full=Protein cft1; AltName: Full=Cleavage factor two
           protein 1
 gi|134083776|emb|CAK47110.1| unnamed protein product [Aspergillus niger]
          Length = 1383

 Score =  128 bits (322), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 210/1019 (20%), Positives = 398/1019 (39%), Gaps = 178/1019 (17%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           +L+V   ++++IY +  +     E  ++ +   ++L++            Y L G V  L
Sbjct: 28  DLIVVRTSLLQIYSLH-KVASHAEGADAQQESTKLLLEK----------EYSLSGTVTGL 76

Query: 117 ----AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
                + S+ G +      ++++AF +AK+S++E+D    G+   S+H +E  +      
Sbjct: 77  CRVKVLNSKSGGE------AVLVAFRNAKLSLIEWDPERRGISTISIHYYERDDLTRSPW 130

Query: 173 GRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFGS--------- 222
             +    G ++ VDP  RC  +  +G++ + I+   Q G  LV D+  +GS         
Sbjct: 131 VPDLNNCGSILSVDPSSRCA-IFNFGIRNLAIIPFHQPGDDLVMDD--YGSDLGEGISTD 187

Query: 223 ---GGG-----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHER 266
              GGG           +      S V+ L  LD  + H     F++ Y EP   IL+ +
Sbjct: 188 HDLGGGTVADKAKEGIVYQTPYAPSFVLPLTTLDPSILHPISLAFLYEYREPTFGILYSQ 247

Query: 267 ELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGA 326
             T +  +  +      +  ++    +   ++ S   LP D ++++A+P P+GG L++G+
Sbjct: 248 VATSSALLPERKDVVFYTVFTLDLEQQASTVLLSVSRLPSDLFRVVALPPPVGGALLIGS 307

Query: 327 NT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKT 383
           N  +H      + A+ +N ++  + S     +S  ++ L+      L +     LL   T
Sbjct: 308 NELVHIDQAGKTNAVGVNEFSRQVSSFSMTDQSDLALRLENCIVECLGDSSGDMLLVLTT 367

Query: 384 GDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI-------TTIGNSLFFLGSRLGDSLLVQ 436
           G++ ++    DGR V  + +         + I       T IG+   FLGS  GDS+L+ 
Sbjct: 368 GEMAIVKFKLDGRSVSGISVHLLPAHAGLTSIYSAAAASTFIGDGKIFLGSEDGDSVLLG 427

Query: 437 FTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV--NGEELSLYGSASNNT 494
           ++  S ++       ++  D  AD        +S  D  +D +     + +L G   +  
Sbjct: 428 YSYSSSSTKKHRLQAKQVIDDSADMSEED---QSDDDVYEDDLYSTSPDTTLTGRRPSGE 484

Query: 495 ESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI---- 550
            SA   + F + D L+NIGPL+D + G R++ +   TG    S    +++   +G     
Sbjct: 485 SSAFGLYDFRIHDKLINIGPLRDITMGKRLSTNLEKTGDRTNSTSPELQIVASQGSHKSG 544

Query: 551 -WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA-------------RTMVLETADLL 596
              V  +    H   S  + + D  + A L    EA             R  V+ T    
Sbjct: 545 GLVVMAREIDPHVVASISLESVDCIWTASLTREEEAVSGTSEKMGQQSQRCYVIATEVKG 604

Query: 597 TEVTESVDYFVQGR----------------TIAAGNLFGRRRVIQVFERGARILDGSY-M 639
           ++  ES+ + V G                 TI+ G    R+RV+QV +   R  D    +
Sbjct: 605 SDREESLIFVVDGHDLKPFRAPDFNPNEDVTISVGTQESRKRVVQVLKNEVRSYDFDLSL 664

Query: 640 TQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAA 699
           TQ       ++     ++    +S S+AD  + +   D ++  L  D S     V     
Sbjct: 665 TQIYPIWDDDT-----NDERMAVSASLADSCLAILRDDSTLLFLQADDSGDLDEVVFGED 719

Query: 700 IESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYE 759
           + S K    SC LY DK                TG+  +ID     P+ + D++  +   
Sbjct: 720 VASGK--WISCCLYSDK----------------TGMFSSIDRTLSEPV-KNDMFLFLLSH 760

Query: 760 SGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENI 819
              L ++ V +   + ++ +   G + ++                 SSE     G +EN+
Sbjct: 761 DCKLFVYRVRD-QKLLSIIEGTDGLSPLL-----------------SSEPPKRSGTRENL 802

Query: 820 HSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSV 879
               V +L  + WSA    P+L        ++ Y+ ++             VST     +
Sbjct: 803 IEAIVADLG-ETWSAS---PYLILRSETDDLIIYKPFV-------------VSTGPVEGI 845

Query: 880 SNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF 939
            ++  S+  N    R P    + + +      + + I  +ISG    F+ G+   + +  
Sbjct: 846 HSLKFSKETNSVLPRIPPGVSSTQPSGSDYRARPLRILPDISGLSAVFMPGASAGFII-- 903

Query: 940 RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
                        S   F  L   N        +    ++ C+LP  + +D  W +++V
Sbjct: 904 ---------RTSASAPHFLRLRGEN--------SRSSTVRFCKLPPMTRFDYQWTLKRV 945


>gi|380488833|emb|CCF37111.1| CPSF A subunit region, partial [Colletotrichum higginsianum]
          Length = 1062

 Score =  128 bits (321), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 226/1013 (22%), Positives = 390/1013 (38%), Gaps = 173/1013 (17%)

Query: 69  YVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAIL----SQGGA 124
           Y  R+ ++   ES   G     V  D      L LV  Y + G V  LA +    S+ G 
Sbjct: 66  YDHRLNDDDGLESSFLGGDGMLVRADRAINTKLVLVAEYPIFGIVTGLAKIKLQYSKSGG 125

Query: 125 DNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPL-- 182
           +      ++++A   A++S++++D   H L   S+H +E  E         S   GPL  
Sbjct: 126 E------ALLIATRVARLSLVQWDPEKHALEDISIHYYEKEEL------EGSPFDGPLNN 173

Query: 183 ----VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIES------ 232
               +  DP  RC  +      +  L   Q    +  D+      G   A+  S      
Sbjct: 174 YRTHLAADPGSRCAALRFGPRYIAFLPFKQADEDIDMDDWDEDVDGPRPAKEPSATAATN 233

Query: 233 ------------SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
                       S+V+ L  LD  + H     F+H Y EP   I+   +          H
Sbjct: 234 GTSNIADVPYSTSYVLPLPQLDPSLLHPVHLAFLHEYREPTFGIISSTQRRSNTLPRKDH 293

Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSAS 337
            +  +  L +    +    I S  NLP D +K++A+P P+GG L+VG N  IH       
Sbjct: 294 FSYKVFTLDLQQ--RASTAILSVNNLPQDLFKVIALPGPVGGALLVGTNELIHIDQSGKP 351

Query: 338 CALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTGDLVLLTVVYDG 395
             +A+N +     +     +S   + L+  +   +  +N   L+    G L ++T   DG
Sbjct: 352 NGVAVNPFTKETTNFPLADQSDLDLRLEHCYIELMSAENGELLMILSDGRLAIITFKIDG 411

Query: 396 RVVQ----RLDLSKTNPSVLTSDITTIGN---SLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
           R V     +L  ++    ++   ++TI     ++FF+G+   DSL++ +T     +    
Sbjct: 412 RTVSGVGVKLVPTEVGGGIVQCSVSTISRLSRNVFFVGTTGSDSLVLGWTRKQAQNARK- 470

Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELS--LYGSASNNTESAQKTFSFAVR 506
             K    D   D+           D   D + GE  +  +   A+ N  S     +F V 
Sbjct: 471 --KTRLVD---DSFEYDLEDEDMEDDDDDDLYGETTTTMIQPGATANGVSKGGDLTFRVH 525

Query: 507 DSLVNIGPLKDFSYGLRI-------------------------NADASATGISKQSNYEL 541
           DSL++I P+KD + G +                            +A A  I  Q+    
Sbjct: 526 DSLLSIAPVKDMTSGKQAFIPDSEEEKNSVGVVADLQLACVVGRGNAGAVAIVNQNIQPK 585

Query: 542 V----ELPGCKGIWTV-----YHKSSRGHNADSSRMAAYDD---EYHAYLIISLEARTMV 589
           V    E P  +G WT+       KS +G    ++ +A+  D   +Y  ++I+S +     
Sbjct: 586 VIGKFEFPEARGFWTMCVQKPVPKSLQGDKGANAAVASEFDASSKYDKFMIVS-KVDLDG 644

Query: 590 LETADLLTEVTESVDYFV-------QGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQ 641
            ET+D+        +           G T+ AG +    R+IQV +   R  DG   ++Q
Sbjct: 645 YETSDVYALTGAGFEALTGTEFDPAAGFTVEAGTMGKHMRIIQVLKSEVRCYDGDLGLSQ 704

Query: 642 DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIE 701
            L     + E+G+      V+S SI DPY+LL   D SI +   D +     V+      
Sbjct: 705 ILPM--LDEETGA---EPRVISASITDPYLLLVRDDSSIMVAQIDNNCELEEVEKQDDTI 759

Query: 702 SSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESG 761
            S K ++ C LY D                +TG+   +    G P  Q + +  +   +G
Sbjct: 760 LSTKWLAGC-LYTD----------------TTGLFAPMQTDKGTPEGQ-NTFMFLLSAAG 801

Query: 762 ALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHS 821
           AL I+ +PN +    V    +G T+ V  ++           + +   GT Q   E +  
Sbjct: 802 ALYIYALPNLSKPVYV---AAGLTY-VPPFL---------SADYAVRRGTVQ---ETLTE 845

Query: 822 MKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSN 881
           + V +L         + P+L     +  +  Y+    E  + T      +  +++L    
Sbjct: 846 LLVADLG----DTTATSPYLIVRHANDDLTIYEPIRLESQDKT------LGLAKTLHFQK 895

Query: 882 VSASRLRNLRFSRTPLDAYTRE--ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF 939
           ++     N   +++P++    E  E P   P +      NI+G+   FL G+ P   +  
Sbjct: 896 IT-----NPALAKSPVEVADDEANEQPRFVPLRPCA---NINGYSTVFLPGASPSLIV-- 945

Query: 940 RERLRVHPQ---LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
            +  +  P+   L    +   +  H   C  GFIY  S+G  ++ QLP+ S +
Sbjct: 946 -KSAKSSPKVVGLQGIGVRGMSSFHTEGCERGFIYADSEGQTRVTQLPADSNF 997


>gi|336463425|gb|EGO51665.1| hypothetical protein NEUTE1DRAFT_89273 [Neurospora tetrasperma FGSC
           2508]
          Length = 1437

 Score =  127 bits (320), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 156/658 (23%), Positives = 267/658 (40%), Gaps = 84/658 (12%)

Query: 94  DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHG 153
           D  ++A L LV    L G +  LA + +    +S   D ++L+F DA++S++E++   + 
Sbjct: 96  DRANSAKLVLVAEVTLPGTITGLARIKKPSGSSSGGADCLLLSFRDARLSLVEWNVERNT 155

Query: 154 LRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL 213
           L   S+H +E  E +             L+  DP  RC  +      + IL   Q    +
Sbjct: 156 LETVSIHYYEKEELVGSPWVAPLHQYPTLLVADPASRCAALKFSERNLAILPFKQPDEDM 215

Query: 214 VGD------------EDTFGSGGGFSARIES-----SHVINLRDLD--MKHVKDFIFVHG 254
             D            +D  G+    ++ IE      S V+ L  L+  + H     F+H 
Sbjct: 216 DMDNWDEELDGPRPKKDLSGAVANGASTIEDTPYSPSFVLRLSKLEASLLHPVHLAFLHE 275

Query: 255 YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
           Y +P + +L   +          H T M+  L +    +    I +   LP D ++++A+
Sbjct: 276 YRDPTIGVLSSTKTASNSLGHKDHFTYMVFTLDLQQ--RASTTILAVNGLPQDLFRVVAL 333

Query: 315 PSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
           P+P+GG L+VGAN  IH      S  +A+N       S   + +S   + L+      L 
Sbjct: 334 PAPVGGALLVGANELIHIDQSGKSNGIAVNPLTKQTTSFSLVDQSDLDLRLEGCAIDVLA 393

Query: 374 NDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDITTI---GNSLFF 424
            ++   LL    G L L+T   DGR V  L +    P    SV+ S +T++   G S  F
Sbjct: 394 AELGEFLLILNDGRLGLITFRIDGRTVSGLSIKMLAPEAGGSVIQSRVTSLSRMGRSTVF 453

Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
           +GS  GDS+L+ +T   G +      ++    I+              D   D + GEE 
Sbjct: 454 VGSEEGDSVLLGWTRRQGQT------QKRKSRIQDADLDLDLDDEDLEDDDDDDLYGEES 507

Query: 485 SLYGSASNNTESAQK-TFSFAVRDSLVNIGPLKDFSYGLRINADAS-------------- 529
           +    A +  ++ +    +F + D L++I P++  +YG  +    S              
Sbjct: 508 TSPEQAMSAAKAIKSGDLNFRIHDRLLSIAPIQKMTYGQPVTLPDSEEERNSEGVRSDLQ 567

Query: 530 ---ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDD- 573
              A G  K S   ++            E P  +G WTV  K          +    +D 
Sbjct: 568 LVCAVGRGKASALAIMNLAIQPKIIGRFEFPEARGFWTVCAKKPIPKTLVGDKGPMNNDY 627

Query: 574 ----EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
               +YH ++I++       E   +   TA     +T +      G T+ AG +    R+
Sbjct: 628 DTSGQYHKFMIVAKVDLDGYETSDVYALTAAGFESLTGTEFEPAAGFTVEAGTMGKDSRI 687

Query: 624 IQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
           +QV +   R  DG   ++Q +     + E+G+      V + SIADP++LL   D S+
Sbjct: 688 LQVLKSEVRCYDGDLGLSQIVPM--LDEETGA---EPRVRTASIADPFLLLIRDDFSV 740



 Score = 63.2 bits (152), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 45/178 (25%), Positives = 77/178 (43%), Gaps = 23/178 (12%)

Query: 816 KENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSR 875
           KE++  + V +L        H  P+L     +  +  YQ Y  +     + +  P S S 
Sbjct: 791 KESVAEILVADLG----DTTHKSPYLILRHANDDLTLYQPYRLK-----ATAGQPFSKS- 840

Query: 876 SLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA----PCQRITIFKNISGHQGFFLSGS 931
                 +   ++ N  F++ P +    ++ PH A    P +R +   NISG+   FL GS
Sbjct: 841 ------LFFQKVPNSTFAKAPEEKPVDDDEPHNAQRFLPMRRCS---NISGYSTVFLPGS 891

Query: 932 RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
            P + +   +       L    + A +  H   C HGFIY  + GI ++ Q+P+ S+Y
Sbjct: 892 SPSFILKTAKSSPRVLSLQGSGVQAMSSFHTEGCEHGFIYADTNGIARVTQIPTDSSY 949


>gi|358390357|gb|EHK39763.1| hypothetical protein TRIATDRAFT_48211 [Trichoderma atroviride IMI
           206040]
          Length = 1441

 Score =  127 bits (319), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 218/1028 (21%), Positives = 388/1028 (37%), Gaps = 159/1028 (15%)

Query: 57  NLVVTAANVIEIYVVR-------------------------VQEEGSKESKNSGETKRRV 91
           NLVV   ++++I+ V+                         V ++   ES   G     +
Sbjct: 28  NLVVAKGSLLQIFTVKAISTELDPEFQPSQPTETETRFDRQVNDDDGLESSFLGGESMFM 87

Query: 92  LMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSI 151
             D  +   L L+    L G V  LA +     + +   ++++LA++ AK+ + E+D   
Sbjct: 88  RTDRTNNTKLVLIAEIPLAGTVIGLARVKT--KNTASGGEALLLAYKAAKMCLAEWDPKK 145

Query: 152 HGLRITSMHCFESPEWLHLKRGRESFAR-GPLVKVDPQGRCGGVLVYGLQMIILKASQGG 210
           + L   S+H +E  E +      E F      ++ DP  RC         + IL  ++  
Sbjct: 146 NELETISIHYYEK-EEMQGSPWEEVFGEYVNYLEADPGSRCAAFKFGTRNLAILPFTRSE 204

Query: 211 SGLVG---DED-------------TFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFV 252
             L     DED               G G    A    S V+ L  LD  + H     F+
Sbjct: 205 EDLEMEDWDEDLDGPRPVKEHTAAANGDGNNVEAAYTPSFVLRLPLLDPSLLHPVHLTFL 264

Query: 253 HGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLL 312
           H Y EP   +L   +   +   S  H +  +  L +    +    I S   LPHD YK++
Sbjct: 265 HEYREPTFGVLSSSQAPASSLGSKDHLSYKVFTLDLQQ--RASTTILSVTGLPHDLYKVI 322

Query: 313 AVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELD--AAHA 369
           A+P+P+GG L+VG N  IH         +A+N  A    S     +S  ++ L+  A   
Sbjct: 323 ALPAPVGGALLVGQNELIHVDQSGKPNGVAINPMAKLATSFNLTDQSDLNLRLESCAIEL 382

Query: 370 TWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVLTSDITTI---GNSL 422
             ++N   LL    G L +++   DGR V  L +    +    +++ S +T I   G + 
Sbjct: 383 LAIENGELLLILNDGRLGIISFKIDGRTVSGLGVKLVGADCGGNIIKSRVTCISRLGKNA 442

Query: 423 FFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGE 482
           FFLGS   DS+++ +   S         K    D +      +       +      +  
Sbjct: 443 FFLGSETSDSVVLGW---SRKQTQEKRRKSRLIDTDLALDVDELDLEDDEEDDDLYGDDS 499

Query: 483 ELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLR--------------INAD- 527
             +     +N         SF + D+L++I P++D + G                ++AD 
Sbjct: 500 ATTKPNQTANGGTVKSGDISFRIHDTLLSIAPIQDITCGQSAFLPDSEEATLNKGVSADL 559

Query: 528 --ASATG---------ISKQSNYELV---ELPGCKGIWTVYHK----SSRGHNADSSRMA 569
             A A G         I+++   +++   E P  +G WT+  K     S G NA ++   
Sbjct: 560 QLACAVGRGEAGSIAVINREIQPKVIGRFEFPEARGFWTMCVKKPVPKSLGTNAGAAGDY 619

Query: 570 AYDDEYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
               ++  ++I++       E   +   TA     + E+      G T+ AG +  +  V
Sbjct: 620 DAPIQHDKFMIVAKVDLDGYETSDVYALTAAGFETLKETEFEPAAGFTVEAGTMGNQMVV 679

Query: 624 IQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
           IQV +   R  +G   + Q L       +  +G+E   V S SI DPY+L+   D S+ L
Sbjct: 680 IQVLKSEVRCYNGDLGLIQILPM----LDEETGAEPRAV-SASIVDPYLLIIRDDASVFL 734

Query: 683 LVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGA 742
              D +     ++   +  +S K  + C LY D                + GV +A  G 
Sbjct: 735 AQIDSNNEIEEIEKTDSGLTSTKWAAGC-LYKD----------------TKGVFQANQG- 776

Query: 743 DGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHIVDTYMREALKDSET 801
           D       ++   +   +GAL I+ +P+ +  V+  +   S   H+   ++ + +     
Sbjct: 777 DQAKKSGEEVMMFLLNTAGALHIYALPDLSKPVYVAEGLSSIPPHLSADFVAKKV----- 831

Query: 802 EINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGP 861
                         +E +  + V +L        H  P+L    +   +  Y+       
Sbjct: 832 ------------ASREALTELVVADLG----DTVHYSPYLILRHSTDDLTIYEPIRL--- 872

Query: 862 ENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNIS 921
                +D P            SA+ +        PL+   ++  P   P +   I  N+ 
Sbjct: 873 ----PTDSPTRNLSDTLFFKKSANSILAKSTVEDPLEDTAQQ--PRYVPLR---ICANVG 923

Query: 922 GHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
           G+   FL G  P + +   + +     +    +   +  +   C+ GFIY  S+GI ++ 
Sbjct: 924 GYSTVFLPGPSPAFILKSSKSVPRVVGVQGLGVRGMSTFNTEGCDRGFIYSDSEGIARVT 983

Query: 982 QLPSGSTY 989
           QLPS + +
Sbjct: 984 QLPSKTNF 991


>gi|350297359|gb|EGZ78336.1| protein cft-1 [Neurospora tetrasperma FGSC 2509]
          Length = 1437

 Score =  127 bits (319), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 156/658 (23%), Positives = 267/658 (40%), Gaps = 84/658 (12%)

Query: 94  DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHG 153
           D  ++A L LV    L G +  LA + +    +S   D ++L+F DA++S++E++   + 
Sbjct: 96  DRANSAKLVLVAEVTLPGTITGLARIKKPSGSSSGGADCLLLSFRDARLSLVEWNVERNT 155

Query: 154 LRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL 213
           L   S+H +E  E +             L+  DP  RC  +      + IL   Q    +
Sbjct: 156 LETVSIHYYEKEELVGSPWVAPLHQYPTLLVADPASRCAALKFSERNLAILPFKQPDEDM 215

Query: 214 VGD------------EDTFGSGGGFSARIES-----SHVINLRDLD--MKHVKDFIFVHG 254
             D            +D  G+    ++ IE      S V+ L  L+  + H     F+H 
Sbjct: 216 DMDNWDEELDGPRPKKDLSGAVANGASTIEDTPYSPSFVLRLSKLEASLLHPVHLAFLHE 275

Query: 255 YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
           Y +P + +L   +          H T M+  L +    +    I +   LP D ++++A+
Sbjct: 276 YRDPTIGVLSSTKTASNSLGHKDHFTYMVFTLDLQQ--RASTTILAVNGLPQDLFRVVAL 333

Query: 315 PSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
           P+P+GG L+VGAN  IH      S  +A+N       S   + +S   + L+      L 
Sbjct: 334 PAPVGGALLVGANELIHIDQSGKSNGIAVNPLTKQTTSFSLVDQSDLDLRLEGCAIDVLA 393

Query: 374 NDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDITTI---GNSLFF 424
            ++   LL    G L L+T   DGR V  L +    P    SV+ S +T++   G S  F
Sbjct: 394 AELGEFLLILNDGRLGLITFRIDGRTVSGLSIKMLAPEAGGSVIQSRVTSLSRMGRSTVF 453

Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
           +GS  GDS+L+ +T   G +      ++    I+              D   D + GEE 
Sbjct: 454 VGSEEGDSVLLGWTRRQGQT------QKRKSRIQDADLDLDLDDEDLEDDDDDDLYGEES 507

Query: 485 SLYGSASNNTESAQK-TFSFAVRDSLVNIGPLKDFSYGLRINADAS-------------- 529
           +    A +  ++ +    +F + D L++I P++  +YG  +    S              
Sbjct: 508 TSPEQAMSAAKAIKSGDLNFRIHDRLLSIAPIQKMTYGQPVTLPDSEKERNSEGVRSDLQ 567

Query: 530 ---ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDD- 573
              A G  K S   ++            E P  +G WTV  K          +    +D 
Sbjct: 568 LVCAVGRGKASALAIMNLAIQPKIIGRFEFPEARGFWTVCAKKPIPKTLVGDKGPMNNDY 627

Query: 574 ----EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
               +YH ++I++       E   +   TA     +T +      G T+ AG +    R+
Sbjct: 628 DTSGQYHKFMIVAKVDLDGYETSDVYALTAAGFESLTGTEFEPAAGFTVEAGTMGKDSRI 687

Query: 624 IQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
           +QV +   R  DG   ++Q +     + E+G+      V + SIADP++LL   D S+
Sbjct: 688 LQVLKSEVRCYDGDLGLSQIVPM--LDEETGA---EPRVRTASIADPFLLLIRDDFSV 740



 Score = 63.2 bits (152), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 45/178 (25%), Positives = 77/178 (43%), Gaps = 23/178 (12%)

Query: 816 KENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSR 875
           KE++  + V +L        H  P+L     +  +  YQ Y  +     + +  P S S 
Sbjct: 791 KESVAEILVADLG----DTTHKSPYLILRHANDDLTLYQPYRLK-----ATAGQPFSKS- 840

Query: 876 SLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA----PCQRITIFKNISGHQGFFLSGS 931
                 +   ++ N  F++ P +    ++ PH A    P +R +   NISG+   FL GS
Sbjct: 841 ------LFFQKVPNSTFAKAPEEKPVDDDEPHNAQRFLPMRRCS---NISGYSTVFLPGS 891

Query: 932 RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
            P + +   +       L    + A +  H   C HGFIY  + GI ++ Q+P+ S+Y
Sbjct: 892 SPSFILKTAKSSPRVLSLQGSGVQAMSSFHTEGCEHGFIYADTNGIARVTQIPTDSSY 949


>gi|358372791|dbj|GAA89393.1| cleavage and polyadenylation specificity factor subunit A
           [Aspergillus kawachii IFO 4308]
          Length = 1372

 Score =  127 bits (318), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 210/1020 (20%), Positives = 400/1020 (39%), Gaps = 182/1020 (17%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           +L+V   ++++IY +  +   + E  ++ +   ++L++            Y L G V  L
Sbjct: 28  DLIVVRTSLLQIYSLH-KVTSNAEGADAQQELTKLLLEK----------EYSLSGTVTGL 76

Query: 117 AILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
             +      NSR   +++++AF +AK+S++E+D     +   S+H +E  +        +
Sbjct: 77  CRVK---VLNSRSGGEAVLVAFRNAKLSLIEWDPERRSISTISIHYYERDDLTRSPWVPD 133

Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFGS------------ 222
               G ++ VDP  RC  +  +G++ + I+   Q G  LV D+  +GS            
Sbjct: 134 LKNCGSILSVDPSSRCA-IFNFGIRNLAIIPFHQPGDDLVMDD--YGSDLGEGMSTDHDL 190

Query: 223 GGG---------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWA 271
           GGG         +      S V+ L  LD  + H     F++ Y EP   IL+ +  T +
Sbjct: 191 GGGPDKAKEGIAYQTPYAPSFVLPLTALDPSILHPISLAFLYEYREPTFGILYSQVATSS 250

Query: 272 GRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IH 330
             +  +      +  ++    +   ++ S   LP D ++++A+P P+GG L++G+N  +H
Sbjct: 251 ALLPERKDVVFYTVFTLDLEQQASTILLSVSRLPSDLFRVVALPPPVGGALLIGSNELVH 310

Query: 331 YHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVL 388
                 + A+ +N ++  + S     +S  ++ L+      L +     LL   TG++ +
Sbjct: 311 IDQAGKTNAVGVNEFSRQVSSFSMTDQSDLALRLENCIVECLGDSSGDMLLVLSTGEMAI 370

Query: 389 LTVVYDGRVVQRLD---------LSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
           +    DGR V  +          L+  N +   S  T IG+   FLGS  GDS+L+ ++C
Sbjct: 371 MKFKLDGRSVSGISVHLLPAHAGLTSMNSAAAAS--TFIGDGKIFLGSEDGDSVLLGYSC 428

Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV--NGEELSLYGSASNNTESA 497
            S +S       ++  D  AD        +S  D  +D +     + +L G   +   SA
Sbjct: 429 SSSSSKKHRLQAKQAIDDSADMSEED---QSEDDVYEDDLYSTSPDTTLTGRRPSGESSA 485

Query: 498 QKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI-----WT 552
              + F + D L+NIGPL+D + G ++  +    G    S    +++   +G        
Sbjct: 486 FGLYDFRMHDKLINIGPLRDITIGRKLPTNQEKGGDRTNSTSPELQIVASQGSHKSGGLV 545

Query: 553 VYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA-------------RTMVLETADLLTEV 599
           V  +    H   S  + + D  + A L    EA             R  V+ T    ++ 
Sbjct: 546 VMAREIDPHVVASISLESVDSIWTASLTWEEEAVSRTSENIGQRSQRCYVIATEAKASDR 605

Query: 600 TESVDYFVQGR----------------TIAAGNLFGRRRVIQVFERGARILDGSY-MTQD 642
            ES+ + V G                 TI  G    R+RV+QV +   R  D    +TQ 
Sbjct: 606 EESLIFVVDGHDLKPFRAPDFNPNEDVTINIGTQESRKRVVQVLKNEVRSYDIDLGLTQI 665

Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIES 702
                 ++     ++    +S S+AD  + +   D ++  L  D S     V     + S
Sbjct: 666 YPIWDDDT-----NDERMAVSASLADSCLAILRDDSTLLFLQADDSGDLDEVVLGEDVAS 720

Query: 703 SKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGA 762
            K    SC LY DK                TG+  +ID     P+ + D++  +      
Sbjct: 721 GK--WISCCLYSDK----------------TGLFSSIDRTLSEPV-KNDMFLFLLSHDSK 761

Query: 763 LEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSM 822
           L ++ V +   + ++ + + G + ++                 SSE     G +EN+   
Sbjct: 762 LFVYRVRD-QKLLSIIEGLDGLSPLL-----------------SSEPPKRSGTRENLVEA 803

Query: 823 KVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNV 882
            V +L  + WSA    P+L     +  ++ Y+ ++             + T  +  +  +
Sbjct: 804 IVADLG-ETWSAS---PYLILRSENDDLIIYKPFV-------------IPTGPTGEIHTL 846

Query: 883 SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRER 942
             S+  N        D  + + +      + + I  +ISG    F+ G+           
Sbjct: 847 KFSKENNSVLPMISPDVDSTQPSGSDYRVRPLRILPDISGLSAVFMPGAS---------- 896

Query: 943 LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQ----GILKICQLPSGSTYDNYWPVQKV 998
                         F +  + +  H F+ +  +      ++ CQLP  + +D  W ++KV
Sbjct: 897 ------------AGFVLRTSASAPH-FLRLRGESPRCSTVRFCQLPPMTRFDYQWTLKKV 943


>gi|167526060|ref|XP_001747364.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163774199|gb|EDQ87831.1| predicted protein [Monosiga brevicollis MX1]
          Length = 1324

 Score =  126 bits (317), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 84/291 (28%), Positives = 142/291 (48%), Gaps = 29/291 (9%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           LEL   +RL+G   ++  ++       + RD+++L+F DAKIS ++F+ S   L    + 
Sbjct: 34  LELAASFRLNGVATAMVAITL----PKQLRDTVVLSFADAKISAIQFEPSTRTLITQKLI 89

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
             E  E ++  +        P+++ DP  RC G LVYG +++I+ A              
Sbjct: 90  NLEI-EAVYGSKVNADLP--PVLQADPLHRCIGALVYGCRLVIIPAH------------- 133

Query: 221 GSGGGFSARIESS-HVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
                   R      VI+L  L   +   K F F+ GY  P  ++LHE    W GR +  
Sbjct: 134 ----ALQPRTNVQFRVIDLEKLSSPLGQAKSFCFLTGYTTPTALLLHEPRPVWVGRHAVG 189

Query: 278 HHTCMISALS--ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS 335
             +C++SALS  + TT    P +W+  +LP D + L+  P P+GG L+V  N + + +Q+
Sbjct: 190 RDSCVLSALSCELDTTDDFAPTVWAKDSLPSDCFALVPTPQPLGGALIVSPNMVLHTNQA 249

Query: 336 ASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDL 386
           +S A+A+N  A          ++  S+ LD A  T++ +  A+ S ++G L
Sbjct: 250 SSSAVAVNAIAARATGYPHTTQAGLSLNLDNARVTFITSVDAIFSLQSGQL 300


>gi|358387835|gb|EHK25429.1| hypothetical protein TRIVIDRAFT_32877 [Trichoderma virens Gv29-8]
          Length = 1440

 Score =  126 bits (316), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 228/1029 (22%), Positives = 401/1029 (38%), Gaps = 170/1029 (16%)

Query: 57  NLVVTAANVIEIYVVR-------------------------VQEEGSKESKNSGETKRRV 91
           NLVV   ++++I+ V+                         V ++   ES   G     +
Sbjct: 28  NLVVAKGSLLQIFTVKSISTELDPEFQPNQPAEVDTRFDRQVNDDDGLESSFLGGETMFM 87

Query: 92  LMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSI 151
             D  +   L L+    L G V  LA L       +   + ++LA++ AK+ + ++D   
Sbjct: 88  RTDRTNNTKLVLIAEIPLAGTVIGLARLKTN--KTASGGEVLLLAYKAAKMCLAQWDPKK 145

Query: 152 HGLRITSMHCFESPEWLHLKRGRESFARGPLV---KVDPQGRCGGVLVYGLQMIILKASQ 208
           + L   S+H +E  E L      E F  G  V   + DP  RC         + IL   +
Sbjct: 146 NELETISIHYYEK-EELQGSPWEEVF--GEYVNHLEADPGSRCAAFKFGTRNLAILPFRR 202

Query: 209 GGSGLVG---DEDTFG---------SGGGFSARIESSHV------INLRDLDMKHVKDFI 250
               L     DED  G         +  G S  +E+++       + L D  + H     
Sbjct: 203 SEEDLEMEDWDEDLDGPRPVKEQAAAVNGDSDNVEAAYTPSFVLRLPLLDPSLLHPVHLT 262

Query: 251 FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYK 310
           F+H Y EP   +L   +   A  +  K H       ++    +    I S   LPHD YK
Sbjct: 263 FLHEYREPTFGVLSSSQAP-AASLGLKDHLSY-KVFTLDLQQRASTTILSVTGLPHDLYK 320

Query: 311 LLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELD--AA 367
           ++A+P+P+GG L+VG N  IH         +A+N  A  + S     ++  ++ L+  A 
Sbjct: 321 VIALPAPVGGALLVGQNELIHVDQSGKPNGVAVNPMAKLVTSFSLTDQADLNLRLENCAI 380

Query: 368 HATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ----RLDLSKTNPSVLTSD---ITTIGN 420
               ++N   LL    G L +++   DGR V     RL  +    +V+ S    I+ +G 
Sbjct: 381 ELLAVENGELLLILNDGRLGIISFKIDGRTVSGLSVRLVGADCGGNVIKSRAACISRLGK 440

Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD-APSTKRLRRSSSDALQDMV 479
           + FF+GS  GDS+++ +     +   +   + +   I+ D A     L     +   D+ 
Sbjct: 441 NTFFVGSETGDSVVLGW-----SRRQTQEKRRKSRLIDPDLALEVDELDLEDDEEDDDLY 495

Query: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLR--------------IN 525
             +  +     +N   +     SF + D L++I P++D + G                ++
Sbjct: 496 GDDSAATKPQTTNGGAAKSGDLSFRIHDVLLSIAPIQDITCGQAACLPDSEEATLIKGVS 555

Query: 526 AD---ASATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRMAA 570
           +D   A A G  +  +  ++            E P  +G WT+  K     +  S+   A
Sbjct: 556 SDLQLACAVGRGEAGSLAIINREIQPRVIGRFEFPEARGFWTMCVKKPVPKSLGSNVGVA 615

Query: 571 --YDD--EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
             YD   ++  ++I++       E   +   TA     + E+      G T+ AG +  +
Sbjct: 616 GDYDAPIQHDKFMIVAKVDLDGYETSDVYALTAAGFETLKETEFEPAAGFTVEAGTMGKQ 675

Query: 621 RRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGS 679
             VIQV +   R  +G   + Q L       +  +G+E   V S SI DPY+L+   DGS
Sbjct: 676 MMVIQVLKSEVRCYNGDLGLIQILPM----LDEETGAEPRAV-SASIVDPYLLIIRDDGS 730

Query: 680 IRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAI 739
           + L   D +     ++      +S K V+ C LY D                + GV ++ 
Sbjct: 731 VFLAQIDSNNEIEEMEKADGGLTSTKWVAGC-LYKD----------------TKGVFQSN 773

Query: 740 DGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHIVDTYM--REAL 796
             +  G  D+G +   +   +GAL I+ +P+ +  V+  +   S   H+   ++  R A 
Sbjct: 774 LNSAAGKADEG-VMMFLLNSAGALHIYSLPDLSKAVYIAEGLSSIPPHLSAGFVARRGAT 832

Query: 797 KDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAY 856
           +++ TEI                    V +L      + HS P+L    +   +  Y+  
Sbjct: 833 RETLTEI-------------------VVADLG----DSVHSSPYLILRHSTDDLTIYEPI 869

Query: 857 LFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI 916
                  T    D +   +S + S+++ S + +      P D     + P   P +    
Sbjct: 870 RLPTASATHALSDTLFFKKSAN-SSLAKSAVED------PSD--DTAQPPRYVPLRTCA- 919

Query: 917 FKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG 976
             N+ G+   FL G  P + +   + +     L    +   +  H   C+ GFIY  S+G
Sbjct: 920 --NVGGYSAVFLPGPSPAFIIKSSKSIPRVVGLQGLGVRGMSTFHTEGCDRGFIYADSEG 977

Query: 977 ILKICQLPS 985
           I ++ QLPS
Sbjct: 978 IARVTQLPS 986


>gi|353234640|emb|CCA66663.1| related to cleavage and polyadenylation specificity factor, 160 kDa
           subunit [Piriformospora indica DSM 11827]
          Length = 1324

 Score =  125 bits (314), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 215/989 (21%), Positives = 393/989 (39%), Gaps = 161/989 (16%)

Query: 55  VPNLVVTAANVIEIYVVR---VQEEGSKES--KNSGETKRRVLMDGISAASLELVCHYRL 109
           V NLVV   N + IY VR     EE   ES  K+SG ++         +  L LV  + L
Sbjct: 36  VTNLVVGRNNRLRIYDVRRTIYTEETHVESDLKSSGPSRH--------SHRLCLVREHLL 87

Query: 110 HGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLH 169
           HG +  LA +           D ++++F+D+K++++E+ ++++ +   S+H +E    L 
Sbjct: 88  HGIIIGLAAVRTANPGLGSP-DRLLVSFQDSKLALMEWSNTLYDISTVSIHSYERSPLLL 146

Query: 170 LKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR 229
                E  A    ++ DP  RC  +++    + +L   Q  + L  D             
Sbjct: 147 NSDFTECRA---YLRTDPANRCAALVMPRDNIALLPWYQPQTEL--DVQDGIQSIAEELP 201

Query: 230 IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
              S+V N+  +D  ++++ D +F+ G+  P + IL + + TW GR+        +  +S
Sbjct: 202 YSPSYVTNVSAMDERIRNILDLVFLPGFNVPTIAILFQEQRTWTGRLKENKDNTSLFFIS 261

Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI-HYHSQSASCALALNNYA 346
           +    + + +I +   LP+D+  +    + +GGVLVV AN+I H    S    L ++ +A
Sbjct: 262 LDLVSRSYQVIATIEKLPYDSLYMSPCHAKLGGVLVVTANSILHVDQASKITTLPMSGWA 321

Query: 347 VSL-DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
             + D+S     +   + L+ +   ++ +   +LS   G  + + + ++GR V  L    
Sbjct: 322 ARVSDTSHGFQDAVDDIHLEGSRMGYISDSQVILSLSNGKCLHIRIDHEGRTVWGLTAVH 381

Query: 406 T-----NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
           T      PSVL +      + L FLGS  GDS+L ++                       
Sbjct: 382 TFGISSPPSVLIAK-----DGLVFLGSTAGDSVLFEYA---------------------- 414

Query: 461 APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFS- 519
                          QD+ +  +  L     N +E+   +FS    D+L + G     S 
Sbjct: 415 ---------------QDLSSHRDFML----PNASETIPTSFSLLPVDNLQDSGSYTAASF 455

Query: 520 YGLRINADA---SATGISKQSNYELV----------ELP---GCKGIWTVYHKSSRGHNA 563
           +GLR + +    +A G+     +  V          +LP   G +GIW     S R H  
Sbjct: 456 FGLRGSEEPALIAANGLDDLGGFSTVHKTMPLRLRKKLPAIAGRQGIW-----SMRVHQG 510

Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR----TIAAGNLFG 619
           +   +    +      ++S +A T     + + T+    +D  +  R    TIA    F 
Sbjct: 511 NGIELPLGHNT-----LLSTDA-TPTPGASRIATKSQARLDINITTRIPMLTIAVAPFFD 564

Query: 620 RRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGS 679
              ++QV     R+L     T D S      +  + +  + +   +I DPYVL+   D +
Sbjct: 565 GTHLLQVTSNSLRLL-----TTDGSEKQVIPDRDNSTARARIRHAAICDPYVLILREDDT 619

Query: 680 IRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD-----KGPEPWLRKTSTDAWLSTG 734
           + L VG+P+   +  +  + +   K    + T Y D     K  E  +R+T         
Sbjct: 620 LGLFVGEPTRGKLRRKDMSPLGDKKLCYWAATFYDDLTGRLKIDEDLMRETK-------A 672

Query: 735 VGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMRE 794
           VG           ++G+ +  +C  +G LEI+ +P    VF V     G +         
Sbjct: 673 VG-----------NRGEKWLALCRSTGTLEIWSLPKLALVF-VSSISLGPS--------- 711

Query: 795 ALK-DSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCY 853
            LK D + E++S+++     G    +  + + +L     S H     L  +     ++ Y
Sbjct: 712 VLKHDQKKEVDSATKTELPVG-ATTLQQVIITDLGEIEPSPH-----LIVLYDSNLLIVY 765

Query: 854 QAYLFEGPENTSKSDDPVSTSRSLSVSNVS-ASRLRNLRFSRTPLDAYTREETPHGAPCQ 912
           Q      P    K+  P    RS+    +S   R+ +   + TP +  T   +      +
Sbjct: 766 QMV----PLEPDKAGLPQLDRRSVPSLRISFVKRMVHHLANPTPDENQTSGGSNEKRLPK 821

Query: 913 RITIFKNISGH----QGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHG 968
            I  F  +        G F++G  P W +       +H      ++ +FT     + +  
Sbjct: 822 TIVPFSVLDWEGNSIYGAFVTGDNPAWILSKNHSGLLHLPCGYEAVHSFTPCSMWDFSPT 881

Query: 969 FIYVTSQGILKICQLPSGSTYDNYWPVQK 997
           F+  T +G   + Q   G T+   +P  K
Sbjct: 882 FLMSTEEGSC-LVQWTPGITFHGQYPCSK 909


>gi|295665178|ref|XP_002793140.1| cleavage and polyadenylation specificity factor subunit A
           [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226278054|gb|EEH33620.1| cleavage and polyadenylation specificity factor subunit A
           [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 1408

 Score =  125 bits (313), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 184/750 (24%), Positives = 304/750 (40%), Gaps = 121/750 (16%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V    ++++Y +     GS   ++  +T+ +        + L LV  Y L G +  L
Sbjct: 28  NLIVAKTTLLQVYNLVNVVYGSSPGQSDEKTRSQY-------SKLVLVAEYALSGTITDL 80

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +     D+    ++I++A  +AK+S++E+D   H +  TS+H +E  + +H+     +
Sbjct: 81  GRVKI--LDSKSGGEAILVATRNAKLSLIEWDPEKHQISTTSIHYYERDD-VHISPWTPN 137

Query: 177 FARGPL-VKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV-------------------- 214
            A  P  + VDP  RC  VL +G + + IL   Q G  LV                    
Sbjct: 138 LAACPSHLTVDPSSRCA-VLNFGKKNLAILPFHQMGDDLVMDDFDSDHDDERQIDTNHTA 196

Query: 215 --GDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTW 270
              DE     G  +     SS V+ +  L+  M H     F++ Y EP   IL+ +    
Sbjct: 197 EERDEANKPDGPVYQTPYASSFVLPIAALEPSMLHPISLAFLYEYREPTFGILYSQVAAS 256

Query: 271 AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-I 329
           +  +  +      S  ++    +    + S   LP+D +K++ +P P+GG L+VG+N  +
Sbjct: 257 SALLHDRKDVVFYSVFTLDLEQRASTTLLSVPRLPNDLFKVIPLPPPVGGALLVGSNELV 316

Query: 330 HYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL--QNDVALLSTKTGDLV 387
           H      + A+ +N +A    S     +S   + L+      L  +N   LL    G + 
Sbjct: 317 HVDQAGRTNAVGVNEFAREASSFSMADQSDLEMRLEGCVVEQLGTENCDMLLVLLNGVMA 376

Query: 388 LLTVVYDGRVVQRLDLS-----------KTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQ 436
           +++   DGR V  + L            +T PS        +G    F GS  GDS+L+ 
Sbjct: 377 VVSFKLDGRSVSGIYLRPVSDQAGGAILRTKPSC----SAPVGRGKIFFGSEEGDSILI- 431

Query: 437 FTCGSGTSMLSSGLK----EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLY----- 487
                G S LS+G K     E G+   D  +         D   D  +  E  LY     
Sbjct: 432 -----GWSRLSAGAKVSPAPETGE---DNVAELSEDEEDDDDDDDEEDAYEDDLYATPVT 483

Query: 488 -GSASNNTESAQKT----FSFAVRDSLVNIGPLKDFSYGL---RINADASATGISKQSNY 539
            G    NT S   T    + F + D L N+GP++D + G      + D   +  S  +  
Sbjct: 484 PGINPRNTASMNGTSLNDYIFRIHDRLWNLGPMRDITLGRPPGSRDKDKRQSVSSLSAYL 543

Query: 540 ELVELPG--------------------------CKGIWTVYHKSSRGHNADSSRMAAYDD 573
           ELV   G                            G+ +V+ K  +  +   S  A    
Sbjct: 544 ELVTTQGYGRAGGLAILRREIDPYVIDSLMIKDTDGVRSVHVKDPKLPSQSGSLPANAGS 603

Query: 574 EYHAYLIISL-----EARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVF 627
            Y  YL++S      + +++V + +    E T + ++   + RTI  G L G  RV+QV 
Sbjct: 604 NYDHYLLLSKSKGFDKEKSVVYKMSSGGLEETRAPEFNPNEDRTIDIGTLAGGTRVVQVL 663

Query: 628 ERGARILD-GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGD 686
           +   R  D G  + Q       ++     SE  +V+  S A+PYVL+   D SI LL  D
Sbjct: 664 KGEVRSYDSGLGLAQIYPVWDEDT-----SEERSVMHASFAEPYVLIIRDDSSILLLQAD 718

Query: 687 PSTCTVSVQTPAAIESSKKPVSSCTLYHDK 716
            S     ++T   I+S+     S +LY DK
Sbjct: 719 ESGDLDEIETDGIIKSTT--WISGSLYQDK 746



 Score = 45.4 bits (106), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 16/80 (20%), Positives = 41/80 (51%)

Query: 919 NISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGIL 978
           ++ G++  F+ G+ PC+ +     +     L   ++ + +  +   C  GF+YV +  ++
Sbjct: 900 DVCGYRTVFMPGNSPCFIIKSATSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDTDNVV 959

Query: 979 KICQLPSGSTYDNYWPVQKV 998
           ++C+ P  + +D  W  +K+
Sbjct: 960 RMCRFPRNTHFDGSWAARKI 979


>gi|164429683|ref|XP_964609.2| hypothetical protein NCU02082 [Neurospora crassa OR74A]
 gi|157073577|gb|EAA35373.2| conserved hypothetical protein [Neurospora crassa OR74A]
          Length = 1437

 Score =  125 bits (313), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 154/658 (23%), Positives = 267/658 (40%), Gaps = 84/658 (12%)

Query: 94  DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHG 153
           D  ++A L LV    L G +  LA + +    +S   D ++L+F DA++S++E++   + 
Sbjct: 96  DRANSAKLVLVAEVTLPGTMTGLARIKKPSGSSSGGADCLLLSFRDARLSLVEWNVERNT 155

Query: 154 LRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGL 213
           L   S+H +E  E +             L+  DP  RC  +      + IL   Q    +
Sbjct: 156 LETVSIHYYEKEELVGSPWVAPLHQYPTLLVADPASRCAALKFSERNLAILPFKQPDEDM 215

Query: 214 VGD------------EDTFGSGGGFSARIES-----SHVINLRDLD--MKHVKDFIFVHG 254
             D            +D  G+    ++ IE      S V+ L  L+  + H     F+H 
Sbjct: 216 DMDNWDEELDGPRPKKDLSGAVANGASTIEDTPYSPSFVLRLSKLEASLLHPVHLAFLHE 275

Query: 255 YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
           Y +P + +L   +          H T M+  L +    +    I +   LP D ++++A+
Sbjct: 276 YRDPTIGVLSSTKTASNSLGHKDHFTYMVFTLDLQQ--RASTTILAVNGLPQDLFRVVAL 333

Query: 315 PSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
           P+P+GG L+VGAN  IH      S  +A+N       S   + ++   + L+      L 
Sbjct: 334 PAPVGGALLVGANELIHIDQSGKSNGIAVNPLTKQTTSFSLVDQADLDLRLEGCAIDVLA 393

Query: 374 NDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLTSDITTI---GNSLFF 424
            ++   LL    G L L+T   DGR V  L +    P    SV+ S +T++   G S  F
Sbjct: 394 AELGEFLLILNDGRLGLITFRIDGRTVSGLSIKMIAPEAGGSVIQSRVTSLSRMGRSTMF 453

Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
           +GS  GDS+L+ +T   G +      ++    ++              D   D + GEE 
Sbjct: 454 VGSEEGDSVLLGWTRRQGQT------QKRKSRLQDADLDLDLDDEDLEDDDDDDLYGEES 507

Query: 485 SLYGSASNNTESAQK-TFSFAVRDSLVNIGPLKDFSYGLRINADAS-------------- 529
           +    A +  ++ +    +F + D L++I P++  +YG  +    S              
Sbjct: 508 ASPEQAMSAAKAIKSGDLNFRIHDRLLSIAPIQKMTYGQPVTLPDSEEERNSEGVRSDLQ 567

Query: 530 ---ATGISKQSNYELV------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDD- 573
              A G  K S   ++            E P  +G WTV  K          +    +D 
Sbjct: 568 LVCAVGRGKASALAIMNLAIQPKIIGRFEFPEARGFWTVCAKKPIPKTLVGDKGPMNNDY 627

Query: 574 ----EYHAYLIIS------LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
               +YH ++I++       E   +   TA     +T +      G T+ AG +    R+
Sbjct: 628 DTSGQYHKFMIVAKVDLDGYETSDVYALTAAGFESLTGTEFEPAAGFTVEAGTMGKDSRI 687

Query: 624 IQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
           +QV +   R  DG   ++Q +     + E+G+      V + SIADP++LL   D S+
Sbjct: 688 LQVLKSEVRCYDGDLGLSQIVPM--LDEETGA---EPRVRTASIADPFLLLIRDDFSV 740



 Score = 62.8 bits (151), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 45/178 (25%), Positives = 77/178 (43%), Gaps = 23/178 (12%)

Query: 816 KENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSR 875
           KE++  + V +L        H  P+L     +  +  YQ Y  +     + +  P S S 
Sbjct: 791 KESVAEILVADLG----DTTHKSPYLILRHANDDLTLYQPYRLK-----ATAGQPFSKS- 840

Query: 876 SLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA----PCQRITIFKNISGHQGFFLSGS 931
                 +   ++ N  F++ P +    ++ PH A    P +R +   NISG+   FL GS
Sbjct: 841 ------LFFQKVPNSTFAKAPEEKPADDDEPHNAQRFLPMRRCS---NISGYSTVFLPGS 891

Query: 932 RPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
            P + +   +       L    + A +  H   C HGFIY  + GI ++ Q+P+ S+Y
Sbjct: 892 SPSFILKTAKSSPRVLSLQGSGVQAMSSFHTEGCEHGFIYADTNGIARVTQIPTDSSY 949


>gi|312077399|ref|XP_003141287.1| hypothetical protein LOAG_05705 [Loa loa]
          Length = 316

 Score =  124 bits (312), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 80/266 (30%), Positives = 132/266 (49%), Gaps = 34/266 (12%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           LE +   RL   V+S AI        +   DS++L F+DAK+S++  + +   L+  S+H
Sbjct: 62  LECLLAVRLLAPVQSFAI---ARIPQNPDCDSLLLGFDDAKLSIVGVNPADRSLKTISLH 118

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
           CFE      LK G       P+++VDP  RC  +LV+G  + +L  +  G+ L       
Sbjct: 119 CFEDE---LLKDGFTKNLPRPVIRVDPGQRCAAMLVFGRYLAVLPFNDSGAQL------- 168

Query: 221 GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
                       S+ + L  +D  + +V D +F+ GY EP ++ L+E   T  GR   ++
Sbjct: 169 -----------HSYTVQLSQIDSRLVNVVDMVFLDGYYEPTLLFLYEPVQTTCGRACVRY 217

Query: 279 HTCMISALSISTTLKQHPL--IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA 336
            T  +  L +S  +K+  L  +W   NLP D  ++LA+P P+GG+L+V  N + Y +QS 
Sbjct: 218 DT--MCVLGVSLNVKEQVLASVWQLTNLPMDCNQILAIPRPVGGILLVATNELIYLNQSV 275

Query: 337 -SCALALNNYAVSLDSSQELPRSSFS 361
             C ++LN+    +D   + P   F 
Sbjct: 276 PPCGISLNS---CMDGFTKFPLRDFK 298


>gi|67521912|ref|XP_659017.1| hypothetical protein AN1413.2 [Aspergillus nidulans FGSC A4]
 gi|74598221|sp|Q5BDG7.1|CFT1_EMENI RecName: Full=Protein cft1; AltName: Full=Cleavage factor two
           protein 1
 gi|40745387|gb|EAA64543.1| hypothetical protein AN1413.2 [Aspergillus nidulans FGSC A4]
 gi|259486722|tpe|CBF84808.1| TPA: Protein cft1 (Cleavage factor two protein 1)
           [Source:UniProtKB/Swiss-Prot;Acc:Q5BDG7] [Aspergillus
           nidulans FGSC A4]
          Length = 1339

 Score =  124 bits (310), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 229/1008 (22%), Positives = 379/1008 (37%), Gaps = 191/1008 (18%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V   ++++I+ +R        S ++ +T+ R          L L   Y+L G V  +
Sbjct: 28  NLIVARTSLLQIFSLR------DVSLSALDTEVRPAQHRQETCKLVLEREYQLPGTVTDI 81

Query: 117 A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
                + ++ G D      ++++AF DAK+S++E+D   +GL   S+H +E  +      
Sbjct: 82  CRVKILKTKSGGD------AVLVAFRDAKLSLVEWDPERYGLSTISIHYYERDDMTRSPW 135

Query: 173 GRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIE- 231
             +    G ++  DP  RC         + I+   Q G  LV D+  FGS   +  R+E 
Sbjct: 136 ASDLSTCGSILSADPGSRCAIFQFGARSLAIIPFHQPGDDLVMDD--FGSEPDYENRVEG 193

Query: 232 --------------------SSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELT 269
                               SS V+ L  LD  + H     F++ Y EP   IL+ +  T
Sbjct: 194 NSRSHEAKDKDAAEYQTPYASSFVLPLTALDPSVIHPISLAFLYEYREPTFGILYSQVAT 253

Query: 270 WAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT- 328
               +  +      + +++    +    + S   LP D +K++A+P P+GG L++G+N  
Sbjct: 254 SHALLHERKDVVFYTVITLDLEQRASTTLLSVTRLPSDLFKVVALPPPVGGSLLIGSNEL 313

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDL 386
           +H      + A+ +N ++    S     +S  ++ L+        +D    LL+  TG  
Sbjct: 314 VHIDQAGKTNAVGVNEFSRQASSFSMTDQSDLALRLENCVVERFSDDNGDLLLALSTGVF 373

Query: 387 VLLTVVYDGRVVQRLD---LSKTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFTCG 440
            L++   DGR V  +    LS  +   L S  ++   +GN   F GS   DS+L+     
Sbjct: 374 ALVSFKLDGRSVSGISVRPLSGPSKEFLASTASSSAFLGNGKVFFGSESADSVLL----- 428

Query: 441 SGTSMLSSGLKEEF-GDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
            G S  SS  K+ F G    D         S  DA +D +     +       N  S   
Sbjct: 429 -GWSSASSATKKSFSGSTSND--------ESEDDAYEDDLYSSAPAAMTDNPQNQPSNSS 479

Query: 500 TFSFA---VRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCK--GIWTVY 554
             +F    + D L + GP++D   G    A +  T   K    ELV   G    G   + 
Sbjct: 480 VAAFGDLRIHDRLSSPGPIRDIVLGRSSEASSRDT---KDGVLELVAAQGSDEGGTMVIM 536

Query: 555 HK--------SSRGHNADS----SRMAAYDDEYHAYLIISL-------EARTMVLETADL 595
            +        S     A+S    S +   +D+   Y+I+S        E+   VLE  D 
Sbjct: 537 KREVDPYLVASMAADTANSLWTVSLLPDNNDQKRDYVILSKQEKPDKEESEVFVLE--DK 594

Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG 655
           L  +T          T+  G L  + RVIQV     R  D  +   D             
Sbjct: 595 LRPITAPEFNPNHELTVEIGTLASKSRVIQVLRNEVRSYDAVWDEDD------------- 641

Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
           S+    ++ ++ DPY+ +   D ++ LL  D S                  +   TL  D
Sbjct: 642 SDERVAVNATLVDPYLAIIRDDSTLLLLQADDS----------------GDLDEVTLSED 685

Query: 716 KGPEPWLRKT--STDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNC 773
              + WL     S +A   T    +I                +  +   L ++ +P+F  
Sbjct: 686 VVSQKWLSACFYSDNAGFFTAPFASI--------------LFLLNQDHQLYVYRLPDF-A 730

Query: 774 VFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWS 833
           V +V + V     I+ T   E  K S T              +EN+  + VVEL      
Sbjct: 731 VISVIEGVGCLPPILST---EPPKRSTT--------------RENVLQIAVVELG----D 769

Query: 834 AHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFS 893
           ++ S PFL     +  ++ Y+ +     E T          R L  +N +  +  N    
Sbjct: 770 SYSSLPFLILRTENDDLVVYKPFFTNSKELTGL--------RFLKEANHTLPKTPNTT-- 819

Query: 894 RTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHP---QLC 950
               D    E  P       + I  NI+G    F+ G  P    +FR      P   +L 
Sbjct: 820 ----DELQSEMKP-------LRILPNIAGCSSIFMPG--PSAGFIFRAS-TTSPHFIRLR 865

Query: 951 DGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
            G I         + + GF Y+ S G L + +LP G+     W ++ V
Sbjct: 866 GGFIKGLGCFD--SPDKGFAYLDSHG-LHLAKLPEGTQLGYPWIMRTV 910


>gi|403170487|ref|XP_003329830.2| hypothetical protein PGTG_11767 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375168746|gb|EFP85411.2| hypothetical protein PGTG_11767 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 1513

 Score =  123 bits (309), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 173/771 (22%), Positives = 308/771 (39%), Gaps = 167/771 (21%)

Query: 48  SKRGIGPVPNLVVTAANVIEIYVVRVQEEGSKE---SKNSGETKRRVLMDGISAASLELV 104
           SK    P+ NL+V  + +++++ + + E+   E   ++N  E K +          L  +
Sbjct: 37  SKTRPRPITNLIVARSTLLQVFELCLVEDDQAENNHTRNHHELKNK-------NYKLFHL 89

Query: 105 CHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFES 164
           C +RLHG V  L  L+          D ++++F+DAK+++LE+ +S   L   S+H FE 
Sbjct: 90  CEHRLHGRVTGLQRLTTLDTQEDGL-DRLLVSFQDAKMTLLEWSNSAADLVPISLHTFEK 148

Query: 165 -PEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSG 223
            P+       R+   +   ++VDP  RC  +L+    + +L   Q    L    D+ G  
Sbjct: 149 LPQITQGDLPRDFQGQ---LEVDPLSRCAVLLLPQATLAVLPFFQDQLDL----DSLGLS 201

Query: 224 GGFSARI------------ESSHVINLRD----------LDMKH--VKDFI---FVHGYI 256
           GG  + +             SS +++             LD +H  +K  I   F+ G+ 
Sbjct: 202 GGLKSALGSEQQRFQTFPYASSFILDFNQQLLNHLPPSSLDSQHRPIKSVIALKFLPGFS 261

Query: 257 EPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
           EP + +L++ + TW+ R+    +T  +  L++       P+I    NLP+DA+ L+A P 
Sbjct: 262 EPTLAVLYQSQYTWSARLENHANTAALIVLTLDLGSNHFPIISHTTNLPYDAHGLVACPK 321

Query: 317 PIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS------------------ 358
            + GVLV+ A+ I +  QS+       N  V   S  ++PR                   
Sbjct: 322 ELAGVLVLCADMILHVDQSSKIIGLATNGWVKHTSELQIPRQDTVRLITPTNKISGHRST 381

Query: 359 ---------------------------SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV 391
                                         V L+ A   + + D A +  +TG++  L  
Sbjct: 382 TNKSDERPEDLEDGEEQDESGVPEGHEKLLVRLENAKIVFSRADRAFVFLRTGEVFSLQF 441

Query: 392 VYDGRVVQRLDLSKTN-PSVLTSDITTIGNSLFFLGSRLGDSLL-----------VQFTC 439
           + DGR + +L L K +  S++ S +  + N   F+GS  GDS L                
Sbjct: 442 LRDGRTLTKLVLEKLDLLSIIPSTVLKVNNECLFVGSMAGDSALYILDHLRPRSSSDDDN 501

Query: 440 GSGTSMLSSGLKE-----------EFG-DIEADAPSTKRLRRSSSDALQD--MVNGEELS 485
             G  + SS + +           +F  DI  D   T  +RR+    L D    NG +  
Sbjct: 502 DDGHQLPSSSIIQPDKAAKNQSSLDFDEDIYGDRTETDPVRRTDHSQLYDDRPSNGADDG 561

Query: 486 LYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELP 545
             G+ ++  E   +     + D +   GP++DF+         +ATG+        +EL 
Sbjct: 562 RPGAGAHLAEPFLR-----LGDVIQAHGPIRDFTM--------AATGVENMP----LELL 604

Query: 546 GCKGI-----WTVYH-----KSSRGHNADSSRMAAYDDEYHAYLII-----SLEARTMVL 590
            C G       TV+H     +  R  + +S   +  +  +   L++     S E + + +
Sbjct: 605 ACTGTGDLGGLTVFHREIPLRKRRKLSFESPSASHINALFFTSLVVESGGLSEERKVVWM 664

Query: 591 ETADLLTEVT---ES-----VDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD 642
             +   TE+    ES     ++ F + +T+A    FG++ V+QV     ++   S     
Sbjct: 665 GRSGPRTEIATYGESGELSLINTFPE-KTLAVSPFFGKQFVVQVTNTAIKLFTSSL---- 719

Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS 693
                  ++         +L  SI D YV+L    G   +  GD  + T+S
Sbjct: 720 -----EEAQVIQPEPAVKILRASIVDDYVMLETHCGLKLIYQGDHDSKTLS 765


>gi|392572878|gb|EIW66021.1| hypothetical protein TREMEDRAFT_70300 [Tremella mesenterica DSM
           1558]
          Length = 1408

 Score =  123 bits (309), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 150/656 (22%), Positives = 279/656 (42%), Gaps = 94/656 (14%)

Query: 101 LELVCHYRLHGNVESLAILS--QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITS 158
           L L+C + LHG +  LA L   +   D     D ++++F+DAK+++LE+  S   +   S
Sbjct: 117 LHLLCQHTLHGWITGLAPLRTIESSVDG---LDRLLVSFKDAKMALLEW--SRGDIATVS 171

Query: 159 MHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED 218
           +H +E  +   +  G   F   PL++ DP  R   + +    + IL   Q  S L   E+
Sbjct: 172 LHTYERCQ--QMVTGDLQFYT-PLLRSDPLSRLAVLTLPEDSLAILPVLQEQSDLDPLEN 228

Query: 219 TFGSGGGFSARIESSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSW 276
            F     +S     S V++L D+   +K+++D +F+ G+  P + +L+    TWAGR   
Sbjct: 229 -FTKDAPYSP----SFVLSLADVAPTIKNLQDLLFLPGFHSPTLAVLYSPYHTWAGRYHS 283

Query: 277 KHHTCMISALSISTTLK-QHPLIWSAMNLPHDAYKLLAVPSPIGGV-LVVGANTIHYHSQ 334
           +  T  +   +   T    +PL+ S   LP D+  ++A P+ +GGV LV     +H    
Sbjct: 284 QRDTFCLEVRTFDITAGGSYPLLTSVSGLPSDSLYIVACPAELGGVVLVTTTGLLHIDQS 343

Query: 335 SASCALALNNY-----AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLL 389
             + A ++N +      +  D S E    S  + L+ + + ++     LL  + GD+  +
Sbjct: 344 GRTVATSVNAWWSHITTLPCDKSSE----SRKISLEGSKSVFVTERDMLLVLQNGDVHQV 399

Query: 390 TVVYDGRVVQRLDLSKTNPSV-LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
               +GR +  + + + + +V   S + T GN   F+G   GDSLL         +    
Sbjct: 400 RFEMNGRAIGAIKVDEQSSNVPAPSSMVTTGNQAIFVGCAEGDSLLANVDIKRAVA---- 455

Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTE----SAQKTFSFA 504
            +++    IEA+A           D  +D+    ++ L   A+N  +    +       +
Sbjct: 456 -IEDRKPAIEAEA---------EVDWDEDLYGDIDVPLTNGATNGAKYQAITGPANIVLS 505

Query: 505 VRDSLVNIGPLKDFSYGLRINADASAT--------GISKQSNY-------------ELVE 543
             D L  +G + D  +G+    + + T        G SK+S +                E
Sbjct: 506 PADVLTGVGKIVDMEFGIASTDEGTRTYPQLVTIGGGSKRSTFNAFRRGIPISKRRRFNE 565

Query: 544 LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA---RTMVLETADLLTEVT 600
           L   + +W +  +   G +     + +  ++    ++ S EA   R   L       ++ 
Sbjct: 566 LFNTESVWFLPIQRPSGQH-----LKSIPEDRRTTMLFSSEATQTRIFSLSAKPNPEQIG 620

Query: 601 ESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENST 660
                 + G+++  G  F R  V+ V +    +LD    TQ             G+E   
Sbjct: 621 R-----ISGKSLTVGPFFQRSNVLVVTQTEVLLLDSDGKTQ----------QSIGNEGEE 665

Query: 661 VLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS-VQTPAAIESSKKPVSSCTLYHD 715
           ++S SI+DPYV++   +GS  + VGD     +S V+ P+  +S + P  +  ++ D
Sbjct: 666 IVSASISDPYVVIRRVNGSGSMFVGDTVARQLSEVKIPS--DSLQPPYQAIEVFSD 719


>gi|320591495|gb|EFX03934.1| cleavage and polyadenylation specificity factor subunit [Grosmannia
            clavigera kw1407]
          Length = 1461

 Score =  123 bits (308), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 218/980 (22%), Positives = 371/980 (37%), Gaps = 169/980 (17%)

Query: 97   SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
            S + L LV  + L G V  LA +   G  +    +++++A +DA++S+LE+D   + L  
Sbjct: 100  SISKLVLVAEFPLAGTVTGLARIKIPGTKSGG--EAVLVALKDARLSLLEWDPDQNDLTT 157

Query: 157  TSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGD 216
             S+H +E  E                +  DP  RC  +      + IL   Q     V  
Sbjct: 158  ISIHYYEQEELQGAPWAAPLSDYANFLVADPGSRCAALKFGARNLAILPFRQADEEDVDM 217

Query: 217  ED-------------------TFGSGGGFS-ARIESSHVINLRDLD--MKHVKDFIFVHG 254
            +D                     G G G        S V+ L +LD  + H     F+H 
Sbjct: 218  DDWDEELDGPRPAKDPSSAAVVSGPGDGIEDTPFAPSFVLRLSNLDTTLLHPVHLAFLHE 277

Query: 255  YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
            Y EP   IL     T A  V  +         ++    K    I S  NLP D ++++ +
Sbjct: 278  YREPTFGILSSSVSTSA--VIGRRDKLSYLVFTLDLQQKASTTILSVANLPQDLFRVVPI 335

Query: 315  PSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ 373
            PSPIGG ++VGAN  IH      +  +A+N +     S     +S  ++ L+      L 
Sbjct: 336  PSPIGGAILVGANELIHIDQSGRANGVAVNPFTKQSTSFGLADQSDLALRLEGCTVDVLS 395

Query: 374  NDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLS----KTNPSVLTSDITT---IGNSLFF 424
             +    L+    G L +LT+  DGR V  L +     +    V+ S IT    IG  + F
Sbjct: 396  AEAGELLIVLHDGQLAVLTIRVDGRTVSGLSVKMVRREAGGDVIQSGITCLSRIGRQMLF 455

Query: 425  LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG-DIEADAPSTKRLRRSSSDALQDMVNGEE 483
             GS   DS+++ ++   G +          G D+ AD       R    +   D  + + 
Sbjct: 456  AGSDQADSVVLGWSRKQGQTARRKPRANRAGLDLGADEEYFDDEREEGEELDDDEDDDDL 515

Query: 484  LSLYGSAS------NNTESAQKTFSFAVRDSLVNIGPLKDFSYG-------LRINADAS- 529
                 SA+      N T       SF + D L++I P++D   G       L   +D + 
Sbjct: 516  YGDGPSAAQTLGIDNTTGRGGDDLSFRIHDRLLSIAPIRDMVIGKPALVGELAKRSDQAT 575

Query: 530  ---------ATG---------ISKQSNYELV---ELPGCKGIWTV-----YHKSSRGHNA 563
                     A G         +S++ N + +   E    + +WTV       ++ +G   
Sbjct: 576  IHSELNLVCAVGSGRAGALALLSREINPDPLGAFEFAEAQALWTVSSSKPIPRTIQGEKG 635

Query: 564  DSSRMAAYDDE--YHAYLIISLEARTMVLETADLLTEVTESVDYF-------VQGRTIAA 614
             ++    Y+    +  Y+I++ E      ET+D+        +           G T+ A
Sbjct: 636  GATVGEDYESPAMHDKYMIVAKEDDDG-FETSDVYAVTASGFETLKGTEFEPAAGFTVQA 694

Query: 615  GNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
            G +   RR+IQV +   R  DG   ++Q L       +  +G+E   VL  SIADPY+LL
Sbjct: 695  GTMGRNRRIIQVLKSEVRCYDGDLGLSQILPM----VDEDTGAE-PRVLFASIADPYLLL 749

Query: 674  GMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLST 733
               D S+ +   +       ++      +S K V+ C LYHD        KTS  A+L +
Sbjct: 750  IRDDASVLVAEMNKDFELEELERDDGSLASTKWVAGC-LYHDTA--SVFSKTSILAFLLS 806

Query: 734  GVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVS--GRTHIVDTY 791
                                      SG   I+ +P+      V + ++   R  + D  
Sbjct: 807  A-------------------------SGTFYIYALPDLKQPVYVAEGLNYVPRLFLPDHT 841

Query: 792  MREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTIL 851
            +R  +                   KE +  + V +L      A    P+L     +  + 
Sbjct: 842  VRRGMA------------------KEPLTEILVADLG----DAVSKAPYLIVRHANDDLT 879

Query: 852  CYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLR--NLRFSRTPLDAYTREETPHGA 909
             YQ               P+ T  SL   + S   L+  N  F+++P+ + + ++     
Sbjct: 880  IYQ---------------PLRTPSSLGSLSESLRFLKVPNPVFAKSPV-SISSDDASSQL 923

Query: 910  PCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQ---LCDGSIVAFTVLHNVNCN 966
                + + +NI G+   FL GS   + +   +  +  P+   L   ++ + +  H  +  
Sbjct: 924  RAMPLRVCENIGGYSTVFLPGSSASFVL---KSAKSQPRVVSLQGTAVRSLSPFHTESSE 980

Query: 967  HGFIYVTSQGILKICQLPSG 986
              FIYV  +G  ++C +P+G
Sbjct: 981  RSFIYVDVEGSGRVCSMPAG 1000


>gi|340515387|gb|EGR45642.1| predicted protein [Trichoderma reesei QM6a]
          Length = 1441

 Score =  122 bits (307), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 222/999 (22%), Positives = 379/999 (37%), Gaps = 156/999 (15%)

Query: 72  RVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRD 131
           +V ++   ES   G     V  +  +   L L+    L G V  LA L    +  +   +
Sbjct: 68  QVNDDDGLESSFLGGETMLVRTERTNNTKLVLITEIPLAGTVIGLARLRT--SRTASGGE 125

Query: 132 SIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV---KVDPQ 188
            +++A++ AK+ + E+D   + L   S+H +E  E L      E F  G  V   + DP 
Sbjct: 126 VLLIAYKAAKLCMAEWDPRKNELETISIHYYEK-EELQGAPWEEVF--GEYVNHLEADPG 182

Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVG---DEDTFG---------SGGGFSARIESSHV- 235
            RC  +      + IL   +    L     DED  G         +  G S  +E+++  
Sbjct: 183 SRCAALKFGTRNLAILPFRRSEEDLEMEDWDEDLDGPRPVKEQAAAVNGDSDNVEAAYTP 242

Query: 236 -----INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST 290
                + L D  + H     F+H Y EP   +L   +   A   +  H +  +  L +  
Sbjct: 243 SFVLRLPLLDPSLLHPVHLTFLHEYREPTFGVLSSSQAPAASLGARDHLSYKVFTLDLQQ 302

Query: 291 TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSL 349
             +    I S   LPHD Y+++A+P+P+GG L+VG N  IH      S  +A+N  A   
Sbjct: 303 --RASTTILSVTGLPHDLYRVIALPAPVGGALLVGQNELIHVDQSGKSNGVAVNPMAKLA 360

Query: 350 DSSQELPRSSFSVELD--AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ----RLDL 403
            S     +S   + L+  A     ++N   LL    G L +++   DGR V     RL  
Sbjct: 361 TSFSLTDQSDLKLRLENCAIEVLAIENGELLLILNDGRLGIISFKIDGRTVSGLSVRLVG 420

Query: 404 SKTNPSVLTSDITTI---GNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
           +    +VL S  T +   G +  F+GS   DS+++ +   S         K    D +  
Sbjct: 421 ADCGGNVLKSRATCVSRLGKNTLFVGSETSDSVVLGW---SRRQTQEKRKKSRLIDPDLA 477

Query: 461 APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
               +       +      +    +      N   +     +F + D L++I P++D + 
Sbjct: 478 LEVDELDLEDDEEDDDLYGDDSVATKPQQLPNGGPAKSGDLTFRIHDVLLSIAPIQDVTC 537

Query: 521 GLR--------------INAD---ASATGISKQSNYELV------------ELPGCKGIW 551
           G                + AD   A A G  +  +  ++            E P  +G W
Sbjct: 538 GQAAFPPDSEEATLNRGVRADLQLACAVGRGEAGSLAIINREIQPRVIGRFEFPEARGFW 597

Query: 552 TVYHKSS--RGHNADSSRMAAYDD--EYHAYLIIS------LEARTMVLETADLLTEVTE 601
           T+  K    +   A++     YD   ++  ++I++       E   +   TA     + E
Sbjct: 598 TMCVKKPVPKSLGANAGVAGDYDTPIQHDKFMIVAKVDLDGYETSDVYALTAAGFETLKE 657

Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENST 660
           +      G T+ AG +  +  VIQV +   R  +G   + Q L       +  +G+E   
Sbjct: 658 TEFEPAAGFTVEAGTMGKQMVVIQVLKSEVRCYNGDLNLIQILPM----LDEETGAEPRA 713

Query: 661 VLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEP 720
           V S SI DPY+ +   DGS+ L   D +     ++   +  +S K V+ C     KG   
Sbjct: 714 V-SASIVDPYLFIVRDDGSVFLAQIDSNNEIEEMEKTDSSLTSTKWVAGCLYKDTKG--- 769

Query: 721 WLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDK 779
             + + +D+   T   EA+                +   +GAL IF +P+ +  V+  + 
Sbjct: 770 IFQSSYSDSTKQTS--EAV-------------MMFLLNSTGALHIFALPDLSKAVYVAEG 814

Query: 780 FVSGRTHIVDTYM--REALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHS 837
             S   H+   Y   R A +++ TEI                    V +L      A H+
Sbjct: 815 LSSIPPHLSAGYAARRGATRETLTEI-------------------VVADLG----DAVHA 851

Query: 838 RPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTP- 896
            P+L    +   +  Y+      P N         T+ +LS           L F ++P 
Sbjct: 852 SPYLILRHSTNDLTIYEPIRL--PAN--------ETAHTLS---------DTLFFKKSPN 892

Query: 897 -LDAYTREETPHGAPCQR-----ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLC 950
            + A +  E P     Q      + I  N+ G+   FL G  P + +     +     L 
Sbjct: 893 AVLAKSAVEDPSDDTAQPPRYVPLRICANVGGYSSVFLPGPSPAFVIKSSRSVPRVVGLQ 952

Query: 951 DGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
              +   +  H   C+ GFIY  S+GI ++ QLPS + +
Sbjct: 953 GHGVRGMSTFHTEGCDRGFIYADSEGIARVTQLPSKTNF 991


>gi|402219312|gb|EJT99386.1| hypothetical protein DACRYDRAFT_17537 [Dacryopinax sp. DJM-731 SS1]
          Length = 1620

 Score =  122 bits (306), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 158/662 (23%), Positives = 265/662 (40%), Gaps = 102/662 (15%)

Query: 101 LELVCHYRLHGNVESLAILS--QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITS 158
           L LV  +R+HG V  L  +     G D     D ++++F+DAKI++LE+ D+I+ L   S
Sbjct: 136 LHLVREHRMHGFVTGLEKVRTLASGEDGM---DRLLVSFKDAKIALLEWSDAIYDLSTVS 192

Query: 159 MHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGD-- 216
           +H +E    +      E     PL++ DP+ RC  +L+    + IL   Q     + D  
Sbjct: 193 LHTYERSSQVSTSEASE---HRPLLRADPESRCAALLLPKDALAILPFVQRTGLDLADPA 249

Query: 217 EDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
            D       ++     S+V  L D D  ++HV DF F+  +  P + IL++    W GR+
Sbjct: 250 RDKEREHQPYTP----SYVFPLSDADDTLRHVLDFCFLPSFHTPTLAILYQPAQNWTGRL 305

Query: 275 SWKHHTCMISALSISTTLK----------QHPLIWSAMNLPHDAYKLLAVP--SPIGGVL 322
           S       ++ +++    K             +I     LP+DA+ LL     S  GGV+
Sbjct: 306 SQTKDNTSLAIVTLDLVGKGAAAGGGAGGGGAVISRTHGLPYDAFSLLPAREGSTFGGVV 365

Query: 323 VVGANTI-HYHSQSASCALALNNYAVSLDSSQELPRSSFSVE--------LDAAHATWLQ 373
           V+  N++ H         LA + +     S+   P  +F+ E        L+ +   W  
Sbjct: 366 VLAGNSVLHVDPAGRIVGLAASGWHAQ-SSALRFPLWAFTAEEGETEERKLEGSRLCWAG 424

Query: 374 NDVALLSTKTGDLVLLTVVYDGRVVQRLD----LSKTN-PSVLT-------SDITTIGNS 421
               +L    G    L V  +GR V  L     L +T+ P+VL          +   G  
Sbjct: 425 EQQLILVGAQGWARELKVGVEGRNVSSLSAGRRLGRTSAPAVLCPVGEQSGRALKPTGRD 484

Query: 422 LFFLGSRLGDSLLVQFTCGSG--TSMLSSGLKEEF--GDIEADAPSTKRLRRSSSDALQD 477
           L +L S  G S+L+Q   G      +  +G ++E    D+E DA S K      +D L D
Sbjct: 485 LVWLASEAGQSVLLQVHKGEPRVEEVKPNGEEKEIEGEDMEIDADSDK------NDDLAD 538

Query: 478 MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADAS-------- 529
           +     L    ++      A    +  V D+L   G + D S+ L   +           
Sbjct: 539 IYGDSGLPAAAASGVTAGPALPWLTLEVLDALQGHGQIADMSFALSFRSGPDRPTPKLVC 598

Query: 530 -------------ATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAY----D 572
                          G+  +    +  + G +GIW++  +          R        D
Sbjct: 599 STPEGERGAWTVYENGLPIRVKRRVPAVAGTRGIWSLRVRRGDRARRGGRRERGEREWAD 658

Query: 573 DEYHAYLIISLEA-------RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQ 625
            E    LI+S +A       RT+ +++   L  ++      +   T+AAG  F    V+Q
Sbjct: 659 GEERDNLIVSTDATPSPGISRTITVDSRGELQIISR-----LPALTLAAGVFFSHTCVMQ 713

Query: 626 VFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG 685
           V      +LDG    ++L     N       E S ++   + DP+V++   +GS+ L +G
Sbjct: 714 VTPDSLHLLDGD--GKELQVLKDNE---GNKEASPIIKACVEDPWVVVTRENGSVALYLG 768

Query: 686 DP 687
           DP
Sbjct: 769 DP 770


>gi|392585051|gb|EIW74392.1| hypothetical protein CONPUDRAFT_133073 [Coniophora puteana
           RWD-64-598 SS2]
          Length = 1490

 Score =  122 bits (306), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 196/913 (21%), Positives = 349/913 (38%), Gaps = 130/913 (14%)

Query: 57  NLVVTAANVIEIYVVR-------VQEEGSKESKN---------SGETKRRVLMDG----- 95
           NLV   +N+I IY VR        Q E  KE K+          GE +     DG     
Sbjct: 40  NLVTARSNIIRIYEVREDAASLSSQVEAEKERKSHVRKGTEAVEGEVEMDTGGDGWVNMG 99

Query: 96  ---------ISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLE 146
                     +      V  + +HG V  +  + +  + N  R D ++++F+DAKI++LE
Sbjct: 100 SVKSTSSGPPTVTRFHFVREHVVHGIVTGMDCI-RTISSNEDRMDRLLVSFKDAKIALLE 158

Query: 147 FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKA 206
           + D+ H L   S+H +E  E   L        R  L +VDP  RC  + +    + IL  
Sbjct: 159 WSDAAHDLITVSIHTYERSE--QLMSIDAPLFRSSL-RVDPLSRCAALSLPNNALAILPF 215

Query: 207 SQGGSGLVGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYIEPVMVIL 263
            Q  +     E    + G        S +++L    D  + +V DF F+ G+  P + +L
Sbjct: 216 YQTQAEFDVIEGEGETEGMRDVPYSPSFILDLPVDVDSSLCNVIDFAFLPGFNNPTLAVL 275

Query: 264 HERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP------ 317
            + E TWAGR+     T ++   ++       P++ +   LP DA+ L     P      
Sbjct: 276 CQSEQTWAGRLKEHRDTTLVVTFTLDLLSCTFPILSTLRGLPSDAFSLSPATLPPDFTSG 335

Query: 318 -------IGGVLVVGANTIHYHSQSASCA-------------LALNNYAVSLDSSQELPR 357
                    GV+V+  + + Y  Q A C              L+++N  ++  ++++   
Sbjct: 336 LSGGASNAHGVVVLTPDAVLYADQ-ARCVGAAVSGWATRTSDLSISNAYLTGGTAKDAEG 394

Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-TNPSVLTSDIT 416
               + L+ A    L     LL  ++G++ ++ +V +GR V R+D+      +V+ + + 
Sbjct: 395 DVKPLALEGAFPLLLTPTALLLVLRSGEMHVVRLVTEGRSVGRVDVGPCVGQTVMPATVV 454

Query: 417 TIGNSLFFLGSRLGDS--------LLVQFTCGSGTSMLSSGLKEEFGDIEADA--PSTKR 466
            +      LG   G+         + V    G  T +LS+   EE      +    S   
Sbjct: 455 RVKAPQRALGQGQGEGEKAKERRMVFVGSIVGPAT-LLSAERVEETAAANGNGVNGSGAN 513

Query: 467 LRRSSSDALQDM--VNGEELSLYGSASNNTE----SAQKTFSFAVRDSLVNIGPLKDFSY 520
               + DA  +M     ++  LYG  +  ++    SA++   FA  D++   GP+ D ++
Sbjct: 514 GHVENKDAGMEMDVDLDDDDDLYGPTTLTSQPSSGSAEEALRFAFCDAIPAHGPILDMAF 573

Query: 521 GLRINAD------ASATGISKQSNYELVE-------------LPGCKGIWTVYHKSS-RG 560
            L    D       ++TG      + L +             L G +GIW++  K S RG
Sbjct: 574 ALGKWGDRYVPELVASTGAEHLGGFTLFQRDLPIRTKRKLHVLGGARGIWSISVKQSPRG 633

Query: 561 HNADSSRMAAYDDEYHAYLIISLEARTM--VLETADLLTEVTESVDYFVQGRTIAAGNLF 618
             A S+      +  +  ++IS +A     V   A   T    ++   + G T+ AG  F
Sbjct: 634 SAASSAGAGPNPELANDTVVISTDANPSPGVSRIATRSTRTDLAIPTRIPGTTVGAGPFF 693

Query: 619 GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
           GR  ++ V     R+L+      D +   S  ++      + +   SI DP VL+   D 
Sbjct: 694 GRTAILHVMTNSIRVLE-----PDGTERQSIKDTDGNMPRAKIRWCSICDPVVLIIREDD 748

Query: 679 SIRLLVGDPSTCTVSVQTPAAI-ESSKKPVSSCTLYHDKG-----PEPWLRKTSTDAWLS 732
           ++ L +G+P    +  +  + + E S + ++ C      G      +P     S+     
Sbjct: 749 TLGLFIGEPERGRIRRKDMSPMGEKSSRYIAGCFFADTSGLFEAFMDPKAAAASSKGDKD 808

Query: 733 TGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYM 792
            G  + +        +    + V+    G LEI+ +P    VF+     +      D+Y 
Sbjct: 809 KGATQTMQSVVNAATNSQ--WLVLVRPQGVLEIWTLPKLTLVFSTTLIATLDNVCADSYD 866

Query: 793 REALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILC 852
             AL                Q        + V  + M +    +  P L   L  G +  
Sbjct: 867 PAALS-------------LPQDPPRKPQELDVENIVMAQLGESNPTPHLMVFLRSGQVAI 913

Query: 853 YQAYLFEGPENTS 865
           Y+      P + S
Sbjct: 914 YETVHHPPPPDPS 926


>gi|212541400|ref|XP_002150855.1| cleavage and polyadenylation specificity factor subunit A, putative
           [Talaromyces marneffei ATCC 18224]
 gi|210068154|gb|EEA22246.1| cleavage and polyadenylation specificity factor subunit A, putative
           [Talaromyces marneffei ATCC 18224]
          Length = 1383

 Score =  122 bits (306), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 181/751 (24%), Positives = 305/751 (40%), Gaps = 137/751 (18%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V   ++++IY +  +   +   +N  +    V  +   A  L L   Y L+G V  +
Sbjct: 28  NLIVVKTSLLQIYTLVAETSTTLILENDQQADDDVKNE---ATKLHLHAEYDLYGTVTDI 84

Query: 117 A----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKR 172
           +    + S+ G D      +++L+F +AK+S++E++    G+   S+H +E         
Sbjct: 85  SPVKILKSRSGGD------ALLLSFRNAKLSLIEWNPETQGISTMSIHYYE--------- 129

Query: 173 GRESFARGPLVK----------VDPQGRCGGVLVYGLQMI-ILKASQGGSGLVGDE---- 217
            +E     P V           VDP  RC  +L +G++ I IL   Q G  LV DE    
Sbjct: 130 -KEDITLSPWVPDLSQCDSHLTVDPSSRCA-LLNFGVRNIAILPFHQAGDDLVMDEYDPD 187

Query: 218 ------------------DTFGSGGGF--SARIESSHVINLRDLD--MKHVKDFIFVHGY 255
                             D+  + G         +S V+ L  LD  + H     F+H Y
Sbjct: 188 LDMDDLTDQEENKKPSHTDSKKAEGDLIHQTPYAASFVLPLTALDPTLIHPIGLTFLHEY 247

Query: 256 IEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVP 315
            EP   IL+    T A  +  +    + S  ++    +    + S   LP D   ++A+P
Sbjct: 248 REPTFGILYSPIATSAALLEERKDVVVYSVFTLDLEQRASTPLLSIAKLPSDLLHIMALP 307

Query: 316 SPIGGVLVVGAN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQN 374
           +P+GG L++G+N  IH      + A+A+N +A  + +   + +S   + L+ +    + N
Sbjct: 308 APVGGTLLIGSNEMIHIDQSGKASAVAVNEFAKQVSAFPMVDQSDLELRLEGSVVEVINN 367

Query: 375 DVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT--------IGNSLFF 424
           +    LL+  TG+LVL+    DGR V    +    P+V   D+ +        +G+   F
Sbjct: 368 ESGDILLTLSTGELVLVHFKIDGRSVSGFVVFPI-PAVSGGDVVSAVASCAVALGSGKVF 426

Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS--SDALQDMVNGE 482
           +GS   +S+L+     S  S  S     +  D E +      +      S A ++ VN  
Sbjct: 427 IGSEDAESVLLDCYLPSAVSKKSRDYDRDHFDEEMNNEEDDDMYEDDLYSSAPKEAVN-- 484

Query: 483 ELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELV 542
           +    G  S+N       ++F V D L+++GPL+  + G   + D++A     Q + + +
Sbjct: 485 KTVSNGRISDN-------YTFKVIDRLLSLGPLRAVAVGKPASRDSNAE--DAQQSVDDL 535

Query: 543 ELPGCKG-----------------------------IWTVYHKSSR-GHNADSSRMAAYD 572
           EL    G                             +W +   +++ GHN DS       
Sbjct: 536 ELAAAYGSGRGGGVALLQRTLHLDDVFTLGAESADSVWNITTSNTKSGHN-DSG------ 588

Query: 573 DEYHAYLIISL-----EARTMVLETADLLTEVTESVDYFVQGR-TIAAGNLFGRRRVIQV 626
           +E  +Y+I++         T+V    +   E   + D    G  TI    L G  RV+QV
Sbjct: 589 EENQSYVILTKANSPENEETLVYAVNERNLEPFNAPDVNPNGDPTIDIDVLAGNSRVVQV 648

Query: 627 FERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG 685
                RI D +  M Q     P   E   G E   V S S AD Y+L+   D S+ LL  
Sbjct: 649 LTGEVRIYDTNLGMAQ---IYPVWDED-EGDERFAV-SASFADHYLLIIRDDSSVLLLHS 703

Query: 686 DPSTCTVSVQTPAAIESSKKPVSSCTLYHDK 716
           D S     +  P  +  S +P     LY D+
Sbjct: 704 DESGDLDELTKPETV--SSQPWLCGCLYTDR 732


>gi|164655043|ref|XP_001728653.1| hypothetical protein MGL_4214 [Malassezia globosa CBS 7966]
 gi|159102535|gb|EDP41439.1| hypothetical protein MGL_4214 [Malassezia globosa CBS 7966]
          Length = 1212

 Score =  122 bits (305), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 150/591 (25%), Positives = 262/591 (44%), Gaps = 75/591 (12%)

Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
           +RL G V  +  + Q  A     RD ++++F DAK++++E+DD    L   S+H FE   
Sbjct: 22  HRLFGQVTGIQSV-QTLASQVDGRDRLLVSFRDAKLALMEWDDVYGDLNSISIHTFERAP 80

Query: 167 WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGF 226
            L +     SF   P + VDP  RC  +L+    + IL   Q  S L G +D   +    
Sbjct: 81  QL-VDGLPPSFV--PRLLVDPASRCAALLLPQDALAILPFVQEASEL-GADDPRDAALLD 136

Query: 227 SARIESSHVINLR---DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMI 283
            A    S +++     D  +++V+D +F+ G+ +P++ +L+E ELTW G +S    T  +
Sbjct: 137 QAPYAPSFILSFSEDVDASIRNVRDCVFLPGFQKPMLAVLYEPELTWTGSLSRARLTTRV 196

Query: 284 SALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALAL 342
             +++  T+ ++P+  ++  LP+D   L+A P  +GGVLVV  + + +  Q+A    L++
Sbjct: 197 CFITLDLTVTKYPVTVTSEALPYDTLYLVACPDSLGGVLVVTPSALLHLDQTARLVGLSV 256

Query: 343 NNYAVSLDSSQELPRSSFSV---ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ 399
           + +     S   LP ++ ++   +L ++  T+ + +  LL  + G ++      +GR V 
Sbjct: 257 SRWTDFTSSELMLPNATATLGDCDLQSSVLTFTEANGGLLVLRDGRMLTFQCALEGRTVT 316

Query: 400 RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEA 459
            L L+     VL  +    G + F     L + L++  +    T + +  L E   +I A
Sbjct: 317 SLSLN----VVLVPERQ--GGASFV--QALPERLILCASFQDDTYLYAMNLLEAPTEIAA 368

Query: 460 D-APSTKRLRRSSSDALQDMVN-GEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKD 517
              P  + L   +      +   G+      + S   + A       V D L  +GPL D
Sbjct: 369 STGPDQQSLEPDADVDADALDLYGDSFKPDVATSKQAQPA----GLDVLDVLPTLGPLND 424

Query: 518 FSYGLRINADASA---TGISKQSNYELVE----------LPGCKGIWTVYHKSSRGHNAD 564
            +YG+  NA   A      + Q +  ++E          +     IWTV        N  
Sbjct: 425 MTYGVVRNAHGKAHPHMVATMQHHLAVIEPRLRCDVVQNIAPAHAIWTV------SINGK 478

Query: 565 SSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVI 624
              + A+D+E    L+ SLE+ +            T  +   +Q RTIA G+   +  VI
Sbjct: 479 WLLLTAWDEE---CLVYSLESNS------------THFLSQHLQ-RTIACGS--TQAGVI 520

Query: 625 QVFERGARILD--GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
           +V  + A +LD  G  MT   +F   ++ +  G         SI D YV L
Sbjct: 521 RVTSKRAEVLDEHGRIMT---TFAECDANASYG-------DASIQDSYVAL 561


>gi|449299306|gb|EMC95320.1| hypothetical protein BAUCODRAFT_25380 [Baudoinia compniacensis UAMH
            10762]
          Length = 1437

 Score =  120 bits (301), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 219/1037 (21%), Positives = 381/1037 (36%), Gaps = 207/1037 (19%)

Query: 52   IGP-VPNLVVTAANVIEIY-VVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRL 109
            IGP   NLVV   ++++++ V R+ +       +  + + R          L L+  Y L
Sbjct: 22   IGPQADNLVVAKTSLLQVFEVKRISQAKDNGHHDHADAQSR----------LSLIGEYTL 71

Query: 110  HGNVESLAILS-----QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFES 164
             G V +L+ ++      GGA       +++ AF+DAK+S++E+D   + +   S+H +E 
Sbjct: 72   SGTVTALSPITLPSSRTGGA-------ALVCAFKDAKLSLIEWDPEHYRISTISIHYYEG 124

Query: 165  PEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS------------- 211
               L    G        ++ VDP  RC  +     Q+ IL   Q G              
Sbjct: 125  DNVLLPPFGAALSECESILTVDPGSRCAALKFGERQLAILPFRQQGDELADEAAEDADMA 184

Query: 212  --------GLVGDEDTFGSGGGFS----ARIESSHVINLRDLD--MKHVKDFIFVHGYIE 257
                    G V  + T  +    S       +SS V+ L  LD  + H     F+H Y E
Sbjct: 185  EAESEEQPGNVTLKRTSTTQALDSKDDITPYKSSFVLPLITLDPSLTHPVHLAFLHEYRE 244

Query: 258  PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
            P   IL   +      +  +      +  ++    +    + S   LP D +K++A+P P
Sbjct: 245  PTFGILSAPQQPSLALLDERKDCLSYTVFTLDLEQRASTNLMSVSKLPSDLWKVIALPPP 304

Query: 318  IGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDV 376
            +GG L+VG N  IH      + A+A+N +A    +      S  +++L+      L +  
Sbjct: 305  VGGALLVGTNELIHIDQSGKTTAVAVNEFAKVASNFSMADHSDLNMKLEGCEIEMLDSST 364

Query: 377  --ALLSTKTGDLVLLTVVYDGRVVQRLDLSK---TNPSVLTSD----ITTIGNSLFFLGS 427
              AL+    G    L+    GR V  L +S+   TN   + +     + ++     F+GS
Sbjct: 365  GNALIVLNDGSFATLSFKMLGRTVGGLTVSRVADTNGGNVNASAPSCVASMQQQKLFVGS 424

Query: 428  RLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD---------------APSTKRLRRSSS 472
              G S LV++   + T        +  G                    APS   ++R++S
Sbjct: 425  EDGSSSLVRWAKDTPTLSRKRSHAQMLGQDAPMDDADDAEELDEDDLYAPSAVAVKRAAS 484

Query: 473  DALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL--RINAD--- 527
                             A+     A  T++F + DSL ++ P+ +   G   R   +   
Sbjct: 485  ----------------VANAAAVDASTTYTFELEDSLNSLAPMNNVCLGRSPRTGKEKLE 528

Query: 528  -ASATGISKQS-----NYELV-------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDE 574
              +  G  K S     N E++       ++ G K IW+V  +S  G    S+      D 
Sbjct: 529  LVAGIGRGKASSLAFMNREIIPNEIRSRDVAGAKDIWSVCARSREGDKVSSA------DT 582

Query: 575  YHAYLIISLEARTMVLETAD----LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG 630
            Y   L +     T   + AD     + E+ E+ D+   G T+  G L     ++Q     
Sbjct: 583  YDNLLFVFDGESTKTYKYADSAEGSIIELDET-DFEGDGETVCVGTLANGSCIVQCRRTE 641

Query: 631  ARILDGSYMTQDLSFGPSNSESGSGSENST---VLSVSIADPYVLLGMSDGSIRLLVGDP 687
             R       T D   G S     S  E      +++ S  DPY+L+   D S+++L  D 
Sbjct: 642  IR-------TYDHQLGLSQIIPMSDDETDAELKIVATSFCDPYLLVIQDDSSVQILQVDK 694

Query: 688  STCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPL 747
                         +   +P+ +         E  LR+     WL+  +         G L
Sbjct: 695  -------------QGDVEPLDAA--------ESDLREGK---WLTGSL-------YAGEL 723

Query: 748  DQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSS 807
              G   + +  + G L++F +P    V++              ++   L         S+
Sbjct: 724  SDGQSAAFLLGQEGGLQVFSLPETKLVYSAPTL---------PFLPPVL---------SA 765

Query: 808  EEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
            +    +G K  +  + VV+L  +      +RP+L        ++ Y+ + +         
Sbjct: 766  DAPQRRGGKVTLTEVLVVDLGAEGV----TRPYLIVRTAMDDLILYEPFHY--------- 812

Query: 868  DDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTRE----ETPHGAPCQRITIFKNISGH 923
                    S +  +  A+   +LRF + P     +     +T  G P Q       I G 
Sbjct: 813  --------SATTLDARATGFTDLRFRKVPFTYLPKYDEGLDTADGRPAQLQPAV--IGGR 862

Query: 924  QGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQL 983
               +L G  P + +     L     L    + +F+ LH   C  GF  V   G LK  QL
Sbjct: 863  NALYLPGGTPSFLVKEATSLPKVLGLRARGVRSFSPLHRAGCQQGFALVDGDGKLKEYQL 922

Query: 984  PSGSTYDNYWPVQKVVF 1000
            P   ++   W V+ +  
Sbjct: 923  PGHVSFATGWSVRTLTL 939


>gi|121719617|ref|XP_001276507.1| cleavage and polyadenylation specificity factor subunit A, putative
           [Aspergillus clavatus NRRL 1]
 gi|148886827|sp|A1C3U1.1|CFT1_ASPCL RecName: Full=Protein cft1; AltName: Full=Cleavage factor two
           protein 1
 gi|119404719|gb|EAW15081.1| cleavage and polyadenylation specificity factor subunit A, putative
           [Aspergillus clavatus NRRL 1]
          Length = 1401

 Score =  119 bits (297), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 167/723 (23%), Positives = 294/723 (40%), Gaps = 127/723 (17%)

Query: 57  NLVVTAANVIEIYV---VRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNV 113
           NLVV   +V++I+    V    EG   +  S         D + +  L L   Y L G V
Sbjct: 28  NLVVVKTSVLQIFSLLNVSCSAEGEIIAAKSARP------DQLQSTKLILEREYSLSGTV 81

Query: 114 ESLA----ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLH 169
             L     + ++ G D      +I+LAF +AK+S++E+D   +G+   S+H +E  +   
Sbjct: 82  SDLCRVKLLKTKSGGD------AILLAFRNAKLSLVEWDPERYGISTISIHYYERDDITR 135

Query: 170 LKRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV-GD----------- 216
                +  + G ++ VDP  RC  V  +G++ + IL   Q G  LV GD           
Sbjct: 136 SPWVPDLSSCGSILSVDPSSRCA-VFNFGIRNLAILPFHQPGDDLVMGDYESDSQKQSHE 194

Query: 217 ---EDTFGS-----GGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHER 266
              +D+ G+     G        SS V+ L  LD  + H     F++ Y EP   IL+ +
Sbjct: 195 HEMDDSAGNSKSKEGAVHQTPYASSFVLPLTALDSAILHPVSLAFLYEYREPTFGILYSQ 254

Query: 267 ELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGA 326
             T    +  +      +  ++    +   ++ S   LP D +K++A+P P+GG L++G 
Sbjct: 255 IATSNSLLHERKDAIFYTVFTLDLEQRASTMLLSVTRLPSDLFKVVALPPPVGGALLIGY 314

Query: 327 NT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKT 383
           N  +H      + A+ +N ++  + +     +S  ++ L+      L N     LL+  +
Sbjct: 315 NELVHVDQAGKTNAVGVNEFSRQVSTFSMADQSELALRLEGCVVELLGNSSGDLLLALSS 374

Query: 384 GDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI--------TTIGNSLFFLGSRLGDSLLV 435
           G +VL+    DGR V  + + +  P     +I         ++G+   F GS   +S+L+
Sbjct: 375 GTMVLVHFKLDGRSVSGISI-RPLPGHAGGNILKAAASASASLGSDKVFFGSEDAESVLL 433

Query: 436 QFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNN-- 493
            ++  S  +  S   + E   IE D         S  D  +D        LY +A +   
Sbjct: 434 GWSLSSSNARKS---RSESKRIEKDHEEGSDDSESEEDVYED-------DLYSAAPDTPA 483

Query: 494 -------TESAQKTFSFAVRDSLVNIGPLKDFSYG-------------------LRINAD 527
                    S   ++ F V D L N  PL+D + G                   L + A 
Sbjct: 484 LGHRLSVAPSTFASYKFKVHDVLPNTAPLRDIALGQPAMPVEDTGSHLDNICSELELVAA 543

Query: 528 ASATG-----ISKQSNYELVE----LPGCKGIWT---VYHKSSRGHNADSSRMAAYDDEY 575
             + G     + K+    +V+    +    G+WT       +++  + D + +    +E+
Sbjct: 544 YGSNGNGGLVVMKRELEPVVKASLNVGPIHGVWTASIALGSAAKPMSGDQTNI----EEW 599

Query: 576 HAYLIISLEARTMVLETADLLTEVTESVDYFVQGR-------TIAAGNLFGRRRVIQVFE 628
             Y+I++ + +T+  E +++      ++  F           +I  G L  R+RV+QV  
Sbjct: 600 RQYVILT-KPQTIDKEESEVFIVDGLNLKPFKAPEFNPNNDISIQVGTLSNRKRVVQVLR 658

Query: 629 RGARILDGSYMTQDLSFG---PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG 685
              R  D      DL      P   E    S+    LS S+ADPY+ +   D ++ LL  
Sbjct: 659 NEVRSYDS-----DLELAQIYPVWDE--DTSDERMALSASLADPYIAILRDDSTLLLLQA 711

Query: 686 DPS 688
           D S
Sbjct: 712 DDS 714



 Score = 48.9 bits (115), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 37/121 (30%), Positives = 54/121 (44%), Gaps = 16/121 (13%)

Query: 889  NLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSG---------SRPCWCMVF 939
            N    R P D+ T       +  + + I  +ISG+   F+ G         SR C   + 
Sbjct: 861  NHVLPRIPPDSDTNISDKEPSNHRPLCILPDISGYSAVFMPGTSASFIFKTSRSC-PHIL 919

Query: 940  RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVV 999
            R R  V   L D     FT   + +   GFIYV S+ +++ICQLP  + YD  W ++KV 
Sbjct: 920  RLRGGVVRSLSD---FDFT---DPSLGRGFIYVDSKDVVRICQLPPETIYDYSWTLKKVA 973

Query: 1000 F 1000
             
Sbjct: 974  I 974


>gi|346319828|gb|EGX89429.1| protein CFT1 [Cordyceps militaris CM01]
          Length = 1452

 Score =  119 bits (297), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 225/989 (22%), Positives = 375/989 (37%), Gaps = 171/989 (17%)

Query: 91   VLMDGISAASLELVCHYRLHGNVESLAIL-----SQGGADNSRRRDSIILAFEDAKISVL 145
            +L D      L LV    + G +  LA L     S GG       ++++LA+  AK+ + 
Sbjct: 94   LLRDRSQHTKLVLVAELPVAGTIIGLARLKLPHTSSGG-------EALLLAYRGAKMCLT 146

Query: 146  EFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV---KVDPQGRCGGVLVYGLQMI 202
            E++     L   S+H +E  E   L+        G  V   + DP  RC         + 
Sbjct: 147  EWNPRRAALETVSIHFYEKDE---LQGAPWELPFGEYVNYLEADPASRCAAFKFGSRNLA 203

Query: 203  ILKASQGG--------------------SGLVGDEDTFGSGGGFSARIESSHVINLRDLD 242
            IL   Q                      + L  + D  G   G  ++   S V+ L  LD
Sbjct: 204  ILPFRQAEEDLEMEDWDEALDGPKPPKEASLATNGDANGDANGTQSQHSPSFVLRLPLLD 263

Query: 243  --MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
              + H     F+H Y EP   IL   + T        H T  +  L +    +    I S
Sbjct: 264  PTLLHPVHLAFLHQYREPTFGILSSAQSTSIALGFRDHLTYKVFTLDLKQ--RASTTILS 321

Query: 301  AMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA-----VSLDSSQE 354
               LP D  +++ +P+P+GG L+VGAN  IH      +  +A+N  A      SL+   E
Sbjct: 322  VTGLPQDLSRVIPLPTPVGGALLVGANELIHIDQSGKANGVAVNPMARQMTSFSLNDQSE 381

Query: 355  LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDL---SKTNPSVL 411
            L   ++ +E  A     +++   LL      L +++   DGR V  + L   S+ N   L
Sbjct: 382  L---NYRLEGCAIEPVSMESGELLLILNDASLAIVSFKIDGRTVSGISLVPVSQENGGNL 438

Query: 412  ----TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
                 S I+ IG S  F+GS  GDS+++ +     +   S   +++   ++A+       
Sbjct: 439  LKSHVSCISRIGKSSMFIGSEYGDSVVLGW-----SRKQSQEKRKKSRVLDAELALDVDD 493

Query: 468  RRSSSDALQDMVNG-EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG----- 521
                     D + G E  +   S + N  +      F ++DSL+ + P+ D + G     
Sbjct: 494  IDLDDFDEDDDLYGTESTAAKPSLATNGVTKGGELIFRLQDSLLCLAPIHDVAPGKAVFP 553

Query: 522  -------LRINAD-----ASATGISKQS-----NYEL-------VELPGCKGIWTVYHK- 556
                   LR         A A G  K       N E+        E P  +G WT+  K 
Sbjct: 554  LDSEEVVLRDGVTSELQLACAVGRGKAGAIAILNREIQPKVIGRFEFPEARGFWTMCVKK 613

Query: 557  ---SSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYF------ 606
                + G NA  S  + YD  E +   +I  +      ET+D+        +        
Sbjct: 614  PLPKALGSNAVVS--SEYDSMELYDRFMIVAKVDLDGYETSDVYALTDAGFESLKDTEFE 671

Query: 607  -VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSV 664
               G T+ AG +  + R+IQV +   R  DG   ++Q L       +  +G+E   V+S 
Sbjct: 672  PAAGFTVMAGTMGKQMRIIQVLKSEVRCYDGDLGLSQILPM----MDEDTGAE-PRVVSA 726

Query: 665  SIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRK 724
            SIADPY+++   D SI +               A I+S+ +      +  DKGP   ++ 
Sbjct: 727  SIADPYLMVIRDDNSIFI---------------AKIDSNDE---LDEVEKDKGPLASIK- 767

Query: 725  TSTDAWLSTGVGEAIDGA-DGGPLDQGDIYSVVCY---ESGALEIFDVPNFNCVFTVDKF 780
                 W +  +    DG       D+G    ++ +    +GAL I+D+ N +        
Sbjct: 768  -----WQTGCLYADHDGHFQPKQPDEGSSPRILMFLMSTTGALHIYDLDNLS-------- 814

Query: 781  VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
                      Y+ E L  S     S++  G     KE +  + V +L           P+
Sbjct: 815  -------EPVYVAEGLT-STPPFLSANFTGRKAAAKETLTEILVADLG----DVVAKSPY 862

Query: 841  LFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAY 900
            L        +  Y+   +  P ++S        S +L     + S +     +    D  
Sbjct: 863  LILRHDTDDLTLYEPVRYHEPNSSS-----APLSDTLFFKKSTNSTIAKSAPASDKEDDE 917

Query: 901  TREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVL 960
            T+++     P Q   +  N+ G+   FLSG  P + +   + +     L    +   +  
Sbjct: 918  TQQK--RFVPLQ---LCANVGGYSAVFLSGDSPSFILKSAKSIPRIVGLQGQGVQGMSTF 972

Query: 961  HNVNCNHGFIYVTSQGILKICQLPSGSTY 989
            H   C+ GFIY  ++GI ++ QLP+ + Y
Sbjct: 973  HTEGCDRGFIYADTKGIARVSQLPTDTNY 1001


>gi|400597740|gb|EJP65470.1| CPSF A subunit region [Beauveria bassiana ARSEF 2860]
          Length = 1444

 Score =  118 bits (295), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 224/998 (22%), Positives = 375/998 (37%), Gaps = 182/998 (18%)

Query: 85  GETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISV 144
           GET   +L D      L LV    + G V  LA L     ++    ++++LA+  AK+ +
Sbjct: 85  GET--LLLRDRAQNTKLVLVAEIPVAGTVIGLARLKLQNTESGG--EALLLAYRGAKMCL 140

Query: 145 LEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV---KVDPQGRCGGVLVYGLQM 201
            E++     L   S+H +E  E   L+        G  V   + DP  RC         +
Sbjct: 141 TEWNPQKAALDTVSIHYYEKDE---LQGAPWELPFGEYVNYLEADPASRCAAFKFGSRNL 197

Query: 202 IILKASQGGSGL-VGDEDTFGSG-----------GGFSARIESSH----VINLRDLD--M 243
            IL   Q    L + D D    G            G     ES H    V+ L  LD  +
Sbjct: 198 AILPFRQAEEDLEMEDWDEALDGPKPAKEAALATNGDDHETESQHSPSFVLRLPLLDPTL 257

Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
            H     F+H Y EP   IL   + T        H T  +  L +    +    I S   
Sbjct: 258 LHPVHLAFLHQYREPTFGILSSAQSTSIALGFRDHMTYKVFTLDLKQ--RASTTILSVTG 315

Query: 304 LPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
           LP D  +++ +P+P+GG L+VG N  IH      +  +A+N  A  + S     +S  + 
Sbjct: 316 LPQDLKRVIPLPTPVGGALLVGENELIHIDQSGKANGVAVNPMARQMTSFSLADQSELNY 375

Query: 363 ELD--AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDL---SKTNPSVL----TS 413
            L+  A     +++   LL      L +++   DGR V  + L   S+ N   L     S
Sbjct: 376 RLEGCAIEPISMESGELLLILNDASLAIISFKIDGRTVSGISLAAVSQENGGNLLKSRVS 435

Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
            I+ IG +  F+GS  GDS+++ +     +   S   +++   ++ D      L     D
Sbjct: 436 CISRIGKASMFIGSESGDSVVLGW-----SRKQSQEKRKKSRALDTD------LALDVED 484

Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFS--------FAVRDSLVNIGPLKDFSYGLRIN 525
              D    E+  LYG+ S   + +Q            F ++D+L+ + P+ D + G  + 
Sbjct: 485 IDLDDDFDEDDDLYGTESAAAKPSQAGAGATKGGEPVFRLQDALLCLAPIHDVAPGKAVF 544

Query: 526 AD-----------------ASATGISKQS-----NYEL-------VELPGCKGIWTVYHK 556
                              A A G  K       N E+        E P  +G W +  K
Sbjct: 545 PSDSEEAFLRDGVTSELQLACAVGRGKAGAIAILNREIQPKVIGRFEFPEARGFWAMCVK 604

Query: 557 SSRGHNADSSRM--AAYD--DEYHAYLIISLEARTMVLETADLLTEVTESVDYF------ 606
                   SS +  + YD  ++Y  ++I++ +      ET+D+        +        
Sbjct: 605 KPVPKALGSSAVISSEYDSTEQYDRFMIVA-KVDLDGYETSDVYALTDAGFESLKDTEFE 663

Query: 607 -VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSV 664
              G T+ AG +  + R++QV +   R  DG   ++Q L       +  +G+E   V+S 
Sbjct: 664 PAAGFTVMAGTMGKQMRIVQVLKSEVRCYDGDLGLSQILPM----LDEDTGAE-PRVVSA 718

Query: 665 SIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRK 724
           SIADPY+++   D S+ +           + +   +E  +K         DKGP      
Sbjct: 719 SIADPYLMIIRDDNSVFI---------AKIGSNDELEEVEK---------DKGP------ 754

Query: 725 TSTDAWLSTGVGEAIDGA--DGGPLDQGDIYSVVCYES--GALEIFDVPNFNCVFTVDKF 780
             +  W +  +    DG      P D     +++   S  GAL ++D+ N +        
Sbjct: 755 LVSTKWQTGCLYTDYDGTFQAKKPDDNASPRTMMFLMSTAGALHMYDLDNLS-------- 806

Query: 781 VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF 840
                     Y+ E L  S     S++  G     KE +  + V +L           PF
Sbjct: 807 -------EPVYVAEGLT-STPPFLSANFTGRKAAAKERLTEILVADLG----DVVSKSPF 854

Query: 841 LFAILTDGTILCYQAYLFEGPENTS---------KSDDPVSTSRSLSVSNVSASRLRNLR 891
           L        +  Y+   ++ P ++S         K     + ++S S  +      +  R
Sbjct: 855 LILRHDTDDLTLYEPVRYQEPNSSSPPLTDTLFFKKSANATIAKSASAFDKEEDETQQRR 914

Query: 892 FSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD 951
           F   PL            PC       N+ G+   FLSG  P + +   + +     L  
Sbjct: 915 F--VPLQ-----------PC------GNVGGYSTVFLSGDSPSFVLKSAKSIPRIVGLQG 955

Query: 952 GSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
             +   +  H   C+ GFIY  ++GI ++CQLP+ + Y
Sbjct: 956 QGVQGMSTFHTAGCDRGFIYADTKGIARVCQLPTDTNY 993


>gi|452979579|gb|EME79341.1| hypothetical protein MYCFIDRAFT_104419, partial [Pseudocercospora
           fijiensis CIRAD86]
          Length = 1342

 Score =  118 bits (295), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 215/975 (22%), Positives = 366/975 (37%), Gaps = 188/975 (19%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           L LV  Y L G V SLA       DN+   D+II+AF DAK+S++E+D   H +   S+H
Sbjct: 46  LSLVAEYPLAGTVISLA--RTKPRDNASGGDAIIIAFRDAKLSLVEWDPENHRISTISLH 103

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGD---- 216
            +E    +    G        ++ VDP  RC  +     Q+ IL     G  L G+    
Sbjct: 104 YYEGDNVITPPFGPTLAESESILTVDPSSRCAALKFGARQLAILPFRHFGDELAGEEEED 163

Query: 217 --------------EDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVM 260
                         E T  +G       ++S V+ L  LD  + H     F+H Y EP  
Sbjct: 164 GFENEPMSAVSKRRESTHLNGEEEQTPYKASFVLPLTALDPTLSHTVHLAFLHEYREPTF 223

Query: 261 VILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGG 320
            IL          +  +      +  ++    +    + +   LP   +K+  +P PIGG
Sbjct: 224 GILSAPMEPSNALLEERKDVLTYTVYTLDLEQRASTNLITVPKLPSTLWKVKPLPLPIGG 283

Query: 321 VLVVGANTIHYHSQSASC-ALALNNYA-------VSLDSSQELPRSSFSVELDAAHATWL 372
            L+VG N + +  QS    A A+N +A       +S  S   L     S+E     +  L
Sbjct: 284 ALLVGTNELVHVDQSGKANATAVNEFAKLESDFGMSDQSHLNLKLEDCSIETIDPKSGQL 343

Query: 373 QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-------TNPSVLTSDITTIGNSLFFL 425
                LL T  G L ++     GR +  ++++        T+ S   S I  + N   F+
Sbjct: 344 -----LLVTSDGALAIIEFKLLGRSISAINVTPVTEDNGVTSLSAAPSCIANLANGSVFI 398

Query: 426 GSRLGDSLLVQFTCGSGTSMLSSGLKEEFGD------IEADAPSTKRLRRSSSDALQDMV 479
           GS  G S L+ ++  +          +  G        +A       L  ++ +A +  V
Sbjct: 399 GSEDGASSLMGWSQPTAPLTRKRSHAQMLGKDGDEEDEDAIEEDDDDLYDAAPEAKKRAV 458

Query: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD-----ASATGIS 534
           +  EL           S+   + F +RD L ++GP+     G +  +      A+ATG  
Sbjct: 459 SDTELG----------SSNAAYQFEIRDHLQSLGPIHRMCVGRQGKSSDKLQLAAATG-R 507

Query: 535 KQS------NYELVELPGCKGIWTVYHKSSRGHNADSS---RMAAYDDEYHAYLIISLEA 585
           KQS      N ++V  PG         ++SR  NA S+   R     DE       +L+ 
Sbjct: 508 KQSGRLTLLNRDVVPTPG---------RASRFENAKSAWAVRAHQAGDES------TLDN 552

Query: 586 RTMVLETADLLT-EVTESVDYFV--------------QGRTIAAGNLFGRRRVIQVFERG 630
           +  V E A+    E++ + ++FV              +G T+    L   + ++Q  ++ 
Sbjct: 553 KLFVFEGANTKAYEISSADEHFVEDRYPEHAKSEWESEGETLEVVALADGKIIVQFRKQE 612

Query: 631 ARILDGSY-MTQDLSFGPSNSESGSGSENS-TVLSVSIADPYVLLGMSDGSIRLLVGDPS 688
            R  D +  M Q L   P   E    +EN   ++ +++ DPYVL+   D SI++L     
Sbjct: 613 VRTYDANLAMNQIL---PMEDE----AENELNIVHIAVCDPYVLVIRDDSSIQIL----- 660

Query: 689 TCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLD 748
               SVQ      +  +P+ +     +K             WL+  +         G L 
Sbjct: 661 ----SVQG-----NELEPLEAEGSVAEK------------KWLTGSLY-------AGTLT 692

Query: 749 QGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHI-VDTYMREALKDSETEINSSS 807
           QG     +    G L  F +P+   +F +         I VD   R A            
Sbjct: 693 QGSAAVFLLNADGGLHAFALPDLQPLFAIPTLPHLPPVIAVDAAQRRA------------ 740

Query: 808 EEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
                 G +E +  + V +L         ++P+L        ++ Y+ + +  P+ + + 
Sbjct: 741 ------GTRETLTEVLVSDLGQHGV----TQPYLVLRTAMDDVVLYEPFHY--PQTSGRK 788

Query: 868 DDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFF 927
               S  + L        R R + FS  P  + +  E+    P    ++   I  +    
Sbjct: 789 ----SWHQDL--------RFRKVPFSHIPKYSESIAESQSARPPPLKSV--KIDTYSAIA 834

Query: 928 LSGSRPCWCM----VFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQL 983
           + G+ PC  +       + L +        +     ++ V C +GF  + +   L+  QL
Sbjct: 835 IPGAPPCLLLKEPSTLPKVLEIRQSAELNRLSMLCPINRVGCENGFFMINADEELEEQQL 894

Query: 984 PSGSTYDNYWPVQKV 998
           P  + Y   W V +V
Sbjct: 895 PLNTWYGTGWSVHQV 909


>gi|342877552|gb|EGU79002.1| hypothetical protein FOXB_10431 [Fusarium oxysporum Fo5176]
          Length = 1399

 Score =  117 bits (293), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 211/980 (21%), Positives = 361/980 (36%), Gaps = 143/980 (14%)

Query: 69  YVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQ-----GG 123
           Y  R  ++   ES   G     V  D  +   L LV    L G V  LA +       GG
Sbjct: 65  YDHRANDDDGLESSFLGGESMLVRTDRTNLTKLVLVAELPLSGTVTGLAKVKTKHSKCGG 124

Query: 124 ADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR-GPL 182
                  +++++A++ AK+ +  +D     L   S+H +E  E LH      SF      
Sbjct: 125 -------EALLIAYKAAKLCMAVWDPEKSNLETISIHYYEKEE-LHGAPWEVSFDEYTNY 176

Query: 183 VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD 242
           ++ DP  RC         + IL   Q    L  D+      G    + ES+ V N     
Sbjct: 177 LEADPGSRCAAFQFGSRNLAILPFRQAEEDLEMDDWDEDLDGPRPVK-ESTTVANGDSDT 235

Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAM 302
           ++            EP   IL   +          H T  +  L +    +    I S  
Sbjct: 236 LEPA----------EPTFGILSSSQERAHSLGQKDHLTYKVFTLDLQQ--RASTTILSVT 283

Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFS 361
           +LP D +K++ +P+P+GG L++G N  IH      S  +A+N+ A  + S     ++  +
Sbjct: 284 DLPRDLFKIIPLPAPVGGSLLIGENELIHVDQSGKSNGVAVNSMARQITSFSLTDQADLN 343

Query: 362 VELD--AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQ----RLDLSKTNPSVLTSDI 415
           + L+        ++N   LL    G + ++T   DGR V     R+   +   +++ S  
Sbjct: 344 LRLEHCVIETLSIENGELLLVLNDGRIGIVTFQIDGRTVSGLTVRMVADENGGNLIKSRA 403

Query: 416 TT---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
           +T   +G + +F+GS +GDS+++ +T   G        K    D E              
Sbjct: 404 STASKLGKNAYFVGSEVGDSVVLGWTRKMGQEKRR---KPRLIDAEIGLEMDDLDLEDED 460

Query: 473 DALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG-LRINADASAT 531
           D   D+   E  +   + + N        SF + D+L++I P+KD + G +  + D+   
Sbjct: 461 DEDDDLYGTESAAAKPAQALNGGGKTGELSFRIHDTLLSIAPIKDLTPGKVSFHPDSEEA 520

Query: 532 GISKQSNYEL----------------------------VELPGCKGIWTVYHK----SSR 559
            +S+    +L                             E P  +  WT+  K     + 
Sbjct: 521 TLSQGVVSDLHLACVVGRGKAGSLAILNRNIQPKIIGRFEFPEARDFWTMSVKKPMPKAL 580

Query: 560 GHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLT------EVTESVDY-FVQGRTI 612
           G N           ++  Y+I++ +      ET+D+        E  +  ++    G T+
Sbjct: 581 GGNVGMGNEYETFGQHDKYMIVA-KVDLDGYETSDVYALTGAGFETLKDTEFDPAAGFTV 639

Query: 613 AAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
            AG +  + R+IQV +   R  DG   +TQ L     + E+G+      V S SIADPY+
Sbjct: 640 EAGTMGKQMRIIQVLKSEVRSYDGDLGLTQILPM--LDEETGA---EPRVTSASIADPYL 694

Query: 672 LLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWL 731
           LL   D S+ L   D +     V+   A   + K  S C     KG     +  ++D   
Sbjct: 695 LLIRDDSSLMLAQIDSNNELEEVEKMDATLQNTKWHSGCLYADTKGA---FQPNASDKGA 751

Query: 732 STGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFN-CVFTVDKFVSGRTHI-VD 789
            T                  I   +   +GAL ++ +P+ +  V+  +       H+  D
Sbjct: 752 ET----------------EKIMMFLLSSTGALHVYALPDLSKPVYVAEGLCYVPPHLSAD 795

Query: 790 TYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGT 849
             +R  L                   KEN+  + V +L           P+L        
Sbjct: 796 YTLRRGLA------------------KENLREILVADLG----DTTSQSPYLILRNQTDD 833

Query: 850 ILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGA 909
           +  Y+      P    +     S S +L+    S + L  +       D     E P   
Sbjct: 834 LTIYE------PLRHVRDGGETSLSATLTFKKTSNTTLATIPVETEQDDV----EQPRFV 883

Query: 910 PCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGF 969
           P +      NI+G+   FL G  P + +   + +     L    +   +  H   C+ GF
Sbjct: 884 PLRPCA---NINGYSTVFLPGPSPSFVIKSSKSIPRVIGLQGLGVRGMSTFHTEGCDRGF 940

Query: 970 IYVTSQGILKICQLPSGSTY 989
           IY   +GI ++ QLP  + +
Sbjct: 941 IYADDKGIARVTQLPPDTNF 960


>gi|303321596|ref|XP_003070792.1| CPSF A subunit region family protein [Coccidioides posadasii C735
           delta SOWgp]
 gi|240110489|gb|EER28647.1| CPSF A subunit region family protein [Coccidioides posadasii C735
           delta SOWgp]
          Length = 1394

 Score =  117 bits (293), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 171/723 (23%), Positives = 290/723 (40%), Gaps = 78/723 (10%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V   ++++++ +     G+    N+ +  R   ++      L LV  Y L G +  L
Sbjct: 28  NLIVAKTSILQVFSLVNVAYGTSALPNADDKGR---VERQQYTKLILVAEYDLSGTITGL 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +     D+    +++++A  +AK+S++E+D   HG+   S+H +E  E +H       
Sbjct: 85  GRVKI--LDSRSGGEALLVATRNAKLSLVEWDHERHGISTISIHYYER-EDVHSSPWTPD 141

Query: 177 FARGP-LVKVDPQGRCGGVLVYGLQMI-ILKASQGGSGLV-----GDEDTFGSGGG---- 225
               P L+ VDP  RC  +L +G+  + IL   Q G  LV     GD D    G      
Sbjct: 142 LKLCPSLLAVDPSSRCA-ILNFGIHSVAILPFHQTGDDLVMDEFDGDLDEKPEGASNIPA 200

Query: 226 ----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
                     +     SS V+ L  LD  + H     F++ Y EP   IL+    T +  
Sbjct: 201 QIAVENDTTMYKTPYASSFVLPLTALDPALVHPIHLAFLYEYREPTFGILYSHLTTSSAL 260

Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYH 332
           +  +      S  ++    +    + +   LP D +K++ +P PIGG L++G+N  IH  
Sbjct: 261 LRDRKDIVSYSVFTLDIQQRASTTLITVSRLPSDLWKVVPLPPPIGGALLIGSNELIHVD 320

Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLT 390
               + A+ +N +A    +   + +S   + L+      L  D    LL    G + +L 
Sbjct: 321 QAGKTNAVGINEFARQASAFSMVDQSDLGLRLEGCVVEQLGTDSGDILLVLADGKMAILR 380

Query: 391 VVYDGRVVQ----RLDLSKTNPSVLTSDIT---TIGNSLFFLGSRLGDSLLVQFTCGSGT 443
           +  DGR V     +L   K   S+L +  +   ++G    F GS   DSLL+ ++  S  
Sbjct: 381 LKVDGRSVSGISAQLVSEKAGGSILKARPSCSASLGRGKVFFGSEETDSLLIGWSRPS-- 438

Query: 444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSF 503
            ++     E   D+  D   T+       +         + +L  + S      +  F F
Sbjct: 439 QLMRKPKVESADDVFGDHSETEDDEDDIYEDDLYSTPVNQTTLSKTTSQTNGLNKDDFVF 498

Query: 504 AVRDSLVNIGPLKDFSYGL-----RINADASATGISKQSNYELVELPGCKGIWTVYHKSS 558
              D L N+GP+ D + G        N   S++  S      + +  G  G   V  +  
Sbjct: 499 RSHDRLWNLGPMSDVTLGRPPGSHDKNRKQSSSRTSADLELVVTQGKGNAGGLAVLQREL 558

Query: 559 RGHNADSSRMAAYDD-------------------EYHAYLIISL-----EARTMVLETAD 594
             +  DS +M   D                     Y  YL+ S      + +++V     
Sbjct: 559 DPYVIDSMKMDNVDGVWSIQVGAPDSTNTRTSSRNYDKYLVFSKSTEPGKEQSVVYSVGG 618

Query: 595 LLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESG 653
              E  ++ ++   +  T+  G L G  RV+QV +   R  D +     +   P   E  
Sbjct: 619 SGIEEMKAPEFNPNEDSTVDIGTLAGGTRVVQVLKSEVRSYDTNLELAQIY--PIWDE-- 674

Query: 654 SGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLY 713
             S+  +V+S S A+PYVL+   D S+ LL  D S     V     I SS + +S C LY
Sbjct: 675 DTSDELSVVSASFAEPYVLIVRDDQSLLLLQADKSGDLDEVNI-DGILSSHRWLSGC-LY 732

Query: 714 HDK 716
            DK
Sbjct: 733 LDK 735



 Score = 46.6 bits (109), Expect = 0.078,   Method: Compositional matrix adjust.
 Identities = 25/101 (24%), Positives = 45/101 (44%), Gaps = 19/101 (18%)

Query: 917 FKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-- 974
           + +I G++  F+SGS PC+ M          +L   ++ + +  H   C  GF YV +  
Sbjct: 866 YSDICGYKTVFMSGSNPCFVMKSSTSSPHVLRLRGEAVSSLSSFHIPACEKGFAYVDASV 925

Query: 975 -----------------QGILKICQLPSGSTYDNYWPVQKV 998
                            Q ++++C+LP  + +DN W  +KV
Sbjct: 926 CVPKQYFVPWNKLILVIQNMVRMCRLPGNTRFDNSWVTRKV 966


>gi|320040273|gb|EFW22206.1| hypothetical protein CPSG_00105 [Coccidioides posadasii str.
           Silveira]
          Length = 1387

 Score =  117 bits (292), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 177/726 (24%), Positives = 289/726 (39%), Gaps = 84/726 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V   ++++++ +     G+    N+ +  R   ++      L LV  Y L G +  L
Sbjct: 28  NLIVAKTSILQVFSLVNVAYGTSALPNADDKGR---VERQQYTKLILVAEYDLSGTITGL 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +     D+    +++++A  +AK+S++E+D   HG+   S+H +E  E +H       
Sbjct: 85  GRVKI--LDSRSGGEALLVATRNAKLSLVEWDHERHGISTISIHYYER-EDVHSSPWTPD 141

Query: 177 FARGP-LVKVDPQGRCGGVLVYGLQMI-ILKASQGGSGLV-----GDEDTFGSGGG---- 225
               P L+ VDP  RC  +L +G+  + IL   Q G  LV     GD D    G      
Sbjct: 142 LKLCPSLLAVDPSSRCA-ILNFGIHSVAILPFHQTGDDLVMDEFDGDLDEKPEGASNIPA 200

Query: 226 ----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
                     +     SS V+ L  LD  + H     F++ Y EP   IL+    T +  
Sbjct: 201 QIAVENDTTMYKTPYASSFVLPLTALDPALVHPIHLAFLYEYREPTFGILYSHLTTSSAL 260

Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYH 332
           +  +      S  ++    +    + +   LP D +K++ +P PIGG L++G+N  IH  
Sbjct: 261 LRDRKDIVSYSVFTLDIQQRASTTLITVSRLPSDLWKVVPLPPPIGGALLIGSNELIHVD 320

Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLT 390
               + A+ +N +A    +   + +S   + L+      L  D    LL    G + +L 
Sbjct: 321 QAGKTNAVGINEFARQASAFSMVDQSDLGLRLEGCVVEQLGTDSGDILLVLADGKMAILR 380

Query: 391 VVYDGRVVQ----RLDLSKTNPSVLTSDIT---TIGNSLFFLGSRLGDSLLVQFTCGSGT 443
           +  DGR V     +L   K   S+L +  +   ++G    F GS   DSLL+ +   S  
Sbjct: 381 LKVDGRSVSGISAQLVSEKAGGSILKARPSCSASLGRGKVFFGSEETDSLLIGW---SRP 437

Query: 444 SMLSSGLKEEFGDI---EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT 500
           S L    K E  D    +              D     VN   LS   S +N     +  
Sbjct: 438 SQLMRKPKVESADDVFGDHSETEDDEDDIYEDDLYSTPVNQTTLSKTTSQTNGLN--KDD 495

Query: 501 FSFAVRDSLVNIGPLKDFSYGL-----RINADASATGISKQSNYELVELPGCKGIWTVYH 555
           F F   D L N+GP+ D + G        N   S++  S      + +  G  G   V  
Sbjct: 496 FVFRSHDRLWNLGPMSDVTLGRPPGSHDKNRKQSSSRTSADLELVVTQGKGNAGGLAVLQ 555

Query: 556 KSSRGHNADSSRMAAYDD-------------------EYHAYLIISL-----EARTMVLE 591
           +    +  DS +M   D                     Y  YL+ S      + +++V  
Sbjct: 556 RELDPYVIDSMKMDNVDGVWSIQVGAPDSTNTRTSSRNYDKYLVFSKSTEPGKEQSVVYS 615

Query: 592 TADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNS 650
                 E  ++ ++   +  T+  G L G  RV+QV +   R  D +     +   P   
Sbjct: 616 VGGSGIEEMKAPEFNPNEDSTVDIGTLAGGTRVVQVLKSEVRSYDTNLELAQIY--PIWD 673

Query: 651 ESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSC 710
           E    S+  +V+S S A+PYVL+   D S+ LL  D S     V     I SS + +S C
Sbjct: 674 E--DTSDELSVVSASFAEPYVLIVRDDQSLLLLQADKSGDLDEVNI-DGILSSHRWLSGC 730

Query: 711 TLYHDK 716
            LY DK
Sbjct: 731 -LYLDK 735



 Score = 55.8 bits (133), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 24/82 (29%), Positives = 44/82 (53%)

Query: 917 FKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG 976
           + +I G++  F+SGS PC+ M          +L   ++ + +  H   C  GF YV +  
Sbjct: 878 YSDICGYKTVFMSGSNPCFVMKSSTSSPHVLRLRGEAVSSLSSFHIPACEKGFAYVDASN 937

Query: 977 ILKICQLPSGSTYDNYWPVQKV 998
           ++++C+LP  + +DN W  +KV
Sbjct: 938 MVRMCRLPGNTRFDNSWVTRKV 959


>gi|431908146|gb|ELK11749.1| Cleavage and polyadenylation specificity factor subunit 1 [Pteropus
           alecto]
          Length = 820

 Score =  116 bits (290), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 74/247 (29%), Positives = 119/247 (48%), Gaps = 16/247 (6%)

Query: 753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTG 812
           + ++  E+GA+EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T 
Sbjct: 162 WCLLVRENGAMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQAEARKEEATR 217

Query: 813 QGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVS 872
           QG    +  + +V L      +  SRP+L  +  D  +L Y+A+    P ++      + 
Sbjct: 218 QGELPLVKEVLLVALG-----SRQSRPYLL-VHVDQELLVYEAF----PHDSQLGQGNLK 267

Query: 873 TSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSR 932
                   N++  R +  R S+   +    E         R   F++I G+ G F+ G  
Sbjct: 268 VRFKKVPHNINF-REKKPRPSKKKAEGGAEEGPGARGRVARFRYFEDIYGYSGVFICGPS 326

Query: 933 PCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDN 991
           P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD 
Sbjct: 327 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 386

Query: 992 YWPVQKV 998
            WPV+K+
Sbjct: 387 PWPVRKI 393


>gi|255948500|ref|XP_002565017.1| Pc22g10080 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211592034|emb|CAP98296.1| Pc22g10080 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 1392

 Score =  115 bits (288), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 170/731 (23%), Positives = 296/731 (40%), Gaps = 151/731 (20%)

Query: 57  NLVVTAANVIEIY--VVRVQEEGSKE-----SKNSGETKRRVLMDGISAASLELVCHYRL 109
           NLVV   ++++++  V  V  +  KE     S  S + + +++++            Y L
Sbjct: 28  NLVVVRTSLLQVFSLVKIVSSQPQKEVPEPLSSQSSQPETKLVLEK----------EYPL 77

Query: 110 HGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWL 168
            G V  L   S+    N+R   ++I++A  +AK+S++E+D    G+   S+H +E  +  
Sbjct: 78  SGTVTDL---SRVKILNTRSGGEAILIAVRNAKLSLIEWDPERRGISTISIHYYERDDLT 134

Query: 169 HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---------GDED 218
                 +    G ++ VDP  RC  V  +G++ + IL   Q G  LV         G+  
Sbjct: 135 RSPWVPDLSRCGSILSVDPSSRCA-VYNFGIRNLAILPFHQAGDDLVMDDYDSELDGERP 193

Query: 219 TFGSGGGFSARIE-------------SSHVINLRDLD--MKHVKDFIFVHGYIEPVMVIL 263
           +  SGGG  A+IE             SS V+ L  LD  + H     F++ Y EP   IL
Sbjct: 194 SQNSGGG--AQIEKRKEEPDHQTPYSSSFVLPLTALDPSLLHPISLAFLYEYREPTFGIL 251

Query: 264 HERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLV 323
           + +  T    +  +      +  ++    +    + S   LP D +K++A+P P+GG L+
Sbjct: 252 YSQVATSTALLHERKDVVFYAVFTLDLEQRASTTLLSVSRLPSDLFKVVALPLPVGGALL 311

Query: 324 VGANTI-HYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLS 380
           +G+N I H      + A+ +N ++  + S     +S  +  L+      L  D    LL+
Sbjct: 312 LGSNEIVHVDQAGKTNAVGVNEFSRQVSSFSMTDQSDLAFRLEGCVVERLGGDSGDLLLA 371

Query: 381 TKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI--------TTIGNSLFFLGSRLGDS 432
             +G++ L+    DGR V  + +    P+    DI        T +G+   F+GS   DS
Sbjct: 372 LASGNMALIKFKLDGRSVSGITVHSL-PAYAGGDILKSAASCSTCLGDGNVFIGSEDADS 430

Query: 433 LLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR---RSSSDALQDM----VNGEELS 485
           +L++++  S                     ST++ R   + ++D L D+       E+  
Sbjct: 431 VLLEWSHTSA--------------------STRKARLESKQTADGLDDLSDEDDQMEDDD 470

Query: 486 LYGSASNNTE---------SAQKTFSFAVRDSLVNIGPLKDFSYGL---RINADASATGI 533
           LY SA    +         S  + ++F + D L +IGPL+D + G      N  + AT  
Sbjct: 471 LYSSAPGPIQVDNRMGTDSSTPEFYNFRLNDKLSSIGPLRDITLGKAFSNTNRKSQATTG 530

Query: 534 SKQSNYELVELPG--------------------------CKGIWTVYHKSSRGHNADSSR 567
           +  +  ELV   G                             +W+      RG       
Sbjct: 531 TVAAELELVASQGSDRGGGLVVIKREIDPLTTMSLKVDDADAVWSASVTKRRG------- 583

Query: 568 MAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV-------QGRTIAAGNLFGR 620
            ++ D+    Y++IS    +   E  ++     +S+  F        +  T+  G+L G 
Sbjct: 584 ASSTDNPSCQYVVISRSTDSE-QEVNEVFIVEEQSLKPFKAPEFNPNEDCTVDIGSLAGN 642

Query: 621 RRVIQVFERGARILDGSYMTQDLSFGPSNSE---SGSGSENSTVLSVSIADPYVLLGMSD 677
            R++QV     R    SY   D+  G S          S+     S S  DPY+++   D
Sbjct: 643 TRLVQVLRNEVR----SY---DIDLGLSQIYPVWDEDTSDERVAASASFIDPYLVIIRDD 695

Query: 678 GSIRLLVGDPS 688
            S+ LL  D S
Sbjct: 696 SSVLLLQADES 706



 Score = 45.1 bits (105), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 30/90 (33%), Positives = 47/90 (52%), Gaps = 8/90 (8%)

Query: 914 ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHP---QLCDGSIVAFTVLHNVNC--NHG 968
           + I  NISG    F+ G+   +  VFR   +  P   +L  G     +   +V+   ++G
Sbjct: 877 LRILPNISGFSTIFMPGASSSF--VFRTA-KSSPHIIRLRGGFTRWLSSFDSVDTGRDNG 933

Query: 969 FIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
           FIYV SQ  ++ CQLPS + +D  W ++KV
Sbjct: 934 FIYVDSQNCVRACQLPSQTQFDYPWTLRKV 963


>gi|425765419|gb|EKV04111.1| Cleavage and polyadenylation specificity factor subunit A, putative
           [Penicillium digitatum Pd1]
 gi|425767100|gb|EKV05682.1| Cleavage and polyadenylation specificity factor subunit A, putative
           [Penicillium digitatum PHI26]
          Length = 1271

 Score =  114 bits (284), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 174/755 (23%), Positives = 301/755 (39%), Gaps = 147/755 (19%)

Query: 57  NLVVTAANVIEIYVV------RVQEEGSK---ESKNSGETKRRVLMDGISAASLELVCHY 107
           NL+V   ++++I+ +      ++Q+EGS+      +  ETK            L L   Y
Sbjct: 28  NLIVIRTSLLQIFSLVKIVSSQLQKEGSEPHGSQFSQPETK------------LVLEKEY 75

Query: 108 RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
            L G V  L+ +     +N    ++I++A  +AK+S++E+D   HG+   S+H +E  + 
Sbjct: 76  PLSGTVTDLSRVKI--LNNKSGGEAILIAVRNAKLSLIEWDPERHGISTISIHYYERDDL 133

Query: 168 LHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---------GDE 217
                  +    G ++ VDP  RC  V  +G++ + IL   Q G  LV         G+ 
Sbjct: 134 TRSPWVPDLSRCGSILSVDPSSRCA-VYNFGIRNLAILPFHQAGDDLVMDDYDSELEGER 192

Query: 218 DTFGSGGGFSARIE-----------SSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILH 264
               SGGG   +             SS V+ L  LD  + H     F++ Y EP   IL 
Sbjct: 193 PIQNSGGGAEPKKSKEGPAYQTPYCSSFVLPLTALDPSLLHPISLAFLYEYREPTFGILF 252

Query: 265 ERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVV 324
            +  T    +  +      +  ++    +    + S   LP D +K++A+P P+GG L++
Sbjct: 253 SQVATSTALLYERKDVVFYAVFTLDLEQRASTTLLSVSRLPSDLFKVVALPLPVGGALLL 312

Query: 325 GANTI-HYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLST 381
           G+N I H      + A+ +N ++  + S     +S  +  L+      L  D    LL+ 
Sbjct: 313 GSNEIVHVDQAGKTNAVGVNEFSRQVSSFSMTDQSDLAFRLEGCVVERLGGDSGDLLLAL 372

Query: 382 KTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI--------TTIGNSLFFLGSRLGDSL 433
            +GD+ L+    DGR V  + +    P+    D+        + +G+   F+GS   DS+
Sbjct: 373 ASGDMALIKFKLDGRSVSGITIHLL-PAHAGGDMLKSAASCSSCLGDGNVFIGSEDADSV 431

Query: 434 LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA-------LQDMVNGEELSL 486
           L++++  S                     STK+ R  S            D    E+  L
Sbjct: 432 LLEWSRSSA--------------------STKKARLESKQTADGFDDLEDDDDQMEDDDL 471

Query: 487 YGSASNNTESAQKT---------FSFAVRDSLVNIGPLKDFSYGLRIN---ADASATGIS 534
           Y SA  +T+   +          ++F ++D L +IGPL+D + G   +    +  AT  +
Sbjct: 472 YSSAPGSTQVDNRMGTENLTTEFYNFRLKDCLPSIGPLRDITLGKVFSNTYREKQATCEA 531

Query: 535 KQSNYELVELPG--------------------------CKGIWTVYHKSSRGHNADSSRM 568
             +  ELV   G                            G+W+   K  RG        
Sbjct: 532 VSAELELVASQGSDRGGGLVVIKREIDPLTTMSLKIDDADGVWSASVKKRRG-------A 584

Query: 569 AAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV-------QGRTIAAGNLFGRR 621
           ++ D+    Y+++S    +   E  ++     +++  F        +  T+  G+  G  
Sbjct: 585 SSTDNPSRQYVVVSRSTDSEQ-ELNEVFVAEEQNLKPFRAPEFNPNEDCTVDIGSFAGDT 643

Query: 622 RVIQVFERGARILDGSYMTQDLS-FGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
           R++QV     R  D   M   LS   P   E    S+    +S S  DPY+++   D S+
Sbjct: 644 RLVQVLRNEVRSYD---MELGLSQIYPVWDE--DTSDERVAVSASFIDPYLMIIRDDSSV 698

Query: 681 RLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
            LL  D +     V     I SS+    S  LY+D
Sbjct: 699 LLLQADENGDLDEVPLSTLIISSR--WRSGCLYYD 731


>gi|258575565|ref|XP_002541964.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237902230|gb|EEP76631.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 1376

 Score =  114 bits (284), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 186/745 (24%), Positives = 298/745 (40%), Gaps = 123/745 (16%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V   +V++++ +     G+  S ++ +  R   ++      L L+  Y L G V  L
Sbjct: 28  NLIVAKTSVLQVFSLVNVAYGASTSPSTDDKTR---VERQQYTRLVLLAEYDLPGTVTGL 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +     D+    +++++A  +AK+S++E+D   HG+   S+H +E  E LH       
Sbjct: 85  GRVKT--LDSKSGGEALLVATRNAKLSLVEWDHERHGISTVSIHYYER-EDLHNSPWTPD 141

Query: 177 FARGP-LVKVDPQGRCGGVLVYGLQMI-ILKASQGGSGLVGD---EDTFG---------- 221
               P L+ VDP  RC  +L +G+  + IL   Q G  LV D   ED  G          
Sbjct: 142 LKLCPSLLAVDPSSRCA-ILNFGIHSVAILPFHQTGDDLVMDDFDEDLRGEKPEDMDNAL 200

Query: 222 ---SGGGFSAR----IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAG 272
              +     AR      SS V+ L  LD  + H     F++ Y EP   IL+    T   
Sbjct: 201 VESTAANDVARHKTPYASSFVLPLTALDPALVHPIHLAFLYEYREPTFGILYSHVATSFA 260

Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHY 331
            +  +      +  ++    +    + +   LP D + ++ +P PIGG L++G+N  IH 
Sbjct: 261 LLGERKDVVSYAVFTLDIQQRTSTTLVTVSRLPSDLWNVVPLPPPIGGSLLIGSNELIHV 320

Query: 332 HSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWL---QNDVALLSTKTGDLVL 388
                + A+ +N +A          +S   + L+      L     D+AL+   +G + +
Sbjct: 321 DQAGKTNAVGVNEFARQASEFSMADQSDLELRLEGCVIEQLGTESGDIALV-LASGRMAI 379

Query: 389 LTVVYDGRVVQ----RLDLSKTNPSVLTSDIT---TIGNSLFFLGSRLGDSLLVQFTCGS 441
           +    DGR V     +L  ++   S+L +  +   ++G    FLGS   DS+LV +T  S
Sbjct: 380 VRFKVDGRSVSGIFVQLVSTQAGGSILKARPSCSASLGRGKIFLGSEETDSVLVGWTRPS 439

Query: 442 GTSMLSSGLKEEFGDIEADAPSTKRLRRSSS-------DALQDMVNGEELSLYGSASNNT 494
                                S KRL+R SS       D   D  +  E  LY + +N T
Sbjct: 440 Q--------------------SIKRLKRDSSGPRAGETDTDDDEDDIYEDDLYSTPTNQT 479

Query: 495 ESAQKT----------FSFAVRDSLVNIGPLKDFSYGLRINA-DASATGISKQS-NYELV 542
              +            F F   D L ++GP+KD + G      D ++   SK S + ELV
Sbjct: 480 TVPKTVSQTNGLIKDEFVFRCHDRLWSLGPMKDITLGRTPGTRDQASKKTSKPSTDLELV 539

Query: 543 EL--PGCKGIWTVYHKSSRGHNADSSRMAAYDD-------------------EYHAYLII 581
                G  G  T+  K    +  DS +M   D                     Y  YL+ 
Sbjct: 540 VTHGQGDAGGLTILRKELDPYIIDSMKMDNVDGVWSVQIAPSNTSNPSTTSRNYDKYLVF 599

Query: 582 SLEARTMVLETADLLTEVTESVDYFV-------QGRTIAAGNLFGRRRVIQVFERGARIL 634
           S ++R    E + + T     +D          +  T+  G L G  RV+QV     R  
Sbjct: 600 S-KSRGHAKEQSVVYTVGGNGIDEMKAPEFNPNEDHTVDIGTLAGGTRVVQVLTSEVRSY 658

Query: 635 DGSYMTQDLSFG---PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
           D      DL+     P   E    S+  +V   S A+PY+L+   D S+ LL  D S   
Sbjct: 659 D-----TDLALAQIYPVWDE--DTSDELSVTGASFAEPYLLITRDDQSLLLLQPDSSGDL 711

Query: 692 VSVQTPAAIESSKKPVSSCTLYHDK 716
             V     + +S K +  C LY DK
Sbjct: 712 DEVNIDGLL-TSNKWLCGC-LYFDK 734



 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 24/92 (26%), Positives = 46/92 (50%), Gaps = 12/92 (13%)

Query: 913 RITIFKNISGHQGFFLSGSRPCWCMVFRE------RLRVHPQLCDGSIVAFTVLHNVNCN 966
           R+    ++ G++  F+ GS PC+ M          RL+  P      + + +  H   C 
Sbjct: 876 RLRAIPDLCGYKTMFMPGSNPCFIMKSSTSSPHVLRLKGEP------VSSLSSFHMPACE 929

Query: 967 HGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
            GF YV ++ ++++C+LP  + +DN W  +K+
Sbjct: 930 KGFAYVDAKNMVRMCRLPGNTRFDNAWAARKI 961


>gi|119195757|ref|XP_001248482.1| hypothetical protein CIMG_02253 [Coccidioides immitis RS]
 gi|121769680|sp|Q1E5B0.1|CFT1_COCIM RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two
           protein 1
 gi|392862316|gb|EAS37050.2| protein CFT1 [Coccidioides immitis RS]
          Length = 1387

 Score =  114 bits (284), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 172/731 (23%), Positives = 294/731 (40%), Gaps = 94/731 (12%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V   ++++++ +     G+    N+ +  R   ++      L LV  Y L G +  L
Sbjct: 28  NLIVAKTSILQVFSLVNVAYGTSAPPNADDKGR---VERQQYTKLILVAEYDLSGTITGL 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +     D+    ++++++  +AK+S++E+D   HG+   S+H +E  E +H       
Sbjct: 85  GRVKI--LDSRSGGEALLVSTRNAKLSLVEWDHERHGISTISIHYYER-EDVHSSPWTPD 141

Query: 177 FARGP-LVKVDPQGRCGGVLVYGLQMI-ILKASQGGSGLVGDE-----DTFGSGGG---- 225
               P L+ VDP  RC  +L +G+  + IL   Q G  LV DE     D    G      
Sbjct: 142 LRLCPSLLAVDPSSRCA-ILNFGIHSVAILPFHQTGDDLVMDEFDEDLDEKPEGASNIPA 200

Query: 226 ----------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
                     +     SS V+ L  LD  + H     F++ Y EP   IL+    T +  
Sbjct: 201 QAAVANDTTMYKTPYASSFVLPLTALDPALVHPIHLAFLYEYREPTFGILYSHLTTSSAL 260

Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYH 332
           +  +      +  ++    +    + +   LP D +K++ +P PIGG L++G+N  IH  
Sbjct: 261 LHDRKDIVSYAVFTLDIQQRASTTLITVSRLPSDLWKVVPLPPPIGGALLIGSNELIHVD 320

Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLT 390
               + A+ +N +A    +   + +S   + L+      L  D    LL    G + +L 
Sbjct: 321 QAGKTNAVGINEFARQASAFSMVDQSDLGLRLEGCVVEQLGTDSGDILLVLADGKMAILR 380

Query: 391 VVYDGRVVQ----RLDLSKTNPSVLTSDIT---TIGNSLFFLGSRLGDSLLVQFTCGSGT 443
           +  DGR V     +L   K   S+L +  +   ++G    F GS   DSLL+ ++  S  
Sbjct: 381 LKVDGRSVSGISAQLVSEKAGGSILKARPSCSASLGRGKVFFGSEETDSLLIGWSRPS-Q 439

Query: 444 SMLSSGLK---EEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT 500
           SM    ++   + FG  +              D     VN   LS   S +N     +  
Sbjct: 440 SMRKPKVESADDVFG--DHSETEDDEDDIYEDDLYSTPVNQTTLSKTTSQTNGLN--KDD 495

Query: 501 FSFAVRDSLVNIGPLKDFSYGL--------------RINADASATGISKQSN-------- 538
           F F   D L N+GP+ D + G               R +AD        + N        
Sbjct: 496 FVFRSHDRLWNLGPMSDVTLGRPPGSHDKNRKQSSSRTSADLELVVTQGKGNAGGLAVLQ 555

Query: 539 -------YELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL-----EAR 586
                   + +++    G+W++   +      DS+        Y  YL+ S      + +
Sbjct: 556 RELDPYVIDSMKMDNVDGVWSIQVGA-----PDSTNTRTSSRNYDKYLVFSKSTEPGKEQ 610

Query: 587 TMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSF 645
           ++V        E  ++ ++   +  T+  G L G  RV+QV +   R  D +     +  
Sbjct: 611 SVVYSVGGSGIEEMKAPEFNPNEDSTVDIGTLAGGTRVVQVLKSEVRSYDTNLELAQIY- 669

Query: 646 GPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKK 705
            P   E    S+  +V+S S A+PYVL+   D S+ LL  D S     V     I SS +
Sbjct: 670 -PIWDE--DTSDELSVVSASFAEPYVLIVRDDQSLLLLQADKSGDLDEVNI-DGILSSHR 725

Query: 706 PVSSCTLYHDK 716
            +S C LY DK
Sbjct: 726 WLSGC-LYLDK 735



 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 32/108 (29%), Positives = 55/108 (50%), Gaps = 8/108 (7%)

Query: 891 RFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLC 950
           RF  +P  AY     PH    + +  + +I G++  F+SGS PC+ M          +L 
Sbjct: 860 RFDPSP-KAYM----PHS---KFLRAYSDICGYKTVFMSGSNPCFVMKSSTSSPHVLRLR 911

Query: 951 DGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
             ++ + +  H   C  GF YV +  ++++C+LPS + +DN W  +KV
Sbjct: 912 GEAVSSLSSFHIPACEKGFAYVDASNMVRMCRLPSNTRFDNSWVTRKV 959


>gi|427795803|gb|JAA63353.1| Putative mrna cleavage and polyadenylation factor ii complex
           subunit cft1 cpsf subunit, partial [Rhipicephalus
           pulchellus]
          Length = 726

 Score =  113 bits (283), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 83/257 (32%), Positives = 117/257 (45%), Gaps = 46/257 (17%)

Query: 756 VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETE-INSSSEEGTGQG 814
           V  E+G LEI+ +P +   F V  F  G+  +VD+    A   +++E ++  S E     
Sbjct: 73  VARENGVLEIYSLPEYKLCFLVKNFPMGQKVLVDSVQMTAPSGTKSEKLSDMSHESMPV- 131

Query: 815 RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTS 874
               +H + VV L ++     HSRP L A + D  +L Y+A+ F              T 
Sbjct: 132 ----VHEILVVGLGIR-----HSRPLLLARV-DEDLLIYEAFPF------------YETQ 169

Query: 875 RSLSVSNVSASRLRNLRFSRTPLDAYTRE-----ETPHGAPCQR-------ITIFKNISG 922
           R   +          LRF +   D + RE     + P     ++       +  F +ISG
Sbjct: 170 REGHL---------KLRFKKMSHDIFLRERKYKTQKPENEEEEKAFQSRQWLHPFSDISG 220

Query: 923 HQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKIC 981
           + G FL G RP W  M  R  LR HP   DG I  F   HNVNC  GF++   QG L+I 
Sbjct: 221 YSGVFLCGYRPYWLFMSSRGELRCHPMFVDGPIHCFAPFHNVNCPKGFLHFNKQGELRIS 280

Query: 982 QLPSGSTYDNYWPVQKV 998
            LP+  TYD  WPV+KV
Sbjct: 281 TLPTHLTYDAPWPVRKV 297


>gi|169864473|ref|XP_001838845.1| cleavage factor protein [Coprinopsis cinerea okayama7#130]
 gi|116500065|gb|EAU82960.1| cleavage factor protein [Coprinopsis cinerea okayama7#130]
          Length = 1458

 Score =  112 bits (281), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 200/917 (21%), Positives = 360/917 (39%), Gaps = 164/917 (17%)

Query: 57  NLVVTAANVIEIYVVRVQE-------EGSKESKN---------SGETKRRVLMDGISAAS 100
           NLVV  +N++ I+ VR +        E  +E K           GE       DG    S
Sbjct: 40  NLVVARSNLLRIFEVREEPCAVPHGVEDERERKGGIRRGTEAVEGELAMDAQGDGFINVS 99

Query: 101 ----------------LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISV 144
                           L LV  ++LHG V  L+ + +  A    + D ++++F+DAKI++
Sbjct: 100 KGMAMKSDVEHPKTTRLYLVREHKLHGMVTGLSGV-RIIASLEDKLDRLLVSFKDAKIAL 158

Query: 145 LEFDDSIHGLRITSMHCFE-SPEWLHLKRGRESFARGPLVK----VDPQGRCGGVLVYGL 199
           LE+ D++H L   S+H +E +P+   L          PL K    VDPQ RC  + +   
Sbjct: 159 LEWSDAVHDLVPVSIHTYERAPQLTSLT--------APLFKSQLRVDPQSRCAALGLPNH 210

Query: 200 QMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR---DLDMKHVKDFIFVHGYI 256
            + IL               F            S +++L    + ++++V DF F+ G+ 
Sbjct: 211 ALAILP--------------FLDDAVSDVPYSPSFILDLAVSVNPNIRNVADFCFLPGFN 256

Query: 257 EPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
           +P + ++ E   TW GR+     T  +   ++      +P+I S   LP D+  L  VP+
Sbjct: 257 KPTLAVMFEPLQTWMGRIGEYKDTVKLVIFTLDIKTSSYPIITSVDGLPMDSLGL--VPA 314

Query: 317 PIGGVLVVGANTIHYHSQSAS--CALALNNYA--VSLDSSQELPRSSFSVELDAAHATWL 372
             GGV++   N++ Y  QS+S   A+ +N +A  ++      LP    ++ L+ +    +
Sbjct: 315 -FGGVVITTPNSLIYIDQSSSRQIAVPVNGWASRITDLPLLPLPSPDLNLTLEGSKTVVV 373

Query: 373 QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-----TNPSVLTSDITTIGNSLFFLGS 427
                 +    G +  + V+ DG+ V +L + K     T PSV+ S              
Sbjct: 374 DEKTLFVILANGIIYPIEVMADGKTVTKLQVGKPLAQATIPSVVES-------------- 419

Query: 428 RLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR-----RSSSDALQDMVNGE 482
            LGD  L   +      +L +   EE  D E +  + K +          D      + +
Sbjct: 420 -LGDGHLFVGSTVGVGVVLKTAWVEEEVDDEEEGTNAKVVEDDIDMDLYDDDDDLYGDSK 478

Query: 483 ELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD------ASATG---- 532
             +   +   +T+  +     ++RD+L   GP+   ++ L    D       +ATG    
Sbjct: 479 NKTQVTAEVKDTKKYRSVLHLSLRDTLPAYGPISSLTFSLATEGDKPVPELVTATGSGIL 538

Query: 533 ---------ISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAY----- 578
                    +  ++  +++ + G +G+W++  + S      SS   A +   HA      
Sbjct: 539 GGFTLFQRDLPTRTKKKILAVGGTRGLWSLPIRQSVKKGGSSSSTTAIE---HAKTERDT 595

Query: 579 LIISLEA-------RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
           LI+S +A       R          TEV  ++   V G T+ A   F R  ++ V     
Sbjct: 596 LILSTDATPSPGVSRIATRAPPGGKTEV--NITTRVPGTTVGAAPFFQRTAILVVMTNSI 653

Query: 632 RILDGSYMTQDLSFGPSNSESGSGSE------NSTVLSVSIADPYVLLGMSDGSIRLLVG 685
           ++L+           P  +E  +  +         + S SI DP+VL+   D S+ L +G
Sbjct: 654 KVLE-----------PDGTERQTIQDMDGKLLRPKIRSCSICDPFVLIIREDDSLGLFIG 702

Query: 686 DPSTCTVSVQTPAAI-ESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAI--DGA 742
           +     +  +  + + E + K ++ C      G      +TS     +T   + +   G+
Sbjct: 703 ETERGKIRRKDMSPMGEKTSKYLAGCFFTDTSGLFGQQFETSVPVEGATATLQNVVSGGS 762

Query: 743 DGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETE 802
             G   Q   + ++    G +EI+ +P     F+V    S    +VD++ + AL  S   
Sbjct: 763 TSGGKPQHTQWLLLVRPQGVMEIWTLPKLTLAFSVSAVPSLFNVLVDSHDKPAL--SVPN 820

Query: 803 INSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPE 862
                +   G+   E +   +V E           R  LF  L +G +  Y+A     P 
Sbjct: 821 PGDPPQRKPGEFDVEQVCVSRVGE-------DGRGRVCLFVFLRNGQLTIYEAL----PL 869

Query: 863 NTSKSDDPVSTSRSLSV 879
           +T+ S    S   ++ V
Sbjct: 870 STTASQPAASVDGAMDV 886


>gi|406602601|emb|CCH45811.1| hypothetical protein BN7_5397 [Wickerhamomyces ciferrii]
          Length = 1287

 Score =  112 bits (281), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 129/613 (21%), Positives = 261/613 (42%), Gaps = 62/613 (10%)

Query: 96  ISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLR 155
           I + + +L+ ++    N + + I S    D+   + +I+ +   AK+S++ FD  ++ ++
Sbjct: 42  IDSKNDKLILNHEFKLNGKIIGIKSIKLPDSQYDQLAILTSL--AKLSIVSFDHDLNTIQ 99

Query: 156 ITSMHCFESPEWLH-LKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV 214
             S+H +ES  +   + +  ES      +K+DP  +   ++VY   +  L   Q    ++
Sbjct: 100 TNSLHYYESEFYTKSISKINES-----QLKIDPNNQTS-LVVYNDLLAFLPFKQDDDEII 153

Query: 215 GDEDTFGSGGGFSARIESSH--VI---NLRDLDMKHVKDFIFVHGYIEPVMVILHERELT 269
            D+    S       IE  H  +I   N  +  + ++ D  F+H Y +P + ILH +E T
Sbjct: 154 DDDHHTQSNDQQQQNIELFHNSIILPANKLESTVSNIIDCDFLHSYRDPTLAILHNKEQT 213

Query: 270 WAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-T 328
           WA  +S K  T     LS+         I    NLP+D + +  +P PI G L++G N  
Sbjct: 214 WASDLSIKKDTVNFVVLSLDLLNDSSTAILLVENLPYDLWFVKPLPDPINGTLLIGCNEI 273

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           IH  +   +  + LN Y   +   +   +S  ++ L+ +    L +   L+  + G+   
Sbjct: 274 IHIDNSGNTKGIGLNKYYQDITDFKLKDQSDLNIFLEHSKVEILNDKNILIIDQFGESYN 333

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTS---DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
           L    DG+ V+ L ++K    +       IT I     F+G +  DS+L+++        
Sbjct: 334 LQFFIDGKSVKDLLITKFEKDLQIRSPISITNIDEQNIFIGCQSSDSILIKY-------- 385

Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAV 505
               LK+E  + +   P+  +      D           +      N        F+  +
Sbjct: 386 --EKLKQETNEAKPTTPAATKTNNDDDDEDLYEDEDLNNNNDDELIN--------FNLQI 435

Query: 506 RDSLVNIGPLKDFSYGLRINADASATGIS--KQSNYELVEL--PGCKGIWTVYHKSSRGH 561
           +D L N GPL  F+ G +IN ++   G++   Q++  +V     G +G  T++++S +  
Sbjct: 436 KDKLFNAGPLSSFTLG-KINPNSLIQGLTNPNQNDVSIVGTSGEGKQGKLTLFNQSIQPK 494

Query: 562 NADSSRMAAYDDEY---HAYLIIS-LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNL 617
              S +    +  +   + YLI + L+     +   +   +  +S D+     TI    +
Sbjct: 495 IHSSLKFNNINKTWNILNKYLITTDLQNFKSEIFLINENFKNFQSFDFKNNNITINIDTI 554

Query: 618 FGRRRVIQVFERGARILDGSY---MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLG 674
             ++R++Q+      + D ++   +  +  F               +++  I DP++++ 
Sbjct: 555 QSQKRILQITSNNVYLFDLNFKKLLQINFDF--------------EIINGKIFDPFIIIT 600

Query: 675 MSDGSIRLLVGDP 687
            S G +++   DP
Sbjct: 601 SSKGEVKIFEMDP 613


>gi|154320778|ref|XP_001559705.1| hypothetical protein BC1G_01861 [Botryotinia fuckeliana B05.10]
          Length = 1153

 Score =  112 bits (279), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 165/763 (21%), Positives = 300/763 (39%), Gaps = 135/763 (17%)

Query: 293 KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDS 351
           K    I S   LP+D ++++ +  P+GG L+VG N  IH      +  +A+N +A     
Sbjct: 10  KASTTILSVGGLPYDLFRIVPLAPPVGGALLVGTNELIHIDQAGKANGVAVNMFAKQCTG 69

Query: 352 SQELPRSSFSVELDAAHATWL--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP- 408
              L ++   + L+      L  +N   L+   +GD+ +L+   DGR V  L + + +  
Sbjct: 70  FSLLDQADLDLRLEGCKIDQLSIENGEMLIILHSGDIAILSFRMDGRSVSGLSIRRVSAE 129

Query: 409 ---SVLT---SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAP 462
              ++LT   S ++++G    F+GS + DS+++ +   SG +       +     E D  
Sbjct: 130 LGGAILTGAASCVSSLGAGSLFVGSEVSDSVILGWNRKSGQTSRRKSRLDSSAIAEVD-- 187

Query: 463 STKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT--FSFAVRDSLVNIGPLKDFSY 520
                     +   D + G+  ++  + +N T S  KT  ++F + DS+VNI P+ + ++
Sbjct: 188 -EAMFDEEDLEDDDDDLYGDGPTITHATANITASNSKTGDYTFRIHDSMVNIAPITNIAF 246

Query: 521 G---LRINADASATGISKQSNYELV--------------------------ELPGCKGIW 551
           G   L +  D        QS  +LV                          +LP  +GIW
Sbjct: 247 GEAALSLGKDEELKSSGVQSELQLVAAVGREKGGSLAVINREIQPNVIGRFDLPEARGIW 306

Query: 552 TVYHK--SSRGHNADSSRMA-----AYDDEYHAYLIISL--EARTMVLETA-----DLLT 597
           T+  K  + +G   +  +         D +Y   +I+S   +A   + E+A     D   
Sbjct: 307 TMSAKRPAPKGLQVNKEKSVTSGDYGVDAQYDRLMIVSKASDAEDAIEESAVYALTDAGF 366

Query: 598 EVTESVDYF-VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSG 655
           E     ++    G TI AG L    RV+Q+ +   R  DG   + Q L     + E+G+ 
Sbjct: 367 EALTGTEFEPAAGSTIEAGTLGNGMRVVQILKSEVRSYDGDLGLAQILPM--LDDETGA- 423

Query: 656 SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
                ++S S ADP++LL   D SI +   D       ++    I  S K ++ C LY D
Sbjct: 424 --EPKIISASFADPFLLLIRDDASIFVAQCDDDNDLEEIERVDDILLSTKWLTGC-LYDD 480

Query: 716 KGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVF 775
                      +D+  S   GE             ++   +    GAL I+ +P+ +   
Sbjct: 481 ------YSGAFSDSK-SNKAGE-------------NVKMFLLSAGGALHIYALPDLSKPV 520

Query: 776 TVDK---FVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRW 832
            V +   FV           + A +++ TEI                       L     
Sbjct: 521 YVAEGICFVPPVLSADYAARKSAARETLTEI-----------------------LVANLG 557

Query: 833 SAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRF 892
            +    P+L    ++  +  Y+ +  +            S S  L  S +   +++N   
Sbjct: 558 DSVSQSPYLILRPSNDDLTIYEPFRVK------------SASPDLLSSTLQFLKIQNTHL 605

Query: 893 SRTPLDAYTREETPHGA------PCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVH 946
           ++ P    + EE   GA      P + I+   N+ G+   F+ G  P + +   +     
Sbjct: 606 TQAP--DVSAEEQVDGAQQTSDKPMRAIS---NLGGYSTVFMPGGSPSFIIKSSKTAPKV 660

Query: 947 PQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
             L    + + +  H   C+ GFIY +++GI ++ Q P  +T+
Sbjct: 661 LSLQGTGVRSLSSFHTEGCDRGFIYASTEGIARVAQFPPNTTF 703


>gi|58268668|ref|XP_571490.1| cleavage and polyadenylation specific protein [Cryptococcus
           neoformans var. neoformans JEC21]
 gi|134113364|ref|XP_774707.1| hypothetical protein CNBF3860 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|338817789|sp|P0CM63.1|CFT1_CRYNB RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two
           protein 1
 gi|338817790|sp|P0CM62.1|CFT1_CRYNJ RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two
           protein 1
 gi|50257351|gb|EAL20060.1| hypothetical protein CNBF3860 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|57227725|gb|AAW44183.1| cleavage and polyadenylation specific protein, putative
           [Cryptococcus neoformans var. neoformans JEC21]
          Length = 1431

 Score =  111 bits (278), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 161/716 (22%), Positives = 300/716 (41%), Gaps = 108/716 (15%)

Query: 57  NLVVTAANVIEIYVVRVQE----EGSKESKNSGETKRRVLMDGI---------------- 96
           NLVV  A V+ ++ +R +     E  K  ++  E ++ V M+ +                
Sbjct: 48  NLVVAGAEVLRVFEIREESVPIIENVKLEEDVAEGEKDVQMEEVGDGFFDDGHAERAPLK 107

Query: 97  --SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGL 154
             +   L L+  + L+G +  LA  ++         D +I++F+DAK+++LE+  S   +
Sbjct: 108 YQTTRRLHLLTQHELNGTITGLAA-TRTLESTIDGLDRLIVSFKDAKMALLEW--SRGDI 164

Query: 155 RITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV 214
              S+H +E    ++     +S+   PL++ DP  R   + +    + +L   Q  S L 
Sbjct: 165 ATVSLHTYERCSQMNTG-DLQSYV--PLLRTDPLSRLAVLTLPEDSLAVLPLIQEQSEL- 220

Query: 215 GDEDTFGSGGGFSARIESSHVINLRDLDM--KHVKDFIFVHGYIEPVMVILHERELTWAG 272
              D    G    A    S V++L D+ +  K+++D +F+ G+  P + +L     TW+G
Sbjct: 221 ---DPLSEGFSRDAPYSPSFVLSLSDMSITIKNIQDLLFLPGFHSPTIALLFSPMHTWSG 277

Query: 273 RV-SWKHHTCM-ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIH 330
           R+ + K   C+ I    +S+    +PL+ S   LP D+  L+A PS +GG+++V +  I 
Sbjct: 278 RLQTVKDTFCLEIRTFDLSSG-TSYPLLTSVSGLPSDSLYLVACPSELGGIVLVTSTGIV 336

Query: 331 YHSQ----SASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDL 386
           +  Q    +A+C  A  +   SL  S  +   S  + L+ +   ++     LL  + G +
Sbjct: 337 HVDQGGRVTAACVNAWWSRITSLKCS--MASVSQKLTLEGSRCVFVTPHDMLLVLQNGAV 394

Query: 387 VLLTVVYDGR---VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGT 443
             +    +GR   V++ LD     P    SD+T  G+   F+GS  GDS L +       
Sbjct: 395 HQVRFSMEGRAVGVIEVLDKGCVVPP--PSDLTVAGDGAVFVGSAEGDSWLAKVNVVRQV 452

Query: 444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSF 503
              S   K+E  +++ D    + L    +DA  D    E   L+G A+          + 
Sbjct: 453 VERSEKKKDEM-EVDWD----EDLYGDINDAALDEKAQE---LFGPAA---------ITL 495

Query: 504 AVRDSLVNIGPLKDFSYGL-----------------------RINADASATGISKQSNYE 540
           +  D L  +G + D  +G+                        IN       I+K+  + 
Sbjct: 496 SPYDILTGVGKIMDIEFGIAASDQGLRTYPQLVAVSGGSRNSTINVFRRGIPITKRRRFN 555

Query: 541 LVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT 600
             EL   +G+W +      G      +     +   A +++S E          L ++ T
Sbjct: 556 --ELLNAEGVWFLPIDRQTGQ-----KFKDIPEAERATILLSSEGNAT--RVFALFSKPT 606

Query: 601 ESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS-YMTQDLSFGPSNSESGSGSENS 659
                 + G+T++A   F R  +++V      +LD +  + Q +         G G +  
Sbjct: 607 PQQIGRLDGKTLSAAPFFQRSCILRVSPLEVVLLDNNGKIIQTV------CPRGDGPK-- 658

Query: 660 TVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD 715
            +++ SI+DP+V++  +D S+   VGD    TV+ + P   E       +  ++ D
Sbjct: 659 -IVNASISDPFVIIRRADDSVTFFVGDTVARTVA-EAPIVSEGESPVCQAVEVFTD 712


>gi|401889164|gb|EJT53104.1| cleavage and polyadenylation specific protein [Trichosporon asahii
           var. asahii CBS 2479]
          Length = 1358

 Score =  110 bits (276), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 228/968 (23%), Positives = 377/968 (38%), Gaps = 195/968 (20%)

Query: 45  ELPSKRGIGPVPNLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMD---------G 95
           E+P  + +G   NLVV     + ++ +R  EE +    +     ++  MD          
Sbjct: 39  EVPDVKVVG---NLVVAGGQDLRVFEIR--EESTPLPDDESAVPKQEDMDVGDSFFDSAP 93

Query: 96  ISAAS--------LELVCHYRLHGNVESLAILSQGGADNSRR-RDSIILAFEDAKISVLE 146
           I  A         L L+  + LHG V  LA L     D+S    D ++++FE AK S  +
Sbjct: 94  IERAPVRYKTTRRLHLLTRHTLHGVVTGLAGLRT--IDSSVDGLDRLLVSFEHAKWSRGD 151

Query: 147 FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKA 206
                  +   S+H +E  + + +    + +   P+++ DP  R   + +    + +L  
Sbjct: 152 -------IATVSLHTYERCQQM-INGNFQGYV--PMLRSDPLSRLAILTLPEDALAVLPI 201

Query: 207 SQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHER 266
            Q  S L   +D+  S                   ++K++KDF+F+ G+  P + +L   
Sbjct: 202 VQEQSELDAMQDSVSSP------------------EIKNIKDFLFLPGFHSPTIALLFAP 243

Query: 267 ELTWAGRVSWKHHTCMISALSISTTLK-QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVG 325
             TWAGR      T  +   +I T+    +PLI S   LP D+  L+A PS +GGV+VV 
Sbjct: 244 MNTWAGRYKSVKDTFRLEIRTIDTSAGGTYPLITSVTGLPSDSQYLVACPSEVGGVVVVT 303

Query: 326 ANTIHYHSQSAS-CALALN---NYAVSL--DSSQELPRSSFSVELDAAHATWLQNDVALL 379
           A+ I +  QS    + ++N   NY  ++  DSS E    S  + LD +HA ++  +  LL
Sbjct: 304 ASGIIHIDQSGRLVSTSVNGWWNYTTNMKSDSSYE----SQKLALDNSHAQFVTENDMLL 359

Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLT-SDITTIGNSLFFLGSRLGDSLLVQFT 438
             +TG++  +    DGR V  + + + + +V   S +   G+   F+GS  GDSLL    
Sbjct: 360 VLETGEVHQIRFEMDGRAVGAIKVDEQSSTVPPPSTLVPAGSDGIFVGSVEGDSLLAMVE 419

Query: 439 CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGE---ELSLYGSASNNTE 495
                S      +EE        P TK+      D  +++  G     +      S    
Sbjct: 420 KARDQSA-----QEE--------PETKQQEMDVDDWDEEVATGPVTVSVKAQDVLSGIGR 466

Query: 496 SAQKTFSFAVRD-------SLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCK 548
            A   F  AV D        LV IG     S G  +N       I+K+  +E  +L    
Sbjct: 467 IADMEFGIAVTDLGTRTYPQLVCIG---GGSQGSTMNVFRRGIPITKRRLFE--QLRTAV 521

Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
             W +  + +   NA   +     ++    +  + E  T +   +    +V E +  F +
Sbjct: 522 ATWFLPVERA---NAPKFKDIPESEQSTIAIAATQEGSTQIFALS--TRKVQERIAEFPE 576

Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSES-GSGSENS---TVLSV 664
              IA G    R R++ V      +LD            SN+   G+  E S    +++ 
Sbjct: 577 P-AIATGTWLRRTRIVLVLPSQVLLLD------------SNANPVGTICEMSDAPPIVAA 623

Query: 665 SIADPYVLLGMSDGSIRLLVGD-----------PSTCTVSVQTPAAI------------- 700
           SIADPYVL+  +DGS+ + VGD           P    + V   A +             
Sbjct: 624 SIADPYVLIRRADGSVSVFVGDTVEGKWSEAPMPEGLALPVCQAAEVFTDTTGIYRTFEA 683

Query: 701 -----ESSKKPVSS----CTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGD 751
                E   KPV +        H  G E   R   +   +S  V       +     +G 
Sbjct: 684 TQGVKEEPVKPVPTKQGQKAKIHLTG-EQLKRLQDSKPAISADVATTESAFNAA---RGT 739

Query: 752 IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGT 811
            +  +  +SG L+I  +P+F+ V   +        + D+    +  D +T      EEG 
Sbjct: 740 QWIALLAQSGELQIRSLPDFDLVLQSNG-------VYDS--EPSFTDDQTGELPELEEG- 789

Query: 812 GQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPV 871
                + +  M    +  +       RP +  +   G +  Y+A     P  T  + D  
Sbjct: 790 -----DEVSQMLFCPIGTRTL-----RPHVIVLHRSGRLNIYEAQ----PRFTVDARD-- 833

Query: 872 STSRSLSVSNVSASRLRNLRFSRTPLDAYTREET--PHGAPCQRITIFKNISGHQGFFLS 929
            + RSL+V      R R +    T L + T   T  P   P      F +I G  G F++
Sbjct: 834 QSRRSLAV------RFRKV---HTQLLSVTPSSTVKPAAIP------FTDIEGLTGAFIT 878

Query: 930 GSRPCWCM 937
           G RP W +
Sbjct: 879 GERPHWII 886


>gi|356527660|ref|XP_003532426.1| PREDICTED: disease resistance response protein 206-like [Glycine
           max]
          Length = 281

 Score =  110 bits (275), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 56/106 (52%), Positives = 76/106 (71%), Gaps = 8/106 (7%)

Query: 1   MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSK--RGIGPVPNL 58
           MSFAAYKMM  PTGI NC  GF+THSR+D+VP    +Q +++D E PS+    +G +PNL
Sbjct: 1   MSFAAYKMMQCPTGIDNCAVGFLTHSRSDFVP----LQPDDIDVEWPSRPCHHVGSLPNL 56

Query: 59  VVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELV 104
           +VT ANV+E+Y VR+QE+ S   K + +++   L+DGI  ASLELV
Sbjct: 57  IVTVANVLEVYAVRLQEDQSP--KAAIDSRSDTLLDGIVGASLELV 100


>gi|405121446|gb|AFR96215.1| cleavage and polyadenylation specific protein [Cryptococcus
           neoformans var. grubii H99]
          Length = 1431

 Score =  110 bits (274), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 160/726 (22%), Positives = 301/726 (41%), Gaps = 107/726 (14%)

Query: 45  ELPSKRGIGPVPNLVVTAANVIEIYVVRVQE----EGSKESKNSGETKRRVLMDGI---- 96
           + P  + IG   NLVV  A V+ ++ +R +     E +K  ++  E ++ V M+ +    
Sbjct: 39  DTPDVKVIG---NLVVAGAEVLRVFEIREESVPIIEKAKLEEDVAEGEKDVQMEEVGDGF 95

Query: 97  --------------SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKI 142
                         +   L L+  + L+G V  LA  ++         D +I++F+DAK+
Sbjct: 96  FDDGHAERAPLKYQTTRRLHLLTQHELNGTVTGLAA-TRTLESTIDGLDRLIVSFKDAKM 154

Query: 143 SVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI 202
           ++LE+  S   +   S+H +E    ++     +S+   PL++ DP  R   + +    + 
Sbjct: 155 ALLEW--SRGDIATVSLHTYERCSQMNTG-DLQSYV--PLLRTDPLWRLAVLTLPEDSLA 209

Query: 203 ILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVM 260
           +L   Q  S L    D    G    A    S V++L D+   +K+++D +F+ G+  P +
Sbjct: 210 VLPLIQEQSEL----DPLSEGFSRDAPYSPSFVLSLSDVSTTIKNIQDLLFLPGFHSPTI 265

Query: 261 VILHERELTWAGRV-SWKHHTCM-ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
            +L     TW+GR+ + K   C+ I    +S+    +PL+ S   LP D+  L+A PS +
Sbjct: 266 ALLFSPMHTWSGRLQTVKDTFCLEIRTFDLSSG-TSYPLLTSVSGLPSDSLYLVACPSEL 324

Query: 319 GGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFS--VELDAAHATWLQNDV 376
           GG+++V +  I +  Q    A A  N   S  +S +   +S S  + L+ +   ++    
Sbjct: 325 GGIVIVTSTGIVHVDQGGRVAAACVNAWWSRITSLKCSTASVSQKLTLEGSRCVFVTPHD 384

Query: 377 ALLSTKTGDLVLLTVVYDGR---VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
            LL  + G +  +    +GR   V++ LD     P    SD+T  G+   F+GS  GDS 
Sbjct: 385 MLLVLQNGAVHQVRFSMEGRAVGVIEVLDKGCVVPP--PSDLTVAGDGAVFVGSAEGDSW 442

Query: 434 LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNN 493
           L +          +   K+E  +++ D    + L    +DA  D    E+   +G A+  
Sbjct: 443 LAKVNVVRQVVERAEKKKDEM-EVDWD----EDLYGDINDAALDEKAQEQ---FGPAA-- 492

Query: 494 TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA--------SATGISKQSNYELV--- 542
                   + +  D L  +G + D  +G+  +           + +G S+ S + +    
Sbjct: 493 -------ITLSPYDILTGVGKIMDIEFGIAASDQGLRTYPQLVAVSGGSRNSTFNVFRRG 545

Query: 543 ----------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA---RTMV 589
                     EL    G+W +      G      +     +   A +++S E    R   
Sbjct: 546 IPITKRRRFNELLNADGVWFLPIDRQTGQ-----KFKDIPEAERATMLLSSEGNATRVFA 600

Query: 590 LETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSN 649
           L +     ++       + G+T++A   F R  ++ V      +LD +        G   
Sbjct: 601 LSSKPTPQQIGR-----LDGKTLSAAPFFQRSCILHVSPLEVVLLDNN--------GKII 647

Query: 650 SESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSS 709
                  +   +++ SI+DP+V++  +D S+   VGD    TV  + P   E       +
Sbjct: 648 QTVCPRGDGPKIVNASISDPFVIIRRADDSVTFFVGDTVARTVG-EAPIVSEGESPVCQA 706

Query: 710 CTLYHD 715
             ++ D
Sbjct: 707 VEIFTD 712


>gi|50552095|ref|XP_503522.1| YALI0E03982p [Yarrowia lipolytica]
 gi|74634000|sp|Q6C740.1|CFT1_YARLI RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two
           protein 1
 gi|49649391|emb|CAG79101.1| YALI0E03982p [Yarrowia lipolytica CLIB122]
          Length = 1269

 Score =  109 bits (272), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 202/935 (21%), Positives = 351/935 (37%), Gaps = 162/935 (17%)

Query: 98  AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
           A  LEL+  Y L G V  +  +     DN    DS+ ++ + AK  ++ ++ S   +   
Sbjct: 51  APRLELITEYYLDGTVTGVTRIKT--IDN-YDLDSLYISVKHAKAVIVAWNASSFTIDTK 107

Query: 158 SMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE 217
           S+H +E  + L      E       V  +       +L    +M  L   + G   + D+
Sbjct: 108 SLHYYE--KGLVESNFFEPECSSVAVSDEANSFYTCLLFQNDRMAFLPIIEKG---LDDD 162

Query: 218 DTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS 275
           +   SG  F    + S ++    LD  +++V D  F+H Y E  M IL + +  W G  +
Sbjct: 163 EMPESGQVF----DPSFIVKASRLDKRIENVMDICFLHEYRETTMGILFQPKRAWVGMKN 218

Query: 276 WKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS 335
               T   + +S+    K   +I +   LP DA K++ +P+P+GG L++ ANTI Y   S
Sbjct: 219 ILKDTVSYAIVSVDVHQKNSTVIGTLNGLPVDAQKVIPLPAPLGGSLIICANTILYIDSS 278

Query: 336 ASCALALNNYAVSLDSSQELPR--SSFSVELDAAHATWLQN--DVALLSTKTGDLVLLTV 391
           AS    + N     +S   + R  S+  + L+ A   ++Q   + ALL T+ G    L  
Sbjct: 279 ASYTGVMVNNTHRQNSDLIVSRDQSTLDLRLEGAEVCFIQELGNTALLVTEDGQFFSLLF 338

Query: 392 VYDGRVVQRLDLSKTNPS--VLT--SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447
             DGR V  L+L    P   +L+  S +    +   FLGSR GDSLLV++  G   S   
Sbjct: 339 NKDGRRVASLELRPIEPDNFILSQPSSVAAGPDGTIFLGSRAGDSLLVKWYHGEPESQPE 398

Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTE-SAQKTFSFAVR 506
             L                          D  N  +  LYG  +  TE +  +     + 
Sbjct: 399 ETL--------------------------DDGNESDDDLYGGDTAQTEDTTNRPLKLRLA 432

Query: 507 DSLVNIGPLKDFSYGLRINAD----ASATGISKQSNYELV--------------ELPGCK 548
           D ++ +GP++  + G    +      + TG+   S   ++              ++PG +
Sbjct: 433 DRMLGMGPMQSLALGKNRGSQGVEFVTTTGVGANSALAILTSALMPYKRKSLYKDMPGGQ 492

Query: 549 GIWTVYHK-SSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
             W+V  +    G  A S       D  ++YL     A   V+E   L T+  ++  +FV
Sbjct: 493 -FWSVPVRFEEEGEVAKSRTYVVSSDSENSYLYYVDAAG--VIEDVSLSTKKKKTKKHFV 549

Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
              T    +      ++QV      I D                  S  + +T +   + 
Sbjct: 550 SNVTTIFSSSMLDSALLQVCLETVNIYDAKI---------GQPHKYSLPQGTTAVEARVL 600

Query: 668 DPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTST 727
             YVL+ +SDG +++L        VS+     +++++  +   +     G        +T
Sbjct: 601 GNYVLVLLSDGQVKILEA------VSINKRPFLKAAQVSIEPASESKAIG------IYAT 648

Query: 728 DAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHI 787
           D+ L+ G         G P        VVCY  G+L             +    S    I
Sbjct: 649 DSSLTFGAPSKKRTRQGSPAQDSRPVVVVCYADGSL------------LLQGLNSDDRLI 696

Query: 788 VDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTD 847
           +D           ++++   +E  GQ        +++V++A+      H       +LT 
Sbjct: 697 LDA----------SDLSGFIKEKDGQLYDA---PLELVDIALSPLGDDHILRDYLVLLTP 743

Query: 848 GTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH 907
             ++ Y+ Y +                               LRF +  L     E TP 
Sbjct: 744 QQLVVYEPYHYND----------------------------KLRFRKIFL-----ERTPT 770

Query: 908 GAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCD----GSIVAFTVLHNV 963
               +R+T    I+G     ++G       +  + L   P+L +       VAFT     
Sbjct: 771 INSDRRLTQVPLINGKHTLGVTGET---AYILVKTLHTSPRLIEFGETKGAVAFT----- 822

Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
           + +  F Y+T  G +  C+     + +  WPV+ V
Sbjct: 823 SWDGKFAYLTQAGEVAECRFDPSFSLETNWPVKHV 857


>gi|321260384|ref|XP_003194912.1| cleavage and polyadenylation specific protein [Cryptococcus gattii
           WM276]
 gi|317461384|gb|ADV23125.1| cleavage and polyadenylation specific protein, putative
           [Cryptococcus gattii WM276]
          Length = 1431

 Score =  108 bits (271), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 162/734 (22%), Positives = 302/734 (41%), Gaps = 123/734 (16%)

Query: 45  ELPSKRGIGPVPNLVVTAANVIEIYVVRVQE----EGSKESKNSGETKRRVLMDGI---- 96
           + P  + IG   NLVV  A  + ++ +R +     E  K  ++  E K+ V M+ +    
Sbjct: 39  DTPDVKVIG---NLVVAGAEALRVFEIREESVPIIEKVKLEEDVAEGKKDVQMEEVGDGF 95

Query: 97  --------------SAASLELVCHYRLHGNVESLAILS--QGGADNSRRRDSIILAFEDA 140
                         +   L L+  + L+G V  LA     +   D     D +I++F+DA
Sbjct: 96  FDDGHAERAPLKYQTTRRLYLLAQHELNGTVTGLAATRTLESAIDG---LDRLIVSFKDA 152

Query: 141 KISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ 200
           K+++LE+  S   +   S+H +E    ++     +S+   PL++ DP  R   + +    
Sbjct: 153 KMALLEW--SRGDIATVSLHTYERCPQMNTG-DLQSYV--PLLRTDPLSRLAVLTLPEDS 207

Query: 201 MIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEP 258
           + +L   Q  S L    D    G    A    S V++L D+   +K+++D +FV G+  P
Sbjct: 208 LAVLPLIQEQSEL----DPLSEGFSRDAPYSPSFVLSLSDVSTTIKNIQDLLFVPGFHSP 263

Query: 259 VMVILHERELTWAGRV-SWKHHTCM-ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
            + +L     TW+GR+ + K   C+ I    +S+    +PL+ S   LP D+  L+A PS
Sbjct: 264 TIALLFSPMHTWSGRLQTVKDTFCLEIRTFDLSSG-TSYPLLTSVSGLPSDSLYLVACPS 322

Query: 317 PIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFS--VELDAAHATWLQN 374
            +GG+++V +  I +  Q    A A  N   S  +S +   +S S  + L+ +   ++  
Sbjct: 323 ELGGIVLVTSTGIVHIDQGGRVAAACVNAWWSRITSLKCSMASVSQKLTLEGSRCVFVTP 382

Query: 375 DVALLSTKTGDLVLLTVVYDGRVVQRLD-LSKTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
              LL  + G +  +    +GR V  ++ L K       SD+   G+   F+GS  GDS 
Sbjct: 383 HDMLLILQNGAVHQVRFSMEGRAVGLIEVLDKGCVVPPPSDLIVTGDGAVFVGSAEGDSW 442

Query: 434 LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNN 493
           L +                            +R+ R+     +  V+ +E  LYG  ++ 
Sbjct: 443 LAKVNV-----------------------VRQRVERAEEKKDEMEVDWDE-DLYGDINDA 478

Query: 494 T--ESAQKTF-----SFAVRDSLVNIGPLKDFSYGLRINADA--------SATGISKQSN 538
              E AQ+ F     + +  D L  +G + D  +G+  +           + +G S+ S 
Sbjct: 479 ALDEKAQEQFGPAAITLSPYDILTGVGKIMDIEFGIAASDQGLRTYPQLVAVSGGSRNST 538

Query: 539 YELV-------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEA 585
           + +              EL   +G+W +      G      +     +   A +++S E 
Sbjct: 539 FNVFRRGIPITKRRRFNELLNAEGVWFLSIDRQTGQ-----KFKDIPEAERATILLSSEG 593

Query: 586 ---RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS-YMTQ 641
              R   L +     ++       + G+T++A   F R  ++ V      +LD +  + Q
Sbjct: 594 NATRVFALSSKPTPQQIGR-----LDGKTLSAAPFFQRSCILHVSPLEVVLLDNNGKIIQ 648

Query: 642 DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIE 701
            +         G G +   +++ SI+DP+ ++  +D S+   VGD    TV+ + P   E
Sbjct: 649 TV------CPRGDGPK---IVNASISDPFAIIRRADDSVTFFVGDTVARTVA-EAPIVSE 698

Query: 702 SSKKPVSSCTLYHD 715
                  +  ++ D
Sbjct: 699 GESPVCQAVEVFTD 712


>gi|320169222|gb|EFW46121.1| cleavage and polyadenylation specificity factor 1 [Capsaspora
           owczarzaki ATCC 30864]
          Length = 1725

 Score =  108 bits (270), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 76/261 (29%), Positives = 135/261 (51%), Gaps = 23/261 (8%)

Query: 229 RIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHEREL-TWAGRVSWKHHTCMISA 285
           R+  S+ I L +L   + HV D  F+ GY EP + +L E    +W GR   +  TC + A
Sbjct: 294 RLRPSYEIKLTELQRHIHHVIDIEFLTGYFEPTLALLFEPNAPSWTGRTVQRKDTCSMVA 353

Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNN 344
           LSI+T+   HP++WS   LP ++ +++AVP P+ G ++V  + I + SQS+ +  ++LN 
Sbjct: 354 LSINTSSHSHPVVWSVDKLPFNSMRVMAVPRPVCGTVIVTPDAILHLSQSSPTVGVSLNE 413

Query: 345 Y-AVSLDSSQELPR------SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRV 397
             ++S +    +P       SS    +      +L  +  L  T+ G++ + T++ +GR 
Sbjct: 414 LSSMSTELRLGIPENKHPDGSSVVYNMQEGRCCFLTPETLLAVTEGGEMFVATLLTEGRT 473

Query: 398 VQRLDLSKTNPSVLTSDITTIGNSLF-FLGSRLGDSLLVQF----TCGSGTSMLSSGLKE 452
           V R+ +     SVL   +T++ N  + F+GSR  DS+L++     T  +    L+S   +
Sbjct: 474 VVRIRIEPAGASVLPCCMTSLYNGQYCFIGSRASDSVLLRVMNNATAAADKRRLASAALD 533

Query: 453 EFGDIEADAPSTKRLRRSSSD 473
           +F        + KR R S ++
Sbjct: 534 DFS-------ANKRSRSSDTN 547



 Score = 82.4 bits (202), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 37/84 (44%), Positives = 52/84 (61%), Gaps = 5/84 (5%)

Query: 920  ISGHQ---GFFLSGSRPCWCMV--FRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS 974
            + GHQ   G F+ G RP W ++   R+ LR H  L DGS+ AF+  +N  C  GF+Y T+
Sbjct: 1218 LGGHQLCSGVFVCGRRPLWLLMSPTRKALRAHLMLTDGSVSAFSAFNNNACPGGFVYFTT 1277

Query: 975  QGILKICQLPSGSTYDNYWPVQKV 998
            QG L+ CQL   + +DN WPV++V
Sbjct: 1278 QGTLRFCQLAPTTNHDNPWPVRRV 1301



 Score = 82.0 bits (201), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 66/253 (26%), Positives = 120/253 (47%), Gaps = 30/253 (11%)

Query: 3   FAAYKMMHWPTGIANCGSGFITHS--RADYVPQIPLIQTEELDSELPSKRGIGPVPNL-- 58
           FA ++  H PT + +C     T++  R   V +  L++   +D+   S  G G    L  
Sbjct: 2   FAYFRQQHPPTAVEHCVEASFTNAAERQLVVARANLLEVYRIDAATAS--GSGWRSELSS 59

Query: 59  --VVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAA---------SLELVCHY 107
              +TA     +++ R    G  +   S +    +    + +A          LELV  +
Sbjct: 60  GSALTAQTAGAMHLGRAAGYGGNDGGRSDDAATEINTRSLHSAPATPPALQHKLELVASF 119

Query: 108 RLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEW 167
            L GNVES+ +          +RDS++LAF++AK++V+++D +   L+  S+H +E    
Sbjct: 120 NLSGNVESIGVARLAHC----KRDSLLLAFKEAKVAVVDYDPATLDLKTISLHMYED--- 172

Query: 168 LHLKRGRESFARG----PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSG 223
           + ++ GR++ A      P+++VDP  +C   LVYG ++IIL   Q     + ++D + S 
Sbjct: 173 IEMRGGRDATALQAVWPPVIRVDPMRQCAAFLVYGTKLIILPFRQESH--LDEDDDYQSA 230

Query: 224 GGFSARIESSHVI 236
              +A +  S  I
Sbjct: 231 QAPAASVPPSAQI 243



 Score = 67.8 bits (164), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 57/192 (29%), Positives = 96/192 (50%), Gaps = 25/192 (13%)

Query: 543 ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLET-ADLLTEVTE 601
           EL G +G+W+V+   S   +   + +++ D   H+ L+ S +  T+V  T  + L ++ E
Sbjct: 739 ELTGGRGLWSVF---STALDPSLAALSSLDGASHSLLVASRDDSTLVFTTTGEELEQIAE 795

Query: 602 SVDYFVQGRTIAAGNLF---GRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSEN 658
           S  +F  G TIA GN+F   G+  ++ VF  G R++DG  + Q+L     +S        
Sbjct: 796 S-GFFTAGATIAIGNVFAANGKILIVDVFAHGIRLVDGVNLRQELLLAQLSSV------- 847

Query: 659 STVLSVSIADPYVLLGMSDGSIRLL--VGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDK 716
           S ++  SIA+  VL   +DG++  +   GD      S  T AA     +PV + +LY D+
Sbjct: 848 SEIIHASIAESSVLALHADGAVSFVQFTGDTQELVASTATVAA----GQPVVAVSLYADR 903

Query: 717 G----PEPWLRK 724
                PE  L++
Sbjct: 904 SGLFVPEAVLQR 915


>gi|76157351|gb|AAX28300.2| SJCHGC08809 protein [Schistosoma japonicum]
          Length = 225

 Score =  107 bits (268), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 58/154 (37%), Positives = 91/154 (59%), Gaps = 5/154 (3%)

Query: 242 DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA 301
            + +V D  F+HG+ EP +++L+E   TWAGRVS +  TC I ALS +   + +P+IW  
Sbjct: 52  KINNVLDMQFLHGFYEPTLLVLYEPIGTWAGRVSARRDTCCIVALSFNLQKRTNPVIWFQ 111

Query: 302 MNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS-ASCALALNNYA---VSLDSSQELPR 357
            +LP D   ++ VP PIGGV+++ AN+I Y  Q+  SC+L LN YA    +    Q++P 
Sbjct: 112 ESLPFDCRSVIPVPQPIGGVVIMAANSILYLKQTLPSCSLPLNCYAQISTNFPMRQDVP- 170

Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV 391
           S   + +D      L     L+ T++G+L LL++
Sbjct: 171 SCGPLSIDGCRVVTLNETQFLIGTRSGNLYLLSL 204


>gi|327304811|ref|XP_003237097.1| hypothetical protein TERG_01819 [Trichophyton rubrum CBS 118892]
 gi|326460095|gb|EGD85548.1| hypothetical protein TERG_01819 [Trichophyton rubrum CBS 118892]
          Length = 1398

 Score =  107 bits (266), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 176/734 (23%), Positives = 287/734 (39%), Gaps = 96/734 (13%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V   ++++++ +     GS  +    +  R    +    A L L   Y + G + SL
Sbjct: 28  NLIVAKTSLLQVFSLVNVTYGSTTAAQPDQKGRN---ERSQHAKLVLAAEYEVPGTITSL 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +    + +    D+II++  +AK+S++E+D   HG+   S+H +E  E  H+      
Sbjct: 85  QRVKISNSKSGG--DAIIVSSRNAKLSLIEWDPEKHGISTISIHYYEGEES-HMSPWVPD 141

Query: 177 FARGPL-VKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFGSGGGFSARIES-- 232
               P  +  DP G C  +  +G+  + IL   Q G  LV D+      G  SA + S  
Sbjct: 142 LGSCPSSLTADPNGNCA-IFNFGIHSLAILPFHQAGDDLVMDDYDATPNGNDSADVVSDP 200

Query: 233 ----------------SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
                           S V+ +  LD  + H     F+H Y EP   IL+ +        
Sbjct: 201 QKSAPENTAHDKPYAPSFVLPMTALDPALTHPIHMEFLHEYREPTFGILYSQVARSTSLT 260

Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHS 333
             +      S  ++    K    + +   LP D +K++ +P P+GG L++G N  +H   
Sbjct: 261 IDRKDIVSYSIFTLDLQQKASTSLLTVSRLPSDVFKIVPLPPPVGGALLIGTNELVHVDQ 320

Query: 334 QSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTV 391
              + A+ +N +A    +     +S   + L+      L +     LL    G + +L+ 
Sbjct: 321 AGKTNAVGVNEFARQASAFSMADQSDLEMRLEGCIIEQLGSGTGDILLILADGRMSILSF 380

Query: 392 VYDGRVVQRLDL----SKTNPSVLTSDIT---TIGNSLFFLGSRLGDSLLVQFTCGSGTS 444
             DGR V  + L     ++N S+  +  T   ++G +  F GS  GDS+L+ ++  S T 
Sbjct: 381 KVDGRSVSGISLHFVAEQSNGSITIARPTCSASLGRNKLFCGSEEGDSILLGWSRPSSTI 440

Query: 445 MLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTES------AQ 498
              S  K   G  E  A           D   D +  ++L     AS   E       + 
Sbjct: 441 KRPS--KAADGVDENGAADLSDEAEQDDDGDDDDMYEDDLYSANLASTRQEKQVVNGDSP 498

Query: 499 KTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI-----WTV 553
             F F   D L ++GP +D + G    + +     S       +EL   +G        V
Sbjct: 499 ADFIFRAYDRLWSLGPYRDITLGKPPKSKSKDQRDSVPEIAAPLELVAARGFGKSGGLAV 558

Query: 554 YHKSSRGHNADSSRMAAYDDEYHAYLIISLEART-------------MVLETADLLTEVT 600
             +    +  DS +M   DD Y  + I  ++ ++             ++ +T D   +  
Sbjct: 559 LKREIDPYTIDSLKM---DDVYGVWSIRVVDPKSKDTGLSRSYDKYLLLAKTKD--DDKE 613

Query: 601 ESVDYFV----------------QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLS 644
           ESV Y V                +  TI  G L    RV+QV     R  D  Y      
Sbjct: 614 ESVVYSVGSSGLDSIDAPEFNPNEDCTIDIGTLAAGTRVVQVLRTEIRSYD--YNLGLAQ 671

Query: 645 FGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS--TCTVSVQTPAAIES 702
             P   E    SE  TV+  S A+PY+L    D S+ +L  D +     V VQ  AA   
Sbjct: 672 IYPVWDE--DTSEERTVIQASFAEPYLLTIRDDHSLLILQTDKNGDLDEVEVQGSAA--- 726

Query: 703 SKKPVSSCTLYHDK 716
           S K +S C LY DK
Sbjct: 727 SGKWISGC-LYEDK 739



 Score = 56.2 bits (134), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 25/91 (27%), Positives = 47/91 (51%), Gaps = 2/91 (2%)

Query: 911  CQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIV-AFTVLHNVNCNHGF 969
            C+R+    ++ G++  F+SG  PC+ ++     R H     G  V + +  H   C  GF
Sbjct: 882  CKRLRALPDVCGYKTVFMSGHNPCF-ILKSAIARPHVLRLRGKAVQSLSGFHIAACERGF 940

Query: 970  IYVTSQGILKICQLPSGSTYDNYWPVQKVVF 1000
             YV    ++++ +LPS + +D+ W  +K+ F
Sbjct: 941  AYVDEDNVIRMSRLPSNTRFDSGWATRKIAF 971


>gi|344229600|gb|EGV61485.1| hypothetical protein CANTEDRAFT_109087 [Candida tenuis ATCC 10573]
          Length = 1300

 Score =  106 bits (264), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 104/458 (22%), Positives = 200/458 (43%), Gaps = 45/458 (9%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           L LV  Y+L G + S+  +       + + D +++A + AKIS++ +D + H +R  S+H
Sbjct: 51  LNLVDQYKLFGTITSIKPIR---TIENPKLDYLLVATQLAKISLVRWDHASHSIRTVSLH 107

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
            +E+   +      +      L+ V+P+  C  V    L   +             ++  
Sbjct: 108 YYEN---VIQTSTFDKLNSAELI-VEPKNACLCVRYKNLLTFLPFTRLKTEEDEYADEED 163

Query: 221 GS-GGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
           G+    +    +SS +IN ++LD +   + D  F+H Y +P + +L  ++  WAG + +K
Sbjct: 164 GAVTNSYDGIYDSSFLINGQNLDSRIGTIVDADFLHNYRQPTVALLSSKDQVWAGNLFFK 223

Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSA 336
                   LS+    K+   +    +LP+D  +L+++PSP+ G L+VGAN  IH  +   
Sbjct: 224 KDNISYIVLSLDLNTKKSTTVLKIDDLPYDIDRLISLPSPLNGSLLVGANQLIHIDNGGI 283

Query: 337 SCALALNNYA--VSLDSSQELPRSSFSVELDAAHATWLQNDVALLST-KTGDLVLLTVVY 393
           +  +++N +    + +S   +  S  ++ L+      L N+  +L    TG+  +LT   
Sbjct: 284 TRKISVNPFTDLTTKNSKNYINYSHMNLRLENCSVVPLPNENKVLVILSTGEFYMLTFEI 343

Query: 394 DGRVVQRLDLSKTNPS-------VLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSML 446
           DG+ ++RL       S               + N+L F+G++ G+S L+Q+         
Sbjct: 344 DGKTIKRLTFEVVETSRYNGINVTFPGQFAALDNNLLFVGNKNGNSPLIQYKY------- 396

Query: 447 SSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVR 506
             G KE+   ++ DA   +           D    +  S            ++   F + 
Sbjct: 397 -EGAKEK--AVKEDAKDEEDNDGDEELYEDDEEKVKSFS------------KEKLDFTLC 441

Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVEL 544
           D L+N GP+  F++G   N    +  I+   NY+ V +
Sbjct: 442 DELINHGPISAFTFGFYSNEKFKSNLIN--PNYQEVSI 477


>gi|302652141|ref|XP_003017930.1| hypothetical protein TRV_08062 [Trichophyton verrucosum HKI 0517]
 gi|291181516|gb|EFE37285.1| hypothetical protein TRV_08062 [Trichophyton verrucosum HKI 0517]
          Length = 844

 Score =  105 bits (262), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 183/741 (24%), Positives = 288/741 (38%), Gaps = 110/741 (14%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V   ++++++ +     GS       +  R    D    A L L   Y + G +  L
Sbjct: 28  NLIVAKTSLLQVFSLVNVTYGSTTGTQPDQKGRH---DRSQHAKLVLAAEYEVPGTITGL 84

Query: 117 AILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE-----WLHL 170
             +      NS+   D+I+++  DAK+S++E+D   HG+   S+H +E  E     W+  
Sbjct: 85  QRVR---ISNSKSGGDAILVSSRDAKLSLIEWDPEKHGISTISIHYYEGEESHMSPWVP- 140

Query: 171 KRGRESFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLV---------GDEDT- 219
                S + G  + VDP G C  +  +G+  + IL   Q G  LV         GD+ T 
Sbjct: 141 --DLGSCSSG--LTVDPNGNC-AIFNFGIHSLAILPFHQAGDDLVMDDYDATPNGDDSTD 195

Query: 220 FGSGGGFSARIESSH--------VINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELT 269
             S    SA   +SH        V+ +  LD  + H     F+H Y EP   IL+ +   
Sbjct: 196 MVSDAQKSAPGNTSHDKPYAPSFVLPMTALDPALTHPIHMEFLHEYREPTFGILYSQVAR 255

Query: 270 WAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT- 328
                  +      S  ++    K    + +   LP D +K++ +P P+GG L++G N  
Sbjct: 256 STSLTIDRKDVVSYSIFTLDLQQKASTSLLTVSRLPSDVFKIVPLPPPVGGALLIGTNEL 315

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDL 386
           +H      + A+ +N +A    +     +S   + L+      L +     LL    G +
Sbjct: 316 VHVDQAGKTNAVGVNEFARQASAFSMADQSDLEMRLEGCIVEQLGSGTGDVLLILADGRM 375

Query: 387 VLLTVVYDGRVVQRLDL-----------SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLV 435
            +L+   DGR V  + L           +K  PS   S    +G +  F GS  GDS+L+
Sbjct: 376 SILSFKVDGRSVSGISLHFVAEQSGGSITKARPSCSAS----LGRNKLFYGSEEGDSVLL 431

Query: 436 QFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTE 495
            ++  S T+   S  K   G  E  A           D   D +  ++L     AS   E
Sbjct: 432 GWSRPSSTTKRPS--KSVDGVDENGAADLSDEADQDDDGDDDDMYEDDLYSVNPASTRQE 489

Query: 496 S------AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKG 549
                  +   F+F   D L ++GP +D + G    + +     S       +EL   +G
Sbjct: 490 KQVVNGDSPADFTFRAYDRLWSLGPYRDITLGKPSKSKSKDQQDSVPEIAAPLELVAARG 549

Query: 550 I-----WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART-----------MVLETA 593
                  TV  +    +  DS +M   DD Y  + I  ++ ++            +L   
Sbjct: 550 FGKSGGLTVLKREVDPYTIDSLKM---DDVYGVWSIRVVDPKSKDTGLSRSYDKYLLLAK 606

Query: 594 DLLTEVTESVDYFV----------------QGRTIAAGNLFGRRRVIQVFERGARILDGS 637
               +  ESV Y V                +  TI  G L    RV+QV     R  D  
Sbjct: 607 SKGEDKEESVVYSVGSSGLDSIDAPEFNPNEDCTIDIGTLATGTRVVQVLRTEIRSYD-- 664

Query: 638 YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS--TCTVSVQ 695
           Y        P   E    SE  TV+  S A+PY+L    D S+ +L  D +     V VQ
Sbjct: 665 YNLGLAQIYPVWDE--DTSEERTVIQASFAEPYLLTIRDDHSLLILQTDKNGDLDEVEVQ 722

Query: 696 TPAAIESSKKPVSSCTLYHDK 716
             AA   S K +S C LY DK
Sbjct: 723 GSAA---SGKWISGC-LYEDK 739


>gi|328848896|gb|EGF98089.1| hypothetical protein MELLADRAFT_96156 [Melampsora larici-populina
           98AG31]
          Length = 1427

 Score =  105 bits (262), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 193/933 (20%), Positives = 350/933 (37%), Gaps = 171/933 (18%)

Query: 104 VCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFE 163
           V  ++LHG V  L  ++     +    D ++++F+DAKI++LE+      L   S+H FE
Sbjct: 73  VLEHQLHGIVTGLQPITTIDT-HVDGLDRLLVSFKDAKITLLEWSHQQSDLVPISLHTFE 131

Query: 164 SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSG 223
               +        F +   ++ DPQ RC  + +    + +L   Q  +    D +T  S 
Sbjct: 132 KLPQITQGDFPTIFDQ---LETDPQSRCAILKLPQSTIAVLPFFQENN---LDLETLFSN 185

Query: 224 GGFSA---RIES-----SHVINLRD------------------LDMKHVKDFIFVHGYIE 257
              SA   RI+S     S +I+L                      +K +  F F+ G+ +
Sbjct: 186 SNPSANNQRIQSFPYAPSFIIDLNQSQSFKSQTQTHSQTQTQQKSIKSIISFKFLPGFSQ 245

Query: 258 PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
           P + IL+  + TWAGR+     +C +  +++  +     +I+   NLP+ A+ ++A P  
Sbjct: 246 PTLAILYTYQHTWAGRLENTTDSCSLIFITLDLSSNHFTIIFQIDNLPYHAHSIMACPKE 305

Query: 318 IGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELP---------------RSSFSV 362
           +GGVLV+ A+ I +  QS+       N    L +  ++P                    V
Sbjct: 306 VGGVLVICADMILHIDQSSKLIGIATNGWSKLSTHLDVPTQQMVKIVTEDGQDQEERLKV 365

Query: 363 ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-TNPSVLTSDITTIGNS 421
            L+ +   ++  D AL+    G +  L +  DGR + +L L K    SV+ S    I + 
Sbjct: 366 RLENSKLVFVTIDRALMFLTDGQIFRLCLYQDGRTLIKLCLEKFPVVSVIPSVAVKISDH 425

Query: 422 LFFLGSRLGDSLLVQFTC------------------------GSGTSMLSSGLKEEFGDI 457
             F+GS LGDS+++                            G+   +  +   E +G  
Sbjct: 426 SVFVGSMLGDSIVMGIEFEGEKEVEVVEEVEVEVEAEVVHQNGNEMEIDQAEEDEIYGKE 485

Query: 458 EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKD 517
           E D   TK       D +  ++           + N +  ++  S  + DS+   GP++D
Sbjct: 486 EPDDKKTK-----DQDGIDSIIK----------ATNKKIHREIRSLRLHDSISGHGPIRD 530

Query: 518 FSYGLRINADASATGISKQSNYE-LVELPGCKGI-----WTVYHKS---SRGHNADSSRM 568
           F+             +SK   +E  +E+ GC G       T+++K     +    DS+  
Sbjct: 531 FT-------------MSKIGGFEDSLEMVGCTGSGETGGLTIFYKEMPLMKRKKLDSTNE 577

Query: 569 A---------AYDDEYHA----YLIISLEARTMVLETADLLTEVTESVDY----FVQGRT 611
           +         A++D   +       IS+  RT +        E   + D      +   T
Sbjct: 578 SMKITNLNSIAFNDPTGSPGCELAWISIHDRTKIFSMIKNPEEGNRTSDLKFMKTLNAST 637

Query: 612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYV 671
           I     F +   +Q+     ++L      +     P  +E    ++ + ++   +   Y+
Sbjct: 638 IYVAMFFDQTCFLQITSYEIKLLKVVGFGEVQVIRPIETE----NKKNKIIRAKVVQDYI 693

Query: 672 LLGMSDGSIRLLVGDPSTCTVS-VQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAW 730
           LL  SD  + L  G   + T+  +Q P       KPV+  +L+    P  +     T+  
Sbjct: 694 LLETSDHRVMLYKGQVDSLTIDRIQLPQL----SKPVTYASLFSAHLP-LYDHDDQTN-- 746

Query: 731 LSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKF-----VSGRT 785
              G+G   D     P      +  V    G L I  +P    VFTV        +    
Sbjct: 747 ---GIGLDNDEDAEKP------WLFVTDLGGVLHILSLPELEIVFTVKGIENLPDLLDED 797

Query: 786 HIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAIL 845
              +   + A++    + +   EE      KEN     +         A  +RP L+  L
Sbjct: 798 EDEEQQQQPAIEYEHEDGDVKMEEDEKVEPKENSSIQMIYGFVT---GAKVARPHLYVEL 854

Query: 846 TDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRF-SRTPLDAYTREE 904
            +G +  YQ  +        K  DP ++       ++  +++   +F S  P+    R  
Sbjct: 855 NNGALAVYQISI----AYDRKPGDPSTSKPRRQALSIRLNKVLGYQFESSEPISNLDR-- 908

Query: 905 TPHGAPCQRITIFKNISGHQGFFLSGSRPCWCM 937
                   ++ + K  +   G  LSG  P W +
Sbjct: 909 --------KVKVVKKNATFSGIHLSGLEPIWIV 933


>gi|302506529|ref|XP_003015221.1| hypothetical protein ARB_06344 [Arthroderma benhamiae CBS 112371]
 gi|291178793|gb|EFE34581.1| hypothetical protein ARB_06344 [Arthroderma benhamiae CBS 112371]
          Length = 1370

 Score =  105 bits (262), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 181/736 (24%), Positives = 287/736 (38%), Gaps = 100/736 (13%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V   ++++++ +     GS  +    +  R    D    A L L   Y + G +  L
Sbjct: 28  NLIVAKTSLLQVFSLVNVTYGSTTATQPDQKGRH---DRSQHAKLVLAAEYEVPGTITGL 84

Query: 117 AILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
             +      NS+   D+I+++  +AK+S++E+D   HG+   S+H +E  E        +
Sbjct: 85  QRVR---ISNSKSGGDAILVSSRNAKLSLIEWDPEKHGISTISIHYYEGEESHMSPWVPD 141

Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDE---------------DT 219
             +    + VDP G C  +  +G+  + IL   Q G  LV D+               D 
Sbjct: 142 LGSCSSSLTVDPNGNCA-IFNFGIHSLAILPFHQAGDDLVMDDYDATPNGDDSTDLVSDA 200

Query: 220 FGSGGGFSARIES---SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
             S  G +A  +    S V+ +  LD  + H     F+H Y EP   IL+ +        
Sbjct: 201 QKSAPGNTAHDKPYAPSFVLPMTALDPALTHPIHMEFLHEYREPTFGILYSQVARSTSLT 260

Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHS 333
             +      S  ++    K    + +   LP D +K++ +P P+GG L++G N  +H   
Sbjct: 261 IDRKDVVSYSIFTLDLQQKASTSLLTVSRLPSDVFKIVPLPPPVGGALLIGTNELVHVDQ 320

Query: 334 QSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTV 391
              + A+ +N +A    +      S   + L+      L +     LL    G + +L+ 
Sbjct: 321 AGKTNAVGVNEFARQASAFSMADHSDLEMRLEGCIVEQLGSGTGDVLLILADGRMSILSF 380

Query: 392 VYDGRVVQRLDL-----------SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCG 440
             DGR V  + L           +K  PS   S    +G +  F GS  GDS+L+ ++  
Sbjct: 381 KVDGRSVSGISLHFVAEQSGGSITKARPSCSAS----LGRNKLFYGSEEGDSVLLGWSRP 436

Query: 441 SGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTES---- 496
           S T+   S  K   G  E  A           D   D +  ++L     AS   E     
Sbjct: 437 SSTTKRPS--KAADGVDENGAADLSDEAEQDDDGDDDDMYEDDLYSVNPASTRQEKQVVN 494

Query: 497 --AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI---- 550
             +   F+F   D L ++GP +D + G    + +     S       +EL   +G     
Sbjct: 495 GDSPADFTFRAYDRLWSLGPYRDITLGKPPKSKSKDQQDSVPEIAAPLELVAARGFGKSG 554

Query: 551 -WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEAR---TMVLETAD---LLTEVT--- 600
             TV  +    +  DS +M   DD Y  + I  L+ +   T +  + D   LL +     
Sbjct: 555 GLTVLKREVDPYTIDSLKM---DDVYGVWSIRVLDPKSKDTGLSRSYDKYLLLAKAKGED 611

Query: 601 --ESVDYFV----------------QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD 642
             ESV Y V                +  TI  G L    RV+QV     R  D  Y    
Sbjct: 612 KEESVVYSVGSSGLDSIDTPEFNPNEDCTIDIGTLATGTRVVQVLRTEIRSYD--YNLGL 669

Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPS--TCTVSVQTPAAI 700
               P   E    SE  TV+  S A+PY+L    D S+ +L  D +     V VQ  AA 
Sbjct: 670 AQIYPVWDE--DTSEERTVIQASFAEPYLLTIRDDHSLLILQTDKNGDLDEVEVQGSAA- 726

Query: 701 ESSKKPVSSCTLYHDK 716
             S K +S C LY DK
Sbjct: 727 --SGKWISGC-LYEDK 739


>gi|406699110|gb|EKD02327.1| cleavage and polyadenylation specific protein [Trichosporon asahii
           var. asahii CBS 8904]
          Length = 1339

 Score =  105 bits (261), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 194/814 (23%), Positives = 316/814 (38%), Gaps = 160/814 (19%)

Query: 181 PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD 240
           P+++ DP  R   + +    + +L   Q  S L   +D+  S                  
Sbjct: 157 PMLRSDPLSRLAILTLPEDALAVLPIVQEQSELDAMQDSVSSP----------------- 199

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK-QHPLIW 299
            ++K++KDF+F+ G+  P + +L     TWAGR      T  +   +I T+    +PLI 
Sbjct: 200 -EIKNIKDFLFLPGFHSPTIALLFAPMNTWAGRYKSVKDTFRLEIRTIDTSAGGTYPLIT 258

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALN---NYAVSL--DSSQ 353
           S   LP D+  L+A PS +GGV+VV A+ I +  QS    + ++N   NY  ++  DSS 
Sbjct: 259 SVTGLPSDSQYLVACPSEVGGVVVVTASGIIHIDQSGRLVSTSVNGWWNYTTNMKSDSSY 318

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLT- 412
           E    S  + LD +HA ++  +  LL  +TG++  +    DGR V  + + + + +V   
Sbjct: 319 E----SQKLALDNSHAQFVTENDMLLVLETGEVHQIRFEMDGRAVGAIKVDEQSSTVPPP 374

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
           S +   G+   F+GS  GDSLL         S      +EE        P TK+      
Sbjct: 375 STLVPAGSDGIFVGSVEGDSLLAMVEKARDQSA-----QEE--------PETKQQEMDVD 421

Query: 473 DALQDMVNGE---ELSLYGSASNNTESAQKTFSFAVRD-------SLVNIGPLKDFSYGL 522
           D  +++  G     +      S     A   F  AV D        LV IG     S G 
Sbjct: 422 DWDEEVATGPVTVSVKAQDVLSGIGRIADMEFGIAVTDLGTRTYPQLVCIG---GGSQGS 478

Query: 523 RINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIIS 582
            +N       I+K+  +E  +L      W +  + +   NA   +     ++    +  +
Sbjct: 479 TMNVFRRGIPITKRRLFE--QLRTAVATWFLPVERA---NAPKFKDIPESEQSTIAIAAT 533

Query: 583 LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQD 642
            E  T +   +    +V E +  F +   IA G    R R++ V      +LD       
Sbjct: 534 QEGSTQIFALS--TRKVQERIAEFPEP-AIATGTWLRRTRIVLVLPSQVLLLD------- 583

Query: 643 LSFGPSNSES-GSGSENS---TVLSVSIADPYVLLGMSDGSIRLLVGD-----------P 687
                SN+   G+  E S    +++ SIADPYVL+  +DGS+ + VGD           P
Sbjct: 584 -----SNANPVGTICEMSDAPPIVAASIADPYVLIRRADGSVSVFVGDTVEGKWSEAPMP 638

Query: 688 STCTVSVQTPAAI------------------ESSKKPVSS----CTLYHDKGPEPWLRKT 725
               + V   A +                  E   KPV +        H  G E   R  
Sbjct: 639 EGLALPVCQAAEVFTDTTGIYRTFEATQGVKEEPVKPVPTKQGQKAKIHLTG-EQLKRLQ 697

Query: 726 STDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRT 785
            +   +S  V       +     +G  +  +  +SG L+I  +P+F+ V   +       
Sbjct: 698 DSKPAISADVATTESAFNAA---RGTQWIALLAQSGELQIRSLPDFDLVLQSNG------ 748

Query: 786 HIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAIL 845
            + D+    +  D +T      EEG      + +  M    +  +       RP +  + 
Sbjct: 749 -VYDS--EPSFTDDQTGELPELEEG------DEVSQMLFCPIGTRTL-----RPHVIVLH 794

Query: 846 TDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREET 905
             G +  Y+A     P  T  + D   + RSL+V      R R +    T L + T   T
Sbjct: 795 RSGRLNIYEAQ----PRFTVDARD--QSRRSLAV------RFRKV---HTQLLSVTPSST 839

Query: 906 --PHGAPCQRITIFKNISGHQGFFLSGSRPCWCM 937
             P   P      F +I G  G F++G RP W +
Sbjct: 840 VKPAAIP------FTDIEGLTGAFITGERPHWII 867


>gi|320583269|gb|EFW97484.1| RNA-binding subunit of the mRNA cleavage and polyadenylation factor
           [Ogataea parapolymorpha DL-1]
          Length = 1309

 Score =  105 bits (261), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 107/464 (23%), Positives = 212/464 (45%), Gaps = 46/464 (9%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           L+L+  YRL+G + ++    +  ++ +   D +I++ + AK+SV+++D  +H +   S+H
Sbjct: 51  LQLIGEYRLNGQIINI---DKFRSNENESLDYLIVSTKLAKLSVIKWDSQLHAISTVSLH 107

Query: 161 CFESP-EWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMII--LKASQGGSGLVGDE 217
            +++  + L +++  ++  +    + DP   C  + +  L   +   K       L  D 
Sbjct: 108 YYDTALDALTVEKLEKTSVQH---RTDPNSLCTCLRLNELFTFLPFYKEYLDEEELKDDA 164

Query: 218 DTFGSGGGFSARIESSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHERE-LTWAGRV 274
           +              S ++N   L  D+K++ D+ F+H Y +P M IL+  E +TWAG +
Sbjct: 165 EEAKDIKKRKKLFTESFILNASSLYPDIKNIVDYQFLHSYRDPTMAILYAPETMTWAGHL 224

Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-TIHYHS 333
                T  +  LS+    K+   I    NLP+D   +  + SP  G L+VG+N  IH +S
Sbjct: 225 PKAKDTLKVIVLSLDLENKKASAIMELTNLPYDVDYIYPLESPTNGFLLVGSNEIIHVNS 284

Query: 334 QSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY 393
             +   +  N Y   + + +   +SS  + L+ +    ++ D  L+ T++G+   L    
Sbjct: 285 LGSVRGIYTNEYFTDISNLKLKDQSSLGLMLENSRVGLVKEDQVLIITESGEFYQLNFEK 344

Query: 394 DG-----RVVQRLDLSK-----TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGT 443
            G       +Q+++ S       N  ++ + + ++   LFF+  + GDS L++ +  SG 
Sbjct: 345 IGGNSTITGLQKVETSNYKGIIVNHPIMITSVPSL--DLFFVCCQGGDSSLIRISSKSG- 401

Query: 444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSF 503
            +L    KE+ GD +        L            + E+   + S+  N++       F
Sbjct: 402 -VLPQETKEQNGDTKETKDDDDWL-----------YDEEDQKSHKSSLVNSQ-------F 442

Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC 547
              D+++N GPL DF+ G R++ +    G+   +  E V +  C
Sbjct: 443 KKMDNILNCGPLVDFTLG-RVSIEQKIMGLPNPNYNEDVLVAAC 485


>gi|390358537|ref|XP_001201130.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like [Strongylocentrotus purpuratus]
          Length = 283

 Score =  105 bits (261), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 79/273 (28%), Positives = 126/273 (46%), Gaps = 51/273 (18%)

Query: 3   FAAYKMMHWPTGIANC-GSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVT 61
           +A Y+ +H PTG+ +C    F +                      P ++      NLVV 
Sbjct: 2   YAFYREIHPPTGVEHCVYCHFFS----------------------PDQQ------NLVVA 33

Query: 62  AANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQ 121
             + + +Y +   +      K+S    +           LE    + + G V S+    Q
Sbjct: 34  KGSELTVYSMITVDSNKPTDKDSKPKNK-----------LEEAATFHIFGKVMSM----Q 78

Query: 122 GGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGP 181
                   RD+++L+F +AK+S++E+D ++H L+  SMH FE  E    K G       P
Sbjct: 79  SAQVTGSGRDALLLSFMNAKVSIVEYDPNMHDLKTLSMHYFEEDET---KEGVYRNIFHP 135

Query: 182 LVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDL 241
           +VKVDP  RC  +L YG ++++L   +   GLV D D   S       +  S+VI L ++
Sbjct: 136 VVKVDPDHRCAIMLTYGSKLVVLPFRR--DGLVEDLDKSMSASTRRGALMPSYVIRLNEM 193

Query: 242 D--MKHVKDFIFVHGYIEPVMVILHERELTWAG 272
           D  + +V D  F+HGY EP ++IL+E   TWAG
Sbjct: 194 DDPICNVLDIQFLHGYYEPTLLILYEPLRTWAG 226


>gi|296806499|ref|XP_002844059.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
 gi|238845361|gb|EEQ35023.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
          Length = 1348

 Score =  104 bits (260), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 161/745 (21%), Positives = 297/745 (39%), Gaps = 116/745 (15%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V   ++++++ +     GS  + +  +  R    D    A L LV  Y++ G +  L
Sbjct: 28  NLIVAKTSLLQVFSLVNVTYGSSLANHPDQKSRH---DRSQHAKLVLVAEYQVSGTITGL 84

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +    + +    D+I+++  +AK+S++E+D   HG+   S+H +E  E  H+      
Sbjct: 85  ERVKISNSKSGG--DAILVSSRNAKLSLIEWDPRNHGISTISIHYYEGEES-HMSPWVPD 141

Query: 177 FAR-GPLVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDE----------------- 217
                  + VDP G C  +  +G+  + IL   Q G  LV D+                 
Sbjct: 142 LGSCASNLTVDPNGNCA-IFNFGIHSLAILPFHQTGDDLVMDDYDSVLNGDSAADTINDT 200

Query: 218 --DTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
              T G     S   E S V+ L  LD  + H     F+H Y EP   IL+ +       
Sbjct: 201 QKPTAGDSTVHSKPYEPSFVLPLAALDPALTHPIHMEFLHEYREPTFGILYSQVARSTSL 260

Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYH 332
              +      +  ++    +    + +   LP D +K++++P P+GG L++G N  +H  
Sbjct: 261 SIDRKDVVSYAIFTLDLQQRASTSLLTVSRLPSDMFKVVSLPPPVGGALLIGTNELVHVD 320

Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLT 390
               + A+ +N +A    +   + +S   + L+      L +D    LL    G + +LT
Sbjct: 321 QAGKTNAVGVNEFARQASAFSMVDQSDLEMRLEDCVVEQLGSDAGEVLLILTDGRMAILT 380

Query: 391 VVYDGRVVQRLDL----SKTNPSVLTSDITT---IGNSLFFLGSRLGDSLLVQFTCGSGT 443
              DGR V  + L     ++  S++ +  +    +G S  F GS  GDS+L+ ++  S  
Sbjct: 381 FKVDGRSVSGISLHYVAEQSGGSIIKARPSCSAGLGRSKLFCGSEEGDSILLGWSKPSSN 440

Query: 444 SMLSSGLKE---EFGDIEADAPSTKRLRR-----------SSSDALQD--MVNGEELSLY 487
           +   +   E   E G  E      +               + +  LQ+  +VNG++ +  
Sbjct: 441 TKKPTKANEDTNEDGTTEFSGEDEQDDDDDDIYEDDLYSANPAPTLQEKRVVNGDDTA-- 498

Query: 488 GSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG------LRINAD-----------ASA 530
                        F F + D L ++GP +D + G      L+   D            +A
Sbjct: 499 ------------DFVFKIHDRLWSLGPFRDITLGRPPKSKLKDKRDNVPSISASLELVAA 546

Query: 531 TGISKQSNYEL------------VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAY 578
            G  K     +            +++    G+W++     +   A ++       +Y  Y
Sbjct: 547 RGFGKSGGLAVLKREIDPFTIDSLKMDNVYGVWSIRVTDPKSKEASAT---GNSRDYDKY 603

Query: 579 LII-----SLEARTMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGAR 632
           L++     S +  ++V    +   +  ++ ++   +  TI  G L    RV+QV     R
Sbjct: 604 LLLAKAKCSDKEESVVYSVGNSGLDSIDAPEFNPNEDCTIDIGTLAAGSRVVQVLRTEIR 663

Query: 633 ILDGSY-MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT 691
             D +  +TQ       ++     SE  TV+  S A+PY+L    D S+ +L  D +   
Sbjct: 664 SYDYNLGLTQIYPVWDEDT-----SEERTVVQASFAEPYLLAIRDDHSLLVLQADKTGDL 718

Query: 692 VSVQTPAAIESSKKPVSSCTLYHDK 716
             V+    + +S   VS C LY D+
Sbjct: 719 DEVEI-QGLATSADWVSGC-LYEDR 741



 Score = 50.8 bits (120), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 24/98 (24%), Positives = 47/98 (47%), Gaps = 6/98 (6%)

Query: 904 ETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHP---QLCDGSIVAFTVL 960
           E  H  P + +    +I G++  F+ G  PC+ +   +     P   +L   ++ + +  
Sbjct: 820 EGKHPFPRKPLRALSDICGYKTVFMPGQNPCFIL---KSAITQPHVLRLRGKAVQSLSGF 876

Query: 961 HNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
           H   C  GF YV    I+++ +LPS + +D+ W  +K+
Sbjct: 877 HIAACERGFAYVDEDNIIRMSRLPSNTRFDSTWATRKI 914


>gi|348679451|gb|EGZ19267.1| hypothetical protein PHYSODRAFT_492468 [Phytophthora sojae]
          Length = 736

 Score =  104 bits (260), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 111/427 (25%), Positives = 169/427 (39%), Gaps = 102/427 (23%)

Query: 235 VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
           ++ LR+L++   V D  F+ GY+EP +++LHE   + +  GR++    T  I+ +SI+  
Sbjct: 261 LLRLRELEITGKVIDLAFLDGYLEPTLMVLHEENEKNSTCGRLAAGFDTYCITVISINMN 320

Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
            + HP IW+  NLP D +KL+   +P+GGV+V+ AN   Y +Q+    LA N     L  
Sbjct: 321 TRLHPKIWTVKNLPSDCFKLIPCRAPLGGVVVLSANAFLYFNQTQFHGLATN----VLRE 376

Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRL-DLSKTNPSV 410
             +   +  ++ L      +L     LL+   GD  +L++ Y+ R V+    + KT    
Sbjct: 377 QDDHEMAQLNIVLYDCQFEYLHEKEVLLTMPNGDAYVLSLPYEDRSVRFWRSIKKT---- 432

Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQF------TCGSGTSMLSSG----LKEEFGDIEAD 460
                        F+GSR GDS+L         + G   S L       +KEE    E  
Sbjct: 433 ------------LFVGSRSGDSVLYALDQKKLTSAGGEASKLQEDEEMLIKEEVVKEEVT 480

Query: 461 -----------------------APSTKRLRRSSSDALQDMVNGEELSLYGSASN----N 493
                                  AP+ +    S S    + VNG   S      N     
Sbjct: 481 AEVKAEPAEEEEEDEDDLFLCGAAPTKEEPTTSGS---TEAVNGTNGSAVKKEENGHAVE 537

Query: 494 TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELV----------- 542
            ES    +     D L +IG +      +  NAD      S +   ELV           
Sbjct: 538 EESGPYDYVLHQIDVLPSIGQITSIELSIENNAD------SNEKREELVISGGYEHSGAI 591

Query: 543 ---------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
                          EL GC+ +WTV         +   R       Y+AYLI+S+  RT
Sbjct: 592 SVLHNGLRPIVGTEAELNGCRAMWTVSSSLPSATKSSDGR------SYNAYLILSVAHRT 645

Query: 588 MVLETAD 594
           MVL T +
Sbjct: 646 MVLRTGE 652


>gi|346971831|gb|EGY15283.1| cft-1 [Verticillium dahliae VdLs.17]
          Length = 1445

 Score =  104 bits (260), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 226/1051 (21%), Positives = 399/1051 (37%), Gaps = 207/1051 (19%)

Query: 57  NLVVTAANVIEIYVVRV-------QEEGSKESKNSGETKRRVLMD--GISAA-------- 99
           NL+V+  ++++I+ V+         +  +K + N+GET  R + D  G+ +A        
Sbjct: 28  NLIVSKGSLLQIFAVKTVSTEIDTSQIQAKSTSNAGETYDRRINDDDGLESAFLGGDGML 87

Query: 100 ---------SLELVCHYRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDD 149
                     L LV  Y +HG +  LA +      +SR   +++++    A++S+L++D 
Sbjct: 88  MRADRTTNTRLVLVAEYPVHGVIAGLARVK---IQSSRSGGEALLVHSRTARLSLLQWDP 144

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPL------VKVDPQGRCGGVLVYGLQMII 203
             HG+   S+H +E  EW      + S   GPL      ++ DPQ RC   L +GL+ I 
Sbjct: 145 EKHGVEDISIHFYEKEEW------QGSPMDGPLRQHATILQADPQSRCAA-LKFGLRKIA 197

Query: 204 L-------------KASQGGSGLVGDEDTFGSGGGFSARIES----------SHVINLRD 240
                            +   G    E+   +     +   S          S V+ L  
Sbjct: 198 FLPFRQIDGDIDMDDWDEEVDGPRPQEEPPAAAAVHGSSSNSSSLAPVPYTPSFVLALPQ 257

Query: 241 LD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298
           LD  + H   F F+H Y EP + I+              H T  +  + +        L 
Sbjct: 258 LDPEILHPVHFAFLHEYREPTLGIISSTNRRLKMEPQKDHFTFKVFTVDL--------LQ 309

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPR 357
            +++N      K++A+P P+GG L++G N  IH      +  +A+N YA  +       +
Sbjct: 310 KASLN------KVIALPKPMGGALLIGENELIHIDQAGKAHGVAVNPYAAKMTKFPLADQ 363

Query: 358 SSFSVELDAAHATWL--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDL----SKTNPSVL 411
           S   + L+      +  +N   LL T+ G++ ++T   DGR V  + +    ++    VL
Sbjct: 364 SELKLRLEHCEVELMSPENGEMLLVTRHGEMAVVTFKMDGRSVSGVSVKVVATENGGDVL 423

Query: 412 ---TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
               + +T +  +  F G+  GDS ++    G     + +  K+     E+         
Sbjct: 424 PFRAACLTKVTKNSMFYGTIGGDSKVI----GWSRQHVQTARKKARLLDESLDYDLDDDE 479

Query: 469 RSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG------- 521
               D   D + GE       ++    +      F V DSL+++ P+ D +YG       
Sbjct: 480 ADDDDDDDDDLYGEGTVAPQPSAAAGSAKGGDVVFRVHDSLLSLSPIMDMTYGKTAFFPG 539

Query: 522 ---------LRINAD-ASATGISKQSNYELV------------ELPGCKGIWT--VYHKS 557
                    +R   D   A G  +  +  L+            + P  +G WT  V    
Sbjct: 540 SEDAKNSEGVRSELDLVCAVGRHRGGSLALINQHIQPRVIGRFDFPEARGFWTTRVQKTI 599

Query: 558 SRGHNADS-SRMAAYDD-----EYHAYLIISLEARTMVLETADLLTEVTESVDYF----- 606
           ++    D  + +A  +D     +Y  ++I++ +      ET+D+        +       
Sbjct: 600 AKSLQGDKGANLAVGNDYGSVTQYDKFMIVA-KVDLDGYETSDVYALTGAGFEALSGTEF 658

Query: 607 --VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENSTVLS 663
               G TI AG +    R+IQV     R  DG   ++Q L     + E+G+      V+S
Sbjct: 659 DPAAGLTIEAGTMGNDMRIIQVLRSEVRCYDGDLGLSQILPM--LDEETGA---EPRVIS 713

Query: 664 VSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDK----GPE 719
            SI DPY+LL   D SI +           +        S K +S C LY D      P 
Sbjct: 714 ASIVDPYLLLLREDSSILVAQITNHNELEELDKEDETIVSTKWLSGC-LYKDSRGLFAPV 772

Query: 720 PWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDK 779
              + TST   +   +  AI          G+++  +C       ++ +PN +    V  
Sbjct: 773 QTDKGTSTSESVFLFLLNAI----------GELHVRIC-------VYALPNLSKSIYV-- 813

Query: 780 FVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRP 839
             +G ++I           S    + ++  GT     E +  + V +L     ++ H   
Sbjct: 814 -AAGLSYI----------PSLLSADYTARRGTS---PETLTEILVADLGDSTSASAH--- 856

Query: 840 FLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDA 899
            L     +  +  Y+ +   G E   K D     + SL    VS S L     +++P++A
Sbjct: 857 -LILRHANDDMTIYEPFRIGGQEE--KED----LANSLFFKKVSNSHL-----AKSPVEA 904

Query: 900 YTREETPHGAPCQRITIFK---NISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVA 956
              E         R+   +   NI G+   FL G+ P + +   +       L    +  
Sbjct: 905 AEDEAVQE----NRVIPLRACDNIGGYSTVFLPGASPSFILKSSKSTPKVIGLQGLGVNG 960

Query: 957 FTVLHNVNCNHGFIYVTSQGILKICQLPSGS 987
            +  H   C  GFIY  S+G  ++ Q P  +
Sbjct: 961 MSSFHTEGCERGFIYADSKGCARVTQFPDAA 991


>gi|159470705|ref|XP_001693497.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158283000|gb|EDP08751.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 461

 Score = 99.4 bits (246), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 57/164 (34%), Positives = 97/164 (59%), Gaps = 7/164 (4%)

Query: 230 IESSHVINL-RDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
           + +S+V+NL + + ++ V+D +F+HGY EPV+++LHE + TWAGR+  +  TC ++A+S+
Sbjct: 128 VGNSYVLNLHKMMGIREVRDCVFLHGYTEPVLLLLHEPDPTWAGRLRERKDTCCLTAISV 187

Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASC-ALALNNYAV 347
           S  LK+H ++W A  LP+D Y+LL +P      LV+  + +   SQ++   A ALN+ A+
Sbjct: 188 SLRLKRHTVLWRAAGLPYDCYRLLPLPQ-RPAALVLSPSLVMLTSQASQPQAAALNSTAL 246

Query: 348 SLDSSQEL----PRSSFSVELDAAHATWLQNDVALLSTKTGDLV 387
             ++   L     R + SV      A +  ND A    ++  LV
Sbjct: 247 PGEAPPPLVFDPAREAPSVTAARMAAEFALNDCAPALGRSAALV 290


>gi|238508528|ref|XP_002385456.1| cleavage and polyadenylation specificity factor subunit A, putative
           [Aspergillus flavus NRRL3357]
 gi|220688975|gb|EED45327.1| cleavage and polyadenylation specificity factor subunit A, putative
           [Aspergillus flavus NRRL3357]
          Length = 1204

 Score = 99.4 bits (246), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 92/355 (25%), Positives = 159/355 (44%), Gaps = 40/355 (11%)

Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGR 190
           ++I+LAF +AK++++E+D   +G+   S+H +E  +        +  + G ++ VDP  R
Sbjct: 88  EAILLAFRNAKLALIEWDPGRYGICTISIHYYERDDSTSSPWVPDLSSCGSILSVDPSSR 147

Query: 191 CGGVLVYGLQ-MIILKASQGGSGLVGDE------DTFGSGG--------------GFSAR 229
           C  V  +G++ + IL   Q G  LV D+      +  GS G                 A 
Sbjct: 148 CA-VFNFGIRNLAILPFHQPGDDLVMDDYGELDDERLGSHGLESGTDCDMTKESIAHRAP 206

Query: 230 IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALS 287
             SS V+ L  LD  + H     F++ Y EP   IL+ +  T    +  +      +  +
Sbjct: 207 YSSSFVLPLAALDPSILHPISLAFLYEYREPTFGILYSQVATSNALLHERKDVVFYTVFT 266

Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA 346
           +    +    + S   LP D +K++A+P P+GG L++G+N  +H      + A+ +N ++
Sbjct: 267 LDLEQRASTTLLSVSRLPSDLFKVVALPPPVGGALLIGSNELVHVDQAGKTNAVGVNEFS 326

Query: 347 VSLDSSQELPRSSFSVELDAAHATWLQ--NDVALLSTKTGDLVLLTVVYDGRVVQRLDLS 404
             + S     +S  ++ L+      L   N   LL   TG++VL+    DGR V  + + 
Sbjct: 327 RQVSSFSMTDQSDLALRLEGCIVERLSETNGDLLLVPTTGEIVLVKFRLDGRSVSGISVH 386

Query: 405 KTNPSV-------LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKE 452
              P           S    +G+   FLGS   DS+L+      G S+ SSG K+
Sbjct: 387 PIPPHAGGDIVKSAASSSAFLGDKRVFLGSEDADSILL------GWSVPSSGTKK 435



 Score = 47.0 bits (110), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 27/101 (26%), Positives = 42/101 (41%), Gaps = 3/101 (2%)

Query: 898 DAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAF 957
           D  + EE     P   + I  NISG    F  G  P + +           L  G   + 
Sbjct: 678 DQSSTEEVIKSVP---LRIVSNISGFSAIFRPGVSPGFIVRTSTSSPHFLGLKGGYAQSL 734

Query: 958 TVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
           +      C  GFI + S+G++ +CQ+P G   D  W +Q++
Sbjct: 735 SKFQTSECGEGFILLDSKGVIHVCQMPLGVQLDYPWTIQQI 775


>gi|150951283|ref|XP_001387581.2| pre-mRNA 3'-end processing factor CF II mRNA cleavage and
           polyadenylation factor II complex, subunit CFT1 (CPSF
           subunit) RNA processing and modification
           [Scheffersomyces stipitis CBS 6054]
 gi|149388465|gb|EAZ63558.2| pre-mRNA 3'-end processing factor CF II mRNA cleavage and
           polyadenylation factor II complex, subunit CFT1 (CPSF
           subunit) RNA processing and modification
           [Scheffersomyces stipitis CBS 6054]
          Length = 1341

 Score = 97.8 bits (242), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 98/415 (23%), Positives = 183/415 (44%), Gaps = 55/415 (13%)

Query: 55  VPNLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVE 114
           V +LVV  A +++I+ V VQ + S  SK                  L+L+  ++LHG + 
Sbjct: 26  VKHLVVGKATLLQIFEV-VQLKSSTPSK--------------PQHRLKLIDQFKLHGLIT 70

Query: 115 SLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR 174
            +  +    + N    D ++++ + AK SV+++D  +H +   S+H +E+          
Sbjct: 71  DIKPIRTVESPNF---DYLLVSTKSAKFSVIKWDHHLHTISTVSLHYYENAIQ---NSTY 124

Query: 175 ESFARGPLVKVDPQGRCGGVLVYGL----------QMIILKASQGGSGLVGDEDTFGSGG 224
           E  ++  L+ ++P G C  +    L          ++    A      +V  E      G
Sbjct: 125 EKLSKSELL-LEPYGSCSCLRFKNLLCFLPFETAEELDDDDADSENEDMVKSEKKEHENG 183

Query: 225 GFSARI--------ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
             +  +        ++S +I+ + LD  +  + D  F+  Y EP   IL +R+  WAG +
Sbjct: 184 TVNVPVTDQPGSFFDTSFLIDGQSLDSSIGSIIDMQFLFKYREPTFGILSQRQQAWAGNL 243

Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-TIHYHS 333
                      L++  T K    +    NLP+D  +++ +PSP+ G L++G N  IH  +
Sbjct: 244 PKIKDNVQFCILTLDLTTKSTVSVLKIDNLPYDVDRIVPLPSPLNGCLLLGCNEIIHVDN 303

Query: 334 QSASCALALNNYAVSLDSSQEL--PRSSFSVELDAAHATWLQND-VALLSTKTGDLVLLT 390
                 +A+N +   + +S +    ++  +++L+      L ND  ALL   TG+   L 
Sbjct: 304 GGIVRRIAVNQFTSLITASTKAYQDQTHLNLKLEDCSVVALPNDHRALLVLSTGEFYYLN 363

Query: 391 VVYDGRVVQRLDLSKTNPSVLTSD--------ITTIGNSLFFLGSRLGDSLLVQF 437
              DG+ +++  +   +  +L SD        I T+ N+L F  +  G+S LVQF
Sbjct: 364 FEVDGKSIKKFTIESVD-KLLYSDIKLTFPGQIATLDNNLLFFANHNGNSPLVQF 417



 Score = 52.8 bits (125), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 37/164 (22%), Positives = 68/164 (41%), Gaps = 26/164 (15%)

Query: 836 HSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRT 895
           H   +L  +   G +L Y+ Y F+G                    N    + ++L  +  
Sbjct: 786 HKEEYLTILTIGGEVLLYKLY-FDG-------------------ENYEFKKEKDLAITGA 825

Query: 896 PLDAYTREETPHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSI 954
           P +AY     P G   +R +  F N++G+   F++G  P   +     +    Q      
Sbjct: 826 PENAY-----PIGTAVERRLAYFPNLNGYTCIFVTGVTPYLILKSLHSIPRIYQFSKIPA 880

Query: 955 VAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
           V+ +  H+    +G I++ +Q   +ICQLP    Y+N WP++ +
Sbjct: 881 VSISPFHDSKVANGLIFLDNQQNARICQLPLDFNYENTWPMKLI 924


>gi|9794906|gb|AAF98387.1| cleavage and polyadenylation specificity factor [Drosophila
           melanogaster]
          Length = 279

 Score = 97.4 bits (241), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 81/300 (27%), Positives = 141/300 (47%), Gaps = 65/300 (21%)

Query: 425 LGSRLGDSLLVQFTCGSGTSMLS------------SGLKEEFGDIEADAPSTKRLRRSSS 472
           LGSRLG+SLL+ FT    +++++              L++E  ++E +     +L  + +
Sbjct: 1   LGSRLGNSLLLHFTEEDQSTVITLDEVEQQSEQQQRNLQDEDQNLE-EIFDVDQLEMAPT 59

Query: 473 DALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD----- 527
            A    +  EEL +YGS +  +    + F F V DSL+N+ P+     G R+  +     
Sbjct: 60  QAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCDSLMNVAPINYMCAGERVEFEEDGVT 119

Query: 528 ---------------ASATGISKQS---------NYELV---ELPGCKGIWTVYHKSSRG 560
                           +ATG SK           N +++   EL GC  +WTV+      
Sbjct: 120 LRPHAESLQDLKIELVAATGHSKNGALSVFVNCINPQIITSFELDGCLDVWTVFD----- 174

Query: 561 HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
              D+++ ++ +D+ H ++++S    T+VL+T   + E+ E+  + V   TI  GNL  +
Sbjct: 175 ---DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQEINEI-ENTGFTVNQPTIFVGNLGQQ 229

Query: 621 RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
           R ++QV  R  R+L G+ + Q++               S V+ VSIADPYV L + +G +
Sbjct: 230 RFIVQVTTRHVRLLQGTRLIQNVPIDVG----------SPVVQVSIADPYVCLRVLNGQV 279


>gi|452825139|gb|EME32137.1| cleavage and polyadenylation specificity factor subunit-like
           protein [Galdieria sulphuraria]
          Length = 1454

 Score = 96.7 bits (239), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 119/529 (22%), Positives = 200/529 (37%), Gaps = 91/529 (17%)

Query: 184 KVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDM 243
           KVDP+     VL+    ++++        ++   D+  +    +  +    +++LR L  
Sbjct: 166 KVDPEHGLIAVLIRKKNLLLI----AKYPILSHRDSLSAECSSNKLLSDPVILDLRRLGH 221

Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
                F F+ GY  P + +L E+  TW+G  S    + ++S +    + K+   IW    
Sbjct: 222 FETIHFCFMFGYSLPTLALLEEKTPTWSGSFSVTRDSRLVSVVQFDLSDKKMKRIWQVEE 281

Query: 304 LPHDAYKLLAVPS-PIGGVLVVGANTIHYHSQSASC-ALALNNYAVSLDSSQELPRSSFS 361
           LPH+ + + +VP    GG LV G N I Y    +    L+ N+      S   L      
Sbjct: 282 LPHECFMVSSVPFLQGGGFLVFGWNIILYFRDGSFVDGLSCNDLGDVYLSKWSLRSQDAP 341

Query: 362 VELDA--------AHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
           + LD         +H T+++N V +L  + G    L +   G     + L      +  S
Sbjct: 342 ISLDGCEVVSEFDSHDTFMKNPVIIL--RDGAFFELCIPKKGG-DSVISLRYCKILIQPS 398

Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
            ++  GN L FLGS +  S L++    + T +                           D
Sbjct: 399 TVSYCGNGLIFLGSHVSPSALLEIIWKNSTEL-----------------------HPEDD 435

Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
            L+        S +G +SN     +   S   RDSL  IGP++D      I   +    +
Sbjct: 436 ELE--------SFFGKSSNKNFVVETIDS---RDSLFCIGPIQDLEVFDNIIGSSRKMEL 484

Query: 534 SK---QSNYELV---------------ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEY 575
                  NY  V                L  C+ IW V  +   G    S  +       
Sbjct: 485 IAAVGSRNYGAVIIFRRTVSPSLLTSIRLEDCQQIWNVLCQRKMGERNGSVPL------- 537

Query: 576 HAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL- 634
              LI+S +  T+VL  +D + E+ +S  +    RT+    +   R +IQVF+ G RIL 
Sbjct: 538 ---LILSTQRNTIVLSVSDTIDELVDS-QFQTSSRTLWVSRVLHDRYIIQVFDEGLRILG 593

Query: 635 DGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
           +   +       P +           V    + DPYV+L +S   + +L
Sbjct: 594 NWDSLISLYELPPGD----------VVTQAFVCDPYVMLHLSSSYLVIL 632


>gi|301093651|ref|XP_002997671.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262110061|gb|EEY68113.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 478

 Score = 96.3 bits (238), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 97/387 (25%), Positives = 159/387 (41%), Gaps = 101/387 (26%)

Query: 235 VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
           ++ LR++++   V D  F+ GY+EP +++LHE   + +  GR++    T  ++ +SI+  
Sbjct: 106 LLRLREVEITGKVIDLAFLDGYLEPTLMVLHEENDKNSTCGRLAVGFDTYCLTVISINMK 165

Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
            + HP IW+  NLP D ++L+   +P+GGV+V+ AN I Y +Q+    LA N +A     
Sbjct: 166 TRLHPKIWTVKNLPSDCFRLIPCRAPLGGVVVLSANAILYFNQTQFHGLATNVFASKTHE 225

Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVL 411
           + +L     +V L      +LQ    LL+   G + +L++ Y+    + L          
Sbjct: 226 TAQL-----NVVLYDCQFEYLQEKELLLTMPCGQVYVLSLPYEDTSSRGL---------- 270

Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIE----------ADA 461
                  G    F+GSR GDS+L           L +  +EE  D E            A
Sbjct: 271 ---YGFGGKQTLFIGSRSGDSVLFVLD----KKKLVTATEEEPKDEEMPIKEVVIKQESA 323

Query: 462 PSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT--------------------- 500
           P  K     S  A ++  + ++L LYG+A    E A  +                     
Sbjct: 324 PEIK-----SEPAEEEEEDEDDLFLYGAAPTKEEPAATSSTECTNGVGVSSVKTEENGAP 378

Query: 501 ------FSFAVR--DSLVNIGPLKDFSYGLRINADASATGISKQSNYELV---------- 542
                 + + +R  D L +IG +     G+  NAD      S +   ELV          
Sbjct: 379 EQDTGPYDYELRQIDVLPSIGQITSIELGVENNAD------SNEKREELVISGGYERSGA 432

Query: 543 ----------------ELPGCKGIWTV 553
                           EL GC+ +WTV
Sbjct: 433 ISVLHNGLRPIVGTEAELNGCRAMWTV 459


>gi|452841862|gb|EME43798.1| hypothetical protein DOTSEDRAFT_79774 [Dothistroma septosporum
           NZE10]
          Length = 1347

 Score = 95.9 bits (237), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 195/965 (20%), Positives = 360/965 (37%), Gaps = 175/965 (18%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           L LV  Y L G V +LA +     D     D+++LAF+DAK++++E+D   H +   S+H
Sbjct: 51  LVLVGEYSLSGTVTNLAQVKL--PDTKTAGDALLLAFKDAKLTLIEWDPENHRISTISIH 108

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
            +E    +    G        ++ VDP  RC  +     Q+ +L   Q    L  +ED  
Sbjct: 109 YYEGDNVVSQPFGPGLGECENILTVDPNWRCAALKFGTRQLAVLPFRQLDDELGVEEDGD 168

Query: 221 GSGGGFSAR-----------------IESSHVINLRDL--DMKHVKDFIFVHGYIEPVMV 261
                 + +                  ++S V+ L  L  D+++  D  F++GY E  + 
Sbjct: 169 AEPASTTLKRSESILQNVNGEVQQTPYKASFVLALSTLLEDIRYTVDLGFLYGYRESTLG 228

Query: 262 ILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGV 321
           IL       +  +  +          +     +   +     LP+  +K++ +P+P+GG 
Sbjct: 229 ILSSSLQPSSSLLDIRKDELEYRMFKLELEQGESTELQVVKQLPNSLWKVVPLPAPVGGA 288

Query: 322 LVVGANT-IHYHSQSASCALALNNYAVSLDSSQELP-RSSFSVELDAAHATWL--QNDVA 377
           L+VG N+ +H    +   ++A+N +A +L+S + +  +S  +++L+      L  ++   
Sbjct: 289 LLVGTNSFVHVDLNAKVNSVAVNEFA-ALESDRGMEDQSDLNLKLEGCSVEILDAESRQV 347

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRL-----------DLSKTNPSVLTSDITTIGNSLFFLG 426
           L+  + G L  +     GR +Q L           DL KT PS     +  + ++  F+G
Sbjct: 348 LVVLRDGSLATIYFEQSGRSIQGLKVSRVREEHGGDLVKTAPSC----VARLDHNKVFVG 403

Query: 427 SRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSL 486
           S  G S LV+++     S LS   K   G +               D         E   
Sbjct: 404 SEDGASSLVRWS--RSISTLSR--KRTHGQMLGQHGDEDDEEALEDDDDDLYDAAPETK- 458

Query: 487 YGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADAS------ATGISKQS--- 537
              A++ T++ +   SF ++D L ++GP+ D   G    A          TG  + S   
Sbjct: 459 -KRATSTTDAFETPPSFQIQDVLHSLGPINDVCLGKSDGAQVDKLQMMLGTGRGRSSRIS 517

Query: 538 --NYELVELPG-------CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM 588
             N ++V +          K  W V+ K +             DD++H  L+ + + +  
Sbjct: 518 CLNRDIVPVSARKSTIGRAKSAWAVHAKRND-----------RDDDFHDNLLFAYDGQET 566

Query: 589 VLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFG 646
            +   D +  +  +   F  +G TI    L     V+Q  +   R  D    ++Q +   
Sbjct: 567 KIYDVDEVGYMERTAQEFEHEGETIDVQMLAKDTIVVQCRKSEIRTYDADLALSQIIPMV 626

Query: 647 PSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKP 706
              ++     E   ++ +S  DPY+L+  +D SI++L                     K 
Sbjct: 627 DEETD-----EEYEIVYLSFCDPYLLVVRNDSSIQVL-----------------HVRGKE 664

Query: 707 VSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVV-----CYESG 761
           +       D   + WL                     GG +  G +   V         G
Sbjct: 665 IEPLEGEGDIAEKKWL---------------------GGSIHTGSLTKDVPALFLLSAQG 703

Query: 762 ALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHS 821
            + +F +P+   V                Y   AL      ++S + +    G KE +  
Sbjct: 704 TMHVFSLPSLEPV----------------YHAPALPHLPPVLSSDAPQRRA-GPKEALTE 746

Query: 822 MKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSN 881
           + V EL     ++    P+L A      ++ Y+ +    P  + +             +N
Sbjct: 747 LLVAELG----ASGVDTPYLVARTALDDLVLYEPFRHPEPAPSDQ-----------WYTN 791

Query: 882 VSASRLRNLRFSRTPL--DAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF 939
           +   R R +  +  P   +A  +EE+    P + I    ++  +    + GS P   ++ 
Sbjct: 792 L---RFRKVPVTYIPKYNEAIAQEESTRPLPLRSI----HVGDYDAVTIPGSPP--LLLV 842

Query: 940 RER------LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYW 993
           +E       L V        +     +H  +C  GF  V + G+L+   LP  + Y   W
Sbjct: 843 KEASSLPRVLEVRISNESNRVATLLPIHLDHCKKGFAAVNADGLLEEYHLPLSAWYGTGW 902

Query: 994 PVQKV 998
            VQ+V
Sbjct: 903 SVQQV 907


>gi|326477251|gb|EGE01261.1| protein kinase subdomain-containing protein [Trichophyton equinum
           CBS 127.97]
          Length = 1267

 Score = 95.5 bits (236), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 133/579 (22%), Positives = 232/579 (40%), Gaps = 65/579 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V   ++++++ +     GS  +    +  R    D    A L L   Y + G +  L
Sbjct: 28  NLIVVKTSLLQVFSLVNVTYGSTTATQPDQKGRN---DRSQHAKLVLAAEYEVPGTITGL 84

Query: 117 AILSQGGADNSRRR-DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
             +      NS+   D+I+++  +AK+S++E+D   HG+   S+H +E  E  H+     
Sbjct: 85  QRVR---ISNSKSGGDAILVSSRNAKLSLIEWDPEKHGISTISIHYYEGEES-HMSPWVP 140

Query: 176 SFARGPL-VKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDE---------------D 218
                P  + VDP G C  +  +G+  + IL   Q G  LV D+               D
Sbjct: 141 DLGSCPSSLTVDPNGNCA-IFNFGIHSLAILPFHQAGDDLVMDDYDATPNGDDSTDMVSD 199

Query: 219 TFGSGGGFSARIES---SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
              S  G +A  +    S V+ +  LD  + H     F+H Y EP   IL+ +       
Sbjct: 200 AQKSAPGNTAHDKPYAPSFVLPMAALDPALTHPIHMEFLHEYREPTFGILYSQVARSTSL 259

Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYH 332
              +      S  ++    +    + +   LP D +K++ +P P+GG L++G N  +H  
Sbjct: 260 TIDRKDVVSYSIFTLDLQQRASTSLLTVSRLPSDVFKIVPLPPPVGGALLIGTNELVHVD 319

Query: 333 SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLT 390
               + A+ +N +A    +     +S   + L+      L +     LL    G + +L+
Sbjct: 320 QAGKTNAVGVNEFARQASAFSMADQSDLEMRLEGCIVEQLGSGTGDVLLILADGRMSILS 379

Query: 391 VVYDGRVVQRLDL-----------SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
              DGR V  + L           +K  PS   S    +G +  F GS  GDS+L+ ++ 
Sbjct: 380 FKVDGRSVSGISLHFVAEQSGGLITKARPSCSAS----LGRNKLFYGSEEGDSILLGWSR 435

Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTES--- 496
            S T+   S  K   G  E+ A           D   D +  ++L     AS   E    
Sbjct: 436 PSSTTKRPS--KAADGVDESGAADLSDEAEQDDDGDDDDMYEDDLYSVNPASIRQEKQVV 493

Query: 497 ---AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGI--- 550
              +   F+F   D L ++GP +D + G    + +     S  +    +EL   +G    
Sbjct: 494 NGDSPADFTFRAYDRLWSLGPYRDITLGKPPKSKSKDQRDSVPAIAAPLELVAARGFGKS 553

Query: 551 --WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
              TV  +    +  DS +M   DD Y  + I  ++ ++
Sbjct: 554 GGLTVLKREVDPYTIDSLKM---DDVYGVWSIRVVDPKS 589



 Score = 50.8 bits (120), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 23/91 (25%), Positives = 45/91 (49%), Gaps = 2/91 (2%)

Query: 911  CQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIV-AFTVLHNVNCNHGF 969
            C+ +    ++ G++  F+SG  PC+ ++     R H     G  V + +  H   C  GF
Sbjct: 751  CKLLRALPDVCGYKTVFMSGHNPCF-ILKSAIARPHVLRLRGKAVQSLSGFHIAACERGF 809

Query: 970  IYVTSQGILKICQLPSGSTYDNYWPVQKVVF 1000
             YV    ++++ +LPS + +D+ W  +K+  
Sbjct: 810  AYVDEDNVIRMSRLPSNTRFDSGWATRKIAL 840


>gi|443894082|dbj|GAC71432.1| mRNA cleavage and polyadenylation factor II complex, subunit CFT1
           [Pseudozyma antarctica T-34]
          Length = 1543

 Score = 94.7 bits (234), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 101/406 (24%), Positives = 170/406 (41%), Gaps = 58/406 (14%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
            LV    +++ IY V  +      +  S  T      D     +L +   + L G V  L
Sbjct: 46  QLVTARDDLLTIYDVYDRSSSQSAASTSNGTANGTAGDAKPRHTLIVTRRHSLFGTVTGL 105

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +    +D   R   ++++F DAK+++LE++D+   L   S+H +E      L  G   
Sbjct: 106 QRVDTLASDKDARH-RLLVSFADAKLALLEWNDTTDDLETVSIHTYERAT--QLLNGTPP 162

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFS--------- 227
             R P + VDP  RC  +L+    + IL   +  +     E  F  G GF          
Sbjct: 163 LFR-PNLNVDPLSRCAALLLPHDALAILPFYRDNA-----EFDFDDGLGFDLANDALDAS 216

Query: 228 --------ARIES-----SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAG 272
                   A +ES     S V+ +R++D  ++++KDF F+ G+ +P + +L +   TW G
Sbjct: 217 DAAAMAAAAHMESLPYSPSFVLTMREVDPKIRNLKDFCFLPGFQKPTVAVLFDHSPTWTG 276

Query: 273 RVSWKHHTCMIS--ALSISTTL------------------KQHPLIWSAMNLPHDAYKLL 312
            ++ +  +  +    L +S +L                    HP++ ++  LP+D   +L
Sbjct: 277 LLTHRKDSFAVYLFTLDLSASLDGATLGSAAALLDDGNMRSAHPVVTTSSQLPYDCLYML 336

Query: 313 AVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV----ELDAAH 368
             P  +GGVLVV  + I +  QS    +   N      S+ E P S   V    +L A+ 
Sbjct: 337 PCPQSLGGVLVVCMSAILHVDQSGRVVVTALNRWFKTTSAIE-PESVLDVPGLADLQASQ 395

Query: 369 ATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSD 414
             +  +  A+LS   GDL  L    DGR V+   L + +     SD
Sbjct: 396 LVFTTDTDAVLSLSNGDLYRLRCHMDGRSVEGFRLERIDQLTAGSD 441


>gi|301103688|ref|XP_002900930.1| cleavage and polyadenylation specificity factor subunit, putative
           [Phytophthora infestans T30-4]
 gi|262101685|gb|EEY59737.1| cleavage and polyadenylation specificity factor subunit, putative
           [Phytophthora infestans T30-4]
          Length = 613

 Score = 94.7 bits (234), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 95/383 (24%), Positives = 159/383 (41%), Gaps = 93/383 (24%)

Query: 235 VINLRDLDMK-HVKDFIFVHGYIEPVMVILHER--ELTWAGRVSWKHHTCMISALSISTT 291
           ++ LR++++   V D  F+ GY+EP +++LHE   + +  GR++    T  ++ +SI+  
Sbjct: 241 LLRLREVEITGKVIDLAFLDGYLEPTLMVLHEENDKNSTCGRLAVGFDTYCLTVISINMK 300

Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
            + HP IW+  NLP D ++L+   +P+GGV+V+ AN I Y +Q+    LA N +A     
Sbjct: 301 TRLHPKIWTVKNLPSDCFRLIPCRAPLGGVVVLSANAILYFNQTQFHGLATNVFASKTHE 360

Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVL 411
           + +L     +V L      +LQ    LL+  +G + +L++ Y+    + L          
Sbjct: 361 TVQL-----NVVLYDCQFEYLQEKELLLTMPSGQVYVLSLPYEDTSSRGL---------- 405

Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI------EADAPSTK 465
                  G    F+GSR GDS+L         +      K+E   I      +  AP  K
Sbjct: 406 ---YGFGGKQTLFIGSRSGDSVLFVLDKKKLVTATEEEPKDEEMPIKEVVIKQESAPEIK 462

Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT------------------------- 500
                S  A ++  + ++L LYG+A    E A  +                         
Sbjct: 463 -----SEPAEEEEEDEDDLFLYGAAPTKEEPAATSSTECTNGVGVSSVKTEENGAPEQDT 517

Query: 501 --FSFAVR--DSLVNIGPLKDFSYGLRINADASATGISKQSNYELV-------------- 542
             + + +R  D L +IG +     G+  NAD      S +   ELV              
Sbjct: 518 GPYDYELRQIDVLPSIGQITSIELGVENNAD------SNEKREELVISGGYERSGAISVL 571

Query: 543 ------------ELPGCKGIWTV 553
                       EL GC+ +WTV
Sbjct: 572 HNGLRPIVGTEAELNGCRAMWTV 594


>gi|298715583|emb|CBJ28136.1| cleavage and polyadenylation specificity factor CG10110-PA
           [Ectocarpus siliculosus]
          Length = 1906

 Score = 94.4 bits (233), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 93/303 (30%), Positives = 136/303 (44%), Gaps = 77/303 (25%)

Query: 215 GDEDTFGSGGGFSARIESSHVINLR-------DLDMKHVKDFI----FVHGYIEPVMVIL 263
           G+ED  G G G +A+ +     NL        DL+   +  FI    F+ G+ EP + +L
Sbjct: 265 GEEDG-GLGNGATAKGDGGAGGNLAVSKPFTIDLEEAGITGFIKAAAFLEGFHEPALALL 323

Query: 264 HERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLV 323
           +E   T AGR++ K  TC ++ LSI+ T  + P+IW   NLPHD++ L+ VPSPIGG+ V
Sbjct: 324 YEPIQTCAGRLASKRSTCRLALLSINLTQGRAPVIWQVENLPHDSWDLVPVPSPIGGLQV 383

Query: 324 VGANTIHYHSQS-ASCALALNNYA-VSLDSS-QELP------------------------ 356
           +  N + + +QS     LA+N YA  ++D +  E P                        
Sbjct: 384 ISTNAVMHVNQSEVRSILAVNGYARATVDPALLECPLRGGDSDWGWTSFRRSHPEREVVD 443

Query: 357 RSSFSV--ELDAAHATWLQNDVALLSTKTGD-----LVLLTVVY---------------- 393
            SS+ V  ELD     +L     LLS +TG+     L L TV                  
Sbjct: 444 LSSYDVCIELDVVRCAFLTPTSMLLSLRTGEVYALRLHLTTVTAAAADAAGCSRPPGGAA 503

Query: 394 ---DGRVVQR--LDLSKTNP-SVLT---------SDITTIGNSLFFLGSRLGDSLLVQFT 438
                RVV +    + + +P SVL             +     L F+GSR+GDSLLV ++
Sbjct: 504 FGTPNRVVGQSMRPVGRASPCSVLAVAASGGSGGDGGSGASKGLVFMGSRVGDSLLVDYS 563

Query: 439 CGS 441
             S
Sbjct: 564 VAS 566



 Score = 45.8 bits (107), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 49/221 (22%), Positives = 89/221 (40%), Gaps = 52/221 (23%)

Query: 3   FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTA 62
           +  Y+ +H PTG+ +   G +T + +                            +LVV  
Sbjct: 7   YTCYRQLHPPTGVDHAVFGSVTAAGSR---------------------------DLVVAK 39

Query: 63  ANVIEIYVVRVQEEGSKESKNSGETKRRVLM------DGISAASLELVCHYRLHGNVESL 116
           A+ +E+Y V   +  S  +  +    R          D  S   LEL   + L GN+ +L
Sbjct: 40  ASTLELYRVHRDDHSSTAAAAAAAAARDTSNGDERDDDDASGYYLELAGTFPLAGNITAL 99

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A++           D ++++F  AK++++ +D  +  L   S+H F++        G ES
Sbjct: 100 AVIP----------DILVVSFGVAKMALVAYDSVLGRLETISIHNFDAGAIGPGAGGVES 149

Query: 177 -FARGPLVK--------VDPQGRCGGVLVYGLQMIILKASQ 208
            +     +K         DP GRC   +V G Q+++L A +
Sbjct: 150 GYGLAAALKDRPRTISSSDPAGRCLAAVVAGCQLVVLPARR 190


>gi|402085944|gb|EJT80842.1| cft-1 [Gaeumannomyces graminis var. tritici R3-111a-1]
          Length = 1450

 Score = 94.0 bits (232), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 159/679 (23%), Positives = 266/679 (39%), Gaps = 112/679 (16%)

Query: 91  VLMDGISAASLELVCHYRLHGNVESLA------ILSQGGADNSRRRDSIILAFEDAKISV 144
           V  D  S   + L+  + L G V  LA      +   GG   S   D +++AF+DAK+S+
Sbjct: 86  VRSDRASHTKIVLIAEFPLSGTVTGLARVKPPNVSKTGGG--SGVGDLLLIAFKDAKLSL 143

Query: 145 LEFDDSIHGLRITSMHCFESPE-----WLHLKRGRESFARGPLVKVDPQGRCGGVLVYGL 199
           + +D     L   S+H +E  E     W        +F     +  DP  RC  +     
Sbjct: 144 VAWDSERRSLETFSIHYYEQDELQGNPWECPLSDYANF-----LVADPGSRCAALKFGPR 198

Query: 200 QMIILKASQGGSGL-VGDEDTFGSGG------------GFSARIES-----SHVINLRDL 241
            + IL   Q    + +GD D    G               ++ IE      S V+ L +L
Sbjct: 199 SLAILPFKQADEDIGMGDWDEALDGPRPAQSQSAAVAINGTSTIEDTPYSPSFVLRLPNL 258

Query: 242 D--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299
           D  + H     F++ Y EP   IL    +T +  +  K H    +  ++    K    I 
Sbjct: 259 DPALLHPVHLAFLYEYREPTFGILSS-SITPSNCLDRKDH-LTYTVFTLDLQQKASTTIL 316

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRS 358
           S   LP D  +++A+P+P+GG L+VGAN  IH      +  +A+N +     S      S
Sbjct: 317 SVGGLPKDLTRVIALPAPVGGALLVGANELIHIDQSGKANGVAVNPFTKQCTSFGLADHS 376

Query: 359 SFSVELDAAHATWL--QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP----SVLT 412
             ++ L+      L  ++   L+    G L  +T   DGR V  L +    P    ++L 
Sbjct: 377 DLNLRLEGCTIEVLSAEHGELLVVLDDGRLATITFHIDGRTVSGLKVRIIPPEAGGNILP 436

Query: 413 SDITT---IGNSLFFLGSRLGDSLLVQFT-CGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
           + ++    IG +  F GS  GDS+++ +    S  S   S +++   D++ D    +   
Sbjct: 437 TSVSCLSRIGRNAMFAGSERGDSIVIGWNRKSSQVSRKKSRVQDPDLDLDIDFDDLEDDE 496

Query: 469 RSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA 528
               D   D    +  ++ G AS   ++  +   F   D L++I P++D +YG       
Sbjct: 497 DDDDDLYGD--TEKTTTVAGLASG--QAKLEDLVFRCHDRLISIAPIRDMAYGKPPPPAE 552

Query: 529 SATGISK----QSNYELV--------------------------ELPGCKGIWTVY---- 554
             TG       QS  +LV                          + P  +G+WT+     
Sbjct: 553 GETGSRNSTPIQSELQLVAVVGRDRASSLAIMNREMTPVSIGRFDFPEARGLWTLACQKP 612

Query: 555 -------HKSSRGHNADSSRMAAYDDEYHAYLIIS------LEARTMVLETADLLTEVTE 601
                   K ++    D      YD     +++++       E+  + + TA    ++  
Sbjct: 613 LPKVLQGEKGTKPVGGDFGVPVQYDK----FMVVAKEDDDNFESSNIYVLTAAGFEKLVG 668

Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESGSGSENST 660
           +      G TI AG +    ++IQV +   R  DG   +TQ +   P   E  +    +T
Sbjct: 669 TEFEPAAGFTIEAGTMGNHTKIIQVLKSEVRCYDGDLGLTQII---PMLDEETNHEPRAT 725

Query: 661 VLSVSIADPYVLLGMSDGS 679
             S SIADPY+L+   D S
Sbjct: 726 --SASIADPYLLIIRDDSS 742



 Score = 46.2 bits (108), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 30/130 (23%), Positives = 57/130 (43%), Gaps = 7/130 (5%)

Query: 867  SDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPH-----GAPCQRITIFK--N 919
            S+D ++      ++  S S    LRF + P  A  + +         AP +R+ +    N
Sbjct: 871  SNDDLTIYEPFKIAESSQSLSGTLRFRKLPNPAVAKSQDTKVSDDAPAPMRRMPLRACGN 930

Query: 920  ISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILK 979
            I+G+   FL G  P + +   +       L    + A +  H   C+ GFIY   +G+ +
Sbjct: 931  IAGYSCVFLPGHSPSFLIKSSKSTPRVIGLQGPGVRAMSPFHTKGCDRGFIYADYEGVAR 990

Query: 980  ICQLPSGSTY 989
            + Q+P+  ++
Sbjct: 991  VAQIPNDCSF 1000


>gi|448105510|ref|XP_004200513.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
 gi|448108635|ref|XP_004201144.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
 gi|359381935|emb|CCE80772.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
 gi|359382700|emb|CCE80007.1| Piso0_003103 [Millerozyma farinosa CBS 7064]
          Length = 1344

 Score = 93.6 bits (231), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 105/502 (20%), Positives = 196/502 (39%), Gaps = 83/502 (16%)

Query: 58  LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL- 116
           LVV  + +++++ +    + SKE K                  L+LV  ++LHG +  L 
Sbjct: 29  LVVGKSTLLQVFDIVQSNKKSKEYK------------------LKLVEQFKLHGLITDLK 70

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A+ +    D     D ++++ + AK+S++++D   + +   S+H +E+          E 
Sbjct: 71  AVRTVENPD----LDYLLVSTKSAKMSLVKWDHHENSISTVSLHYYENSIQ---SSTYEK 123

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMII-------------LKASQGGSGLVGDEDTFGSG 223
                L+ ++P   C  +    L   +              +   G SG  G  D   + 
Sbjct: 124 LTTTELI-MEPNNTCACLRFKNLLTFLPFEMPDEDDEEDGYENVDGASGSRGKHDNKATQ 182

Query: 224 GGFS-ARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHT 280
              + A   SS VI+ ++LD +  +V D  F++ Y EP + I+  +  TW G +      
Sbjct: 183 QDENQALFYSSFVIDAQNLDSRIGNVIDMKFLYNYKEPTLAIISSKNHTWTGLLPLTKDN 242

Query: 281 CMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-TIHYHSQSASCA 339
                LS+    K    +    NLP D   ++ +P P+ G L++G N  IH      +  
Sbjct: 243 ISFIVLSLDLVTKTSTTVLKIDNLPFDIDTIVPLPKPLNGTLLIGCNEIIHVDHGGITRR 302

Query: 340 LALNNYAVSLDSSQELPR--SSFSVELDAAHATWLQND-VALLSTKTGDLVLLTVVYDGR 396
           LA+N +  S+ SS +  R  S  +++L+      + ND    L  K GD   +    DG+
Sbjct: 303 LAVNQFTSSITSSIKNYRDQSELNLKLENCCVKPIPNDHRVFLILKNGDFYYINFAIDGK 362

Query: 397 VVQRLDLSKTNP-------SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSG 449
            ++   L K N             D+  + N+L F+ ++ G+S L++             
Sbjct: 363 TIKNFYLEKVNSINQNEIGISYPEDVVHLDNNLMFICNKNGNSPLIELKF---------- 412

Query: 450 LKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTES----------AQK 499
                    +++   +   +     +QD  NG          ++                
Sbjct: 413 ---------SESKDNQNAEQQKDTEMQDTENGTTDKNDNDDDDDIYEDDEDNEKVLIKNS 463

Query: 500 TFSFAVRDSLVNIGPLKDFSYG 521
              F   D L+N GP+  F++G
Sbjct: 464 VIEFTKHDELINNGPVSSFTFG 485



 Score = 50.1 bits (118), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 36/177 (20%), Positives = 75/177 (42%), Gaps = 27/177 (15%)

Query: 824 VVELAMQRWSAHHSRPFLFAILT-DGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNV 882
           +  +        HS+     ILT  G ++ Y+ + F+G                    N 
Sbjct: 774 IKNIVFNELGDEHSKDEYLTILTIGGEVIIYKLF-FDG-------------------DNF 813

Query: 883 SASRLRNLRFSRTPLDAYTREETPHGAPCQRITIF-KNISGHQGFFLSGSRPCWCMVFRE 941
              + ++L+ +  P +AY     P G   +R  ++  N++G+   F++G  P +      
Sbjct: 814 KFIKEKDLKITGAPDNAY-----PLGTTLERRLVYVPNVNGYSSIFVTGIIPYFITKTVH 868

Query: 942 RLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
            +    +      V+F+   + N  +GFIY+ +    ++C++P    Y+N WP++K+
Sbjct: 869 SVPRIFRFTKLPAVSFSSYSDSNIKNGFIYLDNSKNARMCEIPLDFNYENNWPIKKI 925


>gi|294659889|ref|XP_462318.2| DEHA2G17908p [Debaryomyces hansenii CBS767]
 gi|218511978|sp|Q6BHK3.2|CFT1_DEBHA RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two
           protein 1
 gi|199434312|emb|CAG90824.2| DEHA2G17908p [Debaryomyces hansenii CBS767]
          Length = 1342

 Score = 92.8 bits (229), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 103/499 (20%), Positives = 206/499 (41%), Gaps = 82/499 (16%)

Query: 58  LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
           L+V  A V++++ +   E  +++ K                  L+LV  ++LHG +  + 
Sbjct: 29  LIVGKATVLQVFEIITTETKTQQYK------------------LKLVEQFKLHGLITDIK 70

Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
            +     +NS+  D ++++ + AK+S++++D  ++ +   S+H +E+          E  
Sbjct: 71  AIRT--VENSQL-DYLLVSSKGAKMSLIKWDHHLNSISTVSLHYYENSIQ---SSTYEKL 124

Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMII-----------------LKASQGGSGLVGDEDTF 220
               LV V+P   C  +    L   +                 +  S G      +++  
Sbjct: 125 TTTDLV-VEPNNNCTCLRFKNLLTFLPFETLDEEEEDDDDDEEMNGSSGSDKKATNKENG 183

Query: 221 GSGGG-FSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
            S G   S   ESS +I+ R LD +   + D  F++ Y EP + I+  +   WAG +   
Sbjct: 184 NSNGEEVSELFESSFMIDGRTLDSRIGDIIDMQFLYNYREPTIAIIFSKAHAWAGNLPKV 243

Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-TIHYHSQSA 336
                   LS+    K    +    NLP D  K++ +P P+ G L++G N  IH  +   
Sbjct: 244 KDNINFIVLSLDLVTKASTTVLKIDNLPFDIDKIIPLPQPLNGSLLMGCNEIIHVDNGGI 303

Query: 337 SCALALNNYAVSLDSSQE--LPRSSFSVELDAAHATWLQND-VALLSTKTGDLVLLTVVY 393
           +  LALN +  S+ +S +    +S  +++L+      + ND   L+    GD   +    
Sbjct: 304 TRRLALNQFTSSITTSLKNYHDQSDLNLKLENCSVKPIPNDNKVLMILNNGDFYYINFKI 363

Query: 394 DGRVVQRL-----------DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSG 442
           DG+ +++            D+  T P     +I T+ N+L F+ ++ G++ L++    + 
Sbjct: 364 DGKTIKKFFVEKVSDLNYDDIQLTYP----GEIATLDNNLMFISNKNGNNPLLELKYKNF 419

Query: 443 TSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFS 502
             ++    +E                  +S+ L +    ++L      +      + +  
Sbjct: 420 EHVIVQENEE------------------NSNPLDNEDEEDDLYEEDEVNKKISINKSSIE 461

Query: 503 FAVRDSLVNIGPLKDFSYG 521
           F   D L+N GP+ +F+ G
Sbjct: 462 FIKHDELLNNGPISNFTLG 480



 Score = 45.8 bits (107), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 26/118 (22%), Positives = 53/118 (44%), Gaps = 4/118 (3%)

Query: 881 NVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFR 940
           N    + ++L  +  P +AY+   T      +R+  F N++G    F++G  P +     
Sbjct: 810 NFKLVKEKDLIITGAPDNAYSLGTTIE----RRLVYFPNVNGFTSIFVTGITPYYISKTT 865

Query: 941 ERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
             +    +      V+F    +    +G IY+ +    +IC++P    Y+N WP++K+
Sbjct: 866 HSVPRIFKFTKLPAVSFAPYSDDKIKNGLIYLDNSKNARICEIPVDFNYENNWPIKKI 923


>gi|254564833|ref|XP_002489527.1| RNA-binding subunit of the mRNA cleavage and polyadenylation factor
           [Komagataella pastoris GS115]
 gi|238029323|emb|CAY67246.1| RNA-binding subunit of the mRNA cleavage and polyadenylation factor
           [Komagataella pastoris GS115]
 gi|328349950|emb|CCA36350.1| Protein cft1 [Komagataella pastoris CBS 7435]
          Length = 1388

 Score = 89.7 bits (221), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 105/444 (23%), Positives = 177/444 (39%), Gaps = 51/444 (11%)

Query: 93  MDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIH 152
           +D      L LV  Y+L G V  L  +           D +++A +  K S++++D S +
Sbjct: 80  IDFSQNVKLSLVAEYKLDGLVTDLCKIR---TIEDSHHDYVLVATKGVKFSMIKWDQSSN 136

Query: 153 GLRITSMHCFESPEWLHLKRGRES-----FARGPLVKVDPQGRCGGVLVYGLQMIILKAS 207
            +   S+H        H K+  E+     F     +  DP   C  +L   + +  L   
Sbjct: 137 SISTVSLH--------HYKKIVENSLIDKFNVDTKLIADPNNHCSCLLANEI-LFFLPFL 187

Query: 208 QGGSGLVGDEDTFGSGGGFSARIESSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHE 265
           Q       DE+  G          ++ +    DL  ++K + D  F+HGY EP + +L+ 
Sbjct: 188 QHEV----DEELDGKFVENKKLYSNTFLQFSNDLQPNIKTIIDIEFLHGYSEPTLAVLYT 243

Query: 266 RELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVG 325
              T  G +     T  +   S++   K    I    NLP+D  ++L + SP+ G L++G
Sbjct: 244 SFPTCTGALPKAKDTVSLQVFSLNLQNKASTSIIEVNNLPYDTDRILPLSSPLNGCLLIG 303

Query: 326 AN-TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTG 384
           AN  IH +S   +  ++ N +A    + +   +S+  + L+      + ND  +L T+ G
Sbjct: 304 ANQIIHLNSMGTAKGISCNLFAAKCSNFKLSDQSNLDLRLEKCVLGQVYNDKVILITEKG 363

Query: 385 DLVLLTVVYDGRV-----VQRLDLSKTNPSVLT--SDITTIGNSLFFLGSRLGDSLLVQF 437
                +    G V     +Q++   K    VL+  +  T I    FF+G +  DS+L   
Sbjct: 364 AFYAFSFDIVGGVSSINEIQKIAAEKYQGLVLSLPTMFTNIDGKTFFIGCQGSDSVLF-- 421

Query: 438 TCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESA 497
                      G K        D     ++  +  DAL       E  LY     N    
Sbjct: 422 -----------GSKARLNTQNVDVNGKSKV-ITEEDALY------EEDLYADDIQNVAQG 463

Query: 498 QKTFSFAVRDSLVNIGPLKDFSYG 521
                F   DSL+NIGP+ +F+ G
Sbjct: 464 IDHIDFVKLDSLLNIGPITNFTTG 487


>gi|343425828|emb|CBQ69361.1| related to cleavage and polyadenylation specificity factor, 160 kDa
           subunit [Sporisorium reilianum SRZ2]
          Length = 1567

 Score = 89.7 bits (221), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 85/349 (24%), Positives = 158/349 (45%), Gaps = 48/349 (13%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           L LV  + L G V  L  + Q  A +   RD ++++F+DAK+++LE++D    L   S+H
Sbjct: 92  LVLVRRHTLFGVVTGLQRV-QTLATDKDARDCLLVSFKDAKLALLEWNDLTDDLETVSIH 150

Query: 161 CFE-SPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL-------KASQGGSG 212
            +E +P+ L+   G  +    P++ VDP  RC  +L+    + +L               
Sbjct: 151 TYERAPQLLN---GTPNLFH-PILNVDPLSRCAALLLPHDALAVLPFYRDAADFDFDLDD 206

Query: 213 LVGDEDTFGSGGGFSARIES-----SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHE 265
            +       +    +A +E+     S V+ +R++D  ++++KDF F+ G+ +P + +L  
Sbjct: 207 RLDLAKDDAAAVAAAAEMETLPYSPSFVLTMREVDPKIRNLKDFCFLPGFQKPTVAVLFS 266

Query: 266 RELTWAGRVSWKHHTCMI-------------------SALSISTTLKQHPLIWSAMNLPH 306
              TW G ++ +  T  +                    AL   T    HP++ ++  LP+
Sbjct: 267 HTPTWTGLLAERKDTFSVYLFTLDLSASLDGTLSSAADALDDGTVRSAHPVVTTSTALPY 326

Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSASCAL-ALNNY-----AVSLDSSQELPRSSF 360
           D   +++ P  +GGVLVV  +++ +  QS    + ALN +     A+  +S  +LP    
Sbjct: 327 DCLYMVSCPQTLGGVLVVCMSSVLHVDQSGRVVVTALNGWFKTISAIEPESVLDLPEIP- 385

Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS 409
             +L  +   +      +L+   GDL       DGR V+   L + + S
Sbjct: 386 --DLQGSQLVFTAETAGVLALVDGDLYRFRCQMDGRSVEGFRLERMDQS 432


>gi|402913617|ref|XP_003919276.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like, partial [Papio anubis]
          Length = 132

 Score = 88.6 bits (218), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 48/121 (39%), Positives = 72/121 (59%), Gaps = 12/121 (9%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           LEL   +   GNV S+A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H
Sbjct: 18  LELAASFSFFGNVMSMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLH 73

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL-----KASQGGSGLVG 215
            FE PE   L+ G       P V+VDP GRC  +LVYG ++++L       ++   GLVG
Sbjct: 74  YFEEPE---LRDGFVQNVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVG 130

Query: 216 D 216
           +
Sbjct: 131 E 131


>gi|71021721|ref|XP_761091.1| hypothetical protein UM04944.1 [Ustilago maydis 521]
 gi|46100541|gb|EAK85774.1| hypothetical protein UM04944.1 [Ustilago maydis 521]
          Length = 1597

 Score = 88.6 bits (218), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 95/399 (23%), Positives = 177/399 (44%), Gaps = 54/399 (13%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKE-----SKNSGETKRRVLMDGISAASLELVCHYRLHG 111
            LV    +V+ IY V  Q   S       S+++  +         S  +L +  ++ L G
Sbjct: 46  QLVTARDDVLTIYDVYGQPHASASTIPGISRHTATSSVSSNTSACSHKNLVISRNHTLFG 105

Query: 112 NVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFE-SPEWLHL 170
            V  L  + Q  A +   RD ++++F+DAK+++LE++D+I  L   S+H +E +P+ L+L
Sbjct: 106 AVTGLQRV-QTLASDKDNRDRLLVSFKDAKLALLEWNDAIDDLETISIHTYERAPQLLNL 164

Query: 171 KRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE-------DTFGSG 223
                     P++ VDP  RC  +L+    + IL   +  +    D            + 
Sbjct: 165 A----PHLFHPILNVDPLSRCAALLLPHDSLAILPFYRDAADFDFDLDDHLEIAKDDVAA 220

Query: 224 GGFSARIES-----SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSW 276
              +A ++S     S V+ +R++D  ++++K F F+ G+ +P + +L     TW G +S 
Sbjct: 221 VVAAADLQSLPYSPSFVLTMREVDPKIRNLKHFCFLPGFQKPTVAVLFSHNPTWTGLLSE 280

Query: 277 KHHTCMI--------------------SALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
           +  T  +                     AL   T    HP++ ++  LP+D   ++A P 
Sbjct: 281 RKDTFSVYLFTLDLSASLDGATFSSSAEALDDGTARSAHPVVTTSTPLPYDCLYMVACPQ 340

Query: 317 PIGGVLVVGANTIHYHSQSASCAL-ALNNY-----AVSLDSSQELPRSSFSVELDAAHAT 370
            +GGV+VV  +++ +  QS    + ALN +     A+  +S  EL   S   +L  +   
Sbjct: 341 TLGGVIVVCMSSLLHVDQSGRVMVTALNQWFKTTSAIEPESILEL---SDIADLQGSQLV 397

Query: 371 WLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS 409
           +      +L+   G++       DGR V+ + L +   S
Sbjct: 398 FTSKTQGVLTLVNGEIYRFRCQTDGRSVEGIRLERMQES 436


>gi|354547787|emb|CCE44522.1| hypothetical protein CPAR2_403250 [Candida parapsilosis]
          Length = 1334

 Score = 87.8 bits (216), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 98/458 (21%), Positives = 181/458 (39%), Gaps = 64/458 (13%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           L+LV  ++L G V  L  L       + + D II++ + AK S+++++  +H +   S+H
Sbjct: 57  LKLVEQFKLQGTVTGLKPLR---TSENPQLDYIIVSTKYAKFSIIKWNHQLHSISTVSLH 113

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL---------------- 204
            +E+          E  A   L+ V+P       L Y   +  L                
Sbjct: 114 YYEN---CIQHSTFEKLAISDLI-VEPTYSSVSCLRYKNLLCFLPFEGVNDHDDDDDDDD 169

Query: 205 -----KASQG-GSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYI 256
                   +G    + G + + G+        +SS +I+   L+  +  V D  F+H Y 
Sbjct: 170 DDDDTDDEKGVAENVAGVDKSNGASNDNQPFYDSSFIIDAGTLESSVDSVLDLQFLHHYQ 229

Query: 257 EPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
           E  + IL  +  +WAG +           +++    K    +++  NLP+D  +++ +  
Sbjct: 230 ETTIAILSSKSNSWAGNLIKNKDNVQFQVMTLDIQSKSTLPVFTIDNLPYDIDRIIPLSK 289

Query: 317 PIGGVLVVGAN-TIHYHSQSASCALALNNY----AVSLDSSQELPRSSFSVELDAAHATW 371
           P+ G L++G N  IH  +   +  +A+N +      S+ S Q+    +  +E D +    
Sbjct: 290 PLNGCLLLGCNEIIHVDNGGIAKRIAVNAFTSLITASVKSYQDESELNLKLE-DCSIVPI 348

Query: 372 LQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS-------VLTSDITTIGNSLFF 424
            ++   LL   TG+   L    DG+ ++R+ L               + ++ T+ N+L F
Sbjct: 349 PEDHRVLLILATGEFYFLNFELDGKSIKRIHLEAVEQKAYDAIKLTYSGEVATLDNNLLF 408

Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
             +  GDS LV+    S   +           +E      K       D   +   GEE 
Sbjct: 409 FANMNGDSPLVEIKYSSSAKV-----------VEKQVLDKKEEDSDEEDLYNEDEEGEEQ 457

Query: 485 SLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL 522
            +   +            F + DSL+N GP+  F+ GL
Sbjct: 458 KVMRKSH---------IEFKLHDSLINNGPVSSFTLGL 486


>gi|260941626|ref|XP_002614979.1| hypothetical protein CLUG_04994 [Clavispora lusitaniae ATCC 42720]
 gi|238851402|gb|EEQ40866.1| hypothetical protein CLUG_04994 [Clavispora lusitaniae ATCC 42720]
          Length = 1363

 Score = 87.0 bits (214), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 60/220 (27%), Positives = 110/220 (50%), Gaps = 15/220 (6%)

Query: 232 SSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
           SS ++    LD K   + D  F+H Y +P + +L +++ TWAG +       + S LS+ 
Sbjct: 224 SSFILEASALDNKIGDIIDLQFLHHYRQPTIAVLSQQKSTWAGLLPQTKDNVIFSVLSLD 283

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVS 348
              +    +    NLP+D  K++A+PSP+ G L++G N  IH  +   +  +A+N Y   
Sbjct: 284 MQTRLTTTVLQIENLPYDLEKIIALPSPLNGSLLIGCNELIHVDTGGITRRIAVNQYTED 343

Query: 349 LDSSQE--LPRSSFSVELDAAHATWLQNDVALLST-KTGDLVLLTVVYDGRVVQRLDLSK 405
           + +S +    ++S  ++L+      + ND  LL   +TG++  +    DG+ ++R+ + +
Sbjct: 344 ITASLKNYADQTSLDLKLEDCSILPIPNDNKLLMVLRTGEMYFIVFEVDGKTIKRMSVEE 403

Query: 406 TNPSVLTSDI--------TTIGNSLFFLGSRLGDSLLVQF 437
             PS   S I         ++ N+L FL  R  +S LV+ 
Sbjct: 404 I-PSETYSQIKLMDPSSFASLDNNLLFLTGRSSNSHLVEL 442



 Score = 47.0 bits (110), Expect = 0.055,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 57/124 (45%), Gaps = 16/124 (12%)

Query: 881 NVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI-FKNISGHQGFFLSGSRPCWCMVF 939
           N    +  +L  +  P +AY+     HG   +R  I F ++SG     ++G  P   M+ 
Sbjct: 829 NFQFVKQYDLPITGAPFNAYS-----HGTSIERRMIYFPDVSGTTCIMVTGVIPY--MIT 881

Query: 940 RERLRVHPQL-----CDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWP 994
           R R   H Q+         IV+F         +G IY+ ++   +I +LPS  +YD  WP
Sbjct: 882 RSR---HSQVKVFKFSKIPIVSFVPFSTDKIKNGLIYLDTKKNARIVELPSEFSYDYNWP 938

Query: 995 VQKV 998
           ++KV
Sbjct: 939 IRKV 942


>gi|388856288|emb|CCF50097.1| related to cleavage and polyadenylation specificity factor, 160 kDa
           subunit [Ustilago hordei]
          Length = 1568

 Score = 86.3 bits (212), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 91/358 (25%), Positives = 164/358 (45%), Gaps = 61/358 (17%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           L L+  + L G V  L  + Q  + +   RD ++++F DAK+++LE++ +   L   S+H
Sbjct: 84  LVLIRKHSLFGTVTGLQRI-QTLSTSKDSRDRLLVSFTDAKLALLEWNHTTDDLETVSIH 142

Query: 161 CFE-SPEWL----HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS---- 211
            +E +P+ L    HL +        P + +DP  RC  +L+    + IL   +  +    
Sbjct: 143 TYERAPQLLNGIPHLFQ--------PNLNIDPLSRCAALLLPHDALAILPFYRDAAEFEF 194

Query: 212 --GLVGDEDTFGSGGGFSA-----RIES-----SHVINLRDLD--MKHVKDFIFVHGYIE 257
             GL  D +   +G   +A     +IES     S V+ +R++D  ++++KDF F+ G+ +
Sbjct: 195 DHGLHLDLNLDFAGEDKAAMQAAVQIESLPYSPSFVLTMREVDPKIRNLKDFCFLPGFQK 254

Query: 258 PVMVILHERELTWAGRVSWKHHT----------------CMISALSIS----TTLKQHPL 297
           P + +L     T  G ++ +                    M+ + S S    T    HP+
Sbjct: 255 PTVALLFAHSPTCTGLLAERKDNFSVYLFTLDLAASLDGAMLGSASYSFDDATLRSMHPV 314

Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNY-----AVSLDS 351
           + ++ +LP+D   +L  P  +GGVLVV  ++I +  QS    A ALN +     A+  +S
Sbjct: 315 LTTSSSLPYDCLYMLPCPQTLGGVLVVCMSSILHVDQSGRVVATALNGWFNLVSAIQPES 374

Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS 409
             +LP  +   +L  +   +      +L+   GD+   T   DGR +Q   L +   S
Sbjct: 375 LLDLPEIA---DLQGSQLVFTAETEGVLTLVHGDVYTFTCQMDGRNIQGFRLERMQQS 429


>gi|9794908|gb|AAF98388.1| cleavage and polyadenylation specificity factor [Drosophila
           melanogaster]
          Length = 813

 Score = 85.1 bits (209), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 99/391 (25%), Positives = 156/391 (39%), Gaps = 64/391 (16%)

Query: 659 STVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD--- 715
           S V+ VSIADPYV L + +G +  L    +  T  +       SS   V + + Y D   
Sbjct: 10  SPVVQVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTISSSPAVVAISAYKDLSG 69

Query: 716 ----KG----------------------PEPWLRKTSTDAWLSTGVGEAI------DGAD 743
               KG                       EP ++    +  L    G A       D A 
Sbjct: 70  LFTVKGDDINLTGSSNSAFGHSFGGYMKAEPNMKVEDEEDLLYGDAGSAFKMNSMADLAK 129

Query: 744 GGPLDQGDIYS------------VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTY 791
                  D +             VV  +SG LEI+ +P+   V+ V+   +G   + D  
Sbjct: 130 QSKQKNSDWWRRLLVQAKPSYWLVVARQSGTLEIYSMPDMKLVYLVNDVGNGSMVLTDAM 189

Query: 792 MREALKDSETEINSSSEEGTGQG-RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTI 850
               +  +  E   +S+ G  Q    ++ +S   +EL++     +  RP L  + T   +
Sbjct: 190 EFVPISLTTQE---NSKAGIVQACMPQHANSPLPLELSVIGLGLNGERPLLL-VRTRVEL 245

Query: 851 LCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAP 910
           L YQ  +F  P+   K        R L   N+   +  ++       D     E+    P
Sbjct: 246 LIYQ--VFRYPKGHLKI-----RFRKLDQLNLLDQQPTHIELDEN--DEQEEIESYQMQP 296

Query: 911 --CQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNH 967
              Q++  F N+ G  G  + G  PC+  + FR  LR+H  L +G + +F   +NVN  +
Sbjct: 297 KYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVNIPN 356

Query: 968 GFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
           GF+Y  +   LKI  LPS  +YD+ WPV+KV
Sbjct: 357 GFLYFDTTYELKISVLPSYLSYDSVWPVRKV 387


>gi|255718033|ref|XP_002555297.1| KLTH0G05984p [Lachancea thermotolerans]
 gi|238936681|emb|CAR24860.1| KLTH0G05984p [Lachancea thermotolerans CBS 6340]
          Length = 1307

 Score = 84.3 bits (207), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 127/651 (19%), Positives = 262/651 (40%), Gaps = 131/651 (20%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSI-HGLRITSM 159
           L L+  ++LHG +  +A++ Q         D ++++   AK+S++ FD S+   L   S+
Sbjct: 47  LVLLHEFKLHGQITGMALVPQMEGP----LDCLVVSTGKAKLSLVRFDPSMPMCLETLSL 102

Query: 160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYG---LQMIILKASQGGSGLVGD 216
           H +E+      ++     A+   +++DP+ RC  VL++    L ++ L  ++       D
Sbjct: 103 HYYEAE---FTRKNLIELAKTSKLRLDPERRC--VLLFNSDVLALLPLNINEEDE----D 153

Query: 217 EDTFGSGGGFSARIES---------SHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHE 265
           ++   +      ++E+         S V+++ DL  ++K+V D  F++ + +P + +L++
Sbjct: 154 DNQEPTHQAKKRKVENGDARRLAKQSSVLHVSDLSAELKNVVDIQFLNSFSQPTLAVLYQ 213

Query: 266 RELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVG 325
             L W+G         M   ++I+   K++  I+    LPHD + ++ + +     ++VG
Sbjct: 214 PRLAWSGNDKVAGKGSM-RLMAITPHEKKNTTIYQVKELPHDVHTIIPLAN---SCVLVG 269

Query: 326 ANTIHY--HSQSASCALALNNYAVSLDSSQELPRSSFSVELD-----AAHATWLQNDVAL 378
            N I    ++ +    + LN+++     S+++  SS  V        A+       ++ +
Sbjct: 270 VNEIVSVDNTGAIQSTIQLNSFSPKFTGSKQIDNSSLEVMFTEPIVWASAMVSKDREILI 329

Query: 379 LSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSV--------LTSDITTIGNSL------FF 424
           L     D+  +T+  +GR++    L +  P V        L + I  +   +      FF
Sbjct: 330 LMDHKADMYSITLQSEGRLLIDFTLVRL-PIVNDIFKDQNLPTCIVALSGGIRLKTCQFF 388

Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEEL 484
           +G   GD+++V+          S+ L+  F     +A           DAL     G++ 
Sbjct: 389 IGFSSGDAVVVK----------SNNLRSAFESQYREAIELPNDEDEDYDALY----GDDE 434

Query: 485 SLYGSASNNTESAQKTFSFAVR--DSLVNIGPLKDFSYGLRINADASATGISKQSNYEL- 541
            L    ++N  + +    F +   DSL+N+GP+     G   + +A+  G+   +  EL 
Sbjct: 435 DLARPVNDNKATVETAVPFEIELMDSLINVGPITSICTGRVSSINATIEGLPNPNRNELA 494

Query: 542 ---------------------------VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDE 574
                                      ++      IW +  +    +   +   A   D 
Sbjct: 495 IVSTSGHDSGTYLNVMEPSVRPLVQQALKFTSVTKIWNLKIRKKDKYLVTTDSGAEKSDV 554

Query: 575 YHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
           Y       + A+   ++       VT          T+    L G +R++QV  +   + 
Sbjct: 555 YE------IGAKIASIKPKHFKRNVT----------TVEIAILGGGKRIVQVTTKAVYLF 598

Query: 635 DGSY---MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
           +  +   MT    F               V+ VSI DP++LL  S G I++
Sbjct: 599 NLGFKKLMTISFDF--------------EVVHVSILDPFILLTNSKGEIKI 635


>gi|154421858|ref|XP_001583942.1| CPSF A subunit region family protein [Trichomonas vaginalis G3]
 gi|121918186|gb|EAY22956.1| CPSF A subunit region family protein [Trichomonas vaginalis G3]
          Length = 1297

 Score = 84.3 bits (207), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 96/368 (26%), Positives = 153/368 (41%), Gaps = 49/368 (13%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           L LV   +  G +  +     GG       DSII+  + +K+ VL+  D+   L+ T  H
Sbjct: 48  LRLVWEKKFWGEIFGVYRHKSGG-----EYDSIIVGCDTSKVIVLQVIDN--DLKETEYH 100

Query: 161 CFESPEWLHLKRGRES--------FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSG 212
            F  P        +               ++  DP G C  +L+    + +L  +     
Sbjct: 101 EFNRPGPPEPDPPKPERPFDISTRLRNKTIMDADPTGTCLALLLAQNILYVLPLANK--- 157

Query: 213 LVGDEDTFGSGGGFSAR---IESSHVINLRDLDMK----HVKDFIFVHGYIEPVMVILHE 265
            +  E T  +G  + +    I+ +   ++   D K     ++D +F+ GY  P + I+HE
Sbjct: 158 -IKIESTEKAGDEYHSSWKVIKDAFAYDVHT-DFKSPLYRIRDMVFLDGYKNPTLAIIHE 215

Query: 266 RELTWAGRVSWKHHTCMISALSISTTLKQHPLI---------WSAMNLPHDAYKLLAVPS 316
              TW+ R+  +  T  +S +S     K+  LI         W++  LPH+++ L+ VP 
Sbjct: 216 LIPTWSVRLPLQKSTVAVSIVSPPLKKKETVLISASIDKVTMWTSRALPHNSFGLVHVPD 275

Query: 317 PIGGVLVVGANTIHYHSQSASCALALNNYA-----VSLDSSQELPRSSFSVELDAAHATW 371
           PIGG LV+  N I Y   +   ALALN  A     V +D +   P      EL +   T 
Sbjct: 276 PIGGFLVLSKNAIIYMDHTNIVALALNKLAYLDDEVPVDITANGPGCH---ELYSKVGTA 332

Query: 372 LQNDVALLSTKTGDLVLLTVVYDGRVV-----QRLDLSKTNPSVLTSDITTIGNSLFFLG 426
           +     LL+     L +LT+ Y+G  V           + +PS   S   T   SL F+G
Sbjct: 333 IDKSHILLTVDQHYLSILTLHYNGVKVTNLSLNVNLNLEFHPSCFLSLNYTNNRSLVFMG 392

Query: 427 SRLGDSLL 434
           S   DS L
Sbjct: 393 STTHDSTL 400


>gi|190348091|gb|EDK40482.2| hypothetical protein PGUG_04580 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 1320

 Score = 84.0 bits (206), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 127/611 (20%), Positives = 237/611 (38%), Gaps = 68/611 (11%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           L L+  ++L+G V +L    +    +S   D I++A + AK+S++ +D   H +   S+H
Sbjct: 52  LRLLDQFKLYGTVTAL---KKFRTVDSPDLDYILVATKAAKVSMIRWDHQTHSIATESLH 108

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV-----G 215
            +E           E+     L+ V+P       + +   +  L  S            G
Sbjct: 109 YYEKSIQ---AATYETLDETELI-VEPNRYSCFCVRFKNLLTFLPFSTPDDDDDDMDDEG 164

Query: 216 DEDTFGSGGGFSARI-ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAG 272
           +        GF + +  SS +++ + L+  +  + D  F+H Y EP + IL  +  TW G
Sbjct: 165 ETKKQKYVPGFDSEVFGSSFMVDAQTLEPSIGTIVDMQFLHNYREPTVAILSSKAATWTG 224

Query: 273 RVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-TIHY 331
            +           ++I    K    +    NLP D  +L+ +  P+ G L++G N  IH 
Sbjct: 225 LLPKVKDNITYHVMTIDLATKATTTVLKIENLPFDIDRLVPLSHPLNGCLLLGCNEIIHV 284

Query: 332 HSQSASCALALNNYAVSLDSSQE--LPRSSFSVELDAAHATWLQND-VALLSTKTGDLVL 388
            +      LA+N Y   + +S +    ++  ++ L+      L ND   LLS  TG L  
Sbjct: 285 DNGGIVRRLAVNKYTEDITASVKNYHDQTDLNLMLENCAVIPLPNDNRVLLSLSTGSLFH 344

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTS-DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447
           +    D + ++R  L     +  +S D+T  G   F       DS L+     +G S L 
Sbjct: 345 INFDVDIKTIKRFALEPVLETHYSSVDLTYPGQPAFL------DSNLLFIANNNGNSPL- 397

Query: 448 SGLKEEFGDIEADAPSTKRL-RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVR 506
                    +E      + +  +  S+  +DM   EEL    +A       Q    +   
Sbjct: 398 ---------LEVKYLRNEEVTEKVQSNGKEDMDGDEELYDDDNAGEKIVIRQGDIKYFKH 448

Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPG------CKGIWT-------- 552
           D L+N GP+ DF+ G        A  I+   N   +   G      C  I+         
Sbjct: 449 DELINHGPVSDFTLGKYSTEKFKANLINPNLNDVCIVSNGGSHKQSCLNIFAPSVQPIIR 508

Query: 553 ---VYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQG 609
               + + +R  N ++  +   DD      I  +E     L++ D + +           
Sbjct: 509 SSLTFSQVNRMWNINNKYLITSDDVNSKSEIFQIEKSYSRLKSKDFIND----------E 558

Query: 610 RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
            TIA   L   + ++Q+  +   + +  +  + +SF     E     +   ++S ++ D 
Sbjct: 559 MTIAMHELNNGKYILQITPKHIEVFNSKF-KRHMSF---EDELKDAMKEDQIISSTVHDD 614

Query: 670 YVLLGMSDGSI 680
           Y+++  + G +
Sbjct: 615 YLMIFFASGEV 625


>gi|398397855|ref|XP_003852385.1| hypothetical protein MYCGRDRAFT_100364 [Zymoseptoria tritici
           IPO323]
 gi|339472266|gb|EGP87361.1| hypothetical protein MYCGRDRAFT_100364 [Zymoseptoria tritici
           IPO323]
          Length = 1333

 Score = 83.2 bits (204), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 78/320 (24%), Positives = 133/320 (41%), Gaps = 32/320 (10%)

Query: 97  SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
           + + L L+  Y L G V S+A +     D     ++I+LAF++AK+S++E+D   H +  
Sbjct: 45  AQSKLVLIGGYPLAGTVTSIARVKT--LDTRTGGEAILLAFKNAKLSLIEWDPENHRIST 102

Query: 157 TSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL------------ 204
            S+H +E    +    G        ++ VDP  RC  +     Q+ IL            
Sbjct: 103 VSIHYYEGENVIAQPYGPSLGEYESILTVDPGSRCAALKFGARQLAILPFRQFGDELLGE 162

Query: 205 ------KASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYI 256
                  A+ G +    D    G         + S V+ L  LD  + H  D  F+H Y 
Sbjct: 163 EEGEFENANDGTTSKKHDAMQNGEDEAEQTPYKQSFVLPLTTLDPALSHTIDLAFLHEYR 222

Query: 257 EPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
           EP   I+             +      +  ++    K    + +  NLP   +K++ +PS
Sbjct: 223 EPTFGIISSAIEPSYALFDERKDILSYTVFTLDLEQKASTNLITVPNLPSTLWKVVPLPS 282

Query: 317 PIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQND 375
           PIGG L++G N  IH      + A A+N +A+         +S  +++L+          
Sbjct: 283 PIGGALLIGTNEFIHVDQSGKANATAVNEFAMKESDFGMADQSGLNLKLEGC-------S 335

Query: 376 VALLSTKTGDLVLLTVVYDG 395
           V +L+  TG+  +L V+ DG
Sbjct: 336 VEILNASTGE--MLVVLRDG 353


>gi|159155577|gb|AAI54419.1| Cpsf1 protein [Danio rerio]
          Length = 400

 Score = 81.6 bits (200), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 68/240 (28%), Positives = 110/240 (45%), Gaps = 51/240 (21%)

Query: 483 ELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG--------LRINADAS---- 529
           E+ +YGS A + T+ A  T+SF V DS++NIGP    S G         + N +      
Sbjct: 54  EIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCASASMGEPAFLSEEFQTNPEPDLEVV 111

Query: 530 -ATGISKQSNYELV------------ELPGCKGIWTVYHKSSR---------GHNADSSR 567
             +G  K     ++            ELPGC  +WTV +   +         G + +  +
Sbjct: 112 VCSGYGKNGALSVLQKSIRPQVVTTFELPGCHDMWTVIYCEEKPEKPSAEGDGESPEEEK 171

Query: 568 MAAY---DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVI 624
                  D + H +LI+S E  TM+L+T   + E+  S  +  QG T+ AGN+   + +I
Sbjct: 172 REPTIEDDKKKHGFLILSREDSTMILQTGQEIMELDTS-GFATQGPTVYAGNIGDNKYII 230

Query: 625 QVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV 684
           QV   G R+L+G      L F P +         S ++  S+ADPYV++  ++G + + V
Sbjct: 231 QVSPMGIRLLEG---VNQLHFIPVDL-------GSPIVHCSVADPYVVIMTAEGVVTMFV 280


>gi|146415762|ref|XP_001483851.1| hypothetical protein PGUG_04580 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 1320

 Score = 81.6 bits (200), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 127/612 (20%), Positives = 237/612 (38%), Gaps = 68/612 (11%)

Query: 100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
            L L+  ++L+G V +L    +    +S   D I++A + AK+S++ +D   H +   S+
Sbjct: 51  KLRLLDQFKLYGTVTAL---KKFRTVDSPDLDYILVATKAAKVSMIRWDHQTHSIATESL 107

Query: 160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV----- 214
           H +E           E+     L+ V+P       + +   +  L  S            
Sbjct: 108 HYYEKSIQ---AATYETLDETELI-VEPNRYSCFCVRFKNLLTFLPFSTPDDDDDDMDDE 163

Query: 215 GDEDTFGSGGGFSARI-ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWA 271
           G+        GF + +  SS +++ + L+  +  + D  F+H Y EP + IL  +  TW 
Sbjct: 164 GETKKQKYVPGFDSEVFGSSFMVDAQTLEPSIGTIVDMQFLHNYREPTVAILSLKAATWT 223

Query: 272 GRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN-TIH 330
           G +           ++I    K    +    NLP D  +L+ +  P+ G L++G N  IH
Sbjct: 224 GLLPKVKDNITYHVMTIDLATKATTTVLKIENLPFDIDRLVPLSHPLNGCLLLGCNEIIH 283

Query: 331 YHSQSASCALALNNYAVSLDSSQE--LPRSSFSVELDAAHATWLQND-VALLSTKTGDLV 387
             +      LA+N Y   + +S +    ++  ++ L+      L ND   LLS  TG L 
Sbjct: 284 VDNGGIVRRLAVNKYTEDITASVKNYHDQTDLNLMLENCAVIPLPNDNRVLLSLLTGSLF 343

Query: 388 LLTVVYDGRVVQRLDLSKTNPSVLTS-DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSML 446
            +    D + ++R  L     +  +S D+T  G   F       DS L+     +G S L
Sbjct: 344 HINFDVDIKTIKRFALEPVLETHYSSVDLTYPGQPAFL------DSNLLFIANNNGNSPL 397

Query: 447 SSGLKEEFGDIEADAPSTKRL-RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAV 505
                     +E      + +  +  S+  +DM   EEL    +A       Q    +  
Sbjct: 398 ----------LEVKYLRNEEVTEKVQSNGKEDMDGDEELYDDDNAGEKIVIRQGDIKYFK 447

Query: 506 RDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPG------CKGIWT------- 552
            D L+N GP+ DF+ G        A  I+   N   +   G      C  I+        
Sbjct: 448 HDELINHGPVSDFTLGKYSTEKFKANLINPNLNDVCIVSNGGSHKQSCLNIFAPSVQPII 507

Query: 553 ----VYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
                + + +R  N ++  +   DD      I  +E     L++ D + +          
Sbjct: 508 RSSLTFSQVNRMWNINNKYLITSDDVNLKSEIFQIEKSYSRLKSKDFIND---------- 557

Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
             TIA   L   + ++Q+  +   + +  +  + +SF     E     +   ++S ++ D
Sbjct: 558 EMTIAMHELNNGKYILQITPKHIEVFNSKF-KRHMSF---EDELKDAMKEDQIISSTVHD 613

Query: 669 PYVLLGMSDGSI 680
            Y+++  + G +
Sbjct: 614 DYLMIFFASGEV 625


>gi|328864890|gb|EGG13276.1| CPSF domain-containing protein [Dictyostelium fasciculatum]
          Length = 1627

 Score = 81.6 bits (200), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 39/103 (37%), Positives = 52/103 (50%), Gaps = 16/103 (15%)

Query: 912  QRITIFKNISGHQGFFLSG-SRPCWCMVFRERLRVHPQ---------------LCDGSIV 955
            +RI  F NI   +G F+SG S P W    +   R+HP                     I 
Sbjct: 1018 RRIIPFSNIGNKRGIFVSGVSTPIWIFSEKNFPRIHPMKQQQQTTSSSSSSSSSSKRPIT 1077

Query: 956  AFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
             FT  HN+NC HGFIY    G+L IC+LP G+ Y+N WP++K+
Sbjct: 1078 TFTTFHNINCKHGFIYFDHTGMLCICRLPDGTNYENEWPIRKL 1120


>gi|428164905|gb|EKX33915.1| hypothetical protein GUITHDRAFT_158867 [Guillardia theta CCMP2712]
          Length = 1092

 Score = 80.1 bits (196), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 128/601 (21%), Positives = 236/601 (39%), Gaps = 127/601 (21%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+++  ++   L+ V    ++G + ++ + +  GA+    R+S+ +  E  K  ++E+D 
Sbjct: 37  RLVIYTLTPEGLQPVLDTGIYGRIAAIELYTVAGAE----RESLYILTERLKFCIVEYDS 92

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFAR----GPLVKVDPQGRCGGVLVY-GLQMIIL 204
           S   L   +M   +           +S  R    GP+  +DP+ R  G L+Y GL  +I 
Sbjct: 93  STGELITKAMGDVQ-----------DSVGRPVDGGPIAHIDPERRMIGFLLYDGLFKVIP 141

Query: 205 KASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILH 264
             ++ G               F+ R+E   V++++           F++GY +P +V+L 
Sbjct: 142 IDTRNGQ----------LREAFNIRLEELQVLDVQ-----------FLYGYAQPTIVLL- 179

Query: 265 ERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGV 321
                      ++    M    +   +++    I   WS   +   A  ++ VP+PIGG 
Sbjct: 180 -----------YQDPKEMRHLKTYQVSIRDKDFIAGPWSQTGVEIGATMIIPVPTPIGGC 228

Query: 322 LVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLST 381
           +++G  TI Y +         +   + +D +  + R+   ++ D            LL  
Sbjct: 229 ILLGEQTISYLNGDKG-----DTKTIHMDMT--VIRAWGKIDEDGRR--------YLLGD 273

Query: 382 KTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGS 441
             G L +L + +DG  V  L L     +     IT + + + F+GS  GDS L++     
Sbjct: 274 HLGQLYVLVLEFDGNKVLGLKLDTLGETSSAKTITYLDSGVVFIGSCFGDSQLIRL---- 329

Query: 442 GTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTF 501
                                  K    S+ + L+   N   +  +       +   +  
Sbjct: 330 --------------------HPDKDENDSNIEVLESFTNLGPIQDFCVVDLERQGQGQVV 369

Query: 502 SFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGH 561
           + +        G LKD S  LR+  +    GI++Q+    VELPG KG+W++        
Sbjct: 370 TCS--------GTLKDGS--LRVVRN--GIGINEQAA---VELPGIKGLWSLRE------ 408

Query: 562 NADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRR 621
                   + D +Y  YLI S    T VLE AD     TE   +    +TI   N+ G  
Sbjct: 409 --------SIDAQYDKYLIQSFVNETRVLEIADEELSETEIDGFDHNAQTIFCSNVLG-D 459

Query: 622 RVIQVFERGARILDGSYMTQDLSFGPSNSE--SGSGSENSTVLSVSIADPYVLLGMSDGS 679
            ++Q+ E   R++          + P N E  + +G     V+  S     + L +S+G 
Sbjct: 460 CLLQITEVSLRLVSTKSKQLLKEWFPPNGERITVAGGNVQQVVLTSGKRTLIYLDVSNGD 519

Query: 680 I 680
           +
Sbjct: 520 V 520


>gi|453082807|gb|EMF10854.1| CPSF_A-domain-containing protein [Mycosphaerella populorum SO2202]
          Length = 1349

 Score = 78.6 bits (192), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 144/680 (21%), Positives = 266/680 (39%), Gaps = 93/680 (13%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NLVV   ++++++       G K + N G  ++ VL           V  Y L G V S+
Sbjct: 28  NLVVAKTSLLQVF-------GVKAAGNDGGNEKLVL-----------VGEYSLAGTVTSI 69

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +     D     ++++L+F+DAK+S++E+D   + +   S+H +E    +    G   
Sbjct: 70  ARVKT--LDTKSGGEAVLLSFKDAKLSLVEWDPENYRISTISLHFYEGDNVISAPFGPPL 127

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQ---------------GGSGLVGDEDT-- 219
                ++ VDP  RC  +     Q+ IL   Q                   L   + T  
Sbjct: 128 ADCDSILTVDPSSRCAALKFGARQLAILPFRQFGDELAGEEEEGEFDADHALATSKRTES 187

Query: 220 --FGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS 275
               +G       ++S  + L  LD  + H     F+H Y EP   IL          + 
Sbjct: 188 VPHANGDTEHTPYKASFTLALTALDPSVSHAVHLAFLHEYREPTFGILSATVEPSYSLLE 247

Query: 276 WKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQS 335
            +      + L++    +    + S   LP   ++++ +P P+GG L++G N + +  QS
Sbjct: 248 ERKDILTYTVLTLDLEQRASTNLISVPKLPSTLWEVVPLPLPVGGALLLGTNELVHVDQS 307

Query: 336 ASC-ALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVV 392
               A A+N +A          +S  +++L+      L +     L+ T  G L +L+  
Sbjct: 308 GKANATAVNEFAKLESDFGMADQSHLNLKLEDCRVEVLDSKTGELLIVTNDGSLAILSFQ 367

Query: 393 YDGRVVQRLDLSKTNPSVLTSDITT-------IGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
             GR +  L++ +      ++ I T       +  S  F+GS  G S L+ ++    TS 
Sbjct: 368 MHGRSISALNVKRATSENGSTTIHTAPSCMARLEGSKIFIGSEDGASSLLGWS--RPTSA 425

Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAV 505
           L+   K     +       +       D        E      S +  T +AQ TFS  +
Sbjct: 426 LNR--KRSHAQMLDKEADDEDEEMEEDDDDLYDAAPEPKKRASSETAVTSTAQYTFS--I 481

Query: 506 RDSLVNIGPLKDFSYG--------LRINADASATGISKQS--NYELV-------ELPGCK 548
            D L++ GP+ +   G        L I A A     S+ +  + ++V       +L   +
Sbjct: 482 IDELLSTGPIHEVCLGRSGPWKDRLEIAAGAGRKQASRLTLMHRDIVPTVRRKCKLGAAR 541

Query: 549 GIWTVYHKSSRGHNADSSRMA-AYD-DEYHAYLIISL--EARTMVLETADLLTEVTESVD 604
             W +  K       +   +   +D D+   Y I S   +  +    +A       E++D
Sbjct: 542 ATWALRPKQRNAALPEYDNLLFVFDGDDTKVYDIPSQDEDGSSYTERSAPEFESAGETLD 601

Query: 605 YFVQGRTIAAGNLFGRRRVIQVFERGARI-LDGSYMTQDLSFGPSNSESGSGSENSTVLS 663
                 T+A G +  + R  ++    A++ LD           P   E     E+ +++ 
Sbjct: 602 M----ATVADGTIVVQTRRTELRTYNAKLGLD--------QIIPMTDE--ETDEDLSIVH 647

Query: 664 VSIADPYVLLGMSDGSIRLL 683
           ++++DPYVL+   D S+++L
Sbjct: 648 IAVSDPYVLVIRGDNSVQVL 667


>gi|224135035|ref|XP_002321967.1| predicted protein [Populus trichocarpa]
 gi|222868963|gb|EEF06094.1| predicted protein [Populus trichocarpa]
          Length = 60

 Score = 78.6 bits (192), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 36/48 (75%), Positives = 41/48 (85%)

Query: 684 VGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWL 731
           + DPSTC VSV TP+A +SSKK VS+CTLYHDKGPEP LRKTS +AWL
Sbjct: 1   MTDPSTCMVSVNTPSAFQSSKKSVSACTLYHDKGPEPLLRKTSPNAWL 48


>gi|254580509|ref|XP_002496240.1| ZYRO0C13816p [Zygosaccharomyces rouxii]
 gi|238939131|emb|CAR27307.1| ZYRO0C13816p [Zygosaccharomyces rouxii]
          Length = 1331

 Score = 78.2 bits (191), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 140/656 (21%), Positives = 271/656 (41%), Gaps = 124/656 (18%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           L L   ++  G +  LA++ Q  +      D ++L    AKISV+ +D++ + +   S+H
Sbjct: 48  LILTHEFKFEGRITDLAVVPQKDSP----LDCLLLCTSIAKISVVRYDEASNSIETLSLH 103

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS--------- 211
            +E        R     A+   ++VDP  RC   L++   +I L   Q  S         
Sbjct: 104 YYEDS---FKDRSILELAKESTMRVDPGKRCA--LLFNNDVIALLPLQTTSLNDGEEEDE 158

Query: 212 ---GLVGDEDTFGSGGGFSARIESSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHER 266
                  D+    + G  +A    S + N ++L  DM +V D  F+  +  P + ++ E 
Sbjct: 159 DMDDERPDKRQKNNKGRITA---PSAIFNAKELHQDMNNVIDVTFLRNFTRPTLAVIFEN 215

Query: 267 ELTWAGR-------VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIG 319
           +  WAG        V++   T  +++   ST +K   +I +   L  D + ++ + +   
Sbjct: 216 KPVWAGTSQVLPLPVTYMAFTLEVTSNEQSTDIKS-TVIATVKELSWDFHTMIPIAN--- 271

Query: 320 GVLVVGANTIHY--HSQSASCALALNNYA-VSLDSSQELPRSSFSVELDAAHA-TWLQND 375
           G ++VG+N + Y  ++ S    + LN+YA  ++  ++ + RS   + L       W  +D
Sbjct: 272 GCIIVGSNEMAYIDNTGSLQSIIFLNSYANKNMKKARIVDRSKSKILLHKPTTYNWSVSD 331

Query: 376 VALLSTKTGDLVLLT----------VVYDGRVVQRLDL-----------SKTNPSVLTSD 414
                ++TG+ +L+           + Y+GR++ + D+           + +N + ++  
Sbjct: 332 ---QKSETGETLLIMDHQAAFYYIQLEYEGRLLTKFDIINLPIVNDTLKNNSNATCISRL 388

Query: 415 ITTI-GNSL-FFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
            +T+ GN +  F+G R GD+ +++       + L + ++      E  +P    + +   
Sbjct: 389 NSTLSGNYVDLFVGFRSGDASVLRL------NNLKAAIESRDEHKEITSPPENDIEKFED 442

Query: 473 DALQDMVNGEELSLYGSASNNTESAQKT---FSFAVRDSLVNIGPLKDFSYGLRINADAS 529
              +D +  EE S       N E   +T   F   V  SL NI P+   + G   + D  
Sbjct: 443 ---EDDLYSEEASDADKEKENKEVVVETVLPFDIEVLSSLRNIAPITSLTPGKICSVDKF 499

Query: 530 ATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL-EARTM 588
             G+S  +  E V L    G  T       G +    +M+   +   A   IS+ +   +
Sbjct: 500 VEGLSNPNRNE-VSLVATSGNGT-------GSHLTEIQMSVRPEVQLALKFISITQMWNL 551

Query: 589 VLETADLLTEVTES---------VD----YFVQGR-----TIAAGNLFGR-RRVIQVFER 629
            ++  D     T+S         +D     + +GR     T  + ++FG  +R++QV   
Sbjct: 552 KIKNKDKYLITTDSNKNKSDIYLIDKNFALYKEGRFRRDATTVSISMFGSDKRIVQVTTN 611

Query: 630 GARILDGSY---MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
              + D ++    T    F               V+ VS+ DPY+L+ +S G I++
Sbjct: 612 HLYLYDTNFKRLTTMKFEF--------------EVVHVSVMDPYILITVSRGDIKV 653


>gi|255720869|ref|XP_002545369.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
 gi|240135858|gb|EER35411.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
          Length = 1351

 Score = 78.2 bits (191), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 73/318 (22%), Positives = 134/318 (42%), Gaps = 29/318 (9%)

Query: 222 SGGGFSAR--IESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
           +G  F  R   +SS +I+   LD  +  V D  F+H Y EP + +L  +   WAG +   
Sbjct: 198 NGNSFEPRQFYDSSFIIDATTLDSTVGTVIDMQFLHNYREPTIGVLSSKSEVWAGNLLKS 257

Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSA 336
                   L++    K    ++   NLP++  +++ +PSP+ GV++VG N  IH  +   
Sbjct: 258 KDNIQFQVLTLDLNSKSTVSVFKIDNLPYEIDRVIPLPSPLNGVILVGCNELIHVDNGGV 317

Query: 337 SCALALNNY----AVSLDSSQELPRSSFSVELDAAHATWLQND-VALLSTKTGDLVLLTV 391
              +A+N +      S+ S Q+  +S  +++L+ +    + ND   LL  KTG+   +  
Sbjct: 318 MKRIAVNKFTGLTTASIKSFQD--QSDLNLKLEDSTIVPIPNDHRVLLVLKTGEFYYINF 375

Query: 392 VYDGRVVQRLDLSKTNPSVLTS-------DITTIGNSLFFLGSRLGDSLLVQFTCGSGTS 444
             DG+ ++R+ +   +  +          ++  +  +L F  +  G+S LVQ       S
Sbjct: 376 ELDGKSIKRVHIDVIDKKLYEKVKLTYPGEVAVLDKNLLFFANSSGNSPLVQVKYRDSLS 435

Query: 445 MLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFA 504
               G   E  D E +               ++    E+ +L          ++    F 
Sbjct: 436 DAKIGAPIEESDEEDETQKADEDDDEDDLYKEEEEEEEQKNL----------SKTHIEFV 485

Query: 505 VRDSLVNIGPLKDFSYGL 522
             D L+N GP   F+ G+
Sbjct: 486 YHDELINNGPSSSFTLGV 503


>gi|241954348|ref|XP_002419895.1| subunit of the mRNA cleavage and polyadenylation factor, putative
           [Candida dubliniensis CD36]
 gi|223643236|emb|CAX42110.1| subunit of the mRNA cleavage and polyadenylation factor, putative
           [Candida dubliniensis CD36]
          Length = 1420

 Score = 77.8 bits (190), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 55/236 (23%), Positives = 110/236 (46%), Gaps = 17/236 (7%)

Query: 216 DEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
           +EDT G+        +SS +I+   LD  +  V D  F+H Y EP + +L  ++  WAG 
Sbjct: 197 EEDTNGTNKESHLFYDSSFIIDATTLDSSIDTVVDMQFLHNYREPTIAVLSSKQEVWAGN 256

Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYH 332
           +           L++    K    ++   NLP++  +++ +PSP+ G L+VG N  IH  
Sbjct: 257 LIKSKDNIQFQVLTLDLNSKSTISVFKIDNLPYEIDRIVPLPSPLNGTLLVGCNELIHVD 316

Query: 333 SQSASCALALNNY----AVSLDSSQELPRSSFSVELDAAHATWLQND-VALLSTKTGDLV 387
           +      +A+N +      S+ S Q+  +S  +++L+      + +D   LL  +TG+  
Sbjct: 317 NGGVLKRIAVNKFTRLITASIKSFQD--QSDLNLKLENCSIVPIPDDHRVLLILQTGEFY 374

Query: 388 LLTVVYDGRVVQRLDLSKTNPSVLTS-------DITTIGNSLFFLGSRLGDSLLVQ 436
            +    DG+ ++R+ +   +             ++  +  ++ F+ +  G+S L+Q
Sbjct: 375 FINFELDGKSIKRIHIDNVDKKTYDKIQLNHPGEVAVLDKNMLFIANSNGNSPLIQ 430


>gi|68471006|ref|XP_720510.1| likely Cleavage and Polyadenylation Specificity Factor subunit
           [Candida albicans SC5314]
 gi|74591422|sp|Q5AFT3.1|CFT1_CANAL RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two
           protein 1
 gi|46442380|gb|EAL01670.1| likely Cleavage and Polyadenylation Specificity Factor subunit
           [Candida albicans SC5314]
          Length = 1420

 Score = 77.0 bits (188), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 52/221 (23%), Positives = 104/221 (47%), Gaps = 17/221 (7%)

Query: 231 ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
           +SS +I+   LD  +  V D  F+H Y EP + +L  ++  WAG +           L++
Sbjct: 217 DSSFIIDATTLDSSIDTVVDMQFLHNYREPTIAVLSSKQEVWAGNLIKSKDNIQFQVLTL 276

Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNY-- 345
              LK    ++   NLP++  +++ +PSP+ G L+VG N  IH  +      +A+N +  
Sbjct: 277 DLNLKSTISVFKIDNLPYEIDRIIPLPSPLNGTLLVGCNELIHVDNGGVLKRIAVNKFTR 336

Query: 346 --AVSLDSSQELPRSSFSVELDAAHATWLQND-VALLSTKTGDLVLLTVVYDGRVVQRLD 402
               S  S Q+  +S  +++L+      + +D   LL  +TG+   +    DG+ ++R+ 
Sbjct: 337 LITASFKSFQD--QSDLNLKLENCSVVPIPDDHRVLLILQTGEFYFINFELDGKSIKRIH 394

Query: 403 LSKTNPSVLTS-------DITTIGNSLFFLGSRLGDSLLVQ 436
           +   +             ++  +  ++ F+ +  G+S L+Q
Sbjct: 395 IDNVDKKTYDKIQLNHPGEVAILDKNMLFIANSNGNSPLIQ 435


>gi|50305395|ref|XP_452657.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|74606921|sp|Q6CTT2.1|CFT1_KLULA RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two
           protein 1
 gi|49641790|emb|CAH01508.1| KLLA0C10274p [Kluyveromyces lactis]
          Length = 1300

 Score = 76.6 bits (187), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 136/639 (21%), Positives = 257/639 (40%), Gaps = 111/639 (17%)

Query: 98  AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
           A  L L   ++L G +  + +L Q G   S  +   IL+   +K+S++ FD     L   
Sbjct: 45  AQKLVLAYEWKLAGKIIDMQLLPQIG---SPLKMLAILS-SKSKVSLVRFDPVAESLETL 100

Query: 158 SMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI-ILKASQGGSGLVGD 216
           S+H +   ++++L     S     ++ VDP  RC  +LV+   ++ IL        +  D
Sbjct: 101 SLHYYHD-KFVNL--STSSLKTESIMAVDPLFRC--LLVFNEDVLAILPLKLNTEDMEID 155

Query: 217 EDTFGSGGGFSARIESSHVINLRDLDM---------KHVKDFIFVHGYIEPVMVILHERE 267
           ED  G     + R++ +  I    + M         KHV D  +++ + +P + IL++  
Sbjct: 156 EDENGIKEPMAKRLKRNQGITSDSIIMPISSLHKSLKHVYDIKWLNNFSKPTVGILYQPV 215

Query: 268 LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327
           L W G      +T     LS+    ++  +I    +LP+D + L  VP   G VL +G N
Sbjct: 216 LAWCGNEKVLGNTMRYMVLSLDVEDEKTTVIAELADLPNDLHTL--VPLKRGYVL-IGVN 272

Query: 328 TIHYHSQSA---SCALALNNYAVSLDSSQELPRSSFSVELDAA----HATWLQNDVALLS 380
            + Y S S    SC + LN +A S  +++    S  ++ L  +    +    ++D+ +L 
Sbjct: 273 ELLYISASGALQSC-IRLNTFATSSINTRITDNSDMNIFLSKSSIYFYKALKRHDLLILI 331

Query: 381 TKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRL-----GD---- 431
            +   +  +    +G ++ + D  +            I N + F  SRL     GD    
Sbjct: 332 DENCRMYNIITESEGNLLTKFDCVQ----------VPIVNEI-FKNSRLPLSVCGDLNLE 380

Query: 432 --SLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS 489
              +L+ F  G    +    LK  F        + ++L  +  D        E  +LYG 
Sbjct: 381 TGRVLIGFLSGDAMFLQLKNLKVAFA-------AKRQLVETVDDDDD-----EYSALYGE 428

Query: 490 ASNNTES----AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELP 545
           + NNT +     Q+ F  ++ DS+ NIGPL   + G   + + +   +   +  E   + 
Sbjct: 429 SQNNTHTRIVETQEPFDISLLDSIFNIGPLTSLTIGKVASVEPTIQRLPNPNKDEF-SIV 487

Query: 546 GCKGI-----WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT 600
              G+      T  H + + H   + +  +    ++    + ++ +   L T D   E +
Sbjct: 488 ATSGVGRGSHLTALHSTVQPHIEQALKFTSATRIWN----LKIKGKDKYLVTTDADKEKS 543

Query: 601 E------------SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-----MTQDL 643
           +            + D+    RTI    +   +R++QV   G  + D  +     +T D+
Sbjct: 544 DVYQIDRNFEPFRAQDFRKDSRTIGMETMDDDKRILQVTSGGLYLFDVDFKRLARLTIDI 603

Query: 644 SFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
                            ++   I DPY+L   + G+I++
Sbjct: 604 E----------------IVHACIIDPYILFTDARGNIKI 626


>gi|348681092|gb|EGZ20908.1| hypothetical protein PHYSODRAFT_259403 [Phytophthora sojae]
          Length = 1137

 Score = 76.6 bits (187), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 109/467 (23%), Positives = 182/467 (38%), Gaps = 108/467 (23%)

Query: 174 RESFARGPLV----KVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR 229
           R+S  R   +     +DP+GR  G+ +Y     ++    G   L   +DTF         
Sbjct: 107 RDSIGRSSEIVTSGNIDPEGRLIGMNLYEGYFKVIPIDSGKGIL---KDTF--------- 154

Query: 230 IESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
                  N+R LD   V D  F+HGY +P + +L+E             H      L   
Sbjct: 155 -------NIR-LDELRVIDIKFLHGYTKPTICVLYED-------YKAARHIKTYHILLKE 199

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL 349
               + P  WS  N+   A  L+ VP+P+GGVL+V   TI YH+ S   A+ + +  + +
Sbjct: 200 KDFAEGP--WSQSNVESGASLLIPVPAPVGGVLIVSNQTIVYHNGSTFHAIPMQSTVIQV 257

Query: 350 DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS 409
             + +   S F                 LL+ + G L ++ + + G+ V  + L     +
Sbjct: 258 YGAVDKDGSRF-----------------LLADQYGTLSVVALQHTGKEVTGVHLEVLGET 300

Query: 410 VLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRR 469
            + S ++ + N + F+GS  GDS L++              ++E G              
Sbjct: 301 NIASCLSYLDNGVVFIGSTFGDSQLIKLNAD----------RDENG-------------- 336

Query: 470 SSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLRINAD 527
           S  + L   VN   +  +     + +   +  T S A +D     G L+    G+ IN  
Sbjct: 337 SYIEVLDTYVNVGPIIDFCVMDLDRQGQGQIVTCSGADKD-----GTLRVIRNGIGINEQ 391

Query: 528 ASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
           ASA            ELPG KG+W +               AA  D+Y     +S E R 
Sbjct: 392 ASA------------ELPGIKGMWAL-----------RETFAAEHDKYLLQSYVS-EIRI 427

Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
           + +   D + E  + +  F   +T+   N++G    +QV E   R++
Sbjct: 428 LAIGDEDEMEE--KEIPAFTNVKTLLCRNMYGDVW-LQVTESEVRLI 471


>gi|167384458|ref|XP_001736962.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165900458|gb|EDR26769.1| hypothetical protein EDI_171140 [Entamoeba dispar SAW760]
          Length = 836

 Score = 76.3 bits (186), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 83/326 (25%), Positives = 141/326 (43%), Gaps = 50/326 (15%)

Query: 133 IILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFA----RGPLVKVDPQ 188
           ++L F++AK+S+L +D++ +   I S+HCFE P    LKR +E         P + +D +
Sbjct: 74  LVLLFKEAKVSILRYDETNNKFVIHSLHCFELP----LKRMQEGLTPTTYTNPRLLIDKR 129

Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKD 248
           GRC  ++ Y   M ++                    GF    ++S+ INL    +  + D
Sbjct: 130 GRCISLICYDRLMWVIPL------------------GFD---KTSYSINLEKFGINRIID 168

Query: 249 FIFVHGYIEPVMVILHERELTWAGR-VSWKHHTCMISALSISTTL---KQHPLIWSAMNL 304
            I + GY  P +  LH +  TW GR V+    T  I  LS+   +   KQ  +   +   
Sbjct: 169 CIVLDGYDLPSVAFLHMKIPTWEGRIVNTGETTNEIIVLSLEPDVIHEKQDIVATVSYQF 228

Query: 305 PHDAYKLLAVPS--PIGGVLVVGANTIHYHSQSA--SCALALNNYAVSLDSSQELPRSSF 360
            +  Y  L +    P  G+L++  N+I Y S ++  S  L    + V +  +   P SSF
Sbjct: 229 SYVPYNALQIVDCYPTNGILILTINSIIYLSTTSFESFILPFGKFFV-IPKNNNRPLSSF 287

Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLL-------TVVYDGRVVQRL-DLSKTN-PSVL 411
            +       T + N V  +   T  L ++         V+   +  R+ D+  TN P   
Sbjct: 288 QI---LQMQTKIMNSVKSIFKLTNHLYIIFSMNGESYYVHLLSIANRICDVIITNSPYKY 344

Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQF 437
                TI ++  F+GS + DS +  +
Sbjct: 345 HPTTFTISSNHLFIGSTVHDSYIYNY 370


>gi|302403950|ref|XP_002999813.1| cft-1 [Verticillium albo-atrum VaMs.102]
 gi|261361315|gb|EEY23743.1| cft-1 [Verticillium albo-atrum VaMs.102]
          Length = 1349

 Score = 76.3 bits (186), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 117/507 (23%), Positives = 199/507 (39%), Gaps = 75/507 (14%)

Query: 233 SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST 290
           S V+ L  LD  + H   F F+H Y EP + I+              H T  +   ++  
Sbjct: 197 SFVLALPQLDPEILHPVHFAFLHEYREPTLGIISSSNRRLKMEPQMDHFTFKV--FTVDL 254

Query: 291 TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSL 349
             K    I +  NLP    K++A+  P+GG L++G N  IH      +  +A+N YA  +
Sbjct: 255 LQKASTAILTVSNLPQSLKKVVALSKPMGGALLIGENELIHIDQAGKAHGVAVNPYAAKM 314

Query: 350 DSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDL---- 403
                  +S   + L+      +  D    LL T+ G++ ++T   DGR V  + +    
Sbjct: 315 TKFPLADQSELKLRLEHCEVELMSPDNGEMLLVTRHGEMAVVTFKMDGRSVSGVSVKVVA 374

Query: 404 SKTNPSVL---TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
           ++    +L    + +T +  +  F G+  GDS ++ +   S   + ++  K    D   +
Sbjct: 375 TENGGDILPFRAACLTKVSKNSMFYGTIGGDSQVIGW---SRQHVQTARKKARLLD---E 428

Query: 461 APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
           +           D   D + GE       ++    +      F V DSL+++ P+ D +Y
Sbjct: 429 SLDYDLDEDELDDDDDDDLYGEGTVAPQPSAAAGSAKGGDVVFRVHDSLLSLSPIMDMAY 488

Query: 521 G----------------LRINAD-ASATGISKQSNYELV------------ELPGCKGIW 551
           G                +R   D   A G  +  +  L+            E P  +G W
Sbjct: 489 GKTAFFPGSEEAKNSEGVRSELDLVCAVGRHRGGSLALINQHIQPRVIGRFEFPEARGFW 548

Query: 552 TV-----YHKSSRGHNADSSRMAAYDD-----EYHAYLIISLEARTMVLETADLLTEVTE 601
           T        KS +G     + +A  +D     +Y  ++I++ +      ET+D+      
Sbjct: 549 TTRVQKTIAKSLQGEKG--ANLAVGNDYGSVTQYDKFMIVA-KVDLDGYETSDVYALTGA 605

Query: 602 SVDYF-------VQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSNSESG 653
             +           G TI AG +    R+IQV     R  DG   ++Q L     + E+G
Sbjct: 606 GFEALSGTEFDPAAGLTIEAGTMGNDMRIIQVLRSEVRCYDGDLGLSQILPM--LDEETG 663

Query: 654 SGSENSTVLSVSIADPYVLLGMSDGSI 680
           +      V+S SI DPY+LL   D SI
Sbjct: 664 A---EPRVISASIVDPYLLLLREDSSI 687



 Score = 42.7 bits (99), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 35/138 (25%), Positives = 66/138 (47%), Gaps = 30/138 (21%)

Query: 57  NLVVTAANVIEIYVVRV-------QEEGSKESKNSGETKRRVLMD--GISAA-------- 99
           NL+V+  ++++I+ V+         +  +K S  +GET  R + D  G+ +A        
Sbjct: 28  NLIVSKGSLLQIFAVKTVSTEIDTSQIQAKSSSKAGETYDRRINDDDGLESAFLGGDGML 87

Query: 100 ---------SLELVCHYRLHGNVESLAILSQGGADNSRRR-DSIILAFEDAKISVLEFDD 149
                     L LV  Y +HG +  LA +      +SR   +++++    A++S+L++D 
Sbjct: 88  MRADRTTNTRLVLVAEYPVHGVIAGLARVK---IQSSRSGGEALLVHSRTARLSLLQWDP 144

Query: 150 SIHGLRITSMHCFESPEW 167
             HG+   S+H +E  EW
Sbjct: 145 EKHGVEDVSIHFYEKEEW 162


>gi|363750592|ref|XP_003645513.1| hypothetical protein Ecym_3197 [Eremothecium cymbalariae
           DBVPG#7215]
 gi|356889147|gb|AET38696.1| Hypothetical protein Ecym_3197 [Eremothecium cymbalariae
           DBVPG#7215]
          Length = 1318

 Score = 75.9 bits (185), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 135/646 (20%), Positives = 266/646 (41%), Gaps = 105/646 (16%)

Query: 97  SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
           +   L L   ++L G+V S+A++ Q G++       +++     K+S+L+FD     L  
Sbjct: 44  AKGQLVLSYEWKLSGHVHSMALIPQPGSE----LYCLVILTGCGKLSILKFDHMSQSLDT 99

Query: 157 TSMHCFESP-EWLHLKRGRESFARGPLVKVDPQGRCGGV----LVYGLQMIILKASQGGS 211
            S+H +E   + L L       +  P + VD   RC  V     +  L + + K  +   
Sbjct: 100 LSLHYYEDKFKELSLLE----ISNTPSLIVDRSFRCLLVRNNDCIAILPLNVTKEEEEEE 155

Query: 212 GLVGDEDTFGSGGGFSAR------------IESSHVINLRDL--DMKHVKDFIFVHGYIE 257
                ++   +GG FS +            + SS ++    L  D+K+V D  F+HG+ +
Sbjct: 156 EDNEKDEDRSNGGRFSFKRHKLNGGSVKQFVNSSTIMPASHLHSDIKNVLDVQFLHGFNK 215

Query: 258 PVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP 317
           P + IL++  L W+G    +  T  +  LS+    ++  +I     LP+D + L+ + + 
Sbjct: 216 PTLAILYQPILAWSGNEKLRSQTVKVIILSLDFEDEKSTVINIIQGLPNDLHTLIPLSN- 274

Query: 318 IGGVLVVGANTIHYHSQSASC--ALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ-- 373
               +VVG N + Y   + +    ++LN+++ ++ +++    SS     +     +    
Sbjct: 275 --ASIVVGVNELIYIDNTGALQGTVSLNSFSKTVLNTKVKDNSSLQAFFNRPVCQYTTIS 332

Query: 374 --NDVALLSTKTGDLVLLTVVYDGRVVQ-----RL----DLSKTN--PSVLTSDITTIGN 420
              D+ LL  +   +  + +  +GR+V      RL    D+ K N  P+ +  D+     
Sbjct: 333 KGKDIMLLMDEKSQMYNVIIESEGRLVTAFNCVRLPIVNDIFKNNHLPTCICGDVDLETG 392

Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480
           +L F+G + GD++ V+      +S+ S G   E   +EAD    +               
Sbjct: 393 NL-FIGFKSGDAMRVRLN-NLRSSLASKGNVVE--TMEADEDYDE--------------- 433

Query: 481 GEELSLYGSASNNTESAQKT---FSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
                LYG ++   +    T   F     D+L+NIGPL   + G   + + +   ++  +
Sbjct: 434 -----LYGGSTEVEKKNMDTETPFDIETLDNLINIGPLTSLAVGKVSSIEPTIAKLTNPN 488

Query: 538 NYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYD------------DEYHAYLIISLEA 585
             EL  +    G  T  H +   +    +   A               +   YL+ +  +
Sbjct: 489 RCEL-SIVATSGNSTGSHLTVFENTIVPTVEKALKFISVTQIWNLKIKDKDKYLVTTDSS 547

Query: 586 RTMV-LETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY---MTQ 641
           ++   + + D   +  +S D+     T++       +R++QV  +G  + D ++   MT 
Sbjct: 548 QSKSDIYSIDRDFKPFKSFDFKKNDTTVSTAVTGAGKRIVQVTSKGVYLFDINFKRMMTM 607

Query: 642 DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDP 687
           +  F               V+ V I DP++LL  S G I++   +P
Sbjct: 608 NFDF--------------EVVHVCINDPFLLLTNSKGDIKIYELEP 639


>gi|238881599|gb|EEQ45237.1| conserved hypothetical protein [Candida albicans WO-1]
          Length = 1423

 Score = 75.9 bits (185), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 55/238 (23%), Positives = 110/238 (46%), Gaps = 19/238 (7%)

Query: 216 DEDTFGSGGGFSARI--ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWA 271
           +ED  G+      R+  +SS +I+   LD  +  V D  F+H Y EP + +L  ++  WA
Sbjct: 203 EEDKNGTTTNQEPRLFYDSSFIIDATTLDSSIDTVVDMQFLHNYREPTIAVLSSKQEVWA 262

Query: 272 GRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IH 330
           G +           L++    K    ++   NLP++  +++ +PSP+ G L+VG N  IH
Sbjct: 263 GNLIKSKDNIQFQVLTLDLNSKSTISVFKIDNLPYEIDRIIPLPSPLNGTLLVGCNELIH 322

Query: 331 YHSQSASCALALNNY----AVSLDSSQELPRSSFSVELDAAHATWLQND-VALLSTKTGD 385
             +      +A+N +      S  S Q+  +S  +++L+      + +D   LL  +TG+
Sbjct: 323 VDNGGVLKRIAVNKFTRLITASFKSFQD--QSDLNLKLENCSVVPIPDDHRVLLILQTGE 380

Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTS-------DITTIGNSLFFLGSRLGDSLLVQ 436
              +    DG+ ++R+ +   +             ++  +  ++ F+ +  G+S L+Q
Sbjct: 381 FYFINFELDGKSIKRIHIDNVDKKTYDKIQLNHPGEVAILDKNMLFIANSNGNSPLIQ 438


>gi|449019486|dbj|BAM82888.1| similar to cleavage and polyadenylation specificity factor subunit
           [Cyanidioschyzon merolae strain 10D]
          Length = 1880

 Score = 74.7 bits (182), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 112/499 (22%), Positives = 194/499 (38%), Gaps = 128/499 (25%)

Query: 243 MKHVK--DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI----STTLKQHP 296
           + HV+  D  F+ G   P MV+L+E   TWAGRV    ++C ++A+ +    +    + P
Sbjct: 345 LGHVRILDCCFLTGTALPTMVMLYEERPTWAGRVEAVSNSCALAAIVLPPLPAGAAGEEP 404

Query: 297 LI-WSAMNLPHDAYKLLAVPS------PIGGVLVVGANTIHYHSQSASCALAL--NNYA- 346
           L+ W    LP DA K++ +PS         G+L++ AN + +   +     +L  N++  
Sbjct: 405 LVAWRIQGLPFDAEKVVPLPSVEWDRAAEQGLLLIAANVLFWIRGNGQIGASLSGNHFGD 464

Query: 347 --VSLDSSQELP---------------RSSFSVELDAAHATWLQNDVALLSTKTGDLVLL 389
             + LD  Q LP               R+S  +    A    ++     L    G++  L
Sbjct: 465 TFMELDGCQ-LPGALYGGTDSDIISRCRTSQVLHFRGACIAPVRLHRYGLFLADGNVYQL 523

Query: 390 TVVYDGRVVQRLDL------SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGT 443
            +  D     RL+       S+  P+ L  D   +   L F+ + LG S+L + T     
Sbjct: 524 ALHADAEYPLRLEALRVRGESRLAPAPL--DAKLLSRDLLFVAAHLGSSVLYRMT----- 576

Query: 444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSF 503
                             P  +R R S+++              G+   N  + +  +  
Sbjct: 577 ---------------QVHPHGRRTRTSAAE-------------NGTLHKNATTKEAQWEL 608

Query: 504 AVRDSLVNIGPLKDF---------SYGLRINADA--SATGISKQS------------NYE 540
             RD++  +GP+ D            G  ++     +ATG   QS             ++
Sbjct: 609 QQRDTIFQLGPIVDLVVIPPRYSPPAGTLLDPGEILAATGHQHQSCLARCTYQVQTREWQ 668

Query: 541 LVELPGCKGIWTVY--HKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTE 598
            +   GC+ +W++Y  H  +  H  +     A+     +   + L+ R    + AD  T 
Sbjct: 669 RIPSAGCRRVWSLYADHDGTGMHQEEQ----AFLLLSLSKSSVILDIRRGFEQAAD--TR 722

Query: 599 VTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY---MTQDL---SFGPSNSES 652
           V       +   TIAAGNL  RR + QV   G R+LD +      +D+   +  P  + S
Sbjct: 723 V------LLPSPTIAAGNLAQRRLIAQVHRTGIRLLDANLDVVYEEDMLLAALEPGTAVS 776

Query: 653 GSGSENSTVLSVSIADPYV 671
           G+          S+ DPY+
Sbjct: 777 GA----------SVVDPYI 785


>gi|301121252|ref|XP_002908353.1| DNA damage-binding protein, putative [Phytophthora infestans T30-4]
 gi|262103384|gb|EEY61436.1| DNA damage-binding protein, putative [Phytophthora infestans T30-4]
          Length = 1150

 Score = 74.3 bits (181), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 110/467 (23%), Positives = 183/467 (39%), Gaps = 108/467 (23%)

Query: 174 RESFARGPLV----KVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSAR 229
           R+S  R   +     +DP+GR  G+ +Y     ++    G   L    DTF         
Sbjct: 107 RDSIGRSSEIVTSGNIDPEGRLIGMNLYEGYFKVIPIDSGKGIL---RDTF--------- 154

Query: 230 IESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIS 289
                  N+R LD   V D  F+HGY +P + +L+E +   A  V   H       L   
Sbjct: 155 -------NIR-LDELRVIDIKFLHGYNKPTICVLYE-DYKAARHVKTYH------ILLKE 199

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL 349
               + P  WS  N+   A  L+ VP+P GGVL+V   TI YH+ S   A+ + +  + +
Sbjct: 200 KDFAEGP--WSQSNVESGASLLIPVPAPTGGVLIVSNQTIVYHNGSTFHAIPMQSTVIQV 257

Query: 350 DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS 409
             + +   S F                 LL+ + G L ++ + + G+ V  + L     +
Sbjct: 258 YGAVDKDGSRF-----------------LLADQYGTLSVVALQHTGKEVSGVHLEVLGET 300

Query: 410 VLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRR 469
            + S ++ + N + F+GS  GDS L++              ++E G              
Sbjct: 301 NIASCLSYLDNGVVFIGSTFGDSQLIKLNAD----------RDETG-------------- 336

Query: 470 SSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLRINAD 527
           S  + L   VN   +  +     + +   +  T S A +D     G L+    G+ IN  
Sbjct: 337 SYIEVLDSYVNVGPIIDFCVMDLDRQGQGQIVTCSGADKD-----GTLRVIRNGIGINEQ 391

Query: 528 ASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
           ASA            ELPG KG+W +               AA  D++     +S E R 
Sbjct: 392 ASA------------ELPGIKGMWAL-----------RETFAAEHDKFLLQSYVS-EVRI 427

Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
           + +   D + E  + +  F   +T+   N++G    +QV E   R++
Sbjct: 428 LAIGDEDEMEE--KEIPAFTNVKTLLCRNMYGDYW-LQVTESEVRLI 471


>gi|344305212|gb|EGW35444.1| pre-mRNA 3'-end processing factor CF II [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 1348

 Score = 74.3 bits (181), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 100/500 (20%), Positives = 196/500 (39%), Gaps = 77/500 (15%)

Query: 58  LVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLA 117
           L+V   N+++I+   + ++ S  +K                  L+++  ++L+G +  L 
Sbjct: 29  LIVAKGNLLQIFEPVLIKQQSTPTK--------------PKYKLQIIGQFKLNGLITDLH 74

Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGR--E 175
            L     +N    D +I++ + AK S+++++  +H +   S+H +E     H  R    E
Sbjct: 75  PLRT--VENPHL-DYLIVSTKYAKFSIIKWNHHLHTISTVSLHYYE-----HAIRNSTFE 126

Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE----------------DT 219
                 L+ V+P       L +   +  L  +        D+                D 
Sbjct: 127 KLGISELI-VEPTFNSCSCLRFKNLLCFLPFAVSDEEEEEDDEEDMDLDNKKEKKEKLDI 185

Query: 220 FGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWK 277
            G      +  +SS +I+ + LD  ++ V D  F+H Y EP + IL  +   WAG +   
Sbjct: 186 NGKPADAVSFYDSSFIIDAQTLDSSIETVVDIQFMHNYREPTIAILSSKSNVWAGNLLKV 245

Query: 278 HHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI-HYHSQSA 336
                   +++    K    ++   NLP++  +++ +PSP+ G L++G N I H  +   
Sbjct: 246 KDNVSFQVMTLDLVSKSTVSVFKIDNLPYEIDRIIPLPSPLNGCLLLGCNEIFHVDNGGI 305

Query: 337 SCALALNNY----AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVV 392
              +A+N++      S  S Q+    S S+E D        +   L+   TG    +   
Sbjct: 306 IKRIAVNSFTSLVTASTKSYQDQTDLSLSLE-DCCIIPIPGDHRVLMVLTTGQFFYINFE 364

Query: 393 YDGRVVQRLDLSKTNPSVLTS-------DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
            DG+ ++++ +   + ++ +        ++  + ++L F  +  G+S LVQF        
Sbjct: 365 LDGKAIKKVHIDTVDQALYSQIKLCYPGEVAVLDHNLLFFANENGNSPLVQF-------- 416

Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQ----KTF 501
                   + D+  D     +         +     +E  LY    N  E  Q       
Sbjct: 417 -------RYTDV--DQKRITQEAAKEEKKEEKDDEEDEDDLYMDEENEEEQKQIISNSPI 467

Query: 502 SFAVRDSLVNIGPLKDFSYG 521
            F   D L+N GP+  F+ G
Sbjct: 468 EFIHHDELINNGPISSFTLG 487



 Score = 43.1 bits (100), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 33/164 (20%), Positives = 67/164 (40%), Gaps = 26/164 (15%)

Query: 836 HSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRT 895
           H   +L  +   G ++ Y+ Y F+G                    N    + ++LR +  
Sbjct: 783 HKEEYLTILTIGGEVIMYKLY-FDG-------------------ENYIFKKEKDLRITGA 822

Query: 896 PLDAYTREETPHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSI 954
           P +AY     P G   +R +  F N++G+   F++G  P   M     +    Q      
Sbjct: 823 PENAY-----PLGTTIERRLVYFPNLNGYTSIFVTGIIPYLIMKPMHSIPRIFQFSKIPA 877

Query: 955 VAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
           ++ +   +    +G I++ +    +IC+L    TY+  WP++++
Sbjct: 878 LSISAFSDSKIKNGLIFLDNSKNARICELSLDFTYEFNWPMRQI 921


>gi|145351726|ref|XP_001420218.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144580451|gb|ABO98511.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 1120

 Score = 74.3 bits (181), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 127/549 (23%), Positives = 224/549 (40%), Gaps = 113/549 (20%)

Query: 96  ISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLR 155
           + A  L+ V    ++G + ++++   G  D   R   + L  E    +VL +D++   L+
Sbjct: 58  LHAEGLKPVLDVPINGRIATMSLCQTGSGDGKAR---LYLTTERYGFTVLSYDEANEELK 114

Query: 156 ITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLV 214
             +    +         GR +   G +  VD   R  G+ +Y GL  +I    +GG    
Sbjct: 115 TEAFGDVQD------NIGRPA-DDGQIGIVDDTCRAIGLRLYDGLFKVIPCDEKGG---- 163

Query: 215 GDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
                          ++ +  I L +L    V+D  F+HG  +P + +L+ R+   A  V
Sbjct: 164 ---------------VKEAFNIRLEEL---RVEDIKFLHGTPKPTIAVLY-RDTKDA--V 202

Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQ 334
             K +   I      ++    P  W+  +L   + K++ VP+PIGGV+V+G   I Y   
Sbjct: 203 HIKTYEIGIREKEFVSS----P--WAQNDLEGGSNKIIPVPAPIGGVVVLGQEIIVY--- 253

Query: 335 SASCALALNNYAVSLD---SSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV 391
                  LN +    D    +  +P       +    A        LL    G L LL +
Sbjct: 254 -------LNKFEDDADVFLKAINIPNIPDRTNITCYGAIDPDGSRYLLGDADGMLYLLVI 306

Query: 392 VYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLK 451
           ++DG+ V+ L + +   + + S ++ + N + F+GS  GDS L++             L 
Sbjct: 307 LHDGKRVRELKIERLGDTSIASTLSYLDNGVVFVGSTYGDSQLIK-------------LH 353

Query: 452 EEFGDIEADAPSTKRLRRSSSDALQDMVNGE--ELSLYGSASNNTESAQKTFSFAVRDSL 509
            E   I+ D   T          L  +V+    +L  +G     T S             
Sbjct: 354 AEKTSIDKDGNPTYVQILEEFTNLGPIVDFAFVDLERHGQGQVVTCS------------- 400

Query: 510 VNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMA 569
              G LKD S  LR+  +    GI +Q+   +++LPG KG++++        ++D S+M 
Sbjct: 401 ---GALKDGS--LRVVRN--GIGIDEQA---VIQLPGVKGLFSL-------RDSDDSQM- 442

Query: 570 AYDDEYHAYLIISLEARTMVL----ETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQ 625
                   YL+++    T +L    +  D L E TE   +  + +T+  GN+ G    +Q
Sbjct: 443 ------DKYLVVTFINETRILGFVGDEGDTLDE-TEIAGFDAEAQTLCCGNMQG-NVFLQ 494

Query: 626 VFERGARIL 634
           V  RG R++
Sbjct: 495 VTHRGVRLV 503


>gi|448530371|ref|XP_003870046.1| mRNA cleavage and polyadenylation factor [Candida orthopsilosis Co
           90-125]
 gi|380354400|emb|CCG23915.1| mRNA cleavage and polyadenylation factor [Candida orthopsilosis]
          Length = 1327

 Score = 73.9 bits (180), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 83/375 (22%), Positives = 157/375 (41%), Gaps = 57/375 (15%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           L+LV  ++L G V  L  L      +    D ++++ + AK S++ ++  +H +   S+H
Sbjct: 57  LKLVEQFKLQGTVSGLKALRTSECPH---LDYVVVSTKYAKFSIIRWNHQLHNISTVSLH 113

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIIL---------------- 204
            +E+          E  A   L  V+P       L Y   +  L                
Sbjct: 114 YYEN---CIQHSTFEKLAISDLT-VEPTYSSVSCLRYKNLLCFLPFEGVHEEDDEDDTDD 169

Query: 205 ----KASQGGS----GLVGDEDTFGSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHG 254
                  +GGS    GL  +   F          ++S +I+   LD  +  V D  F+H 
Sbjct: 170 EDIDNDKKGGSITKNGLSYENQPF---------YDASFIIDAGILDSTIDTVLDVQFLHN 220

Query: 255 YIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAV 314
           Y EP + IL  +  +WAG +           +++    K    +++  NLP+D  +++ +
Sbjct: 221 YQEPTIAILSAKSNSWAGNLIKNKDNVQFQVMTLDVQSKSTLPVFNIDNLPYDIDRVIPL 280

Query: 315 PSPIGGVLVVGANT-IHYHSQSASCALALNNY----AVSLDSSQELPRSSFSVELDAAHA 369
           P+P+ G L++G N  IH  +   +  +A+N +      S+ S Q+   S  +++L+    
Sbjct: 281 PNPLNGCLLIGCNELIHVDNGGIAKRIAVNAFTSLITASVKSYQD--ESDLNLKLENCAI 338

Query: 370 TWLQND-VALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS-------DITTIGNS 421
             + +D   LL   TG+   L    DG+ ++++ L   +  +  S        + ++  +
Sbjct: 339 VPIPDDHRVLLILATGEFYYLNFDLDGKSIKKIHLELVDQKMYDSIRLTYPGQVASLDKN 398

Query: 422 LFFLGSRLGDSLLVQ 436
           L F  +  GDS LV+
Sbjct: 399 LLFFANLNGDSSLVE 413


>gi|367014525|ref|XP_003681762.1| hypothetical protein TDEL_0E03080 [Torulaspora delbrueckii]
 gi|359749423|emb|CCE92551.1| hypothetical protein TDEL_0E03080 [Torulaspora delbrueckii]
          Length = 1327

 Score = 73.2 bits (178), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 130/638 (20%), Positives = 269/638 (42%), Gaps = 86/638 (13%)

Query: 98  AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
           +A L L   ++ HG +  LA++ Q  +      D ++L    AK+S+++FD   + +   
Sbjct: 45  SAKLFLTNEFKFHGKITDLALIPQVNSS----LDCLLLCTSIAKVSIVKFDPLSNSIETA 100

Query: 158 SMHCFESP--EWLHLKRGRESFARGPLVKVDPQGRCGGVLVYG-LQMIILKASQGGSGLV 214
           S+H +E    +   L+  ++S+ R     +DP  RC  +L    L ++  +A+       
Sbjct: 101 SLHYYEDKFRDLSLLEIAQQSYFR-----LDPSKRCAIILNNDVLALLPFRAA------T 149

Query: 215 GDEDTFGSGGGFSARIES--------SHVINLRDL--DMKHVKDFIFVHGYIEPVMVILH 264
            D++   +      R+++        S +   ++L  ++++V D  F++ + +P + IL 
Sbjct: 150 DDDEEADAENNDVKRMKTSSDKVTYPSKIFVAKELHSEIRNVIDVQFLNNFSKPTIAILF 209

Query: 265 ERELTWAG--RVSWKHHTCMISALSISTTLKQHPL----IWSAMNLPHDAYKLLAVPSPI 318
           E  L WAG  +++ +  + MI  L IS+T          I     L  D + L+ + +  
Sbjct: 210 EPTLIWAGNRQLNPQPISYMIFTLEISSTDNTTKFGATTIGKLTGLSWDFHSLVPISN-- 267

Query: 319 GGVLVVGANTIHYHSQSASC--ALALNNYA-VSLDSSQELPRSSFSVELDAAHA-TW--- 371
            G ++VGAN + +   S +    + LN+++  +L   + +  S + + L  + A  W   
Sbjct: 268 -GCMIVGANELAFADNSGALQSVILLNSFSDRNLRQGRIIDNSKYEILLPQSIARCWSPP 326

Query: 372 ----LQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK---TNPSVLTSDITTIGNSLFF 424
               + ++  LL     ++  + +  +GR++ + D+ K    N ++  +   T  + L  
Sbjct: 327 TSDKVNDETLLLMDANSNVYYVQLESEGRLLIKFDIIKLPIVNDTLKNNQGCTCMSRLNS 386

Query: 425 LGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-SSDALQDMVNGEE 483
             S     LL+ F  G    +  + LK      +     ++ +  S   D  +D +  +E
Sbjct: 387 RSSNNNMDLLMGFKSGDALVVRLNNLKSAAESRDEHKIFSEAMESSFDKDEDEDNLYSDE 446

Query: 484 LSLYGSASNNTESAQKT---FSFAVRDSLVNIGPLKDFSYGLRINADASATGI--SKQSN 538
            S  G A +N E   +T   F   +  ++ NIGP+   + G   + +    G+    ++ 
Sbjct: 447 ASDAGKADDNKEVIVETVTPFDIELLSTIKNIGPITSLAVGKVCSVEKYVKGLLNPNRNE 506

Query: 539 YELVELP--GCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL- 595
           Y +V     G     T    S R     + +  +    ++    + ++ R   L T D  
Sbjct: 507 YSMVATSGNGSGSHLTEIQGSVRPTVEVALKFISVTQIWN----LKIKNRDKYLVTTDSN 562

Query: 596 -----LTEVTESVDYFVQGR------TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLS 644
                + E+  +     +GR      T+      G +R++QV      + D ++  + L+
Sbjct: 563 KAKSDIYEIDNNFALHKEGRFRRDATTVCISMFGGDKRIVQVTTNNLILYDTNF--RRLT 620

Query: 645 FGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL 682
               + E         V+ VS+ DPY+L+ +S G I++
Sbjct: 621 TMKFDYE---------VVHVSVMDPYILITVSRGDIKI 649


>gi|430810873|emb|CCJ31593.1| unnamed protein product, partial [Pneumocystis jirovecii]
          Length = 301

 Score = 72.4 bits (176), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 70/292 (23%), Positives = 128/292 (43%), Gaps = 41/292 (14%)

Query: 245 HVKDFIFV-HGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
           H+ D  F+ + Y EP + IL+    T  G + ++  T   +A            I++   
Sbjct: 6   HIVDLWFIFYDYREPTLAILYSAFQTSTGLLPYRQDTMTSTA------------IYTVDK 53

Query: 304 LPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASC-ALALNNYAVSLDSSQELPRSSFSV 362
           LP+D + +L +P+PIGG L++G N + Y  Q+A   A+++N++A        +     ++
Sbjct: 54  LPYDLFSVLPLPNPIGGTLLIGNNELVYVDQAARVKAVSVNSFARKCTHLDFIEDYDLNL 113

Query: 363 ELDAAHATWL-----QNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSV----LTS 413
            L+ A   +L     Q    LL  + G  V +    DGRVV  L +   + SV    L S
Sbjct: 114 RLNGAVGVYLELLDDQPGAVLLVIEDGRFVQVGFKLDGRVVSSLSVKILDQSVKNDFLKS 173

Query: 414 D---ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS 470
           +   I  + N   F+GS++ +S+L+++   S  +             E      + +   
Sbjct: 174 EASCIVLLNNEQLFIGSKVSNSVLLEWKRQSEIA-------------EKLLSEPRVIFDE 220

Query: 471 SSDALQDMVNGEELSLYGSASN-NTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
             + L D+  GE+  +  ++S            F + D+L + GP+ D + G
Sbjct: 221 DREVLNDLY-GEDFDIVDTSSILQRNGVFGDIQFRLFDTLYSCGPIVDMTIG 271


>gi|325186344|emb|CCA20849.1| predicted protein putative [Albugo laibachii Nc14]
          Length = 1148

 Score = 71.2 bits (173), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 114/511 (22%), Positives = 192/511 (37%), Gaps = 115/511 (22%)

Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
           +D I L  +  +  VL +D ++  +   +           + R  E    G    +DP G
Sbjct: 74  QDWIFLVTQRFQFCVLAYDTTLQQIITKANGSLRDT----IGRNSEILTNG---NIDPDG 126

Query: 190 RCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDF 249
           R  G+ +Y     ++        L            F+ R++      LR LD+K     
Sbjct: 127 RLIGMNIYEGYFKVIPIDNHSKSL---------KAAFNIRLD-----ELRILDIK----- 167

Query: 250 IFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAY 309
            F++GY +P + +L+E             H      L       + P  WS  N+   A 
Sbjct: 168 -FLYGYNKPTICVLYED-------FKAARHVKTYFILLKEKDFAEGP--WSQSNVEAGAN 217

Query: 310 KLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHA 369
            L+ VP P GGVL++   TI YH+ +   A+ + N  + +  +     S F         
Sbjct: 218 LLIPVPMPYGGVLIISNQTIVYHNGTYFHAIPMQNTMIQVYGAVGDDGSRF--------- 268

Query: 370 TWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRL 429
                   LL+ + G L ++ +  +G+ V  + L     + + S ++ + N + F+GS  
Sbjct: 269 --------LLADQYGALHVVALQTEGKEVLDVYLEVLGQTSIASCVSYLDNGVVFVGSTF 320

Query: 430 GDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS 489
           GDS LV+              ++E G              S  + L   VN   +  +  
Sbjct: 321 GDSQLVKLNSK----------RDESG--------------SYIEVLDSYVNIGPIIDFCV 356

Query: 490 ASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC 547
              + +   +  T S A +D     G L+    G+ IN  ASA            ELPG 
Sbjct: 357 MDLDRQGQGQIVTCSGADKD-----GSLRVIRNGIGINEQASA------------ELPGI 399

Query: 548 KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDY 605
           KG+W +    +               EY  YL+ S   E R M +  +D + EV   ++ 
Sbjct: 400 KGMWALRESLA--------------SEYDKYLVQSYLNEIRIMTIGDSDEMEEV--EIEA 443

Query: 606 FVQGRTIAAGNLFGRRRVIQVFERGARILDG 636
           F+  +T+   N+      +QV E   RI+D 
Sbjct: 444 FLDAKTLYCRNV-NEDGWLQVTETEVRIIDA 473


>gi|407035910|gb|EKE37921.1| CPSF A subunit region protein, putative [Entamoeba nuttalli P19]
          Length = 836

 Score = 71.2 bits (173), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 80/327 (24%), Positives = 144/327 (44%), Gaps = 52/327 (15%)

Query: 133 IILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFA----RGPLVKVDPQ 188
           ++L F++AK+SVL +D++ +   I S+HCFE P    LKR +E         P + +D +
Sbjct: 74  LVLLFKEAKVSVLRYDETNNKFVIHSLHCFELP----LKRMQEGLTPTTYTDPRLLIDKR 129

Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKD 248
           GRC  ++ Y   M ++         +G + T             S+ INL    +  + D
Sbjct: 130 GRCISLICYDRLMWVIP--------LGLDKT-------------SYSINLEKFGINRIID 168

Query: 249 FIFVHGYIEPVMVILHERELTWAGRVSWKHHT---CMISALSISTTLKQHPLI----WSA 301
            I + GY  P +  LH +  TW GR+     T    +I +L      ++  ++    +  
Sbjct: 169 CIVLDGYDLPSVAFLHMKIPTWEGRIVNTGETTNEIIILSLEPDVIHERQDIVATISYQF 228

Query: 302 MNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA--SCALALNNYAVSLDSSQELPRSS 359
             +P++A +++    P  G+L++  N+I Y S ++  S  L    + V +  +   P SS
Sbjct: 229 SYVPYNALQIVDC-YPTNGLLILTVNSIIYLSTTSFESFILPFGKFFV-IPKNINGPLSS 286

Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLL-------TVVYDGRVVQRL-DLSKTN-PSV 410
           F +       T + N V  +   T  L ++         V+   +  R+ D+  TN P  
Sbjct: 287 FQI---LQMQTKIMNSVKSIFKLTNHLYIIFSMNGESYYVHLLSIANRICDVIITNSPYK 343

Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQF 437
                 TI ++  F+GS + DS +  +
Sbjct: 344 YHPTTFTISSNHLFIGSTVHDSYIYNY 370


>gi|365984967|ref|XP_003669316.1| hypothetical protein NDAI_0C04130 [Naumovozyma dairenensis CBS 421]
 gi|343768084|emb|CCD24073.1| hypothetical protein NDAI_0C04130 [Naumovozyma dairenensis CBS 421]
          Length = 1388

 Score = 71.2 bits (173), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 148/725 (20%), Positives = 286/725 (39%), Gaps = 145/725 (20%)

Query: 58  LVVTAANVIEIYVVR-VQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           L+V   N++ IY +  +    S  S +  ET     +     A L L+  ++L+G V+ +
Sbjct: 29  LLVIRTNILSIYHLETILSPRSNTSSSQLETIEDATVTTSKQAKLFLINEFKLNGKVQDI 88

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
           A +  G   NS   + I+L+   AK+S+L FD SI+     S+H +E            S
Sbjct: 89  ASIPLG---NSSSLECILLSTGTAKLSILNFDPSINSFETLSLHYYEEK---FKDISLVS 142

Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIIL--------------KASQGGSGLVGDEDTFGS 222
            A+   +++DP  RC  +L++   ++ L              +       ++ + +   S
Sbjct: 143 LAKKSQLRMDPLNRC--LLMFNNDVMALLPLHSNNEDEEEEEEDENEEDEVLDNYEANLS 200

Query: 223 GGGFSARIE--------SSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHERELTWAG 272
               + RI+         S + N+  L  D+K++ D  F++ + +P + +L++  LTWAG
Sbjct: 201 KTSPNKRIKYNNNQFEGKSKIFNINKLHEDVKNISDIQFLNNFNKPTIAVLYQPTLTWAG 260

Query: 273 RVSWKHHTC--MISALSI----STTLKQHP-----------LIWSAMNLPHDAYKLLAVP 315
            V         MI  L I    ST    H            +I     L  D +K++ + 
Sbjct: 261 NVQLNPLPTHFMIFTLDILSENSTNNANHTTENNNNDLNLIIIAKLKELAWDWFKIIPIS 320

Query: 316 SPIGGVLVVGANTIHYHSQSA--SCALALNNYA-VSLDSSQELPRSSFSVELD------- 365
           +   G +V+G N I Y   +      + LN++A  +L  ++ +  S F +  +       
Sbjct: 321 N---GCVVIGNNEIAYIDNTGVLQSIILLNSFADKNLKKTRIIDESKFQIFFNENVTHVW 377

Query: 366 ----AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDL-----------SKTNPSV 410
               + + T   ++  LL     +L  + +  +GR++ + D+              NP+ 
Sbjct: 378 SPSTSKNKTTEDDETLLLMDAQSNLYYVRLEAEGRLLTKFDIINLPIVNDVLRENCNPTC 437

Query: 411 LTSDITTIGNSL--FFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
           ++   +   NS    F+G   GDSL+V+           + LK      +  + S +  +
Sbjct: 438 ISRLDSNATNSTMDLFIGFLSGDSLVVRL----------NNLKSAIDTRDEHSESNEHTQ 487

Query: 469 RSSSDALQDMVNGEELSLYGSASNNTESAQ------------KTFSFAVRDSLVNIGPLK 516
            +  D        +E +LY     + E A+            + F      SL NIGP+ 
Sbjct: 488 LNGFDE------EDEDNLYSDDEVDVEDARSKRDMETIIHTVQPFDIEYLTSLKNIGPIT 541

Query: 517 DFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGH-NADSSRMAAYDDEY 575
             + G   + D +  G+   +  E         I T    S+  H N     +    ++ 
Sbjct: 542 SLTVGKVSSLDLNVKGLQNPNKNEF-------SIVTTSGNSTGSHLNVIQQTVQPIVEKA 594

Query: 576 HAYLIIS------LEARTMVLETADL------LTEVTESVDYFVQGR-----TIAAGNLF 618
             ++ ++      ++ +   L T D       + ++  +     +GR     T     +F
Sbjct: 595 LKFISVTQIWNLKIKNKDKYLVTTDSTKSKSDIYDIDNNFSLHKEGRLRRDATTVYIAMF 654

Query: 619 GR-RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
           G  +RV+Q+      + D ++  + L+    + E         V+ VS+ DPY+L+ +S 
Sbjct: 655 GDGKRVVQITTNHLYLFDTNF--RRLTAIKFDFE---------VVHVSVMDPYILITVSR 703

Query: 678 GSIRL 682
           G I++
Sbjct: 704 GDIKI 708


>gi|67463896|ref|XP_648489.1| cleavage and polyadenylation specificity factor subunit [Entamoeba
           histolytica HM-1:IMSS]
 gi|56464653|gb|EAL43100.1| cleavage and polyadenylation specificity factor subunit, putative
           [Entamoeba histolytica HM-1:IMSS]
          Length = 1150

 Score = 71.2 bits (173), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 80/327 (24%), Positives = 144/327 (44%), Gaps = 52/327 (15%)

Query: 133 IILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFA----RGPLVKVDPQ 188
           ++L F++AK+SVL +D++ +   I S+HCFE P    LKR +E         P + +D +
Sbjct: 74  LVLLFKEAKVSVLRYDETNNKFVIHSLHCFELP----LKRMQEGLTPTTYTDPRLLIDKR 129

Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKD 248
           GRC  ++ Y   M ++         +G + T             S+ INL    +  + D
Sbjct: 130 GRCISLICYDRLMWVIP--------LGLDKT-------------SYSINLEKFGINRIID 168

Query: 249 FIFVHGYIEPVMVILHERELTWAGRVSWKHHT---CMISALSISTTLKQHPLI----WSA 301
            I + GY  P +  LH +  TW GR+     T    +I +L      ++  ++    +  
Sbjct: 169 CIVLDGYDLPSVAFLHMKIPTWEGRIVNTGETTNEIIILSLEPDVIHERQDIVATISYQF 228

Query: 302 MNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA--SCALALNNYAVSLDSSQELPRSS 359
             +P++A +++    P  G+L++  N+I Y S ++  S  L    + V +  +   P SS
Sbjct: 229 SYVPYNALQIVDC-YPTNGLLILTINSIIYLSTTSFESFILPFGKFFV-IPKNINGPLSS 286

Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLL-------TVVYDGRVVQRL-DLSKTN-PSV 410
           F +       T + N V  +   T  L ++         V+   +  R+ D+  TN P  
Sbjct: 287 FQI---LQMQTKIMNSVKSIFKLTNHLYIIFSMNGESYYVHLLSIANRICDVIITNSPYK 343

Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQF 437
                 TI ++  F+GS + DS +  +
Sbjct: 344 YHPTTFTISSNHLFIGSTVHDSYIYNY 370


>gi|68471462|ref|XP_720279.1| likely Cleavage and Polyadenylation Specificity Factor subunit
           fragment [Candida albicans SC5314]
 gi|46442139|gb|EAL01431.1| likely Cleavage and Polyadenylation Specificity Factor subunit
           fragment [Candida albicans SC5314]
          Length = 423

 Score = 71.2 bits (173), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 45/199 (22%), Positives = 93/199 (46%), Gaps = 15/199 (7%)

Query: 251 FVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYK 310
           F+H Y EP + +L  ++  WAG +           L++   LK    ++   NLP++  +
Sbjct: 3   FLHNYREPTIAVLSSKQEVWAGNLIKSKDNIQFQVLTLDLNLKSTISVFKIDNLPYEIDR 62

Query: 311 LLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNY----AVSLDSSQELPRSSFSVELD 365
           ++ +PSP+ G L+VG N  IH  +      +A+N +      S  S Q+  +S  +++L+
Sbjct: 63  VIPLPSPLNGTLLVGCNELIHVDNGGVLKRIAVNKFTRLITASFKSFQD--QSDLNLKLE 120

Query: 366 AAHATWLQND-VALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS-------DITT 417
                 + +D   LL  +TG+   +    DG+ ++R+ +   +             ++  
Sbjct: 121 NCSVVPIPDDHRVLLILQTGEFYFINFELDGKSIKRIHIDNVDKKTYDKIQLNHPGEVAI 180

Query: 418 IGNSLFFLGSRLGDSLLVQ 436
           +  ++ F+ +  G+S L+Q
Sbjct: 181 LDKNMLFIANSNGNSPLIQ 199


>gi|149237256|ref|XP_001524505.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
 gi|146452040|gb|EDK46296.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
          Length = 1380

 Score = 70.9 bits (172), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 71/328 (21%), Positives = 139/328 (42%), Gaps = 60/328 (18%)

Query: 231 ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
           +SS +I   +LD  +  + D  F+H Y +P + +L  R  +WAG +        +  +S+
Sbjct: 223 DSSFIIEAGNLDSSIDTIIDLQFLHNYRDPTIALLSSRSHSWAGSLLKSKDNVHLEVMSL 282

Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI-HYHSQSASCALALNNY-- 345
               K    I+   NLP++  +++ + +P+ G L+VG N I H  +   +  +++N++  
Sbjct: 283 DLLTKLSTSIFKIENLPYEVDRIVPLSAPLNGCLLVGCNEIMHVDNGGIAKRISVNDFTS 342

Query: 346 --AVSLDSSQELPRSSFSVELDAAHATWLQND-VALLSTKTGDLVLLTVVYDGRVVQRLD 402
               S+ S+Q+  +S+  ++L+      + +D   L+ T+ G         DG+ ++R+ 
Sbjct: 343 LTTASVKSNQD--QSNLGLKLENCSVVQIPDDHRVLIVTEQGSFYFANFELDGKSIKRVF 400

Query: 403 LSKTNPSV-------LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG 455
           +   + ++          +I  +  +L F+ +  GDS LVQ             +K    
Sbjct: 401 IDVVDKNMYDKIKFTFPGEIAVLSKNLLFMSNLNGDSPLVQ-------------VKYRNS 447

Query: 456 DIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESA----------------QK 499
            I  D   T+R+ +           G E +    +SN  +                  QK
Sbjct: 448 KILEDTRGTRRVEKGK---------GAEKNKNNVSSNEVDDDDDDDDDLYKEEEEEEQQK 498

Query: 500 TFS-----FAVRDSLVNIGPLKDFSYGL 522
             S     F ++D L+N  P+  F+ GL
Sbjct: 499 VLSKSHIEFILQDRLINNSPISTFTLGL 526


>gi|168066745|ref|XP_001785293.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162663100|gb|EDQ49884.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1090

 Score = 70.9 bits (172), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 126/557 (22%), Positives = 220/557 (39%), Gaps = 121/557 (21%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++A+ L+ +    ++G + +L +    G      +D + ++FE  K  VL++D 
Sbjct: 39  RIEIHLLTASGLQPMLDVPIYGRIATLELFRPPG----ESQDVLFISFERYKFCVLQWDA 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
              GL +T      S      + GR +   G +  VDP  R  G+ +Y GL  +I   ++
Sbjct: 95  ET-GLLVTRAMGDVSD-----RIGRPT-DNGQIGIVDPDCRLIGLHLYDGLFKVIPIDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F++G  +P + +L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCAKPTIAVLYQDNK 185

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
                     H              + P  W   NL + A  L+ VP P+GG +++G  T
Sbjct: 186 D-------ARHVKTYEVQLKEKDFGEGP--WLQNNLDNGAGLLIPVPLPLGGAIIIGEQT 236

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           I Y++ S   A+ +            + ++   V+ D +          LLS   G L L
Sbjct: 237 IVYYNGSVFKAIPIR---------PSITKAYGRVDSDGSR--------YLLSDHNGMLYL 279

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
           L + +D   V  L++     +   S ++ + N + F+GS  GDS L++            
Sbjct: 280 LVISHDKERVSALNVEPLGETSAASTLSYLDNGVVFVGSSYGDSQLIRL----------- 328

Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVN-GEELSLYGSASNNTESAQ-KTFSFAVR 506
                  + +ADA      + S  + L+  VN G  + L           Q  T S A +
Sbjct: 329 -------NHQADA------KNSYVEVLESYVNLGPIVDLCVVDLERQGQGQVVTCSGAFK 375

Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
           D     G L+    G+ IN  ASA            EL G KG+W++   SS        
Sbjct: 376 D-----GSLRIVRNGIGINEQASA------------ELQGIKGMWSLRASSS-------- 410

Query: 567 RMAAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVI 624
                 D Y  +L++S   E R + + T D L E TE   +  + +T+   N     +++
Sbjct: 411 ------DVYDTFLVVSFISETRILAMNTDDELEE-TEIDGFDSEAQTLFCYNAV-HDQLV 462

Query: 625 QVFERGARILDGSYMTQ 641
           QV     R++D     Q
Sbjct: 463 QVTAGSLRLVDAKTRRQ 479


>gi|449710759|gb|EMD49776.1| cleavage and polyadenylation specificity factor subunit, putative
           [Entamoeba histolytica KU27]
          Length = 836

 Score = 70.9 bits (172), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 80/327 (24%), Positives = 144/327 (44%), Gaps = 52/327 (15%)

Query: 133 IILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFA----RGPLVKVDPQ 188
           ++L F++AK+SVL +D++ +   I S+HCFE P    LKR +E         P + +D +
Sbjct: 74  LVLLFKEAKVSVLRYDETNNKFVIHSLHCFELP----LKRMQEGLTPTTYTDPRLLIDKR 129

Query: 189 GRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKD 248
           GRC  ++ Y   M ++         +G + T             S+ INL    +  + D
Sbjct: 130 GRCISLICYDRLMWVIP--------LGLDKT-------------SYSINLEKFGINRIID 168

Query: 249 FIFVHGYIEPVMVILHERELTWAGRVSWKHHT---CMISALSISTTLKQHPLI----WSA 301
            I + GY  P +  LH +  TW GR+     T    +I +L      ++  ++    +  
Sbjct: 169 CIVLDGYDLPSVAFLHMKIPTWEGRIVNTGETTNEIIILSLEPDVIHERQDIVATISYQF 228

Query: 302 MNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA--SCALALNNYAVSLDSSQELPRSS 359
             +P++A +++    P  G+L++  N+I Y S ++  S  L    + V +  +   P SS
Sbjct: 229 SYVPYNALQIVDC-YPTNGLLILTINSIIYLSTTSFESFILPFGKFFV-IPKNINGPLSS 286

Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLL-------TVVYDGRVVQRL-DLSKTN-PSV 410
           F +       T + N V  +   T  L ++         V+   +  R+ D+  TN P  
Sbjct: 287 FQI---LQMQTKIMNSVKSIFKLTNHLYIIFSMNGESYYVHLLSIANRICDVIITNSPYK 343

Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQF 437
                 TI ++  F+GS + DS +  +
Sbjct: 344 YHPTTFTISSNHLFIGSTVHDSYIYNY 370


>gi|384250802|gb|EIE24281.1| hypothetical protein COCSUDRAFT_28729 [Coccomyxa subellipsoidea
           C-169]
          Length = 1101

 Score = 70.9 bits (172), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 125/551 (22%), Positives = 206/551 (37%), Gaps = 113/551 (20%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++   L+ V    ++G V ++ +    G      +D + L+ E  K  VLE+D 
Sbjct: 44  RIEIHTLTPEGLKGVADVAIYGRVATMELFRPVG----ESKDLLFLSTERYKFCVLEYDS 99

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQG 209
               L   +    E       + GR     G +  VDP          G +MI L    G
Sbjct: 100 ETGELVTRANGDIED------QVGRPC-DNGQIGIVDP----------GCRMIGLHLYDG 142

Query: 210 GSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELT 269
              ++  +D       F+ RI+  +VI           D IF+ G  +P + +L++    
Sbjct: 143 LFKVIPIDDKGQLHEAFNMRIDELNVI-----------DMIFLEGCAKPTIAVLYQDN-- 189

Query: 270 WAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI 329
                    H      +     L + P  W   NL   A +++AVP P+GG LVVG + I
Sbjct: 190 -----KDARHIKTYEVVLKEKDLTEGP--WRQSNLDAGASRVIAVPEPLGGALVVGESVI 242

Query: 330 HYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA-LLSTKTGDLVL 388
            Y  Q                  Q +  +     +  AH    ++    LL    G+L L
Sbjct: 243 AYMGQ-----------------GQAMKCTPIKATIIRAHGRVDEDGSRYLLGDYVGNLYL 285

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
           L + +DG  V  L +     +   S +T + N + F+GS  GDS LV+            
Sbjct: 286 LVLQHDGEHVAGLKVEPLGRTSAPSTLTYLDNGVVFVGSSGGDSQLVRL----------- 334

Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDS 508
                        P T +   +  + L+ M N   +  +       +   +    +    
Sbjct: 335 ----------HPTPVTPQEPSNFVEVLETMTNLGPIIDFVVVDLERQGQGQVVMCS---- 380

Query: 509 LVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRM 568
               G + D S  LRI  +    G+ +Q+    VELPG KG+W +           +S M
Sbjct: 381 ----GIMADGS--LRIVRN--GIGMIEQAT---VELPGIKGMWALR----------ASHM 419

Query: 569 AAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQV 626
            A+D     +L+IS   E R + +   D L E  E   +    +T+  GN      ++QV
Sbjct: 420 DAFD----TFLVISFVGETRILAINADDELDE-AELPGFSADAQTLCCGNTVS-DHLVQV 473

Query: 627 FERGARILDGS 637
                R++D S
Sbjct: 474 AGADVRLVDAS 484


>gi|385304555|gb|EIF48567.1| rna-binding subunit of the mrna cleavage and polyadenylation factor
           [Dekkera bruxellensis AWRI1499]
          Length = 353

 Score = 70.1 bits (170), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 67/293 (22%), Positives = 129/293 (44%), Gaps = 39/293 (13%)

Query: 243 MKHVKDFIFVHGYIEPVMVILHERE-LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSA 301
           +K++ D+ F++ Y EP + IL+  E L+WAG +        +  LS++    +   I   
Sbjct: 69  VKNIMDYQFLYSYREPTIAILYAPEGLSWAGYLXKLKDNMKVVVLSLNLDTHKADSIMVL 128

Query: 302 MNLPHDAYKLLAVPSPIGGVLVVGANTI-HYHSQSASCALALNNYAVSLDSSQELPRSSF 360
            NLP+D   +  +PSPI G L++G+N I H +S  +   +  N Y       +    S  
Sbjct: 129 PNLPYDLNSIYPLPSPINGFLLIGSNEILHVNSLGSIKGVYTNKYFPETSDMKLRDESDL 188

Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG------RVVQRLDLSKTNPSVLTSD 414
           ++E +    +++ +D  LL ++ G   +L+    G      ++++  + +  N SV  ++
Sbjct: 189 NLECEGCSVSFVGDDQVLLISQIGKFYVLSFNESGGISNLNKIIEIPEANYCNVSV--NN 246

Query: 415 ITTIGN----SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS 470
           +  I N    +  FL  +  DS+L+ +                      + P+   + +S
Sbjct: 247 VLQITNIEDCNSAFLCCQGSDSILLHWN--------------------YNVPTRGTVSKS 286

Query: 471 SSDALQDMVNGEELSLY--GSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
           ++   ++    E+  LY     S  +     + +F   D LVN GP  DF+ G
Sbjct: 287 NAGIEKE---DEDSWLYHEDETSQTSNRPLTSCTFTXIDKLVNCGPTSDFTIG 336


>gi|302769568|ref|XP_002968203.1| hypothetical protein SELMODRAFT_145521 [Selaginella moellendorffii]
 gi|300163847|gb|EFJ30457.1| hypothetical protein SELMODRAFT_145521 [Selaginella moellendorffii]
          Length = 1089

 Score = 70.1 bits (170), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 123/557 (22%), Positives = 217/557 (38%), Gaps = 121/557 (21%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+    ++A  L+ +    ++G + +L +    G      +D + ++ E  K  VL++D 
Sbjct: 39  RIEFHLLTAQGLQPLLDVPIYGRIATLELFRPPG----ETQDVLFVSTERYKFCVLQWDS 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   +M           + GR +   G +  VDP+ R  G+ +Y GL  +I   ++
Sbjct: 95  ETTELVTRAMGDVSD------RIGRPT-DNGQIGIVDPECRLIGLHLYDGLFKVIPIDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F++G  +P + +L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCSKPTIAVLYQDNK 185

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
                     H              + P  WS  NL + A  L+ VP+P+GGV+++G  T
Sbjct: 186 D-------ARHVKTYEIQLKEKDFGEGP--WSQNNLDNGAGMLIPVPTPLGGVIIIGEQT 236

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           I Y+S SA  A+ +            + ++   V+ D +          LLS  TG L L
Sbjct: 237 IVYYSGSAFKAIPIR---------PSITKAYGKVDADGSR--------YLLSDHTGSLHL 279

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
           L + ++   V  L +     +   S ++ + N + ++GS  GDS L++            
Sbjct: 280 LVITHERDRVLGLKVELLGETSAASSLSYLDNGVVYVGSSYGDSQLIKLNA--------- 330

Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVN-GEELSLYGSASNNTESAQ-KTFSFAVR 506
                    + D+      R S  + L+  VN G  + L           Q  T S A +
Sbjct: 331 ---------QVDS------RNSYVEVLESFVNLGPIVDLCVVDLERQGQGQVVTCSGAYK 375

Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
           D     G L+    G+ IN  ASA            EL G KG+W++             
Sbjct: 376 D-----GSLRIVRNGIGINEQASA------------ELQGIKGMWSL------------- 405

Query: 567 RMAAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVI 624
             A   D +  +L++S   E R + +   D L E TE   +  + +T+   N     ++I
Sbjct: 406 -RATSKDVFDIFLVVSFISETRILAMNMDDELEE-TEIEGFDSEAQTLFCHNAI-HDQII 462

Query: 625 QVFERGARILDGSYMTQ 641
           QV     R++D +   Q
Sbjct: 463 QVTSTSLRLVDATSRRQ 479


>gi|312069702|ref|XP_003137805.1| hypothetical protein LOAG_02219 [Loa loa]
          Length = 1065

 Score = 70.1 bits (170), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 90/387 (23%), Positives = 149/387 (38%), Gaps = 49/387 (12%)

Query: 349 LDSSQELPRSSFS---VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLS 404
           +D   + P   F    + LD    T +  +  LL  + G L  L +V D    V+ L+L 
Sbjct: 1   MDGFTKFPLRDFKHMVLTLDGCVVTVISTNKILLCDRNGRLFTLVLVTDATNSVKSLELK 60

Query: 405 KTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPST 464
               +V+   +T+      F+GSRL DS+ +       T             ++  AP  
Sbjct: 61  FQFKTVIPCTMTSCAPGYLFIGSRLCDSVFLHCIFEQST-------------LDESAPKK 107

Query: 465 KRLRRSSSDALQDMVNGEELSLYGSASNNT---ESAQKTFSFAVRDSLVNIGPLKDFSYG 521
            +L  +  +A +D    E+  LYG         +SA++  +  V D L+N+GP K  + G
Sbjct: 108 IKLN-TELNANED----EDFELYGEVLPKVAKPDSAEELLNIRVLDKLLNVGPCKKITGG 162

Query: 522 LRINADASATGISKQSNYELVELPGCK--GIWTVYHKSSRGHNADSSRMAAY-------- 571
               +        K   ++LV   G    G   ++ +S R     SS +           
Sbjct: 163 CPSISAYFQEVTRKDPLFDLVCACGHGKFGSICIFQRSVRPEIVTSSSIEGVVQYWAVGR 222

Query: 572 -DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG 630
            +D+ H Y I S E  T+ LET + L E+ E+  +     TIAAG L      +QV    
Sbjct: 223 REDDTHMYFIASKELGTLALETDNDLVEL-EAPIFATSEPTIAAGELADGGLAVQVTTSS 281

Query: 631 ARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRL--LVGDPS 688
             ++      Q +                 V S SI DPY+ +   +G + +  L   P 
Sbjct: 282 LVMVAEGQQIQHIPL----------QLTFPVRSASIVDPYIAICTQNGRLLMYELTSHPH 331

Query: 689 TCTVSVQTPAAIESSKKPVSSCTLYHD 715
                +     +     P++S ++Y D
Sbjct: 332 VHLKEIDISKRLRHETSPITSLSIYRD 358



 Score = 62.4 bits (150), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 59/239 (24%), Positives = 104/239 (43%), Gaps = 29/239 (12%)

Query: 756 VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSE------TEINSSSEE 809
           +  E+G + I+ +P  + V+ V K     +H+ D    +   D E       +  S++  
Sbjct: 445 IARENGNMYIYSIPELHLVYMVKKI----SHLPDIATDQPYVDDEPATAESIDTMSATMT 500

Query: 810 GTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDD 869
            T   + E +    ++EL M     +  RP LF +L D T+  Y+ + +    N      
Sbjct: 501 DTFAAKPEEV----IMELLMVGMGMNQGRPMLF-LLIDDTVSVYEMFTY----NNGIQGH 551

Query: 870 PVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI--FKNISG-HQGF 926
                + L  + V+    R+ RF    LD     E+   A   +  +  F+ I     G 
Sbjct: 552 LAVRFKRLPYTVVT----RSCRFQG--LDGRAAVESVRDAVRHKTVLHFFERIGNVLNGV 605

Query: 927 FLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-QGILKICQLP 984
           F+  S PC   +     R+HP   DG I++FT  +N  C +GFIY+T  + ++++ +LP
Sbjct: 606 FICSSYPCIFFLETGVPRLHPVNLDGPILSFTTFNNAACPNGFIYLTERERLMRVAKLP 664


>gi|58383228|ref|XP_312466.2| AGAP002472-PA [Anopheles gambiae str. PEST]
 gi|55242305|gb|EAA08181.2| AGAP002472-PA [Anopheles gambiae str. PEST]
          Length = 1138

 Score = 69.3 bits (168), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 117/519 (22%), Positives = 194/519 (37%), Gaps = 126/519 (24%)

Query: 180 GPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
           G L  +DP+ R  G+ +Y GL  II            D+DT       S R+E       
Sbjct: 119 GILAVIDPKARVIGMRLYEGLFKIIPL----------DKDT-NELKATSLRMEE------ 161

Query: 239 RDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298
                 HV+D  F++G   P ++++H+        ++ +H    I    IS   K+   I
Sbjct: 162 -----MHVQDVEFLYGTTHPTLIVIHQD-------INGRH----IKTHEISLKDKEFTKI 205

Query: 299 -WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA--------LNNYAVSL 349
            W   N+  +A  L+AVP P+GG +V+G  +I YH   +  A+A        +N YA   
Sbjct: 206 AWKQDNVETEATMLIAVPMPLGGAIVIGQESIVYHDGDSYVAVAPAIIKQSTINCYA--- 262

Query: 350 DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLS 404
                         +D+    +L  ++A      G+L ++ +  +        V+ + + 
Sbjct: 263 -------------RIDSKGLRYLLGNMA------GNLFMMFLETEENAKGQTTVRDIKVE 303

Query: 405 KTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPST 464
                 +   IT + N + F+GSR GDS LV+    +G +     L E F ++   AP  
Sbjct: 304 LLGEITIPECITYLDNGVLFIGSRHGDSQLVKLNTTAGDNGAYVMLMETFTNL---APIV 360

Query: 465 KRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRI 524
                     L+    G+ ++  GS                       G L+    G+ I
Sbjct: 361 DMCVVD----LERQGQGQMITCSGSFKE--------------------GSLRIIRNGIGI 396

Query: 525 NADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLE 584
              A             ++LPG KG+W +             R+   D  Y   LI+S  
Sbjct: 397 QEHAC------------IDLPGIKGMWAL-------------RVGIDDSPYDNTLILSFV 431

Query: 585 ARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL--DGSYMTQD 642
             T VL  +    E TE        +T    N+    +++QV    AR++  D   M  +
Sbjct: 432 GHTRVLMLSGDEVEETEIAGILGDQQTFYCANV-SHGQILQVTPSSARLISCDNKAMICE 490

Query: 643 LSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIR 681
               P N   G    N+T +  + A     + + DG + 
Sbjct: 491 WK-PPDNKRIGVVGANTTQIVCASAQDVYYVEIGDGKLE 528


>gi|91087281|ref|XP_975549.1| PREDICTED: similar to conserved hypothetical protein [Tribolium
           castaneum]
 gi|270010588|gb|EFA07036.1| hypothetical protein TcasGA2_TC010010 [Tribolium castaneum]
          Length = 1149

 Score = 69.3 bits (168), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 105/481 (21%), Positives = 180/481 (37%), Gaps = 110/481 (22%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G L  +DP+ R  G+ +Y     I+   +  S L                       N+R
Sbjct: 119 GILAVIDPKARVIGLRLYDGLFKIIPLEKDNSELKAS--------------------NIR 158

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
            +D   V D  F+HG   P ++++H+        V+ +H    +    IS   K+   + 
Sbjct: 159 -IDELQVHDVEFLHGCANPTLILIHQD-------VNGRH----VKTHEISLREKEFVKVP 206

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  ++ VPSP+GG +++G   I YH       +A             + +S
Sbjct: 207 WRQDNVETEASMIIPVPSPLGGAIIIGQENILYHDGITPVVVA----------PAVIKQS 256

Query: 359 SFS--VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVL 411
           +     ++D     +L  D+A      G L +L +  D R     VV+ L +        
Sbjct: 257 TIVCYAKVDPGGLRYLLGDMA------GHLFMLFLEVDNRGDGNDVVKDLKVELLGEIAT 310

Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS 471
              IT + N + F+GSRLGDS LV+ T     S     + E F ++   AP    L    
Sbjct: 311 PECITYLDNGVLFIGSRLGDSQLVKLTTKPNESGSYVTVMESFTNL---API---LDMCV 364

Query: 472 SDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASAT 531
            D L+    G+ ++  G+                       G L+    G+ I   AS  
Sbjct: 365 VD-LERQGQGQLVTCSGAFKE--------------------GSLRIIRNGIGIQEHAS-- 401

Query: 532 GISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLE 591
                     ++LPG KG+W +                A D  Y   L+++   +T VL 
Sbjct: 402 ----------IDLPGIKGMWAL--------------QVASDGRYDNTLVLAFVGQTRVLS 437

Query: 592 TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSE 651
                 E T+   +    +T   GN+    +++Q+    AR++     T    + P + +
Sbjct: 438 LNGEEVEETDIAGFASDQQTFFCGNVI-HEQIVQITPISARLISAQNKTLLAEWKPPSDK 496

Query: 652 S 652
           +
Sbjct: 497 N 497


>gi|440302955|gb|ELP95261.1| hypothetical protein EIN_430670 [Entamoeba invadens IP1]
          Length = 1175

 Score = 68.9 bits (167), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 79/362 (21%), Positives = 150/362 (41%), Gaps = 59/362 (16%)

Query: 133 IILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCG 192
           +IL F+ A++SV+ ++   +   + S+HCFE PE    ++   +    P + +D +GRC 
Sbjct: 74  LILLFKQARLSVMRYNTETNRFVVHSLHCFEYPELRIREKCTPTAYDDPRMFIDKKGRCI 133

Query: 193 GVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFV 252
            +L Y   + ++                GS         SS+ ++L    +  + D I +
Sbjct: 134 SLLCYDRLLWVIP--------------LGSN-------RSSYRVDLEKFGVSRIVDVISL 172

Query: 253 HGYIEPVMVILHERELTWAGR-VSWKHHTCMISALSISTTL--KQHPLIWSAMN----LP 305
            GY  P +  LH    TW  R V+    T  I+ ++++  +  ++     + +N    LP
Sbjct: 173 SGYETPTLAFLHMTVPTWDARTVNTGEATNEIAIINVNPGVVGEEEQECANVVNRISRLP 232

Query: 306 HDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQE----------L 355
           ++  K++    P+ G+L++ + ++ Y S ++S +  L  +    +  +           L
Sbjct: 233 YNTLKMVEC-YPLPGILLLASVSVLYISTTSSESFIL-PFGTYFNPPEVWKGVVPFLKLL 290

Query: 356 PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI 415
           P     ++L  +     QN + L  T  GD   + +     +VQ + LS   P     + 
Sbjct: 291 PMKIRIIQLVKSIHQLSQN-LYLTFTDKGDSYYIHLNCVEGIVQEIVLSNA-PYKFIPNT 348

Query: 416 TTIGNSLFFLGSRLGDSLLVQFT---------------CGSGTSMLSSGLKEEFGDIEAD 460
            ++ +   FLGS   DS L  +T               CG    +    L+E  G +E D
Sbjct: 349 VSLYDDYIFLGSVFHDSYLFNYTICEYGKGDIKPFGIHCGDAVRI--KNLQERSGQMEED 406

Query: 461 AP 462
            P
Sbjct: 407 YP 408


>gi|154285962|ref|XP_001543776.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
 gi|150407417|gb|EDN02958.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
          Length = 1283

 Score = 68.9 bits (167), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 113/541 (20%), Positives = 202/541 (37%), Gaps = 106/541 (19%)

Query: 501 FSFAVRDSLVNIGPLKDFSYGLRI---NADASATGISKQSNYELVELPG----------- 546
           + F + D L N+GP++D + G      + D      S  +N ELV   G           
Sbjct: 376 YIFRIHDRLWNLGPMRDLTLGRPPGPRDKDKRQPVSSILANLELVTTQGYGKAGGLAILR 435

Query: 547 ---------------CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL-----EAR 586
                            G  +VY K  +  +   S        Y  YL++S      + +
Sbjct: 436 REIDPFVIDSLMIKDTDGARSVYVKDPKLPSQSGSLPLNPGSNYDHYLLLSKSKGLDKEK 495

Query: 587 TMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARILD-GSYMTQDLS 644
           ++V   +    E T++ ++   + RTI  G L    RV+QV +   R  D G  + Q   
Sbjct: 496 SVVYRMSSGGLEETKAPEFNPNEDRTIDIGTLASGTRVVQVLKGEVRSYDSGLGLAQIFP 555

Query: 645 FGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSK 704
               +      SE  +V+  S ADPYVL+   D SI LL  D S      +T   I S+ 
Sbjct: 556 VWDEDM-----SEEKSVVHTSFADPYVLIIRDDQSILLLQADDSGDLDEAETDGIINSTT 610

Query: 705 KPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGP-LDQGD-IYSVVCYESGA 762
               S +LY DK                     +    +G P + Q D +   +      
Sbjct: 611 --WISGSLYQDKY-------------------RSFKSHEGPPNMKQSDNVLLFLLSSESK 649

Query: 763 LEIFDVPNF-NCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHS 821
           L +F +PN    VFT +                   D   +I S+         +E I  
Sbjct: 650 LYVFHLPNAREPVFTTESI-----------------DLLPQILSTEPPPRRVTYRETITE 692

Query: 822 MKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSN 881
           + V +L      +    P+L    ++  ++ Y+ Y +             ST R  S   
Sbjct: 693 LLVADLG----DSVSRSPYLILRSSNSDLILYEPYHYTS-----------STERQFS--G 735

Query: 882 VSASRLRNLRFSRTPLDAYTREETPHGAPCQRIT----IFKNISGHQGFFLSGSRPCWCM 937
           +   ++ N  F ++  ++   +   H A C  I+    +  ++ G++  F+ G+ PC+ +
Sbjct: 736 LRFVKIANHHFPKSHSESNAGK---HPANCTAISKPLRVLGDVCGYRTVFMPGNSPCFII 792

Query: 938 VFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQK 997
                +     L   ++ + +  +   C  GF+YV +  ++++C+ P  + +D  W  +K
Sbjct: 793 KSSTSIPHVMNLRGKTVHSLSSFNIPACEKGFVYVDTDNVVRMCRFPRNTHFDGSWAARK 852

Query: 998 V 998
           +
Sbjct: 853 I 853



 Score = 47.8 bits (112), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 45/180 (25%), Positives = 84/180 (46%), Gaps = 21/180 (11%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL+V    +++++ +     GS   +   +T+ +          L LV  Y L G +  L
Sbjct: 28  NLIVAKTTLLQVFNLVNVVYGSGPGQPDEKTRSQY-------TKLVLVAEYALSGTITDL 80

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             +     D+    +++++A  +AK+S++E+D   H +  TS+H +E  + +++     +
Sbjct: 81  GRVKI--LDSKSGGEAVLVATRNAKLSLIEWDPERHQISTTSIHYYERDD-VNISPWTPN 137

Query: 177 FARGP-LVKVDPQGRCGGVLVYGLQ-MIILKASQGGSGLVGDEDTFGSGGGFSARIESSH 234
            A  P  + VDP  RC  VL +G + + IL   Q G  LV D+        F + +E  H
Sbjct: 138 LASCPSYLTVDPNSRC-AVLNFGKKNLAILPFHQVGDDLVMDD--------FDSDVEEQH 188


>gi|242089089|ref|XP_002440377.1| hypothetical protein SORBIDRAFT_09g030580 [Sorghum bicolor]
 gi|241945662|gb|EES18807.1| hypothetical protein SORBIDRAFT_09g030580 [Sorghum bicolor]
          Length = 1783

 Score = 68.9 bits (167), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 108/464 (23%), Positives = 189/464 (40%), Gaps = 112/464 (24%)

Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
           +D + +A E  K  VL++D     L   +M           + GR +   G +  +DP  
Sbjct: 75  QDFLFIATERYKFCVLQWDAEKSELITRAMGDVSD------RIGRPT-DNGQIGIIDPDC 127

Query: 190 RCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKD 248
           R  G+ +Y GL  +I   ++G                F+ R+E   V++++         
Sbjct: 128 RLIGLHLYDGLFKVIPFDNKGQLK-----------EAFNIRLEELQVLDIK--------- 167

Query: 249 FIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDA 308
             F+HG ++P +V+L++           +H      AL       + P  WS  N+ + A
Sbjct: 168 --FLHGCVKPTIVVLYQ------DNKDVRHVKTYEVALK-DKDFVEGP--WSQNNVDNGA 216

Query: 309 YKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAH 368
             L+ VP+P+GGV+++G   I Y + +++          ++   Q + R+   V+ D + 
Sbjct: 217 GLLIPVPAPLGGVIIIGEEQIVYCNANSTFK--------AIPIKQSIIRAYGRVDPDGSR 268

Query: 369 ATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSR 428
                    LL   TG L LL + ++   V  L +     + + S I+ + N + ++GSR
Sbjct: 269 Y--------LLGDNTGILHLLVLTHERERVTGLKIEYLGETSIASSISYLDNGVVYVGSR 320

Query: 429 LGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG 488
            GDS LV+                   +++ADA        S  + L+  VN   +  + 
Sbjct: 321 FGDSQLVKL------------------NLQADASG------SFVEILERYVNLGPIVDFC 356

Query: 489 SASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPG 546
               + +   +  T S A +D     G L+    G+ IN  AS            VEL G
Sbjct: 357 VVDLDRQGQGQVVTCSGAFKD-----GSLRVVRNGIGINEQAS------------VELQG 399

Query: 547 CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
            KG+W++  KSS            ++D Y  YL++S  + T  L
Sbjct: 400 IKGLWSL--KSS------------FNDPYDMYLVVSFISETRFL 429


>gi|403218521|emb|CCK73011.1| hypothetical protein KNAG_0M01580 [Kazachstania naganishii CBS
           8797]
          Length = 1345

 Score = 68.6 bits (166), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 147/706 (20%), Positives = 284/706 (40%), Gaps = 146/706 (20%)

Query: 61  TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
           T A+ +E+ +VR       +   SG+              L L   ++L   +  LA++ 
Sbjct: 22  TTADYVELLIVRTNLLSIYKVTESGK--------------LLLTHEFKLQARITDLALV- 66

Query: 121 QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF--- 177
            G  +N+   + ++L   + K+S+++F+   + L   S+H +E       K    SF   
Sbjct: 67  -GSVENTGL-NYLLLGIGNCKLSIVKFNSLNNSLETISLHYYEE------KFKANSFIEL 118

Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSA--------R 229
           A+   +++DPQ RC  +L     ++IL  SQ        E+                  +
Sbjct: 119 AKKTELRIDPQNRCA-LLFNNDNIVILPFSQQQEEEDYGEEEEEEDNYNMEDGPNVKKLK 177

Query: 230 IES--------SHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH- 278
           +ES        S + + + LD  +++V D  F+  +  P + IL++ +LTWAG +     
Sbjct: 178 LESASTNLTLPSIITDSKKLDSTIENVVDIQFLRNFSRPTLGILYQPKLTWAGNLQLNPL 237

Query: 279 -HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA- 336
               ++ +L+I+ +  +  +I     LP D++ L  +P+  G VL+ G+N + Y   +  
Sbjct: 238 PTKFLVISLNIAVSELEGTVITKLEGLPWDSHTL--IPTWNGCVLL-GSNEVSYIDNTGV 294

Query: 337 -SCALALNNYA-VSLDSSQELPRSSFSVEL--DAAHATWLQ--------NDVALLSTKTG 384
              A+ LN+YA  SL   + +  +   + L  D   + W          +++ LL  ++ 
Sbjct: 295 LQSAIFLNSYADASLRKVRVVDHTDQQITLNKDLVKSLWSAPTKESGGADEILLLMDESS 354

Query: 385 DLVLLTVVYDGRVVQRLDL-----------SKTNPSVLT--SDITTIGNSLFFLGSRLGD 431
           +L  + + ++GR++ + D+              +P+ +T   +     N   F+G + GD
Sbjct: 355 NLYYIQLEFEGRLMTKFDMINLPIVNDIFVHNLHPTCITRIDESKHNININLFIGFQTGD 414

Query: 432 SLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELS---LYG 488
           SL+V+                   +I +   +    +++SS++    V  E+     LYG
Sbjct: 415 SLVVRL-----------------NNIRSAIETRHEYKQTSSESGLGKVEDEDEDEDDLYG 457

Query: 489 -------SASNNTESAQ----KTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
                  +AS N ++A     + F   +   L NIGP+     G   +      G+   +
Sbjct: 458 DDGAHDKNASVNNDNAVVHTVQPFDIEMMSCLRNIGPVTSLVIGEASSVQPVIKGLPNPN 517

Query: 538 NYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL--------EARTMV 589
             E   +  C         +  G N    +++   +   A   IS+        + R   
Sbjct: 518 KGEYSLVATCG--------NGTGSNLMVGQISVQPEVELALKFISVTQIWNLKVKNRDKY 569

Query: 590 LETADL------LTEVTESVDYFVQGR------TIAAGNLFGRRRVIQVFERGARILDGS 637
           L T D       + E+  +   + QGR      T+      G +R++QV      + D +
Sbjct: 570 LITTDSTKTKSDIYEIENNFALYKQGRLRRDATTVYISMFGGEKRIVQVTTNHLYLYDTN 629

Query: 638 YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLL 683
           +  + L     N E         V+ VS+ DPY+L+ +S G I + 
Sbjct: 630 F--RRLFLNKFNYE---------VVHVSVMDPYLLITLSRGDIMIF 664


>gi|55976392|sp|Q6E7D1.1|DDB1_SOLCE RecName: Full=DNA damage-binding protein 1; AltName:
           Full=UV-damaged DNA-binding protein 1
 gi|49484911|gb|AAT66742.1| UV-damaged DNA binding protein 1 [Solanum cheesmaniae]
          Length = 1095

 Score = 68.6 bits (166), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 119/540 (22%), Positives = 204/540 (37%), Gaps = 143/540 (26%)

Query: 57  NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESL 116
           NL++     IEI+++  Q                    G+    L+ +    ++G + +L
Sbjct: 31  NLIIAKCTRIEIHLLTPQ--------------------GLQCICLQPMLDVPIYGRIATL 70

Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
            +    G      +D + +A E  K  VL++D     +   +M           + GR +
Sbjct: 71  ELFRPHG----ETQDLLFIATERYKFCVLQWDTEASEVITRAMGDVSD------RIGRPT 120

Query: 177 FARGPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHV 235
              G +  +DP  R  G+ +Y GL  +I   ++G                F+ R+E   V
Sbjct: 121 -DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLK-----------EAFNIRLEELQV 168

Query: 236 INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
           ++++           F++G  +P +V+L++           +H        +   +LK  
Sbjct: 169 LDIK-----------FLYGCPKPTIVVLYQ------DNKDARH------VKTYEVSLKDK 205

Query: 296 PLI---WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSS 352
             I   W+  NL + A  L+ VP P+ GVL++G  TI Y S SA  A+ +          
Sbjct: 206 DFIEGPWAQNNLDNGASLLIPVPPPLCGVLIIGEETIVYCSASAFKAIPIR--------- 256

Query: 353 QELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLT 412
             + R+   V+ D +          LL    G L LL + ++   V  L +     + + 
Sbjct: 257 PSITRAYGRVDADGSR--------YLLGDHNGLLHLLVITHEKEKVTGLKIELLGETSIA 308

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
           S I+ + N+  F+GS  GDS LV+                         P TK    S  
Sbjct: 309 STISYLDNAFVFIGSSYGDSQLVKLNL---------------------QPDTK---GSYV 344

Query: 473 DALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLRINADASA 530
           + L+  VN   +  +       +   +  T S A +D     G L+    G+ IN  AS 
Sbjct: 345 EVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD-----GSLRIVRNGIGINEQAS- 398

Query: 531 TGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
                      VEL G KG+W++               +A DD Y  +L++S  + T VL
Sbjct: 399 -----------VELQGIKGMWSL--------------RSATDDPYDTFLVVSFISETRVL 433


>gi|115465791|ref|NP_001056495.1| Os05g0592400 [Oryza sativa Japonica Group]
 gi|48475231|gb|AAT44300.1| putative DNA damage binding protein 1 [Oryza sativa Japonica Group]
 gi|113580046|dbj|BAF18409.1| Os05g0592400 [Oryza sativa Japonica Group]
 gi|215694552|dbj|BAG89545.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222632766|gb|EEE64898.1| hypothetical protein OsJ_19757 [Oryza sativa Japonica Group]
          Length = 1090

 Score = 68.2 bits (165), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 111/504 (22%), Positives = 205/504 (40%), Gaps = 116/504 (23%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++   L+ +    ++G + +L +       ++  +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLTPQGLQPMIDVPIYGRIATLELFRP----HNETQDFLFIATERYKFCVLQWDG 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   +M           + GR +   G +  +DP  R  G+ +Y GL  +I   ++
Sbjct: 95  EKSELLTRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F++G ++P +V+L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCVKPTIVVLYQ--- 182

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
                   +H      AL       + P  WS  NL + A  L+ VP+P+GGV+++G  T
Sbjct: 183 ---DNKDARHVKTYEVALK-DKDFVEGP--WSQNNLDNGAGLLIPVPAPLGGVIIIGEET 236

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           I Y + +++          ++   Q + R+   V+ D +          LL    G L L
Sbjct: 237 IVYCNANSTFR--------AIPIKQSIIRAYGRVDPDGSR--------YLLGDNAGILHL 280

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
           L + ++   V  L +     + + S I+ + N + ++GSR GDS LV+            
Sbjct: 281 LVLTHERERVTGLKIEYLGETSIASSISYLDNGVVYVGSRFGDSQLVKL----------- 329

Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVR 506
                  +++AD         S  + L+  VN   +  +     + +   +  T S A +
Sbjct: 330 -------NLQADPNG------SYVEVLERYVNLGPIVDFCVVDLDRQGQGQVVTCSGAFK 376

Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
           D     G L+    G+ IN  AS            VEL G KG+W++  KSS        
Sbjct: 377 D-----GSLRVVRNGIGINEQAS------------VELQGIKGLWSL--KSS-------- 409

Query: 567 RMAAYDDEYHAYLIISLEARTMVL 590
               ++D Y  YL++S  + T  L
Sbjct: 410 ----FNDPYDMYLVVSFISETRFL 429


>gi|12082087|dbj|BAB20761.1| UV-damaged DNA binding protein [Oryza sativa Japonica Group]
          Length = 1090

 Score = 68.2 bits (165), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 111/504 (22%), Positives = 205/504 (40%), Gaps = 116/504 (23%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++   L+ +    ++G + +L +       ++  +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLTPQGLQPMIDVPIYGRIATLELFRP----HNETQDFLFIATERYKFCVLQWDG 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   +M           + GR +   G +  +DP  R  G+ +Y GL  +I   ++
Sbjct: 95  EKSELLTRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F++G ++P +V+L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCVKPTIVVLYQ--- 182

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
                   +H      AL       + P  WS  NL + A  L+ VP+P+GGV+++G  T
Sbjct: 183 ---DNKDARHVKTYEVALK-DKDFVEGP--WSQNNLDNGAGLLIPVPAPLGGVIIIGEET 236

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           I Y + +++          ++   Q + R+   V+ D +          LL    G L L
Sbjct: 237 IVYCNANSTFR--------AIPIKQSIIRAYGRVDPDGSR--------YLLGDNAGILHL 280

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
           L + ++   V  L +     + + S I+ + N + ++GSR GDS LV+            
Sbjct: 281 LVLTHERERVTGLKIEYLGETSIASSISYLDNGVVYVGSRFGDSQLVKL----------- 329

Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVR 506
                  +++AD         S  + L+  VN   +  +     + +   +  T S A +
Sbjct: 330 -------NLQADPNG------SYVEVLERYVNLGPIVDFCVVDLDRQGQGQVVTCSGAFK 376

Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
           D     G L+    G+ IN  AS            VEL G KG+W++  KSS        
Sbjct: 377 D-----GSLRVVRNGIGINEQAS------------VELQGIKGLWSL--KSS-------- 409

Query: 567 RMAAYDDEYHAYLIISLEARTMVL 590
               ++D Y  YL++S  + T  L
Sbjct: 410 ----FNDPYDMYLVVSFISETRFL 429


>gi|218197365|gb|EEC79792.1| hypothetical protein OsI_21216 [Oryza sativa Indica Group]
          Length = 1089

 Score = 67.8 bits (164), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 111/504 (22%), Positives = 205/504 (40%), Gaps = 116/504 (23%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++   L+ +    ++G + +L +       ++  +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLTPQGLQPMIDVPIYGRIATLELFRP----HNETQDFLFIATERYKFCVLQWDG 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   +M           + GR +   G +  +DP  R  G+ +Y GL  +I   ++
Sbjct: 95  EKSELLTRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F++G ++P +V+L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCVKPTIVVLYQ--- 182

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
                   +H      AL       + P  WS  NL + A  L+ VP+P+GGV+++G  T
Sbjct: 183 ---DNKDARHVKTYEVALK-DKDFVEGP--WSQNNLDNGAGLLIPVPAPLGGVIIIGEET 236

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           I Y + +++          ++   Q + R+   V+ D +          LL    G L L
Sbjct: 237 IVYCNANSTFR--------AIPIKQSIIRAYGRVDPDGSR--------YLLGDNAGILHL 280

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
           L + ++   V  L +     + + S I+ + N + ++GSR GDS LV+            
Sbjct: 281 LVLTHERERVTGLKIEYLGETSIASSISYLDNGVVYVGSRFGDSQLVKL----------- 329

Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVR 506
                  +++AD         S  + L+  VN   +  +     + +   +  T S A +
Sbjct: 330 -------NLQADPNG------SYVEVLERYVNLGPIVDFCVVDLDRQGQGQVVTCSGAFK 376

Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
           D     G L+    G+ IN  AS            VEL G KG+W++  KSS        
Sbjct: 377 D-----GSLRVVRNGIGINEQAS------------VELQGIKGLWSL--KSS-------- 409

Query: 567 RMAAYDDEYHAYLIISLEARTMVL 590
               ++D Y  YL++S  + T  L
Sbjct: 410 ----FNDPYDMYLVVSFISETRFL 429


>gi|357132340|ref|XP_003567788.1| PREDICTED: DNA damage-binding protein 1a-like [Brachypodium
           distachyon]
          Length = 1090

 Score = 67.4 bits (163), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 88/367 (23%), Positives = 152/367 (41%), Gaps = 83/367 (22%)

Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
           F  + +     N+R L+   V D  F++G + P +V+L++           +H      A
Sbjct: 144 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCLRPTIVVLYQ------DNKDARHVKTYEVA 196

Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
           L       + P  WS  NL + A  L+ VP+P+GGV+++G  TI Y + +++        
Sbjct: 197 LK-DKDFVEGP--WSQNNLDNGAGLLIPVPAPLGGVIIIGEETIVYCNANSTFK------ 247

Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
             ++   Q + R+   V+ D +          LL   TG L LL +  +   V  L +  
Sbjct: 248 --AIPIKQSIIRAYGRVDPDGSR--------YLLGDNTGILHLLVLTQERERVTGLKIEH 297

Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
              + + S I+ + N + ++GSR GDS LV+                   +++ADA    
Sbjct: 298 LGETSVASSISYLDNGVVYVGSRFGDSQLVKL------------------NLQADATG-- 337

Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLR 523
               S  + L+  VN   +  +     + +   +  T S A +D     G ++    G+ 
Sbjct: 338 ----SFVEVLERYVNLGPIVDFCVVDLDRQGQGQVVTCSGAFKD-----GSIRVVRNGIG 388

Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
           IN  AS            VEL G KG+W++  KSS            ++D Y  +L++S 
Sbjct: 389 INEQAS------------VELQGIKGLWSL--KSS------------FNDPYDTFLVVSF 422

Query: 584 EARTMVL 590
            + T  L
Sbjct: 423 ISETRFL 429


>gi|302788810|ref|XP_002976174.1| hypothetical protein SELMODRAFT_151061 [Selaginella moellendorffii]
 gi|300156450|gb|EFJ23079.1| hypothetical protein SELMODRAFT_151061 [Selaginella moellendorffii]
          Length = 1089

 Score = 67.4 bits (163), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 122/557 (21%), Positives = 216/557 (38%), Gaps = 121/557 (21%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+    ++A  L+ +    ++G + +L +    G      +D + ++ E  K  VL++D 
Sbjct: 39  RIEFHLLTAQGLQPLLDVPIYGRIATLELFRPPG----ETQDVLFVSTERYKFCVLQWDS 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   +M           + GR +   G +  VDP+ R  G+ +Y GL  +I   ++
Sbjct: 95  ETTELVTRAMGDVSD------RIGRPT-DNGQIGIVDPECRLIGLHLYDGLFKVIPIDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F++G  +P + +L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCSKPTIAVLYQDNK 185

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
                     H              + P  W   NL + A  L+ VP+P+GGV+++G  T
Sbjct: 186 D-------ARHVKTYEIQLKEKDFGEGP--WLQNNLDNGAGMLIPVPTPLGGVIIIGEQT 236

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           I Y+S SA  A+ +            + ++   V+ D +          LLS  TG L L
Sbjct: 237 IVYYSGSAFKAIPIR---------PSITKAYGKVDADGSR--------YLLSDHTGSLHL 279

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
           L + ++   V  L +     +   S ++ + N + ++GS  GDS L++            
Sbjct: 280 LVITHERDRVLGLKVELLGETSAASSLSYLDNGVVYVGSSYGDSQLIKLNA--------- 330

Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVN-GEELSLYGSASNNTESAQ-KTFSFAVR 506
                    + D+      R S  + L+  VN G  + L           Q  T S A +
Sbjct: 331 ---------QVDS------RNSYVEVLESFVNLGPIVDLCVVDLERQGQGQVVTCSGAYK 375

Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
           D     G L+    G+ IN  ASA            EL G KG+W++             
Sbjct: 376 D-----GSLRIVRNGIGINEQASA------------ELQGIKGMWSL------------- 405

Query: 567 RMAAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVI 624
             A   D +  +L++S   E R + +   D L E TE   +  + +T+   N     ++I
Sbjct: 406 -RATSKDVFDIFLVVSFISETRILAMNMDDELEE-TEIEGFDSEAQTLFCHNAI-HDQII 462

Query: 625 QVFERGARILDGSYMTQ 641
           QV     R++D +   Q
Sbjct: 463 QVTSTSLRLVDATSRRQ 479


>gi|350537001|ref|NP_001234275.1| DNA damage-binding protein 1 [Solanum lycopersicum]
 gi|350539125|ref|NP_001233864.1| UV damaged DNA binding protein 1 [Solanum lycopersicum]
 gi|55976440|sp|Q6QNU4.1|DDB1_SOLLC RecName: Full=DNA damage-binding protein 1; AltName: Full=High
           pigmentation protein 1; AltName: Full=UV-damaged
           DNA-binding protein 1
 gi|38455768|gb|AAR20885.1| UV damaged DNA binding protein 1 [Solanum lycopersicum]
 gi|42602165|gb|AAS21683.1| UV-damaged DNA binding protein 1 [Solanum lycopersicum]
          Length = 1090

 Score = 67.4 bits (163), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 113/507 (22%), Positives = 196/507 (38%), Gaps = 123/507 (24%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++   L+ +    ++G + +L +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLTPQGLQPMLDVPIYGRIATLELFRPHG----ETQDLLFIATERYKFCVLQWDT 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               +   +M           + GR +   G +  +DP  R  G+ +Y GL  +I   ++
Sbjct: 95  EASEVITRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F++G  +P +V+L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCPKPTIVVLYQ--- 182

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVG 325
                   +H        +   +LK    I   W+  NL + A  L+ VP P+ GVL++G
Sbjct: 183 ---DNKDARH------VKTYEVSLKDKDFIEGPWAQNNLDNGASLLIPVPPPLCGVLIIG 233

Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGD 385
             TI Y S SA  A+ +            + R+   V+ D +          LL    G 
Sbjct: 234 EETIVYCSASAFKAIPIR---------PSITRAYGRVDADGSR--------YLLGDHNGL 276

Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
           L LL + ++   V  L +     + + S I+ + N+  F+GS  GDS LV+         
Sbjct: 277 LHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAFVFIGSSYGDSQLVKLNL------ 330

Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSF 503
                           P TK    S  + L+  VN   +  +       +   +  T S 
Sbjct: 331 ---------------QPDTK---GSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSG 372

Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNA 563
           A +D     G L+    G+ IN  AS            VEL G KG+W++          
Sbjct: 373 AYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL---------- 405

Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVL 590
                +A DD Y  +L++S  + T VL
Sbjct: 406 ----RSATDDPYDTFLVVSFISETRVL 428


>gi|159470707|ref|XP_001693498.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158283001|gb|EDP08752.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 366

 Score = 67.4 bits (163), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 31/77 (40%), Positives = 42/77 (54%), Gaps = 1/77 (1%)

Query: 923 HQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS-QGILKIC 981
           H G F++G+RP W +  R  L  H    +G + A T  HNVNC  GFI   S +G LK+C
Sbjct: 166 HSGVFVAGARPLWLVAGRGGLAAHAMWSEGPVAALTPFHNVNCPLGFITACSARGQLKVC 225

Query: 982 QLPSGSTYDNYWPVQKV 998
            LP  +  D  W  ++V
Sbjct: 226 CLPPHTRLDGAWATRRV 242



 Score = 56.6 bits (135), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 32/89 (35%), Positives = 47/89 (52%), Gaps = 6/89 (6%)

Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDL------SFGPSNSESGSGSE 657
           +Y     TIAAGNLF    ++Q    G R+L+G  + QDL      + G   + S  G  
Sbjct: 4   EYITDQPTIAAGNLFHNAVIVQACPGGVRLLEGMSLVQDLPLSELQALGGVAAASRPGVA 63

Query: 658 NSTVLSVSIADPYVLLGMSDGSIRLLVGD 686
             T+  + +ADPYVL+ +S+G+  LL  D
Sbjct: 64  PPTITHMQVADPYVLVSLSNGTACLLEAD 92


>gi|297799958|ref|XP_002867863.1| hypothetical protein ARALYDRAFT_492777 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297313699|gb|EFH44122.1| hypothetical protein ARALYDRAFT_492777 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 1088

 Score = 66.2 bits (160), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 112/513 (21%), Positives = 203/513 (39%), Gaps = 135/513 (26%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  +S   L+ +    L+G + +L +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLSPQGLQTILDVPLYGRIATLELFRPHG----EAQDFLFVATERYKFCVLQWD- 93

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRES------FARGPLVKVDPQGRCGGVLVY-GLQMI 202
                       +ES E +    G  S         G +  +DP  R  G+ +Y GL  +
Sbjct: 94  ------------YESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKV 141

Query: 203 ILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVI 262
           I   ++G                F+ R+E   V++++           F++G  +P + +
Sbjct: 142 IPFDNKGQLK-----------EAFNIRLEELQVLDIK-----------FLYGCTKPTIAV 179

Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIG 319
           L++           +H        +   +LK+   +   WS  NL + A  L+ VPSP+ 
Sbjct: 180 LYQ------DNKDARH------VKTYEVSLKEKDFVEGPWSQNNLDNGADLLIPVPSPLC 227

Query: 320 GVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
           GVL++G  TI Y S +A  A+ +            + ++   V+LD +          LL
Sbjct: 228 GVLIIGEETIVYCSANAFKAIPIR---------PSITKAYGRVDLDGSR--------YLL 270

Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
              +G + LL + ++   V  L +     + + S I+ + N++ F+GS  GDS L++   
Sbjct: 271 GDHSGLIHLLVITHEKEKVTGLKIELLGETSIASSISYLDNAVVFVGSSYGDSQLIKL-- 328

Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
                           +++ DA        S  + L+  VN   +  +       +   +
Sbjct: 329 ----------------NLQPDATG------SYVEILEKYVNLGPIVDFCVVDLERQGQGQ 366

Query: 500 --TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKS 557
             T S A +D     G L+    G+ IN  AS            VEL G KG+W++  KS
Sbjct: 367 VVTCSGAYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL--KS 407

Query: 558 SRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
           S             D+ +  +L++S  + T +L
Sbjct: 408 S------------IDEAFDTFLVVSFISETRIL 428


>gi|168047617|ref|XP_001776266.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162672361|gb|EDQ58899.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1089

 Score = 66.2 bits (160), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 122/556 (21%), Positives = 214/556 (38%), Gaps = 119/556 (21%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++A+ L+ +    L+G + +L +    G      +D + ++FE  K  VL++D 
Sbjct: 39  RIEIHLLTASGLQSMLDVPLYGRIATLELFRPPG----ESQDVLFISFERYKFCVLQWDA 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQG 209
              G  IT      S      + GR +   G +  VDP  R  G+ +Y     ++     
Sbjct: 95  ET-GSPITRAMGDVSD-----RTGRPT-DNGQIGIVDPDCRLIGLHLYDGMFKVIPIDNK 147

Query: 210 GSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELT 269
           G               F+ R+E   V++++           F++G   P + +L++    
Sbjct: 148 GQ----------LKEAFNIRLEELQVLDIK-----------FLYGCANPTIAVLYQDNKD 186

Query: 270 WAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTI 329
                    H              + P  W   NL + A  L+ VP P+GG +++G  TI
Sbjct: 187 -------ARHVKTYEVNLKEKDFGEGP--WLQNNLDNGAGLLIPVPLPLGGAIIIGEQTI 237

Query: 330 HYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLL 389
            Y++ S   A+ +            + ++   V+ D +          LLS   G L LL
Sbjct: 238 VYYNGSVFKAIPIR---------PSITKAYGRVDSDGSR--------YLLSDHNGMLYLL 280

Query: 390 TVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSG 449
            + +D   V  L++     +   S ++ + N + F+GS  GDS L++             
Sbjct: 281 VISHDKERVSALNVEPLGETSAASTLSYLDNGVVFVGSSYGDSQLIRL------------ 328

Query: 450 LKEEFGDIEADAPSTKRLRRSSSDALQDMVN-GEELSLYGSASNNTESAQ-KTFSFAVRD 507
                 + +AD      ++ S  + L+  VN G  + L           Q  T S A +D
Sbjct: 329 ------NHQAD------VKGSYVEVLESFVNLGPIVDLCVVDLERQGQGQVVTCSGAFKD 376

Query: 508 SLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSR 567
                G L+    G+ IN  AS            VEL G KG+W++   SS         
Sbjct: 377 -----GSLRIVRNGIGINEQAS------------VELQGIKGMWSLRASSS--------- 410

Query: 568 MAAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQ 625
                D Y  +L++S   E R + + T D L E TE   +  + +T+   N     +++Q
Sbjct: 411 -----DVYDTFLVVSFISETRILAMNTDDELEE-TEIDGFDSEAQTLFCHNAV-HDQLVQ 463

Query: 626 VFERGARILDGSYMTQ 641
           V     R+++     Q
Sbjct: 464 VTAGSLRLVNAKTRKQ 479


>gi|219109892|ref|XP_002176699.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411234|gb|EEC51162.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 1678

 Score = 65.9 bits (159), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 85/328 (25%), Positives = 133/328 (40%), Gaps = 69/328 (21%)

Query: 251 FVHGYIEPVMVILHE--RELTWAGRVSWKHHTC-----MISALSISTTLKQHPLIWSAMN 303
           F+ GY+EPV+V+LH       W+GR+  +          ++ALSIS    +  ++WS + 
Sbjct: 243 FLSGYLEPVLVLLHSDVEGPVWSGRLGRERGVAGAPPLFVTALSISVVHGRTAVLWSQV- 301

Query: 304 LPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSL------DSSQELP 356
           +  DA K+L+      G LVVGANT +          +A+N +A S        + Q  P
Sbjct: 302 VSADATKILSFGKT--GCLVVGANTLVILEIGKVQQVIAMNGWARSTCPAALQTALQANP 359

Query: 357 RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLD-------------- 402
               +++LD    TWL    A+++ +TG L +L    D   V  L               
Sbjct: 360 VVKLAIQLDGCCVTWLSEHSAIMALRTGQLYVLQRTDDRWAVMPLGQTLGAVGEVAHLAS 419

Query: 403 --------LSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEE- 453
                   L K       +    +G  + F GSR GDSL + +      +M  + +K E 
Sbjct: 420 LPIGGLRWLEKMKMDENKASEMQMG--VLFAGSRTGDSLFLGYAL-EIVTMPWAAIKSEG 476

Query: 454 --FGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS----------ASNNTESAQ--- 498
             F + E    S        ++ L  ++  EE +LYG+           S   E+A    
Sbjct: 477 QTFINFEGSELSKVATTAPIANGLDRILQLEEEALYGTDRSTPLHIVRDSEEEETADIPS 536

Query: 499 -----KTFSFAV------RDSLVNIGPL 515
                +  +F V       D LVN+GPL
Sbjct: 537 DAKRLRPVAFTVVRTIVPLDVLVNLGPL 564


>gi|147779836|emb|CAN63685.1| hypothetical protein VITISV_020449 [Vitis vinifera]
          Length = 64

 Score = 65.5 bits (158), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 35/47 (74%), Positives = 42/47 (89%)

Query: 355 LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRL 401
           +PRSSFSVELDAA+ATWL NDVA+LSTKTG+L+LLT+ YDGR+   L
Sbjct: 1   MPRSSFSVELDAANATWLSNDVAMLSTKTGELLLLTLXYDGRLFTDL 47


>gi|357135348|ref|XP_003569272.1| PREDICTED: DNA damage-binding protein 1a-like [Brachypodium
           distachyon]
          Length = 1074

 Score = 65.5 bits (158), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 112/482 (23%), Positives = 196/482 (40%), Gaps = 105/482 (21%)

Query: 180 GPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
           G +  +DPQ R  G+ +Y GL  +I                F + G           +N+
Sbjct: 118 GQIGVIDPQNRLIGLSLYDGLFKVI---------------PFDNKGNLK------EALNI 156

Query: 239 RDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298
           R L    V D  F++G   P +V+LH+           +H      AL     ++     
Sbjct: 157 R-LQEFLVLDIKFLYGCARPTVVVLHQ------DNKDSRHVKTYEVALEDKDFVEGS--- 206

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           WS  NL + A+  L +P P+GGV+++G +TI Y S +   AL++          Q + R+
Sbjct: 207 WSQSNLDNSAH--LLIPVPLGGVIIIGEHTIVYCSATTFKALSIK---------QSIIRA 255

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTI 418
              V+ D +   +  N        TG L L+ + ++   V  L       + + S I+ +
Sbjct: 256 VGRVDPDGSRYLYGDN--------TGALHLIVITHEWGRVTDLKTHYMGETSIASTISYL 307

Query: 419 GNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDM 478
            + L ++GSR GDS L++                   +I+ADA +      S  + L+  
Sbjct: 308 DSGLVYIGSRFGDSQLIKL------------------NIQADASA------SFVEILEQF 343

Query: 479 VNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSN 538
           +N   +  +           +  + +        G  KD S    I A  +   I+ Q++
Sbjct: 344 MNTGPIVDFCVVDTERRGQGQVITCS--------GAYKDGS----IRAVRNGVVITDQAS 391

Query: 539 YELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTE 598
              VEL G KG+W++  KSS     D+  +  + +E H +L +++E     LE  D+   
Sbjct: 392 ---VELRGMKGLWSM--KSSLNDPYDTFLVVTFINETH-FLAMNMENE---LEEVDIKGF 442

Query: 599 VTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDG-SYMTQDLSFGPSNSESGSGSE 657
            +E+       +T+A G+     ++IQV  R  R++   S    D  F P+       + 
Sbjct: 443 DSET-------QTLACGSAI-HNQLIQVTSRSVRLVSSVSLELLDQWFAPARFSVNVAAA 494

Query: 658 NS 659
           N+
Sbjct: 495 NA 496


>gi|413946716|gb|AFW79365.1| hypothetical protein ZEAMMB73_562969 [Zea mays]
          Length = 1089

 Score = 65.1 bits (157), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 88/367 (23%), Positives = 151/367 (41%), Gaps = 83/367 (22%)

Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
           F  + +     N+R L+   V D  F+HG  +P +V+L++           +H      A
Sbjct: 144 FDNKGQLKEAFNIR-LEELQVLDIKFLHGCAKPTIVVLYQ------DNKDVRHVKTYEVA 196

Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
           L       + P  WS  N+ + A  L+ VP+P+GGV+++G   I Y + +++        
Sbjct: 197 LK-DKDFVEGP--WSQNNVDNGAGLLIPVPAPLGGVIIIGEEQIVYCNANSTFK------ 247

Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
             ++   Q + R+   V+ D +          LL   TG L LL + ++   V  L +  
Sbjct: 248 --AIPIKQSIIRAYGRVDPDGSRY--------LLGDNTGILHLLVLTHERERVTGLKIEY 297

Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
              + + S I+ + N + ++GSR GDS LV+                   +++ADA    
Sbjct: 298 LGETSIASSISYLDNGVVYVGSRFGDSQLVKL------------------NLQADASG-- 337

Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLR 523
               S  + L+  VN   +  +     + +   +  T S A +D     G L+    G+ 
Sbjct: 338 ----SFVEILERYVNLGPIVDFCVVDLDRQGQGQVVTCSGAFKD-----GSLRVVRNGIG 388

Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
           IN  AS            VEL G KG+W++  KSS             +D +  YL++S 
Sbjct: 389 INEQAS------------VELQGIKGLWSL--KSS------------INDPFDMYLVVSF 422

Query: 584 EARTMVL 590
            + T  L
Sbjct: 423 ISETRFL 429


>gi|15233515|ref|NP_193842.1| DNA damage-binding protein 1b [Arabidopsis thaliana]
 gi|73620956|sp|O49552.2|DDB1B_ARATH RecName: Full=DNA damage-binding protein 1b; AltName:
           Full=UV-damaged DNA-binding protein 1b; Short=DDB1b
 gi|110739453|dbj|BAF01636.1| UV-damaged DNA-binding protein- like [Arabidopsis thaliana]
 gi|332659001|gb|AEE84401.1| DNA damage-binding protein 1b [Arabidopsis thaliana]
          Length = 1088

 Score = 65.1 bits (157), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 111/513 (21%), Positives = 202/513 (39%), Gaps = 135/513 (26%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  +S   L+ +    L+G + ++ +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLSPQGLQTILDVPLYGRIATMELFRPHG----EAQDFLFVATERYKFCVLQWD- 93

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRES------FARGPLVKVDPQGRCGGVLVY-GLQMI 202
                       +ES E +    G  S         G +  +DP  R  G+ +Y GL  +
Sbjct: 94  ------------YESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRVIGLHLYDGLFKV 141

Query: 203 ILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVI 262
           I   ++G                F+ R+E   V++++           F++G  +P + +
Sbjct: 142 IPFDNKGQLK-----------EAFNIRLEELQVLDIK-----------FLYGCTKPTIAV 179

Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIG 319
           L++           +H        +   +LK    +   WS  NL + A  L+ VPSP+ 
Sbjct: 180 LYQ------DNKDARH------VKTYEVSLKDKNFVEGPWSQNNLDNGADLLIPVPSPLC 227

Query: 320 GVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
           GVL++G  TI Y S +A  A+ +            + ++   V+LD +          LL
Sbjct: 228 GVLIIGEETIVYCSANAFKAIPIR---------PSITKAYGRVDLDGSR--------YLL 270

Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
               G + LL + ++   V  L +     + + S I+ + N++ F+GS  GDS L++   
Sbjct: 271 GDHAGLIHLLVITHEKEKVTGLKIELLGETSIASSISYLDNAVVFVGSSYGDSQLIKL-- 328

Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
                           +++ DA      + S  + L+  VN   +  +       +   +
Sbjct: 329 ----------------NLQPDA------KGSYVEILEKYVNLGPIVDFCVVDLERQGQGQ 366

Query: 500 --TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKS 557
             T S A +D     G L+    G+ IN  AS            VEL G KG+W++  KS
Sbjct: 367 VVTCSGAYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL--KS 407

Query: 558 SRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
           S             D+ +  +L++S  + T +L
Sbjct: 408 S------------IDEAFDTFLVVSFISETRIL 428


>gi|62318656|dbj|BAD95136.1| UV-damaged DNA-binding protein- like [Arabidopsis thaliana]
          Length = 1088

 Score = 65.1 bits (157), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 111/513 (21%), Positives = 202/513 (39%), Gaps = 135/513 (26%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  +S   L+ +    L+G + ++ +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLSPQGLQTILDVPLYGRIATMELFRPHG----EAQDFLFVATERYKFCVLQWD- 93

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRES------FARGPLVKVDPQGRCGGVLVY-GLQMI 202
                       +ES E +    G  S         G +  +DP  R  G+ +Y GL  +
Sbjct: 94  ------------YESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRVIGLHLYDGLFKV 141

Query: 203 ILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVI 262
           I   ++G                F+ R+E   V++++           F++G  +P + +
Sbjct: 142 IPFDNKGQLK-----------EAFNIRLEELQVLDIK-----------FLYGCTKPTIAV 179

Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIG 319
           L++           +H        +   +LK    +   WS  NL + A  L+ VPSP+ 
Sbjct: 180 LYQ------DNKDARH------VKTYEVSLKDKNFVEGPWSQNNLDNGADLLIPVPSPLC 227

Query: 320 GVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
           GVL++G  TI Y S +A  A+ +            + ++   V+LD +          LL
Sbjct: 228 GVLIIGEETIVYCSANAFKAIPIR---------PSITKAYGRVDLDGSR--------YLL 270

Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
               G + LL + ++   V  L +     + + S I+ + N++ F+GS  GDS L++   
Sbjct: 271 GDHAGLIHLLVITHEKEKVTGLKIELLGETSIASSISYLDNAVVFVGSSYGDSQLIKL-- 328

Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
                           +++ DA      + S  + L+  VN   +  +       +   +
Sbjct: 329 ----------------NLQPDA------KGSYVEILEKYVNLGPIVDFCVVDLERQGQGQ 366

Query: 500 --TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKS 557
             T S A +D     G L+    G+ IN  AS            VEL G KG+W++  KS
Sbjct: 367 VVTCSGAYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL--KS 407

Query: 558 SRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
           S             D+ +  +L++S  + T +L
Sbjct: 408 S------------IDEAFDTFLVVSFISETRIL 428


>gi|356512636|ref|XP_003525024.1| PREDICTED: DNA damage-binding protein 1a-like isoform 1 [Glycine
           max]
          Length = 1089

 Score = 65.1 bits (157), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 110/504 (21%), Positives = 200/504 (39%), Gaps = 117/504 (23%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  +S   L+ +    ++G + +L +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLSPQGLQPMLDVPIYGRIATLELFRPHG----EAQDYLFIATERYKFCVLQWDS 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   +M           + GR +   G +  +DP  R  G+ +Y GL  +I   ++
Sbjct: 95  ETAELVTRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F++G  +P +V+L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCSKPTIVVLYQ--- 182

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
                   +H      AL     L + P  WS  NL + A  L+ VP P+ GVL++G  T
Sbjct: 183 ---DNKDARHVKTYEVALKDKDFL-EGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEET 236

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           I Y S +A  A+ +            + ++   V+ D +          LL   TG L L
Sbjct: 237 IVYCSANAFKAIPIR---------PSITKAYGRVDPDGSR--------YLLGDHTGLLSL 279

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
           L + ++   V  L +     + + S I+ + N+  ++GS  GDS L++            
Sbjct: 280 LVITHEKEKVTGLKIEPLGETSIASTISYLDNAFVYIGSSYGDSQLIKL----------- 328

Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVR 506
                  +++ DA      + S  + L+  VN   +  +       +   +  T S A +
Sbjct: 329 -------NLQPDA------KGSYVEGLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYK 375

Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
           D     G L+    G+ IN  AS            VEL G KG+W++             
Sbjct: 376 D-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL------------- 405

Query: 567 RMAAYDDEYHAYLIISLEARTMVL 590
             ++ DD +  +L++S  + T +L
Sbjct: 406 -RSSTDDPFDTFLVVSFISETRIL 428


>gi|449488592|ref|XP_004158102.1| PREDICTED: LOW QUALITY PROTEIN: DNA damage-binding protein 1-like
           [Cucumis sativus]
          Length = 570

 Score = 64.7 bits (156), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 108/504 (21%), Positives = 194/504 (38%), Gaps = 117/504 (23%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++A  L+ +    ++G + +L +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLTAQGLQPMLDVPIYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDT 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   +M           + GR + + G +  +DP  R  G+ +Y GL  +I     
Sbjct: 95  ESSELITRAMGDVSD------RIGRPTDS-GQIGIIDPDCRLIGLHLYDGLFKVI----- 142

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
                            F  + +     N+R L+   V D  F++G   P +V+L++   
Sbjct: 143 ----------------PFDNKGQLKEAFNIR-LEELQVLDIKFLYGCSRPTIVVLYQDNK 185

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
                     H      +       + P  WS  NL + A  L+ VP P+ GV+++G  T
Sbjct: 186 D-------ARHVKTYEVVLKDKDFVEGP--WSQNNLDNGAAVLIPVPPPLCGVIIIGEET 236

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           I Y S +A  A+ +            + R+   V+ D +          LL    G L L
Sbjct: 237 IVYCSATAFKAIPVR---------PSITRAYGRVDADGSR--------YLLGDHAGLLHL 279

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
           L + ++   V  L +     + + S I+ + N+  ++GS  GDS LV+            
Sbjct: 280 LVITHEKERVTGLKIELLGETSIASTISYLDNAFVYIGSSYGDSQLVKL----------- 328

Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVR 506
                  +++ DA      + S  + L+  VN   +  +       +   +  T S A +
Sbjct: 329 -------NVQPDA------KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYK 375

Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
           D     G L+    G+ IN  AS            VEL G KG+W++             
Sbjct: 376 D-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL------------- 405

Query: 567 RMAAYDDEYHAYLIISLEARTMVL 590
             ++ DD +  +L++S  + T +L
Sbjct: 406 -RSSTDDPFDTFLVVSFISETRIL 428


>gi|157128864|ref|XP_001655231.1| DNA repair protein xp-e [Aedes aegypti]
 gi|108882186|gb|EAT46411.1| AAEL002407-PB [Aedes aegypti]
          Length = 1138

 Score = 64.3 bits (155), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 109/468 (23%), Positives = 174/468 (37%), Gaps = 119/468 (25%)

Query: 180 GPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
           G L  +DP+ R  G+ +Y GL  II            D DT              H +  
Sbjct: 119 GILAVIDPKARVIGMRLYEGLFKIIPL----------DRDT--------------HELKA 154

Query: 239 RDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
             L M+  HV+D  F++G   P ++++H+        ++ +H    I    I+   K   
Sbjct: 155 TSLRMEEVHVQDVEFLYGTQHPTLIVIHQD-------LNGRH----IKTHEINLKDKDFT 203

Query: 297 LI-WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA--------LNNYAV 347
            I W   N+  +A  L+ VP+P+GG +V+G  ++ YH   +  A+A        +N YA 
Sbjct: 204 KIAWKQDNVETEATMLIPVPTPLGGAIVIGQESVVYHDGDSYVAVAPAIIKQSTINCYAR 263

Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
                      + S  L        +N   LLS K   + LL  +             T 
Sbjct: 264 VDSKGFRYLLGNMSGHLFMMFLETEENSKGLLSVKDIKVELLGDI-------------TI 310

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
           P      IT + N + F+GSR GDS LV+    +G +     + E F ++   AP     
Sbjct: 311 PEC----ITYLDNGVLFIGSRHGDSQLVKLNTTAGDNGAYVTVMETFTNL---APIIDMC 363

Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
                  L+    G+ ++  GS                       G L+    G+ I   
Sbjct: 364 IVD----LEKQGQGQMITCSGSYKE--------------------GSLRIIRNGIGIQEH 399

Query: 528 ASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
           A             ++LPG KG+W +             R+   D  Y   L++S    T
Sbjct: 400 AC------------IDLPGIKGMWAL-------------RVGIDDSPYDNTLVLSFVGHT 434

Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNL-FGRRRVIQVFERGARIL 634
            +L  +    E TE   +    +T    N+ FG  ++IQV    AR++
Sbjct: 435 RILTLSGEEVEETEIPGFLSDQQTFYCANVDFG--QIIQVTPTTARLI 480


>gi|157128866|ref|XP_001655232.1| DNA repair protein xp-e [Aedes aegypti]
 gi|108882187|gb|EAT46412.1| AAEL002407-PA [Aedes aegypti]
          Length = 980

 Score = 64.3 bits (155), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 109/468 (23%), Positives = 174/468 (37%), Gaps = 119/468 (25%)

Query: 180 GPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
           G L  +DP+ R  G+ +Y GL  II            D DT              H +  
Sbjct: 119 GILAVIDPKARVIGMRLYEGLFKIIPL----------DRDT--------------HELKA 154

Query: 239 RDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
             L M+  HV+D  F++G   P ++++H+        ++ +H    I    I+   K   
Sbjct: 155 TSLRMEEVHVQDVEFLYGTQHPTLIVIHQD-------LNGRH----IKTHEINLKDKDFT 203

Query: 297 LI-WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA--------LNNYAV 347
            I W   N+  +A  L+ VP+P+GG +V+G  ++ YH   +  A+A        +N YA 
Sbjct: 204 KIAWKQDNVETEATMLIPVPTPLGGAIVIGQESVVYHDGDSYVAVAPAIIKQSTINCYAR 263

Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
                      + S  L        +N   LLS K   + LL  +             T 
Sbjct: 264 VDSKGFRYLLGNMSGHLFMMFLETEENSKGLLSVKDIKVELLGDI-------------TI 310

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
           P      IT + N + F+GSR GDS LV+    +G +     + E F ++   AP     
Sbjct: 311 PEC----ITYLDNGVLFIGSRHGDSQLVKLNTTAGDNGAYVTVMETFTNL---APIIDMC 363

Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
                  L+    G+ ++  GS                       G L+    G+ I   
Sbjct: 364 IVD----LEKQGQGQMITCSGSYKE--------------------GSLRIIRNGIGIQEH 399

Query: 528 ASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
           A             ++LPG KG+W +             R+   D  Y   L++S    T
Sbjct: 400 AC------------IDLPGIKGMWAL-------------RVGIDDSPYDNTLVLSFVGHT 434

Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNL-FGRRRVIQVFERGARIL 634
            +L  +    E TE   +    +T    N+ FG  ++IQV    AR++
Sbjct: 435 RILTLSGEEVEETEIPGFLSDQQTFYCANVDFG--QIIQVTPTTARLI 480


>gi|356525401|ref|XP_003531313.1| PREDICTED: DNA damage-binding protein 1-like isoform 1 [Glycine
           max]
          Length = 1089

 Score = 63.9 bits (154), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 108/504 (21%), Positives = 200/504 (39%), Gaps = 117/504 (23%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  +S   L+ +    ++G + +L +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLSPQGLQPMLDVPIYGRIATLELFRPHG----EAQDYLFIATERYKFCVLQWDS 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   +M           + GR +   G +  +DP  R  G+ +Y GL  +I   ++
Sbjct: 95  ETGELVTRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F++G  +P +V+L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCSKPTIVVLYQ--- 182

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
                   +H      AL       + P  WS  NL + A  L+ VP P+ GVL++G  T
Sbjct: 183 ---DNKDARHVKTYEVALK-DKDFVEGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEET 236

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           I Y S +A  A+ +            + ++   V+ D +          LL   TG + L
Sbjct: 237 IVYCSANAFKAIPIR---------PSITKAYGRVDPDGSR--------YLLGDHTGLVSL 279

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
           L ++++   V  L +     + + S I+ + N+  ++GS  GDS L++            
Sbjct: 280 LVIIHEKEKVTGLKIEPLGETSIASTISYLDNAFVYVGSSYGDSQLIKL----------- 328

Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVR 506
                  +++ DA      + S  + L+  VN   +  +       +   +  T S A +
Sbjct: 329 -------NLQPDA------KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYK 375

Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
           D     G L+    G+ IN  AS            VEL G KG+W++             
Sbjct: 376 D-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL------------- 405

Query: 567 RMAAYDDEYHAYLIISLEARTMVL 590
             ++ DD +  +L++S  + T +L
Sbjct: 406 -RSSTDDPFDTFLVVSFISETRIL 428


>gi|119580419|gb|EAW60015.1| hCG2010549, isoform CRA_a [Homo sapiens]
          Length = 323

 Score = 63.5 bits (153), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 26/56 (46%), Positives = 38/56 (67%)

Query: 943 LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
           LR+HP   +G + +F + HNVNC  GF+Y   QG L+I  LP+  +YD+ WPV+K+
Sbjct: 184 LRLHPVGINGPVNSFALFHNVNCPRGFLYFNRQGKLRISVLPAYLSYDSPWPVRKI 239


>gi|357623954|gb|EHJ74904.1| putative DNA repair protein xp-e [Danaus plexippus]
          Length = 1128

 Score = 63.2 bits (152), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 102/460 (22%), Positives = 176/460 (38%), Gaps = 107/460 (23%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G L  +DPQ R  G+ +Y     I+   +  + L             S R+E    +N+ 
Sbjct: 120 GILAVIDPQARVIGLRLYDGLFKIIPLDKDSTEL----------KAASLRLEE---LNVY 166

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
           DL+        F+HG   P ++++H+        ++ +H    I    I+   K+   I 
Sbjct: 167 DLE--------FLHGCSNPTLILIHQD-------LNGRH----IKTHEINLRDKEFMKIP 207

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  L+ VPSP+GG +V+G  +I YH   +  A+A       ++        
Sbjct: 208 WKQDNVETEASILIPVPSPLGGAIVIGQESIVYHDGQSYVAVAPPQIKTPINC------- 260

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR----VVQRLDLSKTNPSVLTSD 414
                +D     +L  D+A      G L +L +    R     V+ L +       +   
Sbjct: 261 --YCRVDVRGLRYLLGDIA------GRLFMLLLELSERDGTASVRDLKVELLGDIPIPEC 312

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
           +T + N + F+GSRLGDS LV+                    +  DA    +   + + +
Sbjct: 313 MTYLDNGVVFVGSRLGDSALVRLAA-----------------VRDDASQYVQPMETFT-S 354

Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
           L  +V+   + L     N   +    F          +G L+    G+ I   AS     
Sbjct: 355 LAPIVDMCVVDLERQGQNQLITCSGAF---------KMGSLRIIRNGIGIQEQAS----- 400

Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
                  ++LPG KG+W +    + G              +H  L++S   +T VL    
Sbjct: 401 -------IDLPGIKGMWAL----TLGQGP-----------HHDTLVLSFVGQTRVLTLNG 438

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
              E TE   +    +T   GN+    ++IQV + G R++
Sbjct: 439 EEVEETEIKGFVSDRQTFFTGNVC-HDQLIQVTDEGIRLI 477


>gi|195145844|ref|XP_002013900.1| GL24391 [Drosophila persimilis]
 gi|194102843|gb|EDW24886.1| GL24391 [Drosophila persimilis]
          Length = 1140

 Score = 63.2 bits (152), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 67/267 (25%), Positives = 114/267 (42%), Gaps = 51/267 (19%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G +  +DP+ R  G+++Y     I+   +  S L                       NLR
Sbjct: 119 GVIAAIDPKARVIGMVLYQGLFTIIPMDKEASEL--------------------KATNLR 158

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
            +D  +V D  F+HG + P ++++H+      GR    H         I+   K+   I 
Sbjct: 159 -MDELNVYDVEFLHGCLNPTIIVIHKDN---DGRHVKSHE--------INLREKEFMKIA 206

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  L+ VPSPIGGV+V+G  +I YH  S       N +AV+  + ++   +
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTIN 259

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTSD 414
            ++  +D     +      LL    G L +L +       G  V+ + + K     +   
Sbjct: 260 CYA-RVDGKGLRY------LLGNMDGQLYMLFLGTSETSKGVTVKDIKVEKLGEISIPEC 312

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGS 441
           IT + N   ++G+R GDS LV+ +  S
Sbjct: 313 ITYLDNGFLYIGARHGDSQLVRLSSES 339


>gi|449435512|ref|XP_004135539.1| PREDICTED: DNA damage-binding protein 1-like [Cucumis sativus]
          Length = 1093

 Score = 62.8 bits (151), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 107/504 (21%), Positives = 197/504 (39%), Gaps = 117/504 (23%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++A  L+ +    ++G + +L +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLTAQGLQPMLDVPIYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDT 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   +M           + GR + + G +  +DP  R  G+ +Y GL  +I   ++
Sbjct: 95  ESSELITRAMGDVSD------RIGRPTDS-GQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F++G   P +V+L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCSRPTIVVLYQDNK 185

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
                     H      +       + P  WS  NL + A  L+ VP P+ GV+++G  T
Sbjct: 186 D-------ARHVKTYEVVLKDKDFVEGP--WSQNNLDNGAAVLIPVPPPLCGVIIIGEET 236

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           I Y S +A  A+ +            + R+   V+ D +          LL    G L L
Sbjct: 237 IVYCSATAFKAIPVR---------PSITRAYGRVDADGSR--------YLLGDHAGLLHL 279

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSS 448
           L + ++   V  L +     + + S I+ + N+  ++GS  GDS LV+            
Sbjct: 280 LVITHEKERVTGLKIELLGETSIASTISYLDNAFVYIGSSYGDSQLVKL----------- 328

Query: 449 GLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVR 506
                  +++ DA      + S  + L+  VN   +  +       +   +  T S A +
Sbjct: 329 -------NVQPDA------KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYK 375

Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSS 566
           D     G L+    G+ IN  AS            VEL G KG+W++             
Sbjct: 376 D-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL------------- 405

Query: 567 RMAAYDDEYHAYLIISLEARTMVL 590
             ++ DD +  +L++S  + T +L
Sbjct: 406 -RSSTDDPFDTFLVVSFISETRIL 428


>gi|125774475|ref|XP_001358496.1| GA20574 [Drosophila pseudoobscura pseudoobscura]
 gi|54638233|gb|EAL27635.1| GA20574 [Drosophila pseudoobscura pseudoobscura]
          Length = 1140

 Score = 62.4 bits (150), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 67/267 (25%), Positives = 113/267 (42%), Gaps = 51/267 (19%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G +  +DP+ R  G+++Y     I+   +  S L                       NLR
Sbjct: 119 GVIAAIDPKARVIGMVLYQGLFTIIPMDKEASEL--------------------KATNLR 158

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
            +D   V D  F+HG + P ++++H+      GR    H         I+   K+   I 
Sbjct: 159 -MDELSVYDVEFLHGCLNPTIIVIHKDN---DGRHVKSHE--------INLREKEFMKIA 206

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  L+ VPSPIGGV+V+G  +I YH  S       N +AV+  + ++   +
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTIN 259

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTSD 414
            ++  +D     +      LL    G L +L +       G  V+ + + K     +   
Sbjct: 260 CYA-RVDGKGLRY------LLGNMDGQLYMLFLGTSETSKGVTVKDIKVEKLGEISIPEC 312

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGS 441
           IT + N   ++G+R GDS LV+ +  S
Sbjct: 313 ITYLDNGFLYIGARHGDSQLVRLSSES 339


>gi|167998730|ref|XP_001752071.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162697169|gb|EDQ83506.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 172

 Score = 62.0 bits (149), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 34/96 (35%), Positives = 49/96 (51%), Gaps = 8/96 (8%)

Query: 114 ESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRG 173
           E+ A+ ++  A    RR S+ +        +L F +     R   +HCFE PE+ +L R 
Sbjct: 28  ETAALRTEAAAPGIHRRPSLTMRLR----IILAFTEC----RCLLIHCFEYPEYQYLNRS 79

Query: 174 RESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQG 209
           RE FA    V+ D  GRC  VL+Y  Q++ LKA  G
Sbjct: 80  RERFAMDLSVRADLVGRCASVLIYNSQLVTLKAGHG 115


>gi|2911067|emb|CAA17529.1| UV-damaged DNA-binding protein-like [Arabidopsis thaliana]
 gi|7268907|emb|CAB79110.1| UV-damaged DNA-binding protein-like [Arabidopsis thaliana]
          Length = 1102

 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 85/370 (22%), Positives = 150/370 (40%), Gaps = 90/370 (24%)

Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
           F  + +     N+R L+   V D  F++G  +P + +L++           +H       
Sbjct: 158 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCTKPTIAVLYQ------DNKDARH------V 204

Query: 286 LSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALAL 342
            +   +LK    +   WS  NL + A  L+ VPSP+ GVL++G  TI Y S +A  A+ +
Sbjct: 205 KTYEVSLKDKNFVEGPWSQNNLDNGADLLIPVPSPLCGVLIIGEETIVYCSANAFKAIPI 264

Query: 343 NNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLD 402
                       + ++   V+LD +          LL    G + LL + ++   V  L 
Sbjct: 265 R---------PSITKAYGRVDLDGSR--------YLLGDHAGLIHLLVITHEKEKVTGLK 307

Query: 403 LSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAP 462
           +     + + S I+ + N++ F+GS  GDS L++                   +++ DA 
Sbjct: 308 IELLGETSIASSISYLDNAVVFVGSSYGDSQLIKL------------------NLQPDA- 348

Query: 463 STKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSY 520
                + S  + L+  VN   +  +       +   +  T S A +D     G L+    
Sbjct: 349 -----KGSYVEILEKYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD-----GSLRIVRN 398

Query: 521 GLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLI 580
           G+ IN  AS            VEL G KG+W++  KSS             D+ +  +L+
Sbjct: 399 GIGINEQAS------------VELQGIKGMWSL--KSS------------IDEAFDTFLV 432

Query: 581 ISLEARTMVL 590
           +S  + T +L
Sbjct: 433 VSFISETRIL 442


>gi|410079681|ref|XP_003957421.1| hypothetical protein KAFR_0E01320 [Kazachstania africana CBS 2517]
 gi|372464007|emb|CCF58286.1| hypothetical protein KAFR_0E01320 [Kazachstania africana CBS 2517]
          Length = 1350

 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 144/669 (21%), Positives = 270/669 (40%), Gaps = 133/669 (19%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           L L   ++  G +  +A+L +  A      D ++L    AKIS+++FD   + +   S+H
Sbjct: 48  LFLTNEFKFDGRITDIALLPRQDA----ALDYLLLCTAVAKISIVKFDLESNSIETVSLH 103

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRC------GGVLVYGLQMIILKASQGGSGLV 214
            +E  ++  L        R   +++DP  RC        + V    M   +      G  
Sbjct: 104 YYED-KFKDLSLAE--LTRESKLRLDPASRCLVLFNEDNIAVLPFVMKEDEEDDDEEGEE 160

Query: 215 GDEDTFGSG-GGFSARIE-----SSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHER 266
            DEDT+      F A I       S +++ + +  D++++ D  F++ Y +P + IL++ 
Sbjct: 161 EDEDTYEPRIKRFRANINGRVTFPSTILSAKTIHEDIQNIIDIEFLNNYSKPTVAILYQP 220

Query: 267 ELTWAGRVSWKH--HTCMISALSIST----TLKQHPLIWSAMNLPHDAYKLLAVPSPIGG 320
           +LTW G +         +I  L  +T    T   H +I     LP D ++L+ V +   G
Sbjct: 221 KLTWVGNLQLHPLPTKLLIVTLECNTNGFETSLSHIVIARLNELPWDWHRLIPVTN---G 277

Query: 321 VLVVGANTIHYHSQSA--SCALALNNYAVSLDSSQELPRSSF---SVELDAAHATWLQND 375
           +++VG N + Y   +      + LN++A      + L +S     S E    + + +++ 
Sbjct: 278 IVIVGINELAYVDNTGVLQTVILLNSFA-----DRNLKKSRIIDHSKEESVFNHSAMKH- 331

Query: 376 VALLSTKTGD---------------LVLLTVVYDGRVVQRLDLSK--------------T 406
           + +L T  G+               L  + ++ +GR++ + D+ K              T
Sbjct: 332 ICILKTTDGNEDDADLLLLMDDRSNLYYVQMISEGRLMTQFDIIKLPIINNIFINNLNPT 391

Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
           + S L S  + +   LFF G + GD+    F C       +   ++E  D+  D PS   
Sbjct: 392 SISRLDSSSSRVNLDLFF-GFQSGDA----FVCRLNNIKSAVETRKEHKDV-LDYPS--- 442

Query: 467 LRRSSSDALQDMVNGEEL----SLYGSASNNTESAQ-------------KTFSFAVRDSL 509
               ++D   +  +G +L     LY   + +T+ A              + F  A+  SL
Sbjct: 443 ----NADEYDE--DGADLYGDDDLYSDEATSTQRANSKENGRSNMIETVEPFDIALLSSL 496

Query: 510 VNIGPLKDFSYGLRINADASATGISKQSNYELVELP----GCKGIWTVYHKSSRGHNADS 565
            NIGPL   + G     D +  G+S  +N EL  +     G     T    S R     +
Sbjct: 497 NNIGPLTSLTSGKVSAVDQNNKGLSNPNNNELSIVATSGNGTGSHLTAVLPSVRPEIELA 556

Query: 566 SRMAAYDDEYHAYLIISLEARTMVLETADL------LTEVTESVDYFVQGR-----TIAA 614
            +  +    ++    +  + +   L T D       + E+  +     +GR     T  +
Sbjct: 557 LKFISITQIWN----LKFKGKDKFLVTTDSTKSKSDIYEIDNNFALHREGRLRRDATTVS 612

Query: 615 GNLFGR-RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLL 673
             +FG  +R++QV      +LD ++               +   +  V+ VS+ DPY+L+
Sbjct: 613 IAMFGSDKRIVQVTTNHLYLLDTTF-----------RRLNTIKFDYEVVHVSVMDPYILI 661

Query: 674 GMSDGSIRL 682
            +S G I++
Sbjct: 662 TVSRGDIKV 670


>gi|356512638|ref|XP_003525025.1| PREDICTED: DNA damage-binding protein 1a-like isoform 2 [Glycine
           max]
          Length = 1068

 Score = 61.2 bits (147), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 85/367 (23%), Positives = 148/367 (40%), Gaps = 84/367 (22%)

Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
           F  + +     N+R L+   V D  F++G  +P +V+L++           +H      A
Sbjct: 123 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCSKPTIVVLYQ------DNKDARHVKTYEVA 175

Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
           L     L + P  WS  NL + A  L+ VP P+ GVL++G  TI Y S +A  A+ +   
Sbjct: 176 LKDKDFL-EGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIR-- 230

Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
                    + ++   V+ D +          LL   TG L LL + ++   V  L +  
Sbjct: 231 -------PSITKAYGRVDPDGSR--------YLLGDHTGLLSLLVITHEKEKVTGLKIEP 275

Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
              + + S I+ + N+  ++GS  GDS L++                   +++ DA    
Sbjct: 276 LGETSIASTISYLDNAFVYIGSSYGDSQLIKL------------------NLQPDA---- 313

Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLR 523
             + S  + L+  VN   +  +       +   +  T S A +D     G L+    G+ 
Sbjct: 314 --KGSYVEGLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD-----GSLRVVRNGIG 366

Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
           IN  AS            VEL G KG+W++               ++ DD +  +L++S 
Sbjct: 367 INEQAS------------VELQGIKGMWSL--------------RSSTDDPFDTFLVVSF 400

Query: 584 EARTMVL 590
            + T +L
Sbjct: 401 ISETRIL 407


>gi|340059653|emb|CCC54046.1| putative mitochondrial carrier protein [Trypanosoma vivax Y486]
          Length = 1481

 Score = 61.2 bits (147), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 61/231 (26%), Positives = 103/231 (44%), Gaps = 41/231 (17%)

Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHH-------TCMISALSISTTL 292
           +++V+D  F+    EP++ IL ER+ TWAGRV    W+         T  ++ + IS ++
Sbjct: 268 IRYVRDLQFIGSSGEPLLAILCERQPTWAGRVKLVEWRTKVVESNTLTMHVTWVQISASM 327

Query: 293 KQHP---LIWSAMNLPHDAYKLLAVP---SPIGGVLVVGANTIHYHSQSASCALALNNY- 345
             HP   LI     +P++   +L V      + GV+  G N I + +         N+  
Sbjct: 328 TAHPKLLLIGEVEGVPYNVTHMLPVEPFSQTMSGVVCFGTNVIMHITTKRGYGAYFNDTG 387

Query: 346 ----------AVS----------LDSSQELPRSSFSVELDAAHATW--LQNDVALLSTKT 383
                     AVS          LD S  L R + S+   AA +    + +++ +L+   
Sbjct: 388 REECINSKFSAVSFGKAVWSDPQLDKSSALARVNMSLANCAATSMVGKMGDELQVLALLE 447

Query: 384 GDLVLLTV--VYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDS 432
            D V++T+  V  G  V+ + ++        S ++ IG  L FLGS +GDS
Sbjct: 448 EDGVVITLHFVARGSSVEEVRITMLGSGCYCSSVSRIGRQLVFLGSTVGDS 498


>gi|224061051|ref|XP_002300334.1| predicted protein [Populus trichocarpa]
 gi|222847592|gb|EEE85139.1| predicted protein [Populus trichocarpa]
          Length = 1088

 Score = 61.2 bits (147), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 109/507 (21%), Positives = 193/507 (38%), Gaps = 123/507 (24%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ ++ ++   L+ +    ++G + +L +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEINLLTPQGLQPMLDVPIYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDA 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   +M           + GR +   G +  +DP  R  G+ +Y GL  +I   ++
Sbjct: 95  ETSELITRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F+HG  +P +V+L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLHGCSKPTIVVLYQ--- 182

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVG 325
                   +H        +    LK    I   WS  NL + A  L+ VP P  GVL++G
Sbjct: 183 ---DNKDARH------VKTYEVALKDKDFIEGPWSQNNLDNGADLLIPVPPPFCGVLIIG 233

Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGD 385
             TI Y S +   A+ +            + ++   V+ D +          LL    G 
Sbjct: 234 EETIVYCSANVFRAIPIR---------PSITKAYGRVDADGSR--------YLLGDHAGL 276

Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
           L LL + ++   V  L +     + + S I+ + N+  F+GS  GDS LV+         
Sbjct: 277 LHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAFVFIGSSYGDSQLVKL-------- 328

Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSF 503
                     ++  DA  T        + L   VN   +  +       +   +  T S 
Sbjct: 329 ----------NLHPDAKGT------YVEVLDRYVNLGPIVDFCVVDLERQGQGQVVTCSG 372

Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNA 563
           A +D     G L+    G+ IN  AS            VEL G KG+W++          
Sbjct: 373 AYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL---------- 405

Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVL 590
                +  DD +  +L++S  + T +L
Sbjct: 406 ----RSLTDDPFDTFLVVSFISETRIL 428


>gi|345498295|ref|XP_001607743.2| PREDICTED: DNA damage-binding protein 1-like [Nasonia vitripennis]
          Length = 1140

 Score = 60.8 bits (146), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 88/420 (20%), Positives = 160/420 (38%), Gaps = 86/420 (20%)

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
           +D ++V+D  F+HG   P ++++H+        ++ +H    +    IS   K+   I W
Sbjct: 162 MDEQNVQDVNFLHGCTNPTLILIHQD-------INGRH----VKTHEISLRDKEFVKIPW 210

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
              N+  +A  ++ VPSPI G +++G  +I YH  +         Y   +    +    S
Sbjct: 211 RQDNVEREAMMVIPVPSPICGAIIIGQESILYHDGTT--------YVTVVPPIIKQSTIS 262

Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLTSD 414
              ++D     +L  D+A      G L +L +  D +     V++ L +       +   
Sbjct: 263 CYAKVDNQGLRYLLGDLA------GHLFMLFLEQDKKADGSMVIKDLKVELLGEVSIPEC 316

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
           IT + N + F+GSRLGDS L++       +       E F ++   AP    +   +   
Sbjct: 317 ITYLDNGVIFIGSRLGDSQLIKLNTKPDENGSYCSTMETFTNL---AP----IVDMAVVD 369

Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
           L+    G+ ++  G+                       G L+    G+ I   AS     
Sbjct: 370 LERQGQGQIVTCSGAFKE--------------------GSLRIIRNGIGIQEHAS----- 404

Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
                  ++LPG KG+W +   S    N                L++S   +T +L    
Sbjct: 405 -------IDLPGIKGMWALKVDSVNFDNT---------------LVLSFVGQTRILMLNG 442

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
              E TE   +    +T   GN+     +IQ+    AR++     +    + P N  + S
Sbjct: 443 EEVEETEIPGFVADEQTFHTGNV-TNDVIIQITPTSARLISNKSSSVISEWEPDNKRTIS 501


>gi|356525403|ref|XP_003531314.1| PREDICTED: DNA damage-binding protein 1-like isoform 2 [Glycine
           max]
          Length = 1068

 Score = 60.5 bits (145), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 83/367 (22%), Positives = 148/367 (40%), Gaps = 84/367 (22%)

Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
           F  + +     N+R L+   V D  F++G  +P +V+L++           +H      A
Sbjct: 123 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCSKPTIVVLYQ------DNKDARHVKTYEVA 175

Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
           L       + P  WS  NL + A  L+ VP P+ GVL++G  TI Y S +A  A+ +   
Sbjct: 176 LK-DKDFVEGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIR-- 230

Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
                    + ++   V+ D +          LL   TG + LL ++++   V  L +  
Sbjct: 231 -------PSITKAYGRVDPDGSR--------YLLGDHTGLVSLLVIIHEKEKVTGLKIEP 275

Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
              + + S I+ + N+  ++GS  GDS L++                   +++ DA    
Sbjct: 276 LGETSIASTISYLDNAFVYVGSSYGDSQLIKL------------------NLQPDA---- 313

Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLR 523
             + S  + L+  VN   +  +       +   +  T S A +D     G L+    G+ 
Sbjct: 314 --KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD-----GSLRVVRNGIG 366

Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
           IN  AS            VEL G KG+W++               ++ DD +  +L++S 
Sbjct: 367 INEQAS------------VELQGIKGMWSL--------------RSSTDDPFDTFLVVSF 400

Query: 584 EARTMVL 590
            + T +L
Sbjct: 401 ISETRIL 407


>gi|194741158|ref|XP_001953056.1| GF17579 [Drosophila ananassae]
 gi|190626115|gb|EDV41639.1| GF17579 [Drosophila ananassae]
          Length = 1140

 Score = 60.5 bits (145), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 65/264 (24%), Positives = 113/264 (42%), Gaps = 51/264 (19%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G +  +DP+ R  G+ +Y     I+   +  S L                       NLR
Sbjct: 119 GVIAAIDPKARVIGMCLYQGLFTIIPMDKEASEL--------------------KATNLR 158

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
            +D  +V D  F+HG + P ++++H+      GR    H         I+   K+   I 
Sbjct: 159 -MDELNVYDVEFLHGCLNPTVIVIHKDN---DGRHVKSHE--------INLREKEFMKIA 206

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  L+ VPSPIGGV+V+G  +I YH  S       N +AV+  + ++   +
Sbjct: 207 WKQDNVETEATMLITVPSPIGGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTIN 259

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTSD 414
            ++  +D+    +      LL    G L +L +       G  V+ + + +     +   
Sbjct: 260 CYA-RVDSKGFRY------LLGNMDGQLYMLFLGTSETSKGITVKDIKVEQLGEISIPEC 312

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFT 438
           IT + N   ++G+R GDS LV+ +
Sbjct: 313 ITYLDNGFLYIGARHGDSQLVRLS 336


>gi|195108657|ref|XP_001998909.1| GI23368 [Drosophila mojavensis]
 gi|193915503|gb|EDW14370.1| GI23368 [Drosophila mojavensis]
          Length = 1140

 Score = 60.5 bits (145), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 65/263 (24%), Positives = 114/263 (43%), Gaps = 49/263 (18%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G +  +DP+ R  G+ +Y     I+   +  S L                       +LR
Sbjct: 119 GFIAAIDPKARVIGMCLYQGLFTIIPLDKDASEL--------------------KATSLR 158

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299
            +D   V D  F+HG + P ++++H+           +H  C    L     +K   L W
Sbjct: 159 -MDELIVYDVEFLHGCLNPTVIVIHKDN-------DGRHVKCHEINLRDKEFMK---LAW 207

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
              N+  +A  L+ VPSPIGGV+V+G  +I YH  S       N +AV+  + ++   + 
Sbjct: 208 KQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTINC 260

Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRV--VQRLDLSKTNPSVLTSDI 415
           ++  +D+    +      LL    G L +L +  +  G+V  V+ + + +     +   I
Sbjct: 261 YA-RVDSKGLRY------LLGNMDGQLYMLFLGINETGKVPTVKDIKVEQLGEISIPECI 313

Query: 416 TTIGNSLFFLGSRLGDSLLVQFT 438
           T + N   ++GSR GDS LV+ +
Sbjct: 314 TYLDNGFLYIGSRHGDSQLVRLS 336


>gi|170057515|ref|XP_001864517.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167876915|gb|EDS40298.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 1138

 Score = 59.7 bits (143), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 105/473 (22%), Positives = 175/473 (36%), Gaps = 129/473 (27%)

Query: 180 GPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
           G L  +DP+ R  G+ +Y GL  II            D DT              H +  
Sbjct: 119 GILAVIDPKARVIGMRLYEGLFKIIPL----------DRDT--------------HELKA 154

Query: 239 RDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
             L M+  HV+D  F++G   P ++++H+        ++ +H    I    I+   K   
Sbjct: 155 TSLRMEEMHVQDVEFLYGTAHPTLIVIHQD-------LNGRH----IKTHEINLKDKDFT 203

Query: 297 LI-WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA--------LNNYAV 347
            I W   N+  +A  L+ VP+P+GG +V+G  ++ YH   +  A+A        +N YA 
Sbjct: 204 KIAWKQDNVETEATMLIPVPTPLGGAIVIGQESVVYHDGDSYVAVAPAIIKQSTINCYA- 262

Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTN 407
                           +D+    +      LL    G L ++ +  +     +L +    
Sbjct: 263 ---------------RVDSRGFRY------LLGNMIGHLFMMFLETEENTRGQLTVKDIK 301

Query: 408 PSVL-----TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAP 462
             +L        IT + N + F+GSR GDS LV+    +  S     + E F ++   AP
Sbjct: 302 VELLGEITIPECITYLDNGVLFIGSRHGDSQLVKLNTTAAASGAYVTVMETFTNL---AP 358

Query: 463 STKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL 522
                       L+    G+ ++  GS                       G L+    G+
Sbjct: 359 IIDMCIVD----LERQGQGQMITCSGSYKE--------------------GSLRIIRNGI 394

Query: 523 RINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIIS 582
            I   A             ++LPG KG+W +             R+   D  Y   L++S
Sbjct: 395 GIQEHAC------------IDLPGIKGMWAL-------------RVGIDDSPYDNTLVLS 429

Query: 583 LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNL-FGRRRVIQVFERGARIL 634
               T +L  +    E TE   +    +T    N+ FG  ++IQV    AR++
Sbjct: 430 FVGHTRILMLSGEEVEETEIPGFLSDQQTFYCANVDFG--QIIQVTPMTARLI 480


>gi|357519461|ref|XP_003630019.1| DNA damage-binding protein [Medicago truncatula]
 gi|355524041|gb|AET04495.1| DNA damage-binding protein [Medicago truncatula]
          Length = 1171

 Score = 59.7 bits (143), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 79/349 (22%), Positives = 145/349 (41%), Gaps = 60/349 (17%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++A  L+ +    L+G + +L +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLTAQGLQSILDVPLYGRIATLELFRPHG----ETQDFLFIATERYKFCVLQWDT 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   SM           + GR +   G +  +DP  R  G+ +Y GL  +I   ++
Sbjct: 95  EKSELVTRSMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F++G  +P +V+L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLYGCPKPTIVVLYQ--- 182

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
                   +H      AL       + P  WS  +L + A  L+ VP P+ GVL++G  T
Sbjct: 183 ---DNKDARHVKTYEVALK-DKDFVEGP--WSQNSLDNGADLLIPVPPPLCGVLIIGEET 236

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           I Y S +   A+ +            + ++   V+ D +          LL   TG L L
Sbjct: 237 IVYCSANGFKAIPIR---------AAITKAYGRVDPDGSRY--------LLGDHTGLLSL 279

Query: 389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           L + ++   V  L +     + + S I+ + N+  ++GS  GDS L++ 
Sbjct: 280 LVITHEKEKVTGLKIEPLGETSIASTISYLDNAFVYIGSSYGDSQLIKL 328


>gi|194901554|ref|XP_001980317.1| GG19434 [Drosophila erecta]
 gi|190652020|gb|EDV49275.1| GG19434 [Drosophila erecta]
          Length = 1140

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 67/264 (25%), Positives = 108/264 (40%), Gaps = 53/264 (20%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G +  +DP+ R  G+ +Y     I+   +  S L                       NLR
Sbjct: 119 GVIAAIDPKARVIGMCLYQGLFTIIPLDKDASEL--------------------KATNLR 158

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
            +D  +V D  F+HG + P ++++H+      GR    H         I+   K+   I 
Sbjct: 159 -MDELNVYDVEFLHGCMNPTVIVIHKDN---DGRHVKSHE--------INLREKEFMKIA 206

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  L+ VPSPIGGV+V+G  +I YH  S       N +AV+          
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVA--------PL 251

Query: 359 SFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTS 413
           +F       +A    N +  LL    G L +L +       G  V+ + + +     +  
Sbjct: 252 TFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSKGVTVKDIKVEQLGEISIPE 311

Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
            IT + N   ++G+R GDS LV+ 
Sbjct: 312 CITYLDNGFLYIGARHGDSQLVRL 335


>gi|298711490|emb|CBJ26578.1| n/a [Ectocarpus siliculosus]
          Length = 1135

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 96/418 (22%), Positives = 166/418 (39%), Gaps = 94/418 (22%)

Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
             A+ +     N+R L+   V D  F+ G  +  + +L++ +       + +H    I  
Sbjct: 143 MDAKGQLKDAFNIR-LEELEVLDIQFLSGCPKATIAVLYQDQR------NARH----IKT 191

Query: 286 LSISTTLKQHPL-IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNN 344
            +IST  K+     W+ +N+ H+A +L+ VP+P GGVL++G  TI YHS  A   + + N
Sbjct: 192 YTISTRDKEFDTGPWAQLNVEHNASELIPVPAPFGGVLILGHQTICYHSGKAFITIPIQN 251

Query: 345 YAVSLDSSQELPRSSFSVELDAAHATWLQNDVA--LLSTKTGDL--VLLTVVYDGRVVQR 400
             +                   A+  W+  D +  L+S  +G L  V+LT       V+ 
Sbjct: 252 TRM------------------CAYG-WVDADGSRLLVSDHSGGLHVVILTPDATNTAVET 292

Query: 401 LDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
             +     +   S I+ + N + F+GS  GDS L++                   + E D
Sbjct: 293 AHIEALGETSCASSISYLDNGVVFIGSASGDSQLIKL------------------NPEKD 334

Query: 461 APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
           A  T      + D L  +++              +    T S   +D     G L+    
Sbjct: 335 AQGTYIQVLETYDNLGPILD----MCVADLDRQGQGQAVTCSGCSKD-----GSLRIIRN 385

Query: 521 GLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLI 580
           G+ IN  A+            +EL G KG+W++     R  N +  +          YL+
Sbjct: 386 GIGINEHAA------------IELAGIKGMWSL-----RPSNTNHDK----------YLV 418

Query: 581 ISLEARTMVL---ETADLLTEVTE-SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
            +  + T VL   E  D   ++ E  +  F +G T+  G   G    +QV +RG  ++
Sbjct: 419 QAFISETRVLAFEEDEDGDHQLAEGEIAGFQEGCTLFCG-CVGGNMAVQVTKRGVVLI 475


>gi|195329354|ref|XP_002031376.1| GM24084 [Drosophila sechellia]
 gi|194120319|gb|EDW42362.1| GM24084 [Drosophila sechellia]
          Length = 1140

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 67/264 (25%), Positives = 108/264 (40%), Gaps = 53/264 (20%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G +  +DP+ R  G+ +Y     I+   +  S L                       NLR
Sbjct: 119 GVIAAIDPKARVIGMCLYQGLFTIIPMDKDASEL--------------------KATNLR 158

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
            +D  +V D  F+HG + P ++++H+      GR    H         I+   K+   I 
Sbjct: 159 -MDELNVYDVEFLHGCLNPTVIVIHKDN---DGRHVKSHE--------INLRDKEFMKIA 206

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  L+ VPSPIGGV+V+G  +I YH  S       N +AV+          
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVA--------PL 251

Query: 359 SFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTS 413
           +F       +A    N +  LL    G L +L +       G  V+ + + +     +  
Sbjct: 252 TFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSKGVTVKDIKVEQLGEISIPE 311

Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
            IT + N   ++G+R GDS LV+ 
Sbjct: 312 CITYLDNGFLYIGARHGDSQLVRL 335


>gi|405970039|gb|EKC34976.1| DNA damage-binding protein 1 [Crassostrea gigas]
          Length = 1160

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 55/231 (23%), Positives = 101/231 (43%), Gaps = 45/231 (19%)

Query: 225 GFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTW--------AGRVSW 276
            F+ R+E   VI+++           F+HG   P ++++H+  L             +S+
Sbjct: 154 AFNIRLEELTVIDIQ-----------FLHGCTTPTLILIHQANLNCYHLMTLCITNLLSF 202

Query: 277 KH--HTCMISALSISTTLKQ-HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHS 333
           K   H   +    IS   K+     W   N+  +A  L+AVP P GG L++G  +I YH 
Sbjct: 203 KQDQHGRHVKTYEISLRDKEFQKGPWKQDNVETEACMLIAVPEPFGGALIIGQESITYHK 262

Query: 334 QSASCALALNNYAVSLDSSQELPRSSFSV--ELDAAHATWLQNDVALLSTKTGDLVLLTV 391
                 +A             + +S+ +   ++DA  + +L  D+       G L +L +
Sbjct: 263 GDNFIPIA----------PPAIKQSTLTCYGKVDANGSRYLLGDMM------GRLFMLML 306

Query: 392 VYDGRV-----VQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
             + ++     V+ L +     + +   IT + N++ ++GSRLGDS LV+ 
Sbjct: 307 EKEEKMDSTVTVKDLKVELLGETTIAECITYLDNAVVYIGSRLGDSQLVKL 357


>gi|21357503|ref|NP_650257.1| piccolo [Drosophila melanogaster]
 gi|74872881|sp|Q9XYZ5.1|DDB1_DROME RecName: Full=DNA damage-binding protein 1; Short=D-DDB1; AltName:
           Full=Damage-specific DNA-binding protein 1; AltName:
           Full=Protein piccolo
 gi|4928452|gb|AAD33592.1|AF132145_1 damage-specific DNA binding protein DDBa p127 subunit [Drosophila
           melanogaster]
 gi|7299719|gb|AAF54901.1| piccolo [Drosophila melanogaster]
 gi|220942640|gb|ACL83863.1| DDB1-PA [synthetic construct]
          Length = 1140

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 67/264 (25%), Positives = 108/264 (40%), Gaps = 53/264 (20%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G +  +DP+ R  G+ +Y     I+   +  S L                       NLR
Sbjct: 119 GVIAAIDPKARVIGMCLYQGLFTIIPMDKDASEL--------------------KATNLR 158

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
            +D  +V D  F+HG + P ++++H+      GR    H         I+   K+   I 
Sbjct: 159 -MDELNVYDVEFLHGCLNPTVIVIHKDS---DGRHVKSHE--------INLRDKEFMKIA 206

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  L+ VPSPIGGV+V+G  +I YH  S       N +AV+          
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVA--------PL 251

Query: 359 SFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTS 413
           +F       +A    N +  LL    G L +L +       G  V+ + + +     +  
Sbjct: 252 TFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSKGVTVKDIKVEQLGEISIPE 311

Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
            IT + N   ++G+R GDS LV+ 
Sbjct: 312 CITYLDNGFLYIGARHGDSQLVRL 335


>gi|255571318|ref|XP_002526608.1| DNA repair protein xp-E, putative [Ricinus communis]
 gi|223534048|gb|EEF35767.1| DNA repair protein xp-E, putative [Ricinus communis]
          Length = 1033

 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 84/362 (23%), Positives = 146/362 (40%), Gaps = 85/362 (23%)

Query: 231 ESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST 290
           E+S +I    L+   V D  F++G  +P +V+L++           +H      AL    
Sbjct: 95  ETSELIT--RLEELQVLDIKFLYGCSKPTIVVLYQ------DNKDARHVKTYEVALK-DK 145

Query: 291 TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLD 350
              + P  W+  NL + A  L+ VP P+ GVL++G  TI Y S +A  A+ +        
Sbjct: 146 DFGEGP--WAQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIR------- 196

Query: 351 SSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSV 410
               + R+   V+ D +          LL    G L LL + ++   V  L +     + 
Sbjct: 197 --PSITRAYGRVDADGSR--------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETS 246

Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS 470
           + S I+ + N++ ++GS  GDS LV+                   +++ DA      + S
Sbjct: 247 IASTISYLDNAVVYIGSSYGDSQLVKL------------------NLQPDA------KGS 282

Query: 471 SSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLRINADA 528
             + L+  VN   +  +       +   +  T S A +D     G L+    G+ IN  A
Sbjct: 283 YVEVLESYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD-----GSLRIVRNGIGINEQA 337

Query: 529 SATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM 588
           S            VEL G KG+W++               ++ DD +  +L++S  + T 
Sbjct: 338 S------------VELQGIKGMWSL--------------RSSTDDPFDTFLVVSFISETR 371

Query: 589 VL 590
           +L
Sbjct: 372 IL 373


>gi|407410979|gb|EKF33219.1| cleavage and polyadenylation specificity factor, putative
           [Trypanosoma cruzi marinkellei]
          Length = 1436

 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 63/260 (24%), Positives = 105/260 (40%), Gaps = 54/260 (20%)

Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS----------IS 289
           +++V+D  F+    EP++  L ER  TWAGRV    W+        LS           S
Sbjct: 250 IRYVRDMQFIESSGEPIVAFLCERHPTWAGRVKLVEWRTKAVESKMLSSQIVWVQISAAS 309

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIG-------GVLVVGANTIHYHSQSASCALAL 342
           T+ ++  LI    ++P++   +    +P+G       GV+  G NT+ + +      + L
Sbjct: 310 TSNRKLLLIGEVDDVPYNVTHM----TPVGPFAQIPSGVICYGINTVMHVTTKRGYGVYL 365

Query: 343 NNYAVS-----------------LDSSQELPRSSFSVELDAAHATW----LQND---VAL 378
           NN  +                   D   E   + F V L  A  T     + N+   + +
Sbjct: 366 NNGGMEECANSKSSAMSYGKVSWYDPKMETSTALFKVNLSLASCTASFMSIVNEMLHLLV 425

Query: 379 LSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
           +S + G ++ L++      VQ + ++        S IT IG+ + FLGS  GDS      
Sbjct: 426 VSEEDGVVLTLSITAQSSSVQDIRIAILGTGCYCSGITRIGDQIVFLGSAFGDS------ 479

Query: 439 CGSGTSMLSSGLKEEFGDIE 458
           C +   M  S   + F  IE
Sbjct: 480 CIAKVDMFHSDAAKRFQIIE 499


>gi|195449948|ref|XP_002072297.1| GK22405 [Drosophila willistoni]
 gi|194168382|gb|EDW83283.1| GK22405 [Drosophila willistoni]
          Length = 1140

 Score = 58.9 bits (141), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 65/264 (24%), Positives = 112/264 (42%), Gaps = 51/264 (19%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G +  +DP+ R  G+ +Y     I+   +  S L                       NLR
Sbjct: 119 GVIAAIDPKARVIGMCLYQGLFTIIPMEKDASEL--------------------KATNLR 158

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
            +D   V D  F+HG + P ++++H+      GR    H         I+   K+   I 
Sbjct: 159 -MDELMVYDVEFLHGCLNPTVIVIHKDN---DGRHVKSHE--------INLRDKEFMKIA 206

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  L+ VPSPIGGV+V+G  +I YH  S       N +AV+  + ++   +
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTIN 259

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTSD 414
            ++  +D+    +      LL    G L +L +       G  V+ + + +     +   
Sbjct: 260 CYA-RVDSKGLRY------LLGNMHGQLYMLFLGTSESSKGITVKDIKVEQLGEISIPEC 312

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFT 438
           IT + N   ++G+R GDS LV+ +
Sbjct: 313 ITYLDNGFLYIGARHGDSQLVRLS 336


>gi|350410909|ref|XP_003489174.1| PREDICTED: DNA damage-binding protein 1-like [Bombus impatiens]
          Length = 1141

 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 90/420 (21%), Positives = 161/420 (38%), Gaps = 86/420 (20%)

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
           ++   V+D  F+HG   P ++++H+        ++ +H    +    IS   K+   + W
Sbjct: 162 MEEHQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEISLRDKEFVKVPW 210

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
              N+  +A  ++ VPSPI G +++G  +I YH          N Y   +    +    +
Sbjct: 211 RQDNVEREAMIVIPVPSPICGAIIIGQESILYHDG--------NTYVAVVPPIIKQSTIT 262

Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLTSD 414
              ++D     +L  D+A      G L +L V  + +     VV+ L +       +   
Sbjct: 263 CYAKVDNQGLRYLLGDMA------GHLFMLFVEQEKKPDGTQVVKDLKVELLGEISIPEC 316

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
           IT + N + F+GSRLGDS LV+                     +AD   +  +   +   
Sbjct: 317 ITYLDNGVIFVGSRLGDSQLVKLIT------------------KADENGSYCVPMETFTN 358

Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
           L  +V+   + L        +    T S A ++     G L+    G+ I   AS     
Sbjct: 359 LAPIVDMAVVDL----ERQGQGQMVTCSGAFKE-----GSLRIIRNGIGIEEHAS----- 404

Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
                  ++LPG KG+W +      G N D++            L++S   +T +L    
Sbjct: 405 -------IDLPGIKGMWAL---KVGGGNFDNT------------LVLSFVGQTRILTLNG 442

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
              E T+   +    +T   GN+      IQ+    AR++     T    + P N  + S
Sbjct: 443 EEVEETDIPGFVADEQTFHTGNV-TNDLFIQITPTSARLISHETKTVVSEWEPENKRTIS 501


>gi|195500686|ref|XP_002097479.1| GE26244 [Drosophila yakuba]
 gi|194183580|gb|EDW97191.1| GE26244 [Drosophila yakuba]
          Length = 1140

 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 67/264 (25%), Positives = 107/264 (40%), Gaps = 53/264 (20%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G +  +DP+ R  G+ +Y     I+   +  S L                       NLR
Sbjct: 119 GVMAAIDPKARVIGMCLYQGLFTIIPLDKDASEL--------------------KATNLR 158

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
            +D   V D  F+HG + P ++++H+      GR    H         I+   K+   I 
Sbjct: 159 -MDELTVYDVEFLHGCLNPTVIVIHKDN---DGRHVKSHE--------INLREKEFMKIA 206

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  L+ VPSPIGGV+V+G  +I YH  S       N +AV+          
Sbjct: 207 WKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVA--------PL 251

Query: 359 SFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVLTS 413
           +F       +A    N +  LL    G L +L +       G  V+ + + +     +  
Sbjct: 252 TFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTSETSKGVTVKDIKVEQLGEISIPE 311

Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
            IT + N   ++G+R GDS LV+ 
Sbjct: 312 CITYLDNGFLYIGARHGDSQLVRL 335


>gi|255080490|ref|XP_002503825.1| predicted protein [Micromonas sp. RCC299]
 gi|226519092|gb|ACO65083.1| predicted protein [Micromonas sp. RCC299]
          Length = 1114

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 51/197 (25%), Positives = 84/197 (42%), Gaps = 22/197 (11%)

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
           L+  +V D  F+HG   P + +L+E             H           TL+  P  WS
Sbjct: 157 LEELNVVDVKFMHGCATPTICVLYED-------TKEARHVKTYEVDVKEKTLRDGP--WS 207

Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
             ++   +  ++ VP+P+GG +VVG + I Y ++                +      ++ 
Sbjct: 208 QSDVEGGSSLIIPVPAPLGGAIVVGESVIVYLNKDGG-------------NGAGGAIATK 254

Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGN 420
           SV + A           LLS  TG L LL +V+D R V  L L     + + S ++ + N
Sbjct: 255 SVNVMAHGVVDADGSRYLLSDSTGMLHLLVLVHDRRRVHALKLESLGQTSIASTLSYLDN 314

Query: 421 SLFFLGSRLGDSLLVQF 437
            + ++GS  GDS LV+ 
Sbjct: 315 GVVYVGSAYGDSQLVRL 331


>gi|328788389|ref|XP_396048.3| PREDICTED: DNA damage-binding protein 1-like isoform 1 [Apis
           mellifera]
          Length = 1141

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 91/420 (21%), Positives = 161/420 (38%), Gaps = 86/420 (20%)

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
           ++   V+D  F+HG   P ++++H+        ++ +H    +    IS   K+   I W
Sbjct: 162 MEEHQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEISLRDKEFVKIPW 210

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
              N+  +A  ++ VPSPI G +++G  +I YH          N Y   +    +    +
Sbjct: 211 RQDNVEREAMIVIPVPSPICGAIIIGQESILYHDG--------NTYVAVVPPIIKQSTIT 262

Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLTSD 414
              ++D     +L  D+A      G L +L V  + +     VV+ L +       +   
Sbjct: 263 CYAKVDNQGLRYLLGDMA------GHLFMLFVEQEKKADGTQVVKDLKVELLGEISIPEC 316

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
           IT + N + F+GSRLGDS LV+                     +AD   +  +   +   
Sbjct: 317 ITYLDNGVIFVGSRLGDSQLVKLIT------------------KADENGSYCVPMETFTN 358

Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
           L  +V+   + L        +    T S A ++     G L+    G+ I   AS     
Sbjct: 359 LAPIVDMAVVDL----ERQGQGQMVTCSGAFKE-----GSLRIIRNGIGIEEHAS----- 404

Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
                  ++LPG KG+W +      G N D++            L++S   +T +L    
Sbjct: 405 -------IDLPGIKGMWAL---KIGGGNFDNT------------LVLSFVGQTRILTLNG 442

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
              E T+   +    +T   GN+      IQ+    AR++     T    + P N  + S
Sbjct: 443 EEVEETDIPGFVADEQTFHTGNV-TNDLFIQITPTSARLISYETKTVVSEWEPENKRTIS 501


>gi|380025901|ref|XP_003696702.1| PREDICTED: LOW QUALITY PROTEIN: DNA damage-binding protein 1-like
           [Apis florea]
          Length = 1141

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 91/420 (21%), Positives = 161/420 (38%), Gaps = 86/420 (20%)

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
           ++   V+D  F+HG   P ++++H+        ++ +H    +    IS   K+   I W
Sbjct: 162 MEEHQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEISLRDKEFVKIPW 210

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
              N+  +A  ++ VPSPI G +++G  +I YH          N Y   +    +    +
Sbjct: 211 RQDNVEREAMIVIPVPSPICGAIIIGQESILYHDG--------NTYVAVVPPIIKQSTIT 262

Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLTSD 414
              ++D     +L  D+A      G L +L V  + +     VV+ L +       +   
Sbjct: 263 CYAKVDNQGLRYLLGDMA------GHLFMLFVEQEKKTDGTQVVKDLKVELLGEISIPEC 316

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
           IT + N + F+GSRLGDS LV+                     +AD   +  +   +   
Sbjct: 317 ITYLDNGVIFVGSRLGDSQLVKLIT------------------KADENGSYCVPMETFTN 358

Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
           L  +V+   + L        +    T S A ++     G L+    G+ I   AS     
Sbjct: 359 LAPIVDMAVVDL----ERQGQGQMVTCSGAFKE-----GSLRIIRNGIGIEEHAS----- 404

Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
                  ++LPG KG+W +      G N D++            L++S   +T +L    
Sbjct: 405 -------IDLPGIKGMWAL---KIGGGNFDNT------------LVLSFVGQTRILTLNG 442

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
              E T+   +    +T   GN+      IQ+    AR++     T    + P N  + S
Sbjct: 443 EEVEETDIPGFVADEQTFHTGNV-TNDLFIQITPTSARLISYETKTVVSEWEPENKRTIS 501


>gi|307205760|gb|EFN83990.1| DNA damage-binding protein 1 [Harpegnathos saltator]
          Length = 1138

 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 50/205 (24%), Positives = 94/205 (45%), Gaps = 35/205 (17%)

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
           +D + V+D  F+HG   P ++++H+        ++ +H    +    IS   K+   I W
Sbjct: 159 MDEQQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEISLRDKEFVKIPW 207

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
              N+  +A  ++ VPSPI G +++G  +I YH  +   A+              + +S+
Sbjct: 208 RQDNVEREAMMVIPVPSPICGAIIIGQESILYHDGTTYIAVV----------PPIIKQST 257

Query: 360 FS--VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLT 412
            +   ++D     +L  D+A      G L +L +  + +     VV+ L +       + 
Sbjct: 258 ITCYAKVDNQGLRYLLGDMA------GHLFMLFLEQEKKPDGTQVVKDLKVELLGEISIP 311

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQF 437
             IT + N + F+GSRLGDS L++ 
Sbjct: 312 ECITYLDNGVIFVGSRLGDSQLIKL 336


>gi|224000243|ref|XP_002289794.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220975002|gb|EED93331.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 1820

 Score = 58.2 bits (139), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 91/410 (22%), Positives = 145/410 (35%), Gaps = 141/410 (34%)

Query: 246 VKDFIFVHGYIEPVMVILHERE-----LTWAGRVSWKHHTCM------------------ 282
           + D  F+ GYIEP +++LH          WAGR+       +                  
Sbjct: 398 IVDIAFLSGYIEPTLLVLHSNPKRGGGRAWAGRLGRTEEVPLSNNGGSGESKDDYGEDID 457

Query: 283 -----------------------ISALSISTTLKQHPLIWSAMN-LPHDAYKLLAVPSPI 318
                                  ++A+S++   ++  ++WS ++ LP DA+KL  VP P 
Sbjct: 458 LEGGDAAKKGPDLVSTGTKYGLSLTAISLAIHQRRSVVLWSLLDALPADAWKL--VPHPS 515

Query: 319 GGVLVVGANTIHYHSQSA--SCALALNNY------------------AVSLDSSQELPRS 358
            GV+V G NT  Y S     SCALA N +                  AV L+ +   P  
Sbjct: 516 DGVIVWGVNTAVYVSMGGKISCALAANGFAKIGCPIGLIPPSGRIGSAVYLEPNPS-PLP 574

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV--------------------------- 391
             +++LD A   ++  DVA++    G L  L +                           
Sbjct: 575 MLALQLDGARVGFVTEDVAIVCLGNGSLYSLELHRAKSMVSPSMFLSMSPLGHRVGGLGV 634

Query: 392 -------------------VYDGRVVQRLDLSK---TNPSVLTSDITTIGNSLFFLGSRL 429
                              + D   V+  D +K   +  SV    I + G  L F GSR+
Sbjct: 635 ASCLSVLAMACHSNSVGHFLVDNEGVKDEDHAKETISKESVSGPKIRSRG--LIFAGSRM 692

Query: 430 GDSLLVQFT---------------CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS-SD 473
           GD  L+ F+                G+G   L     E+   +    P+ K+L++   S 
Sbjct: 693 GDCSLLAFSLNVPIHLVITDVDSETGAGKRKLGGSRPEQLSSMP--EPAQKQLKKEEISP 750

Query: 474 ALQDMVNGEE--LSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
           +  D  +GEE  +    S   +  +     + +  DSL  +GPL    YG
Sbjct: 751 SRTDSEDGEEDIVCAMSSPRRSVRTLSMFRTVSALDSLTGLGPLGQGCYG 800


>gi|71654693|ref|XP_815961.1| cleavage and polyadenylation specificity factor [Trypanosoma cruzi
           strain CL Brener]
 gi|50363265|gb|AAT75335.1| cleavage polyadenylation specificity factor CPSF160 [Trypanosoma
           cruzi]
 gi|70881056|gb|EAN94110.1| cleavage and polyadenylation specificity factor, putative
           [Trypanosoma cruzi]
          Length = 1436

 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 62/261 (23%), Positives = 107/261 (40%), Gaps = 54/261 (20%)

Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS----------IS 289
           +++V+D  F+    EP++  L ER  TWAGRV    W+        LS           S
Sbjct: 250 IRYVRDMQFIESSGEPIVAFLCERHPTWAGRVKLVEWRTKAVESKMLSSQIVWVQISAAS 309

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIG-------GVLVVGANTIHYHSQSASCALAL 342
           T+ ++  LI    ++P++   +    +P+G       GV+  G NT+ + +      + L
Sbjct: 310 TSNRKLLLIGEVDDVPYNVTHM----TPVGPFSQIPSGVICYGINTVMHVTTKRGYGVYL 365

Query: 343 NNYAVS-----------------LDSSQELPRSSFSVELDAAHATW----LQND---VAL 378
           NN  +                   D   E   + F V L  A+ T     + N+   + +
Sbjct: 366 NNGGMEECANSKSSAMSYGKVGWCDPKMEASTALFKVNLSLANCTASFMSIVNEMLHLLV 425

Query: 379 LSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
           +S + G ++ L++      VQ + ++        S I  IG+ + FLGS  GDS      
Sbjct: 426 VSEEDGVVLTLSITAQSSSVQGIRIAILGTGCYCSGIARIGDQIVFLGSACGDS------ 479

Query: 439 CGSGTSMLSSGLKEEFGDIEA 459
           C +   M  S + + F  IE+
Sbjct: 480 CIAKVDMFHSDVAKRFQIIES 500


>gi|195037449|ref|XP_001990173.1| GH18378 [Drosophila grimshawi]
 gi|193894369|gb|EDV93235.1| GH18378 [Drosophila grimshawi]
          Length = 1140

 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 66/264 (25%), Positives = 109/264 (41%), Gaps = 51/264 (19%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G +  +DP+ R  G+ +Y     I+   +  S L                       NLR
Sbjct: 119 GFIAAIDPKARVIGMCLYQGLFTIIPLDKDASEL--------------------KATNLR 158

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI- 298
            +D   V D  F+HG + P ++++H       GR    H         I+   K+   I 
Sbjct: 159 -MDELTVYDVEFLHGCLNPTVIVIHRDN---DGRHVKSHE--------INLRDKEFMKIA 206

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  L+ VPSPI GV+V+G  +I YH  S       N +AV+  + ++   +
Sbjct: 207 WKQDNVETEATMLIPVPSPICGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTIN 259

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLL----TVVYDGRVVQRLDLSKTNPSVLTSD 414
            ++  +D     +      LL    G L +L    T    G  V+ + + +     +   
Sbjct: 260 CYA-RIDEKGLRY------LLGNMDGQLYMLFLGTTETSKGITVKDIKVEQLGEISIPEC 312

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFT 438
           IT + N   ++GSR GDS LV+ +
Sbjct: 313 ITYLDNGFLYIGSRHGDSQLVRLS 336


>gi|312283457|dbj|BAJ34594.1| unnamed protein product [Thellungiella halophila]
          Length = 1088

 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 109/507 (21%), Positives = 199/507 (39%), Gaps = 123/507 (24%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++   L+ +    ++G + +L +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLTPQGLQPMLDVPMYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDA 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   +M           + GR +   G +  +DP  R  G+ +Y GL  +I   ++
Sbjct: 95  ESSELITRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F+ G  +P + +L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLFGCAKPTIAVLYQ--- 182

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVG 325
                   +H        +   +LK    +   WS  NL + A  L+ VP P+ GVL++G
Sbjct: 183 ---DNKDARH------VKTYEVSLKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIG 233

Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGD 385
             TI Y S +A  A+ +            + ++   V++D +          LL    G 
Sbjct: 234 EETIVYCSANAFKAIPIR---------PSITKAYGRVDVDGSR--------YLLGDHAGL 276

Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
           + LL + ++   V  L +     + + S I+ + N++ F+GS  GDS LV+         
Sbjct: 277 IHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKL-------- 328

Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSF 503
                     ++  DA      + S  + L+  VN   +  +       +   +  T S 
Sbjct: 329 ----------NLHPDA------KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSG 372

Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNA 563
           A +D     G L+    G+ IN  AS            VEL G KG+W++  KSS     
Sbjct: 373 AFKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL--KSS----- 408

Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVL 590
                   D+ +  +L++S  + T VL
Sbjct: 409 -------IDEAFDTFLVVSFISETRVL 428


>gi|195395112|ref|XP_002056180.1| GJ10363 [Drosophila virilis]
 gi|194142889|gb|EDW59292.1| GJ10363 [Drosophila virilis]
          Length = 1140

 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 64/263 (24%), Positives = 107/263 (40%), Gaps = 49/263 (18%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           G +  +DP+ R  G+ +Y     I+   +  S L                       NLR
Sbjct: 119 GFIAAIDPKARVIGMCLYQGLFTIIPLDKDASEL--------------------KATNLR 158

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299
            +D   V D  F+HG   P ++++H+      GR    H   +     I          W
Sbjct: 159 -MDELTVYDVEFLHGCQNPTVIVIHKDN---DGRHVKSHEINLRDKEFIKVA-------W 207

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
              N+  +A  L+ VPS IGGV+V+G  +I YH  S       N +AV+  + ++   + 
Sbjct: 208 KQDNVETEATMLIPVPSSIGGVIVIGRESIVYHDGS-------NYHAVAPLTFRQSTINC 260

Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLL----TVVYDGRVVQRLDLSKTNPSVLTSDI 415
           ++  +D+    +      LL    G L +L    T    G  V+ + + +     +   I
Sbjct: 261 YA-RVDSKGLRY------LLGNMDGQLYMLFLGTTETSKGTTVKDIKVEQLGEISIPECI 313

Query: 416 TTIGNSLFFLGSRLGDSLLVQFT 438
           T + N   ++GSR GDS LV+ +
Sbjct: 314 TYLDNGFLYIGSRHGDSQLVRLS 336


>gi|340381612|ref|XP_003389315.1| PREDICTED: DNA damage-binding protein 1-like [Amphimedon
           queenslandica]
          Length = 1142

 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 104/447 (23%), Positives = 165/447 (36%), Gaps = 92/447 (20%)

Query: 236 INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
           I L DL   ++ D  F+HG   P +  + E      GRV        +    IS   K+ 
Sbjct: 156 IRLEDL---YITDIQFLHGTENPTIAYISEEPSVATGRV--------LKTFVISQRDKEL 204

Query: 296 -PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQE 354
            P  W    +   A  L +VPSP  G++VVGA+++ Y           N+ + ++D    
Sbjct: 205 LPGPWKPNTIEGQASLLCSVPSPYNGLIVVGADSVAY----------FNDTSHTVDPIV- 253

Query: 355 LPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRV------VQRLDLSKTNP 408
           +  S  S      H+ +L  D        G L+ L + +   +      +  + L     
Sbjct: 254 IKESVISCIEPLDHSRYLLGDFR------GRLLTLFLEFSEEMESGMTNIVNMKLEVLGE 307

Query: 409 SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
             +   ++ + N + F+GS  GDS LV+         LSS   E  G I           
Sbjct: 308 ISIPHTLSYLDNGVVFVGSTKGDSQLVK---------LSSSPLENGGYI----------- 347

Query: 469 RSSSDALQDMVN-GEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
               D L+ M N G  L +    S      Q          L   G L+    G+ IN  
Sbjct: 348 ----DVLESMTNIGPILDM----SVVDLDKQGRDVLVCCSGLGKDGALRIVKSGIGINEA 399

Query: 528 ASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEART 587
           AS            ++LPG KGIW++             + A  +DE    ++++   +T
Sbjct: 400 AS------------IDLPGIKGIWSL-------------KCAGREDELDDTVVLTFVGQT 434

Query: 588 MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGP 647
           M L  A    E TE        +T    N+ G   +IQ+  +  R++D   M     + P
Sbjct: 435 MALRLAGEEVEETELPALVTDQQTFYCSNVTG-NAIIQITTKSVRLMDDKAMELICDWSP 493

Query: 648 SNSE--SGSGSENSTVLSVSIADPYVL 672
            +    S +   +S V+     D Y L
Sbjct: 494 PDGRGISTAACNSSQVMVAVGCDLYYL 520


>gi|342186481|emb|CCC95967.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 1456

 Score = 57.4 bits (137), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 80/334 (23%), Positives = 130/334 (38%), Gaps = 77/334 (23%)

Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS-------ISTTL 292
           +++V+D  FV    EP++ +L ER  TWAGRV    W+      + LS       IS  L
Sbjct: 254 LRYVRDLQFVGSSGEPLLGVLCERRPTWAGRVKLVEWRTKAVDTNTLSMQVAWVQISGAL 313

Query: 293 KQHP---LIWSAMNLPHDAYKLLAVPS---PIGGVLVVGANTIHYHSQSASCALALNNYA 346
             HP   L+    ++P++   ++ V S      GV+  G NT+ + +      +  N+  
Sbjct: 314 TTHPKLLLVGEVDSVPYNVTHMIPVESSSQTPSGVICFGINTVMHITTKRGYGVYFNSTG 373

Query: 347 V---------------------SLDSSQELPRSSFSVELDAAHATWLQN------DVALL 379
           +                      L+SS  L R +FS  L    AT           +  +
Sbjct: 374 MEECGSNKSSAMSYGKMSWCDAKLESSTALFRVNFS--LANCTATIFSPRSSDSLQILAV 431

Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
           S + G + +L  +  G  V  + +S        S +T I ++LFFLGS       V F+C
Sbjct: 432 SEEDGVVAVLEFLSQGANVHDIQISVLASGCYCSSLTPISDNLFFLGSA------VSFSC 485

Query: 440 GSGTSMLSSGLKEEFGDIE--------------------ADAPSTKRLRRSSSDALQDMV 479
            +  +  +SG   +F  +E                    AD  S  R  +S+S  L+D  
Sbjct: 486 IASITPTNSGAIGKFKVVESIEAIGSIRDVDVVDCSNDAADCISGPRGNQSNSSWLEDTP 545

Query: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIG 513
             E       A N T       S A R +++++ 
Sbjct: 546 FAE------LAGNTTLDPMPNLSVAQRRAIMDLA 573


>gi|383863765|ref|XP_003707350.1| PREDICTED: DNA damage-binding protein 1-like [Megachile rotundata]
          Length = 1138

 Score = 57.4 bits (137), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 52/205 (25%), Positives = 95/205 (46%), Gaps = 35/205 (17%)

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
           +D + V+D  F+HG   P ++++H+        ++ +H    +    IS   K+   I W
Sbjct: 162 MDEQQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEISLRDKEFVKIPW 210

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
              N+  +A  ++ VPSPI G +++G  +I YH  +   A+              + +S+
Sbjct: 211 RQDNVEREATMVIPVPSPICGAIIIGQESILYHDGTTYVAVV----------PPIIKQST 260

Query: 360 FS--VELDAAHATWLQNDVALLSTKTGDLVLLTVVY----DG-RVVQRLDLSKTNPSVLT 412
            +   ++D     +L  D+A      G L +L +      DG +VV+ L +       + 
Sbjct: 261 ITCYAKVDNQGLRYLLGDMA------GHLFMLFLEQEKNPDGTQVVKDLKVELLGEISIP 314

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQF 437
             IT + N + F+GSRLGDS L++ 
Sbjct: 315 ECITYLDNGVIFVGSRLGDSQLIKL 339


>gi|427788481|gb|JAA59692.1| Putative dna damage-binding protein 1 [Rhipicephalus pulchellus]
          Length = 1156

 Score = 57.4 bits (137), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 90/401 (22%), Positives = 154/401 (38%), Gaps = 81/401 (20%)

Query: 246 VKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAM 302
           V+D  F+HG   P +V+LH+         S   H       +   +LK    +   W   
Sbjct: 164 VQDMEFLHGCKTPTIVLLHQD--------SQARHM-----KTYEVSLKDKEFVKGPWKQD 210

Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
           ++  +A  ++AVP P  G L++G  +I YH+         + Y V    +  L R S  V
Sbjct: 211 HVESEANLVIAVPEPFCGALIIGQESITYHNG--------DQYVV---ITPHLIRQSTIV 259

Query: 363 ---ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRV-----VQRLDLSKTNPSVLTSD 414
              ++DA  + +L  D+A      G L +L +  + ++     V+ L L       +   
Sbjct: 260 CYGKVDANGSRYLLGDMA------GRLFMLLLEREDKMDGTTTVKDLKLEFLGEITIAEC 313

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
           IT + N + ++GSRLGDS L++             + E F ++                 
Sbjct: 314 ITYLDNGVVYVGSRLGDSQLIKLHAERNDQGSFVEIMEVFTNL---------------GP 358

Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
           + DM                +    T S A ++     G L+    G+ I+  AS     
Sbjct: 359 IVDMC-------VVDLERQGQGQLVTCSGAFKE-----GSLRIIRNGIGIHEHAS----- 401

Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
                  ++LPG KG+W +        +    R      E    L++S   +T VL  + 
Sbjct: 402 -------IDLPGIKGMWPLRVGPGVAPHGGDGRDPGDSAERDNTLVLSFVRQTRVLMLSG 454

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILD 635
              E TE   +    +T   GN+   +++IQV     R++D
Sbjct: 455 EEVEETELAGFDTSQQTFFCGNV-RNKQLIQVTAAAVRLVD 494


>gi|186511557|ref|NP_001118940.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
 gi|332657118|gb|AEE82518.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
          Length = 1067

 Score = 57.0 bits (136), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 100/453 (22%), Positives = 178/453 (39%), Gaps = 108/453 (23%)

Query: 147 FDDSIHGLRITSMHCF----ESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI 202
            D  I+G RI ++  F    E+ ++L +   R  F    +++ DP+             +
Sbjct: 54  LDVPIYG-RIATLELFRPHGEAQDFLFIATERYKFC---VLQWDPES----------SEL 99

Query: 203 ILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVI 262
           I +A    S  +G     G    F  + +     N+R L+   V D  F+ G  +P + +
Sbjct: 100 ITRAMGDVSDRIGRPTDNGQVIPFDNKGQLKEAFNIR-LEELQVLDIKFLFGCAKPTIAV 158

Query: 263 LHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIG 319
           L++           +H        +   +LK    +   WS  +L + A  L+ VP P+ 
Sbjct: 159 LYQ------DNKDARH------VKTYEVSLKDKDFVEGPWSQNSLDNGADLLIPVPPPLC 206

Query: 320 GVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
           GVL++G  TI Y S SA  A+ +            + ++   V++D +          LL
Sbjct: 207 GVLIIGEETIVYCSASAFKAIPIR---------PSITKAYGRVDVDGSR--------YLL 249

Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
               G + LL + ++   V  L +     + + S I+ + N++ F+GS  GDS LV+   
Sbjct: 250 GDHAGMIHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKL-- 307

Query: 440 GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK 499
                           ++  DA      + S  + L+  +N   +  +       +   +
Sbjct: 308 ----------------NLHPDA------KGSYVEVLERYINLGPIVDFCVVDLERQGQGQ 345

Query: 500 --TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKS 557
             T S A +D     G L+    G+ IN  AS            VEL G KG+W++  KS
Sbjct: 346 VVTCSGAFKD-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL--KS 386

Query: 558 SRGHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
           S             D+ +  +L++S  + T +L
Sbjct: 387 S------------IDEAFDTFLVVSFISETRIL 407


>gi|225443992|ref|XP_002280744.1| PREDICTED: DNA damage-binding protein 1 isoform 2 [Vitis vinifera]
          Length = 1068

 Score = 57.0 bits (136), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 88/391 (22%), Positives = 153/391 (39%), Gaps = 84/391 (21%)

Query: 202 IILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMV 261
           +I +A    S  +G     G    F  + +     N+R L+   V D  F++G  +P +V
Sbjct: 99  VITRAMGDVSDRIGRPTDNGQVIPFDNKGQLKEAFNIR-LEELQVLDIKFLYGCSKPTIV 157

Query: 262 ILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGV 321
           +L++           +H      AL       + P  W+  NL + A  L+ VP P+ GV
Sbjct: 158 VLYQ------DNKDARHVKTYEVALK-DKDFVEGP--WAQNNLDNGADLLIPVPPPLCGV 208

Query: 322 LVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLST 381
           L++G  TI Y S SA  A+ +            + ++   V+ D +          LL  
Sbjct: 209 LIIGEETIVYCSASAFKAIPIR---------PSITKAYGRVDADGSR--------YLLGD 251

Query: 382 KTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGS 441
             G L LL + ++   V  L +     + + S I+ + N+  ++GS  GDS L++     
Sbjct: 252 HAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAFVYVGSSYGDSQLIKI---- 307

Query: 442 GTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK-- 499
                          ++ DA      + S  + L+  VN   +  +       +   +  
Sbjct: 308 --------------HLQPDA------KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVV 347

Query: 500 TFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSR 559
           T S A +D     G L+    G+ IN  AS            VEL G KG+W++      
Sbjct: 348 TCSGAYKD-----GSLRIVRNGIGINEQAS------------VELQGIKGMWSL------ 384

Query: 560 GHNADSSRMAAYDDEYHAYLIISLEARTMVL 590
                    ++ DD +  +L++S  + T +L
Sbjct: 385 --------RSSTDDPHDTFLVVSFISETRIL 407


>gi|50288865|ref|XP_446862.1| hypothetical protein [Candida glabrata CBS 138]
 gi|74609915|sp|Q6FSD2.1|CFT1_CANGA RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two
           protein 1
 gi|49526171|emb|CAG59795.1| unnamed protein product [Candida glabrata]
          Length = 1361

 Score = 57.0 bits (136), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 132/674 (19%), Positives = 265/674 (39%), Gaps = 122/674 (18%)

Query: 96  ISAASLELVCHYRLHGNVESLAIL---SQGGADNSRRRDSIILAFEDAKISVLEFDDSIH 152
           I +  L L+  ++L G +  +A++   S G   N      ++L+   AK+S+L +++   
Sbjct: 43  IRSGRLYLMEEHKLSGRINDVALIPKHSNGSNGNGINLSYLLLSTGVAKLSLLMYNNMTS 102

Query: 153 GLRITSMHC----FESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQ 208
            +   S+H     FES   L L       AR   ++++P G     +++   ++ +    
Sbjct: 103 SIETISLHFYEDKFESATMLDL-------ARNSQLRIEPNGNYA--MLFNNDVLAILPFY 153

Query: 209 GGSGLVGDED----------------TFGSGGGFSARIESSH---VINLRDL--DMKHVK 247
            G     DED                 F    G +   + +H   +IN  +L   +K++K
Sbjct: 154 TGINEDEDEDYINNDKSKINDNSKKSLFKRKKGKTQNNKVTHPSIIINCSELGPQIKNIK 213

Query: 248 DFIFVHGYIEPVMVILHERELTWAGR---VSWKHHTCMIS---ALSISTTLKQHPLIWSA 301
           D  F+ G+ +  + +L++ +L W G    V    +  +IS     SI  T     +I   
Sbjct: 214 DIQFLCGFTKSTIGVLYQPQLAWCGNSQLVPLPTNYAIISLDMKFSIDATTFDKAIISEI 273

Query: 302 MNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA--SCALALNNYAVS-LDSSQELPRS 358
             LP D +    +   + G L++G N I +   +      L LN+Y+   L   + + +S
Sbjct: 274 SQLPSDWH---TIAPTLSGSLILGVNEIAFLDNTGVLQSILTLNSYSDKVLPKVRVIDKS 330

Query: 359 SFSVELDAAHATWL----QNDVA----LLSTKTGDLVLLTVVYDGRVVQRLDLS------ 404
           S  V  +      L    +N+ +    LL  + G +  + +  +GR++ + +++      
Sbjct: 331 SHEVFFNTGSKFALIPSNENERSVENILLFDENGCIFNVDLKSEGRLLTQFNITKLPLGE 390

Query: 405 -----KTNP---SVLTSDITTIGNSLFFLGSRLGDSLLVQFT-CGSGTSMLSSGLKEEFG 455
                K+NP   S++ +D   +     F+G + GD+ +++     S   +      +++ 
Sbjct: 391 DVLSQKSNPSSVSIIWAD-GRLDTYTIFIGFQSGDATMLKLNHLHSAIEVEEPTFMKDYV 449

Query: 456 DIEADAPSTKRLRRSS-------SDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRD 507
           + +A A                 SD   D VN +    +G+  SN   +AQ+        
Sbjct: 450 NKQASAAYNNEDDDDDDDDFNLYSDEENDQVNNKNDRTFGTNESNEPFTAQELM------ 503

Query: 508 SLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSR 567
            L NIGP+     G   + + +  G+   +  E+        + T  +      NA  + 
Sbjct: 504 ELRNIGPINSMCVGKVSSIEDNVKGLPNPNKQEI------SIVCTSGYGDGSHLNAILAS 557

Query: 568 MAAYDDEYHAYLIIS------LEARTMVLETADL------LTEVTESVDYFVQGR----- 610
           +    ++   ++ I+      ++ +   L T D       + E+  +     QGR     
Sbjct: 558 VQPRVEKALKFISITKIWNLHIKGKDKFLITTDSTQSQSNIYEIDNNFSQHKQGRLRRDA 617

Query: 611 -TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP 669
            TI    +   +R++QV      + D ++               +   +  V+ VS+ DP
Sbjct: 618 TTIHIATIGDNKRIVQVTTNHLYLYDLTF-----------RRFSTIKFDYEVVHVSVMDP 666

Query: 670 YVLLGMSDGSIRLL 683
           YVL+ +S G I++ 
Sbjct: 667 YVLITLSRGDIKVF 680


>gi|407850337|gb|EKG04765.1| cleavage and polyadenylation specificity factor, putative
           [Trypanosoma cruzi]
          Length = 1436

 Score = 57.0 bits (136), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 61/261 (23%), Positives = 107/261 (40%), Gaps = 54/261 (20%)

Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS----------IS 289
           +++V+D  F+    EP++  L ER  TWAGRV    W+        LS           S
Sbjct: 250 IRYVRDMQFIESSGEPIVAFLCERHPTWAGRVKLVEWRTKAVESKMLSSQIVWVQISAAS 309

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIG-------GVLVVGANTIHYHSQSASCALAL 342
           T+ ++  LI    ++P++   +    +P+G       GV+  G NT+ + +      + L
Sbjct: 310 TSNRKLLLIGEVDDVPYNVTHM----TPVGPFSQIPSGVICYGINTVMHVTTKRGYGVYL 365

Query: 343 NNYAVS-----------------LDSSQELPRSSFSVELDAAHATW----LQND---VAL 378
           NN  +                   D   E   + F V L  A+ T     + N+   + +
Sbjct: 366 NNGGMEECANSKSSAMSYGKVGWCDPKMEASTALFKVNLSLANCTASFMSIVNEMLHLLV 425

Query: 379 LSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
           +S + G ++ L++      VQ + ++        S I  +G+ + FLGS  GDS      
Sbjct: 426 VSEEDGVVLTLSITAQSSSVQGIRIAILGTDCYCSGIARLGDQIVFLGSACGDS------ 479

Query: 439 CGSGTSMLSSGLKEEFGDIEA 459
           C +   M  S + + F  IE+
Sbjct: 480 CIAKVDMFHSDVAKRFRIIES 500


>gi|297809743|ref|XP_002872755.1| UV-damaged DNA-binding protein 1A [Arabidopsis lyrata subsp.
           lyrata]
 gi|297318592|gb|EFH49014.1| UV-damaged DNA-binding protein 1A [Arabidopsis lyrata subsp.
           lyrata]
          Length = 1088

 Score = 57.0 bits (136), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 107/507 (21%), Positives = 199/507 (39%), Gaps = 123/507 (24%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++   L+ +    ++G + +L +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLTPQGLQPMLDVPIYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDA 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   +M           + GR +   G +  +DP  R  G+ +Y GL  +I   ++
Sbjct: 95  ESSELITRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F+ G  +P + +L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLFGCAKPTIAVLYQ--- 182

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVG 325
                   +H        +   +LK    +   WS  NL + A  L+ VP P+ GVL++G
Sbjct: 183 ---DNKDARH------VKTYEVSLKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIG 233

Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGD 385
             TI Y S +A  A+ +            + ++   V++D +          LL    G 
Sbjct: 234 EETIVYCSANAFKAIPIR---------PSITKAYGRVDVDGSR--------YLLGDHAGL 276

Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
           + LL + ++   V  L +     + + S I+ + N++ F+GS  GDS LV+         
Sbjct: 277 IHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKL-------- 328

Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSF 503
                     ++  DA      + S  + L+  +N   +  +       +   +  T S 
Sbjct: 329 ----------NLHPDA------KGSYVEVLERYINLGPIVDFCVVDLERQGQGQVVTCSG 372

Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNA 563
           A +D     G L+    G+ IN  AS            VEL G KG+W++  KSS     
Sbjct: 373 AFKD-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL--KSS----- 408

Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVL 590
                   D+ +  +L++S  + T +L
Sbjct: 409 -------IDEAFDTFLVVSFISETRIL 428


>gi|307186138|gb|EFN71863.1| DNA damage-binding protein 1 [Camponotus floridanus]
          Length = 1136

 Score = 56.6 bits (135), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 48/205 (23%), Positives = 94/205 (45%), Gaps = 35/205 (17%)

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
           +D + V+D  F+HG   P ++++H+        ++ +H    +    I+   K+   I W
Sbjct: 159 MDEQQVQDVNFLHGCTNPTLILIHQD-------INGRH----VKTHEINLREKEFSKIPW 207

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
              N+  +A  ++ VPSPI G +++G  +I YH  +   A+              + +S+
Sbjct: 208 RQDNVEREAMMVIPVPSPICGAIIIGQESILYHDGTTYVAVV----------PPIIKQST 257

Query: 360 FS--VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLT 412
            +   ++D     +L  D+A      G L +L +  + +     VV+ L +       + 
Sbjct: 258 ITCYAKVDNQGLRYLLGDMA------GHLFMLFLELEKKPDGTQVVKDLKVELLGEISIP 311

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQF 437
             IT + N + ++GSRLGDS L++ 
Sbjct: 312 ECITYLDNGVIYVGSRLGDSQLIKL 336


>gi|358338734|dbj|GAA31211.2| DNA damage-binding protein 1, partial [Clonorchis sinensis]
          Length = 1515

 Score = 56.6 bits (135), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 64/266 (24%), Positives = 111/266 (41%), Gaps = 37/266 (13%)

Query: 183 VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGD--EDTFGSGGGFSARIESSHVINLRD 240
           V VDP   C  V +Y   + I+  +  G  L  D  E    +   ++ RIE  +++    
Sbjct: 101 VLVDPGANCVVVRLYHGLLRIIPLNGIGEKLTTDSLEVNQYAANTYNVRIEEGNIV---- 156

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
                  D  F+HGY  P   +++E EL          H            L+   L   
Sbjct: 157 -------DMAFLHGYTLPTFAMIYEDELVL--------HMKTYEISGREPALRNVQLTLD 201

Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
           ++    D+  L+ VP P GGV++VG N I+YH++       ++ Y     +SQ L  ++ 
Sbjct: 202 SIE--PDSKLLIPVPKPFGGVILVGDNIIYYHTKDGP---HISQYIPQAKASQVLCYAAV 256

Query: 361 SVEL----DAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQ-RLDLSKTNPSVL 411
             +     D A   ++ + +A   T +G+ +L +     V   R+   R++L     +  
Sbjct: 257 DAQRYLLGDMAGRLYMVHLLAEDHTPSGNGLLGSTSSAAVPSARIGSIRIEL--LGETAT 314

Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQF 437
              I  + N + F+G  LGDS L++ 
Sbjct: 315 PESIAYVDNGVVFIGCTLGDSQLIRL 340


>gi|340714589|ref|XP_003395809.1| PREDICTED: DNA damage-binding protein 1-like [Bombus terrestris]
          Length = 1141

 Score = 56.6 bits (135), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 88/420 (20%), Positives = 160/420 (38%), Gaps = 86/420 (20%)

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
           ++   V+D  F+HG   P ++++H+        ++ +H    +    IS   K+   + W
Sbjct: 162 MEEHQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEISLRDKEFVKVPW 210

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
              N+  +A  ++ VPSPI G +++G  +I YH          N Y   +    +    +
Sbjct: 211 RQDNVEREAMIVIPVPSPICGAIIIGQESILYHDG--------NTYVAVVPPIIKQSTIT 262

Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLTSD 414
              ++D     +L  D+A      G L +L V  + +     VV+ L +       +   
Sbjct: 263 CYAKVDNQGLRYLLGDMA------GHLFMLFVEQEKKPDGTQVVKDLKVELLGEISIPEC 316

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
           IT + N + F+GSR GDS LV+                     +AD   +  +   +   
Sbjct: 317 ITYLDNGVIFVGSRFGDSQLVKLIT------------------KADENGSYCVPMETFTN 358

Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
           L  +++   + L        +    T S A ++     G L+    G+ I   AS     
Sbjct: 359 LAPIIDMAVVDL----ERQGQGQMVTCSGAFKE-----GSLRIIRNGIGIEEHAS----- 404

Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
                  ++LPG KG+W +      G N D++            L++S   +T +L    
Sbjct: 405 -------IDLPGIKGMWAL---KVGGGNFDNT------------LVLSFVGQTRILTLNG 442

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
              E T+   +    +T   GN+      IQ+    AR++     T    + P N  + S
Sbjct: 443 EEVEETDIPGFVADEQTFHTGNV-TNDLFIQITPTSARLISHETKTVVSEWEPENKRTIS 501


>gi|195571247|ref|XP_002103615.1| GD18880 [Drosophila simulans]
 gi|194199542|gb|EDX13118.1| GD18880 [Drosophila simulans]
          Length = 1140

 Score = 56.6 bits (135), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 58/207 (28%), Positives = 92/207 (44%), Gaps = 33/207 (15%)

Query: 237 NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
           NLR +D  +V D  F+HG + P ++++H+      GR    H         I+   K+  
Sbjct: 156 NLR-MDELNVYDVEFLHGCLNPTVIVIHKDN---DGRHVKSHE--------INLRDKEFM 203

Query: 297 LI-WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
            I W   N+  +A  L+ VPSPIGGV+V+G  +I YH  S       N +AV+       
Sbjct: 204 KIAWKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVA------- 249

Query: 356 PRSSFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSV 410
              +F       +A    N +  LL    G L +L +       G  V+ + + +     
Sbjct: 250 -PLTFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSKGVTVKDIKVEQLGEIS 308

Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQF 437
           +   IT + N   ++G+R GDS LV+ 
Sbjct: 309 IPECITYLDNGFLYIGARHGDSQLVRL 335


>gi|225443990|ref|XP_002280735.1| PREDICTED: DNA damage-binding protein 1 isoform 1 [Vitis vinifera]
          Length = 1089

 Score = 56.6 bits (135), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 83/367 (22%), Positives = 145/367 (39%), Gaps = 84/367 (22%)

Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
           F  + +     N+R L+   V D  F++G  +P +V+L++           +H      A
Sbjct: 144 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCSKPTIVVLYQ------DNKDARHVKTYEVA 196

Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
           L       + P  W+  NL + A  L+ VP P+ GVL++G  TI Y S SA  A+ +   
Sbjct: 197 LK-DKDFVEGP--WAQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSASAFKAIPIR-- 251

Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
                    + ++   V+ D +          LL    G L LL + ++   V  L +  
Sbjct: 252 -------PSITKAYGRVDADGSR--------YLLGDHAGLLHLLVITHEKEKVTGLKIEL 296

Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
              + + S I+ + N+  ++GS  GDS L++                    ++ DA    
Sbjct: 297 LGETSIASTISYLDNAFVYVGSSYGDSQLIKI------------------HLQPDA---- 334

Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLR 523
             + S  + L+  VN   +  +       +   +  T S A +D     G L+    G+ 
Sbjct: 335 --KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD-----GSLRIVRNGIG 387

Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
           IN  AS            VEL G KG+W++               ++ DD +  +L++S 
Sbjct: 388 INEQAS------------VELQGIKGMWSL--------------RSSTDDPHDTFLVVSF 421

Query: 584 EARTMVL 590
            + T +L
Sbjct: 422 ISETRIL 428


>gi|297740793|emb|CBI30975.3| unnamed protein product [Vitis vinifera]
          Length = 1043

 Score = 56.6 bits (135), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 83/367 (22%), Positives = 145/367 (39%), Gaps = 84/367 (22%)

Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
           F  + +     N+R L+   V D  F++G  +P +V+L++           +H      A
Sbjct: 144 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCSKPTIVVLYQ------DNKDARHVKTYEVA 196

Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
           L       + P  W+  NL + A  L+ VP P+ GVL++G  TI Y S SA  A+ +   
Sbjct: 197 LK-DKDFVEGP--WAQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSASAFKAIPIR-- 251

Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
                    + ++   V+ D +          LL    G L LL + ++   V  L +  
Sbjct: 252 -------PSITKAYGRVDADGSR--------YLLGDHAGLLHLLVITHEKEKVTGLKIEL 296

Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
              + + S I+ + N+  ++GS  GDS L++                    ++ DA    
Sbjct: 297 LGETSIASTISYLDNAFVYVGSSYGDSQLIKI------------------HLQPDA---- 334

Query: 466 RLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSFAVRDSLVNIGPLKDFSYGLR 523
             + S  + L+  VN   +  +       +   +  T S A +D     G L+    G+ 
Sbjct: 335 --KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD-----GSLRIVRNGIG 387

Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
           IN  AS            VEL G KG+W++               ++ DD +  +L++S 
Sbjct: 388 INEQAS------------VELQGIKGMWSL--------------RSSTDDPHDTFLVVSF 421

Query: 584 EARTMVL 590
            + T +L
Sbjct: 422 ISETRIL 428


>gi|15235577|ref|NP_192451.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
 gi|55976605|sp|Q9M0V3.1|DDB1A_ARATH RecName: Full=DNA damage-binding protein 1a; AltName:
           Full=UV-damaged DNA-binding protein 1a; Short=DDB1a
 gi|7267302|emb|CAB81084.1| UV-damaged DNA binding factor-like protein [Arabidopsis thaliana]
 gi|25054828|gb|AAN71904.1| putative UV-damaged DNA binding factor [Arabidopsis thaliana]
 gi|332657117|gb|AEE82517.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
          Length = 1088

 Score = 56.2 bits (134), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 107/507 (21%), Positives = 199/507 (39%), Gaps = 123/507 (24%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ +  ++   L+ +    ++G + +L +    G      +D + +A E  K  VL++D 
Sbjct: 39  RIEIHLLTPQGLQPMLDVPIYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDP 94

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQ 208
               L   +M           + GR +   G +  +DP  R  G+ +Y GL  +I   ++
Sbjct: 95  ESSELITRAMGDVSD------RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK 147

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
           G                F+ R+E   V++++           F+ G  +P + +L++   
Sbjct: 148 GQLK-----------EAFNIRLEELQVLDIK-----------FLFGCAKPTIAVLYQ--- 182

Query: 269 TWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVG 325
                   +H        +   +LK    +   WS  +L + A  L+ VP P+ GVL++G
Sbjct: 183 ---DNKDARH------VKTYEVSLKDKDFVEGPWSQNSLDNGADLLIPVPPPLCGVLIIG 233

Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGD 385
             TI Y S SA  A+ +            + ++   V++D +          LL    G 
Sbjct: 234 EETIVYCSASAFKAIPIR---------PSITKAYGRVDVDGSR--------YLLGDHAGM 276

Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
           + LL + ++   V  L +     + + S I+ + N++ F+GS  GDS LV+         
Sbjct: 277 IHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKL-------- 328

Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQK--TFSF 503
                     ++  DA      + S  + L+  +N   +  +       +   +  T S 
Sbjct: 329 ----------NLHPDA------KGSYVEVLERYINLGPIVDFCVVDLERQGQGQVVTCSG 372

Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNA 563
           A +D     G L+    G+ IN  AS            VEL G KG+W++  KSS     
Sbjct: 373 AFKD-----GSLRVVRNGIGINEQAS------------VELQGIKGMWSL--KSS----- 408

Query: 564 DSSRMAAYDDEYHAYLIISLEARTMVL 590
                   D+ +  +L++S  + T +L
Sbjct: 409 -------IDEAFDTFLVVSFISETRIL 428


>gi|321478515|gb|EFX89472.1| hypothetical protein DAPPUDRAFT_303245 [Daphnia pulex]
          Length = 1158

 Score = 56.2 bits (134), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 89/395 (22%), Positives = 149/395 (37%), Gaps = 83/395 (21%)

Query: 246 VKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-WSAMNL 304
           ++D  F++G   P +VI+H+             H   +    IS   K+     W   N+
Sbjct: 164 IQDIAFLYGCANPTVVIIHQ-----------DAHGRHVKTREISLRDKEFAKTSWKQDNV 212

Query: 305 PHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVEL 364
             +A  LL VP P GG L++G  +I YH+          NY        +    +   ++
Sbjct: 213 ETEAAMLLPVPEPYGGALIIGQESITYHNG--------QNYVTIAPPIIKQSTVTCYGKV 264

Query: 365 DAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTSDITTIG 419
           D   + +L  D+A      G L +L +      DG V V+ + +       +   +T + 
Sbjct: 265 DPNGSRYLLGDLA------GHLFMLVLEKEEKMDGTVTVRDIKIELLGEVSIPECLTYLD 318

Query: 420 NSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
           N + F+GSR GDS LV+       +     + E F ++   AP            + DM 
Sbjct: 319 NGVVFIGSRFGDSQLVKLNVTPDDNNSYVTVMETFTNL---AP------------IVDMT 363

Query: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY 539
                          +    T S A ++     G L+    G+ I+  AS          
Sbjct: 364 -------IVDLDRQGQGQLVTCSGAYKE-----GSLRIIRNGIGIHEQAS---------- 401

Query: 540 ELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEV 599
             ++LPG KGIW +   SS   + D +            +++S   +T VL       E 
Sbjct: 402 --IDLPGIKGIWALKMGSSGNPSVDDT------------VVLSFVGQTRVLMLNGEEMEE 447

Query: 600 TESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
           TE        +T   GN+ G+  V+Q+     R++
Sbjct: 448 TEIPGLTADQQTFFCGNV-GKDSVLQITTGSVRLI 481


>gi|332030156|gb|EGI69950.1| DNA damage-binding protein 1 [Acromyrmex echinatior]
          Length = 1138

 Score = 56.2 bits (134), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 48/205 (23%), Positives = 94/205 (45%), Gaps = 35/205 (17%)

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
           +D + V+D  F+HG   P ++++H+        ++ +H    +    I+   K+   I W
Sbjct: 159 MDEQQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEINLRDKEFAKIPW 207

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
              N+  +A  ++ VPSPI G +++G  +I YH  +   A+              + +S+
Sbjct: 208 RQDNVEREAMMVIPVPSPICGAIIIGQESILYHDGTTYVAVV----------PPIIKQST 257

Query: 360 FS--VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-----VVQRLDLSKTNPSVLT 412
            +   ++D     +L  D+A      G L +L +  + +     VV+ L +       + 
Sbjct: 258 ITCYAKVDNQGLRYLLGDMA------GHLFMLFLEQEKKPDGSQVVKDLKVELLGEISIP 311

Query: 413 SDITTIGNSLFFLGSRLGDSLLVQF 437
             IT + N + ++GSRLGDS L++ 
Sbjct: 312 ECITYLDNGVIYVGSRLGDSQLIKL 336


>gi|241260143|ref|XP_002404926.1| DNA repair protein xp-E, putative [Ixodes scapularis]
 gi|215496735|gb|EEC06375.1| DNA repair protein xp-E, putative [Ixodes scapularis]
          Length = 1148

 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 94/403 (23%), Positives = 155/403 (38%), Gaps = 95/403 (23%)

Query: 246 VKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAM 302
           V+D  F+HG   P +V+LH+         S   H   +    IS  LK    +   W   
Sbjct: 166 VQDMEFLHGCKTPTIVLLHQD--------SQARH---MKTYEIS--LKDKEFVKGPWKQD 212

Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
           ++  +A  ++AVP P      +G  +I YH+      +           +  L R S  V
Sbjct: 213 HVESEATIVIAVPEPFCDARCIGQESITYHNGDQDVVI-----------TPHLIRQSTIV 261

Query: 363 ---ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRV-----VQRLDLSKTNPSVLTSD 414
              ++DA  + +L  D+A      G L +L +  + ++     V+ L L       +   
Sbjct: 262 CYGKVDANGSRYLLGDMA------GRLFMLLLEREDKMDGTTTVKDLKLEFLGEITIAEC 315

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
           +T + N + ++GSRLGDS L++         L+S   E+   +E     T          
Sbjct: 316 MTYLDNGVVYVGSRLGDSQLIK---------LNSERNEQGSYVEVMEVFTN--------- 357

Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
           L  +V+   + L         +    F           G L+    G+ I+  AS     
Sbjct: 358 LGPIVDMCVVDLERQGQGQLVTCSGAFKE---------GSLRIIRNGIGIHEHAS----- 403

Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
                  ++LPG KGIW +        N DSSR           L++S   +T VL  + 
Sbjct: 404 -------IDLPGIKGIWPLR------VNTDSSR--------DNTLVLSFVGQTRVLMLSG 442

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS 637
              E TE   + +  +T   GN+    ++IQV     R++DG 
Sbjct: 443 EEVEETELAGFDISQQTFFCGNV-RNNQLIQVTAAAVRLVDGK 484


>gi|390342012|ref|XP_793599.3| PREDICTED: uncharacterized protein LOC588842 [Strongylocentrotus
           purpuratus]
          Length = 1161

 Score = 55.5 bits (132), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 68/281 (24%), Positives = 111/281 (39%), Gaps = 46/281 (16%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           GP+  +DP+ R  G+ +Y     I+   +    L            F+ R+E  +VI+++
Sbjct: 50  GPIGIIDPECRMIGLRLYDGLFKIIPLDRDNKEL----------KAFNIRLEELNVIDVQ 99

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299
                      F++G  +P +V LH+      GR     H              + P  W
Sbjct: 100 -----------FLYGCHQPTIVFLHQDP---HGR-----HVKTYEVNLREKEFNRGP--W 138

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
              N+  +A  ++AVP P GG L++G  +I YH      A+A             +  S+
Sbjct: 139 KQDNVETEATMVIAVPQPYGGALIIGQESITYHKGDNYVAIA----------PPTIKNST 188

Query: 360 FSV--ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDIT 416
                 LD   + +L  D  L       L+      DG   V+ L L     + +   +T
Sbjct: 189 LVCYGRLDNNGSRYLLGD--LTGRLFLLLLDKEESMDGAATVKDLKLEFLGETSIAECLT 246

Query: 417 TIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI 457
            + N + F+GSRLGDS LV+    S  S     + E F ++
Sbjct: 247 YLDNGVVFIGSRLGDSQLVRLNTESDESGSYVTMMETFTNL 287


>gi|366994686|ref|XP_003677107.1| hypothetical protein NCAS_0F02680 [Naumovozyma castellii CBS 4309]
 gi|342302975|emb|CCC70752.1| hypothetical protein NCAS_0F02680 [Naumovozyma castellii CBS 4309]
          Length = 1340

 Score = 55.5 bits (132), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 83/386 (21%), Positives = 165/386 (42%), Gaps = 64/386 (16%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           L LV  + L+  +  +A++ Q  +  S     +++A   AKIS++ FD   + L   S+H
Sbjct: 48  LNLVEEFNLNAKITDIALIPQEKSPLS----CLVIASGVAKISIVRFDAVTNSLETLSLH 103

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS--------- 211
            +E            + A+   ++VDP  R   +L++    I L     G+         
Sbjct: 104 YYEDKLS---DISLVTLAKTSKLRVDPMNR--ALLLFNNDSIALLPLFSGNHEDEDEDDE 158

Query: 212 ----GLVGDEDTFGSGGGFSARIESSHVINLRDL--DMKHVKDFIFVHGYIEPVMVILHE 265
                +   E T          +  S + ++++L  ++++V D  F++ + +P + +L++
Sbjct: 159 EDDYDVTRGEVTTKRSKKNEKHVGQSKIFHVKELHQELQNVLDIQFLNDFTKPTLAVLYQ 218

Query: 266 RELTWAGRVSWKHH--TCMISALSIST----TLKQHPLIWSAMNLPHDAYKLLAVPSPIG 319
            +LTW G         + MI  L++ T    T     +I +  +L  D ++LL +     
Sbjct: 219 PKLTWVGNTELNPQPTSFMIFTLNLRTNELETAFDVVIIATLHDLSWDWFQLLPISR--- 275

Query: 320 GVLVVGANTIHYHSQSA--SCALALNNYAVSLDSSQELPRSSFSVELDA---AHATW--- 371
           G +V+G N + Y   +      + LN++A   D S +  R     EL+       T+   
Sbjct: 276 GCVVMGNNEMAYIDNTGVLQSIIHLNSFA---DKSLQRARIIDETELEVFFNEKVTYFWS 332

Query: 372 -------LQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK-----------TNPS-VLT 412
                  + ++  L+   + +L  + +  +GR++ + DL K           +NP+ V  
Sbjct: 333 ASTDKKNIDDETLLIIDASANLYYVRLEAEGRLLTKFDLIKLPIVNDALKDTSNPTCVAR 392

Query: 413 SDITTIGNSL-FFLGSRLGDSLLVQF 437
            D  +  +S+  F+G   GDSL+V+ 
Sbjct: 393 VDPNSSNSSMDLFIGYLSGDSLVVRL 418


>gi|45184764|ref|NP_982482.1| AAL060Wp [Ashbya gossypii ATCC 10895]
 gi|74695871|sp|Q75EY8.1|CFT1_ASHGO RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two
           protein 1
 gi|44980110|gb|AAS50306.1| AAL060Wp [Ashbya gossypii ATCC 10895]
 gi|374105681|gb|AEY94592.1| FAAL060Wp [Ashbya gossypii FDAG1]
          Length = 1305

 Score = 55.5 bits (132), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 120/624 (19%), Positives = 248/624 (39%), Gaps = 124/624 (19%)

Query: 140 AKISVLEFDDSIHGLRITSMHCFESP--EWLHLKRGRESFARGPLVKVDPQGRCGGVLVY 197
            ++S++ FD     L   S+H +++   E   L  G       P ++ +P  RC  +LV+
Sbjct: 82  GRVSIVRFDAENQTLETESLHYYDAKFEELSALTVGA-----APRLEQEPAARC--LLVH 134

Query: 198 GLQMIILKASQGGSGLV-------------GDEDTFGSGGGFSARIESSHVINLRDLDMK 244
               + +   +G                     D  G   G S  + +SH+ +    D+K
Sbjct: 135 NGDCLAVLPLRGHEEEGEEAEEEEEHPAKRARTDADGRLVGASTVMPASHLHS----DIK 190

Query: 245 HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNL 304
           +VKD  F+ G  +  + +L++ +L+W G       T     LS+    ++  +I     L
Sbjct: 191 NVKDMRFLRGLNKSAVGVLYQPQLSWCGNEKLTRQTMKFIILSLDLDDEKSTVINMLQGL 250

Query: 305 PHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASC--ALALNNYAVS-----------LDS 351
           P+  + ++ + +   G ++ G N + Y   + +   A++LN ++ S           L +
Sbjct: 251 PNTLHTIIPLSN---GCVLAGVNELLYVDNTGALQGAISLNAFSNSGLNTRIQDNSKLQA 307

Query: 352 SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRL---------D 402
             E P   F+ + +         D+ LL  +   +  + +  +GR++            +
Sbjct: 308 FFEQPLCYFATQSNG-------RDILLLMDEKARMYNVIIEAEGRLLTTFNCVQLPIVNE 360

Query: 403 LSKTN--PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
           + K N  P+ +  ++     SL F+G + GD++ V+       + L S L+         
Sbjct: 361 IFKRNMMPTSICGNMNLETGSL-FIGFQSGDAMHVRL------NNLKSSLEH-------- 405

Query: 461 APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKT------FSFAVRDSLVNIGP 514
                  + + S+ L+   + + + LYG   NN E  +K       F     D L+NIGP
Sbjct: 406 -------KGTVSETLE--TDEDYMELYG---NNAEKEKKNLETESPFDIECLDRLLNIGP 453

Query: 515 LKDFSYGLRINADASATGISKQSNYELVELP----GCKGIWTVYHKSSRGHNADSSRMAA 570
           +   + G   + + +   ++  +  EL  +     G     T+   +       + +  +
Sbjct: 454 VTSLAVGKASSIEHTVAKLANPNKDELSIVATSGNGTGSHLTILENTIVPTVQQALKFIS 513

Query: 571 YDDEYH-------AYLIISLEARTMV-LETADLLTEVTESVDYFVQGRTIAAGNLFGRRR 622
               ++        YL+ +  ++T   + + D   +  ++ D+     T++     G +R
Sbjct: 514 VTQIWNLKIKGKDKYLVTTDSSQTRSDIYSIDRDFKPFKAADFRKNDTTVSTAVTGGGKR 573

Query: 623 VIQVFERGARILDGSY---MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGS 679
           ++QV  +G  + D ++   MT +  F               V+ V I DP++LL  S G 
Sbjct: 574 IVQVTSKGVHLFDINFKRMMTMNFDF--------------EVVHVCIKDPFLLLTNSKGD 619

Query: 680 IRLLVGDPSTCTVSVQT--PAAIE 701
           I++   +P      V+T  P A++
Sbjct: 620 IKIYELEPKHKKKFVKTVLPDALK 643


>gi|320163506|gb|EFW40405.1| UV-damaged DNA binding protein [Capsaspora owczarzaki ATCC 30864]
          Length = 1123

 Score = 55.1 bits (131), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 75/318 (23%), Positives = 127/318 (39%), Gaps = 69/318 (21%)

Query: 237 NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
           N+R L+   V D  F+ GY  P +++L++             H      L       + P
Sbjct: 208 NIR-LEELQVFDIKFLRGYDRPTILVLYQD-------TKETRHVKTYQVLLKEKEFAEGP 259

Query: 297 LIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELP 356
             W+  N+   A  L+ V  P+GGVL+VG  TI YHS SA  ++A+    +         
Sbjct: 260 --WAQNNVEGGASLLIPVLMPLGGVLIVGEQTITYHSGSAFRSVAMRPAII--------- 308

Query: 357 RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR-VVQRLDLSKTNPSVLTSDI 415
              +SV         +  +  LL+   G+L+ + + +D +  V  + + +   + + S +
Sbjct: 309 -KCYSV---------IDTNRFLLADSEGNLLSVLLTHDRQDKVTAIKIDRLGVTSILSCL 358

Query: 416 TTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDAL 475
           T + N + F GS+ GDS L++              ++E G       S   L      A+
Sbjct: 359 TYLDNGVVFGGSQFGDSQLLRLATE----------RDETGSFVRVLESFSNLGPICDMAV 408

Query: 476 QDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISK 535
            D+                +    T S A +D     G L+    G+         GI +
Sbjct: 409 VDL------------ERQGQCQVVTCSGAFKD-----GSLRVVRNGV---------GIEE 442

Query: 536 QSNYELVELPGCKGIWTV 553
           Q+    +ELPG KGIW++
Sbjct: 443 QAT---IELPGIKGIWSL 457


>gi|71413926|ref|XP_809084.1| cleavage and polyadenylation specificity factor-like protein
           [Trypanosoma cruzi strain CL Brener]
 gi|70873410|gb|EAN87233.1| cleavage and polyadenylation specificity factor-like protein,
           putative [Trypanosoma cruzi]
          Length = 499

 Score = 54.7 bits (130), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 62/260 (23%), Positives = 105/260 (40%), Gaps = 54/260 (20%)

Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS----------IS 289
           +++V+D  F+    EP++  L ER  TWAGRV    W+        LS           S
Sbjct: 250 IRYVRDMQFIDSSGEPIVAFLCERHPTWAGRVKLVEWRTKAVESKMLSSQIVWVQISAAS 309

Query: 290 TTLKQHPLIWSAMNLPHDAYKLLAVPSPIG-------GVLVVGANTIHYHSQSASCALAL 342
           T+ ++  LI    ++P++   +    +P+G       GV+  G NT+ + +      + L
Sbjct: 310 TSNRKLLLIGEVDDVPYNVTHM----TPVGPFAQIPSGVICYGINTVMHVTTKRGYGVYL 365

Query: 343 NNYAVS-----------------LDSSQELPRSSFSVELDAAHATW----LQND---VAL 378
           NN  +                   D   E   + F V L  A+ T     + N+   + +
Sbjct: 366 NNGGMEECANSKSSAMSYGKVGWCDPKMEASTALFMVNLSLANCTASFMSIVNEMLHLLV 425

Query: 379 LSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
           +S + G ++ L++      VQ + ++        S I  IG+ + FLGS  GDS      
Sbjct: 426 VSEEDGVVLTLSITAQSSSVQGIRIAILGTGCYCSGIARIGDQIVFLGSACGDS------ 479

Query: 439 CGSGTSMLSSGLKEEFGDIE 458
           C +   M  S   + F  IE
Sbjct: 480 CIAKVDMFHSDAAKRFQIIE 499


>gi|259155222|ref|NP_001158852.1| DNA damage-binding protein 1 [Salmo salar]
 gi|223647700|gb|ACN10608.1| DNA damage-binding protein 1 [Salmo salar]
          Length = 1139

 Score = 54.7 bits (130), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 77/341 (22%), Positives = 131/341 (38%), Gaps = 73/341 (21%)

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  ++ VP P GG +++G  +I YH+     A+A      S          
Sbjct: 207 WKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAIAPPTIKQSTIVCHN---- 262

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQR-LDLSKTNPSVLTS 413
                +D   + +L  D+       G L +L +    + DG VV + L +     + +  
Sbjct: 263 ----RVDPNGSRYLLGDME------GRLFMLLLEKEELMDGAVVLKDLRVELLGETSIAE 312

Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
            +T + N + F+GSRLGDS LV+    S  S     + E F ++                
Sbjct: 313 CLTYLDNGVVFVGSRLGDSQLVKLNVDSNDSGSYVAVMETFTNL---------------G 357

Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
            + DM                +    T S A ++     G L+    G+ I+  AS    
Sbjct: 358 PIVDMC-------VVDLERQGQGQLVTCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401

Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
                   ++LPG KG+W +  +S  G   D              L++S   +T VL  +
Sbjct: 402 --------IDLPGIKGLWPL--RSEAGRETDD------------MLVLSFVGQTRVLMLS 439

Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
               E TE   +    +T   GN+   +++IQ+   G R++
Sbjct: 440 GEEVEETELPGFVDNLQTFYCGNV-AHQQLIQITSGGVRLV 479


>gi|190345965|gb|EDK37945.2| hypothetical protein PGUG_02043 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 1206

 Score = 54.7 bits (130), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 123/611 (20%), Positives = 236/611 (38%), Gaps = 102/611 (16%)

Query: 98  AASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRIT 157
           +  ++ +CH ++ G ++++  + +GG++     D +++  +  ++S+LEFD         
Sbjct: 55  SGKIKQICHQQVIGVIQNIDRIRKGGSN----LDLLVITSDSGRLSILEFDKD------- 103

Query: 158 SMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDE 217
            +  F   +  H K G      G  + VDPQ R   +       ++ KA    + L    
Sbjct: 104 ELKFFPVVQEPHSKNGMNRTTPGEYLCVDPQDRTITIGAIERDKLMYKAQTNNNKL---- 159

Query: 218 DTFGSGGGFSARIES----SHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGR 273
                    S+ +ES    +  I +  LD           GY  P++  +   E  +A  
Sbjct: 160 -------ELSSPLESVSKNTLTIQMVSLDT----------GYENPMLAAI---ECNYAHY 199

Query: 274 VSWKHHTCMISALSISTTLKQHPLIWSA-----MNLPHDAYKLLAVPSPIGGVLVVGANT 328
            +   +    S L++     +  L + A     + +P  +  L+ +P+PIGGV+V G++ 
Sbjct: 200 DASLKYDPQSSNLTLQYYEFEQGLNYVARRKDTLEIPSSSTTLVPLPTPIGGVIVAGSSF 259

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           I YH+ +    L L     S   S  +P   ++V     H     N   LL  + GD   
Sbjct: 260 IFYHNPTIDQQLYLP--IPSRAGSSPVPIVCYAV-----HKLKKNNFFILLHNELGDCFR 312

Query: 389 LTVVY--DGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT-CGSGTSM 445
           + + Y  D   V  L +   +    ++ I        F      D +L Q    G   S 
Sbjct: 313 VLIDYDDDSEKVTELSVGYFDTISPSTSINVFKKGYLFANVTNNDKMLYQIEDLGDNDSY 372

Query: 446 LSSGLKEEFGDI-EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFA 504
           +SS       D+ + +     + R   + AL  +++       G+    +ES +   +  
Sbjct: 373 ISSSQFSSLEDVFDGNKKHEFKPRGLRNLALVQIIDSSNPCFGGALVKTSESKESRIAMI 432

Query: 505 VRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNAD 564
              S      LK  ++G+ I+               LV  P      +V+          
Sbjct: 433 TGHS-----HLKLKTHGIPIST--------------LVSSPLPMIATSVF---------- 463

Query: 565 SSRMAAYDDEYHAYLIISLEA--RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRR 622
           ++R++A + +   Y++IS  A  +T+VL   +++ EV +S   FV  +        G + 
Sbjct: 464 TTRLSA-ESKNDEYMVISSSASSKTLVLAIGEVVEEVQDSS--FVTDQPTIGVQQVGLKS 520

Query: 623 VIQVFERGARIL-----DGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSD 677
           +IQ++  G R +     +G    +   + P            T++S S     VL+G+S+
Sbjct: 521 LIQIYSNGIRHIRQTETEGKITKKTFDWYP--------PAGITIISASTNQEQVLIGLSN 572

Query: 678 GSIRLLVGDPS 688
             +     DP+
Sbjct: 573 RELCYFEIDPT 583


>gi|427780151|gb|JAA55527.1| Putative dna damage-binding protein 1 [Rhipicephalus pulchellus]
          Length = 1181

 Score = 54.3 bits (129), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 92/420 (21%), Positives = 156/420 (37%), Gaps = 94/420 (22%)

Query: 246 VKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI---WSAM 302
           V+D  F+HG   P +V+LH+         S   H       +   +LK    +   W   
Sbjct: 164 VQDMEFLHGCKTPTIVLLHQD--------SQARHMK-----TYEVSLKDKEFVKGPWKQD 210

Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
           ++  +A  ++AVP P  G L++G  +I YH+         + Y V    +  L R S  V
Sbjct: 211 HVESEANLVIAVPEPFCGALIIGQESITYHNG--------DQYVV---ITPHLIRQSTIV 259

Query: 363 ---ELDAAHATWLQNDVA-------------------LLSTKTGDLVLLTVVYDGRV--- 397
              ++DA  + +L  D+A                   LL    G L +L +  + ++   
Sbjct: 260 CYGKVDANGSRYLLGDMAGRLFMLLLEREDKMDGTXYLLGDMAGRLFMLLLEREDKMDGT 319

Query: 398 --VQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG 455
             V+ L L       +   IT + N + ++GSRLGDS L++             + E F 
Sbjct: 320 TTVKDLKLEFLGEITIAECITYLDNGVVYVGSRLGDSQLIKLHAERNDQGSFVEIMEVFT 379

Query: 456 DIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPL 515
           ++                 + DM                +    T S A ++     G L
Sbjct: 380 NL---------------GPIVDMC-------VVDLERQGQGQLVTCSGAFKE-----GSL 412

Query: 516 KDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEY 575
           +    G+ I+  AS            ++LPG KG+W +        +    R      E 
Sbjct: 413 RIIRNGIGIHEHAS------------IDLPGIKGMWPLRVGPGVAPHGGDGRDPGDSAER 460

Query: 576 HAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILD 635
              L++S   +T VL  +    E TE   +    +T   GN+   +++IQV     R++D
Sbjct: 461 DNTLVLSFVRQTRVLMLSGEEVEETELAGFDTSQQTFFCGNV-RNKQLIQVTAAAVRLVD 519


>gi|223647932|gb|ACN10724.1| DNA damage-binding protein 1 [Salmo salar]
          Length = 1139

 Score = 54.3 bits (129), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 77/341 (22%), Positives = 131/341 (38%), Gaps = 73/341 (21%)

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  ++ VP P GG +++G  +I YH+     A+A      S          
Sbjct: 207 WKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAIAPPTIKQSTIVCHN---- 262

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQR-LDLSKTNPSVLTS 413
                +D   + +L  D+       G L +L +    + DG VV + L +     + +  
Sbjct: 263 ----RVDPNGSRYLLGDME------GRLFMLLLEKEELMDGAVVLKDLRVELLGETSIAE 312

Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
            +T + N + F+GSRLGDS LV+    S  S     + E F ++                
Sbjct: 313 CLTYLDNGVVFVGSRLGDSQLVKLNVDSNDSGSYVAVMETFTNL---------------G 357

Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
            + DM                +    T S A ++     G L+    G+ I+  AS    
Sbjct: 358 PIVDMC-------VVDLERQGQGQLVTCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401

Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
                   ++LPG KG+W +  +S  G   D              L++S   +T VL  +
Sbjct: 402 --------IDLPGIKGLWPL--RSEAGRETDD------------MLVLSFVGQTRVLMLS 439

Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
               E TE   +    +T   GN+   +++IQ+   G R++
Sbjct: 440 GEEVEETELPGFVDNLQTFYCGNV-AHQQLIQITSGGVRLV 479


>gi|156389050|ref|XP_001634805.1| predicted protein [Nematostella vectensis]
 gi|156221892|gb|EDO42742.1| predicted protein [Nematostella vectensis]
          Length = 1157

 Score = 54.3 bits (129), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 54/212 (25%), Positives = 95/212 (44%), Gaps = 40/212 (18%)

Query: 237 NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
           N+R L+  HV D  F++G   P +V +++             H   +    I+  L+ H 
Sbjct: 155 NIR-LEELHVVDIQFLYGCANPTIVFIYQDP-----------HGRHVKTYEIN--LRDHE 200

Query: 297 LI---WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
                W   N+  +A +++AVP+P+GG L++G  +I YH  S   A+A            
Sbjct: 201 FAKGPWKQDNVEVEACRVIAVPNPLGGALIIGQESITYHKGSNYHAIA----------PP 250

Query: 354 ELPRSSFSV--ELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKT 406
            L +SS +   ++D   + +L  D+       G L +L +    + DG   V+ L L   
Sbjct: 251 ALKQSSLTCHGKIDTNGSRYLLGDM------NGRLYMLLLERQELIDGTYEVKDLKLEML 304

Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
             + +   +  + N + F+GS LGDS L + +
Sbjct: 305 GETSIAHCLVYLDNGVVFIGSMLGDSQLAKLS 336


>gi|19114492|ref|NP_593580.1| damaged DNA binding protein Ddb1 [Schizosaccharomyces pombe 972h-]
 gi|46395602|sp|O13807.1|DDB1_SCHPO RecName: Full=DNA damage-binding protein 1; AltName:
           Full=Damage-specific DNA-binding protein 1
 gi|2330717|emb|CAB11219.1| damaged DNA binding protein Ddb1 [Schizosaccharomyces pombe]
          Length = 1072

 Score = 53.9 bits (128), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 95/492 (19%), Positives = 190/492 (38%), Gaps = 97/492 (19%)

Query: 174 RESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESS 233
           RES   GPL+ VDP  R   + VY   + I+   +     +   +       FS RI+  
Sbjct: 111 RES-QSGPLLLVDPFQRVICLHVYQGLLTIIPIFKSKKRFMTSHNNPSLHDNFSVRIQEL 169

Query: 234 HVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
           +V+           D   ++    P + +L++   +     ++K              ++
Sbjct: 170 NVV-----------DIAMLYNSSRPSLAVLYKDSKSIVHLSTYK------------INVR 206

Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
           +  +    + + HD  +   +PS  GGV V G   ++Y S+    +  L  Y        
Sbjct: 207 EQEIDEDDV-VCHDIEEGKLIPSENGGVFVFGEMYVYYISKDIQVSKLLLTY-------- 257

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
             P ++FS  +     T L + + +++ ++G L     ++    V  ++L K   S + S
Sbjct: 258 --PITAFSPSISNDPETGLDSSIYIVADESGMLYKFKALFTDETVS-MELEKLGESSIAS 314

Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
            +  + ++  F+GS   +S+L+Q                         PS  +      +
Sbjct: 315 CLIALPDNHLFVGSHFNNSVLLQL------------------------PSITK-NNHKLE 349

Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
            LQ+ VN   +S +    + T S+  T S A +D     G L+     + I         
Sbjct: 350 ILQNFVNIAPISDFIIDDDQTGSSIITCSGAYKD-----GTLRIIRNSINI--------- 395

Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
               N  L+E+ G K  ++V            S  A YD+  + +L +  E R +++   
Sbjct: 396 ---ENVALIEMEGIKDFFSV------------SFRANYDN--YIFLSLICETRAIIVSPE 438

Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESG 653
            +    + + D   +  TI    ++G  +++Q+  +  R+ DG  +   +S  P +   G
Sbjct: 439 GVF---SANHDLSCEESTIFVSTIYGNSQILQITTKEIRLFDGKKLHSWIS--PMSITCG 493

Query: 654 SGSENSTVLSVS 665
           S   ++  ++V+
Sbjct: 494 SSFADNVCVAVA 505


>gi|325189950|emb|CCA24429.1| splicing factor putative [Albugo laibachii Nc14]
          Length = 1644

 Score = 53.5 bits (127), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 73/316 (23%), Positives = 130/316 (41%), Gaps = 47/316 (14%)

Query: 299 WSAMNLPHDAYKLLAVPS----PIGGVLVVGANTIHYHSQS---ASCALALNNYAVSLDS 351
           WS + +P  A KL+AVP     P GGVLV+    I Y +++    SC+  L +     + 
Sbjct: 660 WSQV-VPRSANKLVAVPGGNDGP-GGVLVIAQGLIQYQNENHPPLSCSFPLRSTG-GPNP 716

Query: 352 SQELPRSSFSVELDAAHATWLQNDV--ALLSTKTGDLVLLTVVYDGRVVQRLDLS--KTN 407
            Q+  +  + + +  + AT  Q D+   L+ ++ GDL  +++ Y G  VQ+L +    T 
Sbjct: 717 VQDERKQGYPMMI-VSTATHKQRDLFFVLMQSEWGDLFKISLEYAGSSVQKLRIQYFDTI 775

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL 467
           P  L   IT  G  L F  S   +  L QF         +  +     D E + PS    
Sbjct: 776 PVALALCITKTG--LLFAASEFSNHYLFQFLSIGEDDDAAQCVSAAENDQEPEIPSFSVR 833

Query: 468 RRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD 527
           +  +   + ++ +   ++         E   + ++   +    N   L+   +GL +   
Sbjct: 834 KLKNLAMISNIPSISPITQLLVDDFANEQTPQLYALCGQG---NRSSLRILRHGLPVMEM 890

Query: 528 ASATGISKQSNYELVELPG-CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEAR 586
           A++             LPG  K +W +                ++ D    Y+++S E  
Sbjct: 891 AASA------------LPGVAKAVWCLKE--------------SFTDTCDKYIVVSFEDA 924

Query: 587 TMVLETADLLTEVTES 602
           T+VLE  D + E+T+S
Sbjct: 925 TLVLEIGDTVEEITDS 940


>gi|322787057|gb|EFZ13281.1| hypothetical protein SINV_13198 [Solenopsis invicta]
          Length = 986

 Score = 53.5 bits (127), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 48/203 (23%), Positives = 92/203 (45%), Gaps = 31/203 (15%)

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI-W 299
           +D + V+D  F+HG   P ++++H+        ++ +H    +    I+   K+   I W
Sbjct: 159 MDEQQVQDVNFLHGCANPTLILIHQD-------INGRH----VKTHEINLRDKEFSKIPW 207

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
              N+  +A  ++ VPSP+ G +++G  +I YH          N+Y   +    +    +
Sbjct: 208 RQDNVEREAMMVIPVPSPMCGAIIIGQESILYHDG--------NSYVAVVPPIIKQSTIT 259

Query: 360 FSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY----DGRV-VQRLDLSKTNPSVLTSD 414
              ++D     +L  D+A      G L +L +      DG + V+ L +       +   
Sbjct: 260 CYAKVDNQGLRYLLGDMA------GHLFMLFLEQEKKPDGTLSVKDLKVELLGEISIPEC 313

Query: 415 ITTIGNSLFFLGSRLGDSLLVQF 437
           IT + N + ++GSRLGDS L++ 
Sbjct: 314 ITYLDNGVIYVGSRLGDSQLIKL 336


>gi|242010743|ref|XP_002426118.1| DNA damage-binding protein, putative [Pediculus humanus corporis]
 gi|212510165|gb|EEB13380.1| DNA damage-binding protein, putative [Pediculus humanus corporis]
          Length = 1148

 Score = 52.8 bits (125), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 60/276 (21%), Positives = 116/276 (42%), Gaps = 61/276 (22%)

Query: 173 GRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIES 232
           G++S   G +  +DP+ R  G+ +Y   + I+   +  S L             S R+E 
Sbjct: 113 GKQS-ETGIIAVIDPEARVIGLRLYDGLLKIIPLGKDNSELKAS----------SIRMEE 161

Query: 233 SHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTL 292
             V           +D  F+HG   P ++++H+        ++ +H    +    IS   
Sbjct: 162 VEV-----------QDLNFLHGCQNPTIILIHQD-------INGRH----VKTHEISLRD 199

Query: 293 KQH-PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA---LNNYAVS 348
           K+   + W   N+  DA  ++ VP P+ G +++G  +I YH+ +   A+A   +N   ++
Sbjct: 200 KEFVKMPWKQDNVEPDASIVIPVPEPLCGAIIIGQESILYHNGAGYVAVAPPVINQSTIT 259

Query: 349 LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRV-------VQRL 401
             +           ++D+  + +L  D+A      G L +L +  + ++          L
Sbjct: 260 CYT-----------QVDSNGSRYLLGDMA------GHLFMLLLETEEKIDGTPCVKENGL 302

Query: 402 DLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
            +       +   IT + N + F+GSR GDS LV+ 
Sbjct: 303 KVELLGEISIPEAITYLDNGVLFIGSRCGDSQLVKL 338


>gi|391335522|ref|XP_003742140.1| PREDICTED: DNA damage-binding protein 1-like [Metaseiulus
           occidentalis]
          Length = 1154

 Score = 52.8 bits (125), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 100/431 (23%), Positives = 167/431 (38%), Gaps = 88/431 (20%)

Query: 257 EPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS 316
           +PV+ I++E + T       +H    + AL     L + P  W   NL  +A  L+ V  
Sbjct: 178 DPVLAIVYEEQQT-------RHMKTHVIALR-DKELMKGP--WGQRNLDLEADMLIPVED 227

Query: 317 PIGGVLVVGANTIHYH-SQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQND 375
              GV++VG  TI YH  Q   C                  + SF      +    + N+
Sbjct: 228 TETGVIIVGGETIVYHYGQDYICI-----------------QPSFLRTTKISCYCRIDNN 270

Query: 376 --VALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
             V +L    G L +LT+  + + V    L       +   ++ + N + F+GSRLGDS 
Sbjct: 271 RLVFILGGICGRLFILTLRRENKKVVSHSLDLLGSVSIPECLSYLDNGVVFVGSRLGDSQ 330

Query: 434 LVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS-DALQDMVNGEELSLYGSASN 492
           L++                    + A  P  + L   ++  A+ DM+   +L   G    
Sbjct: 331 LIR--------------------MHAQEPFIEVLESYTNLGAILDMI-VVDLEKQGQDQL 369

Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWT 552
            T S Q              G L+    G+ I+  A             VEL G KGIW 
Sbjct: 370 ITCSGQGA-----------CGSLRIIRNGIGIHELAC------------VELSGIKGIWA 406

Query: 553 VYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLE--TADLLTEVTESVDYFVQGR 610
           +     R + A        DD     L++S   +T V    + + L +VT    + +  +
Sbjct: 407 L-----RMNTAQLEEDTPTDDT----LVLSFVGQTRVFNCSSTEELEQVTLPAAFDIDSQ 457

Query: 611 TIAAGNLFGRRRVIQVFERGARILDGSYMTQ-DLSFGPSNSESGSGSENSTVLSVSIADP 669
           T  A N+ G  +VIQV ++   ++  +  T+ D  F P        + N   +++++ + 
Sbjct: 458 TFCARNVLG-NQVIQVTDKRVNLISVTSKTRVDQWFPPEGEIITQCACNDVQVALALKNV 516

Query: 670 YVLLGMSDGSI 680
            V L + DGS+
Sbjct: 517 LVYLEIRDGSL 527


>gi|389586447|dbj|GAB69176.1| splicing factor 3B subunit 3 [Plasmodium cynomolgi strain B]
          Length = 1286

 Score = 52.8 bits (125), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 127/604 (21%), Positives = 225/604 (37%), Gaps = 114/604 (18%)

Query: 92  LMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSI 151
           L+       L L+    + G +  L      G++    +D +++  +  ++ +L+F +  
Sbjct: 41  LLRADKQGKLNLIASKDVFGIIRCLQTFRLTGSN----KDYVVIGSDSGRLVILQFSNEK 96

Query: 152 HGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGV-------LVYGL----- 199
           +      +HC       + K G      G  + VDP+GR   +        VY L     
Sbjct: 97  NDF--VRVHC-----ETYGKSGLRRIIPGEYIAVDPKGRALMICAIERQKFVYILNRDTK 149

Query: 200 -QMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEP 258
            Q+ I              D  G   GF   I +S   N    D K V +   + G    
Sbjct: 150 EQLTISSPLDAHKSHTICHDVVGMDVGFENPIFASIEQNYEMYD-KQVTNTNEIDGCTRK 208

Query: 259 VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
            ++ L E +L                   ++  +++H        LP D+   L +P P 
Sbjct: 209 TLLCLWEMDL------------------GLNHVIRKH-------TLPIDSSAHLLIPIPG 243

Query: 319 G-----GVLVVGANTIHYHSQS---ASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
           G     GV+V   N + Y         CA     Y   L++ QE    + S+   A H  
Sbjct: 244 GQQGPSGVIVCCDNYLVYKKVEHVDVYCA-----YPRRLETGQE---KNISIVCSALHRI 295

Query: 371 WLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
             +    L+ ++ GDL  + + ++  +V+ +     +   + + I  + +   F+ +  G
Sbjct: 296 R-KFFFILIQSEFGDLYKIEMDHEDGIVKEITCKYFDTVPVANAICVMKSGSLFVAAEFG 354

Query: 431 DSLLVQFT-CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEE--LSLY 487
           +    QF+  G   +      K   G     A  TK+L   ++  L D V      L + 
Sbjct: 355 NHFFYQFSGIGDDDNEAMCTSKHPSGRNAIIAFRTKKL---TNLFLIDQVYSLSPILDMK 411

Query: 488 GSASNNTESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVEL 544
              + N  S Q         +L   GP   L+   +GL I   A              EL
Sbjct: 412 ILDAKNANSPQIY-------ALCGRGPRSSLRILQHGLSIEELADN------------EL 452

Query: 545 PG-CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESV 603
           PG  K IWT+     +  NA          +Y  Y+I+S E  T++LE  + + EV +S+
Sbjct: 453 PGRPKFIWTI-----KKDNAS---------DYDGYIIVSFEGSTLILEIGETVEEVVDSL 498

Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLS 663
              +   T    N+     +IQV + G R ++G  + + +   P N +  + + NST + 
Sbjct: 499 --LLTNVTTIHVNILYDNTLIQVHDTGIRHINGKVVHEWVP--PKNKQIKAATSNSTQIV 554

Query: 664 VSIA 667
           +S++
Sbjct: 555 ISLS 558


>gi|302406266|ref|XP_003000969.1| pre-mRNA-splicing factor rse-1 [Verticillium albo-atrum VaMs.102]
 gi|261360227|gb|EEY22655.1| pre-mRNA-splicing factor rse-1 [Verticillium albo-atrum VaMs.102]
          Length = 1059

 Score = 52.8 bits (125), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 132/578 (22%), Positives = 217/578 (37%), Gaps = 141/578 (24%)

Query: 104 VCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFE 163
           V  + + G + S+A     G++    +D +ILA +  +I+++E+        + + + F+
Sbjct: 59  VLSHDVFGIIRSMAAFRIAGSN----KDYLILATDSGRIAIIEY--------LPAQNRFQ 106

Query: 164 SPEWLHL----KRGRESFARGPLVKVDPQGRCGGVLVYGLQ-----MIILKASQGGSGLV 214
               LHL    K G      G  +  DP+GR    L+  L+      ++ + SQ      
Sbjct: 107 R---LHLETFGKSGIRRVVPGEFLACDPKGRA--CLIASLEKNKLVYVLNRNSQA----- 156

Query: 215 GDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRV 274
             E T  S     A     HV+++  LD+          GY  PV   L E + T A + 
Sbjct: 157 --ELTISSP--LEAHKPGVHVLSMVALDV----------GYANPVFAAL-ETDYTEADQD 201

Query: 275 SWKHHTCMISALSISTTLKQHPL------IWSAMNLPHDAYKLLAVPSPIG-----GVLV 323
                    +AL + T L  + L      +    + P D    L    P G     GVLV
Sbjct: 202 PTGQ-----AALDVETQLVYYELDLGLNHVVRKWSEPVDNTASLLFQVPGGNDGPSGVLV 256

Query: 324 VGANTIHY-HSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA----L 378
            G   I Y HS   +  + +         + E P    S+     H   L+        L
Sbjct: 257 CGEENITYRHSNQEAFRVPVPRRR----GATEDPSRKRSIVAGVMHK--LKGSAGAFFFL 310

Query: 379 LSTKTGDLVLLTVVY----DGRV---VQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGD 431
           L T+ GDL  +T+      DG     V+RL +   +   + S +  + +   ++ S+ G+
Sbjct: 311 LQTEDGDLFKITIDMIEDRDGNPTGEVKRLKIKYFDTIPVASSLCILKSGFLYVASQFGN 370

Query: 432 SLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSAS 491
               QF              E+ GD        + L  SS D   D     E   +    
Sbjct: 371 YQFYQF--------------EKLGD------DDEELEFSSDDFPTDPKQSYEAVFF---- 406

Query: 492 NNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA----SATGISKQSNYELV----- 542
                 ++  + A+ +S+ ++ PL D         DA    +A G   +S + ++     
Sbjct: 407 ----HPRELENLALVESIDSMNPLIDCKVANLTGEDAPQIYTACGNGARSTFRILKHGLE 462

Query: 543 -------ELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
                  ELPG    +WT+  K SRG            D+Y AY+++S    T+VL   +
Sbjct: 463 VNEIVASELPGIPSAVWTL--KLSRG------------DQYDAYIVLSFTNATLVLSIGE 508

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
            + EV +S   F+      A  L G   +IQV  +G R
Sbjct: 509 TVEEVNDS--GFLTSVPTLAAQLLGGEGLIQVHPKGIR 544


>gi|428180158|gb|EKX49026.1| hypothetical protein GUITHDRAFT_68305 [Guillardia theta CCMP2712]
          Length = 1202

 Score = 52.8 bits (125), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 110/552 (19%), Positives = 202/552 (36%), Gaps = 97/552 (17%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           ++ +C     G + S+A     G++    +D ++L  +  +ISVLEF    +      + 
Sbjct: 49  IQSICQMECFGLIRSMASFRLPGSN----KDYLVLGADSGRISVLEFSKERNQFERVHLE 104

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
            +        K G      G  +  DP+GR   +     Q ++          V + D  
Sbjct: 105 TYG-------KSGCRRIVPGQFLASDPKGRAVMISAIEKQKLVY---------VFNRDA- 147

Query: 221 GSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILH----ERELTWAGRVSW 276
                 S+++  S  +        H        G+  P+   L     + +    G+ S 
Sbjct: 148 ------SSKLTISSPLEAHKASTIHFSIVGVDVGFDNPIFAALEMDYSDADADETGQ-SA 200

Query: 277 KHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSP-----IGGVLVVGANTIHY 331
           +    +++   +   L     +    + P DA   + +P P       GVLV   N I Y
Sbjct: 201 EEFNKVLTFYELDLGLNH---VVRKASEPIDAASNMLIPVPGDTDGPSGVLVCAENKIAY 257

Query: 332 HSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV 391
                   +AL      +   Q L  + +S      H         LL ++ GDL  LT+
Sbjct: 258 KKPDHEDVVALIPRRQGMPLDQPLLITGYS------HLKQKDGFFFLLQSEIGDLYRLTL 311

Query: 392 VYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTS---MLSS 448
            Y+   V  ++++  +   +   IT +     F+ S  G+  L QF    G+    M+  
Sbjct: 312 TYNDEEVSEINITYFDTVPVAQSITILKTGFLFVASEFGNHALYQFLSIKGSDESDMMPV 371

Query: 449 GLKEEFGDIE----ADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFA 504
            ++ E   IE    A  P    L     ++L  +++   L L G      E   + ++  
Sbjct: 372 EVEIEGETIEIPHFAPRPLKNLLLVDEMESLSPILDMRVLDLAG------EETPQIYA-- 423

Query: 505 VRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGCK-GIWTVYHKSSRG 560
               L   GP   L+   +GL +            +   + ELP     +WTV     +G
Sbjct: 424 ----LCGKGPRSTLRTLRHGLAV------------AEMAVSELPSNPLAVWTV-----KG 462

Query: 561 HNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR 620
            + D++           Y++++    T+VL   D + EVT+S  +    +T++  +L G 
Sbjct: 463 SSKDAA---------DKYIVVTFANATIVLSIGDTVEEVTDS-GFLATNKTLSV-SLLGD 511

Query: 621 RRVIQVFERGAR 632
             ++QV   G R
Sbjct: 512 DSLLQVHPNGLR 523


>gi|146420838|ref|XP_001486372.1| hypothetical protein PGUG_02043 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 1206

 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 120/610 (19%), Positives = 237/610 (38%), Gaps = 96/610 (15%)

Query: 96  ISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLR 155
           + +  ++ +CH ++ G ++++  + +GG++     D +++  +  ++S+LEFD       
Sbjct: 53  LESGKIKQICHQQVIGVIQNIDRIRKGGSN----LDLLVITSDSGRLSILEFDKD----- 103

Query: 156 ITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVG 215
              +  F   +  H K G      G  + VDPQ R   +       ++ KA    + L  
Sbjct: 104 --ELKFFPVVQEPHSKNGMNRTTPGEYLCVDPQDRTITIGAIERDKLMYKAQTNNNKL-- 159

Query: 216 DEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVS 275
                       +  +++  I +  LD           GY  P++  +   E  +A   +
Sbjct: 160 -----ELLSPLESVSKNTLTIQMVSLDT----------GYENPMLAAI---ECNYAHYDA 201

Query: 276 WKHHTCMISALSISTTLKQHPLIWSA-----MNLPHDAYKLLAVPSPIGGVLVVGANTIH 330
              +    S L++     +  L + A     + +P  +  L+ +P+PIGGV+V G++ I 
Sbjct: 202 SLKYDPQSSNLTLQYYEFEQGLNYVARRKDTLEIPSSSTTLVPLPTPIGGVIVAGSSFIF 261

Query: 331 YHSQSASCALALNNYAVSLDS-SQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLL 389
           YH+ +    L L    + L + S  +P   ++V     H     N   LL  + GD   +
Sbjct: 262 YHNPTIDQQLYL---PIPLRAGSSPVPIVCYAV-----HKLKKNNFFILLHNELGDCFRV 313

Query: 390 TVVY--DGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT-CGSGTSML 446
            + Y  D   V  L +   +    ++ I        F      D +L Q    G   S +
Sbjct: 314 LIDYDDDSEKVTELSVGYFDTISPSTSINVFKKGYLFANVTNNDKMLYQIEDLGDNDSYI 373

Query: 447 SSGLKEEFGDI-EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAV 505
           SS       D+ + +     + R   + AL  +++       G+    +ES +   +   
Sbjct: 374 SSSQFSSLEDVFDGNKKHEFKPRGLRNLALVQIIDSSNPCFGGALVKTSESKESRIAMIT 433

Query: 506 RDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADS 565
             S      LK  ++G+ I+               LV  P      +V+          +
Sbjct: 434 GHS-----HLKLKTHGIPIST--------------LVSSPLPMIATSVF----------T 464

Query: 566 SRMAAYDDEYHAYLIISLEA--RTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRV 623
           +R++A + +   Y++IS  A  +T+VL   +++ EV +S   FV  +        G + +
Sbjct: 465 TRLSA-ESKNDEYMVISSSASSKTLVLAIGEVVEEVQDSS--FVTDQPTIGVQQVGLKSL 521

Query: 624 IQVFERGARIL-----DGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDG 678
           IQ++  G R +     +G    +   + P            T++S S     VL+G+S+ 
Sbjct: 522 IQIYSNGIRHIRQTETEGKITKKTFDWYP--------PAGITIISASTNQEQVLIGLSNR 573

Query: 679 SIRLLVGDPS 688
            +     DP+
Sbjct: 574 ELCYFEIDPT 583


>gi|339235331|ref|XP_003379220.1| DNA damage-binding protein 1 [Trichinella spiralis]
 gi|316978142|gb|EFV61158.1| DNA damage-binding protein 1 [Trichinella spiralis]
          Length = 1329

 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 80/355 (22%), Positives = 152/355 (42%), Gaps = 66/355 (18%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R  +  +SA  L+ V   ++ G + +  + +  G + +     +++      ++++E+D+
Sbjct: 205 RFEVHSVSAEGLQYVTEGKMFGRIGAAKLFTPKGENKAL----MVIVTLKQDVAIVEYDN 260

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQG 209
                RI ++      E      GR + + G L+ V P G   G+ +       +  ++ 
Sbjct: 261 G----RIKTLASRNISE----NFGRPA-SNGILLSVHPDGEVIGLRIMSSTFKCITWNRA 311

Query: 210 GSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELT 269
            S L                  S++ +N     + H+ DF+F+HG+  PV+ +++     
Sbjct: 312 TSKL------------------STYSLNY---SLTHLSDFVFLHGFQFPVIALIY----- 345

Query: 270 WAGRVSWKHH-TCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT 328
             G +  +H  TC IS        +  P  WS  ++  +A+ L+AVP P+ GV+VVG ++
Sbjct: 346 --GDLVGRHVITCRISL--DEQEFENGP--WSRGHIEWEAHTLIAVPPPLCGVIVVGCSS 399

Query: 329 IHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL 388
           + Y           +N  +S  S   L +S  +   DAA           L    G L L
Sbjct: 400 LLY---------IRDNSTISTVSPPFLSKSIVNC-YDAAP----DGLTYFLGQLDGTLSL 445

Query: 389 LTVVYDGRVVQRLDLSKTNPSVL--TSDITTIG----NSLFFLGSRLGDSLLVQF 437
           L +  +     ++ LS+   ++L  TS   ++      SL F+GSR+ DS L++ 
Sbjct: 446 LKLDIETDAEGKVTLSRMRATILGVTSPPDSLSYMHKESLLFVGSRIADSKLLRL 500


>gi|261335516|emb|CBH18510.1| cleavage and polyadenylation specificity factor-like protein,
           putative [Trypanosoma brucei gambiense DAL972]
          Length = 1452

 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 59/234 (25%), Positives = 98/234 (41%), Gaps = 42/234 (17%)

Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS-------ISTTL 292
           +++V+D  F+    EP++ IL ER+ TWAGRV    W+      + LS       IS T 
Sbjct: 254 IRYVRDVQFIGTLGEPLLAILCERKPTWAGRVKLVEWRTKAVESNMLSQQVTWVQISGTA 313

Query: 293 KQHP---LIWSAMNLPHDAYKLLAVPS---PIGGVLVVGANTIHY--------------- 331
              P   L+     +P++   +L V S    + GV+  G NTI +               
Sbjct: 314 SALPKLLLVGEVDGVPYNVTHMLPVGSISQAMSGVICFGVNTIMHITTRRGYGAYWNETG 373

Query: 332 -----HSQSASCALALNNYA-VSLDSSQELPRSSFSVELDAAHATWLQND-----VALLS 380
                 S+S++ +    N+    L+SS  L R + S+    A     ++D        +S
Sbjct: 374 KEECTSSKSSAVSYGKINWCDKKLESSTALFRVNLSLANCVAATLEGKDDEGSLQAVAVS 433

Query: 381 TKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLL 434
              G +++L  +  G  +  + ++        S IT I   L FLGS + DS +
Sbjct: 434 EDDGVVLMLQFLSQGSNIHDIRIAVLTSGCYCSSITPISERLMFLGSAVSDSCI 487


>gi|74025892|ref|XP_829512.1| cleavage and polyadenylation specificity factor-like protein
           [Trypanosoma brucei brucei strain 927/4 GUTat10.1]
 gi|70834898|gb|EAN80400.1| cleavage and polyadenylation specificity factor-like protein,
           putative [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
          Length = 1452

 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 59/234 (25%), Positives = 98/234 (41%), Gaps = 42/234 (17%)

Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHHTCMISALS-------ISTTL 292
           +++V+D  F+    EP++ IL ER+ TWAGRV    W+      + LS       IS T 
Sbjct: 254 IRYVRDVQFIGTLGEPLLAILCERKPTWAGRVKLVEWRTKAVESNMLSQQVTWVQISGTA 313

Query: 293 KQHP---LIWSAMNLPHDAYKLLAVPS---PIGGVLVVGANTIHY--------------- 331
              P   L+     +P++   +L V S    + GV+  G NTI +               
Sbjct: 314 SALPKLLLVGEVDGVPYNVTHMLPVGSISQAMSGVICFGVNTIMHITTRRGYGAYWNETG 373

Query: 332 -----HSQSASCALALNNYA-VSLDSSQELPRSSFSVELDAAHATWLQND-----VALLS 380
                 S+S++ +    N+    L+SS  L R + S+    A     ++D        +S
Sbjct: 374 KEECTSSKSSAVSYGKINWCDKKLESSTALFRVNLSLANCVAATLEGKDDEGSLQAVAVS 433

Query: 381 TKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLL 434
              G +++L  +  G  +  + ++        S IT I   L FLGS + DS +
Sbjct: 434 EDDGVVLMLQFLSQGSNIHDIRIAVLTSGCYCSSITPISERLMFLGSAVSDSCI 487


>gi|402595041|gb|EJW88967.1| hypothetical protein WUBG_00126 [Wuchereria bancrofti]
          Length = 621

 Score = 52.0 bits (123), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 82/363 (22%), Positives = 135/363 (37%), Gaps = 104/363 (28%)

Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
           +W   NL  +A  +++VP P+GG L+ G + I YH +    AL    YA    S      
Sbjct: 201 LWKHDNLEGEANIVISVPEPVGGCLIAGPDAISYH-KGGDDAL---RYAGVPGSRLHNTH 256

Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY---------DGRVVQRLDLSKTNP 408
            +    +D     +L  D+A      G+L +L +              +V+ + +     
Sbjct: 257 PNCYAPVDRDGQRYLLADLA------GNLYMLLLELGKDQEQDENSAVIVRDMKVESLGE 310

Query: 409 SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
           + +   +  + N + F+GSR GDS                                 +L 
Sbjct: 311 TCIAECMCYLDNGVCFIGSRFGDS---------------------------------QLI 337

Query: 469 RSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA 528
           R S++                       A  T   ++ DS  N+ P++D +  +R N   
Sbjct: 338 RLSTEP---------------------RADGTGYISLLDSYTNLAPIRDMTV-MRCNGQQ 375

Query: 529 ---SATGISKQSNY----------EL--VELPGCKGIWTVYHKSSRGHNADSSRMAAYDD 573
              + +G  K              EL  VEL G K ++T+    +RG            D
Sbjct: 376 QILTCSGAYKDGTIRIIRNGIGIEELASVELKGIKNMFTL---RTRG------------D 420

Query: 574 EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 633
           E+  YLI+S ++ T VL       E TE   + V G T+ AG LF  + ++QV      +
Sbjct: 421 EFDDYLILSFDSETHVLFINGEELEDTEITGFAVDGATLWAGCLFHSKTILQVTHGEVIL 480

Query: 634 LDG 636
           +DG
Sbjct: 481 IDG 483


>gi|290998415|ref|XP_002681776.1| damage-specific DNA binding protein 1 [Naegleria gruberi]
 gi|284095401|gb|EFC49032.1| damage-specific DNA binding protein 1 [Naegleria gruberi]
          Length = 1103

 Score = 51.6 bits (122), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 77/358 (21%), Positives = 151/358 (42%), Gaps = 60/358 (16%)

Query: 82  KNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAK 141
           KN      R+  +G+S+     V  +   G ++++++    G     ++D + +  ED  
Sbjct: 35  KNQYLQVNRLSEEGVSS-----VVEFEAPGRIDTMSLFRPSG----EKQDLLFITIEDTF 85

Query: 142 ISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQ 200
            ++   D  I  L   S+   + P       GR S   G +  +DP  R   + +Y GL 
Sbjct: 86  FTLGFIDGKIETLSSGSI---DDP------VGRRS-ESGSITTIDPLCRAVALSIYEGLL 135

Query: 201 MIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVM 260
            II            ++  F     F+ R+E  +VI++  L+    K          P  
Sbjct: 136 KII--------PFENNKHQFKEA--FNVRLEELNVIDIAFLESLGSK------SKSGPTF 179

Query: 261 VILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGG 320
            +L++  +          H       ++   +++  L  + +N+ H A  L+ VP+P+GG
Sbjct: 180 ALLYQDHV-------GSRHVKTYEVKTLDKDMEESSL--NQLNVDHGANILIPVPAPLGG 230

Query: 321 VLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLS 380
           V+ VG   + Y ++S        N++V+  ++  +   S+  +LD  +  W   D     
Sbjct: 231 VICVGEAQVSYINESN------KNHSVASPANSRMAIRSYG-KLD--NTRWFLGD----- 276

Query: 381 TKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
            ++G L LL++      V  L L +   + ++S I+ + N   F+GS  GDS +++ +
Sbjct: 277 -QSGQLYLLSLQVSDSEVTGLTLKELGVTSISSCISYLDNGYVFIGSNYGDSQVIRIS 333


>gi|401883281|gb|EJT47496.1| U2 snRNA binding protein [Trichosporon asahii var. asahii CBS 2479]
          Length = 1216

 Score = 51.6 bits (122), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 130/595 (21%), Positives = 223/595 (37%), Gaps = 132/595 (22%)

Query: 85  GETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISV 144
           G T+  +L    S   L+ +C     G V ++A     G      +D I+L+ +  ++S+
Sbjct: 34  GSTRLEILKLNPSTGQLDSICSSEAFGTVRNVAAFRLAGMG----KDYIVLSSDSGRLSI 89

Query: 145 LEFDDSIHGLRITSMHCFESP-EWLHLKRGRESFARGPLVKVDPQGRC---GGVLVYGLQ 200
           +E       L I+    FES  + ++ K G      G  + VDP+GR    G V    L 
Sbjct: 90  IE-------LVISPTPHFESLYQEVYGKSGSRRTIPGQFLAVDPKGRSAMFGAVEKQKLC 142

Query: 201 MIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVM 260
            I+ + ++G                  A    + V+N+   D           GY  P+ 
Sbjct: 143 YILNRNTEG---------KVYPSSPLEAHKNHTLVVNMIACDT----------GYDNPMF 183

Query: 261 VILHERELTW----------AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYK 310
             L   EL +          A R + KH T     L ++  +++    WS    P D   
Sbjct: 184 AAL---ELDYGDSDHDATGEAYRAAEKHLTFYELDLGLNHVVRK----WSE---PTDRRA 233

Query: 311 LLAVPSP------------IGGVLVVGANTI---HYHSQSASCALALNNYAVSLDSSQEL 355
            L V  P             GGVLV   + +   H  +++    +      ++       
Sbjct: 234 NLLVQVPGGQNANTDRFDGPGGVLVCTEDYVIWKHMDAEAHRVPIPRRRNPMAKPG---- 289

Query: 356 PRSSFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSD 414
            +SS  + + AA    ++     LL ++ GDL   T+ ++G  V+ L +   +   + + 
Sbjct: 290 -QSSRGIIIVAAVTHKIKGSFFFLLQSEDGDLFKATIEHEGEDVRALRIKYFDTVPVATS 348

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTC---------GSGTSMLSSGLKEEFGDIE--ADAPS 463
           +  + +   F+ S  GD  L QF            S T     GL EE          P 
Sbjct: 349 LCILKSGYLFVASEFGDQGLYQFQSLADDDGEREWSSTDYPGFGLGEEHLPYAFFQPRPL 408

Query: 464 TKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGP---LKDFSY 520
              L   +  +L  +++ + ++L G+AS+  +     ++   R      GP    +   +
Sbjct: 409 QNLLLADTLSSLDPILDAQVVNLLGNASDTPQ----IYAACGR------GPRSTFRSLKH 458

Query: 521 GLRINADASATGISKQSNYELVE--LPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHA 577
           GL IN               LVE  LPG    +WT+                + DDEY +
Sbjct: 459 GLDINV--------------LVESPLPGVPNAVWTL--------------KLSEDDEYDS 490

Query: 578 YLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
           Y+++S    T+VL   + + EV ++  +   G T+A   L G   ++QV   G R
Sbjct: 491 YIVLSFPNGTLVLSIGETIEEVNDT-GFLSSGPTLAVQQL-GSAGLLQVHPAGLR 543


>gi|198432469|ref|XP_002129207.1| PREDICTED: similar to DNA damage-binding protein 1 (Damage-specific
           DNA-binding protein 1) (UV-damaged DNA-binding factor)
           (DDB p127 subunit) (DNA damage-binding protein a) (DDBa)
           (UV-damaged DNA-binding protein 1) (UV-DDB 1) (Xeroderma
           pigmentosum group E-co... isoform 1 [Ciona intestinalis]
          Length = 1150

 Score = 51.6 bits (122), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 92/438 (21%), Positives = 167/438 (38%), Gaps = 109/438 (24%)

Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
           F+ RIE   VI+ +           F+HGY  P +VI+++           +H    I  
Sbjct: 155 FNIRIEELSVIDAK-----------FLHGYTTPTLVIIYQNS-------QGRHVKTYIVD 196

Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA---- 341
           +     +      W   N+  +A  ++ VP P+ G +++G  +I YH+      +A    
Sbjct: 197 VRDKEVVAGP---WKQENIDAEANFIINVPKPLAGSIIIGQESITYHNGDKYIPIAPPQI 253

Query: 342 ---LNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYD 394
              +N YA                 +D   + +L  D+A      G L +L +    + D
Sbjct: 254 KDTINCYA----------------PVDKDGSRYLLGDLA------GHLFILLLESDEMMD 291

Query: 395 G-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEE 453
           G   V+ L +       +   I+ + N + ++GSRLGDS L++                 
Sbjct: 292 GTNTVRDLKIELLGEVSIPEAISYLDNGVVYIGSRLGDSQLIR----------------- 334

Query: 454 FGDIEADAPSTKRLRRSSSDALQDMVN-GEELSLYGSASNNTESAQ-KTFSFAVRDSLVN 511
              +  D+    R + S    L    N G  + +     +     Q  T S A ++    
Sbjct: 335 ---LPTDSSMEGRPKPSLISVLDTYTNLGPIIDMCVVDLDRQGQGQVVTCSGAFKE---- 387

Query: 512 IGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAY 571
            G L+    G+ I   AS            ++LPG KG+W +          D+SR  +Y
Sbjct: 388 -GSLRIIRNGIGIQEHAS------------IDLPGIKGLWPL-------RVFDTSR--SY 425

Query: 572 DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
           D      L+IS    + +L+ +    E T+   +  + +T    N+    +++Q+ E+  
Sbjct: 426 DT-----LVISFVGHSRILQLSGEEVEETDLPGFDDESQTFYCSNVC-HNQLVQITEKSI 479

Query: 632 RILDGSYMTQDLSFGPSN 649
           R++  +   Q   + P N
Sbjct: 480 RLISHTERRQVHEWKPKN 497


>gi|348667612|gb|EGZ07437.1| hypothetical protein PHYSODRAFT_565381 [Phytophthora sojae]
          Length = 1197

 Score = 51.2 bits (121), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 68/301 (22%), Positives = 116/301 (38%), Gaps = 76/301 (25%)

Query: 321 VLVVGANTIHYHSQ---SASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDV- 376
           VLV+G NT+ Y ++     +CA+            Q  PR    V    + AT  Q D+ 
Sbjct: 247 VLVLGENTVQYKNEGHPELTCAIP---------RRQGEPRDIVIV----SAATHKQRDLF 293

Query: 377 -ALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLV 435
             LL ++ GDL  +++ Y G  V+ + +   +   + S +      L F  S   +  L 
Sbjct: 294 FVLLQSELGDLYKISLDYSGNAVEEIKIQFFDTVPVASSMCITKTGLLFCASEFSNHYLF 353

Query: 436 QF-TCGSG------------TSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGE 482
           QF + G G             + LS+    +  +++  A S   L   +   + D+ N +
Sbjct: 354 QFLSIGEGDDTAKCSSLAMDPTELSTFPLRKLTNLQL-ASSMPSLSPVTQLLVDDLANEQ 412

Query: 483 ELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELV 542
              +Y    N+  S+                 L+   +GL I   A++            
Sbjct: 413 TPQMYALCGNSNRSS-----------------LRVLRHGLPITEMAASA----------- 444

Query: 543 ELPG-CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
            LPG  K +W +                +Y D Y  Y+++S E  T+VLE  + + EVT+
Sbjct: 445 -LPGVAKAVWCLKE--------------SYADPYDKYIVVSFEDATLVLEVGETVEEVTQ 489

Query: 602 S 602
           S
Sbjct: 490 S 490


>gi|406698009|gb|EKD01256.1| U2 snRNA binding protein [Trichosporon asahii var. asahii CBS 8904]
          Length = 1216

 Score = 50.8 bits (120), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 129/595 (21%), Positives = 223/595 (37%), Gaps = 132/595 (22%)

Query: 85  GETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISV 144
           G T+  +L    S   L+ +C     G V ++A     G      +D I+L+ +  ++S+
Sbjct: 34  GSTRLEILKLNPSTGQLDSICSSEAFGTVRNVAAFRLAGMG----KDYIVLSSDSGRLSI 89

Query: 145 LEFDDSIHGLRITSMHCFESP-EWLHLKRGRESFARGPLVKVDPQGRC---GGVLVYGLQ 200
           +E       L I+    FES  + ++ K G      G  + VDP+GR    G V    L 
Sbjct: 90  IE-------LVISPTPHFESLYQEVYGKSGSRRTIPGQFLAVDPKGRSAMFGAVEKQKLC 142

Query: 201 MIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVM 260
            I+ + ++G                  A    + V+N+   D           GY  P+ 
Sbjct: 143 YILNRNTEG---------KVYPSSPLEAHKNHTLVVNMIACDT----------GYDNPMF 183

Query: 261 VILHERELTW----------AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYK 310
             L   EL +          A R + KH T     L ++  +++    WS    P D   
Sbjct: 184 AAL---ELDYGDSDHDATGEAYRAAEKHLTFYELDLGLNHVVRK----WSE---PTDRRA 233

Query: 311 LLAVPSP------------IGGVLVVGANTI---HYHSQSASCALALNNYAVSLDSSQEL 355
            L V  P             GGVLV   + +   H  +++    +      ++       
Sbjct: 234 NLLVQVPGGQNANTDRFDGPGGVLVCTEDYVIWKHMDAEAHRVPIPRRRNPMAKPG---- 289

Query: 356 PRSSFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSD 414
            +SS  + + AA    ++     LL ++ GDL   T+ ++G  V+ L +   +   + + 
Sbjct: 290 -QSSRGIIIVAAVTHKIKGSFFFLLQSEDGDLFKATIEHEGEDVRALRIKYFDTVPVATS 348

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTC---------GSGTSMLSSGLKEEFGDIE--ADAPS 463
           +  + +   F+ S  GD  L QF            S T     GL EE          P 
Sbjct: 349 LCILKSGYLFVASEFGDQGLYQFQSLADDDGEREWSSTDYPGFGLGEEHLPYAFFQPRPL 408

Query: 464 TKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGP---LKDFSY 520
              L   +  +L  +++ + ++L G+AS+  +     ++   R      GP    +   +
Sbjct: 409 QNLLLADTLSSLDPILDAQVVNLLGNASDTPQ----IYAACGR------GPRSTFRSLKH 458

Query: 521 GLRINADASATGISKQSNYELVE--LPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHA 577
           GL +N               LVE  LPG    +WT+                + DDEY +
Sbjct: 459 GLDVNV--------------LVESPLPGVPNAVWTL--------------KLSEDDEYDS 490

Query: 578 YLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
           Y+++S    T+VL   + + EV ++  +   G T+A   L G   ++QV   G R
Sbjct: 491 YIVLSFPNGTLVLSIGETIEEVNDT-GFLSSGPTLAVQQL-GSAGLLQVHPAGLR 543


>gi|156095699|ref|XP_001613884.1| Splicing factor 3B subunit 3 [Plasmodium vivax Sal-1]
 gi|148802758|gb|EDL44157.1| Splicing factor 3B subunit 3, putative [Plasmodium vivax]
          Length = 1230

 Score = 50.4 bits (119), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 127/607 (20%), Positives = 220/607 (36%), Gaps = 120/607 (19%)

Query: 92  LMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSI 151
           L+       L L+    + G +  L      G++    +D +++  +  ++++L+F +  
Sbjct: 41  LLRADKQGKLNLIASKDIFGIIRCLQTFRLTGSN----KDYVVIGSDSGRLTILQFSNEK 96

Query: 152 HGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGV-------LVYGL----- 199
           +      +HC       + K G      G  + VDP+GR   +        VY L     
Sbjct: 97  NDF--VRVHC-----ETYGKSGLRRIIPGEYIAVDPKGRALMICAIERQKFVYILNRDTK 149

Query: 200 -QMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEP 258
            Q+ I              D  G   GF   + +S   N   LD K V +   +  Y   
Sbjct: 150 EQLTISSPLDAHKSHTICHDVVGMDVGFENPMFASIEQNYEALD-KQVTNTSEIDSYTRK 208

Query: 259 VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
            ++ L      W   +   H                   +      P DA   L +P P 
Sbjct: 209 TLLSL------WEMDLGLNH-------------------VIRKYTFPIDASAHLLIPIPG 243

Query: 319 G-----GVLVVGANTIHYHS---QSASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
           G     GV+V   N + Y         CA     Y   L++ QE   S     L      
Sbjct: 244 GQQGPSGVIVCCDNFLVYKKVDHADVYCA-----YPRRLETGQEKNLSIVCSTLHRIRKF 298

Query: 371 WLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
           +      L+ ++ GDL  + + ++  VV+ +     +   + + I  + +   F+ +  G
Sbjct: 299 FF----ILIQSELGDLYKIEMEHEDGVVKEITCKYFDTVPVANAICVMKSGSLFVAAEFG 354

Query: 431 DSLLVQFTCGSG----TSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEE--L 484
           +    QF+ G G     +M +S  K   G     A  TK+L   ++  L D V      L
Sbjct: 355 NHFFYQFS-GIGDEDNEAMCTS--KHPSGRNAIIAFRTKKL---TNLFLIDQVYSLSPIL 408

Query: 485 SLYGSASNNTESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYEL 541
            +    + N  S Q         +L   GP   L+   +GL I   A             
Sbjct: 409 DMKVIDAKNASSPQIY-------ALCGRGPRSSLRILQHGLSIEELADN----------- 450

Query: 542 VELPG-CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT 600
            ELPG  K IWT+     +  NA          +Y  Y+I+S E  T++LE  + + EV 
Sbjct: 451 -ELPGRPKFIWTI-----KKDNAS---------DYDGYIIVSFEGSTLILEIGETVEEVV 495

Query: 601 ESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENST 660
           +S+   +   T    N+     +IQV + G R ++G  + + +   P N +  + + N  
Sbjct: 496 DSL--LLTNVTTIHVNILYDNSLIQVHDAGIRHINGKVIHEWVP--PKNKQIKAATSNCA 551

Query: 661 VLSVSIA 667
            + +S++
Sbjct: 552 QIVISLS 558


>gi|358440070|pdb|4A0B|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Cpd-Duplex (
           Pyrimidine At D-1 Position) At 3.8 A Resolution (Cpd 4)
 gi|358440072|pdb|4A0B|C Chain C, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Cpd-Duplex (
           Pyrimidine At D-1 Position) At 3.8 A Resolution (Cpd 4)
          Length = 1159

 Score = 50.4 bits (119), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 96/461 (20%), Positives = 172/461 (37%), Gaps = 116/461 (25%)

Query: 185 VDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK 244
           +DP+ R  G+ +Y     ++   +    L            F+ R+E  HVI+++     
Sbjct: 143 IDPECRMIGLRLYDGLFKVIPLDRDNKEL----------KAFNIRLEELHVIDVK----- 187

Query: 245 HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNL 304
                 F++G   P +  +++      GR     H              + P  W   N+
Sbjct: 188 ------FLYGCQAPTICFVYQDP---QGR-----HVKTYEVSLREKEFNKGP--WKQENV 231

Query: 305 PHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV-- 362
             +A  ++AVPSP GG +++G  +I YH+     A+A             + + S  V  
Sbjct: 232 EAEASMVIAVPSPFGGAIIIGQESITYHNGDKYLAIA-----------PPIIKQSTIVCH 280

Query: 363 -ELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTSDIT 416
             +D   + +L  D+       G L +L +      DG V ++ L +     + +   +T
Sbjct: 281 NRVDPNGSRYLLGDME------GRLFMLLLEKEEQMDGTVTLKDLRVELLGETSIAECLT 334

Query: 417 TIGNSLFFLGSRLGDSLLVQFTCGS---GTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
            + N + F+GSRLGDS LV+    S   G+ +++       G I  D       R+    
Sbjct: 335 YLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMETFTNLGPI-VDMCVVDLERQGQGQ 393

Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
            +                        T S A ++     G L+    G+ I+  AS    
Sbjct: 394 LV------------------------TCSGAFKE-----GSLRIIRNGIGIHEHAS---- 420

Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
                   ++LPG KG+W +    +R              E    L++S   +T VL   
Sbjct: 421 --------IDLPGIKGLWPLRSDPNR--------------ETDDTLVLSFVGQTRVLMLN 458

Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
               E TE + +    +T   GN+   +++IQ+     R++
Sbjct: 459 GEEVEETELMGFVDDQQTFFCGNV-AHQQLIQITSASVRLV 498


>gi|348526664|ref|XP_003450839.1| PREDICTED: DNA damage-binding protein 1-like [Oreochromis
           niloticus]
          Length = 1140

 Score = 50.4 bits (119), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 74/341 (21%), Positives = 129/341 (37%), Gaps = 73/341 (21%)

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  ++ VP P GG +++G  +I YH+     A+A      S          
Sbjct: 207 WKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAIAPPTIKQSTIVCHN---- 262

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTS 413
                +D   + +L  D+       G L +L +    + DG V ++ L +     + +  
Sbjct: 263 ----RVDPNGSRYLLGDME------GRLFMLLLEKEELMDGTVALKDLHVELLGETSIAE 312

Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
            +T + N + F+GSRLGDS LV+    S        + E F ++                
Sbjct: 313 CLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVAVMETFTNL---------------G 357

Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
            + DM                +    T S A ++     G L+    G+ I+  AS    
Sbjct: 358 PIVDMC-------VVDLERQGQGQLVTCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401

Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
                   ++LPG KG+W +  +S  G   D              L++S   +T VL  +
Sbjct: 402 --------IDLPGIKGLWPL--RSEAGRETDD------------MLVLSFVGQTRVLMLS 439

Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
               E TE   +    +T   GN+   +++IQ+     R++
Sbjct: 440 GEEVEETELPGFVDNQQTFYCGNV-AHQQLIQITSGSVRLV 479


>gi|432851195|ref|XP_004066902.1| PREDICTED: DNA damage-binding protein 1-like [Oryzias latipes]
          Length = 1140

 Score = 50.4 bits (119), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 74/341 (21%), Positives = 130/341 (38%), Gaps = 73/341 (21%)

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  ++ VP P GG +++G  +I YH+     A+A      S          
Sbjct: 207 WKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAIAPPTIKQSTIVCHN---- 262

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTS 413
                +D   + +L  D+       G L +L +    + DG V ++ L +     + +  
Sbjct: 263 ----RVDPNGSRYLLGDME------GRLFMLLLEKEELMDGTVALKDLHVELLGETSIAE 312

Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
            +T + N + F+GSRLGDS LV+    S        + E F ++                
Sbjct: 313 CLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVTVMETFTNL---------------G 357

Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
            + DM                +    T S A ++     G L+    G+ I+  AS    
Sbjct: 358 PILDMC-------VVDLERQGQGQLVTCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401

Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
                   ++LPG KG+W +  +S  G  +D              L++S   +T VL  +
Sbjct: 402 --------IDLPGIKGLWPL--RSEAGRESDD------------MLVLSFVGQTRVLMLS 439

Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
               E TE   +    +T   GN+   +++IQ+     R++
Sbjct: 440 GEEVEETELPGFVDNQQTFYCGNV-AHQQLIQITSGSVRLV 479


>gi|402222132|gb|EJU02199.1| hypothetical protein DACRYDRAFT_21931 [Dacryopinax sp. DJM-731 SS1]
          Length = 1209

 Score = 50.4 bits (119), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 87/376 (23%), Positives = 150/376 (39%), Gaps = 83/376 (22%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL ++ GDL  +T+ ++   V+ + +   +   + S +  + +   F+ S  G+  L QF
Sbjct: 308 LLQSEDGDLFKVTIDHEDEEVKTMKIKYFDTVPVASSLCILKSGFLFVASEFGNHYLYQF 367

Query: 438 -TCGSGTSML--SSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASN-- 492
              G     +  SS    + G  +    +  R R   +  L D +N  +  +    +N  
Sbjct: 368 QKLGDDDDEIEYSSVSYPDNGMADPIPQAYFRPRPLENLVLADELNSFDPIVDAKVTNLL 427

Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIW 551
           NT++ Q  F+   R +  +   L+   +GL +    S+            ELPG    +W
Sbjct: 428 NTDTPQ-IFAACGRGARSSFRMLR---HGLDVEETVSS------------ELPGIPNAVW 471

Query: 552 TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
           TV  K+              DD+Y AY+I+S    T+VL   + + EV+++  +     T
Sbjct: 472 TVKLKA--------------DDQYDAYIILSFVNGTLVLSIGETIEEVSDT-GFLSSSPT 516

Query: 612 IAAGNLFGRRRVIQVFERGAR------------------ILDGS---------------- 637
           IA   + G   ++QV+  G R                  I+  +                
Sbjct: 517 IAVQQI-GEDSLLQVYPHGIRHVLSDRRVNEWRCPQHTTIVAATTNSRQVAIALSSAQLV 575

Query: 638 YMTQDLSFGPSNSESGSGSENSTVLSVSIAD--------PYVLLGMSDGSIRLLVGDPST 689
           Y   DL  G  N      S  S VL++SIA+        PY+ +G  D ++R++  DP T
Sbjct: 576 YFELDLE-GQLNEYQDRKSLGSGVLAMSIAEVPEGRQRTPYLAVGCEDQTVRIISLDPDT 634

Query: 690 C--TVSVQTPAAIESS 703
               +S+Q   A  SS
Sbjct: 635 TLENISLQALTAPPSS 650


>gi|303271531|ref|XP_003055127.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226463101|gb|EEH60379.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 1223

 Score = 49.7 bits (117), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 62/262 (23%), Positives = 113/262 (43%), Gaps = 35/262 (13%)

Query: 180 GPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLR 239
           GP+  VDP+ R  G+ +Y     ++   Q G               FS R+E   V +++
Sbjct: 130 GPIGAVDPECRMYGLHLYDGLFKVIPMDQTGQ----------LREAFSVRLEELQVFDVK 179

Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299
                      F+ G  +P + +L++   T  GR    +  C+            +P  W
Sbjct: 180 -----------FLAGTPKPTIAVLYQD--TKEGRHIKTYEVCLKDK-------DFNPGPW 219

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
           +  ++   +  L+AVP+P+GGV+VVG   I Y ++  +  +              + ++ 
Sbjct: 220 AQNDVESGSRFLIAVPAPLGGVVVVGEKVIAYLNKETTHGVGDGGGGGGGGGGGMIVKA- 278

Query: 360 FSVELDAAHATWLQNDVA----LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI 415
            +++ DA   T+   D      LLS   G L LL +++D   V+ L L     + + S +
Sbjct: 279 IAMQSDATIMTYGAVDKDGSRYLLSDSAGRLHLLVLMHDKTRVRALKLESLGQTSIASSL 338

Query: 416 TTIGNSLFFLGSRLGDSLLVQF 437
           + + N + ++GS  GDS LV+ 
Sbjct: 339 SYLDNGVVYVGSAYGDSQLVRL 360


>gi|68531971|ref|XP_723667.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
 gi|23478038|gb|EAA15232.1| Drosophila melanogaster CG13900 gene product [Plasmodium yoelii
           yoelii]
          Length = 1235

 Score = 49.7 bits (117), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 62/295 (21%), Positives = 121/295 (41%), Gaps = 43/295 (14%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           L+ ++ GDL  + V ++  +V+ +     +   + + I  + +   F+ +  G+    QF
Sbjct: 302 LIQSEYGDLYKIEVNHEDGIVKEIICKYFDTVPIANSICVLKSGALFVAAEFGNHFFYQF 361

Query: 438 T---CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS-SDALQDMVNGEELSLYGSASNN 493
           +     S  +M +S      G     A  T++L+     D +  +    ++ +  + ++N
Sbjct: 362 SGIGNDSNDAMCTSN--HPSGKNAIIAFKTQKLKNLYLVDQIYSLSPIVDMKILDAKNSN 419

Query: 494 TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIWT 552
                       R SL      +   +GL I   A+             ELPG  + IWT
Sbjct: 420 LPQIYALCGRGPRSSL------RILQHGLSIEELANN------------ELPGKPRYIWT 461

Query: 553 VYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTI 612
           V   +S               EY  Y+I+S E  T++LE  + + EV +S+   +   T 
Sbjct: 462 VKKDNS--------------SEYDGYIIVSFEGNTLILEIGETVEEVYDSL--LLTNVTT 505

Query: 613 AAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
              NL      IQV++ G R ++G  + + +   P N +  + + N + + VS++
Sbjct: 506 IHINLLYDNSFIQVYDTGIRHINGKIVQEWIP--PKNKQINAATSNGSQIVVSLS 558


>gi|324502823|gb|ADY41238.1| DNA damage-binding protein 1, partial [Ascaris suum]
          Length = 1129

 Score = 49.3 bits (116), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 41/144 (28%), Positives = 65/144 (45%), Gaps = 12/144 (8%)

Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
           P +W   N+  +A  ++ +P P GGV+VVG   I YH  +       N Y+         
Sbjct: 200 PPLWKQDNIEAEACMVIPIPQPYGGVIVVGHEAISYHKDA-------NAYSAIAPPLIHQ 252

Query: 356 PRSSFSVELDAAHATWLQNDVALLSTKTGDLVL-LTVVYDGRV-VQRLDLSKTNPSVLTS 413
            + S   ++D     +L  D   LS +   L+L L V  DG   V+ L +     + +  
Sbjct: 253 SQISCYGKIDRDGQRYLLGD---LSGRIFMLLLDLDVATDGTASVKDLKVELLGETSIPE 309

Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
            +  + N + F+GSR GDS LV+ 
Sbjct: 310 CVVYLDNGVVFIGSRFGDSQLVRL 333


>gi|313238818|emb|CBY20011.1| unnamed protein product [Oikopleura dioica]
 gi|313245836|emb|CBY34826.1| unnamed protein product [Oikopleura dioica]
          Length = 1135

 Score = 49.3 bits (116), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 130/628 (20%), Positives = 229/628 (36%), Gaps = 177/628 (28%)

Query: 90  RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
           R+ ++  +   L+ V  + L+G +  + +        + ++D + +  E     +LE+ D
Sbjct: 43  RIEVNLSTQTGLKPVTEFNLYGRIAVIEVFRY----KNEKKDCLFILTESCYACILEYVD 98

Query: 150 SIHGLRITSMHCFESPEWLHLKRGRESFAR-GPLVKVDPQGRCGGVLVYGLQMIILKASQ 208
              G  IT         +  ++    S ++ G    VDP+ RC  + +Y   + I+  + 
Sbjct: 99  ---GKIITRA-------YGDMRDKNYSVSQSGMHACVDPEARCIALRLYDGVLKIINLNS 148

Query: 209 GGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHEREL 268
               L   E           RIE   V+           D  F+H   +P + +L++   
Sbjct: 149 SSKHLTSAEQ----------RIEEILVV-----------DMCFLHTANKPTLALLYDDN- 186

Query: 269 TWAGRVSWKHHTCMISALSIS---TTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVG 325
                 S +H + +   L  S    ++ + P  +    +  D   ++AVP P+ G+L++G
Sbjct: 187 ------SSRHLSTIAITLDNSGSGASIHKGP--FRHTQVEQDTILIVAVPEPLAGILLLG 238

Query: 326 ANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGD 385
              I YH        ++ N                 V+      T +     L     G+
Sbjct: 239 HVNITYHDSKNRSTCSIENI----------------VKRTIECVTPIDKHRYLCGDSNGE 282

Query: 386 LVLLTVVYDGRVV----QRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGS 441
           L LL + Y+   +     RL       + L + ++ I N + F+GS  GDS L++     
Sbjct: 283 LFLLLLDYNENRIPEERMRLATKYLGRTTLPNTLSYIDNYVVFVGSTFGDSELIRI---- 338

Query: 442 GTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTF 501
                      E  D                                   NN  S Q   
Sbjct: 339 -----------EVSD-----------------------------------NN--SGQHFT 350

Query: 502 SFAVRDSLVNIGPLKDFSY--------GLRINADASATGISKQ--------SNYELVELP 545
           S    D+L   GP+KD           G  + A    TG S +          Y  ++L 
Sbjct: 351 SLHQYDNL---GPIKDMCIVDFEKQGQGQLVTASGVGTGGSLRIIRNGVGIHEYASIDLE 407

Query: 546 GCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMV--LETADLLTEVTESV 603
           G KG+W + + SS      S++  +        L++S   +T+   LE  D +TEV E +
Sbjct: 408 GVKGLWALKYLSS------STKQDS--------LLLSFVGQTIFLRLEGQD-VTEV-EEI 451

Query: 604 DYFVQG-RTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGS----EN 658
             F  G +T+ AGN+   ++ +Q+ E+  R++                ES  GS    EN
Sbjct: 452 PGFTNGEQTMYAGNV-TDQQFLQITEKQVRLI--------------ADESLKGSWEPEEN 496

Query: 659 STVLSVSIADPYVLLGMSDGSIRLLVGD 686
           + +   S+    VLLG+   +I L + D
Sbjct: 497 TQINLCSVNKNQVLLGVGSTAIYLEIND 524


>gi|268536658|ref|XP_002633464.1| C. briggsae CBR-DDB-1 protein [Caenorhabditis briggsae]
          Length = 1134

 Score = 48.9 bits (115), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 87/357 (24%), Positives = 148/357 (41%), Gaps = 82/357 (22%)

Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE--L 364
           D+  L+ VP+P+GGV+V+GAN+  Y +   +  +    Y+ SL     L  + F+    +
Sbjct: 210 DSQVLIPVPAPVGGVIVLGANSALYKASDVNGDVV--PYSCSL-----LKNTIFTCHGIV 262

Query: 365 DAAHATWLQNDVALLSTKTGDLVLLTV-VYDGR---VVQRLDLSKTNPSVLTSDITTIGN 420
           DA+       D  LL+   G L++L + + +GR    V+ + +     + +   +  + N
Sbjct: 263 DAS------GDRFLLADTDGRLLMLLLNIGEGRSGTTVKEMRIEYLGETSVADSVNYVDN 316

Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480
            + F+GSRLGDS L++          S  L+                  +++  ++DMV 
Sbjct: 317 GVVFVGSRLGDSQLIRLMTAPNGGSYSVVLET----------------YTNTGPIRDMVL 360

Query: 481 GEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYE 540
            E         ++ +    T S A +D     G L+    G+ I   AS           
Sbjct: 361 VE---------SDGQPQLVTCSGADKD-----GSLRVIRNGIGIEELAS----------- 395

Query: 541 LVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT 600
            V+L    G++ +  +S+     D+  + +  DE H   I   E     LE   LL   T
Sbjct: 396 -VDLAKVIGMFPIRLRST----TDNFVIVSLPDETHVLKITGEE-----LEDVQLLEIET 445

Query: 601 ESVDYFVQGRTIAAGNLFG---RRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
           E         T+ A +LFG      ++QV E   R +  S+  Q   + P+N ES S
Sbjct: 446 ERT-------TMYASSLFGPDDSELILQVTEEEIRFM--SFQKQVKIWRPTNGESVS 493


>gi|198432471|ref|XP_002129229.1| PREDICTED: similar to DNA damage-binding protein 1 (Damage-specific
           DNA-binding protein 1) (UV-damaged DNA-binding factor)
           (DDB p127 subunit) (DNA damage-binding protein a) (DDBa)
           (UV-damaged DNA-binding protein 1) (UV-DDB 1) (Xeroderma
           pigmentosum group E-co... isoform 2 [Ciona intestinalis]
          Length = 1142

 Score = 48.9 bits (115), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 51/217 (23%), Positives = 91/217 (41%), Gaps = 40/217 (18%)

Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
           F+ RIE   VI+ +           F+HGY  P +VI+++           +H    I  
Sbjct: 155 FNIRIEELSVIDAK-----------FLHGYTTPTLVIIYQNS-------QGRHVKTYIVD 196

Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNY 345
           +     +      W   N+  +A  ++ VP P+ G +++G  +I YH+      +A    
Sbjct: 197 VRDKEVVAGP---WKQENIDAEANFIINVPKPLAGSIIIGQESITYHNGDKYIPIA---- 249

Query: 346 AVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDG-RVVQR 400
              L   Q+       V+ D +   +L  D+A      G L +L +    + DG   V+ 
Sbjct: 250 --PLCFFQDTINCYAPVDKDGSR--YLLGDLA------GHLFILLLESDEMMDGTNTVRD 299

Query: 401 LDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           L +       +   I+ + N + ++GSRLGDS L++ 
Sbjct: 300 LKIELLGEVSIPEAISYLDNGVVYIGSRLGDSQLIRL 336


>gi|124806507|ref|XP_001350742.1| splicing factor 3b, subunit 3, 130kD, putative [Plasmodium
           falciparum 3D7]
 gi|23496869|gb|AAN36422.1|AE014849_41 splicing factor 3b, subunit 3, 130kD, putative [Plasmodium
           falciparum 3D7]
          Length = 1329

 Score = 48.5 bits (114), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 80/379 (21%), Positives = 147/379 (38%), Gaps = 64/379 (16%)

Query: 304 LPHDAYKLLAVPSPIG-----GVLVVGANTIHYHS---QSASCALALNNYAVSLDSSQEL 355
           LP D    L +P P G     GVL+   N + Y     +   CA     Y   L+  Q+ 
Sbjct: 261 LPIDITAHLLIPLPGGQQGPSGVLICCENFLVYKKVDHEDIYCA-----YPRRLEIGQDK 315

Query: 356 PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI 415
             S     +      +      L+ ++ GDL  + V ++  +V+ +     +   + + I
Sbjct: 316 NISIICWTMHRIKKFFF----ILIQSEYGDLYKIEVDHEDGIVKEIVCKYFDTVPIGNSI 371

Query: 416 TTIGNSLFFLGSRLGDSLLVQFT-CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
           + + +   F+ +  G+    QF+  G              G     A  T +L+      
Sbjct: 372 SVLKSGSLFVAAEFGNHYFYQFSGIGDDNKQFMCTSNHPLGKNAIIAFKTNKLKNL---Y 428

Query: 475 LQDMVNGEE--LSLYGSASNNTESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADAS 529
           L D +      L +    + NT + Q  ++   R      GP   L+   +GL I   A 
Sbjct: 429 LVDQIYSLSPILDMKIIDAKNTHTPQ-IYTLCGR------GPRSSLRILQHGLSIEELAD 481

Query: 530 ATGISKQSNYELVELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM 588
                        ELPG  K IWT+   +                EY  Y+++S E  T+
Sbjct: 482 N------------ELPGKPKYIWTIKKDNL--------------SEYDGYIVVSFEGNTL 515

Query: 589 VLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPS 648
           +LE  + + EV++++   +   T    N+      IQV++ G R ++G  + + ++  P 
Sbjct: 516 ILEIGESVEEVSDTL--LLNNVTTLHINILYDNSFIQVYDTGIRHINGKVVQEWVA--PK 571

Query: 649 NSESGSGSENSTVLSVSIA 667
           N +  + S NS+ + +S++
Sbjct: 572 NKQIKAASSNSSQIVISLS 590


>gi|221061705|ref|XP_002262422.1| splicing factor 3b, subunit 3, 130kd [Plasmodium knowlesi strain H]
 gi|193811572|emb|CAQ42300.1| splicing factor 3b, subunit 3, 130kd, putative [Plasmodium knowlesi
           strain H]
          Length = 1276

 Score = 48.5 bits (114), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 123/604 (20%), Positives = 222/604 (36%), Gaps = 114/604 (18%)

Query: 92  LMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSI 151
           L+       L L+    + G +  L      G++    +D +++  +  ++ +L+F +  
Sbjct: 41  LLRADKQGKLNLIVSKDIFGIIRCLQTFRLTGSN----KDYVVIGSDSGRLVILQFSNEK 96

Query: 152 HGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGV-------LVYGL----- 199
           +      +HC       + K G      G  + VDP+GR   +        VY L     
Sbjct: 97  NDF--VRVHC-----ETYGKSGLRRIIPGEYIAVDPKGRALMICAIERQKFVYILNRDNK 149

Query: 200 -QMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEP 258
            Q+ I              D  G   GF   + +S   N    D K V +   +      
Sbjct: 150 EQLTISSPLDAHKSHTICHDVVGMDVGFENPMFASIEQNYEMYD-KQVTNTTEIDACTRK 208

Query: 259 VMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
            ++ L E +L                   ++  +++H        LP D    L +P P 
Sbjct: 209 TLLCLWEMDL------------------GLNHVIRKH-------TLPIDMSAHLLIPIPG 243

Query: 319 G-----GVLVVGANTIHYHSQS---ASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
           G     GV+V   N + Y         CA     Y   L++ QE    + S+     H  
Sbjct: 244 GQQGPSGVIVCCDNYLVYKKVEHVDVYCA-----YPRRLETGQE---KNISIVCSTVHRI 295

Query: 371 WLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
             +    L+ ++ GDL  + + +   VV+ +     +   + + I  + +   F+ +  G
Sbjct: 296 R-KFFFILIQSEYGDLYKIEMDHQDGVVKEITCKYFDTVPVANAICVMKSGSLFVAAEFG 354

Query: 431 DSLLVQFT-CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEE--LSLY 487
           +    QF+  G   +      K   G     A  TK+L   ++  L D V      L + 
Sbjct: 355 NHFFYQFSGIGDDDNEAMCTSKHPSGRNAIIAFRTKKL---TNLFLIDQVYSLSPILDMK 411

Query: 488 GSASNNTESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVEL 544
              + N  S Q  ++   R      GP   L+   +GL I   A              EL
Sbjct: 412 ILDAKNANSPQ-IYALCGR------GPRSSLRILQHGLSIEELADN------------EL 452

Query: 545 PG-CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESV 603
           PG  K IWT+     +  NA          +Y  Y+I+S E  T++LE  + + EV +++
Sbjct: 453 PGRPKYIWTI-----KKDNAS---------DYDGYIIVSFEGSTLILEIGETVEEVVDTL 498

Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLS 663
              +   T    N+     +IQV + G R ++G  + + +   P N +  + + N+T + 
Sbjct: 499 --LLTNVTTIHVNILYDNSLIQVHDTGIRHINGKVINEWVP--PKNKQVKAATSNATQIV 554

Query: 664 VSIA 667
           +S++
Sbjct: 555 ISLS 558


>gi|193644722|ref|XP_001942922.1| PREDICTED: DNA damage-binding protein 1-like [Acyrthosiphon pisum]
          Length = 1156

 Score = 48.5 bits (114), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 58/267 (21%), Positives = 109/267 (40%), Gaps = 59/267 (22%)

Query: 180 GPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
           G +  +DP  R  G+ +Y GL  II              D  G    +  R+E    + +
Sbjct: 122 GAMAVIDPSARVIGLKLYDGLFKII------------PLDKEGELKAYCLRMEE---VEV 166

Query: 239 RDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH-PL 297
           +D+D        F++G   P ++I+H+  +   GR         I A  +S   K+    
Sbjct: 167 QDID--------FLYGCANPTIIIIHQDTM---GR--------HIKAKELSIKDKEFVKT 207

Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
            W   N+  +A  ++ VP P+ G +++G  ++ YH+ S+  A+          S   + +
Sbjct: 208 PWKQENVETEASMIIPVPEPLCGAIIIGRESVLYHNGSSFIAI----------SPPVIKQ 257

Query: 358 SSFS--VELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVL---- 411
           S+      +D     +L  D+A      G L +L + Y+        +      +L    
Sbjct: 258 STIVCYARIDPEGTRYLLGDMA------GHLFMLLLNYEKNPDGTFKIKDPKVDLLGEIS 311

Query: 412 -TSDITTIGNSLFFLGSRLGDSLLVQF 437
               +T + N + ++ SR+GDS L++ 
Sbjct: 312 IPESLTYLDNKIIYVASRVGDSQLIKL 338


>gi|68075683|ref|XP_679761.1| splicing factor 3b, subunit 3, 130kD [Plasmodium berghei strain
           ANKA]
 gi|56500578|emb|CAH95367.1| splicing factor 3b, subunit 3, 130kD, putative [Plasmodium berghei]
          Length = 1216

 Score = 48.5 bits (114), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 62/297 (20%), Positives = 120/297 (40%), Gaps = 48/297 (16%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           L+ ++ GDL  + V ++  +V+ +     +   + + I  + +   F+ +  G+    QF
Sbjct: 302 LIQSEYGDLYKIEVNHEDGIVKEIICKYFDTVPIANSICVLKSGALFVAAEFGNHFFYQF 361

Query: 438 T---CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
           +     S  SM +S      G     A  T++L+      L D +    +          
Sbjct: 362 SGIGNDSNESMCTSN--HPSGKNAIIAFKTQKLKNL---YLVDQIYSLPIVDMKILDAKN 416

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +  + ++   R      GP   L+   +GL I   A+             ELPG  + I
Sbjct: 417 SNIPQIYALCGR------GPRSSLRILQHGLSIEELANN------------ELPGKPRYI 458

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WT+   +S               EY  Y+I+S E  T++LE  + + EV +S+   +   
Sbjct: 459 WTIKKDNS--------------SEYDGYIIVSFEGNTLILEIGETVEEVYDSL--LLTNV 502

Query: 611 TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
           T    NL      IQV++ G R ++G  + + +   P N +  + + N + + +S++
Sbjct: 503 TTIHINLLYDNSFIQVYDTGIRHINGKIVQEWVP--PKNKQINAATSNGSQIVISLS 557


>gi|17541566|ref|NP_502299.1| Protein DDB-1 [Caenorhabditis elegans]
 gi|74965443|sp|Q21554.2|DDB1_CAEEL RecName: Full=DNA damage-binding protein 1; AltName:
           Full=Damage-specific DNA-binding protein 1
 gi|5824558|emb|CAA92824.2| Protein DDB-1 [Caenorhabditis elegans]
          Length = 1134

 Score = 48.1 bits (113), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 100/396 (25%), Positives = 166/396 (41%), Gaps = 96/396 (24%)

Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE--L 364
           D+  L+ VP  IGGV+V+G+N++ Y        +    Y  SL     L  ++F+    +
Sbjct: 210 DSSVLIPVPHAIGGVIVLGSNSVLYKPNDNLGEVV--PYTCSL-----LENTTFTCHGIV 262

Query: 365 DAAHATWLQNDVALLSTKTGDLVLL----TVVYDGRVVQRLDLSKTNPSVLTSDITTIGN 420
           DA+   +L      LS   G L++L    T    G  V+ + +     + +   I  I N
Sbjct: 263 DASGERFL------LSDTDGRLLMLLLNVTESQSGYTVKEMRIDYLGETSIADSINYIDN 316

Query: 421 SLFFLGSRLGDSLLVQF-TCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
            + F+GSRLGDS L++  T  +G S   S + E + +I                 ++DMV
Sbjct: 317 GVVFVGSRLGDSQLIRLMTEPNGGSY--SVILETYSNI---------------GPIRDMV 359

Query: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY 539
             E         ++ +    T + A +D     G L+    G+ I+  AS          
Sbjct: 360 MVE---------SDGQPQLVTCTGADKD-----GSLRVIRNGIGIDELAS---------- 395

Query: 540 ELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEV 599
             V+L G  GI+ +   S    NAD+            Y+I+SL   T VL+      E 
Sbjct: 396 --VDLAGVVGIFPIRLDS----NADN------------YVIVSLSDETHVLQITGEELED 437

Query: 600 TESVDYFVQGRTIAAGNLFGRRR---VIQVFERGARILDGSYMTQDLSFGPSNSESGSGS 656
            + ++      TI A  LFG      ++Q  E+  R++  S +++   + P+N E  S  
Sbjct: 438 VKLLEINTDLPTIFASTLFGPNDSGIILQATEKQIRLMSSSGLSK--FWEPTNGEIISK- 494

Query: 657 ENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTV 692
                +SV+ A+  ++L   D ++ LL     TC V
Sbjct: 495 -----VSVNAANGQIVLAARD-TVYLL-----TCIV 519


>gi|260790329|ref|XP_002590195.1| hypothetical protein BRAFLDRAFT_128289 [Branchiostoma floridae]
 gi|229275385|gb|EEN46206.1| hypothetical protein BRAFLDRAFT_128289 [Branchiostoma floridae]
          Length = 1152

 Score = 48.1 bits (113), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 59/268 (22%), Positives = 111/268 (41%), Gaps = 56/268 (20%)

Query: 185 VDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK 244
           +DP+ R  G+ +Y     ++   +    L            F+ R+E  +VI+++     
Sbjct: 124 IDPECRMIGLRLYDGLFKVIPLDRDNREL----------KAFNIRLEELNVIDVK----- 168

Query: 245 HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL-IWSAMN 303
                 F++G   P +V +++             H   +    IS   K+     W   N
Sbjct: 169 ------FLYGCQVPTVVFVYQ-----------DPHGRHVKTYEISVRDKEFSKGPWKQDN 211

Query: 304 LPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV- 362
           +  +A  ++AVP P  G L++G  +I YH+     A+A             + +S+    
Sbjct: 212 VETEASMVIAVPEPFCGSLIIGQESITYHNGDKYVAVA----------PPAIKQSTLICH 261

Query: 363 -ELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTSDIT 416
             +DA  + +L  D++      G L +L +    + DG V V+ L +     + +   +T
Sbjct: 262 GRVDANGSRYLLGDMS------GRLFMLLLEKEELIDGSVTVKDLKVELLGETSIAECLT 315

Query: 417 TIGNSLFFLGSRLGDSLLVQFTCGSGTS 444
            + N + +LGSRLGDS L++    +  S
Sbjct: 316 YLDNGVVYLGSRLGDSQLIKLNVDADDS 343


>gi|47230701|emb|CAF99894.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 953

 Score = 47.8 bits (112), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 71/341 (20%), Positives = 130/341 (38%), Gaps = 64/341 (18%)

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  ++ VP P GG +++G  +I YH+     A+A      S          
Sbjct: 68  WKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAIAPPTIKQSTIVCHN---- 123

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTS 413
                +D   + +L  D+       G L +L +    + DG V ++ L +     + +  
Sbjct: 124 ----RVDPNGSRYLLGDME------GRLFMLLLEKEELMDGTVALKDLHVELLGETSIAE 173

Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
            +T + N + F+GSRLGDS LV+       S L         +++++   +      +  
Sbjct: 174 CLTYLDNGVVFVGSRLGDSQLVKVRVTHSLSEL---------NVDSNDQGSFVTVMETFT 224

Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
            L  +V+   + L         +    F           G L+    G+ I+  AS    
Sbjct: 225 NLGPIVDMCVVDLERQGQGQLVTCSGAFKE---------GSLRIIRNGIGIHEHAS---- 271

Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
                   ++LPG KG+W +  ++ R              E    L++S   +T VL  +
Sbjct: 272 --------IDLPGIKGLWPLRSEAGR--------------ETDDMLVLSFVGQTRVLMLS 309

Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
               E TE   +    +T   GN+    ++IQ+     R++
Sbjct: 310 GEEVEETELPGFVDNQQTFYCGNV-AHNQLIQITSGSVRLV 349


>gi|410912407|ref|XP_003969681.1| PREDICTED: DNA damage-binding protein 1-like [Takifugu rubripes]
          Length = 1140

 Score = 47.4 bits (111), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 73/341 (21%), Positives = 127/341 (37%), Gaps = 73/341 (21%)

Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
           W   N+  +A  ++ VP P GG +++G  +I YH+     A+A      S          
Sbjct: 207 WKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAIAPPTIKQSTIVCHN---- 262

Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRV-VQRLDLSKTNPSVLTS 413
                +D   + +L  D+       G L +L +    + DG V ++ L +     + +  
Sbjct: 263 ----RVDPNGSRYLLGDME------GRLFMLLLEKEELMDGTVALKDLHVELLGETSIAE 312

Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSD 473
            +T + N + F+GSRLGD  LV+    S        + E F ++                
Sbjct: 313 CLTYLDNGVVFVGSRLGDPQLVKLNVDSNDQGSFVTVMETFTNL---------------G 357

Query: 474 ALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGI 533
            + DM                +    T S A ++     G L+    G+ I+  AS    
Sbjct: 358 PIVDMC-------VVDLERQGQGQLVTCSGAFKE-----GSLRIIRNGIGIHEHAS---- 401

Query: 534 SKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
                   ++LPG KG+W +  +S  G   D              L++S   +T VL  +
Sbjct: 402 --------IDLPGIKGLWPL--RSEAGRETDD------------MLVLSFVGQTRVLMLS 439

Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
               E TE   +    +T   GN+    ++IQ+     R++
Sbjct: 440 GEEVEETELPGFVDNQQTFYCGNV-AHNQLIQITSGSVRLV 479


>gi|390357128|ref|XP_001198237.2| PREDICTED: splicing factor 3B subunit 3-like [Strongylocentrotus
           purpuratus]
          Length = 949

 Score = 47.4 bits (111), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 61/259 (23%), Positives = 100/259 (38%), Gaps = 39/259 (15%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           L  T+ GD+  +T+  D  +V  + +   +   + + +  +     F+ S  G+  L Q 
Sbjct: 34  LAQTEQGDIFKITLETDDDMVTEIRMKYFDTVPVATSMNVLKTGFLFIASEYGNHYLYQI 93

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS    E GD    AP T R        L+++   E LS   S     
Sbjct: 94  AHLGDDDDEPEFSSATPLEEGDTFFFAPRTLR-------NLEEVDQLESLSPILSCQIAD 146

Query: 495 ESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIWTV 553
            +++ T    V         ++   +GL +            S   + ELPG    +WTV
Sbjct: 147 LASEDTPQLYVACGRGPRSSMRVLRHGLEV------------SEMAVSELPGNPNAVWTV 194

Query: 554 YHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIA 613
             KS              DDEY AY+I+S    T+VL   + + EVT+S   F+      
Sbjct: 195 KKKS--------------DDEYDAYIIVSFVNATLVLSIGETVEEVTDS--GFLGTTPTL 238

Query: 614 AGNLFGRRRVIQVFERGAR 632
           + +L G   ++Q++  G R
Sbjct: 239 SSSLIGDDALLQIYPDGIR 257


>gi|400597418|gb|EJP65151.1| CPSF A subunit region [Beauveria bassiana ARSEF 2860]
          Length = 1212

 Score = 47.4 bits (111), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 143/614 (23%), Positives = 217/614 (35%), Gaps = 133/614 (21%)

Query: 64  NVIEIYVVRVQEEGSKES---KNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
           NV++   V  Q  G+KE      SG     +  D      + L+ H  + G + S+A+  
Sbjct: 19  NVVQ--AVLGQFAGTKEQLIITGSGSQLTILRPDPAQGKVIPLLSH-DIFGVLRSIAVFR 75

Query: 121 QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180
             G+     +D IILA +  +I+VLE+  S +      M  F        K G      G
Sbjct: 76  LAGSS----KDYIILATDSGRITVLEYLPSPNRFSRLHMETFG-------KTGIRRVVPG 124

Query: 181 PLVKVDPQGRC---GGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
             +  DP+GR      V    L  ++ + SQ        E T  S     A      VI 
Sbjct: 125 EYLACDPKGRACLISAVEKNKLVYVLNRNSQA-------ELTISSP--LEAHKPGVLVIA 175

Query: 238 LRDLDMKHVKDFIFVHGYIEPVMVILH----ERELTWAGRVSWKHHTCMISALSISTTLK 293
           L  LD+          GY  PV   L     E +    G    +  T ++    +   L 
Sbjct: 176 LTALDV----------GYANPVFAALEIDYTEVDQDNTGEALSEVETHLVY-YELDLGLN 224

Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIG-----GVLVVGANTIHY-HSQSASCALALNNYAV 347
                WS    P D    L    P G     GVLV G   + Y HS   +  + +     
Sbjct: 225 HVVRKWSD---PVDPTASLLFQVPGGNDGPSGVLVCGEENVTYRHSNQDALRVPIPRRR- 280

Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVA----LLSTKTGDLVLLTV--VYDGR----- 396
               + E P    ++     H   L+        LL T  GDL  +T+  V D       
Sbjct: 281 ---GATEDPSRKRNIVAGVMHK--LKGSAGAFFFLLQTDDGDLFKITIDMVEDEEGAPTG 335

Query: 397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGD 456
            VQR+ +   +   + + +  + +   ++ S+ G+    QF              E+ GD
Sbjct: 336 EVQRMKIKYFDTVPVATSLCILKSGFLYVASQFGNYAFYQF--------------EKLGD 381

Query: 457 IEADAPSTKRLRRSSSDALQDMVNG-EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPL 515
                     L  SS D   D +   E +  Y   + N          A+ DS+  + PL
Sbjct: 382 ------DDDELEFSSDDFPVDPLAAYEPVYFYPRPAEN---------LALVDSIPAMNPL 426

Query: 516 KDFSYGLRINADA----SATGISKQSNYELV------------ELPGC-KGIWTVYHKSS 558
            D         DA    S  G   +S +  +            ELPG    +WT+   S 
Sbjct: 427 LDCKVANLTGEDAPQIYSICGNGARSTFRTIKHGLEVNEIVASELPGVPSAVWTLKLNS- 485

Query: 559 RGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLF 618
                        D++Y  Y+++S    T+VL   + + EV++S  +     TIAA  L 
Sbjct: 486 -------------DEQYDTYIVLSFTNGTLVLSIGETVEEVSDS-GFLTSVPTIAA-QLL 530

Query: 619 GRRRVIQVFERGAR 632
           G   +IQV  RG R
Sbjct: 531 GTDGLIQVHPRGIR 544


>gi|195996153|ref|XP_002107945.1| hypothetical protein TRIADDRAFT_18324 [Trichoplax adhaerens]
 gi|190588721|gb|EDV28743.1| hypothetical protein TRIADDRAFT_18324 [Trichoplax adhaerens]
          Length = 1134

 Score = 47.4 bits (111), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 49/200 (24%), Positives = 91/200 (45%), Gaps = 29/200 (14%)

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
           L+   V D  F++G+ EP + +++E   +   R    +   + +A      + + P  W+
Sbjct: 158 LEELQVLDVKFLYGFTEPTIALIYE---SGQNRYLKTYEISLQNA-----DIHRQP--WN 207

Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
              +  +A+ +L VP P  G++V+GA +I Y+    S    L+    SL       R + 
Sbjct: 208 IGKVEEEAFMILPVPPPSCGMVVIGAGSISYYKGQDS----LHITPASLKD-----RITC 258

Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD----GRVVQRLDLSKTNPSVLTSDIT 416
              +D+    +L  D +      G L +L +V +    G  V+ L L     + + S IT
Sbjct: 259 FGRVDSNGCRYLLGDYS------GRLFMLILVQEHSQSGIKVKDLCLEYLGETSIPSCIT 312

Query: 417 TIGNSLFFLGSRLGDSLLVQ 436
            + N+  ++GS  GDS L++
Sbjct: 313 YLDNAFAYIGSSCGDSQLIK 332


>gi|341884150|gb|EGT40085.1| CBN-DDB-1 protein [Caenorhabditis brenneri]
          Length = 1134

 Score = 47.0 bits (110), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 87/354 (24%), Positives = 143/354 (40%), Gaps = 82/354 (23%)

Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE--L 364
           D+  L+ VPSPI GV+V+G +++ Y S         N+  V   SS  L  + F+    +
Sbjct: 210 DSSMLIPVPSPISGVVVLGTHSLLYKSSE-------NDGEVVPYSSPLLENTIFTSHSIV 262

Query: 365 DAAHATWLQNDV--ALLSTKTGDLVLLTVVYD--GRVVQRLDLSKTNPSVLTSDITTIGN 420
           D     ++ +D    LL      ++LL  V +  G  V+ + +     + +   I  I N
Sbjct: 263 DPTGERFIVSDTDGRLL------MLLLNAVENQSGLSVKEIRIDLLGDTSVAESINYIDN 316

Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480
            + F+GSR GDS L++       S   S L   +                +   ++DM+ 
Sbjct: 317 GVVFIGSRFGDSQLIRLLSEKTNSSYISVLDTYY----------------NIGPIRDMIM 360

Query: 481 GEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYE 540
            E         ++ +    T S A +D     G L+    G+ I   A+           
Sbjct: 361 VE---------SDGQPQLVTCSGAEKD-----GSLRVIRNGIGIEELAT----------- 395

Query: 541 LVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT 600
            V+LPG  GI+ +   SS    AD+            Y+I+SL   T VL+      E  
Sbjct: 396 -VDLPGVVGIFPIRLDSS----ADN------------YVIVSLVEETHVLQITGEELEDV 438

Query: 601 ESVDYFVQGRTIAAGNLFGRRR---VIQVFERGARILDGSYMTQDLSFGPSNSE 651
           + +       T+ AG LFG      V+QV ER  R++    +++   + P+N E
Sbjct: 439 QFLQIDTALPTMFAGTLFGPNDSGLVVQVTERQVRLMSNGGLSK--FWEPANGE 490


>gi|68060004|ref|XP_671977.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56488645|emb|CAI04030.1| hypothetical protein PB301494.00.0 [Plasmodium berghei]
          Length = 346

 Score = 47.0 bits (110), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 57/269 (21%), Positives = 108/269 (40%), Gaps = 41/269 (15%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           L+ ++ GDL  + V ++  +V+ +     +   + + I  + +   F+ +  G+    QF
Sbjct: 90  LIQSEYGDLYKIEVNHEDGIVKEIICKYFDTVPIANSICVLKSGALFVAAEFGNHFFYQF 149

Query: 438 T---CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS-SDALQDMVNGEELSLYGSASNN 493
           +     S  SM +S      G     A  T++L+     D +  +    ++ +  + ++N
Sbjct: 150 SGIGNDSNESMCTSN--HPSGKNAIIAFKTQKLKNLYLVDQIYSLSPIVDMKILDAKNSN 207

Query: 494 TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPG-CKGIWT 552
                       R SL      +   +GL I   A+             ELPG  + IWT
Sbjct: 208 IPQIYALCGRGPRSSL------RILQHGLSIEELANN------------ELPGKPRYIWT 249

Query: 553 VYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTI 612
           +   +S               EY  Y+I+S E  T++LE  + + EV +S+   +   T 
Sbjct: 250 IKKDNS--------------SEYDGYIIVSFEGNTLILEIGETVEEVYDSL--LLTNVTT 293

Query: 613 AAGNLFGRRRVIQVFERGARILDGSYMTQ 641
              NL      IQV++ G R ++G  + +
Sbjct: 294 IHINLLYDNSFIQVYDTGIRHINGKIVQE 322


>gi|302837243|ref|XP_002950181.1| UV-damaged DNA binding complex subunit 1 protein [Volvox carteri f.
           nagariensis]
 gi|300264654|gb|EFJ48849.1| UV-damaged DNA binding complex subunit 1 protein [Volvox carteri f.
           nagariensis]
          Length = 1104

 Score = 46.6 bits (109), Expect = 0.062,   Method: Compositional matrix adjust.
 Identities = 60/242 (24%), Positives = 94/242 (38%), Gaps = 55/242 (22%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL  + G + LL + +DG  V  L       +   S +  + + L F+GSR GDS LV+ 
Sbjct: 281 LLGNRQGGMQLLVLAHDGSRVSGLRTEPLGYTCAPSCLAYLDSGLTFVGSRSGDSQLVRI 340

Query: 438 TCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESA 497
           +                     + P T      S  +L  +V+   + L           
Sbjct: 341 SAQP-----------------VNQPPTYLELVDSFPSLAPIVDFVVMDL---------ER 374

Query: 498 QKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKS 557
           Q      +   + + G L+    G+ IN  A+            VELPG KG+W++    
Sbjct: 375 QGQGQLVMCSGIDSDGSLRVVRNGIGINRQAT------------VELPGIKGVWSL---- 418

Query: 558 SRGHNADSSRMAAYDDEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAG 615
            R H         YDDEY  YL+++   E R + L T + L E  E   +    +T+  G
Sbjct: 419 -RSH---------YDDEYDKYLLLTFVGETRLLALNTEEELDE-AELPGFDSGSQTLWCG 467

Query: 616 NL 617
           N+
Sbjct: 468 NM 469


>gi|328869269|gb|EGG17647.1| CPSF domain-containing protein [Dictyostelium fasciculatum]
          Length = 1194

 Score = 46.6 bits (109), Expect = 0.063,   Method: Compositional matrix adjust.
 Identities = 114/564 (20%), Positives = 210/564 (37%), Gaps = 111/564 (19%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           L+ V +    G + S+A     G      +D +I+  +  ++ +LE++ S +        
Sbjct: 49  LDHVLYSEAFGVIRSIAPFRLTGGS----KDYLIVGSDSGRVVILEYNPSKNVFEKVHQE 104

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRC---GGVLVYGLQMIILKASQGGSGLVGDE 217
            F        + G      G  +  DP+GR    G +    L  I+ + SQ    +    
Sbjct: 105 TFG-------RSGCRRIVPGQYISTDPKGRAFMIGAIEKQKLVYILNRDSQAKLSI---- 153

Query: 218 DTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVM--VILHERELTWAGRVS 275
                     A    + V ++  +D+          G+  P+   + +   E T    V 
Sbjct: 154 -----SSPLEAHKAHTIVFSMCGVDV----------GFENPIFATISVDYSEETNIEDVE 198

Query: 276 WKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS----PIGGVLVVGANTIHY 331
             H+T +++   +   L      WS   +   A  +++VP     P GGVLV     ++Y
Sbjct: 199 ETHNTKVLTFYELDLGLNNVVRKWSE-EVDRSANLVVSVPGGSDGP-GGVLVCAQGRVYY 256

Query: 332 HSQSASCALALNNYAVSLDSSQELPRSSFSVE----LDAAHATWLQNDVA--LLSTKTGD 385
            +   +            D S  +PR +   E    +  +HA+  Q D+   L+ ++ GD
Sbjct: 257 RNIGHA------------DISVSIPRRNGMTEEKSLMIVSHASHKQRDMFFFLVQSEYGD 304

Query: 386 LVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSM 445
           L  +T+ Y G +V  + ++  +     + IT + N   F+ S  GD  L  F        
Sbjct: 305 LYKITLDYSGEMVSGMQIAYFDTFPTANCITMLKNGFLFVASEFGDHGLYLFK------- 357

Query: 446 LSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAV 505
            S GL         DAP+      +     + +     L L  + S    S      F V
Sbjct: 358 -SLGLD--------DAPTASSAGNTEMVFFEPVFEPRNLVLTATIS----SLSPIVDFKV 404

Query: 506 RDSLVNIGPLKDFSYGLRINADASATGISKQSNYELV------------ELPGC-KGIWT 552
            D L   G  + ++           +G+S+++N  ++            +LPG   GIWT
Sbjct: 405 AD-LAQEGTPQMYAL----------SGVSERANLRVLRHGLPITQMVDSQLPGTPAGIWT 453

Query: 553 VYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE----SVDYFVQ 608
           +    +   N     +   +     Y+++S    T+VL   + + EV +    S    + 
Sbjct: 454 IPQSLTTMRNPQYQGIGTVESPADRYIVVSFVGSTLVLGVGETVEEVQDSGILSTTTTIL 513

Query: 609 GRTIAAGNLFGRRRVIQVFERGAR 632
            R++ A NL     ++Q+F +G R
Sbjct: 514 IRSMGA-NL---DSIVQIFAQGIR 533


>gi|301110252|ref|XP_002904206.1| pre-mRNA-splicing factor RSE1 [Phytophthora infestans T30-4]
 gi|262096332|gb|EEY54384.1| pre-mRNA-splicing factor RSE1 [Phytophthora infestans T30-4]
          Length = 1197

 Score = 46.6 bits (109), Expect = 0.071,   Method: Compositional matrix adjust.
 Identities = 68/306 (22%), Positives = 114/306 (37%), Gaps = 86/306 (28%)

Query: 321 VLVVGANTIHYHSQ---SASCALALNNYAVSLDSSQELPRSSFSVE--LDAAHATWLQND 375
           VLV+G NT+ Y ++     +CA+               PR        +  + AT  Q D
Sbjct: 247 VLVLGENTVQYKNEGHPELTCAI---------------PRREGEHRDIIIVSAATHKQRD 291

Query: 376 V--ALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSL 433
           +   LL ++ GDL  +++ Y G VV+ + +   +   + S +      L F  S   +  
Sbjct: 292 LFFVLLQSELGDLYKISLDYSGNVVEEIKIQFFDTIPVASSMCITKTGLLFCASEFSNHY 351

Query: 434 LVQF-TCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA---------------LQD 477
           L QF + G G        K     ++    ST  LR+ ++ A               + D
Sbjct: 352 LFQFLSIGEG----DDAAKCSSLAMDPTEFSTFPLRKLTNLALASSSASLSPVTQLLVDD 407

Query: 478 MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
           + N +   +Y    NN  S+                 L+   +GL I   A++       
Sbjct: 408 LANEQTPQMYALCGNNNRSS-----------------LRVLRHGLPITEMAASA------ 444

Query: 538 NYELVELPG-CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLL 596
                 LPG  K +W +                +Y D Y  Y+++S E  T+VLE  + +
Sbjct: 445 ------LPGVAKAVWCLKE--------------SYADPYDKYIVVSFEDATLVLEVGETV 484

Query: 597 TEVTES 602
            EV +S
Sbjct: 485 EEVAQS 490


>gi|448528339|ref|XP_003869702.1| hypothetical protein CORT_0D07360 [Candida orthopsilosis Co 90-125]
 gi|380354055|emb|CCG23569.1| hypothetical protein CORT_0D07360 [Candida orthopsilosis]
          Length = 1170

 Score = 46.6 bits (109), Expect = 0.072,   Method: Compositional matrix adjust.
 Identities = 81/340 (23%), Positives = 136/340 (40%), Gaps = 48/340 (14%)

Query: 304 LPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE 363
           +P+DA  L  VP  IGGVLV GAN I Y          L N ++ L   +   ++S  + 
Sbjct: 229 VPNDANYLAPVPGHIGGVLVCGANWIMYDK--------LGNESILLPLLRRKDQTSVIIS 280

Query: 364 LDAAHATWLQND--VALLSTKTGDLVLLTVVYDG--RVVQRLDLSKTNPSVLTSDITTIG 419
               HA   +N     LL    GDL  L + YD    +++ ++++  +   +  ++    
Sbjct: 281 -HVTHALKKKNYGFFILLQNDLGDLFRLIIDYDSNRELIKDIEITYFDTIPVCYNLNIFK 339

Query: 420 NSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMV 479
           N L F        LL QF            L EE    E D    K ++  +    ++  
Sbjct: 340 NGLCFANCINRSQLLYQF----------EKLGEEIS--EEDIRINKTVQMDNIQLTKEKY 387

Query: 480 NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNY 539
              E  L G  +       ++ S  + DS++N   L   S   ++      T  +     
Sbjct: 388 --FEFKLKGLDNLALIDVVESLS-PITDSILNDDTLVTLSTKSKLKTIVHGTPTTTLVES 444

Query: 540 ELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLII--SLEARTMVLETADLLT 597
           +L   P    I+T             +   A DDE   YL+I  +L  +T+VL   +++ 
Sbjct: 445 QLPIKP--TNIFTT-----------KTSANAVDDE---YLVITSTLSFKTLVLSLGEVIE 488

Query: 598 EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS 637
           EV +S   FV  +   A    G+  ++Q++  G R ++G+
Sbjct: 489 EVNDS--EFVLDQPTVAVQQVGKSSIVQIYSNGLRHINGN 526


>gi|312076588|ref|XP_003140928.1| xeroderma Pigmentosum Group E Complementing protein [Loa loa]
          Length = 516

 Score = 46.6 bits (109), Expect = 0.073,   Method: Compositional matrix adjust.
 Identities = 78/356 (21%), Positives = 136/356 (38%), Gaps = 90/356 (25%)

Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
           +W   NL  +A  ++ VP P GG L+ G + I YH +    AL    YA    S      
Sbjct: 201 LWKHDNLEGEASMVIGVPEPAGGCLIAGPDAISYH-KGGDDAL---RYAGVPGSRLHNTH 256

Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR----------VVQRLDLSKTN 407
            +    +D     +L  D+A      G+L +L + + G+           V+ + +    
Sbjct: 257 PNCYAPVDRDGQRYLLADLA------GNLYMLLLEF-GKGQEQDESSTVSVKDMKVESLG 309

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC---GSGTSMLSSGLKEEFGDIEADAPST 464
            + +   +  + N + F+GSR GDS L++ +      GT  +S  L + + ++   AP  
Sbjct: 310 NTCIAECMCYLDNGVCFIGSRFGDSQLIRLSTEPRADGTGYIS--LLDSYTNL---AP-- 362

Query: 465 KRLRRSSSDALQDMV----NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
                     ++DM     NG++  L  S +    + +   +    + L +         
Sbjct: 363 ----------IRDMTVMRCNGQQQILTCSGAYKDGTIRIIRNGIGIEELAS--------- 403

Query: 521 GLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLI 580
                                VEL G K ++T+  +               D E+  YLI
Sbjct: 404 ---------------------VELKGIKNMFTLRTR---------------DHEFDDYLI 427

Query: 581 ISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDG 636
           +S ++ T VL       E T+   + V G T+ AG LF    ++QV      ++DG
Sbjct: 428 LSFDSDTHVLLINGEELEDTQITGFVVDGATLWAGCLFQSTTILQVTHGEVILIDG 483


>gi|444313909|ref|XP_004177612.1| hypothetical protein TBLA_0A02930 [Tetrapisispora blattae CBS 6284]
 gi|387510651|emb|CCH58093.1| hypothetical protein TBLA_0A02930 [Tetrapisispora blattae CBS 6284]
          Length = 1459

 Score = 46.6 bits (109), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 29/116 (25%), Positives = 58/116 (50%), Gaps = 17/116 (14%)

Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHC----F 162
           ++  G +  + ++ Q G++     D ++L   +AKIS+++FD+ ++ L+  S+H     F
Sbjct: 54  FKFSGKITDIVLIPQRGSE----LDCLLLVTPNAKISIIKFDEELNTLKTISLHYYTDEF 109

Query: 163 ESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED 218
           E    L L       AR   ++V+P+ +C  VL++  + I +        +  DED
Sbjct: 110 EKLSMLQL-------ARTSQLRVEPKKKC--VLLFNTESIAILPFTQQFNIDNDED 156


>gi|448111975|ref|XP_004201977.1| Piso0_001448 [Millerozyma farinosa CBS 7064]
 gi|359464966|emb|CCE88671.1| Piso0_001448 [Millerozyma farinosa CBS 7064]
          Length = 1249

 Score = 46.2 bits (108), Expect = 0.097,   Method: Compositional matrix adjust.
 Identities = 57/241 (23%), Positives = 99/241 (41%), Gaps = 47/241 (19%)

Query: 95  GISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGL 154
            I    L+ +C + +   ++SL  +   G+    ++D +++  +  K+++L++D   + L
Sbjct: 52  NIDTGKLDKICVHNVFSVIQSLEKVRLTGS----QKDYLVVTSDSGKLAILQYDTGRNRL 107

Query: 155 RITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ--MIILKASQGGSG 212
               +  F+ P   H K G      GP +  DPQ R   +L+  L+   +I K      G
Sbjct: 108 ----VTVFQEP---HSKTGFRRNTPGPYLLTDPQNR--AILIGALERNKLIYKVHSDDKG 158

Query: 213 LVGDEDTFGSGGGFSARIESS--HVINLR--DLDMKHVKDFIFVHGYIEPVMVILHEREL 268
                     G   S+ +ES   H I L    LD           GY  PV V +     
Sbjct: 159 ----------GMQISSPLESQIRHTITLAMCALDT----------GYENPVFVAIEAEYG 198

Query: 269 TWAGRV----SWKHHTCMISALSISTTLKQHPLIWSAMN--LPHDAYKLLAVPSPIGGVL 322
               +     S  H T + ++  +   L    ++   +N  LP  A  L+ +PSP+GGVL
Sbjct: 199 ALDSKEYSIDSQAHQTLLFTSYELDQGLNH--VVRRVVNNKLPISATHLIPLPSPVGGVL 256

Query: 323 V 323
           +
Sbjct: 257 I 257


>gi|392593521|gb|EIW82846.1| hypothetical protein CONPUDRAFT_81012 [Coniophora puteana
           RWD-64-598 SS2]
          Length = 1213

 Score = 46.2 bits (108), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 87/384 (22%), Positives = 146/384 (38%), Gaps = 97/384 (25%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL ++ GDL  +T+ +D   V+ L +   +   + S +  + +   F+ S  G+  L QF
Sbjct: 308 LLQSEDGDLFKVTIDHDEDEVKSLKIKYFDTVPVASSLCILKSGFLFVASEFGNHYLYQF 367

Query: 438 TC---------GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYG 488
                       S TS  S G+ E F  +     +  R R   + AL D +   +  L  
Sbjct: 368 QKLGDDDDEPEFSSTSFPSFGMAESFIPLPH---AHFRPRGLDNLALADEIESLDPILDA 424

Query: 489 SASN---NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELP 545
              N   N+++ Q  F+   R S      L+   +GL +    S+            ELP
Sbjct: 425 KVMNILPNSDTPQ-IFTACGRGSRSTFRMLR---HGLEVEESVSS------------ELP 468

Query: 546 GC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVD 604
           G    +WT                   DD Y +Y+I+S    T+VL   + + EV ++  
Sbjct: 469 GIPNAVWTTKRTE--------------DDPYDSYIILSFVNGTLVLSIGETIEEVQDT-G 513

Query: 605 YFVQGRTIAAGNLFGRRRVIQVFERGA------------RILDGS--------------- 637
           +     T+A   + G   ++QV  +G             R+  G                
Sbjct: 514 FLSSAPTLAVQQI-GSDALLQVHPQGIRHVLSDRRVNEWRVPQGKTIVCATTNKRQVVVA 572

Query: 638 -------YMTQDLSFGPSNSESGSGSENSTVLSVSIAD--------PYVLLGMSDGSIRL 682
                  Y   DL  G  N      +  STVL++S+ +        PY+ +G  D ++R+
Sbjct: 573 LSSAELVYFELDLD-GQLNEYQDWKAMGSTVLALSVGEVPEGRQRTPYLAVGCEDQTVRI 631

Query: 683 LVGDPSTC--TVSVQT----PAAI 700
           +  DP +   T+S+Q     P+AI
Sbjct: 632 ISLDPESTLETISLQALTAPPSAI 655


>gi|195586770|ref|XP_002083143.1| GD13507 [Drosophila simulans]
 gi|194195152|gb|EDX08728.1| GD13507 [Drosophila simulans]
          Length = 1227

 Score = 46.2 bits (108), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 80/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL T+ GD+  +T+  D  VV  + L   +     + +  +     F+ S  G+  L Q 
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E G+    AP           AL+++V  +EL  +     + 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +            L   GP   L+   +GL +            S   + ELPG    +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTV  ++              DDE+ AY+I+S    T+VL   + + EVT+S  +     
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504

Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
           T+    L G   ++QV+  G R I     + +  + G  +    + ++   V+++S    
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563

Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
                DP                                 ++ +G+SD ++R+L  DP+ 
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPEGEQRSWFLAVGLSDNTVRILSLDPNN 623

Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
           C     TP ++++   P  S  L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642


>gi|195169735|ref|XP_002025674.1| GL20829 [Drosophila persimilis]
 gi|194109167|gb|EDW31210.1| GL20829 [Drosophila persimilis]
          Length = 1225

 Score = 45.8 bits (107), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 80/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL T+ GD+  +T+  D  VV  + L   +     S +  +     F+ S  G+  L Q 
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPASAMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E G+    AP T          L+++V  +EL  +     + 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPRT----------LKNLVLVDELPSFAPIITSQ 411

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPG-CKGI 550
            +            L   GP   L+   +GL +            S   + ELPG    +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTV  ++              DDE+ AY+I+S    T+VL   + + EVT+S  +     
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504

Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
           T+    L G   ++QV+  G R I     + +  + G  +    + ++   V+++S    
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563

Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
                DP                                 ++ +G++D ++R+L  DP+ 
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPDGEQRSWFLAVGLADNTVRILSLDPNN 623

Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
           C     TP ++++   P  S  L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642


>gi|125977518|ref|XP_001352792.1| GA12611 [Drosophila pseudoobscura pseudoobscura]
 gi|54641542|gb|EAL30292.1| GA12611 [Drosophila pseudoobscura pseudoobscura]
          Length = 1228

 Score = 45.8 bits (107), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 80/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL T+ GD+  +T+  D  VV  + L   +     S +  +     F+ S  G+  L Q 
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPASAMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E G+    AP T          L+++V  +EL  +     + 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPRT----------LKNLVLVDELPSFAPIITSQ 411

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +            L   GP   L+   +GL +            S   + ELPG    +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTV  ++              DDE+ AY+I+S    T+VL   + + EVT+S  +     
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504

Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
           T+    L G   ++QV+  G R I     + +  + G  +    + ++   V+++S    
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563

Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
                DP                                 ++ +G++D ++R+L  DP+ 
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPDGEQRSWFLAVGLADNTVRILSLDPNN 623

Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
           C     TP ++++   P  S  L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642


>gi|346971485|gb|EGY14937.1| pre-mRNA-splicing factor RSE1 [Verticillium dahliae VdLs.17]
          Length = 1230

 Score = 45.8 bits (107), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 121/543 (22%), Positives = 201/543 (37%), Gaps = 125/543 (23%)

Query: 133 IILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCG 192
           +ILA +  +I+++E+  + +  +   +  F        K G      G  +  DP+GR  
Sbjct: 102 LILATDSGRIAIIEYLPAQNRFQRLHLETFG-------KSGIRRVVPGEFLACDPKGRA- 153

Query: 193 GVLVYGLQ-----MIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVK 247
             L+  L+      ++ + SQ        E T  S     A     HV+++  LD+    
Sbjct: 154 -CLIASLEKNKLVYVLNRNSQA-------ELTISSP--LEAHKPGVHVLSMVALDV---- 199

Query: 248 DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL------IWSA 301
                 GY  PV   L E + T A +          +AL + T L  + L      +   
Sbjct: 200 ------GYANPVFAAL-ETDYTEADQDPTGQ-----AALDVETQLVYYELDLGLNHVVRK 247

Query: 302 MNLPHDAYKLLAVPSPIG-----GVLVVGANTIHY-HSQSASCALALNNY--AVSLDSSQ 353
            + P D    L    P G     GVLV G   I Y HS   +  + +     A    S +
Sbjct: 248 WSEPVDNTASLLFQVPGGNDGPSGVLVCGEENITYRHSNQEAFRVPVPRRRGATEDPSRK 307

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY----DGRV---VQRLDLSKT 406
               +    +L  +   +      LL T+ GDL  +T+      DG     V+RL +   
Sbjct: 308 RCIVAGVMHKLKGSAGAFF----FLLQTEDGDLFKITIDMIEDRDGNPTGEVKRLKIKYF 363

Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
           +   + S +  + +   ++ S+ G+    QF              E+ GD        + 
Sbjct: 364 DTIPVASSLCILKSGFLYVASQFGNYQFYQF--------------EKLGD------DDEE 403

Query: 467 LRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINA 526
           L  SS D   D     E   +          ++  + A+ +S+ ++ PL D         
Sbjct: 404 LEFSSDDFPTDPKQSYEAVFF--------HPRELENLALVESIDSMNPLIDCKVANLTGE 455

Query: 527 DA----SATGISKQSNYELV------------ELPGC-KGIWTVYHKSSRGHNADSSRMA 569
           DA    +A G   +S + ++            ELPG    +WT+  K SRG         
Sbjct: 456 DAPQIYTACGNGARSTFRILKHGLEVNEIVASELPGIPSAVWTL--KLSRG--------- 504

Query: 570 AYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFER 629
              D+Y AY+++S    T+VL   + + EV +S   F+      A  L G   +IQV  +
Sbjct: 505 ---DQYDAYIVLSFTNATLVLSIGETVEEVNDS--GFLTSVPTLAAQLLGGEGLIQVHPK 559

Query: 630 GAR 632
           G R
Sbjct: 560 GIR 562


>gi|426192113|gb|EKV42051.1| hypothetical protein AGABI2DRAFT_229642 [Agaricus bisporus var.
           bisporus H97]
          Length = 1213

 Score = 45.8 bits (107), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 86/385 (22%), Positives = 148/385 (38%), Gaps = 99/385 (25%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL ++ GDL  +T+ ++   V+ L +   +   + S +  + +   F+ S  G+  L QF
Sbjct: 308 LLQSEDGDLFKVTIEHEDEEVKALKIKYFDTVPVASSLCILKSGFLFVASEFGNHYLYQF 367

Query: 438 TC---------GSGTSMLSSGLKEEFGDIEADAPSTK-RLRRSSSDALQDMVNGEELSLY 487
                       S TS  SSG+ E     +A  P    + R   + AL D +   +  + 
Sbjct: 368 QKLGDDDEEPEFSSTSFPSSGMAEP----QAALPRVYFKPRPLDNLALADELESLDPIID 423

Query: 488 GSASN---NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVEL 544
               N   N+++ Q  F+   R +  +   L+   +GL +    S+            +L
Sbjct: 424 SKVLNLLPNSDTPQ-IFAACGRGARSS---LRTLQHGLEVEESVSS------------DL 467

Query: 545 PGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESV 603
           PG    +WT                   DD Y +Y+I+S    T+VL   + + EV ++ 
Sbjct: 468 PGIPNAVWTTKRNE--------------DDPYDSYIILSFVNGTLVLSIGETIEEVQDT- 512

Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGAR------------------ILDGS-------- 637
            +     T+A   + G   ++QV   G R                  I+  +        
Sbjct: 513 GFLSSAPTLAVQQI-GSDALLQVHPHGIRHVLADRRVNEWRVPSNKIIVAATTNKRQVVV 571

Query: 638 --------YMTQDLSFGPSNSESGSGSENSTVLSVSIAD--------PYVLLGMSDGSIR 681
                   Y   DL  G  N      +  STVL++SI D        PY+ +G  D ++R
Sbjct: 572 ALSSAELVYFELDLD-GQLNEYQDRKAMGSTVLALSIGDVPEGRQRTPYLAVGCEDQTVR 630

Query: 682 LLVGDPSTC--TVSVQT----PAAI 700
           ++  DP +   T+S+Q     P+AI
Sbjct: 631 IISLDPESTLETISLQALTAPPSAI 655


>gi|380490733|emb|CCF35810.1| pre-mRNA-splicing factor rse-1 [Colletotrichum higginsianum]
          Length = 1212

 Score = 45.8 bits (107), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 137/603 (22%), Positives = 226/603 (37%), Gaps = 129/603 (21%)

Query: 74  QEEGSKESK--NSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRD 131
           Q  G+KE     +  ++  +L    S   +  V  + + G + S+A     G++    +D
Sbjct: 27  QFSGTKEQNIVTASGSRLTLLRPDPSQGKVITVLSHDIFGIIRSMAAFRLAGSN----KD 82

Query: 132 SIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHL----KRGRESFARGPLVKVDP 187
            +ILA +  +I+++E+        I + + F+    LHL    K G      G  +  DP
Sbjct: 83  YLILATDSGRITIIEY--------IPAQNRFQR---LHLETFGKSGVRRVIPGEYLACDP 131

Query: 188 QGRC---GGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK 244
           +GR      V    L  ++ + SQ        E T  S     A      V+++  LD+ 
Sbjct: 132 KGRACLIASVEKNKLVYVLNRNSQA-------ELTISSP--LEAHKPGVLVLSMVALDV- 181

Query: 245 HVKDFIFVHGYIEPVMVILHERELTWA-----GRVSWKHHTCMISALSISTTLKQHPLIW 299
                    GY  PV   L E E T A     G  + +  T ++    +   L      W
Sbjct: 182 ---------GYANPVFAAL-EIEYTEADQDPTGEAAREAETQLV-YYELDLGLNHVVRKW 230

Query: 300 SAMNLPHDAYKLLAVP---SPIGGVLVVGANTIHY-HSQSASCALALNNY--AVSLDSSQ 353
           S    P  A  L  VP       GVLV G   I Y HS   +  + +     A    S +
Sbjct: 231 SESVDP-TASMLFQVPGGQDGPSGVLVCGEENITYRHSNQEAFRVPIPRRRGATEDPSRK 289

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY----DGRV---VQRLDLSKT 406
               S    +L  +   +      LL T+ GDL   T+      DG     V+RL +   
Sbjct: 290 RHAVSGVMHKLKGSAGAFF----FLLQTEDGDLFKATLDMVEDTDGNPTGEVKRLKIKYF 345

Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
           +   ++S +  + +   +  S+ G+    QF              E+ GD + +      
Sbjct: 346 DTIPVSSSLCILKSGFLYAASQFGNHQFYQF--------------EKLGDDDDE------ 385

Query: 467 LRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINA 526
           L  SS D   D   G +   +          +   + A+ +S+ ++ PL D         
Sbjct: 386 LEFSSDDFPTDPKAGYDAVYF--------HPRPLENLALVESIDSMNPLLDCKVANLTGE 437

Query: 527 DA----SATGISKQSNYELV------------ELPGC-KGIWTVYHKSSRGHNADSSRMA 569
           DA    +A G   +S + ++            ELPG    +WT+  K +RG         
Sbjct: 438 DAPQIYTACGNGARSTFRMLKHGLEVNEIVASELPGIPSAVWTL--KLNRG--------- 486

Query: 570 AYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFER 629
              D+Y AY+++S    T+VL   + + EV++S   F+      A  L G   +IQV  +
Sbjct: 487 ---DQYDAYIVLSFTNGTLVLSIGETVEEVSDS--GFLTSVPTLAAQLLGEDGLIQVHPK 541

Query: 630 GAR 632
           G R
Sbjct: 542 GIR 544


>gi|409075182|gb|EKM75565.1| hypothetical protein AGABI1DRAFT_64324 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 1213

 Score = 45.8 bits (107), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 86/385 (22%), Positives = 148/385 (38%), Gaps = 99/385 (25%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL ++ GDL  +T+ ++   V+ L +   +   + S +  + +   F+ S  G+  L QF
Sbjct: 308 LLQSEDGDLFKVTIEHEDEEVKALKIKYFDTVPVASSLCILKSGFLFVASEFGNHYLYQF 367

Query: 438 TC---------GSGTSMLSSGLKEEFGDIEADAPSTK-RLRRSSSDALQDMVNGEELSLY 487
                       S TS  SSG+ E     +A  P    + R   + AL D +   +  + 
Sbjct: 368 QKLGDDDEEPEFSSTSFPSSGMAEP----QAALPRVYFKPRPLDNLALADELESLDPIID 423

Query: 488 GSASN---NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVEL 544
               N   N+++ Q  F+   R +  +   L+   +GL +    S+            +L
Sbjct: 424 SKVLNLLPNSDTPQ-IFAACGRGARSS---LRTLQHGLEVEESVSS------------DL 467

Query: 545 PGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESV 603
           PG    +WT                   DD Y +Y+I+S    T+VL   + + EV ++ 
Sbjct: 468 PGIPNAVWTTKRNE--------------DDPYDSYIILSFVNGTLVLSIGETIEEVQDT- 512

Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGAR------------------ILDGS-------- 637
            +     T+A   + G   ++QV   G R                  I+  +        
Sbjct: 513 GFLSSAPTLAVQQI-GSDALLQVHPHGIRHVLADRRVNEWRVPSNKTIVAATTNKRQVVV 571

Query: 638 --------YMTQDLSFGPSNSESGSGSENSTVLSVSIAD--------PYVLLGMSDGSIR 681
                   Y   DL  G  N      +  STVL++SI D        PY+ +G  D ++R
Sbjct: 572 ALSSAELVYFELDLD-GQLNEYQDRKAMGSTVLALSIGDVPEGRQRTPYLAVGCEDQTVR 630

Query: 682 LLVGDPSTC--TVSVQT----PAAI 700
           ++  DP +   T+S+Q     P+AI
Sbjct: 631 IISLDPESTLETISLQALTAPPSAI 655


>gi|393905247|gb|EJD73911.1| CPSF A subunit region family protein [Loa loa]
          Length = 1145

 Score = 45.8 bits (107), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 78/356 (21%), Positives = 136/356 (38%), Gaps = 90/356 (25%)

Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
           +W   NL  +A  ++ VP P GG L+ G + I YH +    AL    YA    S      
Sbjct: 201 LWKHDNLEGEASMVIGVPEPAGGCLIAGPDAISYH-KGGDDAL---RYAGVPGSRLHNTH 256

Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR----------VVQRLDLSKTN 407
            +    +D     +L  D+A      G+L +L + + G+           V+ + +    
Sbjct: 257 PNCYAPVDRDGQRYLLADLA------GNLYMLLLEF-GKGQEQDESSTVSVKDMKVESLG 309

Query: 408 PSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC---GSGTSMLSSGLKEEFGDIEADAPST 464
            + +   +  + N + F+GSR GDS L++ +      GT  +S  L + + ++   AP  
Sbjct: 310 NTCIAECMCYLDNGVCFIGSRFGDSQLIRLSTEPRADGTGYIS--LLDSYTNL---AP-- 362

Query: 465 KRLRRSSSDALQDMV----NGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSY 520
                     ++DM     NG++  L  S +    + +   +    + L +         
Sbjct: 363 ----------IRDMTVMRCNGQQQILTCSGAYKDGTIRIIRNGIGIEELAS--------- 403

Query: 521 GLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLI 580
                                VEL G K ++T+  +               D E+  YLI
Sbjct: 404 ---------------------VELKGIKNMFTLRTR---------------DHEFDDYLI 427

Query: 581 ISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDG 636
           +S ++ T VL       E T+   + V G T+ AG LF    ++QV      ++DG
Sbjct: 428 LSFDSDTHVLLINGEELEDTQITGFVVDGATLWAGCLFQSTTILQVTHGEVILIDG 483


>gi|430813298|emb|CCJ29330.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 1197

 Score = 45.8 bits (107), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 118/544 (21%), Positives = 206/544 (37%), Gaps = 92/544 (16%)

Query: 109 LHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWL 168
           +HG + +L      G +    +D +I+  +  +I++LE+    +         +      
Sbjct: 65  VHGIIRTLVGFRLAGTN----KDHLIVGSDSGRITILEYKPDSNAFSKVHQETYG----- 115

Query: 169 HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSA 228
             K G      G  + VDP+GR   +       ++   ++  +                A
Sbjct: 116 --KSGVRRVVPGQYLAVDPKGRATMIASIEKNKLVYVLNRDSA------TNLTISSPLEA 167

Query: 229 RIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILH----ERELTWAGRVSWKHHTCMIS 284
               S V +L  +D+          GY  PV   L     E E   +G+ +++    +++
Sbjct: 168 HKSCSLVFHLIGMDV----------GYENPVFAALEVDYTEAESDPSGK-AYREIQKVLT 216

Query: 285 ALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIG-----GVLVVGANTIHY-HSQSASC 338
              +   L      WS    P D    L V  P G     G LV    +I Y H    + 
Sbjct: 217 YYELDLGLNHVVRKWSD---PVDRKANLLVTVPGGSDGPSGALVCTEGSIFYKHKGKKTH 273

Query: 339 ALALNNYAVSLDSSQ--ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGR 396
            + +     SL++SQ  ++  SS   ++  A    LQN+        GDL  +T+  +  
Sbjct: 274 RIPIPTRIGSLENSQKKQIIVSSVVHKMRGAFFFLLQNE-------DGDLFKVTIDSNDG 326

Query: 397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF-TCGSGTSMLS-SGLKEEF 454
            V+ L +   +   +++ ++ + +   F+ S  G+  L QF   G   + +  S +    
Sbjct: 327 EVESLKIKYFDTVPVSTGLSILKSGFLFVASEYGNHHLYQFEKLGDDNNEIEFSSVDFPV 386

Query: 455 GDI-EADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT-ESAQKTFSFAVRDSLVNI 512
            D+ E   PS  R R   +  L D +N     +     N T E A + ++   R      
Sbjct: 387 LDLNEGYEPSYFRPRSLENLLLVDDLNSMNPLMDSKILNLTDEDAPQIYALCGR------ 440

Query: 513 GPLKDFS---YGLRINADASATGISKQSNYELVELPGC-KGIWTVYHKSSRGHNADSSRM 568
           GP   F    YGL +N +  A+G           LPG    +WT    SS          
Sbjct: 441 GPRSTFRTLRYGLEVN-EIVASG-----------LPGSPTAVWTTKLTSS---------- 478

Query: 569 AAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFE 628
               D+Y AY+++S    T+VL   + + EV+++  +     T+A   L G   +IQV  
Sbjct: 479 ----DQYDAYIVLSFVNGTLVLSIGETVEEVSDT-GFLSSSPTLAVQQL-GDDALIQVHP 532

Query: 629 RGAR 632
           +G R
Sbjct: 533 KGIR 536


>gi|308477185|ref|XP_003100807.1| CRE-DDB-1 protein [Caenorhabditis remanei]
 gi|308264619|gb|EFP08572.1| CRE-DDB-1 protein [Caenorhabditis remanei]
          Length = 1154

 Score = 45.8 bits (107), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 87/354 (24%), Positives = 145/354 (40%), Gaps = 82/354 (23%)

Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE--L 364
           DA  L+ VP+PI GVLV+ AN+I Y S          N  V   +S  L  + F+    +
Sbjct: 210 DASVLIPVPAPISGVLVLAANSILYKSSDV-------NGDVVPYASPLLDNTVFTCHGLV 262

Query: 365 DAAHATWLQNDVALLSTKTGDLVLLTVVYDGR---VVQRLDLSKTNPSVLTSDITTIGNS 421
           D +   ++ +D     T+   L+L+  + +GR    V+ + +     + +   I  I   
Sbjct: 263 DPSGERFILSD-----TEGRLLMLILNIGEGRSGITVKDMRIEYLGETSIADSINYIDAG 317

Query: 422 LFFLGSRLGDSLLVQFT-CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN 480
           + F+GSRLGDS L++     SG S   S + E + +I                 ++DM+ 
Sbjct: 318 VVFVGSRLGDSQLIRLMPTPSGGSY--SVVLETYSNI---------------GPIRDMIM 360

Query: 481 GEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYE 540
            E         ++ ++   T S A +D     G L+    G+ I   AS           
Sbjct: 361 VE---------SDGQAQLVTCSGAEKD-----GSLRVIRNGIGIEELAS----------- 395

Query: 541 LVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT 600
            VEL G  GI+ +   S+  +                Y+I+SL   T VL+      E  
Sbjct: 396 -VELAGVIGIFPIRLNSTTDN----------------YVIVSLAEETHVLQINGEELEDV 438

Query: 601 ESVDYFVQGRTIAAGNLFG---RRRVIQVFERGARILDGSYMTQDLSFGPSNSE 651
           + +    +  TI A  +FG      ++QV E+  R +  S +++   + P N E
Sbjct: 439 QLLQICTEMPTIFASTIFGPDNSEVLLQVTEKHVRFMAFSGLSK--IWEPPNGE 490


>gi|367001853|ref|XP_003685661.1| hypothetical protein TPHA_0E01320 [Tetrapisispora phaffii CBS 4417]
 gi|357523960|emb|CCE63227.1| hypothetical protein TPHA_0E01320 [Tetrapisispora phaffii CBS 4417]
          Length = 1357

 Score = 45.4 bits (106), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 70/365 (19%), Positives = 153/365 (41%), Gaps = 71/365 (19%)

Query: 97  SAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRI 156
           S   L L   ++L+G V  +A++ Q  +    + D +I+    AK+S++ F+   + L  
Sbjct: 45  STNKLHLNYEFKLNGRVSDIALIKQVDS----KLDYLIILTATAKLSLVNFNVFTNSLET 100

Query: 157 TSMHCFESPEWLH--LKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLV 214
            S+H +E     +  LK  +ES  R     +D    C  VL++    I +      +   
Sbjct: 101 ISLHYYEDKFRQNSILKLAKESKLR-----IDQAKNC--VLLFNNDNIAILPISSTTDEF 153

Query: 215 GDED-----------------TFGSGGGFSARIESSHVINLRDLDM----KHVKDFIFVH 253
            DED                  F S      +I +S +I L+  ++    +++ D  F+ 
Sbjct: 154 EDEDLGQESSAKTVKRGNMSIKFPSQSQKKNKITNSSII-LKSTELNSKIQNIIDIQFLS 212

Query: 254 GYIEPVMVILHERELTWAGR-----VSWKHHTCMISAL-------------SISTTLKQH 295
            + +P + +L++ +L W G      +  ++    ++ L             S++  L + 
Sbjct: 213 NFSKPTLSVLYQPKLAWIGNSNLVTLPTQYMILTLNILERENIKSQENGENSLNQDLIET 272

Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY--HSQSASCALALNNY--AVSLDS 351
            +I     LP++ + ++ + +   G  +VG+N I Y  H+      + +N +    +L  
Sbjct: 273 TIIGQVSELPYELHTIIPLNN---GSTLVGSNEIIYIDHTGVLQSLIIINQFQDKETLKK 329

Query: 352 SQELPRSSFSVELDA------AHATWLQNDV-----ALLSTKTGDLVLLTVVYDGRVVQR 400
            + + +S  ++ L+       A +    N+V      L+  +  ++ L+ +  +GR++  
Sbjct: 330 GRVIDKSKQNIILNKPIKFINAGSRVESNNVDDKNNVLIFDENNNIYLVNITLEGRLLIN 389

Query: 401 LDLSK 405
            D++K
Sbjct: 390 FDINK 394


>gi|301124447|ref|XP_002909707.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262106897|gb|EEY64949.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 328

 Score = 45.4 bits (106), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 24/83 (28%), Positives = 36/83 (43%), Gaps = 19/83 (22%)

Query: 925 GFFLSGSRPCWCMVFRERLRVHPQLCDGS-------------------IVAFTVLHNVNC 965
           G F  G+ P W +  R      P     S                   +++FT  H+ +C
Sbjct: 3   GAFFRGAHPMWILGDRGHASFVPMCVPSSAPPKANGTSKNAAPRVSVPVLSFTPFHHWSC 62

Query: 966 NHGFIYVTSQGILKICQLPSGST 988
            +GFIY  S+G L++C+LPS  T
Sbjct: 63  PNGFIYFHSRGALRVCELPSSKT 85


>gi|241952575|ref|XP_002419009.1| pre-mRNA-splicing factor, putative; pre-spliceosome component,
           putative [Candida dubliniensis CD36]
 gi|223642349|emb|CAX42591.1| pre-mRNA-splicing factor, putative [Candida dubliniensis CD36]
          Length = 1187

 Score = 45.1 bits (105), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 93/395 (23%), Positives = 162/395 (41%), Gaps = 62/395 (15%)

Query: 292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDS 351
           +K+ P   ++  LP D   ++ +P  IGG+LV G+N   Y          L+   + L  
Sbjct: 218 VKKKPASLNSDPLPDDVNYMIPLPGHIGGMLVCGSNWCFYD--------KLDGPRIYLPL 269

Query: 352 SQELPRSSFSVELD-AAHATWLQNDVALLSTKTGDLVLLTVVY--DGRVVQRLDLS--KT 406
            +   ++  S+ ++   H    +N   LL    GDL  LTV Y  D   ++ + ++   T
Sbjct: 270 PRRDGQTQESIIVNHVTHVLKKKNFFILLQNTLGDLFKLTVDYDFDKETIKNISITYFDT 329

Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKR 466
            P  L+ +I    N   F+     D LL QF              E+ GD  A+      
Sbjct: 330 IPPALSLNI--FKNGFLFVNVLNNDKLLYQF--------------EKLGDDLAE----NE 369

Query: 467 LRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL--RI 524
           L  +SSD             Y S  N   +   TF     D+L  I  L+  S  +  RI
Sbjct: 370 LVINSSD-------------YDSLDNVRGTDTTTFKLKGLDNLALIDVLETLSPIIDSRI 416

Query: 525 NADASATGISKQSNYELVE--LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLII- 581
           N D+    +S  S  + +   +P    + +    +          + + +DE   YL+I 
Sbjct: 417 N-DSKLVTLSSHSYVKSITHGVPTTTLVESPLPITPTDIFTTKLSLESANDE---YLVIS 472

Query: 582 -SLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG---ARILDGS 637
            SL ++T+VL   +++ +V +S   FV  ++  +    G   V+QV+  G    R ++G 
Sbjct: 473 SSLSSKTLVLSIGEVVEDVEDS--EFVLDQSTISVQQVGIASVVQVYSNGIKHIRTVNGK 530

Query: 638 YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVL 672
             T D  F P+       S N+  + +++++  V+
Sbjct: 531 KKTTDW-FPPAGITITHASTNNQQVLIALSNLNVV 564


>gi|449459948|ref|XP_004147708.1| PREDICTED: splicing factor 3B subunit 3-like [Cucumis sativus]
 gi|449513493|ref|XP_004164340.1| PREDICTED: splicing factor 3B subunit 3-like [Cucumis sativus]
          Length = 1214

 Score = 45.1 bits (105), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 73/324 (22%), Positives = 126/324 (38%), Gaps = 57/324 (17%)

Query: 320 GVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
           GVLV   N + Y +Q      A+      +    +LP     + + AA          LL
Sbjct: 246 GVLVCAENFVIYKNQGHPDVRAV------IPRRADLPAERGVLIVSAAMHKQKTMFFFLL 299

Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
            T+ GD+  +T+ ++   V+ L +   +   +T+ +  + +   F  S  G+  L QF  
Sbjct: 300 QTEYGDIFKVTLEHNNDSVKELKIKYFDTIPVTASMCVLKSGFLFAASEFGNHSLYQFQA 359

Query: 440 -GSGTSMLSSG-----LKEEFGDIEADAPSTKRLRR-SSSDALQDMVNGEELSLYGSASN 492
            G    + SS       +E F  +       K L R    ++L  +++ + ++L+     
Sbjct: 360 IGEDADVESSSATLMETEEGFQPVFFQPRRLKNLMRIDQVESLMPIMDMKIINLF----- 414

Query: 493 NTESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-K 548
             E   + F+   R      GP   L+    GL I            S   + ELPG   
Sbjct: 415 -EEETPQIFTLCGR------GPRSSLRILRPGLAI------------SEMAVSELPGVPS 455

Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
            +WTV                  +DE+ AY+++S    T+VL   + + EV++S   F+ 
Sbjct: 456 AVWTVKKN--------------INDEFDAYIVVSFANATLVLSIGETVEEVSDS--GFLD 499

Query: 609 GRTIAAGNLFGRRRVIQVFERGAR 632
                A +L G   ++QV   G R
Sbjct: 500 TTPSLAVSLIGDDSLMQVHPNGIR 523


>gi|353232348|emb|CCD79703.1| putative dna repair protein xp-E [Schistosoma mansoni]
          Length = 1329

 Score = 45.1 bits (105), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 75/323 (23%), Positives = 126/323 (39%), Gaps = 57/323 (17%)

Query: 128 RRRDSIILAFEDAKISVLEF---DDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVK 184
           R  DS+ L    A ++++E    +DS+  + + S    +        R      +G  V 
Sbjct: 72  RETDSLFLLTHKAGVAIIECVRNNDSVEFVTVASGSVED--------RSARIIDQGFDVL 123

Query: 185 VDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDM 243
           +DP      V +Y GL  IIL    G        +  G+    + +IE  +++       
Sbjct: 124 IDPGANYIVVRLYHGLLKIILLQCIG--------EKIGTDFLDTNQIEEGNIV------- 168

Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
               D  F++GY  P   +++E EL          H            L+   L   ++ 
Sbjct: 169 ----DMAFIYGYSLPTFAMIYEDELVL--------HMKTYEIYGREPVLRNVQLTLDSIE 216

Query: 304 LPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE 363
              D+  L+ VP P GGV++VG N I YH++       ++ Y     +SQ L  ++   +
Sbjct: 217 --PDSKLLIPVPKPYGGVILVGDNIICYHTKDGP---HISQYIPQAKASQVLCYAAVDAQ 271

Query: 364 L----DAAHATW----LQNDV-ALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSD 414
                D A   +    L  D+ A  +  T +   L+ V  G +   L      P      
Sbjct: 272 RYLLGDMAGRLYMVHLLSEDISAAANNGTSNSDSLSAVRIGSIRIELLGETATP----ES 327

Query: 415 ITTIGNSLFFLGSRLGDSLLVQF 437
           I  + N + F+GS LGDS L++ 
Sbjct: 328 IAYLDNGVVFIGSTLGDSQLIRL 350


>gi|256088964|ref|XP_002580590.1| DNA repair protein xp-E [Schistosoma mansoni]
          Length = 1329

 Score = 45.1 bits (105), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 75/323 (23%), Positives = 126/323 (39%), Gaps = 57/323 (17%)

Query: 128 RRRDSIILAFEDAKISVLEF---DDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVK 184
           R  DS+ L    A ++++E    +DS+  + + S    +        R      +G  V 
Sbjct: 72  RETDSLFLLTHKAGVAIIECVRNNDSVEFVTVASGSVED--------RSARIIDQGFDVL 123

Query: 185 VDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDM 243
           +DP      V +Y GL  IIL    G        +  G+    + +IE  +++       
Sbjct: 124 IDPGANYIVVRLYHGLLKIILLQCIG--------EKIGTDFLDTNQIEEGNIV------- 168

Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
               D  F++GY  P   +++E EL          H            L+   L   ++ 
Sbjct: 169 ----DMAFIYGYSLPTFAMIYEDELVL--------HMKTYEIYGREPVLRNVQLTLDSIE 216

Query: 304 LPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE 363
              D+  L+ VP P GGV++VG N I YH++       ++ Y     +SQ L  ++   +
Sbjct: 217 --PDSKLLIPVPKPYGGVILVGDNIICYHTKDGP---HISQYIPQAKASQVLCYAAVDAQ 271

Query: 364 L----DAAHATW----LQNDV-ALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSD 414
                D A   +    L  D+ A  +  T +   L+ V  G +   L      P      
Sbjct: 272 RYLLGDMAGRLYMVHLLSEDISAAANNGTSNSDSLSAVRIGSIRIELLGETATP----ES 327

Query: 415 ITTIGNSLFFLGSRLGDSLLVQF 437
           I  + N + F+GS LGDS L++ 
Sbjct: 328 IAYLDNGVVFIGSTLGDSQLIRL 350


>gi|194864680|ref|XP_001971056.1| GG14635 [Drosophila erecta]
 gi|190652839|gb|EDV50082.1| GG14635 [Drosophila erecta]
          Length = 1227

 Score = 45.1 bits (105), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 79/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL T+ GD+  +T+  D  VV  + L   +     + +  +     F+ S  G+  L Q 
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E G+    AP           AL+++V  +EL  +     + 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +            L   GP   L+   +GL +            S   + ELPG    +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTV  ++              DDE+ AY+I+S    T+VL   + + EVT+S  +     
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504

Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
           T+    L G   ++QV+  G R I     + +  + G  +    + ++   V+++S    
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563

Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
                DP                                 ++ +G++D ++R+L  DP+ 
Sbjct: 564 VYFEMDPSGELNEYTERSEMPAEIMCMALGTVPDGEQRSWFLAVGLADNTVRILSLDPNN 623

Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
           C     TP ++++   P  S  L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642


>gi|239613967|gb|EEQ90954.1| UV-damaged DNA binding protein [Ajellomyces dermatitidis ER-3]
 gi|327353314|gb|EGE82171.1| UV-damaged DNA binding protein [Ajellomyces dermatitidis ATCC
           18188]
          Length = 1199

 Score = 45.1 bits (105), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 39/134 (29%), Positives = 61/134 (45%), Gaps = 21/134 (15%)

Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
           L+ VP+P+GG+LV+G  +I Y   +++  +           SQ L  ++  V        
Sbjct: 299 LVPVPAPLGGLLVLGETSIRYLDDASNECI-----------SQPLEEATIFV-------A 340

Query: 371 WLQNDVA--LLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGS 427
           W Q D    LL+   G L  L ++ D    VQ   L +       S +  +G  + F+GS
Sbjct: 341 WEQVDGQRWLLADDYGRLFFLMLILDSDNAVQSWKLDRLGNIPRASVLVYMGGGVTFIGS 400

Query: 428 RLGDSLLVQFTCGS 441
             GDS L++ T GS
Sbjct: 401 HQGDSQLIRITEGS 414


>gi|24654874|ref|NP_728546.1| CG13900, isoform A [Drosophila melanogaster]
 gi|23092721|gb|AAF47416.2| CG13900, isoform A [Drosophila melanogaster]
 gi|60678131|gb|AAX33572.1| LD01809p [Drosophila melanogaster]
 gi|220950356|gb|ACL87721.1| CG13900-PA [synthetic construct]
 gi|289803030|gb|ADD20765.1| FI04459p [Drosophila melanogaster]
          Length = 1227

 Score = 45.1 bits (105), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 79/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL T+ GD+  +T+  D  VV  + L   +     + +  +     F+ S  G+  L Q 
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E G+    AP           AL+++V  +EL  +     + 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +            L   GP   L+   +GL +            S   + ELPG    +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTV  ++              DDE+ AY+I+S    T+VL   + + EVT+S  +     
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504

Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
           T+    L G   ++QV+  G R I     + +  + G  +    + ++   V+++S    
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563

Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
                DP                                 ++ +G++D ++R+L  DP+ 
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPEGEQRSWFLAVGLADNTVRILSLDPNN 623

Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
           C     TP ++++   P  S  L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642


>gi|195336406|ref|XP_002034829.1| GM14250 [Drosophila sechellia]
 gi|194127922|gb|EDW49965.1| GM14250 [Drosophila sechellia]
          Length = 1227

 Score = 45.1 bits (105), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 79/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL T+ GD+  +T+  D  VV  + L   +     + +  +     F+ S  G+  L Q 
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E G+    AP           AL+++V  +EL  +     + 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +            L   GP   L+   +GL +            S   + ELPG    +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTV  ++              DDE+ AY+I+S    T+VL   + + EVT+S  +     
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504

Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
           T+    L G   ++QV+  G R I     + +  + G  +    + ++   V+++S    
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563

Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
                DP                                 ++ +G++D ++R+L  DP+ 
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPEGEQRSWFLAVGLADNTVRILSLDPNN 623

Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
           C     TP ++++   P  S  L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642


>gi|60677959|gb|AAX33486.1| RE01065p [Drosophila melanogaster]
          Length = 1227

 Score = 45.1 bits (105), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 79/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL T+ GD+  +T+  D  VV  + L   +     + +  +     F+ S  G+  L Q 
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E G+    AP           AL+++V  +EL  +     + 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +            L   GP   L+   +GL +            S   + ELPG    +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTV  ++              DDE+ AY+I+S    T+VL   + + EVT+S  +     
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504

Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
           T+    L G   ++QV+  G R I     + +  + G  +    + ++   V+++S    
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563

Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
                DP                                 ++ +G++D ++R+L  DP+ 
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPEGEQRSWFLAVGLADNTVRILSLDPNN 623

Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
           C     TP ++++   P  S  L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642


>gi|346327528|gb|EGX97124.1| pre-mRNA splicing factor RSE1 [Cordyceps militaris CM01]
          Length = 1206

 Score = 45.1 bits (105), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 146/656 (22%), Positives = 239/656 (36%), Gaps = 132/656 (20%)

Query: 64  NVIEIYVVRVQEEGSKES---KNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS 120
           NV++   V  Q  G+KE      SG     +  D      + L+ H  + G + S+A+  
Sbjct: 13  NVVQ--AVLGQFAGTKEQLIITGSGSQLTLLRPDPAQGKVIALLSH-DIFGILRSIAVFR 69

Query: 121 QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG 180
             G++    +D IILA +  +I++LE+    +      M  F        K G      G
Sbjct: 70  LAGSN----KDYIILATDSGRITILEYLPGPNRFNRLHMETFG-------KSGIRRVVPG 118

Query: 181 PLVKVDPQGRC---GGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
             +  DP+GR      V    L  ++ + SQ        E T  S     A      VI 
Sbjct: 119 EYLACDPKGRACLISAVEKNKLVYVLNRNSQA-------ELTISSP--LEAHKPGVLVIA 169

Query: 238 LRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
           +  LD+          GY  PV   L          + +      I+  ++S    Q  L
Sbjct: 170 MVALDV----------GYANPVFAALE---------IEYTEVDQDITGEALSEVETQ--L 208

Query: 298 IWSAMNL-----------PHDAYKLLAVPSPIG-----GVLVVGANTIHY-HSQSASCAL 340
           ++  ++L           P D    L    P G     GVLV G   I Y HS   +  +
Sbjct: 209 VYYELDLGLNHVVRKWSDPVDPTASLLFQVPGGNDGPSGVLVCGEENITYRHSNQDALRV 268

Query: 341 ALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA----LLSTKTGDLVLLTV--VYD 394
            +         + E P    ++     H   L+        LL +  GDL  +T+  V D
Sbjct: 269 PIPRRR----GATEDPSRKRNIVAGVMHK--LKGSAGAFFFLLQSDDGDLFKITIDMVED 322

Query: 395 GR-----VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSG 449
                   VQR+ +   +   + + +  + +   ++ S+ G+    QF            
Sbjct: 323 EEGAPTGEVQRMKIKYFDTVPVATSLCILKSGFLYVASQFGNYAFYQFEKLGDDDDEVEF 382

Query: 450 LKEEF--GDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT-ESAQKTFSFAVR 506
             E+F    + A  P     R + + AL D +      L    +N T E A + F+    
Sbjct: 383 SSEDFPVDPLAAYEPVYFYPRLAENLALVDSIPAMNPLLDCKVANLTGEDAPQIFTICGN 442

Query: 507 DSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIWTVYHKSSRGHNADS 565
            +      LK   +GL +N   ++            ELPG    +WT+   S        
Sbjct: 443 GARSTFRTLK---HGLEVNEIVAS------------ELPGVPSAVWTLKLNS-------- 479

Query: 566 SRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQ 625
                 D++Y AY+++S    T+VL   + + EV++S  +     TIAA  L G   +IQ
Sbjct: 480 ------DEQYDAYIVLSFTNGTLVLSIGETVEEVSDS-GFLTSVPTIAA-QLLGTDGLIQ 531

Query: 626 VFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI 680
           V  RG R I +G            N    S  ++ ++++ S     V + +S G I
Sbjct: 532 VHPRGIRHIRNG------------NVNEWSAPQHRSIVAASTNSHQVAIALSSGEI 575


>gi|261193401|ref|XP_002623106.1| UV-damaged DNA binding protein [Ajellomyces dermatitidis SLH14081]
 gi|239588711|gb|EEQ71354.1| UV-damaged DNA binding protein [Ajellomyces dermatitidis SLH14081]
          Length = 1168

 Score = 45.1 bits (105), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 39/134 (29%), Positives = 61/134 (45%), Gaps = 21/134 (15%)

Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
           L+ VP+P+GG+LV+G  +I Y   +++  +           SQ L  ++  V        
Sbjct: 299 LVPVPAPLGGLLVLGETSIRYLDDASNECI-----------SQPLEEATIFV-------A 340

Query: 371 WLQNDVA--LLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGS 427
           W Q D    LL+   G L  L ++ D    VQ   L +       S +  +G  + F+GS
Sbjct: 341 WEQVDGQRWLLADDYGRLFFLMLILDSDNAVQSWKLDRLGNIPRASVLVYMGGGVTFIGS 400

Query: 428 RLGDSLLVQFTCGS 441
             GDS L++ T GS
Sbjct: 401 HQGDSQLIRITEGS 414


>gi|194749950|ref|XP_001957397.1| GF24063 [Drosophila ananassae]
 gi|190624679|gb|EDV40203.1| GF24063 [Drosophila ananassae]
          Length = 1228

 Score = 45.1 bits (105), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 79/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL T+ GD+  +T+  D  VV  + L   +     + +  +     F+ S  G+  L Q 
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E G+    AP           AL+++V  +EL  +     + 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +            L   GP   L+   +GL +            S   + ELPG    +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTV  ++              DDE+ AY+I+S    T+VL   + + EVT+S  +     
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504

Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
           T+    L G   ++QV+  G R I     + +  + G  +    + ++   V+++S    
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563

Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
                DP                                 ++ +G++D ++R+L  DP+ 
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPEGEQRSWFLAVGLADNTVRILSLDPNN 623

Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
           C     TP ++++   P  S  L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642


>gi|195490209|ref|XP_002093045.1| GE20993 [Drosophila yakuba]
 gi|194179146|gb|EDW92757.1| GE20993 [Drosophila yakuba]
          Length = 1227

 Score = 45.1 bits (105), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 79/383 (20%), Positives = 141/383 (36%), Gaps = 90/383 (23%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL T+ GD+  +T+  D  VV  + L   +     + +  +     F+ S  G+  L Q 
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E G+    AP           AL+++V  +EL  +     + 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +            L   GP   L+   +GL +            S   + ELPG    +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTV  ++              DDE+ AY+I+S    T+VL   + + EVT+S  +     
Sbjct: 460 WTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504

Query: 611 TIAAGNLFGRRRVIQVFERGAR-ILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS---- 665
           T+    L G   ++QV+  G R I     + +  + G  +    + ++   V+++S    
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIRHIRSDKRVNEWKAPGKKSITKCAVNQRQVVITLSGREL 563

Query: 666 ---IADP---------------------------------YVLLGMSDGSIRLLVGDPST 689
                DP                                 ++ +G++D ++R+L  DP+ 
Sbjct: 564 VYFEMDPTGELNEYTERSEMPAEIMCMALGTVPEGEQRSWFLAVGLADNTVRILSLDPNN 623

Query: 690 CTVSVQTPAAIESSKKPVSSCTL 712
           C     TP ++++   P  S  L
Sbjct: 624 CL----TPCSMQALPSPAESLCL 642


>gi|240275059|gb|EER38574.1| DNA damage-binding protein 1a [Ajellomyces capsulatus H143]
          Length = 1134

 Score = 45.1 bits (105), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 43/136 (31%), Positives = 64/136 (47%), Gaps = 25/136 (18%)

Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
           L+ VP+P+GG+LV+G  +I Y   +++  +           SQ L  ++  V        
Sbjct: 301 LVPVPAPLGGLLVLGETSIRYLDDASNECI-----------SQPLKEATIFV-------A 342

Query: 371 WLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQ--RLDLSKTNPSVLTSDITTIGNSLFFL 425
           W Q D    LL+   G L  L +V D    VQ  +LDL    P    S +  +G  + F+
Sbjct: 343 WEQVDGQRWLLADDYGRLFFLMLVLDTDNAVQSWKLDLLGDIPR--ASVLVYMGGGITFI 400

Query: 426 GSRLGDSLLVQFTCGS 441
           GS  GDS L++ T GS
Sbjct: 401 GSHQGDSELIRITEGS 416


>gi|325094412|gb|EGC47722.1| DNA damage-binding protein 1a [Ajellomyces capsulatus H88]
          Length = 1201

 Score = 45.1 bits (105), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 43/136 (31%), Positives = 64/136 (47%), Gaps = 25/136 (18%)

Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
           L+ VP+P+GG+LV+G  +I Y   +++  +           SQ L  ++  V        
Sbjct: 301 LVPVPAPLGGLLVLGETSIRYLDDASNECI-----------SQPLKEATIFV-------A 342

Query: 371 WLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQ--RLDLSKTNPSVLTSDITTIGNSLFFL 425
           W Q D    LL+   G L  L +V D    VQ  +LDL    P    S +  +G  + F+
Sbjct: 343 WEQVDGQRWLLADDYGRLFFLMLVLDTDNAVQSWKLDLLGDIPR--ASVLVYMGGGITFI 400

Query: 426 GSRLGDSLLVQFTCGS 441
           GS  GDS L++ T GS
Sbjct: 401 GSHQGDSELIRITEGS 416


>gi|226291941|gb|EEH47369.1| DNA damage-binding protein 1a [Paracoccidioides brasiliensis Pb18]
          Length = 1209

 Score = 44.7 bits (104), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 51/174 (29%), Positives = 74/174 (42%), Gaps = 38/174 (21%)

Query: 275 SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQ 334
           +W+   C I+       LK+         L   A  L+ VP+P+GG+LV+G  +I Y   
Sbjct: 279 AWQDTGC-IAVFKALDLLKEE--------LEMGASFLIPVPAPLGGLLVLGETSIRYLD- 328

Query: 335 SASCALALNNYAVSLDSSQELPRSSFSVELDAA--HATWLQNDVA--LLSTKTGDLVLLT 390
                          D++ E      S+ LD A     W Q D    LL+   G L  L 
Sbjct: 329 ---------------DATNE----CISLPLDEATIFVAWEQVDGQRWLLADDYGRLFFLM 369

Query: 391 VVYD-GRVVQ--RLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGS 441
           ++ D    VQ  +LDL    P    S +  +G  + F+GS  GDS L++ T GS
Sbjct: 370 LILDEDNAVQSWKLDLLGNIPR--ASVLVYLGGGVTFIGSHQGDSQLIRITEGS 421


>gi|225558618|gb|EEH06902.1| DNA damage-binding protein 1a [Ajellomyces capsulatus G186AR]
          Length = 1201

 Score = 44.7 bits (104), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 43/136 (31%), Positives = 64/136 (47%), Gaps = 25/136 (18%)

Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
           L+ VP+P+GG+LV+G  +I Y   +++  +           SQ L  ++  V        
Sbjct: 301 LVPVPAPLGGLLVLGETSIRYLDDASNECI-----------SQPLKEATIFV-------A 342

Query: 371 WLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQ--RLDLSKTNPSVLTSDITTIGNSLFFL 425
           W Q D    LL+   G L  L +V D    VQ  +LDL    P    S +  +G  + F+
Sbjct: 343 WEQVDGQRWLLADDYGRLFFLMLVLDTDNAVQSWKLDLLGDIPR--ASVLVYMGGGITFI 400

Query: 426 GSRLGDSLLVQFTCGS 441
           GS  GDS L++ T GS
Sbjct: 401 GSHQGDSELIRITEGS 416


>gi|308808936|ref|XP_003081778.1| putative UV-damaged DNA binding factor (ISS) [Ostreococcus tauri]
 gi|116060244|emb|CAL56303.1| putative UV-damaged DNA binding factor (ISS) [Ostreococcus tauri]
          Length = 1282

 Score = 44.7 bits (104), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 54/208 (25%), Positives = 102/208 (49%), Gaps = 28/208 (13%)

Query: 234 HVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
              N+R L+   V+D  F+HG  +P + +L+ R++  A  V  K +   +      ++  
Sbjct: 376 EAFNIR-LEELRVEDIQFLHGTAKPTIAVLY-RDMKEA--VHIKTYEIGVREKEFVSS-- 429

Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
                W+  +L   + K++ VP+P+GGV+V+G  TI Y ++++      ++  V L +  
Sbjct: 430 ----PWAQNDLEGGSSKIIPVPAPVGGVVVLGEETIVYLNKTS------DDTDVFLKAIN 479

Query: 354 ELPRSSF----SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS 409
              RSS     +++ D +          LL    G L LL +V+DG+ V  L + +   +
Sbjct: 480 IPERSSIVCYGAIDPDGSRY--------LLGDHDGTLYLLVLVHDGKRVNELKIERLGET 531

Query: 410 VLTSDITTIGNSLFFLGSRLGDSLLVQF 437
            + S ++ + N + F+GS  GDS L++ 
Sbjct: 532 SIPSTVSYLDNGVVFVGSAYGDSQLIKL 559


>gi|225680146|gb|EEH18430.1| DNA damage-binding protein [Paracoccidioides brasiliensis Pb03]
          Length = 1138

 Score = 44.7 bits (104), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 44/138 (31%), Positives = 63/138 (45%), Gaps = 29/138 (21%)

Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA--H 368
           L+ VP+P+GG+LV+G  +I Y                  D++ E      S+ LD A   
Sbjct: 318 LIPVPAPLGGLLVLGETSIRYLD----------------DATNE----CISLPLDEATIF 357

Query: 369 ATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQ--RLDLSKTNPSVLTSDITTIGNSLF 423
             W Q D    LL+   G L  L ++ D    VQ  +LDL    P    S +  +G  + 
Sbjct: 358 VAWEQVDGQRWLLADDYGRLFFLMLILDEDNAVQSWKLDLLGNIPR--ASVLVYLGGGVT 415

Query: 424 FLGSRLGDSLLVQFTCGS 441
           F+GS  GDS L++ T GS
Sbjct: 416 FIGSHQGDSQLIRITEGS 433


>gi|310793065|gb|EFQ28526.1| CPSF A subunit region [Glomerella graminicola M1.001]
          Length = 1212

 Score = 44.7 bits (104), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 137/606 (22%), Positives = 225/606 (37%), Gaps = 135/606 (22%)

Query: 74  QEEGSKESKNSGETKRRVLM---DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRR 130
           Q  G+KE      +  R+ +   D      + L+ H  + G + S+A     G++    +
Sbjct: 27  QFSGTKEQNIITASGSRLTLLRPDPSQGKVITLLSH-DIFGIIRSMAAFRLAGSN----K 81

Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHL----KRGRESFARGPLVKVD 186
           D +ILA +  +I+++E+        I + + F+    LHL    K G      G  +  D
Sbjct: 82  DYLILATDSGRITIIEY--------IPAQNRFQR---LHLETFGKSGVRRVIPGEYLACD 130

Query: 187 PQGRC---GGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDM 243
           P+GR      V    L  ++ + SQ        E T  S     A      V+++  LD+
Sbjct: 131 PKGRACLIASVEKNKLVYVLNRNSQA-------ELTISSP--LEAHKPGVLVLSMVALDV 181

Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWA-----GRVSWKHHTCMISALSISTTLKQHPLI 298
                     GY  PV   L E E T A     G  + +  T ++    +   L      
Sbjct: 182 ----------GYANPVFAAL-EIEYTEADQDPTGEAAREAETQLV-YYELDLGLNHVVRK 229

Query: 299 WSAMNLPHDAYKLLAVPSPIG-----GVLVVGANTIHY-HSQSASCALALNNY--AVSLD 350
           WS    P D    L    P G     GVLV G   I Y HS   +  + +     A    
Sbjct: 230 WSE---PVDPTASLLFQVPGGQDGPSGVLVCGEENITYRHSNQEAFRVPIPRRRGATEDP 286

Query: 351 SSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY----DGRV---VQRLDL 403
           S +    S    +L  +   +      L+ T+ GDL   T+      DG     V+RL +
Sbjct: 287 SRKRHVVSGVMHKLKGSAGAFF----FLIQTEDGDLFKATIDMVEDADGNPTGEVKRLKI 342

Query: 404 SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPS 463
              +   ++S +  + +   +  S+ G+    QF              E+ GD       
Sbjct: 343 KYFDTIPVSSSLCILKSGFLYAASQFGNHQFYQF--------------EKLGD------D 382

Query: 464 TKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLR 523
            + L  SS D   D   G +   +          +   + A+ +S+ ++ PL D      
Sbjct: 383 DEELEFSSDDFPTDPKAGYDAVYF--------HPRPLENLALVESIDSMNPLLDCKVANL 434

Query: 524 INADA----SATGISKQSNYELV------------ELPGC-KGIWTVYHKSSRGHNADSS 566
              DA    +A G   +S + ++            ELPG    +WT+  K +RG      
Sbjct: 435 TGEDAPQIYTACGNGARSTFRMLKHGLEVNEIVASELPGIPSAVWTL--KLNRG------ 486

Query: 567 RMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQV 626
                 D+Y AY+++S    T+VL   + + EV++S   F+      A  L G   +IQV
Sbjct: 487 ------DQYDAYIVLSFTNGTLVLSIGETVEEVSDS--GFLTSVPTLAAQLLGEDGLIQV 538

Query: 627 FERGAR 632
             +G R
Sbjct: 539 HPKGIR 544


>gi|295667673|ref|XP_002794386.1| DNA damage-binding protein 1a [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226286492|gb|EEH42058.1| DNA damage-binding protein 1a [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 1195

 Score = 44.7 bits (104), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 44/138 (31%), Positives = 63/138 (45%), Gaps = 29/138 (21%)

Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA--H 368
           L+ VP+P+GG+LV+G  +I Y                  D++ E      S+ LD A   
Sbjct: 292 LIPVPAPLGGLLVLGETSIRYLD----------------DATNE----CISLPLDEATIF 331

Query: 369 ATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQ--RLDLSKTNPSVLTSDITTIGNSLF 423
             W Q D    LL+   G L  L ++ D    VQ  +LDL    P    S +  +G  + 
Sbjct: 332 VAWEQVDGQRWLLADDYGRLFFLMLILDEDNAVQSWKLDLLGNIPR--ASVLVYLGGGVT 389

Query: 424 FLGSRLGDSLLVQFTCGS 441
           F+GS  GDS L++ T GS
Sbjct: 390 FIGSHQGDSQLIRITEGS 407


>gi|358366432|dbj|GAA83053.1| UV-damaged DNA binding protein [Aspergillus kawachii IFO 4308]
          Length = 1643

 Score = 44.7 bits (104), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 67/264 (25%), Positives = 106/264 (40%), Gaps = 29/264 (10%)

Query: 185 VDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK 244
           +DP GR   + VY   + ++   Q  S   G +    SG       E       R +D  
Sbjct: 62  IDPSGRFMTLEVYEGVIAVVPIVQLPSKKRGRQVAPPSGPDAPRVGELGEPTTAR-IDEL 120

Query: 245 HVKDFIFVHGYI-EPVMVILHE-RELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAM 302
            V+   F+H     P + +L+E  +     +V   H++   S+       ++  L   + 
Sbjct: 121 FVRSSAFLHVQSGPPRLALLYEDNQKKVRLKVRALHYSAATSSTGADAAFEES-LDGFSQ 179

Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
            L   A  L+ VP+P+GG+LV+G  +I Y               V  DS++ + R     
Sbjct: 180 ELDLGASHLIPVPAPLGGLLVLGETSIKY---------------VDTDSNEIVSRP---- 220

Query: 363 ELDAA--HATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQRLDLSKTNPSVLTSDITT 417
            LD A     W Q D    LL+   G L  L +V D    VQ   L     +   S +  
Sbjct: 221 -LDEATIFVAWEQVDSQRWLLADDYGRLFFLMLVLDSNNQVQSWKLDHLGNTARASVLIY 279

Query: 418 IGNSLFFLGSRLGDSLLVQFTCGS 441
           +G  + F+GS  GDS +++   GS
Sbjct: 280 LGGGVIFVGSHQGDSQVLRIGNGS 303


>gi|170589359|ref|XP_001899441.1| Xeroderma Pigmentosum Group E Complementing protein [Brugia malayi]
 gi|158593654|gb|EDP32249.1| Xeroderma Pigmentosum Group E Complementing protein, putative
           [Brugia malayi]
          Length = 521

 Score = 44.7 bits (104), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 43/155 (27%), Positives = 67/155 (43%), Gaps = 31/155 (20%)

Query: 497 AQKTFSFAVRDSLVNIGPLKDFSYGLRINADA---SATGISKQSNY----------EL-- 541
           A  T   ++ DS  N+ P++D +  +R N      + +G  K              EL  
Sbjct: 345 ADGTGYISLLDSYTNLAPIRDMTV-MRCNGQQQILTCSGAYKDGTIRIIRNGIGIEELAS 403

Query: 542 VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
           VEL G K ++T+  +               DDE+  YLI+S ++ T VL       E TE
Sbjct: 404 VELKGIKNMFTLRTR---------------DDEFDDYLILSFDSETHVLLINGEELEDTE 448

Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDG 636
              + V G T+ AG LF  + ++QV      ++DG
Sbjct: 449 ITGFTVDGATLWAGCLFHSKTILQVTHGEVILIDG 483


>gi|91092128|ref|XP_972649.1| PREDICTED: similar to AGAP005549-PA [Tribolium castaneum]
 gi|270004662|gb|EFA01110.1| hypothetical protein TcasGA2_TC010322 [Tribolium castaneum]
          Length = 1219

 Score = 44.3 bits (103), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 73/320 (22%), Positives = 124/320 (38%), Gaps = 59/320 (18%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           L  T+ GD+  +T+  D  +V  + L   +   + S +  +     F+ S  G+  L Q 
Sbjct: 302 LAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVASAMCVLKTGFLFVTSEFGNHYLYQI 361

Query: 438 T-CGSGTSML--SSGLKEEFGDIEADAPSTKR--LRRSSSDALQDMVNGEELSLYGSASN 492
              G     L  SS +  E GD    AP + R  +     ++L  +++     L G    
Sbjct: 362 AHLGDDDDELEFSSAMPLEEGDTFFFAPRSLRNLVLVDEMESLSPILSCRVADLAG---- 417

Query: 493 NTESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-K 548
             E   + +    R      GP   L+   +GL +            S   + ELPG   
Sbjct: 418 --EDTPQLYMLCGR------GPRSSLRVLRHGLEV------------SEMAVSELPGNPN 457

Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
            +WTV  +S              DDEY AY+I+S    T+VL   + + EVT+S   F+ 
Sbjct: 458 AVWTVKRRS--------------DDEYDAYIIVSFVNATLVLSIGETVEEVTDS--GFLG 501

Query: 609 GRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIAD 668
                + +      ++QV+  G R     ++  D      N     G +  T++  +I  
Sbjct: 502 TTPTLSCSALSDDALVQVYPGGIR-----HICSDKRV---NEWKAPGKK--TIVKCAINQ 551

Query: 669 PYVLLGMSDGSIRLLVGDPS 688
             V++ +S G +     DP+
Sbjct: 552 RQVVIALSGGELAYFEMDPT 571


>gi|258572939|ref|XP_002540651.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237900917|gb|EEP75318.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 1144

 Score = 44.3 bits (103), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 39/142 (27%), Positives = 61/142 (42%), Gaps = 21/142 (14%)

Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
           NL   A  L+ VP P+GG+L++G   I Y          ++N  ++L            +
Sbjct: 239 NLELGAEILVPVPLPLGGILILGEKCIKYVD-------TISNETITL-----------PL 280

Query: 363 ELDAAHATW--LQNDVALLSTKTGDLVLLTVVYD-GRVVQRLDLSKTNPSVLTSDITTIG 419
           E +     W  L N   LL+   G L  L +V D    V+   +     +   S +  +G
Sbjct: 281 EYNTVFVAWEQLDNQRWLLADDYGRLFFLMLVLDSANAVRTWKVDLLGETSRASVLVHLG 340

Query: 420 NSLFFLGSRLGDSLLVQFTCGS 441
             + FLGS  GDS +++ T GS
Sbjct: 341 GGVVFLGSHQGDSHVIRITEGS 362


>gi|429859776|gb|ELA34542.1| pre-mRNA-splicing factor rse1 [Colletotrichum gloeosporioides Nara
           gc5]
          Length = 1212

 Score = 44.3 bits (103), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 138/607 (22%), Positives = 226/607 (37%), Gaps = 137/607 (22%)

Query: 74  QEEGSKESKNSGETKRRVLM---DGISAASLELVCHYRLHGNVESLAILSQGGADNSRRR 130
           Q  G+KE      +  R+ +   D      + L+ H  + G + S+A     G++    +
Sbjct: 27  QFSGTKEQNIVTASGSRLTLLRPDPSQGKVITLLSH-DIFGIIRSMAAFRLAGSN----K 81

Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHL----KRGRESFARGPLVKVD 186
           D +ILA +  +I+++E+        I + + F+    LHL    K G      G  +  D
Sbjct: 82  DYLILATDSGRITIVEY--------IPAQNRFQR---LHLETFGKSGVRRVIPGEYLACD 130

Query: 187 PQGRC---GGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDM 243
           P+GR      V    L  ++ + +Q        E T  S     A      V+++  LD+
Sbjct: 131 PKGRACLIASVEKNKLVYVLNRNAQA-------ELTISSP--LEAHKPGVLVLSMVALDV 181

Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWA-----GRVSWKHHTCMISALSISTTLKQHPLI 298
                     GY  PV   L E E T A     G  + +  T ++    +   L      
Sbjct: 182 ----------GYANPVFAAL-EIEYTEADQDPTGEAAREAETQLV-YYELDLGLNHVVRK 229

Query: 299 WSAMNLPHDAYKLLAVPSPIG-----GVLVVGANTIHY-HSQSASCALALNNY--AVSLD 350
           WS    P D    L    P G     GVLV G   I Y HS   +  + +     A    
Sbjct: 230 WSE---PVDPTASLLFQVPGGQDGPSGVLVCGEENITYRHSNQEAFRVPIPRRRGATEDP 286

Query: 351 SSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDL--VLLTVVYDGR-----VVQRLDL 403
           S +    S    +L  +   +      LL T+ GDL   ++ +V D        V+RL +
Sbjct: 287 SRKRHIVSGVMHKLKGSAGAFF----FLLQTEDGDLFKAVIDMVEDADGNPTGEVKRLKI 342

Query: 404 SKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPS 463
              +   ++S +  + +   +  S+ G+    QF              E+ GD + +   
Sbjct: 343 KYFDTVPVSSSLCILKSGFLYAASQFGNHQFYQF--------------EKLGDDDEEK-- 386

Query: 464 TKRLRRSSSDALQDMVNG-EELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGL 522
                 SS D   D   G + +  Y     N          A+ +S+ ++ PL D     
Sbjct: 387 ----EFSSDDFPADPKAGYDAVYFYPRPLEN---------LALVESIDSMNPLLDCKVAN 433

Query: 523 RINADA----SATGISKQSNYELV------------ELPGC-KGIWTVYHKSSRGHNADS 565
               DA    +A G   +S + ++            ELPG    +WT+  K SRG     
Sbjct: 434 LTGEDAPQIYTACGNGARSTFRMLKHGLEVNEIVASELPGIPSAVWTL--KLSRG----- 486

Query: 566 SRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQ 625
                  D+Y AY+++S    T+VL   + + EV++S   F+      A  L G   +IQ
Sbjct: 487 -------DQYDAYIVLSFTNGTLVLSIGETVEEVSDS--GFLTSVPTLAAQLLGEDGLIQ 537

Query: 626 VFERGAR 632
           V  +G R
Sbjct: 538 VHPKGIR 544


>gi|384490729|gb|EIE81951.1| hypothetical protein RO3G_06656 [Rhizopus delemar RA 99-880]
          Length = 967

 Score = 43.9 bits (102), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 41/154 (26%), Positives = 72/154 (46%), Gaps = 17/154 (11%)

Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
           S + +    + L+ VP P+GG+LV+G   I Y        L   N  +S+D ++    ++
Sbjct: 198 STIKVEASTHALVPVPEPLGGLLVIGEYIITYFD-----PLTNTNRELSIDPAR---VTA 249

Query: 360 FSVELDAAHATWLQND-----VALLSTKTGDLVLLTVVYDGRVV---QRLDLSKTNPSV- 410
           +    D ++   L ++     V  + T    +V L+  + G+V    Q ++    +P V 
Sbjct: 250 WEFMKDESNRYLLGDEEGYLYVFSIETSHNKVVNLSSTFIGQVPSFNQNIESKANHPQVS 309

Query: 411 LTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTS 444
             S I  +GN +F++GS  GDS L+Q   G   S
Sbjct: 310 RPSCIVDLGNLMFYIGSTHGDSCLIQLIKGQEKS 343


>gi|195428692|ref|XP_002062402.1| GK16677 [Drosophila willistoni]
 gi|194158487|gb|EDW73388.1| GK16677 [Drosophila willistoni]
          Length = 1273

 Score = 43.9 bits (102), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 61/262 (23%), Positives = 99/262 (37%), Gaps = 45/262 (17%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL T+ GD+  +T+  D  VV  + L   +     + +  +     F+ S  G+  L Q 
Sbjct: 347 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 406

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E G+    AP           AL+++V  +EL  +     + 
Sbjct: 407 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIVTSQ 456

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPG-CKGI 550
            +            L   GP   L+   +GL +            S   + ELPG    +
Sbjct: 457 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 504

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTV  +               DDE+ AY+I+S    T+VL   + + EVT+S  +     
Sbjct: 505 WTVKKR--------------VDDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 549

Query: 611 TIAAGNLFGRRRVIQVFERGAR 632
           T+    L G   ++QV+  G R
Sbjct: 550 TLCCAAL-GDDALVQVYPDGIR 570


>gi|157873900|ref|XP_001685450.1| cleavage and polyadenylation specificity factor-like protein
           [Leishmania major strain Friedlin]
 gi|68128522|emb|CAJ08654.1| cleavage and polyadenylation specificity factor-like protein
           [Leishmania major strain Friedlin]
          Length = 1541

 Score = 43.9 bits (102), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 50/185 (27%), Positives = 81/185 (43%), Gaps = 37/185 (20%)

Query: 223 GGGFSARIESSHVINLRDLDMK----HVKDFIFVHGYIEPVMVILHERELTWAGRVS--- 275
           GGG S  +    V + R  D+K    +++D  FV    EP++  L E++ TWAGRV    
Sbjct: 282 GGGTSLLLRVGTVTHWRLQDVKSALRNIRDVQFVQSAGEPLLAFLFEKQPTWAGRVKLLE 341

Query: 276 WKHH-------TCMIS--ALSISTTLKQHPLIWSAMN-LPHDAYKLLAVPS----PIGGV 321
           W+         TC I    ++++ +   H L  S ++ LP+D   +  +P+    P    
Sbjct: 342 WRSKTVESHMLTCSIEWMKVTLANSATPHMLSLSEVDGLPYDVTSMTPLPAFQDLPSAVF 401

Query: 322 LVVGANTIHYHSQSA----------SCALALNNYAVSLD------SSQELPRSSFSVELD 365
            V     +H  ++S             A +L + AVSL+      +SQ L      V L+
Sbjct: 402 CVSRNMMVHVSTKSGYGVYVNATGEEQARSLKSSAVSLEAVQWRSASQALSTDLVKVNLN 461

Query: 366 AAHAT 370
            A+AT
Sbjct: 462 FANAT 466


>gi|226480826|emb|CAX73510.1| glyceraldehyde 3-phosphate dehydrogenase [Schistosoma japonicum]
          Length = 332

 Score = 43.9 bits (102), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 54/212 (25%), Positives = 92/212 (43%), Gaps = 34/212 (16%)

Query: 128 RRRDSIILAFEDAKISVLEF---DDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVK 184
           R  DS+ L    A ++++E    +DS+  + + S     S E     R      +G  V 
Sbjct: 72  RETDSLFLLTHKAGVAIIECVRNNDSVEFVTVAS----GSVE----DRSARIIDQGFDVL 123

Query: 185 VDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFG-SGGGFSARIESSHVINLRDLD 242
           +DP      V +Y GL  IIL    G        DT   +   +S RIE  +++      
Sbjct: 124 IDPGANYIVVRLYHGLLKIILLQCIGDKIGTDFLDTNQWTVNTYSVRIEEGNIV------ 177

Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAM 302
                D  F++GY  P   +++E EL    + +++ +    +  ++  TL          
Sbjct: 178 -----DMAFIYGYSLPTFAMIYEDELVLHMK-TYEIYGREPALRNVQLTLD--------- 222

Query: 303 NLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQ 334
           ++  D+  L+ VP P GGV++VG N I YH++
Sbjct: 223 SIEPDSKLLIPVPKPYGGVILVGDNIICYHTK 254


>gi|66811906|ref|XP_640132.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
 gi|74854972|sp|Q54SA7.1|SF3B3_DICDI RecName: Full=Probable splicing factor 3B subunit 3
 gi|60468134|gb|EAL66144.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
          Length = 1256

 Score = 43.9 bits (102), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 83/338 (24%), Positives = 124/338 (36%), Gaps = 75/338 (22%)

Query: 319 GGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE----LDAAHATWLQN 374
           GGVLV   + I Y +Q  +            +    +PR   S      L  +H++  Q 
Sbjct: 256 GGVLVASEDYIVYRNQDHA------------EVRSRIPRRYGSDPNKGVLIISHSSHKQK 303

Query: 375 DVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDS 432
            +   L+ ++ GDL  +T+ Y G  V  ++++  +  VL + +T + N   F  S  GD 
Sbjct: 304 GMFFFLVQSEHGDLYKITLDYQGDQVSEVNVNYFDTIVLANCLTVLKNGFLFAASEFGDH 363

Query: 433 LLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRL----RRSSSDALQDMVNGEELSLYG 488
            L  F         S G +EE G  +        L    R S    ++++ N E  S   
Sbjct: 364 TLYFFK--------SIGDEEEEGQAKRLEDKDGHLWFTPRNSCGTKMEELKNLEPTSHLS 415

Query: 489 SASNNTESAQKTFSFAVRDSLVNIGP-------------LKDFSYGLRINADASATGISK 535
           S S           F V D +    P             LK   +GL +    +A     
Sbjct: 416 SLS-------PIIDFKVLDLVREENPQLYSLCGTGLNSSLKVLRHGLSVTTITTAN---- 464

Query: 536 QSNYELVELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
                   LPG   GIWTV   +S   NA         D+   Y+++S    T VL   D
Sbjct: 465 --------LPGVPSGIWTVPKSTS--PNA--------IDQTDKYIVVSFVGTTSVLSVGD 506

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
            + E  ES    ++  T       G   +IQVF  G R
Sbjct: 507 TIQENHES--GILETTTTLLVKSMGDDAIIQVFPTGFR 542


>gi|402077250|gb|EJT72599.1| pre-mRNA-splicing factor RSE1 [Gaeumannomyces graminis var. tritici
           R3-111a-1]
          Length = 1216

 Score = 43.5 bits (101), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 60/279 (21%), Positives = 110/279 (39%), Gaps = 68/279 (24%)

Query: 378 LLSTKTGDLVLLTV--VYDGR-----VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
           LL T+ GDL  +T+  + D        VQRL +   +   ++S++  + +   F+ S  G
Sbjct: 310 LLQTEDGDLFKVTIDMLEDAEGNTTGEVQRLKIKYFDTIPVSSNLCILKSGFLFVASEFG 369

Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSA 490
           +    QF              E+ GD        + L  SS +   D     E + +   
Sbjct: 370 NHHFYQF--------------EKLGD------DDEELEFSSENFPSDPAEPYEPAYF--- 406

Query: 491 SNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA----SATGISKQSNYELV---- 542
                  + T + A+ +S+ ++ PL D       + DA    + +G   +S + ++    
Sbjct: 407 -----YPRPTENLALVESVESMNPLMDLKVANLTDEDAPQIYTVSGNGARSTFRMLKHGL 461

Query: 543 --------ELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETA 593
                   +LPG    +WT                 A DD+Y +Y+++S    T+VL   
Sbjct: 462 EVNEIVASQLPGTPSAVWTT--------------KIARDDQYDSYIVLSFTNGTLVLSIG 507

Query: 594 DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
           + + EV+++   F+   +  A    G   ++QV  RG R
Sbjct: 508 ETVEEVSDT--GFLSSVSTLAVQQLGEDGLVQVHPRGIR 544


>gi|213405251|ref|XP_002173397.1| U2 snRNP-associated protein Sap130 [Schizosaccharomyces japonicus
           yFS275]
 gi|212001444|gb|EEB07104.1| U2 snRNP-associated protein Sap130 [Schizosaccharomyces japonicus
           yFS275]
          Length = 1166

 Score = 43.5 bits (101), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 121/561 (21%), Positives = 202/561 (36%), Gaps = 99/561 (17%)

Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
           + L+     +G V ++A L   G     ++D ++L  +  + ++LE+D   + L      
Sbjct: 56  MNLMISQNCYGIVRNIAPLRLTGF----KKDYLVLTSDSGRFTILEYDIGKNKLVSVYQE 111

Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGV-------LVYGLQ------MII---L 204
            F        K G      G  + +D +GR   V       LVY L       + I   L
Sbjct: 112 AFG-------KSGIRRIVPGEYLALDAKGRAAMVASTEKNKLVYVLNRDSEANLTISSPL 164

Query: 205 KASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILH 264
           +A + G+      D  G   G+   I ++  +   DLD   + +                
Sbjct: 165 EAHKAGTICF---DLVGLDTGYENPIFAALEVEYSDLDHDPLGEL--------------- 206

Query: 265 ERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS----PIGG 320
                      +KH   +++   +   L      WS + +   AYKL+ VP     P  G
Sbjct: 207 -----------YKHSEKVLTYYELDLGLNHVVKRWSKV-VDRSAYKLIRVPGGNDGP-SG 253

Query: 321 VLVVGANTIHY-HSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
           V+V+    I Y H Q  S  + +        ++  LP     + + A       +   LL
Sbjct: 254 VIVISTGWISYRHLQRQSHFVPIPTRETKATTNTALP-----IIVSAVMHKMRDSFFYLL 308

Query: 380 STKTGDLVLLTVVYDGRV-VQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
               GDL+ LT+  D    V+ L +   +     + +  + + L F G   G+  L QF 
Sbjct: 309 QNSDGDLLKLTMELDDHSQVKELRIKYFDTIPFAAILNILKSGLLFAGCEGGNHHLYQFE 368

Query: 439 CGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQ 498
                S+     + EF         +K   +  +  L  + N   L    S    T++  
Sbjct: 369 -----SLAIDDDEPEFSSANFSEEQSKHSPKKLTYKLHPLQNISLLDEIPSLFPLTDAIV 423

Query: 499 KTFSFAVRDSLVNI-GPLKDFSYGLRINADASATGISKQSNYELVELPGCK-GIWTVYHK 556
              S      L  + G  K+ S  L +    SAT +       L ELPG    IWTV  K
Sbjct: 424 TRTSTDANSQLYTLCGRHKEASLRL-LKRGVSATEVV------LSELPGAPIAIWTVKQK 476

Query: 557 SSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGN 616
                          +D Y  Y+++S    T+VL   + + EV +S        T+    
Sbjct: 477 --------------LNDPYDKYMVLSFTNGTLVLSIGETVEEVLDS-GLLSSVSTLNVRQ 521

Query: 617 LFGRRRVIQVFERGARILDGS 637
           L GR  V+Q+  +G R +  +
Sbjct: 522 L-GRSSVVQIHSKGIRCISAN 541


>gi|195126264|ref|XP_002007593.1| GI12293 [Drosophila mojavensis]
 gi|193919202|gb|EDW18069.1| GI12293 [Drosophila mojavensis]
          Length = 1227

 Score = 43.5 bits (101), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 61/262 (23%), Positives = 99/262 (37%), Gaps = 45/262 (17%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL T+ GD+  +T+  D  VV  + L   +     + +  +     F+ S  G+  L Q 
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E G+    AP           AL+++V  +EL  +     + 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +            L   GP   L+   +GL +            S   + ELPG    +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTV  +               DDE+ AY+I+S    T+VL   + + EVT+S  +     
Sbjct: 460 WTVKKR--------------IDDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504

Query: 611 TIAAGNLFGRRRVIQVFERGAR 632
           T+    L G   ++QV+  G R
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIR 525


>gi|195012560|ref|XP_001983703.1| GH16029 [Drosophila grimshawi]
 gi|193897185|gb|EDV96051.1| GH16029 [Drosophila grimshawi]
          Length = 1228

 Score = 43.5 bits (101), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 61/262 (23%), Positives = 99/262 (37%), Gaps = 45/262 (17%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL T+ GD+  +T+  D  VV  + L   +     + +  +     F+ S  G+  L Q 
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E G+    AP           AL+++V  +EL  +     + 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +            L   GP   L+   +GL +            S   + ELPG    +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTV  +               DDE+ AY+I+S    T+VL   + + EVT+S  +     
Sbjct: 460 WTVKKR--------------IDDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504

Query: 611 TIAAGNLFGRRRVIQVFERGAR 632
           T+    L G   ++QV+  G R
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIR 525


>gi|195376606|ref|XP_002047087.1| GJ13230 [Drosophila virilis]
 gi|194154245|gb|EDW69429.1| GJ13230 [Drosophila virilis]
          Length = 1229

 Score = 43.5 bits (101), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 61/262 (23%), Positives = 99/262 (37%), Gaps = 45/262 (17%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL T+ GD+  +T+  D  VV  + L   +     + +  +     F+ S  G+  L Q 
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E G+    AP           AL+++V  +EL  +     + 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGETFFFAPR----------ALKNLVLVDELPSFAPIITSQ 411

Query: 495 ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-KGI 550
            +            L   GP   L+   +GL +            S   + ELPG    +
Sbjct: 412 VADLANEDTPQLYVLCGRGPRSTLRVLRHGLEV------------SEMAVSELPGNPNAV 459

Query: 551 WTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR 610
           WTV  +               DDE+ AY+I+S    T+VL   + + EVT+S  +     
Sbjct: 460 WTVKKR--------------IDDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTP 504

Query: 611 TIAAGNLFGRRRVIQVFERGAR 632
           T+    L G   ++QV+  G R
Sbjct: 505 TLCCAAL-GDDALVQVYPDGIR 525


>gi|302423344|ref|XP_003009502.1| DNA damage-binding protein 1b [Verticillium albo-atrum VaMs.102]
 gi|261352648|gb|EEY15076.1| DNA damage-binding protein 1b [Verticillium albo-atrum VaMs.102]
          Length = 1119

 Score = 43.5 bits (101), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 42/136 (30%), Positives = 60/136 (44%), Gaps = 13/136 (9%)

Query: 312 LAVPSPIGGVLV----VGANTIHYHSQSASCALA-LNNYAVS----LDSSQELPRSSFSV 362
           L +P P    L+    V ++   YH +  + A A L    V+    L     L +   S 
Sbjct: 221 LEIPDPFARTLIPVSIVESDVKRYHRRDTTNASAQLGGLIVAGETMLIYVDTLTKVKISK 280

Query: 363 ELDAAH--ATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTI 418
            LD      +W + DV   LL+   G+L LLT+  DG +V  L L     +   S +  +
Sbjct: 281 ALDEPRIFVSWAKYDVTRYLLADDYGNLHLLTLEVDGVIVTGLSLKTIGKTSRASCLVYM 340

Query: 419 GNSLFFLGSRLGDSLL 434
           GN + FLGS  GDS L
Sbjct: 341 GNEILFLGSHHGDSQL 356


>gi|350629921|gb|EHA18294.1| damage-specific DNA binding protein [Aspergillus niger ATCC 1015]
          Length = 1140

 Score = 43.1 bits (100), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 42/139 (30%), Positives = 61/139 (43%), Gaps = 25/139 (17%)

Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
           A  L+ VP+P+GG+LV+G  +I Y               V  DS++ + R      LD A
Sbjct: 245 ASHLIPVPAPLGGLLVLGETSIKY---------------VDTDSNEIVSRP-----LDEA 284

Query: 368 --HATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQRLDLSKTNPSVLTSDITTIGNSL 422
                W Q D    LL+   G L  L +V D    VQ   L     +   S +  +G  +
Sbjct: 285 TIFVAWEQVDSQRWLLADDYGRLFFLMLVLDSNNQVQSWKLDHLGNTARASVLIYLGGGV 344

Query: 423 FFLGSRLGDSLLVQFTCGS 441
            F+GS  GDS +++   GS
Sbjct: 345 IFVGSHQGDSQVLRIGNGS 363


>gi|317031116|ref|XP_001392900.2| UV-damaged DNA binding protein [Aspergillus niger CBS 513.88]
          Length = 1124

 Score = 43.1 bits (100), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 42/139 (30%), Positives = 61/139 (43%), Gaps = 25/139 (17%)

Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
           A  L+ VP+P+GG+LV+G  +I Y               V  DS++ + R      LD A
Sbjct: 229 ASHLIPVPAPLGGLLVLGETSIKY---------------VDTDSNEIVSRP-----LDEA 268

Query: 368 --HATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQRLDLSKTNPSVLTSDITTIGNSL 422
                W Q D    LL+   G L  L +V D    VQ   L     +   S +  +G  +
Sbjct: 269 TIFVAWEQVDSQRWLLADDYGRLFFLMLVLDSNNQVQSWKLDHLGNTARASVLIYLGGGV 328

Query: 423 FFLGSRLGDSLLVQFTCGS 441
            F+GS  GDS +++   GS
Sbjct: 329 IFVGSHQGDSQVLRIGNGS 347


>gi|154286506|ref|XP_001544048.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
 gi|150407689|gb|EDN03230.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
          Length = 1158

 Score = 43.1 bits (100), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 42/136 (30%), Positives = 63/136 (46%), Gaps = 25/136 (18%)

Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT 370
           L+ VP+P+GG+LV+G  +I Y   +++  +           SQ L  ++  V        
Sbjct: 258 LVPVPAPLGGLLVLGETSIRYLDDASNECI-----------SQPLKEATIFV-------A 299

Query: 371 WLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQ--RLDLSKTNPSVLTSDITTIGNSLFFL 425
           W Q D    LL+   G L  L +V D    VQ  +LDL    P    S +  +G  + F+
Sbjct: 300 WEQVDGQRWLLADDYGRLFFLMLVLDTDNAVQSWKLDLLGDIPR--ASVLVYMGGGITFI 357

Query: 426 GSRLGDSLLVQFTCGS 441
           GS  GD  L++ T GS
Sbjct: 358 GSHQGDPELIRITEGS 373


>gi|198420618|ref|XP_002125906.1| PREDICTED: similar to Splicing factor 3B subunit 3
           (Spliceosome-associated protein 130) (SAP 130)
           (Pre-mRNA-splicing factor SF3b 130 kDa subunit)
           (SF3b130) (STAF130) [Ciona intestinalis]
          Length = 1216

 Score = 43.1 bits (100), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 100/463 (21%), Positives = 168/463 (36%), Gaps = 115/463 (24%)

Query: 304 LPHDAYKLLAVPS----PIGGVLVVGANTIHYHS--QSASCALALNNYAVSLDSSQELPR 357
           L   A  L++VP     P GGVLV   N I Y +          +      LD     P 
Sbjct: 228 LEERANHLISVPGGNDGP-GGVLVCAENYITYKNFGDQPDIRTPIPRRRNDLDD----PE 282

Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
               V   A H T       L+ T+ GD+  +T+  D  +V  + L   +   ++  +  
Sbjct: 283 RGMIVVCSATHKTK-SMFFFLIQTEQGDIFKVTLETDEDMVTEIRLKYFDTVPVSMAMCV 341

Query: 418 IGNSLFFLGSRLGDSLLVQFTC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
           +     F+ + +G+  L Q          +  SS +  E GD    AP           A
Sbjct: 342 LRTGFLFVAAEMGNHCLYQIAHLGDDDDETEFSSAMPLEEGDTFFYAPR----------A 391

Query: 475 LQDMVNGEELS---------LYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRIN 525
           L+++V  +EL          +   A+ +T     T     R SL      +   +GL + 
Sbjct: 392 LRNLVLVDELDSLSPIMTCLISDLANEDTPQLYVTCGRGPRSSL------RVLRHGLEV- 444

Query: 526 ADASATGISKQSNYELVELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLE 584
                      S   + ELPG    +WTV  K               ++E+ +Y+I+S  
Sbjct: 445 -----------SEMAVSELPGNPNAVWTVKIKE--------------EEEFDSYIIVSFV 479

Query: 585 ARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS------- 637
             T+VL   + + EVT+S   F+      + +L G   ++QV+  G R +          
Sbjct: 480 NATLVLSIGETVEEVTDS--GFLGTTPTLSCSLLGENALVQVYPDGIRHIRADKRVNEWK 537

Query: 638 ---------------------------YMTQDLSFGPSNSESGSGSENSTVLSVSIAD-- 668
                                      Y   D S G  N  +     NS V+ + ++   
Sbjct: 538 TPGKKTILRCAVNQRQVVIALTGGELVYFEMDQS-GQLNEYTERKEMNSEVVCMDLSKVP 596

Query: 669 ------PYVLLGMSDGSIRLLVGDPSTC--TVSVQT-PAAIES 702
                  ++ +G++D ++R++  DP+ C   +S+Q  PA  ES
Sbjct: 597 PTEQRTRFLAVGLADNTVRIISLDPTDCLQPLSMQALPATPES 639


>gi|134077422|emb|CAK45676.1| unnamed protein product [Aspergillus niger]
          Length = 1133

 Score = 43.1 bits (100), Expect = 0.86,   Method: Compositional matrix adjust.
 Identities = 42/139 (30%), Positives = 61/139 (43%), Gaps = 25/139 (17%)

Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
           A  L+ VP+P+GG+LV+G  +I Y               V  DS++ + R      LD A
Sbjct: 229 ASHLIPVPAPLGGLLVLGETSIKY---------------VDTDSNEIVSRP-----LDEA 268

Query: 368 --HATWLQNDVA--LLSTKTGDLVLLTVVYD-GRVVQRLDLSKTNPSVLTSDITTIGNSL 422
                W Q D    LL+   G L  L +V D    VQ   L     +   S +  +G  +
Sbjct: 269 TIFVAWEQVDSQRWLLADDYGRLFFLMLVLDSNNQVQSWKLDHLGNTARASVLIYLGGGV 328

Query: 423 FFLGSRLGDSLLVQFTCGS 441
            F+GS  GDS +++   GS
Sbjct: 329 IFVGSHQGDSQVLRIGNGS 347


>gi|154320780|ref|XP_001559706.1| hypothetical protein BC1G_01862 [Botryotinia fuckeliana B05.10]
          Length = 238

 Score = 43.1 bits (100), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 45/182 (24%), Positives = 73/182 (40%), Gaps = 36/182 (19%)

Query: 57  NLVVTAANVIEIYVVR--------VQEEGSKESKNSGETKRRVLMD-GIS---------- 97
           NLVV  +++++I+  +        + E+ S  +K+      RV  D G+           
Sbjct: 28  NLVVAKSSLLQIFTTKTVSVDLDELSEKDSSTAKDDTNIDPRVNNDDGVEDSFLGTDSIM 87

Query: 98  -------AASLELVCHYRLHGNVESL----AILSQGGADNSRRRDSIILAFEDAKISVLE 146
                     L LV  Y L G V SL     I S+ G +      +I++ F+DAK+S++E
Sbjct: 88  QRPELARTTKLVLVAEYNLSGTVTSLVRVKTISSKTGGE------AILVGFKDAKLSLVE 141

Query: 147 FDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKA 206
           +D    G+   S+H +E  E                + VDP  RC  +      + IL  
Sbjct: 142 WDPERPGISTISVHFYEQDELQGSPWAPSLSDCVNYLTVDPGSRCAALKFGARNLAILPF 201

Query: 207 SQ 208
            Q
Sbjct: 202 KQ 203


>gi|328874742|gb|EGG23107.1| UV-damaged DNA binding protein1 [Dictyostelium fasciculatum]
          Length = 1116

 Score = 42.7 bits (99), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 44/196 (22%), Positives = 77/196 (39%), Gaps = 41/196 (20%)

Query: 504 AVRDSLVNIGPLKDF----------------SYGLR---INADASATGISKQSNYELVEL 544
            V D+  N+GP+ DF                S G +   +    +  GI++Q++   ++L
Sbjct: 335 TVLDTFANLGPIPDFCLVDIEKQGQNQIVACSGGFKEGSLRVIRNGIGITEQAS---IDL 391

Query: 545 PGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVD 604
           PG K IW++   S R                  YLI+S  + T VLE      E TE   
Sbjct: 392 PGIKAIWSLARGSDR------------------YLILSFISSTKVLEFQGEDIEETEIAG 433

Query: 605 YFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSV 664
           + +Q  T+  GN+   ++++Q+   G  ++D         + PS+      S     + +
Sbjct: 434 FDLQSPTLYCGNV-ADKQILQISTSGIYLVDHETNLNYDVWKPSSGSINLASHQGNQILI 492

Query: 665 SIADPYVLLGMSDGSI 680
           S     +   + D  I
Sbjct: 493 SFGKTLIYFEIKDQKI 508


>gi|328700785|ref|XP_001945395.2| PREDICTED: DNA damage-binding protein 1-like [Acyrthosiphon pisum]
          Length = 1072

 Score = 42.7 bits (99), Expect = 0.94,   Method: Compositional matrix adjust.
 Identities = 41/202 (20%), Positives = 88/202 (43%), Gaps = 38/202 (18%)

Query: 241 LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS 300
           ++  +++D  F++G+  P ++I++E                  +A+  +  +K+      
Sbjct: 188 MEETNIQDIGFLYGFTNPTIIIIYE------------------NAMGRTIKIKKIIDSKK 229

Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
             ++  +A  ++ VPSP+ G +++G N+I YH  + SC +              LP    
Sbjct: 230 YKSIEKEASMVIPVPSPLCGAIIIGENSIFYH--NGSCNII------------RLPIRQ- 274

Query: 361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRV-----VQRLDLSKTNPSVLTSDI 415
            +E+       L+    LL   +G L++L + Y+  +     V  L L       +   +
Sbjct: 275 KIEIVCYTRVDLEGTRYLLGDHSGCLLMLFLKYEKTLNGKFKVTDLYLRYFGEISIPISL 334

Query: 416 TTIGNSLFFLGSRLGDSLLVQF 437
           T + N + ++ S+ GDS L++ 
Sbjct: 335 TYLDNKVIYVASKFGDSQLIKL 356


>gi|346970653|gb|EGY14105.1| hypothetical protein VDAG_00787 [Verticillium dahliae VdLs.17]
          Length = 1160

 Score = 42.7 bits (99), Expect = 0.99,   Method: Compositional matrix adjust.
 Identities = 42/136 (30%), Positives = 60/136 (44%), Gaps = 13/136 (9%)

Query: 312 LAVPSPIGGVLV----VGANTIHYHSQSASCALA-LNNYAVS----LDSSQELPRSSFSV 362
           L +P P    L+    V ++   YH +  + A A L    V+    L     L +   S 
Sbjct: 221 LEIPDPFARTLIPVSIVESDVKRYHRRDTTNASAQLGGLIVAGETMLIYVDTLTKVKISK 280

Query: 363 ELDAAH--ATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTI 418
            LD      +W + DV   LL+   G+L LLT+  DG +V  L L     +   S +  +
Sbjct: 281 ALDEPRIFVSWAKYDVTRYLLADDYGNLHLLTLEVDGVIVTGLSLKTIGKTSRASCLVYM 340

Query: 419 GNSLFFLGSRLGDSLL 434
           GN + FLGS  GDS L
Sbjct: 341 GNEILFLGSHHGDSQL 356


>gi|389602597|ref|XP_001567507.2| cleavage and polyadenylation specificity factor-like protein
           [Leishmania braziliensis MHOM/BR/75/M2904]
 gi|322505515|emb|CAM42945.2| cleavage and polyadenylation specificity factor-like protein
           [Leishmania braziliensis MHOM/BR/75/M2904]
          Length = 1536

 Score = 42.4 bits (98), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 38/138 (27%), Positives = 68/138 (49%), Gaps = 20/138 (14%)

Query: 243 MKHVKDFIFVHGYIEPVMVILHERELTWAGRVS---WKHH-------TCMIS--ALSIST 290
           +++++D  FV    EP++  L E++ TWAGRV    W+         TC I    ++++ 
Sbjct: 298 LRNIRDVQFVASAGEPLLAFLFEKQPTWAGRVKLLEWRSKTVESHMLTCSIEWMKVTLAN 357

Query: 291 TLKQHPLIWSAMN-LPHDAYKLLAVPS----PIGGVLVVGANTIHYHSQSASCALALNNY 345
           T   H L  S ++ LP+DA  +  +P+    P   VL V  N + + S  +   + +N  
Sbjct: 358 TAAPHMLSLSEVDGLPYDATSMTPLPAFQDVP-SAVLCVSRNMMVHVSTKSGYGVYVN-- 414

Query: 346 AVSLDSSQELPRSSFSVE 363
           A+  + ++ L  S+ S E
Sbjct: 415 AMGEEQARSLKSSAVSCE 432


>gi|67516629|ref|XP_658200.1| hypothetical protein AN0596.2 [Aspergillus nidulans FGSC A4]
 gi|40747539|gb|EAA66695.1| hypothetical protein AN0596.2 [Aspergillus nidulans FGSC A4]
 gi|259489136|tpe|CBF89158.1| TPA: damaged DNA binding protein (Eurofung) [Aspergillus nidulans
           FGSC A4]
          Length = 1132

 Score = 42.4 bits (98), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 64/260 (24%), Positives = 107/260 (41%), Gaps = 31/260 (11%)

Query: 185 VDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK 244
           +DP GR   + +Y   ++++   Q  S   G +    +G       E    I  R +D  
Sbjct: 122 IDPSGRFMTLEIYDGMIVVIPIIQLPSKRRGRQVALPTGPDAPRIGELGEPIITR-IDEL 180

Query: 245 HVKDFIFVHGYI-EPVMVILHE-RELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAM 302
            V+   F+H     P + +L+E  +     +V    ++    A S  T++  +     A 
Sbjct: 181 FVRSSAFLHVQAGSPRLALLYEDNQKKVKLKVRELKYSTAAGAESEFTSIADY-----AQ 235

Query: 303 NLPHDAYKLLAVPSPI---GGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSS 359
            L   A  L+ VP+P+   GG+L++G  +I Y         A NN  VS    Q L  ++
Sbjct: 236 ELDLGASHLIPVPAPLAAAGGLLILGETSIKYVD-------ADNNEIVS----QPLEEAT 284

Query: 360 FSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
             V        W Q D    LL+   G L  L +V     V+R +L     +   S +  
Sbjct: 285 IFV-------AWEQVDSQRWLLADDYGRLFFLMLVLRNSEVERWELHSLGNTSRASVLVY 337

Query: 418 IGNSLFFLGSRLGDSLLVQF 437
           +G  + F+GS  GDS +++ 
Sbjct: 338 LGGGVVFVGSHQGDSQVIRI 357


>gi|391341059|ref|XP_003744849.1| PREDICTED: splicing factor 3B subunit 3-like isoform 2 [Metaseiulus
           occidentalis]
          Length = 1223

 Score = 42.4 bits (98), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 77/391 (19%), Positives = 133/391 (34%), Gaps = 102/391 (26%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           L  T+ GD+  +T+ +D   V  + L   +   +   +  + +   F+ S  G+  L Q 
Sbjct: 302 LAQTEQGDIFKITLEFDDDAVTEIKLKYFDSLPVAQTMHVLKSGFLFVASEFGNHSLYQI 361

Query: 438 T-CGSGTSM--LSSGLKEEFGDIEADAPSTKR----------LRRSSSDALQDMVNGEEL 484
              G  T     SS    E GD     P   +          L    +  + D+ N +  
Sbjct: 362 AHLGDNTDEPEFSSIFPLEEGDTFFFLPRELKNLVLVDEMDSLSPIMTARVADLTNEDTP 421

Query: 485 SLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVEL 544
            LY +      S  +                      LR   + S   +S        EL
Sbjct: 422 QLYAACGRGPRSTMRV---------------------LRHGLEVSEMAVS--------EL 452

Query: 545 PGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESV 603
           PG    +WTV  ++              DDEY AY+++S    T+VL   + + EVT+S 
Sbjct: 453 PGNPSAVWTVKKRA--------------DDEYDAYIVVSFINATLVLSIGETVEEVTDS- 497

Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS-------------------------- 637
             F+      A +  G   ++Q++  G R +                             
Sbjct: 498 -GFLGTTPTLACHQIGHDALVQIYPEGIRHIRADRRVNEWRTSGKKLIVKCAVNQRQVVI 556

Query: 638 --------YMTQDLSFGPSNSESGSGSENSTVLSVSIAD--------PYVLLGMSDGSIR 681
                   Y   D S G  N  +     NS VL +++           ++ +G SDG++ 
Sbjct: 557 ALTGGELIYFEMD-SSGQLNEYAERKEMNSDVLCMALGSVPAGEQRTKFLAVGSSDGTVH 615

Query: 682 LLVGDPSTCTVSVQTPAAIESSKKPVSSCTL 712
           ++  DP +C   +      ES+ + ++   L
Sbjct: 616 VISLDPKSCLSILSVQGMTESNPESLAIVEL 646


>gi|383847297|ref|XP_003699291.1| PREDICTED: splicing factor 3B subunit 3-like [Megachile rotundata]
          Length = 1217

 Score = 42.4 bits (98), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 59/261 (22%), Positives = 100/261 (38%), Gaps = 43/261 (16%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           L  T+ GD+  +T+  D  +V  + L   +   + + +  +     F+ S  G+  L Q 
Sbjct: 302 LAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVAASMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKR--LRRSSSDALQDMVNGEELSLYGSASN 492
                       SS +  E GD    AP   R  +     D+L  ++  +   L   A+ 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLRNLVLVDEMDSLSPIMACQVADL---ANE 418

Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPG-CKGIW 551
           +T     T     R +L      +   +GL +            S   + ELPG    +W
Sbjct: 419 DTPQLYITCGRGPRSTL------RVLRHGLEV------------SEMAVSELPGNPNAVW 460

Query: 552 TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
           TV  +               D+EY AY+I+S    T+VL   + + EVT+S   F+    
Sbjct: 461 TVKRR--------------VDEEYDAYIIVSFVNATLVLSIGETVEEVTDS--GFLGTTP 504

Query: 612 IAAGNLFGRRRVIQVFERGAR 632
             + +  G   ++QV+  G R
Sbjct: 505 TLSCSALGEDALVQVYPDGIR 525


>gi|391341057|ref|XP_003744848.1| PREDICTED: splicing factor 3B subunit 3-like isoform 1 [Metaseiulus
           occidentalis]
          Length = 1211

 Score = 42.4 bits (98), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 78/397 (19%), Positives = 137/397 (34%), Gaps = 103/397 (25%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           L  T+ GD+  +T+ +D   V  + L   +   +   +  + +   F+ S  G+  L Q 
Sbjct: 302 LAQTEQGDIFKITLEFDDDAVTEIKLKYFDSLPVAQTMHVLKSGFLFVASEFGNHSLYQI 361

Query: 438 T-CGSGTSM--LSSGLKEEFGDIEADAPSTKR----------LRRSSSDALQDMVNGEEL 484
              G  T     SS    E GD     P   +          L    +  + D+ N +  
Sbjct: 362 AHLGDNTDEPEFSSIFPLEEGDTFFFLPRELKNLVLVDEMDSLSPIMTARVADLTNEDTP 421

Query: 485 SLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVEL 544
            LY +      S  +                      LR   + S   +S        EL
Sbjct: 422 QLYAACGRGPRSTMRV---------------------LRHGLEVSEMAVS--------EL 452

Query: 545 PGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESV 603
           PG    +WTV  ++              DDEY AY+++S    T+VL   + + EVT+S 
Sbjct: 453 PGNPSAVWTVKKRA--------------DDEYDAYIVVSFINATLVLSIGETVEEVTDS- 497

Query: 604 DYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS-------------------------- 637
             F+      A +  G   ++Q++  G R +                             
Sbjct: 498 -GFLGTTPTLACHQIGHDALVQIYPEGIRHIRADRRVNEWRTSGKKLIVKCAVNQRQVVI 556

Query: 638 --------YMTQDLSFGPSNSESGSGSENSTVLSVSIAD--------PYVLLGMSDGSIR 681
                   Y   D S G  N  +     NS VL +++           ++ +G SDG++ 
Sbjct: 557 ALTGGELIYFEMD-SSGQLNEYAERKEMNSDVLCMALGSVPAGEQRTKFLAVGSSDGTVH 615

Query: 682 LLVGDPSTCTVSVQTPAAIESSKKPVSSCTLY-HDKG 717
           ++  DP +C   +      ES+ + ++   +  H++G
Sbjct: 616 VISLDPKSCLSILSVQGMTESNPESLAIVDMSGHEEG 652


>gi|330792580|ref|XP_003284366.1| hypothetical protein DICPUDRAFT_86223 [Dictyostelium purpureum]
 gi|325085712|gb|EGC39114.1| hypothetical protein DICPUDRAFT_86223 [Dictyostelium purpureum]
          Length = 1064

 Score = 42.4 bits (98), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 128/565 (22%), Positives = 214/565 (37%), Gaps = 117/565 (20%)

Query: 109 LHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWL 168
           ++G +  L + S GG     ++D + ++ E  K  +L +D     +   +    E     
Sbjct: 14  IYGRISVLKLFSAGG-----KQDYLFISTESFKFCILAYDSEKKEIVTKASGNAED---- 64

Query: 169 HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSA 228
               GR + A G L  +DP GR          +I L   +G   L+  E       G + 
Sbjct: 65  --TIGRPTEA-GQLGIIDPDGR----------LIALHLYEGLLKLINIEK------GLNN 105

Query: 229 RIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
            I+ +   N R L+   V D  F++G   P + +L      +      KH    I    +
Sbjct: 106 PIQKTAA-NTR-LEELQVMDMTFLYGCKIPTIAVL------FKDTKDEKH----IVTYEV 153

Query: 289 STTLKQH-PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
           S   ++  P  WS  N+    Y  + V  P+GGVLVV  N I Y +   + ++A      
Sbjct: 154 SQKDQELCPGPWSQSNV--GVYSSMLVAVPLGGVLVVADNGITYMNGRTTRSIA------ 205

Query: 348 SLDSSQELPRSSFSV--ELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK 405
                  +P + F     +D   + +L  D        G L +L ++   + V  L    
Sbjct: 206 -------IPYTKFLAYDRVDKDGSRYLFGD------HFGRLSVLVLLNHQQRVTELKFET 252

Query: 406 TNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTK 465
              + + S I+ + + + F+GS  GDS L++                   + E D P+T 
Sbjct: 253 LGRTSIPSSISYLDSGVVFIGSSSGDSQLIRL------------------NTEKD-PATD 293

Query: 466 RLRRSSSDALQDMVN-GEELSLYGSASNNTESAQ-KTFSFAVRDSLVNIGPLKDFSYGLR 523
               S    L++  N G  +      +     AQ  T S   RD     G L+    G+ 
Sbjct: 294 ----SYISHLENFTNIGPIVDFCLVDTEKQGQAQIVTCSGTYRD-----GTLRVIRNGI- 343

Query: 524 INADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISL 583
                   GI++++   L+EL G KG+W +        N  S  +   D     YLI+S 
Sbjct: 344 --------GIAEKA---LIELEGVKGLWPI------KENDPSDPLNPKD----QYLIVSF 382

Query: 584 EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDG-SYMTQD 642
              T VL+      E TE         TI   N+     ++QV  +   +++  ++   D
Sbjct: 383 IGYTKVLQFQGEEIEETEFEGLDSNSSTILCSNIDKENVIVQVTNQAINLINPITFKRVD 442

Query: 643 LSFGPSNSESGSGSENSTVLSVSIA 667
               PS S     S N + +++SI 
Sbjct: 443 QWKSPSGSPINLVSSNQSQIALSIG 467


>gi|393212467|gb|EJC97967.1| hypothetical protein FOMMEDRAFT_162310 [Fomitiporia mediterranea
           MF3/22]
          Length = 1161

 Score = 42.0 bits (97), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 72/331 (21%), Positives = 128/331 (38%), Gaps = 73/331 (22%)

Query: 306 HDAYKLLAVPSPI-------GGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
            D+  L+ VP  I       GGVLV+G +TI ++S         ++      S+ ++P++
Sbjct: 224 EDSNLLIPVPPQIKSSWNVNGGVLVLGGSTIAFYSIDRKQKKKNSSSQSKS-STSKIPQA 282

Query: 359 SFSVELDAAHATWLQNDVA----LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSD 414
             +       A W Q D      LL    G L LL +      +  + L + +P    + 
Sbjct: 283 EVNWPYFDITA-WAQIDEDGLRYLLGDSFGRLALLAINPQYAYLDIVLLGEVSPP---TS 338

Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDA 474
           +T + +   ++GS  GDS L++ T    ++     + + F +I   AP    +   + D+
Sbjct: 339 LTPLASQYIYVGSHFGDSQLIRVTSERSSNGSYLEISDTFKNI---APIMDAVFEDTDDS 395

Query: 475 LQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGIS 534
            Q  +    ++  G  S                     G L+    G   N DA   GI+
Sbjct: 396 GQPTI----ITCSGGEST--------------------GSLRVIRNGANFNEDARIEGIA 431

Query: 535 KQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLE--T 592
                         G+W +  +              YDD +H Y++++ +  T +LE   
Sbjct: 432 N-----------ITGMWPIRRQ--------------YDDTFHHYMLVTTDTNTHLLELPN 466

Query: 593 ADLLTEVTESVDY---FVQGRTIAAGNLFGR 620
           +   T V+ S D+    +  RT+ AGN+  R
Sbjct: 467 SQQETAVSRSNDFSDLTIDSRTLVAGNMLTR 497


>gi|340721347|ref|XP_003399083.1| PREDICTED: splicing factor 3B subunit 3-like [Bombus terrestris]
 gi|350406701|ref|XP_003487854.1| PREDICTED: splicing factor 3B subunit 3-like [Bombus impatiens]
          Length = 1217

 Score = 42.0 bits (97), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 59/261 (22%), Positives = 100/261 (38%), Gaps = 43/261 (16%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           L  T+ GD+  +T+  D  +V  + L   +   + + +  +     F+ S  G+  L Q 
Sbjct: 302 LAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVAASMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKR--LRRSSSDALQDMVNGEELSLYGSASN 492
                       SS +  E GD    AP   R  +     D+L  ++  +   L   A+ 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLRNLVLVDEMDSLSPIMACQVADL---ANE 418

Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIW 551
           +T     T     R +L      +   +GL +            S   + ELPG    +W
Sbjct: 419 DTPELYITCGRGPRSTL------RVLRHGLEV------------SEMAVSELPGNPNAVW 460

Query: 552 TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
           TV  +               D+EY AY+I+S    T+VL   + + EVT+S   F+    
Sbjct: 461 TVKRR--------------VDEEYDAYIIVSFVNATLVLSIGETVEEVTDS--GFLGTTP 504

Query: 612 IAAGNLFGRRRVIQVFERGAR 632
             + +  G   ++QV+  G R
Sbjct: 505 TLSCSALGEDALVQVYPDGIR 525


>gi|66553024|ref|XP_623333.1| PREDICTED: splicing factor 3B subunit 3 isoform 1 [Apis mellifera]
 gi|380015815|ref|XP_003691890.1| PREDICTED: splicing factor 3B subunit 3-like [Apis florea]
          Length = 1217

 Score = 42.0 bits (97), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 59/261 (22%), Positives = 100/261 (38%), Gaps = 43/261 (16%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           L  T+ GD+  +T+  D  +V  + L   +   + + +  +     F+ S  G+  L Q 
Sbjct: 302 LAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVAASMCVLKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKR--LRRSSSDALQDMVNGEELSLYGSASN 492
                       SS +  E GD    AP   R  +     D+L  ++  +   L   A+ 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLRNLVLVDEMDSLSPIMACQVADL---ANE 418

Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIW 551
           +T     T     R +L      +   +GL +            S   + ELPG    +W
Sbjct: 419 DTPQLYITCGRGPRSTL------RVLRHGLEV------------SEMAVSELPGNPNAVW 460

Query: 552 TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
           TV  +               D+EY AY+I+S    T+VL   + + EVT+S   F+    
Sbjct: 461 TVKRR--------------VDEEYDAYIIVSFVNATLVLSIGETVEEVTDS--GFLGTTP 504

Query: 612 IAAGNLFGRRRVIQVFERGAR 632
             + +  G   ++QV+  G R
Sbjct: 505 TLSCSALGEDALVQVYPDGIR 525


>gi|302831461|ref|XP_002947296.1| hypothetical protein VOLCADRAFT_73165 [Volvox carteri f.
           nagariensis]
 gi|300267703|gb|EFJ51886.1| hypothetical protein VOLCADRAFT_73165 [Volvox carteri f.
           nagariensis]
          Length = 1221

 Score = 42.0 bits (97), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 71/353 (20%), Positives = 130/353 (36%), Gaps = 76/353 (21%)

Query: 271 AGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI---GGVLVVGAN 327
           A  ++ KH T     L ++  L++    W+   + + A  L+AVP      GGVLV   N
Sbjct: 199 AASMAQKHLTFYEMDLGLNNVLRK----WTE-PIDNGANLLVAVPGGADGPGGVLVCAEN 253

Query: 328 TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLV 387
            I Y +Q      A+      L   + +   S++     A++ +L      + ++ GD+ 
Sbjct: 254 FIIYKNQDHEEVRAVIPRRSDLPGDRGVLIVSYATHKKKAYSFFL------VQSEYGDIY 307

Query: 388 LLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447
            +T+ Y+G  V  L +   +     + I  +     F  S  G+  L QF          
Sbjct: 308 KVTLAYEGEAVTELKIKYFDTIPPCTSIAVLKTGFLFAASEYGNHALYQFV--------- 358

Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRD 507
            G  E+  D+E           SSS AL     G +   +          +   +  + D
Sbjct: 359 -GTGEDDEDVE-----------SSSAALVQTEEGFQPVFF--------EPRPLKNLLLID 398

Query: 508 SLVNIGPLKDFS-----------------YGLRINADASATGISKQSNYELVELPGCK-G 549
            + ++ P+ D                   +G R +      G++  +   +  LPG    
Sbjct: 399 EMASLMPITDMKVANLLNEEIPQIYALCGHGPRASLSVLRPGLAV-TELAVSPLPGAPTA 457

Query: 550 IWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTES 602
           +WTV   ++              DE+ A++++S    T+V    + + E  ES
Sbjct: 458 VWTVRRNAT--------------DEFDAFIVVSFANATLVFSIGEEVKETNES 496


>gi|242803623|ref|XP_002484212.1| UV-damaged DNA binding protein, putative [Talaromyces stipitatus
           ATCC 10500]
 gi|218717557|gb|EED16978.1| UV-damaged DNA binding protein, putative [Talaromyces stipitatus
           ATCC 10500]
          Length = 1140

 Score = 41.6 bits (96), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 39/137 (28%), Positives = 63/137 (45%), Gaps = 21/137 (15%)

Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
           A  L+ VP+P+GG+LV+G   I Y   +       NN  +    S+ L  ++  V     
Sbjct: 245 ASHLIPVPAPLGGLLVLGETCIKYIDDA-------NNETI----SRPLDEATIFV----- 288

Query: 368 HATWLQNDVA--LLSTKTGDLVLLTVVYDGR-VVQRLDLSKTNPSVLTSDITTIGNSLFF 424
              W+Q D    LL+   G L  L +V D R  V+   +     +   S +  +G  + F
Sbjct: 289 --AWVQVDGQRWLLADDYGRLFFLMLVLDSRNEVEGWKIDYLGSASRASVLIYLGAGMTF 346

Query: 425 LGSRLGDSLLVQFTCGS 441
           +GS  GDS +++ + GS
Sbjct: 347 IGSHQGDSQVIRISEGS 363


>gi|242018509|ref|XP_002429717.1| Splicing factor 3B subunit, putative [Pediculus humanus corporis]
 gi|212514723|gb|EEB16979.1| Splicing factor 3B subunit, putative [Pediculus humanus corporis]
          Length = 1218

 Score = 41.6 bits (96), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 68/323 (21%), Positives = 122/323 (37%), Gaps = 65/323 (20%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           L  T+ GD+  +T+  D  +V  + L   +   + + +  +     F+ S  G+  L Q 
Sbjct: 302 LAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVATSMCVMKTGFLFVASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E GD    AP           AL+++V  +E+      S + 
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGDTFFFAPR----------ALRNLVQVDEMD-----SLSP 406

Query: 495 ESAQKTFSFAVRDS-----LVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPG 546
             A +    A  D+     L   GP   L+   +GL +            S   + ELPG
Sbjct: 407 IMACQVADLANEDTPQLYMLCGRGPRSTLRVLRHGLEV------------SEMAVSELPG 454

Query: 547 -CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDY 605
               +WTV  +               ++EY AY+I+S    T+VL   + + EVT+S   
Sbjct: 455 NPNAVWTVKRR--------------VEEEYDAYIIVSFVNATLVLSIGETVEEVTDS--G 498

Query: 606 FVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVS 665
           F+      + +  G   ++QV+  G R +              N     G +  T++  +
Sbjct: 499 FLGTTPTLSCSALGDDALVQVYPDGIRHIRADKRV--------NEWKAPGKK--TIMKCA 548

Query: 666 IADPYVLLGMSDGSIRLLVGDPS 688
           +    V++ ++ G +     DP+
Sbjct: 549 VNQRQVVIALTAGELVYFEMDPT 571


>gi|384490247|gb|EIE81469.1| hypothetical protein RO3G_06174 [Rhizopus delemar RA 99-880]
          Length = 1197

 Score = 41.6 bits (96), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 46/205 (22%), Positives = 84/205 (40%), Gaps = 60/205 (29%)

Query: 543 ELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
           ELPG    +WT   ++              DD+YHAY+++S    T+VL   + + EVT+
Sbjct: 443 ELPGNPSAVWTTKLRA--------------DDQYHAYIVVSFANATLVLSIGETVEEVTD 488

Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGAR------------------ILDGSYMTQDL 643
           +  +     T+A   + G   ++QV   G R                  I++ +  ++ +
Sbjct: 489 T-GFLTNAPTLAVQQI-GEDALVQVHPHGIRHIRADRRVNEWRAPQGQTIVEAATNSRQI 546

Query: 644 SFGPSNSE--------SGSGSENS---------TVLSVS------IADPYVLLGMSDGSI 680
           +   SN E         G  +E+          T L++       +   Y+ +G  D ++
Sbjct: 547 AIALSNGEIVYFEMDNMGQLNEHQEHRQMSAYITTLALGEVPEGRVRARYIAVGCEDQTV 606

Query: 681 RLLVGDPSTC--TVSVQTPAAIESS 703
           R+L  DP +C   +S+Q    + SS
Sbjct: 607 RILSLDPDSCLEPISMQALQGVPSS 631


>gi|121699866|ref|XP_001268198.1| UV-damaged DNA binding protein, putative [Aspergillus clavatus NRRL
           1]
 gi|119396340|gb|EAW06772.1| UV-damaged DNA binding protein, putative [Aspergillus clavatus NRRL
           1]
          Length = 1140

 Score = 41.6 bits (96), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 39/136 (28%), Positives = 61/136 (44%), Gaps = 25/136 (18%)

Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA--H 368
           L+ VP+P+GG+L++G  +I Y               V  D+++ + R      LD A   
Sbjct: 248 LIPVPAPLGGLLILGETSIKY---------------VDADNNEIISRP-----LDEATIF 287

Query: 369 ATWLQNDVA--LLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFL 425
             W Q D    LL+   G L  L +V D    V+   L     +   S +  +G  + FL
Sbjct: 288 VAWEQVDSQRWLLADDYGRLFFLMLVLDSDNQVESWKLDLLGKTSRASVLVYLGGGVLFL 347

Query: 426 GSRLGDSLLVQFTCGS 441
           GS  GDS +++ + GS
Sbjct: 348 GSHQGDSQVLRISNGS 363


>gi|302680006|ref|XP_003029685.1| hypothetical protein SCHCODRAFT_58785 [Schizophyllum commune H4-8]
 gi|300103375|gb|EFI94782.1| hypothetical protein SCHCODRAFT_58785 [Schizophyllum commune H4-8]
          Length = 1213

 Score = 41.6 bits (96), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 80/380 (21%), Positives = 141/380 (37%), Gaps = 87/380 (22%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           LL ++ GDL  +T+ ++   V+ + +   +   + S +  + +   F+ S  G+  L QF
Sbjct: 308 LLQSEDGDLFKVTIEHEDEDVKEVKIKYFDTVPVASALCILKSGFLFVASEFGNHYLYQF 367

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTK-RLRRSSSD--ALQDMVNGEELSLYGSAS 491
                       SS    +FG  ++  P      +    D  AL D V   +  +     
Sbjct: 368 QKLGDDDDEPEFSSSSYPQFGMADSSMPLPHVHFKPHPLDNLALADEVESLDPIIDSKVL 427

Query: 492 NNTESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC- 547
           N   ++     FA        GP   L+   +GL +    S+            +LPG  
Sbjct: 428 NLMPNSDTPQIFAA----CGRGPRSSLRTLRHGLEVEESVSS------------DLPGIP 471

Query: 548 KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
             +WT   K               DD + +Y+I+S    T+VL   + + EV ++  +  
Sbjct: 472 NAVWTTKKKE--------------DDAFDSYIILSFVNGTLVLSIGETIEEVQDT-GFLS 516

Query: 608 QGRTIAAGNLFGRRRVIQVFERGAR------------------ILDGS------------ 637
              T+A   + G   ++QV  +G R                  I+  +            
Sbjct: 517 SAPTLAVQQI-GADALLQVHPQGIRHVLSDRRVNEWRVPQGKSIVQATTNKRQVVVALSS 575

Query: 638 ----YMTQDLSFGPSNSESGSGSENSTVLSVSIAD--------PYVLLGMSDGSIRLLVG 685
               Y   DL  G  N      +  STVL++SI +        P++ +G  D ++R++  
Sbjct: 576 AELVYFELDLD-GQLNEYQDRKAMGSTVLALSIGEVPEGRQRTPFLAVGCEDQTVRIISL 634

Query: 686 DPSTC--TVSVQTPAAIESS 703
           DP +   T+S+Q   A  SS
Sbjct: 635 DPESTLDTISLQALTAPPSS 654


>gi|258570355|ref|XP_002543981.1| pre-mRNA splicing factor rse1 [Uncinocarpus reesii 1704]
 gi|237904251|gb|EEP78652.1| pre-mRNA splicing factor rse1 [Uncinocarpus reesii 1704]
          Length = 1209

 Score = 41.6 bits (96), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 65/265 (24%), Positives = 108/265 (40%), Gaps = 40/265 (15%)

Query: 378 LLSTKTGDLVLLTV--VYDGR-----VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
           LL T+ GDL  +T+  V D        V+RL L   +   + S +  + N   F+ S  G
Sbjct: 307 LLQTEDGDLFKVTIDMVEDDNGQPTGEVRRLKLKYFDTVPIASSLCILKNGFLFVASENG 366

Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGD--IEADAPSTKRLRRSSSDALQDMVNGEELSLYG 488
           +    QF         +    ++F    +E  AP   R R + +  L + +N     +  
Sbjct: 367 NHHFYQFEKLGDDDEETEFTSDDFSSDPLEPLAPVYFRPRPAENLNLVESINSVNPLMSC 426

Query: 489 SASNNTES-AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC 547
             +N TE  A + ++     +      LK   +GL +         S+    EL  +P  
Sbjct: 427 KVANLTEDDAPQLYTLCGTGARSTFRTLK---HGLEV---------SEIVESELPSVPS- 473

Query: 548 KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
             +WT   K +R            +D+Y AY+I+S    T+VL   + + EVT++  +  
Sbjct: 474 -AVWTT--KLTR------------NDQYDAYIILSFTNGTLVLSIGETVEEVTDT-GFLS 517

Query: 608 QGRTIAAGNLFGRRRVIQVFERGAR 632
              T+A   L G   +IQV  +G R
Sbjct: 518 SAPTLAVQQL-GEDSLIQVHPKGIR 541


>gi|345570887|gb|EGX53705.1| hypothetical protein AOL_s00006g33 [Arthrobotrys oligospora ATCC
           24927]
          Length = 1133

 Score = 41.6 bits (96), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 63/269 (23%), Positives = 107/269 (39%), Gaps = 46/269 (17%)

Query: 180 GPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINL 238
           G L   DP GR  G+ +Y G+   I    Q              G G  A++  + + NL
Sbjct: 116 GHLYLADPGGRLLGLYLYEGIFTAIPIKRQS------------KGRGRHAQLPEAEIGNL 163

Query: 239 RD---LDMKHVK--DFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
            D   + M  +K  + +F++G   PV+ +L+         ++++        L+++    
Sbjct: 164 DDPCPIRMNELKVINMVFLYGTSVPVIAVLYTDSKKLVHLITYE--------LNVAKRAV 215

Query: 294 QHPLI--W--SAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSL 349
           + P    W   A NL H A  L+ V +P GG+LV+G   + Y     +         V +
Sbjct: 216 KDPEFAQWGIKANNLDHGAKLLIPVDNPTGGILVIGEQVVSYFHPERT---------VPM 266

Query: 350 DSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPS 409
                 P S  +      H   +  +  LLS + G L LL ++ +   +  + +      
Sbjct: 267 KKPLHEPTSFVT------HGK-IDPERYLLSDELGHLYLLLLIIENNKLINMRIENLGEV 319

Query: 410 VLTSDITTIGNSLFFLGSRLGDSLLVQFT 438
                I  + N   FLGS  GDS LV+ +
Sbjct: 320 CQARAIVYLDNGYVFLGSHFGDSTLVRIS 348


>gi|347829304|emb|CCD45001.1| similar to pre-mRNA-splicing factor rse1 [Botryotinia fuckeliana]
          Length = 1212

 Score = 41.2 bits (95), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 129/600 (21%), Positives = 226/600 (37%), Gaps = 94/600 (15%)

Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
           + + G + ++A     G++    +D II+  +  +I+++EF  + +      +  F    
Sbjct: 62  HDVFGIIRAIAAFRLAGSN----KDYIIITSDSGRITIVEFVPAQNKFNRLHLETFG--- 114

Query: 167 WLHLKRGRESFARGPLVKVDPQGRC---GGVLVYGLQMIILKASQGGSGLVGDEDTFGSG 223
               K G      G  + VDP+GR      V    L  ++ + SQ        E T  S 
Sbjct: 115 ----KSGVRRVVPGQYLAVDPKGRACLTASVEKNKLVYVLNRNSQA-------ELTISSP 163

Query: 224 GGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILH----ERELTWAGRVSWKHH 279
               A    + V  L  LD+          GY  PV   L     E +    G+ ++   
Sbjct: 164 --LEAHKAQTLVFALVALDV----------GYANPVFAALEIDYGESDQDPTGQ-AYDEI 210

Query: 280 TCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI---GGVLVVGANTIHY-HSQS 335
              +    +   L      WS   +   A  L  VP       GVLV G + I Y HS  
Sbjct: 211 EKQLVYYELDLGLNHVVRKWSE-PVDRTANILFQVPGGTDGPSGVLVCGEDNITYRHSNQ 269

Query: 336 ASCALALNNYAVSLDSSQELPRSSFSV--ELDAAHATWLQNDVALLSTKTGDLVLLTV-- 391
            +  +A+     + +  Q        V  +L  A   +      LL T  GDL  +T+  
Sbjct: 270 EAFRVAIPRRRGATEDPQRKRNIVAGVMHKLKGAAGAFF----FLLQTDDGDLFKITIEM 325

Query: 392 VYDGR-----VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSML 446
           V D        V+RL +   +   + + +  + +   F+ S  G+    QF         
Sbjct: 326 VEDDNGQPTGEVRRLKIKYFDTVPVATSLCILKSGFLFVASEFGNHQFYQFEKLGDDDEE 385

Query: 447 SSGLKEEF--GDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT-ESAQKTFSF 503
           +  + ++F  G  E+  P     R + + +L + ++     +    +N T E A + +S 
Sbjct: 386 TEFVSDDFPTGAHESYTPIYFHPRPAENLSLVESIDSMNPLMDCKVANLTDEDAPQIYSI 445

Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIWTVYHKSSRGHN 562
               +      LK   +GL ++    +            ELPG    +WT   K +RG  
Sbjct: 446 CGTGARSTFRTLK---HGLEVSEIVES------------ELPGVPSAVWTT--KLTRG-- 486

Query: 563 ADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRR 622
                     D Y AY+I+S    T+VL   + + EVT++  +     T+A   L G   
Sbjct: 487 ----------DTYDAYIILSFSNGTLVLSIGETVEEVTDT-GFLSSAPTLAVQQL-GEDS 534

Query: 623 VIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIAD-PYVLLGM-SDGSI 680
           +IQV  +G R +   +   + +  P +    + + N   ++V+++    V   M SDGS+
Sbjct: 535 LIQVHPKGIRHIRADHRVNEWA-APQHRSIVAATTNERQVAVALSSGEIVYFEMDSDGSL 593


>gi|302820387|ref|XP_002991861.1| hypothetical protein SELMODRAFT_448595 [Selaginella moellendorffii]
 gi|300140399|gb|EFJ07123.1| hypothetical protein SELMODRAFT_448595 [Selaginella moellendorffii]
          Length = 1292

 Score = 41.2 bits (95), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 47/216 (21%), Positives = 85/216 (39%), Gaps = 38/216 (17%)

Query: 493 NTESAQKTFSFAVRDSLVNIGPLKDFS----YGLRINADASATGISKQSNYELVE----- 543
             E  Q +F   V+    NI P+ DFS    YG + +   +  G  ++ +  ++      
Sbjct: 419 KVEDGQLSFQSFVQ----NIAPILDFSLVDYYGEKQDQMFACCGGDEEGSVRIIRNGNSV 474

Query: 544 ---------LPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETAD 594
                      G  GIWT+ ++              + D YHA+ +IS    T VL    
Sbjct: 475 EKLICTPPVYQGVSGIWTMRYR--------------FKDPYHAFFLISFVEETRVLSVGL 520

Query: 595 LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGS 654
              ++T++V +  Q  T+A G L     V QV+    ++   +          SN  S +
Sbjct: 521 NFVDITDAVGFESQVNTLACG-LVEDGWVAQVWRYEVKLCSPTKAAHPAGVSGSNPLSTT 579

Query: 655 GSENSTVLSV-SIADPYVLLGMSDGSIRLLVGDPST 689
             +    +SV ++    V+L ++   + L++G   T
Sbjct: 580 WRKPGYPISVGAVCRSRVILALARPGLLLMLGATQT 615


>gi|313235544|emb|CBY10999.1| unnamed protein product [Oikopleura dioica]
          Length = 1185

 Score = 41.2 bits (95), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 87/428 (20%), Positives = 157/428 (36%), Gaps = 105/428 (24%)

Query: 319 GGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE------LDAAHATWL 372
           GGV+V   N + Y            N+    D    +PR    ++      +  AHAT  
Sbjct: 246 GGVIVCAENYLIY-----------KNFGDQPDIRFPIPRRRNDLDDPERGMIIVAHATHK 294

Query: 373 QNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
              +   LL T+ GDL  +T+  +  +V  + L   +   ++S +  +     F+    G
Sbjct: 295 TRSMFFFLLQTEQGDLFKVTLETEEDIVTEIRLKYFDTVPVSSSLCVLRTGFLFVAGEFG 354

Query: 431 DSLLVQFT-CGSGTSMLSSGLKEEFGDIEADAPSTKRLRR----SSSDALQDMVNGEELS 485
           +  L Q T  G           E   + E    + + LR        D+L  ++N E   
Sbjct: 355 NHNLYQITRLGEDDDEPEFSSAEPLEEGETFFFTPRGLRNLALTDEMDSLSPVLNCEVAD 414

Query: 486 LYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELP 545
           L   A+ +T     T     R +L      +   +GL +            S   + ELP
Sbjct: 415 L---ANEDTPQLYVTCGRGPRSTL------RVLRHGLEV------------SEMAVSELP 453

Query: 546 GC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVD 604
           G    +WTV                + D ++ +Y+I+S    T+VL   + + E+T+S  
Sbjct: 454 GNPNAVWTV--------------KTSADADHDSYIIVSFVNATLVLSIGETVEEITDS-G 498

Query: 605 YFVQGRTIAAGNLFGRRRVIQVFERGAR-------------------------------I 633
           +     T+++G L G   ++Q++  G R                                
Sbjct: 499 FLGTTPTLSSG-LMGEDALVQIYPEGIRHIRSDRRVNEWRAPDRKQIVRCACNRQQVVIA 557

Query: 634 LDGS---YMTQDLSFGPSNSESGSGSENSTVLSVSIAD--------PYVLLGMSDGSIRL 682
           L G    Y   D + G  N  +      S ++++ + D         ++ +G+SDG++R+
Sbjct: 558 LTGGEIVYFEMDPT-GQLNEYTERREFGSEIIALDVGDVPAGEQRCRFLAVGLSDGTVRI 616

Query: 683 LVGDPSTC 690
           +  DP+ C
Sbjct: 617 ISLDPNDC 624


>gi|146096490|ref|XP_001467824.1| cleavage and polyadenylation specificity factor-like protein
           [Leishmania infantum JPCM5]
 gi|134072190|emb|CAM70891.1| cleavage and polyadenylation specificity factor-like protein
           [Leishmania infantum JPCM5]
          Length = 1542

 Score = 41.2 bits (95), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 52/206 (25%), Positives = 86/206 (41%), Gaps = 38/206 (18%)

Query: 202 IILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK----HVKDFIFVHGYIE 257
           +   A+  G G    +     GG  S  +    V + R  D+K    +++D  FV    E
Sbjct: 263 VAFGAASAGPGTASSQKV-TQGGVTSLLLRVGTVTHWRLQDVKTALRNIRDVQFVESAGE 321

Query: 258 PVMVILHERELTWAGRVS---WKHH-------TCMIS--ALSISTTLKQHPLIWSAMN-L 304
           P++  L E++ TWAGRV    W+         TC I    ++++ +   H L  S ++ L
Sbjct: 322 PLLAFLFEKQPTWAGRVKLLEWRSKTVESHMLTCSIEWMKVTLANSTAPHMLSLSEVDGL 381

Query: 305 PHDAYKLLAVPS----PIGGVLVVGANTIHYHSQSA----------SCALALNNYAVSLD 350
           P+D   +  +P+    P     V     +H  ++S             A +L + AVSL+
Sbjct: 382 PYDVTSMTPLPAFQDVPSAVFCVSRNMMVHVSTKSGYGVYVNATGEEQARSLKSSAVSLE 441

Query: 351 ------SSQELPRSSFSVELDAAHAT 370
                 +SQ L      V L+ A+AT
Sbjct: 442 AVQWRSASQALSTDLVKVNLNFANAT 467


>gi|452824087|gb|EME31092.1| DNA damage-binding protein 1 isoform 1 [Galdieria sulphuraria]
          Length = 1128

 Score = 41.2 bits (95), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 55/211 (26%), Positives = 84/211 (39%), Gaps = 35/211 (16%)

Query: 236 INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
           I L +LD   V D  F++G+ +P + +L      +      +H      +L         
Sbjct: 151 IRLEELD---VLDIQFLYGHSKPTIAVL------YTDSEENRHLKTYTVSLK-DKDFGNG 200

Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
           PL     NL   A  L+ VP+PIGGV+V+G  T+ Y S S      L  Y  S+  S  +
Sbjct: 201 PLFQG--NLESGASMLIPVPTPIGGVVVLGQETVTYISGS-----GLRGYH-SIPVSATI 252

Query: 356 PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLL---------TVVYDGRVVQRLDLSKT 406
            R+   ++ D            LL  + G L LL         T       +  L +   
Sbjct: 253 FRAYGRIDKDGTR--------YLLGDEKGILYLLVLEQSTSLSTFTETETKITGLKIQTL 304

Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
             + L S I  + N   ++GS  GDS L++ 
Sbjct: 305 GETSLPSTIDYLDNGFVYIGSCHGDSQLIRL 335


>gi|170041368|ref|XP_001848437.1| splicing factor 3B subunit 3 [Culex quinquefasciatus]
 gi|167864946|gb|EDS28329.1| splicing factor 3B subunit 3 [Culex quinquefasciatus]
          Length = 1215

 Score = 41.2 bits (95), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 67/321 (20%), Positives = 118/321 (36%), Gaps = 61/321 (19%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           L+ T+ GD+  +T+  D  VV  + L   +     + +  +     F+    G+  L Q 
Sbjct: 302 LVQTEQGDIFKVTLETDDDVVAEIKLKYFDTVPPATAMCVLKTGFLFVACDFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGS----- 489
                       SS +  E GD    AP            L+++V  +E+  +       
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGDTFFFAPR----------PLKNLVMVDEIHSFAPILGCQ 411

Query: 490 -ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC- 547
            A    E   + +    R    +I  L+   +GL +            S   + ELPG  
Sbjct: 412 VADLANEDTPQLYLACGRGPRSSIRVLR---HGLEV------------SEMAVSELPGNP 456

Query: 548 KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV 607
             +WTV  ++              DDE+ AY+I+S    T+VL   D + EVT+S   F+
Sbjct: 457 NAVWTVKKRA--------------DDEFDAYIIVSFVNATLVLSIGDTVEEVTDS--GFL 500

Query: 608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA 667
                   +  G   ++QV+  G R +              N     G +  T++  ++ 
Sbjct: 501 GTTPTLCCSALGDDALVQVYPDGIRHIRADKRV--------NEWKAPGKK--TIIKCAVN 550

Query: 668 DPYVLLGMSDGSIRLLVGDPS 688
              V++ +S G +     DP+
Sbjct: 551 SRQVVIALSGGELVYFEMDPT 571


>gi|398020786|ref|XP_003863556.1| cleavage and polyadenylation specificity factor-like protein
           [Leishmania donovani]
 gi|322501789|emb|CBZ36871.1| cleavage and polyadenylation specificity factor-like protein
           [Leishmania donovani]
          Length = 1542

 Score = 41.2 bits (95), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 52/206 (25%), Positives = 86/206 (41%), Gaps = 38/206 (18%)

Query: 202 IILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK----HVKDFIFVHGYIE 257
           +   A+  G G    +     GG  S  +    V + R  D+K    +++D  FV    E
Sbjct: 263 VAFGAASAGPGTASSQKV-TQGGVTSLLLRVGTVTHWRLQDVKTALRNIRDVQFVESAGE 321

Query: 258 PVMVILHERELTWAGRVS---WKHH-------TCMIS--ALSISTTLKQHPLIWSAMN-L 304
           P++  L E++ TWAGRV    W+         TC I    ++++ +   H L  S ++ L
Sbjct: 322 PLLAFLFEKQPTWAGRVKLLEWRSKTVESHMLTCSIEWMKVTLANSTAPHMLSLSEVDGL 381

Query: 305 PHDAYKLLAVPS----PIGGVLVVGANTIHYHSQSA----------SCALALNNYAVSLD 350
           P+D   +  +P+    P     V     +H  ++S             A +L + AVSL+
Sbjct: 382 PYDVTSMTPLPAFQDVPSAVFCVSRNMMVHVSTKSGYGVYVNATGEEQARSLKSSAVSLE 441

Query: 351 ------SSQELPRSSFSVELDAAHAT 370
                 +SQ L      V L+ A+AT
Sbjct: 442 AVQWRSASQALSTDLVKVNLNFANAT 467


>gi|154295205|ref|XP_001548039.1| pre-mRNA splicing factor 3b [Botryotinia fuckeliana B05.10]
          Length = 1020

 Score = 40.8 bits (94), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 129/600 (21%), Positives = 226/600 (37%), Gaps = 94/600 (15%)

Query: 107 YRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
           + + G + ++A     G++    +D II+  +  +I+++EF  + +      +  F    
Sbjct: 62  HDVFGIIRAIAAFRLAGSN----KDYIIITSDSGRITIVEFVPAQNKFNRLHLETFG--- 114

Query: 167 WLHLKRGRESFARGPLVKVDPQGRC---GGVLVYGLQMIILKASQGGSGLVGDEDTFGSG 223
               K G      G  + VDP+GR      V    L  ++ + SQ        E T  S 
Sbjct: 115 ----KSGVRRVVPGQYLAVDPKGRACLTASVEKNKLVYVLNRNSQA-------ELTISSP 163

Query: 224 GGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILH----ERELTWAGRVSWKHH 279
               A    + V  L  LD+          GY  PV   L     E +    G+ ++   
Sbjct: 164 --LEAHKAQTLVFALVALDV----------GYANPVFAALEIDYGESDQDPTGQ-AYDEI 210

Query: 280 TCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI---GGVLVVGANTIHY-HSQS 335
              +    +   L      WS   +   A  L  VP       GVLV G + I Y HS  
Sbjct: 211 EKQLVYYELDLGLNHVVRKWSE-PVDRTANILFQVPGGTDGPSGVLVCGEDNITYRHSNQ 269

Query: 336 ASCALALNNYAVSLDSSQELPRSSFSV--ELDAAHATWLQNDVALLSTKTGDLVLLTV-- 391
            +  +A+     + +  Q        V  +L  A   +      LL T  GDL  +T+  
Sbjct: 270 EAFRVAIPRRRGATEDPQRKRNIVAGVMHKLKGAAGAFF----FLLQTDDGDLFKITIEM 325

Query: 392 VYDGR-----VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSML 446
           V D        V+RL +   +   + + +  + +   F+ S  G+    QF         
Sbjct: 326 VEDDNGQPTGEVRRLKIKYFDTVPVATSLCILKSGFLFVASEFGNHQFYQFEKLGDDDEE 385

Query: 447 SSGLKEEF--GDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT-ESAQKTFSF 503
           +  + ++F  G  E+  P     R + + +L + ++     +    +N T E A + +S 
Sbjct: 386 TEFVSDDFPTGAHESYTPIYFHPRPAENLSLVESIDSMNPLMDCKVANLTDEDAPQIYSI 445

Query: 504 AVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGC-KGIWTVYHKSSRGHN 562
               +      LK   +GL ++    +            ELPG    +WT   K +RG  
Sbjct: 446 CGTGARSTFRTLK---HGLEVSEIVES------------ELPGVPSAVWTT--KLTRG-- 486

Query: 563 ADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRR 622
                     D Y AY+I+S    T+VL   + + EVT++  +     T+A   L G   
Sbjct: 487 ----------DTYDAYIILSFSNGTLVLSIGETVEEVTDT-GFLSSAPTLAVQQL-GEDS 534

Query: 623 VIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIAD-PYVLLGM-SDGSI 680
           +IQV  +G R +   +   + +  P +    + + N   ++V+++    V   M SDGS+
Sbjct: 535 LIQVHPKGIRHIRADHRVNEWA-APQHRSIVAATTNERQVAVALSSGEIVYFEMDSDGSL 593


>gi|452824086|gb|EME31091.1| DNA damage-binding protein 1 isoform 2 [Galdieria sulphuraria]
          Length = 1150

 Score = 40.8 bits (94), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 55/211 (26%), Positives = 84/211 (39%), Gaps = 35/211 (16%)

Query: 236 INLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH 295
           I L +LD   V D  F++G+ +P + +L      +      +H      +L         
Sbjct: 151 IRLEELD---VLDIQFLYGHSKPTIAVL------YTDSEENRHLKTYTVSLK-DKDFGNG 200

Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
           PL     NL   A  L+ VP+PIGGV+V+G  T+ Y S S      L  Y  S+  S  +
Sbjct: 201 PLFQG--NLESGASMLIPVPTPIGGVVVLGQETVTYISGS-----GLRGYH-SIPVSATI 252

Query: 356 PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLL---------TVVYDGRVVQRLDLSKT 406
            R+   ++ D            LL  + G L LL         T       +  L +   
Sbjct: 253 FRAYGRIDKDGTR--------YLLGDEKGILYLLVLEQSTSLSTFTETETKITGLKIQTL 304

Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
             + L S I  + N   ++GS  GDS L++ 
Sbjct: 305 GETSLPSTIDYLDNGFVYIGSCHGDSQLIRL 335


>gi|406602265|emb|CCH46158.1| Pre-mRNA-splicing factor [Wickerhamomyces ciferrii]
          Length = 1123

 Score = 40.8 bits (94), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 61/255 (23%), Positives = 100/255 (39%), Gaps = 81/255 (31%)

Query: 493 NTESAQKTFSFA-VRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVE--LPG-CK 548
           N ++  K +S + V+DS      LK   YGL IN              E+VE  LPG   
Sbjct: 406 NDDAFTKIYSLSGVKDS----SSLKILQYGLSIN--------------EIVESDLPGIAN 447

Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
            +WT                   +DE+  YL+IS    T+VL   + + E+T+S    + 
Sbjct: 448 KVWTTKLNK--------------NDEFDKYLVISFMDTTLVLSIGENVEEITDS-GLALN 492

Query: 609 GRTIAAGNLFGRRRVIQVFERGAR------------------ILDGSYMTQDLSFGPSNS 650
             TI    + G   ++Q+   G R                  IL  S   + ++ G SN 
Sbjct: 493 EETIGIQQI-GINSLVQIHSNGIRNIKNGELINEWQPPAGIKILTTSTTNRQIAIGLSND 551

Query: 651 E---------------SGSGSENSTVLSVSIAD--------PYVLLGMSDGSIRLLVGDP 687
           E               +      S ++S+S+ D        P++++G  D +IR+L  DP
Sbjct: 552 ELVYFEVDDRDRLIEYNERKELTSRIVSLSLGDIPEGRLRSPFLIVGCQDSTIRVLSTDP 611

Query: 688 STC--TVSVQTPAAI 700
            +    +S+Q  ++I
Sbjct: 612 GSTLELLSLQALSSI 626


>gi|171691144|ref|XP_001910497.1| hypothetical protein [Podospora anserina S mat+]
 gi|170945520|emb|CAP71632.1| unnamed protein product [Podospora anserina S mat+]
          Length = 1158

 Score = 40.8 bits (94), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 43/188 (22%), Positives = 72/188 (38%), Gaps = 53/188 (28%)

Query: 282 MISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA 341
           +I    +   +K+H    +    PH           +GGV+VVG   + Y          
Sbjct: 233 LIPVRKVEEEVKRHNFRNTGSAKPH-----------LGGVIVVGETRLLY---------- 271

Query: 342 LNNYAVSLDSSQELPRSSFSVELDAA--HATWLQNDVA--LLSTKTGDLVLLTVVYDGRV 397
                       ++ +++   +LD A     W + +V    L+   G L LLT+  DG  
Sbjct: 272 ----------IDDVTKATVESKLDKASIFVKWAEYNVQTYFLADDYGSLHLLTINTDGAE 321

Query: 398 VQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI 457
           V+ + L+K   +   S++  +GN + F+ S  GDS L Q                   D+
Sbjct: 322 VKGMVLTKIGVTSRASELVYLGNEMLFVASHHGDSRLFQL------------------DL 363

Query: 458 EADAPSTK 465
            AD P+ K
Sbjct: 364 SADKPADK 371


>gi|115397303|ref|XP_001214243.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114192434|gb|EAU34134.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 1140

 Score = 40.8 bits (94), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 42/146 (28%), Positives = 60/146 (41%), Gaps = 25/146 (17%)

Query: 301 AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVS--LDSSQELPRS 358
           A  L   A  L+ VP+P+GG+L++G  +I Y           NN  VS  LD +      
Sbjct: 237 AQELDLGASHLIPVPAPLGGLLILGETSIKYVDDD-------NNEIVSRLLDEA------ 283

Query: 359 SFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGR-VVQRLDLSKTNPSVLTSDI 415
                       W Q D    LL+   G L  L +V D    VQ   L     +   S +
Sbjct: 284 -------TIFVAWEQVDSQRWLLADDYGRLFFLMLVLDSENQVQGWQLDHLGNTSRASTL 336

Query: 416 TTIGNSLFFLGSRLGDSLLVQFTCGS 441
             +G  + F+GS  GDS +++   GS
Sbjct: 337 VYLGGGVIFVGSHQGDSQVLRVGDGS 362


>gi|407923753|gb|EKG16818.1| Cleavage/polyadenylation specificity factor A subunit [Macrophomina
           phaseolina MS6]
          Length = 1129

 Score = 40.8 bits (94), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 63/262 (24%), Positives = 102/262 (38%), Gaps = 32/262 (12%)

Query: 185 VDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDM 243
           +DP GR   + +Y G+  ++    +G     GD +    G    +RIE   V +   L  
Sbjct: 121 LDPTGRFMTLELYEGIVTVVPLTEKGKRK--GDPEVSALGEPVPSRIEEMFVRSSAFLHR 178

Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
           K  +         +P++ +L+E +     R+  +      +     +     P+      
Sbjct: 179 KSPESE-------KPLVALLYEEDEDSKIRLRLRQLAFQTAGTEEQSVAALEPVEGLKEE 231

Query: 304 LPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE 363
           L   A  L+ VP P  GVLV+G   I Y           N+Y  +L   + L  S+  V 
Sbjct: 232 LDLGASHLIPVPGPCYGVLVLGETCITY----------FNDYTKAL-VKKPLQDSTIFV- 279

Query: 364 LDAAHATWLQ--NDVALLSTKTGDLVLLTVVYDGR--VVQRLDLSKTNPSVLTSDITTIG 419
                  W Q  N   LL+   G L L  ++ D    VV+   L K   +   S +  + 
Sbjct: 280 ------AWEQIDNQRFLLADDFGGLYLFMLLLDDNSGVVEGWRLDKIGETSRASVLVYLD 333

Query: 420 NSLFFLGSRLGDSLLVQFTCGS 441
               F+GS  GDS +++ T GS
Sbjct: 334 AGHVFVGSHEGDSQVIRITEGS 355


>gi|18410222|ref|NP_567015.1| splicing factor 3B subunit 3 [Arabidopsis thaliana]
 gi|18410226|ref|NP_567016.1| putative splicing factor [Arabidopsis thaliana]
 gi|7019653|emb|CAB75754.1| spliceosomal-like protein [Arabidopsis thaliana]
 gi|7019655|emb|CAB75756.1| spliceosomal-like protein [Arabidopsis thaliana]
 gi|332645831|gb|AEE79352.1| splicing factor 3B subunit 3 [Arabidopsis thaliana]
 gi|332645833|gb|AEE79354.1| putative splicing factor [Arabidopsis thaliana]
          Length = 1214

 Score = 40.8 bits (94), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 73/326 (22%), Positives = 125/326 (38%), Gaps = 61/326 (18%)

Query: 320 GVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALL 379
           GVLV   N + Y +Q      A+      +    +LP     + + AA          L+
Sbjct: 246 GVLVCAENFVIYMNQGHPDVRAV------IPRRTDLPAERGVLVVSAAVHKQKTMFFFLI 299

Query: 380 STKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC 439
            T+ GD+  +T+ ++G  V  L +   +   + S I  +     F  S  G+  L QF  
Sbjct: 300 QTEYGDVFKVTLDHNGDHVSELKVKYFDTIPVASSICVLKLGFLFSASEFGNHGLYQFQA 359

Query: 440 --------GSGTSMLSSGLKEEFGDIEADAPSTKRLRR-SSSDALQDMVNGEELSLYGSA 490
                    S ++++ +  +E F  +       K L R    ++L  +++ + L+++   
Sbjct: 360 IGEEPDVESSSSNLMET--EEGFQPVFFQPRRLKNLVRIDQVESLMPLMDMKVLNIF--- 414

Query: 491 SNNTESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPG- 546
               E   + FS   R      GP   L+    GL I   A +            +LPG 
Sbjct: 415 ---EEETPQIFSLCGR------GPRSSLRILRPGLAITEMAVS------------QLPGQ 453

Query: 547 CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYF 606
              +WTV    S              DE+ AY+++S    T+VL   + + EV +S   F
Sbjct: 454 PSAVWTVKKNVS--------------DEFDAYIVVSFTNATLVLSIGEQVEEVNDS--GF 497

Query: 607 VQGRTIAAGNLFGRRRVIQVFERGAR 632
           +      A +L G   ++QV   G R
Sbjct: 498 LDTTPSLAVSLIGDDSLMQVHPNGIR 523


>gi|1399512|gb|AAC47162.1| repE [Dictyostelium discoideum]
          Length = 1139

 Score = 40.8 bits (94), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 51/204 (25%), Positives = 87/204 (42%), Gaps = 29/204 (14%)

Query: 234 HVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
           +V N+R L+   V D  F++G   P + +L +           + H       S  T L 
Sbjct: 150 NVNNVR-LEELQVLDMTFLYGCKVPTIAVLFKD-------TKDEKHISTYEISSKDTELV 201

Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
             P  WS  N+    Y  L VP P+GGVLVV  N I Y +   + ++A+ +Y   L  ++
Sbjct: 202 VGP--WSQSNV--GVYSSLLVPVPLGGVLVVADNGITYLNGKVTRSVAV-SYTKFLAFTR 256

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
                   V+ D +          L     G L +L +++  + V  L   +     + S
Sbjct: 257 --------VDKDGSR--------FLFGDHFGRLSVLVLIHQQQKVMELKFEQLGRISIPS 300

Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
            I+ + + + ++GS  GDS L++ 
Sbjct: 301 SISYLDSGVVYIGSSSGDSQLIRL 324


>gi|320593036|gb|EFX05445.1| uv-damaged DNA-binding protein [Grosmannia clavigera kw1407]
          Length = 1504

 Score = 40.4 bits (93), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 36/135 (26%), Positives = 58/135 (42%), Gaps = 21/135 (15%)

Query: 306 HDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELD 365
           H+        + +GG+LVVG   + Y   +  C +             E+P  + S+   
Sbjct: 562 HNVRNTATATANLGGLLVVGETRLLYIDSTTKCTV-------------EVPLRAASI--- 605

Query: 366 AAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGN-SL 422
                W + D    LL+ + G L LLT++  G VV  LD+S    +   S +  + +  L
Sbjct: 606 --FVAWARYDATHYLLADEYGTLHLLTILVSGAVVDNLDVSPIGKTSRASCLVYLPDRRL 663

Query: 423 FFLGSRLGDSLLVQF 437
            F+GS  GDS L + 
Sbjct: 664 LFVGSHNGDSQLFRL 678


>gi|427798971|gb|JAA64937.1| Putative damage-specific dna binding complex subunit ddb1, partial
           [Rhipicephalus pulchellus]
          Length = 1259

 Score = 40.4 bits (93), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 60/268 (22%), Positives = 103/268 (38%), Gaps = 57/268 (21%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           L  T+ GD+  +T+  D  +V  + L   +   + + +  +     F+ +  G+  L Q 
Sbjct: 302 LAQTEQGDIFKITLETDEDMVTEIKLKYFDTIPVAASMCVLKTGFLFVAAEFGNHCLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNT 494
                       SS +  E GD    AP           AL++++  EEL     A   T
Sbjct: 362 ARLGEEDEEPEFSSAIPLEEGDTFFFAPR----------ALRNLLPVEELDSLSPAMGCT 411

Query: 495 ------ESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELP 545
                 E   + +    R      GP   ++   +GL +            S   + ELP
Sbjct: 412 IADLANEDTPQLYVACGR------GPRSCIRVLRHGLEV------------SEMAVSELP 453

Query: 546 GC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVD 604
           G    +WTV  K+              D++Y AY+I+S    T+VL   + + EVT+S  
Sbjct: 454 GNPNAVWTVKRKA--------------DEDYDAYIIVSFVNATLVLSIGETVEEVTDS-G 498

Query: 605 YFVQGRTIAAGNLFGRRRVIQVFERGAR 632
           +     T++   + G   ++QV+  G R
Sbjct: 499 FLGTTPTLSCAQI-GDDALVQVYPEGIR 525


>gi|166240328|ref|XP_637896.2| UV-damaged DNA binding protein1 [Dictyostelium discoideum AX4]
 gi|238064940|sp|B0M0P5.1|DDB1_DICDI RecName: Full=DNA damage-binding protein 1; AltName: Full=DNA
           repair protein E; AltName: Full=UV-damaged DNA-binding
           protein 1
 gi|165988543|gb|EAL64385.2| UV-damaged DNA binding protein1 [Dictyostelium discoideum AX4]
          Length = 1181

 Score = 40.4 bits (93), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 51/204 (25%), Positives = 87/204 (42%), Gaps = 29/204 (14%)

Query: 234 HVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
           +V N+R L+   V D  F++G   P + +L +           + H       S  T L 
Sbjct: 192 NVNNVR-LEELQVLDMTFLYGCKVPTIAVLFKD-------TKDEKHISTYEISSKDTELV 243

Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQ 353
             P  WS  N+    Y  L VP P+GGVLVV  N I Y +   + ++A+ +Y   L  ++
Sbjct: 244 VGP--WSQSNV--GVYSSLLVPVPLGGVLVVADNGITYLNGKVTRSVAV-SYTKFLAFTR 298

Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
                   V+ D +          L     G L +L +++  + V  L   +     + S
Sbjct: 299 --------VDKDGSR--------FLFGDHFGRLSVLVLIHQQQKVMELKFEQLGRISIPS 342

Query: 414 DITTIGNSLFFLGSRLGDSLLVQF 437
            I+ + + + ++GS  GDS L++ 
Sbjct: 343 SISYLDSGVVYIGSSSGDSQLIRL 366


>gi|212539802|ref|XP_002150056.1| UV-damaged DNA binding protein, putative [Talaromyces marneffei
           ATCC 18224]
 gi|210067355|gb|EEA21447.1| UV-damaged DNA binding protein, putative [Talaromyces marneffei
           ATCC 18224]
          Length = 1139

 Score = 40.0 bits (92), Expect = 5.8,   Method: Compositional matrix adjust.
 Identities = 39/139 (28%), Positives = 62/139 (44%), Gaps = 25/139 (17%)

Query: 308 AYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA 367
           A  L+ VP+P+GG+LV+G   I Y                 +D ++     + S  LD A
Sbjct: 245 ASHLIPVPAPLGGLLVLGETCIKY-----------------IDDAK---NETISNPLDEA 284

Query: 368 --HATWLQNDVA--LLSTKTGDLVLLTVVYDGR-VVQRLDLSKTNPSVLTSDITTIGNSL 422
                W+Q D    LL+   G L  L +V D +  V+   L     +   S +  +G  +
Sbjct: 285 TIFVAWVQVDGQRWLLADDYGRLFFLMLVLDSQNEVEGWKLDYLGEASRASVLIYLGAGM 344

Query: 423 FFLGSRLGDSLLVQFTCGS 441
            F+GS  GDS +++ + GS
Sbjct: 345 TFIGSHQGDSQVIRISEGS 363


>gi|380481704|emb|CCF41690.1| CPSF A subunit region, partial [Colletotrichum higginsianum]
          Length = 932

 Score = 40.0 bits (92), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 58/144 (40%), Gaps = 14/144 (9%)

Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSS-----QELPRSSFS 361
           D Y  + +P PI     V      YH +  + A A     + +  +       L R+   
Sbjct: 179 DPYARIVIPVPI-----VEDEVKRYHKRDTTGAKAQLGGLIVVGETLLVYVDTLTRTVVE 233

Query: 362 VELD--AAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
             L+  A    W   D     LS   G+L LLT+  +G VV  L L     +   S +  
Sbjct: 234 SGLNSPAIFVAWAAYDDTNYFLSDDYGNLHLLTIETEGVVVTNLSLRLLGVTSRASCLVH 293

Query: 418 IGNSLFFLGSRLGDSLLVQFTCGS 441
           +GN L FLGS  GDS L+Q    S
Sbjct: 294 MGNGLLFLGSHYGDSQLLQINMES 317


>gi|401426989|ref|XP_003877978.1| cleavage and polyadenylation specificity factor-like protein
           [Leishmania mexicana MHOM/GT/2001/U1103]
 gi|322494225|emb|CBZ29522.1| cleavage and polyadenylation specificity factor-like protein
           [Leishmania mexicana MHOM/GT/2001/U1103]
          Length = 1542

 Score = 40.0 bits (92), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 49/185 (26%), Positives = 84/185 (45%), Gaps = 37/185 (20%)

Query: 223 GGGFSARIESSHVINLRDLDMK----HVKDFIFVHGYIEPVMVILHERELTWAGRVS--- 275
           GGG S  +    V + R  D+K    +++D  FV    EP++  L E++ TWAGRV    
Sbjct: 283 GGGTSLLLRIGTVTHWRLQDVKTALRNIRDIQFVESAGEPLLAFLFEKQPTWAGRVKLLE 342

Query: 276 WKHH-------TCMIS--ALSISTTLKQHPLIWSAMN-LPHDAYKLLA------VPSPIG 319
           W+         TC I    ++++ +   H L  S ++ LP+D   +        VPS + 
Sbjct: 343 WRSKTVESHMLTCSIEWMKVTLANSTAPHMLSLSEVDGLPYDVTSMTPLTAFQDVPSAVF 402

Query: 320 GV---LVVGANT-----IHYHSQSASCALALNNYAVSLD------SSQELPRSSFSVELD 365
            V   ++V  +T     ++ ++     A +L + AVS +      +SQ L      V L+
Sbjct: 403 CVSRNMMVHVSTKSGYGVYVNATGEEQARSLKSSAVSFEAVQWRSASQALSTDLVKVNLN 462

Query: 366 AAHAT 370
            ++AT
Sbjct: 463 FSNAT 467


>gi|440636768|gb|ELR06687.1| pre-mRNA-splicing factor rse1 [Geomyces destructans 20631-21]
          Length = 1212

 Score = 39.7 bits (91), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 74/327 (22%), Positives = 125/327 (38%), Gaps = 49/327 (14%)

Query: 320 GVLVVGANTIHY-HSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA- 377
           GVLV G + I Y HS   +  +A+         + E P+   S+     H          
Sbjct: 253 GVLVCGEDNITYRHSNQEAFRVAIPRRK----GATEDPQRKRSIVAGVMHKMRGAAGAFF 308

Query: 378 -LLSTKTGDLVLLTV--VYDGR-----VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRL 429
            LL +  GDL  +T+  + D        V+RL +   +   + + +  + +   F+ S  
Sbjct: 309 FLLQSDDGDLFKITIEMIEDDNGQPTGEVRRLKIKYFDTVPIATSLCILKSGFLFVASEF 368

Query: 430 GDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD--APSTKRLRRSSSDALQDMVNGEELSLY 487
           G+    QF         +  + + F    A+   P     R + +  L + ++     + 
Sbjct: 369 GNHQFYQFEKLGDDDDETEYISDNFPTDPAEPYTPVYFHPRPAENLNLVESIDSMNPLMD 428

Query: 488 GSASNNTES-AQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPG 546
              +N TE  A + +S     +      LK   +GL +N    +            ELPG
Sbjct: 429 CKVANLTEEDAPQIYSICGTGARSTFRTLK---HGLEVNEIVES------------ELPG 473

Query: 547 C-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDY 605
               +WT   K +RG            DEY AY+I++    T+VL   + + EVT++   
Sbjct: 474 VPSAVWTT--KLTRG------------DEYDAYIILAFSNGTLVLSIGETVEEVTDT--G 517

Query: 606 FVQGRTIAAGNLFGRRRVIQVFERGAR 632
           F+   T  A    G   +IQV  +G R
Sbjct: 518 FLSSATTLAVQQLGEDGLIQVHPKGIR 544


>gi|443694993|gb|ELT96001.1| hypothetical protein CAPTEDRAFT_155561 [Capitella teleta]
          Length = 1215

 Score = 39.7 bits (91), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 59/264 (22%), Positives = 100/264 (37%), Gaps = 49/264 (18%)

Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
           L  T+ GD+  +T+  D  +V  + L   +   + S +  +     F+ S  G+  L Q 
Sbjct: 302 LTQTEQGDVFKITLETDEDMVTEVRLKYFDTVPVASSMCVLKTGFLFIASEFGNHYLYQI 361

Query: 438 TC---GSGTSMLSSGLKEEFGD--IEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASN 492
                       SS +  E GD    A  P    +     D+L  +++ +   L      
Sbjct: 362 AHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLKNLVMVDEMDSLSPIMHCQIADL------ 415

Query: 493 NTESAQKTFSFAVRDSLVNIGP---LKDFSYGLRINADASATGISKQSNYELVELPGC-K 548
             E   + F+   R      GP   L+   +GL +            S   + ELPG   
Sbjct: 416 ANEDTPQLFAMCGR------GPRSTLRVLRHGLEV------------SEMAVSELPGNPN 457

Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQ 608
            +WTV                  +DE+ AY+I+S    T+VL   + + EVT+S  +   
Sbjct: 458 AVWTVKRN--------------IEDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGT 502

Query: 609 GRTIAAGNLFGRRRVIQVFERGAR 632
             T++   L G   ++Q++  G R
Sbjct: 503 TPTLSCSQL-GDDALVQIYPDGIR 525


>gi|70992271|ref|XP_750984.1| UV-damaged DNA binding protein [Aspergillus fumigatus Af293]
 gi|66848617|gb|EAL88946.1| UV-damaged DNA binding protein, putative [Aspergillus fumigatus
           Af293]
 gi|159124553|gb|EDP49671.1| UV-damaged DNA binding protein, putative [Aspergillus fumigatus
           A1163]
          Length = 1140

 Score = 39.7 bits (91), Expect = 8.2,   Method: Compositional matrix adjust.
 Identities = 38/135 (28%), Positives = 60/135 (44%), Gaps = 25/135 (18%)

Query: 311 LLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAA--H 368
           L+ VP+P+GG+L++G  +I Y               V  D+++ + R      LD A   
Sbjct: 248 LIPVPAPLGGLLILGEMSIKY---------------VDADNNEIISRP-----LDEATIF 287

Query: 369 ATWLQNDVA--LLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFL 425
             W Q D    LL+   G L  L +V D    V+   L     +   S +  +G  + FL
Sbjct: 288 VAWEQVDSQRWLLADDYGRLFFLMLVLDSDSQVESWKLDHLGNTSRASVLVYLGGGILFL 347

Query: 426 GSRLGDSLLVQFTCG 440
           GS  GDS +++ + G
Sbjct: 348 GSHQGDSQVLRISNG 362


>gi|322700233|gb|EFY91989.1| Pre-mRNA-splicing factor rse-1 [Metarhizium acridum CQMa 102]
          Length = 1039

 Score = 39.7 bits (91), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 31/105 (29%), Positives = 47/105 (44%), Gaps = 19/105 (18%)

Query: 540 ELV--ELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLL 596
           ELV  ELPG    +WT     S              D+Y AY+I++    TMVL   + +
Sbjct: 461 ELVASELPGTPSAVWTTKLTQS--------------DDYDAYIILTFLHDTMVLSVGETV 506

Query: 597 TEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQ 641
           T+VT+S   F+      A    G+  + QV+ +G R +     T+
Sbjct: 507 TQVTDS--GFITTVATLAVQQIGKNSLFQVYSKGIRHIQSGQFTE 549


>gi|296411833|ref|XP_002835634.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295629420|emb|CAZ79791.1| unnamed protein product [Tuber melanosporum]
          Length = 1053

 Score = 39.7 bits (91), Expect = 9.3,   Method: Compositional matrix adjust.
 Identities = 65/262 (24%), Positives = 106/262 (40%), Gaps = 49/262 (18%)

Query: 186 DPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMK 244
           DP     G+ VY G+ ++I +  Q   G          G      I +  V+ L++L+  
Sbjct: 123 DPGKNMLGIHVYKGIFLVIPQIQQSIKGSRRSRADLDVG-----NIGNPCVVRLKELE-- 175

Query: 245 HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNL 304
            + D  F+ G I PV+ +L++     +G      +T  +S  S    L    L W   +L
Sbjct: 176 -ILDLKFLFGTISPVLAVLYKP----SGADEMAVNTYELSVKSGEVKL----LDWRIRDL 226

Query: 305 P--HDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSV 362
               +A  L+ V  P  G+L++G   I Y           +NY           ++   V
Sbjct: 227 KGGREALFLIPVRPPSNGLLLIGVTKIQY----------FDNYG---------NKTFLPV 267

Query: 363 ELDAAHATW--LQNDVALLSTKTGDLVLLTV---VYDGRVVQRLDL--SKTNPSVLTSDI 415
           +      TW  L  +  +L  + G L +LT+   + D +V   L L  + + P +L    
Sbjct: 268 DPPMVWVTWEMLSPERYILGDEAGGLHMLTLSAGLMDTKVGLHLKLVGNASIPEILVH-- 325

Query: 416 TTIGNSLFFLGSRLGDSLLVQF 437
             +   L FLGS  GDS L+Q 
Sbjct: 326 --LNQGLLFLGSHSGDSQLLQL 345


>gi|384500266|gb|EIE90757.1| hypothetical protein RO3G_15468 [Rhizopus delemar RA 99-880]
          Length = 1057

 Score = 39.7 bits (91), Expect = 9.4,   Method: Compositional matrix adjust.
 Identities = 52/220 (23%), Positives = 85/220 (38%), Gaps = 39/220 (17%)

Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMN 303
           K V    F+   ++P ++IL+E  L                 L    T+K   L+   + 
Sbjct: 142 KKVISLAFLQDTLDPTLLILYEDALE--------------QRLLQMFTIKDRQLVPGDII 187

Query: 304 LPH---DAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF 360
           L H   DA  L+A+P  +GGVL+V +  I Y                 L  +Q  P  + 
Sbjct: 188 LDHFESDASLLIAMPPAVGGVLLVASKFIRY-----------------LKPNQ--PPIAI 228

Query: 361 SVELDAAHATWLQNDV---ALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
            +     ++  + N+     LL    G L LL +    + V+ L         + S +  
Sbjct: 229 GIRSSTINSHCIMNEEGSRVLLGDAEGLLYLLALNTTNQCVESLSFIYLGSISIPSCLAY 288

Query: 418 IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI 457
           + N + F+GS L DS LV     +G S     + E F ++
Sbjct: 289 LDNDIVFVGSNLADSQLVYIQRTTGESEDILQIIETFANL 328


>gi|281208174|gb|EFA82352.1| UV-damaged DNA binding protein1 [Polysphondylium pallidum PN500]
          Length = 1054

 Score = 39.7 bits (91), Expect = 9.7,   Method: Compositional matrix adjust.
 Identities = 45/199 (22%), Positives = 73/199 (36%), Gaps = 39/199 (19%)

Query: 503 FAVRDSLVNIGPLKDFSY---------------------GLRINADASATGISKQSNYEL 541
            +V D   N+GP+ DF                        LRI  +    GI++Q++   
Sbjct: 302 ISVIDQFTNLGPITDFCVVDVEKQGQGQLVTCSGTFQDGSLRIIRNG--IGIAEQAS--- 356

Query: 542 VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
           +ELPG +G+W             S    +     H +LI+S    T VL  +    E TE
Sbjct: 357 IELPGIRGLW-------------SLSNNSNPSSLHRHLIVSFINSTKVLTFSGEEIEETE 403

Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTV 661
              +     T+  GN       IQ+   G  ++D S + +   + P        S N + 
Sbjct: 404 IAGFDSNATTLYCGNTTENNHFIQIATSGIYLVDSSSLMRLDQYTPEKGSINLASCNGSQ 463

Query: 662 LSVSIADPYVLLGMSDGSI 680
           + +S       L +SD  +
Sbjct: 464 ILISQGSNLTYLEISDSKL 482


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.318    0.134    0.395 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 15,589,231,647
Number of Sequences: 23463169
Number of extensions: 661713671
Number of successful extensions: 1528851
Number of sequences better than 100.0: 646
Number of HSP's better than 100.0 without gapping: 339
Number of HSP's successfully gapped in prelim test: 307
Number of HSP's that attempted gapping in prelim test: 1525297
Number of HSP's gapped (non-prelim): 1649
length of query: 1004
length of database: 8,064,228,071
effective HSP length: 153
effective length of query: 851
effective length of database: 8,769,330,510
effective search space: 7462700264010
effective search space used: 7462700264010
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 82 (36.2 bits)